Discussion and Development of Local Image and Video ModelsPrevious: >>108604726https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Zhttps://huggingface.co/Tongyi-MAI/Z-Imagehttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Animahttps://huggingface.co/circlestone-labs/Animahttps://tagexplorer.github.io/>Qwenhttps://huggingface.co/collections/Qwen/qwen-image>Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>LTX-2https://huggingface.co/Lightricks/LTX-2>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girl>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkCollage: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
>mfw Resource news04/15/2026>DisCa: Accelerating Video Diffusion Transformers with Distillation-Compatible Learnable Feature Caching https://huggingface.co/tencent/DisCa>Lyra 2.0: Explorable Generative 3D Worldshttps://research.nvidia.com/labs/sil/projects/lyra2>AniGen: Unified S3 Fields for Animatable 3D Asset Generationhttps://github.com/VAST-AI-Research/AniGen>T2I-BiasBench: A Multi-Metric Framework for Auditing Demographic and Cultural Bias in Text-to-Image Modelshttps://gyanendrachaubey.github.io/T2I-BiasBench>Generative Refinement Networks for Visual Synthesishttps://github.com/MGenAI/GRN>VideoFlexTok: Flexible-Length Coarse-to-Fine Video Tokenizationhttps://videoflextok.epfl.ch>DiffusionPrint: Learning Generative Fingerprints for Diffusion-Based Inpainting Localizationhttps://github.com/mever-team/diffusionprint>Chain-of-Models Pre-Training: Rethinking Training Acceleration of Vision Foundation Modelshttps://github.com/deep-optimization/CoM-PT>Self-Adversarial One Step Generation via Condition Shiftinghttps://github.com/LINs-lab/APEX>See-through WebUIhttps://github.com/BeamManP/see-through-webui>ERNIE-Image: Repackaged model files for ComfyUIhttps://huggingface.co/Comfy-Org/ERNIE-Image04/14/2026>Nucleus-Image Releasedhttps://huggingface.co/NucleusAI/Nucleus-Image>ERNIE-Image: Text-to-image generation model built on a single-stream Diffusion Transformerhttps://huggingface.co/baidu/ERNIE-Image>Danbooru Dataset Filter: High-Speed Metadata Explorer for AI Traininghttps://github.com/ThetaCursed/Danbooru-Dataset-Filter>ChatGPT will praise the mood and 'bedroom/DIY texture' of fart sounds pulled from YouTube https://www.pcgamer.com/software/ai/chatgpt-will-praise-the-mood-and-bedroom-diy-texture-of-fart-sounds-pulled-from-youtube>RefineAnything: Multimodal Region-Specific Refinement for Perfect Local Detailshttps://limuloo.github.io/RefineAnything
Has anyone managed to hook a NES emulator to Comfy yet?
>mfw Research news04/15/2026>Ride the Wave: Precision-Allocated Sparse Attention for Smooth Video Generationhttps://arxiv.org/abs/2604.12219>StructDiff: A Structure-Preserving and Spatially Controllable Diffusion Model for Single-Image Generationhttps://butter-crab.github.io/StructDiff>Scaling Exposes the Trigger: Input-Level Backdoor Detection in Text-to-Image Diffusion Models via Cross-Attention Scalinghttps://arxiv.org/abs/2604.12446>PromptEcho: Annotation-Free Reward from Vision-Language Models for Text-to-Image Reinforcement Learninghttps://arxiv.org/abs/2604.12652>MAST: Mask-Guided Attention Mass Allocation for Training-Free Multi-Style Transferhttps://arxiv.org/abs/2604.12281>Bridging the Micro--Macro Gap: Frequency-Aware Semantic Alignment for Image Manipulation Localizationhttps://arxiv.org/abs/2604.12341>LottieGPT: Tokenizing Vector Animation for Autoregressive Generationhttps://lottiegpt.github.io>Combating Pattern and Content Bias: Adversarial Feature Learning for Generalized AI-Generated Image Detectionhttps://arxiv.org/abs/2604.12353>Nucleus-Image: Sparse MoE for Image Generationhttps://arxiv.org/abs/2604.12163>HDR Video Generation via Latent Alignment with Logarithmic Encodinghttps://HDR-LumiVid.github.io>CoD-Lite: Real-Time Diffusion-Based Generative Image Compressionhttps://github.com/microsoft/GenCodec/CoD_Lite>SubFlow: Sub-mode Conditioned Flow Matching for Diverse One-Step Generationhttps://yexionglin.github.io/subflow>OFA-Diffusion Compression: Compressing Diffusion Model in One-Shot Mannerhttps://arxiv.org/abs/2604.12668>EDGE-Shield: Efficient Denoising-staGE Shield for Violative Content Filtering via Scalable Reference-Based Matchinghttps://arxiv.org/abs/2604.06063>Visual Preference Optimization with Rubric Rewardshttps://arxiv.org/abs/2604.13029>On the Robustness of Watermarking for Autoregressive Image Generationhttps://arxiv.org/abs/2604.11720
unblessed bread
comfy breat
>>108609285>it's insane to say this when anima has the best backgrounds of all current anime models, NAI 4.5 includedcompared to Midjourney, every model make shit backgrounds desu
>>108609846Midjourney still exists?
>>108607834can't believe a gozillion people responded to that post, Illustrious stopped being open source since v2.0 you know that right? lmao
>>108609897>gozillion people responded to that postThere's maybe six anons in these threads. Those replies are clearly samefags. Same thing happens whenever a """modern""" SDXL model drops.
>being spawned into a consciousness of a human before the matrix is madewhat a shit life, lmao
>>108609697>The Chinese always come out with really nice architectures, but they really can't into quality training data. Shame, the model had potential, but it's clearly slopped. It's very strange, due to Flux 2 VAE, some photos look very realistic, while others don't look it at all. They likely used a mixture of both slopped and real data, and it shows.that pisses me off because I thought that the humongus hype Z-image turbo generated would be a sign to those fuckers that people your models way more if you only train on real data, fucking souless bugs get your shit together!
>>108609942I feel ya anon, I thought I was fortunate to have grown during the golden age of video games but people in 50 years will be eating so good, like fucking Sworld art online shit but in real life, damn...
Is there a way that I can get a notification of some kind every time a prompt of mine finishes in webui?
>>108609942yeah you should've been born as a middle eastern woman in the 8th century so you'd get raped and stoned to death for being raped instead of posting shit online
>>108609990theres always worse, but theres always much better, especially for all eternity in the matrix where 99.9999% people that ever existed will actually be compared to this shit time.and sorry to burst your retarded libshit feminist bubble of the past that you randomly decided to add the imaginary foid suffering into this convo with, but not everyone in the past was raped. i know, shocker. being raped is only a common occurence if you are born as a dalit woman in india in the year of your brown sisters superpower of 2025.
in anima should we include the comma when using (emphasis,:1.3)? I remember reading somewhere about this but can't find it
>>108610046>being raped is only a common occurence if you are born as a dalit woman in india in the year of your brown sisters superpower of 2025.SAAR DO NOT REVEAL
>>108610124I put the comma outside the parenthesis but I don't know for sure.I believe it makes more sense because the tag comma isn't part of what you want to emphasize semantically, but technically I don't know how it precisely works.
>wai-animaok this is epic
>>108610223I always left outside, but I think I saw something saying to put inside because of how the attention or whatever worked, idk I wanted to find it again
>>108610384why the wai fags are working on an unfinished base model though? and do we know when the anima fags will be finished?
Ernie Turbo sadly seems to respond to traditional hi-res-fix upscaling in the same weird artifacty way that Z Image Turbo does. Picrel is Ernie with 8 steps @ 896x1152 -> 1.5x upscale with 4xFaceUpSharpDAT -> 8 steps second pass denoise @ 0.5 strength. Klein Distills at their standard 4 steps don't have this problem, they can even handle 2x without it going weird like this.
>>108609955It's just lazy and rushed, likely because they wanted to get a model like this out earlier than BFL as it collects quality data because even Flux.2 Max is still way behind.
>>108610430>workingJeetmixing some shitty loras together is hardly difficult "work" and can easily be replicated for the final version.
>>108610501then why Wai is so popular? ultimately the work of the anima fags should be way more respected, they're the ones making a real finetune after all
>>108609917god bless short shorts
>>108610508>Wai is so popular?Jeets are too lazy to pick and use loras when needed so they want prebaked shitmixes>ultimately the work of the anima fags should be way more respected, they're the ones making a real finetune after allI don't disagree
>>108610508Things that appeal to the lowest common denominator appeal to the masses by their very nature. See any pop artist.
>>108610537>>108610541fair enough, but the fact that Wai got recieved so well here shows that /ldg/ is browner than I expected :(
>>108610417god bless panchira and short skirts
>>108610548>Wai got recieved so well hereProof?
https://www.reddit.com/r/StableDiffusion/comments/1smfz58/ernie_turbo_is_pretty_awesome_i_think_this_is_my/glad that the ledditors are shitting on ernie, those chinks need to understand that Z-image turbo fucking exists, and that we wil never settle for less
why do you think about plebbitors at all
>>108610635why are you afraid to directly quote someone? see? there's a lot of questions that remains unanswered in this world
Holy ESL
>>108610490A model like what? More Chinkslop with no editing despite the fact they used an LLM that has a vision encoder?
Ever since I updated comfy I've been unable to generate photorealistic images with anima, did comfy nerf anima or something?
>>108610666>More Chinkslop with no editing despite the fact they used an LLM that has a vision encoder?there will be an edit model though, but ernie is as slopped as Klein so I really don't see the point, it'll probably be even worsehttps://xcancel.com/ErnieforDevs/status/2044290766349185257#m
I don't get anon's obsession with no editing. How often do you edit images? Would you dismiss local banana pro if it came without edit capability?Mediocre quality and plastic look are bigger problems with these models.
>>108610693I agree that plastic skin is a big issue, but editing is a powerful tool, I'm actually using NBP to make multiple scenes from one image input, and I can use those frames to make funny videos (first frame + last frame) with LTX for example
>>108610709Fair enough. Most of my edits are telling Klein to fix an issue in one image. I guess I am not creative enough for this.
>>108610693most of the people who come here to seethe about local models don't know anything more than basic free slop gens and nano banana, nano banana is the only way they get can "consistent" characters, edit models are super important to them because it's the only way they can make their AI influencers.
>>108610693>>108610731>usecase for a good editing model that could make anime characters and celebrities loras obsolete?
>>108610709>I'm actually using NBP to make multiple scenes from one image input, and I can use those frames to make funny videos (first frame + last frame) with LTX for examplesame, but with seedance, it's so funny to see how the model can transition from first frame to last framehttps://www.youtube.com/watch?v=CpoH9TGrwaE&t=81s
>>108610717Top nice
>>108610693>How often do you edit images?Nobody uses edit models for the same reason people don't use 3D generation models. BECAUSE THEY ARE CURRENTLY TRASH.When people get an edit model that doesn't lose quality when making iterative edits, that will be the default way most people will use all image gen ai.
>>108610790>When people get an edit model that doesn't lose quality when making iterative editsit'll only happen without a VAE
>>108610689yes check the code he added an "animanorealistic" function
>>108610800>it'll only happen without a VAECorrect.>>108610693>Mediocre quality and plastic look are bigger problems with these models.Also, with an actually good edit model, this would actually be a solved problem then since you could simple tell it what style you want or give it a realistic photo and tell it to gen shit like that.
>>108610813>simplesimply>>108610693>Would you dismiss local banana pro if it came without edit capability?No, but not because but despit of it. People would use it still because it would be a good model still, but to act as if the editing capabilities and reference image upload and understanding is not the main point of NBP is delusional.
>>108610818>tfw no bulge
@weebniggersgiven the color quality improvement of vpred models, in what fucking reality is not everyone doing that shit by now?
>>108610871vpred is actually a patchwork for the inherit flaws of these models... the actual fix, which is what people are in fact doing now, is flow models
>>108610892>what people are in fact doing now, is flow modelswith which models? they all seem to still average out the overvibrant colors througout the whole image without being able to get really dark/light images
>>108610916Flux, zimage, qwen and anima are all flow afaik, there are some attemps to turn sdxl into flow like chinknoob and the other one I forgot the name
>>108610692wat. Klein has significantly better raw image quality than either Ernie or Z Image, and can actually upscale properly
>>108608824>>108608861KITAAAA>>108610430Preview 3 came out so recently that I'm surprised they could pop out a finetune of it so soon. Then again, Wai was releasing new versions pretty frequently at one point.
>>108610970cutie
>>108610971>Klein has significantly better raw image quality than Z ImageZ image base maybe, Z image turbo no way
>>108611007jesus!
> Holy shxt… The rules of AI image generation just completely changed. A GPT-based model currently testing in the arena under the bizarre name 'duct-tape' is turning the global AI community upside down. What exactly did they feed this thing?> Native-level text generation with zero awkwardness. Chilling consistency maintained down to the pixel. Overwhelming illustration quality ready for immediate commercial use.>"Is it just downloading photos from the internet?" That was every tester's first reaction. It's that unbelievable.>A game-changer that instantly crumbles Nano Banana Pro's dominance has arrived. A lot of people might need to start packing their desks again.
>>108611038wait what? is this some kind of schizo fan fiction or something that really happened?
>>108611062keeeeek
>>108611062>or something that really happened?it'll happen
>>108611062it's marketing for the new openai slop model
>>108611028they're pretty cute
So it's now Civitai and Civitai.red?
>>108611156ernie would never
>>108610970>>108611007nice, this is anima to zit I assume
Don't buy that used 3090 goy.
>>108611160Yeah. Man, the sites were down a short while ago when I was trying to look up WaiAnima.
>>108611271>537x603kek, cute
>>108611278ss of a meme online
>>108611271Kek>Buy that downgraded nvidia>Looks slanted at you>It has AIEveryone claps
>>108611172>>108611196yes indeed
>>108611288chroma asian footfag is that you?
>>108611298no
>Midjourney V8.1 is live! Our iconic aesthetics are back w native 2K HD rendering - 3x faster and 3x cheaper vs V8. Full quality V8.1 1K mode is faster than V7 draft mode. Image prompts are back. New "Describe" is live - and you'll love our new moodboards & srefs. More soon <3
>>108611288Really nice, what LLM are you using to caption the photos? qwen or gemma4
>>108611362local models?
>>108611298>being this buckbroken lel
>>108611362>Image prompts are backrevolutionary
>>108611362im not even going to try it, look at their vibecoded slop site. this is just embarrassinghttps://alpha.midjourney.com/
>>108611362i usually hate the unrealistic, weird saturation, weird 3d render plastic look but that one doesnt seem as sloppy as usual although the fact that ZIT still blows everything except NBP out of the water with actual candid realism is hilarious
>>108611371didnt mean it in a bad way, hes based, i just didnt see him in a while, although i havent been here in a while eitheri thought it might be him given the oversaturated asian girl squatting gen and that it was interesting that he moved on from chroma to whatever that model is
>>108611362[SERIOUS DISCUSSION]How come Midjourney is the only model capable of doing rich colors without looking completely fried? Local cannot come close to this dynamic range without cranking a lora to weight 2+
>>108611387there's a reason Midjourney has a cult following, their images aren't realistic, but there's something in the colorimetry and shadows that makes it so nice to look at >>>/wsg/6127349
I remember when Midjourney was the boss. Now, lol.
>>108611406give everyone on local a dgx spark and we will have the same
>>108611429it's still the boss at aesthetics, there's still nothing like this
>>108611429>remember when Midjourney was the bossno? aside from some niches they mostly got their popularity because they allowed anyone to joing a discord server and start genning right then and there in a discord channel. they had 1 click button upscales and edits, this was the main thing that allowed youtuber normgroids to hype ai image gen easily to the normgroid masses, creating a positive feedback loop.
>>108611000yes way, ZIT can't do fine details on clothes / jewelry etc for example at the level of Klein to save its life
>>108611362>Image prompts are backWAT, how did you use it before?>>108611406They could always just post-process the image and crank up the vibrancy. In PS you can usually push a full 100% without banding/contrast issues (I've done it several times).
Preview 3 on the left, WaiAnima on the right. The refinements are appreciable.
>>108611443Klein is fucking ass, no amount of jewelry will fix the plastic skin and the shit details at a far away distance
If you never queued at least 7k images to gen, lower your tone while speaking here.
>>108611454fucked up the pose tho
Jailbreaking anima for realism is super fun. And uncensored, which is why it's fun.>using ZIT instead of SDXL for the hiresI will have to try that and post some gens maybe.
>>108611454BUT ANITARD SPAMMED THE THREAD AND HF CLAIMING IT WASUNTRAINABLE!!!!!!!!!
>>108611454impressive, show me more comparisons and I might download that model after all...
>>108611454Same prompt and seed at 1280x1600. Also significantly improved over vanilla preview 3. The latter looks so different that I reran it just in case I got a setting wrong somewhere, but nope.
Ernie will win. True seed and texture are awesome
>>108611368Thanks, I just use danbooru tags.
>imagepost "succeeds" but the post doesn't show up>retrying says there's a duplicate file, but the post doesn't show upJFC, how long is this breakage gonna last?>>108611454Same prompt and seed at 1280x1600. Also significantly improved over vanilla preview 3. The latter looks so different that I reran it just in case I had a different setting somewhere, but nope, there's no mistake.
https://huggingface.co/tencent/HY-World-2.0/tree/main/HY-WorldMirror-2.0Only 5gb?
>>108611545companies dont want to invest into something as niche as this since they wont suddenly gain some big piece of the market since the model wont be production ready anyway, so they invest just enough to try something new and gain #1 FOSS spot while using actual top R&D talent and compute on getting even 0.5% extra on 1 benchmark for their LLM
>>108611545>Only 5gb?+ 80gb of KV cache to hold 30 seconds of world in there kek
Is there ANY way to get good video generation that isn't filtered? All I want is to put images in a prompter, say what I want, and try to get it decent. Do I really have to into SD/ComfyUI shit these days because this shit gets increasingly censored?
>>108611490you told me this was another slop image of Klein I would've believed you
>>108611490>hard yellow bias across the image and even the skin>barely any detail on the skin>plasticy style>plastic hair like from slopped models from 2 years agodoa
>>108611570just do something else until we get the next generation of video models that distill seedance 2.0
>>108611490this look like plastic garbage, are you fucking serious anon? Z-image turbo mogs that shit
is there any truth to the rumor that civitai is gonna start deleting all i2v loras because of safety concerns?
>>108611536I like WAI more. I don’t know how it does it, but it knows how to make things with good quality. Many people say it’s “WAIslop,” but that only happens if you don’t configure the tags at all, especially with SDXL. I have high expectations for WAI Anima.
>>108611570it's over for based api.go try wan, sister. at least the fun is possible and tested kek :)
do illustrious loras work at all with wai anima, ai toolkit still not supporting anima means I cba yet
>>108611490That's sarcasm right?
wtf is wrong with 4chan? new hacking?
Cooked this LoRa. Tested this for a whole day. I think it's working well.https://civitai.red/models/2546093/shifty-nikke-goddess-of-victory-anima-lora?modelVersionId=2861325
>>108611829my guess is cloudflare, look at this shit: https://www.cloudflarestatus.com/
>>108611688people saying "waislop" are just the same people who will hate on whatever model is currently most popular (or literal SAAS shills)
>>108611656dunno but they've apparently made civitai red now.
>>108611489>>108611536Oh, so posts just might not show up for 10+ minutes.Interesting that poses from prior preview versions can resurface in WaiAnima.
>>108611946FINALLY, it took literal hours to get that imagepost through.
many sites broken right now
>>108612062fuck you pig, i know my rights
>Hrm, let's try this Hunyuan world model>It's just gaussian splats and each one eats up like 150mb of disk spacethe fuck? This is beyond useless.
>>108612173How does the model just know Asuka like that?
goddamn civit grenaded my setupall the md5s are gonna changemy API checks have gotten be redonemy spreadsheet has gotta be reworked
>>1086122013 pass custom lora and finetune. Is there another way to prompt asuka?
>>108611490I could swear this is klein. It has the same papery texture klein has sometimes.
really enjoying the ability to actually craft compositions in animait even (somewhat) understands how broken glass works
>>108611490ernie has that newish bright colored skin texture that's in qwen image 2.0. Not bad but clearly has the ai feel to it. >>108612062>>108611659>>108611156>>108611020>>108610960please anon can you share prompt fro some of these? are you using controlnet and img2img?
>>108610478yes i do
>>108611490im fapping
I'm confused. Why do people here hate Anima? It seems fine? Granted I'm not big on anime models, but what's to hate about it?
>>108612650it's astroturfing these threads and promotes shitposting.
>>108612650>people trolling>on 4chanshocking
>>108612650its trivial for anon to make himself appear as dozens
>>108612696Why did they hide unique IPs per thread anyway?
>>108612650It's 99% one troll who's jealous it got seed funding from ComfyUI.
https://motiftech.io/videoshowcase>2b>no soundwhy? why are the insisting on making useless models?
>>108613189
>>108613189>>2bVRAMlet bros, can we finally gen videos now?
Did you like HappyHorse? Then you will love HappyOyster kekhttps://xcancel.com/HappyOysterAI/status/2044618799089926428#m
>>108613279>Alibaba has gotten genuinely good on their craft>And that's the exact moment they stopped going localI hope you localkeks enjoyed being used as a free advertising tool, because remember, it's only local until it's good ;D
>>108613283>good on their craft
>>108613279
>>108613189>Quickstart / Usage>Requirements>Python 3.10+>CUDA-capable GPU with 24GB+ VRAM (e.g., A100, H100, RTX 4090)uhhhh
>>108612127>the fuck? This is beyond useless.that was my feeling during the whole 2026 year, only Klein turned out to be decent, the rest was a bunch of nothingburgers, and now that Alibaba left open source the future is so fucking grim
>>108613345>that was my feeling during the whole 2026 yearcome on anon, the year is far from over, the remontada is possible :d
>>108612713Traffic has been dropping steadily and it helps disguise the automated posts.
>>108613404>Traffic has been dropping steadilythat's a shame, I know that trooncord is slowly killing forums and shit but I don't want to be a fucking avatarfag, fuck that site, and I'm upset that the jew that bought it chickened out when he tried to implement IDs on NSFW threads, he had the perfect occasion to kill it kek
>>108613283I'm completely satisfied with Klein, and BFL will continue to provide me with new models in the future.Since I don't know any celebrities anyway and Trump isn't suitable for gooning, I can live with Klein's downsides.
>>108611852please stop posting lust provoking images
>>108613538>and BFL will continue to provide me with new models in the future.they won't, they gave us decent models only because they had to compete against Alibaba, now that Alibaba is gone they have no reason to give us better models and compete, Alibaba's death means BFL's death
>>108611406Kek, what? That has not been the case since SDXL, and even SDXL eventually got modified by the community to do true blacks.
>>108613608no one said anything about true blacks, it's not that thing that'll get us that vivid and varied set of colors midjourney is producing, and for the moment only that model can do something like that, only them know the secret sauce
>>108611362Midjourney is like the antithesis of Z-image turbo, every seed is a completly different image, but unfortunately for them, those images are often wonky, looks like the balance between quality and variety hasn't been reached yet
>>108613610>what are Klein, Chroma, any DiT model in existance...Chinese also recently released a model that is objecticely both more aesthetic and technically impressive than MJhttps://ernieimageprompt.com/Nothing will save MJ because it's pure slop.
>>108613684>Chinese also recently released a model that is objecticely both more aesthetic and technically impressive than MJ>https://ernieimageprompt.com/
>>108613684>the model that has been finetuned with only Nano Banana Pro's output is not slopcome on dawg
>>108613695the implication being nano banana pro is slop
>>108613707in terms of anime it's definitely slop, it always makes the same shit no matter what
>>108613734wrong thread anon
>>108613684Also >not friedZoom in on the hands herehttps://alpha.midjourney.com/jobs/de24932f-53fe-4fe6-8002-d90602f8f838?index=3This is not just a quirk of the latest model. Their VAE has been stuck in 2023 for the longest time. MJ was the king of aesthetics for a while, but it fell off hard and became a grift the moment more technically capable models became both cheaper and open sourced. I should say, a lot of their "aesthetics" are also stuck in that year.
I tried anime but my gens are all just black boxes, I use Forge Neo.Was there anything special I had to do? I think I got the right stuff.
>>108613753they stopped making effort and improving the architecture because they saw the aesthetics alone was making them rich as fuck, they don't need to touch anything else actually, why would you risk ruining the aesthetics for something more solid but also more boring (if they go that path they'll have to compete with fucking NBP and GPT Image 2), at least with aesthetic slop they have no rivals to compete withhttps://research.contrary.com/company/midjourney
>>108613740true
>>108613762hm>It only worked with beta scheduler for me, I have no idea whythis worked just now, but earlier beta wasn't. I guess it's just finicky
>>108613774case in point, MJ survives because other companies focus on make solid but sterile images, and to be fair, it's not like they don't want to do it, it's because we still haven't solved styles, NBP, as good as it is on editing cannot reproduce styles, no model actually can
>>108613762some samplers and schedulers that work with other architecture like SDXL family models just will not work
>>108613774>MJ, which aesthetically doesn't stand a chance against several local models released past 2025 across multiple departments, somehow stands a chance against an encyclopedic model released by Google that can create any image possible because it literally has seen almost the entire domain of images.I think you are very confused. If MJ got into an Arena and were competing against SOTA models it would laughably be at the bottom. MJ may have made its name off of aesthetics, but in the current day it is far from it. The vast majority of the people they attracted to their grift are not tech savvy, as they focused heavily on marketing to complete normies on social media, so their customers being blissfully unaware of better models in existence is their only advantage. It also does not help that Civit is a cesspool to this day, so when one thinks open source they immediately think of Civit and attribute those sloppers to current open source capabalities, which is far from truth.
>>108613827I'm using anima prompts I found on civitai but some were just not working for me, this is pretty weird. I think it's okay now maybe, after turning off speedups in the .bat. My opencv denoiser extension seems broken though, giving some error. That sucks.
>>108613851>MJ, which aesthetically doesn't stand a chance against several local models released past 2025 across multiple departmentare you living on the same universe as me? local models were aesthetically more varied and interesting during the SD1.5/SDXL days, nowdays it's just pure DiT era slop that can do 5 styles max, what are you even talking about?
>>108613871That's what I was wondering too. Old SD models had great aesthetics.
>>108611946waianima doa
>>108613871We now have edit models. So any style is possible on the fly with local. Aside from those, the quality of images (which are an objective aesthetic metric, unless you think bad hands are still aesthetic in this day and age), plus prompt understanding far trumps what MJ can output. The collage on this thread alone has more aesthetic variety than anything MJ can produce. It can't do proper amateur photography, nor multiple objects nor text coherently. MJ truly has no moat, and if you can't see that you're not any better than some brainrotted Zoomer from Tiktok who just learned about image models by scrolling through his feed.
>>108613947>We now have edit models. So any style is possible on the fly with local.no edit model can reproduce styles, I really start to believe you're living on another universe, how's life in there? do you call "matter" "antimatter"?
>>108613954i really start to believe that i'll never see a person in this thread shilling a saas model that isn't ESL
>>108613963>implying no browns use localyou never visited civitai right?
>>108613954May not be perfect, but they absolutely can, and they will only get better at doing so. Haven't really had trouble with Klein, but Ernie edit model will probably be even better at that as the style variety out of the box resembles what NBP can do.
>>108613983>May not be perfect, but they absolutely cancare to show an example? I
>>108612606Prompts are just danbooru tags, it's anima -> zit img2img, and zit I just remove the "masterpiece, best quality, score_7, photo \(medium\)" and keep the rest of the tags. But I did train an Anima lora on realistic images. FYI if you train on realism, do no use the "@" token, it pushes it towards illustration (I think).>masterpiece, best quality, score_7, photo \(medium\), sunna \(zenless zone zero\), zenless zone zero, 1girl, animal cutout, animal ear fluff, animal ears, bell, black bra, black choker, black panties, black thighhighs, blunt bangs, blush, bra, breasts, cat cutout, cat ears, cat girl, cat lingerie, cat tail, choker, cleavage cutout, closed mouth, clothing cutout, earrings, fake animal ears, fang, frilled bra, frills, green eyes, green hair, hair ornament, hairclip, jewelry, jingle bell, kemonomimi mode, leaning forward, looking at viewer, medium hair, musical note earrings, nail polish, navel, neck bell, one side up, open mouth, panties, paw pose, pink nails, ribs, skindentation, small breasts, solo, striped clothes, striped thighhighs, tail, thighhighs, toeless legwear, toenail polish, toenails, toes, underwear, underwear only, indoors
wtf anons. seedance 2.0 is way less censored now. it still has sensitive copyright filter system but nsfw prompts seems be able to go through model. All of these were text2video.https://litter.catbox.moe/iguqdbqozpr68taz.mp4https://litter.catbox.moe/sa4ic053p8jrmci1.mp4https://litter.catbox.moe/pawrqmmuiwrpv6yx.mp4https://litter.catbox.moe/ygs5abg61l7ltmy1.mp4https://litter.catbox.moe/afc8ep89li01gi0m.mp4
>>108614026>wtf anons. seedance 2.0 is way less censored now.damn, what API site are you using? it's for a friend
>>108614026the movements seem natural, they trained their model on porn or what? lmao
>>108614026>genning SAAS slop, downloading it and then uploading it and then posting it to a local thread
>>108614045don't cry, from time to time you need to be reminded how far you are from them
>>108614049don't breathe
>>108614053
>>108614035https://budgetpixel.com/budget pixel. Image2video is still fucked and doesn't allow human references. It seems like bytedance reach a ultimate compromise of the model for kike lawyers to back off from them. Would love to use local or seedream images for it but its a no go. There's rumors of a work round to trick the model but i don't know how it works.
>>108614026>heh i'll say my gguf ltx gens are seedance 2.0to what end?
>>108614026>>108614073put that here >>>/wsg/6126746 they might be interested, this is still a local thread after all
>>108614087the quality is too good and the sound is too clean to be ltx, you're dreaming
>>108614105the gooner slop in /vdg/ mog those gens anon.
>>108614087>Those videos aren't from Seedance, it's from LTX!>>108614137>LTX videos on /vdg/ mog those videos !So LTX > LTX?
>>108614168i'm saying your videos are trash and if you paid money for them in 2026 you should feel bad.
>>108614204>if you paid money for them in 2026you literally paid (((Jensen))) thousands of dollars to goon on a 5 second Wan 2.2 goon slop lol
>>108614219>you own a computer.is that meant to be an insult?
>>108614254>to own a computer, you must pay a 2000 dollars gpugood Nvdiakek, that's right, the more you buy, the more you save!
>>108614263>you must pay a 2000 dollars gpudo you think RTX cards have a coin slot for pesos?
>Saas shillfag was a gaymd owner who sperged out because he couldn't gen shit on his e-waste gpuKek, whatever helps you cope nonnie.
>>108614204>you must be ashamed to give anyone money to gen AI videos>>108614307>you're jealous because I gave daddy jensen a ton of money to gen AI videoskek
>>108614322so not only are you a poorfag who doesn't own a computer, you can't even afford to pay for saas gens lmao.
>>108614337>>108614204>if you paid money you should feel bad.
>>108610818catbox?
>>108614346if you can't even afford a few dollars a month for saas, you should probably just give up at this point.
>>108614026>>108614087I have to agree, that doesn't look like seedance 2.0
>>108614026>less censored nowdon't fall for this trap again. they can update the filter at any time
https://xcancel.com/bdsqlsz/status/2044726628920742398#mlooks like Nano Banana Pro will remain the king for a long time lool
>>108614373Anima>ZiT?
>>108614373>>108614433what's up, doc?
>>108614431that looks amazing
https://huggingface.co/spaces/silveroxides/Lodestone-Tagger-UI
>>108614469
did anyone try the new ltx2.3 distilled 1.1? the wan2gp readme says that video extensions should replicate the audio better now
>>108614478I'm not downloading a 22b model just for incremental improvements, I'll wait until they get closer to seedance 2.0 (it won't happen)
>>108614478I'd actually forgotten about that, thanks
>>108614476U mad?
>>108614495you're more vague than I'm mad
>>108614489>he doesn't have 10gig internet
>>108614478Nobody here has +12gb gpu, you have to another place fren
>>108614431Did it actually drop? Can't find any official info on this online anywhere, just teasers from a demo that was taken down days ago.
>>108614506works fine on my 12gb gpu
>>108614502>https://huggingface.co/spaces/silveroxides/Lodestone-Tagger-UI>Lodestone>tagger>ui>tagger , tag, tag pictures>huggingfaceLodestone released a image tagger in hugingface
lollmao
>>108614523genuinely what's the usecase for this shit?
>>108614383anon I'm not 100% dependent on closed source saas shit but i also cant be 100% dependent local toys. I just very pragmatic when it comes to this hobby. I like the diversity in choices in models both in saas and local route.
heh, ernie.fuck I'm old
>>108614552good Nvdiakek, that's right, the more you buy, the more you save!
>>108614561why is he driving on the wrong side of the road? he is going to crash and break all the milk bottles
>>108614469Isn't he still training it?Anyway it looks schizo like most lodestone shit. Seems to know insane amount of niche tags, but can't even determine more common ones reliably. And it's further useless since most of its knowledge is esoteric furfag shit. Maybe if you pruned furfag tags from it it could have some use, but I think I would just continue to use WD14 if I ever needed to do lora training for a tag based model.He also forgot to prune meta tags unrelated to image content like "English commentary" or "grandfathered content" and wasted compute and weights teaching model gibberish.This is also 5 gigs compared to a few hundred megabytes of typical tagger. Worth noting for quality/speed tradeoff when batch tagging a lot images.
>>108614540holyshit they banned playtime ai and deleted all his ltx 2.3 loras from civitai. i sense a great purged coming soon anons.https://civitai.red/user/playtime_ai_here some of his lora on civarchive https://civarchive.com/users/playtime_ai_
>>108613856are you using sage? you need non-empty negative prompt or you'll get black boxes no matter what
Grrr anima posts and anima doubts in anime generals!
>>108614648I was but I turned it off since it said I might want to. I had negative prompts too.I dunno what was going on but it works now.
>>108614677Usecase for dedicated anime generals?
bigma general
>>108614748Make you seethe
>>108614433>>108614444yes
>>108614568actually true if you're buying nvidia stock though
I can see that our friends on /lmg/ is also enjoying the Chinese culture kek>>108614665>>108614999
>>108614478I haven't used ltx for a while but it seems better, as in less jank. I know the limits of the model so I haven't tested any high speed chases or backflips, but it seems betterI'm not redoing my loras a thid time though, I'll wait for ltx3
>>108614995>>108614326are these just making cosplay or fan art into real or generated from scratch because those are very character accurate
Is there a way to suppress these fucking error reports in Comfy? I pulled for the first time in awhile, now it's spamming this shit every time I use a bypass switch between txt2img and img2img on my workflow.>0 errorsYEAH, NO SHIT YOU FUCKING FAGGOT
>>108615106use the "Click to Remove Element" extension, it removes a lot of useless shit from this bloated piece of software
>>108615062Nobody cares because Gemma4 bagged every qwen.
I finally got a 5090, what should I watch out for in generating stuff so as not to overwork my machine?
>>108615168fire
>>108615168Keep the boobs and butts on the small side. Every cup you go up increases the weight of the model.
>>108615106does it happen if you use a normal switch
>>108615180I see, better keep some water close by, don't want anything catching on fire>>108615218So I should make flatties? Smh, I guess I better start to love flatties AAA cup
>>108615255I'm using Fast Groups Bypasser and it happens every time I switch. Not sure about any other switches. There's a few other retarded "errors" it reports that aren't errors at all as far as my workflows are concerned, but I did what the other anon said and blocked the element, so it's all good now.
>>108614026kill yourself, why dont you go back to genning onsen pics for your friends in plebbit retard?
can you post good anime instead of jjk slop please?
>>108615327why dont you post any gens tho?
>>108615327It has cool characters and animation, but there is certain slop look, can't deny. I wonder if it's because mix of 3d animation. It has certainly different look with 90's anime filter
can you post good realism instead of zturbo slop please?
>>108615503that ain't his wheelhouse. you get >1girl with {pink|purple|orange|green} eyes,
>>108615327can you?
>>108615503
>>108615327open up
>>108615503nyo~~~
>>108614552catbox?
Fresh when ready >>108615635>>108615635>>108615635
>>108615519>>108615305>>108615223Cool gens