Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>106968093https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/https://github.com/Wan-Video>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Neta Luminahttps://civitai.com/models/1790792?modelVersionId=2298660https://neta-lumina-style.tz03.xyz/https://huggingface.co/neta-art/Neta-Lumina>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
THREE MORE YEARS OF SDXL!
>>106972449ty baker
>>106972482when ai developers get their shit together and stop with shitty safety guardrails and censorship would we have a solid illustrious successor by now. kinda feel sorry for grok fags.
If /ldg/ took the api pill already you would've seen how much better AI can be.
based kebab manhttps://xcancel.com/GozukaraFurkan/status/1980931218494624092#m
we should separate the general between 1girl and actual art
>>106972586that's right king, tell those fucking chinkoids how it really is.>>106972592waiting on you to post some actual art dog
we already do, actual art goes in the api thread
> comfyui, wan> switch loras and strengths 4 times, ram oom, restart> switch loras and strengths 2 times, ram oom, restart> switch loras and strengths many times without oom, gen all day, reboot computer> switch loras and strengths few times, ram oom againfucking hate this shitwhy can it work only 1/100 times like normal
>>106972631set pagefile to double your system ram, which by the sounds of it you have 32gb.
>>106972586> Single-GPU & Fast Inference> single H100
Can I do anything on a 12gb rtx 3060? Got a i9 14900k and 32gb of ram.
>>106972639what that'll do?
>>106972656yea.. anything sdxl, cut down flux.. animations will be slow tho
>>106972656You can run SDXL, the best local model ever developed
Blessed thread of frenship
>>106972668stop you from ooming, you silly billy.
>>106972449Last thread>SD 1.5 tier>SDXL is still bestI love how these fags blabber on about without a proper goal for what constitutes a decent model. The tech has already advanced on every aspect, but they are stuck on SDXL due to its aesthetics (mainly because of a very low IQ).NetaYume is a more than capable anime tune, with limitless potential for further improvement. Chroma base is also ready for its own anime tune.Yeah, Chroma is harder to teach styles. Don't forget you're comparing 2B to 12B. No, SDXL is not even Flux tier, fuck off and go back to your SD 1.5 tier shitmixes.
>>10697263964gbit's about 42gb used after the first genclearing cache and unloading models through model_manager.py, cuda clear and doing gc.collect frees not enough just prolong the agony
>yes, my 12b model is dogshit at learning compared to a 2b from 2023, but you should use it anyway because new thing is new!
>>106972573>>106972592>>106972606(You)
>>106972713man i don't know then, sorry. I have half your system ram and i never oom with wan. something else is fucked with your setup.>>106972731Nice.
>>106972691>Please continue to finetune chroma for me because the $200,000 i spent wasn’t enough to learn a single style. Trust me guys, 512x512 is the future!
>>106972691> Don't forget you're comparing 2B to 12B> extra 500% heavy> for 30% better result> still have to gacha
>>106972656>14900k>rtx 3060why?
>>106972691the truth of local models isif it cant run at decent speed on a gamer gpu it will stay shit.unless its a really large model that can only be run on custom whalecoomer hardware and happens to be good out of the boxno inbetweensonly exception being videogen because the barrier to entry there is higher in the first place
>>106972731Do you want us to post API gens here? You cried like a bitch last time you got mogged by Seedream
>>106972742>>106972746SDXL doesn't even learn a single concept. It overfits on training data, similar to SD 1.5. What you get is tasteless variations, which is why you can only mostly tag soup it. Garbage in, garbage out. Chroma actually genaralizes.
>>106972756>Do you want us to post API gens here?Did someone take your EBT?>You cried like a bitch last time you got mogged by SeedreamThat was another anon yesterday and I'm also a Chroma lover. These have been Qwen Edit 2509 though
>>106972756i wish there was a midjourney thread desu, i always see a lot of cool shit that uses gens from there
>>106972785yet shitty sdxl based finetunes > generalizing chroma
To that bro who was running Nunchaku Qwen with LoRA, can you share your workflow? Or anyone else doing the same.
Chroma users are a blight upon this general. It’s a trash failbake only enjoyed by balding 3dsloppers who mistake the training artifacts as ‘realism’. Every chroma gen posted looks like melted shit, and that is because the model is objectively poorly trained
>>106972639>set pagefile to double your system ramnta but this is also only a fix for some workflows and comfyui is only getting worse over time, i still get ooms every once in a while with 24 vram and 128gb ram and dynamic windows managed pagefile
>>106972815>nogentry again but not through tears next time
>>106972691A lot of you havent experienced what a proper anime finetune (NovelAI V4.5) is like to use compared to the utter dogshit we're served here locally.
>>106972749Replaced a much older CPU recently. Changed the GPU a few years back.
>>106972785>Chroma actually genaralizes.Not op, add my +1 for Chroma training. It's ridiculously good at picking up source training data and is being negged into a secret only autist coomers with motivation are aware of
>>106972749I can add another GPU for AI but it's expensive. I would also need a new power supply.
>>106972864>It's ridiculously good at picking up source training dataAny examples you can share?
>>106972670>IN A WORLD...OF NONSENSICAL SLOPPED DETAILS...
>>106972835NovelAI actually develops tech like #source#target which local has zero answer to. This is because local bakers don’t actually gen, so they don’t realize what needs improving
>>106972890the funny thing is, chroma could've done something similar, considering the entire dataset was given NLP captions with gemini
>>106972864They are just trolls. Can't possibly be upset with Chroma or NetaYume (meaning you have no imagination whatsoever), shill Illustrious, but then simultaneously shill API shit as if that doesn't make IL look like a joke in comparison.
Just a screeching voice fading in the windYou and your trani lostJust cope and go away
>>106972906NetaYume is incredibly artist dependent in my limited testing. Some artists give great results, a majority of the others end up leaving you with incoherent and melted details. The artifacts are unique, however, they're different from the typical VAE melt seen in SDXL
>muh 1girl bad>unironically shilling chroma rng slop
a local wan2.5 might even surpass sora 2. but that will never happen. the chinese prefer online humiliation, kek
>>106972932We're never getting another local WAN model btw.Even if we did, it'd be without audio (too dangerous!)
>>106972928>Oversized gianttess 1girl>Not doing anything novel, because the model no concept of anything other than some basic poses fed into it.That's IL/SDXL for you.
>>106972864>>106972906my vote's on yume for the next major step up, i just need to get my coomer motivation today to install a totally fucking different training script than what i got the other day and give it a gothough step one is getting training data for a good style to train.>pic unrelated
>>106972960do share if you find a good training script for netayume, i have some loras i need to rebake on it.
It's the same disabled faggot spamming for days, you know who he is just stop arguing.
>>106972890NetaYume comes close to NAI in prompt understanding (though it's obviously tagged differently so same prompts yield different results, but NAI prompts can be tailored for NetaYume). NAI is probably around 4-5B parameters due to its better grasp on text. Local is catching up, just a scaled NetaYume is all it needs.
Retard here, the main reason for the garbled details and backgrounds in XL based models is the VAE right? Could the VAE be retrained, or replaced to improve the small scale details? Or is it easier to just do a whole new model?Just wondering, because at least for me those models do most things I want, with exception of the mentioned things so I wondered why this one thing hasn't changed with all the retrains and variations of XL models having been made.
>>106972960For anime? Sure, for now (as we've yet to see a Chroma anime tune). But for realism Chroma is already by far the best model for it.
>>106973048Do you know about NewbieAI? It’s apparently yet another lumina finetune but they added 1b to it. Last I checked it’s supposed to release around the end of this year
>>106972879>>It's ridiculously good at picking up source training data>Any examples you can share?Search the archive. There's a no feeding sign>>106972906>They are just trolls.Out in force today>>106972960Will try when diffusion-pipe or OT supports it. That's better than expected
>>106973080Never heard of it but if this is it https://huggingface.co/NewBie-AI/NewBie_diffusion-model_repositorythen it's looking great.
>>106973055theres a reason it hasnt been donebecause it requires retraining the model as well so it outputs the right 'format' for the VAE to interpretalso i speculate that fixing the VAE problem would only expose other incoherency issues as the model was not trained on such precision and it can only get so far with its small sizemost gens benefit from a proper upscale anyways, regardless of shit VAE or not
>brown 3dcgvomit inducing FR, where's the netayume schizo with his gens? past thread was also devoid of good anime gals
Do you train illustrous loras with the same settings as XL/pony, or is there something that needs to be different?
Interpolate>SeedVR2OrSeedVR2>Interpolate
i just dont know what kind of 1girl to gen is the thing
>>106972928>>106973013Why does every medieval interior AI generated stuff look exactly the same?
>>106973145>Search the archive.I'm not digging through troves of endless garbage to find the next level prompt adherence you speak of. Back up your claims with proof.
>>106973243That's more a consequence of the shitmix he uses desu
>qwen edit 2509>prompt add a girl in the scene, keep everything else unchanged>the scene shifts a tiny bit and the details are offthis model a fucking joke. who would've thought that a bigger model than kontext can't even do basic shit correctly? wtf
>>106973273For being an edit model it sure does like to resize and warp images.
>>106973243it's the mixes man. they're trying to warn us about the shitmixes but we won't listen man.
>>106973233seedvr2>interpolate
>woaw chroma is so good guys its the best model!!>cant even do fishnets without shitting itself Not a good look
giving non artists the confidence of faggy artists was a huge mistake
>remember in my tired brain you can just prompt any kind of eyes you want>leave out the character tag and it's gonna wing it a bit>get thisd'aaaawwww adowable eyes gen i can't share the full image of because its a failgen that put her on top of the table with a gigantic hyper ass in focus
>>106973327Have you tried donating another $200000? It might be enough to allow him to upgrade to 768x768 training!
>>106973327Point me to a different model that can oneshot my bondage, girl on leash or pregnancy prompts anon-kun.API would filter 99% of my Chroma gens.
i think ani's lawsuit just flew over my house
>>106973243gets boring using "flower garden background", "forest", "jungle", "rocky area", and "grass field" background for majority of my gens. Building interior tend to be the weakness of sdxl.
where'd all the creative anons go?
>>106973080>>106973147Any more info on this? The repo is bare.
>>106973413Realistic? Not sure. Seedream 4.0 is uncensored, but probably not amazing at NSFW regardless.Anime? NovelAI V4.5 without a doubt.
Some Nodes Are MissingWhen loading the graph, the following node types were not found.This may also happen if your installed version is lower and that node type can’t be found.NunchakuQwenImageDiTLoaderNothing I fucking do fixes this. Yes, I manually downloaded and installed the correct wheels. Please anons, by God, help me, I'm going to rip my hair.
>>106973426Dl local models and get creative. OP has guides.
>>106973426monetize on twitter
comfy shoudl be dragged out on the street and shot
>>106973210Mostly
>>106973426>>106972047explains it nicely
>>106973442>Realistic? Not sure. Seedream 4.0 is uncensoredAh yes, Seedream>>106968701
>>106973487Sounds like bullshit to me
>>106973502>Ah yes, Seedream>no prompt providedWorthless comparison.
baiting schizos is the most fun part of these threads desu, you make some vague comment and some anon spends 3 threads fighting someone who only exists in their head
>>106973514facts
>>106972671>SDXL, the best local modelwhycome it be best?
>>106973382>i can't share the full image of because its a failgen that put her on top of the table with a gigantic hyper ass in focusthats not a failgen
>>106973502What's best is that Chroma can generalize, so I can modify as I want to.>>106973513Didn't include since I've posted it here many times, but it's>Amateur photograph, a Japanese idol woman, performing an advanced contortion pose indoors, likely in a studio setting. She is sitting on a surface with her legs bent backward and extended over her shoulders, so that her feet are positioned and touching over her head, displaying an impressive level of flexibility.>A white towel is draped over her front for modesty. She has straight black hair with bangs, and she wears a black wristband or watch on one wristTo showcase its generalization abilities, on Chroma, the prompt can be modified as much as I'd like:>>106967674>>106967761I can make it uncensored.https://files.catbox.moe/h90ted.pngIt's the ultimate benchmark against APIshit models, and none of them have ever reproduced anything like it even if it's not filtered, and I can also give the 1girl props in her hands etc...I can generate endless variations that satisfy my needs. It truly is an amazing model.
great you summoned the chroma-schizo
>muh model X>muh model Y>muh schizosam I the only one who doesn't care and enjoys his 1girls, as long as a model gives me nice pictures i don't give a shit
>>106973459Sorry you got molested trani
thsi it begins (again)
>>106973640this is generally why we're a bullied minority in these threads, the only subgroup unironically satisfied with the simpler things. Many such cases!>gens on comfyui>uses whatever model works>doesn't complain unless its a skill issue
Tried to gen a video at 360x480 and got an OOM.Set the resolution back to 480x640 for debugging purposes and it works fine.wtf is going on
Your personal opinion:For gooningWan with motion (undressing)?Chroma to undress?Which way anon?
>>106973640The thread is filled with browns whos primary fascination is not the tech because they are low iq, and given that in any online forum its always gonna be more likely to have those people post because they are terminally online and mentally ill, every forum will devolve into a clownshow of mostly those posters
>>106973643That would explain his behavior
>>106973614>none of them have ever reproduced anything like it even if it's not filteredWhich btw, Chroma is the closest to what I'm trying to achieve (inspiration was a real image of a girl doing a contortionist pose which I found hot as shit). That requires a certain degree of anatomical understanding that has never been seen in any other model due to how censored they tend to be.
>>106973552>thats not a failgenthank you for the encouragement.
schizo holocaust when
>>106973147>doesnt explain the datasetfucking garbage
>>106973444it means COMFY NUNCHAKU failed to load, check the fucking logs retard, and give them to chatgpt
>>106973786Nice
>>106973786>this guy slaps your 1girl waifu's asswhat do you do?
what's a generally good method for 2girl prompting in comfy? raw prompting + adetailer separating is working great but there's still going to be the expected clothing/lora bleeding.i see there's like a dozen options but most seem convoluted and need highly specialized node setups.
>>106973855its either regional prompting for sdxl based models, or just use a recent model and rawdog it (there might be some bleeding but with enough rolls you'll get decent results)
>>106973802>what do you do?rage against the machine
>update button: pressed>manager nodes: updated>huggingface repos: checked>model status: loaded*cracks knuckles* alright, it's time to gen some 1girls
>>106973855attention couple
>>106973855forge couple on a1111/forge forks. works best for 2d anime/cartoon but issues of bleeding are present with 3dcg, cgi and realistic styles.
>>106973732>ImportError: DLL load failed while importing _C: The specified module could not be found.I don't need ChatGPT to tell me that that's not particularly useful.
>>106971762>Is anyone running comfy with cuda 13 yet? any problems? I assume you would also need torch nightly and all that other shit.Yes, I had to recompile a few things especially for cuda 13 like sage attention but essentially everything works.
>>106974020gud
>>106972631Use linux.
>>106973656I don't understand your question : either you want video (wan) or image (chroma or whatever image model you like)
street is busy tonight
>>106974051I just wanna know anons preferencesLike I noticed here a lot of people using chroma (or qwen)
>>106972713>>106972822It's ironic how we went back to increasing pagefile size in the unholy year of 2025, how much ram is even enough at this point?
How the fuck do I install ersatzForge? I like reforge but this offers a couple interesting changes I want to try out. I figured it'd be easy to install since reforge works but trying gives me like 20 A4 pages worth of error codes and nothing of the output seems particularly useful like pointing out missing dependencies, how the fuck do you install this shit
>>106974062what artist?
>>106974080CIA chickens
>>106974080kino
>>106974062>NetaYumeWake me up when better finetunes of the base model are out, or when it offers better upsides for all the downsides it has.
>>106974070I gen with chroma or sdxl based local, then I animate with wan.It's not one or the other.
>>106974021good to know, thxhas there been a speed improvement or anything?
Why is prompt leaking a fucking thing?Is there any node that can prevent a prompt from a previous gen from leaking into the next gens that have a freaking different prompt all together?
>>106974076@kabaji
Trying to use ProductConsistency wan lora with cowgirl really works pretty well, but with blowjobs (the DR34MJOB one), I only got samefaces and horrible anatomy, like the guy having a dick mouth.I'll try other loras.
How's Qwen and Chroma?Can i run these checkpoints with 12gb VRAM?I want to know if they are worth it, i am using flux nf4 gguf but it's pretty meh, you cannot get good landscapes without massive amounts of blur and it often ignores prompt.It simply sucks at details.
>>106974137I noticed less OOM but honestly it can be general updates or drivers or anything else.
>>106974117perfect, what's your wf?
>>10697415412GB works theoretically, but my computer always shits itself when trying to run the full models, seems like 32GB RAM for swap isn't enough.