Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>107045533https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/https://github.com/Wan-Video>Neta Yume (Lumina 2)https://civitai.com/models/1790792?modelVersionId=2298660https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQdhttps://gumgum10.github.io/gumgum.github.io/https://neta-lumina-style.tz03.xyz/https://huggingface.co/neta-art/Neta-Lumina>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
https://huggingface.co/spaces/briaai/FIBOhttps://huggingface.co/briaai/FIBOeven SDXL can make better faces, the fuck is this shit lmao
>>107049293for a 8b model that's terrible, I love to shit on chroma (8.9b), but chroma looks way better than that
*yawn*
>>107049306>chroma looks way better than thatWith the same complex prompt it doesn't it looks much worse, it needs to be tweaked a lot and focused on 1 subject to nail realism, and its quite worse for prompt following
>>107049293>picPrompt was: >>107049239>Detailed photograph RAW of seven smiling friends of different races that are at a nightclub concert with dim lighting that is shining on their faces, behind them is a crowd of people dancing while fighting with large swords, everyone is holding a sword in their left hand and an intricate beer glass with differently colored beer in the right hand. Far behind them above the DJ there is a sign which has "Minimum drinKing age 021!" written on it in stylized cursive letters.
>>107049306That is the result of a training on a safe dataset. Could be saved though, and it looks way better than Flux dev does out of the box.
>>107049293SDXL can not gen that, not even close
>>107049293Interesting, it's got some of the Chroma kino, too bad my prompts are moderated so I can't test its anatomical knowledge thoroughly, but this may be the first non-Flux model I truly like for realism.
>>107049335being good at this kind of stunt prompt has absolutely nothing to do with safety lmao, people act like the mere presence of nude bodies somehow magically make a model good at wholy unrelated things.
>>107049361Has everything to do with it.Like >>107049359Almost perfect, but her tongue is broken. That's the kind of thing you'd expect out of a model that hasn't seen that many tongues. I also expect the pose variety to suck.
>>107049359>pircel
Almost same prompt as>>107037102minus panty part. Chroma gen is more accurate because I say Tokyo Metro train so the seat should be benches.
>>107049399
>NVIDIA RTX PRO 4500 Blackwell>32gb>200wworth getting? or just get a 5090?
>>107049425That is not the complete prompt>Amateur photograph, two young beautiful Japanese idol women, dressed identically in coordinated pink outfits, are fast asleep on a Tokyo Metro train, seated side-by-side with their heads gently resting against the plush seats. Both wear stylish pink berets, light pink jacket-like tops over white t-shirts, and short, pleated plaid skirts in shades of pink and white. Their feet, clad in white sneakers and white socks, are slightly elevated as they snooze. Faintly visible through the windows, the dazzling blue expanse of the sea and the glittering skyline of Tokyo pass by, a serene backdrop to their shared slumber.
>>107049436this shit is more expensive than the 5090 right?
>>107049436i dont understand whats hard about comparing like 3 things for peopleprice, vram size, and vram speedthat piece of shit is 896.0 GB/s compared to 5090s 1.79 TB/s
>>107049436>10,496 cuda coreslol
>>107049372no it doesn't lol
>>107049444not in my country
>>107049443meh who cares, FIBO looks like plastic shit with terrible details on the face, I don't give a shit how well it follows prompts if the aesthetic is picasso tier
>>107049384Kek. Asian bitches really smile wit they eyes, yo>>107049436How much? Probably would just go for the rtx 5000 72GB.
>>107049450you forgot the number of cuda cores
>>107049460Nothing is better than Chroma for realism anon-kun. It's not a fair comparison because Chroma is exponentially better than its own base model. But what I see in FIBO is a decent base for further tuning.
>>107049472>But what I see in FIBO is a decent base for further tuning.why not simply using Chroma as the base for further tuning anon-kun?
is he still pretending like you cant train aesthetics into a model
>>107049467i mean there are some more caveats too like the compute version which can fuck you over with not having capability for things like flash attention too but for anything newer you dont need to worry about most things beyond the 3
>>107020025Had to slightly modify this one as well>Amateur photograph captures a beautiful cute Korean idol woman with hair styled into voluminous pigtails adorned with multiple red ribbons. Her skin appears doll-like, with flushed cheeks and subtle red star-shaped patterns under her eyes, lending an unsettling innocence to her gaze. She wears a cream-colored off-the-shoulder ruffled top and shorts combo, paired with knee-high white socks and retro red Mary Jane shoes. Her expression, with slightly parted lips and wide, almost vacant eyes, is unsettling as she is intently next to a lifeless, creepy doll that sits next to her. The doll, with its porcelain features and vacant stare, mirrors her own eerie demeanor. Surrounding her on the floor are scattered sweets and fruits, while the background suggests an old-fashioned setting with light blue cabinets and a patterned floor, similar to her given image. The overall effect is a disturbing blend of charm and unsettling undertones, as if she is immersed in a macabre playtime. The woman is in her 20s, an adult. The room is dark and flash lightTrimmed part at end about disturbing/unsettling undertones, macabre playtime or dark room so it doesn't moderate it.
>>107049436>worth getting?No, it's way underpowered compared to a 5090.>or just get a 5090?Yes or if you have money to burn, a 6000 pro, which has the same compute as a 5090 with way more vram.
>>107049436Only if you plan on getting 2 of them
>>107049534the blur is more intense on the left size compared to the right side, that's funny it's the first time I'm seeing something like that
>>107049537why not 3 5090s then?
>>107049481Chroma style tunes shouldn't be limited to Chroma.
>>107049588>U vill buy ze pro sizthowzant and u vill be happy
>>1070495881800W
>>107049610wdym?you haven't replied my point. 3 5090s clearly have more cuda cores than a 6000 pro, so why not? I'm not letting you get away with this
>107049634retard alert
>>107049634can comfy run models with multiple GPUs on parallel though?
>>107049650https://github.com/pollockjj/ComfyUI-MultiGPU
>>107049686it doesn't make both gpu run at the same time though, it's not parallel
>>107049788>Seamlessly distribute .safetensors and GGUF layers across multiple GPUs if availableI think it should
>>107049293if i can compose every item in my image down to where i want it to be with json, i can run it through any refiner i want to get the output i care about. ie. any of these chroma gens that are refined with any number of other models
Any anon kind to help me out? i dowloaded SD Next to try it out but the gens are going slow af, in forge and comfyui everything takes around 35 sec to 1 minute at most, in SD next shit takes like 3 mins to give me a shitty gen, same settings as in forge or comfyui.i think its using my cpu instead of gpu but i can't find where to change it on the setitngs.
>>107049890New models are just too expensive to finetune. Chroma spent $200k and still had to cope with 512x512. No baker wants to touch the new stuff either because the base gens all look so synthetic and trash so a lot of training time gets wasted de-slopping the model
>>107049816no, it splits the model into 2 parts, the first gpu starts with the first part then the second gpu continues with the second part, they don't do it at the same time
>>107049959depending on the workflow you'll install more nodes, e.g. comfyui-multigpu model loaders to offload more from vram to system ram or the videohelpersuite video combine nodes that let you do more video codecs or frame interpolaion nodes that put more frames between generated frames with a different faster technique.
Latest git comfy now supports:--fast pinned_memoryFor faster memory offloading. You can also add this to speed it up even more:--async-offload
>>107050212>I've not been into this for a year or so. Last I was using easy diffusion with sd1.5.really? what brought you back?
sad
>>107049934bot's back again I see
>>107050212>>107050248>>107050260why she on fire? hot fart?
>>107050302it's a spicy cumfart
>>107050302because she is fire
>>107050313>dgxenjoying the njudea sponsorship?can you send over a rtx 6000 pro?
>>107050212didnt this happen metaphorical weeks ago (two or three days ago) i remember anon posting about it you should add a save queue feature
>>107050321MI300X is a bit better though.>>107050332The pinned_memory is from yesterday but it still needs some polish.
>>107050352is this neta yume? why do you like fennec girls, arent you tired of the same wifey?
Is sdxl still the best at nsfw hardcore?
>>107050352oh while you're here, can you make it so when you pass --disable-api-nodes, the API templates also get disabled?
>>107050402yes it is.
https://huggingface.co/Disty0/HunyuanImage3-SDNQ-uint4-svd-r32/tree/mainExplain to me why we can't just keep compressing this until we can run it on our plebian GPUs.4 bit of 4 bit of 4 bit.
>>107050499Isn't that version already tailor made for poverty stricken Brazilian GPUs?
>>107050499>why can't they keep compressing the jpg?you can do it, it won't look good though
>testing something new in a workflow>errored out earlier and adjusted settings>gets to the same point where it failed>get jumpscared by fucking adblock tab opening and flashbanging me telling me it's been updated
>>107050517It's a massive model. Those tend to respond to extreme quantization better. Another thing we could look at is potentially 1-2bit, then a proper gguf implementation and just CPU maxxing.
>>107050393fennec girl is cute and I'm not going to post my gf in these threads.>>107050405ok but that might take a while cause it's pretty minor.
https://huggingface.co/Disty0/Chroma1-HD-SDNQ-uint4-svd-r32 How you get this shit working for comfy? I refuse to use a cucked UI.
anti comfy schizo real quiet now
Is vace supposed to take a long time?I've been at this part for like 8 minutes now, 32gb vram.
>>10705062832gb isn't enough for a remotely fast speed
>>107050632How, the model is only 18gb.
>>107050605comfy has better stuff coming.
What the fuck.I feed this clip joiner workflow two nsfw clips to splice and merge, and this is what I get?? 0 frames related to the two videos.Took like 25minutes to gen.
>>107050605>SDNQanother meme entered the arena>supports SVDwait im not reading this guy's paper, is this supposed to be a quantiziation method or not?>only 1 example in the model cardcringe af, nunchaku (SVD) guys put more effort in their copequants
Which front-end etc should I choose for img2vid gens? And is there a specific Stable Diffusion version/installation I should pair it with?Thanks
>>107050820don't remember her prompt, have returned to varied 1girls. maybe another time.
test