Discussion of Free and Open Source Diffusion Modelsprev: >>107918851https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Flux Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>Z Image Turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>LTX-2https://huggingface.co/Lightricks/LTX-2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>NetaYumehttps://huggingface.co/duongve/NetaYume-Lumina-Image-2.0https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
Blessed thread of frenship
This one was made first.
blessed thread of quality maintenance
>>107921834thanks for the bake
>>107921860SD1.5 energy
>2025: gen -> detailer -> inpaint fix>2026: gen -> detailer -> klein fix with reference img?
https://files.catbox.moe/4vusph.mp4I literally could not compress this enough to put it in the /wsg/ thread.
>>107921834>>Maintain Thread Quality>https://rentry.org/debo>https://rentry.org/animanonwhy are these off-topic links in the OP and why is AniStudio not?
>>107921913hi ani
uh oh melty
>>107921913because this thread is full of fascist chuds
>>107921913>>107921834I have the same questions.
guys, I ran out of comfycreditsgen on without me :(
>>107921871
>>107921834>>Maintain Thread Quality>https://rentry.org/debo>https://rentry.org/animanonwhy do we let a tranny bake threads with schizo manifestos in the OP?
>>107921854low thread quality maintained
>>107921843*hugs you*
>>107921912good shit anon
>Turn this illustration into a photo. Do not change the appearance or details of the monster. Do not change anything else.This model has some limitations like poor facial likeness but it's ability to preserve even unusual details is quite good imo. It may not be perfect but good showing for a difficult request. (I had more accurate seeds too but wanted to show a more average result) I asked a similar prompt with the same picture to the older GPT-Image months ago and the result was completely slopped compared to this. Even current GPT-Image is not that much better.
nice another thread of ani talking to himself until bump limit
>>107921890klein is a very solid img2img fixing tool for sdxl gens.
LOCAL IS SAVED>LOCAL IS SAVEDLOCAL IS SAVED>LOCAL IS SAVEDhttps://civitai.com/models/2308535/unggoy-ill
>>107922018honestly klein has no business being that good at img2img
>>107922018>>1079220084b or 9b?
I love genning, I love bantering, I hate Anistudio some days, I love defending some other days, I hate Comfy, I love using Comfy, I hate schizos, but I’m a schizo myself and shitpost as well.I love /ldg/.Never change, please.
>>107922022a hole is a hole xd
>>107922033The monster one is 9b. The other is not mine.
>>107922043kek
>>107922033its klein 9b base >>107922024its insane how good it is. even better than nanobanana pro imo.
>>107922043Based, deep down, despite everything, we’re having fun.
local suno?
>>107922082>its klein 9b baseYou should try distilled. it's just as good and like 50x faster.
>>107922018Is the darkening on purpose?
>>107922043Very basedJulien is literal shit tho
imagine the degenerate shit guys like Musk are gooning to at the press of a button and we're stuck with this shit, it ain't fair
>>107922183skill issue
>>107922153uh oh! pamper stinky!
>>107922104i already have it, i was just testing 18.2 version today. i don't notice any quality differences.
>Transparent clothing is gone.Gotta wait until someone beats raunchy stuff into it I guess.
>>107922183pretty sure grok image stuff is just qwen-image.
>>107922183be patient anon, the next gooner model is either going to be on klein 4B or on Z-image base
>i- If i only now how to set up Local models..........
>>107922501>anon delivering fertilizer to a nearby data center(2027, colorized)
>>107921912lmao kino
>>107922363:) can you share the prompt anon.
>>107921912kino
when will they update comfy ui desktop already? I want to use klein. fucking lazy vibecoder devs
>>107922594>desktopyour loss
>>107922082Can you get the left 2.9d look from klein?
>>107922430The Stone Of Lode is training both it would seem. And yes, I did train a Z Image Lora of "Uber Cough Girl" Arna Kimiai.
>>107921730Klein can do way higher though if you just don't use that node>>107922522We'll never escape jeetcaptioning I fear
>>107922724>The Stone Of Lode is training both it would seemDavid Cronenberg approves
>>107922724>4Bpass
Do flux 2 loras work on klein?
>>107922762just finetune 9b yourself lmao
>>107922779Exactly we all want another chroma and it's mutants. lezz gooo baby
>>107922008>likenessIt's very inconsistent but it can do likeness quite well.
>>107922795chroma was a lobotomized undistilled model trained on schizo settings and resolution.
>>107922813what do you think lodestones will do with the new models? He's already trainign Z-Chroma on the undistilled ZIT and grafted the Flux 2 VAE onto it or something
>>107922825It can only become better once he uses base + with his experience.Issue is money.
its too addicting remaking and remixing my old gens and class fap materials from massive +100gb fap folder with a enhanced ultra hd textures. The is getting pretty scary ngl.
>>107922840good but too much peach fuzz
>>107922831>with his experience.On what? Did he make some good model I don't know about lmao
>>107922809at least if it misses you can reroll until you hit
>>107922845Experience is all about mistakes, anon.
>>107922862proof?
>>107922840please mix my sister good sir i pay you good money sirrrr i want the very best quality like in you're pictures as gesture you can keep the pictures sirrrrr
>>107922840that doesn't sound healthy, have you considered getting a gf or taking testo blockers?
>>107922845there is always this strange cope about furries on the community. don't know what causes it, but the next model is probably going to come from the noob team. the ilu team is probably going to gatekeep theirs behind donations.
>>107922082Garbage 3dcgi sloppa
>>107922831>It can only become better once he uses base + with his experience.He will take a perfectly serviceable base model and instead of finetuning it like any sensible person would he'll perform some Frankenstein surgery on it, blow it up twice in size, increase gen time by 5 times and will turn it into a grade A model for peopel who coom to amputee porn
>>107922862He ended up blaming chroma being shit as "it's a base, someone else needs to further finetune it" lol. I have no hope of him learning from anything he does.
>>107922866Leave me alone.
>>107922883fuck you
>>107922882>>107922880Time will tell, I'm sure he'll try it regardless anyway.
There is no way this is the easiest way to take the amount of input frames of a video, divide them by 8, round the number, multiply it by 8, and add +1. Right? Surely I'm missing something here.
>>107922885benchodbenchod
>>107922894guys point and laugh at this moron
>>107922894Sometimes it's just easier to do it in your head lol
>>107922910retard
>>107922877>the ilu team is probably going to gatekeep theirs behind donations.Very likely. They are currenly finetuning ZImageTurbo
>>107922915For knowing my multiplication table?
>>107922894What are you doing anon?Just use a calculator and do the math.Also shouldn't it be the opposite anyway?You start by fps + duration you want, you just multiply them and add 1.10s@24fps 10 x 24 + 1 = 241 frames
>>107922928i dont believe you
>>107922894Nigga just type the the number of frames for every extra second into a note and pin on the WF lmao.
>>107922894there are general math nodes that take in multiple inputs and you can use / * min max or whatever. you don't need to make a new node for every single operation.
>>107922894
>>107922980anon that's too complex someone like him
>>107922980Very nice, thanks.
>>107922990based delivery
>>107922894im l8
maybe the LTX-2 schizo was right. some of the gens I've seen have the most expressive voices I've ever heard coming out of any model.
>>107923078*vomits*
do people in civitai just use the worst possible images as example or are all the ZIT loras there really this shitty?
>>107923113sirrrrrr you dont like my indian beuty ?
>>107923113>see dogshit gen with lots of likes>man I could do way better than this>downloads lorasame principle as those mobile ads with the person intentionally playing awful
>>107923101Nta but it's just undercooked right now, everything else about the model is way more impressive than wan and it has way more potential. Unless wan releases 2.5 or something the next few iteration will probably put ltx on top
>>107922581turn this image into an amateur photo
>>107923125>>107923113it's cool we can make good loras and all but the easier they are to train the more terrible quality ones just flood the site. but i guess when you're just making coom bait it doesn't really matter
>>107923078Do this.
>>107923161ok
>>107923161just did
>>107923161enjoy the prompttransform style of image to a photorealistic photograph. change the lighting to cinematic very dark night lighting. maintain the same expression and emotion. keep the eyes closed.some you change first sentence "transform style of image to a photorealistic cinematic high budget Hollywood movie scene." >>107923140thanks anon.
>>107923202she became vance lmao
>>107921912solid kek
HURRY UP
>>107923283two
>>107923283more
>>107923326more
How long does it usually take for a model to populate with loras on civitai? I'd like to do more with Klein but I'm new to this and don't knowOr should I just get good at prompting? I accidentally switched from my z-image lora workflow to klein beforehand and ended up with something better than the lora could produce anywayThough unfortunately neither of them knew what tommy guns were but I think I know how to fix that myself
>>107923283weeks
>>107923339weeks
>>107923344i forgot
>>107923283fourteen
>>107923283
>>107923361additional
>>107923381units
>>107923389share prompt
klein won
>>107923389
>>107923078ultimate test
>>107923372
2x speedup for Klein 9b base/schnell on 30-series cards.https://github.com/BobJohnson24/ComfyUI-Flux2-INT8Comparing with fp8 on schnell:https://files.catbox.moe/6o01vq.jpgMiku grids:INT8https://files.catbox.moe/c0bej3.jpgRegular Slow FP8https://files.catbox.moe/f09m1f.jpg
>flux klein picks up details like moles during training>face still has 50% likeness at best what is this shit?
>>107923530You should try prior concept preservation, I heard it helped with likeness training during flux1 when things were just as shit.
>>107923522>int8absolute cope quant>3000why are poor people in this thread? this is for vram chads.
>>107923522>int8Go home kid
>>107923458
>>107923530We need more details than that. Batch size? LR? Res?
28 minutes for a 1080p 5 seconds video.4 steps. I maxed 24GB of Vram.
>>107922882Chroma isn't shitty with the right negatives though. And Flash generally speaking has significantly better default outputs than HD without negatives at all since it's a guidance distilled model.
>>107923832It was beautiful.
>>107923827already triedbatch size 1, 2, 4 LR 1e-4, 3e-4res 512, 1024rank 16, 32cant observe any improvements, all results are shit tier
>>107923669not bad, at least it figured out it's a human :D
>>107922762Why? The distilled is real similar to 9B Distilled most of the time at full precision and 8 to 10 steps as it is
>>107923832I legit was waiting so long for the video to start before realizing she's just breathing lol.
>>107923669how did you prompt this?
>>107923854Is that rank in weird ai toolkit scaling or normal person kohya scaling? Like how big does it come out
>>107923864I do.>>107923868Did it for Space Engine.
>>107923862lol no, not even close...
>>10792387566-133mb
>>107923832If someone could give me a hint on how to keep the contrast/brightness consistent. I already use the color correction node when I interpolate frames. It's good but no seamless.
>>107923854do these work>https://huggingface.co/malcolmrey/klein9/tree/main
so, how many reference images can you jam into klein 9B?
>>107923870I used >>107922008 but changed 'monster' to 'subject' ymmv
>>107922762it's the one with the license that isn't annoying, it'll probably mostly be 4b if the checkpoint gets finetuned>>107923854for some lora it seemed better to do 5e-5 or such but of course it takes more steps, and the issue wasn't character likeness
>>107923876I meant wallpaper engine.
>>107924016I settled on 3 for my custom workflow. But it's not that useful. Additional conditioning images (more specifically pixels) increase inference time. With 3 images, even distill becomes pretty slow.Besides, the model has very poor training for addressing individual images among references. It doesn't understand which image is 1, which is 2, etc. It often just guesses depending on your prompt. You can see it by switching the order of your images, but the result will be still the same. This fact makes multiple reference images even less useful.
>>107924019prompting guide has example with 8
>>107924016https://docs.bfl.ai/flux_2/flux2_overview#which-model-to-choose
>>107924015work as in "i can generate pictures with the lora"? yeswork as in "the person doesnt look like a complete retard"? nosimilar to what i see in my trainings.>Uploading files from your IP range has been temporarily blocked due to abusehttps://files.catbox.moe/fagki4.png>>107924041i see, but honestly i'm kinda done with training for now. wasted hours on this already and the results are simply not usable wheras zit picked up the characters within 1500 steps
>>107924081>carefuly prepared images with white bgI AM BENCHMOOOOXING
>>107924084What parts of her face are retarded. Theres strange uncanny valley effect but I cant pin it down
>>107923832where's the video?
I was pointed to this place by another Anon.With barely any coding experience, how do I get started with local image generation?I'd like to make something in the style of picrel.
>>107924075But if I wanted to do something like an instant style transfer, it might be useful, yes?
>>107924110neoforge, follow the I stallation instructions and download a checkpoint that knows styles like noob or wai_bsfw and maybe a lora if one exists in that style. make sure the lora is the same type as the checkpoint or illustrious based
>>107921912HOLY FUCKING CINEMA
>>107922082>>107922878I like it.
>>107924150proof?
>>107924125I tried style transfer with 3 images, and it was bad. It just mashed together relevant parts of all images, rather than following general style. If it works with 8 reference images, be sure to tell us.But anyway, reference images are treated as conditioning. They replace long lexical descriptions (instead of "pink plaid skirt" just give an image). So, I think style that the model doesn't know intrinsically can't be copied.
>>107922831Like Pony?
>>107924168Up your ass.
>>107923522Going to be bed now but I am interested. I wonder how it works instead of just running Q8 gguf. No need to de-quantize, if so how? Also next time provide the bf16 baseline images as well in your comparisons. Thanks for sharing though, ignore the troll responses.
>>107924128Can you tell me where I can find those?
>>107923522what about 50 series?
>>107924105i'd say her eyes (looking in 2 directions?) and how one side of her face is bloated despite barely smiling
>>107924128Are some loras/models cross compatible? I swapped a pony model for an illustrious model, kept the pony lora, and it still worked.
>>10792421550 series already supports fp8 and even nvfp4.
>>107921912i thought ai was supposed to be soulless
>>107924260fuck you then
>>107924015His Z loras weren't the best quality, but resembled the person they were trained on good enough and did that consistently.Not the case here. They look like an ai-slop version of the person that resemble, but doesn't pass as them.
we need another NAI leak or its over for local
>accidentally generate the literal perfect futa>not even a faggot
>>107923878They are though lol, like this is 4B and 9B Distilled plus ZIT all genned with 8 steps base + 8 steps hi-res res denoise, no cherry picking. 4B doesn't quite rotate her hands correctly on this seed but the overall thing isn't exactly ultra far off 9B.```A woman with vibrant emerald-green hair executes a perfect handstand in the center of the frame, her body forming a straight vertical line as gravity pulls her locks toward the floor. She wears a bright yellow t-shirt that bunches slightly at the shoulders and distressed blue denim shorts rich with fabric texture. To her left, a stoic capybara sits in calm contrast to her dynamic pose, its coarse brown fur rendered in sharp detail against the seamless neutral studio background. Masterful studio lighting defines the scene, utilizing a large overhead softbox to create soft, wrap-around illumination that highlights the muscles in the woman's arms, while a subtle cool-toned rim light separates the subjects from the backdrop and accentuates the green of her hair. Style: High-end commercial DSLR photography. Mood: Playful, precise, and energetic.```
>>107924110Use LMARENA GPT 5.2, Gemini 3, or Opus 4.5 to guide you through the instalation process. Search "Forge Classic Haoming02" on google then select github, then select the Neo branch.Another plug and play method is SwarmUINoob and WAI are good starting points.
>>107924323didn't read you text but all pictures you posted are shit
>>107924081it ignored like half the fucking prompt tho
this is supposed to be Momo from Twice. Clothing texture is pretty decent.
>>107922922
>>107921845
>>107921860
>>107921870
>>107921889
>>107924316wdym? futa is a straight man fetish
>>107921952
>>107922008
>>107924405>>107924464Husbant.. I...
>>107924405>>107924464the dead eyes are a nice touch
>>107922157
>>107922283
>>107924467lmao faggot
>>107924177>Show a church in the same style as the reference images.Not great, but not terrible. Reference images were 512x512 SD1.5 gens.
>>107923202
>>107922503
>>107923395
>>107923430
It's really cool how this guy can post his videos less than 1 minute apart. That's really neat. Over and over and over again.
>>107923426
You need to be really high IQ to understand these LTX2 vidjews
>>107923078
>>107922379
>>107922501
>>107924177>>107924511Updated the prompt and it's doing a little better.>Show a church in the same style as the reference images. Same lighting. Same building style. Same feel. Same cave elements.
>>107922724
Can you do the SVI thing with LTX2 to make longer videos?
>>107924544does that mean hes using a proxy? how is one able to post so quickly like that?
>>107924582>>107924511it really does not want to do a cave without stalactites.
>>107924593https://github.com/Rolandjg/LTX-2-video-extend-ComfyUI
>>107921912I demand 3 seasons and a movie
>>107924596what do you mean, 4chan allows you to post 1 minute apart by default
>>107924624no I think he is the hacker known as 4chan
>>107924323just hide the toes, Klein. No one will know that you can't do it.
>>107924624hes posting them 30 seconds apart though
>>107924584Awful face retention tho
>>107924646That's actually easy if you're not a mutt
>>107923510
>>107924669so he is spamming with a proxy? why? just to shit up the thread? strange behavior
>>107924635Z not great at toes either DESU
>>107924529>>107924550>>107924671holy shit lmao
>>107924680it's called having fun, mutt
>>107924529lmao
>>107924511
>>107924698whatever helps you sleep at night
>>107924719uh oh meltie
>>107924723?
>>107924655
>>107924323>>107924688>Z not great at toes either DESUToes look fine in that image. Klein on the otherhand...every time you cross a red line, the body flips direction. The hands in the green circles have thumbs on the wrong side.
>>107924090skill issue?>>107923522I'll try this. I get around 6 second a gen on my 3090 once the model and clip are cached. but 2x is always nice if there's not much of a quality hit.
canadians are also mutts desujust saying
>>107924695
>>107924698>gets so upset he starts spamming with a proxy >"im just having fun"kek
>>107924740proof?
>>107924743>>107924584>>107924550>>107924516>>107924507>>107924501>>107924473Whatever LoRA you're using for these... it fucking sucks
>>107924733I love how she just nonchalantly walks away. truly a master of fruit tricks.
Unets:>dumb at composition>still makes kino>super easy to train and extend>extended tools make composition weakness irrelevant DiT:>cope quants make it look shitty>worse artifacts than unets>a million snake oils>slop base quality>can do some composition accurately>boomer prompts>optional use a llm to boomer prompt for you but makes the output even more slopthis tech is regressing over the stupidest shit. you dags won't even admit to it. sdxl forever and ever because researchers chase benchmarks
how do people debug this shit when the models take 5 minutes to load after every edit to the code you make
>>107924785most of us have a 6000 at least
>>107924788yeah that's probably itthink VC money is on the decline anyway so why bother
Someone make a desloppify LoRA. I don't have time.>make many controlled gens with ZIT>run through SD1.5 img2img with some denoise setting>SD1.5 model will fuck up small features.>use these image pairs as your target/source for Klein or qwen-editThere's something for you to do, anon.
>>107924785White men can just execute code in their minds and not write jeeted code
>>107924818gimme 2 weeks
>>107924818but Z looks more slopped than Klein by default, it's just blurry background perfect asian woman model generator with zero variety
mongo spamming his dogshit mp4s
>>107924825granted
>>107924835I enjoy watching them. some of them are quite funny.besides, these threads need a high turnover rate. it's what sets them apart from normie threads.
>>107924836who is this semen demon
>>107924852i wish sound was enabled on /g/
>>107924832You are too fucking stupid to reply to my post
>>107924852turn you over
>>107924866 got traded to Mariners? The seamen?
>>107924852>besides, these threads need a high turnover rateThey're already the quickest imggen threads on /g/. We're fine without the low effort proxy spam.
>>107924866indeed. but then I guess we'd kill /wsg/
kek>The woman is now a nigger. Maintain all other aspects of the composition and layout exactly as they are.
>>107924973I'm very surprised they didn't remove that word from training
>>107924995some people have always been permitted to say it, and flux can't distinguish users, at least I hope it can't
>>107924995it isn't in training, it just thinks he means people from nigeria
>>107924818look at the Zit Girl's hands though, if the Flux.1 VAE was a bit better she'd definitely have like 8 fingers on each one
>>107925013can you make a sd 1.5 and z base merge?
>>107925013woopsthat was meant for:>>107924734
>>107925013>>107925020The hands are facing the correct direction and the thumbs are on the correct side.
>>107924016I've managed to fit 5 different images and made it work but just barely
I know I'm late but Klein (9b distilled) sucks. Text sucks. Multiple characters suck. Anatomy sucks.Going back to ZIT.
>>107925034For just transferring style, 8 512x512 images seem to work well enough.
>>107924550
>>107925044bye. enjoy your same 1 asian woman
>>107925044No one's using Klein for t2i
>>107925044>xhe doesn't know they can go over 4 steps to improve quality
>>107925044real?
>>107925062thanks>>107925074doesn't do anything
>>107925085i made this image :)
>>107924743this is art
>>107924785debug what, custom nodes? setup a empty comfyui instance and use sd1.5 models for fast loading.
hnnnnghhow far we've come
>>107925100qrd
60 WAN gens queued uptime to go to sleep. I'll wake up to many presents, can't wait!
>>107925087how do you promot the black bars
>>107925103I remember when I was genning these at a batch of 4 on a 1080
>>107925106the presents:https://files.catbox.moe/d7t7zr.mp4https://files.catbox.moe/m4ngdp.mp4
>>107925114what does batch do? is it parallel genning?
>>107925106it's gonna OOM as soon as you fall asleep
>>107925107paint because Christian board
>>107925118yes. takes more vram, processes them at the same time.
>>107925103>>107925118holy retardation
>>107925065i am
>>107925133im new
>>107924785Meanwhile in sdcpp it takes 10 seconds to launch, load the model and generate an image
>>107925115basically, but no furry this time. anime gens are too unpredictable. I need to babysit those.>>107925120Nah. with Q8 I never get anywhere close to maxing the vram
>>107925136im surprised youre able to use the computer
>>107925144why are you being mean?
>>107925139>I never get anywhere close to maxing the vramcomfy doesn't care. It'll just start leaking shit into memory for no reason. Godspeed, anon.
>>107925149qrd
new>>107925157>>107925157>>107925157>>107925157
I updated comfy and now I have to recompile flash-attention....
>>107925151I've done this nearly daily and it never OOM's.
>>107925163proof?
>troll bake>again How many times do we have to teach you old man
Baking real. 300 seconds.
Fresh >>107925289>>107925289>>107925289>>107925289
>>107925296Didn't get any warning or ban for this bake btw