Discussion of Free and Open Source Diffusion ModelsPrev: >>107894964https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Z Image Turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Flux Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>WanXhttps://github.com/Wan-Video/Wan2.2>LTX-2https://huggingface.co/Lightricks/LTX-2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>NetaYumehttps://huggingface.co/duongve/NetaYume-Lumina-Image-2.0https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
Blessed thread of friendship
Where the hell is unsharded Qwen3-8B in BF16?
>>107896312>unsharded Qwen3-8Bwhat?
>>107896284ltx
>>107896301>picrelthat's very cursed>4B? Skill issue if you are using 9B. 9B also occasionally refuses to do nipples across seeds but I got a lot of it still.9B, I just asked to make an anime image realistic, I think the model didn't know what to do with nipple piercingsI didn't test multiple seeds so there's that
>>107896312https://huggingface.co/Comfy-Org/flux2-klein-9B/tree/main/split_files/text_encoders
>>107896245lul
>>107896322one giant safetensor instead of multiple safetensor files you can't import in comfyui
>>107896330asking for cosplay might work better
>>107896333Thank you!
>>107896311Is she pointing at her feet?
Juggernaut the best model for making anime?
>>107896336whats qie
>>107896348Someone gotta clean 'em soles bucko
>>107896350Qwen Image Edit
>>107896360
Can any one of you troglodytes inform me as to what the current meta is?I'm still using noobAI (29+1+h)
>>107896374No fuck you, maybe don't name call while asking to be spoon fed
>>107896374you are in the meta. it's going to be six months before zimage base releases then you have to wait for the finetune
>>107896128>retard here. since Klein can do edits does that mean if you finetune it you'll need to include edit pairs as well?
>>1078963932 weeks*
>>1078964086 years*
>>107896341you are either a jaded loser or a zoomer
>>107896384I asked it before and not one faggot volunteered so I figured acting like a cunt would do it>>107896393Thanks anonSince you said base, I'm assuming the turbo ver currently released isn't all that good, yeah?
https://github.com/Tongyi-MAI/Z-Image/issues/126SHUT UP GWELLO YOU ARE IN NO POSITION OF POWE->Klein gets releasedACK, Chingsisies, what should we do?
>>107896429>I'm assuming the turbo ver currently released isn't all that good, yeah?are you joking? this model is amazing, that's why people are begging for Z-image base to be released
>>107896432my guess : internal politics + turbo model more popular than they expected + finetuning because the base model isn't that good and turbo was lightning in a bottle moment
>>107896440Well, that anon said I was in the meta when I said I'm using a noobai finetune so the natural conclusion is that zimage turbo doesn't replace itWhat's the pros and cons of zimage turbo? Is it good for anime, realism, classical paintings etc?
>>107896432would be funny if these gituhub randos piss off Tongyi so much that they don't release it, memeing Chinese Culture into reality
>>107896457z-image if you make only 10-20 images a day is the best for realismklein 9b if you want diversitychroma if you want 2d goon
>>107896457For it's size it's incredibly good for realism while being fast.It can also do complex prompts SDXL can never do without controlnet and regional prompt autism. The text, texture detail and backgrounds are infinitely better than SDXL too.
>>107896432I get the frustration but that's just asking for "get a refund" response
>>107896457Z-image turbo is distilled, so it can't be finetuned and can't replace SDXL base, which is why we need Z-image to be able to move on>Is it good for anime, realism, classical paintings etc?it's insanely good at realism, anime is ok I guess
>>107896473>chroma if you want 2d goonand 10 out of 100 images without anatomy abomination
>>107896465kek, i guess this is what happened to wan 2.5. if i recall they said something like "if you ask nicely" jokingly then the "community" collectively lost their shit lmao
>>107896484the quality of the good one makes up for it, it is a gacha alright
>>107896374Zimage and Chroma 2k. Klein for editing. 8gb vram minimum, 16gb is ideal.
ryan gosling in drive, netflix edition:
>>107896507why are you so obsessed with this dude, at least pick a cute girl
no styles is not meta
>>107896507it would appear you arent high enough off fent, mr. bond.>>107896516cause it makes libtards mad on social media, and he's a good test case for meme gens
>>107896516because it makes you mad
>>107896516>why are you so obsessed with this dudeI'm asking myself the same question when I see endless Charlie Kirk memes on tiktok desu
>>107896530but it doesn't?
>>107896533>>107896516enough, we all know what they want, they are all fags in deep denial and end up worshiping males as a way to cope with their lust
>>107896446you're probably close, my guess is that they want to make this right and have a really solid base model, but I'm afraid they'll overdo it and slop this shit instead of letting it act like a normal base model
>>107896530I'm tired of seeing his ugly face anon, it's not about being mad, this shit is gay and boring>>107896533same shit
>>107896542come on, CK and GF are ugly motherfuckers, if they were homo they would go for more handesome dudes lol
>>107896556You vill hear about trannies every day, you vill see the big lipped babboon every hour and you vill be happy.
>>107896561>applying logic to a coping mechanism
>>107896561did you miss the deep denial part? if they genned handsome men and got hard they'd have to accept they're gay
>>107896473I'm always in a complex situation in these cases because the things I enjoy in AI are pretty nicheNoob allows me to do pretty show-accurate anime/manga and the style of almost any artist (even some esoteric mfers) on danbooru>>107896475>>107896480I see, thank you guys for the context>>107896506>16gb is idealI gotta get a new card
>>107896581buy a 6000 btw
>>107896528one more with this guy.replace the face of the black man in image 1 with the man in image 2, who is holding a white bag of powder in a ziploc bag. Change the text "RUSH HOUR" to "FENT HOUR". leave the asian man on the left unchanged.klein 9b distill (grab the q8, it's a small model) is a lot of fun. also the ability to copy font styles is neat, seems to do it better than qwen.
>>107896542zoomers are just poisoned by politics, there is nothing much you can do about it, they will make a billion trump floyd and whatever else american politics related instead of obsessing over nice female curves
>>107896581>Noob allows me to do pretty show-accurate anime/manga and the style of almost any artist (even some esoteric mfers) on danbooruAt the expense of having some bad anatomy, background, composition and overall consistency/cohesion. It was good a year ago but I can't help but see all it's faults now.>>107896584For sure, tomorrow by midnight I'll be shipping you the receipt
Highest resolution you've gone with Klein9b and Zimage before it broke down?With klein I tried up to 2.5MP and it dealt with it fine.
>>107896596yeah it works fine at high resolutions, those modern models don't shit their pants like their predecessors, that's when you can see the field is improving for real
>>107896507
>>107896618lmao
>>107896618faggot
Is zimage natural language or does it understand booru prompts?
>>107896629i'm not 100% sure
make the image a wireframe like a technical document for the anime girl:
>>107896631can you make him into a girl?
>>107896629why would a model not trained on danbooru understand danbooru tags?
>>107896629>>107896630not surehttps://www.youtube.com/watch?v=sVyRkl5qNb8
make a chibi size plush doll of the anime girl made of fabric.
>>107896670
>>107896581
>>107896592>having some bad anatomyit gets hands right more often than not compared to the new models but they haven't been tuned yet so maybe in time it will be better>backgroundsI agree with you here>consistency/cohesionthe main complaint about dit models is the lack of variation, I wouldn't mark this as a point against sdxl
>>107896670>>107896686I'd get these
>>107896670>>107896686that's cute!
>>107896697>it gets hands right more often than not compared to the new models but they haven't been tuned yet so maybe in time it will be bettergonna are gonna have to lear the hard way that that is 100% tied to aesthetic tuning it to one specific style like anime for noob or photorealism for z image (plus tons of rl training)
>>107896432The real answer is internal company review processes.In research, the release of things backlogs for a billion different reasons because of internal reviewers.You know how we got all this "model coming soon" stuff on Github? People forget about it, then 6 months later the researchers are like "Model live on Huggingface!"It sucks but its the truth.
>>107896686
make a plastic anime figure of the girl on a round pedestal
>>107896716>You know how we got all this "model coming soon" stuff on Github? People forget about it, then 6 months later the researchers are like "Model live on Huggingface!"yeah I know what you mean, I remember this same situation happen once but I don't remember what model it was though, probably something mid lol
Where can I download Klein image edit workflow for comfy? I can only find the text to image workflow.
>>107896696EndearingNow do him turning into a titan>>107896697Good to knowI suppose I'll just wait a while before I try another model, doesn't seem like there's a local option that can fit my needs better than noob rn
>>107896738https://github.com/BlenderNeko/ComfyUI_ADV_CLIP_emb
how the fuck does klein know what image 1/2/3 is?
>>107896738look at templates in comfyui
>>107896745it stiches them together I think from left to right. You can describe it in other ways though.
>>107896732
>>107896697yeah i mostly agree with you. the newer models that /ldg/ are fawning over are powerful for realism but dont know characters, series, or proper danbooru tags. not really suited to anime gooner material if thats your thing
>>107896793with i2v or an image reference it doesn't need to know them nativelyfor anime image sources you have stuff like wai/illustrious which knows everything
>>107896756if black forest labs was nintendo they would start suing plastic doll factories for copyright infringement
>>107896807racist
>>107896800this is just cope and legwork to make a model work the way you expect ootb at this point. models are great when it's easy to use. loras are just as bad imo
>>107896800having a structured prompt tag system, which applies to noob, but also illustrious finetunes in general is a huge advantage. you might not be good at first, but you can study and get good
>>107896825if you aren't using an llm to write your prompts you are doing it wrong
>>107896756
>>107896836
>I stopped using 4chan since the hack. I now browse alt chans that actually care about their users, and don't need an userscript fighting their shitty design.
>>107896618How are you getting Klein to maintain the likeness so sharply? It's changing every person I try and manipulate to a horrible mess of JPEG compressed Flux-face.
>>107896885i'll tell you
>>107896807show me whatever chink model you think would do better at that kind of closeup while still realistically representing the peach fuzz and such
>>107896885be less retardedKlein 9B Distilled, 8 steps, "The man is now wearing blackface. Everything else is exactly the same." Input image resolutio same as output resolution.
>>107896875
>>107896905Not what I asked about, fuck you piece of shit
>>107896848nice
>>107896908watwhy am i a piece of shit for pointing out that Klein shouldn't be making significant changes to anything, generally
>>107896914(samefag) like in general, literally make the output the same res as the input, you'll get best results that way
>>107896913that head tho
why didnt NAI switch to chroma?
>>107896913good body, face needs to be 20s and not early 30s
>>107896885pretty much default
>>107896929>early 30sdo gweilo women really age like that? lol
>>107896927what the fuck are you talking about nigger, Noob stopped training when they ran out of money, in general
>>107896927they made their own from scratch model that is better than everything else for animie. They refuse to do realism since they allow loli so of course that would get them in trouble
>>107896942>Noob stopped training when they ran out of money, in generalwait what? what happened?
>>107896927isn't nai novel ai?in that case this:>>107896943
>>107896920it matches the original, and stubbornly refuses to get smaller no matter how I prompt it>>107896929better? can't get it to do any ages between this and the original
>>107896951i bet she fucks desi men
>>107896955her head looks too big lol, face is fine sureI guess it's the input image
>>107896948it was a long time ago dude lmao, like at least a yearalso the other guy is right, NAI usually means NovelAIonly zoomer retards associate "NAI" strictly with Noob
>>107896954who is noob and why did he ran out of money
>>107896955
I added a "max_images_allowed" parameter so that you won't have to bypass anything to switch from "2 images mode" to "1 image mode" or "no image mode" or whateverhttps://github.com/BigStationW/ComfyUi-TextEncodeEditAdvancedhttps://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced/blob/main/workflow/workflow_Flux2_Klein_9b.json
>>107896987buy an ad
>>107896964>I guess it's the input imageanime has always head huge heads proportionally to the body compared to a real human
>>107896987Your intro is about QIE. Does Klein have the same issue?
>>107896954>isn't nai novel ai?i'll never not love the chinks that made noob for stealing nai's name because of the seethe it causes them
>>107897002the zoom-in issue? it has, but it's way less severe than QiE, anyways, the vl_megapixels value must always be at 0 for Klein since it's used only for QiE
>>107896896(intentional samefag self reply)Seedream 4.5 gives this at max resolution for the exact same prompt, for the record. Continues to suffer from the retarded obsession of incel Chinks with weird fucking glowing eyes on white women and a general facial structure that resembles no real person who exists anywhere on earth.Like when I train loras I train them on actual photographs of actual normal looking real people, literally go out of my way most of the time to have a good spread of ethnic and age variance, and also caption the specific ethnicity and age of all people who appear, both male and female. As such I have no respect whatsoever for anyone who does anything less, and I especially don't give a shit about ugly retarded Chinese beauty standards that don't look good in any way to anyone who isn't nativly Chinese.
>>107896913make a 3d printed recreation of the character
>>107897012No, this : >The default TextEncodeQwenImageEdit node downscales your images to 0.15 megapixels before feeding them to the VLM.But you answered my question. You should write somewhere that your node can be used with Klein too.
>>107897015now prompt it to turn her back to og asuka
>>107896976
wan2gp now supports flux klein 4b and 9b.
>>107897021yeah I should probably change the readme at some point
>>107897033>it's the anon with the shitty skin fetishfuck me
klein lorasklein lorasklein lorasklein lorasklein loras
>>107897044post "good skin" the you nogen faggot, I dare you
>>107897022make this into an anime illustration
what schedulers and samplers are yall folx using with klein for realism? Res2/beta seems to kinda work for me at 1.5 cfg, steps
>>107897049a bit washed out colors but pretty good
>>107897044>shitty skinyou say that because she's black racist :'(
>>107896987>ConditioningNoiseInjectionNeat, zimage severely lacks variation so might give z another go. Will it support other models in the future?
>>107897058correct
>>107897041
17.6 seconds on a 4090, 9b distilledThe rug is a large, intricately patterned tapestry with a dominant circular mandala design in the center. The primary motif is a concentric radial pattern with multiple layers radiating outward, resembling a blooming flower or sunburst. At the center is a small, dark blue circle encircled by lighter blue and grey petal-like shapes forming a tight rosette. Surrounding this core, layers alternate in warm and cool tones—rust red, deep navy, soft purples, and desaturated oranges.The second ring features diamond-shaped petals with red centers and dark blue outlines, creating a sharp, rhythmic contrast. The next ring is a wide band of leaf- or feather-like shapes, pointing outward and alternating between muted blue and copper tones. These are bordered by thin concentric rings that divide each section cleanly, maintaining geometric precision.Beyond the central mandala, the rug transitions into a repeating motif of teardrop and eye shapes, arranged in a circular rhythm. These shapes are filled with detailed internal patterns: dots, lines, and floral curves, mostly in subdued hues of indigo, grey-blue, and maroon.The outer background of the rug is a deep violet or midnight blue, filled with tiny floral emblems and star-like dots, scattered evenly to create a celestial ambiance. There’s also a visible fabric texture throughout, giving the sense of a woven tapestry rather than a printed rug.
>>107897052cfg 2 + res6s + whatever the default node uses for scheduler since it's not explicit
>>107897078>res6sbro you gen 1 image per hour huh
NovelAI is still better than everything local, right? At least generally
>>107897033
lol. klein doesn't generate genitalia, but it swaps them in no problem.
>>107897009
>>107897060>Will it support other models in the future?I think it works on every models
>>107897079120s per image, I'm fine with it
>>107897088swaps them??for me half the time it adds panties, always conservative types
>>107896342Amusingly (and correctly) asking to make the image like cosplay = background will by default look like inner city sprawl because of conventions all being hosted in those environments, though of course you can just ask for whatever location. still, i've found just asking to make it into a real photo works completely fine; the big issue is more unnatural elements (wings, coloured eyes, wands/staffs, tails etc, or ofc very stylised art) stack up and make the face worse and more uncanny the more of them there are whether you call it cosplay or not. if you really badly want a nice aesthetic pic of a character like this, the play might honestly be (1) image edit to remove those, (2) convert to photo, (3) show it photo & original drawing & ask to add the staff/tail/wings/etc to the photo. Those individual elements might still look a little bit tacky but at least the face will preserve its good quality instead of coming across with poison baggage.
>>107897106>picture 1 with a skin mound>picture 2 with genitals>prompt "swap the genitals">boomsample size of 2 kek
my god, it's flawless, you actually can't even tell it's ai
>>107897141I see, I'll definitely try, this a good way to leverage other models good understanding of genitals
>>107897015
>>107897013Z Image version genned and upscaled the same way as the Klein onewow can't believe she looks like a completely generic as fuck unrealistic Asian-injected SD 1.5 esque sloppa 1girl with significantly worse detail than the Klein versionwho could have imagined these results
>>107897044its ok to admit that you want to colonize her anon. This isn't /pol/ or reddit.
>>107897166black women are based, that anon and his obsession with damaged leathery dirty skin is not
My very first Flux Klein gen. (9b base, fp8)>A woman with large breasts wearing a long gown at midnight. The image is so dark it is barely possible to see anything.I must admit I'm impressed. Many models would not be able to do an adequate job with this prompt. Whether it can do a gen that looks good on close inspection remains to be seen, but so far so good.
>>107897187
>>107897235I think it goes too hard on the night though, it's almost completly black lol
>>107896432>>107896446since base model will be so much slower than turbo, it needs to be sufficiently smarter that waiting 60s for a base gen is more appealing than waiting 50s for 5x turbo gens and filtering for the best result, e.g. needs to be so much better or so much more competent at edge case difficulty tasks that the turbo model frequently fucks up. W/ that said i dont know why theyd even call the models "turbo" and "base" of the "same" model after this much work, sounds like it's practically gonna be Z-Image V2. Might as well have co-released base on the day but told people not to directly use it for inference, as with Flux 2 Klein's base.>>107897148i know you're just joking around but i actually do think the clothes and hair etc on this gen look great. the chunky fingers and retarded eyes matched the input image which is a pretty valid decision you can easily explicitly prompt against and same for the neotenous jawline, the only other blatant mistake is one pinky finger being in grey glove material / almost unnoticeable, and i guess the zipper being gone. frankly i think its mindblowing that raw unfinetuned local can do this now.
>>107897250Did you read the prompt?
>>107897250nta but to be fair>The image is so dark it is barely possible to see anything.it did what it was asked lol
>input a low res image of a person>tell klein to upscale the image, and use a high res picture of the same person to grab detail fromwhy doesnt this work?
Uh oh.. gen #2 is SLOPPED. Time to play with settings and maybe img2img if necessary