[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>107894964

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Flux Klein
https://huggingface.co/collections/black-forest-labs/flux2

>WanX
https://github.com/Wan-Video/Wan2.2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Blessed thread of friendship
>>
File: 1747233206253473.jpg (3.99 MB, 4800x6912)
3.99 MB
3.99 MB JPG
>>
Where the hell is unsharded Qwen3-8B in BF16?
>>
>>107896312
>unsharded Qwen3-8B
what?
>>
File: 1744254943357984.mp4 (3.79 MB, 1638x2048)
3.79 MB
3.79 MB MP4
>>107896284
ltx
>>
>>107896301
>picrel
that's very cursed

>4B? Skill issue if you are using 9B. 9B also occasionally refuses to do nipples across seeds but I got a lot of it still.
9B, I just asked to make an anime image realistic, I think the model didn't know what to do with nipple piercings
I didn't test multiple seeds so there's that
>>
>>107896312
https://huggingface.co/Comfy-Org/flux2-klein-9B/tree/main/split_files/text_encoders
>>
File: 1764701272526086.png (1.77 MB, 2282x1225)
1.77 MB
1.77 MB PNG
>>107896245
lul
>>
>>107896322
one giant safetensor instead of multiple safetensor files you can't import in comfyui
>>
>>107896330
asking for cosplay might work better
>>
>>107896333
Thank you!
>>
>>107896311
Is she pointing at her feet?
>>
Juggernaut the best model for making anime?
>>
>>107896336
whats qie
>>
>>107896348
Someone gotta clean 'em soles bucko
>>
>>107896350
Qwen Image Edit
>>
File: I volunteer.gif (29 KB, 220x214)
29 KB
29 KB GIF
>>107896360
>>
Can any one of you troglodytes inform me as to what the current meta is?
I'm still using noobAI (29+1+h)
>>
>>107896374
No fuck you, maybe don't name call while asking to be spoon fed
>>
>>107896374
you are in the meta. it's going to be six months before zimage base releases then you have to wait for the finetune
>>
>>107896128
>retard here. since Klein can do edits does that mean if you finetune it you'll need to include edit pairs as well?
>>
>>107896393
2 weeks*
>>
>>107896408
6 years*
>>
>>107896341
you are either a jaded loser or a zoomer
>>
>>107896384
I asked it before and not one faggot volunteered so I figured acting like a cunt would do it
>>107896393
Thanks anon
Since you said base, I'm assuming the turbo ver currently released isn't all that good, yeah?
>>
File: 1742500495187424.png (215 KB, 2524x1052)
215 KB
215 KB PNG
https://github.com/Tongyi-MAI/Z-Image/issues/126
SHUT UP GWELLO YOU ARE IN NO POSITION OF POWE-
>Klein gets released
ACK, Chingsisies, what should we do?
>>
File: Z-image turbo.png (1.72 MB, 1280x720)
1.72 MB
1.72 MB PNG
>>107896429
>I'm assuming the turbo ver currently released isn't all that good, yeah?
are you joking? this model is amazing, that's why people are begging for Z-image base to be released
>>
>>107896432
my guess : internal politics + turbo model more popular than they expected + finetuning because the base model isn't that good and turbo was lightning in a bottle moment
>>
>>107896440
Well, that anon said I was in the meta when I said I'm using a noobai finetune so the natural conclusion is that zimage turbo doesn't replace it
What's the pros and cons of zimage turbo? Is it good for anime, realism, classical paintings etc?
>>
>>107896432
would be funny if these gituhub randos piss off Tongyi so much that they don't release it, memeing Chinese Culture into reality
>>
>>107896457
z-image if you make only 10-20 images a day is the best for realism
klein 9b if you want diversity
chroma if you want 2d goon
>>
>>107896457
For it's size it's incredibly good for realism while being fast.
It can also do complex prompts SDXL can never do without controlnet and regional prompt autism. The text, texture detail and backgrounds are infinitely better than SDXL too.
>>
>>107896432
I get the frustration but that's just asking for "get a refund" response
>>
File: Z-image turbo.png (2.13 MB, 1280x720)
2.13 MB
2.13 MB PNG
>>107896457
Z-image turbo is distilled, so it can't be finetuned and can't replace SDXL base, which is why we need Z-image to be able to move on
>Is it good for anime, realism, classical paintings etc?
it's insanely good at realism, anime is ok I guess
>>
>>107896473
>chroma if you want 2d goon
and 10 out of 100 images without anatomy abomination
>>
>>107896465
kek, i guess this is what happened to wan 2.5. if i recall they said something like "if you ask nicely" jokingly then the "community" collectively lost their shit lmao
>>
>>107896484
the quality of the good one makes up for it, it is a gacha alright
>>
>>107896374
Zimage and Chroma 2k. Klein for editing. 8gb vram minimum, 16gb is ideal.
>>
File: 1743459197440173.png (1.37 MB, 896x1152)
1.37 MB
1.37 MB PNG
ryan gosling in drive, netflix edition:
>>
>>107896507
why are you so obsessed with this dude, at least pick a cute girl
>>
no styles is not meta
>>
File: 1765667012223871.png (1.44 MB, 1168x880)
1.44 MB
1.44 MB PNG
>>107896507
it would appear you arent high enough off fent, mr. bond.
>>107896516
cause it makes libtards mad on social media, and he's a good test case for meme gens
>>
File: 1744018335711699.png (1.91 MB, 1296x1200)
1.91 MB
1.91 MB PNG
>>
>>107896516
because it makes you mad
>>
>>107896516
>why are you so obsessed with this dude
I'm asking myself the same question when I see endless Charlie Kirk memes on tiktok desu
>>
>>107896530
but it doesn't?
>>
>>107896533
>>107896516
enough, we all know what they want, they are all fags in deep denial and end up worshiping males as a way to cope with their lust
>>
>>107896446
you're probably close, my guess is that they want to make this right and have a really solid base model, but I'm afraid they'll overdo it and slop this shit instead of letting it act like a normal base model
>>
>>107896530
I'm tired of seeing his ugly face anon, it's not about being mad, this shit is gay and boring

>>107896533
same shit
>>
>>107896542
come on, CK and GF are ugly motherfuckers, if they were homo they would go for more handesome dudes lol
>>
File: Flux2-Klein_00115_.png (2.49 MB, 1440x1440)
2.49 MB
2.49 MB PNG
>>
>>107896556
You vill hear about trannies every day, you vill see the big lipped babboon every hour and you vill be happy.
>>
>>107896561
>applying logic to a coping mechanism
>>
>>107896561
did you miss the deep denial part? if they genned handsome men and got hard they'd have to accept they're gay
>>
File: 1742654724389070.png (952 KB, 896x1152)
952 KB
952 KB PNG
>>107896473
I'm always in a complex situation in these cases because the things I enjoy in AI are pretty niche
Noob allows me to do pretty show-accurate anime/manga and the style of almost any artist (even some esoteric mfers) on danbooru
>>107896475
>>107896480
I see, thank you guys for the context
>>107896506
>16gb is ideal
I gotta get a new card
>>
>>107896581
buy a 6000 btw
>>
File: 1768495058857827.png (1.59 MB, 1360x752)
1.59 MB
1.59 MB PNG
>>107896528
one more with this guy.

replace the face of the black man in image 1 with the man in image 2, who is holding a white bag of powder in a ziploc bag. Change the text "RUSH HOUR" to "FENT HOUR". leave the asian man on the left unchanged.

klein 9b distill (grab the q8, it's a small model) is a lot of fun. also the ability to copy font styles is neat, seems to do it better than qwen.
>>
>>107896542
zoomers are just poisoned by politics, there is nothing much you can do about it, they will make a billion trump floyd and whatever else american politics related instead of obsessing over nice female curves
>>
>>107896581
>Noob allows me to do pretty show-accurate anime/manga and the style of almost any artist (even some esoteric mfers) on danbooru
At the expense of having some bad anatomy, background, composition and overall consistency/cohesion. It was good a year ago but I can't help but see all it's faults now.
>>107896584
For sure, tomorrow by midnight I'll be shipping you the receipt
>>
Highest resolution you've gone with Klein9b and Zimage before it broke down?
With klein I tried up to 2.5MP and it dealt with it fine.
>>
File: 1750027553857730.jpg (676 KB, 1504x1728)
676 KB
676 KB JPG
>>
>>107896596
yeah it works fine at high resolutions, those modern models don't shit their pants like their predecessors, that's when you can see the field is improving for real
>>
File: Flux2-Klein_00129_.png (2.63 MB, 1264x1632)
2.63 MB
2.63 MB PNG
>>107896507
>>
>>107896618
lmao
>>
>>107896618
faggot
>>
Is zimage natural language or does it understand booru prompts?
>>
>>107896629
i'm not 100% sure
>>
File: 1737928024700966.png (866 KB, 640x1632)
866 KB
866 KB PNG
make the image a wireframe like a technical document for the anime girl:
>>
File: 1750630152729472.png (2.59 MB, 1796x1800)
2.59 MB
2.59 MB PNG
>>107896631
can you make him into a girl?
>>
>>107896629
why would a model not trained on danbooru understand danbooru tags?
>>
>>107896629
>>107896630
not sure
https://www.youtube.com/watch?v=sVyRkl5qNb8
>>
File: 1756049948050662.png (1.28 MB, 640x1632)
1.28 MB
1.28 MB PNG
make a chibi size plush doll of the anime girl made of fabric.
>>
File: 1767257510018198.png (1.31 MB, 816x1264)
1.31 MB
1.31 MB PNG
>>107896670
>>
File: 1757623176966597.mp4 (1.18 MB, 320x768)
1.18 MB
1.18 MB MP4
>>107896670
>>
File: 1768465468395546.mp4 (3.31 MB, 1592x2048)
3.31 MB
3.31 MB MP4
>>107896581
>>
>>107896592
>having some bad anatomy
it gets hands right more often than not compared to the new models but they haven't been tuned yet so maybe in time it will be better
>backgrounds
I agree with you here
>consistency/cohesion
the main complaint about dit models is the lack of variation, I wouldn't mark this as a point against sdxl
>>
>>107896670
>>107896686
I'd get these
>>
>>107896670
>>107896686
that's cute!
>>
>>107896697
>it gets hands right more often than not compared to the new models but they haven't been tuned yet so maybe in time it will be better
gonna are gonna have to lear the hard way that that is 100% tied to aesthetic tuning it to one specific style like anime for noob or photorealism for z image (plus tons of rl training)
>>
>>107896432
The real answer is internal company review processes.
In research, the release of things backlogs for a billion different reasons because of internal reviewers.
You know how we got all this "model coming soon" stuff on Github? People forget about it, then 6 months later the researchers are like "Model live on Huggingface!"
It sucks but its the truth.
>>
File: 1749034714536976.mp4 (3.74 MB, 1292x2048)
3.74 MB
3.74 MB MP4
>>107896686
>>
File: 1743266224367751.png (1.13 MB, 816x1264)
1.13 MB
1.13 MB PNG
make a plastic anime figure of the girl on a round pedestal
>>
>>107896716
>You know how we got all this "model coming soon" stuff on Github? People forget about it, then 6 months later the researchers are like "Model live on Huggingface!"
yeah I know what you mean, I remember this same situation happen once but I don't remember what model it was though, probably something mid lol
>>
Where can I download Klein image edit workflow for comfy? I can only find the text to image workflow.
>>
>>107896696
Endearing
Now do him turning into a titan

>>107896697
Good to know
I suppose I'll just wait a while before I try another model, doesn't seem like there's a local option that can fit my needs better than noob rn
>>
>>107896738
https://github.com/BlenderNeko/ComfyUI_ADV_CLIP_emb
>>
File: 1754846995826639.png (216 KB, 820x952)
216 KB
216 KB PNG
how the fuck does klein know what image 1/2/3 is?
>>
File: file.png (2.52 MB, 2047x1388)
2.52 MB
2.52 MB PNG
>>107896738
look at templates in comfyui
>>
>>
>>107896745
it stiches them together I think from left to right. You can describe it in other ways though.
>>
File: 1746728656431076.mp4 (2.74 MB, 640x1024)
2.74 MB
2.74 MB MP4
>>107896732
>>
>>107896697
yeah i mostly agree with you. the newer models that /ldg/ are fawning over are powerful for realism but dont know characters, series, or proper danbooru tags. not really suited to anime gooner material if thats your thing
>>
>>107896793
with i2v or an image reference it doesn't need to know them natively

for anime image sources you have stuff like wai/illustrious which knows everything
>>
>>107896756
if black forest labs was nintendo they would start suing plastic doll factories for copyright infringement
>>
>>107896807
racist
>>
>>107896800
this is just cope and legwork to make a model work the way you expect ootb at this point. models are great when it's easy to use. loras are just as bad imo
>>
>>107896800
having a structured prompt tag system, which applies to noob, but also illustrious finetunes in general is a huge advantage. you might not be good at first, but you can study and get good
>>
>>107896825
if you aren't using an llm to write your prompts you are doing it wrong
>>
>>107896732
>>
File: 1758609721508638.mp4 (1.5 MB, 576x896)
1.5 MB
1.5 MB MP4
>>107896756
>>
File: 1766929188848771.jpg (431 KB, 1920x1344)
431 KB
431 KB JPG
>>
File: 1739721515300015.mp4 (1.63 MB, 768x576)
1.63 MB
1.63 MB MP4
>>107896836
>>
File: vz5.png (3.01 MB, 2560x2560)
3.01 MB
3.01 MB PNG
>I stopped using 4chan since the hack. I now browse alt chans that actually care about their users, and don't need an userscript fighting their shitty design.
>>
>>107896618
How are you getting Klein to maintain the likeness so sharply? It's changing every person I try and manipulate to a horrible mess of JPEG compressed Flux-face.
>>
>>107896885
i'll tell you
>>
>>107896807
show me whatever chink model you think would do better at that kind of closeup while still realistically representing the peach fuzz and such
>>
File: Comparison.jpg (2.69 MB, 2496x1872)
2.69 MB
2.69 MB JPG
>>107896885
be less retarded
Klein 9B Distilled, 8 steps, "The man is now wearing blackface. Everything else is exactly the same." Input image resolutio same as output resolution.
>>
File: 1745099928200689.mp4 (3.95 MB, 2048x2048)
3.95 MB
3.95 MB MP4
>>107896875
>>
>>107896905
Not what I asked about, fuck you piece of shit
>>
File: 1754479483592823.jpg (541 KB, 1872x1392)
541 KB
541 KB JPG
>>
File: Flux2-Image_00102_.png (1.46 MB, 848x1232)
1.46 MB
1.46 MB PNG
>>107896848
nice
>>
>>107896908
wat
why am i a piece of shit for pointing out that Klein shouldn't be making significant changes to anything, generally
>>
>>107896914
(samefag) like in general, literally make the output the same res as the input, you'll get best results that way
>>
>>107896913
that head tho
>>
why didnt NAI switch to chroma?
>>
>>107896913
good body, face needs to be 20s and not early 30s
>>
File: file.png (1.33 MB, 2100x1350)
1.33 MB
1.33 MB PNG
>>107896885
pretty much default
>>
>>107896929
>early 30s
do gweilo women really age like that? lol
>>
>>107896927
what the fuck are you talking about nigger, Noob stopped training when they ran out of money, in general
>>
>>107896927
they made their own from scratch model that is better than everything else for animie. They refuse to do realism since they allow loli so of course that would get them in trouble
>>
>>107896942
>Noob stopped training when they ran out of money, in general
wait what? what happened?
>>
File: 1737513274129046.png (2.44 MB, 1120x1392)
2.44 MB
2.44 MB PNG
>>
>>107896927
isn't nai novel ai?
in that case this:
>>107896943
>>
File: Flux2-Concat_00109_.png (2.57 MB, 1694x1232)
2.57 MB
2.57 MB PNG
>>107896920
it matches the original, and stubbornly refuses to get smaller no matter how I prompt it
>>107896929
better? can't get it to do any ages between this and the original
>>
>>107896951
i bet she fucks desi men
>>
>>107896955
her head looks too big lol, face is fine sure
I guess it's the input image
>>
>>107896948
it was a long time ago dude lmao, like at least a year
also the other guy is right, NAI usually means NovelAI
only zoomer retards associate "NAI" strictly with Noob
>>
>>107896954
who is noob and why did he ran out of money
>>
File: 1768709490.png (1.65 MB, 992x1056)
1.65 MB
1.65 MB PNG
>>
File: 1740897416925863.mp4 (1.81 MB, 832x1216)
1.81 MB
1.81 MB MP4
>>107896955
>>
File: 1759315082119941.png (132 KB, 819x1044)
132 KB
132 KB PNG
I added a "max_images_allowed" parameter so that you won't have to bypass anything to switch from "2 images mode" to "1 image mode" or "no image mode" or whatever
https://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced
https://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced/blob/main/workflow/workflow_Flux2_Klein_9b.json
>>
>>107896987
buy an ad
>>
>>107896964
>I guess it's the input image
anime has always head huge heads proportionally to the body compared to a real human
>>
>>107896987
Your intro is about QIE. Does Klein have the same issue?
>>
>>107896954
>isn't nai novel ai?
i'll never not love the chinks that made noob for stealing nai's name because of the seethe it causes them
>>
File: 1768709707.png (1.78 MB, 1088x944)
1.78 MB
1.78 MB PNG
>>
>>107897002
the zoom-in issue? it has, but it's way less severe than QiE, anyways, the vl_megapixels value must always be at 0 for Klein since it's used only for QiE
>>
File: ChinkDreamFourPointFive.jpg (3.81 MB, 3072x4096)
3.81 MB
3.81 MB JPG
>>107896896
(intentional samefag self reply)
Seedream 4.5 gives this at max resolution for the exact same prompt, for the record. Continues to suffer from the retarded obsession of incel Chinks with weird fucking glowing eyes on white women and a general facial structure that resembles no real person who exists anywhere on earth.

Like when I train loras I train them on actual photographs of actual normal looking real people, literally go out of my way most of the time to have a good spread of ethnic and age variance, and also caption the specific ethnicity and age of all people who appear, both male and female. As such I have no respect whatsoever for anyone who does anything less, and I especially don't give a shit about ugly retarded Chinese beauty standards that don't look good in any way to anyone who isn't nativly Chinese.
>>
>>107896913
make a 3d printed recreation of the character
>>
>>107897012
No, this :
>The default TextEncodeQwenImageEdit node downscales your images to 0.15 megapixels before feeding them to the VLM.

But you answered my question. You should write somewhere that your node can be used with Klein too.
>>
>>107897015
now prompt it to turn her back to og asuka
>>
File: 1751321610817989.png (2.67 MB, 1008x1536)
2.67 MB
2.67 MB PNG
>>
File: 1741221835943003.mp4 (3.86 MB, 1890x2048)
3.86 MB
3.86 MB MP4
>>107896976
>>
wan2gp now supports flux klein 4b and 9b.
>>
>>107897021
yeah I should probably change the readme at some point
>>
File: Flux2-Image_00120_.png (3.19 MB, 1232x2048)
3.19 MB
3.19 MB PNG
>>
>>107897033
>it's the anon with the shitty skin fetish
fuck me
>>
klein loras
klein loras
klein loras
klein loras
klein loras
>>
>>107897044
post "good skin" the you nogen faggot, I dare you
>>
>>107897022
make this into an anime illustration
>>
what schedulers and samplers are yall folx using with klein for realism? Res2/beta seems to kinda work for me at 1.5 cfg, steps
>>
>>107897049
a bit washed out colors but pretty good
>>
>>107897044
>shitty skin
you say that because she's black racist :'(
>>
>>107896987
>ConditioningNoiseInjection

Neat, zimage severely lacks variation so might give z another go. Will it support other models in the future?
>>
>>107897058
correct
>>
File: 1767425939385074.mp4 (2.18 MB, 576x1024)
2.18 MB
2.18 MB MP4
>>107897041
>>
File: Flux2-Klein_00013_.jpg (2.32 MB, 2048x2048)
2.32 MB
2.32 MB JPG
17.6 seconds on a 4090, 9b distilled
The rug is a large, intricately patterned tapestry with a dominant circular mandala design in the center. The primary motif is a concentric radial pattern with multiple layers radiating outward, resembling a blooming flower or sunburst. At the center is a small, dark blue circle encircled by lighter blue and grey petal-like shapes forming a tight rosette. Surrounding this core, layers alternate in warm and cool tones—rust red, deep navy, soft purples, and desaturated oranges.
The second ring features diamond-shaped petals with red centers and dark blue outlines, creating a sharp, rhythmic contrast. The next ring is a wide band of leaf- or feather-like shapes, pointing outward and alternating between muted blue and copper tones. These are bordered by thin concentric rings that divide each section cleanly, maintaining geometric precision.
Beyond the central mandala, the rug transitions into a repeating motif of teardrop and eye shapes, arranged in a circular rhythm. These shapes are filled with detailed internal patterns: dots, lines, and floral curves, mostly in subdued hues of indigo, grey-blue, and maroon.
The outer background of the rug is a deep violet or midnight blue, filled with tiny floral emblems and star-like dots, scattered evenly to create a celestial ambiance. There’s also a visible fabric texture throughout, giving the sense of a woven tapestry rather than a printed rug.
>>
>>107897052
cfg 2 + res6s + whatever the default node uses for scheduler since it's not explicit
>>
>>107897078
>res6s
bro you gen 1 image per hour huh
>>
NovelAI is still better than everything local, right? At least generally
>>
File: 1761575028856961.mp4 (2.01 MB, 576x896)
2.01 MB
2.01 MB MP4
>>107897033
>>
lol. klein doesn't generate genitalia, but it swaps them in no problem.
>>
File: 1750031417049621.mp4 (3.03 MB, 640x512)
3.03 MB
3.03 MB MP4
>>107897009
>>
File: Flux2-Image_00126_.jpg (773 KB, 1280x1904)
773 KB
773 KB JPG
>>
>>107897060
>Will it support other models in the future?
I think it works on every models
>>
>>107897079
120s per image, I'm fine with it
>>
>>107897088
swaps them??
for me half the time it adds panties, always conservative types
>>
>>107896342
Amusingly (and correctly) asking to make the image like cosplay = background will by default look like inner city sprawl because of conventions all being hosted in those environments, though of course you can just ask for whatever location. still, i've found just asking to make it into a real photo works completely fine; the big issue is more unnatural elements (wings, coloured eyes, wands/staffs, tails etc, or ofc very stylised art) stack up and make the face worse and more uncanny the more of them there are whether you call it cosplay or not. if you really badly want a nice aesthetic pic of a character like this, the play might honestly be (1) image edit to remove those, (2) convert to photo, (3) show it photo & original drawing & ask to add the staff/tail/wings/etc to the photo. Those individual elements might still look a little bit tacky but at least the face will preserve its good quality instead of coming across with poison baggage.
>>
>>107897106
>picture 1 with a skin mound
>picture 2 with genitals
>prompt "swap the genitals"
>boom
sample size of 2 kek
>>
File: Flux2-Concat_00141_.png (2.76 MB, 1796x1200)
2.76 MB
2.76 MB PNG
my god, it's flawless, you actually can't even tell it's ai
>>
>>107897141
I see, I'll definitely try, this a good way to leverage other models good understanding of genitals
>>
File: 1760020691226156.mp4 (3.3 MB, 448x704)
3.3 MB
3.3 MB MP4
>>107897015
>>
File: ZImageTurbo_Output_1251.png (3.23 MB, 1248x1824)
3.23 MB
3.23 MB PNG
>>107897013
Z Image version genned and upscaled the same way as the Klein one
wow can't believe she looks like a completely generic as fuck unrealistic Asian-injected SD 1.5 esque sloppa 1girl with significantly worse detail than the Klein version
who could have imagined these results
>>
>>107897044
its ok to admit that you want to colonize her anon. This isn't /pol/ or reddit.
>>
>>107897166
black women are based, that anon and his obsession with damaged leathery dirty skin is not
>>
File: 1740083143952022.png (1.99 MB, 896x1744)
1.99 MB
1.99 MB PNG
>>
File: Flux2-Klein_00001_.png (913 KB, 1024x1024)
913 KB
913 KB PNG
My very first Flux Klein gen. (9b base, fp8)

>A woman with large breasts wearing a long gown at midnight. The image is so dark it is barely possible to see anything.

I must admit I'm impressed. Many models would not be able to do an adequate job with this prompt. Whether it can do a gen that looks good on close inspection remains to be seen, but so far so good.
>>
File: 1762382441486374.mp4 (3.88 MB, 1024x2048)
3.88 MB
3.88 MB MP4
>>107897187
>>
>>107897235
I think it goes too hard on the night though, it's almost completly black lol
>>
>>107896432
>>107896446
since base model will be so much slower than turbo, it needs to be sufficiently smarter that waiting 60s for a base gen is more appealing than waiting 50s for 5x turbo gens and filtering for the best result, e.g. needs to be so much better or so much more competent at edge case difficulty tasks that the turbo model frequently fucks up. W/ that said i dont know why theyd even call the models "turbo" and "base" of the "same" model after this much work, sounds like it's practically gonna be Z-Image V2. Might as well have co-released base on the day but told people not to directly use it for inference, as with Flux 2 Klein's base.
>>107897148
i know you're just joking around but i actually do think the clothes and hair etc on this gen look great. the chunky fingers and retarded eyes matched the input image which is a pretty valid decision you can easily explicitly prompt against and same for the neotenous jawline, the only other blatant mistake is one pinky finger being in grey glove material / almost unnoticeable, and i guess the zipper being gone. frankly i think its mindblowing that raw unfinetuned local can do this now.
>>
>>107897250
Did you read the prompt?
>>
>>107897250
nta but to be fair
>The image is so dark it is barely possible to see anything.
it did what it was asked lol
>>
>input a low res image of a person
>tell klein to upscale the image, and use a high res picture of the same person to grab detail from
why doesnt this work?
>>
File: Flux2-Klein_00001_.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
Uh oh.. gen #2 is SLOPPED. Time to play with settings and maybe img2img if necessary
>>
File: 1748706960440617.png (2.93 MB, 1248x1248)
2.93 MB
2.93 MB PNG
>>
>>107896932
You know what? It might be the fp8 text encoder fucking my shit up... anyone know where to get a unquanted 9b safetensor file (that isn't shards)?

Comfy only hosts fp8 and those have always been ass when it comes to quality.
>>
>>107897235
>>107897250
>The image is so dark it is barely possible to see anything.
I reckon it followed his prompt extremely well. However, I will say that I've had it generate black clothing that was extremely fucking close to just raw black paint bucket in an otherwise normal daylight image. I saw an anon suggest running at 8 steps instead of 4 for turbo and from my testing he's completely right, it's not much of a time hit (due to text encoder / model loading / vae time in practice) and you get a consistent improvement in all minor details and low attention areas. Don't trust me though, go test yourself on a half dozen images

What are the rules around "image 1" and "image 2" for multi image combination stuff? That's one thing I've still had inconsistent results with, I'm not sure if my wording is just poor or the model sometimes just takes a stab in the dark with what you want done, it feels like sometimes it interprets my prompts as if the instructions were swapped, adds the wrong character into the opposite image etc. Does it know the image order very well or am I better off prompting like "the image with the blonde" & "the cartoon image" etc?
>>
File: 1757898769449599.mp4 (3.86 MB, 1364x2048)
3.86 MB
3.86 MB MP4
>>107897250
>>
File: 00027-314691688.png (757 KB, 832x1216)
757 KB
757 KB PNG
>>107897235
prompt please:)? i want to emulate that level of darkness night lighting.
>>
>>
not a single literate person in this general
>>
>>107897288
I don't know if I can replicate it with a better cfg setting. The default was 5.0, which is WAY too high for most images. I'm trying really low values right now and it seems like the sweet spot will be somewhere between 1.0 and 2.0

Anyway the prompt was in my post lol
>>
>>107897250
that's what he asked for though lmao
ZiTNiggers be like
NOOO THE GEN ISN'T MUH EBIN REALISTIC
when the prompt is like "2D, anime only, it's a drawing, on paper, literally traditional media, pretty woman"
>>
>>107897241
but who was phone??
>>
File: 1765568549237914.png (2.58 MB, 1024x1536)
2.58 MB
2.58 MB PNG
>>
>>107896293
>I CAN'T SEIG
>>
>>107897311
>ZiTNiggers be like
rent free lmao, at no point he said he was a ZiT enjoyer
>>
File: 1761930750881479.mp4 (3.97 MB, 2048x2048)
3.97 MB
3.97 MB MP4
>>107897267
>>
File: file.png (1.44 MB, 2433x1573)
1.44 MB
1.44 MB PNG
learn these meme skills now
>>
>>107897311
>NOOO THE GEN ISN'T MUH EBIN REALISTIC
yes this is me
>>
Can zit train character loras properly or is it completely trash if you want to train character loras?
I mean is the distill after all.
>>
>>107897360
>every single wf now is just "stuff everything inside a big-ass subgraph"
>look inside
>noodle hell that nobody even bothers to tidy up
>>
>>107897373
you can easily train a character lora. use the adapter method

https://github.com/ostris/ai-toolkit.
>>
File: 1742445106084969.png (2.17 MB, 1850x768)
2.17 MB
2.17 MB PNG
>>107897360
>>
File: 1748868109725425.png (281 KB, 2528x952)
281 KB
281 KB PNG
>>107897311
>NOOO THE GEN ISN'T MUH EBIN REALISTIC
desu even if it's not at the level of Z-image turbo it's quite impressive, this model can do both edit and pure text to image shit, I doubt Z-image edit will be much more realistic than Klein since they don't seem to be using RLHF like on turbo
>>
File: 1754578376564789.png (2.78 MB, 1008x1536)
2.78 MB
2.78 MB PNG
>>
>>107897382
It don't mind it that much for the Flux 2 Klein default workflow since it does allow you to easily use the same workflow to toggle between doing image modification vs multi-image modification, and I added a third subgraph for regular t2i into mine. However, it was irritating having to fuck around with the subgraph's signature so that I could get it using the gguf cliploader since i'm using the qwen3 8b q8 gguf; it seemed that the only way to do this was just trust black magic by making a new input pin and connecting it up to the input of a contained gguf cliploader node, which *doesn't* work if you just reconnect the existing "clip_name" pin to the new clip node, doing that won't reset the type, so it's not just magic, it's magic which sometimes doesn't activate, and with no apparent way to explicitly do it yourself. had to make new & delete the old node. Also, noise randomize/increment doesn't seem to even work on the default subgraph for me, so i also had to pull that out into an external input node (fine, i wanted a single global one anyway, but it's more shipped broken stuff).
>>
>1girl incel grifter islamophobic h8ters ITT
big sad. m(1g)ga
>>
File: ErikaKirk_ZiT_Output.png (2.68 MB, 2016x1152)
2.68 MB
2.68 MB PNG
>>107897373
it's good if you train with Ostris V2 adapter. At least if you train for 100 epochs on a dataset of 50+ images that were meticulously captioned with jailbroken Gemini 3 Pro, with regular AdamW at 0.0005 model LR, Cosine With Restarts at 3 restarts, 5 gradient accumulation steps with a batch size of one, and 1024x1024 base resolution, and Dim 32 / Alpha 32. Picrel.
>>
>>107897451
Thanks, I'll give it a try.
>>
>tfw no albino gf
>>
lmao
https://civitai.com/images/116078924
>>
File: Flux2-Klein_00026_.png (779 KB, 704x1152)
779 KB
779 KB PNG
I think I'm starting to understand Flux Klein base. Shockingly, the model it reminds me of most is... Flux 1 dev. Who could have guessed?
>>
>>107897443
oh, and the default workflow has a little trap to fuck up your gens btw. "ImageScaleToTotalPixels" converts the input image to exactly 1 megapixel in size in case retards feed it a 10 megapixel image and can't understand why their gen took half an hour. if you're using the official workflow make sure you delete or bypass that horseshit, for even some slight resizes it can turn what might have been a crystal perfect modification into a fried artifact-filled output with fuzzy aliased outlines (i noticed on a 2d img2img)
>>
File: MadisonBeer_ZiT_Output.png (2.63 MB, 1152x2016)
2.63 MB
2.63 MB PNG
>>107897451
Madison Beer also trained the same wayish here
>>
>>107897510
no I know that, I was referring to the fact that ZiTNiggers literally don't care about the prompt, they just look at any comparison and assume the winner is whichever image is most realistic even when it's wholly uncalled for
>>
File: 1740019555955277.jpg (359 KB, 1616x1616)
359 KB
359 KB JPG
>>
File: Comparison.jpg (1 MB, 2838x989)
1 MB
1 MB JPG
>>107897311
to be fair you can remove some of the slop if you go for more steps, I think they did a mistake by going for a 4 steps distillation process, it would've been more realistic if they did ilke Z-image turbo and went for a 8 steps distillation process
>>
>>107897484
>account name ai images
>uploaded an ai vidya
I've been lied too
>>
>>107897525
isn't this the same comment i replied to in:
>>107897520
how did you delete it lmao
>>
>>107897533
>how did you delete it
I forgot to add the image :(
>>
Are both 9B and 4B Kleins edit models?
>>
>>107897536
yes
>>
File: Flux2-Image_00174_.png (1.27 MB, 848x848)
1.27 MB
1.27 MB PNG
>>
File: 1742081993718920.png (66 KB, 1172x538)
66 KB
66 KB PNG
https://www.reddit.com/r/StableDiffusion/comments/1qfylsf/flux2klein_training_lora_is_now_supported_in/
based
>>
>>107897491
Looks my ex, Jen, but younger and with a better body.
>>
>bghirafag
>>
>>107897491
>the model it reminds me of most is... Flux 1 dev
I played a lot with flux 1 dev and I don't think of that model when playing Klein, it's way less slopped than that
>>
>>
>>107897571
does that mean more than 64gb ram for 9b?
>>
Got three gens in a row of SD1.5-tier limbgore with Klein, then it went back to behaving... weird...

>>107897585
I was pretty good at un-slopping Flux 1 with my bag of tricks. e.g. picrel was a Flux 1 dev gen
>>
File: Flux2-Image_00189_.png (3.27 MB, 1424x2048)
3.27 MB
3.27 MB PNG
>>
File: Flux2-Image_00192_.png (1.2 MB, 848x1120)
1.2 MB
1.2 MB PNG
>>
File: Flux2-Concat_00193_.png (1.88 MB, 1692x1120)
1.88 MB
1.88 MB PNG
Klein knows a lot of styles
>>
File: 1765516238566917.png (57 KB, 1255x613)
57 KB
57 KB PNG
>>107897571
literally what?
>>
File: Flux2-Image_00197_.jpg (1.11 MB, 1280x1872)
1.11 MB
1.11 MB JPG
>>
>>107897612
nice, what style is this?
>>
WAIT WHAT?
I didn't think to test, I expected it to be censored but apparently the 4B klein is super uncensored, even does penis out of the box. Nothing else before it did that:
https://civitai.com/posts/25961518
>>
File: Flux2-Concat_00200_.png (1.91 MB, 1688x1120)
1.91 MB
1.91 MB PNG
>>107897637
Linocut / Woodcut
>>
File: TOTAL BRETZEL VICTORY.gif (172 KB, 220x220)
172 KB
172 KB GIF
>>107897644
>even does penis out of the box. Nothing else before it did that
keek, Z-image who??
>>
File: Flux2-Klein_00192_.png (1.23 MB, 832x1248)
1.23 MB
1.23 MB PNG
>>
File: 1748615619158850.png (2.32 MB, 1040x1488)
2.32 MB
2.32 MB PNG
>>
>>107897644
>pony prompts.
disgusting
>>
ok yea, klien 4B really is sdxl 2 now. Trainable on cheap hardware, knows a ton in many styles, is uncensored as a base model as we can get...
>>
>>107897644
explain to me how this company went from "muhh safety is the most important thing to the world" to "oh yeah our model can make dicks now" in less than 2 months???
>>
>>107897644
>4B klein is super uncensored
No it's not. It turns panties into shorts. If it shows dicks it only means it has gay porn in it and not general lewds.
>>
File: Flux2-Concat_00213_.png (2.83 MB, 1696x1152)
2.83 MB
2.83 MB PNG
>>107897665
GOLEM GET YE GONE
>>
>>107897644
>ublock blocks so much the site doesn't even work
heh
>>
>>107897684
do 4B base, not 9B
>>
File: Flux2-Image_00218_.png (1.66 MB, 848x1296)
1.66 MB
1.66 MB PNG
>>
>>107897644
Isn't this just an SD gen? It's a very nice one, though presumably handpicked from a huge amount of inputs, but Civit just barely vaguely seems to be indicating it's a touched up gen from some SD1.x model, and definitely doesn't seem to say anywhere that this is Flux. I assume you're baiting. With that said, maybe you're not, since I've had plenty of Flux Klein gens happily generate basic pussy, and it's quite happy to do nipples (though sometimes skips if not explicitly prompted and creates smooth breasts). The 9B, at least, doesn't seem to know what cock is though.
>>
>its true
holy shit
https://files.catbox.moe/dsfwp7.png
>>
>>107897644
he probably just used reference images.
>>
File: lets goooooo.png (756 KB, 1400x700)
756 KB
756 KB PNG
>>107897714
SDXL IS OFFICIALLY DEAD
>>
https://files.catbox.moe/nad3i6.png
bfl lost their mind
>>
>>107896987
>I added a "max_images_allowed" parameter
this is FUCKING retarded man, it's also completely non-standard on how comfy works, you're basically disabling inputs anyway, why add another source of disabling?
>>
Gonna try running Klein 4B base with the same prompt and settings as I had for 9B base, see what's different
>>
>>107897714
prompt?
>>
I can't post this direcly because NSFW but holy shit lmao
https://files.catbox.moe/khnsk5.png
>>
>>107897726
>why add another source of disabling?
because if you want to go for 2 images to 1 image you have to bypass a lot of nodes manually, now THAT is retarded
>>
File: 1765588036239069.png (2.42 MB, 1024x1520)
2.42 MB
2.42 MB PNG
milk train arrive
>>
not perfect of course but by FAR the best we have ever had out of a base model. SDXL was far worse
https://files.catbox.moe/jgwf3v.png
>>
>>107897733
>>107897725
base or distilled?
>>
https://files.catbox.moe/plpca2.png
https://files.catbox.moe/lrjmje.png
Yea, no idea how they did this. This is so risky for them
>>
>>107897733
prompt?? it also works on distilled?
>>
>>107897731
I mean the nodes are yours so you are free to do whatever, but disabling of groups of nodes is already a resolved problem, either with group disabling (easy use or rgthree? i dont remember) or with lazy switch nodes.
It's just clutter at this point
>>
>>107897749
not telling :3
>>
File: 1740119422124761.png (150 KB, 480x216)
150 KB
150 KB PNG
>>107897755
pack it up boys it was a troll all along
>>
nipples need work for sure. It will be crazy with loras
>>
>>107897451
Why 5 grad acc steps for batch of 1?
>>
>>107897752
>It's just clutter at this point
just let the value to 3 if you want to keep doing your convoluted manual disabling shit lol
>>
unironically the greatest model to ever release openly
>>
>>107897725
you can face swap too, they dont care about "muh ethical AI" now

did you take a lewd pic and say "make image 1 in the style of image 2"?
>>
>>107897749
base
>>
>>107897725
Hello. Please share the prompt.
>>
>>107897770
>they dont care about "muh ethical AI" now
what happened in germany? did they recently elect the son of Hitler or something? what's with that sudden 180° turn
>>
>>107897634
Bro that is fucking great
>>
>>107897718
I get thumbs when I do that
>>
>>107897725
>>107897749
>>107897776

Nta but that is the edit model, so my guess would be just give it a pic of a cock as input and that's what you get.
>>
>>107897789
Because he was trolling. Notice all the beggars?
>>
>>107897780
nothing happened, it's just people in the company doing their shit
>>
>>107897793
no penis reference, but it only gets it right like 1/4 times. Still needs training. I cherry picked the good ones.
https://files.catbox.moe/jvnfo4.webp
>>
File: zit.jpg (100 KB, 1400x1800)
100 KB
100 KB JPG
>>107897451
>loramancer rituals
i'm not saying this is a good lora, i'm just saying i only had <10 blurry FMV images to work with and i still use waifu diffusion tagger and the step count was whatever number looked the roundest to me.
>>
>>107897807
>no proof
>>
>>107897644
>even does penis out of the box.
>>107897807
>Still needs training.
you're making a blowjob lora or something?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.