[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>107867304

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Is it over or are we back?
>>
>>107870506
we need to act like flux is good so they release base in response to it
>>
Is this supposed to take more than 20 minutes?
>>
im sorry guys
i have to go to class
ill upload model
soon
>>
>>107870521
but it is? and they gave us klein base too? bet Chang didn't see this one coming
>>
>>107870506
yes
>>
I wonder what horrors BFL has put into the new models to make then unable to do nsfw
>>
>>107870521
it's a turbo model, and the regular model is already out
vramlets rejoice I guess, but shit all good it does us higher castes
>>
File: img_00135_.jpg (481 KB, 1520x1728)
481 KB
481 KB JPG
>>
File: 1747613955052656.png (30 KB, 800x128)
30 KB
30 KB PNG
>nothing todo so i guess i'll try that flux klein thing
>see this
lmao what is this cucked shit? where can i download?
>>
>>107870524
>"desktop app" electron garbage install
probably scanning your computer to collect and sell data
>>
>>107870544
retard tourist
>>
>>107870538
No one's gonna finetune some 32b abomination, this is as good for 5090 chads as it is for VRAMlets, especially if you are coomer
>>
>>107870453
https://www.reddit.com/r/mildlyinteresting/comments/b4qdsf/this_blue_jay_still_has_half_of_their_baby/

upper right image isn't AI
>>
Anyone have a wan2.2 image2image workflow? How does it compare to qwen3 edit?
>>
File: Flux2-Klein_00031_.png (1.37 MB, 960x1280)
1.37 MB
1.37 MB PNG
9B isn't too bad at all bros, still gotta do scheduler/sampler tests etc. tho
>>
>>107867356
catbox anon :D
>>
File: extinction event.jpg (151 KB, 1600x1211)
151 KB
151 KB JPG
>BFL realizing that they actually have to release models that more than 3 people can run if they want to stay relevant for local
I have arrived in bizarro world
>>
Klein9BBros, what settings are you using? i get good results but it seems to add a lot of noise to the image if i go resolution above 1 megapixel. or are there specific "supported" resolutions?

>>107870601
there's an anon who posts a reddit image in every thread disguised as a gen, he's been doing this for a while
>>
Blessed thread of frenship
>>
File: img_00143_.jpg (1015 KB, 1520x1728)
1015 KB
1015 KB JPG
>>
>>107870639
change your pampers
>>
at least the flux2 vae seems to be apache now so chang can use it in the future
>>
File: Flux2-Klein_00041_.png (2.13 MB, 960x1440)
2.13 MB
2.13 MB PNG
>>
File: ZImageTurbo_Output_151511.png (3.13 MB, 1440x1440)
3.13 MB
3.13 MB PNG
I gave Gemini the highlights of this thread to get a caption and then prompted Z with it lol

Pastebin with prompt cause it exceeds 4Chan comment length:
https://pastebin.com/XEJxkkw2
>>
File: Flux2Klein9B_00025_.png (1.73 MB, 1152x896)
1.73 MB
1.73 MB PNG
Klein skin texture test
>>
What model and workflow are you guys using for LTX2 I2V? I've seen some videos that seem to hold the likeness with minimal artifacting, everything I gen gets melty within 2 seconds. T2V works fine.
>>
imagine the Chroma flux2 klein
>twice the size
>20 times as slow
>body horror that'd horrify David Cronenberg
>>
If I run wan fp16 models, do I need to enable fp16 accumulation?
>>
File: Flux2Klein9B_00029_.png (1.78 MB, 1152x896)
1.78 MB
1.78 MB PNG
>>
File: file.png (1.76 MB, 960x1088)
1.76 MB
1.76 MB PNG
Add this to the next collage
>>
>>107870697
I really wasn't expecting Klein to be fully a different model using a totally different text encoder than Dev honestly, kinda interesting.
>>
File: Flux2-Klein_00045_.png (1.67 MB, 960x1440)
1.67 MB
1.67 MB PNG
you can push it to some strange places
https://files.catbox.moe/m5nq9s.png
>>
>>107870761
oh my hecking godderino
>>
>>107870752
I assume that these teams are working at multiple models at any time, and the feedback for Flux2 was less then stellar and they got BTFO by the chinks so they decided to release on of their other projects. Competition is actually working
>>
>>107870687
is that Base or non-Base?
>>
>>107870777
non-base, i haven't figured out the correct workflow for base so i gave up. maybe some other anon will.
>>
>>107870761
no penor or vagoo?
>>
File: ComfyUI_10152_.png (1.25 MB, 912x1200)
1.25 MB
1.25 MB PNG
>>
>>107870761
BFL redemption arc confirmed?
>>
you can def go >4 steps too and fuck with clownshark to get some neat outputs. i'm done posting for now
>>
File: Flux2Klein9B_00037_.png (1.88 MB, 896x1152)
1.88 MB
1.88 MB PNG
>>107870752
and somehow it has better outputs than the 32B monstrosity? wtf.
>>
File: 1766650638527219.png (376 KB, 649x1677)
376 KB
376 KB PNG
>>107870793
>>
>>107870738
yeah zit is done for
>>
>>107870806
What's up with all the noise? that background is something...
>>
>>107870835
i told it to add it
>>
File: Flux2Klein9B_00038_.png (1.6 MB, 1000x1040)
1.6 MB
1.6 MB PNG
>>107870835
>>
File: Flux2-Klein_00052_.png (1.88 MB, 960x1440)
1.88 MB
1.88 MB PNG
>>107870793
yeah dude, bfl put full uncensored cocks in the model now

>>107870801
i think we should all try the models that come out and see what we think for ourselves. my opinion is irrelevant. (now i'm actually done posting)
>>
>>107870801
full Flux.2 could already do shit like this, it's actually noticeably less "censored" than the original Flux was in practice
>>
>>107870806
I mean does it? Have you compared that prompt with the same settings?
>>
>>107870853
>full Flux.2 could already do shit like this
yeah but it's huge and slow as fuck, models of this size are always DOA
>>
>>107870835
probably just the sampler he's using
>>
File: Flux2Klein9B_00041_.png (1.72 MB, 1152x896)
1.72 MB
1.72 MB PNG
>>107870835
Someone needs to figure out the settings, I keep messing with them and sometimes it gens a very noisy image

>>107870860
I've genned enough Flux 2 32B gens that I can tell just from preliminary tests.
>>
Running KoboldAI Lite and Sillytavern with a RTX4070 12GB.

Does anyone have recommended models for filthy uncensored ERP?
>>
>>107870905
Ask in /lmg/
>>
how good is a 5060ti 16gb with these diffusion models? upgrading from a 3080ti 12gb
>>
>>107870846
>>107870881
So uh.. I've seen a few examples now. Some have a good skin texture some are awful. What's the difference?
>>
>>107870928
massive upgrade
>>
>>107870928
get a 5090 while you still can
>>
File: Flux2Klein9B_00049_.png (2.82 MB, 1440x1120)
2.82 MB
2.82 MB PNG
oh shit i've figured it out why my images were so noisy. i was using 20 steps when the official repo recommends 4 steps. that makes it like 2 seconds to generate a single image, like way faster than ZiT

>>107870935
>What's the difference?
prompt/sampler/steps/rng.. too early to tell
>>
>>107870938
thats all i needed to hear thank you anon
>>
File: Flux2Klein9B_00053_.png (2.45 MB, 1440x1120)
2.45 MB
2.45 MB PNG
>>107870948
that one was 8 steps btw that's why it's still noisy

this one is 6 steps euler
>>
>Klein 4b is apache 2.0
we will finally leave XL behind.
>>
>>107870978
BFL will find a way to cuck us
>>
>>107870978
this is force them to release wan 2.5
>>
File: test.png (2.52 MB, 1239x1089)
2.52 MB
2.52 MB PNG
seed diversity test on the 9B distilled with same prompt and settings
>a woman in a dress at the party
>>
Chinese will regret not releasing base, thank you Germany
https://i.4cdn.org/wsg/1768480233175429.mp4
>>
>>107870948
bruh do you not fundamentally understand the point of distilled models lol. The Base version would need 20+ steps yeah, not the non-Base
>>
>>107871044
not bad DESU, good age range
>>
>>107871044
>no desis
trash
>>
Are we eating good?
>>
>>107871049
I don't think I watched a ltx gen in days. I tried to watch this but no, never again.
>>
>>107871044
Not much variation but more than Z I guess
>>
>>107871085
zitter alert
>>
>>107871085
I think we can agree that LTX is only useful for memes and voice cloning or audio. I don't know what else you can use it for.
>>
has the white man put the gooks into their place again?
>>
File: Flux2Klein9B_00101_.png (2.83 MB, 1440x1120)
2.83 MB
2.83 MB PNG
anime test or idk man i never gen this 2d stuff so don't have any prompts for it

>>107871062
i understand but i didn't even pay attention to it, i just switched out the loaded model and text encoder in an existing Flux2 32B workflow i had
you have to cut me some slack this shit released like an hour ago
>>
>>107871103
You will be BEGGING for more loras after the first goon ones come out
>>
>>107871103
Wan sucks balls, in slomo
>>
Are the Klein models editing models too?
>>
>>107871107
ai slop
>>
>>107871106
If by white man you mean jewish, then yes
>>
>>107871117
no slomo if you disable high noise speedup lora, but gen takes 4x longer

>>107871122
this is an ai slop general
>>
>>107871103
there are better memes and better voice models. it does a really shitty job at blending a vid with provided audio. it's a cool idea but it's just shit execution
>>
>>107871122
damn what gave it away
>>
>>107871133
I can gen 10 mid quality videos with ltx in the time it takes wan to do one high quality one, plus ltx has audio. Sometimes quantity > quality
>>
>>107871134
we already have workflows integrating stuff like vibevoice and acestep into ltx 2, it can tell them what to do
>>
>>107871134
Lol I would like to see wan do this, like legit impossible and I am not even talking about the audio, this continuous shot is nuts
https://civitai.com/images/117487189
>>
>>107871143
read my post again retard and tell me that solves my complaint
>>
>>107871152
you read mine retard
>>
>>107871165
I did and it makes you seem illiterate
>>
File: z-turbo_00032_.png (2.07 MB, 1536x1152)
2.07 MB
2.07 MB PNG
>>107871122
>>
Klein looks better than Dev judging from the examples in this thread. How is that possible?
>>
>>107871151
fuck next year we are genning whole 2h movies.
>>
I'm tired of the retards that just use their 10 year old meme folder as a reaction image. just gen something instead my god
>>
>>107871175
i think you lost
>>
>>107871151
>no workflow
fake and gay
>>
>>107871151
Yeah but can it make slomo big titty bounces!? Checkmate loser
>>
>>107871186
Tranny moment
>>
>>107871182
>2h movie
>entire thing is dry delivery, fried slop, jank movement and blurry poop
nice
>>
>>107871202
we getting loras and you will see
>>
File: zimg_0062.png (2.33 MB, 1662x1222)
2.33 MB
2.33 MB PNG
ok i lied, here's some comps.
i grabbed 25 images from wikimedia commons and ran them through qwen.

https://imgur.com/a/51Nvz4b
>>
>>107871208
now go see the first movie ever made
>>
>>107871213
>ran them through qwen
Workflow?
>>
any recommendations for realistic painterly models? thinking of artists like chifudoon, wlop, krysdecker, sangsoo jeong, rui li, guweiz

like photorealistic paintings, but not literal photos or 3d renders
>>
>>107871213
Lol detached feet, reflection that make no sense. Yeah I'll stick with z
>>
>>107871208
I said in a year you troglodyte.
>>
>>107871213
>third leg
>reflection is wrong
>white
Absolute slop
>>
>>107871225
yeah
>>
>>107871230
yellow fever alert
>>
File: zimg_0047.png (781 KB, 768x1024)
781 KB
781 KB PNG
>>107871224
https://imgur.com/a/wH6qSF7
og images

>>107871224
https://files.catbox.moe/65qh7f.png
it's brittle so good luck
>>
>>107871230
fuck off no-gen
>>
>>107871212
I was being sarcastic, nothing I hate more than slomo big titty bounces. I want some fast paced big titty bounces ffs
>>
File: 373825600.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
If anyone can bother to test this on the new Flux:
on the top left there's a red marble with golden stripes, to its side on the top middle there is a emerald cube, on the top right is a cone made of pink sand, on the middle left is a black torus, in the center is a golden egg, in the middle right is a purple glass sphynx, on the bottom left is a white rabbit, on the middle bottom there's a orange die, and finally on the bottom right we have a cat
>>
File: file.png (11 KB, 725x720)
11 KB
11 KB PNG
>>107871244
i think your imgur is shadow banned
>>
File: zimg_0063.png (2.01 MB, 1566x1222)
2.01 MB
2.01 MB PNG
>>107871276
guess i'm spamming
>>
0 excitement for the new model. 0 faith in ltx. 0 chance z base will drop. am I missing anything?
>>
File: flux.png (490 KB, 617x521)
490 KB
490 KB PNG
>>107871263
>>
>>107871292
no
it's over
>>
>another uninteresting model that looks exactly the same as the last 5 models

zzzz
>>
>>107871292
Mr catjak's filled diaper after a soul bond with her grandmother
>>
>>107871302
anon its called reality
>>
File: Flux2Klein9B_00038_.png (1.01 MB, 896x1152)
1.01 MB
1.01 MB PNG
>>107871263
>>
>>107871312
there are tits in reality
>>
>>107871302
it's a model that more than 3 people can run, which makes it very promising
>>
>>107871314
>sand
>egg
looking ogre
>>
this was the 4b model btw >>107871298
>>
>>107870537
I have done nothing but study the blade since flux1 release. I will make it learn NSFW, even if it is the last thing I do.
>>
File: zimg_0071.png (2.42 MB, 1566x1222)
2.42 MB
2.42 MB PNG
>>107871288
>>
>>107871292
>slow death of wan
>slow development of chroma
>cool models arent being chaku'd

>>107871312
>mmmmmm 600gb of reality
>OH! this 12gb model does a slight variation of lumen
>WAOWY ZOWY! this 30gb one has slightly better chins

a rook same
>>
>>107871337
>it can do nipples

huge
>>
File: zimg_0065.png (2.58 MB, 1566x1222)
2.58 MB
2.58 MB PNG
>>107871337
>>
File: Flux2Klein9B_00147_.png (2.58 MB, 1120x1440)
2.58 MB
2.58 MB PNG
if you prompt for film grain it actually adds a lot of film grain
>>
>>107871349
Can you do a robot
>>
>>107871349
Man glad we have some real comparisons here instead of reddit's "I got early access" gens lol. Model looks pretty meh
>>
>>107871350
those posters, how?
>>
>>107871367
meh? its garbage, their only hope is z base takes 2 more weeks to release and people actually start finetuning their model and make it good
>>
File: zimg_0088.png (1.73 MB, 1566x1222)
1.73 MB
1.73 MB PNG
>>107871363
yeah

>>107871367
i wanted to get a sense of general images. i'm really curious what we get out of loras etc with this thing. this is just basic settings and slop prompts i'm working with.
>>
>>107871396
thanks
>>
File: zimg_0073.png (2.62 MB, 1566x1222)
2.62 MB
2.62 MB PNG
>>107871396
>>
File: Flux2Klein9B_00155_.png (2.57 MB, 1120x1440)
2.57 MB
2.57 MB PNG
>>107871349
what settings are you using? my stuff doesn't come out slopped like yours

>>107871378
i prompted for the names of the bands but it obviously doesn't recognize them
>>
File: zimg_0089.png (1.89 MB, 1566x1222)
1.89 MB
1.89 MB PNG
>>107871405
i uploaded the rest in the link at https://pastebin.com/eBBz6JKr

i cannot be bothered with this captcha
>>
>>107871430
i'm running distilled with the default workflow/settings. i'm also using default settings for ZIT, both of them can be improved but the the comparison doesn't really seem fair if i start tweaking shit and cherry picking gens.
>>
>>107871314
Wow that looks bad
>>
>>107871292
Maybe low t
>>
so uh do you need a cumfy node for that or can it be loaded by any Flux loader
>>
File: Flux2Klein9B_00173_.png (2.84 MB, 1440x1120)
2.84 MB
2.84 MB PNG
>Amateur photo shot with a smartphone, two people are taking a photo next to a statue of a golden bull
suffers from sameface issues like all the other models if you don't explicitly prompt for different looking faces
>>
>>107871526
you need furk's premium workflow
>>
retard here
can we already train loras for flux2 klein?
>>
>>107871516
not really?
>>
>>107871550
who
>>
>>107871562
yes really
>>
>>107871213
I ind it hard to believe the composition is actually THIS similar between Klein 9B and ZiT quite frankly

most of your images show how much better the Flux.2 VAE though I'd say
>>
>>107871572
nah
>>
Do NOT try to prompt z image turbo with JSON format style prompting. DON'T.
>>
File: Flux2Klein9B_00176_.png (2.47 MB, 1440x1120)
2.47 MB
2.47 MB PNG
lmao, it just decided to insert the old guy there
>A woman taking a bathroom selfie in a high class establishment
>>
>>107871585
Dude look at that cone and tell me that looks like sand for you...
>>
>>107871554
nah it just dropped,will take a few days
>>
stagnant tech
>>
>>107871593
based old oogler, mite b a hint the model is sovlfvl
>>
>>107871593
there is so much retarded shit going on in this pic
>>
mat1 and mat2 shapes cannot be multiplied
wtf
>>
>>107871593
That's not a bathroom that's a sinkroom
>>
chinks have been defeated, chinese century cancelled
>>
>>107871650
my brother in christ for comfy use this
https://huggingface.co/Comfy-Org/flux2-klein-9B
>>
>>107871663
I don't know if I can trust this company. they seem to lie a lot about stuff and found reddit posts saying they collect your data
>>
>>107871691
do you trust random white man on reddit or random white man on 4chan
>>
>>107871592
wtf I tried it and my computer exploded and my cat got hit by shrapnel
>>
File: flux klein.jpg (614 KB, 3023x1024)
614 KB
614 KB JPG
nvfp4 is ~80% faster than fp8 on my blackwell card (it looks worse)
>>
>>107871710
please also show klein 4b fp16 if possible
>>
>>107871655
>alibaba probably has a marketing team that searches the internet for feedback of wan
>they see youtube videos, leddit posts and other social media platforms
>they somehow stumble on 4chan
>they see anons spamming "FUCKING CHIIIIIINKKKKSSSSS"
>2.5 never releases

kek, jej even
>>
>you have to either trust jews, nazis or chinks for the future
>indians are forced into every option
what are we going to do?
>>
>>107871747
for now the chinks are best for training the model further yourself
>>
*sigh, where's the super secret workflow then

https://huggingface.co/lodestones/Zeta-Chroma/blob/main/zeta-chroma-x0-pixel-proto.safetensors
>>
File: file.png (424 KB, 1211x1103)
424 KB
424 KB PNG
>>107871244
anon where did you get that qwen clip, I found one with a slightly different name https://huggingface.co/DenRakEiw/qwen3_8b.safetensors and its giving me errors
>>
can we start calling it chemo instead of chroma? seems more fitting
>>
>>107871755
qwen post training breaks down easily, zimage turbo is turbo, what are you smoking anon.
>>
>>107871772
https://huggingface.co/Comfy-Org/flux2-klein-9B/tree/main/split_files/text_encoders
>>
File: 5073370678.png (593 KB, 512x512)
593 KB
593 KB PNG
>>107871663
Thank you, finally worked here
>>
>>107871805
thank you
>>
>>107871691
sorry just not gonna use your shitty ass wrapper ui lmao
>>
File: flux klein_2.jpg (825 KB, 4030x1024)
825 KB
825 KB JPG
>>107871729
>>
File: 11541191.png (3.34 MB, 2080x1040)
3.34 MB
3.34 MB PNG
>>
>>107871874
make their cocks fatter please
>>
>>107871860
thanks
>>
>>107871701
>reddit
>white

also
>4chan
>white

you and i both know it's jeets all the way down no matter the location.
>>
File: file.png (2.41 MB, 1822x1222)
2.41 MB
2.41 MB PNG
this is fun, thanks for the workflow anon
>>
Huh? I did not expect this 4B and 9B release, including both base and distilled versions, by Flux team at all. And the quality doesn't seem too bad? I need to run my own tests though.
Also 4B BASE is on Apache 2 too. New SDXL if the Chinese culture us with the base release?
>>
File: 6996824181.png (1.44 MB, 1152x960)
1.44 MB
1.44 MB PNG
>>107871881
>>
File: Flux2Klein9B_00199_.png (2.25 MB, 1440x1120)
2.25 MB
2.25 MB PNG
>>
>>107871860
so you are better off running klein 9b on nvfp8 instead of running klein 4b on fp16? i guess the final question is the speed
>>
>>107871832
get new material lolcow
>>
>>107871955
gguf q8 as usual
>>
File: silent.mp4 (1.32 MB, 2048x422)
1.32 MB
1.32 MB MP4
>>
They haven't modified the text encoders and they released the base versions.
Unlike the earlier distilled Flux releases we should be able to beat cunts and cocks into them without insane effort.
China scared them really good kek.
We wouldn't get this without Tongyi-Mai.
>>
When did you guys realize qwen3 to wai-i2i was far better for multi-character compositions than using controlnets?

Controlnet is still the best for 1girl pose transfers, but as soon as you have more than one character, qwen3 edit + a WAI style pass blows controlnets away.
>>
File: silent.mp4 (1.32 MB, 2048x422)
1.32 MB
1.32 MB MP4
Whoops, here is the right file
>>
>>107871938
looks good. how looks nsfw part?
>>
File: question.png (475 KB, 624x624)
475 KB
475 KB PNG
Why does this character pop up every now and then? I am not prompting this shit. Some memory bug?
>>
>>107871982
these NV quants look like shit
>>
>>107871995
he's just checking in to see if you are happy with future technology
>>
>>107871916
zit mogs
>>
>>107872013
settings issue, give me the prompt and i will try a proper Flux 9B gen
>>
File: file.jpg (453 KB, 3674x1242)
453 KB
453 KB JPG
>>
File: file.jpg (640 KB, 3674x1242)
640 KB
640 KB JPG
>>107871995
Grim
>>
>>107872077
prompt:
A robotic blue cat with large eyes and a red nose stands centered in frame; it is approximately three feet tall with a rounded body, short legs, and oversized paws holding a rectangular black sign labeled “LDG” in white capital letters on a wooden stick. The cat wears a white crescent-shaped chest plate and a red collar with a gold bell. It exhibits a wide open mouth revealing a pink interior and displays a cheerful expression; the background is a blurred green lawn with trees visible under a clear blue sky during daylight hours. Shot with an 85mm lens at f/2.8, utilizing soft, natural lighting from above with slight shadows, maintaining sharp focus on the cat’s face and sign while employing shallow depth of field to blur the background; exposure is balanced for outdoor conditions and framing is a medium shot capturing the entire figure from mid-thigh up.
>>
>>107872077
>least cucked western model
kek
>>
File: question2.png (352 KB, 508x507)
352 KB
352 KB PNG
>>107871995
to continue this, I sometimes get a shirt or pants like this. It makes no fucking sense
>>
File: file.jpg (775 KB, 3674x1242)
775 KB
775 KB JPG
>>107872093
it can do Trump though lol
>>
>>107871995
>>107872077
>>107872084
It seems to have overlearned Doraemon during the distillation process.
Here is another one. Prompt Patrick Star and you will commonly get Spongebob instead or the two mangled together.
>>
File: comparison.jpg (713 KB, 2890x1172)
713 KB
713 KB JPG
Klein vs Z, same prompt
>>
File: file.jpg (625 KB, 3674x1242)
625 KB
625 KB JPG
Can it be saved?
>>
File: comparison2.jpg (848 KB, 2890x1172)
848 KB
848 KB JPG
>>107872164
Z-Image? probably not
>>
File: comparison3.jpg (876 KB, 2890x1172)
876 KB
876 KB JPG
>>
>>107872177
Hey I'm all for a new model, but it needs to do goon
>>
File: 0008.jpg (541 KB, 3674x1242)
541 KB
541 KB JPG
>>
Hijacking current Flux 2 Klein discussion for old story but:
https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md
Does anyone know what part of this license forced lodestone to de-distill schnell instead for chroma?
It seems to allow non-commerical finetuning, no?
Is it the
>We may terminate this License, in whole or in part, at any time upon notice (including electronic) to you.
part that spooked him?
>>
>>107872192
well they gave us the base model unlike the chinese, you can start training today
>>
File: comp_0001.jpg (563 KB, 3674x1242)
563 KB
563 KB JPG
>>
File: file.png (1.34 MB, 832x1216)
1.34 MB
1.34 MB PNG
>>
File: 861077793.png (2.07 MB, 1040x1040)
2.07 MB
2.07 MB PNG
le epic fail
>>
File: comp_0002.png (3.09 MB, 3674x1242)
3.09 MB
3.09 MB PNG
>>
File: 69.png (2.57 MB, 1408x960)
2.57 MB
2.57 MB PNG
>>107872287
I find it odd that its failing at such a simple text but worked with more complex one, was it specifically tuned against kys messages?
>>
File: file.png (27 KB, 741x253)
27 KB
27 KB PNG
>suddenly hear about a random release
>go see what people say about it
>>
>>107872272
kill poved?
>>
File: comp_0003.jpg (619 KB, 3674x1242)
619 KB
619 KB JPG
>>
>>107872300
try giving more width maybe?
>>
>>107872256
You have Flux 2 klein bases now.
The Chinese overplayed their hand with too much culture.
>>
File: comp_0004.jpg (834 KB, 3674x1242)
834 KB
834 KB JPG
>>107872256
>>
File: 8563.jpg (1.07 MB, 1984x1920)
1.07 MB
1.07 MB JPG
>>107872313
zit worked fine with square images, the distilled 9b is suffering
>>
File: 9608958.png (1.13 MB, 1040x1040)
1.13 MB
1.13 MB PNG
welp, w/e
>>
File: comp_0005.jpg (644 KB, 3674x1242)
644 KB
644 KB JPG
>>107872333
A still life composition features the words "Kill yourself" rendered in ornate, metallic lettering resting on a small, rectangular cushion covered in dark red velvet; a single lit candle stands centrally amongst the letters, its flame illuminating the scene and casting shadows onto the surrounding surfaces; delicate spiderwebs stretch between and around the letters and the candle; the setting is an interior space with a dark wooden surface visible beneath the cushion, bathed in low light suggesting nighttime or a dimly lit room; the image utilizes a focal length of 50mm, employs dramatic chiaroscuro lighting emphasizing texture and shadow detail, shot from a slightly high angle directly above the subject, maintains sharp focus on the lettering and candle flame, and uses a moderate exposure to capture the richness of the red velvet and metallic sheen, framed tightly around the central elements with minimal negative space.
>>
File: 75299681.png (1.6 MB, 1040x1040)
1.6 MB
1.6 MB PNG
>>107872354
yeah it was some odd fluke with the grass prompt
>>
File: kys candles.jpg (698 KB, 1264x848)
698 KB
698 KB JPG
>>107872333
Nano Banana Pro (aistudio) in 1 try
prompt: An elaborate candelabra shaped in the words "kill yourself".
>>
File: 311.png (1.32 MB, 704x1344)
1.32 MB
1.32 MB PNG
>>
File: comp_0006.png (1.14 MB, 3674x1242)
1.14 MB
1.14 MB PNG
>>
File: comp_0007.jpg (438 KB, 3674x1242)
438 KB
438 KB JPG
>>
File: Flux2Klein9B_00246_.png (2.6 MB, 1440x1120)
2.6 MB
2.6 MB PNG
that was supposed to be Greenland on the map...
>>
File: 1737621102550634.png (2.17 MB, 1440x1563)
2.17 MB
2.17 MB PNG
I just woke up, tell me chat how good is klein? I hope it's good so that the Tongyi fucks finally WAKE THE FUCK UP
>>
File: comp_0008.jpg (760 KB, 3674x1242)
760 KB
760 KB JPG
>>
>>107872459
its a solid 6/10
>>
File: comp_0009.jpg (955 KB, 3674x1242)
955 KB
955 KB JPG
>>107872459
>>
>cries
>>
>flux2_klein_9B_bnb_4bit_text_encoder
What is a BNB text encoder?
>>
File: 8298732276.png (1.82 MB, 1040x1040)
1.82 MB
1.82 MB PNG
>>107872529
bits and bytes maybe?
>>
File: 1746735925449249.png (883 KB, 2546x1630)
883 KB
883 KB PNG
the level of disrespect they're talking about Klein on Tongyi's server, BASED, THATS WHY CHINESE CULTURE DESERVES AHAHAH
>>
Klein 4B can do pretty decent lighting and skin even at 4 steps with the right sampler and scheduler. It seems to depend a lot on the prompt too. This was using DPM++ 2S Ancestral Linear Quadratic.
>>
Which GPU is better for generating images and videos?

>5070 12vram
>5060ti 16vram
>>
>>107872535
Do I need that?
>>
>>107872557
*pukes*
>>
File: file.jpg (2.04 MB, 7961x2897)
2.04 MB
2.04 MB JPG
>>107872529
>>107872535
https://docs.vllm.ai/en/stable/features/quantization/bnb/
it's a 4bit quant meme
>>
>>107872459
I like it a lot, even the 4B Distilled is pretty good, better and faster than any other 4-step model I can really think of
>>
>>107872558
5090
>>
>>107872558
more vram = better always for AI
>>
Klein vs ZiT and bf16 vs nvfp4 guy, can you run a longer text comparison?
Make a 1girl hold a large sign that has 3-4 sentences written on it.
>>
File: comp_0012.jpg (986 KB, 3674x1242)
986 KB
986 KB JPG
>>107872459
gemma thinks she has pink hair
>>
>>107872564
sorry I'll make her a pale as fuck azn waifu next time kek
>>
>>107872587
is the 9B NVFP4 here still distilled? Or is that Base NVFP4?
>>
Anyone got klein to work in Forge Neo already?
>>
>>107872587
it actually looks competitive compared to Z-it, can you remove the retarded nvfp4 and put Klein base so that we can see where we're starting, maybe we won't need Z-image base after all
>>
File: file.png (1.7 MB, 832x1216)
1.7 MB
1.7 MB PNG
>>107872450
Why it does know trump so well and everyone else sucks?
>>
>>107872609
Trump is by far the most known dude on earth, obviously there's probably gozillions of pictures of him on the internet
>>
>>107871776
Qwen is literally impossible to overtrain unless you're a retard who uses dim so high it creates like 1GB loras
>>
File: file.png (1.28 MB, 832x1216)
1.28 MB
1.28 MB PNG
>klein
>it doesnt know what a klein bottle is
It's over.
>>
>>107872450
>>107872609
ngl it looks better than Flux 2 32b, I didn't expect them to give us a smaller better model, I guess that Z-image turbo forced them to work harder, WTF I LOVE COMPETITION NOW
>>
File: comp_0014.jpg (576 KB, 3674x1242)
576 KB
576 KB JPG
>>107872585
>>
File: Flux2Klein9B_00239_.png (2.21 MB, 1120x1440)
2.21 MB
2.21 MB PNG
this is "slightly chubby" according to Klein 9B

>>107872459
it's alright, solid 6/10 (unlike picrel) and we got the base model

>>107872609
trump is probably 20% of the whole dataset
>>
File: klein 4b_9b.png (3.43 MB, 2016x1024)
3.43 MB
3.43 MB PNG
>>107871916
>workflow
post it onegai
>>
>>107872656
https://files.catbox.moe/fkme0f.png
>>
File: file.png (1.6 MB, 832x1216)
1.6 MB
1.6 MB PNG
>>107872624
So does Obama, yet he looks like shit.
>>
File: 0796993033.png (1.66 MB, 1056x1056)
1.66 MB
1.66 MB PNG
>>107872634
preposterous
>>
coomerbros, we are saved
>>
File: comp_0013.jpg (826 KB, 3674x1242)
826 KB
826 KB JPG
>>107872535
>>
when people say they trained their lora for X steps, does that mean steps=processed images or steps=processed images/batch size?
>>
File: 1752129844988558.png (1.96 MB, 1536x864)
1.96 MB
1.96 MB PNG
I really like the skin texture, I think the BFL fags managed to find the Z-image turbo secret sauce, and I've heard it can do edit as well? if yes we're fucking back right? FUCK CHINESE CULTURE I LOVE BRETZELS NOW
>>
>>107872694
You can't really know, it would be nice if people knew how to use the correct term (samples (steps*batch) vs steps) so it wasn't ambiguous
>>
File: comp_0015.jpg (791 KB, 3674x1242)
791 KB
791 KB JPG
>>107872585
>>
Can you make them hold longer text like this:
"You must solve this quadratic equation to reactivate Agartha's defenses. Quick my child, the future of the white race depends on you."
>>
File: 92537791.png (1.76 MB, 1056x1056)
1.76 MB
1.76 MB PNG
>>
>>107872719
Yeah this is too long. Even API models would start slopping it.
>>
File: comp_0016.jpg (683 KB, 3674x1242)
683 KB
683 KB JPG
>>107872730
>>
File: 1813405.png (1.8 MB, 1056x1056)
1.8 MB
1.8 MB PNG
>>
File: comp_0017.jpg (660 KB, 3674x1242)
660 KB
660 KB JPG
>>107872749
>>
They improved the sound on LTX2 >>>/wsg/6073274
>>
>>107872644
Flux still looks pretty slopped being said shouldn't we be comparing the 4B model, no one is gonna train that 9B with the shitty license
>>
i feel like the default of 4 steps euler isn't enough to converge properly and slops the images, i don't like those settings. someone needs to do a big sampler/steps comparison
>>
>>107872784
can it generate girl doing gluck gluck cough cough for my brothers in india
>>
File: 7979309099.png (3.75 MB, 1408x1664)
3.75 MB
3.75 MB PNG
>>107872768
lmao, the prompt:
A close-up shot of a vintage 1990s-era beige CRT computer monitor resting on a dark, cluttered wooden desk in a dimly lit room. The curved glass screen displays a legacy web interface with a light grey background and thin blue borders. In the upper-left portion of the screen, a single line of monospaced, aliased text in a vibrant lime green color reads ">implying". To the right of the text is a low-resolution, pixelated digital illustration of a stylized face with a smug, upward-curving mouth and half-closed eyes. The screen surface shows visible horizontal scanlines and a fine RGB sub-pixel grid. The monitor's plastic casing is textured with a slight matte finish and has small vents along the top edge. The primary illumination comes from the screen itself, casting a soft green glow onto the desk surface and the monitor's frame. The background is a deep, out-of-focus shadow.
>>
>>107872789
>no one is gonna train that 9B with the shitty license
wait it's not Apache 2.0? oh man... with retards like that Tongyi can sleep peacefully
>>
File: comp_0018.jpg (982 KB, 3674x1242)
982 KB
982 KB JPG
>>107872740
>>
File: 50329584.png (1.76 MB, 1280x896)
1.76 MB
1.76 MB PNG
>>107872792
>euler
are you in 1990? use res_2s/3s
>>
>>107872747
Thanks. They seem roughly even for text, maybe ZiT is a bit better with long text but more testing needed. I wonder if 9B base would fare better with text?
>>
https://huggingface.co/black-forest-labs/FLUX.2-klein-base-9B
>FLUX.2 [klein] 9B Base is a 9 billion parameter rectified flow transformer capable of generating images from text descriptions and supports multi-reference editing capabilities.
is it good at edit? I haven't seen anyone showing any edit examples so far
>>
File: comp_0019.jpg (717 KB, 3674x1242)
717 KB
717 KB JPG
>>107872799
with prompt
>>
>>107872801
only the 4B, bfl is all about shooting itself in the foot
>>
>>107872809
fuck res it's 2x slower than euler, obviously euler would look better if it has 2x more steps and would have 2x more time to make renders like res
>>
>>107872712
so what should I be aiming for? i always here "3500" steps (whatever that means)
>>
>>107872819
sounds like cope buddy
>>
>>107872801
4B is Apache 2, 9B has its own BFL license.
I don't know what precisely prevents training on 9B though.
>>
File: 487480.png (1.53 MB, 1280x896)
1.53 MB
1.53 MB PNG
>>107872819
res_2m isn't slower and is better, the 2s/3s is
>euler would look better if it has 2x more steps and would have 2x more time to make renders like res
Wrong btw
>>
File: split-the-dog-in-half.jpg (1.15 MB, 2890x1172)
1.15 MB
1.15 MB JPG
>>107872809
i don't like res_2s, it split the dog in half
>>
danke fürs beta testen!
>>
>>107872828
>>107872834
I'm not seeing any euler vs res_2m comparison pictures in there! That shilling sounds inorganic as fuck btw.
>>
>>107872826
For SDXL (WAI/NAI) I aimed at 2k samples (steps * batch =~2000), depending on what you are training and on which model you may need more steps since some models are much more resistant to training
>>
Nvfp4 has very good quality for size and speed. Useful for quick slopping. Wish I was on 5000 series.
>>
>>107872839
lmao, res_2s is 2x slower and is slopping the skin texture hard, I knew it was complete bullshit
>>
File: 12418.png (1.61 MB, 1280x896)
1.61 MB
1.61 MB PNG
A 3D spherical emoji character split vertically down the center to display two contrasting exaggerated emotions. The left half of the sphere is saturated crimson red with a matte texture; it features a thick black eyebrow sharply angled downward and a narrowed, glowing white eye. The mouth on the red side is pulled into a tight, aggressive snarl showing white rectangular teeth. The right half of the sphere is bright yellow with a high-gloss finish; it features a wide, quivering eye with atranslucent blue liquid flowing down the cheek. The mouth on the yellow side is stretched wide into a trembling, downward-curved sob. The entire character is rendered with soft-touch plastic shaders and subtle subsurface scattering. The lighting is a high-contrast studio setup with a sharp rim light that highlights the glossy tears and the texture of the red surface, set against a solid, neutral dark gray background.
>>
>>107872839
You should use 7-8 steps for a more accurate comparison here?
>>107872826
4k total steps (Images times repeats times epochs) Divide it with batch size.
>>
File: 12418.webm (960 KB, 704x896)
960 KB
960 KB WEBM
>>107872861
>>
File: 2.png (2.01 MB, 1088x1088)
2.01 MB
2.01 MB PNG
A macro-photography shot of a miniature bonsai tree sculpture entirely constructed from drug paraphernalia. The main trunk is formed by thick, heat-distorted glass pipes fused together in a gnarled, twisting shape. The branches consist of bent stainless steel hypodermic needles and thin, scorched metal stems. Instead of leaves, the tree features clusters of multi-colored pharmaceutical capsules, small round pills, and jagged, translucent white crystals attached to the branch ends. The sculpture is rooted in a heavy, chipped glass ashtray filled with a base of fine white powder and colorful crushed tablet fragments. The lighting is a single, sharp directional spotlight from the side, highlighting the oily residue on the glass and the metallic reflections of the needles, while the background remains in deep, shadowed darkness. The focus is sharp on the intricate textures of the fused glass and the chalky surface of the pills.
>>
>>107872832
>4B is Apache 2, 9B has its own BFL license.
they had one fucking job, they know Tongyi will release base with the apache 2.0 licence, that alone makes their model DOA
>>
File: 23.png (1.38 MB, 1088x1088)
1.38 MB
1.38 MB PNG
>>107872881
me being a retard

Macro, low-angle shot at table-surface level focusing on a group of four-inch-tall ceramic garden gnomes moving earth and stones across a dark, weathered oak table. The central gnome, clad in a faded red conical hat with visible pitted ceramic texture, is hunched over, pushing a translucent quartz pebble. To its left, another gnome with a mossy green hat and a bushy, fiber-textured white beard uses a small, rusted iron spade to scoop dark, damp potting soil into a mound. Scatterings of grey river silt, jagged granite fragments, and fine brown dirt grains are spread across the deep grooves of the wooden tabletop. The lighting is soft and directional, originating from the side to accentuate the gritty texture of the soil and the matte finish of the gnomes' painted clothes. A shallow depth of field keeps the foreground gnomes in sharp focus while the background dissolves into a warm, amber-toned bokeh of a potting shed. Tiny airborne dust motes are caught in the light beams near the active tumbling site. Individual droplets of moisture are visible on the clumps of dark earth.
>>
>>107872832
Can't monetize it which seems to be a pretty big reason to finetuners. Lora makers probably won't care
>>
>>107872889
German culture
>>
File: comp_0020.jpg (1014 KB, 4580x1242)
1014 KB
1014 KB JPG
>>107872810
too bad there is no zib
>>
>>107872899
>Lora makers probably won't care
Then we can at least get merges.
>>
File: Flux2Klein9B_00285_.png (2.81 MB, 1440x1120)
2.81 MB
2.81 MB PNG
>>107872870
>You should use 7-8 steps for a more accurate comparison here?
yes. here is 8 steps res_2s
>>
>>107872850
>>107872870
alright, cheers
>>
>>107872908
base looks like shit, and they want us to finetune on that with the BFL licence? no fucking way lol
>>
Even just 4b Klein having Apache 2.0 is good, it's still an absolutely massive upgrade over XL incase we don't get Z-Base
>>
File: 84034.png (1.86 MB, 1088x1088)
1.86 MB
1.86 MB PNG
A miniature Victorian-style cottage constructed entirely from edible confectionery. The primary walls are made of thick, rectangular dark-brown gingerbread panels with a coarse, baked texture, joined at the corners by piped ridges of matte white royal icing. The roof features overlapping shingles made of toasted golden-brown graham crackers, with white frosting icicles hanging from the eaves. A small front porch is supported by four vertical peppermint sticks with a red and white spiral pattern. The front door is a single, glossy slab of dark chocolate featuring a small, golden-yellow spherical candy as a doorknob. Affixed directly above the door frame is a small, rectangular wafer containing the words "SWEET HOME" printed in precise, dark brown cocoa-based ink. The windows are made of translucent, amber-colored poured sugar with a faint crystalline grain. A walkway leading to the porch is paved with a mosaic of multi-colored, polished jelly beans embedded in a layer of white cream frosting. Surrounding the house are spherical green gumdrops acting as ornamental shrubs and conical marshmallows dusted with fine green sugar crystals to represent miniature trees. The entire scene rests on a base of fine, white granulated sugar that mimics the appearance of snow. The lighting is divided between a warm, golden glow emanating from the interior through the sugar windows and a soft, diffused overhead light that highlights the granular textures of the sugar and the smooth glaze of the candy surfaces. The camera perspective is a low-angle macro shot with a shallow depth of field, focusing sharply on the door and the "SWEET HOME" sign.
>>
>[16236.928505] sd 5:0:0:0: [sdf] tag#16 FAILED Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK cmd_age=0s
>[16236.928512] sd 5:0:0:0: [sdf] tag#16 CDB: Read(10) 28 00 00 5f b6 f8 00 00 08 00
>[16236.928515] I/O error, dev sdf, sector 6272760 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
>[16236.928520] Buffer I/O error on dev sdf2, logical block 779999, async page read
Just as I wanted to download the model, my ssd died on me ffs
Never buy Verbatim guise
>>
File: comp_0021.jpg (494 KB, 4580x1242)
494 KB
494 KB JPG
>>107872861
>>
>>107872889
>>107872901
>>107872920
Omg shut up you have to pretend Flux 2 Klein is the 2nd comming of Christ, Tongyi must feel the urgency and release base if we see them shill their brezel model kek
>>
>>107872908
is there some secret to using Klein Base? i get same shitty results. 50 steps, euler, guidance 4.
if this is what base models are like, i'm not surprised Tongyi postponed the Z-Image-Base release by two more weeks
>>
>>107872951
german engineering
>>
File: comp_0022.jpg (1.01 MB, 4580x1242)
1.01 MB
1.01 MB JPG
>>107872928
>>
File: 1755296776714424.png (141 KB, 319x282)
141 KB
141 KB PNG
Why do we either have base or a distilled finetune nowdays? Why can't they also release a regular finetune? What's is the deal?
>>
File: 37.png (2.46 MB, 1088x1088)
2.46 MB
2.46 MB PNG
A high-angle, top-down photograph of a man with a thick, groomed brown beard and shoulder-length wavy chestnut hair lying supine on an intricate oriental rug. The man's eyes are closed in a state of rest, and his left hand is placed flat on his abdomen. He is wearing black over-ear headphones connected by a thin black wire to a black rectangular electronic device resting on the rug near his hip. His attire consists of a dark charcoal grey t-shirt under an open olive-green zip-up hoodie and light beige cotton trousers. The rug features a vibrant red central field filled with complex floral and vine patterns in shades of cream, sky blue, and golden yellow, framed by a thick navy blue border with repeating palmette motifs. A small, square white cardboard box is positioned on the rug near the man's right shoulder, featuring the word "SONIC" printed in black, bold sans-serif lettering on its top surface. The lighting is soft and diffused, originating from the side to emphasize the textures of the woolen rug and the fabric of the clothing. A small section of polished light-oak wooden floorboards is visible at the very top edge of the frame.
>>
>>107872965
are base models supposed to look this bad? wtf???
>>
File: Flux2Klein9B_Base_00001_.png (1.98 MB, 1440x1120)
1.98 MB
1.98 MB PNG
>>107872974
i have a theory that Z base looked equally shitty and when it unexpectedly gained popularity they panicked and realized they have to finetune it before releasing hence all the delays
>>
File: 6321.png (2.15 MB, 1088x1088)
2.15 MB
2.15 MB PNG
A miniature cottage constructed entirely from sliced and whole swiss rolls, positioned on a rustic dark wooden cutting board. The walls are built from thick, circular slices of vanilla sponge cake with spiraling white cream centers, arranged so the spiral patterns face outward. The gabled roof is composed of elongated, horizontal swiss rolls with a golden-brown exterior, held together by visible lines of thick white frosting acting as mortar. A small arched doorway is carved into one of the cake slices, revealing a soft, cream-filled interior. The "ground" surrounding the structure is dusted with a fine layer of powdered sugar. The scene is captured in a macro close-up with a shallow depth of field, blurring the background of a sunlit kitchen. Warm, directional light from the side highlights the airy, porous texture of the yellow sponge cake and the smooth, matte finish of the cream swirls. A few tiny cake crumbs are scattered near the base of the cottage.
>>
File: comp_0023.jpg (825 KB, 4580x1242)
825 KB
825 KB JPG
>>107872881
>>
>>107872951
>>107872965
Klein doesn't use guidance at all. Distilled versions should use CFG 1 and low step counts (4-8). Base should use CFG 4 or so, and 30+ steps, just like any normal model. Base does look worse (especially in terms of textures and "slop"), but not hugely so.
>>
>>107872979
I have a feeling you kept base at cfg 1 lool
>>
Does Radeon 9060XT / 9070XT have enough power to do stable diffusion and local models?

I know Nvidia is preferred but hasn't Radeon caught up?
>>
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B
>Text-to-image and image-to-image multi-reference editing in a single unified model.
how good is it at edit? I'm interested on that
>>
File: comp_0025.jpg (1.01 MB, 4580x1242)
1.01 MB
1.01 MB JPG
>>107872990
>>107872986
>>107872891
Base with cfg 4 & 30 steps
>>
>>107872986
Flux doesn't use CFG. i'm talking about the guidance_scale parameter (which isn't CFG) and should be set to 4 according to their official docs
>>
>>107872965
>>107872976
How many steps are you running the base at? CFG?
>>107872974
Not this much, no.
>>
If Tongyi releases Z-image base today or tommorow, it would confirm their prior petty behavior with Flux was no accident. I really hope they'll do it again kek.
>>
>>107873003
when i say 4 cfg I mean guidance btw

>>107873005
previously 1 cfg and 9 steps
>>
>>107873008
I wonder if BFL would've even released 4b and 9b without ZiT forcing their hands, they don't actually seem to care much about local beyond lipservice
>>
>>107873022
>I wonder if BFL would've even released 4b and 9b without ZiT forcing their hands
obviously not, Klein looks way less slopped than Flux 2 32b, they were forced to put on some effort to get hype now, Z-image turbo has increased the requirements a lot (and that's definitely a good thing)
>>
File: Flux2Klein9B_Base_00002_.png (1.41 MB, 1440x1120)
1.41 MB
1.41 MB PNG
I managed to get something coherent out of base
>A man holding up a sign that says "KINO SOVL"
res2_s, 50 steps, guidance 4
>>
File: ComfyUI_00021_.png (1.4 MB, 1024x1024)
1.4 MB
1.4 MB PNG
>>
File: 60.png (1.5 MB, 1088x1088)
1.5 MB
1.5 MB PNG
Prompt too beeg
https://pastebin.com/5qpySq8K
>>
>>107873014
The code in repo had 50 but that's probably overkill. Try 30 at least otherwise or don't bother with them at all.
>>
File: Flux Klein.jpg (2.51 MB, 4544x2805)
2.51 MB
2.51 MB JPG
Guys, it can do edit as well
>>
Looks like base is really just for training and not worth prompting independently at all
>>
File: comp_0027.jpg (1.8 MB, 5486x1242)
1.8 MB
1.8 MB JPG
>>107872977
>>107873063
cfg 4
>>
>>107870761

https://files.catbox.moe/qeq7mu.mp4
>>
crazy how hard zit mogs....
>>
File: 1767207426370748.png (668 KB, 2098x1499)
668 KB
668 KB PNG
https://bfl.ai/blog/flux2-klein-towards-interactive-visual-intelligence
>look guys, our model is better than Z-image turbo!!
delusional
>>
File: comp.png (3.13 MB, 1792x1152)
3.13 MB
3.13 MB PNG
>>107873074
Your shit is borked. Left: 4b with 8 steps and CFG 1. Right: 4b base with 30 steps and CFG 4.
There are no guidance nodes anywhere.
>>
>>107873117
it doesn't. that 4 image comparison workflow that anons are sharing has dogshit settings for klein.
>>
File: file.png (362 KB, 1569x1095)
362 KB
362 KB PNG
>>107873124
this my setup
>>
>>107873129
i thought i was going crazy when shit looked like ass compared to single image
>>
>>107873129
Spoon feed us some good stuff then mate
>>
>>107871772
the 4b and 9b models need qwen 4b and 8b respectively. comfy has the qwen8b on their hugging face
>>
File: comp_0028.jpg (3.06 MB, 5486x1242)
3.06 MB
3.06 MB JPG
>>107872972
>>
>>107873137
Don't use ConditioningZeroOut, actually encode an empty prompt for the negative.
>>
>>107872967
Destilled are faster.
Base are better for training.
>>
>>107873160
but a finetune without distill will always look better, Wan 2.2 looks better as an undistilled finetuned model compared to its distilled turbo loras stacked over it
>>
>>107873173
Who talks about what looks better.
One is speed the other is flexibility. No one talked about what looks better.
>>
>>107873192
Who talks about speed and flexibility. No one talked about speed and flexibility.
>>
>>107873137
be mindful that if you are using distilled and base there are different workflows.

i'm about to do scheduler/sampler grid for you assholes
>>
>>107871592
what's the meme i'm missing here?
>>
>>107873137
>>107873159
I don't get it, how can he be wrong about the workflow? there's an official comfyui template he can try he won't have to guess
>>
where nu thred
>>
>>107873235
no more threads until base release
>>
File: comp_0029.jpg (2.18 MB, 5486x1242)
2.18 MB
2.18 MB JPG
>>107873159
Done, looks like that was fucking shit up
>>
>>107873247
RIP
>>
>>107873247
thats a wrap folks nice knowing yall
>>
>>107873250
how can you compare base against distilled? base actually looks pretty good, too bad for that shitty licence it would've been an instant hit, sad
>>
File: ComfyUI_00023_.png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
>>
>>107873247
was a good run. see you in the next life
>>
File: 2752078838.png (1.52 MB, 960x1344)
1.52 MB
1.52 MB PNG
>>
File: comp_0030.jpg (677 KB, 4580x1242)
677 KB
677 KB JPG
>>107873271
>>107872834
30 steps 4cfg for base
>>
>>107873202
You asked why they release distilled. I answered you because distilled is faster. You come with what looks better you stupid cunt.
>>
File: zimg_lora__00101_.jpg (835 KB, 1596x2364)
835 KB
835 KB JPG
ignore filename, it's wired up, gonna fill it out
>>
>>107873321
>You asked why they release distilled.
I didn't, are you retarded or something? I asked why they only release distilled and base these days when they could also add undistilled finetune too, oh boy you are a stupid boi
>>
>>107873325
i'll be her simple beta
>>
File: 1739763560000202.png (203 KB, 399x498)
203 KB
203 KB PNG
>>107873317
if not for the bad licence we would've been saved from the Chinese Culture, why can't we have nice things?
>>
>>107873348
Read your post again asswipe
>>
File: comp_0031.jpg (794 KB, 4580x1242)
794 KB
794 KB JPG
>>107873045
>>
>>107873364
>Read your post again
>>107872967
>Why can't they also release a regular finetune? What's is the deal?
I'll ask once again, are you mentally challenged?
>>
>>107873367
wow klein base looks shit as fuck
>>
>>107873379
well duh, if base models were good they wouldn't waste additional moneys to do finetunes
>>
>>107873376
>Why do we either have base or a distilled finetune nowdays?
BECAUSE DISTILLED IS FASTER
holy fuck kys somewhere
>>
File: flux2klein.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
kleins bretty gud
>>
>>107873388
you are absolutely braindead, I'm not saying it's a bad thing to released the distilled finetune, I'm saying why they ONLY release than when there's good reasons to release the undistilled finetune too:
- It looks better
- It's undistilled so you can make loras easier and you could even finetune it if you want if you don't want to start from the base model

last (You) for you, your IQ must be in the 2 digit scale I feel I'm wasting my time talking to brown subhumans like you
>>
File: comp_0032.jpg (1.15 MB, 4580x1242)
1.15 MB
1.15 MB JPG
>>107871812
>>
File: 1601931459290.jpg (340 KB, 800x807)
340 KB
340 KB JPG
>indians and jews are already working on ai
>chinese are still celebrating xmas
wtf
>>
File: 21225.png (2.18 MB, 1152x960)
2.18 MB
2.18 MB PNG
I'm deeply offended by the fact that all of these models completely fail to recognize traditional artists (this was supposed to be Franz Marc)
>>
File: comp_0033.jpg (961 KB, 4580x1242)
961 KB
961 KB JPG
>>107870650
https://files.catbox.moe/lq0rrp.png
>>
File: fk9bd_grid_sm.jpg (1.99 MB, 1758x2046)
1.99 MB
1.99 MB JPG
no real surprises here.

https://files.catbox.moe/7liot7.jpg
>>
>>107873434
This happens because they don't tag the good stuff right? I should be able to prompt some John Sargent Singer on command.
>>
>>107873412
Fuck off you dense troglodyte.
>>
File: file.png (17 KB, 28x725)
17 KB
17 KB PNG
>>107873456
why do these make slop?
>>
File: comp_0034.jpg (876 KB, 4580x1242)
876 KB
876 KB JPG
>>107873424
>>
>>107873465
I have no idea, maybe they just use llm captions only so all the styles get lost into whatever the vlm see them as (descriptions of the brushstrokes and textures)

a cubist painting of a cake with heavy brushstrokes, the brushstrokes are very coarse and the tones are muted
>>
File: 1739443683544909.png (487 KB, 1119x1366)
487 KB
487 KB PNG
>>107873465
>This happens because they don't tag the good stuff right?
Gemini 3.0 has a shit ton of knowledge, they could've simply asked it to caption images, it knows who Franz Marc is.
>>
new
>>107873510
>>107873510
>>107873510
>>107873510
>>
>>107873474
can you share your flux workflow?
>>
File: 1765451019933473.png (90 KB, 299x168)
90 KB
90 KB PNG
>>107873515
>no collage
>no rentries
>>
>>107873521
https://files.catbox.moe/lq0rrp.png
>>
>>107873471
not aligned with the model
>>
>>107873515
fuck off with your retard bake
>>
>>107873515
>no subject
good one retard
>>
>>107873515
my messages got instantly deleted, seems like when Ani is making his schizo bread he has the mod power to silence everyone lmao
>>
>>107873515
low IQ retard didn't even put the subject in or make a collage
>>
Make a new bake and let the fail bake die
>>
>>107873576
>Make a new bake
you can't anymore, since Ani cried to the mods on RC you can't make multiple new bakes anymore, the first one that makes a new one can force everyone else to use it now
>>
>>107873617
but the failbake doesn't even have a thread title, that should exempt it
>>
>>107873617
This is not true. The new thread doesn't even have a subject.

If someone bakes a new thread, it 100% won't get deleted.
>>
>>107873664
>>107873665
oh yeah lmao, the fuck
>>
thank god it's midnight here, off to bed, have fun with Ani in the schizo thread guys, see you tomorrow



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.