[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 1756841350521697.jpg (586 KB, 2033x790)
586 KB
586 KB JPG
Discussion of Free and Open Source Diffusion Models

Prev: >>107878539

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg
>>
>>107880290
benchod
>>
File: Flux2-Klein_00067_.png (1.63 MB, 832x1216)
1.63 MB
1.63 MB PNG
>>
File: comp_0102.jpg (962 KB, 4580x1242)
962 KB
962 KB JPG
>>
>tranibake
>>
>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>>107880342
Based thread quality maintainer
>>
File: comp_0103.jpg (1.33 MB, 4580x1242)
1.33 MB
1.33 MB JPG
>>
>>107880365
zit slays again
>>
Stinky bread
>>
>mfw
>>
File: comp_0104.jpg (1003 KB, 4580x1242)
1003 KB
1003 KB JPG
>>107880420
>>
I HATE PYTOORCH
I HATE PYTOORCH
I HATE PYTOORCH
>>
File: 0.png (1.08 MB, 1072x536)
1.08 MB
1.08 MB PNG
points for trying, because flux2 can't
or won't...
>>
>>107880325
kek
>>
>>107880284
lel. The ZIT one is what you often get with an empty prompt.
>>
File: Flux2-Klein_00092_.png (2.33 MB, 1950x1510)
2.33 MB
2.33 MB PNG
so i trained a lora for flux.2 klein 9b distilled last night, locally. it was just a test run so this was 40 images (random women, no captions), 500 steps at 512 resolution. it's not very strong since it is grossly undertrained but it was viable on a 3090 (bf16), and used around 20GB of VRAM. I'm going to try one more longer run but I'll leave it to ai-toolkit (da goat), to do this shit properly.

>a photo of sks woman in a tank top and jeans

>>107880463
you don't even know the half of it brother
>>
File: img_00060_.jpg (791 KB, 1672x1264)
791 KB
791 KB JPG
Let the machine live your fantasy!
>>
File: file.png (815 KB, 1096x918)
815 KB
815 KB PNG
>>
File: img_00061_.jpg (1.69 MB, 3024x4032)
1.69 MB
1.69 MB JPG
>>
>some faggot redditor used two of my gens in "his" "showcase" about Klein
show yourself Different_Fix_2217 you faggot. i know you're here.
>>
File: edit_12.jpg (3.94 MB, 2649x2649)
3.94 MB
3.94 MB JPG
>>
File: Flux2-Klein_00006_.png (1.01 MB, 1168x880)
1.01 MB
1.01 MB PNG
>Replace the girl on the right in image 1 with the girl in image 2. She should have the same pose as the girl in image 1, but have her outfit and facial expression from image 2 while looking at the boy on the left. She should not be wearing armor.
Damn, I can't believe this largely just werks.
>>
File: ComfyUI_temp_mpifk_00003_.jpg (2.07 MB, 3000x4000)
2.07 MB
2.07 MB JPG
>>
File: Flux2-Klein_00094_.png (2.48 MB, 1950x1510)
2.48 MB
2.48 MB PNG
>>107880515
more surprisingly, i can use a lora trained on 9b distilled on base. these are all early findings though so let's wait to see what's coming up.
>>
File: Capture.png (16 KB, 785x130)
16 KB
16 KB PNG
>>107880596
What you gonna do about it?
>>
>>107880596
they hate us but farm us for updoots
>>
File: Flux2-Klein_00007_.png (1.15 MB, 1168x880)
1.15 MB
1.15 MB PNG
>>107880622
Interestingly, 9B-distilled came out IMO worse than 4B-distilled.
>>
>>107880641
>What you gonna do about it?
Call you a faggot again.
>>
>>107880651
>>107880644
I will keep farming you, back to work ;)
>>
File: ComfyUI_temp_eqbcl_00052_.png (2.09 MB, 1741x1238)
2.09 MB
2.09 MB PNG
>>
File: inputs.png (548 KB, 1250x910)
548 KB
548 KB PNG
>>107880622
>>107880645
Inputs for reference
>>
File: ComfyUI_temp_eqbcl_00054_.png (3.36 MB, 1741x1238)
3.36 MB
3.36 MB PNG
>>
File: ComfyUI_temp_eqbcl_00057_.png (2.96 MB, 1741x1238)
2.96 MB
2.96 MB PNG
>>107880645
Try generating a few more, varying the seed changes the output a lot
>>
Wait are germans based now? Do we need a song praising their culture?
>>
File: klein9b_khqaf_00008_.jpg (284 KB, 896x1088)
284 KB
284 KB JPG
>>107880290
>>
File: ComfyUI_temp_eqbcl_00058_.png (2.85 MB, 1741x1161)
2.85 MB
2.85 MB PNG
>>
File: comp_small.jpg (2.88 MB, 3216x2015)
2.88 MB
2.88 MB JPG
>>
File: ComfyUI_temp_eqbcl_00061_.png (2.76 MB, 1741x1161)
2.76 MB
2.76 MB PNG
>>107880722
>>
>>107880719
the license on everything but klein 4b is still very retarded

but the model's training is getting more useful
>>
>>107880729
details please
>>
>>
File: Flux2Klein9B_00300_.png (1.42 MB, 1152x896)
1.42 MB
1.42 MB PNG
>>
>>
how good is flux Klein edit compared to grok edit
>>
>>
File: source.jpg (481 KB, 1093x973)
481 KB
481 KB JPG
>>107880752
klein distill edit
euler, simple, 8 steps
ModelSamplingAuraFlow for shift,
>Make the man in the front Vladimir Putin. He is wearing furs
>>
File: ComfyUI_temp_eqbcl_00071_.png (3.45 MB, 1740x1222)
3.45 MB
3.45 MB PNG
>>107880787
should be same level since grok is based on flux, wouldn't be surprised if grok was already a modified version of klein since klein is so light to run compared to flux.2
>>
>>
Still no Z Image Base? or Z Image Illustrious or something?
>>
>>107880862
release is imminent, entered final verification phase
source is anime profile picture man from twitter
>>
File: ComfyUI_temp_eqbcl_00075_.png (3.78 MB, 1867x1248)
3.78 MB
3.78 MB PNG
content sloppers are going to have a field day with klein
>>
>>107880893
internet is already slopped from closed source cloud models, this won't change much i believe
>>
File: ComfyUI_temp_eqbcl_00076_.png (3.61 MB, 1867x1248)
3.61 MB
3.61 MB PNG
>>
>>107880814
why aura flow shift and not flux shift?
>>
File: Flux2-Klein_00109_.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
>>107880814
brilliant, thanks.
>>
>>107880862
2
>>
File: file.png (1.94 MB, 1949x676)
1.94 MB
1.94 MB PNG
>>
File: 1753277129028400.jpg (24 KB, 552x530)
24 KB
24 KB JPG
>>107880290
I have 5070ti but i still too brainlet to do this. ComfyUI is anything but comfy
>>
File: ComfyUI_temp_eqbcl_00077_.png (3.35 MB, 1858x1114)
3.35 MB
3.35 MB PNG
>>107880912
Nah, klein is so light and easy to run, I bet this very moment there is a wave of Ai slopa youtubers uploading their "THIS CHANGES EVERYTHING" and "OPEN SOURCE NANO BANANA PRO KILLER" videos about klein, then redditors are going to flood r/stablediffusion and every ai subreddit there is with their slop content, as soon as ai grifter entrepeneur start seeing this, they will start hosting klein models on their sites (even if its illegal), then they will start flooding every social media site there is trying to sell their service with their shitty disguised ads "how this was made?", "I made 10k this month using this tool", then ai sloppa youtubers are going to subscribe to said service, start generating the same content (anime to realism slopa), (painting to realism slopa), etc etc, the never ending circle of every ai tool there is. I gotta admit klein is pretty good, you can see right away the limitations of every new model there is, but so far klein doesn't seem like a weak model at all, can't wait to train my loras too
>>
>>107880968
you're clearly not autistic enough anon maybe reading the whole thread a few times will cook your brain enough
>>
>>107880987
I head "Flux Klein" is the next big thing right now.
How to install it ??
>>
>>107880968
made her happy
>>
File: ComfyUI_temp_eqbcl_00079_.png (3.42 MB, 1858x1114)
3.42 MB
3.42 MB PNG
>>
>>107880978
>>107881014
Damn that's good, I'm sad because I hate BFL.
>>
so... no more comfy optimizations for ltx? this is the best it will ever work
>>
File: Flux2Klein9B_00306_.png (1.15 MB, 1152x896)
1.15 MB
1.15 MB PNG
>>107880978
i fucking hate the modern internet so much.
>>
File: _f2k_00020.png (1.52 MB, 960x1280)
1.52 MB
1.52 MB PNG
>>
>>107881014
what's the prompt? or catbox pls
>>
reddit grifter general
>>
playing with this new model make me remember how much heavy lifting does booru tag system do in terms of prompt adherence, hope they experiment with finetuning klein or z-base
>>
File: Flux2Klein9B_Edit_00009_.png (3.22 MB, 1839x1226)
3.22 MB
3.22 MB PNG
>>
>>107881116
Tranny moment
>>
File: Flux2-Klein_00068_.png (2.51 MB, 960x1440)
2.51 MB
2.51 MB PNG
you can also quite reliably glitch it out and get some neat gens
>>
File: ComfyUI_temp_eqbcl_00060_.png (2.82 MB, 1741x1161)
2.82 MB
2.82 MB PNG
>>107881018
yeah, I didn't rate them either after the flux.2 dev but they really killed it with klein, I wonder if they learned/copied anything from z-image turbo (qwen text encoder) and added it before release, they really struck back at Qwen/Zit, I bet the z-image base/edit model isn't that good compared to klein and they took to long to release the model and now here we are, competition is fun
>>
>>107881147
greaseball moment
>>
HOW TO INSTALL FLUX 2 KLEIN ???????
>>
>>107881170
can't hear you, speak up
>>
>trying out flux klein
>have to test out new workflow i found
>install a million new nodes
>errors
>no even though you just downloaded a million shit automatically without knowing shit you need this obscure node that requires you to find a file on your computer and change a word in it before you download this one
>workflow doesn't even work
yeah i'm dumb and impaired
>>
>>107880968
You need to swallow the autism pill and suck 3 cocks minimum then you are ready for comfy
Or just use one of the forge forks, it's enough for most things >>107880968
>>
Just woke up from a coma, what model are we simping for now?
>>
>go to civit
>check out z-image model page
>z IMAGE mind you
>50 videos of cartoon whores gyrating

bros???
>>
>>107881199
if you see it posted here, and then reddit shortly after, that one
>>
5070ti enough for Klein9b ??
>>
>>107880325
lulz
>>
File: ComfyUI_01801_.png (1.74 MB, 1024x1024)
1.74 MB
1.74 MB PNG
>>
File: Flux2K4b.jpg (173 KB, 1024x1024)
173 KB
173 KB JPG
>>107881223
typically yes
>>
File: ComfyUI_01805_.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
drink up
>>
File: ComfyUI_temp_sojfy_00001_.png (3.37 MB, 1858x1114)
3.37 MB
3.37 MB PNG
>>107881026
now imagine if everyone had access to powerful gpus and ram... I think ... the government is right about limiting hardware access to generate AI tools
>>
>>
>>107881253
sweet jesus
>>
>>107881253
well yea, many of the political boomers unironically most bothered by them getting lampooned and parodied with AI
>>
File: ComfyUI_01807_.png (2.82 MB, 1088x1920)
2.82 MB
2.82 MB PNG
honk honk
>>
>>107881250
I only drink spaghetti
>>
File: Flux2K4b.jpg (309 KB, 2048x2048)
309 KB
309 KB JPG
>>
File: ComfyUI_01808_.png (2.86 MB, 1088x1920)
2.86 MB
2.86 MB PNG
>>107881258
>>
File: ComfyUI_temp_sojfy_00003_.png (2.87 MB, 1577x1114)
2.87 MB
2.87 MB PNG
>>
>>107881258
Make him wear Palestinian babies as slippers
>>
>>107881281
We are fucked if anime tuners jump on this shit model instead of 9b...
>>
File: Flux2-Klein-9bb-e_00001_.png (1.12 MB, 1408x736)
1.12 MB
1.12 MB PNG
lmfao.. good job klein edit.. couldn't have photoshopped it better myself
>>
>>107880893
why is there always this harsh flash photography look?
>>
File: Flux2K4b.jpg (254 KB, 2048x2048)
254 KB
254 KB JPG
>>107881299
they basically have to because of the license? 9b has so many restrictions.
>>
>>107881316
honestly kinda sovl
>>
>>107881149
Yeah I got garbage like this a lot, not sure where I went wrong
>>
File: ComfyUI_temp_sojfy_00005_.png (3.49 MB, 1867x1248)
3.49 MB
3.49 MB PNG
>>107881263
>>107881269
now imagine if indians had access to powerful and cheap gpu, ram, nvmes, etc
>>
>>107881299
Even 4b is a massive improvement over XL, I want the XL era to fucking end
>>
File: ComfyUI_01809_.png (3.52 MB, 1088x1920)
3.52 MB
3.52 MB PNG
>>
File: ComfyUI_03834_.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>
hey guys i just discovered this really cool-
oh wait reddit grifter niggers lurk here nvm
>>
>>107881331
>>107881346
I'd rather they don't at all then. Fuck it's been too long, I don't want another sidegrade again
>>
>>107881223
I tried with 12 GB (8-bit version) and it was fine, although a bit slow.

>>107881316
lel
>>
>>107881348
Love it.
>>
>>107881349
You have never figured out anything. Stop larping.
>>
>>107881354
Are you actually retarded, 4b is a huge upgrade over SDXL
>>
File: Flux2K4b.jpg (310 KB, 2048x2048)
310 KB
310 KB JPG
>>107881339
africans, asians or even specifically chinese etc. aren't very much of bother but just more fellow anime coomers... maybe at some point it'll be the same with indians? idk
>>
>>107881380
You call this >>107881380
a huge upgrade? retarded fuck
>>
File: ComfyUI_01818_.png (3.3 MB, 1088x1920)
3.3 MB
3.3 MB PNG
cat girls will become real
>>
>>107881339
>>
>>107881275
>>
File: Flux2K4b.jpg (406 KB, 2048x2048)
406 KB
406 KB JPG
>>107881354
ultimately one of the newer better-than-sdxl models will be finetuned. 4b or a derivative is certainly a candidate.
>>
>>107881400
Yes it is, have you ever used XL, the better VAE alone is worth it
>>
>>107881339

could u give ah nikka ah workflow gang?
>>
>>107881400
>>107881380
I accept now I am retarded lol, Meant this one >>107881331.
The hands are beyond help and details are smudgy as hell, it's just xl all over
>>
>>107881357
Slow like 5+ Minutes or 1 or 2minutes ?
>>
>>107881387
Chinese are cool in my book, they are living an economic miracle, much of the stuff you think about them is just propaganda, of course there are bad people everywhere, plus you have to see what they have given us in open-source AI models

Africans and indians are known to be scammers/grifters and sadly it seems that is part of their culture, of course there are exceptions, but there is a reason why youtuber channels about scammers always are about indians/africans. The last viral videos of deepfake are indians, that should tell you something
>>
>>107881223
>>107881436
same card
works fine for me, pretty fast, 20 seconds
>>
File: ComfyUI_01827_.png (3.82 MB, 1088x1920)
3.82 MB
3.82 MB PNG
adding film grain to any prompt makes it better for whatever reason
>>
File: Flux2K4b.jpg (520 KB, 2048x2048)
520 KB
520 KB JPG
>>107881433
I don't claim perfect inference settings but it is certainly a flawed training.

Still has many advantages over SDXL. The prompt accuracy and amount of tokens it actually takes into account, the resolution flexiblity, edit capability and so.

One of these models will succeed SDXL.
>>
"change this image to a photorealistc style"
>>
File: 1745913377500221.png (1.48 MB, 2185x1027)
1.48 MB
1.48 MB PNG
well klein failed my first test,
>>
>>107881485
ew
>>
so from what i can see ITT (thanks for beta testing btw) flux2 is generally a lot of fun, flexible, has very good editing capability, produces body horror and illogic bullshit every now and then, and gets mogged by zit on realistic humans (as expected)
yes?
>>
File: ComfyUI_01837_.png (2.57 MB, 1088x1920)
2.57 MB
2.57 MB PNG
>>
>>107881436
like 5 minutes, but it's on a RTX A2000 (3060 equivalent). Much slower than ZIT.
>>
>>107881485
it is photorealistic, now tell it to add a million filters like real women
>>
>>107881400
you are retarded, you should compare with base XL before talking shit. the flux 2 VAE alone and the speed of training makes it worth it.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.