[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>107880290

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
cooomers eating good, XL era is ending
>>
euler, simple
>>
Qwen3 Image is better than Klein 9b right?

Is Klein just a good option for poorer people?
>>
File: Flux2-Klein_00113_.png (1.75 MB, 960x1440)
1.75 MB
1.75 MB PNG
it knows shakira?
>>
>>107883170
honestly the klein image edits i have seen here were very good
>>
SHAKIRA SHAKIRA
OO BABY YOU WANNA FUCK MY ASS
YOU MAKE A WOMAN GO MAD
SHANIQUA
TANIQUA
FEELING THE SIDES OF MY BODY
IMMMMM ON TONIGHT AND MY HIPS DONT LIE AND IM TRYING TO FEEL YOU BOY
LETS GO
MEXICO
>>
>>107883147
>>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
uh oh, low thread quality inbound!
>>
>>107883207
are you debo or ani
>>
>>107883197
I've been using the edit nodes on anime images, and qwen3 still has clearly better results/adherence.
>>
Why was the last one deleted?
>>
>>107883197
klein hits a little different
>>
File: Untitled.jpg (856 KB, 1951x1105)
856 KB
856 KB JPG
hmm!
>>
File: 1.webm (3.87 MB, 1920x1080)
3.87 MB
3.87 MB WEBM
Thoughts?
>>
>>107883256
green animation is largest
>>
>>107883256
white purple and green
>>
Do you guys miss the good old SD1.5 days? Remember when it was a simpler time and everyone was excited about making gens?
>>
>>107883274
no
>>
>>107883274
if you're not excited by every new improvement its your brain issue
>>
>>107883296
Wow, did you have to be this rude to me? Yes, I'm feeling depressed, alright, but there was no need to call me mentally ill.
>>
Blessed thread of frenship
>>
>>107883274
>getting stable diffusion to even work on Linux was a total fucking bitch with random python errors
>after trouble shooting for hours, your non flagship card that barely had any tensor cores (in my case i used a 1080 ti which had none) would either crash out or struggle to give you a half decent washed out image
No. Early sd 1.5 days sucked and I was largely unimpressed, immediately going back to cloud based shit.
>>
>>107883274
>This post brought to you by Greg Rutkowski and Alphonse Mucha.
>>
>>107883274
I remember the first time trying to get anything to work
>windows
>amd
>some shitty workarounds
>something onnx
>something converting models
>....
>like 10 minutes per picture
>gpu at 100%
I was happy when i had my picture of a pink cocktail on a table at a beach and was fascinated by it kek
>>
>>107883256
absolutely based they recommend using llama.cpp over pootorch
>>
>>107883274
the sidegrades actually felt impactful. nowadays it's "use this snake oil to get realistic skin!" (no difference). I remember ipadapter, loras and controlnets actually being useful and easy to plug and play. nowadays everything sucks and just plateaued
>>
>>107883388
benchod
>>
>>107883364
>Greg Rutkowski
I remember he was bitching about being used in the dataset and then as soon as he was removed everyone forgot he existed lol. AI was the only reason his generic fantasyslop got recognized.
>>
File: file.png (139 KB, 763x454)
139 KB
139 KB PNG
Hey people using klein edit, where is this node from?
>>
>>107883437
https://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced
>>
File: a.jpg (124 KB, 1024x768)
124 KB
124 KB JPG
>>
>>107883454
real?
>>
File: b.jpg (123 KB, 1024x768)
123 KB
123 KB JPG
>>
File: 1758235891120793.jpg (77 KB, 1080x1033)
77 KB
77 KB JPG
>>107883274
1.5 was dogshit. the anatomy was catastrophic. It required extensive retouching. luckily, i only discovered this garbage, a few weeks before the release of the XL version, kek
>>
File: 85544237.png (1.44 MB, 1344x832)
1.44 MB
1.44 MB PNG
>>107883274
no because it didn't make dicks, I want dicks
>>
>>107883501
There were a lot of 1.5 models that could make dicks
>>
>>107883546
that arm looks extremely huge/weird
>>
File: f763704952.jpg (78 KB, 512x512)
78 KB
78 KB JPG
>>107883274
I had fun making HR Giger interpretations of politicians.
Lost most of the gens tho
>>
>>107883274
1.5 is unoriginal soul. It's trained with everything and the kitchen sink. You ask these new "good" models for a kodak brownie photograph they don't know anything.
>>
File: comp_0111.jpg (833 KB, 6106x1020)
833 KB
833 KB JPG
>>107883454
>>
>>107883183
Only thing we know is that you are face blind
>>
>>107882616
Kleinfags in shambles
>>
>>107883596
about time you finally admitted that migu is a girl
took you ages
>>
File: comp_0114.jpg (1.18 MB, 5426x1132)
1.18 MB
1.18 MB JPG
>>107883481
>>
File: ComfyUI_temp_flflq_00004_.jpg (761 KB, 1728x2304)
761 KB
761 KB JPG
if i pass the SamplerCustomAdvanced an SD3 latent it doubled my image res? does that happen to anyone else?
>>
>>107883251
>just numbers in the filename
I now use
ComfyUI %date:yyyy-MM-dd hh-mm-ss% %KSamplerAdvanced.noise_seed%
>>
>>107883688
Nobody asked, though?
>>
File: 865.png (1.23 MB, 1280x896)
1.23 MB
1.23 MB PNG
>>
File: comp_0115.jpg (900 KB, 6106x1020)
900 KB
900 KB JPG
>>107883465
>>107883655
huh? when did i say migu isn't a grill
>>
File: comp_0116.jpg (1.13 MB, 4586x1308)
1.13 MB
1.13 MB JPG
>>107883686
>>
>>107883274
I remember in October 2022 being super excited about genning a 512x768 Injun lmao I thought it was amazing. That was SD 1.4.
>>
File: comp_0117.jpg (762 KB, 6360x988)
762 KB
762 KB JPG
>>107883729
>>
>>107883752
Just checked the archives and I posted it on /aco/ lmao
https://desuarchive.org/aco/thread/6779207/#6779754
>>
File: comp_0118.jpg (1.24 MB, 5340x1164)
1.24 MB
1.24 MB JPG
>>107883560
>>
>>107883612
kek'd
>>
>>107883718
It's for your edification, sequentialfag

>>107883776
Jesus
>>
>>107883752
oh hey it knows jessica alba?
>>
File: comp_0119.jpg (1.1 MB, 6020x1036)
1.1 MB
1.1 MB JPG
>>107883246
>>
File: ComfyUI_temp_flflq_00009_.jpg (750 KB, 1728x2304)
750 KB
750 KB JPG
>>
>>107883785
It vaguely knew a bunch of celebs, but this one's not Jessica Alba.
>>
File: mikuoal.png (960 KB, 1024x768)
960 KB
960 KB PNG
>>
File: comp_0120.jpg (1.05 MB, 4586x1308)
1.05 MB
1.05 MB JPG
>>107883798
>>
File: gemsune.png (1.37 MB, 1024x768)
1.37 MB
1.37 MB PNG
>>
File: Flux2K9b.jpg (193 KB, 1024x1024)
193 KB
193 KB JPG
>>107883274
i am still quite excited
>>
File: comp_0121.jpg (1.06 MB, 4334x1388)
1.06 MB
1.06 MB JPG
>>107883752
>>
>>107883274
i remember upgrading from 1080ti to 3090 during sd 1.5 days and was like holy shit i can gen 8 shitty 512x512 terrible images at once!
>>
File: comp_0122.jpg (1.49 MB, 5340x1164)
1.49 MB
1.49 MB JPG
>>107883823
>>
File: comp_0123.jpg (1.14 MB, 6106x1020)
1.14 MB
1.14 MB JPG
>>107883812
interesting
>>
>>107883834
Qwen wins again
>>
i will be excited once z image base drops, and then releases a noobai finetune which blows illustrious out of the water
>>
>>107883481
i don't have a single XL gen saved that wasn't an edit.
i keep loads of 1.5 models, some of which have probably disappeared, because creative weirdos easily fine-tuned 1.5 and then vanished.
>>
File: comp_0124.jpg (1.53 MB, 6106x1020)
1.53 MB
1.53 MB JPG
>>107883818
>>
File: 1704202647350540.jpg (113 KB, 784x676)
113 KB
113 KB JPG
Is there any website I can use to generate a 20-30 minute movie, that can do consistent scenes?
>>
File: Flux2Klein9B_Edit_00043_.jpg (1.56 MB, 3456x2304)
1.56 MB
1.56 MB JPG
>>107883798
>>
File: Flux2-Klein_00038_.jpg (1.02 MB, 1728x2304)
1.02 MB
1.02 MB JPG
>>107883904
kek
>>
>>107883891
<is the singularity out yet?
>>
>>107883891
>Jarvis, create a free website where users can generate a 30 minute movie that makes sense for free
>>
File: 42.png (1.47 MB, 960x1232)
1.47 MB
1.47 MB PNG
When will I stop getting these demonic captchas and get the normal ones? No wonder even the 4chan xt dev bailed out, this website works againsts its users
>>
File: 1754792957127325.png (365 KB, 619x403)
365 KB
365 KB PNG
wen base
>>
Tried chatgpt image gen the other day for work
Shit is light years away from local ngl
Local is just too unwieldy, you have to fucking study how to do anything with it
That shit was just plug and play and better results
So that is very unfortunate
>>
File: file5.jpg (19 KB, 250x203)
19 KB
19 KB JPG
>>107883925
>>107883949
So what's the best this horseshit AI can do then? 10 sec clips? that may or may not be consistent or have continuity
>>
File: Flux2-Klein_00049_.jpg (1.03 MB, 1728x2304)
1.03 MB
1.03 MB JPG
>>
File: Soon(TM).gif (1.86 MB, 498x274)
1.86 MB
1.86 MB GIF
Mr President, another Z-image commit has hit the tower.
https://github.com/kohya-ss/musubi-tuner/pull/843
>>
>>107883999
yes. it's only for gooning and memes but the memes kinda suck now and the new model has to have someone spend several hundred thousand dollars to train porn into the base model but it's already fried on release
>>
>>107884010
https://github.com/kohya-ss/musubi-tuner/pull/843#issuecomment-3759879680
>I will test and merge this as soon as the base weights are released.
based kohya not falling for their bullshit, no weight = no merge
>>
>>107883686
explain?
>>
>>107884017
How many more years until I can make 30 minute movie that is indistinguishable or at least very close in quality to the real thing?
>>
>>107883256
This sounds absolutely insane! Is there a paper somewhere where we could listen to samples? Realtime too, the TTS model I've been waiting for. Can we control emotions like in ElevenLabs?
>>
>>107883999
Yeah but some people manage. Today I found this guy lmao, makes 2-4 minute erotic films about Wonder Woman getting hypnotised/enslaved/fucked, with clearly several short scenes just stitched together.
https://www.deviantart.com/deviant-wonders/gallery/all
>>
saw some solid faceswaps with flux2 in the thread before. does it work with the basic workflow or do i need dark magic?
>>
File: image (41) (1).jpg (2.47 MB, 4032x1728)
2.47 MB
2.47 MB JPG
Krea still compares pretty well to both Klein and Z Image IMO. They're usually always pretty similar to each other too I guess because of Qwen as the TE, whereas T5 takes Krea in a bit of a different direction usually on the same prompt.
>>
File: Untitled.jpg (181 KB, 1140x693)
181 KB
181 KB JPG
>>107884029
i'm passing in 1152x864 and getting 2304x1728 out.
>>
>>107884053
krea looks burned af ngl
>>
>>107884049
You need a workflow for this with some special nodes for editing like this https://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced/blob/main/workflow/workflow_Flux2_Klein_9b.json
>>
>>107883256
Thanks doc
>>
File: comfyexplication.png (20 KB, 825x183)
20 KB
20 KB PNG
>>107884060
ahh nice find, gonna test it later, I'm trying to train a klein lora, diffusion-pipe added support, its going pretty fast
>>
File: Flux2-Klein_00057_.jpg (808 KB, 1728x2304)
808 KB
808 KB JPG
>>107884060
>>
>>107884047
Looks stupid and inconsistent, I only saw one clip since all others need to login.

1/10 would not watch and would not bother with this horseshit.
>>
>>107883919
>>107884003
one thing that I noticed about klein is that it tends to give very manly hands to women
>>
>>107884065
cheers, will have a look at it
>>
File: comp_0125.jpg (892 KB, 4334x1388)
892 KB
892 KB JPG
>>107883164
>>
File: Flux2-Klein_00068_.jpg (841 KB, 1536x2304)
841 KB
841 KB JPG
>>107884089
ahh that explains it. good find anon.
good luck with training, i found it was almost as easy as zimage. are you doing i2i or t2i?
>>
>>107884089
>diffusion-pipe
ugh hate that one. so clunky to use and uses pointless deepspeed like everyone has fifty gpus
>>
>>107883147
Klein is better at edits than NBP. But it understands only a fraction of it, how did they do it?
>>
>>107884061
Yeah it's a bit contrasty. It retains more fine detail when upscaling then the other two though.
>>
File: Flux2K9b.jpg (137 KB, 1024x1024)
137 KB
137 KB JPG
>>107883842
hm. it did not become real
>>
>>107884121
thanks for the comparisons anon.
Looks like qwen is typically better but with the tradeoff that it can change things beyond what's been asked for
>>
File: radiance.jpg (104 KB, 848x1488)
104 KB
104 KB JPG
>>
>>107884151
>how did they do it?
getting humiliated by Alibaba and Z-image turbo does that to you, when your ego has been hurt you only want to prove everyone wrong so you work as hard as you can
>>
File: Flux2K9b.jpg (267 KB, 1024x1024)
267 KB
267 KB JPG
>>107883842 >>107883823
BTW I'm glad that prompt still works.

Some SD1.5 prompts did not fare quite as well on flux2 klein.
>>
I like that models tend to be more and more unified, for example they managed to make Flux 2 Klein good at both edititing and as a text2image model, that's how it should be, a model that can do it all at the same time
>>
>>107884170
gasper
>>
>>107884022
>based kohya not falling for their bullshit, no weight = no merge
Merging takes on second, only thing that takes time is reviewing, he can review at his leisure and merge the second Base drops
>>
>>107884222
>Merging takes on second
he needs to test the base model and see if the script works on it before approving and merging though
>>
>>107884229
didn't ask
>>
>>107884170
are you trying his latest radiance models?
https://huggingface.co/lodestones/Zeta-Chroma/blob/main/zeta-chroma-x0-pixel-proto.safetensors
>>
File: 1748621151560168.png (109 KB, 500x250)
109 KB
109 KB PNG
>>107884232
>>
>>107884239
puto
>>
Hey bros what's the good news
>>
File: diffusion-pipeflux2.png (42 KB, 1085x317)
42 KB
42 KB PNG
>>107884129
t2i, thats only supported for now


>>107884146
Is not that bad once you got it set it up
>>
>>107884251
i got fired
>>
>>107884251
I got hired because some fag got fired lmao
>>
File: Flux2K4b.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>107884251
main thing rn: flux2 klein is out

>>107884233
checking some old SD1.4/1.5 prompts on flux2 kein and current radiance

haven't tested zeta radiance yet
>>
so what is the use of flux-2-klein-base-9b over distilled?
>>
>>107884270
training (actually no, the licence is shit lul)
>>
https://www.reddit.com/r/StableDiffusion/comments/1qdl0dd/ltx2_vs_wan_22_the_anime_series/
Absolute cinema



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.