[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>107880290

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
cooomers eating good, XL era is ending
>>
euler, simple
>>
Qwen3 Image is better than Klein 9b right?

Is Klein just a good option for poorer people?
>>
File: Flux2-Klein_00113_.png (1.75 MB, 960x1440)
1.75 MB
1.75 MB PNG
it knows shakira?
>>
>>107883170
honestly the klein image edits i have seen here were very good
>>
SHAKIRA SHAKIRA
OO BABY YOU WANNA FUCK MY ASS
YOU MAKE A WOMAN GO MAD
SHANIQUA
TANIQUA
FEELING THE SIDES OF MY BODY
IMMMMM ON TONIGHT AND MY HIPS DONT LIE AND IM TRYING TO FEEL YOU BOY
LETS GO
MEXICO
>>
>>107883147
>>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
uh oh, low thread quality inbound!
>>
>>107883207
are you debo or ani
>>
>>107883197
I've been using the edit nodes on anime images, and qwen3 still has clearly better results/adherence.
>>
Why was the last one deleted?
>>
>>107883197
klein hits a little different
>>
File: Untitled.jpg (856 KB, 1951x1105)
856 KB
856 KB JPG
hmm!
>>
File: 1.webm (3.87 MB, 1920x1080)
3.87 MB
3.87 MB WEBM
Thoughts?
>>
>>107883256
green animation is largest
>>
>>107883256
white purple and green
>>
Do you guys miss the good old SD1.5 days? Remember when it was a simpler time and everyone was excited about making gens?
>>
>>107883274
no
>>
>>107883274
if you're not excited by every new improvement its your brain issue
>>
>>107883296
Wow, did you have to be this rude to me? Yes, I'm feeling depressed, alright, but there was no need to call me mentally ill.
>>
Blessed thread of frenship
>>
>>107883274
>getting stable diffusion to even work on Linux was a total fucking bitch with random python errors
>after trouble shooting for hours, your non flagship card that barely had any tensor cores (in my case i used a 1080 ti which had none) would either crash out or struggle to give you a half decent washed out image
No. Early sd 1.5 days sucked and I was largely unimpressed, immediately going back to cloud based shit.
>>
>>107883274
>This post brought to you by Greg Rutkowski and Alphonse Mucha.
>>
>>107883274
I remember the first time trying to get anything to work
>windows
>amd
>some shitty workarounds
>something onnx
>something converting models
>....
>like 10 minutes per picture
>gpu at 100%
I was happy when i had my picture of a pink cocktail on a table at a beach and was fascinated by it kek
>>
>>107883256
absolutely based they recommend using llama.cpp over pootorch
>>
>>107883274
the sidegrades actually felt impactful. nowadays it's "use this snake oil to get realistic skin!" (no difference). I remember ipadapter, loras and controlnets actually being useful and easy to plug and play. nowadays everything sucks and just plateaued
>>
>>107883388
benchod
>>
>>107883364
>Greg Rutkowski
I remember he was bitching about being used in the dataset and then as soon as he was removed everyone forgot he existed lol. AI was the only reason his generic fantasyslop got recognized.
>>
File: file.png (139 KB, 763x454)
139 KB
139 KB PNG
Hey people using klein edit, where is this node from?
>>
>>107883437
https://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced
>>
File: a.jpg (124 KB, 1024x768)
124 KB
124 KB JPG
>>
>>107883454
real?
>>
File: b.jpg (123 KB, 1024x768)
123 KB
123 KB JPG
>>
File: 1758235891120793.jpg (77 KB, 1080x1033)
77 KB
77 KB JPG
>>107883274
1.5 was dogshit. the anatomy was catastrophic. It required extensive retouching. luckily, i only discovered this garbage, a few weeks before the release of the XL version, kek
>>
File: 85544237.png (1.44 MB, 1344x832)
1.44 MB
1.44 MB PNG
>>107883274
no because it didn't make dicks, I want dicks
>>
>>107883501
There were a lot of 1.5 models that could make dicks
>>
>>107883546
that arm looks extremely huge/weird
>>
File: f763704952.jpg (78 KB, 512x512)
78 KB
78 KB JPG
>>107883274
I had fun making HR Giger interpretations of politicians.
Lost most of the gens tho
>>
>>107883274
1.5 is unoriginal soul. It's trained with everything and the kitchen sink. You ask these new "good" models for a kodak brownie photograph they don't know anything.
>>
File: comp_0111.jpg (833 KB, 6106x1020)
833 KB
833 KB JPG
>>107883454
>>
>>107883183
Only thing we know is that you are face blind
>>
>>107882616
Kleinfags in shambles
>>
>>107883596
about time you finally admitted that migu is a girl
took you ages
>>
File: comp_0114.jpg (1.18 MB, 5426x1132)
1.18 MB
1.18 MB JPG
>>107883481
>>
File: ComfyUI_temp_flflq_00004_.jpg (761 KB, 1728x2304)
761 KB
761 KB JPG
if i pass the SamplerCustomAdvanced an SD3 latent it doubled my image res? does that happen to anyone else?
>>
>>107883251
>just numbers in the filename
I now use
ComfyUI %date:yyyy-MM-dd hh-mm-ss% %KSamplerAdvanced.noise_seed%
>>
>>107883688
Nobody asked, though?
>>
File: 865.png (1.23 MB, 1280x896)
1.23 MB
1.23 MB PNG
>>
File: comp_0115.jpg (900 KB, 6106x1020)
900 KB
900 KB JPG
>>107883465
>>107883655
huh? when did i say migu isn't a grill
>>
File: comp_0116.jpg (1.13 MB, 4586x1308)
1.13 MB
1.13 MB JPG
>>107883686
>>
>>107883274
I remember in October 2022 being super excited about genning a 512x768 Injun lmao I thought it was amazing. That was SD 1.4.
>>
File: comp_0117.jpg (762 KB, 6360x988)
762 KB
762 KB JPG
>>107883729
>>
>>107883752
Just checked the archives and I posted it on /aco/ lmao
https://desuarchive.org/aco/thread/6779207/#6779754
>>
File: comp_0118.jpg (1.24 MB, 5340x1164)
1.24 MB
1.24 MB JPG
>>107883560
>>
>>107883612
kek'd
>>
>>107883718
It's for your edification, sequentialfag

>>107883776
Jesus
>>
>>107883752
oh hey it knows jessica alba?
>>
File: comp_0119.jpg (1.1 MB, 6020x1036)
1.1 MB
1.1 MB JPG
>>107883246
>>
File: ComfyUI_temp_flflq_00009_.jpg (750 KB, 1728x2304)
750 KB
750 KB JPG
>>
>>107883785
It vaguely knew a bunch of celebs, but this one's not Jessica Alba.
>>
File: mikuoal.png (960 KB, 1024x768)
960 KB
960 KB PNG
>>
File: comp_0120.jpg (1.05 MB, 4586x1308)
1.05 MB
1.05 MB JPG
>>107883798
>>
File: gemsune.png (1.37 MB, 1024x768)
1.37 MB
1.37 MB PNG
>>
File: Flux2K9b.jpg (193 KB, 1024x1024)
193 KB
193 KB JPG
>>107883274
i am still quite excited
>>
File: comp_0121.jpg (1.06 MB, 4334x1388)
1.06 MB
1.06 MB JPG
>>107883752
>>
>>107883274
i remember upgrading from 1080ti to 3090 during sd 1.5 days and was like holy shit i can gen 8 shitty 512x512 terrible images at once!
>>
File: comp_0122.jpg (1.49 MB, 5340x1164)
1.49 MB
1.49 MB JPG
>>107883823
>>
File: comp_0123.jpg (1.14 MB, 6106x1020)
1.14 MB
1.14 MB JPG
>>107883812
interesting
>>
>>107883834
Qwen wins again
>>
i will be excited once z image base drops, and then releases a noobai finetune which blows illustrious out of the water
>>
>>107883481
i don't have a single XL gen saved that wasn't an edit.
i keep loads of 1.5 models, some of which have probably disappeared, because creative weirdos easily fine-tuned 1.5 and then vanished.
>>
File: comp_0124.jpg (1.53 MB, 6106x1020)
1.53 MB
1.53 MB JPG
>>107883818
>>
File: 1704202647350540.jpg (113 KB, 784x676)
113 KB
113 KB JPG
Is there any website I can use to generate a 20-30 minute movie, that can do consistent scenes?
>>
File: Flux2Klein9B_Edit_00043_.jpg (1.56 MB, 3456x2304)
1.56 MB
1.56 MB JPG
>>107883798
>>
File: Flux2-Klein_00038_.jpg (1.02 MB, 1728x2304)
1.02 MB
1.02 MB JPG
>>107883904
kek
>>
>>107883891
<is the singularity out yet?
>>
>>107883891
>Jarvis, create a free website where users can generate a 30 minute movie that makes sense for free
>>
File: 42.png (1.47 MB, 960x1232)
1.47 MB
1.47 MB PNG
When will I stop getting these demonic captchas and get the normal ones? No wonder even the 4chan xt dev bailed out, this website works againsts its users
>>
File: 1754792957127325.png (365 KB, 619x403)
365 KB
365 KB PNG
wen base
>>
Tried chatgpt image gen the other day for work
Shit is light years away from local ngl
Local is just too unwieldy, you have to fucking study how to do anything with it
That shit was just plug and play and better results
So that is very unfortunate
>>
File: file5.jpg (19 KB, 250x203)
19 KB
19 KB JPG
>>107883925
>>107883949
So what's the best this horseshit AI can do then? 10 sec clips? that may or may not be consistent or have continuity
>>
File: Flux2-Klein_00049_.jpg (1.03 MB, 1728x2304)
1.03 MB
1.03 MB JPG
>>
File: Soon(TM).gif (1.86 MB, 498x274)
1.86 MB
1.86 MB GIF
Mr President, another Z-image commit has hit the tower.
https://github.com/kohya-ss/musubi-tuner/pull/843
>>
>>107883999
yes. it's only for gooning and memes but the memes kinda suck now and the new model has to have someone spend several hundred thousand dollars to train porn into the base model but it's already fried on release
>>
>>107884010
https://github.com/kohya-ss/musubi-tuner/pull/843#issuecomment-3759879680
>I will test and merge this as soon as the base weights are released.
based kohya not falling for their bullshit, no weight = no merge
>>
>>107883686
explain?
>>
>>107884017
How many more years until I can make 30 minute movie that is indistinguishable or at least very close in quality to the real thing?
>>
>>107883256
This sounds absolutely insane! Is there a paper somewhere where we could listen to samples? Realtime too, the TTS model I've been waiting for. Can we control emotions like in ElevenLabs?
>>
>>107883999
Yeah but some people manage. Today I found this guy lmao, makes 2-4 minute erotic films about Wonder Woman getting hypnotised/enslaved/fucked, with clearly several short scenes just stitched together.
https://www.deviantart.com/deviant-wonders/gallery/all
>>
saw some solid faceswaps with flux2 in the thread before. does it work with the basic workflow or do i need dark magic?
>>
File: image (41) (1).jpg (2.47 MB, 4032x1728)
2.47 MB
2.47 MB JPG
Krea still compares pretty well to both Klein and Z Image IMO. They're usually always pretty similar to each other too I guess because of Qwen as the TE, whereas T5 takes Krea in a bit of a different direction usually on the same prompt.
>>
File: Untitled.jpg (181 KB, 1140x693)
181 KB
181 KB JPG
>>107884029
i'm passing in 1152x864 and getting 2304x1728 out.
>>
>>107884053
krea looks burned af ngl
>>
>>107884049
You need a workflow for this with some special nodes for editing like this https://github.com/BigStationW/ComfyUi-TextEncodeEditAdvanced/blob/main/workflow/workflow_Flux2_Klein_9b.json
>>
>>107883256
Thanks doc
>>
File: comfyexplication.png (20 KB, 825x183)
20 KB
20 KB PNG
>>107884060
ahh nice find, gonna test it later, I'm trying to train a klein lora, diffusion-pipe added support, its going pretty fast
>>
File: Flux2-Klein_00057_.jpg (808 KB, 1728x2304)
808 KB
808 KB JPG
>>107884060
>>
>>107884047
Looks stupid and inconsistent, I only saw one clip since all others need to login.

1/10 would not watch and would not bother with this horseshit.
>>
>>107883919
>>107884003
one thing that I noticed about klein is that it tends to give very manly hands to women
>>
>>107884065
cheers, will have a look at it
>>
File: comp_0125.jpg (892 KB, 4334x1388)
892 KB
892 KB JPG
>>107883164
>>
File: Flux2-Klein_00068_.jpg (841 KB, 1536x2304)
841 KB
841 KB JPG
>>107884089
ahh that explains it. good find anon.
good luck with training, i found it was almost as easy as zimage. are you doing i2i or t2i?
>>
>>107884089
>diffusion-pipe
ugh hate that one. so clunky to use and uses pointless deepspeed like everyone has fifty gpus
>>
>>107883147
Klein is better at edits than NBP. But it understands only a fraction of it, how did they do it?
>>
>>107884061
Yeah it's a bit contrasty. It retains more fine detail when upscaling then the other two though.
>>
File: Flux2K9b.jpg (137 KB, 1024x1024)
137 KB
137 KB JPG
>>107883842
hm. it did not become real
>>
>>107884121
thanks for the comparisons anon.
Looks like qwen is typically better but with the tradeoff that it can change things beyond what's been asked for
>>
File: radiance.jpg (104 KB, 848x1488)
104 KB
104 KB JPG
>>
>>107884151
>how did they do it?
getting humiliated by Alibaba and Z-image turbo does that to you, when your ego has been hurt you only want to prove everyone wrong so you work as hard as you can
>>
File: Flux2K9b.jpg (267 KB, 1024x1024)
267 KB
267 KB JPG
>>107883842 >>107883823
BTW I'm glad that prompt still works.

Some SD1.5 prompts did not fare quite as well on flux2 klein.
>>
I like that models tend to be more and more unified, for example they managed to make Flux 2 Klein good at both edititing and as a text2image model, that's how it should be, a model that can do it all at the same time
>>
>>107884170
gasper
>>
>>107884022
>based kohya not falling for their bullshit, no weight = no merge
Merging takes on second, only thing that takes time is reviewing, he can review at his leisure and merge the second Base drops
>>
>>107884222
>Merging takes on second
he needs to test the base model and see if the script works on it before approving and merging though
>>
>>107884229
didn't ask
>>
>>107884170
are you trying his latest radiance models?
https://huggingface.co/lodestones/Zeta-Chroma/blob/main/zeta-chroma-x0-pixel-proto.safetensors
>>
File: 1748621151560168.png (109 KB, 500x250)
109 KB
109 KB PNG
>>107884232
>>
>>107884239
puto
>>
Hey bros what's the good news
>>
File: diffusion-pipeflux2.png (42 KB, 1085x317)
42 KB
42 KB PNG
>>107884129
t2i, thats only supported for now


>>107884146
Is not that bad once you got it set it up
>>
>>107884251
i got fired
>>
>>107884251
I got hired because some fag got fired lmao
>>
File: Flux2K4b.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>107884251
main thing rn: flux2 klein is out

>>107884233
checking some old SD1.4/1.5 prompts on flux2 kein and current radiance

haven't tested zeta radiance yet
>>
so what is the use of flux-2-klein-base-9b over distilled?
>>
>>107884270
training (actually no, the licence is shit lul)
>>
https://www.reddit.com/r/StableDiffusion/comments/1qdl0dd/ltx2_vs_wan_22_the_anime_series/
Absolute cinema
>>
>>107884233
nta is there an actual workflow for this?
>>
File: 00004-1646943154.png (3.03 MB, 1728x1344)
3.03 MB
3.03 MB PNG
>>
>>107884282
lol at the hunyuan grave and jensen laughing like an evil maniac
>>
>>107884273
Still good for training loras, only full finetuners care about licenses.
>>
File: x.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>
File: Flux2K9b.jpg (151 KB, 1024x1024)
151 KB
151 KB JPG
>>107884282
kj boss and potato 3060 gpu and many other good references.

fucking saved.
>>
>>107884166
A Qwen Edit with the realism of the 2512 T2I model would be WAY better than Klein (but still slower). I hope they do one, regardless.
>>
so where Z-Base
>>
Why the fuck do you guys hate Pony so much?
>>
>>107884315
I hope Z-image edit will have the realism of Z-image turbo, this shit would be goated
>>
File: 1742982986447351.jpg (529 KB, 1024x1024)
529 KB
529 KB JPG
whats the verdict on klein? i get body horror with t2i and the edits make the images slightly pinker/orange
>>
>>107884344
>the edits make the images slightly pinker/orange
that's because they're still using a VAE in the year of our lord 2026, they should go pixel mode on edit models ffs
>>
File: 1755524702785364.jpg (640 KB, 992x1040)
640 KB
640 KB JPG
>>
damn. klein actually gives me better results than qwen when inpainting.
>>
Why didn't any of you tell me about this user on civitAI?

https://civitai.com/user/Synthdark8

Absolute god-tier anime style loras.
>>
File: 2952.png (529 KB, 1040x1040)
529 KB
529 KB PNG
>>
>>107884340
he's a cuck that removes artist styles on his models, fuck that horse fucker
>>
File: b54amn.png (1.94 MB, 1024x1024)
1.94 MB
1.94 MB PNG
>>
5090 but i got 2TB disk space and running out. i'm like the guy who parks his Porsche in front of a trailer park home
>>
>>107884372
>shovelware shit
>>
>>107884344
It still needs more training to replace SDXL for anime, but it's actually quite good. I can see people using it as an alternative to ZIT for various subjects.

Shame only 4b has an open license.
>>
File: radiance.jpg (122 KB, 848x1488)
122 KB
122 KB JPG
>>
>>107884253
I trust it even less for cumfart cocksucking. their formats suck and he won't make anything better than gguf imatrix
>>
kill ani
>>
File: Flux2-Concat_00019_.png (1.45 MB, 2394x672)
1.45 MB
1.45 MB PNG
>>107884344
It's best in class for a 9b model but isn't the best overall. It's fast, can handle edits on large images and is solid as an upscaler.
For t2i use the distilled, it's nothing special but it's fast.
For edits use the base model, it's slower but much higher quality
>>
File: 39.png (2.22 MB, 1216x1088)
2.22 MB
2.22 MB PNG
>>107884408
no
>>
File: 1742737178391062.jpg (263 KB, 1584x744)
263 KB
263 KB JPG
>change to high definition
alright it's good
>>
>>107884415
>For edits use the base model, it's slower but much higher quality
really? do you have some comparison pictures between a distilled edit and a base edit?
>>
>>107884423
benchod
>>
File: Flux2-Concat_00004_.jpg (2.6 MB, 3220x1547)
2.6 MB
2.6 MB JPG
>>107884427
prompt was
>add colour, keep detailed sketch style
>>
>>107884415
>For edits use the base model
how many steps? honestly think the distill results are great already
>>
>>107884441
damn, base managed to keep the sovl wheras the distilled version slopified it
>>
>>107884267
Too soon
>>
File: ykhjkz.png (1.93 MB, 1024x1024)
1.93 MB
1.93 MB PNG
>>
ACEStep 1.5 is apparently still in the oven and improvements are still being made bros, we'll be eating good

https://files.catbox.moe/jc3fgz.mp3
(7 min song with coherence and great audio/vocals, insane)

https://files.catbox.moe/6pmrsy.mp3


Also ACEStep dev has this to say about HeartMuLa

>It still sounds very impressive, the lyrics alignment is spot-on, and the details are incredibly realistic.
>However, the limitations are as follows: our 2B model supports 50 languages, which does cause a certain degree of capacity issues. Additionally, in terms of inference speed, the time it takes for this model to generate one song is enough for us to generate 200 songs.
>>
File: 54545454545.png (99 KB, 1186x825)
99 KB
99 KB PNG
>>107884478
These gens are from recent improvements that have been made
>>
>>107884478
>https://files.catbox.moe/6pmrsy.mp3
the guitar sounds so fake, looks like a cheap VSL from the 00s lmao
>>
>>107884478
doesn't feel quite right to me yet but it is progressing
>>
>>107884478
pretty cool! would love to have a local model music that doesn't suck
>>
>>107884478
Wow I had given up on ACEStep since 1.5 has been the next on their roadmap since last summer and nothing has happened
>>
>>107884421
yes but what about Mr catjak?
>>
>>107884526
Which part? The instruments on that first one sound really good, vocals mostly okay. No idea what 8k, 9k and 10k mean btw. If those atre training steps then the first one is the one with 10k steps, bottom is 8k.
>>
>all custom nodes are updated with no errors
>finally get x0 to run
>finishes gen
>100% static output

Ok, cool. Still not updating comfy though.
>>
File: 1747118981893426.png (499 KB, 1080x739)
499 KB
499 KB PNG
>>107884584
>Still not updating comfy though.
are you sure you won't update comfy once Z-image base will be released?
>>
File: Flux2-Klein_00147_.png (1.86 MB, 1008x1024)
1.86 MB
1.86 MB PNG
Great model, and the output has enough variation to keep genning.
>>
>>107884535
>1.5 has been the next on their roadmap since last summer and nothing has happened

You can try out 1.5 on their Discord and the dev there has been sharing updates/samples ever since it's been pretraining phase (even when you couldn't hear anything coherent out of it).
>>
>>107884605
tmws
>>
>>107884610
>5 hp noobs trying to reach for the master sword
>>
>>107884441
note Base is DEFINITELY giga-worse than Distilled in like, a lot of cases though. It might depend on the prompt / style.
>>
>>107884605
I dont care about z. I just want to gen and the constant downloading, tinkering, then deleting every new model every other week isn't fun.
>>
>>107884605
are people really expecting it to be better than Turbo?
>>
File: file.png (279 KB, 825x510)
279 KB
279 KB PNG
>>107884441
base is too noisy and distilled is too slopped, now hear me out, what if we merge them together and find the sweet spot
>>
>>107884668
I'm waiting for Z-image edit personally, there's a big chance it's gonna be even better than Klein
>>
i guess i will join the fun. Will a dual GPU setup work 5070ti + 5060ti work for 32gb of vram or do i have to spend 3x as much for a 5090 for 32gb vram? I already have 128gb in my system already.
>>
>>107884676
>Will a dual GPU setup work 5070ti + 5060ti work for 32gb of vram
it will, with that node
https://github.com/pollockjj/ComfyUI-MultiGPU
>>
>>107884610
There's a weird criss cross line pattern. It's most evident in the guy's hair but I think it's on the entire image.
>>
File: Flux2-Klein-T2I_00020_.png (1.73 MB, 1024x1024)
1.73 MB
1.73 MB PNG
>>107884703
Yeah, there's a few weird things which can happen with klein depending on the res, model size (4b or 9b), model type (base or distilled), number of steps, sampler etc
Makes it hard to assess quality
>>
>>107884730
I would say that it's a bit inconsistent, sometimes you can get a good quality image, and sometimes you have pure slop, I think it was undertrained a bit, maybe they've rushed it so that they can get some hype before Z-image base and Z-image edit destroys every competition once and for all
>>
>>107884669
there might actually be some benefit to that DESU, for both T2I and editing
>>
>>107884618
Cool, here's hoping it's closing in on a release
>>
File: ComfyUI_temp_fnyhl_00014_.png (3.6 MB, 1664x1216)
3.6 MB
3.6 MB PNG
>>
from what I've gathered ltx i2v is kinda fucked is that right? just wait and hope for them to fix it with an updated model?
>>
>>107884777
I think it's already fixed. I was struggling to get more than 10 seconds of video before but now I can go to 20+ without a problem. Didn't change much to my old workflow either other than add the new vae
>>
>>107884668
It will be better for training, the big question is if the loras will work well with Z-Image Turbo or if you will have to gen on Base as well, who knows, maybe there will be a new Turbo better aligned with the Base models.

Either way, for large finetuning, Z-Image Base will be what NSFW etc trainers will use both for the license and the quality of the model(s) despite only being 6B (which makes training much faster).
>>
File: ComfyUI_temp_ppqem_00002_.png (3.06 MB, 1216x1664)
3.06 MB
3.06 MB PNG
>>
>>107884777
correct, there are issues with i2v (you can sorta fix it by genning at 48fps) and audio.
Fixes for both, along with better portrait mode coming 'soon'
>>
Was making a character for some chatbot card
I busted my gpu before I got to generate it
If anyone is kind enough I'd appreciate something with the following


Illustrious nsfw as the checkpoint. Sexualized but not nsfw
No Lora
512 width
1024 height

Qipao,middle length hair, wine color hair, cowboy shot, big or huge breasts, fully clothed, wide hips, serious expression, portrait
Negative tag nsfw, cleavage

Was gonna make it myself but GPU died 3 days ago.
>>
>>107884802
>>>/r/
>>
>>107884802
There's free image generators online you know, and I'm not talking about Sora or Gemini.
>>
File: ComfyUI_temp_ppqem_00012_.png (2.94 MB, 1216x1664)
2.94 MB
2.94 MB PNG
>>
>>107884802
Which gpu and how did it die?
>>
File: ComfyUI_temp_ppqem_00013_.png (3.23 MB, 1216x1664)
3.23 MB
3.23 MB PNG
>>
>>107884791
it's not about the length it's more about all sorts of video issues and quirks with i2v which make it barely usable
>>107884800
I tried 48 fps but it didn't seem that much better, and after updating nodes and comfy it stopped giving coherent outputs with 48 and instead gave me a stuttery mess of a video for some reason
>Fixes for both, along with better portrait mode coming 'soon'
sounds good
>>
File: ComfyUI_temp_axihh_00026_.png (3.31 MB, 1056x1328)
3.31 MB
3.31 MB PNG
>>107884441
This is on distill with "keep everything else exactly the same, especially the grainy texture" and I also described the colors. The scheduler is the main culprit, I used shift=1 and simple.
>>
File: eiv0o6.png (2.11 MB, 1024x1024)
2.11 MB
2.11 MB PNG
>>
You can make extremely long ltxv videos with this: https://github.com/RandomInternetPreson/ComfyUI_LTX-2_VRAM_Memory_Management
>>
>>107884857
what wf are you using? neither the default or the bigstation one let me adjust shift or choose simple as the sampler
>>
>>107884798
blush is a bit much but still awesome

>>107884838
>>107884844
awesome gens

some anons bitch about it but sexy asians always keep me coming back
>>
>>107884911
https://files.catbox.moe/cjrc0u.json



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.