[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 1769494284112966.jpg (986 KB, 3264x1206)
986 KB
986 KB JPG
Discussion of Free and Open Source Diffusion Models

Prev: >>107978058

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>ZiT
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Blessed thread of frenship
>>
Spamming/flooding
There is already thread up at
>>107979673
>>107979673
nowhere close to bump limit
>>
>>107981239
Thanks, baker. Not every mod's up to speed on Animanon's MO, huh.
>>
>>107981282
stop thanking yourself its fucking cringe
>>
>>107981278
>>
>>107981278
>>107981297
being this angry at 6 in the morning isn't healthy
>>
>>
>>
File: 1758699429456155.png (2.08 MB, 1184x1344)
2.08 MB
2.08 MB PNG
>not in fagollage
SAD
>>
Z-Base ETA?
>>
File: file.jpg (1.73 MB, 979x2558)
1.73 MB
1.73 MB JPG
Do not engage with the samefagging avatarfag that likes to shit up this place, so he can promote his UI or something.
>>
File: 1750744585312644.png (2.64 MB, 1472x1056)
2.64 MB
2.64 MB PNG
'tis time for torture
>>
File: 1763653931345179.png (1.64 MB, 1440x1088)
1.64 MB
1.64 MB PNG
>>
>>107981838
repost
>>
File: 1752126289698970.png (1.71 MB, 1344x1344)
1.71 MB
1.71 MB PNG
>>107981830
soon
>>
Like look at these:
https://imgur.com/a/gvdhesS
Chroma is the ONLY image model that does not scream slop with every image.
>>
>>107981854
it's actually not, spot the differences thoughever :)
>>
>>107981861
zit slop
>>
File: radiance_x32.jpg (143 KB, 1280x1280)
143 KB
143 KB JPG
>>
>>107981867
your moms a zit slop faagggg
>>
>>107981869
>white skin
*pukes*
>>
>Chroma is the ONLY image model that does not scream slop with every image.
>>
>>107981870
rekt
>>
>>107981872
Yep is fucking bellows
>>
>>107981872
true, did you see for yourself: >>107981859
>>
File: 1763277389312589.png (2.18 MB, 1408x1120)
2.18 MB
2.18 MB PNG
imagine not liking 1girl standing
what causes this??
>>
>>107981869
HANDS
>>
If soon gets its own soon, I don't give a damn, what a culture
>>
File: 1745334022788081.png (2.25 MB, 1344x1184)
2.25 MB
2.25 MB PNG
1girl bros???
>>
File: collage.jpg (3.4 MB, 7295x4962)
3.4 MB
3.4 MB JPG
>>
>>107981859
You can't even prove that was chroma
>>
>>107981869
enough of radiance, its far too undertrained to be using still
>>
Any real bake? This is just another Ran troll thread that will get rightfully deleted
>>
>>107981887
I prefer 1girl sucking(dog cock)
>>
File: hmm.jpg (1.02 MB, 3477x5000)
1.02 MB
1.02 MB JPG
>>
>>107981859
most of those look like midjourney slop
>>
>>107981897
literally from here
https://civitai.com/models/860092/kegant?modelVersionId=2584964
>>
>>107981907
point doesnt change
>>
>>107981907
no metadata = fake
>>
>>107981907
>>107981905
>>107981904
>>107981903
evacuate troll bake, we need a proper cozy bread
>>
File: 3967753789.png (3.63 MB, 1344x1728)
3.63 MB
3.63 MB PNG
>>
>>107981905
>midjourney slop
midjourney is the least sloppy model but its not local and cant do nsfw so its shit
>>
File: F2Kb__00014_.png (1.83 MB, 1024x1024)
1.83 MB
1.83 MB PNG
>>107981830
never. It's a scam, there is no base, because zit is a shitmix.
>>
>>107981916
kill ani
>>
File: Flux2-Klein_00365_.png (1.2 MB, 1280x720)
1.2 MB
1.2 MB PNG
>>
File: 1765244598961505.png (1.79 MB, 1376x1152)
1.79 MB
1.79 MB PNG
1girl soaking wet
>>
>>107981928
wow
>>
>>107981933
>nooo asian don't look the same!
>*zit happens*
>uhh hmmm
>>
>>107981919
2 years ago maybe
>>
>>107981933
repost
>>
this tune is extremely good https://civitai.com/models/860092/kegant?modelVersionId=2584964
and chroma 2 is looking good already. I wonder if he will keep training klein or switch to z image base if it releases?
>>
>>107981922
fuck you benchod
>>
>https://huggingface.co/Tongyi-MAI/Z-Image-Base
https://huggingface.co/Tongyi-MAI/Z-Image-Base
>https://huggingface.co/Tongyi-MAI/Z-Image-Base
https://huggingface.co/Tongyi-MAI/Z-Image-Base
>https://huggingface.co/Tongyi-MAI/Z-Image-Base
https://huggingface.co/Tongyi-MAI/Z-Image-Base
>>
File: radiance_x32.jpg (130 KB, 1280x1280)
130 KB
130 KB JPG
>>107981888
optional for sfw
>>
File: 1760445724363845.png (2.27 MB, 1472x1056)
2.27 MB
2.27 MB PNG
>>107981936
you dont rike?
>>107981941
might actually be but I dont remember if I had it posted in the nuked thread :(
>>
>>107981941
actually it isn't, try to spot the difference tho ;)
>>
It's funny how the Z Base fanboys are already changing their tune and saying that it's going to be crap, but that the finetunes will achieve great things.

It's a bit reminiscent of the movie โ€˜Downfallโ€™, where Hitler conjures up the final victory and Steiner's army will strike the Russians.

Pure comedy.
>>
>>107981951
HANDS
A
N
D
S
>>
>>107981951
>brown goddess
i will now jack to your slop
>>
File: 1751138940205498.png (28 KB, 621x526)
28 KB
28 KB PNG
>>107981949
ITS REAL
>>
>>107981957
bruh its useless, this guy has been spamming for the past week his completely melty chroma x32 gens. before that he spent a whole month spamming the base radiance gens.
>>
>>107981951
>dark skin
*vomits*
>>
>>107981960
>base
>vae
retard
>>
>>107981951
this will be ai in 2018
>>
File: Flux2-Klein_00368_.png (1.47 MB, 1280x720)
1.47 MB
1.47 MB PNG
>>
File: 7807312.png (1.96 MB, 1152x1152)
1.96 MB
1.96 MB PNG
>>107981962
here, a klein gen
>>
File: radiance_x32.jpg (202 KB, 1280x1280)
202 KB
202 KB JPG
>>107981957
creative interpretation hands
>>
everything sucks
everything is gay and retarded
it's over
>>
>>107981976
chroma is so good!!!
>>
>>107981976
what are those things hanging from her "hand" wtf
>>
File: 1749272618236245.png (241 KB, 293x399)
241 KB
241 KB PNG
>>107981951
when are ai models gonna learn never to do skin like this?
>>
I notice WAN2.2 isn't in the OP anymore, is LTX-2 considered to be a flat upgrade or was there just not room and WAN2.2/2.1 are still the most stable/mature t2v/i2v models but LTX-2 is newer and therefore included as news?
>>
File: 1156313243.png (3.86 MB, 2016x1152)
3.86 MB
3.86 MB PNG
>>
>>107981990
nerd
>>
>>107981972
do an attack helicopter in terraria
>>
>>107981999
Tell me
>>
>>107982002
I don't have a terraria LoRA.
>>
File: Flux2-Klein_00369_.png (499 KB, 640x594)
499 KB
499 KB PNG
>>107982002
>>
what's the point of these reposts
>>
>>107982009
how do you know?
>>
File: radiance_x32.jpg (169 KB, 1280x1280)
169 KB
169 KB JPG
>>107981981
most likely nothing to worry about.
>>
>>107982011
that gave me a heli attack 2 flashback
>>
File: F2Kb__00015_.png (1.93 MB, 1024x1024)
1.93 MB
1.93 MB PNG
>>107982011
>>
>>107982028
slop
>>
File: 708.png (3.98 MB, 1949x1344)
3.98 MB
3.98 MB PNG
>>107981978
nah, klein, qwen and zit are awesome, I never imagined we'd get something like that to run on local, specially something like klein
>>
>>107982022
Because I made the image and the LoRA and I do not have a terraria LoRA.
>>
File: 156562284.png (3.15 MB, 1248x1824)
3.15 MB
3.15 MB PNG
>>
>>107982034
did you check?
>>
what is the first thing you'll generate with z image base releasing today?
>>
>>107982033
most of those lego parts don't exist
>>
>>107982038
okay i just checked how did you put it there i'm scared now
>>107982040
1girl standing
>>
>>107982040
>using base to gen
double retard
>>
>>107982038
Certainly, let me go check.
*Goes to back of store, looks at phone for a minute and comes back out*

Yeah I checked.
>>
Why are three anons pretending to be me?
>>
>>107981990
Wan is still very much the go to and the ops of these generals have been hijacked by the mentally ill who should not own a computer.
>>
>>107982042
not surprising that genai invents things that don't exist yet. it is supposed to do that for most uses.

constraints to just real lego parts would have to be trained separately.
>>
z image better be fucking amazing with all this blue balling
>>
>>107982002
>do an attack helicopter in terraria
I'm so fucking brainrotted I thought this phrase meant you were telling him to kill himself in Minecraft for being a trainee

>>107982040
>what is the first thing you'll generate with z image base releasing today?
Voyeur and creepshots of little girls in frilly microbikinis barefoot or in flip flops
But like the super intimate angles that really juxtapose how small and vulnerable and delicate they are

>>107982050
>using base to gen
>double retard
Yeah to check knowledge silly
>>
File: 1984587747.png (3.53 MB, 1248x1824)
3.53 MB
3.53 MB PNG
>>
File: 04296.jpg (728 KB, 1989x1328)
728 KB
728 KB JPG
>>107982042
They aren't lego parts thoughevverbeit, they are toy brick construction, plastic interlocking blocks, 3D relief sculpture, isometric depth, textured stud surfaces, layered geometric assembly, toy photography style, clean studio lighting, sharp edges, post-impressionist brick art, high-detail plastic molding, matte finish.
>>
>>107982021
There is a schizo who runs a bot that reposts old images and old posts in an attempt to ruin the threat, the same schizo who seethes about the OP constantly. Notice the massive surge in inorganic low-quality posts.
>>
>>107982063
kek true ani is a mentally ill avatarfagging retard
>>
File: 1753420599483241.png (659 KB, 800x800)
659 KB
659 KB PNG
>>107982040
>what is the first thing you'll generate with z image base releasing today?
nipples
>>
>>107982071
>z image better be fucking amazing with all this blue balling
It's not gonna be that good. I just hope the collective consciousness switches to focusing on local video afterwards
>>
>>107982075
zoomer
>>
File: 95555458.png (2.74 MB, 1264x1040)
2.74 MB
2.74 MB PNG
>>
>>107982082
those are pepperonis thoughverbait
>>
File: 2744012248.png (3.35 MB, 1824x1248)
3.35 MB
3.35 MB PNG
>>107982082
reminds me of this song https://www.youtube.com/watch?v=4BSn_ENB0IE
>>
>>107982071
z image is for training loras and for finetuning.
>>
z-image but for anime, completely killing illustrious and nai
>>
>make essentially a big asian girl lora into a model
>people love it and think you are the next coming of SD
>demand the holy base
>keep delaying as long as you can as you know you can't deliver
it will be funny seeing the community slowly realize they were duped, will probably take 1-2 months after people try fine tuning to extract the magic juice and realize it never existed
>>
>>107982100
FINETUNESSSS WILL FIX IT!!!!!!!!!!!!!!!!!!!!!!!!!
>>
>>107982099
fat pig
>>
File: 1367657413.png (3.19 MB, 1248x1824)
3.19 MB
3.19 MB PNG
>>107982112
as well she should be
>>
File: F2Kb__00016_.png (1.79 MB, 1024x1024)
1.79 MB
1.79 MB PNG
>>107982031
curse vishnu, you signal user
>>
>>107982115
>boob biger than head
>>
File: hf.png (165 KB, 944x761)
165 KB
165 KB PNG
nice deepfakes
>>
>>107982110
this, but unironically
>>
>>107982130
no
>>
I think Europeans are a lot more like the indians.

Like we have whole genres of humor they don't even accept as imports at their terminals.
>>
>>107982134
shutup fucking benchod
>>
>>107982132
the most used models are finetunes
>>
giving zit another test, now with a more coherent auto-gen prompt.

Breaking prompts is useful for revealing the actual nature of how models work. I broke zit previously, forcing it to reveal the obvious traits of loras.
>>
File: 336696108.jpg (345 KB, 1503x1344)
345 KB
345 KB JPG
>>
>>107982071
Lots of anons have been saying many times to temper your expectations. Personally, I think they will release something but we'll get cultured. Didnt tonguey say themselves its not as good as zurbo?

>>107982085
This. We have an image model every week, prefer video.
>>
>base isn't as good as distilled
No way
>>
File: zit_00023_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>107982155
Here's Zit.
>>
>>107982159
The z image examples chinese anime man have been posting look terrible. It will need a lot of finetuning, beyond just loras because the base appears to be falling apart
>>
>>107982129
*laughs in 80b*
>>
File: zit_00024_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>107982179
paintful how it's breaking down. Here's the other gen, since I do 2 up with zit.

>>107982175
There is no base, really. zit is just an oriental scam.
>>
File: 1759990701136567.mp4 (438 KB, 220x392)
438 KB
438 KB MP4
>>
You guys won't fucking believe what just flew over my house
>>
saying it now: klein 9b absolutely destroys z-base, it's not even close.
>>
File: 93740.png (630 KB, 1024x1024)
630 KB
630 KB PNG
>>107982202
z image base?
>>
>>107982076
This Chroma?
>>
what about z image base turbo
>>
>>107982213
zit
>>
>>107982197
closed source trash paypig.
>>
>>107982223
ltx isn't closed source retard
>>
>>107982221
Neat.
>>
>>107982227
but did he died?
>>
>>107982223
>instinctively assuming it's paid because it looks better than most localslop
localkeks starting to internalize api victory
>>
File: F2Kb__00017_.png (1.98 MB, 1024x1024)
1.98 MB
1.98 MB PNG
>>107982197
neat. Mall World -ish.

I've had dreams involving like.... weird overpasses.

>>107982194
>>107982179
annnnd compare Klein 9B base.

same prompt.

:^)
>>
File: radiance_x32.jpg (230 KB, 1280x1280)
230 KB
230 KB JPG
>>
>>107982204
correct:
>>107982252

the prompt:

a photograph. bleach washed kodachrome. the location is pearl desert-canyon , in the left: astral the-angelic-being snaring , in the middle: arcane the-fate-spinner

thundering , in the right: crimson-laced shattering the-dwarf-king scrying , a woman running amidst lime clawed coyote kicks green bold penguin , while penguin leaps cyan dull possum , while possum meows indigo chubby crab
>>
Hello all.

They have us working overtime to release Z-image. Omni however has been cancelled indefinitely.

Good night.
>>
>>107982262
and ofc I could run it through an llm to make it more natural language, but I want to offer models a chance to do their worst.
>>
Yes, I am the man who single-handedly destroyed z-image base.
>>
Who's this new schizo?
I really don't like to set up filters
>>
>>107982093
>zoomer
It's a good time to be young and not have kids yet

>>107982124
>boob biger than head
One of the best parts of AI is allowing me to experience over 80% of what I want from women with tits larger than their heads, without the massive sacrifices in lifestyle and increase in stress you had to do for most of human history to get the same thing

And I only care about super boobs for like 10 minutes a week at most when you had to commit all-in back then
>>
>>107982296
4chin filters are easy as fuck. Just add the names and terms you dont want to see. And at the "export settings" save that url then paste it in the url bar the next time you visit 4chins, it automatically reactivates the filters. Or think you can bookmark the url somehow to save pasting it in everytime (havnt tried that yet)
>>
>Ace step 1.5 delayed
>Z-image base entering an every tightening "soon" spiral

It's over.
>>
>>107982387
No, we want you to go hang off a rope
>>
>>107982393
I'm not catjak
>>
File: 85.png (1.7 MB, 960x1200)
1.7 MB
1.7 MB PNG
>>107982380
>>It's over.
>Klein is open source and runs on local hardware
It has barely even started, how can it be over?
>>
>>107982398
this. catjak runs the automated spambot, proxies, shits his pants and screeches about ani all the time. he has to go
>>
>>107982398
>>107982406
>not even 3 hours of sleep
Julienxisters... i think it might be over.....
>>
>>107982333
>he spends 10 mins jacking per week
boomer
>>
>>107982417
uh oh! troonjak melty!
>>
>>107982403
>Klein
Go ahead and generate nipples with it
>>
>>107982430
Who's that?
Your bull?
>>
>>107982403
we need z image to release so everyone realizes how much better klein is. z-base actually looks terrible in comparison
>>
>>107982449
But enough about your current predicament
>>
>>107982387
Julien is a literal schizo who talks about doxxing people and posts death threads
>>
>>107982460
not my problem. your grandmother and soul bonded partner can change your diaper for you.
>>
>>107982478
I didn't know catjak had a name! what a weirdo
>>
>>107982433
>>107981393
>>107981459
>>
>>107982478
meanwhile ranfaggot literally posts "kill ani" all the time
>>
>>107982478
julien is the one that claims you have sex with your grandmother and you are the one that posts gens of your grandmother as a sex slave
>>
File: 1692386474.jpg (268 KB, 1824x956)
268 KB
268 KB JPG
>>107982213
it's noob vpred actually.
>>
So what killed LTX2?
>>
Tranustudio
>>
>>107982512
DiTlers on suicide watch
>>
File: 4198.png (2.27 MB, 1024x1312)
2.27 MB
2.27 MB PNG
>>
>>107982518
Killed? I rarely use wan these days. Audio adds to much. I'm personally waiting for 2.5 till I train my own stuff though since video is so expensive to train. I know several other finetuners are doing the same.
>>
File: 563410.png (2.26 MB, 1040x1296)
2.26 MB
2.26 MB PNG
kek
>>
>>107982532
show eardrums. Ltx gives me tinnitus
>>
>>107982540
use the audio fix nodes they put out, no im not holding your hands, check the bandico discord and grab one of the million WFs
>>
>>107982526
Holy shit. This is literally me.
>>
File: 608424224.mp4 (3.41 MB, 576x832)
3.41 MB
3.41 MB MP4
It will forever remain a mystery to me how people get anything good out of ltx-2 distilled
>>
>>107982552
>check discord
how about u kys
>>
>>107982564
then suffer from the shitty gens / wfs shared here and never know good gens
>>
>>107982552
Their audio normalization sampler makes things 10 times worse, it makes voices sound even more metallic
>>
>>107982575
no it doesn't. Get a better WF from someone who used it correctly then. Also use that big I2V lora, it makes I2V night and day better
>>
>>107982563
1. Don't use distilled model, use dev version with distilled lora at 0.6 strength at 9-10 steps, optionally add details lora too
2. 720p is the bare minimum to get decent and consistent results, 1080p is preferable
>>
>>107982563
Curious if you're using the new i2v adapter LoRA.
>>
>>107982563
grab a good WF from the bandoco discord, not the shit seen here. The I2V adapter, the audio nodes, use BOTH the temporal AND spatial upscaler, use the new sigmas, Also for super fast moving scenes use a higher base fps
>>
>>107982563
It's a model made by jeets.
>>
wan is still king. ltx is just too fried, makes my ears bleed and can't make porn very well
>>
File: o_00001_.jpg (1.17 MB, 2304x1792)
1.17 MB
1.17 MB JPG
>>
>>107982600
>o_00001
>>
>>107982599
try the nsfw merge, its not bad. Reminder than wan took months to get decent at nsfw. The fried and ears thing is a bad wf thing
>>
>>107982511
julien is correct
>>
>>107982610
show me an output before shilling without proof
>>
>>107982495
I wish I trusted chroma tunes, I guess it's the best we got though
>>
>>107982610
nta, but why even really bother with ltx 2 when it's basically confirmed that something better is coming in short order?
>>
>>107982632
what if it isn't?
>>
>>107982599
99% of people are using it like wan which it can not do. It has tons of stuff going on like temporal compression. Grab a better WF

>>107982619
https://files.catbox.moe/te72cw.mp4
https://files.catbox.moe/cw8v8z.mp4
>>
>>107982619
https://files.catbox.moe/njl5di.mp4
https://files.catbox.moe/f1zbhx.mp4
https://files.catbox.moe/c1gcvy.mp4
>>
>>107982651
you have to be trolling. these voices are so metallic it gave me a headache withing the first few seconds. fuck you
>>
It's 10PM in China. It could release any second now...
>>
>>107982651
>>107982661
your licence to recommend models is revoked. you are a jeet slopper that doesn't know quality
>>
does analstudio support ZIT?
>>
File: 93796983.mp4 (3.58 MB, 768x1152)
3.58 MB
3.58 MB MP4
>>107982586
I guess higher res does seem to produce better results.
>>107982587
ya
>>
File: 06.png (2.15 MB, 1168x1168)
2.15 MB
2.15 MB PNG
>>
>>107982526
>>107982538
these can go wild on SA tiktok
>>
>>107982679
nigger show me wan doing good audio or long ass videos, or really anything like these without months worth of loras
https://files.catbox.moe/yt83gg.mp4
https://files.catbox.moe/p0cgcm.mp4

and on top of that they already said 2.1 and 2.5 would fix that
>>
File: o_00002_.jpg (1.48 MB, 2304x1792)
1.48 MB
1.48 MB JPG
>>
>>107982701
You better not have something like "girl trips and falls in the snow" as your prompt
>>
>>107982712
>SA tiktok
What does SA stands for here?
>>
with a natural language model like z-base is, for lora training a character is it better to add the character name into the captions.

So if you auto-caption 50 images, should you then go back to add the character name instead of say "green haired women"
>>
>>107982743
Yes. Unless your character has tons of hair colors / styles only caption the 'mutable' aspects of the image. The parts you DONT caption will be cooked into your lora. And / or you can do what you said and use a keyword. Then you use the keyword instead of just applying the lora. That has a tiny bit more control.
>>
>>107982717
wan doing long videos is already a thing. no video model can beat vibe voice at audio and the vids are all nsfw which ltx cannot do. now begone jeet!
>>
>>107982741
Sexual Assault.
>>
>>107982769
nta, but this is just Wan cope. It's on the way out and you need to accept that.
>>
>>107982756
hey anus does anustudio support ZIT and edit models yet
>>
>>107982717
nigger stop showing me slop that gives me headaches
>>
>>107982769
wan can not do voice / motion matching like ltx, its not even close. At least have experience with what you are talking about before you open your mouth.

https://files.catbox.moe/wunip1.mp4
https://files.catbox.moe/jftiwc.mp4
https://files.catbox.moe/eea5wn.mp4
https://files.catbox.moe/k29y60.mp4
https://files.catbox.moe/m3tt74.mp4
>>
File: screenshot.1769523738.jpg (174 KB, 1116x367)
174 KB
174 KB JPG
https://github.com/modelscope/DiffSynth-Studio
https://x.com/bdsqlsz/status/2016125638932095201
they said z-image has already been released lol

also, what is Z-Image-i2L? Z-Image-Lora?
>>
>>107982787
go make an issue about it instead of screeching here niggerjak
>>
Is it true that natural language based loras work better with fewer images like ~20, compared to SDXL/anime models just working better mostly with just as many good images as you can find
>>
File: 31892292.mp4 (3.52 MB, 832x1280)
3.52 MB
3.52 MB MP4
>>107982727
No, but it is stupid.
>A burst of movement cuts across a quiet winter landscape as footsteps crunch faster and faster, breath puffing in quick clouds. The run turns reckless, shoes skidding, arms windmilling, and then thereโ€™s a sudden leap fueled by momentum rather than planning. Gravity takes over. The body disappears into a deep, powdery drift with a soft whumph, snow erupting outward in a dull, airy explosion that swallows all sound for half a beat. The world settles. Wind sighs. Then motion resumesโ€”only the lower half remains visible, stuck upright in the snowbank like an exclamation point. Boots kick weakly at first, then with more urgency. Snow squeaks and collapses with each wriggle, tiny avalanches sliding down the sides. A muffled laugh and an indignant huff vibrate through the snow, followed by frantic, side-to-side wiggling as the trapped figure tries to regain balance. Each movement produces new sounds: the soft grind of packed snow, the hollow thud of shifting weight, the faint rustle of fabric brushing ice crystals. The wiggling grows more exaggerated, almost rhythmic, as if the snow itself is resisting, holding fast while the lower half stubbornly refuses to stay still. The scene lands as pure winter comedyโ€”energy, momentum, and gravity conspiring in a brief, ridiculous stalemate.
>>
>>107982798
>wan can not do voice / motion matching like ltx,
good because it's completely shit on ltx and I don't want that feature if it's complete garbage. I stopped watching these btw you are trying to blow out my eardrums clearly
>>
>>107982807
yeah everyone has Z-image except us
>>
ltxv can look better than wan if your willing to wait for the same gen times.
https://files.catbox.moe/g4kecb.mp4
https://files.catbox.moe/2tux6w.mp4
https://files.catbox.moe/secyev.mp4
https://files.catbox.moe/3udh6d.mp4
https://files.catbox.moe/g9dzvj.mp4
https://files.catbox.moe/ydn07o.mp4

>>107982832
you didn't even watch the vids then. Its night and day better than anything else. It was the main thing people were amazed at
>>
File: ComfyUI_07932.jpg (2.99 MB, 2160x1440)
2.99 MB
2.99 MB JPG
>>107982040
Nothing fun until I do some training.
>>
>>107982825
Holy slop. Jesus fuck, anon, stop using whatever LLM you're using. Write the prompt yourself, and describe things in simple prose
>>
>>107982825
>โ€”
>>
>>107982848
I tried that, but it was even worse. Seems like the model prefers long-slop prompts. If you got any prompts you think would work I'd be happy to try them.
>>
>>107982825
i like how she turns asian halfway through
>>
>>107982861
go check the bandico discord, someone made a node and a qwen finetune that helps a ton
>>
>>107982843
>you didn't even watch the vids then.
that is what I am telling you yes. congrats you finally read what I am saying. I am not watching any more vids because you gave me a headache and they all sucked up to that point. you suck, bandodoco sucks and ltx sucks. fuck you
>>
>>107982861
also your WF is shit. You are clearly not using temporal upscaling / the right scalars. I'm not helping every limp dick so >>107982874
>>
>>107982882
>The video gave me a headache.
If you're physically incapacitated by a video then you're the issue.
>>
>>107982861
don't listen to the retard
>>
>>107982882
so your just a troll then, got it.
>>
>>107982861
https://ltx.io/model/model-blog/prompting-guide-for-ltx-2
Give this a read. Or use this as an example for your (((((enhancement))))) prompt if you're illiterate and/or indian
>>
>>>/wsg/6078080
>Nobody ever commented or even insulted my LTX gens.
>>
>>107982775
what is sexual assault tiktok and how do these images relate to it lmao
>>
his videos are like someone screaming into your ear through a soup can, don't do it anons
>>
>>107982913
they are literally real music overlaid them for lip sync, thx for admitting you are trolling
>>
>>107982922
why didn't you start with those instead of raping my ears? also this isn't a case for using ltx to gen the whole thing at all and it's lipsync sucks
>>
>>107982935
>it's lipsync sucks
its literally sota. Better than even sora 2 / veo 3 there
>>
File: YJ Chick.webm (3.85 MB, 852x1280)
3.85 MB
3.85 MB WEBM
>>107982904
It looks both undertrained and fried, but they all kinda look like that, so...
>>
>another day of no base
what's our cope for tomorrow
>>
File: 854096928.mp4 (3.47 MB, 768x1152)
3.47 MB
3.47 MB MP4
Ah fuck it, someone will eventually come up with a finetune if it's worth it. At least it's funny.
>>
>>107982944
>Better than even sora 2 / veo 3 there
lol stop

>>107982946
this
>>
File: o_00004_.jpg (885 KB, 2304x1792)
885 KB
885 KB JPG
>>
>>107982897
>your
>>
>>107982421
>he spends 10 mins jacking per week
No anon I spend 10 minutes per week on super booba women because it is not my main fetish but I am still a straight white male

>>107982944
>its literally sota. Better than even sora 2 / veo 3 there
>>107982968
>lol stop
LTX does actually have SOTA lip sync matching Sora or veo it just sucks for literally everything other than taking heads. No movement or dynamism in the gens at all and don't even get me started on the aesthetic Holocaust that is LTX T2V, they don't even have an excuse for t2v being so bad when models like mochi was better and that was when no one knew what they were doing for video at all
>>
>>107982968
it is. sora 2's I2V is utter garbage
>>
>>107983013
>No movement or dynamism
Use the I2V adapter. Its night and day better
https://huggingface.co/MachineDelusions/LTX-2_Image2Video_Adapter_LoRa

(nsfw)
https://files.catbox.moe/rt6pl7.mp4
https://files.catbox.moe/6g5l6g.mp4
https://files.catbox.moe/gnb035.mp4
https://files.catbox.moe/souqc7.mp4
https://files.catbox.moe/gzmnno.mp4
>>
>>107983013
>super booba
what even is this? boobs that can fly? invisiboobs? do they shoot lazers from their nipples?
>>
>>107983033
dude stop spamming the thread with these garbage gens.
we get it you love plastic slop.
buzz off
>>
>>107983048
stop trolling
>>
this released pretty recently btw, its a bandaid till 2.1 for proper I2V support
https://www.reddit.com/r/StableDiffusion/comments/1qnvyvu/ltx2_imagetovideo_adapter_lora/
>>
I feel bad for debo :(
>>
>IT'S SOTA YOU HAVE TO BELIEVE ME
>JUST DON'T GENERATE AUDIO, PROVIDE EXISTING AUDIO
>USE WAN FOR NSFW MOTION
>USE THESE SNAKE OILS
>JUST REFINE IT WITH WAN
>THE LIPSYNC IS SOTA! STOP CALLING IT SLOP!
>YOU HAVE TO BELIEVE ME!
please fucking stop you are embarrassing yourself. it's fucked at a pre training level and no cope can fix that
>>
File: o_00005_.jpg (1.26 MB, 2304x1792)
1.26 MB
1.26 MB JPG
>>
>>107983033
Nah man this shit isn't remotely comparable to using WAN2.2 14B at all. This is like Wan 5B tier. I knew the LTX-2 fags were full of shit.
>>
>>107983079
yawn
>>
>>107983079
Why so negative?
Things take time and until then I will wait before I can make memes of green frogs laughing and giving specific insults to anons in threads
>>
>>107981869
I tried the x32 proto wf and got tensor size errors, can you share your WF
>>
>GF poured water on my GPU because she saw me looking for loras on CivitAI and thought it was porn
It's over for me bros. Stuck on a shitty Laptop from 2010 from here on out.
>>
>>107983091
gota say, stop comparing gens that take 30 secs vs wan gens that take 10+ mins. 10+ min ltx gens are rarely posted but they look far better. They are bigger than the catbox file limit though so you will have to check reddit or discord, I often post to the bandoco discord in the ltx gens channel
>>
>>107983079
I don't really have a dog in this fight, but you're the one coming off as coping and defensive.
>>
File: file.png (36 KB, 779x55)
36 KB
36 KB PNG
>>107982959
There's still 48 minutes left.
>>
>>107983112
You're replying to a loser who's sole function is to ruin the vibes, call him disabled and move on.
>>
>>107983033
not sure if trolling or fecaloid
>>
>>107983105
your fault or choosing biological pussy instead of embracing digital waifu
>>
File: 1751302104837070.png (1.34 MB, 1168x880)
1.34 MB
1.34 MB PNG
Gooning and sooning in the septic tank
>>
Is klein 9b just generally a higher quality than 4b. or is there anything radically different about it? Does 9b also change the edited images liek changing brightness, saturation, like 4b does?
>>
File: Flux2-Klein_00376_.png (3.13 MB, 1920x1072)
3.13 MB
3.13 MB PNG
>>
What's that site that shows deleted civitai entries again?
>>
>>107983187
civitarchive
>>
File: Flux2-Klein_00377_.png (2.73 MB, 1920x1072)
2.73 MB
2.73 MB PNG
>>
>>107983191
thx
>>
File: Flux2-Klein_00382_.png (1.29 MB, 1280x720)
1.29 MB
1.29 MB PNG
>>
uh oh.... z image base is looking bad
https://x.com/bdsqlsz/status/2016128379842658815
>>
>>107983173
>Is klein 9b just generally a higher quality than 4b.
yes. 4b is just their model for poor people.
>>
>>107983105
Sue her for damages and dump her.
>>
>>107983229
oh no, lol. This is BAD. This is gonna be hilarious.
https://x.com/bdsqlsz/status/2016128384095683065/photo/1
>>
>>107983229
we already knew base would look worse than turbo. the entire point is finetuning and making loras to be used with the turbo. the loras will have more flexibility and be of higher quality.
>>
>>107983236
No, I mean BAD >>107983235
destroyed even, they fucked something major up
>>
uh oh kleinturdy melty
>>
>>107983229
since they retrained it extensively apparently will loras be compatible with ZIT or is it basically a new different model by now
>>
File: G_q3CfDWYAAqePQ.jpg (289 KB, 1152x864)
289 KB
289 KB JPG
look at this, the details are completely destroyed
>>
>benchmark pics
yawn
>>
okay now that z-image flopped HARD: what are we waiting for next?
>>
File: G_q6EJRbAAMB94b.jpg (282 KB, 1152x864)
282 KB
282 KB JPG
its SO bad and this is what they chose to share. People are gonna be pissed
>>
>>107983255
somehow looks worse than chroma v1
>>
File: G_q4HqQbAAYQzC5.jpg (268 KB, 1152x864)
268 KB
268 KB JPG
legit worse than chroma radiance and at least that is not done pretraining
>>
File: 9.png (1.58 MB, 1008x1232)
1.58 MB
1.58 MB PNG
>>107983262
A new NAI leak
>>
File: o_00007_.png (3.71 MB, 2304x1792)
3.71 MB
3.71 MB PNG
>>
>china warned him that it'll look bad at first
>anon warned him that it'll look bad at first
>he's still surprised that it looks bad at first
LEL
>>
File: G_q4oMbW0AAsMWE.jpg (128 KB, 1152x864)
128 KB
128 KB JPG
nice shoe... it seems to be missing half of it. Gonna gen some mutated bodies first thing
>>
another sd3 incoming, local loses yet again! keep licking those license boots, maybe another de-distill will save you!
>>
lawl this was foretold
>>
>>107983277
man i am so excited for character reference v2 and style reference v1
also nice gen
>>
>>107983283
you dont understand anon, this is night and day worse than klein base.
>>
>>107983283
it's just one klein shill. they'll fuck off eventually
>>
>>107983283
>ITS A HECKING BASE MODEL FOR FINETUNES YOU BIGOT!!! ANONS WILL BE PROMPTING KINO TRUST ME!!
the cope begins.
>>
>ltx2 looks bad
>uhm ackshually your workflow is shit!!!
>z base looks bad
>*crickets*
Curious
>>
Anyone know what happened to Debo? Five days missing, did he rage quit and retire from the hobby?
>>
>he doesn't know turbo was hyper tuned for asian 1girl
>he expects base to have the same quality
Oh I'm getting trolled aren't I
>>
>>107983322
I made plenty of non-asians with excellent quality.
>>
I'll try out Z-Omni-Hyper-Edit-etc whatever it's called now before judgement desu, whats with the negativity, if it sucks it sucks, if it's useable it's useable
bunch of sissies ITT
>>
>>107983250
ani said that it's not gonna be compatible with zit. they changed a lot in the base model, mostly trying to censor it. so they certainly are releasing it but it's useless without finetuning, even for loras
>>
File: klein.png (2.27 MB, 1440x1024)
2.27 MB
2.27 MB PNG
>>107983309
klein is good though, and the flux 2 vae means z image base would have been incredible to be worth using over it for finetunes
>>
watching localkeks cope over finetunes that never arrive is always hilarious. it has been at least 2 years now and the only finetunes of note were for sdxl
>>
>>107983345
>vramlet cope this hard
>>
>>107983335
no need. didn't you see the images? just save your bandwidth and energy. im feeling sleepy
>>
>>107983343
>ani said
Proof?
>>
>>107983344
>klein is good though
it cant learn faces though
>>
File: 03761.png (2.45 MB, 1312x976)
2.45 MB
2.45 MB PNG
>american triple letter agency agents trying to demoralize free and open source chinese models
Not falling for it buddy
>>
Reminder that Klein watermarks your images :)
>>
File: G_qyH6gWcAEqvyI.jpg (218 KB, 1152x864)
218 KB
218 KB JPG
>>107983358
the fuck are you on about, it literally can and its a good thing they didn't include them much in their training to not make them super biased like every other model

this is 50 fucking steps from z image btw. This is far worse than klein it seems
>>
>>107983298
Buy an ad.
>>
>>107983372
>it literally can
it literally cant
>>
File: ComfyUI_07965.jpg (3.67 MB, 1440x2160)
3.67 MB
3.67 MB JPG
>>107983229
>>107983235
It's cope, but Flux SRPO was helped out a lot by custom VAEs to help tame it's grubby output. I'd try those right away.
>>
>>107983374
bought :)
>>
>>107983383
it is already learning characters on chromas tune 3 days in. It 100% can
>>
migrate
>>107983392
>>107983392
>>107983392
>>
Any good 3D to real workflows?
>>
>no collage
>troll changed links
lilbro keeps trying huh
>>
the chinese shill farm is out in full force I see. It really must be releasing today
>>
>>107983391
>implying they tagged any characters because ponyman said no
there will be some character cluster that you can combine with a style cluster and this is a good thing
>>
chinkshit shills have nothing left going for them. z-base flopped terribly, qwen is plastic as ever, glm is a mess, hunyuan is bloated crap. meanwhile klein arrives out of nowhere and delivers way better results in a single model with edit capabilities. no waiting 3 months for klein edit or klein base.
>>
>>107983407
>ponyman
chroma is lodestone, it has nothing at all to do with pony
and lodestones dataset is public. It has everything
>>
>>107983395
what is this crap
>>
>>107983407
>character cluster
keeeeeek
>>
>>107983402
It's NAI
>>
>>107982807
https://www.modelscope.cn/models/DiffSynth-Studio/Z-Image-i2L

lol the diffsynth version got released first
>>
>>107983457
>image2lora
what even is this? Also wtf is dino v3?
>>
>>107983479
a model that take in an image and output the whole lora

it is worse than normal lora training based on their example though, maybe only use it when you lack training data



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.