[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>107993481

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg
>>
Cookies!
>>
I bought spicy "hot" potato chips for today evening and /g/
>they are not spice
fuck what do I do now
>>
> 320/49
Explains why they had to postpone for 2 months.
Does not explain why only 2 months though.
>>
Chinese culture anon on suicide watch
>>
>>107995571
She looks like a man
>>
>>107995588
yeah that's Brie Larson
>>
>>107995477
blessed thread of frenship
>>
Now that the dust has settled, can Zedit be released already? Klein can't fulfill my needs and Qwen is so fucking slow and smooth slopped
>>
inb4 comfy and python le bad and why isn't anustudio in the OP
>>
>>107995583
>oh no! the rentry links are gone!
>wtf! who added the rentry links?
holy shit. this is the gayest, most retarded shit ever and everyone involved in this schizo rentry link shuffle should be fuckin ashamed
>>
File: o_00074_.jpg (793 KB, 3072x1024)
793 KB
793 KB JPG
>>
>>107995573
the chinese video models needs to start pushing more for creativity with concepts than just "muh realism".
>>
can the ani haters just dilate already? it's so fucking hard to have a discussion when schizos and trolls are spamming this childish faggotry
>>
>>107995573
z-image edit when
>>
there he is. he's doing the routine. new day, same song and dance. zzzzzzzzzzz
>>
>>107995571
Does it know her fucked up toes
>>
finetune when
>>
>>107995667
good question but i doubt it, the likeness isn't that great to begin with fellow fa/tv/irgin
>>
File: zit.png (2.8 MB, 1024x2304)
2.8 MB
2.8 MB PNG
>A still frame from the movie The Empire Strikes Back, showing Luke Skywalker with a blue lightsaber fighting Darth Vader with a red lightsaber, on a metal catwalk in Bespin. Their lightsabers are locked in a clash, with Luke straining to resist Vader's superior strength. The metal guardrail and nearby machinery are scarred and sparking from stray slashes.
These are ZIT, and the next post will have non-Turbo for comparison.
>>
>>107995681
>the nu-starwars lightsabers with the guards
lol overfit on disneyshit
>>
File: zi.png (3.58 MB, 1024x2304)
3.58 MB
3.58 MB PNG
>>107995681
And here's Z-Image non-Turbo for the same three corresponding seeds.
>>
>>107995689
it's over
>>
>>107995672
never, and we have it it will be very subpar, and there won't be any merges/mixes to fix it too, its all downhill from here (I for one am happy enough with klein to make memes and fool around)
>>
>>107995623
she looks like an ex-onlyfans model that converted to christianity and now speaks about the damages of the porn industry
>>
>>107995695
but kathleen is gone now....
>>
>>107995704
Her vaginal secretions will forever stain and stink the franchise
>>
File: zi-closeup.png (3.54 MB, 1024x2304)
3.54 MB
3.54 MB PNG
>>107995689
I know, right? Triggered the fuck out of me.

>>107995693
I also tried adding Mark Hamill to the prompt and getting a closer shot to see if his face would come out better, but it seems Z-Image doesn't know his face that well.
>A close-up still frame from the movie The Empire Strikes Back, showing Mark Hamill as Luke Skywalker wielding a blue lightsaber, fighting Darth Vader wielding a red lightsaber, on a metal catwalk in Bespin. Their lightsabers are locked in a clash, with Luke being pushed back and visibly straining to resist Vader's superior strength. The metal guardrail and nearby machinery are scarred and sparking from stray slashes.
>>
>>107995704
they poisoned the model, it's too late
>>
Feeling horny, any good loras or tunes yet?
>>
>>107995726
yeah
>>
>>107995726
civitai.com/models/2182021/z-wedgie-v2-is-slider-trained-on-z-image-base
>>
>>107995693
>>107995711
Forgot to mention these used 25 steps for non-Turbo, so going for longer might improve details. The close-up versions used a different set of seeds than the other two.
>>
>>107995739
>All sample images are made on Turbo.
lol
>>
>>107995742
25 steps sounds too low for Base
>>
>>107995711
Try to add some retro words to your prompt. The old trilogy isn't that polished.
>>
>>107995742
25 steps sounds too high for base
>>
>>107995721
no, people will post here until the next thread which may or may not have the rentry links, until the next thread which may or may not have the rentry links, rinse and repeat, and no one will care except for a couple of insane, demented losers
>>
What's stopping me from gathering a dataset on my own and just finetuning it on my own PC
>>
File: o_00075_.jpg (1.18 MB, 2304x1792)
1.18 MB
1.18 MB JPG
>>
File: Flux2-Klein_00948_.png (1.96 MB, 960x1392)
1.96 MB
1.96 MB PNG
>>
>>107995793
time
>>
God this shit sucks at guns even in base.
>>
>>107995756
The template suggests 30-50, but defaults to 25. I might try some 50-step runs for comparison.
>>
>>107995787
pretty sure the only guy who cares is the schizo who pretends to be ani and ran and keeps samefagging both sides
>>
>>107995801
But what if I have patience and I'm willing to wait?
>>
>>107995793
entropy
>>
File: 5678430561.png (2.22 MB, 1024x1280)
2.22 MB
2.22 MB PNG
>>107995796
Muadib
>>
>>107995793
It's gonna take forever on a single gpu. (unless you actually have a big setup then please go on)
>>
>>107995810
base slop
>>
>>107995808
nothing is stopping you. this guy is finetuning chroma on one (1) 4090

https://huggingface.co/SG161222/SPARK.Chroma_v1
>>
>>107995812
I have a 6000, can I do it?
>>
>>107995805
how do you explain the celeb spammer fanning the flames? genuine or a unrelated troll?
>>
>>107995803
When I started texting I went with that too, results become notceably better at around 40-50 steps. But takes twice as long too
>>
File: Zimage_base__00197_.png (1.75 MB, 1024x1024)
1.75 MB
1.75 MB PNG
>>
>>107995805
I'm pretty sure I don't give a fuck, faggot
>>
>>107995823
its like a open wound, you have the main culprit but then bacteria starts to grow too
>>
>>107995803
50 is def the way to go
>>
>>107995818
Then why aren't anons in this thread making the finetunes they want instead of wishing for those finetunes to materialize?
>>
>>107995831
proof?
>>
>>107995803
comfy always sets the templates to like 20 steps when 50 is recommended
>>
>>107995819
In something like 5 years or so, yes, no one is stopping you
>>
>>107995840
because most are retarded poors
>>
>>107995802
>even in base
idk about the usage of "even" here, Base is not supposed to be "better" than Turbo as that model had RL aiming for higher quality and better anatomy. Base is just supposed to be "less slopped" and easier to fine-tune. You guys are still stuck in the early SDXL days where a distilled model automatically means body horror
>>
>>107995840
Because I can't afford my card to be busy 24/7 and basically locking me out of doing anything gpu intensive during training. I already train loras overnight when I sleep
>>
>>107995557
>Apparently turbo loras work with the base if you bump up the strength to ~2.5
tried that and just turned things into mush
>>
>>107995848
I don't get it, if it is so hard and slow how are people making illustrious tunes?
>>
cozy bread
>>
>>107995856
It's all lora shitmixes.
>>
>>107995830
thats much better than my attempts, nice
>>
>>107995856
>how are people making illustrious tunes?
They have access to gpu clusters or are mislabeling their loras as finetunes, some people also do very small scale fine tunes with few images.
>>
>>107995818
>Dataset Preparation... 80%
And using wayback it has been like that for weeks, what's going on
>>
File: Zimage_base__00198_.png (2.04 MB, 1024x1024)
2.04 MB
2.04 MB PNG
Muad'dib
>>
>>107995870
tldr
>>
>>107995802
>sucks at guns
try airplanes, faggot. The people compiling the datasets have to be the most emasculated faggots out there. Heavy machinery, airplanes, cars (unless it's on a movie), war machines, every model sucks at those.
>>
>>107995870
if you had to guess, why do you think that is
>>
File: Zimage_base__00199_.png (1.56 MB, 1024x1024)
1.56 MB
1.56 MB PNG
>>
>>107995897
if i knew i wouldn't ask retard
>>
>>107995906
thanks for the aside
now if you had to guess, why do you think that is
>>
File: cum.png (2.26 MB, 1280x1024)
2.26 MB
2.26 MB PNG
>>107995906
He didn't ask for knowledge he asked for a guess
>>
>>107995906
what could possibly be the reason
>>
>>107995921
didn't ask
>>
File: z-image_00061_.png (3.63 MB, 2048x1280)
3.63 MB
3.63 MB PNG
>>
>Loras are inherently limited
>Can't finetune on my own computer
>Nobody else is making the finetunes
What's the point of this then?
>>
>>107995933
fingerbang yourself
>>
File: o_00076_.jpg (1.39 MB, 2304x1792)
1.39 MB
1.39 MB JPG
>>
>>107995933
we told you base was a meme
>>
>>107995818
>finetuning
He's training a lora on a couple thousand images max, merging it into the model, releasing only the merged model, and pretending it's a full finetune.
>>
>>107995928
Yes, I, in fact, did not ask.
>>
File: z-image_00062_.jpg (888 KB, 1280x1920)
888 KB
888 KB JPG
>>
Unironically ZB is SDXL tier. Something about it looks way off.

Flux (with loras) mostly solved the "authentic look", minus the fact that Flux is bogged down by censorship.
But ZB legitimately looks like SDXL gens, everything looking uncanny and made out of mashed potatoes, with unnatural lighting.
>>
>>107995933
>What's the point of this then?
To stop having hopes like a retard and just be glad when good shit pops up, I wasn't expecting for something like Klein to drop in a million years yet here we are, I was also not expecting to have any porn/anime model better then SDXL in a million years and still am not surprised by the fact that we still don't have it; the secret is to have no expectation.
>>
File: 25-vs-50.png (2.35 MB, 2048x818)
2.35 MB
2.35 MB PNG
>>107995803
Here's 25 steps versus 50 for the same seed (1028834557930147). Ehhh. It really likes those crossguards.
>>
File: 1748142215983165.mp4 (3.86 MB, 2048x1152)
3.86 MB
3.86 MB MP4
>we get the best open source world model yet
>people are still yapping about base slop
yall cringe

https://github.com/Robbyant/lingbot-world
>>
>>107995970
that looks terrible in both cases, what sampler and cfg are you using, are you using negatives?
>>
>>107995978
ooh wow look at the slop you can press wasd in!
>>
>>107995992
yet you glorify 1girl slop, curious
>>
>>107995970
they likely used scraped images off google search and if you google star wars algo probably pushes the most recent stuff up so that's what the training baked in
>>
>>107995997
oh do i?
>>
really starting to hate python and the whole community
>see project
>"works for python >= 3.10!" and cuda >=12.4
>clone
>install dependencies with python 3.11 and cuda 12.8
>install fails because some packages with their specified version are not available for that python/cuda combo
>okay guess i will go with the verified combination then
>install
>run script
>errors
this shit happens constantly. what the fuck
>>
>>107996004
yes you do
>>
>>107995997
proof?
>>
>>107995867
What are you talking about, it was at 40% and only recently got to 80%. Probably an issue with rendering on WM. I've been checking once or twice weekly.

t spark preview enjoyer
>>
File: settings.png (46 KB, 395x530)
46 KB
46 KB PNG
>>107995983
Generally defaults. No negative prompt.
>>
>>107996017
multistep is fail
>>
>>107996007
>he doesn't have a llm managing his comfyui, python, etc
imagine posting here and being so outdated
>>
>>107996007
babbie's first time with python dependencies
>>
>>107995978
https://arxiv.org/abs/xx.xx
I dunno if I can trust researchers that can't upload papers properly
>>
File: Zimage_base__00204_.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
>>
>>
>>107995978
1girl vagina cave exploration
>>
>>107996017
I haven't tried res multistep so idk but the images I got so far look way less melted on just euler beta. I still have to test more but it feels like negatives are needed with the base and improve output by a lot.
>>
File: choices.png (1.8 MB, 1387x700)
1.8 MB
1.8 MB PNG
guess which is turbo
>>
>>107995739
I guarantee you the same thing would look the same if it was trained on ZIT with adapter
>>
>>107996007
old /g/ returns. slowly, but surely
>>
is base worth it or nah
>>
>>107996042
well yes it would considering >>107995749
>>
File: 1739586020462438.png (2.74 MB, 1344x1344)
2.74 MB
2.74 MB PNG
before and after genning one (1) picture with z-image
>>
>>107996040
Left
>>
>>107996032
Shut up she is finally ready to settle
>>
File: 1745923328822164.png (342 KB, 1687x624)
342 KB
342 KB PNG
The z-image model released yesterday is just "z-image", the version they distilled into z-image-turbo. The true "base" model is the z-image-omni-base which has yet to be released.

I'm not knocking the model released yesterday, I've just seen like 10+ posts getting this wrong today and it was bugging me.
>>
>>107996040
both are z-image
>>
>>107996051
if you are satisfied with ZIT not really, but it knows alot more than Turbo and do much more things, but the gen times are a big turn off. I find it really fun to experiement and see what it can do.
>>
>>107996069
Brap
>>
>>107996063
based fellow hitlerposter
>>
File: o_00077_.jpg (1.52 MB, 2304x1792)
1.52 MB
1.52 MB JPG
>>
people who complain about zib gen times are unemployed loser. i just start genning an image before i got to work, and when i come home i get to enjoy it with a glass of red wine.
>>
>>107996094
ai babble
>>
>>107996097
There is no zib, you meant zi
>>
>>107996097
i gen at work
>>
File: 1655159844515.png (407 KB, 1373x770)
407 KB
407 KB PNG
>>107996069
I mean, clearly?
<picrel https://huggingface.co/Tongyi-MAI/Z-Image
>>
>>107996069
z-image can be confused with z-image turbo. so we need a name for it to differentiate it, and base just stuck. omni is gonna be omni and edit is gonna be edit. might as well call this base
>>
>>107995636
thread quality is noticeably worse without the links. it's a fact.
>>
>>107996125
retard
>>
>julienbake
>>
File: 1758461750228886.png (9 KB, 237x103)
9 KB
9 KB PNG
>>107996125
This anon is very smart
>>
>>107996135
as lon as he doesn't whine about comfy and starts advertising his UI and himself it's okay I guess
>>
File: o_00078_.jpg (1.3 MB, 3024x4032)
1.3 MB
1.3 MB JPG
>>
File: ZIB_00019_.jpg (1.67 MB, 1536x2048)
1.67 MB
1.67 MB JPG
>>
>>107996169
i wonder if omni will have this face disease
>>
>>107996158
toothpick test dude
>>
>>107996158
everything reminds me of her
>>
File: o_00078_.jpg (1.45 MB, 2304x1792)
1.45 MB
1.45 MB JPG
>>
a*i won
>>
Video chads, we're so back

SkyReels-V3
>Set the extension duration, choosing from 5 to 30 seconds.
https://github.com/SkyworkAI/SkyReels-V3
https://huggingface.co/Kijai/WanVideo_comfy_fp8_scaled/tree/main/SkyReelsV3

Self-Refining Video Sampling
>We present self-refining video sampling method that reuses a pre-trained video generator as a denoising autoencoder to iteratively refine latents. With ~50% additional NFEs, it improves physical realism (e.g., motion coherence and physics alignment) without any external verifier, training, or dataset.
https://github.com/agwmon/self-refine-video
>>
File: z-image_00065_.png (1.51 MB, 944x1280)
1.51 MB
1.51 MB PNG
do people still say that Z base failed?
>>
>>107996225
qrd
>>
>>107996213
>thread is dead
yeah he did
>>
stop replying to yourself
>>
>>107996228
How can something that didn't release fail?
>>
>>107996007
if it takes you more than twenty minutes to figure out you should go back to reading the manual and trying to understand what you're actually doing.
>>
>>107996293
no
>>
File: comp.jpg (1.24 MB, 1216x2661)
1.24 MB
1.24 MB JPG
"base loras work on turbo"
>>
File: Zimage_base__00208_.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>
>>107996314
i made this image
>>
>>107996324
proof?
>>
>>107996293
right, i'm going to debug the code because pythoniggers arent able to properly state the requirements. fuck off
>>
>>107996362
How about you join the fail dev and languish with him being unable to port shit to another language. Cry lil bro, keep crying like a bitch.
>>
>>107996309
You have to understand, the average redditor hears "it's a base model" and instantly assumes loras will work on Turbo. Even though the base we got isn't the same base used to train Turbo, it is significantly diverged from those weights. ZIB is not an ancestor model of ZIT, it's an alternate branch.

It doesn't help that some retard takes an undertrained lora that does basically nothing, uses it on Turbo, and says "wow look it works and it looks so good" when the only reason it "works" is that the lora is extremely weak and it effectively just adds a bit of noise when used on the wrong model.
>>
File: z-image_00112_.png (1.17 MB, 1024x768)
1.17 MB
1.17 MB PNG
>>107995970
>>107996037
Here's 50 steps with Euler and Beta. Pretty similar.
>>
File: file.png (386 KB, 1019x477)
386 KB
386 KB PNG
>>107996175
aegyo sal is non negotiable
if your model doesn't support this, it's basically useless
>>
>>107996431
its not a real thing retard, even your pic shows it, women shouldn't come with makeup unless i ask for it
>>
>>107996380
Is this AI? Can you list the tells? I'm trying to get better at spotting it.
>>
>>107996431
fucking bugs
>>
>>107996442
you gora did this retard, same reason why there are so many prostitutes
>>
>>107996431
why are gooks so insecure about their eyes
>>
>>107996462
see >>107996459
>>
>>107996462
https://en.wikipedia.org/wiki/United_States_military_and_prostitution_in_South_Korea
>>
File: o_00081_.jpg (913 KB, 2304x1792)
913 KB
913 KB JPG
>>
>>107996431
>year 2048
>ayygo sal becomes popular like fat asses in the 90s
>everyone looks stoned or stung by a bee
>"ayo girl you got fat eyes"
>>
File: base.jpg (2.65 MB, 2028x2028)
2.65 MB
2.65 MB JPG
>>
>>107995964
Wise anon
>>
>>107996472
>Yankee princess
>Yankee whore (양갈보; Yanggalbo
kek, im ganna use that name
>>
>>107996309
Yeah, my results weren't awful but I get the impression ZIT simply is not directly descended from ZIB, the way Klein Dist definitely is from Klein Base, so there's a bit of weight misalignment happening. So I'll just keep using ZIT + Ostris Adapter V2 which already worked completely fine anyways.
>>
File: z_imageBASEd_00005_.jpg (688 KB, 1520x1728)
688 KB
688 KB JPG
>>
What is Klein 9B distilled better than zit at? I’m only talking about i2t, no other features. From what I’ve tested Klein is way better at skin imperfections and things like moles, veins, cellulite, acne, bruises etc
>>
>>107996499
It has way better overall prompt adherence at least in English, IMO
>>
>>107996499
Much more output variation
>>
So did chodestones decide to tune zib or klein?
>>
>>107996481
ouch!
>>
>>107996481
is she a gnome?
>>
>>107996527
yeah
>>
>>107996527
Both
>>
>>107996539
Both?
>>
>>107996499
Kleins are better at "real" details and will be also be more useful for pics with no humans because the model is more creative, but you still have the inherited flux body horror. Way more than Zbase.
>>
>>107996553
yeah
>>
File: Zimage_base__00211_.png (1.78 MB, 1024x1024)
1.78 MB
1.78 MB PNG
>>
>>107996499
>i2t
you're talking about VL models?
>>
>>107996573
he doesn't know what he's talking about
>>
help i've got a job interview in 30 minutes and i can't stop fapping
>>
I'm racist but don't post on /pol/, what's the best t2i model for me
>>
>>107996585
eye contact and firm handshake
>>
>>107996561
proof?
>>
>>107996572
damn this sdxl finetune is bussin
>>
File: Zimage_base__00213_.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
>>
>>107996588
sd 1.4
>>
File: z_imageBASEd_00012_.jpg (725 KB, 1520x1728)
725 KB
725 KB JPG
>>
So when does the actual base model that they originally announced come out
>>
>>107996611
>Base knows Beczinski
based base
>>
>>107996616
lol
>>
>>107996616
two weeks morre
>>
File: z_imageBASEd_00014_.jpg (641 KB, 1520x1728)
641 KB
641 KB JPG
>>107996625
lora
>>
File: z-image_00113_.png (1.16 MB, 768x1024)
1.16 MB
1.16 MB PNG
>>107996431
Oh, so there's a term for that. I always liked how they looked on Jennifer Love Hewitt, but didn't realize Koreans were purposely emphasizing them.

>this is Z-Image's idea of '90s teen Jennifer Love Hewitt
Grim.
>>
File: Zimage_base__00216_.png (1.54 MB, 1024x1024)
1.54 MB
1.54 MB PNG
>>
Klein 9 edit distilled upscales like hires.fix anime really well, but it tends to flatten or genericize the shading and colors. Anyone trained a style LoRA on Klein 9b with [insert anime style] to fix this? What software and dataset size would you recommend to do that?
>>
>>107996625
aw yes, "Geeszwaf BekchinskI". I know that guy

https://www.youtube.com/watch?v=N6GAhY7TTxc
>>
File: 1741623961206521.jpg (609 KB, 1536x1440)
609 KB
609 KB JPG
playing around with ZIB 1MP 15 res2s steps + ZIT 1.5x 9 steps refinement (0.6 denoise)
I think I need higher denoise
>>
I'm going to finetune Z-Base
>>
>>107996683
ok then ill coarsetune it
>>
>>107996683
in two more weeks when it releases?
>>
>>107996683
I'll make the logo
>>
>>107996431
you studied the math
you studied the computers
you studied the light, the cameras and the lenses

but you didn't study korean makeup, so now your model is useless
>>
File: Zimage_base__00224_.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
now go ahead and stick your hand in there and pick that up
>>
>be comfyui
>sampler seed set to fixed
>increment anyway
>>
>>107996702
look if Chinese Culture doesn't want to release Zase we'll just bake our own Zase. It's that simple. In a crowd of people so obsessed with Zase the sudden appearance of Zase is hardly surprising.
>>
>>107996431
is this the reason why ZIT girls comes with the fucking eyebags lmao fucking bugpeople
>>
File: Zimage_base__00226_.png (1.84 MB, 1024x1024)
1.84 MB
1.84 MB PNG
it really does like to butcher faces if they're not front and center in an image
>>
File: z_imageBASEd_00022_.jpg (853 KB, 1520x1728)
853 KB
853 KB JPG
thats some detail alright
>>
File: 1748844350903127.png (103 KB, 698x683)
103 KB
103 KB PNG
>load comfyui template
>press R to refresh node definitions
>this retarded node cluster breaks beyond repair
epic
>>
>>107996785
you got comfy'd
>>
File: Zimage_base__00227_.png (1.76 MB, 1024x1024)
1.76 MB
1.76 MB PNG
>>
>>107996792
gek
>>
ZIB feels like a censored, more unstable version of chroma. You have to be hyper specific with the parameters if you dont want that specific prompt to be too melted, and mostly you just have to change the prompt to work around it.

I'm using bf16 everything, no sage, tried multiple sampler/scheduler combos, 30-50 steps, 4-7 cfg, nothing seems to get rid of the melted look problem really.

Niche LoRAs with special knowledge generate more creative output but it still kinda comes out melted.

Are there any settings people found to fix this?
>>
File: 1398637088770s.jpg (13 KB, 248x250)
13 KB
13 KB JPG
>>107996796
time for some
>midnight wow!
>>
File: Zimage_base__00230_.png (1.84 MB, 1024x1024)
1.84 MB
1.84 MB PNG
Harrison Ford is back in... Indiana Space Jones
>>
alright time to make that image I've imagined
>loads character loras
>loads style lora
>loads background lora
>loads posture lora
>loads gesture lora
>>
>>107996785
fennec'd
>>
File: ComfyUI_08228.png (3.34 MB, 2160x1440)
3.34 MB
3.34 MB PNG
>>107995978
This'd be a cool VR experience... if they could produce 120fps, 8K stereoscopic video.

>>107996585
Remember to wash your hands!
>>
>>107996828
Lower flow value to 1
>>
flux klein loras give me extremely borked results with the distill. whats the fix? increase strength? it doesnt seem to help that much
>>
>>107996891
its a virtual interview, im probably going to have to keep tugging and keep the camera high
>>
File: Zimage_base__00238_.png (1.75 MB, 1024x1024)
1.75 MB
1.75 MB PNG
>>
File: z-image_00011_.png (1.67 MB, 1216x832)
1.67 MB
1.67 MB PNG
>>
File: Zimage_base__00239_.png (1.7 MB, 1024x1024)
1.7 MB
1.7 MB PNG
>>107996909
amazing
>>
>>107996891
Jenny looks hot with her hair like that.
>>
>>107996896
as in the other thread: couldn't fix it, slower than usual training rate & many steps just gave me less borked results

best i figured out so far
>>
File: z_imageBASEd_00030_.jpg (938 KB, 1520x1728)
938 KB
938 KB JPG
>>
ok.. off to my interview for principal engineer.. if i get this shit im buyin a rtx 6000 pro
>>
File: Zimage_base__00245_.png (1.54 MB, 1024x1024)
1.54 MB
1.54 MB PNG
>>
>>107996947
i see, cheers
how many steps would that be? LR around 1e-5?
>>
File: 82793.png (1.42 MB, 1520x784)
1.42 MB
1.42 MB PNG
>>
File: ComfyUI_08242.png (3.47 MB, 2160x1440)
3.47 MB
3.47 MB PNG
>>107996900
>>107996951
Good luck!

>>107996933
Messy hair on a woman always makes the look hotter... they have it so easy.
>>
>>107996970
best results were like 7e-6, it's obviously annoying how many steps it then uses

>>107996951
good luck. but you want an amd gpu for this?
>>
>>107996896
Are you using the Klein lora training adapter?
>>
>>107996997
yeah
>>
Klein 9 edit distilled upscales like hires.fix anime really well, but it tends to flatten or genericize the shading and colors. Anyone trained a style LoRA on Klein 9b with [insert anime style] to fix this? What software and dataset size would you recommend to do that?
>>
>>107997002
just use esrgan/regular upscalers if you want to preserve the original? the point of a diffusion model is to be generative
>>
>>107997019
No, but Klein adds some enhancements to the image that are more subtle and less destructive than Hires fix, and it also fixes some things for free. The only thing is that because it does this, it also changes the shader and the colors, and it ends up leaving everything looking like a children’s book illustration.
>>
>>107996991
what tool are you using btw? i'm still conflicted about the timestep distribution+shift that should be used for klein
>>
>>107997051
You can try including instructions in the prompt to try to preserve the original as much as possible but with fine stuff like that its hit or miss, specially if it conflicts with other parts of the prompt like you seem to have to "improve" the image
>>
>>107995978
Epstein Island, 6 Year old girl Pov, Amnesia: A Machine for Pigs aesthetics
>>
0girl
>>
>>107997131
post lora
>>
>>107997059
Can I make loras with that model?
>>
>>107997137
torch.zeros_like(girl)
>>
>>107996985
Catbox please?
>>
This is the worst general ever existed
>>
>>107997161
Tautological comment since you are here.
>>
>>107996828
Use a model that has RL training lol
>>
Just finetune your own checkpoint bro
>>
>>107997131
0girl is preferable to body horror
>>
There is an anon who is using his GPU as a heater, spamming ZiB gens all day.
>>
>>107996560
Klein distills are mostly fine at the same number of steps as ZIT, like 6 to 8ish
Just don't use the shit comfy stock workflow
>>
File: 1761431564954818.jpg (505 KB, 1392x1632)
505 KB
505 KB JPG
kino WF
>>
>>107996616
The one with even less aesthetic tuning than what they just released, you mean?
>>
File: zzz_00001_.png (510 KB, 1024x1024)
510 KB
510 KB PNG
>zbase
wtf i have this shit? i tried all workflows...
>>
>>107997056
aitk.

no clue about that, feel free to experiment with yet more parameters if you have that much compute
>>
>>107995477
/sdg/ core gens
>>
>>107997206
another localkek btfo
>>
>>107997206
>knotted
>>
File: 1747836159478539.jpg (715 KB, 1392x1632)
715 KB
715 KB JPG
>>
>>107997230
Nice sex doll
>>
I'm trying to follow the rentry guide for wan2.2 on comfyui but I can't get the right version of pytorch to be used.
It says it needs 2.7.1 but it always displays pytorch version: 2.10.0+cu128 when I start comfy.

Is it important or do I just ignore? MY current cope is that maybe the guide is outdated and it'll all be fine...
>>
File: z_imageBASEd_00049_.jpg (1 MB, 1520x1728)
1 MB
1 MB JPG
>>
>>107997056
Timestep distribution should be linear and 0 shifting. This affects how dataset images are denoised during training. Anything other than these settings is coping.
>>
I seem to recall there was a node for comfy that allowed you to put up some nodes as floating windows on screen, or something like that
Anyone know which one I'm talking about?
>>
>>107997217
>>107997226
i'm not going to cry. it's just a geeky curiosity. The turbo Z works perfectly. 500 gens already
>>
File: 1757823051711075.png (3.94 MB, 1296x1728)
3.94 MB
3.94 MB PNG
>>
>>107995681
Best I was able to get with Klein 9B Dist.
>>
>>107997244
sounds reasonable, i only remember people were using shift 3 for zit. but honestly I dont know enough about this
>>
>>107997238
>1suck
>>
>goon for hours daily
>suddenly sharp pain from pp to funhole
>>
>>107997177
Yes, unironically.
>>
File: 1741717275813340.jpg (847 KB, 1248x1824)
847 KB
847 KB JPG
>>
>>107997237
the 2.2 rentry guide was a half-assed edit of the 2.1 guide, so yeah it's outdated and shit. that's why it's not in the op now. just ask chatgpt or whatever to help
>>
File: 1766385398900624.png (3.71 MB, 1376x1568)
3.71 MB
3.71 MB PNG
>>
File: 1751767036833036.jpg (756 KB, 1248x1824)
756 KB
756 KB JPG
now I just need the cnet for base and I can goon forever
>>
>>107997339
Old controlnet doesn't work?
>>
>>107997355
nope
>>
File: z_imageBASEd_00059_.jpg (662 KB, 1520x1728)
662 KB
662 KB JPG
>>
>>107996985
thx.. went well we'll see
>>
File: Zimage_base__00247_.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>
File: 1751330977338124.png (3.98 MB, 1632x1376)
3.98 MB
3.98 MB PNG
>>
>>107997417
No model, even the nsfw ones, is able to do open mouth kissing, only chaste light lip touching.
>>
File: Zimage_base__00251_.png (1.86 MB, 1024x1024)
1.86 MB
1.86 MB PNG
>>
File: 1767617174757177.jpg (738 KB, 2016x1152)
738 KB
738 KB JPG
>tfw cant go higher than 0.6 denoise on 2nd pass otherwise the zit face creeps back in
>>
>>107997390
cool
>>
File: Zimage_base__00257_.png (1.72 MB, 1024x1024)
1.72 MB
1.72 MB PNG
>>
>>107997267
Haha, Luke looks like a werewolf. Nice lighting/atmosphere, but poses could be better, esp. Vader.
>>
File: Zimage_base__00262_.png (1.79 MB, 1024x1024)
1.79 MB
1.79 MB PNG
>>
Not impressed.
>>
>>107997443
You can ask a booru-trained model for a french kiss, although the tongues often make no sense if you look closer.
>>
>>107996991
what? why would i want an amd gpu?
>>
>>107997530
he thinks the 6000 is amd. he just doesn't know it's rtx
>>
File: Zimage_base__00267_.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
baka my head>>107997548
>>
>>107997522
That's only part of the kiss. The part where the heads are slightly tilted and wide-opened mouths are entirely sealed together can't be done with any model.
>>
File: cursed.png (2 MB, 1536x1024)
2 MB
2 MB PNG
>>107996747
That reminds me I wanted to try a molten-metal look for Zealot's cursed arm. Got a couple of neat results, but so far not one that combines being large and all-orange while still looking good.
>>
>>107997572
i like the one on the left
>>
>>107997572
neat
>>
>>107997289
You should see a doctor. This could be sign of something serious. Gooning should not cause pain.
>>
new
>>107997567
>>107997567
>>107997567
>>
>>107997572
left one is really cool



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.