Discussion of Free and Open Source Diffusion ModelsPrev: >>107993481https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Zhttps://huggingface.co/Tongyi-MAI/Z-Imagehttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>LTX-2https://huggingface.co/Lightricks/LTX-2>Wanhttps://github.com/Wan-Video/Wan2.2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>NetaYumehttps://huggingface.co/duongve/NetaYume-Lumina-Image-2.0https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg
Cookies!
I bought spicy "hot" potato chips for today evening and /g/>they are not spicefuck what do I do now
> 320/49Explains why they had to postpone for 2 months.Does not explain why only 2 months though.
Chinese culture anon on suicide watch
>>107995571She looks like a man
>>107995588yeah that's Brie Larson
>>107995477blessed thread of frenship
Now that the dust has settled, can Zedit be released already? Klein can't fulfill my needs and Qwen is so fucking slow and smooth slopped
inb4 comfy and python le bad and why isn't anustudio in the OP
>>107995583>oh no! the rentry links are gone!>wtf! who added the rentry links?holy shit. this is the gayest, most retarded shit ever and everyone involved in this schizo rentry link shuffle should be fuckin ashamed
>>107995573the chinese video models needs to start pushing more for creativity with concepts than just "muh realism".
can the ani haters just dilate already? it's so fucking hard to have a discussion when schizos and trolls are spamming this childish faggotry
>>107995573z-image edit when
there he is. he's doing the routine. new day, same song and dance. zzzzzzzzzzz
>>107995571Does it know her fucked up toes
finetune when
>>107995667good question but i doubt it, the likeness isn't that great to begin with fellow fa/tv/irgin
>A still frame from the movie The Empire Strikes Back, showing Luke Skywalker with a blue lightsaber fighting Darth Vader with a red lightsaber, on a metal catwalk in Bespin. Their lightsabers are locked in a clash, with Luke straining to resist Vader's superior strength. The metal guardrail and nearby machinery are scarred and sparking from stray slashes.These are ZIT, and the next post will have non-Turbo for comparison.
>>107995681>the nu-starwars lightsabers with the guardslol overfit on disneyshit
>>107995681And here's Z-Image non-Turbo for the same three corresponding seeds.
>>107995689it's over
>>107995672never, and we have it it will be very subpar, and there won't be any merges/mixes to fix it too, its all downhill from here (I for one am happy enough with klein to make memes and fool around)
>>107995623she looks like an ex-onlyfans model that converted to christianity and now speaks about the damages of the porn industry
>>107995695but kathleen is gone now....
>>107995704Her vaginal secretions will forever stain and stink the franchise
>>107995689I know, right? Triggered the fuck out of me.>>107995693I also tried adding Mark Hamill to the prompt and getting a closer shot to see if his face would come out better, but it seems Z-Image doesn't know his face that well.>A close-up still frame from the movie The Empire Strikes Back, showing Mark Hamill as Luke Skywalker wielding a blue lightsaber, fighting Darth Vader wielding a red lightsaber, on a metal catwalk in Bespin. Their lightsabers are locked in a clash, with Luke being pushed back and visibly straining to resist Vader's superior strength. The metal guardrail and nearby machinery are scarred and sparking from stray slashes.
>>107995704they poisoned the model, it's too late
Feeling horny, any good loras or tunes yet?
>>107995726yeah
>>107995726civitai.com/models/2182021/z-wedgie-v2-is-slider-trained-on-z-image-base
>>107995693>>107995711Forgot to mention these used 25 steps for non-Turbo, so going for longer might improve details. The close-up versions used a different set of seeds than the other two.
>>107995739>All sample images are made on Turbo.lol
>>10799574225 steps sounds too low for Base
>>107995711Try to add some retro words to your prompt. The old trilogy isn't that polished.
>>10799574225 steps sounds too high for base
>>107995721no, people will post here until the next thread which may or may not have the rentry links, until the next thread which may or may not have the rentry links, rinse and repeat, and no one will care except for a couple of insane, demented losers
What's stopping me from gathering a dataset on my own and just finetuning it on my own PC
>>107995793time
God this shit sucks at guns even in base.
>>107995756The template suggests 30-50, but defaults to 25. I might try some 50-step runs for comparison.
>>107995787pretty sure the only guy who cares is the schizo who pretends to be ani and ran and keeps samefagging both sides
>>107995801But what if I have patience and I'm willing to wait?
>>107995793entropy
>>107995796Muadib
>>107995793It's gonna take forever on a single gpu. (unless you actually have a big setup then please go on)
>>107995810base slop
>>107995808nothing is stopping you. this guy is finetuning chroma on one (1) 4090https://huggingface.co/SG161222/SPARK.Chroma_v1
>>107995812I have a 6000, can I do it?
>>107995805how do you explain the celeb spammer fanning the flames? genuine or a unrelated troll?
>>107995803When I started texting I went with that too, results become notceably better at around 40-50 steps. But takes twice as long too
>>107995805I'm pretty sure I don't give a fuck, faggot
>>107995823its like a open wound, you have the main culprit but then bacteria starts to grow too
>>10799580350 is def the way to go
>>107995818Then why aren't anons in this thread making the finetunes they want instead of wishing for those finetunes to materialize?
>>107995831proof?
>>107995803comfy always sets the templates to like 20 steps when 50 is recommended
>>107995819In something like 5 years or so, yes, no one is stopping you
>>107995840because most are retarded poors
>>107995802>even in baseidk about the usage of "even" here, Base is not supposed to be "better" than Turbo as that model had RL aiming for higher quality and better anatomy. Base is just supposed to be "less slopped" and easier to fine-tune. You guys are still stuck in the early SDXL days where a distilled model automatically means body horror
>>107995840Because I can't afford my card to be busy 24/7 and basically locking me out of doing anything gpu intensive during training. I already train loras overnight when I sleep
>>107995557>Apparently turbo loras work with the base if you bump up the strength to ~2.5tried that and just turned things into mush
>>107995848I don't get it, if it is so hard and slow how are people making illustrious tunes?
cozy bread
>>107995856It's all lora shitmixes.
>>107995830thats much better than my attempts, nice
>>107995856>how are people making illustrious tunes?They have access to gpu clusters or are mislabeling their loras as finetunes, some people also do very small scale fine tunes with few images.
>>107995818>Dataset Preparation... 80%And using wayback it has been like that for weeks, what's going on
Muad'dib
>>107995870tldr
>>107995802>sucks at gunstry airplanes, faggot. The people compiling the datasets have to be the most emasculated faggots out there. Heavy machinery, airplanes, cars (unless it's on a movie), war machines, every model sucks at those.
>>107995870if you had to guess, why do you think that is
>>107995897if i knew i wouldn't ask retard
>>107995906thanks for the asidenow if you had to guess, why do you think that is
>>107995906He didn't ask for knowledge he asked for a guess
>>107995906what could possibly be the reason
>>107995921didn't ask
>Loras are inherently limited>Can't finetune on my own computer>Nobody else is making the finetunesWhat's the point of this then?
>>107995933fingerbang yourself
>>107995933we told you base was a meme
>>107995818>finetuningHe's training a lora on a couple thousand images max, merging it into the model, releasing only the merged model, and pretending it's a full finetune.
>>107995928Yes, I, in fact, did not ask.
Unironically ZB is SDXL tier. Something about it looks way off. Flux (with loras) mostly solved the "authentic look", minus the fact that Flux is bogged down by censorship.But ZB legitimately looks like SDXL gens, everything looking uncanny and made out of mashed potatoes, with unnatural lighting.
>>107995933>What's the point of this then?To stop having hopes like a retard and just be glad when good shit pops up, I wasn't expecting for something like Klein to drop in a million years yet here we are, I was also not expecting to have any porn/anime model better then SDXL in a million years and still am not surprised by the fact that we still don't have it; the secret is to have no expectation.
>>107995803Here's 25 steps versus 50 for the same seed (1028834557930147). Ehhh. It really likes those crossguards.
>we get the best open source world model yet>people are still yapping about base slopyall cringehttps://github.com/Robbyant/lingbot-world
>>107995970that looks terrible in both cases, what sampler and cfg are you using, are you using negatives?
>>107995978ooh wow look at the slop you can press wasd in!
>>107995992yet you glorify 1girl slop, curious
>>107995970they likely used scraped images off google search and if you google star wars algo probably pushes the most recent stuff up so that's what the training baked in
>>107995997oh do i?
really starting to hate python and the whole community>see project>"works for python >= 3.10!" and cuda >=12.4>clone>install dependencies with python 3.11 and cuda 12.8>install fails because some packages with their specified version are not available for that python/cuda combo>okay guess i will go with the verified combination then >install>run script>errorsthis shit happens constantly. what the fuck
>>107996004yes you do
>>107995997proof?
>>107995867What are you talking about, it was at 40% and only recently got to 80%. Probably an issue with rendering on WM. I've been checking once or twice weekly.t spark preview enjoyer
>>107995983Generally defaults. No negative prompt.
>>107996017multistep is fail
>>107996007>he doesn't have a llm managing his comfyui, python, etcimagine posting here and being so outdated
>>107996007babbie's first time with python dependencies
>>107995978https://arxiv.org/abs/xx.xxI dunno if I can trust researchers that can't upload papers properly
>>1079959781girl vagina cave exploration
>>107996017I haven't tried res multistep so idk but the images I got so far look way less melted on just euler beta. I still have to test more but it feels like negatives are needed with the base and improve output by a lot.
guess which is turbo
>>107995739I guarantee you the same thing would look the same if it was trained on ZIT with adapter
>>107996007old /g/ returns. slowly, but surely
is base worth it or nah
>>107996042well yes it would considering >>107995749
before and after genning one (1) picture with z-image
>>107996040Left
>>107996032Shut up she is finally ready to settle
The z-image model released yesterday is just "z-image", the version they distilled into z-image-turbo. The true "base" model is the z-image-omni-base which has yet to be released.I'm not knocking the model released yesterday, I've just seen like 10+ posts getting this wrong today and it was bugging me.
>>107996040both are z-image
>>107996051if you are satisfied with ZIT not really, but it knows alot more than Turbo and do much more things, but the gen times are a big turn off. I find it really fun to experiement and see what it can do.
>>107996069Brap
>>107996063based fellow hitlerposter
people who complain about zib gen times are unemployed loser. i just start genning an image before i got to work, and when i come home i get to enjoy it with a glass of red wine.
>>107996094ai babble
>>107996097There is no zib, you meant zi
>>107996097i gen at work
>>107996069I mean, clearly?<picrel https://huggingface.co/Tongyi-MAI/Z-Image
>>107996069z-image can be confused with z-image turbo. so we need a name for it to differentiate it, and base just stuck. omni is gonna be omni and edit is gonna be edit. might as well call this base
>>107995636thread quality is noticeably worse without the links. it's a fact.
>>107996125retard
>julienbake
>>107996125This anon is very smart
>>107996135as lon as he doesn't whine about comfy and starts advertising his UI and himself it's okay I guess
>>107996169i wonder if omni will have this face disease
>>107996158toothpick test dude
>>107996158everything reminds me of her
a*i won
Video chads, we're so backSkyReels-V3>Set the extension duration, choosing from 5 to 30 seconds.https://github.com/SkyworkAI/SkyReels-V3https://huggingface.co/Kijai/WanVideo_comfy_fp8_scaled/tree/main/SkyReelsV3Self-Refining Video Sampling >We present self-refining video sampling method that reuses a pre-trained video generator as a denoising autoencoder to iteratively refine latents. With ~50% additional NFEs, it improves physical realism (e.g., motion coherence and physics alignment) without any external verifier, training, or dataset. https://github.com/agwmon/self-refine-video
do people still say that Z base failed?
>>107996225qrd
>>107996213>thread is deadyeah he did
stop replying to yourself
>>107996228How can something that didn't release fail?
>>107996007if it takes you more than twenty minutes to figure out you should go back to reading the manual and trying to understand what you're actually doing.
>>107996293no
"base loras work on turbo"
>>107996314i made this image
>>107996324proof?
>>107996293right, i'm going to debug the code because pythoniggers arent able to properly state the requirements. fuck off
>>107996362How about you join the fail dev and languish with him being unable to port shit to another language. Cry lil bro, keep crying like a bitch.
>>107996309You have to understand, the average redditor hears "it's a base model" and instantly assumes loras will work on Turbo. Even though the base we got isn't the same base used to train Turbo, it is significantly diverged from those weights. ZIB is not an ancestor model of ZIT, it's an alternate branch.It doesn't help that some retard takes an undertrained lora that does basically nothing, uses it on Turbo, and says "wow look it works and it looks so good" when the only reason it "works" is that the lora is extremely weak and it effectively just adds a bit of noise when used on the wrong model.
>>107995970>>107996037Here's 50 steps with Euler and Beta. Pretty similar.
>>107996175aegyo sal is non negotiableif your model doesn't support this, it's basically useless
>>107996431its not a real thing retard, even your pic shows it, women shouldn't come with makeup unless i ask for it
>>107996380Is this AI? Can you list the tells? I'm trying to get better at spotting it.
>>107996431fucking bugs
>>107996442you gora did this retard, same reason why there are so many prostitutes
>>107996431why are gooks so insecure about their eyes
>>107996462see >>107996459
>>107996462https://en.wikipedia.org/wiki/United_States_military_and_prostitution_in_South_Korea
>>107996431>year 2048>ayygo sal becomes popular like fat asses in the 90s>everyone looks stoned or stung by a bee>"ayo girl you got fat eyes"
>>107995964Wise anon
>>107996472>Yankee princess>Yankee whore (양갈보; Yanggalbokek, im ganna use that name
>>107996309Yeah, my results weren't awful but I get the impression ZIT simply is not directly descended from ZIB, the way Klein Dist definitely is from Klein Base, so there's a bit of weight misalignment happening. So I'll just keep using ZIT + Ostris Adapter V2 which already worked completely fine anyways.
What is Klein 9B distilled better than zit at? I’m only talking about i2t, no other features. From what I’ve tested Klein is way better at skin imperfections and things like moles, veins, cellulite, acne, bruises etc
>>107996499It has way better overall prompt adherence at least in English, IMO
>>107996499Much more output variation
So did chodestones decide to tune zib or klein?
>>107996481ouch!
>>107996481is she a gnome?
>>107996527yeah
>>107996527Both
>>107996539Both?
>>107996499Kleins are better at "real" details and will be also be more useful for pics with no humans because the model is more creative, but you still have the inherited flux body horror. Way more than Zbase.
>>107996553yeah
>>107996499>i2tyou're talking about VL models?
>>107996573he doesn't know what he's talking about
help i've got a job interview in 30 minutes and i can't stop fapping
I'm racist but don't post on /pol/, what's the best t2i model for me
>>107996585eye contact and firm handshake
>>107996561proof?
>>107996572damn this sdxl finetune is bussin
>>107996588sd 1.4
So when does the actual base model that they originally announced come out
>>107996611>Base knows Beczinskibased base
>>107996616lol
>>107996616two weeks morre
>>107996625lora
>>107996431Oh, so there's a term for that. I always liked how they looked on Jennifer Love Hewitt, but didn't realize Koreans were purposely emphasizing them.>this is Z-Image's idea of '90s teen Jennifer Love HewittGrim.
Klein 9 edit distilled upscales like hires.fix anime really well, but it tends to flatten or genericize the shading and colors. Anyone trained a style LoRA on Klein 9b with [insert anime style] to fix this? What software and dataset size would you recommend to do that?
>>107996625aw yes, "Geeszwaf BekchinskI". I know that guyhttps://www.youtube.com/watch?v=N6GAhY7TTxc
playing around with ZIB 1MP 15 res2s steps + ZIT 1.5x 9 steps refinement (0.6 denoise)I think I need higher denoise
I'm going to finetune Z-Base
>>107996683ok then ill coarsetune it
>>107996683in two more weeks when it releases?
>>107996683I'll make the logo
>>107996431you studied the mathyou studied the computersyou studied the light, the cameras and the lensesbut you didn't study korean makeup, so now your model is useless
now go ahead and stick your hand in there and pick that up
>be comfyui>sampler seed set to fixed>increment anyway
>>107996702look if Chinese Culture doesn't want to release Zase we'll just bake our own Zase. It's that simple. In a crowd of people so obsessed with Zase the sudden appearance of Zase is hardly surprising.
>>107996431is this the reason why ZIT girls comes with the fucking eyebags lmao fucking bugpeople
it really does like to butcher faces if they're not front and center in an image
thats some detail alright
>load comfyui template>press R to refresh node definitions>this retarded node cluster breaks beyond repairepic
>>107996785you got comfy'd
>>107996792gek
ZIB feels like a censored, more unstable version of chroma. You have to be hyper specific with the parameters if you dont want that specific prompt to be too melted, and mostly you just have to change the prompt to work around it.I'm using bf16 everything, no sage, tried multiple sampler/scheduler combos, 30-50 steps, 4-7 cfg, nothing seems to get rid of the melted look problem really.Niche LoRAs with special knowledge generate more creative output but it still kinda comes out melted.Are there any settings people found to fix this?
>>107996796time for some >midnight wow!
Harrison Ford is back in... Indiana Space Jones
alright time to make that image I've imagined>loads character loras>loads style lora>loads background lora>loads posture lora>loads gesture lora
>>107996785fennec'd
>>107995978This'd be a cool VR experience... if they could produce 120fps, 8K stereoscopic video.>>107996585Remember to wash your hands!
>>107996828Lower flow value to 1
flux klein loras give me extremely borked results with the distill. whats the fix? increase strength? it doesnt seem to help that much
>>107996891its a virtual interview, im probably going to have to keep tugging and keep the camera high
>>107996909amazing
>>107996891Jenny looks hot with her hair like that.
>>107996896as in the other thread: couldn't fix it, slower than usual training rate & many steps just gave me less borked resultsbest i figured out so far
ok.. off to my interview for principal engineer.. if i get this shit im buyin a rtx 6000 pro
>>107996947i see, cheershow many steps would that be? LR around 1e-5?
>>107996900>>107996951Good luck!>>107996933Messy hair on a woman always makes the look hotter... they have it so easy.
>>107996970best results were like 7e-6, it's obviously annoying how many steps it then uses>>107996951good luck. but you want an amd gpu for this?
>>107996896Are you using the Klein lora training adapter?
>>107996997yeah
>>107997002just use esrgan/regular upscalers if you want to preserve the original? the point of a diffusion model is to be generative
>>107997019No, but Klein adds some enhancements to the image that are more subtle and less destructive than Hires fix, and it also fixes some things for free. The only thing is that because it does this, it also changes the shader and the colors, and it ends up leaving everything looking like a children’s book illustration.
>>107996991what tool are you using btw? i'm still conflicted about the timestep distribution+shift that should be used for klein
>>107997051You can try including instructions in the prompt to try to preserve the original as much as possible but with fine stuff like that its hit or miss, specially if it conflicts with other parts of the prompt like you seem to have to "improve" the image
>>107995978Epstein Island, 6 Year old girl Pov, Amnesia: A Machine for Pigs aesthetics
0girl
>>107997131post lora
>>107997059Can I make loras with that model?
>>107997137torch.zeros_like(girl)
>>107996985Catbox please?
This is the worst general ever existed
>>107997161Tautological comment since you are here.
>>107996828Use a model that has RL training lol
Just finetune your own checkpoint bro
>>1079971310girl is preferable to body horror
There is an anon who is using his GPU as a heater, spamming ZiB gens all day.
>>107996560Klein distills are mostly fine at the same number of steps as ZIT, like 6 to 8ishJust don't use the shit comfy stock workflow
kino WF
>>107996616The one with even less aesthetic tuning than what they just released, you mean?
>zbasewtf i have this shit? i tried all workflows...
>>107997056aitk. no clue about that, feel free to experiment with yet more parameters if you have that much compute
>>107995477/sdg/ core gens
>>107997206another localkek btfo
>>107997206>knotted
>>107997230Nice sex doll
I'm trying to follow the rentry guide for wan2.2 on comfyui but I can't get the right version of pytorch to be used.It says it needs 2.7.1 but it always displays pytorch version: 2.10.0+cu128 when I start comfy.Is it important or do I just ignore? MY current cope is that maybe the guide is outdated and it'll all be fine...
>>107997056Timestep distribution should be linear and 0 shifting. This affects how dataset images are denoised during training. Anything other than these settings is coping.
I seem to recall there was a node for comfy that allowed you to put up some nodes as floating windows on screen, or something like thatAnyone know which one I'm talking about?
>>107997217>>107997226i'm not going to cry. it's just a geeky curiosity. The turbo Z works perfectly. 500 gens already
>>107995681Best I was able to get with Klein 9B Dist.
>>107997244sounds reasonable, i only remember people were using shift 3 for zit. but honestly I dont know enough about this
>>107997238>1suck
>goon for hours daily>suddenly sharp pain from pp to funhole
>>107997177Yes, unironically.
>>107997237the 2.2 rentry guide was a half-assed edit of the 2.1 guide, so yeah it's outdated and shit. that's why it's not in the op now. just ask chatgpt or whatever to help
now I just need the cnet for base and I can goon forever
>>107997339Old controlnet doesn't work?
>>107997355nope
>>107996985thx.. went well we'll see
>>107997417No model, even the nsfw ones, is able to do open mouth kissing, only chaste light lip touching.
>tfw cant go higher than 0.6 denoise on 2nd pass otherwise the zit face creeps back in
>>107997390cool
>>107997267Haha, Luke looks like a werewolf. Nice lighting/atmosphere, but poses could be better, esp. Vader.
Not impressed.
>>107997443You can ask a booru-trained model for a french kiss, although the tongues often make no sense if you look closer.
>>107996991what? why would i want an amd gpu?
>>107997530he thinks the 6000 is amd. he just doesn't know it's rtx
baka my head>>107997548
>>107997522That's only part of the kiss. The part where the heads are slightly tilted and wide-opened mouths are entirely sealed together can't be done with any model.
>>107996747That reminds me I wanted to try a molten-metal look for Zealot's cursed arm. Got a couple of neat results, but so far not one that combines being large and all-orange while still looking good.
>>107997572i like the one on the left
>>107997572neat
>>107997289You should see a doctor. This could be sign of something serious. Gooning should not cause pain.
new>>107997567>>107997567>>107997567
>>107997572left one is really cool