336 of My Gens Edition Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>107400410https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipehttps://github.com/ostris/ai-toolkit>Z Image Edithttps://huggingface.co/Tongyi-MAI/Z-Image-Turbohttps://comfyanonymous.github.io/ComfyUI_examples/z_image/>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/>NetaYumehttps://civitai.com/models/1790792?modelVersionId=2298660https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
is base out already?
killing myself
is there a special way to summon loras per region in krita? if i do it with the menu method theres bleed, if i do it with <lora:blahblahblah> i get "Server error: 'Conv2d' object has no attribute 'temp'"
you're delusional if you think the base is coming.
>>107402434looks nice. a little smol
>>107402435I think they sincerely intended to until they realized what they had on their hands.I called it day one.
tongyi my butthole
>>107402435i've been waiting all week to call anons retarded for claiming base is 6B. I guess I'll never get that chance. oh well
Dedistill status?
>When the anons who called you a schizo for saying there won't be a base model don't get their base model.It's called pattern recognition and this patter was the classic Frodo refusing to throw the ring into the fire maneuver.
>here is the base (distillied) for local cucks>also introducing saas BASE PROIt's going to happen isn't it
>grab prompt off of civitai>make the asian 1girl even more obvious>make her tits way biggerideal zimage workflow
>>107402505zit took everyone off guard, it's not a stretch to think it took tongey off guard too. The days only seem longer because you're anticipating, like a... kid at christmas.
>>107402476
Any recommendation for ZIT Ostris settings? Fine details like clothing pattern is lost with the default settings.
>>107402464i thought we were gunna undistill it first and then dedistill. er-how did it go again?
>>107402476While I have no doubt that's what they've decided, ella-style, it's an incredibly dumb decision. Illu 3 levels of dumb. The model got hyped because it was faster and smaller. Releasing base would help it entrench, sitting on it will simply force people to keep using zurbo, fin.
>>107402410Anyone with NAG on Z got a workflow? I can't get anything coherent out of it
The gay ring problem:_______________There is a very big and gay ring.If you put it on, you gain super powers.But, people will know you wear the huge gay ring.do you wear the ring?
>>107402482Funny how she's 20 at most posing as a grandma.
>>107402552mail order bride, the listing said 52 but they sent a 25
>>107402537It seems like shooting yourself in the foot is a rite of passage for all imagegen companies. Remember when Emad fought desperately to keep SD1.5 out of the public's hands?
Any cool models or nodes like ipadapter where I can combine two images together? be it style, faces, etc
>>107402561What was the prompt? I can't tell if the shadows imply something
>>107402561neat, it has 3d generated imagery style shadow errors.
>>107402563>rite of passageThis wouldn't be the first time. Wan 2.5 was borderline though. While they never outright said they would open source it. They never corrected the people who interacted with it and said it would be open source.
>>107402568her boobs bounce
>>107402564Res4lyf has a slew of them with examples.
>>107402581excellent output
>>107402546what is so hard about this
is there a difference between increment/decrement and randomize
>>107402604are you for real right now?
>>107402522>CRUMBS ON HIS JACKETSES
>>107402527just keep it default with cache text embeds and train for longer, like 8000 stepsalso switch the adapter version to the new one, change v1 to v2 in input field
>>107402604the seed (number thing) determines the "gen" you are creating, if you keep everything else the same. that's because it is the seed for the noise, in which the model fantasizes and conjures up the magical abominations that God said you shouldn't be messing with u fool!MATH! IS! DEMONS!
>>107402618 (me)also set quanting to - NONE - if you got 24gbif you got 16, set it to none and set low vram true, if that doesnt fit properly then use that tensor offloading until it does
>>107402604on a fundamental level no, randomise functions as both increment and decrement, but for those of us with a limited lifespan it's handy to have thsoe options anyway
>>107402628I know what a seed is. So if I get an output that's close to what I want, I should increment or decrement rather than randomize
>>107402604>>107402628also, sequential seeds aren't similar to each other, despite being the next number.the reason to keep the seed the same is if you are changing other things to see what effect they have. But you should be careful about such comparisons, because sometimes it's a fluke.
>>107402505A way they can play this without fully destroying their image is to purposely modify the real base into a shittier "base", but hold on to the real base to later call it it Z-Image-Pro on a web service/API
>>107402604Increment increases quality while decrement decreases quality while randomize randomizes quality of the gen. it's true, trust me, no need to fact check.
>>107402587was same prompt, different resolutionthis one is 'her boobs jiggle'
>>107402643Doesn't matter, most people use random. a different seed is a totally different random noise.butsometimes you want to keep the seed the same and concentrate on fixing your gen. however, this can be a trap.
>>107402645actually I am testing this out with Flux 2 right now, and incrementing generates very similar outputs, so its working well. while randomize might generate dogshit again. idk. too much randomness involved
>>107402651oh no I've been doing it wrong
any type of zit lora I try just changes the look of everything. The loras don't mix at all if you're trying to maintain a consistent character. They change the look of all characters the model already knows too.
Is there any correlation between nearby seeds and similar output? There wasn't in the 1.5 days but I haven't actually checked for any other model.
>>107402664Sorry to disappoint, anon. adjacent seeds aren't related at all in terms of output.
>>107402679nope. not at all.666 is a dangerous seed, don't use it.
>>107402579This time it hurts more not because of a potential language barrier confusing the messaging but how many times they mention or allude to "open-source". Their github quite literally states, under Base "By releasing this checkpoint, we aim to unlock the full potential for community-driven fine-tuning and custom development."
>>107402456Base is for sure 6b, exact same architecture as Turbo. The researchers made a big deal about how cheap the model was trained. Why then, would they train a larger base model, then train a smaller model distilled from the big one (but still trained from randomly initialized weights)? Versus just taking the base model as-is and distilling it against itself as a teacher model (exact same thing Flux Schnell did). The latter is way more efficient and less training resources.
>>107402604No difference. Main benefit of incrementing/decrementing seed is you can recover a gen if there's some mishap.
>>107402679Yes your favorite number will give the best gens, rigorously proven with years of experimentation
>>107402694Maybe flux is better unquantized.
>>107402618>>107402632Thanks... But that will have to cook overnight to validate if it works. I am feeling a little chilly anyway, no need for a heater.
>>107402724oh cool be sure to publish your findings
>>107402527Set steps to 1000.
>>107402643Use the Inspire Ksampler with variation seed.
>>107402701The blackest black pill is that there was no language barrier and this is just Chinese culture.
>wikipedia copy paste slop prompting>it looks better than any game I've played
>>107402679No. Think of seeds as unique hashes. Be aware some samplers like Euler A add noise which makes using seeds pointless.
>>107402745True.But we win, they gave away too much for free. We are overcoming :^)
do you think alibaba can pull one more grift or will it be a different chang that swindles anon next time
>>107402714All unquantized (including the VLM)
>>107402759why can't you control ancestral noise?
So can I use zit to make porn yet?
I don't want to alarm you but there might be a serb or serbs in this thread.
>>107402771what's the captioning node? I like it.
>>107402670
>>107402786The original images were bulk captioned with a python script, and the result was fed into the "Prompt Cycler" node in ComfyUI.
>>107402694Is that a real place at the top or just a concept? Looks neat. Also zimage did a good job, damn.
supports whatnow?
>>107402810>ldg taking on the 1124lb world record deadlift
>>107402819>Qwen3-VL-32B-Instruct>66.7 GBI only have 64gb of system ram. I have 16gb of vram, so I guess theoretically I can run it?
>>107402842where the fuck is Kandinsky faggot?
>>107402590Not working well with custom seed variation workflow
>>107402894he grabs her hair and drags her back to mordor for some bodymods :)
>>107401984someone's cranky
>>107402837It was concept art for the "International Chengdu Global Center" which is like a Las Vegas-style arcology.
>>107402909Can you make it where the animals on the wall are trying to talk, but they totally ignore them, then one of them shoots them?
>>107402658>most people use randomLifehack: using increment willl let you rewind without reloading wf because the seed you've missed but decided to revisit is just a click away.
When do you think reddit is going to come to the same conclusion we all have? The base model is not coming.
>>107402897What is 'custom seed variation wf' even is?
>>107402985NTA but why not reload the workflow from the queue?
>>107402915This is so busy, noisy and jpegified it looks like this was genned with zurbo, not the other one.
>>107402779no
>>107402987You care about what they think?
>>107403013I mean kind of. Like I enjoy seeing them seethe and cope. I think that counts as caring in a way.
Is there some new tech like ipadapter available?I just remembered this ancient tech from the 1.5 days.
Why are they just shifting meaningless stuff around on their github and huggingface
so glad I bought 64 gigs of RAM for $200 several weeks ago. greatest purchase i ever made
Will the training go well?
>>107402999I do both, but most of the time rewinding seed is faster. And sort of allows for internal pipeline separation when working on complicated workflows: rewinding seeds most of the time and reloading from queue when you're lost and need to get back to things that worked.
>>107403028
>>107402842>Ovis
ComfyUI officially declares ZIT a "game changer", which indirectly implies it is superior to Flux.2
>>107402985life hack: if you use a random seed, then you can't be known to have generated adjacent seeds.This means that even if an adjacent seed violates Iranian law, you won't be punished for something they can't prove you genned.example."pretty lady"in chroma hdyou gen seed 10001you post ityou gen seed 10002you post ityou gen seed 10003it shows a booby. you don't want to go to jail in iran, so you don't post it.you gen seed 10004you post ityou will now be visited by the Iranian secret police.if only you'd used RANDOM SEEDS
>>107403036smoke?
>>107403050all the competition except maybe wan is so bad its unreal
>>107402999because comfy is a fucking piece of shit and randomly decides you can no longer do that
>>107403067
>>107402992https://www.reddit.com/r/StableDiffusion/comments/1p94z1y/get_more_variation_across_seeds_with_z_image_turbo/
>>107403045>>107402842I love the reference.
>>107403101
>>107403109kek
why is everyone on civitai training their z loras on pony outputs with tag prompts?
>>107402842so yeah, I don't see the ovis image support.
>>107403164First time?
>>107403169https://github.com/Andro-Meta/ComfyUI-Ovis2found this, but I doubt it can load the new one.
>>107403164That sounds retarded, how are you even supposed to prompt for something like that?
>>107403164tag prompts with many images add seed variance back to the model
>>107403165>advisor: we might have a vacancy for you, its volunteer wor->woman: no thanks>advisor: ..at a school
>>107403284women should work at home. They can setup to sell stuff they make, and they can buy property.Married women should cover their heads when out of the house, and should always obey their husbands and be cheerful in all things as possible.
am i tripping balls or raising ModelSamplingAuraFlow value fucks loras up?
>>107403061That's the weirdest train of thought I've read today.
>>107403284yes Gennie, you do have to work even when you're on your period"fuck that, i won't do it, you stupid bitch, i'm outta here"*sigh*welcome back to the table Gennie"im sorry, hormones"
>‘Post-Avatar depression syndrome’: why do fans feel blue after watching James Cameron’s film?Didn't take long, and ai can cure it.
>>107403032make him shove her down and sit down in her chair
>>107403330Raising shift increases the scheduler curve's slope. So what's your scheduler (actually, don't, answer, just plot them yourself).
>>107403061if you get nsfw output you probably were prompting something riskey anyway, so straight to jailotherwise for this to be a problem, you would need the future agi to turn into a basilisk that will bruteforce generate all images in accordance with the rest of the settings that would influence the output for every single user/gen found onlineyou would basically need to be posting on an account/ip tied to you irl known to the government, and also at some point in that chain on posting images you would have to post that exact workflow for those sets of images once so they have the setup parameters, and then you would have to post 2 more, and probably many more gens before they can do thisand even then you have a free option to after genning a boob switch to random noise and keep posting like it was a unrelated descision you made for no particular reason
women discussing the largest corporate merger in history
vfx artists in trouble.
>>107403454kek, how'd you do that
>>107403418
>>107403442>output you probably were prompting something riskeynope, noob
>>107403503then you were at best using a model you knew had women in it and wasnt as safe and effective as SD3 was, so straight to jail
>>107403460A hole appears beneath the women and they fall into the flames.
The Iranian/Turk is fine, he took my advice and uses RANDOM SEEDS.
>>107403528Furk?
>>107403495
Now that you've had a chance to decide, which race do you prefer?Yellow race:>>107403418Red race:>>107403495Black race:>>107403543
I like all the races. I like forests, deserts, and the seaside. It's hard to pick a race.
>>107403061lmao you're funny nameGOD
What the fuck is this namefag's problem?
If my dataset images are mostly 1280*1280, is it worth also scaling to lower resolutions during training in ai-toolkit? i think that will scale all images to all those resolutions and train on those too, but is this better than just training at 1280*1280?
>>107403613lol, I don't care. His posts are mostly just noise to me. Rather have him than the schizo who samefags anonymously to shill his broken software.
praying for nsfw and danbooru tune of z-image
>>107403635There won't be a base model so no tune unfortunately.
>>107403621If it's a tiny dataset since most lora datasets are, consider mix and matching between downscaling and crops instead. The goal is for the model to generalize better, it does it not just with scale but with different composition as well. Also, training the entire lora on 1280 is overkill. It will simply be faster with large batches relegated to lower res.
>>107403637That never stopped Lodestone, you know (should have, but it didn't)
>>107402909>>107402927
>>107403667I 80 images, and dont mind the time cost to train everything at max quality. But if downscaling helps the model generalize better, why isn't 256*256 also enabled by default? Is it always better to also do 512*512 at least?
>>107403695>I 80 imagesI got 80 images
Qwen just updated their app for image editing. Probably means Qwen Image Edit 2511 is coming soon
>>107403695>Is it always better to also do 512*512 at least?Yes, most models train the bulk of it on 512, so you won't make a dent if it doesn't know 256
>>107403708>so you won't make a dent if it doesn't know 256you mean it wont matter if it doesnt know 256?interesting, ill train it at all resolutions from 1280 to 512 then
>>107403707Can you make bokeh look like a real DSLR instead of this iPhone blur?
the rt. hon chantelle
>>107403543>>107403549(or, if you are into bestiality, a subhuman BR*WN)
so it's over? No base?
>less than a week from a model release>IT'S OVER NOTHING HAPPENED WE GONNA STARVE
bros im trying nag out (using blue sky as nag) but I cant seem to get this shit working, are my nodes connected good? im using the eulerflow schedulercfg 1, shift 3
>>107403945Another asian 1girl has been defeated by the CROSS.
If I want the style of an image but have full control of the character, its pose, scene composition, angle etc, without the use of a lora, what's my options in comfy?
>>107403962Nanobanana api node.
https://www.reddit.com/r/StableDiffusion/comments/1pc2enz/z_image_turbo_controlnet_released_by_alibaba_on_hf/uh oh. controlnets instead of edit model. bad sign.
>>107403961boom. asians defeated again.
>>107403977>no comfy
>tell the llm to describe this womans naked body>also tell it to keep the image sfw>it gaslights itself that the image isn't of a nude woman
>>107403982It's an exorcism.
It's amazing to watch the latent try to be a gook, but the crosses keep banishing the demonic race.
>>107404009nice try, GOOK-AI
btw, to unburn these, I can just have more iterations. idk how many are needed, but these are 9, and usually I do 80 if I want an unburned look, when I'm fighting the gook devils.
>generate a 1080x1920p image with zit>32gb vram usage 100%
>>107404050HEY YELLOW YIDS!
the reality is they were only targeting the asian market, and so they intentionally were trying to patch whites out of the defaults.
>>107404140
>>107404181
This from the op are the optimal settings for nag?
sooo where is the Base model?
Is this thing dead or something?The models are in the correct location and named properly.
Where base now?Where?!
the blank input for 3 steps guys are right.https://www.reddit.com/r/StableDiffusion/comments/1p94z1y/get_more_variation_across_seeds_with_z_image_turbo/I didn't use the wf, because I already know how to chain.
>>107404381But alsoI am blessed by the CROSS which SLAYS evil asian sluts.
the nameGOD knows how to chain guys
I'd rather chain girls
>>107403900there are months where nothing happens and there are days when months happen
/ldg/ - local degenerates
>>107404381Your gen and his have high pass effect in common, though. Which is weird, because in my experience, z isn't prone to falling into high pass when messing with denoise levels. It is, unlike qwen, very high-pass tolerant (thankfully).
noobs who can't adjust curves, need ai to do it for them.
one thousand anons just got TOLD[x] tolderino
noobs, man
https://huggingface.co/alibaba-pai/Z-Image-Turbo-Fun-Controlnet-Union
>>107404461Hmm. Releasing another tool for the turbo model instead of the base. Interesting. Let me plot it on my chart.
>>107404471https://search.brave.com/images?q=round+tuit&source=webthe chart
>>107404476>>107404460Are these with NAG?
>>107404485no, the burned look is from cfg, which isn't necessary for this prompt. actually, more steps will make it much less burned. Here it is with neither cfg nor nag.
>>107403050You fools. The interest outmatched their expectations and they now decided to keep the better stuff behind lock and API key.
>>107404506>the burned lookGlad you picked up what I'm putting down.
>>107404519They literally just released a control net. Have patience you zoomer.
>>107404519I've been sounding the alarm bell for a few days now. They will not release the base model.
>>107404528Do you know what wan did before they pulled up the video ladder? Released wan animate. This is a consolation prize.
>>107404506>>107404522I'm kind of into the burned look.
>>107404447>Needing to postprocess his gens in PhotoshopNgmi
>>107404553
>>107404471They're waiting for the hype to die down a bit so they can release base or edit to get it back up again, prolonging the period of time that they are relevant. Relevance leads to active discussion, which results in more exposure.
>>107404565git gud
>>107404531>base was supposed to come out last week>bghira warned the chinese government>orders to censor the dataset>they are now rebaking base from scratch in a panic>it will be offered api only so they can monitor and censor all the prompts>damage control is startingexpect to see more and more "who needs base? turbo is all you need!" posts.
>>107402646it's alibaba to decideand you know their decision for wan 2.5
>>107404567>>107404553don't post every shitty output. maintain thread quality
I want Z Base to be SaaS, does that make me a bad person? I think not
>>107404602this. it's only local until it's good. alibaba saw that it was a turbo 6b model and assumed it was generic slop, even the chinese leaker guy said it was only good at realism and wouldn't be as good as flux but rather an option for those who want speed.now they see how popular it is and, just like with wan, want to lock it behind and api and monetize it through partner shilling with comfy API, which is why comfy is already mentioning "local and cloud" in every post about z-image. normalcattle will hear about 'uncensored lightning fast z image', search it, immediately see comfyorg results, and subscribe to the pro api thinking it's what everyone was talking about all along.
>>107404631Ever read Art Forum?
Is there model good and cheap enough to compete against Gemini 3? They can release a good enough model for free to piss in Google's well though.
>>107402410>Z Image EditIt's just called Z-Image, retard
>>107404659well screenshot it, and when it releases you can z image edit it.
>A hybrid cross between a rat and cat.Tried one of those meme animal prompts. Every other model seems to get at least some idea first try. Tried many gens on Z.>A half cat half rat hybrid creature>A hybrid of a rat and cat.>A hybrid fusion between a rat and cat.etc.lol
is there lightx2v lora for flux 2?
was wan2.5 even a success? no one in the west uses it. why would they pull the same trick twice.
be sure to share something today!#growth #empathy #lovethumpshate
>>107404679none succeeded, though.
>>107404679>sd 1.5sovl
Hello real general~!The new ZiT Controlnet is very good, did you guys used it?https://huggingface.co/alibaba-pai/Z-Image-Turbo-Fun-Controlnet-Union/tree/main
>>107404774We are back anime bross!
>>107404774arigaToT
>>107404774thank you ^^
Is there a local model or workflow that takes both starting and ending frame to generate a video transition between them?
Can you inpaint without issues with zimage yet?No? Still ass?
how do I make parts of the workflow run depending on a boolean?my use case is prompt rewriting, I'd like to disable it on the fly, and I want to keep the logic inside the subgraph, so externally I'd just see a boolean toggle.I was trying some logic if nodes, but the true/false shit still requires stuff before it to be processed.
>>107404863What issues are you having? I had no problem personally.
239s on a power throttled 250w 3090
>>107404873>I was trying some logic if nodes, but the true/false shit still requires stuff before it to be processed.I'm not sure what you mean by that. I made a minimal Z workflow that uses a different prompt depending on a boolean value. It uses Impact's conditional node. Does that solve your issue? https://files.catbox.moe/bkoec1.png
i have grown to despise the immature momfucker HugeTits fanclub even more than i do footfags.
>>107404873>>107404942Ah, I think I get it. You need a lazy conditional node, which Impact's isn't, so it crashes if the unused value is null. I believe I've used a lazy conditional node before, let me go look for it.
>>107404886I'm getting 150 at 300w at 1.6MP
>>107404976oh wait impact node works, I was using the easy-use nodes and that shit required both conditions to produce a value. arigatou
>>107404774Thanks for the news, sexy child!. I want to lick your whole body! ToT. Tell me, why don't you share theis news in our dedicated anime general? we are so lonely! So cute so sexy so child...ToT
>>107402618NTA but I think I got better results with v1. v2 feels a little overcooked while not picking up the likeness as well
>>107404774Thanks for sharing!
>>107404991Cool. If you do end up needing a lazy conditional, I found "Lazy Switch KJ" from KJNodes.
>>107404863Uh oh meltie!
>base was supposed to come out last week>bghira warned the chinese government>Xi asked Trump about it>a sudden alien attack killed the lead researcher >Trump said that the alien attack was more important than an image model>Xi agreed>order to stall the release to manage the third type encounter with the aliens>they are now the compute to understand the alien speech instead of finetuning zimage>damage control is startingexpect to see more and more "I think I've seen something strange flying over my house last night!" posts.
why the hell is he samefagging so hard
>>107405043Who?
>>107404873I would recommend against it. Comfy is very hostile to if/else clauses. You may get one working, but put in several, and the WF will break where you least expect. Something to do with parsing that starts from end node, not the root. Consider vibe-coding a python script if you really need logic.
>>107405043You are mentally ill. Stop larping as the thread moderator.
>>107404886Was it worth it in the end?
>>107405006I agree, V1 looks like clean screengrabs from a Pixar movie while V2 looks like another style bleeds in somewhat
>>107405111I don't even know who that guy was talking about but your reaction tells me he was onto something.
>>107404774do i just load this model like in my old flux control net workflow?
>mentally ill schizo thinks wildcards are bad >proceeds to sit here 24/7 to bake threads with worst possible images
>>107404774i can't think of use cases, since zit is censored
>>107405142>worst possible imagesThis is slop, every image here is worst possible.