Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>107342183https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipehttps://github.com/ostris/ai-toolkit>Zhttps://huggingface.co/Tongyi-MAI/Z-Image-Turbohttps://huggingface.co/Comfy-Org/z_image_turbo>WanXhttps://rentry.org/wan22ldgguidehttps://comfyanonymous.github.io/ComfyUI_examples/wan22/>NetaYumehttps://civitai.com/models/1790792?modelVersionId=2298660https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
seedream will not be forgotten... we will have our revenge
>>107343661Just as a reminder for the big news.
gifted threed of frenship
Why does Z have such horrible case of sameface by default?Sure you can give it celebrity names but I want random people that don't exist.
>>107344176>she was a doll all alongrei bros... not like this!!!!!
>>107344182you gotta prompt face features/race. it's the price of consistency
don't care, my finetune will be betterno, actually, i will steal-i mean, use the design of the architecture as base inspiration to create my second own new architecture and surpass this new Z-image thingy everyone is talking about just be patient, don't worry
>>107344182tons of recent models have overfitting/seed variance issues. it's not just z image, but i don't know what's causing it
A large group of indian men who are barefoot running down the street in mumbai. The indian men are holding a sign saying "DO NOT REDEEM SAAR!".
>>107344194You could generate any 1girl or anything else you want and yet you choose to generate ugly jeets. Why are you gay?
>>107344194
You want more variety? Sure, return TO CLIP then. Mongrels.
>>107344199it's just a testalso I think they had insider flux 2 info and released it the same day on purpose.
>>107344182If it's Asian, I suspect their dataset is primarily portrait photography from Chinese TikTok/Douyin cosplayers and internet celebrities with the typical heavy beauty face filters.
>>107344191Overfitting due to overtraining in order to make them perform as intended by the creators, like in the LLM world.
>>107344194it wants to gen 6 fingers, but doesn't do it
>>107344183wcyd
I thought you fucks said Qwen image was fast why it taking over 10 minutes just to shit out a megapixel?
How will the non-turbo version be?
>>107344202
Why is qwenvl taking barely any resources while it is reading an image? My system is very capable.Model is already downloaded.
a sexy Japanese woman is at the beach sitting in a chair, reading a book. The title of the book is "how to beat flux 2".
>>107344220just use the nunchaku version with lightning lora, blazing fast
is z-image actually good or is it just the latest toy to shill?
>>107344245it makes non plastic people and is much faster, so yes it's good.
>>107344187fake kurumuz. give us Director Mode first
>>107344247like if you made this in flux she'd have a plastic doll sheen.
>>107344252Make it in flux
almost forgor we can modify a prompt without ending into an horror movie... Z is killing it
Ummm okay yeah it's looking pretty god but How the fuck are we supposed to tag images for loras (if trainers ever finds out how to train one)? Like, I can't imagine that right now, honestlyGood ole danbooru-based autotaggers won't save us now, seeing that the model is perfectly capable of genning readable textAnd what about sex and stuff? Is it already over?
>>107344245Flux 2 is good but everyone is spamming z-slop because its so much faster
>>107344262flux2 at q4 (already an ultra cope quant for imagen) is 20gb, so go figure
>make qwenvl describe my nsfw gen and then gen it with zimageThat bundle of sticks, lold.https://files.catbox.moe/fnj6cg.jpg NSFW
>>107344272safe realistic model can't into danbooru degeneracy whaaat?
see, it's more natural vs flux which has the plastic/doll look.
>>107344262Got some flux slop for us to see? Sometimes the juice isn't worth the squeeze.
>>107344281is this flux? fucked up feet and the hand seems weird
>>107344272man stop with the normie frieren shit, PLEASE
>>107344260>And what about sex and stuff? Is it already over?Just one more finetune and it will fix everything, don't worry, just like lumina 2 and auraflow
So does it run, per step, faster than SDXL?For me it's running a little bit more than twice as slow as SDXL.But I am a 12gb VRAMlet, so I suspect CPU offloading plays a role.Can any 24+gb VRAMkings here share how the speed compares to SDXL?
>>107344153Is it more common to use local models for image generation? Because in LLM world almost everyone just consumes through API services, most of the things you can run are garbage with no context. I mean running local is the only way to get nsfw anyway right? I'm going for anime style
>>107344285posted this earlier but I like how it does horse armor
>>107344325tell the horse to wear pants
>>107344280It's fun to try, anon.>>107344298That's an old image. I'm doing quad amputee fleshlights now.
>>107344258What model and what hoops do I need to jump through to gen like this?
>>107344322if you want to do image only perchance.org will give you unlimited nsfw. it's based on chroma and good enough.
How do you guys sleep at night knowing you use GGUF models?
>>107344332>quad amputeenugget bros...
I don't get why you guys are hyping up these bloated trash models that manage to be a step below Chroma HD somehow
Some fucking genshintron
>>107344336just run Z image turbo locally.
>>107344360Okay, booting it into Fooocus UI now, fingers crossed
>>107344360does it need nvdia cards?
>>107344338I'm already reading guides to understand this shit, I'm gonna try running everything local, I downloaded comfyUI and I've downloaded checkpoints from SDXL like WAI-illustrious. Do you have any tips for prompting.I'm gonna use anime models now (it seems they are also lighter or something, I only have 12GB vram) but in the future I would like some realistic model to make 2D girls into 3D for fun
>>107344364If you're genning and not using a Nvidia card than stick to online sites bud
People already complaining even when it's this good, small and fast before the inevitable finetunes. As long as they release a solid base model it's going to be the new standard (for coomers).
>>107344371Small is definitely not what it is retard.
>comfy must be dragged into the streets and shot
I am testing Flux2 edit since there are no LoRa training for ZIT nor ZITedit. Better than QwenEdit at anatomy, but getting that stupid Flux blur effect.
>>107344369is it really that bad? isn't this a monopoly?
>>107344367perchance isn't local, it's a website. with chroma try using natural language sentences to describe one aspect of an object or person, then end the sentence with comma separated tokens. like anime girl has blue hair in braids, long, flowing, neatly plaited. don't try and describe a lot of different aspects in a single sentence, it seems to be how you get body horrors.
so is 9 steps the standard/ideal?
>>107344393it would be if any of the other companies made something remotely competitive
>>107344395Does anyone have an actual example of it benefiting from more than 8?Most of the time, it changes little.
>>107344375kys vramlet
>>107344375Comfortably fits on a 5 years old consumer gpu. I'd say that counts as small.
>>107344395yes, actual inference will start after 1st step
>>107344393I've never used it but you can try zluda
>>107344178no FUCKING WAY
>>107344388What the fuck is that derp face kek.
This one came out better than expected.
>he doesn't have a fridge full of beer in his bathroomwhy even live
>>107344430Dude not cool. She's brave putting herself out there after the stroke.
Does the Zimmage work on 8gb or should I an hero
>>107344454It's literally built with 8GB as the target
>>107344178big if true, local is saved
>>107344178Every day I thank God for President Xi and the Chinese people. May the Jews never do to that country what they did to us
lightning Z lora waiting room
>>107344454turbo prompts in ~20s on 4060
>>1073444692 bit quant z image is what we're all really waiting for
>>107344458>>107344474Okay thanksdescription said 'fits comfortably" on 16gb so I assumed that was the baseline
>>107344468the jew fears the 100 acre wood
>>107344454I mean the worst you would have to do is go down 8bit?But it should work with some offloading. (Which it seems to automatically do when needed.)
>>107344469plus teacache
>>107344469 Is your computer made by Tiger Electronics?
doesn't know any characters therefore irrelevant until a big finetune
how the hell do you increment a value or randomize it with the new comfyui style?
>>107344191It's the only way to remove anatomy issues, you basically overtrain on poses, gestures etc, so that you while you have very little seed variation, you will have correct anatomy.There is sadly no magic to solve this, Chroma for example has a lot more variation between seeds, but it also has more anatomy issues. And it's not a model size issue either, GPT-4o etc have practically zero variation between seeds.
>>107344482>ANON'S COMPUTER?>FOUR GIGA BYYYYTTEESS
Z can't even do NSFW right if at all how is it being shilled this hard as the next big thing?
>>107344500KEK what model is this?
>>107344507Flux
>>107344509ah you know what i should've guessed that. thanks thoughbeit.
>>107344497Memes are moar important than NSFWYou would know that if your brain wasn't full of coom
>>107344507Z-Image Turbo
>>107344492Go back to the good nodes.
>>107344497just think of it as a bigger and better sdxl waiting for its noob finetune
>>107344176People are claiming Z is uncensored but this is barbie doll, or can you just not expect for it to get explicit unless you explicit prompt for genitals in a base model?
>>107344497training nsfw loras or finetunes for a model of this size is pathetically easyit's literally qwen but ten times faster and with a less pronounced neutral bias, this shit is insane
>>107344520I guess it's broken then, ok
>Update ST to check the new model>The UI has been shuffled around again for no reason.Why do they keep doing this.
Can't load Flux.2 by offloading anymore. Anyone know why?
>>107344538>Can't load Flux.2good
How often do you guys get anatomy errors with z-image?
Honestly all i can think now is i wanna pause all my genning until base releases, knowing it'll be trained with noob's dataset, it'll 100% replace illustrious/noob/pony for me permanently.. woah..>>107344552not super often. only times i can think of it happening were because of my prompt.
BUT CAN IT DO PEEPEE?
do we have z-image loras yet?
nb4 base is paywalled nb4 "but they said they wouldnt"
>>107344538Use Z Image instead.
Absurd. This shit is so good for its size. Not as good at different art styles as chroma obviously, but still insane nonetheless.
>>107344563Already have at least 300 different ones last i checked.
>>107344418The more complex your prompt the more a higher step count will do
>>107344393im almost certain you can use an amd card on linux with comfyui pretty smoothly
>>107344578so there's a benefit to higher step counts? thought anything over 9 was negligible.
>Negative prompts have no effect>Incorrect model sampling nodeWhy is the workflow comfy shared so bad?
>qwenvl is literally bugged and takes 500+s to read an image>swap to joycaptionNow we're talking.
>>107344588>so there's a benefit to higher step counts?Yes, for the reason described in the post you replied to
>>107344563It can read all SDXL and Flux branches fine somehow so you got countless too choose from
>>107344603no way, are you epically trolling?
>>107344603you're saying my flux loras work? no way
>>107344603lol
I can't even get it to work in SwarmUI (says missing backend text encoders) and Comfy is too confusing and enraging for my smooth brain so I will just go back to crying in the cry corner like I have been I guess.
Reminder Pony V7 is literally right here if you want a competent realism model that does NSFW really well.
Bruh..So I use joycaption to describe my nsfw images, been spamming a few just copy pasting the prompt. It just straight up genned cp with zimage..
ZIT edit has to beat Flux2 when it comes out.
>>107344662amazing humor, /hdg/ tumor
>>107344659use comfy's workflow and switch back to swarm's ui with it
>>107344669Anon, we need to investigate your hard drive
>>107344669THAT'S HIM OFFICAH
>>107344669...and?
>>107344669Why don't you take a seat?
>>107344669stop right there
>>107344360Does it do lewds? What about the edit variant?
>>107344493>Chroma for example has a lot more variation between seeds, but it also has more anatomy issues.less*It's a skill issue at this point desu. I mean in this case does the Z side look like good anatomy to you? Let's just say, there are situation where Z is unable to do what is being asked, and there's no way to fix the bad anatomy because it's hard baked. I won't lie that Chroma v40 in this case didn't take several tries to get it right, or gets the number of toes wrong now and then. But these are minor issues that are fixed if I keep regenning due to the gift of seed variety (even within the same seed). Or now with Chroma HD Flash, I get 10x less anatomy issues that I had on non-Flash versions, so it would probably get it first or second try.
>>107344705KINO
>>107344709fewer**
>>107344709>>107344723pingas***
the coherent backgrounds Z is able to pull off makes my dick rock hardGOD imagine how that noob dataset is gonna look with good backgrounds
>>107344737>GOD imagine how that noob dataset is gonna look with good backgroundsThat's the issue. If they overtrain on it it will nuke the background quality because you're training on images with godawful backgrounds for the most part. Can't really have your cake and eat it too.
>>107344745Just merge the datasets??????????
>>107344178This is gonna be for the base model right? you add this shit with the reasoning capabiliti (that shit is what's making nano banana so great) and we'll end up with the best model ever, holy shit god bless china I love communism now!https://xcancel.com/srameojin/status/1993793896397320193#m
This is looking real comfy.
>>107344754>I love communism now!>>107344745i trust the plan either way. the fact they didn't just release it as is, means they actually care. they're not benchmaxxing.
>>107344754>Prompt enhancer with z-image-turbo might be better . System prompt is on its way!what does that even mean?
>>107344500>>107344507>>107344516More like Z-image Negro am I right??
It's pronounced "zimmage"
holy shit Z yes COOK>>107344774haha lol
>>107344765https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
>>107344789do young goku instead
>>107344569that looks great
take it downNOW
>>107344789>mfw I don't have to download a 50gb model to get something slopped and subpar
>>107344791very very interesting. i wonder how censored it's gonna be.
>>107344569those chinks showed that you can get great quality at normal size, this is what I always said, there's still a lot of room for improvement, we're just at the begining at this shit, that's why I find it sad that bfl and tencent went for the layerMaxxing thing, they implicitly admit they don't know how to improve their training process and go the easy way out
>>107344794it really cannot do kid goku sadly
hmm
>>107344765>>107344791https://huggingface.co/spaces/Tongyi-MAI/Z-Image-Turbo/blob/main/pe.py
it knows how to do pasties
Holy shit how can it load and dump 32gb of vram so fast repeatedly? That can't be healthy.
>>107344709you could at least have tried
>>107344827That's what VRAM is for dude
>>107344706>Does it do lewds?It does boobs>What about the edit variant?Not released
>>107344821>it made r2d2 chineseLMAO
>>107344826>it knows how to do pastiesanother lora for flux i can safely delete
>>107344754>prompt enhancer is on its way!I'm more waiting for the reasoning personally, that's the most important part, with that your prompt adherence will be off the charts and it'll be able to make comics/manga pages with very vague prompts like on Nano Banana Pro
>>107344830It's chromajeet, his behavior is so fucking predictable and pathetic >>107333063
>>107344851it's really sad desu, I gave him the benefit of the doubt when he was defending chroma so hard (his arguments can hold up, it's a model with the best skin texture and can do NSFW out of the box), but when Z-image got released and showed how much superior it is and how good it it at rendering asian women I thought he would be thrilled by that, I think he's on sunk cost fallacy mode, he shilled chroma too hard to abandon it, many such cases
>>107344851kek
Can Z-Image pass the "girl lying on grass, upside down" test?
>>107344822>https://huggingface.co/spaces/Tongyi-MAI/Z-Image-Turbo/blob/main/pe.pypretty based system prompt if you ask me
The uncanny valley with zimage doing images from 2d is scary.
>>107344890try to go for "a woman disguised as [insert anime character]"
>>107344886that is one hell of a system prompt. wish i could do that with llm's without inflating gen time.>>107344890how are you guys img2img'ing zimage? never done it in cumfart before.
>>107344524I actually prompted for it to be censored, because I wanted to post it on this christian board. Otherwise it will do nudity no problem https://files.catbox.moe/hpla5v.jpg
is 7s/it on ancient vramlet gpu (2070 super 8gb vram) about what i should expect for ZIT, or is something wrong with my setup?
>>107344830Sure, you can engineer it to give you what Chroma gave me from the same prompt, but that's not the point. It has inferior prompt understanding.
>>107344821Ohhhh, dis vewy goowood
>>107344870of course it doesbut does it pass the "girl lying on grass, upside down and pointing a gun straight at the viewer?"
>>107344918this is actually really impressive not gonna lie, not a lot of models can nail that
so can this new shit run on anistudio?
>>107344821https://www.youtube.com/watch?v=lPm890OX_Dc
>>107344927You have alerted the poop dick schizo
Can joycaption recognize people/characters and enter their names? >>107344898Interesting.>>107344903There was a link to a page with a few workflows earlier.
>>107344927ani said support is going to be added soon. stopped using comfy a long time ago so i'll just wait for ani to implement it, no way im launching comfyui ever again
>>107344918bad anatomy on the left hand but very impressive
>>107344946>no way im launching comfyui ever againwe both know that you will
>>107344563>do we have z-image loras yet?for the moment no but civitai has added z-image to their list, damn that was fasthttps://civitai.com/models/2169035/z-image-turbo-workflow?modelVersionId=2442551
>>107344954nope. not giving schizo the satisfaction. ani did a great job on his interface, i have no reason to ever use anything made by spiteful schizo comfy
>>107344946I was tolerating comfy these past few months but had to update it to get Z-Image working, somehow they made the UI even worse.Quite an achievement.
>i have no reason to ever use anything made by spiteful schizo comfyreminder that ani forgot to take his trip off while schizoposting
>>107344960>>107344946>>107344927can tell the exact moment Jewlien logs in
>>107344954There's also forge forks, they'll also get Z-Image support eventuallyNo one should have to subject themselves to spaghetti
>>107344851>>107344864>>107344868Samefag anti-Chroma troll. The argument was that Chroma has worse anatomy, which would mean that Z somehow is perfect, which I showed it is not, nor is it able to handle the changing complexity of those prompts in particular.
>>107344974why do you keep posting this photo schizo
gottem
ignore the fucking schizoposting fellas, someone just woke up because /sdg/ is getting it at the exact same time too.
>>107344984its debo
chroma lost, hard.though it has a chance to win considering lodestone is open to working with alibaba on a noob+chroma finetune
>>107344992ITS ACTUALLY JULIEN DICKTARD
>>107344994>chroma lost, hard.it lost very hard, Z-image will soon have all the anime character in human history, we're getting close to the perfect model >>107344178
>>107344974Literally fucking obsessed with a guy who did nothing wrong. He's literally making one of the best UIs for the community.
Hey hey Anon, Anon here.Bringing the usual. Euler only as of yet but I'll fuck around some more. Comprehensive style plot later today.Thank you, China. This is pretty exciting.Full sized box (jpg because of filesize):https://files.catbox.moe/2xg6gg.jpg
>>107345001you fucking legendary cum god, thank you for your service.
>>107345000put the trip back on please
>>107345001what ui
fp8 is faster but I prefer bf16
>>107345024seems like he has time to trollbake though? https://desuarchive.org/g/search/username/Ani/tripcode/0gRLTHrqN2/type/posts/
OneTrainer update waiting room>>107345001Thanks for testing. Which one was your favorite?
>>107345023>ComfyUIis it really worth using this shitwait a week so proper uis implement z-model anon
Comfy should be dragged out on the street and shot
>>107345028that's not ani
Oh im raffin so hard.
>>107345034>wait a weekthat's too long, I don't want to waste any time having fun with this SOTA model
>>107345001how do you do these? some script that reads metadata, or are you doing it with nodes?
>>107344669HelloI would like you to guide me step by step how to install that model and reproduce the same output, thanks
>>107345044embarrassing addict
>>107345044based addict
>>107345044help ani with anistudio if you want features added fasterhttps://github.com/FizzleDorf/AniStudio
>>107345034Eh, who cares? Just use whichever uis you want, anon. Though, I prefer forge neo, it's easier integration for my game later
>>107345051
>>107345058supporting comfy is simply unethical, he's just not a decent person and his interface sucks. anons should know about the alternatives
>>107344994>though it has a chance to win considering lodestone is open to working with alibaba on a noob+chroma finetuneI would welcome a Z Chroma/bigASP tune, but as of now Chroma ain't going anywhere as it's still the best photoreal model. Since Z is overfit to give good results, we don't know what a tune would do, is it going to break the coherence of the model for stuff outside the training data like Qwen? If so, what is the point of that over Chroma?
>>107345066i'll take comfyanonymous over the guy who trollbakes and schizoposts in this thread all day
>>107345066>muhh decent personwho cares, I separate the art and the artisthttps://www.youtube.com/watch?v=3OV4VaNW4FU
>>107345064go back to ifunny faggot
>>107345001i can't use my eyes, what's the best?
oh, turning the normal txt2img workflow into an img2img is the easiest thing in the world and its working just fine at 0.3 denoisejesus people really do blow comfy's complexity out of proportion huh.
>>107345068it's comfy false flagging as ani. ani did nothing wrong, he deserves our support desu, he's based and hard-working but he doesn't have infinite time due to job and dating obligations. and ani hasn't made any ldg threads for months
>>107344822wait, that prompt enhancer thing is gonna be API only? OHNONONONO
>>107345066Sure. I think sd.next already supports z-image, but its UI is bloated as hell.
>>107345067>Chroma ain't going anywhere as it's still the best photoreal modelChroma's main gimmick is not photorealism, it's a vast library of mediums and art styles.
>>107345011Comfy.Forgot to add: It's all 9 steps, bf16.>>107345032Simple seems to work just fine, but I'm really surprised by ddim_uniform, I kinda dig it. Usually I like bong_tangent for a lot of models but it's real noisy here.I'll wait with the final judgement, but simple and ddim_uniform seem worth to play around with.>>107345087If it's just another (V)LLM in the middle like all those other enhancers, all we need is the system prompts they're using. Pretty confident we can get that.
>>107345085>ani hasn't made any ldg threads for monthsAnd how do you know that, ``anon''? Sounds to me like you got caught with your pants down and are now trying to damage control.
>>107344912prompt engineering goes both wayschroma is extremely finicky with some tags to the point of changing the whole image style just by typing the wrong onealso the original argument was about anatomy
>>107345108daily routine at this point
>>107345096Yes, but Chroma has the handicap of T5 so it can't properly lock on to a style. Plus Z Edit if anything could potentially bridge any gap in styles.
Comfyui won, majorly. Spoke to thousands of webui users, pretty much all of them are switching to comfy for zimage.
>i can take the semi-real nova animal images i've been generating for months and make them much more realistic with a quick 8 step img2img nowholy. fuck. god if it didn't have that jpeg noise filter this would be insanely overpowered. that base checkpoint will officially light the scene on fire.
>>107345129based
>>107345124T5 is like 6 years old at this point, it's such a terrible and obsolete choice for a text encoder
>>107345124>the handicap of T5I wonder what it exactly is, isn't t5 pretty damn robust on paper atleast
>>107345110Be honest. You get the same image over and over again. That is boring. I can get that girl in 5 variations of that pose with Chroma. That is its power.
we're eating well, maybe qwen edit v3 will be out today too.
You can really appreciate how realizic Z-image is when you compare it against Flux desu
>>107345144Q: what's with the no seed variance?
>>107345155oh that's right, i should send my gens from early yesterday i did in flux to z. thanks for reminding mekek how fucked is that? feeding gens from a model way fucking bigger because that model is somehow less realistic.
>less parameters>generates faster>generates better resultsquite an achievement desu, it's so refreshing to get SDXL speed gens.
so how many months until the base is released?
I just woke up and I had to catch up to 3 threads, this is going so fast, even during the release of Wan it wasn't like that, damn
>>107345178do we have info on how long it took start to finish for them to get the turbo model?shit, i don't even know how big the noob dataset is.
>>107345155Yep, or compare to Qwen or Flux 2 Pro (which is about same realism). Good stuff in that case.
>>107345129God, I love Carol!
Trying comfy for the first time and I'm able to generate 1 image, but getting decode something out of memory error when trying to generate a second one, without even changing prompts or anything, on batch 1. How does this shit work?8 vram 16 ramlet btw
>it's time to shill the current thing
>>107345155God bless the chinks
>>107345202try using clean vram nodes.
Can I use joycaption directly on my workflow? I'm tired of copy pasting to a web page.
>>107345199same brother. my brain can't fully process the shit i'm making right now.i can integrate turbo z into my illustrious/noob realism workflow as the final step to make it properly realistic, barring the jpg noise issue. nuts.>>107345202the solution's in your statement, you're a fucking ramlet 'arry
>>107345205>the current thingbut Flux 2 is a current thing and no one is shilling it, they're all making fun of it
>>107345202>but getting decode something out of memory erroruse tilted vae node instead
Fucking hell, no way this model can be 3 times lighter than qwen, it generates considerably better results.
>>107345231yep, not only it looks realistic but the anatomy and details are on point, something that Chroma failed to do
>>107345213>i can integrate turbo z into my illustrious/noob realism workflow as the final step to make it properly realistic, barring the jpg noise issue. nuts.post the workflow, carol-anon
>>107345124>so it can't properly lock on to a styleThere's no problem training styles on Chroma, Civitai is full of Chroma style lorasThe Chroma base model simply didn't caption many styles when training so everything became 'generalized' which is how ai training works unless you separate concepts with specific captionsFor example, if you train a ton of images of women captioned only as 'woman', the model will generalise all these women into a single 'look', which is why you need eye color, hair color, skin color, freckles, full lips, thin lips, ethnicity etc to instruct the model not to generalise all these concepts
>>107345183>even during the release of Wan it wasn't like that, damnit's a good model and it can run fast on any modern card with 12GB+ vram
>>107345245workflow was probably a bad word, i meant like "my usual genning in sd forge, THEN img2img in comfyui".
>>107345250how do you even train chroma loras, i keep getting errors when testing the lora on comfy.
so how much vram to run all this goon sheet
>>107345263i use sd-scripts. works very well for training characters
>>107345250Speaking of Civitai, is there any good alternative or an archive? Their website is laggy and slow
Imagine if they used nemo instead of qwen 3b.
>>107345263Diffusion-Pipe and OneTrainer work fine at least as I've used both to train Chroma loras and both work in Comfy, OneTrainer loras doesn't work in Forge though while Diffusion-Pipe loras do.OneTrainer has Chroma presets and Diffusion-Pipe has a config example.
>>107345271ill look into it, i tried a bunch of times with the onetrainer preset for chroma and keep getting header json issues when loading it to comfy
>>107345281>Imagine if they used nemo instead of qwen 3b.I have no doubt this model could be even better, what if they went for a 15b model instead, this shit would be Nano Banana Pro tier, and I'm not exagerating
Is there an abliterated/uncensored joycaption somewhere?
>>107345294which preset did you use in onetrainer for chroma? im using the 16gb preset.
>>107345278I wish, but there really isn't. It's easily the worst site of its kind I've ever come across in terms of UI navigation and bloat, but it is THE place for lora / finetunes, and seemingly the last AI site that allows NSFW sharing, they had to drop celebrities though else the (((payment processors))) would ban them.
>in less than 12 hours i went from "i cant wait to delete all my flux loras" to "i can't wait to improve all my flux gens"god damn what a model
>See people shill this model hard>"It can't be that good right? I'll try it by myself and see if the realism is..."oh my... I apologize to the chinks, I wasn't familiar to their game
>>107345299theres a bunch of resources on the chroma discord
>>107345144I can talk positively about SD1.5 if I keep moving the goal posts too. But I'm not interested in doing so or arguing with a person who does that.>>107345310Joycaption is pretty much uncensored.
>>107345205no shilling, it's super fast, and makes better gens than the 35 gig model.
>>107345325So... there's no private/public tracker for AI models and lora?
>>10734531416gb preset as well, if you get an error you should report it, Chroma loras have been working fine since support was officially merged.Are you sure you have an updated OneTrainer ?
Complete retard here, how are the requirements compared to SDXL?
>>107345337Not that I'm aware of. Would be cool if there was.
>>107345129Can you share a img2img wf anon?
>>107345339bout 2x
okay. hear me out. Flux 3. and its 160gb
>>107345337torrent is lost technology for anyone born after 2000
>>107345339if you go for bf16 you need a bit more than 12gb of vram, you can offload to the ram thoughhttps://github.com/pollockjj/ComfyUI-MultiGPU
>>107345347>add load image load>add vae encde>connect load image node to vae encode>connect output of vae encode to latent imageezpz. adjust as needed if you have less than 16gb of vram though. can't guarantee you won't OOM.
>>107345339Uses about 13.5GB vram for me, runs great, 40 seconds for a 2048x1536 image. Faster if you go smaller obviously.
>>107345303NBP is probably powered by SOTA thinking models like Kimi K2 Thinking. Gemini 2.5 Flash was like trillions of params.
>>107345349>>107345353>>107345355>tfw 3060 vramletIt's over for the little guy. maybe i'll try it still
>>107345352still great for apps and games, I use 1337x or cs.rin.ru torrents.
>>107345331I was half joking. I just don't want to update CumUI but I guess I need to do it.Would be interesting to try some bit more artistic gens and different mediums and see if it bends or not.
>>107345358you really don't need that much parameters just to reason and rewrite prompts though, like c'mon, we're asking the model to render 1girl, not resolve the Navier Stokes equation
>>107345362like I said, offload 2 or 3 gb to the ram and you're good to go >>107345353
>>107345325Jesus, what about just index all the models on Civitai and link directly to their model pages? All I need is just a search and filtering without the autoplay gifs and bloated ui
>>107345371offloading usually meant awful speed. also never used comfy, i'm a reforge boomer. but yeah i'll try it
>>1073453628GB I assume? It'll still work, just gotta offload more to RAM.
>>107345338I did a clean reinstall of OneTrainer, then hit the bat file to update and then redownloaded the lodestone repo. I keep getting this error:>Error while deserializing header: invalid JSON in header: EOF while parsing a value at line 1 column 0I made sure that the dataset is in place as well.
>>107344994>>107344999>more anime slopew. anyway, chroma is still king for being able to do more weird and interesting shit. I can deform bodies (intentionally and unintentionally) in chroma where as in z it tries to correct it. but maybe that's what chroma needs, correction on unintended limb horror so a z and chroma merge would be legendary.>z becomes more flexible with weird shit>chroma becomes less body horrordis gon b good
How better quality wise is the BF16 compared to the FP8? worth the extra niggabytes?
>>107345377>offloading usually meant awful speed.not if you offload less than 20% of the total size, and the offloading method has improved a lot recently, the model is already so fast you won't notice a lot desu lol
>>107345329>>107345136can someone explain to me why does this child looks appealing to me WHAT THE FUCK, HELP
>>107345385If you have an RTX 3000, maybe. BF16 is usually the better option.
>>107345385nah stay on bf16, fp8 is a pretty bad quant in my opinion, if you're really desperate for something smaller go for Q8
>>107345400>>107345403thanks, i guess most of us are on the fp8 then huh kek was wondering why some of the gens here are higher quality. the fp8 must be the one with the really bad artifacts. grabbin the bf16 then.
where's the actual download I hate this huggingnigger website so much
>>107345362It runs pretty well even on 8GB
>>107345410>the fp8 must be the one with the really bad artifacts.it's not that bad but the difference is noticable compared to the real deal
>2070 super>8 sec per step when loading BF16 in fp8 and offloading a small amount of the model to RAM>Try Q4_K_M>11 sec per step even though model is fully in VRAMJesus didn't realize it would be this bad on an older GPU. Well, at least it works.
>>107345397She's white
>>107345417what s/it?
https://www.youtube.com/watch?v=ZEcqHA7dbwM
>>107345397I got news for you, that means you're a pedo
>>107344493Might consistency of the transformation also be a benefit to their edit modelIt seems to me less an error and more where image generation models are moving. Whether or not that's a good or bad thing is subjective
Should I be downloading the fp16 or the gguf Q8 quaint?
>>107345426my man... q8 is 7s/it
>>107345397How compelling, now say that in public
>>107345444>>107345424>>107345403
>>107345445>my man... q8 is 7s/it
>>107345397She's of ageYou're a perfect biological man
>>107345354thanks anon
>>107345364it's worth it. you can actually make flux/qwen type gens in seconds, no speed lora needed. also, people look real not plastic.
>>107345430
>>107345329What are you even trying to argue?
>>107345397stop right there
>>107345463Yeah, in any case it's better experience than Chroma already.
Why does Comfy say I have 5068 MB available when it loads models. I have 8 GB VRAM and I made sure only a small amount was in use before launching Comfy. And CLIP is on CPU.
In a near future.
>>107344178What's the point of this fake screen? Give hope to some people, and then make them despair when nothing come out of this? That's it?If they really were ready to do h, their model would be able to do dicks, like hunyuan
>>107344791So this is why it comes bundled with the full LLM and not just the encoder. Makes sense. Really we should be able to do this in Comfy anyway, no reason we couldn't just load the LLM and ask it to enhance the prompt, it isn't a super technical thing or something that hasn't been done before
I think I understand why it's so noisy and pixelated, the shift isn't high enough, it's at 3 (default), you can increase that value with the ModelSamplingAuraFlow node
>>107345397cute ok obviously, appealing, nope
>>107345491Wait, that was a shitpost?
>>107345494Wait, you have to use model sampling? I thought it was optional and could be bypassed.
>>107345470MUH FUCKING DICK FUCK
>>107345397Welcome to the club
>>107345503it is technically optional (when you don't use it it sets the value at 3) but you can use it to increase its value if it's beneficial to you
>>107344791If it comes with a reasoning llm in it, does that mean something like, say, sillytavern could take advantage of it AND use it for image gen at the same time?
HOLY im gonna prealso BF16 is as fast as the fp8 lmao
>>107345491fuck
why not working saars
>>107345534>BF16 is as fast as the fp8 lmaodepends on the gpu generation
>>107344709Why you are an absolute retard? this is a base model, Chroma is a finetune, is better because has more porn data set with many poses, but if the furryfag did that with flux schnell, an old and ugly model, distilled, with this when we have the base model without any retard distillation, you can have the same but 10 times better in a future finetune of chroma2.
>>107345539What error have you encountered?
>>107345555woah holy tamoli what'd you prompt for this?
>>107345494here's another example, look at the wall there's no noise patterns anymore
>>107345546only considering that lodestone actually will do it, as he ran out of money long agoI hope he does, z image is also smaller than schnell
>>107345541niggerwell kek i'm so glad i bought this card. made the literal moon jump from the gtx 1080 too.>>107345494>>107345560holy shit really? it was that easy?
Does ComfyUI sideload to ram automatically? I imported that workflow, have 12GB vram and it still generated
>>107345560Nice, try adding some film grain using shift=7. Will that work?
>>107345467I'm saying that you moved goal posts many times already, and I can do the same. That profits nobody.Just enjoy chroma, its alright.
>IT WAS THE FUCKING SHIFT THE WHOLE TIMECOOOOMFFFYYY GET YOUR ASS IN HERE AND EXPLAIN THIS SHIT IN YOUR DEFAULT WORKFLOWWHY WAS IT NOT TESTED CONNECTED?
>>107345511Catbox the uncensored version.
>>107345588Nigga you can gen your own easily
>>107345582you mean it was bypassed by default?
>>107345592bypassed+not connected to begin with
>>107345582>>107345592>WHY WAS IT NOT TESTED CONNECTED?if the node is being bypassed it means the shift is at 3, there will always be a shift, but the default one might be too low
>>107345599Not connected you shouldn't even be able to gen anything since it's inline with the loading of the model.>>107345600I see, I'll try 7.
>>107345559>Polaroid SX-70 manipulation photograph
>>107345491Not fake nigger, the messages wew just rearranged for convenience.
>>107345600shift 7 seems like the best spot, this is my img2img workflow at 6 steps.
>>107345591>Nigga you can gen your own easilyI want that image uncensored.
Zimage seems very good at glowing/aura effects.
anyone testing zimage with cfg > 1?I'm trying 2.5 with some negatives
>>107345588>>107345615https://files.catbox.moe/a2s7pe.jpg
How was Z-Image-Turbo trained from Z-Image-Base
>>107345610>6 steps.why 6? it was trained at 8 steps lol
>>107345642too many steps turns the image into that taylor swift as an 80 year old gen from the last OPbut ill bring it back to 8 and also try 16 given i'm still testing.
>>107345634Doesn't seem to response to negatives at all, and if you change cfg from 1.0 it literally doubles the generation time
Is there a forbidden lora trained on underage pussy or how do we go bout this dog
>>107345668for me it drastically enhances prompt adherance, but it makes the image worse lookingI'll take a look at tricks like skimmed cfg, maybe that can helpI don't care about speed I'm using a 5090 with batch 4, I'm so used to long gens this is nothing
>>107345668>if you change cfg from 1.0 it literally doubles the generation timeThat applies to literally any model. After realizing that I never went back to being a cfg>1 cuck. You can still use negative prompts with NAG, but it probably doesn't work with this new model yet. https://github.com/ChenDarYen/ComfyUI-NAG
Sick.
>>107345668>>107345683if we had NAG for Z-Image we would definitely get some prompt adherence improvement
>>107345637Thanks.
>>107345687Same energy.
>>107345693kek i like that puu turned into a design on her shirt
>>107345693what model is that? qwen-edit?
>>107345560yep, I think this definitely fixes the noise
>>107345715it was kontext dev
>>107345682this model will never be fully uncensored because otherwise it would be able to gen cheese pizza insanely easythe only way this model can be fully uncensored is to lobotomize the absolute shit out of it so it forgets even the smallest glimpse of realistic data so it only prints 2Dand i have absolutely no fucking idea on how are chinks going to achieve that
Just needs a liiiiitle finetuning on some explicit material
>>107345494>>107345560>>107345722Kek, and I thought that issue was linked to the model itself, once again it's Comfy's fault :(
>>107345741>literally cumfy's fault because he didn't connect the node in his workflow
>>107345745>>107345745
>>107345733>it would be able to gen cheese pizza insanely easyI mean, it's kinda doing that already, just some proportions are bad
>>107345741Well, 3.0 shift is in their own scheduler config, that's kinda what you'd use first.
>>107345759>FlowMatchEulerDiscreteScheduleris that the simple scheduler though?
wonder if zimage base will be even better?
>>107345798Probably need to be finetuned before it can be considered production quality
>>107345608Well, we will see then, but they won't use the noob dataset as it is, I can see them cut a lot of the h from it. And I don't know how they will tackle artists mix with natural language, or the threshold for a character/artist to appear.That and the noob dataset is at least one year old, so retraining one year of loras will be painful, more if it still can't do nsfw posesHell it's not even a deal they will do an anime finetune, but if they do it's not going to be anytime soon and I can see ppl waiting for this before doing any big finetune of the base model
yesterday i did some testing with the fucking model shift but i only tested from 0-5, here's a bigger range.https://files.catbox.moe/7n0qwl.png
>>107346254You fucked something up because those are all identical.
another few, i guess it's very much prompt dependent when shift will make a difference:euler a> https://files.catbox.moe/scw8lo.png> https://files.catbox.moe/w1xxdu.pngeuler> https://files.catbox.moe/bz7bm4.pngprompts:>an analog film photo of a man holding a beer while sitting in the driver's seat of an old truck, a woman in a white bikini sits on the hood laughing>an amateur photo, an irish woman wearing lacy black top, purple hair, black bangs, posing in the middle of moshpit, high-angle selfie
>>107346305yeah the noodles had shift set up to the sampler and not scheduler, i fixed it here: >>107346323but also i redid that one:> https://files.catbox.moe/qo3gy7.png
>>107345382of course the chromashitter is antianimeit's hilarious how they out themselves
since my coffee is hitting, here's a clip-skip grid:https://files.catbox.moe/gow552.png