Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>107405841https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipehttps://github.com/ostris/ai-toolkit>Z Image Turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbohttps://comfyanonymous.github.io/ComfyUI_examples/z_image/t>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/>NetaYumehttps://civitai.com/models/1790792?modelVersionId=2298660https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
nigboigbo
how's the controlnet?
Comfy must be dragged out onto the streets and shot
tensions seem very high in the tongueass lab coomcord
comfy should be dragged into the sea on a yacht
>z-image>the loli is sitting in the man's lap>the man is holding her stomach>they are both sweaty>wan i2v>gyrating hips loraUOHHHHHOHHHHHHHHHHH
>>107408185>https://comfyanonymous.github.io/ComfyUI_examples/z_image/tNeed to remove the trailing t, it was added by mistake.
>mfw
>>107408217And raped, coz of the implication
Haven't tried much realistic wan genning, but it looks real nice.
>>107408224finally some kino
>>107408229nice gen
>>107408220>this is worse than Flux2 and QIE>this single image input needs to stop.https://huggingface.co/Tongyi-MAI/Z-Image-TurboSo it's just one image input on the edit model? Uh oh...
>Z-Image-Base – The non-distilled foundation model. By releasing this checkpoint, we aim to unlock the full potential for community-driven fine-tuning and custom development.>By releasing this checkpoint, we aim to unlock the full potential for community-driven fine-tuning and custom development.>Z-Image-Base>unlock the full potential for community-driven fine-tuning and custom development.>
"beautiful woman"Glorious china.
https://civitai.com/models/1332651?modelVersionId=2460872Gonna bring that Midjourney feel
>>107408010Based shoulderpad enjoyer.>Messed up the lapelsA shame.
base never (ever)
>>107408010or it's te/embedding thing
has anyone shared turbo training settings or has furk already locked it behind pateron
Seek Christ, anons.
>>107408394default ostris should work
>>107408394>https://www.youtube.com/watch?v=Kmve1_jiDpQtldw: The defaults, sigmoid if training a character
>>107408257yeah it's SDXL after all.on the plus side you can abuse the large canvas size to abuse cramming multiple images in.
>>107407694what were the settings? negative prompts? thanks for sharing :)
>>107408403>>107408410defaults suck though i was hoping for a bespoke ldg anon guide oh well
>>107408394Is it worth training loras on turbo model? I've been waiting for the base model. I tried few loras from civitai and wasn't impressed at all, like >>107408348I haven't tried anons 80s fantasy and 2000s camera loras yet
>>107408422nigga we don't even have a wan guide
>>107408379> bug facebruh
probably a skill issue.printed on the center of a woman's shirt in a sans-serif font:"a b c d e f g h i j k l m n o p q r s t u v w x y z", the letters "l" "d" and "g" are red, the rest are black.the woman smiles with both thumbs up.
>>107408348That's why I love how good Z-image turbo is at details, you're not obligated to go for some zoom in of humans to make the image look good
>>107408422For real person training 0.0001 (1e-4) works fine, but if you use any synthetic data, which always trains faster, you should drop down to ~0.00005 (5e-5), this range is likely the best for reasonably complex artstyles (as in not anime)I use logit normal and flow shifting, but I'm training with Diffusion Pipe, not AI Toolkit
>>107408422its been like a few days no one is experienced enoughalso, how do you know it sucks?
>>107408400Imagine those claws... no. No!
>>107408469
>>107408185ANIME DIFFUSION NEWS!>Noob Models!SeeleNoobAI (2048 native resolution): https://civitai.com/models/1445275/seele-noobai-sdxlChenkin Noob XL:(NoobAI ESP with new dataset of character)https://civitai.com/models/2167995/chenkin-noob-xlWAI Shuffle Noobhttps://civitai.com/models/989367/wai-shuffle-noob>Anime Lora Making Guide!https://civitai.com/models/22530/guide-make-your-own-loras-easy-and-free>Model News!ZiT Zeta Image Turbo Model: 6b model, fast, open source, doesn't understand booru tags.UIs that supports it: Comfy, Krita AI Diffusion, Neo Forge, Swarm, SD Next>Anime ZiT LoRas!:Frieren LoRAhttps://civitai.com/models/2176854/frieren-beyond-journeys-end-sousou-no-frieren-z-image-loraFlat Anime Style:https://civitai.com/models/2175307/z-image-flatanimestyleRa Lilium Style:https://civitai.com/models/2125529/ra-lilium-styleNyalia Style:https://civitai.com/models/2180136/nyalia-styleAnime Flat Style:https://civitai.com/models/1952560/anime-flat-styleTeto:https://civitai.com/models/2175612/kasane-teto-z-image-loraANIME CHARACTER LORA REQUESTS HERE!
>>107408448Majority of Civitai loras for Turbo are just of synthetic slop output from other models, as in retarded.It trains really well, both people and artstyles, BUT base will undoubtably train better given that it's a model made to be a base for further training.So you might as well wait unless you are REALLY eager or just likes to experiment, like me.I trained this Z-Image Turbo lora a bunch of threads ago, picked up the person with no problems:https://files.catbox.moe/4pfomp.safetensors
>>107408522no frickin way this is ai
>>107408531put this shit on civitai omg what's your problem
>>107408543No celebrities allowed on CivitAI dude
please understand sir i need to use the pony outputs for training
>>107408557oh yeah my b
>>107408229Good style debo!
>>107408531who is it (so I can tag it)?
>>107408531does it train well with 1024 resolution? hows the time vs chroma for example? Have you tried large rank 1gb loras like Emma Watson one you made earlier on zImage?
>>107408577it looks like cara delevigne, or however the hell you spell it
I hope I made it in time before the release of base.
inpainting on zit still ass?give me your best z-image inpainting workflowshaloing results are an instant disregardplease and thank you
>>107408593Why not just rin the gen through SDXL with a denoise of like 0.3 with a detailed controlnet, then do a cycle of upscale and downscale?
z shitmix status? dedistillation status?
>>107408584Haven't tried 1024 yet, this (Cara Delevigne) was trained at 512 which seems well enough and really fast ~1.25 s/it on a 5060 ti>large rank 1gb loras like Emma Watson one you made earlierWasn't me, I only train people lora at rank 16This model was trained using Diffusion-Pipe: adamw, LR:1e-4, rank 16, logit normal + flux shifting, 25 images, 100 epochs
>>107408595laughed
>>107408531i'm curious how you're captioning. i'm using old captions from chroma with 70-100 tokens, but i've also heard people are training without captions at all.
>>107408595Based, more happy endings please!
>>107408651>logit normal + flux shiftingCompletely new settings for me. Here we go again.>Wasn't me, I only train people lora at rank 16My bad. I hope that dude tries to train that lora on zImage to see if it works
>>107408584>does it train well with 1024 resolution? hows the time vs chroma for example?i'd compare it to chroma for 1024. it learns slowly at 1024 but the end results are very good. 6000 steps is where i usually stop, but i'm pretty sure i could go even further for further quality increase.
hey guys base was just released.
psych
>>107408693>>107408702you devil youroguish behaviour
>>107408637why do I only ever get engagement bot tier replies to this questionyou don't need to reply if you have such bad ideas
>>107408693>>107408702Shame on you.
>>107408674For this I only used 'rcng' as caption so basically no caption at all, you don't need rcng in the prompt and none of the example images used it, for a bunch of images of a person it's perfectly fine, the model will easily spot the human being pattern to focus onFor anything more varied, like an artstyle or clothing / photography styles I just use JoyCaption 'Write a long detailed description of this image.' and check that it doesn't hallucinate stuff and if so edit the resulting caption.
https://www.youtube.com/watch?v=iNM5z8cCH8w>A 600k subscribers youtuber is making the promotion of Z-image turbogoddam
You should gen more cute maids
>>107408462Is this what you were looking for, anon?
you can prompt some pretty raw shit on zim turbo without loras???https://files.catbox.moe/1vrfjc.png
If they trained it on so many real images why do anons gens look so synthetic?
>>107408723I need non pepperoni nipples
>>107408729AI is an image synthetisizer.
I downloaded all this stuff and I dont know what to do lol
zit dark fantasy lora plus one other
>zit doesnt seem to know what a vampire isIt's so over.
>>107408681>logit normal + flux shiftingflux shifting is typically known as timestep shifting or flow shifting, so maybe you're already using it.I've only seen it referred to as 'flux shifting' in Diffusion-Pipe
>>107408691Good to know. I'm still gonna wait before training. I have this gut feeling that the base model might be massive and super slow. I hope I'm wrong
>>107408762what?
>>107408760that looked more like stark girl actually
>>107408777Nice teeth, bro.
>>107408762That's Konoru-chan, not a vampire you baka.
>>107408751sovl
>>107407500catbox this, please. I must be fucking something up because even with the bracket shit I never get anything close.
>>107408762teenage jennifer connelly in compromising positions
>>107408778>>107408760What's the prompt for cleavages? Or is it a lora?Zit either gives me nothing or nipples straight to my face
>>107408801>She wears a dark indigo velvet robe with a deep center split extending to her navel, revealing her torso and the cleavage between her gigantic breasts
>>107408780nothing wrong with the teeth I have no idea what you're talking about :^)
>>107408813>gigantic breasts>Still medium at bestPityThanks I'll try that
Anyone able to gen hitler well with zit?Having trouble getting the hair/mustache right.I thought it'd be easy with such a public figure
>>107408828chinese model, that size is gigantic for them
Z doesn't know Greta Thunberg :(
>>107408762falsegenned yesterday. well, this morning.
It knows vampires are jews.>>107408842keek
https://civitai.com/models/833507/apple-quicktake-150-digital-camera-style-zit-qwen-and-flux?modelVersionId=2461241>sovl -> sovless
>>107408821Is Brad Pitt still in hospital? I hope he is ok.
>>107408729Because the training was of photoshopped "real images" of women who have been more plastic than person since entering university.
Every day that passes I see soul being used as a synonym for old more and more
>>107408894many think old = good you are correct
>>107408531tested it in an sdxl checkpoint for science
>>107408886>Is Brad Pitt still in hospital?he still is :(
>>107408870>ugly -> beautiful
>>107408944Damn! I need to donate some money to him...
>>107408940Huh, it actually picks it up somewhat, interesting
>>107408593Here's an example where I turned the girl's pendant into a heart-shaped one.The workflow: https://files.catbox.moe/rb153f.png
>>107408944
so what happens now
>>107408940and positive weightsusing a forge that doesn't support zitreducing the strength of the lora like> <lora:4pfomp-(03a9d4d29935):0.5>all the way down to 0.1 changed absolutely nothing so that's why i went with changing the weight on the namemy prompting probably messes with the likeness a bit too
>>107408976no remorse for anne, she deserved it
>>107408397that's timotay
>>107409020better, but notrying to hide the problem by using tons of feathering isn't it anonthis isn't the way to get this solved
>>107409020what's after 1.5, she is becoming progressively more grinch-like
>>107409046>>107409020go 2.0 and prompt her with green skin
>>107409046(cara delevingne:5.0)
>>107409085fuckin kek
>>107409085Nepo-baby that stole the Christmas!
>>107407877What prompt did he use for the text???
>>107408604zimage is such slop for troonime
>>107409121Its like SDXL at 500x500 10 steps, reminds me of neta
Hitler remained elusive, so I had to settle for the next person in the shadow cabal
>>107409062nice one anon
>tfw AMDWhat's the best option for someone stuck with an AMD card?
>>107409184suicide
>>107409168>pony uh oh janjan aint gunna like tht one
>>107409184Doesn't Comfy have a AMD portable release ? If so that's your best bet.
>>107409166That's the face of a girl who just stole your doughnut.
>>107409189no but for real mah dogg, drop the shit ahh answers and tell me
which one is better?>>107409085my sides
>>107409170you can just slap in whatever and it'll happen
>>107409205Eat hers later
>>107409213
>>107408584I got about 1.5s/it at 1024 with my 4090. R16, 2900 steps and very verbose captioning. AI-Toolkit is really fucking shitty though, Z will probably fair a lot better if they release the base model and you can train on something that's not so completely ass.
>>107409213right, except for the "canon"
>>107409184buy an nvidia card
>>107409213right has better nails
>>107409206realistically sell your AMD card and get something better. If you're broke then be prepared to spend the whole day researching how to get the best out of it, and knowing that even then, it'll still be shit.
>>107409250This is nice!
>>107409166definitely AI. a girl that skinny would never eat a donut.
>>107409274do you see a bite out of that donut? she's pretending to eat it
>>107409250Dunno Z just has that dead AI look to it, flux and qwen looks convincingly ''artistic''
still love qwen edit 2509 (v2). it's so good. especially since the new one allows multi image and easy referencing with no latent stitching needed. (image1/2/3)replace the police officer in blue with the pink hair anime girl in image2, who is wearing a blue police uniform and badge, and kneeling on the black man on the floor. keep the anime girl's expression the same. Add the text "Bocchi the Cop!" to the top of the image.
>>107409290it's clearly trained on seedream slop
>>107409268This is candid shot. In the early 2000s Brad always carried a sword with him when he was in LA.
>>107409225Diffusion-Pipe has Z-Image Turbo supportOneTrainer seems to be waiting for Base
>>107409302could you share a flow for that?
>>107409260This autist knows his shit
>>107409313>Brad always carried a sword with him when he was in LA.Who doesn't ?
>>107409250Z-image anime is pretty generic looking but holy fuck man what does the BFL guys have against anime? Shit always is so melty and anatomy downright awful when it tries to do anime.>>107409290It's a real anime image captioned by qwen. qwen has the same generic style as Z.
>>107409321it's just the default comfy template for qwen image edit. if you updated comfy it's there, didnt change any settings.
>Z is supposed to be poorfag friendly>can't even get Turbo running on my 2070S in comfyhaha...
>>107409352oh, nice. I rarely look at the templatesthank you
>>107409290Z turbo only really looks nice when you find a good pocket of training since they essentially slapped a lora on top of it before releasing
>>107408185>https://comfyanonymous.github.io/ComfyUI_examples/z_image/tded link
>>107409355>2070Syeah, poor. not completely destitute, living in a crack-shack eating great value rice crispies with no milk
>>107409348I rely on rooftop Koreans.
>>1074093892070S is better than 3060 though. SDXL forks work fine...
>>107409383delete the last "t". don't know why that's there. baker fucking with it probably, fucking faggot
>>107409260>>107409232>>107409221it's subtle but noise injection does seem to have an effect
>>107409403>noise injection does seem to have an effectyou can also use thishttps://github.com/BigStationW/ComfyUi-RescaleCFGAdvanced
When you increase nag_sigma_end you stop NAG earlier than expected, not only it makes it faster but it lets the model more time to do some cleaning on "normal mode", NAG is important only at the begining when it has to create the scene and add the relevant characters with its superior prompt adherence, but the details should be handled without it
>>107408828
>>107408970
anon's dark fantasy lora
>>107409471even migu is tired of his bullshit
>>107409471That's me! I am the one who is depicted in this image!
>>107409485That looks good, why doesn't he put that on civitai? >>107406069
Ladies, I think we jumped into the wrong system..
>>107409471Where did you get this image of??
>>107409423interesting
>>107409294>hummingbird kissCute
>>107409503that's pretty good, you're using a lora for that one?
>>107409471is that zimage? cute migu
I NEED to know the size of base!
>>107409314>Diffusion-PipeDo I have to jump through a a bunch of hoops and mirror HF to work offline like I do with AI-Toolkit?
>>107408774For the last time since you guys have brain damage and can't read the paper, the base model is the same size as the turbo model at 6B. It will be slower because you can't use 8 steps, you'll need 3-4x the steps to get an image or even higher like prior models.
>>107409577eat a dick nigga
>>107409446nag sigma end being backwards from what you expect is so on brand for comfy
>>107409589512x512 images of only her face right?
>>107409591Then why is it taking so long to release if its just a worser turbo??
>>107409547nah just zit and a ludicrously long prompt
>>107409577Why, it won't be released anyway
The anime girl with large breasts sits at a desk in a Japanese classroom.even with minimal details wan does well. the kijai MoE lora for high helps a lot too (latest lightx2v update, but fixed)https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22_Lightx2v/Wan_2_2_I2V_A14B_HIGH_lightx2v_MoE_distill_lora_rank_64_bf16.safetensors
>neoforge>like 3 downloads>zit workflow for comfyUI>1 GORLLION DILLON DOWNLOADS
>>107409020any idea why cfg isn't configurable on a step by step basis? It's the same idea. So is nag. mess with the latent, after a step is finished.
>>107409591>>107409606It's not meant to gen on, it's meant to train loras on, and then gen those loras on Turbo
>>107409613>it won't be released anyway
holy shit i just got access to wan 2.5 from a friend. it's 12 different 28gb models, with a gradient of noise levels. I just trained a lora of a cat (12x 1.2gb safetensors) and the results are magical.
>>107409663Tree branches don't grow 1 ft from the ground. Whoever made this image has literally never stepped foot outside.
>>107409620The anime girl with sits and reads a book with the title "LDG" in a Japanese classroom.
>>107409606>Then why is it taking so long to release if its just a worser turbo??the potential is too big, if someone finetunes that monster it'll end up being too powerful for the goyims
still no base Z image?
>>107409591Paper is just a paper. Until I can run it on my own pc I treat it like it doesn't exist.
>>107409687can you share the flow for this?it's so damn clean
>>107409608>ludicrously long promptBut can it depict ludicrously high speed?
>>107409721uh no? lol
>>107409423>>107409523it does sound nice to be able to control cfg better.
>>107409691they're wrapping it and tying the bow.
>Loading checkpoint shards: 100%|##########| 3/3 [06:24<00:00, 128.13s/it]does this take ages for anyone else?
>>107409721it's just wan 2.2 from the templatesLora setup for wan 2.2:HIGH:https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22_Lightx2v/Wan_2_2_I2V_A14B_HIGH_lightx2v_MoE_distill_lora_rank_64_bf16.safetensorsLOW: 2.2 lightning low:https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22-Lightning/old/Wan2.2-Lightning_I2V-A14B-4steps-lora_LOW_fp16.safetensors1 strength for both
Finally got this setup to work, didn't think it ever would> First person view, sitting on a camp mattress inside a tent, the male legs of the subject are visible, slightly spread. A cute young woman sits across him on the other end of the same mattress. Her legs are intertwining with his. She is looking at the camera.
>>107409765kek that one is better
>>107409600No, 1024+ all around.
0.4 or so is the sweet spot for the fantasy lora. otherwise it seems like it is deforming stuff but this depends of course
so whats currently the best workflow for Z image turbo?
>>107409811>0.4 or so is the sweet spot for the fantasy lora.Oh shit I was at 0.8 that explains why it couldn't do text
>>107409811What lora are we talking about?
>>107409633> why does no one want to use inferior tool?
Has anyone trained a lora that actually works well at 1.0 strength
>>107409849-> >>107409499
>>107409519wut>>107409566yeah it's z
>>107409811prompt?
>>107409880>wutOf me, fuck. This image of me.
>>107408185proper workflow in SD reforge to upscale images?i'm using the standard 1024x1024 proper resolution but i want to upscale at least x3.I have 24GB vram.
>>107409403the one on the right is better
>>107409811>>107409832Do you think Comfy fucked up the lora "fix"? It's weird we have to go for lower strength of 1 to get the normal effect of loras
>>107409184Just use linux and comfyui. It works. I gen with wan, Z, qwen, SDXL, chroma etc on my 7900 XTX. LLMs with llama.cpp work great too, I have been running GLM AIR and GPT OSS 120B with weights split between GPU and CPU.>>107409261>>107409189nvidia FUD. I get why you would prefer Nvidia since it has better support, but why would you be such a mental slave that you actively encourage others to be stuck with a monopoly?
>>107409964It's a turbo model anyway and would make sense to lower the strength but I don't know anything.
>>107409981FUD lol... go back to slashdot
>>107409964I think this is all ZIT Turbo loras. If you read civitai descriptions, they all recommend really low weights. It probably has something to do with making a lora for a turbo model, which is why people are waiting for basedot.
>>107409693>>107409785Interesting theme
>>107409410bruh wtf, how can flux 2 fuck up the fingers like that?
>>107410021For a second, I misread that as "breastfeeding stall." Hot concept either way though.
>>107409811did you cook at 8k steps and high lr yet again?
>>107410049i didn't make the lora >>107409872
>>107409355Switched to thishttps://civitai.com/models/2169712/z-image-turbo-quantized-for-low-vramand it ran fine. it ain't flipping over for my 2070S till its flipping over
>>107409880what did you prompt? comic of ___? any style prompts?
>>107409799why's she so.... demonic looking, the missing nose, and the alien eyes?
>>107410147You are playing demonic games sirwhere does it end sir
The camera zooms out on the anime girl wearing a japanese school uniform and she drinks a glass of water in a Japanese classroom.
>>107410041It's called panel van.
Zimg is kinda back to old days of warped spacetime and unreal distances.
>>107410072How did you run the caption? Through API or local? I want to try a LoRa with boomer captions.
>>107410182it's called sovl anon
>>107410133>A horizontal two-panel web comic rendered in clean digital art style with crisp black outlines, flat colors, and expressive character animationyou can use llmarena to enhance your prompts. it's free but be careful what you put there as it's basically public
>>107410182stop! employees aren't allowed to pocket cash, you find anything.
>>107410172one moreThe camera zooms out on the anime girl who is wearing a japanese school uniform in a Japanese classroom.this turned out nice, no cut!
>>107409892I try to follow a shopping list structure of>general description of the image in couple of sentences, listing the characters in the image and locations/qualities of items etc.Followed by 2-3 sentence simple descriptions for each thing in the scene:>character description 1>character description 2>character description 3>asset(s) description(s)>background descriptionThen keep working with the descriptions until the image looks ok or funny. No word salads.
>bro A11111/stable diffusion is outdated shit it's all about comfyui>ok>install everything>get a good workflow>plug in the same model/lora/etc I was using in A11111>run>the result is literally almost 1 to 1 the same thing A11111 spits outWow real fucking amazing comfy shills *smack*
>>107410157when the power of the cross is too strong, it splits the demon into two non-demons even.
>>107410273if you don't need to change anything, then you don't need comfyui.butthe moment you do...
>>107410273I thought that's not supposed to happen unless you're using those nodes that replicates A1111 settings
>>107408717>https://www.youtube.com/watch?v=iNM5z8cCH8wouhh mama mia!
>>107410273I thought it was supposed to be worse, hmmm?
>>107410304there's some minor differences but hardly worth using all these nodes and workflows when A11111 just worked fine with a generic simple GUI. I'm just not seeing the point of why I'd use comfy at all when the result is basically the same shit.>>107410297I mean idk? what's the best workflow that does regional prompting? maybe I'll give that a shot and see if it wins me over on comfy. This workflow seems to just gen with no special features besides all the adetailers just being included
>>107410273>the result is literally almost 1 to 1 the same thing A11111 spits out
>>107410282
>>107410333Because you replicated the A111 workflow to do basic 1girl shit. If you had an imagination, what could you build in comfy?
>>107410186Local, with a simple python script
>>107410147Nice crat.
Can someone tell me how to get zit to give me a fucking side view?Probably skill issue on my part but it's always the subject starting directly at the viewer
>>107410356she even lost her left leg and is still happy, that's a girl of focus, commitment and sheer fucking will!
>>107410215illustrious?
is it just me or is zit overhypedfine tuned sdxl models look much nicer and are only slightly less controllable
>>107410373anything for the onion
>check civitai for other people's prompts>they're littered with useless fluff prose and LLM-ismsHow hard is it to just write a basic paragraph of your 1girl
>>107410400what more do you need other than>1girl, tomboy, flat chest
If you had to choose between Wan 2.2 and Z-Image for the "Model of the Year" award, what would be your pick?
>>107410409>1girl, tomboy,BASED>flat chestyou're describing a man anonhttps://www.youtube.com/watch?v=Zd8vzIRQLLM
>>107410423Wan 2.1
https://civitai.com/models/958009Is this good?
>>107410378Zimage.
>>107410429I woman isn't just a chest, it's the shoulder to hip ratio that makes a woman
>>107410466there's no women that exist with perfect flat chest, what are you smoking m8
>>107410423wan is more versatile, does animation obviously, also kinda works as an edit model and it probably has the best anatomy of any model. praying for a wan 2.5 christmas
>>107410452I don't see what this lora can do that the turbo model can't lol
>>107410486>no women that exist with perfect flat chestNo 3D women maybe, that's why 3D sucks
>>107410465Wow so cool anon!
>>107410423>If you had to choose between Wan 2.2 and Z-Image for the "Model of the Year" award, what would be your pick?Z-Image, local image models are now so close to the best API models, can't say the same for video models, we still don't have sound bruh>>107410499you're just describing a femboy dude lol
>>107410508>we still don't have sound bruhLTX-2 will be open-sourced next month and it does have sound. Let's hope it will be good
>>107410382>is it just me or is zit overhypedIt's underhyped. There's still performance to be had if you could disable the safety and asianizer.
>>107410465how'd you convince it to output that style and texture?
Can someone check on trooncord and see if they really said that? Can't believe we're getting the "2 more weeks" meme again :(
>>107409905Uhmmm... m'lady, why art thou alone in a place such as this? *tips fedora nervously*
>>107410016hot
>>107410465for a model focused on realism it can produce way better artistic shit than flux and Qwen Image lool
>prompt literally anything in Z>get a woman staring directly at cameraI see, so this is the future of AI...
>>107410543Chinese don't celebrate Christmas.
>>107410543>>107410565that remind me of last year when we were waiting for wan to be released and we had to wait until next year because of some chink christmas or something lol
>>107410543POOP!
>>107410543itsso fuckingover
>>107410565The Chinese mostly just think Christmas stuff is American-ish stuff. Sort of like how we think of their lamp festival as just Chinese stuff.
>>107410543>>107410630don't doom we don't know if they really said that, I want a discord screen right now!
Time for my hourly reboot because comfy get inexplicably slow.
New to anime genning, which is your favorite Z lora and SDXL lora and model? What should I learn more about, Z models or SDXL? Pros and cons of each one?
Comfyui newfag from earlier here, how exactly do I tell this thing to spit out 10 images per gen? similar to batch genning from A1111. I can't seem to find the node that handles that?
>>107410308let's hope the base model will be more receptive to styles, turbo is too restrictive on its choices
I love doing prude girls being cute but also hot, does anybody knows how can i increase details in lingerie using adetailer or i'm forced to do it with manual inpaint(i wish i could do models for adetailer)
>>107410648just use the unload all models node
>>107410675This?
>>107410675there should be a batch size setting somewhere
>>107410672NTDMix/GENESIS lora, NlxlMix, Art-illustrious, 2DN.
>>107410704>does anybody knows how can i increase details in lingerie using adetailer or i'm forced to do it with manual inpaint >>107409423
>>107410713no it's not that
>>107410704Nice style!
>>107410713>>107410715I will look into those>>107410722idk who this is
>>107410672And RouWei, massive finetune of illustrious with more concepts/artstyles/better natural language prompting.
I thought Batch was for genning multiple workflows simultaneously, whereas the Queue was for genning multiple workflows serially.
>>107410704>natural language promptingSnake oil
>>107410713>>107410715nta but what's the difference here? wtf is batch size vs batch count
>>107410736>idk who this isit's me.... the batch count setting functions as if you pressed the run button ten times. that's not what you want, right?
>>107410753>>107410745You
>>107410716>Yeah no, 3 inches is totally fine
>>107410765I mean it is, but the thing is this just spits out 10 different generations on 10 different workflows I need to shuffle through. Is there a way to just give me 10 images? A11111 would give them to me in a tiled list (that I can individually go through 1 by 1)
>>107410704M'lady... *tips fedora(Download segmenter model from CivitAI... *adjusts glasses*...for the lingerie detection, if you know what I mean. *winks awkwardly(Then select it in ADetailer.
>>107410769Sure, so use booru prompting and enjoy the 14m image finetune.
>>107410753how should i do it and do you get better results?
first person perspective from above a japanese woman dressed as Hatsune Miku standing in the water of a swimming pool at night who is wearing a sleeveless white blouse and black miniskirt, in Japan. Miku's arms are outstretched, and she is smiling.
>>107410776Batch size set with latent image is not the same as pressing the run button x times. Set batch size 4, seed increment, run. Run again, the first image from the second batch will not be the second image from the first batch.
>>107410794added: anime style image
>>107410540I think it's a combo of "pale color, muted color, painting \(medium\), oil painting \(medium\)"This prompt was basically random, i threw my old noobai prompts at the wall and looked what whill stick.>>107410559I'd say Z has a decent amount of non-realistic art styles in it. But if we get a Chroma styled finetune it's gonna be insane.
Pollhttps://poal.me/kse2wphttps://poal.me/kse2wphttps://poal.me/kse2wphttps://poal.me/kse2wphttps://poal.me/kse2wp
>>107410807in the style of an oil painting:
RouWei is shit at both natural language and Booru tags.End of the diacussion.
>>107410818>No Flux 2come on anon, give it a chance and show to everyone how everyone don't give a fuck about that model kek
>>107410818wait what is illustrious v2?
>>107410818zimage improved everything a lot but we could still do almost all of the things it offered, just with more effort and a lot slowerwan 2.1 on the other hand actually allowed video generation to be of any reasonable quality to be usable by anyone at all, and later did so at 10x the speed with loras
>>107410818>no Qwen>no Flux 2>no Hunyuan 2 or 3>chroma
>>107410851>wan 2.1 on the other hand actually allowed video generation to be of any reasonable quality to be usable by anyone at allI'd argue this was Hunyuan Video. Did they ever release the 720p version btw?
>>107410818What about Qwen Image Edit? I have a lot of fun with that model
>>107410818Why not SDXL?
>>107410878in another world hunyuan would have gotten the community effort and probably would have taken that place, but that just didnt happen, it was a worse model
>>107410879qie was an incremental improvement over kontext dev but was also objectively worse for some things like smaller changes on a persons face, so it cant be the model of the year
>>107410891A lot of the things that made Wan usable at decentish speeds before lightning loras came along were developed for Hunyuan Video, like teacache and torchcompile.
a next level cfg would basically run qwen image edit every step, to remove unwanted things, and enforce wanted things, also with knowledge of prior steps.
you NERDS need to stop SCARING bass.
Chroma can do early CGI pretty nicely, I was going for the fallout 1 death screen with this one
>>107411074looks like Conan Exiles
>>107411074proompt
>>107410870Does anyone here would actually use Qwen-Image as a "daily driver" for T2I ( not editing) ?It's just too slopped to be useful, and if you care about doing advanced/ high end stuff you may as well just use the nano banana pro API
tf kinda tent is this
>>107411136
A group of people in an unemployment line outside a building in the city named "UNEMPLOYMENT OFFICE". the people are all wearing tshirts that say "flux 2".
>>107411114A landscape of a desert in the style of the videogame Fallout 1. There are buildings in ruins and ancient technology scattered along. There is a human ribcage and skull lying in the foreground. Daylight. Detailed, volumetric lighting, 4k, high res, old CGI styleChroma 1-HD is worse with this btw, seems like its a fine tune to speed up high res gens
So what's the value/node/whatever in comfyui I have to tweak to make the generation "randomize" more of the final result while still mostly adhering to the prompt? I want to see some more variety in the end results without changing the core character/theme
https://www.reddit.com/r/StableDiffusion/comments/1pchpjb/quick_psa_the_stablediffusioncpp_implementation/>With my 2060 6GB and the fp16 hack I get about 7.5-8s/it on z-image with comfyui. I've tried several different speedups like cache-dit and the gguf nodes (to limit offloading), but they either look noticeably worse (cache-dit) or make no difference (gguf).>Now with StableDiffusioncpp I'm getting 4s/it, nearly a 2x speed increase without any noticeable quality degradation.
>>107411130what lora?
leave bass alone
tried to install NAG for a basic z workflow, but im getting an error. anyone knows what do i have to look up to fix this? i'm running the patientx fork of comfy https://files.catbox.moe/hxkzcj.json>ValueError: Model type <class 'comfy.ldm.lumina.model.NextDiT'> is not support for NAGCFGGuider
>>107411204that's because you didn't install the good NAG branch, it's this onehttps://github.com/scottmudge/ComfyUI-NAG
all these fucking NAGgers i swear to god
I am not afraid anymore..I am a NAGger
I get OOMs with NAG.
>>107411166makes sense, comfy is trash
>>107411229thanks! it's working now
>>107411300>it's working nowif you don't know the right parameters for NAG, I recommend you this one>cfg 1, nag_scale 3, nag_tau 1, nag_alpha 0.25, nag_sigma_end 0.75
>>107411309>shoes
>>107411167psxam
Close enough kek
>>107411324>>107411190kino
>>107410023it's not a fined tuned model anon. Jesus vramlet hours today
>>107411324dont worry /ldg/, i will hunt down the bass
>>107411341>it's not a fined tuned model anon.no one is gonna finetune a distilled 32b model with a shit licence when Z-Image exists, are you fucking retarded?
FOR ZE VATERLAND
>>107411309hey, thats pretty cool, sexy school girl shoes too
>>107411165PLEASE REPLY
>>107411341What...a fucking 32B model will need a finetune to be good?
>>107411353A-ALL I EVER WANTED
>>107411229Why hasnt the original devs just update their repo? Whats going on?
>>107411366That's pretty cool. First I thought it was the ComfyUI dev.
>>107411341>muh vramletsdead model enjoyer
>>107411400>Why hasnt the original devs just update their repo?maybe they got them>*insert bogdanov phone meme since the max limit of image replies has been reached :(*
>>107411375Was to see you smiling
>>107411400multiple devs are sick of the comfy update humiliation ritual. they might be fed up as well
bake so I can post the good spidermans already
Baker?
>>107411401>>107411366sorry, fixed
This is the last /ldg/ thread.
ahhhh image limit ahhh
>>107411436Comfy broke NAG because he changed the names (layers, blocks, etc) of things that nothing actually uses yet.
i want my money back, this baker isn't doing his job
>>107411475yeah but there's some PR that are fixing that and are just waiting to be merged, all they have to do is to press one button and they're good to go...
>>107411158Black farting logs studio still got 300 million in investments because of flux 2, you don't hate boomers enough.
Ahh Baker-Sama ahh... ahh...I- hmmmg... I-I have slop to p-post...Ahh.. B-Baker-Sama please don't tease me like this...
No bread until bass
Rehydrate yourselves.
NOOO NOT THE IMAGE LIMIT!!
>>107411158holy same face, and they all look underage
wait, there's a limit on how many images you can post? since when?
it's over, this is how /ldg/ dies
>>107411523Since forever.
>>107411522That would explain how they released such a shitty model
>>107411528nah, never happened in the past couple years
I want newfags to leave.
>>107411314thanks, i'll try those
new thred>>107407830>>107407830>>107407830
>>107411558nigger
>>107411558oh debo you sneaky bastard
Nevermind this image limit, why cant I post an image from incognito anymore? This site keeps getting shittier mang
>>107411610its so mossad can track you, they don't finna mess with incognito
No base.No bake.This is the end.
>>107411632pack it up folks, its been... something
fine, I'll do it myself.
Come on I need to post anime girls>>107411653
/ldg/...Forgive me...*dies*
>>107411725>>107411725>>107411725
>>107411727heroic bake