Previous /sdg/ thread : >>107314383>Beginner UIEasyDiffusion: https://easydiffusion.github.ioSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Advanced UIComfyUI: https://github.com/comfyanonymous/ComfyUIForge Classic: https://github.com/Haoming02/sd-webui-forge-classicreForge: https://github.com/Panchovix/stable-diffusion-webui-reForgeStability Matrix: https://github.com/LykosAI/StabilityMatrix>Early Preview UIAniStudio: https://github.com/FizzleDorf/AniStudio>Flux.2 Devhttps://comfyanonymous.github.io/ComfyUI_examples/flux2https://huggingface.co/black-forest-labs/FLUX.2-devhttps://huggingface.co/city96/FLUX.2-dev-gguf>Qwen Image & Edithttps://docs.comfy.org/tutorials/image/qwen/qwen-imagehttps://huggingface.co/Qwen/Qwen-Imagehttps://huggingface.co/QuantStack/Qwen-Image-GGUFhttps://huggingface.co/QuantStack/Qwen-Image-Distill-GGUFhttps://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF>Text & image to video - Wan 2.2https://docs.comfy.org/tutorials/video/wan/wan2_2https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUFhttps://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUFhttps://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF>Chromahttps://comfyanonymous.github.io/ComfyUI_examples/chromahttps://github.com/maybleMyers/chromaforgehttps://huggingface.co/lodestones/Chroma1-HDhttps://huggingface.co/silveroxides/Chroma-GGUF>Models, LoRAs & upscalinghttps://civitai.comhttps://tensor.arthttps://huggingface.cohttps://tungsten.runhttps://yodayo.com/modelshttps://www.diffusionarc.comhttps://miyukiai.comhttps://civitaiarchive.comhttps://civitasbay.orghttps://www.stablebay.orghttps://openmodeldb.info>Index of guides and other toolshttps://rentry.org/sdg-link>Related boards>>>/h/hdg>>>/e/edg>>>/d/ddg>>>/b/degen>>>/vt/vtai>>>/aco/sdg>>>/u/udg>>>/tg/slop>>>/trash/sdg>>>/vp/napt
>>107328651op time to remove ani, no one asked for it, no one uses it, no one cares for it
>mfw Resource news11/25/2025>FLUX.2: Frontier Visual Intelligencehttps://bfl.ai/blog/flux-2>FLUX.2-dev-GGUFhttps://huggingface.co/orabazes/FLUX.2-dev-GGUF>FLUX.2 Day-0 Support in ComfyUI: Frontier Visual Intelligencehttps://blog.comfy.org/p/flux2-state-of-the-art-visual-intelligence>Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokenshttps://wakalsprojectpage.github.io/comt-website>DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generationhttps://zehong-ma.github.io/DeCo>Syn-GRPO: Self-Evolving Data Synthesis for MLLM Perception Reasoninghttps://github.com/hqhQAQ/Syn-GRPO>Learning Plug-and-play Memory for Guiding Video Diffusion Modelshttps://thrcle421.github.io/DiT-Mem-Web>DiffSeg30k: A Multi-Turn Diffusion Editing Benchmark for Localized AIGC Detectionhttps://huggingface.co/datasets/Chaos2629/Diffseg30k>FlowPortal: Residual-Corrected Flow for Training-Free Video Relighting and Background Replacementhttps://gaowenshuo.github.io/FlowPortalProject>Trump Launches Genesis Mission, Harnessing AI for US Energy, Science and Security Dominancehttps://www.capitalaidaily.com/president-trump-launches-genesis-mission-harnessing-ai-for-us-energy-science-and-security-dominance11/24/2025>cc12m-1mp_plus-realistic: Filtered CC12M dataset for 1mp+ realismhttps://huggingface.co/datasets/opendiffusionai/cc12m-1mp_plus-realistic>simpletuner v3.1.3 with Kandinsky5, ACE-Step music training, and a webUIhttps://github.com/bghira/SimpleTuner/releases/tag/v3.1.3>Hunyuan 1.5 step distilled lorashttps://huggingface.co/Comfy-Org/HunyuanVideo_1.5_repackaged/tree/main>MMT-ARD: Multimodal Multi-Teacher Adversarial Distillation for Robust Vision-Language Modelshttps://github.com/itsnotacie/MMT-ARD>Intervene-All-Paths: Unified Mitigation of LVLM Hallucinations across Alignment Formatshttps://github.com/SooLab/AllPath
>mfw Research news11/25/2025>ABM-LoRA: Activation Boundary Matching for Fast Convergence in Low-Rank Adaptationhttps://arxiv.org/abs/2511.19145>LumiTex: Towards High-Fidelity PBR Texture Generation with Illumination Contexthttps://lumitex.vercel.app>Are Image-to-Video Models Good Zero-Shot Image Editors?https://arxiv.org/abs/2511.19435>Breaking the Likelihood-Quality Trade-off in Diffusion Models by Merging Pretrained Expertshttps://arxiv.org/abs/2511.19434>In-Video Instructions: Visual Signals as Generative Controlhttps://arxiv.org/abs/2511.19401>Growing with the Generator: Self-paced GRPO for Video Generationhttps://arxiv.org/abs/2511.19356>SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservationhttps://arxiv.org/abs/2511.19320>Evaluating Dataset Watermarking for Fine-tuning Traceability of Customized Diffusion Models: A Comprehensive Benchmark and Removal Approachhttps://arxiv.org/abs/2511.19316>BideDPO: Conditional Image Generation with Simultaneous Text and Condition Alignmenthttps://limuloo.github.io/BideDPO>ConceptGuard: Proactive Safety in Text-and-Image-to-Video Generation through Multimodal Risk Detectionhttps://arxiv.org/abs/2511.18780>LAST: LeArning to Think in Space and Time for Generalist Vision-Language Modelshttps://arxiv.org/abs/2511.19261>STCDiT: Spatio-Temporally Consistent Diffusion Transformer for High-Quality Video Super-Resolutionhttps://jychen9811.github.io/STCDiT_page>FilmSceneDesigner: Chaining Set Design for Procedural Film Scene Generationhttps://arxiv.org/abs/2511.19137>When Semantics Regulate: Rethinking Patch Shuffle and Internal Bias for Generated Image Detection with CLIPhttps://arxiv.org/abs/2511.19126>Beyond Reward Margin: Rethinking and Resolving Likelihood Displacement in Diffusion Models via Video Generationhttps://arxiv.org/abs/2511.19049>VeCoR - Velocity Contrastive Regularization for Flow Matchinghttps://p458732.github.io/VeCoR_Project_Page
>mfw MORE Research news>One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Controlhttps://mizhenxing.github.io/One4D>Learning What to Trust: Bayesian Prior-Guided Optimization for Visual Generationhttps://arxiv.org/abs/2511.18919>DiP: Taming Diffusion Models in Pixel Spacehttps://arxiv.org/abs/2511.18822>ProxT2I: Efficient Reward-Guided Text-to-Image Generation via Proximal Diffusionhttps://arxiv.org/abs/2511.18742>VLM in a flash: I/O-Efficient Sparsification of Vision-Language Model via Neuron Chunkinghttps://arxiv.org/abs/2511.18692>Exploring Weak-to-Strong Generalization for CLIP-based Classificationhttps://arxiv.org/abs/2511.18396>Synthetic Curriculum Reinforces Compositional Text-to-Image Generationhttps://arxiv.org/abs/2511.18378>MagicWand: A Universal Agent for Generation and Evaluation Aligned with User Preferencehttps://arxiv.org/abs/2511.18352>ConsistCompose: Unified Multimodal Layout Control for Image Compositionhttps://arxiv.org/abs/2511.18333>Seeing What Matters: Visual Preference Policy Optimization for Visual Generationhttps://arxiv.org/abs/2511.18719>CoD: A Diffusion Foundation Model for Image Compressionhttps://arxiv.org/abs/2511.18706>Now You See It, Now You Don't - Instant Concept Erasure for Safe Text-to-Image and Video Generationhttps://arxiv.org/abs/2511.18684>Edit2Perceive: Image Editing Diffusion Models Are Strong Dense Perceivershttps://arxiv.org/abs/2511.18673>Robust Posterior Diffusion-based Sampling via Adaptive Guidance Scalehttps://arxiv.org/abs/2511.18471>Point-to-Point: Sparse Motion Guidance for Controllable Video Editinghttps://arxiv.org/abs/2511.18277>Uni-DAD: Unified Distillation and Adaptation of Diffusion Models for Few-step Few-shot Image Generationhttps://arxiv.org/abs/2511.18281
>>107328855its np, i had the news open all day and just wasnt moving through it at all. just me being extremely lazy and digging up an excuse, lol
Good evening, anons! I hope everyone is doing well :]
>>107330108helloall the buzz is about flux2. only the mightiest of GPUs can run it
>>107330187Heya, Debo!! Great to see you again :]Ohhh sounds exciting!! I'll have to check it out!
>>107330257there's a few links in the resource news for the gguf and comfy implementation. apart from that, I can't help, flux2 is beyond my power level
>>107330321Downloading it now! I hope it works for me haha
Last one from megood night anons>>107330108Thanks PW, gn :)
>>107330377I'm expecting a full report on my desk or you're losing your sdg pension>>107330449early nightgn
>>107330449Good night, Quokkanon!! Sleep well :D>>107330458LOOL Yes sir! I'll have it on your desk before I go to bed!
>>107326889https://suno.com/song/5c797d02-7fb8-428c-9981-233fc37f0089
i miss schizo anon
>>107324001This is great news!>>107326889>https://suno.com/s/1WB2m2b0Vs3UTLxcCool instrumental.>>107330482I'm looking forward to seeing what you can gen with the new model.>>107331069>https://suno.com/song/5c797d02-7fb8-428c-9981-233fc37f0089Nice remix and lyrics.
>>107332095Thank you for all the nigbobumps.
flux.2 dev>Lower VRAM (~24-32G) - RTX 4090 and 5090>LowerThis makes me sad.gm
>gm
Gonna need a H200 to use the flux.2 as designed.
>>107328651Remove AniStudio from OP
>>107334086i don't know, i kinda like the idea of anistudio, the problem is it's just too early and barely functional
ComfyUI with the right custom nodes can do everything Anistudio promises but better and with more control. Why would anyone switch to an unproven, preview build?
I'm a huge fucking retard and I need a big spoon feeding right into my mouth.I literally just installed this, how do I load models?No images I save appear to have any metadata that ComfyUI can load.
>>107334086>>107334140Ani herself requested it to be removed from all OPs
>>107334271>4chan server filenameexif/workflow data is stripped automatically when uploaded so you're not going to find any workflow on images. The anon has to share the image via catbox.moe (or any other file uploading site) for the workflow to be present.
>>107334140Maybe in a year AniStudio will be worth the OP space, for now, it's garbage.
>>107334354proof????
>>107334410I tried it with a catbox image too and it didn't work, but I guess whoever posted it just didn't include the metadata then. So I just drag and drop the image into the UI and it should be able to read it?
>>107334086I repeat:you don't even post here. why do you care?
>>107334495download and drag and drop this into comfyui, it has workflow. Not my image, found elsewhere...https://files.catbox.moe/24i01w.png
>>107334510Ani himself said to remove his UI from the OP. And this genereal must grow and evolve, not stagnate with the same old non functioning UIs,
>>107334547>>107334594Okay yeah that does work, if I can find some in the style/context I want that'll make it easier to figure out what kind of workflows I need.
>>107334598It's labeled as an early preview, which should not cause confusion among new people.
>>107334598>same old non functioning UIsI dont think so, as this anon >>107334839said, early preview means it's actively developed and removing it will kills visibility and potential user feedback.
flux2 with my promptsapparently ai-toolkit can train a lora for itwill have to see over the 4 day weekend
Morning anons
>>107335508will we have to call her flux2girl?>>107335544gm
>>107335147The other sd.cpp based uis are way better and even more popularSo by your logic we should add them all or is there a hidden reason why we shouldn't add them?
>>107335569>will we have to call her flux2girl?ermchromluxgirlor flomagirl
>>107335508Not bad.Is that with the 4-bit version or do you rent an H200 or equivalent?
>>107335693fp8 https://huggingface.co/silveroxides/FLUX.2-dev-fp8_scaledregular model will run on a 5090 i believe
on the plus side, flux2 handles styles (and knows styles) better than most local modelson the negative, it seems kind of soulless. trying to get a good prompt flow going now
on the other other hand, no need to upscale
Good morning, anons! I hope everyone is doing well :]
>>107335823gm did you see >>107327360>>107327393>>107327439>>107327452>>107327504
>>107335840Heya Flux girl anon!! It's so good to see you again!Omg these are amazing hahaha I love those!
lel that chinese model everyone's talking about sure BTFO flux 2 tho, i'll have to play with that later too>>107335873flux2 can definitely have fun times
>>107335823gmpretty early for youany luck with flux2 yet?
>>107335883I just got it working this morning hahaIt took an hour and 45 mins for my first test LOLI figured out what I did wrong tho hahaI like how you can copy the style of other images/gens! That's so cool!>>107335946Yeah haha I gotta go do some work stuff pretty soon but I don't think I'm gonna stay all dayThese gens are done with Flux 2! :]I just got it working like 30 mins ago haha
>>107335823gm PWDay off?
>>107335706Well, I only have 12gb of vram... I'll have to skip until I can do an upgrade.
>>107335979>I gotta go do some work stuffyou dont have tuesdays off anymore? >These gens are done with Flux 2! :]looks good. I kinda wanna see whats up with that mobile game in the background
wait so is this the tranny hang out hugbox? i dont understand the "how is your day" posting
>>107336028comfy unloads to regular ram so you could probably fit it in thomaybe
>>107336043you have to go back
>>107336014Good morning!! :DSuper cool gen! I love that outfit!Kinda hahaha I gotta go shopping in a couple hours but I think that's all I plan to do today>>107336034I didn't for a while but I think I will again soon after the holidays!Thanks hahaha! I like how those came out!
>>107335632Ani lurks here we should support him. Unlike other sp.cpp based UI developers who don't hang around here, Ani participates. That's why I think we should help him out.
>>107336161I understand, but if Julien himself wants it delisted, we should respect his wishes. No dev wants an unfinished project representing him publicly
>>107336062ty - have fun>>107336062the mistral clip is what's causing the oom, I'll try another workflow.
>>107336287>the mistral clip is what's causing the oomyep, maybe there's a way to load that to cpu (if you have the ram that is). i was trying to load my usual workflow which has gemma3-27b as llm (to use as a prompt generator and pass it to flux2) but it was way too much lol
>>107336263>>107335147Support is one thing, spoonfeeding a broken UI to newfags in the main thread list is another. If it's in the OP, people assume it's a viable alternative to Forge or Comfy and it's not there yet, maybe keep it in a rentry or a separate paste until it's functioning at least.
>>107336302>load that to cpuyes. that was a good tip. There was a cpu option. >loaded completely; 95367431640625005117571072.00 MB usable, 1280.59 MB loaded, full load: TrueKEK - it thinks I have 9x1030 MB of VRAM. So it crashed.
>>107336263 sorry, it was to him >>107336161
>>107336459>loaded completely; 95367431640625005117571072.00 MB usablei get that all the time
>>107336585I get CUDA kernel errors
>>107336188>>107336394homeless pw is cute>>107336643>pw spell misfires and she turns herself into a ragdoll
there's too much to keep track of today lol
>>107336658Thanks!! :DLOL! It's crazy how much like the original doll gens!
behold, z-image (running locally)
now to wait for a comfy workflow lol
>>107336915z-image is finally out? I've been looking forward to trying it out after flux2 was too fat for me
>>107336979just out now lolhttps://huggingface.co/Comfy-Org/z_image_turbo/tree/main/split_fileswaiting on a workflow but i used the python script they provided for these
>>107337011>https://huggingface.co/Comfy-Org/z_image_turbo/tree/main/split_filesA Q8 quant should come in around ~7GB which will fit in a lot more GPUs than Flux.2 did.
>>107337094yeah flux2 despite the glitz may be DOA now lol
>>107337316looks pretty good
>>107337350it's pretty fantastic and fast as hell>8/8 [00:05<00:00, 1.40it/s]that was straight 1536x1152
>>107337350>>107337374and apparently it knows porn positions and stuff, not just nsfw lolthis shit's ridiculous
BIGGER
i'm easily impressed
What's with the slopper in /sdg/? Somehow I feel like if a typical anon spams the same character it wouldn't even be much of a deal.Here, he runs the same scene over and over again with no cohesion. It's like abstract without the abstract. It's not going to make anyone ask "what's the story?" It's just a bunch of shit mashed together with the same keywords.
>>107338496no one asked
>>107338509nor did anyone ask for that junk
>>107338522no one asked you
>>107338496enjoy your "story"
>>107338496You're going to have to be more specific, quality is dead here since nigbo chased off all the good posters
>>107338496>>107338595two new models drop, all the prompters are eating good. you, on the other hand, spend your thanksgiving eve recycling boring, worn-out nogen drama baiting
uh oh
>>107338567have you just been using the default euler/simple or have you tried other combos?
dear past mepls pls pls document at least some things before you deprovision everythingthank youfuture mehaha
>>107339146nah in integrated it into my regular workflow, so 24 steps z-image with deis_2m + double sigma stuff (bong tangent)then pass it to a tile upscaler with chroma and more bong tangent for another 20 steps lelit's overkill but it seems to be workingyou're using zimage now? how fast is it for you?
>>107339170lel
>>107339170nice to see youhappens to the best of ushappy thanksgiving eve>>107339178>you're using zimage now? how fast is it for you?yeah. very speedy. 40s at 20 steps. cranked up to 30 steps @ ~60s
>>107338704>spends all 365 days recycling boring, worn-out prompts
>>107339248henlo :)there is so much wrong with my setup right now and the outputs that i do not know where to start fixing it hahai just realized that the outputs are not even squares anymoreso much for being able to clone the exact setup to a different environmenti need time, stupid real life haha
>>107339399>i need time, stupid real life hahayou don't have extra down time this week?
>>107339248nice, push up the res man lol
>>107339489tru
ok now stuff crashed, back to the machine room for me, gn frens haha>>107339453no not reallyand my problem is that when i have some free time i just work more hahaoh well
110s>>107339603gn
eh nm that upscaling was too much loli'll just gen straight at the high resolution
now that everyone can use z-image, the slop will flowit was the best of times, it was the worst of times
>>107339666>>107339732like these>>107340088winrar
>>107340111thx
Also, Black Forest Lab has a playground to try the pro version.
>>107340219bfl lost this one, manz-image just rolls over itexcept the edit (z-image supposedly has an edit model coming soon(tm)flux2 is nice, but it got mogged
>>107340329>bfl lost this one, manits kind of ironic cuz flux's claim to fame was that it popped up after sd3 was too large and too bad. now flux2 gets undercut in a similar fashion
Back to slowness... took 7 minutes with Q3.gguf on a 5070Ti 16gb.
retrying my animal racing prompt on zindex but it definitely doesn't understand at all. prob needs the negative to work
Hello again, anons! :]
>>107340961hello
>>107341149Heyyy Koff!! It's so great to see you again! I loved your new song!
>>107341216i am glad you liked it.i heard this song this evening, one of the nicer ones i have heard lately, sadly i doubt a.i. can ever match it:https://youtu.be/3EBTk5brQVY
nite
>>107341359Ohhh I like this! It's really mellow and unique!I tried to make something close to the stylehttps://suno.com/s/5JxcUbqoPpsrBAhB>>107341548Good night!! Sleep well :]
>>107340961hellothere's yet another new model out. people like it more than flux2: >>107337011>>107341548gn, sorry I missed you
>>107342075LOOL just when I thought I caught up hahaha! I'll try it out!
Any advice for genning anything that features a couple?Concepts from one character keep bleeding into the other character and vice-versa... I tried using "BREAK" but it doesn't seem to do much
ZIMAGE, 7 seconds..we back.
>>107342154lol, yeah, its kinda crazy to get two new sota models back to back. might be more surprises before the end of the year too>>107342203there's not really a silver bullet for prompting multiple characters. newer models can understand prompts better but will still bleed, plus they'll be much slower on averagecloud models are way ahead of local on this front. people have done really impressive composites with nano banana and gpt imagefor local, you can try looking into regional conditioning or latent couple. there are some nodes/workflows but its kind of finicky and unreliable>>107342334you can gen an image faster than a rocketeer can kill you
>>107342438Oh wow! This gens in 4 seconds! Much better than waiting 2 mins for Flux 2 hahaha!
>>107342454>>107342334Nice!Specs? I doubt I'll be able to run it on my 6GB laptop but it's just really impressive
>>107342454you can pump the resolution a lot too. its pretty crazy>>107342487z-image is very small. actually smaller than flux1, I think. idk if there are quants out yet tho
>>107342487I have 24gb (4090)!I think you might be able to run it! It's way faster than anything i've used so far I think! Also the stuff you gotta download isn't too big either>>107342526That's one of the first things I tried hahaha! Slowly going up! This one only took 9 seconds
15 seconds!Plus it got the characters right haha
>>107342652>the witch mafia plans a murder
24 seconds! Gonna have to add a thing to save as jpg haha it got too big>>107342663LOL
>>107342663>All those broccoli perms Damn, now that's hyper real
Wow I don't think i've ever made gens this big haha35 seconds
>>107342764lol, zoomer heavy dataset it seems
>>107342790Maybe this model knows more zoomer celebs
this shit is so crazyat least now people cant bitch about my gens since everyone's going nuts with both z and flux>>107328651op get rid of ani and add z-image you crazy bastard
>>107342926only one guy complains about your gens and he only does it because he wants attention, not because he has valid opinions
>>107342926I'll get zimage added. It's going to look a bit weird without either docs or github.io tutorial linked and also a quant repo for now.
3840x2160 takes 55 seconds but things get weird hahahaIt also slows down my pc and goes over 4mb a lot, even as a jpg
>>107342963are you just testing the limits of your gpu on how high you can gen? because if you just want high res dont go higher than like 1536 and then do an upscale with model or whatever, much easiereven if you use a slow upscaler yo've already saved what, a minute or a minute and a half anyway>>107342961lel i know, i was being facetiousstill, everyone's mixing all kinds of shit up now, it's funny, i feel like my gens arent pushing the limits anymore
I will need to know how z-image performs on quokka benchmarks
>>107342999Yeah hahaha I just wanted to see how high I could get it with the basic workflow! It was fun to play with it :]I'll likely play around with upscaling tomorrow, getting kinda sleepy hahaI really like this model, my only thing is that I noticed that if you gen with the same proompt over and over it looks almost the same as the one before it but i'm sure that could be easily fixed>>107343122It does a pretty good quokka hahaha
>>107343145yeeah people have mentioned different seeds look the same on the same prompt, but i havent seen that then again i use 4 or 5 different seed generators on different nodes lol
>>107343155>different seeds look the same on the same promptI definitely see that, even with fairly wildcarded prompts. that behavior is what ultimately turned me off of base flux1
>>107343122>>107343145Gotta say it's pretty accurate to real pics (unlike flux or chroma), nice, can't wait to try it.
>>107343173there's only so many ways a news blurb can look no?
Next Thread>>107343201>>107343201>>107343201>>107343194Have you tried your group shot from the last thread in z-image yet?
>>107343155Good idea haha I might have to try something similar tomorrow!>>107343181I hope it works well for you!! :DI'm sure it will!
>>107343194the layout, composition and positioning, even the anchors are nearly identical between them all. there's a lot of room for variety that isnt being explored. here's the same thing with the DJ prompt, before I was able to loosen it up some
i was going to skip this gen but then i saw she was making a proper thumbs down>>107343206oh i should lol>>107343245ah i see it