Previous /sdg/ thread :>>106878537>Beginner UIEasyDiffusion: https://easydiffusion.github.ioSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Advanced UIComfyUI: https://github.com/comfyanonymous/ComfyUIForge Classic: https://github.com/Haoming02/sd-webui-forge-classicreForge: https://github.com/Panchovix/stable-diffusion-webui-reForgeStability Matrix: https://github.com/LykosAI/StabilityMatrix>Early Preview UIAniStudio: https://github.com/FizzleDorf/AniStudio>Qwen Image & Edithttps://docs.comfy.org/tutorials/image/qwen/qwen-imagehttps://huggingface.co/Qwen/Qwen-Imagehttps://huggingface.co/QuantStack/Qwen-Image-GGUFhttps://huggingface.co/QuantStack/Qwen-Image-Distill-GGUFhttps://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF>Flux.1 Kreahttps://docs.comfy.org/tutorials/flux/flux1-krea-devhttps://huggingface.co/black-forest-labs/FLUX.1-Krea-devhttps://huggingface.co/QuantStack/FLUX.1-Krea-dev-GGUF>Text & image to video - Wan 2.2https://docs.comfy.org/tutorials/video/wan/wan2_2https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUFhttps://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUFhttps://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF>Chromahttps://comfyanonymous.github.io/ComfyUI_examples/chromahttps://github.com/maybleMyers/chromaforgehttps://huggingface.co/lodestones/Chroma1-HDhttps://huggingface.co/silveroxides/Chroma-GGUF>Models, LoRAs & upscalinghttps://civitai.comhttps://tensor.arthttps://huggingface.cohttps://tungsten.runhttps://yodayo.com/modelshttps://www.diffusionarc.comhttps://miyukiai.comhttps://civitaiarchive.comhttps://civitasbay.orghttps://www.stablebay.orghttps://openmodeldb.info>Index of guides and other toolshttps://rentry.org/sdg-link>Related boards>>>/h/hdg>>>/e/edg>>>/d/ddg>>>/b/degen>>>/vt/vtai>>>/aco/sdg>>>/u/udg>>>/tg/slop>>>/trash/sdg>>>/vp/napt
>mfw Resource news10/14/2025>QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMshttps://github.com/NVlabs/QeRL>Scaling Language-Centric Omnimodal Representation Learninghttps://github.com/LCO-Embedding/LCO-Embedding>ChatGPT will soon allow erotica for verified adults, says OpenAI boss https://www.bbc.com/news/articles/cpd2qv58yl5o>Diffusion Transformers with Representation Autoencodershttps://rae-dit.github.io>DiT360: High-Fidelity Panoramic Image Generation via Hybrid Traininghttps://fenghora.github.io/DiT360-Page>High-resolution Photo Enhancement in Real-time: A Laplacian Pyramid Networkhttps://github.com/fengzhang427/LLF-LUT>Towards Self-Refinement of Vision-Language Models with Triangular Consistencyhttps://github.com/dengyl20/SRF-LLaVA-1.5>AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestrationhttps://avocado-captioner.github.io>ComfyUI DreamOmni2 Nodehttps://github.com/HM-RunningHub/ComfyUI_RH_DreamOmni2>Graph Your Own Prompthttps://darcyddx.github.io/gcr>VORTA: Efficient Video Diffusion via Routing Sparse Attentionhttps://github.com/wenhao728/VORTA>Syn-Vis-v0: A Dataset of Synthetic Faces https://huggingface.co/datasets/retowyss/Syn-Vis-v0>Silly Caption: Lightweight, brower-based AI autocaptioning toolhttps://github.com/obsxrver/SillyCaption10/13/2025>Boosting Multi-modal Keyphrase Prediction with Dynamic Chain-of-Thought in Vision-Language Modelshttps://github.com/bytedance/DynamicCoT>Stable Video Infinity: Infinite-Length Video Generation with Error Recyclinghttps://stable-video-infinity.github.io/homepage>MMAudioSep: Taming Video-to-Audio Generative Model Towards Video/Text-Queried Sound Separationhttps://github.com/sony/mmaudiosep>Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generationhttps://kangliao929.github.io/projects/puffin
>mfw Research news10/14/2025>GIR-Bench: Versatile Benchmark for Generating Images with Reasoninghttps://hkust-longgroup.github.io/GIR-Bench>Point Prompting: Counterfactual Tracking with Video Diffusion Modelshttps://point-prompting.github.io>FACE: Faithful Automatic Concept Extractionhttps://arxiv.org/abs/2510.11675>InfiniHuman: Infinite 3D Human Creation with Precise Controlhttps://yuxuan-xue.com/infini-humanhttps://arxiv.org/abs/2510.11650>IVEBench: Modern Benchmark Suite for Instruction-Guided Video Editing Assessmenthttps://ryanchenyn.github.io/projects/IVEBench>Massive Activations are the Key to Local Detail Synthesis in Diffusion Transformershttps://arxiv.org/abs/2510.11538>Uncertainty-Aware ControlNet: Bridging Domain Gaps with Synthetic Image Generationhttps://arxiv.org/abs/2510.11346>DiffStyleTS: Diffusion Model for Style Transfer in Time Serieshttps://arxiv.org/abs/2510.11335>Zero-shot Face Editing via ID-Attribute Decoupled Inversionhttps://arxiv.org/abs/2510.11050>ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generationhttps://nenhang.github.io/ContextGen>BLEnD-Vis: Benchmarking Multimodal Cultural Understanding in Vision Language Modelshttps://arxiv.org/abs/2510.11178>Demystifying Numerosity in Diffusion Models -- Limitations and Remedieshttps://arxiv.org/abs/2510.11117>Compositional Zero-Shot Learning: A Surveyhttps://arxiv.org/abs/2510.11106>CoDefend: Cross-Modal Collaborative Defense via Diffusion Purification and Prompt Optimizationhttps://arxiv.org/abs/2510.11096>DEMO: Disentangled Motion Latent Flow Matching for Fine-Grained Controllable Talking Portrait Synthesishttps://arxiv.org/abs/2510.10650>IUT-Plug: A Plug-in tool for Interleaved Image-Text Generationhttps://arxiv.org/abs/2510.10969>DreamMakeup: Face Makeup Customization using Latent Diffusion Modelshttps://arxiv.org/abs/2510.10918
>mfw MORE Research news>DreamMakeup: Face Makeup Customization using Latent Diffusion Modelshttps://arxiv.org/abs/2510.10918>SceneTextStylizer: A Training-Free Scene Text Style Transfer Framework with Diffusion Modelhttps://arxiv.org/abs/2510.10910>Discrete State Diffusion Models: A Sample Complexity Perspectivehttps://arxiv.org/abs/2510.10854>VLM-Guided Adaptive Negative Prompting for Creative Generationhttps://shelley-golan.github.io/VLM-Guided-Creative-Generation>Image-to-Video Transfer Learning based on Image-Language Foundation Models: A Comprehensive Surveyhttps://arxiv.org/abs/2510.10671>Scalable Face Security Vision Foundation Model for Deepfake, Diffusion, and Spoofing Detectionhttps://fsfm-3c.github.io/fsvfm.html>ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for Large Vision-and-Language Modelshttps://arxiv.org/abs/2510.10606>UniFlow: A Unified Pixel Flow Tokenizer for Visual Understanding and Generationhttps://arxiv.org/abs/2510.10575>Head-wise Adaptive Rotary Positional Encoding for Fine-Grained Image Generationhttps://arxiv.org/abs/2510.10489>When Images Speak Louder: Mitigating Language Bias-induced Hallucinations in VLMs through Cross-Modal Guidancehttps://arxiv.org/abs/2510.10466>DREAM: A Benchmark Study for Deepfake REalism AssessMenthttps://arxiv.org/abs/2510.10053>Taming a Retrieval Framework to Read Images in Humanlike Manner for Augmenting Generation of MLLMshttps://arxiv.org/abs/2510.10426>ReMix: Towards a Unified View of Consistent Character Generation and Editinghttps://arxiv.org/abs/2510.10156>VividAnimator: An End-to-End Audio and Pose-driven Half-Body Human Animation Frameworkhttps://arxiv.org/abs/2510.10269>Semantic Visual Anomaly Detection and Reasoning in AI-Generated Imageshttps://arxiv.org/abs/2510.10231>Prompt Optimization Meets Subspace Representation Learning for Few-shot Out-of-Distribution Detectionhttps://arxiv.org/abs/2509.18111
First for containment general
i miss schizo anon
>>106894991Your gens are all garbage
How does it feel samefagging and spamming a thread and general for so long? as if a new thread would change anything about the situation of this general...
>>106896342what do you mean, anon?
>>106894678I'm here AMA, the real one >>106896342 this was me
>>106896345I mean making a new thread will not fix the problems here. The same people will keep samefagging and spamming low effort gens. We need more actual discussion and genuine content instead of the same spam every thread.t.schizo anon
>>106896361how does this affect you personally though?
>>106896376It clogs up the thread with garbage and makes it harder to find good posts and discussions. When half the thread is just spam gens and samefagging it kills any conversation.
>>106896376>>106896345Oh sorry, thought I was in /ldg/, my bad! forget what I said. Bye!
>>106894991>>106895588I do not think your gens are garbage but I do not like or I am unsettled by your character's face, although I understand that is precisely what you like about that character. What I would like to know is what that character represents to yout.schizo anon
>>106896350imposter
A lyft driver told me her life story yesterday. It was wild.
get a discord you blogposting faggets
>>106897202Sounds like you are butthurt.
>>106897220weird how /s*g/ anons always make up stuff as copes
We are artists. We have refined taste that normal people cannot understand.
I've been using AI for a while now, mostly coding. But since I discovered what you can do with this stuff I've basically been nonstop gooning for days, is this normal? Being 100% serious here
>>106897717Ok
Asking for a fourth time because /g/ has 6 gorillion different AI generals.Can I get an idiot-proof QRD on what I should be using right now for local gen?>Haven't touched Stable Diffusion since like March>Still using Stable Diffusion reForge>This is the model I was using lasthttps://huggingface.co/nnnn1111/models-moved/blob/9ab536a5b4612faf1e13e40ce3915747bff906df/illustriousXLPersonalMerge_v30Noob10based.safetensorshttps://civitai.com/models/835655/illustrious-xl-personal-merge-noob-v-pred05-test-merge-updated>It produced the least shitty results back then but I still feel like I'm stuck in 2022 using it (shitty faces, bad anatomy, very poor results even with negative prompt and Hires. fix, takes dozens of gens to make something decent) and I'm very jealous of NAI faggots who can simply proompt and coom>I don't know if my loras will work with anything else if I get something newer or better>I'm only slightly familiar with the SD interface but I'm still not too too sure of what I'm doing (usually only mess with CFG scale and check Hires. fix, I don't know what the hell sampling methods do, for example)>Just generally retarded and need help
>>106898017The links are in the OP
>>106898017I think illustrious is still the most preferred model line for anime. there's no NAI equivalent that you can run locally
>>106898216>there's no NAI equivalent that you can run locallyHow come local image gen has been stuck in the same place for 3 years just like local voice gen?
>>106898230local voice gen has actually made some progress recently. vibevoice made a lot of noise (heh) recentlylocal image gen has a lot of new entries too. chroma finished its training, and there's a bunch of newer models like hunyuan3, qwen-image, and othersbut ultimately local is limited by 1) what bones companies want to throw to the open source world and 2) what is going to fit into consumer hardware
>>106897717YeahMy goon phase was like 3 months and at the end i needed to stop and go to a doctor because my balls were in agonyAfter that i decided to become schizo anon and haunt californians
>sloptober
>gooning too much makes you schizophrenic/s*g/ solved another mystery
>>106898603we've been defunded by doge :(
>>106898603>no stairsHow nice of the architects
>>106898802Sad
DIFFUSION THREADS ARE UNDER CONSTANT ATTACK BY TROLLS AND SCHIZOS.REMEMBER:POST WITH AN AVATAR OR TRIPCODE ONLY.RESPOND TO AVATARFAGS ONLY.PROTECT OUR SPACE. USING IDENTIFIERS IS THE ONLY WAY TO KEEP THE TROLLS OUT.
>>106898844thank you anonymous nogen poster
>>106898935>>106898944cute couple
>>106898950It's her sister
>>106898950nice ghost.
i'm literally shitting right now
>>106898971shes tryn to be spooky but is too cute to pull it off>>106899180>only you can prevent forest fires
Does the order of descriptions in the prompt matter? (Example: Background, then Lora for character, then character description, then Lora for style, then style description, etc) Or can I just keep adding random shit I think about between generations?
Afternoon anons
>>106899680I actually legitimately remember posters that had smoky the bear that hung up on the walls in my school. Man, I feel old.
>>106899985Yes
>>106900027Yes to which one??
>>106900034To the one where yes is an actually comprehensible and legitimate reply.
>>106900057I'm debo
>>106900073No I'm debo
>>106898844threads are under attack from people genning random slop aka using models that can't generate simple things like leaves, flowers and aesthetic space because they are badly trained and the user is blatantly ignorant and can't understand settings aka nigbo posts, either that or he has terminal astigmatism (truth hurts I'm afraid)
>ga
>dollar store koff ghostsgrim
>>106899985it depends. using the t5 encoder, position is very important. using clip, position importance is ui-specific (forge will apply relative token weighting while comfy will use absolute weighting)>>106899989gaI hope he's certified to be in the cockpit>>106900022I wonder what the zoomer equivalent of smokey is. maybe newer generations just accept the world is going to burn down so they don't think about it
>>106900577I dont think its "acceptance". I think its just apathy.
Anybody got a good Halloween prompt?
>>106900577they got more of the boomer "duck and cover" style narrative but swapped out the ruskies with climate change. i'd wager there's less social cohesion the boomers could rely on to manage whatever anxieties were caused by the narratives, which obviously zoomers don't have.
>>106900593a rose by any other name>>106900754the climate change scare tactics worked on me. I'm anxious of the world melting >/me burns another rain forest to generate an imageI get the irony
>>106901167i thought they were going with the water use thing, with a weird notion that the water used is, thermodynamics be damned, permanently destroyed forever.
>>106901167Yeah, what I said and what you said are not the same thing.
>>106901272the water use thing? you mean data centers wasting water? or just society's consumption of potable water et large? both are issues -- not because water disappears forever but because it gets increasingly more costly to access and provide usable waterwith data centers, they often add chemicals to the water to make it cool more efficiently or to be less corrosive, effectively "destroying" it because its now toxic. they would need to spend money to treat it to return it back to the system, but big business isn't well known for spending money just cuz its the right thing to do
>>106901335i didn't really think about the chemical aspect, although i'm not clear on how nasty they get with it. the way i usually encounter this narrative is more along my violation of thermodynamics quip. so i'll go with "destroying" being technically accurate but i have my reservations as to whether the people deploying it know
>Use a SFW model>Put "nsfw" in the negative prompt>Gen>Still paywalledI am not trying to make porn Civit fuck you
Anyone able to gen a face hugger on a jack-o-lantern?
mh
>>106902723you're watermarking your gens now?
>>106901795i had to cheat and use gpt-image-1, could try the prompt on something else:"an alien parasite with a pale, leathery body. it has eight long, jointed limbs extending from a central fleshy mass. its limbs are finger-like and end in rounded tips, posed as if grasping. the top of its body is segmented and ridged, while the underside is smoother and swollen like an organ sac. a long, segmented tail coils beneath it, thick at the base and tapering toward a sharp point. the creature is veined, wet, and slightly translucent, with pinkish-beige skin. high detail. horror lighting. The creature is attached to the FACE of the jack-o-lantern (creature face on pumpkin face) we see creature only from behind. the legs and tail are wrapped tightly around the pumpkin."
mr president, another ai general has hit the board >>106904218
>>106904379oh nm, its just ldg and they fucked up the subject. lol idiots
>>106904388lmao
>>106904933This scene reminds me of when I was in elementary school and we had the D.A.R.E. program and they were trying to convince us that some of our peers would try to get us to drink wine coolers and if that ever happened, we should say no. Before that I had never heard of wine coolers ever. Apparently it's like fruit juice mixed with wine and sold in like soft drink sized bottles. To this day, nobody has ever offered me a wine cooler and I have never actually seen one.
>>106904987DARE was the biggest lie. I was led to believe strangers would be offering me free drugs all the time.
Here's your controller bro
>>106905087send this to elon so he makes the tesla engineers redesign their robots
>>106905139We like Dem curvy bots
>>106904222nice work still
Schizo anon here AMA
>>106902723>*sniffs*:3
Guess everyone's at work or picrel
>>106908600*or in the real thread(s)
>>106909131>he says, posting here
>>106908187chocolate with orange notes is so good. its usually dark chocolate though rather than milk chocolate>>106908600threads been pretty ded this week
>>106909380Have you ever had one of those chocolate orange things that you're supposed to whack open and it crumbles into slices that are usually around for the holidays?
I haven't touched image generation in a while. I would like to modify textures for a game, those are flat 2D atlas (like picrel) which are applied on a 3D model.Can image generation helps me with that? For example, adding some details to a texture, changing some clothing, upscaling them or making (mostly color) variants?
>>106909634YES, I loved those when I was a kid. man I totally forgot about those. and those were def milk chocolate I might buy one of those of amazon now... you're a bad influence lol>>106909723I've never seen someone try this but it seems like it would be possible. qwen-image-edit is the current premiere edit model so I'd recommend trying that and seeing how it works for this usecase
>>106909769Can I get a guide on how to use it locally? I only have 12GB of vram.
>>106909804here is the quantized model and there's some usage instructions in there (although a bit obtuse)https://huggingface.co/nunchaku-tech/nunchaku-qwen-image-edit-2509
>>106909769Lol sorry, it was the first thing that came to mind
>>106909891I also saw that there was some GGUF quantization: https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUFWhich quantization should I use?
>>106910075Weirdly cozy
>>106909152where else would i go and why? my home is /s*g/
qwen edit 2509:
>>106909966I couldnt tell you the difference between them, desu
>>106910310Ok
>>106910405We live in a society
>>106910405with wan 2.2:
>>106910452
Morning anons
>>106909966>Which quantization should I use?With your 12GB vram you'll want to use either of the Q3 versions to keep image generations times fast. Using anything bigger will overflow into system ram (or just straight up crash from OOM errors) and push gen times into minutes instead of staying below that.
>>106909966I use Q8 which is like 20/21gb with a 4080 (16GB) and my gens are like 30 seconds, depends on how much RAM you have. So you don't necessarily need a model below your VRAM count.
Did LDG die or something lol?
>>106910694you can put either sdg or ldg into the catalog filter/search box and the subjectless ldg thread shows up.
>>106910694yes. ldg lost
Anyone else feel that the pages are going faster than usual?
>gm
>>10691092510mins after the post above me and this thread was on page 4 so yes.
>nigbo
trying this combo for wan with the new lora, seems to work decent:
>>1069115844 steps
>>106911614Amazing
>>106911614the anime girl shakes hands with hatsune miku who walks in from the right.
>>106911645I love this!
>>106911956These gens are so cozy. No idea what's going on, not the left with the plywood cutout of that dude and the people just chilling, but cozy nonetheless.
>>106912001looks like a community picnic maybe? I'm sure its a good timeI was thinking about something you said, like "I thought all places got fall colors" when I was out for a run recently. our fall colors are mostly just a variety of greyish-yellows and browns, lol
>one day and 16 hours ago
I want to train a super resolution AI model. I have a collection of images and low resolution versions of those images. Some of the OG high resolution images are missing and I want to recreate them as close as possible with the pairings I already have (the artstyle is the same through the collection).What can I do? My idea was to train a super resolution model with my collection as regularization images or something similar, but I don't know.
does anyone know how to prompt picrel? - https://files.catbox.moe/rwgu80.jpegnot so much concerned about the magazine cover style, more about the pose with chair and the sparkling water and lighting
>>106912586>moistnice taste anon
I keep looking at hlky's github hoping to see a green dot show up
the demonizerhttps://suno.com/s/WVUux3t0OToIGQS0
>>106910452this could be an actual animation from the game lol
so alive
>containment general
>>106914200She finally got some help. Nice
>nigbobumping
>>106914251In the grim darkness of the far future there is only war.
>>106914200>>106914281this series is wild. just different wildcards or did you change other stuff?
>>106914378one of my nodes is for manual input. i add whatever there and it applies certain wildcards toocurrently reading a warhammer 40k novel called 'mechanicum' and used some of the descriptions
>>106914457>>106914408Agree with >>106914378epic scenes
>>106914490some are kind of neat, some are just weird
damn lines
>>106915028oo cyclops girl is back
>>106915042yeah they come out that way sometimes, not sure why exactly.
wheres pw anyway
>>106915117hopefully he's in a better place
cribbed prompt from 8/23, barbwire outline on the hat is a nice touch.
>>106915191we are all pw on this blessed day(except pw who isn't here)
these old prompts were weird
>>106915304whats old is new againthats why i like changing models once in a while, cuz I can spam all my old prompts again
>>106915331they're doing this to me tomorrow
dunno where the fuck this came from
losing the deebs. just kidding, he's wasting his life reading fake chinese pre-prints preporting to have created
ah fuck. is it more retarded to flub a post or to read fake chinese preprints claiming to have invented the image diffusion god?
oh don't mind me, i'm just a simple minded retard
Next Thread>>106915697>>106915697>>106915697>>106915678>read fake chinese preprints claiming to have invented the image diffusion god?Hopefully he's >>106915450 done scanning them because I'm sending this next thread nao!
Last one from meGood night anons
>>106915698a (you) from baker-san?!i'll never wash this 4chan pass again!>>106915699night!
since we're doin golden oldieshttps://suno.com/s/SEMiHaK2LUHQYqru
>>106915678he knows me so wellI had to catch up cuz I had todays and tomorrow (I can see the future)>>106915698ty baker san>>106915699gn
>>106896602i just remembered i never got back to you. i don't remember exactly, but it probably happened a few times accidently and i was like "yes!" unsettling is the vibe. and idk, i guess i have a black eye fetish, idk lmao