Previous /sdg/ thread : >>106618193>Beginner UIEasyDiffusion: https://easydiffusion.github.ioSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Advanced UIComfyUI: https://github.com/comfyanonymous/ComfyUIForge Classic: https://github.com/Haoming02/sd-webui-forge-classicreForge: https://github.com/Panchovix/stable-diffusion-webui-reForgeStability Matrix: https://github.com/LykosAI/StabilityMatrix>Early Preview UIAniStudio: https://github.com/FizzleDorf/AniStudio>Qwen Image & Edithttps://docs.comfy.org/tutorials/image/qwen/qwen-imagehttps://huggingface.co/Qwen/Qwen-Imagehttps://huggingface.co/QuantStack/Qwen-Image-GGUFhttps://huggingface.co/QuantStack/Qwen-Image-Distill-GGUFhttps://huggingface.co/QuantStack/Qwen-Image-Edit-GGUF>Flux.1 Kreahttps://docs.comfy.org/tutorials/flux/flux1-krea-devhttps://huggingface.co/black-forest-labs/FLUX.1-Krea-devhttps://huggingface.co/QuantStack/FLUX.1-Krea-dev-GGUF>Text & image to video - Wan 2.2https://docs.comfy.org/tutorials/video/wan/wan2_2https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUFhttps://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUFhttps://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF>Chromahttps://comfyanonymous.github.io/ComfyUI_examples/chromahttps://github.com/maybleMyers/chromaforgehttps://huggingface.co/lodestones/Chroma1-HDhttps://huggingface.co/silveroxides/Chroma-GGUF>Models, LoRAs & upscalinghttps://civitai.comhttps://tensor.arthttps://huggingface.cohttps://tungsten.runhttps://yodayo.com/modelshttps://www.diffusionarc.comhttps://miyukiai.comhttps://civitaiarchive.comhttps://civitasbay.orghttps://www.stablebay.orghttps://openmodeldb.info>Index of guides and other toolshttps://rentry.org/sdg-link>Related boards>>>/h/hdg>>>/e/edg>>>/d/ddg>>>/b/degen>>>/vt/vtai>>>/aco/sdg>>>/u/udg>>>/tg/slop>>>/trash/sdg>>>/vp/napt
First for containment general
>nigbo
>>106633701Sloppy
>>106633701Toppy
>>106633773>>106633798
Post some examples why Flux, Chroma, Qwen that you are shipping here are supposedly better than Pony or Illustrious.Or is there another reason?
>>106634091no
>>106634091Fuck off
>Pony and Illustrious reigns supreme yet again.
>>106634727glorp
Why are all gens here shit compared to the ones in /ldg/ and /adt/? I mean just look at thrir collages
>>106634907What do you mean?
>>106634907They use shitty models mentioned in OP.
>mfw Resource news09/19/2025>UnifiedVisual: A Framework for Constructing Unified Vision-Language Datasetshttps://github.com/fnlp-vision/UnifiedVisual>DACoN: DINO for Anime Paint Bucket Colorization with Any Number of Reference Imageshttps://github.com/kzmngt/DACoN>MultiEdit: Advancing Instruction-based Image Editing on Diverse and Challenging Taskshttps://huggingface.co/datasets/inclusionAI/MultiEdit>Universal Metal Flash Attentionhttps://github.com/bghira/universal-metal-flash-attention>Lucy Edit Dev (5B): Open-weight video editinghttps://huggingface.co/decart-ai/Lucy-Edit-Dev>RamTorch: RAM is All You Needhttps://github.com/lodestone-rock/RamTorch09/18/2025>Wan-Animate: Unified Character Animation and Replacement with Holistic Replicationhttps://humanaigc.github.io/wan-animate>Noise-Level Diffusion Guidance: Well Begun is Half Donehttps://github.com/harveymannering/NoiseLevelGuidance>EDITS: Enhancing Dataset Distillation with Implicit Textual Semanticshttps://github.com/einsteinxia/EDITS>Cross-modal Full-mode Fine-grained Alignment for Text-to-Image Person Retrievalhttps://github.com/yinhao1102/FMFA>LivePyxel: Accelerating image annotations with a Python-integrated webcam live streaminghttps://github.com/UGarCil/LivePyxel>LLM-I: LLMs are Naturally Interleaved Multimodal Creatorshttps://github.com/ByteDance-BandAI/LLM-I09/17/2025>Runge-Kutta Approximation and Decoupled Attention for Rectified Flow Inversion and Semantic Editinghttps://github.com/wmchen/RKSovler_DDTA>Lego-Edit: A General Image Editing Framework with Model-Level Bricks and MLLM Builderhttps://github.com/xiaomi-research/lego-edit>Superpixel Anything: A general object-based framework for accurate yet regular superpixel segmentationhttps://github.com/waldo-j/spam>China blocks sale of Nvidia AI chips https://arstechnica.com/tech-policy/2025/09/china-blocks-sale-of-nvidia-ai-chips
>mfw Research news09/19/2025>DF-LLaVA: Unlocking MLLM's potential for Synthetic Image Detection via Prompt-Guided Knowledge Injectionhttps://arxiv.org/abs/2509.14957>MARIC: Multi-Agent Reasoning for Image Classificationhttps://arxiv.org/abs/2509.14860>[Re] Improving Interpretation Faithfulness for Vision Transformershttps://arxiv.org/abs/2509.14846>Not All Degradations Are Equal: A Targeted Feature Denoising Framework for Generalizable Image Super-Resolutionhttps://arxiv.org/abs/2509.14841>Dataset Distillation for Super-Resolution without Class Labels and Pre-trained Modelshttps://arxiv.org/abs/2509.14777>Frame Sampling Strategies Matter: A Benchmark for small vision language modelshttps://arxiv.org/abs/2509.14769>Chain-of-Thought Re-ranking for Image Retrieval Taskshttps://arxiv.org/abs/2509.14746>Decoupled Proxy Alignment: Mitigating Language Prior Conflict for Multimodal Alignment in MLLMhttps://arxiv.org/abs/2509.14735>Attention Lattice Adapter: Visual Explanation Generation for Visual Foundation Modelhttps://arxiv.org/abs/2509.14664>Generalizable Geometric Image Caption Synthesishttps://arxiv.org/abs/2509.15217>Understand Before You Generate: Self-Guided Training for Autoregressive Image Generationhttps://arxiv.org/abs/2509.15185>Leveraging Geometric Visual Illusions as Perceptual Inductive Biases for Vision Modelshttps://arxiv.org/abs/2509.15156>WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidancehttps://worldforge-agi.github.io>QuizRank: Picking Images by Quizzing VLMshttps://arxiv.org/abs/2509.15059>AutoEdit: Automatic Hyperparameter Tuning for Image Editinghttps://arxiv.org/abs/2509.15031>AToken: A Unified Tokenizer for Visionhttps://arxiv.org/abs/2509.14476
>mfw Yesterday's Research news09/18/2025>Where Do Tokens Go? Understanding Pruning Behaviors in STEP at High Resolutionshttps://arxiv.org/abs/2509.14165>SAIL-VL2 Technical Reporthttps://arxiv.org/abs/2509.14033>GenExam: A Multidisciplinary Text-to-Image Examhttps://arxiv.org/abs/2509.14232>Towards Robust Defense against Customization via Protective Perturbation Resistant to Diffusion-based Purificationhttps://arxiv.org/abs/2509.13922>SpecDiff: Accelerating Diffusion Model Inference with Self-Speculationhttps://arxiv.org/abs/2509.13848>Diving into Mitigating Hallucinations from a Vision Perspective for Large Vision-Language Modelshttps://arxiv.org/abs/2509.13836>BWCache: Accelerating Video Diffusion Transformers through Block-Wise Cachinghttps://arxiv.org/abs/2509.13789>Generative Image Coding with Diffusion Priorhttps://arxiv.org/abs/2509.13768>Iterative Prompt Refinement for Safer Text-to-Image Generationhttps://arxiv.org/abs/2509.13760>Controllable-Continuous Color Editing in Diffusion Model via Color Mappinghttps://arxiv.org/abs/2509.13756>StyleProtect: Safeguarding Artistic Identity in Fine-tuned Diffusion Modelshttps://arxiv.org/abs/2509.13711>BiasMap: Leveraging Cross-Attentions to Discover and Mitigate Hidden Social Biases in Text-to-Image Generationhttps://arxiv.org/abs/2509.13496>EdiVal-Agent: An Object-Centric Framework for Automated, Scalable, Fine-Grained Evaluation of Multi-Turn Editinghttps://tianyucodings.github.io/EdiVAL-page>An Empirical Analysis of VLM-based OOD Detection: Mechanisms, Advantages, and Sensitivityhttps://arxiv.org/abs/2509.13375>Humor in Pixels: Benchmarking Large Multimodal Models Understanding of Online Comicshttps://arxiv.org/abs/2509.12248>Synthesis and Perceptual Scaling of High Resolution Naturalistic Images Using Stable Diffusionhttps://arxiv.org/abs/2410.13034
debo has arisen
>vibe killed
>>106633701this is still chroma? I see it in the filename obv but that style is very different. looks closer to an illustrious based gen>>106633864>mfw>>106634727glorp>>106635025gmslightly awkward news timing
>>106635033
>>106635052yah chroma (2k +v48 detail cal+staging large merged)seems people are mixing shit up to see what works best now but i havent really kept up with what's what
>>106635650Good gen
>>106635711ty
tfw nogen
>>106635052>https://github.com/Cypress-Yang/SongBloomSongbloom is pretty good for a local modelHere's an example >>106635822It's only 2B model too.
Morning anons
>gm
>>106636685morning
>>106635964>>106636144goblin hours>>106636303interesting. example sounds super crunchy but maybe local will use this model to evolve>>106636685gmhappy friday
>>106636971There was some better example posted earlier but I linked the one with a screenshot. Seems like he's using something to process the reference audio. Need to test it out, haven't downloaded it yet...
https://desuarchive.org/g/thread/88697361three years ago.. the good ol days
>>106637454>checks to see what I was up to three years ago >cyberpunk cityscapesah, pretty much my OG bread and butter
>>106637506i was doing my usual boring portraits
>>106637400I like sad clowns. I should make some too.
>>106637523hard to frown with clowns
your clown gf, sir
concluding clown hour
New comfy is cancer...
>>106637942To add: not sure if it's wise to install Songbloom nodes. Seems bit iffy.
>>106637516how interesting that even that long ago you had already established a style. far from boringnow I'm compulsively back to cityscapes
>>106637956I know text encoders are limited but have you tried brackets? Using brackets is a common thing with LLMs. This is not exactly precise in terms of description.>summary: [ portrait, closeup ], >subject: [ grey alien, big black eyes ],>clothes: [ white t-shirt, orange skirt, white sneakers ],>medium: [ cinematic lighting, film grain ],>background: [ dark background, black background ]I think this will work better with more robust encoders like chroma.
>>106638086brackets in text encoder conditionings affect the token weighting
>>106638141Yeah it should be the thing, they should help to isolate tokens from other groups.Maybe ComfyUI does not care about it, I don't know. LLMs benefit this format. At least smaller ones.
>>106638163*from I have a dementia. 10 years to live btw.
>>106638086
>>106637956had some, like these, very psychedelic/weird, i wish i could remember the prompts, back then i wasn't so locked into standard practices, and sd 1.4/1.5 was more free
>>106638199>black eyesOrange bleeds already.This is some sdxl model made for porn anyway.
>>106638163with these more recent prompt, I have been trying a strategy of somewhat-isolating concepts just by virtue of how the wildcards are laid out. my understanding of how imgen sampling works though is it just looks at all the tokens and tries to produce them all, which is why token bleed has always been a thing. thats why its generally a better strategy to use natural language descriptions that correlate tokens together, rather than keyword lists that will be more haphazard in retrieval>>106638170>10 years to live:(the chatgpt AGI hypermind will save you. only $499/mo>>106638200if you wanted to make a whole project of it, you could skim through all these old archives and train a lora off your 1.4/1.5 gens. could be quite a trip>i wish i could remember the prompts,I'm a hoarder, so I still have my prompts from all the way back then. none of them are useful, ironically>sd 1.4/1.5 was more freea double edged sword. there were fewer guardrails but it was also way more willing to produce garbage
>>106638338Yeah I see what do you mean.
>>106638338I will need to replace my company contract with a new life long subscription with OpenAI, I'm lying on a bed and LLM constantly feeds my brain with new impulses.Beautiful!
She's bit too blurry.
>>106638444I like it.
Qwen can make comfy gens.
Well this one is from API only variant of Flux actually.But I think it fits here more than Dall-e thread.
>>106638523>>106638469I posted these because these are bit like photographs from 1970s.
>>106638541sick
This is the new chroma girl.
>>106638834denied
>>106638837?
>>106638837Why are you so offensive?
>>106638834quite the great gen
>>106638995Thank you. I appreciate when it is coming from you.
>>106638892why are you so... YUCCKY??
>>106639352I'm sorry but if you want to talk with me you need to address the poster with the right attitude.
https://www.youtube.com/watch?v=vkUpfw4Hf3w
>>106639675I'm confused about jewtube. Pick up one song, it will automatically continue with its ai generated playlist.
>>106639761https://www.youtube.com/watch?v=yq9xqXjnC6kI know you hate it.
>>106639772i like them, dare say love. a favorite as a kidhttps://youtu.be/Qb0vPMhwWjU?si=fvyaLQUwl7mPk0KGanother i liked as a kid, don't really care much for it now. really liked the album art
>>106639830Thanks, I'm going to listen to this and smoke a cigarette.
>>106639830It's progressive. I love progressive music.
>>106639879
https://youtu.be/FLRtymQWndM?si=sG4Qqr2FmSpA2LpP
>>106639957Seattle...
>>106640006I wish I could travel to Alaska!
>>106640041me too, tired of midwestern summershttps://youtu.be/5fqq9XGE2co?si=w2mGxcnFgyDZGNFc
>>106640099Thanks, I have forgotten all the music. I deleted 100gb of images anyway.
>>106640099This is not a date service. I live in Finland and you live in Alaska.
I have been using forge for years. what does comfy let me do that forge does not? I would always like more control over what i'm making but i'm just not clear on what exactly comfy offers
>>106639707whatever keeps the normies listening to ads>>106640099describe a midwestern summer>>106640127>average sdg poster
>>106640099I love you anyway.Let's see a new song...It could be offensive...
i hate (((doctors)))
>>106640099https://www.youtube.com/watch?v=1s3fMaBdWD8I love to play this because it's offensive.
>>106640154Google classifies this as problematic.
>>106640119alrighthttps://youtu.be/VRWwJZ44Sh4?si=l9GljDYDAiwIKjmG>>106640139just horribly hot and muggy, very miserable>>106640154incredibly unappealing thumbnail image, not going to listen, sorry
>>106640180None taken.
>>106640139She is literally me.
This is my punk.https://www.youtube.com/watch?v=XUJ_z3Y30o4
no
>>106640877butt
>>106640889AI buttz
>>106640805yes
>>106640901maybe
when you accidentally generate a gaterade commercial>>106640938>men live like this and see no problem>>106640982where did you get this picture of my thought bubble?
>>106641043
sorry distracted by operation
>>106641095now this guy knows how to gen
>>106641110turn the cfg to 11
>>106641157ok
>>106641179JUST DO IT
>>106641198i did. this is cfg 11
>>106641203>mfw euler
>>106641210it's a good scheduler saar
>>106641220*sampleri bet you use smiple or normal scheduler too you savage
i use automatic, like God intended
>>106641266i should go back to forge or whatever supports chroma
>>106641266reforge supports chroma just fine, chroma doesn't support me.
investigationhttps://suno.com/s/udpM1f6lbFxVSmgD
>>106641398Seems like ok.
>>106641406seems like we agree
>>106641398Cool, now rip a guitar solo at 2 mins or otherwise it's not worth a listen.
>>106641417You need to PUSH it. Every song has a climax.
>>106641418no u.
>>106641423this one has a special purpose.
>>106641430Sorry daddy.
>>106641439you are not the target then. be not afraid
>>106641418Now let's pray it doesn't take them another 2 years to learn an additional 2 boring keys
>>106641475always leading back to you
this shithole sucks my ass
>>106641540i dont think thats how shitholes work
>>106641703you fucking snake! meet me in the otherworld, and i will destroy you.
>>106641804Thank you daddy.
>>106641840you're welcome, nigger.
i can't find the beginning,and i don't know what's the truth?is it me, or sunshine?or both? or none
die in your millions,scum.do not reproduce. period.that means all of us. everywhere and when
except a slim neckwhich i created out of nothing, and promised to it that there would be no interference.
always leading back to you
Yo
it keeps leading back to you!
i feel a coma coming on
>>106641993Ok daddy, let the coma in.
>>106641927yo>>106641993when I was young, I thought it was "acoma" rather than "a coma". like, you get a bad case of acoma or whatever
Gm!!! ;3
>>106642187Yo what's good
>>106642192nm, just winding down for the night. up to anything fun this weekend?
>>106642224Nice. Nah just coasting. Thinking about ordering a beanbag chair. Norhing exciting lol.
>>106642232>Thinking about ordering a beanbag chair.lol, I've actually had this thought too. I have a room I don't really use and thought "would be cool to have a bean bag chair in here and I could lounge around"but I'd prob wanna be on my laptop and it'd be awkward to use on a beanbag chair I think. plus I don't wanna be responsible for a beanbag chairdon't let me bias your beanbag aspirations though
Last one from me, good night anons
>>106642271I mean, you could sort of pull up your legs and dig your heels into the crevice that's made by your butt when you sit in it and then place the laptop sort of on an angle between your knees and stomach. I know that sounds awkward as a description, but I've seen many people use laptops on a beanbag chair.
>>106642277gn>>106642279possibly. but not definitely. so it'd be a risky purchase. I've already perfected couch lounging so my lounging capacity is pretty up thereare you thinking a normal beanbag chair or a gigantic beanbag chair?
>>106642293I'm actually thinking about like a very large bean bag chair. Something that I could like recline in and take a nice nap.
>>106642329hell yea. go big or go home
>>106642404Woooo
>>106642413I almost made it to the next thread but i'm dyingn!
>>106638834Represents sdg in all its glory
i miss schizo anon
oh dear
Next Thread>>106643240>>106643240>>106643240>>106637454Back when I manually renamed filenames for the first image. Gud tymes.
>>106643244the man, the myth, the legend!
>>106643289hail, kot. well met.