Previous /sdg/ thread : >>108606788>Beginner UIEasyDiffusion: https://easydiffusion.github.ioSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Advanced UIComfyUI: https://github.com/comfyanonymous/ComfyUIForge Classic: https://github.com/Haoming02/sd-webui-forge-classicStability Matrix: https://github.com/LykosAI/StabilityMatrix>Z-Imagehttps://comfyanonymous.github.io/ComfyUI_examples/z_imagehttps://huggingface.co/Tongyi-MAI/Z-Imagehttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Flux.2 Dev/Kleinhttps://comfyanonymous.github.io/ComfyUI_examples/flux2https://huggingface.co/black-forest-labs/FLUX.2-devhttps://huggingface.co/black-forest-labs/FLUX.2-klein-4Bhttps://huggingface.co/black-forest-labs/FLUX.2-klein-9B>Chromahttps://comfyanonymous.github.io/ComfyUI_examples/chromahttps://huggingface.co/lodestones/Chroma1-HDhttps://huggingface.co/silveroxides/Chroma-GGUF>Animahttps://huggingface.co/circlestone-labs/Anima>Qwen Image & Edithttps://docs.comfy.org/tutorials/image/qwen/qwen-imagehttps://huggingface.co/Qwen/Qwen-Image>Text & image to video - Wan 2.2https://docs.comfy.org/tutorials/video/wan/wan2_2>Models, LoRAs & upscalinghttps://civitai.comhttps://huggingface.cohttps://tungsten.runhttps://yodayo.com/modelshttps://www.diffusionarc.comhttps://miyukiai.comhttps://civitaiarchive.comhttps://civitasbay.orghttps://www.stablebay.orghttps://openmodeldb.info>Index of guides and other toolshttps://rentry.org/sdg-link>Related boards>>>/aco/sdg>>>/b/degen>>>/d/ddg>>>/e/edg>>>/gif/vdg>>>/h/hdg>>>/r/realistic+parody>>>/tg/slop>>>/trash/sdg>>>/u/udg>>>/vp/napt>>>/vt/vtaiOP https://rentry.co/twkuk8tz
First for shithole general
"adorable Quokka" according to ERNIE image turboLmao
>>108624720it's an adorable stuffed quokka
>mfw Resource news04/17/2026>ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handlinghttps://yjx-research.github.io/ControlFoley>TokenGS: Decoupling 3D Gaussian Prediction from Pixels with Learnable Tokenshttps://research.nvidia.com/labs/toronto-ai/tokengs>MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generationhttps://aka.ms/mm-webagent>Qwen2D-VAEhttps://huggingface.co/Anzhc/Qwen2D-VAE>ComfyUI HY-World 2.0 — WorldMirror 3Dhttps://github.com/AHEKOT/ComfyUI_HYWorld2>Anima Style Explorer: A free web tool for ComfyUI styleshttps://anima.mooshieblob.com>Stanford AI Index Report 2026https://hai.stanford.edu/assets/files/ai_index_report_2026.pdf04/16/2026>Motif-Video 2B: A micro-budget text-to-video diffusion transformer from Motif Technologieshttps://motiftech.io/videoshowcase>HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worldshttps://huggingface.co/tencent/HY-World-2.0>ErnieTurbo_extracted_lorahttps://huggingface.co/GuangyuanSD/ErnieTurbo_extracted_lora/tree/main04/15/2026>DisCa: Accelerating Video Diffusion Transformers with Distillation-Compatible Learnable Feature Caching https://huggingface.co/tencent/DisCa>Lyra 2.0: Explorable Generative 3D Worldshttps://research.nvidia.com/labs/sil/projects/lyra2>AniGen: Unified S3 Fields for Animatable 3D Asset Generationhttps://github.com/VAST-AI-Research/AniGen>T2I-BiasBench: A Multi-Metric Framework for Auditing Demographic and Cultural Bias in Text-to-Image Modelshttps://gyanendrachaubey.github.io/T2I-BiasBench>Generative Refinement Networks for Visual Synthesishttps://github.com/MGenAI/GRN>VideoFlexTok: Flexible-Length Coarse-to-Fine Video Tokenizationhttps://videoflextok.epfl.ch>DiffusionPrint: Learning Generative Fingerprints for Diffusion-Based Inpainting Localizationhttps://github.com/mever-team/diffusionprint
>mfw Research news04/17/2026>Seen-to-Scene: Keep the Seen, Generate the Unseen for Video Outpaintinghttps://arxiv.org/abs/2604.14648>Prompt-to-Gesture: Measuring the Capabilities of I2V Deictic Gesture Generationhttps://arxiv.org/abs/2604.14953>Beyond Prompts: Unconditional 3D Inversion for Out-of-Distribution Shapeshttps://daidedou.sorpi.fr/publication/beyondprompts>Flow of Truth: Proactive Temporal Forensics for I2V Generationhttps://arxiv.org/abs/2604.15003>AnimationBench: Are Video Models Good at Character-Centric Animation?https://animationbench.github.io>DVFace: Spatio-Temporal Dual-Prior Diffusion for Video Face Restorationhttps://arxiv.org/abs/2604.14560>Geometrically Consistent Multi-View Scene Generation from Freehand Sketcheshttps://arxiv.org/abs/2604.14302>Analysis of Regularization and Fokker-Planck Residuals in Diffusion Models for Img Genhttps://arxiv.org/abs/2604.15171>Step-level Denoising-time Diffusion Alignment with Multiple Objectiveshttps://arxiv.org/abs/2604.14379>Prompt-Guided Image Editing with Masked Logit Nudging in Visual Autoregressive Modelshttps://arxiv.org/abs/2604.14591>Towards Design Compositinghttps://arxiv.org/abs/2604.14605>LeapAlign: Post-Training Flow Matching Models at Any Generation Step by Building Two-Step Trajectorieshttps://rockeycoss.github.io/leapalign>The Courtroom Trial of Pixels: Robust Image Manipulation Localization via Adversarial Evidence and Reinforcement Learning Judgmenthttps://arxiv.org/abs/2604.14703>Reward-Aware Trajectory Shaping for Few-step Visual Generationhttps://arxiv.org/abs/2604.14910>Deepfake Detection Generalization with Diffusion Noisehttps://arxiv.org/abs/2604.14570>Switch-KD: Visual-Switch Knowledge Distillation for VLMshttps://arxiv.org/abs/2604.14629>Bird-SR: Bidirectional Reward-Guided Diffusion for Real-World Image Super-Resolutionhttps://arxiv.org/abs/2602.07069
darn teeth
>>108624914nice
lel
>>108625483hmm, which is the vile one
>>108625573the one in the back
>>108624720Nice. lolHere's my ERNIE.
>>108626018>>108624720nice. i plan on getting around to trying it, how hard is it to prompt compared to like klein/chroma/zit?
>>108626170So far feels easier, I couldn't change the aspect ratio without distortion/multiple subjects appearing. Follow styles, and knows characters..
>>108626170>>108626199do we know what text encoder ernie uses? I don't see it mentioned on the model card
>>108626351ministral 3 3b afaict
>>108624734Now it turned into a koala>>108626018Kek, heebs will not divide us
>>108626369I dont think I've ever done anything with minstrel. is it the same as qwen basically?
>>108626392yeah but french instead of chinese. idk if they just use the text encoder built into the model or what. the comfy template also uses a prompt enhancer which i'd expect is kind of like that one thing from the news we were talking about the other day. i'm still downloading so i'm just talking out my ass atm
>>108626420>yeah but french instead of chinesegod damn it, I just finished learning mandarin and now I have to learn french too? wǒ zhè cāodàn de féi zhái rénshēng..
>>108626369oof and yikeseven t5 would be better
>>108626825omg lewd
>>108626935it's a skin colored bodysuit
ERNIE Quokkas are so pink
>>108626966it's either a disease or they've turned carnivorous
This one came out better>>108626991New species!
>>108626966i remember some sdxl mix long ago kept drawing quokkas as green. if I had a quarter for every time quokkas had a spurious correlation with a random color.....
zit version of chromagirl (zgirl?) has a real death-cult vibe to her, i dont get it
>>108627088should try out ernie and see what kind of stuff erniegirl gets up to
>>108627101>would have to update comfyuii dont know about that
>>108627141don't be a pussy it'll be fine, unless it's been months and then idk maybe ur fucked lol. there haven't been any issues i've seen on desktop for a while now and i update everytime it nags me like a good comfyslave
>>108624673i was going hard on SD back in 2023 or 2024 and it worked on my gtx1080 8gb but now that is simply not possible. What are the boys using nowadays and how much do i need to pay to get up to speed?
>>108627450z-image-turbo might work on that card, it's enough vram anyway. might be a tad slower than on a 30xx+
>>108624673Kik Epp23gTele Bgftg33Make a Lora of my gf?
>>108627450>how much do i need to pay to get up to speed?a better question is how much you're willing to pay
ernie is pretty literal, but did ascii anyway. who needs color anyway?
>>108627605thats the most ascii yet
>>108627621workin on integrating it into an existing workflow. getting distracted by some mouseman baka
>>108627651the thread yearns for mouseman baka
>>108627659he is very stubborn. also this thing is very good at text
will need a lot of tweaking that i'm not in the mood to tweak. gonna try models i guess. also "adult female" = "chinese" so there's that
>>108627837>i'm not in the mood to tweak.just do what I do and jam all your old prompts into it with reckless abandon
>>108627880working on it, it's a little fiddly bc i gotta massage the noodles... THEY'RE ALWAYS CHINESE?? I TOOK MY MEDS I SWEAR TO GOD
this is aids. i'm kicking the ernie can down the road. why are they chinese?!?
>>108627926the noodles are chinese? I prefer my noodles to be japanese, personally>>108627952>why are they chinese?!?doesn't ernie have negs?
>>108627958probably, idk the template is fucking retarded. they shove the whole thing in a subgraph bc they know ppl see your average workflow graph and run screaming in terror. but that subgraph doesn't expose negatives. besides, what kind of model makes you put "chinese" in negatives? i'm moving on, it's caturday
>>108627966people only use subgraphs ironically>what kind of model makes you put "chinese" in negatives?maybe the chinese felt this way about SD. "why do I have to put 'white people' in the negatives?!?
>>108628004i was thinking about that as i wrote my previous reply. i didn't have a good answer except "we won, and in victory: the end"
still chinese
>>108628278its your subconscious reaching out. deep down you is mao's great proletarian cultural revolution and you have an unquenchable thirst for the blood of landlords
>>108628317>i want a kingno not like that!
>>108628323what is a king
>>108628337hopefully not mao! but if it is, then it is what it is.
Everything looks like AI to me now. Even supposedly real images. >>108627621maybe i should explore the galaxy
>>108628573>Everything looks like AI to me now. Even supposedly real images.these things happen... welcome!
>>108628580Thanks. I guess the future finally caught up with me.
>>108628614i took this picture! where did you get it?!
>>108628666Your mom said I could have it. I laughed so much she insisted that I should take it.nice devil's trips btw
rabid penguinhttps://suno.com/s/8SAYG3928UkqCFONhttps://youtu.be/Gzg7i4iKw-A
gn then.
>>108628990gngm
G'mornin Anons,
>>108630347morning
>>108630350Gm, nice details/gradient in gen!
>>108630380ty
>ernieyeah...
i miss schizo anon
hmmm
i cant say i'm impressed by ernie>mistral 3eh>prompt enhancer (actually gemma3-2b or 4b)eh>asian(chinese) women no matter the prompteh
>>108631069>>asian(chinese) women no matter the promptcentury of chinese prosperity
>>108631162might be ok for more simple prompts but not the stuff i throw at it
>>108631069>asian(chinese) women no matter the promptThat's interesting. Mistral isn't even chinese, neither is gemma.
>>108631215yah but ernie is, and if it's trained on say 80% chinese women (for the "woman" token) then there you go
>>108631244Yeah, makes sense. I just vide coded a GIMP plug-in that runs an image edit workflow.
>>108631293*vibe
the TE (mistral) and "prompt enhancer" (gemma3 if you even use it) are fine, the issue is the image model (ernie) doesnt have enough knowledge of say styles/things (probably because they spent so much training the text/multilanguage aspect). so while for certain things it may work well enough (especially text), if you throw a lot of things it has no idea about even if the TE/PE do, it's pointlessi'd say chroma (and more recent XL finetunes) > zit/f2.klein > flux in terms of local "knowledge" for image models. ernie would sit between zit/flux on thatlike this image, doesnt know what a space marine looks like lel>>108631293>I just vide coded a GIMP plug-in that runs an image edit workflow.nice, share it pls
>>108631304>nice, share it plshttps://litter.catbox.moe/ee68zgsfs9evey64.zipexpand directory in ~/.config/GIMP/3.2/plug-ins/Should work with the out of the box edit workflows from templates but didn't test all. Just gwen 2512.
>>108631342thx
Last time I fiddled with stuff like SD was when automatic1111 webui was pretty new.Lookin to get up to date with this, specifically to use img2img to change styles (eg photo to a drawing)Can someone give me a general point into the right direction? Where do I have to look? Theres so much shit now. no need for an explaination
>>108631874step one is always install a UI from the OP and gen a pictureforge is the spiritual successor to a1111. comfyui is more premiere
Morning anonsPinkuokka lmao>>108631874Flux Kontext shouube able to run on reforge, still unsure how to use comfy myself.
>>108631874Assuming you have a GPU with 12+gb vramStart by installing comfyui from https://github.com/comfyanonymous/ComfyUIInstall all the dependencies.Go to the templates and click in image, there are image edit templates. Use one which works for you.Skipping lots of steps but that's what the thread news is about.
>>108631907>>108631909>>108631920alright thanks anon i think i can work with that
>we get a brand new theme poster>nobody engages them>just keep spamming slop This is why everyone left
>>108632226you're still hereyou didnt do shit eitherbe the change you want to see
>>108632284OK debo
>>108632288you're dumb
>>108632226>brand new theme posterwat
hi
tfw nogens
>>108632379hello
>>108632379good afternoon
new>>108633149>>108633149>>108633149