Previous /sdg/ thread : >>108633149>Beginner UIEasyDiffusion: https://easydiffusion.github.ioSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Advanced UIComfyUI: https://github.com/comfyanonymous/ComfyUIForge Classic: https://github.com/Haoming02/sd-webui-forge-classicStability Matrix: https://github.com/LykosAI/StabilityMatrix>Z-Imagehttps://comfyanonymous.github.io/ComfyUI_examples/z_imagehttps://huggingface.co/Tongyi-MAI/Z-Imagehttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Flux.2 Dev/Kleinhttps://comfyanonymous.github.io/ComfyUI_examples/flux2https://huggingface.co/black-forest-labs/FLUX.2-devhttps://huggingface.co/black-forest-labs/FLUX.2-klein-4Bhttps://huggingface.co/black-forest-labs/FLUX.2-klein-9B>Chromahttps://comfyanonymous.github.io/ComfyUI_examples/chromahttps://huggingface.co/lodestones/Chroma1-HDhttps://huggingface.co/silveroxides/Chroma-GGUF>Animahttps://huggingface.co/circlestone-labs/Anima>Qwen Image & Edithttps://docs.comfy.org/tutorials/image/qwen/qwen-imagehttps://huggingface.co/Qwen/Qwen-Image>Text & image to video - Wan 2.2https://docs.comfy.org/tutorials/video/wan/wan2_2>Models, LoRAs & upscalinghttps://civitai.comhttps://huggingface.cohttps://tungsten.runhttps://yodayo.com/modelshttps://www.diffusionarc.comhttps://miyukiai.comhttps://civitaiarchive.comhttps://civitasbay.orghttps://www.stablebay.orghttps://openmodeldb.info>Index of guides and other toolshttps://rentry.org/sdg-link>Related boards>>>/aco/sdg>>>/b/degen>>>/d/ddg>>>/e/edg>>>/gif/vdg>>>/h/hdg>>>/r/realistic+parody>>>/tg/slop>>>/trash/sdg>>>/u/udg>>>/vp/napt>>>/vt/vtaiOP https://rentry.co/twkuk8tz
>mfw Resource news04/20/2026>Elucidating the SNR-t Bias of Diffusion Probabilistic Modelshttps://github.com/AMAP-ML/DCW>(1D) Ordered Tokens Enable Efficient Test-Time Searchhttps://soto.epfl.ch>Frequency-Aware Flow Matching for High-Quality Image Generationhttps://github.com/OliverRensu/FreqFlow>From Zero to Detail: A Progressive Spectral Decoupling Paradigm for UHD Image Restoration with New Benchmarkhttps://github.com/NJU-PCALab/ERR>China’s Alibaba launches 10,000-card computing clusterhttps://www.scmp.com/tech/article/3349335/ai-race-us-intensifies-chinas-alibaba-launches-10000-card-computing-cluster>Modly: Local, open source, AI-powered image-to-3D mesh generationhttps://github.com/lightningpixel/modly>DCW: Elucidating the SNR-t Bias of Diffusion Probabilistic Modelshttps://github.com/AMAP-ML/DCW04/19/2026>ZPix: Local AI image generator and editor powered by open image models. https://github.com/SamuelTallet/ZPix>Comfy Canvas: Local inline layer based image editorhttps://github.com/Zlata-Salyukova/Comfy-Canvas04/18/2026>Rose: Range-Of-Slice Equilibration PyTorch optimizerhttps://github.com/MatthewK78/Rose04/17/2026>ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handlinghttps://yjx-research.github.io/ControlFoley>TokenGS: Decoupling 3D Gaussian Prediction from Pixels with Learnable Tokenshttps://research.nvidia.com/labs/toronto-ai/tokengs>MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generationhttps://aka.ms/mm-webagent>Qwen2D-VAEhttps://huggingface.co/Anzhc/Qwen2D-VAE>ComfyUI HY-World 2.0 — WorldMirror 3Dhttps://github.com/AHEKOT/ComfyUI_HYWorld2>Anima Style Explorer: A free web tool for ComfyUI styleshttps://anima.mooshieblob.com>Stanford AI Index Report 2026https://hai.stanford.edu/assets/files/ai_index_report_2026.pdf
>mfw Research news04/20/2026>Towards In-Context Tone Style Transfer with A Large-Scale Triplet Datasethttps://arxiv.org/abs/2604.16114>Beyond Text Prompts: Precise Concept Erasure through Text-Image Collaborationhttps://arxiv.org/abs/2604.15829>Motion-Adapter: A Diffusion Model Adapter for Text-to-Motion Generation of Compound Actionshttps://arxiv.org/abs/2604.16135>TwoHamsters: Benchmarking Multi-Concept Compositional Unsafety in Text-to-Image Modelshttps://arxiv.org/abs/2604.15967>Repurposing 3D Generative Model for Autoregressive Layout Generationhttps://fenghora.github.io/LaviGen-Page>The Amazing Stability of Flow Matchinghttps://arxiv.org/abs/2604.16079>DINOv3 Beats Specialized Detectors: A Simple Foundation Model Baseline for Image Forensicshttps://arxiv.org/abs/2604.16083>Sketch and Text Synergy: Fusing Structural Contours and Descriptive Attributes for Fine-Grained Image Retrievalhttps://arxiv.org/abs/2604.15735>AHS: Adaptive Head Synthesis via Synthetic Data Augmentationshttps://keh0t0.github.io/AHS>VEFX-Bench: A Holistic Benchmark for Generic Video Editing and Visual Effectshttps://arxiv.org/abs/2604.16272>Adapting in the Dark: Efficient and Stable Test-Time Adaptation for Black-Box Modelshttps://arxiv.org/abs/2604.15609>From Competition to Coopetition: Coopetitive Training-Free Image Editing Based on Text Guidancehttps://arxiv.org/abs/2604.15948>UniEditBench: A Unified and Cost-Effective Benchmark for Image and Video Editing via Distilled MLLMshttps://arxiv.org/abs/2604.15871>Efficient Video Diffusion Models: Advancements and Challengeshttps://arxiv.org/abs/2604.15911>Making Image Editing Easier via Adaptive Task Reformulation with Agentic Executionshttps://arxiv.org/abs/2604.15917>Noise Aggregation Analysis Driven by Small-Noise Injection: Efficient Membership Inference for Diffusion Modelshttps://arxiv.org/abs/2510.21783
once again i only find the right flow when i'm about ready to go crash
>>108649890pls acknowledge my 420 masterpiece, best quality. it includes coreal
>>108649879>>108649908bruh i'm high and half sleep, i just posted on new thread lelbut now that i see it, great job!definitely me rnshould've said lolweed lmao blaze it 420
gn all
>>108650022gn
that's it for me
>>108650557gn
i miss schizo anon
Is it compute intensive to create LoRAs? Are the links in the rentry still mostly what I should be looking for guides etc. or have things changed?
>>108652391No, making a lora takes like 3 to 5 minutes with a 30 image dataset on a rtx 3060 12gb vram card
Morning Anons
>>108653975gm
>>108653975gmtry gpt-image-2 yet? eagerly awaiting the quokka metric
>>108654030It's not in their API yet. Maybe in an hour or so
gpt-image-2, some random bbs prompt that was sitting in the workflow already. guess i need to update my nodes to support openai to try it for real
this pricing table is weird, why 1024x1024 cost more than 1024x1536? bc reasons. idk what quality that test image was, the playground thing is pretty ass
>>108655520seems gpt-img-2 is mogging everyone (but still not great at nsfw)
this is nuts>>108654985>>108655069local when?
>>108655734apikeks will do anything to prove their models are better except create art
>>108655662i want to really kick the tires on it but i'm swamped at work and my nodes don't support it yet so it'll have to wait. full quality comparison of same image in gpt-image-2 vs zit (litterbox bc catbox closed due to jeetery)https://litter.catbox.moe/aft91j864n8zd33u.png
nano banana 2 is more faithfully ansi, and at 4k is fuckin yugehttps://litter.catbox.moe/h8xj8pr8g765ic2z.jpg
>>108656283based ANSI enjoyer
>>108656219>i'm swamped at workquit your job so you can gen moreelon and sam are giving us all AI bux soon anyway
nb2 looks like ass, but mostly bc it decided to do really ugly southpark style lmaohttps://litter.catbox.moe/b1lcws35gq1v9pge.png
>>108656557papercraft brought in a bit of a south park correlation, lol
>>108656590that prompt was just kind of cursed, it had "stacked paper layered cardstock" in there, nb2 is capable of making good ones lol. interestingly gpt-image-2 decided to ignore that part
>>108657245YOU MAD BITCH! MODACHODE
my hand on the right
>>108657944are you waterfalls anon?
>>108657875you would >>108657944omg he backnice to see you
>>108658017>wouldalways
mfw it's time to crash
>>108658140me holding the sword (paradox)
>>108658197the bug just has a large forearm
>boner in the promptand with that, gn all
>>108658228gn