[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 00022-2573822097.jpg (396 KB, 1344x1728)
396 KB
396 KB JPG
Previous /sdg/ thread : >>108493483

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Flux.2 Dev/Klein
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/black-forest-labs/FLUX.2-klein-4B
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Anima
https://huggingface.co/circlestone-labs/Anima

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/aco/sdg
>>>/b/degen
>>>/d/ddg
>>>/e/edg
>>>/gif/vdg
>>>/h/hdg
>>>/r/realistic+parody
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vp/napt
>>>/vt/vtai

OP https://rentry.co/twkuk8tz
>>
File: o_00713_.png (1.13 MB, 1280x768)
1.13 MB
1.13 MB PNG
>>
Shithole general
>>
>>
File: o_00714_.png (1.81 MB, 1152x896)
1.81 MB
1.81 MB PNG
>>
>>108501784
And you are still here, curious.
>>
Morning anons
>>
File: 00003-4187648827.png (3.4 MB, 1344x1728)
3.4 MB
3.4 MB PNG
>>108502001
morning. never forget
>>
>gm
>>
>>
>mfw Resource news

04/01/2026

>DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing
https://carlofkl.github.io/dreamlite

>MMFace-DiT: A Dual-Stream Diffusion Transformer for High-Fidelity Multimodal Face Generation
https://vcbsl.github.io/MMFace-DiT

>Hallucination-aware intermediate representation edit in LVLMs
https://github.com/ASGO-MM/HIRE

>CutClaw: Agentic Hours-Long Video Editing via Music Synchronization
https://github.com/GVCLab/CutClaw

>Extend3D: Town-Scale 3D Generation
http://seungwoo-yoon.github.io/extend3d-page

>PixlStash 1.0.0 release candidate
https://github.com/Pikselkroken/pixlstash/releases/tag/v1.0.0rc3

>adetailer-hires-sync: Automatically enables ADetailer in Forge
https://github.com/KazeKaze93/adetailer-hires-sync

03/31/2026

>See-through: Single-image Layer Decomposition for Anime Characters
https://github.com/shitagaki-lab/see-through

>VRAM Pager: Compressed GPU Memory Paging for Diffusion & Video Models
https://github.com/willjriley/vram-pager

>TGIF2: Extended Text-Guided Inpainting Forgery Dataset & Benchmark
https://github.com/IDLabMedia/tgif-dataset

>Look, Compare and Draw: Differential Query Transformer for Automatic Oil Painting
https://differential-query-painter.github.io/DQ-painter

>Drift-AR: Single-Step Visual Autoregressive Generation via Anti-Symmetric Drifting
https://github.com/aSleepyTree/Drift-AR

>INSID3: Training-Free In-Context Segmentation with DINOv3
https://visinf.github.io/INSID3

>OmniColor: Unified Framework for Multi-modal Lineart Colorization
https://github.com/zhangxulu1996/OmniColor

>Gen-Searcher: Reinforcing Agentic Search for Image Generation
https://gen-searcher.vercel.app

>V-CAST: Video Curvature-Aware Spatio-Temporal Pruning for Efficient Video LLMs
https://github.com/xinyouu/V-CAST

>GEMS: Agent-Native Multimodal Generation with Memory and Skills
https://gems-gen.github.io

>RAWIC: Bit-Depth Adaptive Lossless Raw Image Compression
https://github.com/chunbaobao/RAWIC
>>
>mfw Research news

04/01/2026

>Quantization with Unified Adaptive Distillation to enable multi-LoRA based one-for-all Generative Vision Models on edge
https://arxiv.org/abs/2603.29535

>SLVMEval: Synthetic Meta Evaluation Benchmark for Text-to-Long Video Generation
https://arxiv.org/abs/2603.29186

>Abstraction in Style
https://arxiv.org/abs/2603.29924

>Stepper: Stepwise Immersive Scene Generation with Multiview Panoramas
https://fwmb.github.io/stepper

>Gloria: Consistent Character Video Generation via Content Anchors
https://yyvhang.github.io/Gloria_Page

>PromptForge-350k: Large-Scale Dataset and Contrastive Framework for AI Image Forgery Localization
https://arxiv.org/abs/2603.29386

>MEDiC: Multi-objective Exploration of Distillation from CLIP
https://arxiv.org/abs/2603.29009

>Multi-Feature Fusion Approach for Generative AI Images Detection
https://arxiv.org/abs/2603.29788

>CIPHER: Counterfeit Image Pattern High-level Examination via Representation
https://arxiv.org/abs/2603.29356

>MacTok: Robust Continuous Tokenization for ImgGen
https://arxiv.org/abs/2603.29634

>Diffusion Mental Averages
https://diffusion-mental-averages.github.io

>Unify-Agent: Unified Multimodal Agent for World-Grounded Image Synthesis
https://arxiv.org/abs/2603.29620

>SHIFT: Stochastic Hidden-Trajectory Deflection for Removing Diffusion-based Watermark
https://arxiv.org/abs/2603.29742

>Unbiased Model Prediction Without Using Protected Attribute Information
https://arxiv.org/abs/2603.29270

>Omni-NegCLIP: Enhancing CLIP with Front-Layer Contrastive Fine-Tuning for Comprehensive Negation Understanding
https://arxiv.org/abs/2603.29258

>MultiGen: Level-Design for Editable Multiplayer Worlds in Diffusion Game Engines
https://ryanpo.com/multigen

>Understanding vs. Generation: Navigating Optimization Dilemma in Multimodal Models
https://arxiv.org/abs/2602.15772

>When Test-Time Guidance Is Enough: Fast Image/Video Editing with Diffusion Guidance
https://arxiv.org/abs/2602.14157
>>
>mfw YESTERDAY's Research news

03/31/2026

>On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers
https://contextual-repulsion.github.io

>DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing
https://carlofkl.github.io/dreamlite

>ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks
https://arxiv.org/abs/2603.27862

>EdgeDiT: Hardware-Aware Diffusion Transformers for Efficient On-Device Image Generation
https://arxiv.org/abs/2603.28405

>TokenDial: Continuous Attribute Control in Text-to-Video via Spatiotemporal Token Offsets
https://tokendial.github.io

>Attention Frequency Modulation: Training-Free Spectral Modulation of Diffusion Cross-Attention
https://arxiv.org/abs/2603.28114

>Diversity Matters: Dataset Diversification and Dual-Branch Network for Generalized AI-Generated Image Detection
https://arxiv.org/abs/2603.27800

>MathGen: Revealing the Illusion of Mathematical Competence through Text-to-Image Generation
https://arxiv.org/abs/2603.27959

>Rethinking Structure Preservation in Text-Guided Image Editing with Visual Autoregressive Models
https://arxiv.org/abs/2603.28367

>OPRO: Orthogonal Panel-Relative Operators for Panel-Aware In-Context Image Generation
https://arxiv.org/abs/2603.27637

>Customized Visual Storytelling with Unified Multimodal LLMs
https://arxiv.org/abs/2603.27690

>GEditBench v2: A Human-Aligned Benchmark for General Image Editing
https://arxiv.org/abs/2603.28547

>Inference-time Trajectory Optimization for Manga Image Editing
https://arxiv.org/abs/2603.27790

>Beyond Dataset Distillation: Lossless Dataset Concentration via Diffusion-Assisted Distribution Alignment
https://arxiv.org/abs/2603.27987

>SonoWorld: From One Image to a 3D Audio-Visual Scene
https://humathe.github.io/sonoworld

>CoPE-VideoLM: Leveraging Codec Primitives For Efficient Video Language Modeling
https://microsoft.github.io/CoPE
>>
File: 00022-2979839411.png (2.17 MB, 1024x1280)
2.17 MB
2.17 MB PNG
>>
>>108502481
>adetailer-hires-sync: Automatically enables ADetailer in Forge
>Manually toggling the checkbox each time is friction.
>This extension hooks into the hires fix button and manages the ADetailer checkbox automatically:
checking a box is too much for some people
>>
File: deCC_zi_00028_.png (2.71 MB, 1920x1033)
2.71 MB
2.71 MB PNG
>>108502532
lol I wasnt sure if there was more to it or not. its rare for me to find anything to give to the forge folks though so I included it
>>
File: o_00715_.png (1.31 MB, 1280x768)
1.31 MB
1.31 MB PNG
>>
gm
>>
File: deCC_zi_00029_.png (2.78 MB, 1920x1033)
2.78 MB
2.78 MB PNG
>>108502778
gm
>>
File: o_00717_.png (2.11 MB, 1920x1080)
2.11 MB
2.11 MB PNG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.