[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 3-corp-art.png (1.42 MB, 896x1152)
1.42 MB
1.42 MB PNG
Previous /sdg/ thread : >>107653059

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://huggingface.co/jayn7/Z-Image-Turbo-GGUF

>Flux.2 Dev
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/city96/FLUX.2-dev-gguf

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
https://huggingface.co/QuantStack/Qwen-Image-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUF
https://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUF
https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://github.com/maybleMyers/chromaforge
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/vp/napt
>>>/r/realistic+parody
>>
>mfw Resource news

12/25/2025

>Lumi Tools v1.1.0 adds LLM processors, new utility nodes, and more
https://github.com/illuminatianon/comfyui-lumi-tools/releases/tag/v1.1.0

>Input-Adaptive Visual Preprocessing for Efficient Fast Vision-Language Model Inference
https://github.com/kmdavidds/mlfastlm

>Rethinking Direct Preference Optimization in Diffusion Models
https://github.com/kaist-cvml/RethinkingDPO_Diffusion_Models

>ComfyUI-LG_SamplingUtils
https://github.com/LAOGOU-666/ComfyUI-LG_SamplingUtils

12/24/2025

>PhotoMapAI: fast, modern image browser and search tool for large photo collections
https://github.com/lstein/PhotoMapAI

12/23/2025

>StoryMem: Multi-shot Long Video Storytelling with Memory
https://kevin-thu.github.io/StoryMem

>Qwen-Image-Edit-2511
https://huggingface.co/Qwen/Qwen-Image-Edit-2511
https://huggingface.co/lightx2v/Qwen-Image-Edit-2511-Lightning
https://huggingface.co/unsloth/Qwen-Image-Edit-2511-GGUF

>CASA: Cross-Attention via Self-Attention for Efficient VL Fusion
https://kyutai.org/casa

>The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding
https://github.com/WeichenFan/UAE

>MaskFocus: Focusing Policy Optimization on Critical Steps for Masked ImGen
https://github.com/zghhui/MaskFocus

>Efficient Zero-Shot Inpainting with Decoupled Diffusion Guidance
https://github.com/YazidJanati/ding

>ComfyUI-SpectralVAEDetailer
https://github.com/SparknightLLC/ComfyUI-SpectralVAEDetailer

>Wan2.1 NVFP4 quantization-aware 4-step distilled models
https://huggingface.co/lightx2v/Wan-NVFP4

>Majoor Assets Manager for ComfyUI
https://github.com/MajoorWaldi/ComfyUI-Majoor-AssetsManager

12/22/2025

>Region-Constraint In-Context Generation for Instructional Video Editing
https://zhw-zhang.github.io/ReCo-page

>Infinite-Homography as Robust Conditioning for Camera-Controlled VidGen
https://emjay73.github.io/InfCam

>SAM 3 Segmentation Agent Now in ComfyUI
https://github.com/adambarbato/ComfyUI-Segmentation-Agent
>>
>mfw Research news

12/25/2025

>VisRes Bench: On Evaluating the Visual Reasoning Capabilities of VLMs
https://arxiv.org/abs/2512.21194

>HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
https://arxiv.org/abs/2512.21338

>Beyond Memorization: A Multi-Modal Ordinal Regression Benchmark to Expose Popularity Bias in Vision-Language Models
https://sytwu.github.io/BeyondMemo

>GriDiT: Factorized Grid-Based Diffusion for Efficient Long Image Sequence Generation
https://arxiv.org/abs/2512.21276

>ACD: Direct Conditional Control for Video Diffusion Models via Attention Supervision
https://arxiv.org/abs/2512.21268

>DreaMontage: Arbitrary Frame-Guided One-Shot Video Generation
https://dreamontage.github.io/DreaMontage

>FreeInpaint: Tuning-free Prompt Alignment and Visual Rationality Enhancement in Image Inpainting
https://arxiv.org/abs/2512.21104

>T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation
https://arxiv.org/abs/2512.21094

>Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations
https://arxiv.org/abs/2512.21004

>FluencyVE: Marrying Temporal-Aware Mamba with Bypass Attention for Video Editing
https://arxiv.org/abs/2512.21015

>Generalization of Diffusion Models Arises with a Balanced Representation Space
https://arxiv.org/abs/2512.20963

>UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement
https://arxiv.org/abs/2512.21185

>Beyond Artifacts: Real-Centric Envelope Modeling for Reliable AI-Generated Image Detection
https://arxiv.org/abs/2512.20937
>>
>mfw Yesterday's Research news

12/24/2025

>AI Image Generators Default to the Same 12 Photo Styles, Study Finds
https://gizmodo.com/ai-image-generators-default-to-the-same-12-photo-styles-study-finds-2000702012

>SemanticGen: Video Generation in Semantic Space
https://jianhongbai.github.io/SemanticGen

>FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models
https://arxiv.org/abs/2512.20561

>CRAFT: Continuous Reasoning and Agentic Feedback Tuning for Multimodal Text-to-Image Generation
https://arxiv.org/abs/2512.20362

>TAVID: Text-Driven Audio-Visual Interactive Dialogue Generation
https://arxiv.org/abs/2512.20296

>How I Met Your Bias: Investigating Bias Amplification in Diffusion Models
https://arxiv.org/abs/2512.20233

>AMoE: Agglomerative Mixture-of-Experts Vision Foundation Model
https://arxiv.org/abs/2512.20157

>HEART-VIT: Hessian-Guided Efficient Dynamic Attention and Token Pruning in Vision Transformer
https://arxiv.org/abs/2512.20120

>Item Region-based Style Classification Network (IRSN): A Fashion Style Classifier Based on Domain Knowledge of Fashion Experts
https://arxiv.org/abs/2512.20088

>UTDesign: A Unified Framework for Stylized Text Editing and Generation in Graphic Design Images
https://arxiv.org/abs/2512.20479

>How Much 3D Do Video Foundation Models Encode?
https://vidfm-3d-probe.github.io

>Few-Shot-Based Modular Image-to-Video Adapter for Diffusion Models
https://arxiv.org/abs/2512.20000

>Learning to Refocus with Video Diffusion Models
https://arxiv.org/abs/2512.19823

>Beyond Vision: Contextually Enriched Image Captioning with Multi-Modal Retrieva
https://arxiv.org/abs/2512.20042
>>
File: l-nbp-2025-12-26_00011_.png (1.29 MB, 1376x768)
1.29 MB
1.29 MB PNG
>>107670781
no version in readme no problems :)
>>
File: deJS_zi_00019_.png (2.63 MB, 1408x1536)
2.63 MB
2.63 MB PNG
news, posted
thread, filled
xmas, over
food, eaten
sleep, awaits
gn
>>
File: l-nbp-2025-12-26_00015_.png (1.49 MB, 1376x768)
1.49 MB
1.49 MB PNG
>>107670843
gn
>>
trying out the old SDXL quokka LoCon on illustrious, he turns into a bird like pokemon
>>107670843
gn anon :)
>>
yep, illustrous identifies quokka as bird, lmao
>>
File: l-nbp-2025-12-26_00027_.png (1.31 MB, 1376x768)
1.31 MB
1.31 MB PNG
>>
File: l-nbp-2025-12-26_00045_.png (1.74 MB, 1344x768)
1.74 MB
1.74 MB PNG
>>
File: l-nbp-2025-12-26_00052_.png (1.62 MB, 1344x768)
1.62 MB
1.62 MB PNG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.