[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: SourceBook2.jpg (246 KB, 1828x2350)
246 KB
246 KB JPG
Previous /sdg/ thread : >>107583401

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://huggingface.co/jayn7/Z-Image-Turbo-GGUF

>Flux.2 Dev
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/city96/FLUX.2-dev-gguf

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
https://huggingface.co/QuantStack/Qwen-Image-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUF
https://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUF
https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://github.com/maybleMyers/chromaforge
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/vp/napt
>>>/r/realistic+parody
>>
First for containment general
>>
Morning anons
>>
File: 0866609184concession.jpg (211 KB, 2007x2007)
211 KB
211 KB JPG
>>
Forgot image lul
>>
>>
>gm
>>
>>
Where's the news
>>
>>107603482
Look in the previous thread. >>107583401
>>
>>
>>
File: SDG_News_00008_.png (2.61 MB, 1728x1344)
2.61 MB
2.61 MB PNG
>mfw Resource news

12/19/2025

>Generative Refocusing: Flexible Defocus Control from a Single Image
https://generative-refocusing.github.io

>FlashPortrait
https://francis-rings.github.io/FlashPortrait

>AdaTooler-V: Adaptive Tool-Use for Images and Videos
https://github.com/CYWang735/AdaTooler-V

>SFTok: Bridging the Performance Gap in Discrete Tokenizers
https://github.com/Neur-IO/SFTok

>Alchemist: Unlocking Efficiency in T2I Model Training via Meta-Gradient Data Selection
https://kxding.github.io/project/Alchemist

>Multimodal RewardBench 2: Evaluating Omni Reward Models for Interleaved Text and Image
https://github.com/facebookresearch/MMRB2

>Pixel Seal: Adversarial-only training for invisible image and video watermarking
https://github.com/facebookresearch/videoseal

>RePlan: Reasoning-guided Region Planning for Complex Instruction-based Image Editing
https://replan-iv-edit.github.io

>The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text
https://worldcanvas.github.io

>Trainable Log-linear Sparse Attention for Efficient Diffusion Transformers
https://github.com/SingleZombie/LLSA

>SmartGallery for ComfyUI
https://github.com/biagiomaf/smart-comfyui-gallery

>Photo Tinder - Desktop app for image triage and ranking
https://github.com/relaxis/photo-tinder-desktop

12/18/2025

>Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition
https://github.com/QwenLM/Qwen-Image-Layered

>Vibe Spaces for Creatively Connecting and Expressing Visual Concepts
https://huzeyann.github.io/VibeSpace-webpage

>Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning
https://github.com/JoeLeelyf/Skyra

>SoFlow: Solution Flow Models for One-Step Generative Modeling
https://github.com/zlab-princeton/SoFlow

>TagScribeR v2: Dataset curator powered by Qwen 3-VL
https://github.com/ArchAngelAries/TagScribeR

>Meet the New ComfyUI-Manager
https://blog.comfy.org/p/meet-the-new-comfyui-manager
>>
>mfw Research news

12/19/2025

>Detecting Localized Deepfakes: How Well Do Synthetic Image Detectors Handle Inpainting?
https://arxiv.org/abs/2512.16688

>EasyV2V: A High-quality Instruction-based Video Editing Framework
https://snap-research.github.io/easyv2v

>Yuan-TecSwin: A text conditioned Diffusion model with Swin-transformer blocks
https://arxiv.org/abs/2512.16586

>GenEval 2: Addressing Benchmark Drift in Text-to-Image Evaluation
https://arxiv.org/abs/2512.16853

>StageVAR: Stage-Aware Acceleration for Visual Autoregressive Models
https://arxiv.org/abs/2512.16483

>Geometric Disentanglement of Text Embeddings for Subject-Consistent Text-to-Image Generation using A Single Prompt
https://arxiv.org/abs/2512.16443

>Factorized Video Generation: Decoupling Scene Construction and Temporal Synthesis in Text-to-Video Diffusion Models
https://arxiv.org/abs/2512.16371

>EverybodyDance: Bipartite Graph-Based Identity Correspondence for Multi-Character Animation
https://arxiv.org/abs/2512.16360

>PixelArena: A benchmark for Pixel-Precision Visual Intelligence
https://pixelarena.reify.ing/project

>VIVA: VLM-Guided Instruction-Based Video Editing with Reward Optimization
https://viva-paper.github.io

>Instant Expressive Gaussian Head Avatar via 3D-Aware Expression Distillation
https://research.nvidia.com/labs/amri/projects/instant4d

>Kling-Omni Technical Report
https://arxiv.org/abs/2512.16776

>FrameDiffuser: G-Buffer-Conditioned Diffusion for Neural Forward Frame Rendering
https://framediffuser.jdihlmann.com

>REGLUE Your Latents with Global and Local Semantics for Entangled Diffusion
https://arxiv.org/abs/2512.16636

>DeContext as Defense: Safe Image Editing in Diffusion Transformers
https://arxiv.org/abs/2512.16625

>C-DGPA: Class-Centric Dual-Alignment Generative Prompt Adaptation
https://arxiv.org/abs/2512.16164
>>
>mfw Yesterday's Research news

12/18/2025

>CLIP-FTI: Fine-Grained Face Template Inversion via CLIP-Driven Attribute Conditioning
https://arxiv.org/abs/2512.15433

>Image Complexity-Aware Adaptive Retrieval for Efficient Vision-Language Models
https://arxiv.org/abs/2512.15372

>Generative Preprocessing for Image Compression with Pre-trained Diffusion Models
https://arxiv.org/abs/2512.15270

>MMMamba: A Versatile Cross-Modal In Context Fusion Framework for Pan-Sharpening and Zero-Shot Image Enhancement
https://arxiv.org/abs/2512.15261

>Null-LoRA: Low-Rank Adaptation on Null Space
https://arxiv.org/abs/2512.15233

>Robust and Calibrated Detection of Authentic Multimedia Content
https://arxiv.org/abs/2512.15182

>Borrowing from anything: A generalizable framework for reference-guided instance editing
https://arxiv.org/abs/2512.15138

>3DProxyImg: Controllable 3D-Aware Animation Synthesis from Single Image via 2D-3D Aligned Proxy Embedding
https://arxiv.org/abs/2512.15126

>Is Nano Banana Pro a Low-Level Vision All-Rounder? A Comprehensive Evaluation on 14 Tasks and 40 Datasets
https://lowlevelbanana.github.io

>DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models
https://arxiv.org/abs/2512.15713

>End-to-End Training for Autoregressive Video Diffusion via Self-Resampling
https://guoyww.github.io/projects/resampling-forcing

>VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression
https://kylesargent.github.io/vlic

>Spatia: Video Generation with Updatable Spatial Memory
https://zhaojingjing713.github.io/Spatia

>Where is the Watermark? Interpretable Watermark Detection at the Block Level
https://arxiv.org/abs/2512.14994

>TalkVerse: Democratizing Minute-Long Audio-Driven Video Generation
https://zhenzhiwang.github.io/talkverse

>InpaintDPO: Mitigating Spatial Relationship Hallucinations in Foreground-conditioned Inpainting via Diverse Preference Optimization
https://arxiv.org/abs/2512.15644
>>
>>
File: 00086-1712971008.jpg (1.13 MB, 2048x2560)
1.13 MB
1.13 MB JPG
>>
File: 00088-2099698442.jpg (1.26 MB, 2048x2560)
1.26 MB
1.26 MB JPG
>>
>>
>>
File: FYtcB7WXoAMJLsK.jpg (243 KB, 1664x1664)
243 KB
243 KB JPG
found an old account from 2022 with a.i. stuff of mine.
>>
File: FYufZgYXwAA5Q2c.jpg (607 KB, 1664x1664)
607 KB
607 KB JPG
>>
File: FY2j49fWIAY5nb5.jpg (442 KB, 1664x1664)
442 KB
442 KB JPG
>>
File: FY2j9D1XkAEH33u.jpg (244 KB, 1152x2048)
244 KB
244 KB JPG
>>
File: FY4AHeaX0AUB0yc.jpg (358 KB, 1664x1664)
358 KB
358 KB JPG
>>
File: FZCVrEgWIAA3MB6.jpg (448 KB, 2048x819)
448 KB
448 KB JPG
>>
File: FZR3oBaXgAAcDbB.jpg (427 KB, 1152x2048)
427 KB
427 KB JPG
>>
File: FZhQSkvXgAAP8fH.jpg (45 KB, 512x512)
45 KB
45 KB JPG
>>
File: FZwBF5gWIAIPkuG.jpg (130 KB, 576x832)
130 KB
130 KB JPG
mfw
>>
File: FZ0rx5zWQAIyjgT.jpg (89 KB, 832x512)
89 KB
89 KB JPG
>>
>>
File: FZ13ohuWQAAMHlh.jpg (57 KB, 832x832)
57 KB
57 KB JPG
>>
>>
File: FaKKbcRXEAMKjhB.jpg (102 KB, 512x704)
102 KB
102 KB JPG
this'll conclude the 2022 throwback
>>
>>
File: z-img_00085_.png (1.9 MB, 1024x1536)
1.9 MB
1.9 MB PNG
WTH is this new Captcha
>>
File: z-img_00093_.png (1.74 MB, 1024x1536)
1.74 MB
1.74 MB PNG
>>
>>107605463
>WTH is this new Captcha
what are you, some kind of robot?
>>
File: z-img_00102_.png (2.1 MB, 1024x1536)
2.1 MB
2.1 MB PNG
>>107605590
after using all my brain cells i made sense of it :P
>>
>>107605610
i'm keeping my eye on you, clanker
>>
File: z-img_00094_.png (1.76 MB, 1024x1536)
1.76 MB
1.76 MB PNG
>>107605651
It's me.. ~pzzt* me... The 1guy who keeps posting blond blue eyed 1girls.
[*.........]
[*****...]
[*******]
Self check complete, switching to lurker mode.
>>
>>
File: 0x0730477002x0.jpg (885 KB, 4608x3584)
885 KB
885 KB JPG
>>
File: z-img_00127_.png (1.74 MB, 1024x1536)
1.74 MB
1.74 MB PNG
>>
>>
>>107605785
i like those
>>
>>
File: z-img_00116_.png (1.6 MB, 1024x1536)
1.6 MB
1.6 MB PNG
>>107605991
Thank you kindly :)
>>107606001
I tried elf ears with img-z-img and it tended to turn out asian :D
>>
File: z-img_00002_.png (2 MB, 1024x1536)
2 MB
2 MB PNG
>>
File: 00003-3571797359.jpg (695 KB, 1344x1728)
695 KB
695 KB JPG
>>
File: deKF_zi_00002_.png (3.22 MB, 2176x1152)
3.22 MB
3.22 MB PNG
>>107606643
can I get a piccolo in this style?
>>
Thoughts on this?
https://github.com/Comfy-Org/ComfyUI-Manager
>>
File: deKF_zi_00001_.png (2.41 MB, 2176x1152)
2.41 MB
2.41 MB PNG
>>107607081
I think its a pretty cool guy



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.