[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor application acceptance emails are being sent out. Please remember to check your spam box!


[Advertise on 4chan]


File: 1739947702797547.png (2 MB, 1747x1112)
2 MB
2 MB PNG
Previous /sdg/ thread : >>107191070

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Early Preview UI
AniStudio: https://github.com/FizzleDorf/AniStudio

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
https://huggingface.co/QuantStack/Qwen-Image-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF

>Flux.1 Krea
https://docs.comfy.org/tutorials/flux/flux1-krea-dev
https://huggingface.co/black-forest-labs/FLUX.1-Krea-dev
https://huggingface.co/QuantStack/FLUX.1-Krea-dev-GGUF

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUF
https://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUF
https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://github.com/maybleMyers/chromaforge
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Models, LoRAs & upscaling
https://civitai.com
https://tensor.art
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/vp/napt
>>
>mfw Resource news

11/13/2025

>Kandinsky-5.0-I2V-Pro-sft-5s-Diffusers
https://huggingface.co/kandinskylab/Kandinsky-5.0-I2V-Pro-sft-5s-Diffusers/tree/main

>Causally-Grounded Dual-Path Attention Intervention for Object Hallucination Mitigation in LVLMs
https://github.com/CikZ2023/OWL

>Diversifying Counterattacks: Orthogonal Exploration for Robust CLIP Inference
https://github.com/bookman233/DOC

11/12/2025

>Multi-modal Deepfake Detection and Localization with FPN-Transformer
https://github.com/Zig-HS/MM-DDL

>3D4D: An Interactive, Editable, 4D World Model via 3D Video Generation
https://yunhonghe1021.github.io/NOVA

>xdit-comfyui-private: Parallel Multi GPU worker
https://github.com/xdit-project/xdit-comfyui-private

>Moondream 3 HF
https://huggingface.co/NyxKrage/moondream3-hf

11/11/2025

>StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
http://streamdiffusionv2.github.io

>Turbo-DDCM: Fast and Flexible Zero-Shot Diffusion-Based Image Compression
https://amitvaisman.github.io/turbo_ddcm

>Hilbert-Guided Block-Sparse Local Attention
https://github.com/Yunge6666/Hilbert-Local-Attention

>SUP Toolbox App: Gradio web interface for SUP-Toolbox image restoration and upscaling
https://github.com/DEVAIEXP/sup-toolbox-app

>Gausian Native Editor: Fast, native video editor and preview tool built in Rust
https://github.com/gausian-AI/Gausian_native_editor

>Minimalistic Comfy Wrapper WebUI
https://github.com/light-and-ray/Minimalistic-Comfy-Wrapper-WebUI

11/10/2025

>DeepEyesV2: Toward Agentic Multimodal Model
https://visual-agent.github.io

>Role-SynthCLIP: A Role Play Driven Diverse Synthetic Data Approach
https://github.com/huangfu170/Role-SynthCLIP

>Ovi 1.1 Update: Temporal-consistent 10-second video generation
https://github.com/character-ai/Ovi#-ovi-11-update-10-november-2025

>OpenAI Blowing As Much As $15 Million/Day On Sora Videos
https://www.forbes.com/sites/phoebeliu/2025/11/09/openai-spending-ai-generated-sora-videos
>>
>mfw Research news

11/13/2025

>Revisiting Cross-Architecture Distillation: Adaptive Dual-Teacher Transfer for Lightweight Video Models
https://arxiv.org/abs/2511.09469

>DBINDS -- Can Initial Noise from Diffusion Model Inversion Help Reveal AI-Generated Videos?
https://arxiv.org/abs/2511.09184

>FSampler: Training Free Acceleration of Diffusion Sampling via Epsilon Extrapolation
https://arxiv.org/abs/2511.09180

>Ultra-Light Test-Time Adaptation for Vision--Language Models
https://arxiv.org/abs/2511.09101

>Composition-Incremental Learning for Compositional Generalization
https://arxiv.org/abs/2511.09082

>Negative Entity Suppression for Zero-Shot Captioning with Synthetic Images
https://arxiv.org/abs/2511.08909

>DT-NVS: Diffusion Transformers for Novel View Synthesis
https://arxiv.org/abs/2511.08823

>BayesQ: Uncertainty-Guided Bayesian Quantization
https://arxiv.org/abs/2511.08821

>Harnessing Diffusion-Generated Synthetic Images for Fair Image Classification
https://arxiv.org/abs/2511.08711

>FGM-HD: Boosting Generation Diversity of Fractal Generative Models through Hausdorff Dimension Induction
https://arxiv.org/abs/2511.08945

>From Structure to Detail: Hierarchical Distillation for Efficient Diffusion Model
https://arxiv.org/abs/2511.08930

>Rethinking generative image pretraining: How far are we from scaling up next-pixel prediction?
https://arxiv.org/abs/2511.08704

>SPEED-Q: Staged Processing with Enhanced Distillation towards Efficient Low-bit On-device VLM Quantization
https://arxiv.org/abs/2511.08914
>>
File: deRA_cHD_00020_.png (3.85 MB, 2150x1229)
3.85 MB
3.85 MB PNG
>>
File: 00004-3519584096.png (2.15 MB, 1792x1024)
2.15 MB
2.15 MB PNG
lake
>>
>>107201110
>>107201181
reposting question

Also is it me or has Comfy generally regressed? It's unstable as fuck now.
>>
File: ComfyUI_wan_00010_.mp4 (3.5 MB, 1280x720)
3.5 MB
3.5 MB MP4
>>107201680
>>
File: deRA_cHD_00021_.png (3.45 MB, 2150x1229)
3.45 MB
3.45 MB PNG
>>107201680
can't argue with that
>>
File: 05mcollage.jpg (439 KB, 2413x1880)
439 KB
439 KB JPG
>>
File: 00009-2694588641.png (1.73 MB, 1792x1024)
1.73 MB
1.73 MB PNG
>>107201721
in her final turn around she has an ass on the front and back. i think i'm in love
>>
File: 00010-2694588644.png (1.84 MB, 1792x1024)
1.84 MB
1.84 MB PNG
https://sora.chatgpt.com/d/gen_01ka029cksea9s1gvg7zfrbqjk
>>
File: deRA_cHD_00022_.png (3.69 MB, 2150x1229)
3.69 MB
3.69 MB PNG
>>107201837
Invalid URL (GET /backend/project_y/profile/drafts/v2/gen_01ka029cksea9s1gvg7zfrbqjk)
>>
File: 00015-2179046265.png (1.63 MB, 1792x1024)
1.63 MB
1.63 MB PNG
>>107201861
weird. uh wait try it now. apparently, you need to publish things? idk sora is retarded, probably one of the worst UIs ive ever used in my life, and i use ffmpeg on a regular basis.
>>
File: 00016-2179046267.png (2.08 MB, 1792x1024)
2.08 MB
2.08 MB PNG
https://sora.chatgpt.com/p/s_6916ca4b506c8191981eba893d89c043
have to come up with some better ideas... i'm also forced to contend with "what does bunchan sound like" which, frankly, i've never thought about for even a moment
>>
File: deRA_cHD_00026_.png (3.41 MB, 2150x1229)
3.41 MB
3.41 MB PNG
>>107201904
invalid GET
I even tried pressing the esc key twice (like the movie hackers do). no luck
>>
File: deRA_cHD_00027_.png (3.88 MB, 2150x1229)
3.88 MB
3.88 MB PNG
>>107201929
that one worked
fuckin' tin-fingers
>>
File: 00021-2993617756.png (2.34 MB, 1792x1024)
2.34 MB
2.34 MB PNG
>>107201930
oh man after all this ur gonna be so dissapointed lmao. lemme try catbox
https://files.catbox.moe/fjivzo.mp4
https://files.catbox.moe/32o2kl.mp4
>>
File: 00022-2993617759.png (2.02 MB, 1792x1024)
2.02 MB
2.02 MB PNG
>>
File: deRA_cHD_00030_.png (3.69 MB, 2150x1229)
3.69 MB
3.69 MB PNG
>>107201949
I can never be disappointed in bunchan
>>
File: 00031-1993946341.png (2.12 MB, 1792x1024)
2.12 MB
2.12 MB PNG
>>
Something about these generals feels very Israeli
Can't put my finger on it, but you can just feel it in your gut sometimes
>>
File: deRA_cHD_00032_.png (3.62 MB, 2150x1229)
3.62 MB
3.62 MB PNG
>>107202105
why hasnt bibi bought me a 5090 then?
>>
File: 00032-1993946343.png (1.83 MB, 1792x1024)
1.83 MB
1.83 MB PNG
>>107202105
shalom! be afraid, the mossad is hiding in your JPEGs
>>
File: gen.out2.jpg (217 KB, 1889x1469)
217 KB
217 KB JPG
>>
File: deRA_cHD_00033_.png (3.95 MB, 2150x1229)
3.95 MB
3.95 MB PNG
>>
File: 00037-2006822991.png (1.93 MB, 1792x1024)
1.93 MB
1.93 MB PNG
>>
File: 00038-2006822993.png (2.25 MB, 1792x1024)
2.25 MB
2.25 MB PNG
>>
>>107201681
On closer analysis it seems like rocm/pytorch defaults to using 7.1 and I think my 6700XT really doesn't like any rocm version above 6.4.
>>
File: WO_SEK_NA_CHICK_2.jpg (1.09 MB, 4608x3584)
1.09 MB
1.09 MB JPG
>>
>gm nigbos
>>
Hey Lumi.

What model do you use to generate this schizoart?
>>
i miss schizo anon
>>
hes here. hes lumi
>>
File: WO_SEK_NA_CHICK_4.jpg (961 KB, 4608x3384)
961 KB
961 KB JPG
>>
File: WO_SEK_NA_CHICK_3.jpg (930 KB, 4608x3584)
930 KB
930 KB JPG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.