[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: Variant 3_1211_091845.jpg (247 KB, 1792x2304)
247 KB
247 KB JPG
Previous /sdg/ thread : >>107737741

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://huggingface.co/jayn7/Z-Image-Turbo-GGUF

>Flux.2 Dev
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/city96/FLUX.2-dev-gguf

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
https://huggingface.co/QuantStack/Qwen-Image-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUF
https://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUF
https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://github.com/maybleMyers/chromaforge
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/vp/napt
>>>/r/realistic+parody
>>
>mfw Resource news

01/02/2026

>Qwen Image 2512 System Prompt
https://huggingface.co/spaces/Qwen/Qwen-Image-2512/blob/main/app.py

>Memorization in 3D Shape Generation: An Empirical Study
https://github.com/zlab-princeton/3d_mem

>ComfyUI-Niutonian-Themes
https://github.com/Niutonian/ComfyUI-Niutonian-Themes

>Qwen-Image-2512-Turbo-LoRA
https://huggingface.co/Wuli-art/Qwen-Image-2512-Turbo-LoRA

>Let Samples Speak: Mitigating Spurious Correlation by Exploiting the Clusterness of Samples
https://github.com/davelee-uestc/nsf_debiasing

01/01/2026

>From Inpainting to Editing: A Self-Bootstrapping Framework for Context-Rich Visual Dubbing
https://hjrphoebus.github.io/X-Dub

>Guiding a Diffusion Transformer with the Internal Dynamics of Itself
https://zhouxingyu13.github.io/Internal-Guidance

>DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models
https://diffthinker-project.github.io

>Think Before You Move: Latent Motion Reasoning for Text-to-Motion Generation
https://chenhaoqcdyq.github.io/LMR

12/31/2025

>Qwen-Image-2512
https://huggingface.co/Qwen/Qwen-Image-2512

>Qwen Image 2512 Lightning 4Steps Lora By LightX2V
https://huggingface.co/lightx2v/Qwen-Image-2512-Lightning

>ComfyUI-HY-Motion1: A ComfyUI plugin based on HY-Motion 1.0 for text-to-3D human motion generation
https://github.com/jtydhr88/ComfyUI-HY-Motion1

12/30/2025

>HY-Motion 1.0: Scaling Flow Matching Models for Text-To-Motion Generation
https://github.com/Tencent-Hunyuan/HY-Motion-1.0

>ThinkGen: Generalized Thinking for Visual Generation
https://github.com/jiaosiyuu/ThinkGen

>SD.cpp-WebUI: Lightweight, browser-based web interface for stable-diffusion.cpp
https://github.com/taltoris/SD.cpp-WebUI

>PurifyGen: Risk-Discrimination and Semantic-Purification Model for Safe T2I Generation
https://github.com/AI-Researcher-Team/PurifyGen

>CoFi-Dec: Hallucination-Resistant Decoding via Coarse-to-Fine Generative Feedback in LVLMs
https://github.com/AI-Researcher-Team/CoFi-Dec
>>
File: bbs-zit-2026-01-03_00106_.png (2.62 MB, 1792x1024)
2.62 MB
2.62 MB PNG
>>
>mfw Research news

01/02/2026

>GARDO: Reinforcing Diffusion Models without Reward Hacking
https://tinnerhrhe.github.io/gardo_project

>Iterative Inference-time Scaling with Adaptive Frequency Steering for Image Super-Resolution
https://arxiv.org/abs/2512.23532

>SoulX-LiveTalk Technical Report
https://arxiv.org/abs/2512.23379

>AnyMS: Bottom-up Attention Decoupling for Layout-guided and Training-free Multi-subject Customization
https://arxiv.org/abs/2512.23537

>Deep Learning for Art Market Valuation
https://arxiv.org/abs/2512.23078

>The Quest for Winning Tickets in Low-Rank Adapters
https://arxiv.org/abs/2512.22495

>PTalker: Personalized Speech-Driven 3D Talking Head Animation via Style Disentanglement and Modality Alignment
https://arxiv.org/abs/2512.22602

>Neighbor-Aware Token Reduction via Hilbert Curve for Vision Transformers
https://arxiv.org/abs/2512.22760

>Visual Autoregressive Modelling for Monocular Depth Estimation
https://arxiv.org/abs/2512.22653

>DeMoGen: Towards Decompositional Human Motion Generation with Energy-Based Diffusion Models
https://jiro-zhang.github.io/DeMoGen

>Error Analyses of Auto-Regressive Video Diffusion Models: A Unified Framework
https://arxiv.org/abs/2503.10704

>Few Tokens Matter: Entropy Guided Attacks on Vision-Language Models
https://arxiv.org/abs/2512.21815

>EasyOmnimatte: Taming Pretrained Inpainting Diffusion Models for End-to-End Video Layered Decomposition
https://arxiv.org/abs/2512.21865

>Towards Long-window Anchoring in Vision-Language Model Distillation
https://arxiv.org/abs/2512.21576

>FUSE: Unifying Spectral and Semantic Cues for Robust AI-Generated Image Detection
https://arxiv.org/abs/2512.21695

>Inference-based GAN Video Generation
https://arxiv.org/abs/2512.21776
>>
File: SDG_News_00024_.png (2.14 MB, 1728x1296)
2.14 MB
2.14 MB PNG
>>107749846
I forgot I genned a news pic. too late. gn
>>
File: bbs-zit-2026-01-03_00129_.png (2.88 MB, 1792x1024)
2.88 MB
2.88 MB PNG
<3
>>
File: _00030_.png (2.89 MB, 960x1600)
2.89 MB
2.89 MB PNG
>>
File: winter night.webm (3.64 MB, 1920x640)
3.64 MB
3.64 MB WEBM
>>
File: 5654655664.jpg (183 KB, 832x1216)
183 KB
183 KB JPG
>>
File: winter night 2.webm (3.45 MB, 1920x640)
3.45 MB
3.45 MB WEBM
>>107750774
Nice
>>
i miss schizo anon
>>
File: 00000-302066147.png (483 KB, 512x512)
483 KB
483 KB PNG
>>
File: 222.webm (2.08 MB, 512x512)
2.08 MB
2.08 MB WEBM
>>107751117
>>
>>107751206
cool
>>
is there an easy free option for generation video of my illustrious slop i generate locally?
i run a gtx 1070 so obviously i can't do it on my machine, and trying to generate on civitai is not viable because it takes a fuck huge amount of tokens.
>>
Neeeeeeeerrrrrrddddddsssssssssss
(New Year Edition)
>>
File: 00000-1540226053.jpg (1.71 MB, 1792x2304)
1.71 MB
1.71 MB JPG
>>
File: deSI_zi_00001_.png (2.14 MB, 1792x1152)
2.14 MB
2.14 MB PNG
>>107751525
video needs too much compute for there to be free options

>>107752312
no u (have a happy new year)
nice to see you
>>
File: 00001-903276678.jpg (1.45 MB, 2304x1792)
1.45 MB
1.45 MB JPG
>>
File: 000000_50673_.png (3.18 MB, 1711x968)
3.18 MB
3.18 MB PNG
>>
File: 00002-1548823748.jpg (675 KB, 2304x1792)
675 KB
675 KB JPG
>>
File: XQ_SE_CLASSICCANDY_01.jpg (740 KB, 4096x3608)
740 KB
740 KB JPG
>>
File: ghoould.jpg (140 KB, 700x393)
140 KB
140 KB JPG
>>107752796
>>
File: 00003-234695243.jpg (1.09 MB, 2304x1792)
1.09 MB
1.09 MB JPG
>>
File: deSI_zi_00003_.png (2.21 MB, 1792x1152)
2.21 MB
2.21 MB PNG
>>
File: 000000_50709_.png (2.46 MB, 1702x961)
2.46 MB
2.46 MB PNG
>>107753215
>trying to get an old world tartarian airship attached..
>>
File: 00004-1105555908.jpg (1.06 MB, 1792x2304)
1.06 MB
1.06 MB JPG
>>
File: 00005-2506781170.jpg (1.44 MB, 2048x2560)
1.44 MB
1.44 MB JPG
>>
File: deSI_zi_00006_.png (2.23 MB, 1792x1152)
2.23 MB
2.23 MB PNG
>>
So I've been watching Alchemy of Souls and I found a waifu.
>>107753345
ear fins, nice touch.

>>107753465
nice
>>
>>107749834
Anyone here trained lora's for SDXL models? im working on one as a soon as i get the dataset all ready (its an OC so im doing it all myself using a Ponyxl model), and was wondering if you tag things like expressions or actions, i found when i made lora's before (when i made them with 1.5) to have as many expressions as possible in the dataset but to not tag them otherwise when you want to have the character do a specific emotion it only takes from the dataset tagged with those emotions, was also wondering if i should avoid tagging actions like hand on hip/open mouth/looking away, ect, also i have a huge lora training guide im reading through if anyone wants it, someone in the thread the other day linked it

https://github.com/bghira/SimpleTuner#quickstart-guides
>>
File: deSI_zi_00010_.png (2.26 MB, 1792x1152)
2.26 MB
2.26 MB PNG
>>107753781
>Alchemy of Souls
kdrama, I'm assuming?
>>
>>107754329
Update, i was told to not use "ohwx" as the instance prompt and instead use a token that the SDXL model will recognise that is similar to my character, however my character is an OC but anime based, should i instead find an anime character that it would recognise to base it on and make training better?
>>
File: deSI_zi_00013_.png (2.36 MB, 1792x1152)
2.36 MB
2.36 MB PNG
>>
File: 00006-2709072481.jpg (1.13 MB, 2304x1792)
1.13 MB
1.13 MB JPG
>>
File: ComfyUI_00104_.jpg (374 KB, 1792x2304)
374 KB
374 KB JPG
>>
File: deSI_zi_00020_.png (2.19 MB, 1792x1152)
2.19 MB
2.19 MB PNG
>>
File: ComfyUI_00109_.jpg (386 KB, 1792x2304)
386 KB
386 KB JPG
>>
>>107754381
ah probably. not sure. it's about magic, mages and my waifu. I don't watch kdrama.
>>
File: file.png (92 KB, 1004x679)
92 KB
92 KB PNG
we /rocm windows/ now
>>
File: deSI_zi_00022_.png (2.32 MB, 1792x1152)
2.32 MB
2.32 MB PNG
>>107755854
nice. into the rcom universe
>>
File: deSI_zi_00028_.png (2 MB, 1792x1152)
2 MB
2 MB PNG
>>
hello /sdg/
i recently bought a 5060 ti 16gb and while trying to prompt utilizing some aspects within stable diffusion and comfyui, i run into erros:

pytorch seems to be trying to use infinite memory, causing my SD or ComfyUI to crash suddenly, i have no idea what could be causing this and i've tried using other versions of pytorch, is this a common thing?

stable diffusion works fine for the most part but seems to run into problems whenever i try to use flux too, i would be very happy if anyone knew a fix to this



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.