[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: k_00777_.png (3.09 MB, 1920x1080)
3.09 MB
3.09 MB PNG
Previous /sdg/ thread : >>107899825

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://huggingface.co/jayn7/Z-Image-Turbo-GGUF

>Flux.2 Dev
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/city96/FLUX.2-dev-gguf

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
https://huggingface.co/QuantStack/Qwen-Image-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUF
https://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUF
https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://github.com/maybleMyers/chromaforge
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/vp/napt
>>>/r/realistic+parody
>>
>mfw Resource news

01/19/2026

>kohya-ss/sd-scripts v0.10.0 released
https://github.com/kohya-ss/sd-scripts/releases/tag/v0.10.0

>Radiance: Professional HDR Image Processing Suite for ComfyUI
https://github.com/fxtdstudios/radiance

>M3DDM+: An improved video outpainting by a modified masking strategy
https://github.com/tamaki-lab/M3DDM-Plus

>ShapeR: Robust Conditional 3D Shape Generation from Casual Captures
http://facebookresearch.github.io/ShapeR

>VidLeaks: Membership Inference Attacks Against Text-to-Video Models
https://zenodo.org/records/17972831

>Moonworks Lunara Aesthetic Dataset
https://huggingface.co/datasets/moonworks/lunara-aesthetic

01/18/2026

>VIBE: Visual Instruction Based Editor
https://huggingface.co/iitolstykh/VIBE-Image-Edit

>Arthemy Live Tuner SDXL ComfyUI
https://github.com/aledelpho/Arthemy_Live-Tuner-SDXL-ComfyUI

>Pixel-Perfect Aligner (AI Fix) for GIMP 3
https://github.com/CombinEC-R/Pixel-Perfect-Aligner

>Stable AI Flow: Phase-Locked Live AI Filter
https://github.com/anttiluode/StableAIflow

>ComfyUI-Flux2Klein-Enhancer: Conditioning enhancement node for FLUX.2 Klein 9B
https://github.com/capitan01R/ComfyUI-Flux2Klein-Enhancer

>DiffusionDesk: Self-hosted Creative AI server integrating stable-diffusion.cpp and llama.cpp
https://github.com/Danmoreng/diffusion-desk

>WAN 2.6 Reference-to-Video is available in ComfyUI
https://blog.comfy.org/p/wan26-reference-to-video

01/17/2026

>FLUX.2 Prompting: Prompting Guide - FLUX.2 [klein]
https://docs.bfl.ai/guides/prompting_guide_flux2_klein

01/16/2026

>ComfyUI-CapitanFlowMatch: Optimal samplers and schedulers for rectified flow models
https://github.com/capitan01R/ComfyUI-CapitanFlowMatch

01/15/2026

>FLUX.2 [klein]: Generate and edit in less than a second with state-of-the-art quality
https://bfl.ai/models/flux-2-klein

>ComfyUI-TBG-ETUR: 100MP Enhanced Tiled Upscaler & Refiner Pro. Enhance Your Images with TBG's Upscaler
https://github.com/Ltamann/ComfyUI-TBG-ETUR
>>
>mfw Research news

01/19/2026

>Your One-Stop Solution for AI-Generated Video Detection
https://arxiv.org/abs/2601.11035

>PhysRVG: Physics-Aware Unified Reinforcement Learning for Video Generative Models
https://arxiv.org/abs/2601.11087

>CoDance: An Unbind-Rebind Paradigm for Robust Multi-Subject Animation
https://lucaria-academy.github.io/CoDance

>SoLA-Vision: Fine-grained Layer-wise Linear Softmax Hybrid Attention
https://arxiv.org/abs/2601.11164

>ATATA: One Algorithm to Align Them All
https://arxiv.org/abs/2601.11194

>Enhancing Vision Language Models with Logic Reasoning for Situational Awareness
https://arxiv.org/abs/2601.11322

>When Are Two Scores Better Than One? Investigating Ensembles of Diffusion Models
https://arxiv.org/abs/2601.11444

>MHA2MLA-VLM: Enabling DeepSeek's Economical Multi-Head Latent Attention across Vision-Language Models
https://arxiv.org/abs/2601.11464
>>
File: k_00831_.png (581 KB, 896x1152)
581 KB
581 KB PNG
>>
File: deGG_zi_00015_.png (3.2 MB, 2176x1152)
3.2 MB
3.2 MB PNG
we havent seen baker anon since around xmas. would he really vanish without even saying goodbye?
>>
File: k_00832_.png (1.45 MB, 896x1152)
1.45 MB
1.45 MB PNG
>>107914677
idk. it is depressing. we seem in the latter days of sdg. i am pretty burnt out.
>>
File: deGG_zi_00016_.png (3.2 MB, 2176x1152)
3.2 MB
3.2 MB PNG
>>107914783
post cadence hasn't been too bad lately, keeping close to a thread a day. I do miss a lot of posters though. maybe we'll see them again some day
>>
klein lora attemp 1 (.3) worked well enough
>>
>>
I created a kpop band. kek

Full video
https://files.catbox.moe/cljkr0.mp4
>>
File: deGG_zi_00018_.png (3.26 MB, 2176x1152)
3.26 MB
3.26 MB PNG
>>107915098
nice. how do you rate f2k trainability compared to chroma or zimg?

>>107915215
this was all local? pretty nice pacing and scene structure. the faces get pretty demonic in parts tho
how long did the whole project take?
>>
>>107915340
>how do you rate f2k trainability compared to chroma or zimg?
taking into account that the more i've trained loras the more i've learned what works and waht doesnt, and that i dont tend to go back to retrain old loras with new knowledge...
i know chroma well and i know what doesnt work on it well too, what works is a challenge because of the nature of training (datasets, samplers, optimizations)
given that..
chroma is hardest and longest to train, but if done right it's like you add your character/style to the model
z-image is fast to train but tends to overfit (because it's not a base model+i didnt spend too much time on it)
klein is not even fully implemented (in onetrainer anyway) and it's as fast if not faster than z, and so far seems to preserve the base model stuff without bullying it (chroma) or overfitting (z). kinda of best of both worlds, but i'm still ironing out some quirks.
it took 6 hrs or so to do 50 epochs (around 2500 steps). using 250+ source images with meh captions, batch 10 at res 1024
same thing on chroma would've been maybe 15-20 hrs (if not more), and i never tried that many source images on z, but probably around 5-6 hrs too
>>
gonna leave the chromagirl training overnight, see how it does at 100 epochs lel
>>
>>
>>107915340
>how long did the whole project take?
About 3-4 days, all local. Yeah, quality is bad because of just 12gb. Still amazing what you can do with just 12gb.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.