[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 1769681690665286.jpg (1.39 MB, 1967x1967)
1.39 MB
1.39 MB JPG
Discussion of Open Source Diffusion Models

Previous: >>108494530

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg
>>
File: 1766194561883981.jpg (2.5 MB, 3024x4032)
2.5 MB
2.5 MB JPG
>>
File: 441314770780331.png (1.06 MB, 1664x2432)
1.06 MB
1.06 MB PNG
>>
>mfw Resource news

04/01/2026

>DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing
https://carlofkl.github.io/dreamlite

>MMFace-DiT: A Dual-Stream Diffusion Transformer for High-Fidelity Multimodal Face Generation
https://vcbsl.github.io/MMFace-DiT

>Hallucination-aware intermediate representation edit in LVLMs
https://github.com/ASGO-MM/HIRE

>CutClaw: Agentic Hours-Long Video Editing via Music Synchronization
https://github.com/GVCLab/CutClaw

>Extend3D: Town-Scale 3D Generation
http://seungwoo-yoon.github.io/extend3d-page

>PixlStash 1.0.0 release candidate
https://github.com/Pikselkroken/pixlstash/releases/tag/v1.0.0rc3

>adetailer-hires-sync: Automatically enables ADetailer in Forge
https://github.com/KazeKaze93/adetailer-hires-sync

03/31/2026

>See-through: Single-image Layer Decomposition for Anime Characters
https://github.com/shitagaki-lab/see-through

>VRAM Pager: Compressed GPU Memory Paging for Diffusion & Video Models
https://github.com/willjriley/vram-pager

>TGIF2: Extended Text-Guided Inpainting Forgery Dataset & Benchmark
https://github.com/IDLabMedia/tgif-dataset

>Look, Compare and Draw: Differential Query Transformer for Automatic Oil Painting
https://differential-query-painter.github.io/DQ-painter

>Drift-AR: Single-Step Visual Autoregressive Generation via Anti-Symmetric Drifting
https://github.com/aSleepyTree/Drift-AR

>INSID3: Training-Free In-Context Segmentation with DINOv3
https://visinf.github.io/INSID3

>OmniColor: Unified Framework for Multi-modal Lineart Colorization
https://github.com/zhangxulu1996/OmniColor

>Gen-Searcher: Reinforcing Agentic Search for Image Generation
https://gen-searcher.vercel.app

>V-CAST: Video Curvature-Aware Spatio-Temporal Pruning for Efficient Video LLMs
https://github.com/xinyouu/V-CAST

>GEMS: Agent-Native Multimodal Generation with Memory and Skills
https://gems-gen.github.io

>RAWIC: Bit-Depth Adaptive Lossless Raw Image Compression
https://github.com/chunbaobao/RAWIC
>>
>mfw Research news
>>108502500
>>108502506
>>
File: 57622191487741.png (626 KB, 832x1216)
626 KB
626 KB PNG
>>
Mugen vs animal anon? For nsfw ofc

Also Jenner, share your LoRA please, I start to have a kink for her
>>
>>108502834
Anima unless you are a fudding lolcow thread schizo or member of their shillcord. Not even close.
>>
>>108502834
sdxl
>>
File: 132110849987576.png (995 KB, 1344x768)
995 KB
995 KB PNG
>>
>julienbake
>>
Qwen and Klein are great in a lot of ways but I miss Chroma's soul. I haven't been keeping up with the Chroma meta since 1.0, though. What are people using? I heard the flash huen lora was good but it looked really slopped (skill issue?) and Chroma really needs negative prompts. What are people using these days?
>>
File: Klein9BDistilledRemakes.jpg (3.54 MB, 2312x2604)
3.54 MB
3.54 MB JPG
>>108502637
Remade this a couple slightly different ways (different prompt approaches) with Klein 9B Distilled
>>
File: 123436057959205.png (952 KB, 1152x896)
952 KB
952 KB PNG
>>
>>108502903
I use either base or the DC-2K memeversion. As for flash loras, the rank 64 are the best.
>>
>>108502685
cake lady looks delish, catbox?
>>
>>108502928
The framing/composition is too simplistic, if you want to make it look more realistic you have to make them stand more off-center, make the framing not perfectly horizontally aligned, etc. Probably add a bit of photo blur and turn down the color saturation.
>>
File: 491221717145059.png (773 KB, 832x1216)
773 KB
773 KB PNG
>>
>>108502954
>DC-2K
What is this? I found the files but there isn't any description or guidance on sampler settings. Is it just a drop-in replacement for Chroma-HD? Is it meant to gen at 2k?

As far as the flash lora I was using rank 64 and it was generating images that looked like base flux when I was trying to do something realistic. I gave up pretty quickly but it really didn't look promising.
>>
>>108502994
>Is it meant to gen at 2k?
No it's just more stable at higher than 1mpx gens. Other than that it's just chroma.
Idk I don't really gen 3dpd so I can't help you here.
>>
>>108502994
No one besides the five furtroons who basement dwell their cord 7/24 knows what the fuck exactly the trillion different schizo chroma spinoff experiments precisely are supposed to be.
Chroma is an astroturfed failbake and you will only waste your time with it. None of the variants or autistic workflows work without the most aggressive cherry picking and seed lottery game.
>>
File: Klein9BVsZIT.png (2.87 MB, 1566x1168)
2.87 MB
2.87 MB PNG
What ZIT thinks white people look like is the biggest issue with it IMO, it's really not that realistic at all when it comes to faces in quite a lot of cases, you can immediately tell it's a Chinese model just by looking at the eyes most of the time
>>
>>108503012
Got it. I have to wonder if the flash lora was trained for anime or something because it absolutely tanked the realism which Chroma usually excels at.

>>108503018
Fuck off, we've got people in this thread that are still using fucking SDXL. Chroma has been surpassed in a lot of ways by recent models but I've made plenty of Chroma images that I just can't replicate with new models.
>>
>>108503059
desu there are multiple flash loras. I have like 5 and each is baked differrently and gives differrent results.
>>
File: 385555633032208.png (959 KB, 896x1152)
959 KB
959 KB PNG
>>
teto is so shit as a waifu it's not even funny anymore
>>
Chroma is only good/better if you're a furfags... If not just pick any Qwen/ZiT/Klein nsfw.
>>
>>108503312
Hermano, we're all furfags
>>
>>108503023
and don't forget to mention that zit, has far more trouble with the loras than klein
>>
>chinese anime man's hyped up 'new model' was API wan 2.7
local really is dead. absolutely zero developments this entire year.
>>
>108503312
>same one argument
>>
>>108503444
No he was referring to Nucleus MoE whose diffusers PR seems to be stuck in development hell.
>>
>>108503444
I'm actually glad local is stagnating. it means there's no reason for me to upgrade my PC. which saves me money. i'm going to ride out my 3090 128 GB of ram for the next 5 years.
>>
File: wan.png (287 KB, 708x608)
287 KB
287 KB PNG
>>108503483
No he wasn't, he quoted his own post to refer to wan 2.7. Whatever nucleus is, it will be worse than z-image, qwen 2512, and flux klein. Local has nothing left, even Noob has to resort to shitty GLM-image because Qwen refuses to release image 2.0. The era of API is here
>>
>>108503525
Didn't check his recent posts, damn that sucks.
>>
>>108503525
so is Wan 2.7 image not something to be excited for?
>>
I thought anon would have a good april fools joke but instead hes just repeating the same troll he always uses :(
>>
File: 789507225759739.png (826 KB, 832x1216)
826 KB
826 KB PNG
>>
>>108503579
Never mind, i didn't read that it was API. i thought it was being open sourced.
>>
The least they could do is releasing earlier wan versions like 2.5 but they won't even do that.
>>
>>108503591
it's only local until it's good
>>
>>108503603
they REALLY hate the idea of people using their own hardware to make shit. they probably think it's going to be a huge legal risk.
>>
>>108503636
I thought china was supposed to be heckin based and redpilled, saving AI from western jews and censorship, releasing local models to end capitalism and defeat OpenAI's monopoly?? Don't tell me that was just a bunch of a shill astroturfing after all and they sold out to SaaS just as quickly as everyone else
>>
File: 16436738523.jpg (137 KB, 640x705)
137 KB
137 KB JPG
Looking through some of my old folders, people had it figured it out in 2023 apperantly.
>>
>>108503685
even cumfart sold out to saas
>>
>>108503685
Pretty much. you saw how China bent the knee and are going to strip Seedance of any ability to do anything before releasing it locally. China is ultimately just as scared of america as the rest of the world is.
>>
File: 1749783908067763.jpg (467 KB, 1744x1432)
467 KB
467 KB JPG
*zitslops all over u*
>>
>>108502726
it sent shivers down my spine
>>
>>108503718
good, no one wants to train an overfit model.
>>
Klein (at least 4b) and qwen are censored. Qwen phroot is all or nothing. Moving to pony or illustrous
>>
>>108503418
Klein generates body horror with loras.
>>
>>108503712
you sound jealous
>>
>>108503902
you sound like a little bitch boy saas cuck
>>
>>108503953
you post about saas here all the time though
>>
>>108503972
I don't faggot



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.