[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Previous: >>108384322

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>mfw Resource news

03/17/2026

>Self-E: Self-Evaluation Unlocks Any-Step Text-to-Image Generation
https://github.com/XinYu-Andy/SelfE

>Chain-of-Trajectories: Unlocking the Intrinsic Generative Optimality of Diffusion Models via Graph-Theoretic Planning
https://github.com/UnicomAI/CoTj

>EditHF-1M: A Million-Scale Rich Human Preference Feedback for Image Editing
https://github.com/IntMeGroup/EditHF

>Representation Alignment for Just Image Transformers is not Easier than You Think
https://github.com/kaist-cvml/PixelREPA

>AdapterTune: Zero-Initialized Low-Rank Adapters for Frozen Vision Transformers
https://github.com/salimkhazem/adaptertune

>PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling
https://x-gengroup.github.io/HomePage_PaCo-RL

>LTX-2.3 NVFP4
https://huggingface.co/Lightricks/LTX-2.3-nvfp4

>Gamers react with overwhelming disgust to DLSS 5’s generative AI glow-ups
https://arstechnica.com/gaming/2026/03/gamers-react-with-overwhelming-disgust-to-dlss-5s-generative-ai-glow-ups

>Nvidia's Nemotron coalition brings eight AI labs together to build open frontier models
https://www.tomshardware.com/tech-industry/artificial-intelligence/nvidias-nemoclaw-coalition-brings-eight-ai-labs-together-to-build-open-frontier-models

03/16/2026

>Finite Difference Flow Optimization for RL Post-Training of Text-to-Image Models
https://github.com/NVlabs/finite-difference-flow-optimization

>MemRoPE: Training-Free Infinite Video Generation via Evolving Memory Tokens
https://memrope.github.io

>MoKus: Leveraging Cross-Modal Knowledge Transfer for Knowledge-Aware Concept Customization
https://chenyangzhu1.github.io/MoKus

>Less Data, Faster Convergence: Goal-Driven Data Optimization for Multimodal Instruction Tuning
https://github.com/rujiewu/GDO

>Rethinking VLMs for Image Forgery Detection and Localization
https://github.com/sha0fengGuo/IFDL-VLM
>>
>mfw Research news

03/17/2026

>Early Failure Detection and Intervention in Video Diffusion Models
https://arxiv.org/abs/2603.14320

>Relevance Feedback in Text-to-Image Diffusion: A Training-Free And Model-Agnostic Interactive Framework
https://arxiv.org/abs/2603.14936

>PHAC: Promptable Human Amodal Completion
https://arxiv.org/abs/2603.14741

>CamLit: Unified Video Diffusion with Explicit Camera and Lighting Control
https://arxiv.org/abs/2603.14241

>Fair Benchmarking of Emerging One-Step Generative Models Against Multistep Diffusion and Flow Models
https://arxiv.org/abs/2603.14186

>Trust-Region Noise Search for Black-Box Alignment of Diffusion and Flow Models
https://arxiv.org/abs/2603.14504

>IMS3: Breaking Distributional Aggregation in Diffusion-Based Dataset Distillation
https://arxiv.org/abs/2603.13960

>TMPDiff: Temporal Mixed-Precision for Diffusion Models
https://arxiv.org/abs/2603.14062

>LatSearch: Latent Reward-Guided Search for Faster Inference-Time Scaling in Video Diffusion
https://zengqunzhao.github.io/LatSearch

>Diffusion Reinforcement Learning via Centered Reward Distillation
https://arxiv.org/abs/2603.14128

>Single Image Super-Resolution via Bivariate ‘A Trous Wavelet Diffusion
https://arxiv.org/abs/2603.07234

>SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation
https://sk-adapter.github.io

>RAZOR: Ratio-Aware Layer Editing for Targeted Unlearning in Vision Transformers and Diffusion Models
https://arxiv.org/abs/2603.14819

>Workflow-Aware Structured Layer Decomposition for Illustration Production
https://arxiv.org/abs/2603.14925

>Texel Splatting: Perspective-Stable 3D Pixel Art
https://arxiv.org/abs/2603.14587

>GenState-AI: State-Aware Dataset for Text-to-Video Retrieval on AI-Generated Videos
https://arxiv.org/abs/2603.14426

>AnyPhoto: Multi-Person Identity Preserving Image Generation with ID Adaptive Modulation on Location Canvas
https://arxiv.org/abs/2603.14770
>>
>>108393213
>Neighbors
>>>/vg/vpcai
>>
>mfw MORE Research news

>MotionCFG: Boosting Motion Dynamics via Stochastic Concept Perturbation
https://arxiv.org/abs/2603.14073

>Spectrum Matching: a Unified Perspective for Superior Diffusability in Latent Diffusion
https://arxiv.org/abs/2603.14645

>Not All Directions Matter: Toward Structured and Task-Aware Low-Rank Adaptation
https://arxiv.org/abs/2603.14228

>Seeking Physics in Diffusion Noise
https://arxiv.org/abs/2603.14294

>CyCLeGen: Cycle-Consistent Layout Prediction and Image Generation in Vision Foundation Models
https://arxiv.org/abs/2603.14957

>FIND: A Simple yet Effective Baseline for Diffusion-Generated Image Detection
https://arxiv.org/abs/2603.14220

>Distilling Latent Manifolds: Resolution Extrapolation by Variational Autoencoders
https://arxiv.org/abs/2603.14536

>Balancing Saliency and Coverage: Semantic Prominence-Aware Budgeting for Visual Token Compression in VLMs
https://arxiv.org/abs/2603.14892

>M2IR: Proactive All-in-One Image Restoration via Mamba-style Modulation and Mixture-of-Experts
https://arxiv.org/abs/2603.14816

>ASAP: Attention-Shift-Aware Pruning for Efficient LVLM Inference
https://arxiv.org/abs/2603.14549

>Towards Generalizable Deepfake Detection via Real Distribution Bias Correction
https://arxiv.org/abs/2603.14005

>GameUIAgent: An LLM-Powered Framework for Automated Game UI Design with Structured Intermediate Representation
https://arxiv.org/abs/2603.14724

>Mixture of States: Routing Token-Level Dynamics for Multimodal Generation
https://haozheliu-st.github.io/mos-homepage

>Secure and Robust Watermarking for AI-generated Images: A Comprehensive Survey
https://arxiv.org/abs/2510.02384
>>
>>108393263
>>108393269
Damn man, that's a lot. Any particularly interesting paper there?
And thanks for all the work, by the way.
>>
>
>>
File: ComfyUI_00024_.jpg (208 KB, 1024x1536)
208 KB
208 KB JPG
>>108393213
Welcome to the final days and last threads of /ldg/, relax, and make yourself at home! ^^
>>
File: deBU_zi_00022_.png (1.97 MB, 1536x922)
1.97 MB
1.97 MB PNG
>>108393281
tuesday dumps are the biggest. I've never known why
>Any particularly interesting paper there?
depends on what you're interested in. I personally found this interesting:
>SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation
>>
>>108393351
u wish
>>
What's the best local way to add porn audio to porn video gens? I don't want to reroll LTX-2 50 times until I get something barely usable...
>>
can someone link an online upscaler? I just need it for one pic
>>
File: 1748681442674816.png (64 KB, 747x707)
64 KB
64 KB PNG
>>108393405
mmaudio nsfw. it needs schizo negatives though. pic related is what I came up with for now
>>
>>108393445
>fizzle
>>
>>108393351
Do you sell any souvenirs or pins as a memento?
>>
>>108393445
a man of taste i see.
>>
>>108393453
hurr durr herrr burr durrr berr durr durr ddurrrr
kill yourself
>>
>>108393445
how light/heavy is it to use?
>>
>>108393454
Yes!^^
I have many, which memento do you prefer?

I was there for the last /ldg/ threads
Survived the last /ldg/ threads
Last /ldg/ thread witness
I posted in the final /ldg/
Present at the last /ldg/ threads
Gone but not forgotten, /ldg/
>>
>>108393445
kill yourself obsessed faggot loser
>>
>>108393475
I think it was like 10 seconds a gen. it's not great btw so you might need to crank out a couple dozen gens before you get something decent but it's all we have
>>
>>108393438
HELP
>>
>>108393438
>>108393493
>>/g/aicg
>>/g/dalle
>>
>>108393493
we are LOCAL chads here.
>>
long dick general
>>
>>108393502
long dead general :3
>>
File: dsfsdfsdfsdfsdf.jpg (303 KB, 1608x1011)
303 KB
303 KB JPG
>>108393500
>>108393501
I fucking hate you AI nerds
I asked bing to upscale it, what the FUCK is this?
>>
>>108393481
Neat, I want one with Hatsune Teto and "I was there for the last /ldg/ threads" phrase
>>
>>108393493
POST THE FUCKING IMAGE I'LL DO IT
>>
Newfag here, what checkpoint/model is best for undressing? I want to stick as close as possible to the orginal. Is inpaint better than I2I with a good prompt?
I'm fucking around with klein 9b (unstableRevolutionF2K) with inpaint/prompt, but the results seem far too random. I would appreciate any advice.
>>
>>108393584
>>
>>108393592
pervert.
>>108393601
why this image? mouth open or closed?
>>
>>108393631
>why this image?
I like Alysa, her mouth is open.
>>
File: 1743679619882543.png (870 KB, 832x1248)
870 KB
870 KB PNG
>>
>>108393257
Based.
>>
>>108393643
sovl
>>
>>108393637
can't do it, sorry. didn't notice it's a celeb. need a lora for her. impossible to land that face
>>
File: image.jpg (938 KB, 3328x1535)
938 KB
938 KB JPG
https://civitai.com/models/2239459/akashicpulse-eqvae
Is this model good, or is it snake oil or cobra oil? Apparently, it uses a new kind of VAE
>>
>>108393738
>illus
>EA grift
>>
File: 1687597503950061.jpg (117 KB, 841x1024)
117 KB
117 KB JPG
>>108393631
For a moment, I had forgotten where I was; I think I’m going to ponder every decision I’ve made that led me here.
>>
>>108393738
>XL
>EA
>4ch VAE "rework"
>>
>>108393738
All SDXL VAEs are dogshit
>>
>>108393738
https://civitai.com/models/2239459/akashicpulse-eqvae?dialog=commentThread&commentId=1051235
>In other words, let my broke ass experiment with the things I interested in man, I released my models for free too despite my $200 monthly salary
On one hand, I admire an ESL poorfag at least trying. On the other, he is a retarded ELS poorfag.
>>
>>108393738
pretty sure eqvae is just a means to accelerate training, adapting an already trained model is retarded
>>
File: 1751776369955191.jpg (652 KB, 1536x1536)
652 KB
652 KB JPG
which way?
>>
>>108393876
360 degrees and walk away
>>
>>108393884
r u mewgagay
>>
File: 1771089967870519.jpg (552 KB, 1536x1536)
552 KB
552 KB JPG
tfw share threads with homosexgggs
sad :(
>>
>>108393890
no the turbo look no longer interests me
>>
>>108393781
deep inside you already know there is no way out of this hole.
>>
File: 1751497030571427.jpg (582 KB, 1328x1744)
582 KB
582 KB JPG
>>108393912
whats ure poison
>>
>>108393923
nigga talk like a normal human bean and we cooperate. turbo just looks so flat and lifeless
>>
>>108393923
base obviously also what the other anon said
>>
File: 1666018572963908.png (8 KB, 296x170)
8 KB
8 KB PNG
>>108393915
Not if it looks so good. What can I say, weak I am. The new GPU needs to work on something.
>>
File: 1770769888863187.jpg (3.42 MB, 4284x5712)
3.42 MB
3.42 MB JPG
can someone fix this?
>>
>>108393485
works pretty well, miles better than LTX at least and it takes only 5 seconds per gen so I can crank through about 20 in the same time it takes a single LTX gen
thanks!
>>
>>108394079
what kind of resolution is that? this image is fucking with me, go away.
>>
File: 1765798010612280.mp4 (1.74 MB, 720x720)
1.74 MB
1.74 MB MP4
>>108393876
>>
>>108394152
which kiss lora is that?
>>
>>108394163
https://civitaiarchive.com/models/1881060?modelVersionId=2186130
>>
>>108394245
thanks man
>deleted
wtf
>>
>>108394338
No problem, bro. Most of playtime_ai's stuff was removed from Civitai when he got banned for making a slider LoRa for making people skinny or something.
>>
>>108394357
those slider loras are verboten!
bannedtai went down the gutter so fast, oh man. please send me back to 2023 when shit was fresh
>>
>>108394357
>a slider LoRa for making people skinny or something.
no way
>>
Hey brothers!
Long time user here, I would like some spoonfeeding.
I have a automatic111 with a pony XL model
Is there something BETTER in the past like, year and a half?
Or are we still in lategenland?
I'm mainly asking to know if I should bother changing my whole setup for 0.2 improvement or not
>>
File: 1761891389656443.jpg (545 KB, 1328x1640)
545 KB
545 KB JPG
>muh zit look
>>
>>108394523
illustrious based models like noobai are the better choice nowadays, still XL so it's an easy drop-in replacement
anima is a new model on a different architecture currently in the making and the preview versions are promising so far
>>
>>108394575
You are completely right, and I apologize for essentially gaslighting you. I did process your audio.
>>
>>108394523
You could perhaps look at OP
>>
File: 65.png (1.22 MB, 896x1152)
1.22 MB
1.22 MB PNG
>+ + works much better on anima2
well that's sure an improvement
>>
>>108394523
no, absolutely nothing new has come out since automatic111 and pony. that's right, nothing
>>
im trying to make a lora for a photoreal checkpoint using concepts from cartoon images. and what i have found is the 2.5d sloppy semi real images really poison the dataset. its working pretty good so far with cartoon images and a few photoreal.
>>
>>108393445
don't leave us hanging. you've gotta post a gen of that
>>
is it worth using a wan 2.2 merge like smooth mix for nsfw photoreal instead of base wan 2.2?
>>
I have just installed Comfy and got some models. What nao? I think I'll make some Loras next.
>>
>>108395096
No, use base wan2.2 and just add loras specific to whatever you want to do.
>>
>>108394523
just try it out and see if it works for you
there's also this one:
https://civitai.com/models/2053259/wan-22-enhanced-nsfw-or-svi-or-camera-prompt-adherence-lightning-edition-i2v-and-t2v-fp8-gguf
you can download the svi version and use it for single clips just fine
>>
I've fucked around with video models, but i don't know much about image models...
I remember there being an image model where you could choose two or more starting images and it would take all the people and place them together in one image?
Anyone knows which one that was, and is it still current, or has it become outdated in favor of a newer one?
Pls help me goon properly, /ldg/
>>
>>108395466
Flux Klein 9 and 4B
Qwen image edit.
>>
>>108395466
>Pls help me goon properly, /ldg/
difficult on a blue board but not impossible
>>
>>108395502
Thank you!
Which one performs better, in your opinion, between Flux 9B and Qwen edit?
>>
>>108395561
Qwen has cool shit like https://huggingface.co/fal/Qwen-Image-Edit-2511-Multiple-Angles-LoRA available but Klein is more vramlet friendly.
>>
>>108395561
>Which one performs better

Kind of an apples and oranges thing here. Qwen is probably better overall for "most" applications. But klein is faster overall and good enough that the difference is negligable. I feel qwen can do more out of the box though, but the speedup LoRAs hurt that capability somewhat.

If you goal is to goon. Qwen can do a little more, but both will fight you without LoRAs.
>>
Without Catjack /ldg/ is a dead general~
So irrelevant~
A rotten corpse~
>>
Gpt slop
>>
cozy breas
>>
where can I get LTX2 loras? been searching on civitai with filters but havent found any

am I retarded?
>>
File: file.png (6 KB, 603x100)
6 KB
6 KB PNG
huh
>>
>>108395764
go to models, filter with ltx2
>>
File: 1753280645767498.jpg (145 KB, 1440x1920)
145 KB
145 KB JPG
nano banana mogs
>>
>>108395740
What's wrong with letting the thread proceed at a normal pace and mostly having the discussion be centered around Free and Open Source Diffusion Models?
>>
File: badface3.jpg (799 KB, 1179x2556)
799 KB
799 KB JPG
>>108395862
NBP always generates the same ugly 1girl face, same composition and same negative space, only jeets and clueless normies like that model lel
>>
File: file.png (47 KB, 1152x373)
47 KB
47 KB PNG
>seaching github for a proper implementation of spectrum
>some literal "proper" implementations appear
>the guy makes 3 custom nodes instead of one
beggars cant be choosers I guess
>>
My cute little feet stepping on /ldg/'s rotten corpse~
>>
you're a middle-aged man
>>
>>108393445
examples???
>>
>>108395888
are you sure this isn't just vibecoded shit?
and maybe it doesn't even work on anima lol
>>
>>108395888
>spectrum
wat do
>>
>>108396031
predicts every n step, speeding up the gen.
i use this repo for anima
https://github.com/ruwwww/ComfyUI-Spectrum-sdxl
>>
qwen edit sucks. it makes it blurry and loses details. its better to photoshop something then inpaint
>>
nuh uh
>>
>>108393213
Age judging is retarded, I'd date a crazy murderous wench who is 15 years older than me, I don't care, I'd probably submit to the most negative relationship conditions too, I'd probably date that alien in the top right of the OP as well. Fight me.
>>
File: 1769887718788588.jpg (319 KB, 1440x1920)
319 KB
319 KB JPG
>>108395883
that looks like nano banana 2, or he used a slopped image as a reference
npb does not produce slop if you prompt properly
>>
File: badface2.jpg (830 KB, 1179x2556)
830 KB
830 KB JPG
>>108396269
you're too obtuse to see, but that NBP 1girl face is everywhere, see pic related
>>
>>108396269
god i hate bongs, go see a dentist if you're ashamed
>>
File: 1764563516150916.jpg (670 KB, 1440x1920)
670 KB
670 KB JPG
>>108396297
i see it. its probably because theyre using json slop prompts and the model regresses to a generic face
>>
File: IMG_9653.jpg (387 KB, 1179x2556)
387 KB
387 KB JPG
>>108396323
No, its because Google isnt that dumb, thats why all NBP pros images look the same, you just think that google would just give out an ultra realistic image generator for people to scam/generate fake news/content just like that without even able to identify its images?

IG is full of "AI influencers" with the same 1girl NBP face, same centered composition, same everything lol, once you see them its really easy to identify them.
Thats why I said only jeets and normies fools fall for them
>>
>phonefag arguing with nbfag
make it stop
>>
>>108396379
bogdanoffs...



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.