/g/ - /ldg/ - Local Diffusion General - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/ldg/ - Local Diffusion Genera(...) 03/17/26(Tue)13:39:33 No.108393213

File: highlights_g_108384322_17(...).jpg (876 KB, 2275x2505)

/ldg/ - Local Diffusion General Anonymous 03/17/26(Tue)13:39:33 No.108393213

Discussion of Free and Open Source Diffusion Models

Previous: >>108384322

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon

Anonymous
03/17/26(Tue)13:45:28 No.108393257

Anonymous 03/17/26(Tue)13:45:28 No.108393257

>mfw Resource news

03/17/2026

>Self-E: Self-Evaluation Unlocks Any-Step Text-to-Image Generation
https://github.com/XinYu-Andy/SelfE

>Chain-of-Trajectories: Unlocking the Intrinsic Generative Optimality of Diffusion Models via Graph-Theoretic Planning
https://github.com/UnicomAI/CoTj

>EditHF-1M: A Million-Scale Rich Human Preference Feedback for Image Editing
https://github.com/IntMeGroup/EditHF

>Representation Alignment for Just Image Transformers is not Easier than You Think
https://github.com/kaist-cvml/PixelREPA

>AdapterTune: Zero-Initialized Low-Rank Adapters for Frozen Vision Transformers
https://github.com/salimkhazem/adaptertune

>PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling
https://x-gengroup.github.io/HomePage_PaCo-RL

>LTX-2.3 NVFP4
https://huggingface.co/Lightricks/LTX-2.3-nvfp4

>Gamers react with overwhelming disgust to DLSS 5’s generative AI glow-ups
https://arstechnica.com/gaming/2026/03/gamers-react-with-overwhelming-disgust-to-dlss-5s-generative-ai-glow-ups

>Nvidia's Nemotron coalition brings eight AI labs together to build open frontier models
https://www.tomshardware.com/tech-industry/artificial-intelligence/nvidias-nemoclaw-coalition-brings-eight-ai-labs-together-to-build-open-frontier-models

03/16/2026

>Finite Difference Flow Optimization for RL Post-Training of Text-to-Image Models
https://github.com/NVlabs/finite-difference-flow-optimization

>MemRoPE: Training-Free Infinite Video Generation via Evolving Memory Tokens
https://memrope.github.io

>MoKus: Leveraging Cross-Modal Knowledge Transfer for Knowledge-Aware Concept Customization
https://chenyangzhu1.github.io/MoKus

>Less Data, Faster Convergence: Goal-Driven Data Optimization for Multimodal Instruction Tuning
https://github.com/rujiewu/GDO

>Rethinking VLMs for Image Forgery Detection and Localization
https://github.com/sha0fengGuo/IFDL-VLM

Anonymous
03/17/26(Tue)13:46:38 No.108393263

Anonymous 03/17/26(Tue)13:46:38 No.108393263

>mfw Research news

03/17/2026

>Early Failure Detection and Intervention in Video Diffusion Models
https://arxiv.org/abs/2603.14320

>Relevance Feedback in Text-to-Image Diffusion: A Training-Free And Model-Agnostic Interactive Framework
https://arxiv.org/abs/2603.14936

>PHAC: Promptable Human Amodal Completion
https://arxiv.org/abs/2603.14741

>CamLit: Unified Video Diffusion with Explicit Camera and Lighting Control
https://arxiv.org/abs/2603.14241

>Fair Benchmarking of Emerging One-Step Generative Models Against Multistep Diffusion and Flow Models
https://arxiv.org/abs/2603.14186

>Trust-Region Noise Search for Black-Box Alignment of Diffusion and Flow Models
https://arxiv.org/abs/2603.14504

>IMS3: Breaking Distributional Aggregation in Diffusion-Based Dataset Distillation
https://arxiv.org/abs/2603.13960

>TMPDiff: Temporal Mixed-Precision for Diffusion Models
https://arxiv.org/abs/2603.14062

>LatSearch: Latent Reward-Guided Search for Faster Inference-Time Scaling in Video Diffusion
https://zengqunzhao.github.io/LatSearch

>Diffusion Reinforcement Learning via Centered Reward Distillation
https://arxiv.org/abs/2603.14128

>Single Image Super-Resolution via Bivariate ‘A Trous Wavelet Diffusion
https://arxiv.org/abs/2603.07234

>SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation
https://sk-adapter.github.io

>RAZOR: Ratio-Aware Layer Editing for Targeted Unlearning in Vision Transformers and Diffusion Models
https://arxiv.org/abs/2603.14819

>Workflow-Aware Structured Layer Decomposition for Illustration Production
https://arxiv.org/abs/2603.14925

>Texel Splatting: Perspective-Stable 3D Pixel Art
https://arxiv.org/abs/2603.14587

>GenState-AI: State-Aware Dataset for Text-to-Video Retrieval on AI-Generated Videos
https://arxiv.org/abs/2603.14426

>AnyPhoto: Multi-Person Identity Preserving Image Generation with ID Adaptive Modulation on Location Canvas
https://arxiv.org/abs/2603.14770

Anonymous
03/17/26(Tue)13:46:38 No.108393264

Anonymous 03/17/26(Tue)13:46:38 No.108393264

>>108393213
>Neighbors
>>>/vg/vpcai

Anonymous
03/17/26(Tue)13:47:38 No.108393269

Anonymous 03/17/26(Tue)13:47:38 No.108393269

>mfw MORE Research news

>MotionCFG: Boosting Motion Dynamics via Stochastic Concept Perturbation
https://arxiv.org/abs/2603.14073

>Spectrum Matching: a Unified Perspective for Superior Diffusability in Latent Diffusion
https://arxiv.org/abs/2603.14645

>Not All Directions Matter: Toward Structured and Task-Aware Low-Rank Adaptation
https://arxiv.org/abs/2603.14228

>Seeking Physics in Diffusion Noise
https://arxiv.org/abs/2603.14294

>CyCLeGen: Cycle-Consistent Layout Prediction and Image Generation in Vision Foundation Models
https://arxiv.org/abs/2603.14957

>FIND: A Simple yet Effective Baseline for Diffusion-Generated Image Detection
https://arxiv.org/abs/2603.14220

>Distilling Latent Manifolds: Resolution Extrapolation by Variational Autoencoders
https://arxiv.org/abs/2603.14536

>Balancing Saliency and Coverage: Semantic Prominence-Aware Budgeting for Visual Token Compression in VLMs
https://arxiv.org/abs/2603.14892

>M2IR: Proactive All-in-One Image Restoration via Mamba-style Modulation and Mixture-of-Experts
https://arxiv.org/abs/2603.14816

>ASAP: Attention-Shift-Aware Pruning for Efficient LVLM Inference
https://arxiv.org/abs/2603.14549

>Towards Generalizable Deepfake Detection via Real Distribution Bias Correction
https://arxiv.org/abs/2603.14005

>GameUIAgent: An LLM-Powered Framework for Automated Game UI Design with Structured Intermediate Representation
https://arxiv.org/abs/2603.14724

>Mixture of States: Routing Token-Level Dynamics for Multimodal Generation
https://haozheliu-st.github.io/mos-homepage

>Secure and Robust Watermarking for AI-generated Images: A Comprehensive Survey
https://arxiv.org/abs/2510.02384

Anonymous
03/17/26(Tue)13:50:17 No.108393281

Anonymous 03/17/26(Tue)13:50:17 No.108393281

>>108393263
>>108393269
Damn man, that's a lot. Any particularly interesting paper there?
And thanks for all the work, by the way.

Anonymous
03/17/26(Tue)13:51:52 No.108393286

Anonymous 03/17/26(Tue)13:51:52 No.108393286

>

Anonymous
03/17/26(Tue)14:01:10 No.108393351

Anonymous 03/17/26(Tue)14:01:10 No.108393351

File: ComfyUI_00024_.jpg (208 KB, 1024x1536)

208 KB JPG

>>108393213
Welcome to the final days and last threads of /ldg/, relax, and make yourself at home! ^^

Anonymous
03/17/26(Tue)14:03:07 No.108393363

Anonymous 03/17/26(Tue)14:03:07 No.108393363

File: deBU_zi_00022_.png (1.97 MB, 1536x922)

1.97 MB PNG

>>108393281
tuesday dumps are the biggest. I've never known why
>Any particularly interesting paper there?
depends on what you're interested in. I personally found this interesting:
>SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation

Anonymous
03/17/26(Tue)14:04:55 No.108393374

Anonymous 03/17/26(Tue)14:04:55 No.108393374

>>108393351
u wish

Anonymous
03/17/26(Tue)14:09:53 No.108393405

Anonymous 03/17/26(Tue)14:09:53 No.108393405

What's the best local way to add porn audio to porn video gens? I don't want to reroll LTX-2 50 times until I get something barely usable...

Anonymous
03/17/26(Tue)14:15:26 No.108393438

Anonymous 03/17/26(Tue)14:15:26 No.108393438

can someone link an online upscaler? I just need it for one pic

Anonymous
03/17/26(Tue)14:16:17 No.108393445

Anonymous 03/17/26(Tue)14:16:17 No.108393445

File: 1748681442674816.png (64 KB, 747x707)

64 KB PNG

>>108393405
mmaudio nsfw. it needs schizo negatives though. pic related is what I came up with for now

Anonymous
03/17/26(Tue)14:17:02 No.108393453

Anonymous 03/17/26(Tue)14:17:02 No.108393453

>>108393445
>fizzle

Anonymous
03/17/26(Tue)14:17:07 No.108393454

Anonymous 03/17/26(Tue)14:17:07 No.108393454

>>108393351
Do you sell any souvenirs or pins as a memento?

Anonymous
03/17/26(Tue)14:17:30 No.108393458

Anonymous 03/17/26(Tue)14:17:30 No.108393458

>>108393445
a man of taste i see.

Anonymous
03/17/26(Tue)14:17:59 No.108393461

Anonymous 03/17/26(Tue)14:17:59 No.108393461

>>108393453
hurr durr herrr burr durrr berr durr durr ddurrrr
kill yourself

Anonymous
03/17/26(Tue)14:19:46 No.108393475

Anonymous 03/17/26(Tue)14:19:46 No.108393475

>>108393445
how light/heavy is it to use?

Anonymous
03/17/26(Tue)14:20:17 No.108393481

Anonymous 03/17/26(Tue)14:20:17 No.108393481

>>108393454
Yes!^^
I have many, which memento do you prefer?

I was there for the last /ldg/ threads
Survived the last /ldg/ threads
Last /ldg/ thread witness
I posted in the final /ldg/
Present at the last /ldg/ threads
Gone but not forgotten, /ldg/

Anonymous
03/17/26(Tue)14:20:37 No.108393483

Anonymous 03/17/26(Tue)14:20:37 No.108393483

>>108393445
kill yourself obsessed faggot loser

Anonymous
03/17/26(Tue)14:22:02 No.108393485

Anonymous 03/17/26(Tue)14:22:02 No.108393485

>>108393475
I think it was like 10 seconds a gen. it's not great btw so you might need to crank out a couple dozen gens before you get something decent but it's all we have

Anonymous
03/17/26(Tue)14:22:49 No.108393493

Anonymous 03/17/26(Tue)14:22:49 No.108393493

>>108393438
HELP

Anonymous
03/17/26(Tue)14:23:37 No.108393500

Anonymous 03/17/26(Tue)14:23:37 No.108393500

>>108393438
>>108393493
>>/g/aicg
>>/g/dalle

Anonymous
03/17/26(Tue)14:24:44 No.108393501

Anonymous 03/17/26(Tue)14:24:44 No.108393501

>>108393493
we are LOCAL chads here.

Anonymous
03/17/26(Tue)14:25:11 No.108393502

Anonymous 03/17/26(Tue)14:25:11 No.108393502

long dick general

Anonymous
03/17/26(Tue)14:31:01 No.108393539

Anonymous 03/17/26(Tue)14:31:01 No.108393539

>>108393502
long dead general :3

Anonymous
03/17/26(Tue)14:33:42 No.108393555

Anonymous 03/17/26(Tue)14:33:42 No.108393555

File: dsfsdfsdfsdfsdf.jpg (303 KB, 1608x1011)

303 KB JPG

>>108393500
>>108393501
I fucking hate you AI nerds
I asked bing to upscale it, what the FUCK is this?

Anonymous
03/17/26(Tue)14:34:22 No.108393563

Anonymous 03/17/26(Tue)14:34:22 No.108393563

>>108393481
Neat, I want one with Hatsune Teto and "I was there for the last /ldg/ threads" phrase

Anonymous
03/17/26(Tue)14:37:07 No.108393584

Anonymous 03/17/26(Tue)14:37:07 No.108393584

>>108393493
POST THE FUCKING IMAGE I'LL DO IT

Anonymous
03/17/26(Tue)14:37:45 No.108393592

Anonymous 03/17/26(Tue)14:37:45 No.108393592

Newfag here, what checkpoint/model is best for undressing? I want to stick as close as possible to the orginal. Is inpaint better than I2I with a good prompt?
I'm fucking around with klein 9b (unstableRevolutionF2K) with inpaint/prompt, but the results seem far too random. I would appreciate any advice.

Anonymous
03/17/26(Tue)14:38:54 No.108393601

Anonymous 03/17/26(Tue)14:38:54 No.108393601

File: AQME_u9zVnLlOCykcnhYMMFwY(...).jpg (88 KB, 583x857)

88 KB JPG

>>108393584

Anonymous
03/17/26(Tue)14:41:46 No.108393631

Anonymous 03/17/26(Tue)14:41:46 No.108393631

>>108393592
pervert.
>>108393601
why this image? mouth open or closed?

Anonymous
03/17/26(Tue)14:43:36 No.108393637

Anonymous 03/17/26(Tue)14:43:36 No.108393637

>>108393631
>why this image?
I like Alysa, her mouth is open.

Anonymous
03/17/26(Tue)14:44:06 No.108393643

Anonymous 03/17/26(Tue)14:44:06 No.108393643

File: 1743679619882543.png (870 KB, 832x1248)

870 KB PNG

Anonymous
03/17/26(Tue)14:45:02 No.108393651

Anonymous 03/17/26(Tue)14:45:02 No.108393651

>>108393257
Based.

Anonymous
03/17/26(Tue)14:46:04 No.108393660

Anonymous 03/17/26(Tue)14:46:04 No.108393660

>>108393643
sovl

Anonymous
03/17/26(Tue)14:50:07 No.108393688

Anonymous 03/17/26(Tue)14:50:07 No.108393688

>>108393637
can't do it, sorry. didn't notice it's a celeb. need a lora for her. impossible to land that face

Anonymous
03/17/26(Tue)14:59:24 No.108393738

Anonymous 03/17/26(Tue)14:59:24 No.108393738

File: image.jpg (938 KB, 3328x1535)

938 KB JPG

https://civitai.com/models/2239459/akashicpulse-eqvae
Is this model good, or is it snake oil or cobra oil? Apparently, it uses a new kind of VAE

Anonymous
03/17/26(Tue)15:03:31 No.108393774

Anonymous 03/17/26(Tue)15:03:31 No.108393774

>>108393738
>illus
>EA grift

Anonymous
03/17/26(Tue)15:04:16 No.108393781

Anonymous 03/17/26(Tue)15:04:16 No.108393781

File: 1687597503950061.jpg (117 KB, 841x1024)

117 KB JPG

>>108393631
For a moment, I had forgotten where I was; I think I’m going to ponder every decision I’ve made that led me here.

Anonymous
03/17/26(Tue)15:04:50 No.108393787

Anonymous 03/17/26(Tue)15:04:50 No.108393787

>>108393738
>XL
>EA
>4ch VAE "rework"

Anonymous
03/17/26(Tue)15:05:01 No.108393789

Anonymous 03/17/26(Tue)15:05:01 No.108393789

>>108393738
All SDXL VAEs are dogshit

Anonymous
03/17/26(Tue)15:14:12 No.108393847

Anonymous 03/17/26(Tue)15:14:12 No.108393847

>>108393738
https://civitai.com/models/2239459/akashicpulse-eqvae?dialog=commentThread&commentId=1051235
>In other words, let my broke ass experiment with the things I interested in man, I released my models for free too despite my $200 monthly salary
On one hand, I admire an ESL poorfag at least trying. On the other, he is a retarded ELS poorfag.

Anonymous
03/17/26(Tue)15:15:38 No.108393863

Anonymous 03/17/26(Tue)15:15:38 No.108393863

>>108393738
pretty sure eqvae is just a means to accelerate training, adapting an already trained model is retarded

Anonymous
03/17/26(Tue)15:18:12 No.108393876

Anonymous 03/17/26(Tue)15:18:12 No.108393876

File: 1751776369955191.jpg (652 KB, 1536x1536)

652 KB JPG

which way?

Anonymous
03/17/26(Tue)15:18:47 No.108393884

Anonymous 03/17/26(Tue)15:18:47 No.108393884

>>108393876
360 degrees and walk away

Anonymous
03/17/26(Tue)15:19:29 No.108393890

Anonymous 03/17/26(Tue)15:19:29 No.108393890

>>108393884
r u mewgagay

Anonymous
03/17/26(Tue)15:20:09 No.108393894

Anonymous 03/17/26(Tue)15:20:09 No.108393894

File: 1771089967870519.jpg (552 KB, 1536x1536)

552 KB JPG

tfw share threads with homosexgggs
sad :(

Anonymous
03/17/26(Tue)15:23:10 No.108393912

Anonymous 03/17/26(Tue)15:23:10 No.108393912

>>108393890
no the turbo look no longer interests me

Anonymous
03/17/26(Tue)15:23:36 No.108393915

Anonymous 03/17/26(Tue)15:23:36 No.108393915

File: SDXL_lustifySDXLNSFW_v6.s(...).jpg (542 KB, 1664x2304)

542 KB JPG

>>108393781
deep inside you already know there is no way out of this hole.

Anonymous
03/17/26(Tue)15:24:35 No.108393923

Anonymous 03/17/26(Tue)15:24:35 No.108393923

File: 1751497030571427.jpg (582 KB, 1328x1744)

582 KB JPG

>>108393912
whats ure poison

Anonymous
03/17/26(Tue)15:27:46 No.108393941

Anonymous 03/17/26(Tue)15:27:46 No.108393941

>>108393923
nigga talk like a normal human bean and we cooperate. turbo just looks so flat and lifeless

Anonymous
03/17/26(Tue)15:28:53 No.108393949

Anonymous 03/17/26(Tue)15:28:53 No.108393949

>>108393923
base obviously also what the other anon said

Anonymous
03/17/26(Tue)15:43:31 No.108394029

Anonymous 03/17/26(Tue)15:43:31 No.108394029

File: 1666018572963908.png (8 KB, 296x170)

8 KB PNG

>>108393915
Not if it looks so good. What can I say, weak I am. The new GPU needs to work on something.

Anonymous
03/17/26(Tue)15:54:17 No.108394079

Anonymous 03/17/26(Tue)15:54:17 No.108394079

File: 1770769888863187.jpg (3.42 MB, 4284x5712)

3.42 MB JPG

can someone fix this?

Anonymous
03/17/26(Tue)15:54:22 No.108394082

Anonymous 03/17/26(Tue)15:54:22 No.108394082

>>108393485
works pretty well, miles better than LTX at least and it takes only 5 seconds per gen so I can crank through about 20 in the same time it takes a single LTX gen
thanks!

Anonymous
03/17/26(Tue)16:00:23 No.108394125

Anonymous 03/17/26(Tue)16:00:23 No.108394125

>>108394079
what kind of resolution is that? this image is fucking with me, go away.

Anonymous
03/17/26(Tue)16:04:53 No.108394152

Anonymous 03/17/26(Tue)16:04:53 No.108394152

File: 1765798010612280.mp4 (1.74 MB, 720x720)

1.74 MB MP4

>>108393876

Anonymous
03/17/26(Tue)16:07:08 No.108394163

Anonymous 03/17/26(Tue)16:07:08 No.108394163

>>108394152
which kiss lora is that?

Anonymous
03/17/26(Tue)16:20:53 No.108394245

Anonymous 03/17/26(Tue)16:20:53 No.108394245

>>108394163
https://civitaiarchive.com/models/1881060?modelVersionId=2186130

Anonymous
03/17/26(Tue)16:34:30 No.108394338

Anonymous 03/17/26(Tue)16:34:30 No.108394338

>>108394245
thanks man
>deleted
wtf

Anonymous
03/17/26(Tue)16:37:01 No.108394357

Anonymous 03/17/26(Tue)16:37:01 No.108394357

>>108394338
No problem, bro. Most of playtime_ai's stuff was removed from Civitai when he got banned for making a slider LoRa for making people skinny or something.

Anonymous
03/17/26(Tue)16:57:00 No.108394476

Anonymous 03/17/26(Tue)16:57:00 No.108394476

>>108394357
those slider loras are verboten!
bannedtai went down the gutter so fast, oh man. please send me back to 2023 when shit was fresh

Anonymous
03/17/26(Tue)16:59:02 No.108394493

Anonymous 03/17/26(Tue)16:59:02 No.108394493

>>108394357
>a slider LoRa for making people skinny or something.
no way

Anonymous
03/17/26(Tue)17:01:49 No.108394523

Anonymous 03/17/26(Tue)17:01:49 No.108394523

Hey brothers!
Long time user here, I would like some spoonfeeding.
I have a automatic111 with a pony XL model
Is there something BETTER in the past like, year and a half?
Or are we still in lategenland?
I'm mainly asking to know if I should bother changing my whole setup for 0.2 improvement or not

Anonymous
03/17/26(Tue)17:09:06 No.108394575

Anonymous 03/17/26(Tue)17:09:06 No.108394575

File: 1761891389656443.jpg (545 KB, 1328x1640)

545 KB JPG

>muh zit look

Anonymous
03/17/26(Tue)17:11:18 No.108394589

Anonymous 03/17/26(Tue)17:11:18 No.108394589

>>108394523
illustrious based models like noobai are the better choice nowadays, still XL so it's an easy drop-in replacement
anima is a new model on a different architecture currently in the making and the preview versions are promising so far

Anonymous
03/17/26(Tue)17:20:33 No.108394664

Anonymous 03/17/26(Tue)17:20:33 No.108394664

>>108394575
You are completely right, and I apologize for essentially gaslighting you. I did process your audio.

Anonymous
03/17/26(Tue)17:22:30 No.108394684

Anonymous 03/17/26(Tue)17:22:30 No.108394684

>>108394523
You could perhaps look at OP

Anonymous
03/17/26(Tue)17:36:14 No.108394795

Anonymous 03/17/26(Tue)17:36:14 No.108394795

File: 65.png (1.22 MB, 896x1152)

1.22 MB PNG

>+ + works much better on anima2
well that's sure an improvement

Anonymous
03/17/26(Tue)17:36:57 No.108394801

Anonymous 03/17/26(Tue)17:36:57 No.108394801

>>108394523
no, absolutely nothing new has come out since automatic111 and pony. that's right, nothing

Anonymous
03/17/26(Tue)17:44:21 No.108394848

Anonymous 03/17/26(Tue)17:44:21 No.108394848

im trying to make a lora for a photoreal checkpoint using concepts from cartoon images. and what i have found is the 2.5d sloppy semi real images really poison the dataset. its working pretty good so far with cartoon images and a few photoreal.

Anonymous
03/17/26(Tue)18:09:35 No.108395005

Anonymous 03/17/26(Tue)18:09:35 No.108395005

File: Screenshot_20260317_230800.png (2 KB, 303x56)

2 KB PNG

>>108393445
don't leave us hanging. you've gotta post a gen of that

Anonymous
03/17/26(Tue)18:21:36 No.108395096

Anonymous 03/17/26(Tue)18:21:36 No.108395096

is it worth using a wan 2.2 merge like smooth mix for nsfw photoreal instead of base wan 2.2?

Anonymous
03/17/26(Tue)18:22:48 No.108395101

Anonymous 03/17/26(Tue)18:22:48 No.108395101

I have just installed Comfy and got some models. What nao? I think I'll make some Loras next.

Anonymous
03/17/26(Tue)18:49:11 No.108395239

Anonymous 03/17/26(Tue)18:49:11 No.108395239

>>108395096
No, use base wan2.2 and just add loras specific to whatever you want to do.

Anonymous
03/17/26(Tue)19:01:45 No.108395296

Anonymous 03/17/26(Tue)19:01:45 No.108395296

>>108394523
just try it out and see if it works for you
there's also this one:
https://civitai.com/models/2053259/wan-22-enhanced-nsfw-or-svi-or-camera-prompt-adherence-lightning-edition-i2v-and-t2v-fp8-gguf
you can download the svi version and use it for single clips just fine

Anonymous
03/17/26(Tue)19:26:34 No.108395466

Anonymous 03/17/26(Tue)19:26:34 No.108395466

I've fucked around with video models, but i don't know much about image models...
I remember there being an image model where you could choose two or more starting images and it would take all the people and place them together in one image?
Anyone knows which one that was, and is it still current, or has it become outdated in favor of a newer one?
Pls help me goon properly, /ldg/

Anonymous
03/17/26(Tue)19:33:20 No.108395502

Anonymous 03/17/26(Tue)19:33:20 No.108395502

>>108395466
Flux Klein 9 and 4B
Qwen image edit.

Anonymous
03/17/26(Tue)19:37:23 No.108395527

Anonymous 03/17/26(Tue)19:37:23 No.108395527

>>108395466
>Pls help me goon properly, /ldg/
difficult on a blue board but not impossible

Anonymous
03/17/26(Tue)19:43:20 No.108395561

Anonymous 03/17/26(Tue)19:43:20 No.108395561

>>108395502
Thank you!
Which one performs better, in your opinion, between Flux 9B and Qwen edit?

Anonymous
03/17/26(Tue)19:45:00 No.108395571

Anonymous 03/17/26(Tue)19:45:00 No.108395571

>>108395561
Qwen has cool shit like https://huggingface.co/fal/Qwen-Image-Edit-2511-Multiple-Angles-LoRA available but Klein is more vramlet friendly.

Anonymous
03/17/26(Tue)19:46:54 No.108395580

Anonymous 03/17/26(Tue)19:46:54 No.108395580

>>108395561
>Which one performs better

Kind of an apples and oranges thing here. Qwen is probably better overall for "most" applications. But klein is faster overall and good enough that the difference is negligable. I feel qwen can do more out of the box though, but the speedup LoRAs hurt that capability somewhat.

If you goal is to goon. Qwen can do a little more, but both will fight you without LoRAs.

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.