/g/ - /ldg/ - Local Diffusion General - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/ldg/ - Local Diffusion Genera(...) 06/04/26(Thu)03:11:45 No.108976783

File: highlights_g_108972752_17(...).jpg (1.33 MB, 4554x2500)

/ldg/ - Local Diffusion General Anonymous 06/04/26(Thu)03:11:45 No.108976783

Discussion and Development of Local Image, Video, and Music Models

Previous: >>108972752

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon

Anonymous
06/04/26(Thu)03:16:13 No.108976797

Anonymous 06/04/26(Thu)03:16:13 No.108976797

1st for Ani.

Anonymous
06/04/26(Thu)03:18:56 No.108976809

Anonymous 06/04/26(Thu)03:18:56 No.108976809

so like. where are the 1girls yo. like I came for the hot pointy chinned 1girls

Anonymous
06/04/26(Thu)03:19:19 No.108976810

Anonymous 06/04/26(Thu)03:19:19 No.108976810

sfw vageen is ascended

Anonymous
06/04/26(Thu)03:32:12 No.108976841

Anonymous 06/04/26(Thu)03:32:12 No.108976841

File: 85358566.webm (2.37 MB, 256x448)

2.37 MB WEBM

>bullet impacts sounding like drums
made me leap out of my chair and scream "KINO!!!!!!!!!!!!!!!!!" 3 times and then do a partial backflip
https://files.catbox.moe/qjzu25.mp4

Anonymous
06/04/26(Thu)03:33:54 No.108976848

Anonymous 06/04/26(Thu)03:33:54 No.108976848

File: 3089031.jpg (38 KB, 540x540)

38 KB JPG

In the standalone anima trainer, does flash attention made things faster in exchange for quality hit, or just faster?

Anonymous
06/04/26(Thu)03:34:24 No.108976849

Anonymous 06/04/26(Thu)03:34:24 No.108976849

File: 1777497626575143.png (1.29 MB, 2259x1315)

1.29 MB PNG

>Scorsese uses FLUX
zit faggots keep seething

Anonymous
06/04/26(Thu)03:34:37 No.108976850

Anonymous 06/04/26(Thu)03:34:37 No.108976850

>>108976848
ask AI

Anonymous
06/04/26(Thu)03:44:38 No.108976878

Anonymous 06/04/26(Thu)03:44:38 No.108976878

File: wan22 Scail vs Bernini.mp4 (3.98 MB, 1920x1080)

3.98 MB MP4

Tested Wan22 Bernini. Here are my initial result on the single test case.

R2V: Subject to Video generation.
Best: 0.8 Megapixel 81 frames at 30 FPS, OOM on higher res/frames length on a 5090/128gb RAM. Heavily dependent on subject resolution, so best results may varies. Most accurate at 30FPS, lowering FPS seems to degrade reference accuracy. Accuracy also degrades after 81 frames, just like Wan22 base I guess. Bernini can be extended if you are determined to stich 81 frames video together. Seems to lose out against SCAIL on ease of use, VRAM requirements, but SCAIL can only do rigid open pose reference. Bernini can supposedly can do more things, need to test further.

>vid related, Bernini 81 + 81

https://github.com/Comfy-Org/ComfyUI/pull/14216

https://bernini-ai.github.io/

Anonymous
06/04/26(Thu)03:46:14 No.108976887

Anonymous 06/04/26(Thu)03:46:14 No.108976887

>>108976878
everyday I hate myself for being a VRAMlet

Anonymous
06/04/26(Thu)03:47:12 No.108976889

Anonymous 06/04/26(Thu)03:47:12 No.108976889

I don't get the appeal of video generation

Anonymous
06/04/26(Thu)03:48:20 No.108976896

Anonymous 06/04/26(Thu)03:48:20 No.108976896

>>108976878
make moot do cute things

Anonymous
06/04/26(Thu)03:48:43 No.108976898

Anonymous 06/04/26(Thu)03:48:43 No.108976898

>>108976889
its ok, its not your fault you were born brown

Anonymous
06/04/26(Thu)03:51:46 No.108976914

Anonymous 06/04/26(Thu)03:51:46 No.108976914

>>108976889
making porn of unsuspecting women

Anonymous
06/04/26(Thu)03:55:59 No.108976928

Anonymous 06/04/26(Thu)03:55:59 No.108976928

File: technologyidea.mp4 (660 KB, 720x480)

660 KB MP4

>>108976887

Anonymous
06/04/26(Thu)04:00:08 No.108976945

Anonymous 06/04/26(Thu)04:00:08 No.108976945

Is it possible to train a lora on small (<64x64) sprites?

Anonymous
06/04/26(Thu)04:03:19 No.108976957

Anonymous 06/04/26(Thu)04:03:19 No.108976957

File: bernini_s.png (1.67 MB, 871x1080)

1.67 MB PNG

>>108976878

Anonymous
06/04/26(Thu)04:15:54 No.108976997

Anonymous 06/04/26(Thu)04:15:54 No.108976997

>>108976783
I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT I HATE FACE DRIFT

Anonymous
06/04/26(Thu)04:15:55 No.108976998

Anonymous 06/04/26(Thu)04:15:55 No.108976998

File: ComfyUI_00724_.png (894 KB, 896x1152)

894 KB PNG

Anonymous
06/04/26(Thu)04:19:41 No.108977012

Anonymous 06/04/26(Thu)04:19:41 No.108977012

>>108976998
ma'am, i need that for studying

Anonymous
06/04/26(Thu)04:21:58 No.108977016

Anonymous 06/04/26(Thu)04:21:58 No.108977016

>>108976889
for realistic nsfw, nothing else has the prompt understanding and adherence

Anonymous
06/04/26(Thu)04:22:02 No.108977017

Anonymous 06/04/26(Thu)04:22:02 No.108977017

>>108976889
Its only good for porn. For other stuff its cringe

Anonymous
06/04/26(Thu)04:22:06 No.108977019

Anonymous 06/04/26(Thu)04:22:06 No.108977019

File: ZIT.png (2.71 MB, 1536x1536)

2.71 MB PNG

Surely Krea 2 open release won't be like their previous open release and will be better than ZIT, a model from half a year ago, right?

Anonymous
06/04/26(Thu)04:24:47 No.108977031

Anonymous 06/04/26(Thu)04:24:47 No.108977031

>>108975467
not that hard, though 100k seems a bit thin for a full finetune. get a regularisation data set at the very least
LR at batch size 1 around 6e-6 to 8e-6, scale up from there correspondingly
captioning is the painful part, what i did is run through WD14 or animetimm first, then filter out false positives that pop up when one uses these models on photos (asian, realistic, etc), then gemma4 31b with grounding from these tags and a good system prompt

i recommend to not tune existing photography tags like photo (medium) or cosplay girl as your main triggers, but do something fresh like an artist tag. trying to build atop the existing ones only resulted in slop semi realism for me

Anonymous
06/04/26(Thu)04:26:08 No.108977037

Anonymous 06/04/26(Thu)04:26:08 No.108977037

File: ComfyUI_00725_.png (965 KB, 896x1152)

965 KB PNG

Anonymous
06/04/26(Thu)04:27:22 No.108977043

Anonymous 06/04/26(Thu)04:27:22 No.108977043

File: 324654.webm (3.42 MB, 256x448)

3.42 MB WEBM

pretty good seed for the plane

Anonymous
06/04/26(Thu)04:37:18 No.108977081

Anonymous 06/04/26(Thu)04:37:18 No.108977081

File: 584565.webm (2.86 MB, 256x448)

2.86 MB WEBM

american ship cloaking technology captured on film
https://files.catbox.moe/nw00el.mp4

Anonymous
06/04/26(Thu)04:52:24 No.108977124

Anonymous 06/04/26(Thu)04:52:24 No.108977124

File: t.mp4 (1.33 MB, 480x720)

1.33 MB MP4

>>108976889
1girl, plot

Anonymous
06/04/26(Thu)04:57:22 No.108977143

Anonymous 06/04/26(Thu)04:57:22 No.108977143

>>108977124
Hot glue gun to ass? I'd rather take a tattoo

Anonymous
06/04/26(Thu)04:58:22 No.108977149

Anonymous 06/04/26(Thu)04:58:22 No.108977149

File: il_794xN.5782544826_g3hx-(...).jpg (102 KB, 794x794)

102 KB JPG

>>108976783
Baker, next OP, please:
"Discussion and Development of Local Image, Video, Music and Anime Models"

Anonymous
06/04/26(Thu)05:05:09 No.108977173

Anonymous 06/04/26(Thu)05:05:09 No.108977173

File: 1.jpg (418 KB, 1552x832)

418 KB JPG

Anonymous
06/04/26(Thu)05:07:25 No.108977182

Anonymous 06/04/26(Thu)05:07:25 No.108977182

>>108977173
MEW my beloved

Anonymous
06/04/26(Thu)05:09:56 No.108977192

Anonymous 06/04/26(Thu)05:09:56 No.108977192

>>108977019
You have Anima, why care?

Anonymous
06/04/26(Thu)05:10:41 No.108977193

Anonymous 06/04/26(Thu)05:10:41 No.108977193

File: file.png (3.7 MB, 1328x1776)

3.7 MB PNG

>>108977173
stop posting my gf

Anonymous
06/04/26(Thu)05:17:13 No.108977215

Anonymous 06/04/26(Thu)05:17:13 No.108977215

File: 3.jpg (309 KB, 1136x1136)

309 KB JPG

>>108977182
>>108977193

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.

Janitor applications are now open. Apply here!