/g/ - /ldg/ - Local Diffusion General - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
/ldg/ - Local Diffusion Genera(...) 10/31/24(Thu)01:09:52 No.103035679

File: the longest dick general.jpg (2.32 MB, 3264x1472)

/ldg/ - Local Diffusion General Anonymous 10/31/24(Thu)01:09:52 No.103035679

Discussion of free and open source text-to-image models

Previously baked bread : >>103024144

Tasteless Retards Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://aitracker.art
https://huggingface.co
https://civitai.com
https://tensor.art/models
https://liblib.art
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3

>SD3.5L/M
https://huggingface.co/stabilityai/stable-diffusion-3.5-large
https://replicate.com/stability-ai/stable-diffusion-3.5-large
https://huggingface.co/stabilityai/stable-diffusion-3.5-medium
https://huggingface.co/spaces/stabilityai/stable-diffusion-3.5-medium

>Sana
https://github.com/NVlabs/Sana
https://sana-gen.mit.edu

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux
DeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai

Anonymous
10/31/24(Thu)01:15:16 No.103035714

Anonymous 10/31/24(Thu)01:15:16 No.103035714

>>103027007
That looks legit, how did you do that?

Anonymous
10/31/24(Thu)01:21:47 No.103035758

Anonymous 10/31/24(Thu)01:21:47 No.103035758

File: 2024-10-30_00012_.png (1.48 MB, 720x1280)

1.48 MB PNG

>>103035670
>something important with pixelwave 3, make sure to use dpmpp_2m for the sampler and sgm_uniform for the scheduler, do at least 25 steps
Interesting. Why is that? Here is a prompt with PixelWave, some LyingSigma too. And DynamicThresholdingFull.

just euler, beta, otherwise.

Anonymous
10/31/24(Thu)01:24:09 No.103035776

Anonymous 10/31/24(Thu)01:24:09 No.103035776

File: 2024-10-31_00002_.png (1.04 MB, 720x1280)

1.04 MB PNG

Adding some earlier the lyingsigma un-detail.

Anonymous
10/31/24(Thu)01:24:32 No.103035779

Anonymous 10/31/24(Thu)01:24:32 No.103035779

File: 002491.jpg (2.74 MB, 1536x2560)

2.74 MB JPG

Anonymous
10/31/24(Thu)01:25:19 No.103035783

Anonymous 10/31/24(Thu)01:25:19 No.103035783

>>103029174
Looks neat!

Anonymous
10/31/24(Thu)01:26:24 No.103035791

Anonymous 10/31/24(Thu)01:26:24 No.103035791

>>103029288
This reminds me of Facing Worlds.

Anonymous
10/31/24(Thu)01:27:24 No.103035798

Anonymous 10/31/24(Thu)01:27:24 No.103035798

>>103035784
nonono bro I'm doing dedistilled cfg=1 lmao

Two models at once, pixelwave highly realistic. Let me try it.

Anonymous
10/31/24(Thu)01:28:25 No.103035803

Anonymous 10/31/24(Thu)01:28:25 No.103035803

>>103035714
c-can I t-take y-your p-picture? (the chines are very demure)

Anonymous
10/31/24(Thu)01:28:34 No.103035806

Anonymous 10/31/24(Thu)01:28:34 No.103035806

>>103035758
>that image
that's the issue with PixelWave, the greats details of Flux are gone, now it looks like a SDXL image with its shitty VAE, maybe because he insisted on training the model with only 1k resolution pictures

Anonymous
10/31/24(Thu)01:29:35 No.103035811

Anonymous 10/31/24(Thu)01:29:35 No.103035811

>>103035798
yeah my b I was about to respond to another post not you kek

Anonymous
10/31/24(Thu)01:30:36 No.103035815

Anonymous 10/31/24(Thu)01:30:36 No.103035815

>>103035758
Those are the parameters the creator specified and i find it gives the best results by far

Anonymous
10/31/24(Thu)01:34:33 No.103035835

Anonymous 10/31/24(Thu)01:34:33 No.103035835

File: 2024-10-31_00003_.png (820 KB, 720x1280)

820 KB PNG

>>103035806
It seems like more adherence = worse details, it's a tradeoff.

>>103035776
The same exact settings but with PixelWave.

Anonymous
10/31/24(Thu)01:48:47 No.103035935

Anonymous 10/31/24(Thu)01:48:47 No.103035935

File: 2024-10-31_00005_.png (1.27 MB, 720x1280)

1.27 MB PNG

>>103035835
Same, but bypassed dynamic threshholding, and lyingsigmas, and I'm using the correct dpmpp_2m, sgm_uniform.

It's "better".

One problem with this whole topic is how it intersects the idea of image, culture, class, etc etc. photography, artwork, architecture, pragmatism.

ai is the ultimate social battleground.

Anonymous
10/31/24(Thu)01:52:24 No.103035958

Anonymous 10/31/24(Thu)01:52:24 No.103035958

>desert storm was before the DJI Phantom

Tech continues to advance at a rapid pace. We just have stopped recognizing that our lives are altered over and over in ways that are not predictable, but which are mostly negative.

Anonymous
10/31/24(Thu)01:55:14 No.103035976

Anonymous 10/31/24(Thu)01:55:14 No.103035976

File: 20241031T055303Z_00001_.jpg (1.86 MB, 2560x1440)

1.86 MB JPG

Anonymous
10/31/24(Thu)02:16:55 No.103036088

Anonymous 10/31/24(Thu)02:16:55 No.103036088

SOMEONE FIX THE OUT OF FOCUS BOKEH BLUR FLUX BULLSHIT!
Dedistilled, Pixelwave, whatever, it's still there I can't escape it!

Anonymous
10/31/24(Thu)02:19:10 No.103036096

Anonymous 10/31/24(Thu)02:19:10 No.103036096

>>103036088
the only way I found to remove the bokeh is to go for that is by using the Lying Sigma Sampler node
https://github.com/Jonseed/ComfyUI-Detail-Daemon

Anonymous
10/31/24(Thu)02:40:19 No.103036204

Anonymous 10/31/24(Thu)02:40:19 No.103036204

File: cm2wxe11q00ni336pchdclmgq.webm (742 KB, 1696x960)

742 KB WEBM

Anonymous
10/31/24(Thu)02:41:44 No.103036215

Anonymous 10/31/24(Thu)02:41:44 No.103036215

File: cm2wxp4oa00bp336p1m27uql9.webm (547 KB, 1696x960)

547 KB WEBM

Anonymous
10/31/24(Thu)02:49:37 No.103036246

Anonymous 10/31/24(Thu)02:49:37 No.103036246

File: 2024-10-31_00007_.png (1.45 MB, 720x1280)

1.45 MB PNG

Ariana Grande lora

Anonymous
10/31/24(Thu)02:50:50 No.103036249

Anonymous 10/31/24(Thu)02:50:50 No.103036249

>>103036096
I really want to know how that's happening anyway - how can it know which parts to blur?

Anonymous
10/31/24(Thu)02:52:33 No.103036257

Anonymous 10/31/24(Thu)02:52:33 No.103036257

>>103036249
>how can it know which parts to blur?
anon, models are deep learning neurons, they implicitly knows how the world works, so they probably finetuned it in a way that it should only make bokeh pictures

Anonymous
10/31/24(Thu)02:54:15 No.103036267

Anonymous 10/31/24(Thu)02:54:15 No.103036267

File: wtf.png (395 KB, 1566x1166)

395 KB PNG

wtf

Anonymous
10/31/24(Thu)02:57:37 No.103036278

Anonymous 10/31/24(Thu)02:57:37 No.103036278

File: 00076-1639706402.png (375 KB, 496x496)

375 KB PNG

from an unreleased GeoCities image lora

Anonymous
10/31/24(Thu)02:58:38 No.103036283

Anonymous 10/31/24(Thu)02:58:38 No.103036283

File: 00046-3614240975.png (255 KB, 344x496)

255 KB PNG

>>103036278

Anonymous
10/31/24(Thu)02:59:12 No.103036288

Anonymous 10/31/24(Thu)02:59:12 No.103036288

>unreleased
Awh, man.

Anonymous
10/31/24(Thu)02:59:39 No.103036294

Anonymous 10/31/24(Thu)02:59:39 No.103036294

File: 00034-4277168806.png (257 KB, 600x400)

257 KB PNG

>>103036283

Anonymous
10/31/24(Thu)03:00:42 No.103036298

Anonymous 10/31/24(Thu)03:00:42 No.103036298

File: 00067-1257821327.png (234 KB, 496x496)

234 KB PNG

>>103036288
I'll release it eventually, just gotta generate some good examples images and get a better feel for it. I could upload it to catbox right now, I think I'm gonna do that right now in fact.

Anonymous
10/31/24(Thu)03:02:54 No.103036310

Anonymous 10/31/24(Thu)03:02:54 No.103036310

File: esoteric_knowledge.jpg (92 KB, 900x669)

92 KB JPG

>>103036267
Seems like this image will never not be relevant.

Anonymous
10/31/24(Thu)03:03:12 No.103036312

Anonymous 10/31/24(Thu)03:03:12 No.103036312

File: 00017-3628111074.png (136 KB, 384x384)

136 KB PNG

>>103036298
https://files.catbox.moe/qqy5s0.safetensors
Trigger is "Geocities image." at start of prompt. I recommend using a weight of 1.5 since 1.0 still has that Flux DoF blur.

Anonymous
10/31/24(Thu)03:03:28 No.103036313

Anonymous 10/31/24(Thu)03:03:28 No.103036313

>>103036298
Flux? XL? 3.5?

Anonymous
10/31/24(Thu)03:04:21 No.103036321

Anonymous 10/31/24(Thu)03:04:21 No.103036321

>>103036313
Flux, trained locally on a 3060. It takes hours but thankfully I have the time.

Anonymous
10/31/24(Thu)03:05:16 No.103036324

Anonymous 10/31/24(Thu)03:05:16 No.103036324

>>103036312
TY anon, when I break out flux again I'll try it. The examples you're posting look so good.

Anonymous
10/31/24(Thu)03:06:48 No.103036332

Anonymous 10/31/24(Thu)03:06:48 No.103036332

File: 00005-3902850286.png (410 KB, 768x384)

410 KB PNG

>>103036312
Also I recommend generating at 512 resolution or lower, but text sometimes breaks down at around 300px. I tried song lyrics and it always spit out "Are a jaded?" instead of "Are you jaded?".
>>103036324
Do post your results, I'm proud of how this turned out.

Anonymous
10/31/24(Thu)03:07:43 No.103036340

Anonymous 10/31/24(Thu)03:07:43 No.103036340

>>103036310
true, the line is really thin kek

Anonymous
10/31/24(Thu)03:09:54 No.103036350

Anonymous 10/31/24(Thu)03:09:54 No.103036350

>>103036332
i almost dont believe this is AI

Anonymous
10/31/24(Thu)03:11:46 No.103036357

Anonymous 10/31/24(Thu)03:11:46 No.103036357

>>103036350
Flux is incredible at producing low-quality looking images if you train it on them. I attribute it to the VAE personally, but even in that image you can see the generic blur in the back. Again, using 1.5 weight is probably better but I didn't bother when I generated that one.

Anonymous
10/31/24(Thu)03:13:11 No.103036366

Anonymous 10/31/24(Thu)03:13:11 No.103036366

File: ComfyUI_00007_.png (1013 KB, 1024x1024)

1013 KB PNG

>>103036350
>i almost dont believe this is AI
that's the Flux effect, I had the same reaction for that picture until I was able to make it myself on flux dev

Anonymous
10/31/24(Thu)03:17:00 No.103036385

Anonymous 10/31/24(Thu)03:17:00 No.103036385

File: 00115-3436883531.png (2.03 MB, 1024x1024)

2.03 MB PNG

Here's a different experiment from a while back, using a weirdcore aesthetic lora without a prompt (only using the activation).
I generated a metric fuckton of these, and they look nothing like what the weirdcore aesthetic is supposed to be.

Anonymous
10/31/24(Thu)03:18:02 No.103036389

Anonymous 10/31/24(Thu)03:18:02 No.103036389

File: 00181-2114428040.png (1.88 MB, 1024x1024)

1.88 MB PNG

>>103036385
There's way too many to reasonably post, so here's two more

Anonymous
10/31/24(Thu)03:19:05 No.103036397

Anonymous 10/31/24(Thu)03:19:05 No.103036397

File: 00140-2816918863.png (1.08 MB, 1024x1024)

1.08 MB PNG

>>103036389
This one's closer, at least with the text.

Anonymous
10/31/24(Thu)03:26:03 No.103036427

Anonymous 10/31/24(Thu)03:26:03 No.103036427

File: 00144-3930504532.png (1.59 MB, 1024x1024)

1.59 MB PNG

>>103036389
I lied.

Anonymous
10/31/24(Thu)03:29:42 No.103036443

Anonymous 10/31/24(Thu)03:29:42 No.103036443

File: 00174-3930504562.png (2.27 MB, 1024x1024)

2.27 MB PNG

>>103036427
Alright, NOW this is the last one. I have to stop now or else I'll get carried away.

Anonymous
10/31/24(Thu)03:35:15 No.103036467

Anonymous 10/31/24(Thu)03:35:15 No.103036467

File: cm2wyg5j7002z336opmfmympv.webm (567 KB, 1696x960)

567 KB WEBM

Anonymous
10/31/24(Thu)03:36:25 No.103036472

Anonymous 10/31/24(Thu)03:36:25 No.103036472

File: cm2wygfu9003u336oe93jh4ps.webm (1.11 MB, 1696x960)

1.11 MB WEBM

Anonymous
10/31/24(Thu)03:36:26 No.103036473

Anonymous 10/31/24(Thu)03:36:26 No.103036473

File: 2024-10-31_00009_.png (1.21 MB, 720x1280)

1.21 MB PNG

>>103036246
dedistilled and lyingsigmas

Anonymous
10/31/24(Thu)03:37:46 No.103036479

Anonymous 10/31/24(Thu)03:37:46 No.103036479

>>103036443
>1girl

Anonymous
10/31/24(Thu)03:38:35 No.103036483

Anonymous 10/31/24(Thu)03:38:35 No.103036483

File: 00015-3930504503.png (1.36 MB, 1024x1024)

1.36 MB PNG

>>103036479

Anonymous
10/31/24(Thu)03:39:39 No.103036486

Anonymous 10/31/24(Thu)03:39:39 No.103036486

>>103036312
>>103036385
>>103036389
>>103036397
>>103036483
I'd take a whole thread of them. Incredible.

Anonymous
10/31/24(Thu)03:45:49 No.103036514

Anonymous 10/31/24(Thu)03:45:49 No.103036514

File: 00030-3141513107.png (1.52 MB, 1024x1024)

1.52 MB PNG

>>103036486
I dumped an absolute SHITLOAD of them in a dickscord server, I'll try just compiling them and making a catbox (or other file service if it's too big)
Glad you appreciate them! This is the last one I'll post in the thread, for realsies this time.

Anonymous
10/31/24(Thu)03:47:18 No.103036517

Anonymous 10/31/24(Thu)03:47:18 No.103036517

>>103036310
real

Anonymous
10/31/24(Thu)03:53:45 No.103036560

Anonymous 10/31/24(Thu)03:53:45 No.103036560

>>103036514
Sorry, doesn't look like it's happening. The zip is 1.4GB (granted i did absolutely no filtering) and I can't find a good file sharing site. There's one I specifically remember that was very similar to catbox but had an upload limit of 1gb but now I can't find it. If anyone knows what I'm talking about please post it here.

Anonymous
10/31/24(Thu)03:55:20 No.103036571

Anonymous 10/31/24(Thu)03:55:20 No.103036571

>>103036560
>upload limit of 1gb
https://litterbox.catbox.moe

Anonymous
10/31/24(Thu)03:56:39 No.103036578

Anonymous 10/31/24(Thu)03:56:39 No.103036578

>>103036571
I meant a base upload of 1GB, where it doesn't expire, and presumably their equivalent of litterbox would support larger sizes.

Anonymous
10/31/24(Thu)03:57:42 No.103036583

Anonymous 10/31/24(Thu)03:57:42 No.103036583

>>103036578
everything expires eventually, catbox is a major exception to this
maybe use pixel drain or mega

Anonymous
10/31/24(Thu)03:58:00 No.103036591

Anonymous 10/31/24(Thu)03:58:00 No.103036591

File: file.webm (1.3 MB, 1280x768)

1.3 MB WEBM

https://github.com/jy0205/Pyramid-Flow
Babe wake up, they released their promised base model trained from scratch
>We have switched the model structure from SD3 to a mini FLUX to fix human structure issues, please try our 1024p image checkpoint and 384p video checkpoint (up to 5s). The new miniflux model shows great improvement on human structure and motion stability. We will release 768p video checkpoint in a few days.

Anonymous
10/31/24(Thu)04:02:31 No.103036623

Anonymous 10/31/24(Thu)04:02:31 No.103036623

>>103036583
My upload speed is fucked unfortunately, so I'm currently sitting my ass down and saving everything I posted to discord, and I'll zip that instead.

Anonymous
10/31/24(Thu)04:04:21 No.103036642

Anonymous 10/31/24(Thu)04:04:21 No.103036642

>>103036591
>please try our 1024p image checkpoint
oh nice a new local image model

Anonymous
10/31/24(Thu)04:07:52 No.103036666

Anonymous 10/31/24(Thu)04:07:52 No.103036666

File: 002514.jpg (2.14 MB, 1536x2560)

2.14 MB JPG

Anonymous
10/31/24(Thu)04:09:10 No.103036676

Anonymous 10/31/24(Thu)04:09:10 No.103036676

>>103036623
It's still gonna take a bit. Will post when done.

Anonymous
10/31/24(Thu)04:09:39 No.103036680

Anonymous 10/31/24(Thu)04:09:39 No.103036680

>>103036591
>they released their promised base model trained from scratch
Not quite, they released the 384p video sure, but not the 768p yet, so I guess we'll wait for some more before testing it seriously again. I hope that one will be as good as Mochi, at least those chinks made image2video possible

Anonymous
10/31/24(Thu)04:16:44 No.103036718

Anonymous 10/31/24(Thu)04:16:44 No.103036718

>>103036676
Here it finally is: pixeldrain 7Za5iN4D

Anonymous
10/31/24(Thu)04:18:08 No.103036726

Anonymous 10/31/24(Thu)04:18:08 No.103036726

>>103036718
God fucking dammit, I left out a bunch of images. Oh well, nobody gives a shit, it can wait.

Anonymous
10/31/24(Thu)04:47:11 No.103036884

Anonymous 10/31/24(Thu)04:47:11 No.103036884

File: image-2024-10-30T223057.848.png (755 KB, 1024x1024)

755 KB PNG

Anonymous
10/31/24(Thu)04:48:16 No.103036889

Anonymous 10/31/24(Thu)04:48:16 No.103036889

File: 1071461689.png (1.33 MB, 768x1344)

1.33 MB PNG

Anonymous
10/31/24(Thu)04:49:01 No.103036891

Anonymous 10/31/24(Thu)04:49:01 No.103036891

>>103036591
The last time they said "a few days" that meant 3 weeks, I'm not going to install and test it for 384p.
While i understand this is primarily to get some sppedups and vram reductions from devs to use on the 768p model I don't apprecitate being told "a few days" when it is more likely to be several weeks.
I am grateful to all devs for the toys they provide.

Anonymous
10/31/24(Thu)04:56:18 No.103036934

Anonymous 10/31/24(Thu)04:56:18 No.103036934

File: 2024-10-31_00010_.png (1.31 MB, 720x1280)

1.31 MB PNG

>>103036473

Anonymous
10/31/24(Thu)04:57:34 No.103036943

Anonymous 10/31/24(Thu)04:57:34 No.103036943

>>103036726
FINALLY it's done: https://files.catbox.moe/fmsoan.7z

Anonymous
10/31/24(Thu)05:15:35 No.103037061

Anonymous 10/31/24(Thu)05:15:35 No.103037061

>>103036891
I didn't believe their "few days" bullshit, you can't pretrain a model in a few days lawl

Anonymous
10/31/24(Thu)05:30:37 No.103037164

Anonymous 10/31/24(Thu)05:30:37 No.103037164

File: 2024-10-31_00012_.png (1.44 MB, 720x1280)

1.44 MB PNG

>>103036934
"desert" storm. Time to find out the TRUTH.

Anonymous
10/31/24(Thu)05:31:04 No.103037168

Anonymous 10/31/24(Thu)05:31:04 No.103037168

File: 00088-3930504576.png (1.45 MB, 1024x1024)

1.45 MB PNG

Anonymous
10/31/24(Thu)05:36:16 No.103037209

Anonymous 10/31/24(Thu)05:36:16 No.103037209

File: 2024-10-31_00013_.png (1.51 MB, 720x1280)

1.51 MB PNG

>>103037164
What do they know that they aren't telling us?

Anonymous
10/31/24(Thu)05:37:30 No.103037217

Anonymous 10/31/24(Thu)05:37:30 No.103037217

>>103037061
the thing is, once they've got the parameters set for the dataset at lower resolution, which probably took iteration and tests, they can just plug that into the larger model if it's the same dataset just at a higher resolution and get a comparable result in terms of what the model learns, but with finer fidelity.
But admittedly, "a few days" is kinda sus.

Anonymous
10/31/24(Thu)05:41:38 No.103037239

Anonymous 10/31/24(Thu)05:41:38 No.103037239

File: 2024-10-31_00014_.png (965 KB, 720x1280)

965 KB PNG

>>103037209

Anonymous
10/31/24(Thu)05:53:19 No.103037297

Anonymous 10/31/24(Thu)05:53:19 No.103037297

File: 2024-10-31_00015_.png (446 KB, 720x1280)

446 KB PNG

Data.

Anonymous
10/31/24(Thu)06:04:55 No.103037379

Anonymous 10/31/24(Thu)06:04:55 No.103037379

File: grid-0327.jpg (336 KB, 1792x2304)

336 KB JPG

Anonymous
10/31/24(Thu)06:07:24 No.103037393

Anonymous 10/31/24(Thu)06:07:24 No.103037393

there are literally 20+ flux loras out there that are meant to do female nudity. Which is a reliable one worth using?

Anonymous
10/31/24(Thu)06:14:57 No.103037440

Anonymous 10/31/24(Thu)06:14:57 No.103037440

File: altrevy.jpg (194 KB, 1024x1024)

194 KB JPG

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.