/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 08/07/24(Wed)05:28:43 No.101764165

File: tmp.jpg (1.18 MB, 3264x3264)

1.18 MB JPG

/ldg/ - Local Diffusion General Anonymous 08/07/24(Wed)05:28:43 No.101764165 Archived

Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>101761268

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Kolors
https://gokaygokay-kolors.hf.space
Nodes: https://github.com/kijai/ComfyUI-KwaiKolorsWrapper

>AuraFlow
https://fal.ai/models/fal-ai/aura-flow
https://huggingface.co/fal/AuraFlows

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg

Anonymous
08/07/24(Wed)05:33:24 No.101764194

Anonymous 08/07/24(Wed)05:33:24 No.101764194

>first time in several collages that I'm not in it
I'm free

Anonymous
08/07/24(Wed)05:34:11 No.101764200

Anonymous 08/07/24(Wed)05:34:11 No.101764200

>flux vae is fucked with that retarded grid pattern and there's nothing we can do about it
AI detectors easily pick it up btw

Anonymous
08/07/24(Wed)05:36:04 No.101764212

Anonymous 08/07/24(Wed)05:36:04 No.101764212

File: FD_00308_.png (2.48 MB, 1024x1536)

2.48 MB PNG

>>101764200
Why the fuck should you care? Are you trying to trick people into thinking it's your art?

Anonymous
08/07/24(Wed)05:37:07 No.101764221

Anonymous 08/07/24(Wed)05:37:07 No.101764221

>>101764200
no grid pattern on my gens

Anonymous
08/07/24(Wed)05:38:52 No.101764229

Anonymous 08/07/24(Wed)05:38:52 No.101764229

>>101764221
It is there, it's just not visible, anon.

Anonymous
08/07/24(Wed)05:39:14 No.101764233

Anonymous 08/07/24(Wed)05:39:14 No.101764233

File: Capture.jpg (825 KB, 3839x1769)

825 KB JPG

All right so I made a XY plot between the samplers and schedulers for that prompt:
>Hatsune Miku with dreadlocks and a black skin showing her fists

Here's a few notes:
1) I used CFG 3 + DynamicThresholding or else flux simply wouldn't want to modify Miku's feature
https://reddit.com/r/StableDiffusion/comments/1ekgiw6/heres_a_hack_to_make_flux_better_at_prompt/

2) Only those samplers are working on flux on ComfyUi:
>euler; heun; heunpp2; dpm_2; lms; dpm_adaptive; dpmpp_2m; lcm; ipndm; ipndm_v; deis; ddim; uni_pc; uni_pc_bh2
The rest just give insanely glitched output so it wasn't worth adding them in

3) The schedulers "normal; sgm_uniform; simple;" give almost the same output, so I only went for the "simple one"

https://files.catbox.moe/af40tk.jpg

There's some interesting observations you can make out of those samplers + schedulers, they're not as identical as I thought.

Anonymous
08/07/24(Wed)05:40:59 No.101764244

Anonymous 08/07/24(Wed)05:40:59 No.101764244

>>101764233
See, that's why I told you to use a much more specific prompt, now you don't know if the variation is due to the sampler or just the prompt being that vague...

Anonymous
08/07/24(Wed)05:41:15 No.101764247

Anonymous 08/07/24(Wed)05:41:15 No.101764247

>>101764229
nope

Anonymous
08/07/24(Wed)05:42:02 No.101764258

Anonymous 08/07/24(Wed)05:42:02 No.101764258

>>101764247
Post your gen (raw png) and I'll show you

Anonymous
08/07/24(Wed)05:43:12 No.101764267

Anonymous 08/07/24(Wed)05:43:12 No.101764267

official pixart bigma and hunyuan finetune waiting room

Anonymous
08/07/24(Wed)05:43:52 No.101764273

Anonymous 08/07/24(Wed)05:43:52 No.101764273

>>101764244
Nope, the prompt is fine, it's already working when you're being more agressive with the guidance (4.0) + CFG (3.0), for that one I went for guidance 3.5 + CFG 3.0 so that I could find a set of samplers + schedulers that would work for something less agressive, desu there's none that had both the black skin and the dreadlocks, so I consider it a failure

Anonymous
08/07/24(Wed)05:44:46 No.101764283

Anonymous 08/07/24(Wed)05:44:46 No.101764283

>>101764258
tell me the method you use and I'll see for myself

Anonymous
08/07/24(Wed)05:46:00 No.101764291

Anonymous 08/07/24(Wed)05:46:00 No.101764291

File: file.png (2.55 MB, 1024x1024)

2.55 MB PNG

>>101764233
very interesting! thanks, anon.

Anonymous
08/07/24(Wed)05:46:27 No.101764297

Anonymous 08/07/24(Wed)05:46:27 No.101764297

File: 1707990061358641.png (580 KB, 1548x426)

580 KB PNG

>>101764283
I think https://arxiv.org/pdf/1912.11035 should easily spot Flux gens too

Anonymous
08/07/24(Wed)05:47:56 No.101764315

Anonymous 08/07/24(Wed)05:47:56 No.101764315

File: file.jpg (8 KB, 218x231)

8 KB JPG

>>101764200
>mfw the latest free shit has an invisible pattern

Anonymous
08/07/24(Wed)05:47:58 No.101764316

Anonymous 08/07/24(Wed)05:47:58 No.101764316

>>101764297
Yeah, looks like it works for SD at least, see e.g. https://blog.metaphysic.ai/combating-stable-diffusion-face-forgery-through-frequency-analysis/
https://arxiv.org/pdf/2210.14571v4

Anonymous
08/07/24(Wed)05:49:11 No.101764332

Anonymous 08/07/24(Wed)05:49:11 No.101764332

File: 1715273960137749.png (111 KB, 1146x213)

111 KB PNG

>>101764315
"I just realised... Sucrose and Collei... Anemo and dendro, wind...Blume..."?

Anonymous
08/07/24(Wed)05:50:45 No.101764340

Anonymous 08/07/24(Wed)05:50:45 No.101764340

File: 2024-08-07_00132_.png (1.17 MB, 720x1280)

1.17 MB PNG

>>101764233
thanks for the research efforts, heunpp+beta looks very promising, had not considered that one yet, pic related

Anonymous
08/07/24(Wed)05:51:03 No.101764346

Anonymous 08/07/24(Wed)05:51:03 No.101764346

File: FD_00319_.png (1011 KB, 1024x1024)

1011 KB PNG

Some data points for you guys

Anonymous
08/07/24(Wed)05:52:16 No.101764352

Anonymous 08/07/24(Wed)05:52:16 No.101764352

>>101764297
The squares on the frequency analysis have fuck all to do with squares we're talking about which are on the image itself. Dummy.

Anonymous
08/07/24(Wed)05:53:22 No.101764360

Anonymous 08/07/24(Wed)05:53:22 No.101764360

>cropping out the background and replacing it with a solid color when training a character LORA
thoughts?

Anonymous
08/07/24(Wed)05:54:07 No.101764371

Anonymous 08/07/24(Wed)05:54:07 No.101764371

>>101764360
Bad. It fucks up the LoRA. All the gens using your subject will have a white or black background, You can't prompt a background back in.

Anonymous
08/07/24(Wed)05:55:08 No.101764379

Anonymous 08/07/24(Wed)05:55:08 No.101764379

>>101764360
>>101764371
To add to this, you want as many diverse backgrounds for your subject as posible. Inside, outside, nature, urban, light, dark, day, night etc etc

Anonymous
08/07/24(Wed)05:55:59 No.101764387

Anonymous 08/07/24(Wed)05:55:59 No.101764387

File: ComfyUI_Flux_4677.jpg (193 KB, 1024x768)

193 KB JPG

>>101764233
thanks a lot anon, saved

Anonymous
08/07/24(Wed)05:55:59 No.101764388

Anonymous 08/07/24(Wed)05:55:59 No.101764388

>>101764379
>>101764371
are you tagging the backgrounds then? how thorough do you have to be?

Anonymous
08/07/24(Wed)05:56:56 No.101764392

Anonymous 08/07/24(Wed)05:56:56 No.101764392

>>101764387
could you make these more fuckable? thanks.

Anonymous
08/07/24(Wed)05:57:34 No.101764396

Anonymous 08/07/24(Wed)05:57:34 No.101764396

File: ComfyUI_Flux_25.png (957 KB, 1280x720)

957 KB PNG

>>101764102
>dev2 branch scrapped
RIP.
If you were on a dev2 branch before updating, then just do git checkout dev2 and don't touch it again until a major update, otherwise try git checkout on the main branch back to the commit before illyasviel started messing with the repo somewhere in June-July. Alternatively give reForge a try, though I have no idea if it even works properly.

Anonymous
08/07/24(Wed)05:59:13 No.101764404

Anonymous 08/07/24(Wed)05:59:13 No.101764404

File: 1715122314266077.png (1.36 MB, 1024x1024)

1.36 MB PNG

>a 1060 6b took 120 sec per gen on sd1
>a 3060 12gb takes 150 sec per flux gen
Why Are We Still Here? Just To Suffer?
tho i get the impression something very wrong is happening, swarmui uses a constant 24gb ram after loading the model but flushes the vram multiple times.

Anonymous
08/07/24(Wed)05:59:32 No.101764410

Anonymous 08/07/24(Wed)05:59:32 No.101764410

>>101764388
I can't remember, haven't trained any LoRAs since SDXL, I used an auto tagger and just gave the data a once over.
I remember doing the exact thing of removing the background of the data set and it definitely fucks up the LoRA.
I ran the next session with backgrounds in and it was perfect.

Anonymous
08/07/24(Wed)06:00:18 No.101764418

Anonymous 08/07/24(Wed)06:00:18 No.101764418

File: 2024-08-07_00134_.png (1.46 MB, 1024x1024)

1.46 MB PNG

>>101764233
heunpp+beta looks cool, but euler+beta (see >>101764165) has better sky

Anonymous
08/07/24(Wed)06:01:22 No.101764426

Anonymous 08/07/24(Wed)06:01:22 No.101764426

File: 2024-08-07_00135_.png (1.58 MB, 1024x1024)

1.58 MB PNG

>>101764233
DPMadaptive+beta is madness, it went to 60 iterations before it was happy with convergence

Anonymous
08/07/24(Wed)06:04:17 No.101764445

Anonymous 08/07/24(Wed)06:04:17 No.101764445

File: FD_00323_.png (1.49 MB, 1024x1024)

1.49 MB PNG

Anonymous
08/07/24(Wed)06:07:32 No.101764469

Anonymous 08/07/24(Wed)06:07:32 No.101764469

File: ComfyUI_Flux_4689.jpg (123 KB, 1024x768)

123 KB JPG

>>101764233
heunpp2 beta 14 steps

Anonymous
08/07/24(Wed)06:08:14 No.101764479

Anonymous 08/07/24(Wed)06:08:14 No.101764479

>>101764233
ipndm for schizo gen

Anonymous
08/07/24(Wed)06:12:15 No.101764504

Anonymous 08/07/24(Wed)06:12:15 No.101764504

File: FD_00329_.png (1.61 MB, 1024x1024)

1.61 MB PNG

Anonymous
08/07/24(Wed)06:19:06 No.101764550

Anonymous 08/07/24(Wed)06:19:06 No.101764550

File: FD_00334_.png (1.44 MB, 1024x1024)

1.44 MB PNG

Anonymous
08/07/24(Wed)06:32:26 No.101764659

Anonymous 08/07/24(Wed)06:32:26 No.101764659

File: FD_00342_.png (1.11 MB, 1024x1024)

1.11 MB PNG

Anonymous
08/07/24(Wed)06:34:06 No.101764674

Anonymous 08/07/24(Wed)06:34:06 No.101764674

File: 2024-08-07_00144_.png (1.61 MB, 1024x1024)

1.61 MB PNG

>>101764469
ya its nice but its sloooow, euler+beta ~1.2s/it, heunpp+beta ~2.4s/it, with negatives hack ~5s/it

Anonymous
08/07/24(Wed)06:34:53 No.101764681

Anonymous 08/07/24(Wed)06:34:53 No.101764681

File: ComfyUI_01000_.png (917 KB, 1024x1024)

917 KB PNG

Anonymous
08/07/24(Wed)06:37:05 No.101764700

Anonymous 08/07/24(Wed)06:37:05 No.101764700

File: ComfyUI_01002_.png (850 KB, 1024x1024)

850 KB PNG

Anonymous
08/07/24(Wed)06:39:31 No.101764732

Anonymous 08/07/24(Wed)06:39:31 No.101764732

>>101764297
>Upscale the image
>Grid pattern gone, like magic!

Anonymous
08/07/24(Wed)06:42:53 No.101764761

Anonymous 08/07/24(Wed)06:42:53 No.101764761

>>101764396
>Alternatively give reForge a try, though I have no idea if it even works properly.
It works alright, but it's starting to have some bloat

Anonymous
08/07/24(Wed)06:43:07 No.101764765

Anonymous 08/07/24(Wed)06:43:07 No.101764765

File: file.png (1.02 MB, 816x645)

1.02 MB PNG

DreamshaperXL 4 steps. Um... didn't exactly get the prompt correct, but I assume that has to do with some SFW thing with fooocus.

Anonymous
08/07/24(Wed)06:45:14 No.101764785

Anonymous 08/07/24(Wed)06:45:14 No.101764785

File: media_GULSUrZW0AAkuyC.png (2.5 MB, 2881x1975)

2.5 MB PNG

Have you tried decreasing the CFG value so that you get more varied styles out of your prompts?

Anonymous
08/07/24(Wed)06:47:04 No.101764806

Anonymous 08/07/24(Wed)06:47:04 No.101764806

>>101764785
no because using CFG cuts speed by half

Anonymous
08/07/24(Wed)06:51:00 No.101764857

Anonymous 08/07/24(Wed)06:51:00 No.101764857

File: aasa.jpg (1.2 MB, 2881x1975)

1.2 MB JPG

>>101764785
Would you look at that!

Anonymous
08/07/24(Wed)06:51:57 No.101764864

Anonymous 08/07/24(Wed)06:51:57 No.101764864

>>101764857
>In the style of Pablo Picasso

NO YOU CANT DO THAT

Anonymous
08/07/24(Wed)06:52:57 No.101764877

Anonymous 08/07/24(Wed)06:52:57 No.101764877

can I get a link to a workflow with CFG included so I can give it a try?

Anonymous
08/07/24(Wed)06:55:40 No.101764911

Anonymous 08/07/24(Wed)06:55:40 No.101764911

File: ComfyUI_01009_.png (916 KB, 1280x824)

916 KB PNG

Anonymous
08/07/24(Wed)06:57:30 No.101764931

Anonymous 08/07/24(Wed)06:57:30 No.101764931

File: 2024-08-07_00157_.png (2.52 MB, 1280x1024)

2.52 MB PNG

>>101764864
nta, but yes I can.

Anonymous
08/07/24(Wed)06:57:49 No.101764934

Anonymous 08/07/24(Wed)06:57:49 No.101764934

>>101764877
Sure, here it is.
https://files.catbox.moe/rf18x1.png

Anonymous
08/07/24(Wed)07:01:15 No.101764974

Anonymous 08/07/24(Wed)07:01:15 No.101764974

File: ComfyUI_01011_.png (1.23 MB, 1280x824)

1.23 MB PNG

Anonymous
08/07/24(Wed)07:03:16 No.101764992

Anonymous 08/07/24(Wed)07:03:16 No.101764992

>>101764785
>>101764857
Is it still 2-3 times slower than having cfg at 1?

Anonymous
08/07/24(Wed)07:04:02 No.101764999

Anonymous 08/07/24(Wed)07:04:02 No.101764999

File: ComfyUI_Flux_4741.jpg (127 KB, 1056x480)

127 KB JPG

Anonymous
08/07/24(Wed)07:04:26 No.101765003

Anonymous 08/07/24(Wed)07:04:26 No.101765003

>>101764992
it's 2 times slower when cfg isn't 1 because it has to calculate the negative prompt on top of the positive prompt now, desu I don't know why it shouldn't be the same speed if you put nothing in the negative prompt though

Anonymous
08/07/24(Wed)07:05:59 No.101765023

Anonymous 08/07/24(Wed)07:05:59 No.101765023

File: FD_00350_.png (1.86 MB, 1024x1024)

1.86 MB PNG

>>101764974
Jesus Christ I thought that was a real Germans desktop
>>101764992
Yes.
It definitely does something though.
>Princess Jasmine in the style of Frida Kahlo
Just not sure it's the something I intended.

Anonymous
08/07/24(Wed)07:07:42 No.101765038

Anonymous 08/07/24(Wed)07:07:42 No.101765038

>>101765023
>real Germans desktop

Anyone else notice that a lot of the default gibberish in this model looks distinctly German?

Anonymous
08/07/24(Wed)07:08:07 No.101765045

Anonymous 08/07/24(Wed)07:08:07 No.101765045

File: pose.png (2.31 MB, 1280x1856)

2.31 MB PNG

Have any of you been able to get a pose like this out of FLUX? I tried, but without success. Can it just not do it?

Anonymous
08/07/24(Wed)07:08:36 No.101765049

Anonymous 08/07/24(Wed)07:08:36 No.101765049

>>101765003
Have you guys tested if having same token amount on pos and neg effects generation time?

Anonymous
08/07/24(Wed)07:08:43 No.101765052

Anonymous 08/07/24(Wed)07:08:43 No.101765052

File: ComfyUI_01014_.png (979 KB, 1280x824)

979 KB PNG

Anonymous
08/07/24(Wed)07:09:24 No.101765062

Anonymous 08/07/24(Wed)07:09:24 No.101765062

File: FD_00351_.png (1.89 MB, 1024x1024)

1.89 MB PNG

>>101765023
>a painting of princess jasmine by Frida Kahlo,
cfg 0.3
>>101765038
I mean theres a model called Schnell

Anonymous
08/07/24(Wed)07:10:08 No.101765073

Anonymous 08/07/24(Wed)07:10:08 No.101765073

File: fp032.jpg (362 KB, 1024x1024)

362 KB JPG

Anonymous
08/07/24(Wed)07:10:36 No.101765078

Anonymous 08/07/24(Wed)07:10:36 No.101765078

File: FD_00352_.png (1.59 MB, 1024x1024)

1.59 MB PNG

>>101765062
>Princess Jasmine painted by Frida Kahlo
cfg 1.0
cfg low is placebo, it's just being more creative and you are fluking results I think.

Anonymous
08/07/24(Wed)07:11:23 No.101765089

Anonymous 08/07/24(Wed)07:11:23 No.101765089

>>101764857
right is cool/interesting but it's still not even close to the style of Picasso
like that's not even a decent approximation of cubism lol, it's just some totally different random thing

Anonymous
08/07/24(Wed)07:13:23 No.101765104

Anonymous 08/07/24(Wed)07:13:23 No.101765104

File: file.png (2.51 MB, 1024x1024)

2.51 MB PNG

Anonymous
08/07/24(Wed)07:13:25 No.101765105

Anonymous 08/07/24(Wed)07:13:25 No.101765105

File: 2024-08-07_00162_.png (2.37 MB, 1280x1024)

2.37 MB PNG

>>101765023
>in the style of Frida Kahlo
is not by Frida Kahlo if there is no mustache and nearly unibrow
>>101765078
>cfg low is placebo
this, it will ofc change something, but my Picasso Mikus just went full back to anime with low CFG

Anonymous
08/07/24(Wed)07:14:21 No.101765110

Anonymous 08/07/24(Wed)07:14:21 No.101765110

>>101765104
nice! What's your prompt anon?

Anonymous
08/07/24(Wed)07:14:57 No.101765115

Anonymous 08/07/24(Wed)07:14:57 No.101765115

>>101765003
>I don't know why it shouldn't be the same speed if you put nothing in the negative prompt though
because originally having nothing in the negative prompt was the whole point, Classifier Free Guidance runs a step with no text condition and another with text condition then you control how much of each to use to change the noise
putting things in the negative prompt is a hack of CFG

Anonymous
08/07/24(Wed)07:16:25 No.101765131

Anonymous 08/07/24(Wed)07:16:25 No.101765131

>>101765045
it doesn't know naughty stuff and that pose is rarely seen in SFW contexts, anon, but how close have you gotten and what prompt did you use?

Anonymous
08/07/24(Wed)07:17:34 No.101765141

Anonymous 08/07/24(Wed)07:17:34 No.101765141

>>101765115
oh ok, thanks for the explaination anon

Anonymous
08/07/24(Wed)07:17:49 No.101765144

Anonymous 08/07/24(Wed)07:17:49 No.101765144

>>101765110
i'm the schizoprompter from a few threads back.
heres the catbox, keep in mind that only about 1 in 10 or 1 in 20 of the outputs are any good, the prompt and settings produces very chaotic outputs.
https://files.catbox.moe/85ujkt.png
thanks to the anons who provided info about samplers and schedulers recently

Anonymous
08/07/24(Wed)07:18:35 No.101765149

Anonymous 08/07/24(Wed)07:18:35 No.101765149

>>101765144
>thanks to the anons who provided info about samplers and schedulers recently
it was me kek, and thanks for the catbox

Anonymous
08/07/24(Wed)07:18:39 No.101765151

Anonymous 08/07/24(Wed)07:18:39 No.101765151

File: FD_00003_.png (1.77 MB, 1024x1536)

1.77 MB PNG

>>101765045
uuuuum.....

Anonymous
08/07/24(Wed)07:19:45 No.101765160

Anonymous 08/07/24(Wed)07:19:45 No.101765160

I've seen a few gigantic workflows shared, and since I'm new to comfyui, I have no idea what they actually do.
Is there any guide to understand what nodes do beyond simple "my first workflow with flux/sdxl" ?

Anonymous
08/07/24(Wed)07:20:33 No.101765167

Anonymous 08/07/24(Wed)07:20:33 No.101765167

>>101765151
It sucks at most poses that are even mildly nsfw.

Anonymous
08/07/24(Wed)07:20:44 No.101765169

Anonymous 08/07/24(Wed)07:20:44 No.101765169

>>101765144
Oh shit you're prompting in full mode.
I can't handle that.

Anonymous
08/07/24(Wed)07:21:15 No.101765173

Anonymous 08/07/24(Wed)07:21:15 No.101765173

>>101765160
i'm new too, and from what i've seen there's not a whole lot. you can learn a lot surprisingly quick by lurking these threads tho

Anonymous
08/07/24(Wed)07:21:55 No.101765179

Anonymous 08/07/24(Wed)07:21:55 No.101765179

>>101765160
>Is there any guide to understand what nodes do
Yes.
https://github.com/comfyanonymous/ComfyUI/blob/master/nodes.py

Anonymous
08/07/24(Wed)07:22:04 No.101765181

Anonymous 08/07/24(Wed)07:22:04 No.101765181

File: 1714143032016335.png (379 KB, 512x512)

379 KB PNG

>>101764785
for some reason no matter what I tried on 1.5 and XL it never managed to make a bicycle or motorcycle look correct. This is impressive to me.
>>101765045
We will probably have to wait for community checkpoint for the good stuff.

How does one even install the bloody thing? Finally have a reason to learn Comfy after getting too comfy with A1111, but every guide says to do a different thing (put it into checkpoints, no put it into unet etc)

Anonymous
08/07/24(Wed)07:23:16 No.101765196

Anonymous 08/07/24(Wed)07:23:16 No.101765196

File: ComfyUI_01024_.png (887 KB, 768x1280)

887 KB PNG

Anonymous
08/07/24(Wed)07:23:18 No.101765197

Anonymous 08/07/24(Wed)07:23:18 No.101765197

>>101765181
>How does one even install the bloody thing?
Go for that tutorial, it's the best one
https://www.youtube.com/watch?v=stOiAuyVnyQ

Anonymous
08/07/24(Wed)07:24:40 No.101765212

Anonymous 08/07/24(Wed)07:24:40 No.101765212

>>101765181
if you're talkin about flux here's the quick setup guide
https://comfyanonymous.github.io/ComfyUI_examples/flux/
this should probably be in the OP now that i think about it

Anonymous
08/07/24(Wed)07:25:23 No.101765221

Anonymous 08/07/24(Wed)07:25:23 No.101765221

File: ComfyUI_01025_.png (945 KB, 768x1280)

945 KB PNG

Anonymous
08/07/24(Wed)07:25:23 No.101765222

Anonymous 08/07/24(Wed)07:25:23 No.101765222

>>101765181
This is their official blog
https://comfyanonymous.github.io/ComfyUI_examples/flux/
Install comfy, download the files on that page, then drag and drop the image into the comfyui page
Super Simple Stuff

Anonymous
08/07/24(Wed)07:26:07 No.101765233

Anonymous 08/07/24(Wed)07:26:07 No.101765233

File: Screenshot 2024-08-07 232351.png (505 KB, 2921x1545)

505 KB PNG

>>101765160
>gigantic workflows
You should see some of my 1.5 shit
This is tame compared to some of the autism I have seen

Anonymous
08/07/24(Wed)07:26:19 No.101765235

Anonymous 08/07/24(Wed)07:26:19 No.101765235

>>101765222
>official blog

Anonymous
08/07/24(Wed)07:27:10 No.101765240

Anonymous 08/07/24(Wed)07:27:10 No.101765240

>>101765233
wtf...

Anonymous
08/07/24(Wed)07:27:35 No.101765245

Anonymous 08/07/24(Wed)07:27:35 No.101765245

>>101765197
Thanks m8. Dig the thumbnail!
>>101765212
>>101765181
thank you! I saw the link somewhere in the links in OP I think. Gonna give it a go and do the YT if I fail miserably (expected outcome)
>>101764396
>reForge
oh God, are they making a duct tape supportet fork of an abandoned fork?

Anonymous
08/07/24(Wed)07:27:53 No.101765249

Anonymous 08/07/24(Wed)07:27:53 No.101765249

>>101765235
oh yeah my b I was originally going to link to their official blog at
https://blog.comfy.org/august-2024-flux-support-new-frontend-for-loops-and-more/
but then I realised it'd be easier to click through to the actual tutorial and link that, and forgot to edit

Anonymous
08/07/24(Wed)07:28:01 No.101765253

Anonymous 08/07/24(Wed)07:28:01 No.101765253

File: ComfyUI_01026_.png (903 KB, 768x1280)

903 KB PNG

Anonymous
08/07/24(Wed)07:28:09 No.101765256

Anonymous 08/07/24(Wed)07:28:09 No.101765256

>>101765233
I love factorio and cities skylines too.

Anonymous
08/07/24(Wed)07:29:43 No.101765269

Anonymous 08/07/24(Wed)07:29:43 No.101765269

File: img_10.jpg (141 KB, 768x1360)

141 KB JPG

>>101765131
It's not that rare/naughty. "a girl looking back through between her legs" finds a lot of hits that are clearly not intended to be sexy, but that doesn't work as a prompt and just gives a girl in underwear facing away.

I tried things like "a drawing of princess peach standing bent over with her legs spread, looking back at the viewer from between her legs. her face is upside down." but it makes things like pic rel.

Anonymous
08/07/24(Wed)07:29:51 No.101765271

Anonymous 08/07/24(Wed)07:29:51 No.101765271

>>101765245
>oh God, are they making a duct tape supportet fork of an abandoned fork?
What has been abandoned?

Anonymous
08/07/24(Wed)07:29:52 No.101765272

Anonymous 08/07/24(Wed)07:29:52 No.101765272

>>101765240
Looks more complicated than it is.
Essentially because 1.5 was trained on 512x512 images, genning large images is impossible in one step, you get way too many artifacts
This workflow does 4 upscale steps to 4k and has a face detailer, That's it.

Anonymous
08/07/24(Wed)07:30:52 No.101765280

Anonymous 08/07/24(Wed)07:30:52 No.101765280

>>101765173
I'm able to make a basic workflow, it's just the terms: what's the difference between unet and checkpoint loading? What's sigma? What's guidance why is it different from cfg? etc

Anonymous
08/07/24(Wed)07:33:22 No.101765309

Anonymous 08/07/24(Wed)07:33:22 No.101765309

>>101765271
Forge was abandoned iirc and the author said it will no longer be developed as stable alternative to A1111 but he will test out some new features on it. At least thats what I got a few months ago.

Anonymous
08/07/24(Wed)07:36:35 No.101765342

Anonymous 08/07/24(Wed)07:36:35 No.101765342

>>101765309
>Forge was abandoned iirc
https://github.com/lllyasviel/stable-diffusion-webui-forge looks like there's some big update coming

Anonymous
08/07/24(Wed)07:36:53 No.101765345

Anonymous 08/07/24(Wed)07:36:53 No.101765345

f

Anonymous
08/07/24(Wed)07:38:50 No.101765364

Anonymous 08/07/24(Wed)07:38:50 No.101765364

>>101765342
interesting. maybe I should switch back to forge. I remember reading the news when there was some drama between him and A1111 devs. Maybe situation improved since Forge did produce results way faster and reliably even on a 4080

Anonymous
08/07/24(Wed)07:40:52 No.101765388

Anonymous 08/07/24(Wed)07:40:52 No.101765388

>>101765364
>between him and A1111 devs
Nah, it was comfy and his butt buddies who got their panties twisted. A1111 dev doesn't seem to give a shit about anything

Anonymous
08/07/24(Wed)07:41:05 No.101765391

Anonymous 08/07/24(Wed)07:41:05 No.101765391

File: FD_00357_.png (898 KB, 1024x1024)

898 KB PNG

>>101765280
>What's sigma
It's another model, a community darling, better than SD but lacks the same level of support.
cfg stands for classifier free guidance, Guidance helps shape the images a model makes, cfg specifically adjusts images to better match prompts by mixing results with and without the prompt influence.
Unet loads a kind of neural network, checkpoint loads a models state.

Anonymous
08/07/24(Wed)07:44:33 No.101765429

Anonymous 08/07/24(Wed)07:44:33 No.101765429

>>101765280
you can find some terms explained for example here
>https://diffute.com/glossary
or here
>https://replicate.com/guides/stable-diffusion/glossary

but every new diffusion method and model comes with new tech and new terms so they are outdated (for flux for example)

>>101765280
>what's the difference between unet and checkpoint loading?
unet loader loads flux model and the like
checkpoint loader loads sd15/sdxl models and the like

>>101765280
>What's guidance why is it different from cfg? etc
guidance for flux makes t5xxl follow the prompt more precise (it can forget styles if its to high tho, so lower guidance for that)
cfg is a different type of guidance == classifier free guidance that is implemented outside the text encoder interpretation, you can use it in flux with a hack using dynamic thresholding that normalizes the output back to a "virtual" mimic cfg of 1.0, which is what FLUX actually wants

>>101765280
>What's sigma
in what context did you see that? cause it can have many meanings from lora learning to a model name

Anonymous
08/07/24(Wed)07:45:14 No.101765437

Anonymous 08/07/24(Wed)07:45:14 No.101765437

>>101764200
NTA but figured it out, I got that grid when I dropped guidance to 1

Anonymous
08/07/24(Wed)07:47:13 No.101765455

Anonymous 08/07/24(Wed)07:47:13 No.101765455

>>101765388
oh yeah, now I'm starting to remember it. Comfy accused him of reusing some code which was probably not true. Stability wants Comfy to be their main UI which made me dislike Comfy along with the drama.
>>101765429
Is there a guide like that for all the prompt formatting related to SD? Like Break, brackets, || etc? Maybe a cheatsheet or something. I also tried experimenting with BREAK on the free flux demo but it doesnt seem to respond to it at all.

Anonymous
08/07/24(Wed)07:47:53 No.101765464

Anonymous 08/07/24(Wed)07:47:53 No.101765464

File: file.png (46 KB, 714x267)

46 KB PNG

>>101765391
>>101765429
Thanks anons.

For sigma I meant picrel.

Anonymous
08/07/24(Wed)07:48:38 No.101765474

Anonymous 08/07/24(Wed)07:48:38 No.101765474

File: FD_00360_.png (2 MB, 1024x1024)

2 MB PNG

Anonymous
08/07/24(Wed)07:50:04 No.101765490

Anonymous 08/07/24(Wed)07:50:04 No.101765490

File: doratest.png (8 KB, 1472x37)

8 KB PNG

Finally time to test DoRA. Running with AdamW8Bit, linear, huber loss enabled

Anonymous
08/07/24(Wed)07:50:37 No.101765497

Anonymous 08/07/24(Wed)07:50:37 No.101765497

>>101765455
>BREAK on the free flux demo but it doesnt seem to respond to it at all
cause it doesnt, BREAK and prompt weights dont work on FLUX

Anonymous
08/07/24(Wed)07:50:52 No.101765499

Anonymous 08/07/24(Wed)07:50:52 No.101765499

comfy is telling me to run:
>-m pip install --upgrade pip
it doesn't work either in cmd or git, I'm doing something wrong, any hint?

Anonymous
08/07/24(Wed)07:52:16 No.101765515

Anonymous 08/07/24(Wed)07:52:16 No.101765515

It's weird but it's been multiple times I've seen flux show a naked person at low steps, then at some steps (for example 8-9) suddenly they got underwear.
Is that related to how it's been trained?

Anonymous
08/07/24(Wed)07:52:39 No.101765517

Anonymous 08/07/24(Wed)07:52:39 No.101765517

>>101765499
that's python suggesting to upgrade it. remove the -m part at the front and it should work.

Anonymous
08/07/24(Wed)07:52:59 No.101765522

Anonymous 08/07/24(Wed)07:52:59 No.101765522

>>101765464
thats explained here >https://openart.ai/workflows/fish_intent_33/flux-dev-splitsigmas/j8kSUra4WQSQMoePIj9m

Anonymous
08/07/24(Wed)07:55:35 No.101765537

Anonymous 08/07/24(Wed)07:55:35 No.101765537

>>101765490
For Flux?

Anonymous
08/07/24(Wed)07:56:10 No.101765542

Anonymous 08/07/24(Wed)07:56:10 No.101765542

>>101765522
I'll take a look then, thanks.

Anonymous
08/07/24(Wed)07:57:57 No.101765559

Anonymous 08/07/24(Wed)07:57:57 No.101765559

File: fd002.png (1.36 MB, 1024x1024)

1.36 MB PNG

Anonymous
08/07/24(Wed)07:57:58 No.101765560

Anonymous 08/07/24(Wed)07:57:58 No.101765560

>>101765537
Testing some anime-jank with 1.5, DoRA should be great with multiple concepts

Anonymous
08/07/24(Wed)07:59:37 No.101765578

Anonymous 08/07/24(Wed)07:59:37 No.101765578

File: FD_00375_.png (1.32 MB, 1024x1024)

1.32 MB PNG

>>101765560
A 1.5 DoRA? At this time of year?

Anonymous
08/07/24(Wed)07:59:53 No.101765580

Anonymous 08/07/24(Wed)07:59:53 No.101765580

>>101765517
thanks, I managed to make it work by using the full string and adding --user at the end

Anonymous
08/07/24(Wed)08:03:16 No.101765608

Anonymous 08/07/24(Wed)08:03:16 No.101765608

>>101765559
>here's your controller bro

Anonymous
08/07/24(Wed)08:03:52 No.101765617

Anonymous 08/07/24(Wed)08:03:52 No.101765617

should I change the file extension of : flux1-dev.safetensors and ae.safetensors to .sft?

Anonymous
08/07/24(Wed)08:04:37 No.101765623

Anonymous 08/07/24(Wed)08:04:37 No.101765623

>>101765617
No. Why?

Anonymous
08/07/24(Wed)08:05:30 No.101765631

Anonymous 08/07/24(Wed)08:05:30 No.101765631

>>101765617
".sft" is what i got and i never had to change it, not sure why its .safetensors for you

Anonymous
08/07/24(Wed)08:06:00 No.101765636

Anonymous 08/07/24(Wed)08:06:00 No.101765636

>>101765560
honestly, dora kinda sucked for me when I tried multi concept lora.

Anonymous
08/07/24(Wed)08:07:25 No.101765650

Anonymous 08/07/24(Wed)08:07:25 No.101765650

>>101765636
what settings did you use? how large dataset?

Anonymous
08/07/24(Wed)08:07:53 No.101765655

Anonymous 08/07/24(Wed)08:07:53 No.101765655

>>101765617
.jpg .jpeg same thing.

Anonymous
08/07/24(Wed)08:09:01 No.101765671

Anonymous 08/07/24(Wed)08:09:01 No.101765671

>>101765617
it does not matter .. three letter file extensions are a DOS limitation, you using DOS? no, therefore it does not matter

Anonymous
08/07/24(Wed)08:09:46 No.101765678

Anonymous 08/07/24(Wed)08:09:46 No.101765678

>>101764212
NTA but yeah, not because I care but it just makes artlets seethe so I do it

Anonymous
08/07/24(Wed)08:10:47 No.101765687

Anonymous 08/07/24(Wed)08:10:47 No.101765687

File: FD_00014_.png (1.32 MB, 1024x1024)

1.32 MB PNG

>>101765671
DOS is open source now, you mean you aren't genning on DOS? baka my head

Anonymous
08/07/24(Wed)08:12:54 No.101765710

Anonymous 08/07/24(Wed)08:12:54 No.101765710

How the fuck does the AI know who Integra Hellsing is, but not Haman Karn.

Anonymous
08/07/24(Wed)08:16:05 No.101765750

Anonymous 08/07/24(Wed)08:16:05 No.101765750

File: ComfyUI_Flux_44.png (1.13 MB, 1216x832)

1.13 MB PNG

>>101765233
Group up your shit and use get/set nodes or anything everywhere to get rid of this disgusting spaghetti.

Anonymous
08/07/24(Wed)08:19:25 No.101765783

Anonymous 08/07/24(Wed)08:19:25 No.101765783

>>101765710
I consider any character still there as a mistake from the model makers, I'm sure they'd remove that too if they could.
Maybe in their next iteration.

Anonymous
08/07/24(Wed)08:20:47 No.101765797

Anonymous 08/07/24(Wed)08:20:47 No.101765797

File: FD_00023_.png (1.42 MB, 1024x1024)

1.42 MB PNG

>>101765678
Artlets seethe just by existing.
I honestly don't know why they don't just use imagen in their art workflows.
My wife is a painter. She gens concepts, refines them, then paints them. Speeds up the whole process.
I gen character sheets and model them in blender.
>>101765750
I do. This was from like 2 years ago and was in the testing phase, where everything is spaghet. Once I get it working how I want I group things.

Anonymous
08/07/24(Wed)08:21:20 No.101765808

Anonymous 08/07/24(Wed)08:21:20 No.101765808

File: ComfyUI_temp_pzszu_00041_.png (1.22 MB, 1024x1024)

1.22 MB PNG

>>101765650
dataset from around 300 up to somewhat above 1000, with around 50 datapoints per concept, iteratively scaled up while working on improving the adapter. I used locon and prodigy optimizer with long training times (some of the concepts took around 4-6 hours to converge).
just locon and prodigy was much more effective for me, dora overfit like crazy without converging to the concepts. I could see dora really slap for training a style though in minimal steps.

interested in hearing your experiences anon

Anonymous
08/07/24(Wed)08:21:43 No.101765816

Anonymous 08/07/24(Wed)08:21:43 No.101765816

>>101765797
>I honestly don't know why they don't just use imagen in their art workflows.
Honestly I'd rather keep it the way it is now, them just whining and screaming instead of gaining intelligence, it's more entertaining that way

Anonymous
08/07/24(Wed)08:23:07 No.101765840

Anonymous 08/07/24(Wed)08:23:07 No.101765840

File: FD_00021_.png (1.35 MB, 1024x1024)

1.35 MB PNG

>>101765816
I miss when they use to come in here seething daily, and getting dogpiled for having a dumb ass opinion

Anonymous
08/07/24(Wed)08:25:15 No.101765872

Anonymous 08/07/24(Wed)08:25:15 No.101765872

>>101765797
Artists would be really good at this shit too because they probably know a bunch of prompt terms that the AI would recognize.

Anonymous
08/07/24(Wed)08:26:23 No.101765884

Anonymous 08/07/24(Wed)08:26:23 No.101765884

>>101765808
>interested in hearing your experiences anon
I'll report here. What dimension and alpha did you use? I have a feeling that scale weight norms is mandatory with this. For me prodigy overfits like crazy if I don't lower d_coef and use snr gamma

Anonymous
08/07/24(Wed)08:27:04 No.101765889

Anonymous 08/07/24(Wed)08:27:04 No.101765889

File: fd001.png (1.17 MB, 1024x1024)

1.17 MB PNG

Anonymous
08/07/24(Wed)08:27:09 No.101765891

Anonymous 08/07/24(Wed)08:27:09 No.101765891

File: file.png (1.04 MB, 813x634)

1.04 MB PNG

>>101765783
How is it a bug for the model to recognize characters?

Anonymous
08/07/24(Wed)08:28:14 No.101765903

Anonymous 08/07/24(Wed)08:28:14 No.101765903

Uhm? Is this real?
https://www.reddit.com/r/StableDiffusion/comments/1em9u6x/first_flux_controlnet_canny_was_just_released_by/

Anonymous
08/07/24(Wed)08:28:18 No.101765904

Anonymous 08/07/24(Wed)08:28:18 No.101765904

>>101765797
>I honestly don't know why they don't just use imagen in their art workflows.
Good artists already have ai one way or the others in their workflow, from draft ideas to test to add background to anything really.
The ones you see going apeshit 24/7 either are weirdly insecure about their skills, or are luddites who will refuse to use the new thing because it was created after they were born.

Anonymous
08/07/24(Wed)08:29:32 No.101765918

Anonymous 08/07/24(Wed)08:29:32 No.101765918

>>101765891
I'm reasoning using their intent anon.
I think it's retarded to scrub pop culture stuff like this, but they obviously did it on purpose.

Anonymous
08/07/24(Wed)08:33:44 No.101765981

Anonymous 08/07/24(Wed)08:33:44 No.101765981

File: 1717855867722800.png (32 KB, 1104x311)

32 KB PNG

>>101765891
deefakes and using "registered TM" faces I guess.

ok this has potential but is so terribly and ubearably slow. Even with 16gb vram for some reason it goes into lowvram mode. With noVideo probably putting 12gb vram on all their 5xxx series chips and 14.5gb on a 5090 this will be not feasible as a gacha hobby

Anonymous
08/07/24(Wed)08:36:01 No.101766013

Anonymous 08/07/24(Wed)08:36:01 No.101766013

File: dev21stepsvsdevschnelldar(...).png (2.33 MB, 2048x1024)

2.33 MB PNG

unironically if you want flux to listen to your prompts more without weird CFG tricks and sacrificing speed. use Schnell (4 steps) (inevitable quality loss on text) or a DARE merge of Schnell and Dev. (4-16 steps) https://huggingface.co/martyn/FLUX.1-dev-schnell-dare-merge

prompt:
an african american hatsune miku with braided dreadlocks holding up a peace hand sign gesture.
no gimmicks, 1 CFG euler
left is Dev at 21 steps
right is the linked DARE merge between Dev and Schnell at 10 steps (it can converge at even lower steps but not as well as Schnell on its own)

The problem with Dev is as it needs more steps to converge, that 1 CFG is spread over more steps while at lower step counts, the CFG is proportionately higher and thus "listens" more.

Anonymous
08/07/24(Wed)08:36:22 No.101766015

Anonymous 08/07/24(Wed)08:36:22 No.101766015

File: ComfyUI_Flux_4849.jpg (203 KB, 1024x1024)

203 KB JPG

Anonymous
08/07/24(Wed)08:36:52 No.101766027

Anonymous 08/07/24(Wed)08:36:52 No.101766027

File: xyz_grid-0004-343.png (2.31 MB, 2112x1654)

2.31 MB PNG

>>101765884
I used network rank 8 network alpha 1 convolution rank 8 convolution alpha 1 which was always the best, no matter how many concepts I added. Also, I've read in a guide that higher batch count is bad for multiconcept but don't listen to the haters, when I turned up batch count the model learned to differenciate picrel, which was a huge pain in the ass to learn.
>scale weight norms
I haven't tried that yet
>lower d_coef
yeah d_coef of 0.5 was pretty good for me

Anonymous
08/07/24(Wed)08:39:34 No.101766062

Anonymous 08/07/24(Wed)08:39:34 No.101766062

>>101766013
If you merge schnell and Dev, which license takes precedence? I'm assuming the shitter one.

Anonymous
08/07/24(Wed)08:40:26 No.101766073

Anonymous 08/07/24(Wed)08:40:26 No.101766073

File: c11b6.jpg (209 KB, 1024x1024)

209 KB JPG

>>101766015
ah it's the old cenobite sailor moon

Anonymous
08/07/24(Wed)08:42:00 No.101766093

Anonymous 08/07/24(Wed)08:42:00 No.101766093

>>101766013
Anyone care to make a thorough comparison between regular Dev and this frankenstein of a merge with different prompts of various complexity?
>inb4 me
I will, but my PC is shit and it will take a long while.

Anonymous
08/07/24(Wed)08:43:18 No.101766107

Anonymous 08/07/24(Wed)08:43:18 No.101766107

File: ComfyUI_Flux_4863.jpg (207 KB, 1024x1024)

207 KB JPG

>>101766073
yep trying to get it in a panavision film style. so far unlucky. its entertaining though

Anonymous
08/07/24(Wed)08:45:24 No.101766143

Anonymous 08/07/24(Wed)08:45:24 No.101766143

Where's debo

Anonymous
08/07/24(Wed)08:45:53 No.101766152

Anonymous 08/07/24(Wed)08:45:53 No.101766152

>>101766062
you are correct, the apache 2.0 is void then, you get the black forest lab license then

Anonymous
08/07/24(Wed)08:46:13 No.101766153

Anonymous 08/07/24(Wed)08:46:13 No.101766153

File: FD_00042_.png (2.41 MB, 1536x1024)

2.41 MB PNG

>>101765904
Even shit artists are using it, they are ALL using generative fill in photoshop but they don't consider it AI because it's part of photoshop.
They are all retards.
>>101765872
They are, my wife has that eye for things, she makes some cool gens.
I get creatively bankrupted constantly and just gen from an empty prompt to explore the latent space for ideas. I don't consider myself an artist at all.

Anonymous
08/07/24(Wed)08:46:54 No.101766159

Anonymous 08/07/24(Wed)08:46:54 No.101766159

File: ComfyUI_Flux_4879.jpg (112 KB, 1056x480)

112 KB JPG

>>101766073

Anonymous
08/07/24(Wed)08:48:18 No.101766175

Anonymous 08/07/24(Wed)08:48:18 No.101766175

File: ComfyUI_Flux_4887.jpg (106 KB, 1056x480)

106 KB JPG

>>101766159

Anonymous
08/07/24(Wed)08:50:14 No.101766195

Anonymous 08/07/24(Wed)08:50:14 No.101766195

File: a man of integrity.png (431 KB, 640x478)

431 KB PNG

>>101766159
>>101766175
Now add a caption.

Anonymous
08/07/24(Wed)08:54:06 No.101766241

Anonymous 08/07/24(Wed)08:54:06 No.101766241

>>101766027
I've been defaulting to dim32 alpha16 with 1.5 for a long time. Batch 2.

>yeah d_coef of 0.5 was pretty good for me
I've gone as low as 0.1 with great results

Anonymous
08/07/24(Wed)08:56:52 No.101766280

Anonymous 08/07/24(Wed)08:56:52 No.101766280

File: 6835ada6aec2adfd0b5eb3dc4(...).jpg (382 KB, 896x1152)

382 KB JPG

Anonymous
08/07/24(Wed)08:57:30 No.101766287

Anonymous 08/07/24(Wed)08:57:30 No.101766287

File: ComfyUI_Flux_4915.jpg (99 KB, 1056x480)

99 KB JPG

>>101766195

Anonymous
08/07/24(Wed)09:00:46 No.101766317

Anonymous 08/07/24(Wed)09:00:46 No.101766317

>>101766287
proompt?

Anonymous
08/07/24(Wed)09:02:13 No.101766334

Anonymous 08/07/24(Wed)09:02:13 No.101766334

File: 1707358596972205.png (1.26 MB, 1024x1024)

1.26 MB PNG

Anonymous
08/07/24(Wed)09:03:52 No.101766354

Anonymous 08/07/24(Wed)09:03:52 No.101766354

File: ComfyUI_Flux_4931.jpg (96 KB, 1056x480)

96 KB JPG

>>101766195

Anonymous
08/07/24(Wed)09:05:32 No.101766367

Anonymous 08/07/24(Wed)09:05:32 No.101766367

File: ComfyUI_Flux_4933.jpg (86 KB, 1056x480)

86 KB JPG

>>101766317

its meh and wont get you consistent results.
https://files.catbox.moe/m4t9ww.webp

Anonymous
08/07/24(Wed)09:17:37 No.101766502

Anonymous 08/07/24(Wed)09:17:37 No.101766502

>>101766367
thanks

Anonymous
08/07/24(Wed)09:20:00 No.101766529

Anonymous 08/07/24(Wed)09:20:00 No.101766529

I have a 3070ti and am running ReForge, should I even bother with trying to use Flux? There's a 20 GB checkpoint on Civit.

Anonymous
08/07/24(Wed)09:20:56 No.101766542

Anonymous 08/07/24(Wed)09:20:56 No.101766542

>>101764165
Explain like I'm retarded. What's the difference between cfg & guidance.

Anonymous
08/07/24(Wed)09:22:21 No.101766553

Anonymous 08/07/24(Wed)09:22:21 No.101766553

File: Flux-20240807_151750-gen-(...).png (1.72 MB, 832x1216)

1.72 MB PNG

>>101766107
this is a good one, inspiring!

Anonymous
08/07/24(Wed)09:22:37 No.101766558

Anonymous 08/07/24(Wed)09:22:37 No.101766558

File: 8d3310458def1da8e6a6f019a(...).jpg (599 KB, 896x1152)

599 KB JPG

>>101766542
Also I don't think it's gonna take much to make an NSFW version of this lol

Anonymous
08/07/24(Wed)09:34:04 No.101766679

Anonymous 08/07/24(Wed)09:34:04 No.101766679

>>101766529
yeah you could run it easily just takes time and you'll have to install comfy (or just wait for a1111/forge update)

Anonymous
08/07/24(Wed)09:34:27 No.101766686

Anonymous 08/07/24(Wed)09:34:27 No.101766686

>>101765710
I know who Integra Hellsing is, but not Haman Karn...

Anonymous
08/07/24(Wed)09:36:44 No.101766725

Anonymous 08/07/24(Wed)09:36:44 No.101766725

>>101765889
That's adorable. I'm going to see if I can make that irl.

Anonymous
08/07/24(Wed)09:39:08 No.101766752

Anonymous 08/07/24(Wed)09:39:08 No.101766752

File: fd003.png (1.24 MB, 1024x1024)

1.24 MB PNG

>>101766725
you could 3d print some of it ong

Anonymous
08/07/24(Wed)09:40:44 No.101766773

Anonymous 08/07/24(Wed)09:40:44 No.101766773

>>101766542
see >>101765429 and >>101765391

Anonymous
08/07/24(Wed)09:41:51 No.101766791

Anonymous 08/07/24(Wed)09:41:51 No.101766791

File: Flux-20240807_153156-gen-(...).png (1.44 MB, 1024x1024)

1.44 MB PNG

>>101766686
haman karn is from the gundam universe, best girl.
as expected, flux does decent macro stuff.

Anonymous
08/07/24(Wed)09:42:57 No.101766809

Anonymous 08/07/24(Wed)09:42:57 No.101766809

>>101766143
Right here
We're going to add the pastebin to the OP if you don't fuck off

Anonymous
08/07/24(Wed)09:46:35 No.101766859

Anonymous 08/07/24(Wed)09:46:35 No.101766859

>>101766809
You won't do nothing chud

Anonymous
08/07/24(Wed)09:46:51 No.101766863

Anonymous 08/07/24(Wed)09:46:51 No.101766863

>>101766752
Yep; there's a bunch of existing designs to 3D print, but they're either props or too big. I'm thinking clear or tint clear and small 1" OLED display with microcontroller running clock. Or something.

Anonymous
08/07/24(Wed)09:50:07 No.101766903

Anonymous 08/07/24(Wed)09:50:07 No.101766903

File: flux2_Y.png (2.57 MB, 1536x1344)

2.57 MB PNG

>>101766752
Your gens are my favorite

Anonymous
08/07/24(Wed)09:51:26 No.101766919

Anonymous 08/07/24(Wed)09:51:26 No.101766919

File: ComfyUI_Flux_4971.jpg (82 KB, 1056x480)

82 KB JPG

Anonymous
08/07/24(Wed)09:59:24 No.101767017

Anonymous 08/07/24(Wed)09:59:24 No.101767017

File: 1695457725513536.png (1.02 MB, 1024x1024)

1.02 MB PNG

Anonymous
08/07/24(Wed)10:06:18 No.101767102

Anonymous 08/07/24(Wed)10:06:18 No.101767102

>>101766773
Thanks, anon.

Anonymous
08/07/24(Wed)10:10:25 No.101767163

Anonymous 08/07/24(Wed)10:10:25 No.101767163

File: V52AxiGklEwSctdVufUoa.png (1.75 MB, 1488x1248)

1.75 MB PNG

Good morning anons. Hope you are all well.

Anonymous
08/07/24(Wed)10:10:52 No.101767167

Anonymous 08/07/24(Wed)10:10:52 No.101767167

File: ComfyUI_Flux_5009.jpg (125 KB, 1360x720)

125 KB JPG

Anonymous
08/07/24(Wed)10:13:01 No.101767196

Anonymous 08/07/24(Wed)10:13:01 No.101767196

>>101767167
So good. Little bit like that boss from Fallout 4 dlc. Fake movie screenshots are great

Anonymous
08/07/24(Wed)10:13:44 No.101767206

Anonymous 08/07/24(Wed)10:13:44 No.101767206

File: 5efc0db7d2b2f6df715f4dd3f(...).jpg (967 KB, 896x1152)

967 KB JPG

>>101767102
This actually works out pretty well. I know t5 better than I know SD kek. No wonder language models are so good at writing prompts for this thing.

Anonymous
08/07/24(Wed)10:18:32 No.101767271

Anonymous 08/07/24(Wed)10:18:32 No.101767271

File: ComfyUI_02591_.jpg (1.06 MB, 2048x2048)

1.06 MB JPG

Anonymous
08/07/24(Wed)10:26:37 No.101767363

Anonymous 08/07/24(Wed)10:26:37 No.101767363

>>101766013
>The problem with Dev is as it needs more steps to converge, that 1 CFG is spread over more steps while at lower step counts, the CFG is proportionately higher and thus "listens" more.
what do you mean "the CFG is proportionately higher"? CFG is constant on every step

Anonymous
08/07/24(Wed)10:29:21 No.101767386

Anonymous 08/07/24(Wed)10:29:21 No.101767386

>>101766241
I guess rank size and stuff is very individual to models kinda, but I've been surprised what fits into dim8 pony adapter

Anonymous
08/07/24(Wed)10:30:09 No.101767405

Anonymous 08/07/24(Wed)10:30:09 No.101767405

what do you guys load for clip? I've read that using both t5 and clip_I yields worse results than just using one of them but I don't I don't even know how to test that because the Load Clip node doesnt have flux as type, only the DualClipLoad does

Anonymous
08/07/24(Wed)10:41:08 No.101767547

Anonymous 08/07/24(Wed)10:41:08 No.101767547

>>101764165
Put Frida Kahlo in the negative.

Anonymous
08/07/24(Wed)10:42:13 No.101767559

Anonymous 08/07/24(Wed)10:42:13 No.101767559

File: ComfyUI_Flux_5021.jpg (110 KB, 1360x720)

110 KB JPG

Anonymous
08/07/24(Wed)10:46:23 No.101767617

Anonymous 08/07/24(Wed)10:46:23 No.101767617

File: ComfyUI_00018_.png (1.11 MB, 1024x1024)

1.11 MB PNG

Anonymous
08/07/24(Wed)10:47:20 No.101767629

Anonymous 08/07/24(Wed)10:47:20 No.101767629

File: ComfyUI_02595_.jpg (1.04 MB, 2048x2048)

1.04 MB JPG

Anonymous
08/07/24(Wed)11:02:20 No.101767799

Anonymous 08/07/24(Wed)11:02:20 No.101767799

File: ComfyUI_30685_.png (1.16 MB, 1024x1024)

1.16 MB PNG

Anonymous
08/07/24(Wed)11:02:31 No.101767804

Anonymous 08/07/24(Wed)11:02:31 No.101767804

File: ComfyUI_00022.png (1015 KB, 1024x1024)

1015 KB PNG

Anonymous
08/07/24(Wed)11:06:48 No.101767843

Anonymous 08/07/24(Wed)11:06:48 No.101767843

File: f454625db02fc0e9f022c15c1(...).jpg (705 KB, 896x1152)

705 KB JPG

Man, once this thing has a proper NSFW extension, I am going to jack off if you know what I mean.

Anonymous
08/07/24(Wed)11:07:27 No.101767851

Anonymous 08/07/24(Wed)11:07:27 No.101767851

>>101767843
Sadly probably months/a year away...

Anonymous
08/07/24(Wed)11:10:14 No.101767868

Anonymous 08/07/24(Wed)11:10:14 No.101767868

>>101767851
I don't know, man. Thirst makes people incredibly motivated!

Anonymous
08/07/24(Wed)11:11:23 No.101767883

Anonymous 08/07/24(Wed)11:11:23 No.101767883

>>101767868
doesn't matter how motivated, it's going to take months of H100s running non stop

Anonymous
08/07/24(Wed)11:13:53 No.101767908

Anonymous 08/07/24(Wed)11:13:53 No.101767908

>>101767883
And just getting started will be months away, is my guess.

Anonymous
08/07/24(Wed)11:16:04 No.101767928

Anonymous 08/07/24(Wed)11:16:04 No.101767928

>>101767843
How the fuck did you manage to get such a skimpy outfit?

Anonymous
08/07/24(Wed)11:16:31 No.101767931

Anonymous 08/07/24(Wed)11:16:31 No.101767931

File: 2024-08-07_00271_.png (1.59 MB, 1024x1024)

1.59 MB PNG

Anonymous
08/07/24(Wed)11:17:05 No.101767939

Anonymous 08/07/24(Wed)11:17:05 No.101767939

>>101767928
the model knows skimpy attire
basically the only horny stuff it does

Anonymous
08/07/24(Wed)11:17:58 No.101767949

Anonymous 08/07/24(Wed)11:17:58 No.101767949

Has anyone figured out the best guidance to use that doesn't trade too much style for prompt adherence. I'm having pretty good luck with 1.5 but that seems low, although I really don't have much sense of scale for this other than that they recommend 3.5 with the full model. (using schnell btw).

Anonymous
08/07/24(Wed)11:18:28 No.101767960

Anonymous 08/07/24(Wed)11:18:28 No.101767960

File: 2024-08-07_00254_.png (1.77 MB, 1024x1024)

1.77 MB PNG

>>101767939
ya it does

Anonymous
08/07/24(Wed)11:19:20 No.101767969

Anonymous 08/07/24(Wed)11:19:20 No.101767969

>>101767928
> Description: in an 80s slasher movie, starring a beautiful woman, looking scared. Her physique is curvy and her clothes are revealing and torn in places. Overall, exploitative feeling as if the scene is the product of the male gaze.

Anonymous
08/07/24(Wed)11:23:48 No.101768017

Anonymous 08/07/24(Wed)11:23:48 No.101768017

Haven't played with Flux yet, how's it with anime? Artists status?

Anonymous
08/07/24(Wed)11:24:28 No.101768024

Anonymous 08/07/24(Wed)11:24:28 No.101768024

File: file.png (591 KB, 1024x1024)

591 KB PNG

Anonymous
08/07/24(Wed)11:25:19 No.101768032

Anonymous 08/07/24(Wed)11:25:19 No.101768032

File: 2024-08-07_00278_.png (1.33 MB, 1024x1024)

1.33 MB PNG

faster horse! faster!

Anonymous
08/07/24(Wed)11:26:39 No.101768049

Anonymous 08/07/24(Wed)11:26:39 No.101768049

File: file.png (572 KB, 1024x1024)

572 KB PNG

Anonymous
08/07/24(Wed)11:28:31 No.101768061

Anonymous 08/07/24(Wed)11:28:31 No.101768061

>>101768017
Generic anime.
Most artists are deleted (maybe not the images themselves, but they're clearly not linked with the artist)
Same with many characters.

Anonymous
08/07/24(Wed)11:29:13 No.101768069

Anonymous 08/07/24(Wed)11:29:13 No.101768069

>>101767363
i'll revise the explanation, schnell converges in 1-4 steps, a side effect of that is the model inherently is more sensitive to prompts because it targets a complete image in 1-4 steps. It doesn't have time to slowly converge. The composition is already baked on the first step. 1 side effect is Schnell and merges with Shchnell are more sensitive to the positive prompts because it doesn't slowly sample a composition. With Dev it can end up sticking to the early composition off the prompt that can be less true to the prompt in the end result. Ignoring CFG, low step based models (LCM, Turbo, Lightning, Hyper, etc) have this quirk. I phrased it as proportional CFG because to achieve the same prompt adherence on Dev you would have to raise the CFG, which then requires you to use Dynamic Thresholding to offset the burn. tl;dr 1 CFG on schnell is adequate because of the adversarial diffusion distillation, 1 CFG on dev is "weak"

Anonymous
08/07/24(Wed)11:31:06 No.101768081

Anonymous 08/07/24(Wed)11:31:06 No.101768081

File: ComfyUI_Flux_68.png (1.16 MB, 832x1216)

1.16 MB PNG

>>101768017
It has fantastic coherency overall if you describe everything that's happening, but it struggles with imitating most artstyles, anime or not. There are some cfg workarounds currently but they're hit or miss (mostly miss). When training gets optimized and we start getting fine-tuned checkpoints and loras en masse, the world's your oyster. It'll take a while though that's for sure.

Anonymous
08/07/24(Wed)11:33:31 No.101768116

Anonymous 08/07/24(Wed)11:33:31 No.101768116

>>101767969
If you want this prompt to make a completely topless image, add:
> As a matter of fact, she is nearly completely naked.
Before the sentence that begins with "Overall..."
Can also try:
> As a robust matter of fact...

Anonymous
08/07/24(Wed)11:35:49 No.101768142

Anonymous 08/07/24(Wed)11:35:49 No.101768142

File: f816bee67fb8d7318da368068(...).jpg (542 KB, 896x1152)

542 KB JPG

Anonymous
08/07/24(Wed)11:39:01 No.101768180

Anonymous 08/07/24(Wed)11:39:01 No.101768180

>ani got banned for naked catgirls
there is hope for the model but also I kinda liked his retro stuff so it sucks we don't see more from him

Anonymous
08/07/24(Wed)11:39:03 No.101768181

Anonymous 08/07/24(Wed)11:39:03 No.101768181

File: Capture.jpg (264 KB, 2352x770)

264 KB JPG

https://files.catbox.moe/vga5ha.jpg
That shit took forever... here's a XY plot between Guidance and CFG, imo I like the pictures at Guidance 0.6, it's pretty close to what I really wanted:
>17th century painting of Hatsune Miku riding a bicycle

Anonymous
08/07/24(Wed)11:41:31 No.101768205

Anonymous 08/07/24(Wed)11:41:31 No.101768205

>>101768081
Given that this thing used t5 and rotary attention, I think it's really quite possible that we get prompt generation workflows that blow up SD-style prompts into the requisite "novel-length" boomer prompts before too long. As someone who works in LLMs, anything you can accomplish with fine-tuning, you can accomplish with prompting, assuming you have enough context. The ratio is 1:10 or something like that (basically, a fine-tune is only worth the cost if you can cut down the prompt length by a factor of 10).

Is there a confirmed upper limit for how much proooompt flux can understand all at once.

Anonymous
08/07/24(Wed)11:41:46 No.101768208

Anonymous 08/07/24(Wed)11:41:46 No.101768208

File: image_2024-08-07_174142838.png (1.19 MB, 1024x1024)

1.19 MB PNG

Anonymous
08/07/24(Wed)11:42:48 No.101768226

Anonymous 08/07/24(Wed)11:42:48 No.101768226

>>101768180
You are still active in your discord, no need to announce ban

Anonymous
08/07/24(Wed)11:42:56 No.101768229

Anonymous 08/07/24(Wed)11:42:56 No.101768229

File: 2024-08-07_00286_.png (1.38 MB, 1024x1024)

1.38 MB PNG

kek, you wanna do some convenient censoring and FLUX gives you slimy mosskini

Anonymous
08/07/24(Wed)11:43:24 No.101768237

Anonymous 08/07/24(Wed)11:43:24 No.101768237

>>101768205
we'll eventually just have a tiny llm that is trained for prompt enhancing where it simply adds tedious detail without sacrificing the intent of the prompt itself

Anonymous
08/07/24(Wed)11:45:49 No.101768269

Anonymous 08/07/24(Wed)11:45:49 No.101768269

>>101768226
>doxcord
no thanks

Anonymous
08/07/24(Wed)11:47:24 No.101768285

Anonymous 08/07/24(Wed)11:47:24 No.101768285

File: 2024-08-07_00292_.png (1.36 MB, 1024x1024)

1.36 MB PNG

Anonymous
08/07/24(Wed)11:50:55 No.101768326

Anonymous 08/07/24(Wed)11:50:55 No.101768326

File: ComfyUI_temp_fkxqv_00152_.png (1.62 MB, 1024x1024)

1.62 MB PNG

>>101768032

Anonymous
08/07/24(Wed)11:52:00 No.101768341

Anonymous 08/07/24(Wed)11:52:00 No.101768341

>>101768181
sampler/scheduler?

Anonymous
08/07/24(Wed)11:53:36 No.101768356

Anonymous 08/07/24(Wed)11:53:36 No.101768356

>>101768341
euler beta, 20 steps

Anonymous
08/07/24(Wed)11:57:39 No.101768402

Anonymous 08/07/24(Wed)11:57:39 No.101768402

File: ComfyUI_02249_.png (1.22 MB, 1024x1024)

1.22 MB PNG

>>101768017

Flux has a good understanding of natural language. no more tags, no more autistic prompts.
80% of prompts have good hands, good text. Anime prompts tend to mess up hands,as far as i've seen personally

Few artstyle choices,no nude (yet)

Anonymous
08/07/24(Wed)11:58:16 No.101768408

Anonymous 08/07/24(Wed)11:58:16 No.101768408

>>101768402
>dagger glued on ass

Anonymous
08/07/24(Wed)11:58:25 No.101768413

Anonymous 08/07/24(Wed)11:58:25 No.101768413

When I download an "adetailer" pt file, how do I know if should put it in models\ultralytics\bbox vs models\ultralytics\segm ?

Anonymous
08/07/24(Wed)12:00:34 No.101768437

Anonymous 08/07/24(Wed)12:00:34 No.101768437

File: 2024-08-07_00297_.png (1.74 MB, 1024x1024)

1.74 MB PNG

>>101768326
kek

Anonymous
08/07/24(Wed)12:01:08 No.101768445

Anonymous 08/07/24(Wed)12:01:08 No.101768445

File: god-i-wish-that-was-me.gif (29 KB, 688x200)

29 KB GIF

>>101768408
God I wish that dagger were me

Anonymous
08/07/24(Wed)12:01:12 No.101768448

Anonymous 08/07/24(Wed)12:01:12 No.101768448

File: up_0002.jpg (782 KB, 3456x5120)

782 KB JPG

Anonymous
08/07/24(Wed)12:03:51 No.101768478

Anonymous 08/07/24(Wed)12:03:51 No.101768478

local is still behind dall-e 3 uh?

Anonymous
08/07/24(Wed)12:06:19 No.101768506

Anonymous 08/07/24(Wed)12:06:19 No.101768506

File: 2a9b0cf0de584c686b879fe13(...).jpg (605 KB, 896x1152)

605 KB JPG

>>101768237
Yes. The only problem is adding stuff the model has apparently never seen, which would seem to be lower body genitalia and not much else. Everything else is just limited by how much you feel like debugging a long prompt, which is where an LLM would come in handy. I might try taking some of my gens and feeding them to gpt or Claude or gemini (or all 3) and see how close their description is to the starting prompt. And then take their description and feed it back into flux to see how much meaning drift we're really talking about here.

It could also be a matter of fine-tuning t5 itself as opposed to fine-tuning the diffuser. I haven't gone to the flux repo yet to look at how everything works under the hood, but there's probably some stuff from the language modeling side that can be done faster than the time it would take to retrain the image generator itself.

TLDR there's a lot you can do with transformers / attention is all you need / etc..

Anonymous
08/07/24(Wed)12:06:49 No.101768510

Anonymous 08/07/24(Wed)12:06:49 No.101768510

>>101768478
not on realistic pictures, flux destroys dalle on that departement

Anonymous
08/07/24(Wed)12:08:57 No.101768536

Anonymous 08/07/24(Wed)12:08:57 No.101768536

since updating comfyui my vram gets flushed out every gen and has to reload every time
anybody else experiencing this issue with comfyui?

Anonymous
08/07/24(Wed)12:08:57 No.101768538

Anonymous 08/07/24(Wed)12:08:57 No.101768538

>>101768069
that's a really interesting theory, but schnell's quality isn't that great compared to dev, I hope that merge will take the best of both worlds though

Anonymous
08/07/24(Wed)12:13:03 No.101768586

Anonymous 08/07/24(Wed)12:13:03 No.101768586

>>101768478
On text no, Flux is actually better.
On artist recognition, style, even people and nsfw, DALLE is actually better (when the moderation endpoint and prompt rewriting doesn't make it impossible to show of course, I meant the base model)

Anonymous
08/07/24(Wed)12:14:46 No.101768609

Anonymous 08/07/24(Wed)12:14:46 No.101768609

File: ComfyUI_temp_pzszu_00192_.png (1.25 MB, 1024x1024)

1.25 MB PNG

why did it give her that absolute dump truck

Anonymous
08/07/24(Wed)12:14:56 No.101768612

Anonymous 08/07/24(Wed)12:14:56 No.101768612

>>101768536
yup.. that it does, loves reading the model again and again every gen, no clue why it does that.

Anonymous
08/07/24(Wed)12:16:13 No.101768629

Anonymous 08/07/24(Wed)12:16:13 No.101768629

>>101768478
dalle has really shit AI grain too, you can tell DE3 gens from the pattern, Flux absolutely can make indistinguishable photorealistic gens

Anonymous
08/07/24(Wed)12:17:01 No.101768640

Anonymous 08/07/24(Wed)12:17:01 No.101768640

File: image_2024-08-07_181659424.png (1.25 MB, 1024x1024)

1.25 MB PNG

Anonymous
08/07/24(Wed)12:17:12 No.101768642

Anonymous 08/07/24(Wed)12:17:12 No.101768642

>>101768478
nope, saying that as a big freetard cope hater

Anonymous
08/07/24(Wed)12:19:58 No.101768664

Anonymous 08/07/24(Wed)12:19:58 No.101768664

>>101768181
GJ anon!

Anonymous
08/07/24(Wed)12:21:13 No.101768684

Anonymous 08/07/24(Wed)12:21:13 No.101768684

File: 2024-08-07_00311_.png (1.27 MB, 1024x1024)

1.27 MB PNG

Anonymous
08/07/24(Wed)12:21:53 No.101768697

Anonymous 08/07/24(Wed)12:21:53 No.101768697

>>101768510
>>101768629
that's by design, the API has the choice between vivid and natural and natural has much less of that dall-e 3 look we know

Anonymous
08/07/24(Wed)12:24:44 No.101768741

Anonymous 08/07/24(Wed)12:24:44 No.101768741

File: ComfyUI_temp_rvtvh_00026_.png (1.69 MB, 1024x1024)

1.69 MB PNG

>>101768478
I am not convinced it's behind.
There should be a competition: the best flux prompter vs the best dall-e, prompter. 30 subjects, consisting of description of complex scenes plus desired styles, 30 minutes.
Judges choose between the Dall-e gen or the Flux Gen Blindly.
The main problem is this >>101768629. You can tell Dall-e gen for their grain, so it won't be truly blind.
Let the rest of the models compete too.

Anonymous
08/07/24(Wed)12:25:41 No.101768757

Anonymous 08/07/24(Wed)12:25:41 No.101768757

>>101768612
i tried adding the --gpu-only tag and it made things worse
i have a 3060 but still, i was running fine before i updated.

Anonymous
08/07/24(Wed)12:27:15 No.101768774

Anonymous 08/07/24(Wed)12:27:15 No.101768774

File: flux_merge_fp8_00012_.jpg (483 KB, 896x896)

483 KB JPG

Anonymous
08/07/24(Wed)12:31:22 No.101768821

Anonymous 08/07/24(Wed)12:31:22 No.101768821

File: image_2024-08-07_183120014.png (1.18 MB, 1024x1024)

1.18 MB PNG

Anonymous
08/07/24(Wed)12:32:51 No.101768844

Anonymous 08/07/24(Wed)12:32:51 No.101768844

>>101768536
https://github.com/comfyanonymous/ComfyUI/commit/c14ac98fedd0176686d285d384abec5e4c0140c2
this commit is really good for a couple reasons, but if you are always hitting lowvram then there should be an -arg to disable it.

Anonymous
08/07/24(Wed)12:33:01 No.101768846

Anonymous 08/07/24(Wed)12:33:01 No.101768846

File: 2024-08-07_00314_.png (1.26 MB, 1024x1024)

1.26 MB PNG

>>101768181
good stuff, now extent it to cfg 2.0 .. kek just joking, interesting results

Anonymous
08/07/24(Wed)12:35:07 No.101768879

Anonymous 08/07/24(Wed)12:35:07 No.101768879

File: ComfyUI_01290_.png (1.77 MB, 1024x1024)

1.77 MB PNG

>>101768846
>interesting results
thanks o/

>now extent it to cfg 2.0 .. kek just joking
That's possible with DynamicThresholding yeah, here's what I've got with cfg = 3 + Guidance 3.5 for example

Anonymous
08/07/24(Wed)12:38:48 No.101768931

Anonymous 08/07/24(Wed)12:38:48 No.101768931

>>101767386
DoRA didn't learn multiple concepts as well I wanted. Few test runs I made with small datasets and short training time were promising. I think with datasets closer to 1k images it's better to stay with normal Lora, set lr to 1 with prodigy and let Jesus take the wheel.

Anonymous
08/07/24(Wed)12:42:03 No.101768975

Anonymous 08/07/24(Wed)12:42:03 No.101768975

actually wouldn't it be better to check if the user even needs low vram after the flush? i have a xl workflow on 8gb that goes into lowvram on the last detailer because prior detailers don't unload from the model once they are done. that flush could push me out of vramlet hell for that last step

Anonymous
08/07/24(Wed)12:43:04 No.101768992

Anonymous 08/07/24(Wed)12:43:04 No.101768992

File: fs_0066.jpg (66 KB, 688x1280)

66 KB JPG

Anonymous
08/07/24(Wed)12:44:28 No.101769015

Anonymous 08/07/24(Wed)12:44:28 No.101769015

>>101768931
I can recommend giving locon a shot!

Anonymous
08/07/24(Wed)12:45:33 No.101769036

Anonymous 08/07/24(Wed)12:45:33 No.101769036

>>101768975
don't unload from the VRAM*

Anonymous
08/07/24(Wed)12:46:37 No.101769050

Anonymous 08/07/24(Wed)12:46:37 No.101769050

Is fp8 supposed to take just as long as fp16?

Anonymous
08/07/24(Wed)12:47:13 No.101769057

Anonymous 08/07/24(Wed)12:47:13 No.101769057

>>101769050
yeah, I didn't notice a speed increase when going for fp8

Anonymous
08/07/24(Wed)12:47:37 No.101769062

Anonymous 08/07/24(Wed)12:47:37 No.101769062

File: 00003-1790321666.jpg (351 KB, 1296x1728)

351 KB JPG

>>101769015
for sure

Anonymous
08/07/24(Wed)12:47:55 No.101769067

Anonymous 08/07/24(Wed)12:47:55 No.101769067

File: 2024-08-07_00331_.png (1.61 MB, 1024x1024)

1.61 MB PNG

>>101769050
yes, its not faster it just takes less vram

Anonymous
08/07/24(Wed)12:50:30 No.101769093

Anonymous 08/07/24(Wed)12:50:30 No.101769093

>>101764165
>this thread
i love how the prerequisite to generate images locally is to be a lonely incel lmao

Anonymous
08/07/24(Wed)12:50:47 No.101769100

Anonymous 08/07/24(Wed)12:50:47 No.101769100

>I updated

Should i just go back or is this a matter of lets say the new update writing over zluda somehow, and maybe i could just reinstall zluda?

Anonymous
08/07/24(Wed)12:51:56 No.101769118

Anonymous 08/07/24(Wed)12:51:56 No.101769118

>>101769057
>>101769067
ah ok, thought I fucked something up

Anonymous
08/07/24(Wed)12:52:11 No.101769124

Anonymous 08/07/24(Wed)12:52:11 No.101769124

Other merged flux :
https://huggingface.co/HaileyStorm/FLUX.1-Merges
https://huggingface.co/drbaph/FLUX.1-schnell-dev-merged-fp8-4step

Anonymous
08/07/24(Wed)12:52:56 No.101769136

Anonymous 08/07/24(Wed)12:52:56 No.101769136

>>101769050
Yeah, it's knock-off chinese "quantization" where weights are cast to fp8 then you lose precision casting back to fp16 for inference

Anonymous
08/07/24(Wed)12:53:04 No.101769137

Anonymous 08/07/24(Wed)12:53:04 No.101769137

File: 3456768754876.png (3 KB, 478x76)

3 KB PNG

>>101769100
>Forgot pic

I am at a loss for words.

Anonymous
08/07/24(Wed)12:55:45 No.101769189

Anonymous 08/07/24(Wed)12:55:45 No.101769189

>>101769136
>Yeah, it's knock-off chinese "quantization"
so that mean there's a way to make this quantization faster right?

Anonymous
08/07/24(Wed)12:56:50 No.101769206

Anonymous 08/07/24(Wed)12:56:50 No.101769206

File: 2024-08-07_00337_.png (836 KB, 1024x1024)

836 KB PNG

>>101769100
>>101769137

Anonymous
08/07/24(Wed)13:04:15 No.101769321

Anonymous 08/07/24(Wed)13:04:15 No.101769321

File: Capture.jpg (1.16 MB, 3840x1793)

1.16 MB JPG

For those using DynamicTresholding with higher CFG, I'd recommand to put cfg_mode at either Half Cosine Up or Half Cosine Down, the others are too bright and makes the picture too saturated

Anonymous
08/07/24(Wed)13:05:46 No.101769338

Anonymous 08/07/24(Wed)13:05:46 No.101769338

>>101768844
this is what i get for updating
it was so lit before, next seed, gen, next seed, gen, change prompt, gen, pipe to upscaler, gen, back to base sampler, gen
no loading/reloading

Anonymous
08/07/24(Wed)13:07:39 No.101769364

Anonymous 08/07/24(Wed)13:07:39 No.101769364

Why is the quality of gens in this general so low? Any place I check, discords, other boards, reddit or whatever, people gen much higher quality, funnier and cooler flux stuff. Here it's just... very boring.

Anonymous
08/07/24(Wed)13:07:48 No.101769370

Anonymous 08/07/24(Wed)13:07:48 No.101769370

>>101769321
thx (btw I think you mean to desaturated, high saturation == colorful, desaturated == bleached out)

Anonymous
08/07/24(Wed)13:10:30 No.101769411

Anonymous 08/07/24(Wed)13:10:30 No.101769411

>>101769370
yeah, my b. Btw I think I found the best combinaison, you put half cosine up for both cfg_mode and mimic_mode, that one is the closest to the original picture at CFG 1

Anonymous
08/07/24(Wed)13:11:36 No.101769429

Anonymous 08/07/24(Wed)13:11:36 No.101769429

File: up_0004.jpg (445 KB, 2752x5120)

445 KB JPG

Anonymous
08/07/24(Wed)13:16:03 No.101769493

Anonymous 08/07/24(Wed)13:16:03 No.101769493

File: 2024-08-07_00343_.png (1.42 MB, 1024x1024)

1.42 MB PNG

Anonymous
08/07/24(Wed)13:16:54 No.101769510

Anonymous 08/07/24(Wed)13:16:54 No.101769510

>>101769124
>still over 23GB
I'm guessing this wasn't supposed to help out vramlets.

Anonymous
08/07/24(Wed)13:17:27 No.101769519

Anonymous 08/07/24(Wed)13:17:27 No.101769519

>>101769510
a merge doesn't change the size of the architecture

Anonymous
08/07/24(Wed)13:18:15 No.101769528

Anonymous 08/07/24(Wed)13:18:15 No.101769528

File: 2024-08-07_00349_.png (1.64 MB, 1024x1024)

1.64 MB PNG

I wish Cyberpunk 2077 had looked like this, not the plastic look it actually is.

Anonymous
08/07/24(Wed)13:20:03 No.101769567

Anonymous 08/07/24(Wed)13:20:03 No.101769567

File: 00005-1790321666.jpg (465 KB, 1296x1728)

465 KB JPG

Anonymous
08/07/24(Wed)13:21:12 No.101769586

Anonymous 08/07/24(Wed)13:21:12 No.101769586

>>101764165
Poorfag here. I've got 12 gb of VRAM, but only 16 gb of system ram. Can I fp8 Flux? Or am I flucked?

Anonymous
08/07/24(Wed)13:21:51 No.101769602

Anonymous 08/07/24(Wed)13:21:51 No.101769602

>/ldg/ reaching Popular Threads
we eatin' good

Anonymous
08/07/24(Wed)13:21:55 No.101769605

Anonymous 08/07/24(Wed)13:21:55 No.101769605

Text generation is good, but it's not good enough. Hopefully finetuning and lora can improve it.

Anonymous
08/07/24(Wed)13:22:10 No.101769610

Anonymous 08/07/24(Wed)13:22:10 No.101769610

maybe if you're on linux

Anonymous
08/07/24(Wed)13:23:34 No.101769631

Anonymous 08/07/24(Wed)13:23:34 No.101769631

File: 2024-08-07_00006_.png (1.39 MB, 1280x720)

1.39 MB PNG

>>101769586
with 32GB system ram it would have worked.. but txxl will gobble up about 20GB system ram .. you can still try with ram swapping to the SSD .. but probably you are fucked

Anonymous
08/07/24(Wed)13:26:45 No.101769679

Anonymous 08/07/24(Wed)13:26:45 No.101769679

>>101769631
Thanks. I'll see about getting another cheap ram stick.

Anonymous
08/07/24(Wed)13:26:52 No.101769682

Anonymous 08/07/24(Wed)13:26:52 No.101769682

>>101769602
anon loves imggen

Anonymous
08/07/24(Wed)13:27:29 No.101769690

Anonymous 08/07/24(Wed)13:27:29 No.101769690

>>101764165
Is SwarmUI a honeypot? During each gen it sends an outbound request to a google server, doesn't do that when not genning anything

Anonymous
08/07/24(Wed)13:29:57 No.101769717

Anonymous 08/07/24(Wed)13:29:57 No.101769717

File: 1698496938908263.png (1.29 MB, 768x1280)

1.29 MB PNG

Anonymous
08/07/24(Wed)13:31:18 No.101769738

Anonymous 08/07/24(Wed)13:31:18 No.101769738

File: 00006-2925565261.jpg (188 KB, 864x1152)

188 KB JPG

Anonymous
08/07/24(Wed)13:31:38 No.101769743

Anonymous 08/07/24(Wed)13:31:38 No.101769743

File: 1709197436921352.png (1.32 MB, 768x1280)

1.32 MB PNG

Anonymous
08/07/24(Wed)13:32:45 No.101769757

Anonymous 08/07/24(Wed)13:32:45 No.101769757

File: 2024-08-07_00358_.png (1.68 MB, 1024x1024)

1.68 MB PNG

>>101769690
what google server tho?

Anonymous
08/07/24(Wed)13:33:14 No.101769762

Anonymous 08/07/24(Wed)13:33:14 No.101769762

>>101769690
They probably have Google Analytics, could be sending your prompts or simply sending that you generated something with the model, resolution, etc.

Anonymous
08/07/24(Wed)13:33:23 No.101769765

Anonymous 08/07/24(Wed)13:33:23 No.101769765

File: 1713249963212218.png (1.24 MB, 768x1280)

1.24 MB PNG

Anonymous
08/07/24(Wed)13:34:15 No.101769778

Anonymous 08/07/24(Wed)13:34:15 No.101769778

>>101769762
You could probably look at the JavaScript code / install Google Tag Manager and see what it's sending.

Anonymous
08/07/24(Wed)13:35:33 No.101769799

Anonymous 08/07/24(Wed)13:35:33 No.101769799

File: test.png (3.62 MB, 3840x1713)

3.62 MB PNG

>>101769321
>>101769411
>I think I found the best combinaison, you put half cosine up for both cfg_mode and mimic_mode, that one is the closest to the original picture at CFG 1
It's starting to be looking really good, negative prompt is working and there isn't much burn at cfg = 3 with those new settings

Anonymous
08/07/24(Wed)13:36:08 No.101769806

Anonymous 08/07/24(Wed)13:36:08 No.101769806

>>101769757
No clue, some generic google IP, lookup says it's part of the 1e100.net
>>101769762
That'd be shitty
>>101769778
I'll try the Google Tag Manager then

Anonymous
08/07/24(Wed)13:38:38 No.101769841

Anonymous 08/07/24(Wed)13:38:38 No.101769841

File: 00007-2925565261.jpg (366 KB, 1296x1728)

366 KB JPG

Anonymous
08/07/24(Wed)13:40:02 No.101769863

Anonymous 08/07/24(Wed)13:40:02 No.101769863

File: 18972156710598672.png (5 KB, 941x34)

5 KB PNG

>>101769100
>>101769137
After re-following this guide
>https://github.com/CS1o/Stable-Diffusion-Info/wiki/Installation-Guides#amd-automatic1111-with-zluda

I get picrel... hiprtc0507.dll
Also it turns out the driver update wiped the PATHs for some fucking reason thanks windows 10 but thats an easy fix.

Hmmm not sure where to go from here.
ZLUDA says it SHOULD be compatible with AMD driver 24.7.1, and i even re-installed hip.

Anonymous
08/07/24(Wed)13:43:33 No.101769928

Anonymous 08/07/24(Wed)13:43:33 No.101769928

>>101768844
>>101769338
ok i loaded an old snapshot and full updated
things are working like they used to now.
im not even ganna question it

Anonymous
08/07/24(Wed)13:44:11 No.101769937

Anonymous 08/07/24(Wed)13:44:11 No.101769937

File: ComfyUI_243.png (1.01 MB, 832x1216)

1.01 MB PNG

>>101768478
Go ahead and write "nigger" in your dall-e prompt, I'll wait.

Anonymous
08/07/24(Wed)13:44:41 No.101769945

Anonymous 08/07/24(Wed)13:44:41 No.101769945

>>101768402
autistic prompts are what I live for

Anonymous
08/07/24(Wed)13:45:27 No.101769960

Anonymous 08/07/24(Wed)13:45:27 No.101769960

>>101769799
Where does dynamic thresholding go? Anywhere in the model pipeline?

Anonymous
08/07/24(Wed)13:46:22 No.101769974

Anonymous 08/07/24(Wed)13:46:22 No.101769974

>>101767843
good quality booba, and just the right amount of baked-ness, nice.

Anonymous
08/07/24(Wed)13:46:53 No.101769985

Anonymous 08/07/24(Wed)13:46:53 No.101769985

>>101768846
Is this flux? How get wet skin look thx

Anonymous
08/07/24(Wed)13:46:54 No.101769986

Anonymous 08/07/24(Wed)13:46:54 No.101769986

>>101769960
I give you a workflow: https://files.catbox.moe/haqdtd.png

Anonymous
08/07/24(Wed)13:48:02 No.101770006

Anonymous 08/07/24(Wed)13:48:02 No.101770006

>>101769743
Nice but blown out af

Anonymous
08/07/24(Wed)13:48:32 No.101770014

Anonymous 08/07/24(Wed)13:48:32 No.101770014

>>101769945
Don't worry, writing a wall of text (or relegating it to LLM) is pretty autistic in itself.

Anonymous
08/07/24(Wed)13:49:36 No.101770033

Anonymous 08/07/24(Wed)13:49:36 No.101770033

>>101770014
I can't run this myself but I wonder how it would react to the prompt edit junk I love to do

Anonymous
08/07/24(Wed)13:49:56 No.101770037

Anonymous 08/07/24(Wed)13:49:56 No.101770037

File: 1713319906281180.png (1.37 MB, 768x1280)

1.37 MB PNG

>>101770006
yeah tru

Anonymous
08/07/24(Wed)13:50:56 No.101770052

Anonymous 08/07/24(Wed)13:50:56 No.101770052

>>101769937
this, fuck censorship, flux is dalle with better realism and complete freedom, a blessing in the sky

Anonymous
08/07/24(Wed)13:52:09 No.101770072

Anonymous 08/07/24(Wed)13:52:09 No.101770072

I'm curious about flux's capabilities. It's clear that it does single characters very well, but what about multiple characters interacting? Can somebody try to make two characters boxing in a boxing ring, each character with a different description and appearance?

Anonymous
08/07/24(Wed)13:52:14 No.101770076

Anonymous 08/07/24(Wed)13:52:14 No.101770076

File: 2024-08-07_00369_.png (1.55 MB, 1024x1024)

1.55 MB PNG

>>101769985
ya thats flux there was
>The girl has wet blonde hair that clings to her body.
and
>She is swimming in a natural forest lake.
in the prompt .. just the later and she was still dry, but with wet hair it also made wet skin

Anonymous
08/07/24(Wed)13:52:55 No.101770089

Anonymous 08/07/24(Wed)13:52:55 No.101770089

>>101770037
damn son this is more like it

Anonymous
08/07/24(Wed)13:53:46 No.101770100

Anonymous 08/07/24(Wed)13:53:46 No.101770100

File: file.png (1.64 MB, 1024x1024)

1.64 MB PNG

Anonymous
08/07/24(Wed)13:55:29 No.101770118

Anonymous 08/07/24(Wed)13:55:29 No.101770118

>>101770033
Gimme the prompt, I'll gen it.

Anonymous
08/07/24(Wed)13:57:41 No.101770152

Anonymous 08/07/24(Wed)13:57:41 No.101770152

File: ComfyUI_00026_.png (1.37 MB, 1024x1024)

1.37 MB PNG

Anonymous
08/07/24(Wed)13:58:18 No.101770162

Anonymous 08/07/24(Wed)13:58:18 No.101770162

File: file.png (1.34 MB, 1024x1024)

1.34 MB PNG

Anonymous
08/07/24(Wed)13:58:24 No.101770164

Anonymous 08/07/24(Wed)13:58:24 No.101770164

>>101769937
/pol/ is that way

Anonymous
08/07/24(Wed)14:04:49 No.101770259

Anonymous 08/07/24(Wed)14:04:49 No.101770259

File: 2024-08-07_00377_.png (1.45 MB, 1024x1024)

1.45 MB PNG

Anonymous
08/07/24(Wed)14:05:44 No.101770272

Anonymous 08/07/24(Wed)14:05:44 No.101770272

File: tmpmu2z9ryr.png (1.23 MB, 1024x1536)

1.23 MB PNG

>>101770118
>score_9,score_8_up,score_7_up, black theme, simple background, gray background,black slime orb, many red dots,red dots,red dots,humanoid shape,
[:tentacles,melting, body horror,arms,legs,arms,legs,arms,legs,:0.2]lovecraft,cthulhu mythos

I guess you can trim the ponyXL stuff and I don't know if the edit works

Anonymous
08/07/24(Wed)14:06:33 No.101770285

Anonymous 08/07/24(Wed)14:06:33 No.101770285

File: file.png (1.31 MB, 1024x1024)

1.31 MB PNG

Anonymous
08/07/24(Wed)14:07:30 No.101770294

Anonymous 08/07/24(Wed)14:07:30 No.101770294

File: 1709084090009747.png (1.45 MB, 768x1280)

1.45 MB PNG

Anonymous
08/07/24(Wed)14:09:04 No.101770321

Anonymous 08/07/24(Wed)14:09:04 No.101770321

File: file.png (1.11 MB, 1024x1024)

1.11 MB PNG

Anonymous
08/07/24(Wed)14:09:16 No.101770325

Anonymous 08/07/24(Wed)14:09:16 No.101770325

>>101769986
What does the VAE override do?

Anonymous
08/07/24(Wed)14:09:19 No.101770327

Anonymous 08/07/24(Wed)14:09:19 No.101770327

File: ComfyUI_00027_.png (1.17 MB, 1024x1024)

1.17 MB PNG

Anonymous
08/07/24(Wed)14:09:50 No.101770338

Anonymous 08/07/24(Wed)14:09:50 No.101770338

Straight from the oven...
>>101770020
>>101770020
>>101770020

Anonymous
08/07/24(Wed)14:10:14 No.101770341

Anonymous 08/07/24(Wed)14:10:14 No.101770341

>>101770325
it gives you the choice to put the VAE on another gpu if you have multiple gpu's, if you only have one you can delete that node and the CLIP override aswell

Anonymous
08/07/24(Wed)14:11:04 No.101770360

Anonymous 08/07/24(Wed)14:11:04 No.101770360

>>101769806
Turns out it’s adetailer

Anonymous
08/07/24(Wed)14:16:58 No.101770472

Anonymous 08/07/24(Wed)14:16:58 No.101770472

>>101770360
>Turns out it’s adetailer
wtf

Anonymous
08/07/24(Wed)14:23:04 No.101770568

Anonymous 08/07/24(Wed)14:23:04 No.101770568

File: ComfyUI_30863_.png (1.24 MB, 2048x1024)

1.24 MB PNG

>>101770272
Left - verbatim, right - without "score_9,score_8_up,score_7_up" and without brackets/weights

Anonymous
08/07/24(Wed)14:24:53 No.101770599

Anonymous 08/07/24(Wed)14:24:53 No.101770599

>>101770568
>it turned into pixel art
Hah. Anyway, very cool.
Thanks anon

Anonymous
08/07/24(Wed)15:03:03 No.101771101

Anonymous 08/07/24(Wed)15:03:03 No.101771101

>>101770472
ADetailer makes calls to google cloud servers, supposedly to compare the local detection model with some in huggingface
https://github.com/Bing-su/adetailer/issues/163

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.