/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 08/01/24(Thu)22:12:07 No.101678250

File: long dick general.jpg (2.04 MB, 2465x3264)

2.04 MB JPG

/ldg/ - Local Diffusion General Anonymous 08/01/24(Thu)22:12:07 No.101678250 Archived

Black Forest Pt. 3: Localchads Won Edition

Discussion of free and open source text-to-image models

Previous /ldg/ bred : >>101674851

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Kolors
https://gokaygokay-kolors.hf.space
Nodes: https://github.com/kijai/ComfyUI-KwaiKolorsWrapper

>AuraFlow
https://fal.ai/models/fal-ai/aura-flow
https://huggingface.co/fal/AuraFlows

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg

Anonymous
08/01/24(Thu)22:13:42 No.101678271

Anonymous 08/01/24(Thu)22:13:42 No.101678271

blessed thread of frenship

Anonymous
08/01/24(Thu)22:15:29 No.101678299

Anonymous 08/01/24(Thu)22:15:29 No.101678299

File: 1722080365983147.jpg (86 KB, 740x1232)

86 KB JPG

Hail nigga forest . They cooked

Anonymous
08/01/24(Thu)22:22:43 No.101678411

Anonymous 08/01/24(Thu)22:22:43 No.101678411

official pixart bigma, lumina 2 and hunyuan finetune waiting room, now with flux 12b fp8

Anonymous
08/01/24(Thu)22:25:59 No.101678458

Anonymous 08/01/24(Thu)22:25:59 No.101678458

Anyone know what resolutions it's trained for?

Anonymous
08/01/24(Thu)22:26:38 No.101678465

Anonymous 08/01/24(Thu)22:26:38 No.101678465

>>101678411
>now with flux 12b fp8
it's already here
https://huggingface.co/Kijai/flux-fp8/tree/main

Anonymous
08/01/24(Thu)22:27:04 No.101678470

Anonymous 08/01/24(Thu)22:27:04 No.101678470

>>101678458
any it seems. even "long" pics in either direction work

Anonymous
08/01/24(Thu)22:28:22 No.101678490

Anonymous 08/01/24(Thu)22:28:22 No.101678490

>>101678470
Still, I'm a bit suspicious it might be making the model a bit worse/dumber. I'd rather just gen at the proper resolutions.

Anonymous
08/01/24(Thu)22:29:45 No.101678518

Anonymous 08/01/24(Thu)22:29:45 No.101678518

File: 806.png (1.05 MB, 1768x312)

1.05 MB PNG

>>101678458
all of them it seems

Anonymous
08/01/24(Thu)22:30:19 No.101678529

Anonymous 08/01/24(Thu)22:30:19 No.101678529

>>101678465
For the retards here; what specs does this version run on?

Anonymous
08/01/24(Thu)22:31:07 No.101678537

Anonymous 08/01/24(Thu)22:31:07 No.101678537

File: asasa.jpg (13 KB, 301x429)

13 KB JPG

What's the best scheduler for euler?

Anonymous
08/01/24(Thu)22:32:09 No.101678554

Anonymous 08/01/24(Thu)22:32:09 No.101678554

>>101678529
a bit more than 12gb of vram

Anonymous
08/01/24(Thu)22:34:35 No.101678587

Anonymous 08/01/24(Thu)22:34:35 No.101678587

>>101678518
I'm not sure it can go too far though, 2048x2048 gives duplication

Anonymous
08/01/24(Thu)22:34:57 No.101678590

Anonymous 08/01/24(Thu)22:34:57 No.101678590

File: 8169372046.png (1.04 MB, 304x2008)

1.04 MB PNG

>>101678537
I think the only ones that work are simple and sgm_uniform they are (practically?) identical

Anonymous
08/01/24(Thu)22:35:27 No.101678599

Anonymous 08/01/24(Thu)22:35:27 No.101678599

>>101678271
I'm so sick of this ritual posting
are you d*bo??

Anonymous
08/01/24(Thu)22:35:31 No.101678600

Anonymous 08/01/24(Thu)22:35:31 No.101678600

>>101678590
wow thats a crazy res

Anonymous
08/01/24(Thu)22:37:19 No.101678632

Anonymous 08/01/24(Thu)22:37:19 No.101678632

File: file.png (1.32 MB, 1280x800)

1.32 MB PNG

Anonymous
08/01/24(Thu)22:37:59 No.101678638

Anonymous 08/01/24(Thu)22:37:59 No.101678638

File: Capture.jpg (396 KB, 3087x1688)

396 KB JPG

>A worker gives coins to a pharmacist in the street, and a sign reads: "How ironic, don't you think?", anime style
Sometimes it has great prompt understanding, sometimes not, it lacks a bit of consistency

Anonymous
08/01/24(Thu)22:38:19 No.101678646

Anonymous 08/01/24(Thu)22:38:19 No.101678646

>>101678632
boring reality btfo

Anonymous
08/01/24(Thu)22:38:36 No.101678651

Anonymous 08/01/24(Thu)22:38:36 No.101678651

>>101678632
no need for boring reality
>>101678638
why only one clip

Anonymous
08/01/24(Thu)22:39:41 No.101678669

Anonymous 08/01/24(Thu)22:39:41 No.101678669

File: Capture.jpg (35 KB, 922x628)

35 KB JPG

>>101678651
>why only one clip
what do you mean? I should put the text on the 2 of them?

Anonymous
08/01/24(Thu)22:40:48 No.101678682

Anonymous 08/01/24(Thu)22:40:48 No.101678682

File: ComfyUI_temp_kvcoj_00025_.png (1.08 MB, 896x1152)

1.08 MB PNG

Anonymous
08/01/24(Thu)22:41:45 No.101678695

Anonymous 08/01/24(Thu)22:41:45 No.101678695

File: 37.png (9 KB, 310x147)

9 KB PNG

>>101678669

Anonymous
08/01/24(Thu)22:42:21 No.101678705

Anonymous 08/01/24(Thu)22:42:21 No.101678705

File: file.png (1.91 MB, 1280x800)

1.91 MB PNG

Anonymous
08/01/24(Thu)22:42:51 No.101678711

Anonymous 08/01/24(Thu)22:42:51 No.101678711

File: Capture.jpg (339 KB, 3230x1485)

339 KB JPG

>>101678695
I already have that

Anonymous
08/01/24(Thu)22:43:28 No.101678718

Anonymous 08/01/24(Thu)22:43:28 No.101678718

>>101678669
try it out and see if its better

Anonymous
08/01/24(Thu)22:43:41 No.101678722

Anonymous 08/01/24(Thu)22:43:41 No.101678722

>>101678705
Crosswalk is almost perfect

Anonymous
08/01/24(Thu)22:44:04 No.101678728

Anonymous 08/01/24(Thu)22:44:04 No.101678728

>>101678718
nothing changed kek

Anonymous
08/01/24(Thu)22:44:46 No.101678740

Anonymous 08/01/24(Thu)22:44:46 No.101678740

>>101678529
>>101678554
Haven't tried the fp8 on disk version yet but have been loading with fp8 and running fine on 3080 10gb card (both dev and schnell). Might need --novram when launching Comfy.

Anonymous
08/01/24(Thu)22:44:56 No.101678742

Anonymous 08/01/24(Thu)22:44:56 No.101678742

File: file.png (1.65 MB, 1280x800)

1.65 MB PNG

Anonymous
08/01/24(Thu)22:45:53 No.101678757

Anonymous 08/01/24(Thu)22:45:53 No.101678757

File: 824195.jpg (428 KB, 2144x1072)

428 KB JPG

>>101678711
shouldn't you put the prompt on both fields?
left: bottom field only
right: two fields
a man a woman holding hands

Anonymous
08/01/24(Thu)22:46:58 No.101678776

Anonymous 08/01/24(Thu)22:46:58 No.101678776

File: 319.png (2.69 MB, 1072x1072)

2.69 MB PNG

>>101678757
top field only
LMAO

Anonymous
08/01/24(Thu)22:46:59 No.101678777

Anonymous 08/01/24(Thu)22:46:59 No.101678777

File: file.png (1.2 MB, 1280x800)

1.2 MB PNG

Anonymous
08/01/24(Thu)22:49:38 No.101678811

Anonymous 08/01/24(Thu)22:49:38 No.101678811

File: file.png (1.58 MB, 1280x800)

1.58 MB PNG

Anonymous
08/01/24(Thu)22:50:29 No.101678818

Anonymous 08/01/24(Thu)22:50:29 No.101678818

File: ComfyUI_temp_kvcoj_00030_.png (726 KB, 896x1152)

726 KB PNG

Anonymous
08/01/24(Thu)22:54:30 No.101678871

Anonymous 08/01/24(Thu)22:54:30 No.101678871

File: ComfyUI_00118_.jpg (972 KB, 3283x1504)

972 KB JPG

>>101678757
Holy fuck you're right... but that's retarded,why didn't he make a single text box that applies for the both of them? that's annoying to have to copy paste the prompt everytime...

Anonymous
08/01/24(Thu)22:54:38 No.101678873

Anonymous 08/01/24(Thu)22:54:38 No.101678873

File: file.png (1.7 MB, 1280x800)

1.7 MB PNG

Anonymous
08/01/24(Thu)22:56:22 No.101678898

Anonymous 08/01/24(Thu)22:56:22 No.101678898

File: file.png (1.82 MB, 1280x800)

1.82 MB PNG

Anonymous
08/01/24(Thu)22:56:43 No.101678901

Anonymous 08/01/24(Thu)22:56:43 No.101678901

Why does it load and unload for each gen? I have 54gb of ram and 24gb of vram and I'm runing the DiT model on fp8

Anonymous
08/01/24(Thu)22:56:48 No.101678902

Anonymous 08/01/24(Thu)22:56:48 No.101678902

File: 4645455630.png (18 KB, 742x172)

18 KB PNG

>>101678871
I use it like this

Anonymous
08/01/24(Thu)22:57:43 No.101678916

Anonymous 08/01/24(Thu)22:57:43 No.101678916

>>101678902
yeah but you don't have the guidance scale on the classic CLIP Text Encode

Anonymous
08/01/24(Thu)22:58:41 No.101678923

Anonymous 08/01/24(Thu)22:58:41 No.101678923

>>101678871
>>101678916
right click convert widget to input, then feed a text box into both

Anonymous
08/01/24(Thu)22:59:50 No.101678934

Anonymous 08/01/24(Thu)22:59:50 No.101678934

File: ComfyUI_temp_kvcoj_00033_.png (894 KB, 896x1152)

894 KB PNG

Anonymous
08/01/24(Thu)23:00:35 No.101678945

Anonymous 08/01/24(Thu)23:00:35 No.101678945

File: Capture.jpg (29 KB, 927x308)

29 KB JPG

>>101678923
which one?

Anonymous
08/01/24(Thu)23:00:42 No.101678950

Anonymous 08/01/24(Thu)23:00:42 No.101678950

File: file.png (727 KB, 1280x800)

727 KB PNG

Anonymous
08/01/24(Thu)23:02:00 No.101678964

Anonymous 08/01/24(Thu)23:02:00 No.101678964

>>101678554
Does comfy support CPU or Vulkan?
If so, anyone tried a gen on CPU? How many days did it take?

Anonymous
08/01/24(Thu)23:02:57 No.101678974

Anonymous 08/01/24(Thu)23:02:57 No.101678974

File: Capture.jpg (124 KB, 2182x873)

124 KB JPG

>>101678923
that's it it's good to go?

Anonymous
08/01/24(Thu)23:03:51 No.101678989

Anonymous 08/01/24(Thu)23:03:51 No.101678989

>>101678299
ugly anime desu

Anonymous
08/01/24(Thu)23:03:55 No.101678991

Anonymous 08/01/24(Thu)23:03:55 No.101678991

The only problem is the lack of knowledge in anime culture, pop culture, the mutant foots in some poses, and the censorship for NSFW. So the next step is finetune the model, I thing only the Pony guy and a pair more could do a proper fine tune with their data.

Anonymous
08/01/24(Thu)23:04:21 No.101678997

Anonymous 08/01/24(Thu)23:04:21 No.101678997

>>101678964
I tried cpu earlier with a 10900K and canceled. For 1x 1024x1024 it showed 80 minutes.

Anonymous
08/01/24(Thu)23:05:19 No.101679009

Anonymous 08/01/24(Thu)23:05:19 No.101679009

File: file.png (499 KB, 1280x800)

499 KB PNG

Anonymous
08/01/24(Thu)23:05:35 No.101679014

Anonymous 08/01/24(Thu)23:05:35 No.101679014

>>101678991
we've been waiting on that next step for quite some time now. prepare to wait quite some time longer

Anonymous
08/01/24(Thu)23:05:50 No.101679018

Anonymous 08/01/24(Thu)23:05:50 No.101679018

>>101679009
Sonichu?

Anonymous
08/01/24(Thu)23:09:35 No.101679070

Anonymous 08/01/24(Thu)23:09:35 No.101679070

File: Capture.jpg (137 KB, 1844x1336)

137 KB JPG

>>101678923
something like this?

Anonymous
08/01/24(Thu)23:10:01 No.101679075

Anonymous 08/01/24(Thu)23:10:01 No.101679075

>>101679070
good job anon you figured it out

Anonymous
08/01/24(Thu)23:12:08 No.101679105

Anonymous 08/01/24(Thu)23:12:08 No.101679105

>>101679075
It gives me different outputs though, so I'm not sure it's the good way of dealing with it

Anonymous
08/01/24(Thu)23:13:47 No.101679126

Anonymous 08/01/24(Thu)23:13:47 No.101679126

File: Capture.jpg (295 KB, 2788x1526)

295 KB JPG

>>101678901
Yeah, that's twice as long because of the loading shit, how to fix that??

Anonymous
08/01/24(Thu)23:15:25 No.101679146

Anonymous 08/01/24(Thu)23:15:25 No.101679146

File: file.png (633 KB, 1280x800)

633 KB PNG

Anonymous
08/01/24(Thu)23:16:57 No.101679172

Anonymous 08/01/24(Thu)23:16:57 No.101679172

>>101678997
I'm a diffusion pleb. Are they compute bound or memory bandwidth bound (like LLM's)?
Trying to work out if efficiently leveraging an AMD APU could accelerate generation significantly or not.

Anonymous
08/01/24(Thu)23:21:36 No.101679222

Anonymous 08/01/24(Thu)23:21:36 No.101679222

File: Capture.jpg (324 KB, 3248x1622)

324 KB JPG

For those who want those nodes (it has the negative prompt + guidance scale + a simple text for both clip and t5xxl) I give you can get the metadata here:
https://files.catbox.moe/imf60c.png

Anonymous
08/01/24(Thu)23:22:01 No.101679226

Anonymous 08/01/24(Thu)23:22:01 No.101679226

File: ComfyUI_temp_kvcoj_00044_.png (812 KB, 896x1152)

812 KB PNG

Anonymous
08/01/24(Thu)23:27:18 No.101679295

Anonymous 08/01/24(Thu)23:27:18 No.101679295

>>101678250
Holy shit I took a break while waiting for Bigma and what the fuck just hapoened here bros, did the Dalle weights just drop?

Anonymous
08/01/24(Thu)23:28:41 No.101679317

Anonymous 08/01/24(Thu)23:28:41 No.101679317

>>101679295
Flux dropped and it's what SD3 should've been. Upper half uncensored and really good quality and prompt comprehension.

Anonymous
08/01/24(Thu)23:29:51 No.101679331

Anonymous 08/01/24(Thu)23:29:51 No.101679331

>prompt goth
>instantly turns into sloppastyle
grim, but perhaps deserved

Anonymous
08/01/24(Thu)23:29:56 No.101679333

Anonymous 08/01/24(Thu)23:29:56 No.101679333

>>101679172
all this anon knows is AMD has pisspoor support
srry imggen bros are mostly retarded

Anonymous
08/01/24(Thu)23:32:20 No.101679368

Anonymous 08/01/24(Thu)23:32:20 No.101679368

File: ComfyUI_temp_cdifl_00027_.png (2.27 MB, 2400x1600)

2.27 MB PNG

>https://github.com/comfyanonymous/ComfyUI/commits/master/
>Hack to make all resolutions work on Flux models.

So I updated Comfy to get this commit from an hour ago, and now Flux can directly generate coherent 2400x1600 images apparently. Probably even higher, though I haven't tried yet. What the fuck?

Even the hands are perfect which is INSANE at this res, any other model you'd be lucky to get a coherent face at 2K without hiresfix let alone hands.

Anonymous
08/01/24(Thu)23:33:25 No.101679380

Anonymous 08/01/24(Thu)23:33:25 No.101679380

>>101679317
>Flux dropped and it's what SD3 should've been.
this, I wished it wasn't a 12b model though, that's too big, 10b would've been the sweet spot

Anonymous
08/01/24(Thu)23:38:44 No.101679435

Anonymous 08/01/24(Thu)23:38:44 No.101679435

>>101679295
>did the Dalle weights just drop?
It sure does look like it doesn't it

Anonymous
08/01/24(Thu)23:40:04 No.101679447

Anonymous 08/01/24(Thu)23:40:04 No.101679447

>>101679222
Thanks anon, much appreciated :)

Anonymous
08/01/24(Thu)23:40:05 No.101679448

Anonymous 08/01/24(Thu)23:40:05 No.101679448

File: ComfyUI_00133_.png (3.1 MB, 2048x2048)

3.1 MB PNG

>>101679368
Oh nice, I just made a 2048x2048 res output, it's only asking me for 15gb of vram for the fp8 DiT model, that's cool to see how much we've improved in a single day kek

Anonymous
08/01/24(Thu)23:41:38 No.101679458

Anonymous 08/01/24(Thu)23:41:38 No.101679458

So going forward I guess we're only going get models that are great at coherence but have a slopped sovlless style and can't imitate even public domain artists very well? Kind of sad desu
aesthetics were always more important to me than coherence, and it seems like we're going backwards on aesthetics

Anonymous
08/01/24(Thu)23:42:00 No.101679461

Anonymous 08/01/24(Thu)23:42:00 No.101679461

>some poor anon still using 1.5

Anonymous
08/01/24(Thu)23:42:24 No.101679465

Anonymous 08/01/24(Thu)23:42:24 No.101679465

i really hope this model gains traction. it's getting a lot of attention but it's still so expensive to finetune. hopefully some people come along thinking "now's our chance!" and finally get cooking. this really does feel like local's dall-e moment, it's a bit step up in a lot of ways

Anonymous
08/01/24(Thu)23:44:53 No.101679487

Anonymous 08/01/24(Thu)23:44:53 No.101679487

>>101679368
>>101679448
>dalle3 at home made high res obsolete
that's cool to be able to render high res images like that, it will help for details that's for sure

Anonymous
08/01/24(Thu)23:45:19 No.101679490

Anonymous 08/01/24(Thu)23:45:19 No.101679490

File: file.png (1.15 MB, 1280x800)

1.15 MB PNG

>nogen doomposter is upset he has to describe images with words instead of using artist names

Anonymous
08/01/24(Thu)23:47:18 No.101679514

Anonymous 08/01/24(Thu)23:47:18 No.101679514

>>101679368
Just got into Comfy earlier today. How do you update it? Does a simple git pull just work or are there any other commands I should do?

Anonymous
08/01/24(Thu)23:48:16 No.101679523

Anonymous 08/01/24(Thu)23:48:16 No.101679523

>>101679490
holy fucking cope

Anonymous
08/01/24(Thu)23:48:22 No.101679524

Anonymous 08/01/24(Thu)23:48:22 No.101679524

File: 1706405090986084.png (544 KB, 512x768)

544 KB PNG

Anonymous
08/01/24(Thu)23:48:51 No.101679529

Anonymous 08/01/24(Thu)23:48:51 No.101679529

>>101679524
Say sike

Anonymous
08/01/24(Thu)23:49:14 No.101679531

Anonymous 08/01/24(Thu)23:49:14 No.101679531

>>101679514
just use the "Direct click to download", on that zip you have an updater.bat ready to be used
https://github.com/comfyanonymous/ComfyUI?tab=readme-ov-file

Anonymous
08/01/24(Thu)23:50:06 No.101679542

Anonymous 08/01/24(Thu)23:50:06 No.101679542

File: fluxDiffAttempt.png (39 KB, 600x851)

39 KB PNG

>>101677984
https://github.com/motexture/FluxDiff

if you get an error with fbgemm.dll either dl random libomp140.x86_64.dll on pytorch forums or get it from comfy venv

by default it'll download 16.8gb model + vae as .bin [.cache\huggingface\hub\models--motexture--FluxDiff\snapshots]
and then it'll expect a .safetensors (either fix it in code or just duplicate and rename, otherwise it'll redownload .bin again)

tested on 12g vram, you can try to cast/offload

also, reforge dev is working on implementing flux image model

Anonymous
08/01/24(Thu)23:50:41 No.101679544

Anonymous 08/01/24(Thu)23:50:41 No.101679544

flux inpainting model when

Anonymous
08/01/24(Thu)23:50:55 No.101679550

Anonymous 08/01/24(Thu)23:50:55 No.101679550

>>101679447
it's a pleasure o/

Anonymous
08/01/24(Thu)23:50:56 No.101679551

Anonymous 08/01/24(Thu)23:50:56 No.101679551

File: file.png (927 KB, 1280x800)

927 KB PNG

>doomposter thinks I'm talking about him

Anonymous
08/01/24(Thu)23:51:20 No.101679553

Anonymous 08/01/24(Thu)23:51:20 No.101679553

fp8 feels pretty good, what i was hoping for when i first tried the model. no unloading bullshit

Anonymous
08/01/24(Thu)23:51:43 No.101679561

Anonymous 08/01/24(Thu)23:51:43 No.101679561

>>101679531
I use Linux...

Anonymous
08/01/24(Thu)23:52:44 No.101679572

Anonymous 08/01/24(Thu)23:52:44 No.101679572

File: tadpoles.png (1.36 MB, 1024x1024)

1.36 MB PNG

>A bunch of tadpoles swimming in a pond
bake. again

Anonymous
08/01/24(Thu)23:52:49 No.101679573

Anonymous 08/01/24(Thu)23:52:49 No.101679573

>>101679553
>no unloading bullshit
you also have that anon? is that a bug or something? that makes me so annoyed... >>101679126

Anonymous
08/01/24(Thu)23:55:05 No.101679599

Anonymous 08/01/24(Thu)23:55:05 No.101679599

File: ComfyUI_temp_kvcoj_00062_.png (940 KB, 960x1088)

940 KB PNG

Anonymous
08/01/24(Thu)23:56:34 No.101679618

Anonymous 08/01/24(Thu)23:56:34 No.101679618

>>101679070
How do you get the Prompt box?
When I search, I just see CLIP Text Encode (Prompt) show up

Anonymous
08/01/24(Thu)23:57:54 No.101679638

Anonymous 08/01/24(Thu)23:57:54 No.101679638

>>101679618
just use this metadata you'll get everything, and I double clicked on the dot near t5xxl to make the prompt text appear >>101679222

Anonymous
08/01/24(Thu)23:58:30 No.101679644

Anonymous 08/01/24(Thu)23:58:30 No.101679644

>>101679573
I had it with 24gb vram, it seems the fp16 is just slightly over. One of the nodes should have an option to switch the fp8_e4 or some shit in the model loading node, that allowed me to fit it comfortably without it reloading every prompt. If that isn't enough and you have less vram then idk, there might be another solution i think i saw some shit about 12gb vram somewhere

Anonymous
08/01/24(Thu)23:59:11 No.101679652

Anonymous 08/01/24(Thu)23:59:11 No.101679652

Any way to load the text encoder on one gpu and the image model on another for flux? I'm capping out with 24GB of VRAM.

Anonymous
08/02/24(Fri)00:02:14 No.101679691

Anonymous 08/02/24(Fri)00:02:14 No.101679691

File: ComfyUI_00136_.jpg (696 KB, 2692x1488)

696 KB JPG

>>101679448
>>101679368
you lose a lot of prompt understanding if you go to far though, I guess it works great when it's a simple scene though
>A concert with Donald Trump as bassist and Hatsune Miku as singer, the audience is ecstatic and all raise their hands to the sky.

Anonymous
08/02/24(Fri)00:02:36 No.101679695

Anonymous 08/02/24(Fri)00:02:36 No.101679695

File: file.png (1.41 MB, 1280x800)

1.41 MB PNG

Anonymous
08/02/24(Fri)00:02:49 No.101679698

Anonymous 08/02/24(Fri)00:02:49 No.101679698

>>101679542
Appreciate the effort, but all I could get were wonky frames like a sketchy cartoon style even with realism. Not a lot of motion.

Anonymous
08/02/24(Fri)00:03:11 No.101679706

Anonymous 08/02/24(Fri)00:03:11 No.101679706

i really like that one

Anonymous
08/02/24(Fri)00:03:15 No.101679707

Anonymous 08/02/24(Fri)00:03:15 No.101679707

>>101679644
I have this loading -> unloading shit for 24gb vram + 56gb ram and fp8 DiT + fp16 text encoder :(

Anonymous
08/02/24(Fri)00:04:15 No.101679722

Anonymous 08/02/24(Fri)00:04:15 No.101679722

>>101679542
wait, we can already use their text to video model locally?

Anonymous
08/02/24(Fri)00:06:59 No.101679750

Anonymous 08/02/24(Fri)00:06:59 No.101679750

>>101679638
Based, ty anon I was using the comfy example and there was no negative prompt on there.

Anonymous
08/02/24(Fri)00:08:22 No.101679761

Anonymous 08/02/24(Fri)00:08:22 No.101679761

>>101679435
Just looking at pics in this thread, Dalle 3 is obsolete and so is MJ v6.1, this model very much looks like what ClosedAI was planning to release with GPT 4o (which never came out), this is a massive win for local.

Anonymous
08/02/24(Fri)00:09:47 No.101679783

Anonymous 08/02/24(Fri)00:09:47 No.101679783

>>101679761
Issue is that its so big 99% of people are not going to be able to train it.

Anonymous
08/02/24(Fri)00:10:52 No.101679796

Anonymous 08/02/24(Fri)00:10:52 No.101679796

sieg heil danke deutscher mann

Anonymous
08/02/24(Fri)00:11:35 No.101679802

Anonymous 08/02/24(Fri)00:11:35 No.101679802

File: file.png (1.25 MB, 1280x800)

1.25 MB PNG

Anonymous
08/02/24(Fri)00:12:06 No.101679803

Anonymous 08/02/24(Fri)00:12:06 No.101679803

>>101679750
That's because this model is supposed to work with only a cfg = 1, and cfg = 1 means you can't use negative prompt, it works at higher cfg though, just be careful to not fry your picture >>101679669

Anonymous
08/02/24(Fri)00:12:12 No.101679804

Anonymous 08/02/24(Fri)00:12:12 No.101679804

File: ComfyUI_00007_.png (853 KB, 1024x1024)

853 KB PNG

SAI bros... not like this...

Anonymous
08/02/24(Fri)00:12:23 No.101679808

Anonymous 08/02/24(Fri)00:12:23 No.101679808

>>101679761
>so is MJ v6.1
I would like this to be true but it doesn't have anywhere near the art aesthetics or style understanding of MJ 6.1
Amazing coherence but yeah, the style and soul are just not there for art

Anonymous
08/02/24(Fri)00:12:46 No.101679811

Anonymous 08/02/24(Fri)00:12:46 No.101679811

File: ComfyUI_00017_.png (945 KB, 1024x1024)

945 KB PNG

Nice night for a walk

Anonymous
08/02/24(Fri)00:13:25 No.101679820

Anonymous 08/02/24(Fri)00:13:25 No.101679820

>>101679808
For realistic shit this model is easily API level, only MJ is better and not by much, we're so back

Anonymous
08/02/24(Fri)00:13:39 No.101679823

Anonymous 08/02/24(Fri)00:13:39 No.101679823

>>101679783
At the end of the day the question is how trainable is it. If it's like SDXL then it's a problem. If it's like Sigma then it's not. SDXL was extremely slow and took forever to figure out new concepts. Pixart is relatively fast and learned new concepts fairly quickly. If the only problem is renting a 80 GB GPU then people will swallow it if they get their money's worth in a day or week.

Anonymous
08/02/24(Fri)00:16:16 No.101679850

Anonymous 08/02/24(Fri)00:16:16 No.101679850

File: file.png (337 KB, 1280x800)

337 KB PNG

Anonymous
08/02/24(Fri)00:16:29 No.101679853

Anonymous 08/02/24(Fri)00:16:29 No.101679853

>>101679722
This is something else, the name is just a coincidence I think. Flux=Flow, etc

Anonymous
08/02/24(Fri)00:16:51 No.101679860

Anonymous 08/02/24(Fri)00:16:51 No.101679860

Can flux do coom yet?

Anonymous
08/02/24(Fri)00:18:02 No.101679871

Anonymous 08/02/24(Fri)00:18:02 No.101679871

>>101679860
Not great, needs to be finetuned
https://files.catbox.moe/3pbilx.jpg (embed)

Anonymous
08/02/24(Fri)00:18:10 No.101679873

Anonymous 08/02/24(Fri)00:18:10 No.101679873

>>101679860
it can do tasteful PG-13 / R-rated coom out of the box

Anonymous
08/02/24(Fri)00:18:45 No.101679880

Anonymous 08/02/24(Fri)00:18:45 No.101679880

File: ComfyUI_00009_.png (1.45 MB, 1920x1080)

1.45 MB PNG

>1920x1080 works
We eating good tonight chads
Good for them to release over the weekend too

Anonymous
08/02/24(Fri)00:19:08 No.101679885

Anonymous 08/02/24(Fri)00:19:08 No.101679885

>>101679860
Yes it can
https://files.catbox.moe/b09u2v.png

Anonymous
08/02/24(Fri)00:19:35 No.101679892

Anonymous 08/02/24(Fri)00:19:35 No.101679892

>>101679803
That's super useful anon much obliged. What cfg do you recommend for this? Default (3.5)?

Anonymous
08/02/24(Fri)00:19:39 No.101679894

Anonymous 08/02/24(Fri)00:19:39 No.101679894

>>101679885
>>101679871
>>101679873
Guess it's time to reinstall, hope the macbook can take it.

Anonymous
08/02/24(Fri)00:19:47 No.101679897

Anonymous 08/02/24(Fri)00:19:47 No.101679897

>>101677984
literally malware

Anonymous
08/02/24(Fri)00:20:09 No.101679905

Anonymous 08/02/24(Fri)00:20:09 No.101679905

>>101679871
>Not great
you're joking? the anatomy is almost perfect, this will be a blast to finetune it

Anonymous
08/02/24(Fri)00:21:28 No.101679919

Anonymous 08/02/24(Fri)00:21:28 No.101679919

File: file.png (1.43 MB, 1280x800)

1.43 MB PNG

I know there's going to be some gems in this dataset.

Anonymous
08/02/24(Fri)00:21:53 No.101679925

Anonymous 08/02/24(Fri)00:21:53 No.101679925

>>101679892
>What cfg do you recommend for this? Default (3.5)?
You're talking about the guidance scale, that's not the CFG, if you want that it's on this metadata >>101679222
Even on the API they put CFG = 1 that's why they didn't display negative prompt

And desu for the value of cfg it fry the model really quickly, I'd go for the lowest value, 1.1 so that you can still get a great picture + being able to use the negative prompt

Anonymous
08/02/24(Fri)00:21:55 No.101679927

Anonymous 08/02/24(Fri)00:21:55 No.101679927

>>101679172
I believe that diffusion is more compute bound than LLM. Diffusion uses few slow evaluations (~steps), while LLM require lots of fast evaluations (one for each token ~ word).

Anonymous
08/02/24(Fri)00:22:00 No.101679929

Anonymous 08/02/24(Fri)00:22:00 No.101679929

Anyone else getting unbelievably slow gens for flux? It's only using 14GB/24GB so that's not the issue, but 20 steps takes 10+ mins

Anonymous
08/02/24(Fri)00:22:58 No.101679938

Anonymous 08/02/24(Fri)00:22:58 No.101679938

>>101679929
You're definitely on the CPU. On a 4090 Shnell was 20s and Dev was 50s

Anonymous
08/02/24(Fri)00:23:44 No.101679950

Anonymous 08/02/24(Fri)00:23:44 No.101679950

>>101679885
Should have known the second I saw no "Safety" section in the release

Anonymous
08/02/24(Fri)00:23:56 No.101679952

Anonymous 08/02/24(Fri)00:23:56 No.101679952

>>101679929
yeah that's an issue, for me the inference takes 30-40 seconds, but the problem is the loading -> unloading, why the fuck does it do that? :(

Anonymous
08/02/24(Fri)00:24:35 No.101679959

Anonymous 08/02/24(Fri)00:24:35 No.101679959

>>101679871
Topless seems to work fine.
Bottomless on the other hand is proving challenging. For example:
>>101679885
Note the one on the right, model can be situationally prude.

Anonymous
08/02/24(Fri)00:25:20 No.101679973

Anonymous 08/02/24(Fri)00:25:20 No.101679973

>>101679959
>Note the one on the right, model can be situationally prude.
yeah, desu it will be easily fixed with a finetune, the model doesn't seem to be that brainwashed compared to SDXL for example

Anonymous
08/02/24(Fri)00:25:48 No.101679982

Anonymous 08/02/24(Fri)00:25:48 No.101679982

File: ComfyUI_temp_ybhgr_00009_.png (884 KB, 960x1088)

884 KB PNG

>>101679458
>So going forward I guess we're only going get models that are great at coherence but have a slopped sovlless style
patience is a virtue

Anonymous
08/02/24(Fri)00:26:15 No.101679990

Anonymous 08/02/24(Fri)00:26:15 No.101679990

after testing a whole afternoon I think Kolors (with automated LLM prompt translation into chinese) is much, much better than Flux for art

but Flux is going to be the new gold standard for photography and memes

Anonymous
08/02/24(Fri)00:26:23 No.101679991

Anonymous 08/02/24(Fri)00:26:23 No.101679991

File: file.png (1.53 MB, 1280x800)

1.53 MB PNG

>>101679959
There is little to no genitals in the dataset and they certainly didn't train on those words. So if you want them you're going to have to dig deep and be clever. It's almost smart enough that you can poorman generate them by description.

Anonymous
08/02/24(Fri)00:26:26 No.101679993

Anonymous 08/02/24(Fri)00:26:26 No.101679993

>>101679919
kek, nice gen anon

Anonymous
08/02/24(Fri)00:26:40 No.101679999

Anonymous 08/02/24(Fri)00:26:40 No.101679999

returning oldfag here
haven't been into imagegen for a couple years, what's the best way to prompt nowadays? i'm more used to the "tag-style" prompts like "a beautiful white woman, blonde hair, blue eyes, cinematic, studio lighting, hyperrealistic, 4k uhd, award winning, kodak film" etc.
but a lot of the examples i see on newer models have full sentences, especially on "demo prompts" that often have a hilariously large amount of adjectives and fluff like "A stunning and beautiful white woman stands in the dramatic, breathtaking, pronounced cinematic lighting. Her thought-provoking expression stands in stark contrast to the plain background - an enchantingly magical pure white."
whereas some of the prompts i've seen here are more normal but still have a more "sentence-style" structure
do modern models work best with natural language sentences or is the tag style still the best method?

Anonymous
08/02/24(Fri)00:27:37 No.101680010

Anonymous 08/02/24(Fri)00:27:37 No.101680010

>>101679999
Use both

Anonymous
08/02/24(Fri)00:28:30 No.101680021

Anonymous 08/02/24(Fri)00:28:30 No.101680021

File: Capture.jpg (153 KB, 2796x737)

153 KB JPG

When I simply change seed, there's no unload -> reload, but when I change the prompt, the unload -> reload starts again, hmm...

Anonymous
08/02/24(Fri)00:29:52 No.101680045

Anonymous 08/02/24(Fri)00:29:52 No.101680045

>>101680021
Prompt has to be encoded which is a different model. Seed just changes the color pattern the generation starts with.

Anonymous
08/02/24(Fri)00:30:15 No.101680048

Anonymous 08/02/24(Fri)00:30:15 No.101680048

File: ComfyUI_temp_ybhgr_00013_.png (942 KB, 960x1088)

942 KB PNG

>(SUSPENDED:1.2)

Anonymous
08/02/24(Fri)00:30:18 No.101680049

Anonymous 08/02/24(Fri)00:30:18 No.101680049

>>101679973
Agreed with everything else this model really shines, it's the last missing piece
>>101679925
Holy shit the negatives work now! Ty anon

Anonymous
08/02/24(Fri)00:30:35 No.101680053

Anonymous 08/02/24(Fri)00:30:35 No.101680053

lol the devs fucked it up so bad, really how can they release the model? they will get the shitstorm of the decade in a week

Anonymous
08/02/24(Fri)00:30:38 No.101680054

Anonymous 08/02/24(Fri)00:30:38 No.101680054

>>101680021
maybe when it's reusing the same prompt it just keeps the saved clip embeds that it generated earlier and gets rid of the model weights
so when the prompt changes it needs to load the clip weights again

Anonymous
08/02/24(Fri)00:30:45 No.101680057

Anonymous 08/02/24(Fri)00:30:45 No.101680057

>>101680045
still, it's not normal to have this constant loading -> unloading, some anons don't have this issue though

Anonymous
08/02/24(Fri)00:30:54 No.101680060

Anonymous 08/02/24(Fri)00:30:54 No.101680060

File: RTX 6090 Ti.png (1.06 MB, 768x1216)

1.06 MB PNG

Are you ready?

Anonymous
08/02/24(Fri)00:31:06 No.101680061

Anonymous 08/02/24(Fri)00:31:06 No.101680061

>>101680053
What's wrong with it?

Anonymous
08/02/24(Fri)00:31:35 No.101680067

Anonymous 08/02/24(Fri)00:31:35 No.101680067

>>101680060
omg 26.5 gb vram
sugoi!!!

Anonymous
08/02/24(Fri)00:32:10 No.101680075

Anonymous 08/02/24(Fri)00:32:10 No.101680075

File: ComfyUI_temp_ybhgr_00014_.png (1.09 MB, 960x1088)

1.09 MB PNG

Anonymous
08/02/24(Fri)00:32:20 No.101680079

Anonymous 08/02/24(Fri)00:32:20 No.101680079

File: ComfyUI_00015_.png (1.72 MB, 1920x1080)

1.72 MB PNG

Model is b*sed

Anonymous
08/02/24(Fri)00:32:54 No.101680090

Anonymous 08/02/24(Fri)00:32:54 No.101680090

File: ComfyUI_temp_krkpn_00013_.png (999 KB, 1200x800)

999 KB PNG

huh, crazy how good it is at text. better than proprietary models

Anonymous
08/02/24(Fri)00:32:55 No.101680092

Anonymous 08/02/24(Fri)00:32:55 No.101680092

>>101679959
Bottomless is mostly featureless it seems, when it works
https://files.catbox.moe/lk0zai.png

Anonymous
08/02/24(Fri)00:34:04 No.101680107

Anonymous 08/02/24(Fri)00:34:04 No.101680107

>>101680092
omg it looks great, maybe the nipples are a bit weird though

Anonymous
08/02/24(Fri)00:34:15 No.101680110

Anonymous 08/02/24(Fri)00:34:15 No.101680110

>>101680021
Been a while since i used Comfy, but doesn't it have some strategy to save memory by unloading models? You may be hitting your memory limit. There was an option to not unload models from memory i think, but you may swap or just OOM instead of reload.

Anonymous
08/02/24(Fri)00:34:20 No.101680114

Anonymous 08/02/24(Fri)00:34:20 No.101680114

File: file.png (1.2 MB, 1280x800)

1.2 MB PNG

>>101680053
SAI survived SD 1.5. With the Olympics and the election Twitter won't even have time to stir up the Staceys to hand wring about consensual images. Also it barely knows celebrities so you're not going to see much on that front. Actually shocked it knows politicians.

Anonymous
08/02/24(Fri)00:34:54 No.101680122

Anonymous 08/02/24(Fri)00:34:54 No.101680122

>>101680061
it can generate cunny apparently

Anonymous
08/02/24(Fri)00:35:04 No.101680124

Anonymous 08/02/24(Fri)00:35:04 No.101680124

>>101680092
This seems like something that could actually be fixed by loras, it isn't deliberately ruined.

Anonymous
08/02/24(Fri)00:35:24 No.101680129

Anonymous 08/02/24(Fri)00:35:24 No.101680129

File: Capture.jpg (341 KB, 2675x1774)

341 KB JPG

>>101680110
No I'm not hitting any limit, I still have room to spare and it just wants to unload for some reason

Anonymous
08/02/24(Fri)00:36:05 No.101680136

Anonymous 08/02/24(Fri)00:36:05 No.101680136

>>101680122
It can't >>101680092

Anonymous
08/02/24(Fri)00:36:24 No.101680141

Anonymous 08/02/24(Fri)00:36:24 No.101680141

>>101679938
Bizarre, both my CPU and GPU are pinned
Not sure what's going on

Anonymous
08/02/24(Fri)00:36:25 No.101680143

Anonymous 08/02/24(Fri)00:36:25 No.101680143

>>101680114
>With the Olympics and the election Twitter won't even have time to stir up the Staceys to hand wring about consensual images.
that's so true, it's the perfect moment to release that model

Anonymous
08/02/24(Fri)00:37:31 No.101680155

Anonymous 08/02/24(Fri)00:37:31 No.101680155

File: 1704386294103.jpg (249 KB, 1024x1024)

249 KB JPG

How do we deal with the existential dilemma of images that are close to perfect (with respect to personal style choices), but just slightly flawed, yet correcting the flaw would take 20x more effort than simply generating 100 new images?

I just feel such a constant strange disconnect. I make a hundred images, each of which is flawed in some tiny way so that none are perfect, and none can be really chosen as "the best". They're all just different interpretations of an idea. I really wonder if this is going to cause some kind of psychological damage.

Anonymous
08/02/24(Fri)00:38:17 No.101680166

Anonymous 08/02/24(Fri)00:38:17 No.101680166

>>101680155
You're playing with a gacha machine, just go with the flow.

Anonymous
08/02/24(Fri)00:38:44 No.101680173

Anonymous 08/02/24(Fri)00:38:44 No.101680173

>>101680129
I think i found the option:
>https://github.com/comfyanonymous/ComfyUI/blob/master/comfy/cli_args.py#L117
--disable-smart-memory
Give it a go anyway.
Can comfy normally split the model into multiple cards?

Anonymous
08/02/24(Fri)00:39:21 No.101680178

Anonymous 08/02/24(Fri)00:39:21 No.101680178

>>101680155
nice style, what was the model and prompt to get this style?

Anonymous
08/02/24(Fri)00:40:58 No.101680199

Anonymous 08/02/24(Fri)00:40:58 No.101680199

>>101680173
>--disable-smart-memory
trying this now, praying multiple cards exists because i have a handful of 12ish gig ones and im kicking myself for not just getting a 3090

Anonymous
08/02/24(Fri)00:42:52 No.101680219

Anonymous 08/02/24(Fri)00:42:52 No.101680219

>>101680178
dalle

Anonymous
08/02/24(Fri)00:42:58 No.101680222

Anonymous 08/02/24(Fri)00:42:58 No.101680222

File: ComfyUI_temp_krkpn_00022_.png (1.17 MB, 1200x800)

1.17 MB PNG

tryna generate trump punching hillary in the face during a boxing match, but it keeps making them friendly

Anonymous
08/02/24(Fri)00:43:11 No.101680226

Anonymous 08/02/24(Fri)00:43:11 No.101680226

>>101680173
Now it free up the VRAM when the gen is over, it's worse lol

Anonymous
08/02/24(Fri)00:44:05 No.101680239

Anonymous 08/02/24(Fri)00:44:05 No.101680239

>>101680222
>but it keeps making them friendly
I noticed this too when trying to make a gigachad laughing and pointing at a fat whore
It just made them buddy buddy

Anonymous
08/02/24(Fri)00:44:52 No.101680246

Anonymous 08/02/24(Fri)00:44:52 No.101680246

File: 1.png (2.2 MB, 1232x856)

2.2 MB PNG

Anonymous
08/02/24(Fri)00:45:32 No.101680255

Anonymous 08/02/24(Fri)00:45:32 No.101680255

>>101680226
Kek.. sorry about that. Have you ever genned with comfy and seen it use more than one card? Can it actually use more than one? it's a 24gb model...

Anonymous
08/02/24(Fri)00:45:51 No.101680257

Anonymous 08/02/24(Fri)00:45:51 No.101680257

File: ComfyUI_temp_ybhgr_00019_.png (1.31 MB, 960x1088)

1.31 MB PNG

Anonymous
08/02/24(Fri)00:46:17 No.101680264

Anonymous 08/02/24(Fri)00:46:17 No.101680264

File: ComfyUI_temp_krkpn_00025_.png (927 KB, 1200x800)

927 KB PNG

>>101680222
getting closer
really impressive likenesses and body coherence either way

Anonymous
08/02/24(Fri)00:46:46 No.101680267

Anonymous 08/02/24(Fri)00:46:46 No.101680267

>>101680255
I'm on a fp8 mode, so it only uses 12gb of vram, that's enough for my 3090, and Comfy doesn't have a multigpu feature so the 3060 I have is just sleeping

Anonymous
08/02/24(Fri)00:48:00 No.101680286

Anonymous 08/02/24(Fri)00:48:00 No.101680286

File: Capture.jpg (315 KB, 2384x1566)

315 KB JPG

It sad this model doesn't know much trivia about styles, I ask it to make a ps1 render and it completely ignores that, even such aesthetics is ignored

Anonymous
08/02/24(Fri)00:50:17 No.101680306

Anonymous 08/02/24(Fri)00:50:17 No.101680306

File: 2414586.png (1.13 MB, 768x768)

1.13 MB PNG

>>101680286
yeah, those models are too tuned for "aesthetics" to appeal to people

Anonymous
08/02/24(Fri)00:51:36 No.101680319

Anonymous 08/02/24(Fri)00:51:36 No.101680319

>>101680267
It still needs some extra buffers in memory to keep the latent and do some work.
i also found this:
>vram_group.add_argument("--highvram", action="store_true", help="By default models will be unloaded to CPU memory after being used. This option keeps them in GPU memory.")
--highvram. May as well give it a go. There's also --gpu-only, but i don't know how they interact with each other.

Anonymous
08/02/24(Fri)00:52:37 No.101680328

Anonymous 08/02/24(Fri)00:52:37 No.101680328

>>101680136
>>101680122
It can generate photorealistic children with nipples, there will be a shitshow about this.

Anonymous
08/02/24(Fri)00:53:10 No.101680335

Anonymous 08/02/24(Fri)00:53:10 No.101680335

>>101680328
I don't think anyone cares about that anymore

Anonymous
08/02/24(Fri)00:53:53 No.101680344

Anonymous 08/02/24(Fri)00:53:53 No.101680344

>>101680319
I tried highvram, it forced the gpu to take both the image model + the text encoder into the vram, of course that's impossible that's over 24gb

Anonymous
08/02/24(Fri)00:54:12 No.101680346

Anonymous 08/02/24(Fri)00:54:12 No.101680346

>>101680335
lol
LMAO

Anonymous
08/02/24(Fri)00:55:01 No.101680353

Anonymous 08/02/24(Fri)00:55:01 No.101680353

>>101680346
People care about
>OMG [THING] CAN DO [NEW THING]
They already think models can do this.

Anonymous
08/02/24(Fri)00:55:45 No.101680364

Anonymous 08/02/24(Fri)00:55:45 No.101680364

File: 98377712.png (734 KB, 735x873)

734 KB PNG

>>101678250
damn buddy...

Anonymous
08/02/24(Fri)00:56:14 No.101680367

Anonymous 08/02/24(Fri)00:56:14 No.101680367

>>101680353
All it takes is one random person to push it into the public conscience and all the normies will be out for blood

Anonymous
08/02/24(Fri)00:56:52 No.101680374

Anonymous 08/02/24(Fri)00:56:52 No.101680374

when are we going to get a model that can actually UNDERSTAND the prompts like LLMs can?
like if i type in "a table but deflated like a balloon" i want to see a table deflated like a balloon but instead it just gives me normal tables ("table" is an example, not the actual prompt i tried)
the model can't seem to understand some foreign concepts like that, while LLMs can more or less easily grasp the concepts easily

Anonymous
08/02/24(Fri)00:56:57 No.101680375

Anonymous 08/02/24(Fri)00:56:57 No.101680375

>>101680328
Why are you pretending that the SD models can't do that since 2022?

Anonymous
08/02/24(Fri)00:57:22 No.101680379

Anonymous 08/02/24(Fri)00:57:22 No.101680379

>>101680367
That is true of literally everything, this is old news. Unless someone (You?) decides to push it hard, nobody cares about AI slop anymore.

Anonymous
08/02/24(Fri)00:58:22 No.101680386

Anonymous 08/02/24(Fri)00:58:22 No.101680386

>>101680374
>when are we going to get a model that can actually UNDERSTAND the prompts like LLMs can?
for that, we'll need better models than t5xxl and go for llama9b for example

Anonymous
08/02/24(Fri)00:58:44 No.101680393

Anonymous 08/02/24(Fri)00:58:44 No.101680393

>>101680374
LLMs don't understand stuff like that either

Anonymous
08/02/24(Fri)00:59:45 No.101680402

Anonymous 08/02/24(Fri)00:59:45 No.101680402

>>101680328
Mindblowing revelation: If you train a model on X and Y, it can generate X+Y.
There is no cure for this short of shortcircuiting the laws of logic and physical reality (which some people try).

Anonymous
08/02/24(Fri)00:59:46 No.101680403

Anonymous 08/02/24(Fri)00:59:46 No.101680403

File: 36.png (1.16 MB, 336x1904)

1.16 MB PNG

>>101680374
Give a research team 1 billion dollars and they will do it for you in 2 weeks

Anonymous
08/02/24(Fri)01:02:16 No.101680428

Anonymous 08/02/24(Fri)01:02:16 No.101680428

>>101680414
lmao

Anonymous
08/02/24(Fri)01:02:43 No.101680431

Anonymous 08/02/24(Fri)01:02:43 No.101680431

>>101680344
>I'm on a fp8 mode
>Comfy doesn't have a multigpu feature
>of course that's impossible that's over 24gb
You're trying to do the impossible, then. I expected fp8 would ~half the size. Even if all the models end up being 12gb or a little under, the thing needs some working memory to do the work.

Anonymous
08/02/24(Fri)01:03:37 No.101680434

Anonymous 08/02/24(Fri)01:03:37 No.101680434

>>101680414
impressive, very nice

Anonymous
08/02/24(Fri)01:03:54 No.101680438

Anonymous 08/02/24(Fri)01:03:54 No.101680438

Does 3 mins an image sound about right for a p40?

Anonymous
08/02/24(Fri)01:04:24 No.101680444

Anonymous 08/02/24(Fri)01:04:24 No.101680444

>>101680431
but why does it free some memory when doing a new gen? I have enough room to spare, it's not like a new gen will ask more memory than the first one >>101680129

Anonymous
08/02/24(Fri)01:04:49 No.101680447

Anonymous 08/02/24(Fri)01:04:49 No.101680447

File: ComfyUI_temp_ybhgr_00022_.png (929 KB, 960x1088)

929 KB PNG

>>101680414
im crying

Anonymous
08/02/24(Fri)01:04:55 No.101680449

Anonymous 08/02/24(Fri)01:04:55 No.101680449

>>101680092
this could be fixed with inpainting using normal pony model i guess, a bit inconvenient but better than a kick in the teeth.

Anonymous
08/02/24(Fri)01:07:00 No.101680466

Anonymous 08/02/24(Fri)01:07:00 No.101680466

File: 5.png (1.62 MB, 1560x520)

1.62 MB PNG

Anonymous
08/02/24(Fri)01:08:50 No.101680482

Anonymous 08/02/24(Fri)01:08:50 No.101680482

>>101680129

Maybe your getting screwed by Nvidia driver memory management. I rather OOM than have Nvidia driver screw thing up. Look up how to disable it or roll back to before Nvidia add that feature.

https://nvidia.custhelp.com/app/answers/detail/a_id/5490/~/system-memory-fallback-for-stable-diffusion

Anonymous
08/02/24(Fri)01:09:53 No.101680491

Anonymous 08/02/24(Fri)01:09:53 No.101680491

File: ComfyUI_temp_ybhgr_00028_.png (545 KB, 960x1088)

545 KB PNG

Anonymous
08/02/24(Fri)01:10:12 No.101680495

Anonymous 08/02/24(Fri)01:10:12 No.101680495

File: ComfyUI_08693_.png (883 KB, 1368x768)

883 KB PNG

i fucking hate my life

Anonymous
08/02/24(Fri)01:10:26 No.101680496

Anonymous 08/02/24(Fri)01:10:26 No.101680496

File: ComfyUI_temp_krkpn_00038_.png (1.16 MB, 1200x800)

1.16 MB PNG

>>101680222
success
can't get hillary to look as beaten up as I'd like, but I'm declaring victory on this prompt

Anonymous
08/02/24(Fri)01:10:47 No.101680499

Anonymous 08/02/24(Fri)01:10:47 No.101680499

File: ComfyUI_00154_.jpg (635 KB, 3072x1449)

635 KB JPG

Don't settle for 20 steps, that's not enough

Anonymous
08/02/24(Fri)01:10:55 No.101680501

Anonymous 08/02/24(Fri)01:10:55 No.101680501

What kind of optimizations did Replicate or the black forest people do so that schnell literally takes 1 second to generate on Replicate? Dev is about 15 seconds.

Anonymous
08/02/24(Fri)01:10:57 No.101680502

Anonymous 08/02/24(Fri)01:10:57 No.101680502

>>101680466
miku would never say that >>101678159

Anonymous
08/02/24(Fri)01:11:32 No.101680508

Anonymous 08/02/24(Fri)01:11:32 No.101680508

>>101680499
>low step count doesnt look ass
Insane

Anonymous
08/02/24(Fri)01:12:15 No.101680514

Anonymous 08/02/24(Fri)01:12:15 No.101680514

>>101680482
I'm sure that's a bug on ComfyUi, there's already an issue about it, and memory fallback isn't likely to be the culprit, it also unload on the RAM side
https://github.com/comfyanonymous/ComfyUI/issues/2046

Anonymous
08/02/24(Fri)01:12:34 No.101680518

Anonymous 08/02/24(Fri)01:12:34 No.101680518

>>101680508
20 really isn't that low for any model

Anonymous
08/02/24(Fri)01:12:36 No.101680519

Anonymous 08/02/24(Fri)01:12:36 No.101680519

>>101680499
yeah any interesting or weird composition seems to benefit from cranking steps to 50

unfortunate since that means 75 seconds per image even on my 3090 in fp8

Anonymous
08/02/24(Fri)01:14:04 No.101680529

Anonymous 08/02/24(Fri)01:14:04 No.101680529

whats the token count for it tho

Anonymous
08/02/24(Fri)01:14:11 No.101680531

Anonymous 08/02/24(Fri)01:14:11 No.101680531

>>101680519
same, it's kinda slow but meh, quality > quality, always

Anonymous
08/02/24(Fri)01:14:29 No.101680534

Anonymous 08/02/24(Fri)01:14:29 No.101680534

>>101680129
to add to what >>101680482 said you can run
python -c "import sys; print(sys.executable)"
on cmd to get your system python path

Anonymous
08/02/24(Fri)01:15:14 No.101680538

Anonymous 08/02/24(Fri)01:15:14 No.101680538

File: Image.jpg (2.6 MB, 1920x2176)

2.6 MB JPG

Anonymous
08/02/24(Fri)01:16:28 No.101680552

Anonymous 08/02/24(Fri)01:16:28 No.101680552

>>101680496
2016 was two decades ago

Anonymous
08/02/24(Fri)01:17:20 No.101680563

Anonymous 08/02/24(Fri)01:17:20 No.101680563

>>101680552
yet trump will beat up another woman for the election kek

Anonymous
08/02/24(Fri)01:18:29 No.101680572

Anonymous 08/02/24(Fri)01:18:29 No.101680572

>Back to several mins per generation for larger images
Just like the good ol' days

Anonymous
08/02/24(Fri)01:18:43 No.101680576

Anonymous 08/02/24(Fri)01:18:43 No.101680576

File: ComfyUI_00157_.png (1.22 MB, 1024x1024)

1.22 MB PNG

>>101680466

Anonymous
08/02/24(Fri)01:20:22 No.101680592

Anonymous 08/02/24(Fri)01:20:22 No.101680592

>>101680563
weird

Anonymous
08/02/24(Fri)01:20:52 No.101680595

Anonymous 08/02/24(Fri)01:20:52 No.101680595

>>101680592
forced meme

Anonymous
08/02/24(Fri)01:20:54 No.101680596

Anonymous 08/02/24(Fri)01:20:54 No.101680596

>>101678991
Training flux finetunes is a different beast from standard SD and SDXL.
Also the smallest flux model (which is Apache 2.0) is not even that good. Sure it's better than SD 3.0 medium, but it's not good enough to dedicate tens of thousands up to hundreds of thousands of dollars for finetuning right from the bat.
Anything that prohibits commercial use is out of question for basically every single "big" finetuner. No one will drop that amount of cash for a model they can't monetize in any way.
Pixart Sigma, HunyuanDiT, Lumina and Flux "schnell" allow commercial usage.
Kwai-Kolors and Flux "dev" do not, yet those two are the best out of the box in many aspects.

Kwai-Kolors has the best anatomy understanding, poses, hands, feet are good. It has nice and crisp style. Negatives for it are Chinese prompting, bad NSFW out of the box, horrible anime quality and lack of styles and concepts.
Flux "dev" is the best model out of all overall and has superior prompt understanding and text generation. Biggest negatives are the size of the model and inability to train loras locally (not even RTX 5090 will cut it).

Anonymous
08/02/24(Fri)01:21:43 No.101680602

Anonymous 08/02/24(Fri)01:21:43 No.101680602

FUCK YOU REPLICATE
MY PROMPTS AREN'T NSFW

Anonymous
08/02/24(Fri)01:21:46 No.101680604

Anonymous 08/02/24(Fri)01:21:46 No.101680604

>>101680576
kekd

Anonymous
08/02/24(Fri)01:22:19 No.101680610

Anonymous 08/02/24(Fri)01:22:19 No.101680610

>>101680595
cope weirdo

Anonymous
08/02/24(Fri)01:22:28 No.101680612

Anonymous 08/02/24(Fri)01:22:28 No.101680612

File: 38763.png (2.14 MB, 1024x1024)

2.14 MB PNG

Anonymous
08/02/24(Fri)01:22:57 No.101680618

Anonymous 08/02/24(Fri)01:22:57 No.101680618

>>101678682
>>101678818
>>101678934
>>101679226
>>101679599
>>101679982
>>101680048
>>101680075
>>101680447
>>101680491
>>101680538
Got a catbox for any of these? Really dig the style, curious how to get these kinds of results and what the general setup would look like.

Anonymous
08/02/24(Fri)01:23:32 No.101680624

Anonymous 08/02/24(Fri)01:23:32 No.101680624

>>101680592
letting a man beat up a woman in the olympics is pretty fucking weird, but when trump does it he's just taking out the trash so it's fine

Anonymous
08/02/24(Fri)01:23:44 No.101680627

Anonymous 08/02/24(Fri)01:23:44 No.101680627

>>101680602
Use it through the api or telegram glowie bot that apparently some anon put up @imgfun_bot

Anonymous
08/02/24(Fri)01:23:55 No.101680628

Anonymous 08/02/24(Fri)01:23:55 No.101680628

>>101680596
>Also the smallest flux model (which is Apache 2.0) is not even that good. Sure it's better than SD 3.0 medium, but it's not good enough to dedicate tens of thousands up to hundreds of thousands of dollars for finetuning right from the bat.
>Anything that prohibits commercial use is out of question for basically every single "big" finetuner. No one will drop that amount of cash for a model they can't monetize in any way.
I'm sure a big finetune on schnell can beat flux dev, so why not going that path yeah

>Kwai-Kolors has the best anatomy understanding, poses, hands, feet are good. It has nice and crisp style. Negatives for it are Chinese prompting, bad NSFW out of the box, horrible anime quality and lack of styles and concepts.
And it's not a DiT model, that makes it obsolete from the start

Anonymous
08/02/24(Fri)01:24:15 No.101680630

Anonymous 08/02/24(Fri)01:24:15 No.101680630

>>101680552
I'm not even american I just think the mental image of them having a ring fight is funny

Another thing that's impressive about Flux is what it infers about body types
Like Hillary's body here is soft around the midsection like an old person's tends to be, the kind of body you'd expect a woman her age to have
if you tried to do this with SDXL she'd just have a boxer's body

Anonymous
08/02/24(Fri)01:25:16 No.101680642

Anonymous 08/02/24(Fri)01:25:16 No.101680642

File: ComfyUI_temp_krkpn_00046_.png (1023 KB, 1200x800)

1023 KB PNG

>>101680630
forgot the image like a retard

Anonymous
08/02/24(Fri)01:25:23 No.101680644

Anonymous 08/02/24(Fri)01:25:23 No.101680644

>>101680592
the zoomer has received his new programming

Anonymous
08/02/24(Fri)01:26:17 No.101680652

Anonymous 08/02/24(Fri)01:26:17 No.101680652

>>101680610
this stuff doesn't work on 4chan man

Anonymous
08/02/24(Fri)01:27:02 No.101680654

Anonymous 08/02/24(Fri)01:27:02 No.101680654

File: ComfyUI_temp_ybhgr_00035_.png (982 KB, 960x1088)

982 KB PNG

>>101680618
https://files.catbox.moe/z5hiho.png

Anonymous
08/02/24(Fri)01:27:05 No.101680655

Anonymous 08/02/24(Fri)01:27:05 No.101680655

File: 254344.png (1.69 MB, 1024x1024)

1.69 MB PNG

>naruto looks like a anime girl even when male is the first word in the prompt
waow

Anonymous
08/02/24(Fri)01:28:46 No.101680667

Anonymous 08/02/24(Fri)01:28:46 No.101680667

>>101680624
>confusing women for men isnt weird
if you say so

>>101680652
why are you weirdos replying to me if it doesn't work. you're compelled to declare how not weird you are, only to attach incredible weirdness to it
>I'm not weird but let me tell you how much I think about trans people!!!!
weird

Anonymous
08/02/24(Fri)01:28:58 No.101680669

Anonymous 08/02/24(Fri)01:28:58 No.101680669

>>101680655
looks like naruto if he was drawn on some shotacon doujin kek

Anonymous
08/02/24(Fri)01:29:10 No.101680672

Anonymous 08/02/24(Fri)01:29:10 No.101680672

>>101680596
In LLMs there are many people fine tuning 70 and 120 B parameters, with a license similar to the fux dev, I thing is just a question of time to some rich nigger with 5 or moe H100 NVL, train a good fine tune. As I said, the Ponyfag could be that, since the license don't forbiden monetize with donations.

Anonymous
08/02/24(Fri)01:29:35 No.101680674

Anonymous 08/02/24(Fri)01:29:35 No.101680674

File: le-sad.jpg (27 KB, 500x346)

27 KB JPG

I have 12 VRAM but only 16GB system RAM till end of month then i can upgrade to only a max of 32GB of system RAM. But I do have an entire 250GB SSD dedicated to swap with discard enabled. Will it run or nah?

Anonymous
08/02/24(Fri)01:30:13 No.101680680

Anonymous 08/02/24(Fri)01:30:13 No.101680680

>>101680667
anon, I'm afraid I must again draw your attention to the fact that you are on 4chan

Anonymous
08/02/24(Fri)01:30:47 No.101680685

Anonymous 08/02/24(Fri)01:30:47 No.101680685

>>101680674
>But I do have an entire 250GB SSD dedicated to swap
Why?

Anonymous
08/02/24(Fri)01:31:55 No.101680696

Anonymous 08/02/24(Fri)01:31:55 No.101680696

File: ComfyUI_temp_krkpn_00051_.png (1.04 MB, 1200x800)

1.04 MB PNG

Anonymous
08/02/24(Fri)01:32:02 No.101680699

Anonymous 08/02/24(Fri)01:32:02 No.101680699

>>101680685
because it prevents the system from hanging when i run out of ram? Anyway it helps if you have a low ram system and are doing stable diffusion.

Anonymous
08/02/24(Fri)01:32:44 No.101680706

Anonymous 08/02/24(Fri)01:32:44 No.101680706

is flux 8b that much worse than 16b?

Anonymous
08/02/24(Fri)01:33:43 No.101680712

Anonymous 08/02/24(Fri)01:33:43 No.101680712

>>101680685
go on say something dumb about how i should care about wearing out the SSD that i paid £15 for second hand...

Anonymous
08/02/24(Fri)01:34:48 No.101680724

Anonymous 08/02/24(Fri)01:34:48 No.101680724

File: ComfyUI_temp_krkpn_00052_.png (1.23 MB, 1200x800)

1.23 MB PNG

>a child's crayon drawing of a house
sovl

Anonymous
08/02/24(Fri)01:35:12 No.101680725

Anonymous 08/02/24(Fri)01:35:12 No.101680725

>>101680712
No I was going to say most recommend the max amount of swap being double system RAM. 250GB sounds like so much.

Anonymous
08/02/24(Fri)01:35:26 No.101680728

Anonymous 08/02/24(Fri)01:35:26 No.101680728

what are best prompts for that amateur photo look

Anonymous
08/02/24(Fri)01:36:50 No.101680740

Anonymous 08/02/24(Fri)01:36:50 No.101680740

File: ComfyUI_01158_.png (1.39 MB, 1024x1024)

1.39 MB PNG

>>101680642
hey anon this is actually kind of fun

Anonymous
08/02/24(Fri)01:37:50 No.101680749

Anonymous 08/02/24(Fri)01:37:50 No.101680749

>>101680740
kek nice
this one is really technically impressive since other models generally can't do upside down people without mangling them

Anonymous
08/02/24(Fri)01:39:01 No.101680757

Anonymous 08/02/24(Fri)01:39:01 No.101680757

>>101680740
oh also, tip: starting the prompt with "espn footage of" seems to be better for getting that slightly lofi television camera look

Anonymous
08/02/24(Fri)01:39:54 No.101680762

Anonymous 08/02/24(Fri)01:39:54 No.101680762

>>101680725
its not much, i've saw it used up to 100 GB swap when i done a video through animatediff that was about 2 minutes long. If the swap wasn't that big it would have failed for sure, also one time my GPU crashed and when checking journalctl the last thing that happened was memory pressure flushing caches then seconds later the GPU died.

So yeah, if you have memory issues with stable diffusion try increasing swap file/partition. It works because there is more virtual memory available albeit slower.

Anonymous
08/02/24(Fri)01:40:08 No.101680765

Anonymous 08/02/24(Fri)01:40:08 No.101680765

File: Image.jpg (1.18 MB, 1920x1088)

1.18 MB JPG

Anonymous
08/02/24(Fri)01:40:49 No.101680770

Anonymous 08/02/24(Fri)01:40:49 No.101680770

>>101680642
>>101680740
kek

Anonymous
08/02/24(Fri)01:41:26 No.101680773

Anonymous 08/02/24(Fri)01:41:26 No.101680773

File: ComfyUI_01159_.png (1 MB, 1024x1024)

1 MB PNG

>>101680757
anon i kneel, thanks.

Anonymous
08/02/24(Fri)01:41:31 No.101680775

Anonymous 08/02/24(Fri)01:41:31 No.101680775

>>101680762
>its not much, i've saw it used up to 100 GB swap when i done a video through animatediff that was about 2 minutes long
You might just wanna ask someone for a short term loan so you can get that ram ASAP, your hard drives are taking a beating

Anonymous
08/02/24(Fri)01:44:23 No.101680789

Anonymous 08/02/24(Fri)01:44:23 No.101680789

>>101680725
>most recommend the max amount of swap being double system RAM.
this just general copy pasta from every know it all online since like forever. You can have as much swap space as you like. This general assumption of double your ram is no different than general assumptions of how big a boot partition should be, but it will always depend on what you actually plan on doing to how much you will actually need.

>>101680775
nah i can wait. I don't do loans.

Anonymous
08/02/24(Fri)01:45:13 No.101680801

Anonymous 08/02/24(Fri)01:45:13 No.101680801

I can't seem to make a girl stab a beast and there be blood and gore. Also can't make her do a middle finger while holding can of pepsi. Sadly, that means it's censored, unlike Dalle. Still, this is an interesting model.

Anonymous
08/02/24(Fri)01:46:39 No.101680817

Anonymous 08/02/24(Fri)01:46:39 No.101680817

>some rich nigger with 5 or moe H100 NVL, train a good fine tune
It takes way more effort than just being rich. You need good dataset. You need to curate that dataset. You need to label that dataset. Then you need to know what the fuck you are doing too.

https://www.reddit.com/r/StableDiffusion/comments/1dbasvx/the_gory_details_of_finetuning_sdxl_for_30m/

Anonymous
08/02/24(Fri)01:47:41 No.101680826

Anonymous 08/02/24(Fri)01:47:41 No.101680826

>>101680801
from what dalle red team testers said the dataset wasn't actually censored at all and during the testing phase they could generate extreme gore and disturbing shit
the 'safety' is all in having GPT-4 act as the middleman between you and the API and cockblock disallowed prompts, without that the model is actually capable of really dark stuff
I guess that's something you can do when you're closed source and not sharing the weights

Anonymous
08/02/24(Fri)01:51:20 No.101680849

Anonymous 08/02/24(Fri)01:51:20 No.101680849

>>101680826
Sad. There is nothing wrong with being able to gen extreme gore and disturbing shit as long as it's not super photorealistic and just 80s movie or anime style.

Anonymous
08/02/24(Fri)01:51:21 No.101680850

Anonymous 08/02/24(Fri)01:51:21 No.101680850

>>101680817
/lmg/ here, first time?

Anonymous
08/02/24(Fri)01:51:26 No.101680852

Anonymous 08/02/24(Fri)01:51:26 No.101680852

File: replicate-prediction-csf4(...).png (1.48 MB, 1216x832)

1.48 MB PNG

>>101680724

Anonymous
08/02/24(Fri)01:52:51 No.101680859

Anonymous 08/02/24(Fri)01:52:51 No.101680859

>>101680826
Yes, and on Azure you can disable the NSFW and prompt filters (only the basic filter remains) and see the raw dalle3 dataset power: https://catbox.moe/c/lfnwjt

Anonymous
08/02/24(Fri)01:53:09 No.101680862

Anonymous 08/02/24(Fri)01:53:09 No.101680862

File: replicate-prediction-260j(...).png (1.09 MB, 1216x832)

1.09 MB PNG

Anonymous
08/02/24(Fri)01:53:26 No.101680863

Anonymous 08/02/24(Fri)01:53:26 No.101680863

>>101680826
That is the "secret sauce". You train it with everything and then you just perform post-filtering with multimodal vision model that looks at the prompt+image and estimates the level of "harm". Then you just set a cutoff point for what level of "harm" you tolerate and call it a day.

If you go back to the time before SD 2.0 release, you can read Emad's and other SD employee messages or listen the public discord calls (on youtube still I think).
They were grappling with the issue of CP. The issue was that if a model is capable of doing nudity and also children, then it is always capable of combining those into nude children, even if the training data has never seen a nude child. That is the only reason they and everyone else prunes all nudity from the dataset for models that they give out.

Anonymous
08/02/24(Fri)01:53:40 No.101680864

Anonymous 08/02/24(Fri)01:53:40 No.101680864

>>101680852
cute
looks like it's good at the crayon style, has a nice texture and doesn't feel slopped like the digital art style it tries to do

Anonymous
08/02/24(Fri)01:53:47 No.101680866

Anonymous 08/02/24(Fri)01:53:47 No.101680866

>>101680859
you have the prompts for any of these? curious how they'd transfer to flux

Anonymous
08/02/24(Fri)01:55:20 No.101680878

Anonymous 08/02/24(Fri)01:55:20 No.101680878

>>101680866
Anon, you didn't get it, the prompt for ALL of those is literallt just DeviantArt + a jailbreak so that the API doesnt rewrite it, it just shows how depraved unchained dalle is, and deviantart specifically is represented like that in their dataset.

Anonymous
08/02/24(Fri)01:57:35 No.101680893

Anonymous 08/02/24(Fri)01:57:35 No.101680893

>>101680863
Hopefully eventually one of the real models will leak so we can finally have a not-shit local model

Anonymous
08/02/24(Fri)01:57:54 No.101680897

Anonymous 08/02/24(Fri)01:57:54 No.101680897

File: ComfyUI_temp_krkpn_00059_.png (1.73 MB, 1200x800)

1.73 MB PNG

similar to the anons' findings above with crayon prompts, using "beatrix potter drawing of" seems to produce a hand drawn looking art style
doesn't really look like beatrix potter at all but it's nice and not slopped looking

>beatrix potter drawing of a cozy stone cottage in the forest

Anonymous
08/02/24(Fri)01:58:17 No.101680901

Anonymous 08/02/24(Fri)01:58:17 No.101680901

>>101680859
wow why have i never seen these before

Anonymous
08/02/24(Fri)01:59:44 No.101680912

Anonymous 08/02/24(Fri)01:59:44 No.101680912

>>101680859
neat
even ignoring the content, there's some sovl some of the drawing styles there that's quite hard to get when using it on chatgpt

Anonymous
08/02/24(Fri)02:00:41 No.101680921

Anonymous 08/02/24(Fri)02:00:41 No.101680921

>>101680849
That said, I did get a result that somewhat resembles what I was after. I guess the key to a good result is being less precise
https://files.catbox.moe/ve7hxh.png

Anonymous
08/02/24(Fri)02:00:44 No.101680922

Anonymous 08/02/24(Fri)02:00:44 No.101680922

>>101680912
*some sovl in

Anonymous
08/02/24(Fri)02:01:20 No.101680926

Anonymous 08/02/24(Fri)02:01:20 No.101680926

>>101680859
I wouldn't say that's "DeviantArt". It's more like dalle trying to gen "deviant" "art"

Anonymous
08/02/24(Fri)02:02:26 No.101680936

Anonymous 08/02/24(Fri)02:02:26 No.101680936

>>101680921
this works with filtered llms too when you're trying to generate smut but they have a filter on your prompts
you take advantage of the model's intelligence by having it infer what you want rather than stating it outright

Anonymous
08/02/24(Fri)02:05:44 No.101680960

Anonymous 08/02/24(Fri)02:05:44 No.101680960

File: ComfyUI_temp_hgfua_00026_.png (1.8 MB, 1152x960)

1.8 MB PNG

>>101680897
It's a shame that stylization is inferior to SDXL. Is that by choice? Or is it by training with AI images that have super generic styles?

Anonymous
08/02/24(Fri)02:05:51 No.101680962

Anonymous 08/02/24(Fri)02:05:51 No.101680962

File: ComfyUI_00019_.png (1.09 MB, 1024x1024)

1.09 MB PNG

>>101680775
>>101680725
>>101680674
Of course it was able to do it :P took a long time to load the initial models though

Anonymous
08/02/24(Fri)02:08:57 No.101680991

Anonymous 08/02/24(Fri)02:08:57 No.101680991

>>101680960
AuraFlow has the problem too, I think it's an artifact of AI dataset captioning. Gonna have to stick to Kolors for art gens for now unfortunately

Anonymous
08/02/24(Fri)02:09:19 No.101680998

Anonymous 08/02/24(Fri)02:09:19 No.101680998

>>101680960
More neutral / not stylized biased dataset most likely. And im sure dalle did DPO training.

Anonymous
08/02/24(Fri)02:09:35 No.101681004

Anonymous 08/02/24(Fri)02:09:35 No.101681004

File: Image.jpg (838 KB, 2880x2176)

838 KB JPG

Anonymous
08/02/24(Fri)02:10:15 No.101681011

Anonymous 08/02/24(Fri)02:10:15 No.101681011

>>101680962
are you drunk?

Anonymous
08/02/24(Fri)02:10:21 No.101681012

Anonymous 08/02/24(Fri)02:10:21 No.101681012

File: ComfyUI_temp_dvmso_00001_.png (1.23 MB, 768x1216)

1.23 MB PNG

i like how it drew a face on this house unprompted. it's cute.

Anonymous
08/02/24(Fri)02:14:09 No.101681034

Anonymous 08/02/24(Fri)02:14:09 No.101681034

>>101680960
I think the vision models people are using to caption their dataset are only describing the content of the image and not going into detail about the style at all

like the model will describe the composition of the image incredibly accurately but won't say much about the art style except to note that it's art and not a photograph, and it likely won't make any guesses as to the name of the artist either

so then the resulting model trained on those captions has amazing understanding of the content of an image, but it doesn't really know much about style other than "photograph/not a photograph"

Anonymous
08/02/24(Fri)02:16:21 No.101681052

Anonymous 08/02/24(Fri)02:16:21 No.101681052

The image quality is pretty great at 10 steps

Anonymous
08/02/24(Fri)02:26:11 No.101681128

Anonymous 08/02/24(Fri)02:26:11 No.101681128

File: ComfyUI_temp_ybhgr_00060_.png (743 KB, 960x1088)

743 KB PNG

Anonymous
08/02/24(Fri)02:27:29 No.101681131

Anonymous 08/02/24(Fri)02:27:29 No.101681131

SAI releases 2B model with miserable quality that everyone can see for themselves how shitty it is
DFL publishes 12B model with extreme quality that only a few can use and they advertise it for free with their enthusiasm

too bad for the vram poor, but simply smarter

Anonymous
08/02/24(Fri)02:28:38 No.101681140

Anonymous 08/02/24(Fri)02:28:38 No.101681140

File: ComfyUI_00025_.png (1.19 MB, 1024x1024)

1.19 MB PNG

>>101681011
go fuck a tree dork.

Anonymous
08/02/24(Fri)02:28:41 No.101681141

Anonymous 08/02/24(Fri)02:28:41 No.101681141

For those struggling with image quality, with these type of models it often helps to add "aesthetic" at the end if your prompt if that's what you're going for, for instance https://files.catbox.moe/5xbcfw.png

Anonymous
08/02/24(Fri)02:29:45 No.101681153

Anonymous 08/02/24(Fri)02:29:45 No.101681153

>>101681052
up the steps to 50, it makes the images even better imo

Anonymous
08/02/24(Fri)02:29:50 No.101681154

Anonymous 08/02/24(Fri)02:29:50 No.101681154

>>101681131
isn't pixart really small? so there's groups working on stuff for the vram poor as well

Anonymous
08/02/24(Fri)02:31:02 No.101681166

Anonymous 08/02/24(Fri)02:31:02 No.101681166

>>101681153
It also takes several minutes.

Anonymous
08/02/24(Fri)02:31:39 No.101681171

Anonymous 08/02/24(Fri)02:31:39 No.101681171

really impressed with how it follows prompts, it gives exactly what I tell it.

Anonymous
08/02/24(Fri)02:38:40 No.101681232

Anonymous 08/02/24(Fri)02:38:40 No.101681232

>>101681166
but you also don't need 20 trys for a good picture with crippled hands - like pre flux

Anonymous
08/02/24(Fri)02:39:28 No.101681243

Anonymous 08/02/24(Fri)02:39:28 No.101681243

File: ComfyUI_temp_ybhgr_00065_.png (301 KB, 960x1088)

301 KB PNG

Anonymous
08/02/24(Fri)02:40:19 No.101681249

Anonymous 08/02/24(Fri)02:40:19 No.101681249

>>101681131
even if you have a 24GB vram card you are still vram poor to make loras on this thing.

Anonymous
08/02/24(Fri)02:40:21 No.101681250

Anonymous 08/02/24(Fri)02:40:21 No.101681250

>>101681140
your guidance is too high

Anonymous
08/02/24(Fri)02:41:15 No.101681258

Anonymous 08/02/24(Fri)02:41:15 No.101681258

>>101681141
>https://files.catbox.moe/5xbcfw.png
She's about to become the next phineas gage with that umbrella

Anonymous
08/02/24(Fri)02:42:21 No.101681269

Anonymous 08/02/24(Fri)02:42:21 No.101681269

>>101681232
I get pretty good hands at 10. Might step up later when things get faster, but for now I'm happy.

Anonymous
08/02/24(Fri)02:45:24 No.101681297

Anonymous 08/02/24(Fri)02:45:24 No.101681297

>>101681141
I loaded your catbox image into comfy and your prompt was:
>1girl. anime, holding an umbrella, glitch art
there's no 'aesthetic' in there at all

Anonymous
08/02/24(Fri)02:46:39 No.101681311

Anonymous 08/02/24(Fri)02:46:39 No.101681311

>>101681250
he might be using schnell, its images look overcooked like that even at low guidance
all 'turbo' type models are like that, I can't stand them

Anonymous
08/02/24(Fri)02:47:09 No.101681314

Anonymous 08/02/24(Fri)02:47:09 No.101681314

Even with style issue all it needs is thousands of LoRAs or a very clever solution, so it's not over just because it can't copy style right away, a capable model is the first step, refinement can happen later.

Anonymous
08/02/24(Fri)02:48:54 No.101681328

Anonymous 08/02/24(Fri)02:48:54 No.101681328

>>101681297
I switched it up. It was
>1girl. anime, holding an umbrella, aesthetic

Anonymous
08/02/24(Fri)02:49:16 No.101681333

Anonymous 08/02/24(Fri)02:49:16 No.101681333

>>101681311
>>101681250
I'm just using the example workflow, switching things around now, I did a little sharpening on that last image also at about 0.20 to see the effect as the first image was a little blurry. I'm trying words like ultra sharp and aesthetic like that other anon said.

Anonymous
08/02/24(Fri)02:49:52 No.101681340

Anonymous 08/02/24(Fri)02:49:52 No.101681340

bros, is it over for the faggots over at SAI???
Unironic question

Anonymous
08/02/24(Fri)02:50:18 No.101681345

Anonymous 08/02/24(Fri)02:50:18 No.101681345

File: ComfyUI_temp_dvmso_00008_.png (1.65 MB, 768x1216)

1.65 MB PNG

been trying to wrangle it to give me PC-98 type graphics. so close but so far.

Anonymous
08/02/24(Fri)02:50:45 No.101681349

Anonymous 08/02/24(Fri)02:50:45 No.101681349

>>101681328
thanks

Anonymous
08/02/24(Fri)02:50:52 No.101681350

Anonymous 08/02/24(Fri)02:50:52 No.101681350

>>101681340
what percentage of consumer GPUs can run flux?
unironic question

Anonymous
08/02/24(Fri)02:51:29 No.101681356

Anonymous 08/02/24(Fri)02:51:29 No.101681356

>>101681350
I can

Sent from my GeForce RTX 4090

Anonymous
08/02/24(Fri)02:52:18 No.101681364

Anonymous 08/02/24(Fri)02:52:18 No.101681364

>>101681340
pixart stabbed the knife
kolors twisted it
flux ran it through SAIs asshole

Anonymous
08/02/24(Fri)02:52:22 No.101681366

Anonymous 08/02/24(Fri)02:52:22 No.101681366

File: Untitled.jpg (1.98 MB, 3760x2216)

1.98 MB JPG

Top images: 2560x1440
Bottom images: 1280x720

It's really interesting how the quality deteriorates at higher resolutions. They just "look worse" by being less interesting.

Anonymous
08/02/24(Fri)02:52:25 No.101681367

Anonymous 08/02/24(Fri)02:52:25 No.101681367

>>101681350
12GB can run it, although a bit slowly. So a lot of them. I bet it will get faster and more efficient in the coming days as well.

Anonymous
08/02/24(Fri)02:53:05 No.101681371

Anonymous 08/02/24(Fri)02:53:05 No.101681371

>>101681367
>12GB can run it
this isn't true

Anonymous
08/02/24(Fri)02:53:13 No.101681372

Anonymous 08/02/24(Fri)02:53:13 No.101681372

The oven stays hot and the bread just keeps coming...

>>101681353
>>101681353
>>101681353

Anonymous
08/02/24(Fri)02:55:04 No.101681394

Anonymous 08/02/24(Fri)02:55:04 No.101681394

>>101681367
>I bet it will get faster and more efficient in the coming days
just like pixart? and hunyan? right?
spoiler: they never got more efficient
if you just wanna dream, then dream whatever fantastic dreams you want. don't come here asking for conversation to validate your dreaming though.

Anonymous
08/02/24(Fri)02:55:46 No.101681404

Anonymous 08/02/24(Fri)02:55:46 No.101681404

>>101681372
ty baker

Anonymous
08/02/24(Fri)02:58:13 No.101681421

Anonymous 08/02/24(Fri)02:58:13 No.101681421

can this thing only do euler or something?

Anonymous
08/02/24(Fri)03:00:19 No.101681436

Anonymous 08/02/24(Fri)03:00:19 No.101681436

>>101681394
Anon, those are already small, and aren't huge leaps. There's no reason to make them run more efficiently. Remember the early days of local diffusion?

Anonymous
08/02/24(Fri)03:01:13 No.101681443

Anonymous 08/02/24(Fri)03:01:13 No.101681443

>>101681371
i'm using 3060 12GB and only 16GB ram and it runs aka works on my machine, now stfu.

Anonymous
08/02/24(Fri)03:02:32 No.101681460

Anonymous 08/02/24(Fri)03:02:32 No.101681460

>>101681421
dpmpp2pm (non-sde) also works
looks worse though imo, overcooked

Anonymous
08/02/24(Fri)03:03:14 No.101681467

Anonymous 08/02/24(Fri)03:03:14 No.101681467

File: Image.jpg (1.43 MB, 2880x1088)

1.43 MB JPG

>>101681372
Nice bake

Anonymous
08/02/24(Fri)03:03:44 No.101681473

Anonymous 08/02/24(Fri)03:03:44 No.101681473

Sigma is still the king of smokes, guns and cigars but this is damn nice
https://files.catbox.moe/gse06h.png

Anonymous
08/02/24(Fri)03:04:04 No.101681476

Anonymous 08/02/24(Fri)03:04:04 No.101681476

>>101681436
listen, I'm responding to someone asking "is it over for SAI". if you don't want to acknowledge the problem of accessibility, then you don't need to involve yourself in this conversation. for everyone else, SAI still has a large market of GPUs to reach that dont run these 13gb+ models

Anonymous
08/02/24(Fri)03:07:00 No.101681497

Anonymous 08/02/24(Fri)03:07:00 No.101681497

>>101681476
Problems of accessibility? What are you talking about? It will get more efficient to run: layer specific quants, finetunes, optimizations for the architecture, more. You are simply wrong.

Anonymous
08/02/24(Fri)03:09:03 No.101681514

Anonymous 08/02/24(Fri)03:09:03 No.101681514

>>101681497
>What are you talking about?
>>101681350
>It will get more efficient to run
>>101681394

Anonymous
08/02/24(Fri)03:12:57 No.101681542

Anonymous 08/02/24(Fri)03:12:57 No.101681542

>>101681514
Anon, you are retarded or poor and in denial. The majority of people genning images locally have 12 or more GB of VRAM. There is no accessibility issue already, and it's only going to become more accessible. People are going to focus on this model far more than any model since the NAI leak, since it's an actual, definitive jump in quality. Nobody will realistically be using anything but derivatives of this model in 4 months, unless something better comes out.

Anonymous
08/02/24(Fri)03:13:48 No.101681550

Anonymous 08/02/24(Fri)03:13:48 No.101681550

>>101681394
Anon why are you getting so triggered and hasty? The cost to run at home is only $500-700, well within the allowance of most households. If you can't afford then just rent a GPU. Also, ever heard of a distilled model?

Anonymous
08/02/24(Fri)03:14:23 No.101681556

Anonymous 08/02/24(Fri)03:14:23 No.101681556

>>101681542
>The majority of people genning images locally have 12 or more GB of VRAM
just wanna quote this incase you delete your post out of embarrassment at some point

Anonymous
08/02/24(Fri)03:28:07 No.101681676

Anonymous 08/02/24(Fri)03:28:07 No.101681676

>>101681542
Last time I checked (end of last year) people still recommended GPUs with 8GB.

Anonymous
08/02/24(Fri)03:29:18 No.101681686

Anonymous 08/02/24(Fri)03:29:18 No.101681686

>>101681556
people having been saying things will not improve for like months, including oh we will never get local text to video, by the end of the year you will probably be the one that looks like a retard. Within 5 years home owned GPU's will probably have 1TB VRAM, you think that's impossible? Look at how computers evolved over the last 20 years you tard. We now have SSD's that have write speeds of 5 GB/s, that is miles away from the old SSD tech that you'd be lucky if you get 480 MB/s

Anonymous
08/02/24(Fri)03:31:40 No.101681709

Anonymous 08/02/24(Fri)03:31:40 No.101681709

>>101681686
>Within 5 years home owned GPU's will probably have 1TB VRAM
It's more likely that AI shit won't be done on GPUs anymore than that.

Anonymous
08/02/24(Fri)03:34:23 No.101681730

Anonymous 08/02/24(Fri)03:34:23 No.101681730

>>101681686
>y the end of the year you will probably be the one that looks like a retard
how much VRAM will consumer hardware have by the end of the year? please just give a specific number

Anonymous
08/02/24(Fri)03:53:50 No.101681853

Anonymous 08/02/24(Fri)03:53:50 No.101681853

>>101678321
total janny death

Anonymous
08/02/24(Fri)07:31:56 No.101683806

Anonymous 08/02/24(Fri)07:31:56 No.101683806

>>101679146
>male Pikachu tail

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.