/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 10/23/24(Wed)23:10:59 No.102949176

File: the longest dick general.jpg (2.8 MB, 2137x3264)

2.8 MB JPG

/ldg/ - Local Diffusion General Anonymous 10/23/24(Wed)23:10:59 No.102949176 Archived

Discussion of free and open source text-to-image models

Previous /ldg/ bred : >>102940941

2 4 8 16 32 64 128 Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://aitracker.art
https://huggingface.co
https://civitai.com
https://tensor.art/models
https://liblib.art
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3

>SD3.5
https://huggingface.co/stabilityai/stable-diffusion-3.5-large
https://replicate.com/stability-ai/stable-diffusion-3.5-large

>Sana
https://github.com/NVlabs/Sana
https://8876bd28ee2da4b909.gradio.live

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux
DeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai

Anonymous
10/23/24(Wed)23:13:36 No.102949193

Anonymous 10/23/24(Wed)23:13:36 No.102949193

File: 00002-1975399016.png (3.33 MB, 1280x1920)

3.33 MB PNG

>>102949088
Oh yes

More

Anonymous
10/23/24(Wed)23:15:10 No.102949205

Anonymous 10/23/24(Wed)23:15:10 No.102949205

>>102949167
You can't do that and make a good model ever

Anonymous
10/23/24(Wed)23:15:57 No.102949211

Anonymous 10/23/24(Wed)23:15:57 No.102949211

>>102949205
>You can't do that and make a good model ever
this, at this point I'm just waiting for China to deliver the goods, the west has lost the AI battle

Anonymous
10/23/24(Wed)23:18:13 No.102949228

Anonymous 10/23/24(Wed)23:18:13 No.102949228

billions must gen

Anonymous
10/23/24(Wed)23:18:15 No.102949229

Anonymous 10/23/24(Wed)23:18:15 No.102949229

File: ComfyUI_temp_vtmtc_00017_.png (2.35 MB, 960x1600)

2.35 MB PNG

>>102949193
I've got an very old gen from you

Anonymous
10/23/24(Wed)23:19:18 No.102949238

Anonymous 10/23/24(Wed)23:19:18 No.102949238

>>102949211
China has a no shit given about IPs, but they have their own problem around nsfw, so not sure they'd do nice things either.
Though at least the base quality would probably be leagues better than not using any copyrighted or "allowed" "safe" content ever.
The consent obsession in the west is turning everything into shit.

Anonymous
10/23/24(Wed)23:20:16 No.102949245

Anonymous 10/23/24(Wed)23:20:16 No.102949245

>>102949238
>China has a no shit given about IPs, but they have their own problem around nsfw,
desu they aren't much more cucked on nfsw than the west for example, Moshi can do female nudes just fine

Anonymous
10/23/24(Wed)23:20:52 No.102949252

Anonymous 10/23/24(Wed)23:20:52 No.102949252

File: 00011-2688074747.png (3.38 MB, 1280x1920)

3.38 MB PNG

>3 of my gens made it into the OP
Excellent

>>102949229
Great stuff, flux?

Anonymous
10/23/24(Wed)23:21:23 No.102949255

Anonymous 10/23/24(Wed)23:21:23 No.102949255

>>102949245
>Moshi can do female nudes just fine
kino piercel
https://files.catbox.moe/t6276z.mp4

Anonymous
10/23/24(Wed)23:23:11 No.102949265

Anonymous 10/23/24(Wed)23:23:11 No.102949265

>>102949245
Oh the difference is just that individuals in the chinese companies have no problems with nsfw used in their training, while the law is very anti nsfw in general, so they always tread lightly, at leas in public.

In the west it's the opposite, nothing nsfw is illegal, so the individuals do that by conviction. It's kind of sad really.

Anonymous
10/23/24(Wed)23:23:50 No.102949275

Anonymous 10/23/24(Wed)23:23:50 No.102949275

File: chwnol.png (2.35 MB, 1152x2052)

2.35 MB PNG

Anonymous
10/23/24(Wed)23:24:52 No.102949284

Anonymous 10/23/24(Wed)23:24:52 No.102949284

File: 1679020662560538.png (3 MB, 1152x2052)

3 MB PNG

>>102949252
SDXL

Anonymous
10/23/24(Wed)23:25:58 No.102949295

Anonymous 10/23/24(Wed)23:25:58 No.102949295

>>102949265
>In the west it's the opposite, nothing nsfw is illegal, so the individuals do that by conviction. It's kind of sad really.
amen anon, fucking amen... I'm just glad China exists at this point because if we had to only rely on the west to get good shit I would end up depressed kek
https://www.youtube.com/watch?v=XZcN6lIVmSo

To be fair, SD3.5 got a little better in that department, it can do females nudes now, I guess that removing Emad the prude from the team was a good idea after all

Anonymous
10/23/24(Wed)23:27:57 No.102949318

Anonymous 10/23/24(Wed)23:27:57 No.102949318

File: ComfyUI_temp_vtmtc_00018_.png (2.17 MB, 960x1600)

2.17 MB PNG

>>102949252
since you're still using webui/forge, you should play with this extension https://github.com/muerrilla/sd-webui-detail-daemon
I remember using it but never got ported to comfy, thank to migu poster to post about it in the last thread

Anonymous
10/23/24(Wed)23:30:03 No.102949346

Anonymous 10/23/24(Wed)23:30:03 No.102949346

>>102949318
>I remember using it but never got ported to comfy, thank to migu poster to post about it in the last thread
any equivalent for comfy?

Anonymous
10/23/24(Wed)23:31:11 No.102949358

Anonymous 10/23/24(Wed)23:31:11 No.102949358

File: ComfyUI_temp_vtmtc_00019_.png (2.35 MB, 960x1600)

2.35 MB PNG

>>102949318
for posting about it**

Anonymous
10/23/24(Wed)23:31:59 No.102949370

Anonymous 10/23/24(Wed)23:31:59 No.102949370

>>102949295
>Emad the prude
my view is that most of them are prudes, they all write the same "safety" shit (which always means no nsfw)

Anonymous
10/23/24(Wed)23:32:14 No.102949371

Anonymous 10/23/24(Wed)23:32:14 No.102949371

File: ComfyUI_temp_vtmtc_00020_.png (2.09 MB, 960x1600)

2.09 MB PNG

>>102949346
https://www.reddit.com/r/comfyui/comments/1g9wfbq/simple_way_to_increase_detail_in_flux_and_remove/

Anonymous
10/23/24(Wed)23:34:16 No.102949398

Anonymous 10/23/24(Wed)23:34:16 No.102949398

File: 00000-2710090048.png (2.84 MB, 1280x1920)

2.84 MB PNG

>>102949284
What model, it looks good

>>102949318
I will look into it, thanks mate

Anonymous
10/23/24(Wed)23:34:37 No.102949402

Anonymous 10/23/24(Wed)23:34:37 No.102949402

>>102949371
thanks man

Anonymous
10/23/24(Wed)23:35:14 No.102949409

Anonymous 10/23/24(Wed)23:35:14 No.102949409

File: ComfyUI_temp_vtmtc_00021_.png (2.28 MB, 960x1600)

2.28 MB PNG

>>102949318
that dev is so good, this is another extension that doesnt have a comfyui port
https://github.com/muerrilla/stable-diffusion-NPW

Anonymous
10/23/24(Wed)23:35:30 No.102949411

Anonymous 10/23/24(Wed)23:35:30 No.102949411

File: 00178-3093194836.jpg (439 KB, 1248x1824)

439 KB JPG

Anonymous
10/23/24(Wed)23:43:09 No.102949477

Anonymous 10/23/24(Wed)23:43:09 No.102949477

File: file.jpg (3.88 MB, 5884x3188)

3.88 MB JPG

>>102949371
Idk about that method, I can't find a good value between 0.95 and 1, it's not consistent, what value consistently look the best to you anon?

Anonymous
10/23/24(Wed)23:47:32 No.102949522

Anonymous 10/23/24(Wed)23:47:32 No.102949522

File: ComfyUI_temp_vtmtc_00031_.png (2.02 MB, 960x1600)

2.02 MB PNG

>>102949477
they all look great anon, 0.95 seems to be the perfect value

Anonymous
10/23/24(Wed)23:48:31 No.102949531

Anonymous 10/23/24(Wed)23:48:31 No.102949531

File: 00023-1651728872.png (1.77 MB, 1024x1536)

1.77 MB PNG

Anonymous
10/23/24(Wed)23:48:38 No.102949536

Anonymous 10/23/24(Wed)23:48:38 No.102949536

File: cryingpepe.png (1.85 MB, 1120x1120)

1.85 MB PNG

that one pic in the collage with the miku tranny is savage

Anonymous
10/23/24(Wed)23:50:15 No.102949546

Anonymous 10/23/24(Wed)23:50:15 No.102949546

>>102949522
>they all look great anon
look at the prompts, it's really a hit or miss, sometimes they miss the text to be displayed, sometimes it doesn't understand it aswell, I mean for you that's all right you go for simplistic 1girl images but if you go past that I'm not sure if this is a good deal

Anonymous
10/23/24(Wed)23:51:15 No.102949560

Anonymous 10/23/24(Wed)23:51:15 No.102949560

>>102949536
He will never be a Migu, many such cases :(

Anonymous
10/23/24(Wed)23:52:41 No.102949575

Anonymous 10/23/24(Wed)23:52:41 No.102949575

File: 00024-558990164.png (1.88 MB, 1024x1536)

1.88 MB PNG

>>102949318
This detail daemon extension seems to work wonders, thanks mate

Anonymous
10/23/24(Wed)23:53:36 No.102949585

Anonymous 10/23/24(Wed)23:53:36 No.102949585

File: file.png (599 KB, 960x467)

599 KB PNG

kek

Anonymous
10/23/24(Wed)23:54:59 No.102949599

Anonymous 10/23/24(Wed)23:54:59 No.102949599

File: ComfyUI_temp_vtmtc_00033_.png (2.39 MB, 960x1600)

2.39 MB PNG

>>102949546
well, you could try re-formatting your prompt, have you tried using a LLM bot to rewrite your prompts? that can help with formatting and grammar

Anonymous
10/23/24(Wed)23:55:00 No.102949601

Anonymous 10/23/24(Wed)23:55:00 No.102949601

Can you multigpu on the video models that just got released?
I have 1 3090 and 1 3080...

Anonymous
10/23/24(Wed)23:55:57 No.102949609

Anonymous 10/23/24(Wed)23:55:57 No.102949609

OK but why would you do any of this

Anonymous
10/23/24(Wed)23:56:32 No.102949613

Anonymous 10/23/24(Wed)23:56:32 No.102949613

File: file.png (394 KB, 2453x1347)

394 KB PNG

>>102949601
because it's on ComfyUi, you can put the text encoder on the 2nd gpu, but that's all, I don't think you can do inference paralellism

Anonymous
10/23/24(Wed)23:57:23 No.102949618

Anonymous 10/23/24(Wed)23:57:23 No.102949618

File: ComfyUI_temp_vtmtc_00036_.png (2.44 MB, 960x1600)

2.44 MB PNG

>>102949575
don't thank me, thank migu, I totally forgot about that extension until he posted about it

Anonymous
10/23/24(Wed)23:57:51 No.102949622

Anonymous 10/23/24(Wed)23:57:51 No.102949622

>>102949613
thanks, is that faster doing that or is there no point?
I've also seen that :

https://github.com/victorchall/genmoai-smol
But it's monogpu.

Anonymous
10/23/24(Wed)23:59:18 No.102949642

Anonymous 10/23/24(Wed)23:59:18 No.102949642

File: file.png (27 KB, 2823x91)

27 KB PNG

>>102949622
>thanks, is that faster doing that or is there no point?
I guess that's faster the moment you want to change the prompt, it won't unload and reload the text encoder, but desu it's not that much of a deal, making a video is fucking long so you won't care if you won 10 sec because of the text encoder kek

Anonymous
10/24/24(Thu)00:00:20 No.102949648

Anonymous 10/24/24(Thu)00:00:20 No.102949648

File: j2zouwj2jlwd1.webm (173 KB, 600x600)

173 KB WEBM

>>102949477
here's another example

Anonymous
10/24/24(Thu)00:01:13 No.102949659

Anonymous 10/24/24(Thu)00:01:13 No.102949659

>>102949585
>50s horor (makeup)
>80s horror (soul)
>modern horror (cgi)

Anonymous
10/24/24(Thu)00:01:30 No.102949665

Anonymous 10/24/24(Thu)00:01:30 No.102949665

>>102949642
I see, sad

Anonymous
10/24/24(Thu)00:02:26 No.102949674

Anonymous 10/24/24(Thu)00:02:26 No.102949674

File: merged_image.jpg (3.74 MB, 5632x2048)

3.74 MB JPG

>>102948557
I did some blind comparisons with flux lite. I've got four sets of images of the same 10 seeds of this knight guy.

Lite plus artstyle lora vs Dev Q8 plus artstyle lora:
Results: 1 vote for lite, 9 votes for dev.

The artstyle lora does mostly work, but it's noticeably capturing only like 90% of the style.
And here's the second comparison without the artstyle lora:

Lite with no lora vs Dev Q8 with no lora:
Results: 3 votes for flux lite, 7 votes for dev q8

So in conclusion: flux lite is 23% worse in exchange for being 23% faster.

Anonymous
10/24/24(Thu)00:02:37 No.102949675

Anonymous 10/24/24(Thu)00:02:37 No.102949675

File: file.png (2.04 MB, 2048x1024)

2.04 MB PNG

>>102949648
>>102949618
https://www.reddit.com/r/comfyui/comments/1g9wfbq/comment/lte0rdg/?utm_source=share&utm_medium=web2x&context=3
seems like they improved on the Sigma thing with this "LyingSigmaSampler" node, it adds details without changing the overall picture

Anonymous
10/24/24(Thu)00:03:34 No.102949691

Anonymous 10/24/24(Thu)00:03:34 No.102949691

File: ComfyUI_temp_vtmtc_00044_.png (2.08 MB, 960x1600)

2.08 MB PNG

Anonymous
10/24/24(Thu)00:04:27 No.102949704

Anonymous 10/24/24(Thu)00:04:27 No.102949704

>>102949674
>Lite with no lora vs Dev Q8 with no lora:
>Results: 3 votes for flux lite, 7 votes for dev q8
ok so that's a fucking nothingburger, Q8 is 12gb big wheras flux lite is 16gb big, and Q8 wins lol

Anonymous
10/24/24(Thu)00:05:23 No.102949716

Anonymous 10/24/24(Thu)00:05:23 No.102949716

File: cm2ms0u76003q336phea0d3te.webm (786 KB, 1696x960)

786 KB WEBM

>Moderated: QUALITY
That's a new one
Still let's you download the gen though, just doesn't show it to you online for some reason

Anonymous
10/24/24(Thu)00:05:26 No.102949717

Anonymous 10/24/24(Thu)00:05:26 No.102949717

File: 00069-1911574376.png (3.16 MB, 922x2765)

3.16 MB PNG

Anonymous
10/24/24(Thu)00:05:31 No.102949718

Anonymous 10/24/24(Thu)00:05:31 No.102949718

File: 02131.jpg (2.31 MB, 1664x2432)

2.31 MB JPG

Anonymous
10/24/24(Thu)00:06:29 No.102949730

Anonymous 10/24/24(Thu)00:06:29 No.102949730

File: 00074-670133703.png (3.73 MB, 922x2765)

3.73 MB PNG

Anonymous
10/24/24(Thu)00:07:58 No.102949738

Anonymous 10/24/24(Thu)00:07:58 No.102949738

File: 00003-24852473.png (3.43 MB, 2400x960)

3.43 MB PNG

Anonymous
10/24/24(Thu)00:09:00 No.102949752

Anonymous 10/24/24(Thu)00:09:00 No.102949752

File: 00076-2964468965.png (2.32 MB, 2592x864)

2.32 MB PNG

Anonymous
10/24/24(Thu)00:09:12 No.102949755

Anonymous 10/24/24(Thu)00:09:12 No.102949755

File: file.png (356 KB, 2482x1222)

356 KB PNG

https://github.com/kijai/ComfyUI-MochiWrapper
Ok gentlemen, I just did a bf16 vs fp8 comparison, they have the exact same settings (pircel image)

>A 22 years old woman dancing on the Hotel Room, she is holding a Pikachu plush
bf16
https://files.catbox.moe/fubqwj.webm
fp8
https://files.catbox.moe/92gksm.webm

Anonymous
10/24/24(Thu)00:09:55 No.102949764

Anonymous 10/24/24(Thu)00:09:55 No.102949764

>>102949717
>>102949730
>>102949738
How does one achieve this

Anonymous
10/24/24(Thu)00:11:15 No.102949772

Anonymous 10/24/24(Thu)00:11:15 No.102949772

File: Untitled.png (36 KB, 1026x251)

36 KB PNG

>>102949295
3.5 Medium is apparently multi-res while Large isn't, so it might be better even

Anonymous
10/24/24(Thu)00:13:37 No.102949795

Anonymous 10/24/24(Thu)00:13:37 No.102949795

>>102949764
Not use models with zero aesthetic like flux trained on synthetic slop and instead use models trained on actual art
https://civitai.com/models/833294/noobai-xl-nai-xl

Anonymous
10/24/24(Thu)00:14:44 No.102949800

Anonymous 10/24/24(Thu)00:14:44 No.102949800

>>102949772
>3.5 Medium is apparently multi-res while Large isn't, so it might be better even
there's no way a 2b model is gonna be better than a 8b model, right? I can feel SD3.5M will be a distilled version of SD3.5-8b

Anonymous
10/24/24(Thu)00:15:02 No.102949804

Anonymous 10/24/24(Thu)00:15:02 No.102949804

>>102949675
its amazing how webui extension devs mog the comfyui ones, adetailer, npw, detailerdaemon, resharpen, hires-fix-tweaks, webui-controlnet I could go on...meanwhile custom nodes all they do is pollute your comfyui with schizo options and tweaks, the only good custom node dev is kijai, the rest are really autistic

Anonymous
10/24/24(Thu)00:15:21 No.102949806

Anonymous 10/24/24(Thu)00:15:21 No.102949806

File: 02133.jpg (1.59 MB, 1664x2432)

1.59 MB JPG

>>102949717
>>102949730
im in awe

Anonymous
10/24/24(Thu)00:16:56 No.102949816

Anonymous 10/24/24(Thu)00:16:56 No.102949816

>>102949795
You assume everyone's purpose is to create "quality". Some of us want to generate an image that corresponds to the prompt.

Anonymous
10/24/24(Thu)00:17:38 No.102949820

Anonymous 10/24/24(Thu)00:17:38 No.102949820

File: 00004-449654167.png (1.02 MB, 832x1216)

1.02 MB PNG

Anonymous
10/24/24(Thu)00:17:44 No.102949823

Anonymous 10/24/24(Thu)00:17:44 No.102949823

>>102949755
We were so hyped after those previews... Minimax at home they said...

Anonymous
10/24/24(Thu)00:17:57 No.102949824

Anonymous 10/24/24(Thu)00:17:57 No.102949824

>>102949675
How's that work?

Anonymous
10/24/24(Thu)00:17:57 No.102949825

Anonymous 10/24/24(Thu)00:17:57 No.102949825

>>102949816
>You assume everyone's purpose is to create "quality". Some of us want to generate an image that corresponds to the prompt.
if this was true, SD3M would be a popular model and not a meme, because it follows the prompt well

Anonymous
10/24/24(Thu)00:19:58 No.102949842

Anonymous 10/24/24(Thu)00:19:58 No.102949842

File: ComfyUI_temp_vtmtc_00049_.png (2.26 MB, 960x1600)

2.26 MB PNG

Anonymous
10/24/24(Thu)00:20:15 No.102949845

Anonymous 10/24/24(Thu)00:20:15 No.102949845

File: 00023-3053294472.png (3.88 MB, 1037x3110)

3.88 MB PNG

Anonymous
10/24/24(Thu)00:20:18 No.102949847

Anonymous 10/24/24(Thu)00:20:18 No.102949847

>>102949823
>We were so hyped after those previews... Minimax at home they said...
we'll get Minimax at home anon, it'll be the HD version, that's the one they probably use in their demos
https://www.genmo.ai/blog
>Today, we are releasing our 480p base model, with Mochi 1 HD coming later this year.

Anonymous
10/24/24(Thu)00:21:53 No.102949858

Anonymous 10/24/24(Thu)00:21:53 No.102949858

File: 00008-3694283219.png (985 KB, 832x1216)

985 KB PNG

Anonymous
10/24/24(Thu)00:23:18 No.102949865

Anonymous 10/24/24(Thu)00:23:18 No.102949865

File: ComfyUI_temp_vtmtc_00050_.png (2.24 MB, 960x1600)

2.24 MB PNG

Anonymous
10/24/24(Thu)00:23:46 No.102949874

Anonymous 10/24/24(Thu)00:23:46 No.102949874

File: 00000-2196781888.png (3.86 MB, 1152x2016)

3.86 MB PNG

Anonymous
10/24/24(Thu)00:24:36 No.102949881

Anonymous 10/24/24(Thu)00:24:36 No.102949881

File: file.png (491 KB, 3402x1562)

491 KB PNG

https://github.com/kijai/ComfyUI-MochiWrapper
Has anyone managed to make sage attention work on windows? got those weird ass errors

Anonymous
10/24/24(Thu)00:24:59 No.102949889

Anonymous 10/24/24(Thu)00:24:59 No.102949889

File: 00078-3780410220.png (2.86 MB, 922x1843)

2.86 MB PNG

Anonymous
10/24/24(Thu)00:26:43 No.102949903

Anonymous 10/24/24(Thu)00:26:43 No.102949903

>>102949889
These look nice.

Anonymous
10/24/24(Thu)00:27:09 No.102949908

Anonymous 10/24/24(Thu)00:27:09 No.102949908

File: cm2mst68a000g336pi6f13pfi.webm (2.96 MB, 1696x960)

2.96 MB WEBM

>>102949823
Genmo falls off very quickly as soon as you stray from the training data. Heavy cherry picking is needed too. But let's not lie to ourselves: img2video is the only use case valuable for actual production companies and larger projects since you need some way to control subject consistency.

>>102949847
Even if the HD version is also underwhelming and suffers from the same issues as the 480p version, if it's Apache 2 we'll just have to wait for PonyGenmo in 2025. I highly doubt many people will be using the HD version locally given the VRAM requirements though,even if there's a god-tier 4bit quant of the model.
Hopefully the eventual fine-tunes improve prompt adherence too because after being spoiled by the natural language understanding of Flux and the Chinese video models it's really frustrating when genmo doesn't listen. The only consolation is I wasted 20 cents of some Silicon Valley incubators seed funding on the failed gen

Anonymous
10/24/24(Thu)00:30:32 No.102949935

Anonymous 10/24/24(Thu)00:30:32 No.102949935

File: 00014-1700103974.png (1.11 MB, 832x1216)

1.11 MB PNG

Anonymous
10/24/24(Thu)00:32:09 No.102949952

Anonymous 10/24/24(Thu)00:32:09 No.102949952

>>102949800
no, it's literally a different model apparently. He's saying 3.5 Large is just a finetune of old 3.0 Large, but 3.5 Medium was done up from scratch which let them add new stuff. So he believes that's why it has multi-res training but Large doesn't. It definitely won't be distilled like Large Turbo, also.

I think it's possible for it to have worse prompt adherence but better image quality, at least.

Anonymous
10/24/24(Thu)00:32:33 No.102949960

Anonymous 10/24/24(Thu)00:32:33 No.102949960

>>102949908
>Even if the HD version is also underwhelming and suffers from the same issues as the 480p version, if it's Apache 2
that's my concern, maybe it won't be apache 2 for the HD version

Anonymous
10/24/24(Thu)00:34:56 No.102949990

Anonymous 10/24/24(Thu)00:34:56 No.102949990

>>102949935
Handsome guy. At first I though he was a lady.

Anonymous
10/24/24(Thu)00:35:31 No.102950000

Anonymous 10/24/24(Thu)00:35:31 No.102950000

>>102949908
>highly doubt many people will be using the HD version locally given the VRAM requirements
I'm sure that's possible, the 480p + fp8 version only asks for 12gb of vram during inference and only 10 gb of vram during vae decoding if you go for vae tilt

Anonymous
10/24/24(Thu)00:36:06 No.102950004

Anonymous 10/24/24(Thu)00:36:06 No.102950004

>>102949960
In that case we cope with 480p or just wait 6 months for the new hotness base model. As soon as a company finds a way to be profitable with AI video competition will increase, and Minimax beating out kling and now the open source release of genmo (with the paper coming out soon allegedly) shows that there's no moat for video models

Anonymous
10/24/24(Thu)00:37:24 No.102950017

Anonymous 10/24/24(Thu)00:37:24 No.102950017

>>102950000
You underestimate the VRAM of the average normie, anon
Even an enjoyable SDXL experience is still too out of reach for a lot of the population since they're on 8GB or less cards

Anonymous
10/24/24(Thu)00:41:43 No.102950057

Anonymous 10/24/24(Thu)00:41:43 No.102950057

>>102950017
The vram issue is because inference software sucks.

Anonymous
10/24/24(Thu)00:42:45 No.102950070

Anonymous 10/24/24(Thu)00:42:45 No.102950070

vram is waaaayyyyy too large, for ai.

The reason to have loads of vram, for games, is that players can be very random.

Anonymous
10/24/24(Thu)00:45:04 No.102950083

Anonymous 10/24/24(Thu)00:45:04 No.102950083

>>102950070
>vram is waaaayyyyy too large, for ai.
true, but we have no other choice, you can't get a good model with 1b, maybe if we improved the transformers architecture that would be possible, we'll see about that

Anonymous
10/24/24(Thu)00:48:29 No.102950113

Anonymous 10/24/24(Thu)00:48:29 No.102950113

>>102950017
SDXL is totally fine in Comfy with Nvidia Turing or higher arch cards even at 6GB

Anonymous
10/24/24(Thu)00:51:50 No.102950129

Anonymous 10/24/24(Thu)00:51:50 No.102950129

>>102950017
How many normies do you think are running AI models locally? Their interest in AI is playing for 5 minutes with whatever the new shilled thing (DALLE3/Suno/Minimax/etc.) is and moving on

Anonymous
10/24/24(Thu)00:57:23 No.102950165

Anonymous 10/24/24(Thu)00:57:23 No.102950165

>>102950129
>How many normies do you think are running AI models locally?
a lot, there's a lot of discussions in the US about destroying the AI local ecosystem by making laws that would make impossible to train uncucked local models

Anonymous
10/24/24(Thu)00:57:30 No.102950166

Anonymous 10/24/24(Thu)00:57:30 No.102950166

File: 00019-3822636690.png (1.08 MB, 832x1216)

1.08 MB PNG

Anonymous
10/24/24(Thu)00:59:55 No.102950182

Anonymous 10/24/24(Thu)00:59:55 No.102950182

>>102950165
>there's a lot of discussions in the US about destroying the AI local ecosystem by making laws that would make impossible to train uncucked local models
Huh? Where?

Anonymous
10/24/24(Thu)01:01:07 No.102950194

Anonymous 10/24/24(Thu)01:01:07 No.102950194

>>102950182
https://leginfo.legislature.ca.gov/faces/billNavClient.xhtml?bill_id=202320240SB1047
In Commiefornia especially, a lot of AI companies are in there unfortunatly

Anonymous
10/24/24(Thu)01:02:21 No.102950200

Anonymous 10/24/24(Thu)01:02:21 No.102950200

File: file.png (290 KB, 2808x1526)

290 KB PNG

I FUCKING HATE BUILDING BINARIES IT NEVER WORK FUUUUUUUUUUUUUUCK

Anonymous
10/24/24(Thu)01:04:08 No.102950213

Anonymous 10/24/24(Thu)01:04:08 No.102950213

File: 02135.jpg (3.13 MB, 1792x2304)

3.13 MB JPG

Anonymous
10/24/24(Thu)01:04:55 No.102950218

Anonymous 10/24/24(Thu)01:04:55 No.102950218

>he's actually trying to build flash attention
Give up lol, you're wasting your time. You have already spent more time than you'd ever save by slightly faster gens.

Anonymous
10/24/24(Thu)01:06:03 No.102950224

Anonymous 10/24/24(Thu)01:06:03 No.102950224

>>102950218
>slightly faster gens
flash attention is useful for memory, which is the most important ressource in AI

Anonymous
10/24/24(Thu)01:10:10 No.102950257

Anonymous 10/24/24(Thu)01:10:10 No.102950257

>>102950017
According to the latest steam survey around 25% of users have 12gb or more of VRAM which is pretty good considering a big portion of steam users are poverty Russians and Sudacas just playing DOTA2 on shitboxes

Anonymous
10/24/24(Thu)01:20:54 No.102950334

Anonymous 10/24/24(Thu)01:20:54 No.102950334

File: Screenshot 2024-10-23 at (...).png (179 KB, 1358x644)

179 KB PNG

>>102950194
>>102950165
>make impossible to train uncucked local models
Technically if you're using less than $10,000,000 in computes and keeping the model to yourself it's exempt. Still gay desu.

Anonymous
10/24/24(Thu)01:25:54 No.102950370

Anonymous 10/24/24(Thu)01:25:54 No.102950370

>>102950257
>According to the latest steam survey around 25% of users have 12gb or more of VRAM
That sounds crazy when 12GB+ vram only really started existing at all after the RTX 2000 series

Anonymous
10/24/24(Thu)01:26:16 No.102950372

Anonymous 10/24/24(Thu)01:26:16 No.102950372

>>102950334
>if you're using less than $10,000,000 in computes and keeping the model to yourself it's exempt. Still gay desu.
If the US don't want to do it, someone else will, and that's exactly why China will win the AI race, they don't want to kill the AI advance, that's the opposite, they want to make the best AI possible, oh well.

Anonymous
10/24/24(Thu)01:27:17 No.102950381

Anonymous 10/24/24(Thu)01:27:17 No.102950381

>>102950370
it's not that surprising, the games of nowdays aren't optimised anymore so you need a shit ton of vram, and the 3060 has a lot of vram and is cheap as fuck because Nvdia was making money out of the crypto grifters during the crypto boom in 2021 kek

Anonymous
10/24/24(Thu)01:31:35 No.102950420

Anonymous 10/24/24(Thu)01:31:35 No.102950420

>>102950113
If you're okay with a single pass and no hiresfix and consider that "totally fine" sure

>>102950381
Video games being unoptimized doesn't give thirdies money they can spend on new GPUs though. I guess it doesn't matter since the future of inference is with NPU/IPUs anyways

Anonymous
10/24/24(Thu)01:34:32 No.102950443

Anonymous 10/24/24(Thu)01:34:32 No.102950443

>>102950420
>I guess it doesn't matter since the future of inference is with NPU/IPUs anyways
what's that?

Anonymous
10/24/24(Thu)01:42:01 No.102950494

Anonymous 10/24/24(Thu)01:42:01 No.102950494

>>102950443
Neural/Inference processing units. Basically dedicated hardware for running inference of models. Right now they're just embedded into the CPU and used for small things like helping blur your webcam when on a zoom call etc but both AMD and Nvidia (as well as a lot of startups probably) are working on larger discrete accelerators
source: I worked at AMD for a bit on a project related to NPUs

Anonymous
10/24/24(Thu)01:43:06 No.102950498

Anonymous 10/24/24(Thu)01:43:06 No.102950498

>>102950494
those NPUs will be able to do graphic stuff aswell? Like video games and shit? because the advantage of a GPU is that it can do both video games an AI

Anonymous
10/24/24(Thu)01:46:18 No.102950519

Anonymous 10/24/24(Thu)01:46:18 No.102950519

>>102950494
>Neural/Inference processing units. Basically dedicated hardware for running inference of models.
how much faster will it be? Let's take a comparison, how much faster a NPU would be against a 3090 for example on inference?

Anonymous
10/24/24(Thu)01:50:01 No.102950542

Anonymous 10/24/24(Thu)01:50:01 No.102950542

>>102950498
No, they're only for running inference. The advantage of an NPU is that it'll be much much cheaper than a 3k 4090 for the same ML performance which is interesting to me because I don't really care about gayming

>>102950519.
>how much faster will it be? Let's take a comparison, how much faster a NPU would be against a 3090 for example on inference?
Depends on the NPU. It'll be more efficient for sure. I wasn't working on the ML side of things but more the hardware side so I can't give you any estimates but I'm assuming it'll be similar to ASICs for crypto mining where they completely mog GPUs at the same price point since they're optimized for that specific task

Anonymous
10/24/24(Thu)01:50:12 No.102950544

Anonymous 10/24/24(Thu)01:50:12 No.102950544

https://github.com/kijai/ComfyUI-MochiWrapper
Ok I managed to make sage attention work, here's the steps:
1) Install triton with those binaries
https://github.com/woct0rdho/triton-windows/releases/tag/v3.1.0-windows.post5
2) Install python 3.11.9 on your computer
https://www.python.org/downloads/release/python-3119/
3) Go to C:\Users\Home\AppData\Local\Programs\Python\Python311 and copy the "libs" and "include" folders
4) Paste those folders onto ComfyUI_windows_portable\python_embeded

Anonymous
10/24/24(Thu)01:51:21 No.102950551

Anonymous 10/24/24(Thu)01:51:21 No.102950551

File: 2024-10-23_231419_seed192(...).png (3.17 MB, 2016x1152)

3.17 MB PNG

Anonymous
10/24/24(Thu)01:51:54 No.102950555

Anonymous 10/24/24(Thu)01:51:54 No.102950555

>>102950544
forgot step 0) -> install sage attention -> pip install sageattention

Anonymous
10/24/24(Thu)01:54:03 No.102950568

Anonymous 10/24/24(Thu)01:54:03 No.102950568

>>102950544
>>102950555
>sage attention
>Quantized Attention that achieves speedups of 2.1x and 2.7x compared to FlashAttention2 and xformers, respectively
Neat

Anonymous
10/24/24(Thu)01:54:47 No.102950574

Anonymous 10/24/24(Thu)01:54:47 No.102950574

>>102949411
>my chatbot at the start
>by the third reply

Anonymous
10/24/24(Thu)01:57:56 No.102950597

Anonymous 10/24/24(Thu)01:57:56 No.102950597

File: 2024-10-23_222823_seed396(...).png (3.34 MB, 2016x1152)

3.34 MB PNG

1girl, river, walking

Anonymous
10/24/24(Thu)02:02:20 No.102950635

Anonymous 10/24/24(Thu)02:02:20 No.102950635

>>102950597
OMG IT TETO
https://www.youtube.com/watch?v=dDlljvDSLSg

Anonymous
10/24/24(Thu)02:03:58 No.102950652

Anonymous 10/24/24(Thu)02:03:58 No.102950652

https://github.com/comfyanonymous/ComfyUI/commit/f82314fcfcc4d83b307f30f06e77db44e95686cf#diff-ff903427b64d57103d983ee5eeb6c33ffb5ab760526a48a35ad42d5afafdf2fbR360-R361
Interesting, so the beta had an issue on ComfyUi, maybe that could explain why that one tend to overburn the image compared to the other samplers

Anonymous
10/24/24(Thu)02:13:15 No.102950724

Anonymous 10/24/24(Thu)02:13:15 No.102950724

>>102950542
>Depends on the NPU. It'll be more efficient for sure.
what about the memory though? it's the most important thing, it's easier to add memory onto a NPU compared to a GPU?

Anonymous
10/24/24(Thu)02:14:43 No.102950742

Anonymous 10/24/24(Thu)02:14:43 No.102950742

File: ComfyUI_34401_.png (1.5 MB, 848x1024)

1.5 MB PNG

Anonymous
10/24/24(Thu)02:15:13 No.102950748

Anonymous 10/24/24(Thu)02:15:13 No.102950748

>>102950597
Now do 1girl, river, wanking

Anonymous
10/24/24(Thu)02:15:21 No.102950750

Anonymous 10/24/24(Thu)02:15:21 No.102950750

>>102950742
nice style

Anonymous
10/24/24(Thu)02:16:01 No.102950754

Anonymous 10/24/24(Thu)02:16:01 No.102950754

File: file.png (3.64 MB, 2638x1452)

3.64 MB PNG

>>102950748
>wanking

Anonymous
10/24/24(Thu)02:16:58 No.102950761

Anonymous 10/24/24(Thu)02:16:58 No.102950761

>>102950724
>what about the memory though? it's the most important thing, it's easier to add memory onto a NPU compared to a GPU?
Memory is cheap anon, don't let Nvidia and AMD trick you into thinking it's not because of their ridiculous prices at the data center tier. They could both sell us a 32GB vram graphics card for under $1000 if they wanted to. There's just no reason to ever do that because they know people will pay 4x the price or more for it due to the gold rush and their fiduciary duty to shareholders to maximize profit

Anonymous
10/24/24(Thu)02:19:00 No.102950788

Anonymous 10/24/24(Thu)02:19:00 No.102950788

>>102950761
the problem is that those are the same greedy companies (Nvdia and AMD) that will make those NPUs, so yeah maybe NPUs will be fast, but if they only have 16gb of vram you're just as fucked as if you had a 16gb of gpu vram

Anonymous
10/24/24(Thu)02:34:27 No.102950906

Anonymous 10/24/24(Thu)02:34:27 No.102950906

>>102950420
You can do hi-res-fix up to like 1.5x with 6GB VRAM + 16GB RAM on an SDXL model no problem. Again the card has to be Nvidia and it has to be Turing or later, though.

Anonymous
10/24/24(Thu)02:36:35 No.102950930

Anonymous 10/24/24(Thu)02:36:35 No.102950930

File: ComfyUI_34406_.png (1.43 MB, 848x1024)

1.43 MB PNG

Anonymous
10/24/24(Thu)03:00:03 No.102951140

Anonymous 10/24/24(Thu)03:00:03 No.102951140

so many 1girl sloppu ;_;

Anonymous
10/24/24(Thu)03:09:19 No.102951207

Anonymous 10/24/24(Thu)03:09:19 No.102951207

Remember this?
https://blackforestlabs.ai/up-next/
I thought they would never release it to the public, but now that we got Mochi, maybe they'll try to enter into competition with them, that would be cool desu

Anonymous
10/24/24(Thu)03:16:19 No.102951259

Anonymous 10/24/24(Thu)03:16:19 No.102951259

>>102950788
Sure but my point is that a professional or prosumer tier NPU will be cheaper than the equivalent GPU for equivalent inference performance

>>102950906
>the card has to be Nvidia and it has to be Turing or later, though.
I stand corrected then. How long would a 1024x1024 hiresfixed to 1536x1536 be on 6GB+16GB? I consider anything longer than 2 minutes per gen "unusable"

Anonymous
10/24/24(Thu)03:17:11 No.102951269

Anonymous 10/24/24(Thu)03:17:11 No.102951269

>>102951259
>my point is that a professional or prosumer tier NPU will be cheaper than the equivalent GPU for equivalent inference performance
Idk about that, if only Nvdia makes them good, they'll just make expensive as fuck like their overpriced GPUs

Anonymous
10/24/24(Thu)03:19:40 No.102951284

Anonymous 10/24/24(Thu)03:19:40 No.102951284

File: FLUX-416270909784053_00001_.png (519 KB, 512x832)

519 KB PNG

flux gave this bitch a amulet

Anonymous
10/24/24(Thu)03:20:51 No.102951294

Anonymous 10/24/24(Thu)03:20:51 No.102951294

File: FLUX-578481417288498_00001_.png (360 KB, 512x832)

360 KB PNG

Anonymous
10/24/24(Thu)03:20:55 No.102951295

Anonymous 10/24/24(Thu)03:20:55 No.102951295

File: file.png (944 KB, 735x856)

944 KB PNG

>>102951284
I read it as "a mulet" so I was expecting this kek

Anonymous
10/24/24(Thu)03:38:14 No.102951454

Anonymous 10/24/24(Thu)03:38:14 No.102951454

File: file.png (129 KB, 2808x653)

129 KB PNG

>>102950544
>>102950200
btw with this method you'll be able to build your own binaries of flash attention

Anonymous
10/24/24(Thu)04:25:06 No.102951800

Anonymous 10/24/24(Thu)04:25:06 No.102951800

>>102951284
It didn't fill her other equipment slots though

Anonymous
10/24/24(Thu)04:30:41 No.102951839

Anonymous 10/24/24(Thu)04:30:41 No.102951839

File: 00038-1638469766.png (763 KB, 1152x896)

763 KB PNG

>>102951140
These models are all trained really well for 1girl slop, usually if i try to make anything else its shit. Yes there's a skill issue component, but it's easier to go with the flow and produce 1girls.

Anonymous
10/24/24(Thu)05:13:14 No.102952099

Anonymous 10/24/24(Thu)05:13:14 No.102952099

>>102950568
>>Quantized Attention that achieves speedups of 2.1x and 2.7x compared to FlashAttention2 and xformers, respectively
I wanted to verify by myself, so here you go:

>Donald Trump mocks and laughs at a kneeling, weeping Kamala Harris
>121 frames, 64 steps, seed 42

>Sage: 35:28<00:00, 33.26s/it, 16.2gb VRAM during inference
https://files.catbox.moe/dyi3cm.webm
>flash_att: 42:49<00:00, 40.14s/it, 16.2gb VRAM during inference
https://files.catbox.moe/dqxrat.webm

Sage is definitely faster, use it >>102950544

Anonymous
10/24/24(Thu)05:13:28 No.102952101

Anonymous 10/24/24(Thu)05:13:28 No.102952101

File: 00003-4037236168.png (2.02 MB, 1344x768)

2.02 MB PNG

I wanted a fisherman catching a bass. Instead i get... a bass fisherman? I mean, if I'd written the words bass fisherman in the prompt, I'd get it.

Anonymous
10/24/24(Thu)05:17:53 No.102952128

Anonymous 10/24/24(Thu)05:17:53 No.102952128

>>102952099
So it's faster, but not 2.7x faster
I wonder how long it'll take to go from 30+ minutes to under 5 between optimizations and hardware improvements

Anonymous
10/24/24(Thu)05:18:28 No.102952131

Anonymous 10/24/24(Thu)05:18:28 No.102952131

File: 3380426930.png (1.11 MB, 896x1152)

1.11 MB PNG

Anonymous
10/24/24(Thu)05:22:59 No.102952155

Anonymous 10/24/24(Thu)05:22:59 No.102952155

File: 2611978228.png (1.03 MB, 896x1152)

1.03 MB PNG

Anonymous
10/24/24(Thu)05:25:50 No.102952176

Anonymous 10/24/24(Thu)05:25:50 No.102952176

File: file.png (105 KB, 2773x619)

105 KB PNG

>>102952099
the neat part is that sageattention is also used on image models like flux, and it's also faster than going for the sdpa optimisation, I used to get 3.6s/it on sdpa, now I'm at 2.91, let's go dude!

Anonymous
10/24/24(Thu)05:31:02 No.102952215

Anonymous 10/24/24(Thu)05:31:02 No.102952215

>>102952128
>So it's faster, but not 2.7x faster
*2.1x, 2.7x was for segatt vs xformers, but yeah of course they went for the most extreme of cases to get this number (as every researchers do kek), having a 1.2x speed is still really cool though and when I look at those 2 videos I notice the quality on sage is better, so it's a win/win situation there

Anonymous
10/24/24(Thu)05:33:09 No.102952230

Anonymous 10/24/24(Thu)05:33:09 No.102952230

Does everyone know what cuda malloc do? When I disable it I don't notice any difference

Anonymous
10/24/24(Thu)05:37:13 No.102952265

Anonymous 10/24/24(Thu)05:37:13 No.102952265

>>102949755
Thanks for the comparison.
II'll give bf16 a go today. I think you had an "unlucky seed" for your test btw.

Anonymous
10/24/24(Thu)05:40:05 No.102952287

Anonymous 10/24/24(Thu)05:40:05 No.102952287

>>102952265
>II'll give bf16 a go today. I think you had an "unlucky seed" for your test btw.
probably yeah, it was on fp8 too, unfortunately you can't go too far on bf16, when I go over 60 frames it overflows, I wish there was something in between fp8 and bf16 so that you'll get the quality and enough space to stack up the frames

Anonymous
10/24/24(Thu)05:40:24 No.102952293

Anonymous 10/24/24(Thu)05:40:24 No.102952293

File: 405816733.png (1.22 MB, 896x1055)

1.22 MB PNG

Anonymous
10/24/24(Thu)05:49:58 No.102952359

Anonymous 10/24/24(Thu)05:49:58 No.102952359

>>102952287.
>I wish there was something in between fp8 and bf16 so that you'll get the quality and enough space to stack up the frames
Why does bf8 not exist for fp8 like how bf16 exists for fp16?

Anonymous
10/24/24(Thu)05:58:24 No.102952406

Anonymous 10/24/24(Thu)05:58:24 No.102952406

File: 3562288793.png (1.1 MB, 896x1152)

1.1 MB PNG

Anonymous
10/24/24(Thu)05:59:35 No.102952412

Anonymous 10/24/24(Thu)05:59:35 No.102952412

File: bComfyUI_132649_.jpg (1.34 MB, 3072x1536)

1.34 MB JPG

Anonymous
10/24/24(Thu)06:03:49 No.102952449

Anonymous 10/24/24(Thu)06:03:49 No.102952449

kek, so this general just devolved into coomposting too. its just sdg 2.0 now

Anonymous
10/24/24(Thu)06:06:33 No.102952474

Anonymous 10/24/24(Thu)06:06:33 No.102952474

>>102952412
These are great, having human for scale is a nice touch

Anonymous
10/24/24(Thu)07:12:33 No.102952977

Anonymous 10/24/24(Thu)07:12:33 No.102952977

So can you run that mochi thing on 2x3090, on comfy (on linux) ?

Anonymous
10/24/24(Thu)07:15:14 No.102952997

Anonymous 10/24/24(Thu)07:15:14 No.102952997

>>102949193
>>102949229
>>102949252
>>102949318
>>102949358
>>102949371
>>102949409
>>102949522
>>102949599
are these real or AI? I don't have my glasses right now

Anonymous
10/24/24(Thu)07:22:56 No.102953060

Anonymous 10/24/24(Thu)07:22:56 No.102953060

>>102950544
https://reddit.com/r/StableDiffusion/comments/1gb07vj/how_to_run_mochi_1_on_a_single_24gb_vram_card/
Just made a long ass tutorial for those who weren't able to make Mochi work on their computer

Anonymous
10/24/24(Thu)07:23:56 No.102953068

Anonymous 10/24/24(Thu)07:23:56 No.102953068

File: screenshot.png (52 KB, 2406x1896)

52 KB PNG

>>102945840
ok I've made this program (claude did) to help simplify the second half of the process:
https://github.com/rainlizard/EasyQuantizationGUI/releases

For anyone on Windows who wants to convert the 24GB flux1-dev.safetensors file to 12GB .gguf it should be pretty easy now.

Anonymous
10/24/24(Thu)07:23:59 No.102953070

Anonymous 10/24/24(Thu)07:23:59 No.102953070

>>102952977
>So can you run that mochi thing on 2x3090, on comfy (on linux) ?
you can make it work with a single 3090 >>102953060

Anonymous
10/24/24(Thu)07:45:08 No.102953224

Anonymous 10/24/24(Thu)07:45:08 No.102953224

File: mochi.png (4 KB, 1081x22)

4 KB PNG

guys I finally got the "room heater" comfyui extension

Anonymous
10/24/24(Thu)07:50:15 No.102953278

Anonymous 10/24/24(Thu)07:50:15 No.102953278

>>102953224
>150 steps
what? you can go for lower, the default value for Mochi is 64 steps

Anonymous
10/24/24(Thu)07:51:17 No.102953288

Anonymous 10/24/24(Thu)07:51:17 No.102953288

>>102953068
>For anyone on Windows who wants to convert the 24GB flux1-dev.safetensors file to 12GB .gguf it should be pretty easy now.
nice job anon, I'll keep this link in mind I wanna make gguf of other flux variant models

Anonymous
10/24/24(Thu)07:52:38 No.102953302

Anonymous 10/24/24(Thu)07:52:38 No.102953302

>>102953278
I think the examples used by mochi use 200, and I can wait. I can't fucking install sageattn so Im using pytorch's default attention, how much slower does it make the process?

Anonymous
10/24/24(Thu)07:54:10 No.102953325

Anonymous 10/24/24(Thu)07:54:10 No.102953325

>>102953302
>I think the examples used by mochi use 200
I've heard it was 64, how did you get that number?

>I can't fucking install sageattn so Im using pytorch's default attention
Why? It works on both windows and linux >>102950544

Anonymous
10/24/24(Thu)07:56:33 No.102953345

Anonymous 10/24/24(Thu)07:56:33 No.102953345

File: my body is a machine that.jpg (86 KB, 1080x1029)

86 KB JPG

>>102953325
>how did you get that number?
half-remember reading it somewhere like yesterday, maybe I made it up idk

Anonymous
10/24/24(Thu)07:57:34 No.102953357

Anonymous 10/24/24(Thu)07:57:34 No.102953357

>>102953325
I know but triton gives me a message about not being able to find Windows Kit/10/Include and I've been fiddling with pip shit for an hour now so I've given up

Anonymous
10/24/24(Thu)07:58:53 No.102953372

Anonymous 10/24/24(Thu)07:58:53 No.102953372

>>102953357
>triton gives me a message about not being able to find Windows Kit/10/Include
did you do this? >>102950544
>3) Go to C:\Users\Home\AppData\Local\Programs\Python\Python311 and copy the "libs" and "include" folders
>4) Paste those folders onto ComfyUI_windows_portable\python_embeded

Anonymous
10/24/24(Thu)07:59:45 No.102953385

Anonymous 10/24/24(Thu)07:59:45 No.102953385

>>102953372
Whoops, no. Ill do it after I get this gen, thanks

Anonymous
10/24/24(Thu)08:12:44 No.102953525

Anonymous 10/24/24(Thu)08:12:44 No.102953525

>>102953060
Why specifiy 24gb when it works on a 16gb card, well at least in Linux it does.

Anonymous
10/24/24(Thu)08:14:13 No.102953545

Anonymous 10/24/24(Thu)08:14:13 No.102953545

>>102953525
>Why specifiy 24gb when it works on a 16gb card
true, but my tutorial is for both fp8 and bf16 and your 16gb card can't handle the bf16

Anonymous
10/24/24(Thu)08:36:15 No.102953753

Anonymous 10/24/24(Thu)08:36:15 No.102953753

File: ComfyUI_SD35L_0302.jpg (160 KB, 1152x896)

160 KB JPG

Flux 8B model
https://huggingface.co/Freepik/flux.1-lite-8B-alpha

Anonymous
10/24/24(Thu)08:37:15 No.102953763

Anonymous 10/24/24(Thu)08:37:15 No.102953763

>>102953753
it's still distilled right?

Anonymous
10/24/24(Thu)08:38:20 No.102953771

Anonymous 10/24/24(Thu)08:38:20 No.102953771

File: cm2nack06000d336olc2g4kxq.webm (483 KB, 1696x960)

483 KB WEBM

The Russian teens are middle eastern indians now wtf. What garbage captioner did the genmo team use or is the website secretly modifying prompts?

Anonymous
10/24/24(Thu)08:39:31 No.102953782

Anonymous 10/24/24(Thu)08:39:31 No.102953782

File: cm2nac3qb0020336oyh56vyv5.webm (435 KB, 1696x960)

435 KB WEBM

Here's the better of the two to show it wasn't just a fluke

Anonymous
10/24/24(Thu)08:40:42 No.102953790

Anonymous 10/24/24(Thu)08:40:42 No.102953790

>>102953771
>>102953782
>1696x960
that's the official resolution you got when downloading those videos? if yes then it means that their demo is using the HD version, we only got the 480p model locally yet

Anonymous
10/24/24(Thu)08:44:51 No.102953835

Anonymous 10/24/24(Thu)08:44:51 No.102953835

>>102953545
I tried, you are right, much sadness :(
Good redit post though.

Anonymous
10/24/24(Thu)08:45:57 No.102953848

Anonymous 10/24/24(Thu)08:45:57 No.102953848

>>102953835
>I tried, you are right, much sadness :(
it's ok, the fp8 isn't that different to the bf16, you don't loose much and you can still run Mochi
>Good redit post though.
thanks :3

Anonymous
10/24/24(Thu)08:49:42 No.102953884

Anonymous 10/24/24(Thu)08:49:42 No.102953884

>>102953790
That's the official resolution yeah. That would explain why the catbox webms of the 480p gens look so much worse.

Anonymous
10/24/24(Thu)08:54:47 No.102953918

Anonymous 10/24/24(Thu)08:54:47 No.102953918

>>102953848
All my gens with fp8 have had considerable "wavy mirage" effects on motion, I'd hoped the bf16 would reduce that, facial coherence and generation is sometimes as spot on as FaceDetailer for sdxl but it's wildly unpredicable through the 8 or so gens i've done so far.
This is the first instance since I bought my GPU last Christmas (4060ti) that i've desired a new card due to the processing time, it was good while it lasted.

Anonymous
10/24/24(Thu)08:58:04 No.102953943

Anonymous 10/24/24(Thu)08:58:04 No.102953943

File: file.png (65 KB, 716x676)

65 KB PNG

>>102953918
>All my gens with fp8 have had considerable "wavy mirage" effects on motion, I'd hoped the bf16 would reduce that,
I think it does yeah >>102949755, but the quality is still not on par with their demo (that probably use the HD version), I'm trying to increase the resolution and see what it does kek (this resolution and fp8 asks for 16gb of vram)

Anonymous
10/24/24(Thu)09:05:30 No.102954000

Anonymous 10/24/24(Thu)09:05:30 No.102954000

>>102953918
>All my gens with fp8 have had considerable "wavy mirage" effects on motion
maybe Comfy's new scaled fp8 could fix this, it's supposedly better in quality than the regular fp8, dunno how to make those though
https://huggingface.co/comfyanonymous/flux_dev_scaled_fp8_test

Anonymous
10/24/24(Thu)09:08:14 No.102954021

Anonymous 10/24/24(Thu)09:08:14 No.102954021

File: cm2nbdold00zn336upzqtbeh9.webm (260 KB, 1696x960)

260 KB WEBM

She's looking at me judgingly because of my promptletness

The online site's text prompt moderation works like Luma's, where one prompt works but a very similar one with one word changed doesn't. In this case "bathroom" was blocking the prompt until I changed it to "apartment"

Anonymous
10/24/24(Thu)09:17:29 No.102954113

Anonymous 10/24/24(Thu)09:17:29 No.102954113

File: 3948116548.png (937 KB, 1152x896)

937 KB PNG

Anonymous
10/24/24(Thu)09:35:24 No.102954307

Anonymous 10/24/24(Thu)09:35:24 No.102954307

>>102954000
>...dunno how to make those though
I don't think Comfy would do it specifically for mochi but some techniques might be applicable for people in that area to use if they wanted to try.
Also, I think captcha is a about to break, had a few just now where the slider doesn't change the image.

Anonymous
10/24/24(Thu)09:37:46 No.102954336

Anonymous 10/24/24(Thu)09:37:46 No.102954336

Remember that meme? Good times.
https://xcancel.com/__theben/status/1829554120270987740

Anonymous
10/24/24(Thu)09:40:09 No.102954370

Anonymous 10/24/24(Thu)09:40:09 No.102954370

File: ComfyUI_01567_.png (1.92 MB, 1024x1024)

1.92 MB PNG

Anonymous
10/24/24(Thu)09:40:12 No.102954372

Anonymous 10/24/24(Thu)09:40:12 No.102954372

>>102953070
thanks anon

Anonymous
10/24/24(Thu)09:42:47 No.102954405

Anonymous 10/24/24(Thu)09:42:47 No.102954405

File: ComfyUI_01569_.png (1.42 MB, 1024x1024)

1.42 MB PNG

SD3.5L is pretty decent so far

Anonymous
10/24/24(Thu)09:44:52 No.102954433

Anonymous 10/24/24(Thu)09:44:52 No.102954433

>>102954405
anatomy is very bad for me and it doesn't work well with resolutions outside of 1024x1024, but they said they used a improved architecture for sd 3.5 medium so i'm waiting for that

Anonymous
10/24/24(Thu)09:45:21 No.102954440

Anonymous 10/24/24(Thu)09:45:21 No.102954440

File: file.webm (813 KB, 1712x960)

813 KB WEBM

Yep, their demo is definitely the HD one, because on the demo it's rendering a 1696x960 and when I put those resolutions on the local 480p model I got this blurry shit kek

Anonymous
10/24/24(Thu)09:46:22 No.102954448

Anonymous 10/24/24(Thu)09:46:22 No.102954448

>>102954433
>they said they used a improved architecture for sd 3.5 medium so i'm waiting for that
desu they should've used the improved architecture for the 8b aswell, they could've beaten flux that way, their 2b model won't change the needle I'm afraid

Anonymous
10/24/24(Thu)09:47:51 No.102954466

Anonymous 10/24/24(Thu)09:47:51 No.102954466

>>102954448
im assuming the 8b was just released as a way to generate hype and the medium model is what they put most of their work into. if they released the medium model by itself it would have been met with disappointment ig

Anonymous
10/24/24(Thu)09:49:35 No.102954490

Anonymous 10/24/24(Thu)09:49:35 No.102954490

File: file.png (105 KB, 360x521)

105 KB PNG

>>102954466
it's just a 2b model, they shouldn't focus on this little shit, they had the chance to beat flux with their 8b and they didn't go for it, why are they so fucking retarded??

Anonymous
10/24/24(Thu)09:50:40 No.102954503

Anonymous 10/24/24(Thu)09:50:40 No.102954503

>>102953068
Just curious, are we able to quantize any of the general image diffusion models like sd 3 or pony and run it from sd webui forge?

Anonymous
10/24/24(Thu)09:52:07 No.102954522

Anonymous 10/24/24(Thu)09:52:07 No.102954522

>>102954490
they are a collapsing company, i doubt they have the resources to focus on the 8b anymore. back when sd3 released, the old 8b was the one they locked behind the api to generate some coin out of desperation and even that failed because they were outclassed by the competition

Anonymous
10/24/24(Thu)09:52:38 No.102954529

Anonymous 10/24/24(Thu)09:52:38 No.102954529

what do you jabronis put in your negative when genning with flux? Bad hands, bad image, ugly, deformed? Does any term in the negative improve overall image quality?

Anonymous
10/24/24(Thu)09:55:24 No.102954569

Anonymous 10/24/24(Thu)09:55:24 No.102954569

File: ComfyUI_01577_.png (1.4 MB, 1024x1024)

1.4 MB PNG

>>102954433
Okay, I'll keep having a good time though

Anonymous
10/24/24(Thu)10:01:03 No.102954646

Anonymous 10/24/24(Thu)10:01:03 No.102954646

File: ComfyUI_temp_xgpij_00007_.png (1013 KB, 816x1016)

1013 KB PNG

Anonymous
10/24/24(Thu)10:02:39 No.102954669

Anonymous 10/24/24(Thu)10:02:39 No.102954669

File: ComfyUI_temp_xgpij_00008_.png (1 MB, 816x1016)

1 MB PNG

Anonymous
10/24/24(Thu)10:06:01 No.102954707

Anonymous 10/24/24(Thu)10:06:01 No.102954707

>>102950544
>>102953060
Do I have to use portable Comfy? I assume dropping those Python311 libraries into my Python 312 install won't work?

Anonymous
10/24/24(Thu)10:13:24 No.102954806

Anonymous 10/24/24(Thu)10:13:24 No.102954806

>>102954569
>is happy generating white plates with strawberries and chocolate splattered around
I guess I'll envy you.

Anonymous
10/24/24(Thu)10:24:16 No.102954916

Anonymous 10/24/24(Thu)10:24:16 No.102954916

>>102954529
No, the only use is if your gen has something you didn't want, add that to negs and regen.

Anonymous
10/24/24(Thu)10:26:19 No.102954936

Anonymous 10/24/24(Thu)10:26:19 No.102954936

File: ComfyUI_temp_xgpij_00025_.png (1.94 MB, 1088x1360)

1.94 MB PNG

Anonymous
10/24/24(Thu)10:26:39 No.102954940

Anonymous 10/24/24(Thu)10:26:39 No.102954940

>>102954490
>they had the chance to beat flux with their 8b and they didn't go for it
They tried, a Base Model with naked women is the proof they really tried.
The bad thing with incompetence is that you can't do it no matter how hard you try.

Anonymous
10/24/24(Thu)10:29:38 No.102954973

Anonymous 10/24/24(Thu)10:29:38 No.102954973

File: 00030-2621750539.png (1.21 MB, 832x1216)

1.21 MB PNG

Anonymous
10/24/24(Thu)10:29:39 No.102954974

Anonymous 10/24/24(Thu)10:29:39 No.102954974

>>102954707
>I assume dropping those Python311 libraries into my Python 312 install won't work?
I guess you can install python 312 on your computer and put those libraries, don't forget to also download the triton that has python 312 in it

Anonymous
10/24/24(Thu)10:30:46 No.102954992

Anonymous 10/24/24(Thu)10:30:46 No.102954992

File: ComfyUI_temp_xgpij_00029_.png (2.26 MB, 1088x1360)

2.26 MB PNG

Anonymous
10/24/24(Thu)10:30:48 No.102954993

Anonymous 10/24/24(Thu)10:30:48 No.102954993

>>102954940
I mean, they will be using a better architecture for SD3.5-2b, if they also did that for SD3-8b, I'm sure they would've caught up with Flux

Anonymous
10/24/24(Thu)10:31:59 No.102955006

Anonymous 10/24/24(Thu)10:31:59 No.102955006

>>102954707
>Do I have to use portable Comfy?
If you know where is located your python.exe that is used to run your Comfy, you can remix this tutorial to make it work I guess

Anonymous
10/24/24(Thu)10:36:01 No.102955029

Anonymous 10/24/24(Thu)10:36:01 No.102955029

File: ComfyUI_temp_xgpij_00035_.png (2.17 MB, 1088x1360)

2.17 MB PNG

Anonymous
10/24/24(Thu)10:41:40 No.102955078

Anonymous 10/24/24(Thu)10:41:40 No.102955078

https://github.com/kijai/ComfyUI-MochiWrapper
https://huggingface.co/Kijai/Mochi_preview_comfy/blob/main/mochi_preview_dit_GGUF_Q4_0_v1.safetensors
>GGUF.safetensors
didn't know it could work like that :d

Anonymous
10/24/24(Thu)10:42:44 No.102955097

Anonymous 10/24/24(Thu)10:42:44 No.102955097

>>102955078
can I run this shit on my 3060 12gb?

Anonymous
10/24/24(Thu)10:43:21 No.102955104

Anonymous 10/24/24(Thu)10:43:21 No.102955104

>>102955097
without any issues, fp8 was asking for a bit more than 12gb of vram

Anonymous
10/24/24(Thu)10:45:28 No.102955133

Anonymous 10/24/24(Thu)10:45:28 No.102955133

>>102955078
ok, I'm mildly aroused now.

Anonymous
10/24/24(Thu)10:47:30 No.102955162

Anonymous 10/24/24(Thu)10:47:30 No.102955162

>>102955078
this mf transformed a gguf into a safetensor, that's a fucking genius move if you ask me, maybe if we do the same for flux gguf and transform them into safetensors, they'll be as fast as fp8 and won't be slow motherfuckers when we add loras on top of them?

Anonymous
10/24/24(Thu)10:48:03 No.102955168

Anonymous 10/24/24(Thu)10:48:03 No.102955168

File: 00009-170385697.jpg (996 KB, 1280x1920)

996 KB JPG

Anonymous
10/24/24(Thu)10:53:10 No.102955233

Anonymous 10/24/24(Thu)10:53:10 No.102955233

>>102955168
neat

Anonymous
10/24/24(Thu)10:58:02 No.102955291

Anonymous 10/24/24(Thu)10:58:02 No.102955291

File: 3603366327.png (1.16 MB, 1152x896)

1.16 MB PNG

Anonymous
10/24/24(Thu)11:02:55 No.102955354

Anonymous 10/24/24(Thu)11:02:55 No.102955354

>>102955078
it's crazy to think of where we were at the end of 2022 compared to now

Anonymous
10/24/24(Thu)11:03:12 No.102955363

Anonymous 10/24/24(Thu)11:03:12 No.102955363

File: file.jpg (2.13 MB, 7262x1795)

2.13 MB JPG

it's impressive how much of an impact this tiny node has
https://reddit.com/r/comfyui/comments/1g9wfbq/simple_way_to_increase_detail_in_flux_and_remove/

Anonymous
10/24/24(Thu)11:12:27 No.102955467

Anonymous 10/24/24(Thu)11:12:27 No.102955467

File: file.jpg (931 KB, 1763x2304)

931 KB JPG

https://civitai.com/models/883426/verus-vision-10b?modelVersionId=988886
This is the first finetune of dedistill, I like the skin texture, it doesn't look like plastic anymore

Anonymous
10/24/24(Thu)11:15:32 No.102955508

Anonymous 10/24/24(Thu)11:15:32 No.102955508

File: file.png (227 KB, 2123x959)

227 KB PNG

>>102955467
>This is the first finetune of dedistill
>finetune
not even close, it's just a Lora merge
finetune

Anonymous
10/24/24(Thu)11:17:53 No.102955532

Anonymous 10/24/24(Thu)11:17:53 No.102955532

File: file.png (24 KB, 918x227)

24 KB PNG

They'll never let us down

Anonymous
10/24/24(Thu)11:19:25 No.102955548

Anonymous 10/24/24(Thu)11:19:25 No.102955548

File: file.png (61 KB, 927x677)

61 KB PNG

A little bread from heaven

Anonymous
10/24/24(Thu)11:19:48 No.102955554

Anonymous 10/24/24(Thu)11:19:48 No.102955554

>>102955532
they know what to do, a bigger model will make their shit relevant, we're waiting BIGMA my dead Mr. Lawrence
https://www.youtube.com/watch?v=jElCDsfptVU

Anonymous
10/24/24(Thu)11:20:43 No.102955566

Anonymous 10/24/24(Thu)11:20:43 No.102955566

>>102955554
I know you love to burn money doing inefficient shit, but efficiency makes it possible to train big models. Or maybe you like having big models restricted to people with $100k in hardware?

Anonymous
10/24/24(Thu)11:22:21 No.102955593

Anonymous 10/24/24(Thu)11:22:21 No.102955593

>>102955566
>Or maybe you like having big models restricted to people with $100k in hardware?
why are you whining like that? the llm fags are dealing with fucking 70b models, you have no idea how good you have

Anonymous
10/24/24(Thu)11:23:09 No.102955598

Anonymous 10/24/24(Thu)11:23:09 No.102955598

>>102955593
Yeah, must be nice relying 100% on Meta lmao

Anonymous
10/24/24(Thu)11:23:52 No.102955603

Anonymous 10/24/24(Thu)11:23:52 No.102955603

File: file.png (397 KB, 960x876)

397 KB PNG

>>102955593
>>102955598
a tale as old as time

Anonymous
10/24/24(Thu)11:24:35 No.102955610

Anonymous 10/24/24(Thu)11:24:35 No.102955610

>>102955548
goddamnit just give us the model from the demo I don't give a fuck give me new toys

Anonymous
10/24/24(Thu)11:25:29 No.102955619

Anonymous 10/24/24(Thu)11:25:29 No.102955619

>>102955610
you are already tired from the SD3.5 toy from yesterday??

Anonymous
10/24/24(Thu)11:27:43 No.102955643

Anonymous 10/24/24(Thu)11:27:43 No.102955643

>>102955619
you should've stopped caring about SAI awhile ago

Anonymous
10/24/24(Thu)11:28:36 No.102955655

Anonymous 10/24/24(Thu)11:28:36 No.102955655

File: test3.jpg (2.82 MB, 3072x1440)

2.82 MB JPG

>>102955467
I tested it a few threads ago, it doesn't really improve over regular dedistill while removing a lot of flexibility. I believe any perceived skin detail increase is just the extra grain Verus adds.
Left to right, Verus>DeDistillfp8>Distilledfp8

Anonymous
10/24/24(Thu)11:30:07 No.102955665

Anonymous 10/24/24(Thu)11:30:07 No.102955665

>>102955643
I kinda do, last time I gave a fuck about them was during the SD3M fiasco, it was funni, and then they released SD3.5, I tested it out for an hour yesterday and I quickly noticed it wasn't at the level of flux, put it on the trash and went on with my life, I'll care about them for the next release though, if it's still inferior to flux it'll be 1 more hour, if not then they succeed on their redemption arc kek

Anonymous
10/24/24(Thu)11:33:39 No.102955703

Anonymous 10/24/24(Thu)11:33:39 No.102955703

>>102954992
>>102955029
What model is that?

Anonymous
10/24/24(Thu)11:34:36 No.102955715

Anonymous 10/24/24(Thu)11:34:36 No.102955715

>>102955655
I see, what I notice though is that the distilled version seems to have more realistic humans than vanilla flux dev, but it's just one picture so it's hard to make a definitive conclusion about that

Anonymous
10/24/24(Thu)11:35:44 No.102955731

Anonymous 10/24/24(Thu)11:35:44 No.102955731

>>102955078
https://huggingface.co/Kijai/Mochi_preview_comfy/tree/main

Which one is recommended? The Q4? The fp8? The bf16?
Is there a difference in quality?

Anonymous
10/24/24(Thu)11:39:25 No.102955775

Anonymous 10/24/24(Thu)11:39:25 No.102955775

File: file.jpg (2.04 MB, 7961x2897)

2.04 MB JPG

>>102955731
>Which one is recommended? The Q4? The fp8? The bf16?
>Is there a difference in quality?
so far we only have image models as a baseline, but fp8 seems to increase some blur glitches during movement >>102949755 >>102953918

Anonymous
10/24/24(Thu)11:44:14 No.102955838

Anonymous 10/24/24(Thu)11:44:14 No.102955838

>>102955665
I immediately saw potential in XL despite the initial reaction from anon but I also saw potential in sigma which, to be fair, got some love but never became the meta. regardless, the trajectory of SAI and the people who left for BFL seem to be larger and larger models which is not something I want. I do like flux but I'm not going to dillude myself into thinking its ecosystem will take any less time to flesh out than XL - in fact, we all know itll take MUCH longer. I'm sure some are okay with waiting and don't mind the hardware requirements but I think it's idiotic to completely forgo the idea of making smaller, better models. flux is bloated as hell and sd3 appears to suffer from le safety demon. it's like software in general being more about "lets put more shit in it" rather than "how can we optimize" or whatever.
I could simply be poorfag coping but I don't think I am.

Anonymous
10/24/24(Thu)11:47:52 No.102955885

Anonymous 10/24/24(Thu)11:47:52 No.102955885

>>102955775
thanks anon, the videos are both very blurry anyway, I hope all local gens aren't that bad
I'll probably try both anyway

Anonymous
10/24/24(Thu)11:49:32 No.102955899

Anonymous 10/24/24(Thu)11:49:32 No.102955899

>>102955885
we won't get the same quality as their API demo because they are using the HD version, and we are not
https://www.genmo.ai/blog
>Today, we are releasing our 480p base model, with Mochi 1 HD coming later this year.
>>102954440 >>102953790 >>102953884

Anonymous
10/24/24(Thu)11:49:41 No.102955902

Anonymous 10/24/24(Thu)11:49:41 No.102955902

>>102955838
I just don't get how people don't see that Pixart gets decent results with 600m, much better than SD 1.5 or base SDXL, so why wouldn't a 1.6B model which an extremely efficient architecture be something that could be very good especially for a niche model. We don't need every model to be the kitchen sink, in fact Pony is a perfect example of a niche model that everyone likes. There's no reason to believe that Sana 1.6B can't be the next hyper specific booru anime model.

Anonymous
10/24/24(Thu)11:51:26 No.102955926

Anonymous 10/24/24(Thu)11:51:26 No.102955926

>>102955899
oh I see, and I'd guess the hd would be unusable locally unless you get it work on multiple h100...

Anonymous
10/24/24(Thu)11:52:20 No.102955939

Anonymous 10/24/24(Thu)11:52:20 No.102955939

>>102955775
q4 produced very blurred output and needed the frame_batch cranked right down. Testing again on a small gen rn using the default prompt and seed

Anonymous
10/24/24(Thu)11:53:32 No.102955958

Anonymous 10/24/24(Thu)11:53:32 No.102955958

>>102955926
>oh I see, and I'd guess the hd would be unusable locally unless you get it work on multiple h100...
not at all, I was able to make a high resolution video on my 3090, of course the result look like absolute shit because it was never intended, it was just to test out if it would be enough in terms of VRAM, and it does, it's asking for 16gb of VRAM for fp8, we're good o/ >>102954440

Anonymous
10/24/24(Thu)11:57:49 No.102956000

Anonymous 10/24/24(Thu)11:57:49 No.102956000

>>102955958
oh very cool, hopefully the hd one is released soon then

Anonymous
10/24/24(Thu)11:57:57 No.102956002

Anonymous 10/24/24(Thu)11:57:57 No.102956002

>>102955902
I think base sigma being so undertrained compared to the competition did it. Perhaps some of it was skill issue and likely the audience for base models don't care about aesthetics (which it had in spades). I could shizo-babble about the west conspiring to kill any chinese competition by means of making their models look bad or something, but I think the real reason is something else.
I know for a fact that, somehow, many were filtered by its install. Somehow they didn't know where to place the files for sigma but then when Flux/SD3 arrived, magically they remembered or figured it out.
I wish I knew the answer to your question, but I don't.

Anonymous
10/24/24(Thu)11:59:55 No.102956023

Anonymous 10/24/24(Thu)11:59:55 No.102956023

>>102955939
I retract this, seems it was just a bad gen idk...
iterate, iterate, iterate.

Anonymous
10/24/24(Thu)12:01:17 No.102956036

Anonymous 10/24/24(Thu)12:01:17 No.102956036

>>102956002
It's the retarded bigger number meme. There's a reason why the Xbox 2 was called the Xbox 360 (because of the PS3). But I also think it was the timing, Pixart is too small of a model and SAI was promising a Flux-like super model so no one wanted to switch and even to this day ComfyUI doesn't natively support Pixart.

Anonymous
10/24/24(Thu)12:02:00 No.102956045

Anonymous 10/24/24(Thu)12:02:00 No.102956045

>>102956036
>It's the retarded bigger number meme.
so much of a meme that the SOTA local model is the biggest one

Anonymous
10/24/24(Thu)12:04:19 No.102956067

Anonymous 10/24/24(Thu)12:04:19 No.102956067

>>102956036
>ComfyUI doesn't natively support Pixart.
i have a suspicion that this is due to some dumbass politics between comfy and city et. al.

Anonymous
10/24/24(Thu)12:04:29 No.102956074

Anonymous 10/24/24(Thu)12:04:29 No.102956074

>>102956045
The problem with Flux in particular is no one challenges that 12B isn't anything but a means of preventing local competition. It's what I would've done if I was monetizing in this space, you purposely make impossible to train base models and you get to build a loyal audience of window lickers while ensuring no one will make a Pony model to compete against you. We already see the results of this, BFL gets way too much respect despite ghosting us.

Anonymous
10/24/24(Thu)12:04:55 No.102956079

Anonymous 10/24/24(Thu)12:04:55 No.102956079

File: file.png (46 KB, 827x417)

46 KB PNG

https://www.reddit.com/r/comfyui/comments/1g9wfbq/comment/lte0rdg/?utm_source=share&utm_medium=web2x&context=3
Ok now that's impressive
https://imgsli.com/MzExNjQ2

Anonymous
10/24/24(Thu)12:08:20 No.102956122

Anonymous 10/24/24(Thu)12:08:20 No.102956122

>>102956074
>BFL gets way too much respect despite ghosting us.
I think you overestimate the % of trainers in the ecosystem, this model is currently being downloaded more than a million times per month, the very vast majority of people are just using models, not training them, to them Flux is excellent and they'll never see its shortcommings in terms of training because they will never do something like that, so their feelings towards BFL can only be overall positive, they got in their hands a model that is consistently good and that's it

Anonymous
10/24/24(Thu)12:10:08 No.102956140

Anonymous 10/24/24(Thu)12:10:08 No.102956140

>>102956079
 dishonesty_factor 
is probably the funniest name for a setting ive seen thus far

Anonymous
10/24/24(Thu)12:13:36 No.102956183

Anonymous 10/24/24(Thu)12:13:36 No.102956183

>>102956122
Just sucks because this will arrest local development for a year or more.

Anonymous
10/24/24(Thu)12:18:16 No.102956244

Anonymous 10/24/24(Thu)12:18:16 No.102956244

>>102955467
>>102955508
>verus-vision was lauded as the first flux finetune by anon
>it's not actually a finetune
holy kek

Anonymous
10/24/24(Thu)12:18:40 No.102956249

Anonymous 10/24/24(Thu)12:18:40 No.102956249

>Hi everyone, yes, we are still alive! Thank you for your attention to SANA, our latest work on efficient text-to-image generation. It was developed jointly by people from NVIDIA, MIT, and Tsinghua University.
>We are preparing to open source SANA recently (waiting for the company's approval process, but whether it can be open source depends on the company's approval result). If you have any suggestions, you can leave a message directly in the channel, send an email to the SANA team (enzex@nvidia.com/junsongc@nvidia.com), or fill in this Google sheet
>https://docs.google.com/spreadsheets/d/1rQWGYdswcl8O6V5Vu3AqtBh9PotkmTkxN2inI_njDy0/edit?gid=0#gid=0
anything you guys wanna tell them?

>We initially plan to support ControlNet and expand to Video generation. We hope that with community feedback, SANA will get better and better.
i had a feeling they were planning on using sana as a base for a video gen model, only way the super compressed vae would make sense

Anonymous
10/24/24(Thu)12:21:32 No.102956283

Anonymous 10/24/24(Thu)12:21:32 No.102956283

>>102956249
Super compressed VAE is good for both video and high resolution images. Also less tokens required for the model to learn which means per parameter efficiency goes up.

Anonymous
10/24/24(Thu)12:21:59 No.102956288

Anonymous 10/24/24(Thu)12:21:59 No.102956288

>>102956079
https://imgsli.com/MzExNjYx
this shit is really amazing, I always felt flux was a bit empty on its image, that node fixes that perfectly
>>102956140
kek

Anonymous
10/24/24(Thu)12:22:34 No.102956295

Anonymous 10/24/24(Thu)12:22:34 No.102956295

>>102956283
its not good when it wrecks eyes and hands and any other small detail

Anonymous
10/24/24(Thu)12:22:58 No.102956300

Anonymous 10/24/24(Thu)12:22:58 No.102956300

>>102956295
You can use Flux I don't care

Anonymous
10/24/24(Thu)12:23:01 No.102956302

Anonymous 10/24/24(Thu)12:23:01 No.102956302

>>102956249
https://github.com/NVlabs/Sana/issues/3#issuecomment-2434357814
>The released version will be further trained. This is a prototype demo for experience.

>>102956283
>Also less tokens required for the model to learn which means per parameter efficiency goes up.
you are right, i completely forgot about that

Anonymous
10/24/24(Thu)12:23:31 No.102956310

Anonymous 10/24/24(Thu)12:23:31 No.102956310

>>102956249
>anything you guys wanna tell them?
yes, why do they want to compress the VAE so much? their model is small enough, a good quality VAE makes all the difference, especially on details, that's not something you can just overlook and compress

Anonymous
10/24/24(Thu)12:24:11 No.102956317

Anonymous 10/24/24(Thu)12:24:11 No.102956317

>>102956310
>duh why woood u wunt efficansy

Anonymous
10/24/24(Thu)12:25:00 No.102956338

Anonymous 10/24/24(Thu)12:25:00 No.102956338

>>102956295
you also have to take into account how much that model is trained / the webp compression from the demo. not saying the vae DOESN'T contribute to this but there are other factors at play

Anonymous
10/24/24(Thu)12:25:01 No.102956340

Anonymous 10/24/24(Thu)12:25:01 No.102956340

>>102956300
might as well just close your eyes while you generate if you want the model to be good no matter the reality

Anonymous
10/24/24(Thu)12:25:12 No.102956344

Anonymous 10/24/24(Thu)12:25:12 No.102956344

>>102956310
>yes, why do they want to compress the VAE so much?
this is probably why
>expand to Video generation. We hope that with community feedback, SANA will get better and better
they want to use sana's research to dip their toes into video gen, it would make sense min-max performance then

Anonymous
10/24/24(Thu)12:25:16 No.102956345

Anonymous 10/24/24(Thu)12:25:16 No.102956345

>>102956317
they're not efficient at all, it looks like shit, that's the problem anon

Anonymous
10/24/24(Thu)12:26:44 No.102956365

Anonymous 10/24/24(Thu)12:26:44 No.102956365

>>102956345
No it doesn't and I know this for a fact given I've been posting many Sana gens and no one has say anything :)
Almost like you're just a dumb BFL employee

Anonymous
10/24/24(Thu)12:27:24 No.102956371

Anonymous 10/24/24(Thu)12:27:24 No.102956371

>>102956365
>Almost like you're just a dumb BFL employee
says the Sana employee

Anonymous
10/24/24(Thu)12:27:46 No.102956376

Anonymous 10/24/24(Thu)12:27:46 No.102956376

>>102956344
It always makes sense to min-max performance because it's impossible to predict what happens when things run 8 times faster

Anonymous
10/24/24(Thu)12:29:06 No.102956391

Anonymous 10/24/24(Thu)12:29:06 No.102956391

>>102956244
at least that means that the result is dissapointing only because he didn't really finetune the model, what a shame

Anonymous
10/24/24(Thu)12:29:15 No.102956394

Anonymous 10/24/24(Thu)12:29:15 No.102956394

now now you guys, lets not fight, we are all big adults here. if you can please make sure to give the sana guys any good constructive criticism. i've never seen anyone else ask the community for advice like this before

Anonymous
10/24/24(Thu)12:29:47 No.102956403

Anonymous 10/24/24(Thu)12:29:47 No.102956403

>>102956376
For example, there are multiple training losses techniques no one uses because the overhead is way too high (ie perceptual loss) despite those objectively improving convergence and final output

Anonymous
10/24/24(Thu)12:30:08 No.102956405

Anonymous 10/24/24(Thu)12:30:08 No.102956405

>>102956338
thats true but after seeing the effects of sdxls/1.5 vae and cascade im definitely super skeptical against any model that does this super compression shit

Anonymous
10/24/24(Thu)12:31:41 No.102956427

Anonymous 10/24/24(Thu)12:31:41 No.102956427

File: ComfyUI_temp_xhnhk_00015_.png (2.18 MB, 1088x1360)

2.18 MB PNG

Anonymous
10/24/24(Thu)12:32:43 No.102956435

Anonymous 10/24/24(Thu)12:32:43 No.102956435

File: ComfyUI_temp_xhnhk_00016_.png (2.1 MB, 1088x1360)

2.1 MB PNG

>>102956288
gj miqu anon

Anonymous
10/24/24(Thu)12:33:27 No.102956440

Anonymous 10/24/24(Thu)12:33:27 No.102956440

>>102956405
There's pros and cons for everything, but please keep demanding your filet minon and act like that's the only thing in the world.

Anonymous
10/24/24(Thu)12:33:44 No.102956441

Anonymous 10/24/24(Thu)12:33:44 No.102956441

File: AnimateDiff_00001.webm (566 KB, 872x488)

566 KB WEBM

10 fucking minutes

Anonymous
10/24/24(Thu)12:34:46 No.102956453

Anonymous 10/24/24(Thu)12:34:46 No.102956453

>>102956441
>AnimateDiff
I thought you used Mochi for that one lol

Anonymous
10/24/24(Thu)12:35:34 No.102956457

Anonymous 10/24/24(Thu)12:35:34 No.102956457

File: 00081-2227089261.png (1.11 MB, 832x1216)

1.11 MB PNG

Anonymous
10/24/24(Thu)12:36:27 No.102956464

Anonymous 10/24/24(Thu)12:36:27 No.102956464

>>102956453
Oops, that is Mochi. I forgot to change the filename prefix

Anonymous
10/24/24(Thu)12:37:12 No.102956474

Anonymous 10/24/24(Thu)12:37:12 No.102956474

File: Screenshot from 2024-10-2(...).png (162 KB, 1270x612)

162 KB PNG

>>102956023
retracting this (again), I get blurry low coherence with the Q4 compared to fp8. the default 168 frames json has an error where i have to rebuild the video_combine node or it doesn't save the image or vid
(Some Time Later...)
fp8
https://files.catbox.moe/sj6ecc.mp4
ggufq4
https://files.catbox.moe/e1cg5r.mp4
idkwtf is going on.

Anonymous
10/24/24(Thu)12:37:48 No.102956480

Anonymous 10/24/24(Thu)12:37:48 No.102956480

File: ComfyUI_temp_xhnhk_00019_.png (2.05 MB, 1016x1280)

2.05 MB PNG

Anonymous
10/24/24(Thu)12:37:51 No.102956482

Anonymous 10/24/24(Thu)12:37:51 No.102956482

>>102956440
the (sdxl mainly) compression together with low parameters and bad te IS one of the main things bottlenecking image generation right now
have u ever tried just taking a normal image and encoding it and decoding it? im sure u will hecking love the result

Anonymous
10/24/24(Thu)12:38:53 No.102956497

Anonymous 10/24/24(Thu)12:38:53 No.102956497

>>102956474
>idkwtf is going on.
it's not complicated, Q4 is too aggressive to be usable

Anonymous
10/24/24(Thu)12:38:54 No.102956498

Anonymous 10/24/24(Thu)12:38:54 No.102956498

>>102956482
The biggest bottleneck is training speed and requirements. We can survive just fine on the SDXL VAE quality if it means 8 times faster training and total requirements.

Anonymous
10/24/24(Thu)12:40:20 No.102956515

Anonymous 10/24/24(Thu)12:40:20 No.102956515

File: ComfyUI_temp_xhnhk_00024_.png (1.78 MB, 1016x1280)

1.78 MB PNG

Anonymous
10/24/24(Thu)12:40:25 No.102956516

Anonymous 10/24/24(Thu)12:40:25 No.102956516

>>102956474
Ok so basically this low is unusable.

Anonymous
10/24/24(Thu)12:40:48 No.102956523

Anonymous 10/24/24(Thu)12:40:48 No.102956523

>>102956498
sure training speed is a big problem but if the absolute quality ceiling is not good then there is no point in training in the first place

Anonymous
10/24/24(Thu)12:42:22 No.102956549

Anonymous 10/24/24(Thu)12:42:22 No.102956549

>>102956523
You people were happy to slurp up shitty SD 1.5, I think you are just moving around requirements arbitrarily. And again, 100% VAE recreation is not the be-all-end-all of a model, in fact it's a very superficial requirement.

Anonymous
10/24/24(Thu)12:43:31 No.102956565

Anonymous 10/24/24(Thu)12:43:31 No.102956565

>>102956523
>if the absolute quality ceiling is not good then there is no point in training in the first place
this, case closed

>>102956549
>You people were happy to slurp up shitty SD 1.5
are you a retard? we had no other choice back then, now we have

Anonymous
10/24/24(Thu)12:44:01 No.102956572

Anonymous 10/24/24(Thu)12:44:01 No.102956572

>>102956497
It's probably this, I'm overthinking, searching for answers from the perspective of "I don't know enough about all the parameters in front of me".
>>102956516
Seems so, but i will polish the turd for a bit and see.

Anonymous
10/24/24(Thu)12:45:27 No.102956593

Anonymous 10/24/24(Thu)12:45:27 No.102956593

>>102956565
It's okay, you can use Flux. I'll use Sana. We'll see who gets bored first, I'd imagine Flux will get boring when you realize Loras are extremely limited. Also as always, I never see the Flux apologists ever posting gens.

Anonymous
10/24/24(Thu)12:47:37 No.102956613

Anonymous 10/24/24(Thu)12:47:37 No.102956613

File: ComfyUI_temp_xhnhk_00029_.png (2.1 MB, 1088x1360)

2.1 MB PNG

Anonymous
10/24/24(Thu)12:47:37 No.102956614

Anonymous 10/24/24(Thu)12:47:37 No.102956614

File: ComfyUI_02466_.png (1.65 MB, 1024x1024)

1.65 MB PNG

>>102956079
can you pls post a workflow

Anonymous
10/24/24(Thu)12:48:00 No.102956618

Anonymous 10/24/24(Thu)12:48:00 No.102956618

https://www.reddit.com/r/StableDiffusion/comments/1gb07vj/comment/ltjdvlm/?utm_source=share&utm_medium=web2x&context=3
>Apparently 200 steps is the official number too, I haven't dared to go that high yet.
HOLY SHIT ARE THEY SERIOUS?? THATS ALREADY TOO SLOW WITH 50 STEPS AAAAAAA

Anonymous
10/24/24(Thu)12:49:23 No.102956637

Anonymous 10/24/24(Thu)12:49:23 No.102956637

File: cm2nhj4rq0029336pcz7dnjmd.webm (2.76 MB, 1696x960)

2.76 MB WEBM

Yeah I'm starting to get really disappointed in genmo, prompt adherence and world knowledge is really shit compared to minimax or kling
And I've only been using the website
Maybe I should be prompting using tags and commas instead of natural language?

Anonymous
10/24/24(Thu)12:49:48 No.102956645

Anonymous 10/24/24(Thu)12:49:48 No.102956645

File: file.png (597 KB, 1963x612)

597 KB PNG

>>102956614
what workflow do you need? it's just one new node to add between, it's not rocket science lol

Anonymous
10/24/24(Thu)12:49:52 No.102956647

Anonymous 10/24/24(Thu)12:49:52 No.102956647

File: ComfyUI_temp_xhnhk_00033_.png (1.86 MB, 1088x1360)

1.86 MB PNG

Anonymous
10/24/24(Thu)12:51:02 No.102956656

Anonymous 10/24/24(Thu)12:51:02 No.102956656

>>102956371
Sana is a company?

Anonymous
10/24/24(Thu)12:52:00 No.102956670

Anonymous 10/24/24(Thu)12:52:00 No.102956670

File: ComfyUI_temp_xhnhk_00034_.png (2.31 MB, 1240x1424)

2.31 MB PNG

>>102956637
have you tried prompting in chinese?

Anonymous
10/24/24(Thu)12:52:45 No.102956679

Anonymous 10/24/24(Thu)12:52:45 No.102956679

>>102956637
yeah their HD version isn't that good, but that's probably why they're not releasing it now but at the end of the year? They're probably trying to improve it I guess

Anonymous
10/24/24(Thu)12:53:32 No.102956685

Anonymous 10/24/24(Thu)12:53:32 No.102956685

So am I getting this right that neither SDXL nor Flux offer any benefit to anime generations, which have already been perfected with SD1.5? I mean if you want to use a Pony you need to use SDXL, sure, but inherently it doesn't seem to add anything.

Anonymous
10/24/24(Thu)12:54:34 No.102956698

Anonymous 10/24/24(Thu)12:54:34 No.102956698

>>102956685
Pony does actual porn with accurate genitals. SD 1.5 cannot do that.

Anonymous
10/24/24(Thu)12:54:36 No.102956699

Anonymous 10/24/24(Thu)12:54:36 No.102956699

File: ComfyUI_temp_xhnhk_00036_.png (2.7 MB, 1240x1424)

2.7 MB PNG

Anonymous
10/24/24(Thu)12:55:38 No.102956714

Anonymous 10/24/24(Thu)12:55:38 No.102956714

>>102956685
ive been liking some pony models for anime quite a bit compared to other ones i've tried
is there a better low res model to look at? kinda stopped doing ai gens for a while when I swapped from my 3090 and rocm didn't yet support rdna3 for a while so I missed out on a lot between sd1 and now

Anonymous
10/24/24(Thu)12:55:53 No.102956718

Anonymous 10/24/24(Thu)12:55:53 No.102956718

File: but where.jpg (298 KB, 3088x1636)

298 KB JPG

>>102956645
but where exactly?
my shit looks like picrel

Anonymous
10/24/24(Thu)12:56:00 No.102956719

Anonymous 10/24/24(Thu)12:56:00 No.102956719

File: file.png (21 KB, 2785x77)

21 KB PNG

>>102956618
https://youtu.be/4lVUuuuJU7c?t=5

Anonymous
10/24/24(Thu)12:56:04 No.102956721

Anonymous 10/24/24(Thu)12:56:04 No.102956721

File: cm2nj720l004a336psg8k2y40.webm (819 KB, 1696x960)

819 KB WEBM

Cutest 1girl so far but unfortunately she's 2young

>>102956670
kek I haven't but I've seen examples on the website using Arabic, Cyrillic etc and it seems to understand those languages. Gemno is a US company so English should be best

>>102956679
>They're probably trying to improve it I guess
Yeah their pricing page implies you get early access to new models so hopefully the new model is better AND it makes BFL release theirs in some way too

Anonymous
10/24/24(Thu)12:56:26 No.102956724

Anonymous 10/24/24(Thu)12:56:26 No.102956724

File: 1715241882023243.jpg (1.14 MB, 1536x2048)

1.14 MB JPG

Anonymous
10/24/24(Thu)12:58:05 No.102956744

Anonymous 10/24/24(Thu)12:58:05 No.102956744

File: ComfyUI_temp_xhnhk_00038_.png (2.61 MB, 1240x1424)

2.61 MB PNG

Anonymous
10/24/24(Thu)12:58:49 No.102956759

Anonymous 10/24/24(Thu)12:58:49 No.102956759

File: ComfyUI_07742_.png (1.2 MB, 1024x1024)

1.2 MB PNG

>>102956718
yeah it won't work with the KSampler, take this workflow: https://files.catbox.moe/a0snom.png
and also I modified the script so that it allows for more decimals on the values: https://files.catbox.moe/4gxohm.py

Anonymous
10/24/24(Thu)12:59:54 No.102956772

Anonymous 10/24/24(Thu)12:59:54 No.102956772

File: ComfyUI_temp_xhnhk_00040_.png (2.57 MB, 1240x1424)

2.57 MB PNG

Anonymous
10/24/24(Thu)13:02:40 No.102956812

Anonymous 10/24/24(Thu)13:02:40 No.102956812

File: ComfyUI_temp_xhnhk_00041_.png (2.79 MB, 1568x1568)

2.79 MB PNG

Anonymous
10/24/24(Thu)13:02:53 No.102956815

Anonymous 10/24/24(Thu)13:02:53 No.102956815

>>102956613
>>102956744
>>102956772
unrealistic, no girls post here nor would they wear merch

Anonymous
10/24/24(Thu)13:03:50 No.102956826

Anonymous 10/24/24(Thu)13:03:50 No.102956826

>>102956714
checkout noobxl

Anonymous
10/24/24(Thu)13:05:42 No.102956854

Anonymous 10/24/24(Thu)13:05:42 No.102956854

File: wut are theset.jpg (108 KB, 1676x972)

108 KB JPG

>>102956759
thanks bro, wut are these red ones?

Anonymous
10/24/24(Thu)13:06:09 No.102956863

Anonymous 10/24/24(Thu)13:06:09 No.102956863

>>102956826
It's too bad illustrious has those issues. I choose models based on their names. This is why I will be using Sana over Flux.

Anonymous
10/24/24(Thu)13:07:05 No.102956879

Anonymous 10/24/24(Thu)13:07:05 No.102956879

>>102956854
some nodes to put the text encoder on my second gpu, you don't need them I guess, just go with the regular loaders you're currently using

Anonymous
10/24/24(Thu)13:08:06 No.102956892

Anonymous 10/24/24(Thu)13:08:06 No.102956892

>>102956826
>noobxl
when proxl?

Anonymous
10/24/24(Thu)13:08:18 No.102956896

Anonymous 10/24/24(Thu)13:08:18 No.102956896

File: ComfyUI_temp_xhnhk_00046_.png (1.71 MB, 1080x1344)

1.71 MB PNG

>>102956815
>he doesnt know about LDG brand clothing

Anonymous
10/24/24(Thu)13:09:29 No.102956914

Anonymous 10/24/24(Thu)13:09:29 No.102956914

how much vram do you recommend for a beginner UI?

Anonymous
10/24/24(Thu)13:09:48 No.102956922

Anonymous 10/24/24(Thu)13:09:48 No.102956922

Keep it rolling

>>102956911
>>102956911
>>102956911

Anonymous
10/24/24(Thu)13:10:41 No.102956936

Anonymous 10/24/24(Thu)13:10:41 No.102956936

>>102956826
i think i did try their prerelease stuff but would have to totally rework my prompts that im using on pony atm
the early access one did generate an image but it does seem to be missing data that models like pony have for specific niches

Anonymous
10/24/24(Thu)13:11:21 No.102956945

Anonymous 10/24/24(Thu)13:11:21 No.102956945

File: ComfyUI_temp_xhnhk_00048_.png (1.5 MB, 1080x1344)

1.5 MB PNG

Anonymous
10/24/24(Thu)13:12:05 No.102956958

Anonymous 10/24/24(Thu)13:12:05 No.102956958

File: watdo.jpg (33 KB, 1085x615)

33 KB JPG

>>102956879
ok makes sense, also now I get pic related error.
do I have to change any settings?

Anonymous
10/24/24(Thu)13:12:46 No.102956977

Anonymous 10/24/24(Thu)13:12:46 No.102956977

>>102956958
did you update comfyui?

Anonymous
10/24/24(Thu)13:14:50 No.102957006

Anonymous 10/24/24(Thu)13:14:50 No.102957006

>>102956936
noobxl v5 pred has e621 images in it's dataset so that should help fill in some of the more niche tags.

>totally rework my prompts that im using on pony atm
yes, if you use pony score tags on an illustrious model you will be labeled a jeet

Anonymous
10/24/24(Thu)13:15:13 No.102957013

Anonymous 10/24/24(Thu)13:15:13 No.102957013

>>102955958
There are other option they are just using upscalers, I mean 1696x960 are litterally the double factor of 480p. I mean, the resolution look like the 480 but upscaled.

Anonymous
10/24/24(Thu)13:15:20 No.102957016

Anonymous 10/24/24(Thu)13:15:20 No.102957016

>>102956977
I did

Anonymous
10/24/24(Thu)13:17:50 No.102957048

Anonymous 10/24/24(Thu)13:17:50 No.102957048

>>102957013
Idk man, their API demo output look way better than what we have locally, something else is going on

Anonymous
10/24/24(Thu)13:18:31 No.102957056

Anonymous 10/24/24(Thu)13:18:31 No.102957056

>>102957006
i swapped all of that out for the typical danbooru quality tags they suggest (and I already use typically) but still wasn't really getting what I was asking for from it
its in progress and still training or whatnot so will probably wait to fully judge it once the full model releases
ive had issues trying to get certain tags to show up in anime-only/anime-primary trained models so it might just be because of that in general when that's their focus
those models also seem to overcook quite a bit earlier at higher cfg, at least that's how illustrious seems to be

Anonymous
10/24/24(Thu)13:18:50 No.102957063

Anonymous 10/24/24(Thu)13:18:50 No.102957063

>>102957016
oh what node did you get that error? ComfyUi colors the culprit node

Anonymous
10/24/24(Thu)13:19:59 No.102957082

Anonymous 10/24/24(Thu)13:19:59 No.102957082

File: file.png (1.93 MB, 1024x1024)

1.93 MB PNG

Anonymous
10/24/24(Thu)13:21:31 No.102957100

Anonymous 10/24/24(Thu)13:21:31 No.102957100

>>102957063
"SamplerCustomAdvanced"

Anonymous
10/24/24(Thu)13:21:49 No.102957107

Anonymous 10/24/24(Thu)13:21:49 No.102957107

>>102955508
That explain many think, so even this guy have not the hardware to train a full Flux model, and you think SD 3.5 large trains with is just 4B less? I doubt, the reality is his frankmerge is best that the finetunes that we had till now, so this show that de destilled is the path. Sadly, no much people have the hardware to do it, and the nigger pony creator would not train a real model. We cannot make a fucking cross-founding to rent for a month computing cloud and train our model?

Anonymous
10/24/24(Thu)13:22:07 No.102957111

Anonymous 10/24/24(Thu)13:22:07 No.102957111

any1 know some good chinese artists? maybe sana has lots of that in the dataset and would look really cool

Anonymous
10/24/24(Thu)13:23:06 No.102957124

Anonymous 10/24/24(Thu)13:23:06 No.102957124

>>102957100
can you show a screen of your workflow? something's weird here

Anonymous
10/24/24(Thu)13:24:52 No.102957158

Anonymous 10/24/24(Thu)13:24:52 No.102957158

File: watdooot.jpg (476 KB, 3592x1980)

476 KB JPG

>>102957124

Anonymous
10/24/24(Thu)13:27:41 No.102957198

Anonymous 10/24/24(Thu)13:27:41 No.102957198

>>102957158
you haven't used my custom script, your LyingSigmaSampler doesn't have all the decimals, and my workflow was going for values that your current node cannot reach, that's probably why you got an error

Anonymous
10/24/24(Thu)13:31:14 No.102957255

Anonymous 10/24/24(Thu)13:31:14 No.102957255

>>102957198
>you haven't used my custom script
but I did.
I opened it up with the editor and pasted it in and saved it.

Anonymous
10/24/24(Thu)13:32:48 No.102957280

Anonymous 10/24/24(Thu)13:32:48 No.102957280

>>102957255
then why do you have 0.1 and not 0.01? you should restart ComfyUi to get it working

Anonymous
10/24/24(Thu)13:34:16 No.102957301

Anonymous 10/24/24(Thu)13:34:16 No.102957301

File: error2t.jpg (345 KB, 3363x1982)

345 KB JPG

>>102957280
I did and I still get the same error

Anonymous
10/24/24(Thu)13:37:00 No.102957337

Anonymous 10/24/24(Thu)13:37:00 No.102957337

>>102957301
can't help you further, I suggest you to use a workflow that works for you and the reconstruct everything from it to get it working, instead of using my workflow that is somehow incompatible with yours

Anonymous
10/24/24(Thu)13:44:47 No.102957410

Anonymous 10/24/24(Thu)13:44:47 No.102957410

>>102957048
Is call prompt enhacer, the same with replika and flux.dev and some faggot that have skill issue saying the same moths ago.

Anonymous
10/24/24(Thu)15:06:39 No.102958464

Anonymous 10/24/24(Thu)15:06:39 No.102958464

>>102949176
Could you give me a picture of the blonde?

Anonymous
10/24/24(Thu)16:03:06 No.102959133

Anonymous 10/24/24(Thu)16:03:06 No.102959133

>>102956067
I mean it doesn't natively support Kolors either TBF. Only Hunyuan for some reason I think.

Anonymous
10/24/24(Thu)16:06:19 No.102959176

Anonymous 10/24/24(Thu)16:06:19 No.102959176

>>102954448
I think it'll see interest if it has strong baseline resolution support / image quality, even with the almost certainly worse prompt adherence vs 3.5 Large

Anonymous
10/24/24(Thu)16:15:20 No.102959294

Anonymous 10/24/24(Thu)16:15:20 No.102959294

File: file.png (2.22 MB, 2590x1227)

2.22 MB PNG

https://xcancel.com/OpenAI/status/1849139783362347293
>We are sharing a new approach, called sCM, which simplifies the theoretical formulation of continuous-time consistency models, allowing us to stabilize and scale their training for large scale datasets. This approach achieves comparable sample quality to leading diffusion models, while using only two sampling steps.
Really interesting

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.