/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

New anti-spam measures have been applied to all boards.

Please see the Frequently Asked Questions page for details.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/ldg/ - Local Diffusion Genera(...) 10/26/24(Sat)23:59:40 No.102987712

File: the longest dick general.jpg (3.09 MB, 3264x1509)

3.09 MB JPG

/ldg/ - Local Diffusion General Anonymous 10/26/24(Sat)23:59:40 No.102987712

Discussion of free and open source text-to-image models

Previous /ldg/ bred : >>102974813

I'm So Lonely Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://aitracker.art
https://huggingface.co
https://civitai.com
https://tensor.art/models
https://liblib.art
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3

>SD3.5
https://huggingface.co/stabilityai/stable-diffusion-3.5-large
https://replicate.com/stability-ai/stable-diffusion-3.5-large

>Sana
https://github.com/NVlabs/Sana
https://sana-gen.mit.edu

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux
DeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai

Anonymous
10/27/24(Sun)00:01:01 No.102987718

Anonymous 10/27/24(Sun)00:01:01 No.102987718

File: dena_sd35_00060_.png (970 KB, 1344x768)

970 KB PNG

>mfw

Anonymous
10/27/24(Sun)00:04:34 No.102987737

Anonymous 10/27/24(Sun)00:04:34 No.102987737

File: 1724770886791051.jpg (484 KB, 1024x1024)

484 KB JPG

How do I run local frog memes?
Is there a LoRa for it?

Anonymous
10/27/24(Sun)00:07:56 No.102987755

Anonymous 10/27/24(Sun)00:07:56 No.102987755

is this worth getting?

https://civitai.com/models/141592/pixelwave?modelVersionId=992642

Anonymous
10/27/24(Sun)00:28:54 No.102987882

Anonymous 10/27/24(Sun)00:28:54 No.102987882

>>102987755
it's a legit finetune for artistic-inclined content if you want to gen that, yeah

Anonymous
10/27/24(Sun)00:29:15 No.102987885

Anonymous 10/27/24(Sun)00:29:15 No.102987885

I don't see a legitimate reason to not use the same seed for every sampler.

Anonymous
10/27/24(Sun)00:48:56 No.102987979

Anonymous 10/27/24(Sun)00:48:56 No.102987979

File: 2024-10-26_00045_.png (1.55 MB, 720x1280)

1.55 MB PNG

>>102987737
It's not the same, but the Pepe lora for Flux can make some pepes.

Anonymous
10/27/24(Sun)00:56:02 No.102988021

Anonymous 10/27/24(Sun)00:56:02 No.102988021

File: 1698877237844316.jpg (17 KB, 255x255)

17 KB JPG

>>102987979
Haven't delved into local besides fooocus, and swarm back when sd3 first came out.
Do I have to sell my soul to holocaust survivors in order to run Flux locally or just learn comfyUI?

Anonymous
10/27/24(Sun)02:29:28 No.102988475

Anonymous 10/27/24(Sun)02:29:28 No.102988475

>

Anonymous
10/27/24(Sun)02:46:17 No.102988559

Anonymous 10/27/24(Sun)02:46:17 No.102988559

File: PBS.png (585 KB, 1152x896)

585 KB PNG

Anonymous
10/27/24(Sun)03:05:58 No.102988670

Anonymous 10/27/24(Sun)03:05:58 No.102988670

has anyone done a real close examination of flux dev fp8 vs nf4? I only use nf4 because fp8 doesn't really work for my small card but I get curious about what I'm missing

Anonymous
10/27/24(Sun)03:19:13 No.102988737

Anonymous 10/27/24(Sun)03:19:13 No.102988737

>>102988021
comfyui can be learned in minutes

Anonymous
10/27/24(Sun)04:17:17 No.102989073

Anonymous 10/27/24(Sun)04:17:17 No.102989073

*GASP*

Anonymous
10/27/24(Sun)04:31:40 No.102989143

Anonymous 10/27/24(Sun)04:31:40 No.102989143

What are some style lora's that either don't exist yet or only exist in poor quality/SD 1.5/etc? Have some credits to burn.

Anonymous
10/27/24(Sun)04:50:07 No.102989258

Anonymous 10/27/24(Sun)04:50:07 No.102989258

cough

Anonymous
10/27/24(Sun)04:56:35 No.102989289

Anonymous 10/27/24(Sun)04:56:35 No.102989289

File: cm2rcqyyw006a336pnohbvueg.webm (863 KB, 1696x960)

863 KB WEBM

>>102988670
nf4 is 5% less coherent than fp8, which is 3% less coherent than bf16, which is 1% less coherent than fp16
hope this helps

Anonymous
10/27/24(Sun)04:57:47 No.102989294

Anonymous 10/27/24(Sun)04:57:47 No.102989294

>>102988559
this would be a cool thing to animate with minimax

Anonymous
10/27/24(Sun)05:08:27 No.102989350

Anonymous 10/27/24(Sun)05:08:27 No.102989350

>>102989289
Can you change purple color to chromatic yellow?

Anonymous
10/27/24(Sun)05:21:38 No.102989432

Anonymous 10/27/24(Sun)05:21:38 No.102989432

>>102989289
How long do these take through the web api?

Anonymous
10/27/24(Sun)05:28:19 No.102989469

Anonymous 10/27/24(Sun)05:28:19 No.102989469

new noobxl v0.75
>https://civitai.com/models/833294/noobai-xl-nai-xl?modelVersionId=998979

Anonymous
10/27/24(Sun)05:29:06 No.102989473

Anonymous 10/27/24(Sun)05:29:06 No.102989473

File: ComfyUI_04638_.png (1.66 MB, 1024x1024)

1.66 MB PNG

>>102987979

It sorta works

Anonymous
10/27/24(Sun)05:36:49 No.102989517

Anonymous 10/27/24(Sun)05:36:49 No.102989517

File: cm2re8xz3000v336pmro1zenh.webm (810 KB, 1696x960)

810 KB WEBM

>>102989350
>Can you change purple color to chromatic yellow?
Here you go
She's cute

>>102989432
>How long do these take through the web api?
Like 4 minutes; H100s go brrrr
And it takes like 3 minutes to make a new Gmail account for 2 more gens

Anonymous
10/27/24(Sun)05:42:24 No.102989541

Anonymous 10/27/24(Sun)05:42:24 No.102989541

>>102987755
he claims he hasn't used generated images in his dataset, so it's less poisoned than your average checkpoint.

The XL checkpoints of pixelwave had the most unique results compared to the other converging slop, which is why I have my hopes up for this one

Anonymous
10/27/24(Sun)05:52:21 No.102989592

Anonymous 10/27/24(Sun)05:52:21 No.102989592

why doesn't OpenPose in forge or reForge work. I don't get it, it feels like a user error.
I upload an image, I pick a preprocessor, use one of the models from hugging face, click generate.
Why does it fail?
RuntimeError: mat1 and mat2 shapes cannot be multiplied (154x2048 and 768x320)
this is from Forge, reForge's error is much less helpful.

Anonymous
10/27/24(Sun)05:54:38 No.102989615

Anonymous 10/27/24(Sun)05:54:38 No.102989615

>>102989517
>brrr
Thanks, idk if you've seen the few examples on redit using a 4090 of exploding things, pretty impressive, as is the HD model.
We're going to be eatin' good, but slowly (lol) when the HD model is local and if it is small enough by quanting, esp if the encoder is released at the same time.

https://i.imgur.com/5yPiXXD.mp4
https://i.imgur.com/CoINM7S.mp4
https://i.imgur.com/rAHVOay.mp4
https://i.imgur.com/3GwELbh.mp4
https://i.imgur.com/wGQx9oK.mp4

Anonymous
10/27/24(Sun)06:04:35 No.102989665

Anonymous 10/27/24(Sun)06:04:35 No.102989665

>>102989592
Dont know for sure because i've not used that for a long time in Comfy but i seem to remember openpose for sd1.5 and SDXL both exist and if the base model you are using doesnt match the openpose model type it wont work.

Anonymous
10/27/24(Sun)06:07:48 No.102989688

Anonymous 10/27/24(Sun)06:07:48 No.102989688

>>102989665
Oh. So the ones from Ilya won't work if I'm using a Pony based model?
That's fucking obnoxious, thanks.

Anonymous
10/27/24(Sun)06:22:47 No.102989764

Anonymous 10/27/24(Sun)06:22:47 No.102989764

File: cm2rfvgh60015336p6r9ol4tw.webm (1.18 MB, 1696x960)

1.18 MB WEBM

>>102989615
First one is great, maybe I'll gen some corpo office towers blowing up (cough ClosedAI cough) for my cyberpunk video too. Was the prompt shared for the exploding building gen?
>we're going to be eating good, but slowly
The eating gets truly good when the HD model is local AND someone makes PonyGenmoHD

Anonymous
10/27/24(Sun)06:33:05 No.102989820

Anonymous 10/27/24(Sun)06:33:05 No.102989820

>>102989764
>prompt
No it wasn't, CliffDeNardo was the user who posted them.
>>102989688
idk, download both openpose variants and try and see.

Anonymous
10/27/24(Sun)06:39:56 No.102989865

Anonymous 10/27/24(Sun)06:39:56 No.102989865

File: cm2rgiri200gk336pf2jb4a4g.webm (820 KB, 1696x960)

820 KB WEBM

>>102989820
>No, it wasn't
boooooo more like CliffDeFarto
but whatever it should be a trivial thing to generate. Hopefully it's not blocked for terrorism or something stupid on the website...

Anonymous
10/27/24(Sun)06:46:52 No.102989904

Anonymous 10/27/24(Sun)06:46:52 No.102989904

File: 439631825.png (1.58 MB, 1344x768)

1.58 MB PNG

Anonymous
10/27/24(Sun)06:51:27 No.102989931

Anonymous 10/27/24(Sun)06:51:27 No.102989931

>>102989865
Michael Bay is the "king" of explosions in films, might help your prompt get past any censor filter if you have problems.

Anonymous
10/27/24(Sun)06:55:47 No.102989963

Anonymous 10/27/24(Sun)06:55:47 No.102989963

File: 196009930.png (1.64 MB, 768x1344)

1.64 MB PNG

Anonymous
10/27/24(Sun)07:36:32 No.102990223

Anonymous 10/27/24(Sun)07:36:32 No.102990223

File: 306890738779299843.webm (553 KB, 720x720)

553 KB WEBM

Anonymous
10/27/24(Sun)07:51:36 No.102990305

Anonymous 10/27/24(Sun)07:51:36 No.102990305

File: 1939923965.png (1.58 MB, 896x1152)

1.58 MB PNG

Anonymous
10/27/24(Sun)07:59:29 No.102990351

Anonymous 10/27/24(Sun)07:59:29 No.102990351

File: 1805667023.jpg (3.54 MB, 2048x2048)

3.54 MB JPG

Anonymous
10/27/24(Sun)08:59:14 No.102990763

Anonymous 10/27/24(Sun)08:59:14 No.102990763

File: 00009-3364060907.png (1 MB, 1024x1024)

1 MB PNG

>>102989688
>>102989820
well that took some fucking around and googling but i got it working. thanks anon, i would have been slamming my head against that for hours trying to figure out what I was doing wrong.
Keeping it simple right now until I get the hang of it.

Anonymous
10/27/24(Sun)09:04:01 No.102990804

Anonymous 10/27/24(Sun)09:04:01 No.102990804

When it comes to learn the math and read the code, what's the best open source project that still gives ok results?

Anonymous
10/27/24(Sun)09:22:10 No.102990931

Anonymous 10/27/24(Sun)09:22:10 No.102990931

>>102990804
https://github.com/huggingface/diffusers
https://github.com/huggingface/transformers

Anonymous
10/27/24(Sun)09:40:06 No.102991093

Anonymous 10/27/24(Sun)09:40:06 No.102991093

mochi 5% speedup on cublas with fp8, gguf Q8 and q4 throw up qublas api failed, reee. (I saw todays q4 fix, i'm up to date)
>>102990763
>head slamming
np anon, small steps, happy prompting!

Anonymous
10/27/24(Sun)09:43:30 No.102991125

Anonymous 10/27/24(Sun)09:43:30 No.102991125

>>102990931
>diffusers
Thanks. Got a pitch why this is a better approach than another repo, even on huggingface/

Anonymous
10/27/24(Sun)10:18:29 No.102991460

Anonymous 10/27/24(Sun)10:18:29 No.102991460

File: 1520640549.jpg (3.22 MB, 2304x1792)

3.22 MB JPG

Anonymous
10/27/24(Sun)10:40:14 No.102991694

Anonymous 10/27/24(Sun)10:40:14 No.102991694

Just want to say it again, you can full finetune Flux 8B on 24 GB of VRAM with Kohya without block swapping.

>ayjank

Anonymous
10/27/24(Sun)10:41:28 No.102991708

Anonymous 10/27/24(Sun)10:41:28 No.102991708

>>102989473
<3 She's beautiful!

Anonymous
10/27/24(Sun)10:41:48 No.102991713

Anonymous 10/27/24(Sun)10:41:48 No.102991713

Im giving a shot at training SD3.5

Anonymous
10/27/24(Sun)10:43:05 No.102991720

Anonymous 10/27/24(Sun)10:43:05 No.102991720

>>102988021
just download a workflow and flux, you can run it on 12GB of VRAM almost losslessly
now, you'd need a pepe lora, Im not sure if there are pepe loras for flux of sd

Anonymous
10/27/24(Sun)10:46:28 No.102991747

Anonymous 10/27/24(Sun)10:46:28 No.102991747

File: cm2rpa1eg038n336pqycx909c.webm (1.75 MB, 1696x960)

1.75 MB WEBM

I'm gonna go outside my comfort zone and try to generate.... 3girls

Anonymous
10/27/24(Sun)10:48:13 No.102991769

Anonymous 10/27/24(Sun)10:48:13 No.102991769

Can I run a different prompt at certain steps? ie can each step have a different prompt? Obviously, typically I'd only want to change the prompt once or twice.

Anonymous
10/27/24(Sun)10:49:19 No.102991779

Anonymous 10/27/24(Sun)10:49:19 No.102991779

>>102990223
How about without smiling?

Anonymous
10/27/24(Sun)11:13:34 No.102992004

Anonymous 10/27/24(Sun)11:13:34 No.102992004

File: ComfyUI_Flux_14894.jpg (192 KB, 704x1472)

192 KB JPG

Anonymous
10/27/24(Sun)11:18:30 No.102992068

Anonymous 10/27/24(Sun)11:18:30 No.102992068

>>102990763
https://files.catbox.moe/q7k4ia.png
catbox because I feel like this might be a bit to risque for a blue board.
I'm just impressed at how well it did with the glasses, these are the best looking glasses I've seen come out of AI. I didn't even ask for them.

Anonymous
10/27/24(Sun)11:18:45 No.102992074

Anonymous 10/27/24(Sun)11:18:45 No.102992074

>>102991769
Yeah

Anonymous
10/27/24(Sun)11:33:08 No.102992229

Anonymous 10/27/24(Sun)11:33:08 No.102992229

>>102991125
>Got a pitch why this is a better approach than another repo
Developed by professionals, not hacked together by amateurs

Anonymous
10/27/24(Sun)11:35:57 No.102992265

Anonymous 10/27/24(Sun)11:35:57 No.102992265

File: ComfyUI_Flux_14946.jpg (248 KB, 704x1472)

248 KB JPG

Anonymous
10/27/24(Sun)11:42:37 No.102992325

Anonymous 10/27/24(Sun)11:42:37 No.102992325

File: cm2rr872j000m336u14lflvyl.webm (1.19 MB, 1696x960)

1.19 MB WEBM

"group of three" doesn't work, gonna try prompting for crowds

Anonymous
10/27/24(Sun)11:44:28 No.102992357

Anonymous 10/27/24(Sun)11:44:28 No.102992357

File: 2024-10-27_00001_.png (754 KB, 720x1280)

754 KB PNG

>>102991720

Anonymous
10/27/24(Sun)11:45:10 No.102992360

Anonymous 10/27/24(Sun)11:45:10 No.102992360

File: cm2rre46t001g336ufl1qv5qk.webm (833 KB, 1696x960)

833 KB WEBM

Anonymous
10/27/24(Sun)11:45:57 No.102992367

Anonymous 10/27/24(Sun)11:45:57 No.102992367

>>102992074
is this how to do it?
>Bracket notation: Use brackets [] to define changes in the prompt. For example: [large::0.1] [cat|dog:0.05] will start with “a large” and switch to “cat” or “dog” at 0.05 steps.

Anonymous
10/27/24(Sun)11:46:10 No.102992372

Anonymous 10/27/24(Sun)11:46:10 No.102992372

>>102992325
godspeed anon you can do it.

Anonymous
10/27/24(Sun)11:48:32 No.102992399

Anonymous 10/27/24(Sun)11:48:32 No.102992399

File: file.jpg (302 KB, 944x1280)

302 KB JPG

>>102992357
pepe and warhammer loras were fun for a few gens

Anonymous
10/27/24(Sun)11:53:06 No.102992442

Anonymous 10/27/24(Sun)11:53:06 No.102992442

>>102992367
>>102992074
trying
>anime Pepe [dog:cat:10]

Anonymous
10/27/24(Sun)11:54:07 No.102992453

Anonymous 10/27/24(Sun)11:54:07 No.102992453

>>102992399
topkek

I need to try inpainting with pepe

Anonymous
10/27/24(Sun)11:59:01 No.102992506

Anonymous 10/27/24(Sun)11:59:01 No.102992506

>>102992442
Still genning, but clearly it just did the same as if
>anime Pepe dog cat

It's not changing the prompt midway.

Anonymous
10/27/24(Sun)12:00:04 No.102992517

Anonymous 10/27/24(Sun)12:00:04 No.102992517

>>102990223
Man, it's so clean, very nice, compared to the local results in >>102989615, which seems only good as long as there are no person in the scene.

I wonder if it means mochi was mostly trained on non copyrighted stock footage without anyone in it.

Anonymous
10/27/24(Sun)12:04:47 No.102992576

Anonymous 10/27/24(Sun)12:04:47 No.102992576

>>102992357
I say this and look like that you got me

Anonymous
10/27/24(Sun)12:09:20 No.102992626

Anonymous 10/27/24(Sun)12:09:20 No.102992626

File: 2024-10-27_00002_.png (715 KB, 720x1280)

715 KB PNG

>>102992442
The result is clearly not the prompt changing from
>anime Pepe dog
to
>anime Pepe cat

but is just
>anime Pepe dog cat

Anonymous
10/27/24(Sun)12:11:02 No.102992646

Anonymous 10/27/24(Sun)12:11:02 No.102992646

>>102992325
Could try "Group of friends" useful for generating less "samey" faces in SD compositions for multiple people, might translate across.

Anonymous
10/27/24(Sun)12:15:06 No.102992691

Anonymous 10/27/24(Sun)12:15:06 No.102992691

File: Mochi_preview_00015.webm (589 KB, 856x480)

589 KB WEBM

I hope mochi hd version is good

Anonymous
10/27/24(Sun)12:43:35 No.102992967

Anonymous 10/27/24(Sun)12:43:35 No.102992967

File: 155616_00001.webm (1.64 MB, 854x480)

1.64 MB WEBM

fp8. 100steps, cublas, 67 frames, 2501s, 4060ti

Anonymous
10/27/24(Sun)12:45:17 No.102992990

Anonymous 10/27/24(Sun)12:45:17 No.102992990

>>102992967
yjk

Anonymous
10/27/24(Sun)12:57:19 No.102993115

Anonymous 10/27/24(Sun)12:57:19 No.102993115

File: fluxUPscale_0127.jpg (1.06 MB, 1544x2696)

1.06 MB JPG

my gang

Anonymous
10/27/24(Sun)13:08:58 No.102993255

Anonymous 10/27/24(Sun)13:08:58 No.102993255

>>102987979
>>102988021 (me)
>>102991720
>>102992399
So if I were trying to make a quick guide for autistic gamers with 12gb vram
it would be something like...

> Learn ComfyUI https://github.com/comfyanonymous/ComfyUI
> Get flux https://comfyanonymous.github.io/ComfyUI_examples/flux
> Get the Pepe LoRa https://civitai.com/images/24172831
> ?????
> PROFIT

Anonymous
10/27/24(Sun)13:14:10 No.102993320

Anonymous 10/27/24(Sun)13:14:10 No.102993320

>>102993255
seems p straight forward to me

Anonymous
10/27/24(Sun)13:23:07 No.102993405

Anonymous 10/27/24(Sun)13:23:07 No.102993405

>>102993255
I'd go with Forge instead

Anonymous
10/27/24(Sun)13:36:54 No.102993531

Anonymous 10/27/24(Sun)13:36:54 No.102993531

File: cm2ru20ph000r336uo7auhs74.webm (1.33 MB, 1696x960)

1.33 MB WEBM

>>102992372
>godspeed anon you can do it.
might have to just repeat "group of 3 girls" over and over, genmo has maxed out at 2 over 10 attempts

>>102992646
>Could try "Group of friends" useful for generating less "samey" faces in SD compositions for multiple people, might translate across.
I'll try "group of girl friends", sometimes even with two girls it makes one guy and one girl

Anonymous
10/27/24(Sun)13:43:31 No.102993606

Anonymous 10/27/24(Sun)13:43:31 No.102993606

File: 165704_00001.webm (647 KB, 854x480)

647 KB WEBM

>>102992990
That's disgusting anon, how could you even think of something like that?

Anonymous
10/27/24(Sun)13:51:19 No.102993671

Anonymous 10/27/24(Sun)13:51:19 No.102993671

>>102993531
maybe try to describe each person?

Anonymous
10/27/24(Sun)13:52:20 No.102993679

Anonymous 10/27/24(Sun)13:52:20 No.102993679

>>102993531
>>102992360
>>102992325
Alright now it's radical.

Anonymous
10/27/24(Sun)13:58:32 No.102993751

Anonymous 10/27/24(Sun)13:58:32 No.102993751

>>102993671
>maybe try to describe each person?
Not a bad idea, I'll make one blonde one brunette one black hair, but I'll need to try again in 2.5 hours

>>102993679
>Alright now it's radical.
if lromot adherence was better genmo would be wicked tubular

Anonymous
10/27/24(Sun)14:17:37 No.102993941

Anonymous 10/27/24(Sun)14:17:37 No.102993941

>>102993531
I'll try 3 girls on local, about 40mins to generate, because i like the qaulity of 100 steps on fp8

Anonymous
10/27/24(Sun)14:32:49 No.102994074

Anonymous 10/27/24(Sun)14:32:49 No.102994074

>captioning some pics
>The image is a digitally rendered scene from a video game, specifically from the 2006 game "Saya no Uta."
I-it knows?

Anonymous
10/27/24(Sun)14:33:18 No.102994081

Anonymous 10/27/24(Sun)14:33:18 No.102994081

File: ComfyUI_11939_.png (1.34 MB, 776x1024)

1.34 MB PNG

>3 months after Flux.1
>still not a word about Flux Video
It's vaporware, isn't it?

Anonymous
10/27/24(Sun)14:36:34 No.102994120

Anonymous 10/27/24(Sun)14:36:34 No.102994120

>>102994081
BFL are very secretive about their work, hell none of us even knew they were doing anything, or even existed, until Flux released.
It would be funny if they released it right after the election though.

Anonymous
10/27/24(Sun)14:38:01 No.102994138

Anonymous 10/27/24(Sun)14:38:01 No.102994138

>>102994081
don't trust these hoes everything is vaporware until it's released

Anonymous
10/27/24(Sun)14:41:23 No.102994172

Anonymous 10/27/24(Sun)14:41:23 No.102994172

File: ComfyUI_Flux_15008.jpg (194 KB, 704x1472)

194 KB JPG

trying to gen me a new phone wallpaper

Anonymous
10/27/24(Sun)15:13:31 No.102994525

Anonymous 10/27/24(Sun)15:13:31 No.102994525

Death march

Anonymous
10/27/24(Sun)15:20:59 No.102994609

Anonymous 10/27/24(Sun)15:20:59 No.102994609

>>102994172
Get yourself one of those gallery widgets and have it scroll through all your fav gens.

Anonymous
10/27/24(Sun)15:29:20 No.102994700

Anonymous 10/27/24(Sun)15:29:20 No.102994700

File: ComfyUI_04651_.png (2.01 MB, 1024x1024)

2.01 MB PNG

Anonymous
10/27/24(Sun)15:30:55 No.102994721

Anonymous 10/27/24(Sun)15:30:55 No.102994721

File: 181101_00001.webm (520 KB, 854x480)

520 KB WEBM

>>102993531
>3 teenage irish young-women wearing traditional clothes are sitting in a small 16th century cottage, in front of each one of them is a spinning-jenny which they are using to make cloth, a fireplace burns in one corner and bolts of cloth are piled up against one wall. A boarder collie dog is laying down asleep in front of the fire.
Seemed to work ok, but i had to close browser so am waiting 15 mins to post this, i'll start another "3" girl gen, should be 30 mins from this post.

Anonymous
10/27/24(Sun)15:37:53 No.102994800

Anonymous 10/27/24(Sun)15:37:53 No.102994800

>>102994721
Looks like I need to rework my prompts. I'll throw it into an LLM for modifications once my 6 hour limit expires
>Seemed to work ok, but i had to close browser so am waiting 15 mins to post this, i'll start another "3" girl gen, should be 30 mins from this post.
I saw a thread where you can get a temporary inbox for Gmail addresses, maybe find it in the archives if you're interested

Anonymous
10/27/24(Sun)15:46:49 No.102994882

Anonymous 10/27/24(Sun)15:46:49 No.102994882

File: ComfyUI_04656_.png (1.43 MB, 1024x1024)

1.43 MB PNG

Anonymous
10/27/24(Sun)15:49:27 No.102994919

Anonymous 10/27/24(Sun)15:49:27 No.102994919

>>102988021
just use flux, comfy is mostly autism for tinkerers

Anonymous
10/27/24(Sun)15:54:52 No.102994965

Anonymous 10/27/24(Sun)15:54:52 No.102994965

File: ComfyUI_04663_.png (1.81 MB, 1024x1024)

1.81 MB PNG

Anonymous
10/27/24(Sun)15:55:07 No.102994968

Anonymous 10/27/24(Sun)15:55:07 No.102994968

File: 191051_00001.webm (2.64 MB, 854x480)

2.64 MB WEBM

>>102994800
Thanks, i may have to do that, i know icloud or whatever it is lets you make millions of alises a day or whatever.
I'd focus on putting the most important thing at the front of the prompt
>3 mexican women wearing traditional clothes are at an open air cooking area preparing a large meal for eating doors, they are underneath a large shade, stirring a large cooking pot and chopping up vegetables and meat and preparing soft tortilas on a floured surface, they are happy and smiling at the camera as they work, it is a sunny day.

Turned out largely as expected.

Anonymous
10/27/24(Sun)15:56:04 No.102994975

Anonymous 10/27/24(Sun)15:56:04 No.102994975

>>102994721
Poor dog...

Anonymous
10/27/24(Sun)15:58:21 No.102994986

Anonymous 10/27/24(Sun)15:58:21 No.102994986

>>102994968
>largely
kek

Anonymous
10/27/24(Sun)15:58:33 No.102994987

Anonymous 10/27/24(Sun)15:58:33 No.102994987

>>102994975
ikr, the other half of him is in front of the fireplace >< maybe he's magical and warming his fuzzy trotters... idk
"Front" and "back" and "behind" have got better understanding by models but there's room for improvement.

Anonymous
10/27/24(Sun)15:59:23 No.102994993

Anonymous 10/27/24(Sun)15:59:23 No.102994993

>>102994975
Like you're so much better at dog storage.

Anonymous
10/27/24(Sun)15:59:42 No.102994997

Anonymous 10/27/24(Sun)15:59:42 No.102994997

>>102994919
why are you here if you're not an autistic tinkerer?

Anonymous
10/27/24(Sun)16:00:07 No.102995002

Anonymous 10/27/24(Sun)16:00:07 No.102995002

>>102994986
Yeah
Human: AI, what does the average mexican woman look like to you, describe her?
AI: She's fat <end of line>

Anonymous
10/27/24(Sun)16:01:12 No.102995019

Anonymous 10/27/24(Sun)16:01:12 No.102995019

File: ComfyUI_04668_.png (1.77 MB, 1024x1024)

1.77 MB PNG

Anonymous
10/27/24(Sun)16:07:53 No.102995076

Anonymous 10/27/24(Sun)16:07:53 No.102995076

File: 2024-10-27_00003_.png (1.08 MB, 720x1280)

1.08 MB PNG

>>102994919

>>102995019
cool! luv books, me

Anonymous
10/27/24(Sun)16:08:40 No.102995083

Anonymous 10/27/24(Sun)16:08:40 No.102995083

>>102989289
>coherence
not really usable info for me. I was wondering more whether it shows better grasp of niche/subtle prompt terms, rare concepts, does it display wider potential of gens, can it push higher cfg without cooking, is it less prone to sameface, etc

Anonymous
10/27/24(Sun)16:11:19 No.102995102

Anonymous 10/27/24(Sun)16:11:19 No.102995102

What's Illustrious? I've seen Loras on civitai appear that are marked with IL for somethign that's called like that.

Anonymous
10/27/24(Sun)16:11:29 No.102995106

Anonymous 10/27/24(Sun)16:11:29 No.102995106

>>102989143
Early video game FMV. Think command and conquer video games, the opening of the original resident evil, many adventure games, wing commander 4 etc. But I'm not sure it's easy to make a great lora for, because of the varying low quality artifacts and all that shit.

Anonymous
10/27/24(Sun)16:11:41 No.102995107

Anonymous 10/27/24(Sun)16:11:41 No.102995107

File: ComfyUI_34424_.png (903 KB, 848x1024)

903 KB PNG

Anonymous
10/27/24(Sun)16:13:17 No.102995127

Anonymous 10/27/24(Sun)16:13:17 No.102995127

>>102995106
That's actually pretty similar to what I'm doing (a distorted CRT hyper-stylized 90s JRPG/VN aesthetic). I have had zero success in local captioning so I'm slowly plugging through it manually, hopefully I'll have a lora by tonight.

Anonymous
10/27/24(Sun)16:36:22 No.102995336

Anonymous 10/27/24(Sun)16:36:22 No.102995336

>>102995083
confidence intervals are going to overlap for each of the quants so you'll never be able to get a statistically significant result anon. That's why I gave you the answer I did.

Anonymous
10/27/24(Sun)16:37:13 No.102995342

Anonymous 10/27/24(Sun)16:37:13 No.102995342

>>102995127
Sounds cool, but wouldn't that be more prerendered 3D animation and 2D animation oriented, rather than actors on a set or green screened onto a prerendered 3D background?

Anonymous
10/27/24(Sun)16:39:39 No.102995362

Anonymous 10/27/24(Sun)16:39:39 No.102995362

>>102995342
Oh yeah you're right, forgot what FMV meant, was picturing cutscenes broadly.

Anonymous
10/27/24(Sun)16:41:34 No.102995378

Anonymous 10/27/24(Sun)16:41:34 No.102995378

>>102994968
>Thanks, i may have to do that
I was struggling getting a nice looking prompt until I had Claude help me out a little
>largely
I wonder if using "Mexican" as a token compared to "latina" weighs it more (lmao) towards making fat people

Anonymous
10/27/24(Sun)16:43:59 No.102995394

Anonymous 10/27/24(Sun)16:43:59 No.102995394

File: 2024-10-27_00004_.png (1.38 MB, 720x1280)

1.38 MB PNG

>>102995378
Could be random. Here's Flux with your prompt.

This may sound stupid, but is mochi not good at img2img? It sounds to me like a video model would literally be img2img.

Anonymous
10/27/24(Sun)16:47:33 No.102995423

Anonymous 10/27/24(Sun)16:47:33 No.102995423

>>102995394
flux is biased towards making beautiful people. genmo is too but not as much since video has to fundamentally pull from more "base truth" about reality than a model trained on Instagram pics
At least that's my theory

Mochi can't do image2image or image2giceo because the vae encoder was not released according to another anon

Anonymous
10/27/24(Sun)16:48:50 No.102995442

Anonymous 10/27/24(Sun)16:48:50 No.102995442

File: ComfyUI_04691_.png (1.62 MB, 1024x1024)

1.62 MB PNG

Anonymous
10/27/24(Sun)16:50:12 No.102995458

Anonymous 10/27/24(Sun)16:50:12 No.102995458

>>102995423
>flux is biased towards making beautiful people
to some extent all image gen is because people are biased towards reposting images of beautiful people. Ugly people post their cats as their profile pics. etc.

Anonymous
10/27/24(Sun)16:50:45 No.102995463

Anonymous 10/27/24(Sun)16:50:45 No.102995463

>>102995423
ok, how about this?

>beautiful, pretty, good looking

in the negative

also I forgot I'm using a painting lora, Leighton.

Anonymous
10/27/24(Sun)16:59:12 No.102995544

Anonymous 10/27/24(Sun)16:59:12 No.102995544

>>102995458
yeah you're right it's not just flux
my point is that if you scrape 1000 videos of the real world, most won't be photoshopped (maybe they'll look nicer with saturation but the people will look like they do irl) but if you scrape 1000 images of people a sizable amount will be facetuned

>>102995463
idk anon try it out. from what I remember with other anons experiments flux doesn't really understand what it means to be "ugly" so you should prompt stuff like "tired, acne, blemishes" etc

Anonymous
10/27/24(Sun)17:02:55 No.102995579

Anonymous 10/27/24(Sun)17:02:55 No.102995579

File: 2024-10-27_00005_.png (1.37 MB, 720x1280)

1.37 MB PNG

>>102995463
>>102995423
>>102995394
>>102995378
I think it's WAY nicer.

Anonymous
10/27/24(Sun)17:03:53 No.102995586

Anonymous 10/27/24(Sun)17:03:53 No.102995586

>>102995544
>>102995458
imo these models tend towards averaging featurings not in the sense that they'll making average faces, but that the average of a 1000 faces is symmetrical and safe-attractive
outlier attractive faces are the ones that stick in people's memories, and the models just have trouble with that

Anonymous
10/27/24(Sun)17:04:54 No.102995596

Anonymous 10/27/24(Sun)17:04:54 No.102995596

>>102995586
>averaging features*

Anonymous
10/27/24(Sun)17:08:32 No.102995634

Anonymous 10/27/24(Sun)17:08:32 No.102995634

>>102995586
juggernaut was able to teach SDXL to only make good looking people when "beautiful" is in the prompt, and normal looking imperfect people otherwise so it's definitely possible

Anonymous
10/27/24(Sun)17:08:51 No.102995639

Anonymous 10/27/24(Sun)17:08:51 No.102995639

>>102995579
now s/3 mexican/3 obese mexican

Anonymous
10/27/24(Sun)17:17:17 No.102995728

Anonymous 10/27/24(Sun)17:17:17 No.102995728

File: 2024-10-27_00006_.png (1.35 MB, 720x1280)

1.35 MB PNG

>>102995639
wow

Now genning the same, but with a more extensive negative:
>beautiful, pretty, good looking, gorgeous, lovely, stunning,
attractive, exquisite, elegant, charming, alluring, radiant, resplendent, comely, fair, pleasant, dainty, delightful, captivating, enchanting

Anonymous
10/27/24(Sun)17:18:10 No.102995739

Anonymous 10/27/24(Sun)17:18:10 No.102995739

>>102995586
No, the problem is these models filter based on aesthetics ratings which bias towards professional photography. Also I'm pretty sure their dataset is biased towards certain actual models (barely disguised creator's fetish). If you actually train a model on random Flickr photos without cherrypicking, you will get more realistic people.

Anonymous
10/27/24(Sun)17:18:49 No.102995744

Anonymous 10/27/24(Sun)17:18:49 No.102995744

>>102995586
I think the models are capable of extrapolating outside of the mean. So like:
sample 1: 5
sample 2: 500
sample 3. 414
I think the model will consider 600 reasonable, and even 8 not that likely. idk, something like that, I think the training is not the boundary.

not sure, but I think so.

Anonymous
10/27/24(Sun)17:19:50 No.102995750

Anonymous 10/27/24(Sun)17:19:50 No.102995750

>>102995728
notice the food improved :^)

Anonymous
10/27/24(Sun)17:28:02 No.102995836

Anonymous 10/27/24(Sun)17:28:02 No.102995836

File: 2024-10-27_00007_.png (1.36 MB, 720x1280)

1.36 MB PNG

>>102995728
result.

The next one is a dichotomy, we'll see what it does...

Anonymous
10/27/24(Sun)17:30:44 No.102995866

Anonymous 10/27/24(Sun)17:30:44 No.102995866

File: FLUX-974292388061597_00001_.png (295 KB, 384x704)

295 KB PNG

>>102995544
>yeah you're right it's not just flux
To be clear, I completely agree that Flux is worse than average in this regard. I don't know why I'm arguing the opposite.

FLUX has an enormous untapped knowledge of average and ugly people too, but for some reason you really need to force it with tricks because prompting naively with an empty latent is unlikely to get you there. I don't generally try to gen ugly chicks but I do try to gen women who aren't "instagram pretty", and at first I found Flux very frustrating and didn't like it. Now it's all I use.

Anonymous
10/27/24(Sun)17:44:12 No.102995988

Anonymous 10/27/24(Sun)17:44:12 No.102995988

>>102995866
skinny apparently is synonymous with pretty, in Flux.

Anonymous
10/27/24(Sun)17:45:13 No.102995997

Anonymous 10/27/24(Sun)17:45:13 No.102995997

>>102995836
(I canceled the experiment) tldr Flux can do a ton, it's crazy.

Anonymous
10/27/24(Sun)17:46:13 No.102996007

Anonymous 10/27/24(Sun)17:46:13 No.102996007

>>102995866
What do you use to make chubs?

Anonymous
10/27/24(Sun)17:51:27 No.102996053

Anonymous 10/27/24(Sun)17:51:27 No.102996053

>>102996007
just with the word "chubby" usually, in a construction like "chubby in a good way" "chubby in all the right places" etc., some conventional way of saying hot-chubby. The more important thing is the rest of the prompt setting the scene in a way that suggests a normal person's facebook pic

Anonymous
10/27/24(Sun)18:03:31 No.102996157

Anonymous 10/27/24(Sun)18:03:31 No.102996157

File: 1708836395014367.jpg (22 KB, 400x400)

22 KB JPG

>>102994919
How to apply loRa without local tho?
I'm a noob to Flux or anything beyond fooocus

Anonymous
10/27/24(Sun)18:59:39 No.102996677

Anonymous 10/27/24(Sun)18:59:39 No.102996677

>>102996157
>without local
This is /ldg/, I have only done local.

Anonymous
10/27/24(Sun)19:05:03 No.102996732

Anonymous 10/27/24(Sun)19:05:03 No.102996732

File: file.png (305 KB, 1962x1444)

305 KB PNG

>red_panda
Chinaman cometh?

Anonymous
10/27/24(Sun)19:08:17 No.102996762

Anonymous 10/27/24(Sun)19:08:17 No.102996762

>>102996732
That's mostly a test of adherence, since the prompts are complex. Better adherence is very good news potentially.

Anonymous
10/27/24(Sun)19:09:05 No.102996771

Anonymous 10/27/24(Sun)19:09:05 No.102996771

File: 1708772641460420.jpg (56 KB, 568x568)

56 KB JPG

>>102996677
chegged
am about to try forgeUI after struggling with swarmUI
got any tips?

Anonymous
10/27/24(Sun)19:15:45 No.102996836

Anonymous 10/27/24(Sun)19:15:45 No.102996836

>>102996771
Wish I did, I use ComfyUI

Anonymous
10/27/24(Sun)19:16:59 No.102996841

Anonymous 10/27/24(Sun)19:16:59 No.102996841

Is there any reason I can't or shouldn't use one or more Tesla p40s with an RTX 4060ti or any other gayman card?

Anonymous
10/27/24(Sun)19:17:25 No.102996844

Anonymous 10/27/24(Sun)19:17:25 No.102996844

File: ComfyUI_temp_zgmyc_00005_.png (2.72 MB, 1152x1920)

2.72 MB PNG

Anonymous
10/27/24(Sun)19:23:26 No.102996897

Anonymous 10/27/24(Sun)19:23:26 No.102996897

>>102996844
bräääääp

Anonymous
10/27/24(Sun)19:25:36 No.102996916

Anonymous 10/27/24(Sun)19:25:36 No.102996916

>>102995107
get the fuck off my keyboard im trying to prompt

Anonymous
10/27/24(Sun)19:31:52 No.102996965

Anonymous 10/27/24(Sun)19:31:52 No.102996965

File: ComfyUI_temp_zgmyc_00012_.png (2.83 MB, 1152x1920)

2.83 MB PNG

anon, your tongue, my soles, now!

Anonymous
10/27/24(Sun)19:35:18 No.102996988

Anonymous 10/27/24(Sun)19:35:18 No.102996988

>>102996965
feet are gross id rather tongue her butthole

Anonymous
10/27/24(Sun)19:42:19 No.102997064

Anonymous 10/27/24(Sun)19:42:19 No.102997064

anyone find a way to stop mochi from doing that zoom effect? I want the camera frame to be locked.

Anonymous
10/27/24(Sun)19:54:31 No.102997172

Anonymous 10/27/24(Sun)19:54:31 No.102997172

>save outputs as .jpg
>temp folder wiped clean
>workflow completely lost
god FUCKING damnit

Anonymous
10/27/24(Sun)19:58:53 No.102997207

Anonymous 10/27/24(Sun)19:58:53 No.102997207

>>102996965
The human being in this photo is alright (besides having left her hand on the other stool after losing it in a terrible accident) but man every background detail is just fucked.

Anonymous
10/27/24(Sun)20:08:04 No.102997276

Anonymous 10/27/24(Sun)20:08:04 No.102997276

File: 1708584205805588.png (2.18 MB, 1280x1280)

2.18 MB PNG

Anonymous
10/27/24(Sun)20:08:19 No.102997279

Anonymous 10/27/24(Sun)20:08:19 No.102997279

File: Cog_00008.webm (504 KB, 720x480)

504 KB WEBM

Hot load

Anonymous
10/27/24(Sun)20:19:47 No.102997358

Anonymous 10/27/24(Sun)20:19:47 No.102997358

Anonymous
10/27/24(Sun)20:24:37 No.102997407

Anonymous 10/27/24(Sun)20:24:37 No.102997407

File: ComfyUI_temp_zgmyc_00035_.png (3.66 MB, 1152x2160)

3.66 MB PNG

Anonymous
10/27/24(Sun)20:34:32 No.102997514

Anonymous 10/27/24(Sun)20:34:32 No.102997514

File: 306809434104672259 2.webm (977 KB, 1280x720)

977 KB WEBM

I need to get better. I find it difficult to zoom out and also keep her tongue stable.

Anonymous
10/27/24(Sun)20:41:57 No.102997570

Anonymous 10/27/24(Sun)20:41:57 No.102997570

I'm gunna pull

Anonymous
10/27/24(Sun)20:47:48 No.102997610

Anonymous 10/27/24(Sun)20:47:48 No.102997610

File: ComfyUI_temp_zgmyc_00042_.png (3.72 MB, 1152x2160)

3.72 MB PNG

>>102997514
how long does it take you to gen that or are you using the online service?

Anonymous
10/27/24(Sun)20:48:58 No.102997619

Anonymous 10/27/24(Sun)20:48:58 No.102997619

File: 1717288406449638.png (6 KB, 310x107)

6 KB PNG

>>102996844
i went to school with her

Anonymous
10/27/24(Sun)20:50:32 No.102997629

Anonymous 10/27/24(Sun)20:50:32 No.102997629

AI isn't really a mature tech until I can take a small-breasted normie's instagram pic and use it as the first frame to generate an r/biggerthanyouthought video where she reveals she had enormous breasts all along

Anonymous
10/27/24(Sun)20:56:45 No.102997676

Anonymous 10/27/24(Sun)20:56:45 No.102997676

Using artificalanalysis.ai's arena at the moment, I occasionally get pictures it attributes to a model called "neptune_next" but it's not on their leaderboard. This some new thing?

Anonymous
10/27/24(Sun)20:57:41 No.102997683

Anonymous 10/27/24(Sun)20:57:41 No.102997683

>>102997629

This is already within reach

Anonymous
10/27/24(Sun)20:58:40 No.102997692

Anonymous 10/27/24(Sun)20:58:40 No.102997692

File: 002317.jpg (2.16 MB, 1664x2432)

2.16 MB JPG

Anonymous
10/27/24(Sun)20:59:33 No.102997701

Anonymous 10/27/24(Sun)20:59:33 No.102997701

>>102997610
Online.

Dancing around banned words and phrases.

I actually had a lot of success taking a picture of a room and adding a main character interacting from my POV

Anonymous
10/27/24(Sun)21:00:33 No.102997714

Anonymous 10/27/24(Sun)21:00:33 No.102997714

File: 00001-3874033555.png (1.04 MB, 1344x768)

1.04 MB PNG

Anonymous
10/27/24(Sun)21:04:51 No.102997747

Anonymous 10/27/24(Sun)21:04:51 No.102997747

>>102997714
why he sad

Anonymous
10/27/24(Sun)21:07:12 No.102997769

Anonymous 10/27/24(Sun)21:07:12 No.102997769

>>102997358
what teh fuck

Anonymous
10/27/24(Sun)21:17:18 No.102997854

Anonymous 10/27/24(Sun)21:17:18 No.102997854

File: ComfyUI_00158_.png.webm (1.4 MB, 848x480)

1.4 MB WEBM

Anonymous
10/27/24(Sun)21:20:37 No.102997880

Anonymous 10/27/24(Sun)21:20:37 No.102997880

File: 00005-3062886786.png (1.04 MB, 1344x768)

1.04 MB PNG

>>102997747
Hes in a war.

Anonymous
10/27/24(Sun)21:21:10 No.102997888

Anonymous 10/27/24(Sun)21:21:10 No.102997888

>>102997854
A big poo

Anonymous
10/27/24(Sun)21:59:59 No.102998168

Anonymous 10/27/24(Sun)21:59:59 No.102998168

File: 002323.jpg (2.94 MB, 1664x2432)

2.94 MB JPG

Anonymous
10/27/24(Sun)22:12:49 No.102998287

Anonymous 10/27/24(Sun)22:12:49 No.102998287

File: ComfyUI_temp_kosgp_00003_.png (3.5 MB, 1152x1920)

3.5 MB PNG

Anonymous
10/27/24(Sun)22:31:25 No.102998436

Anonymous 10/27/24(Sun)22:31:25 No.102998436

File: ComfyUI_temp_kosgp_00008_.png (2.92 MB, 960x1600)

2.92 MB PNG

what sampler do you use? I'm still using AYS

Anonymous
10/27/24(Sun)22:35:31 No.102998464

Anonymous 10/27/24(Sun)22:35:31 No.102998464

File: ComfyUI_temp_xjqly_00015_.png (3.37 MB, 1152x1920)

3.37 MB PNG

Anonymous
10/27/24(Sun)22:37:27 No.102998480

Anonymous 10/27/24(Sun)22:37:27 No.102998480

>>102998436
i havent tried the last 22 samplers that were added
for schedulers: https://github.com/Extraltodeus/sigmas_tools_and_the_golden_scheduler

Anonymous
10/27/24(Sun)22:40:55 No.102998506

Anonymous 10/27/24(Sun)22:40:55 No.102998506

File: ComfyUI_temp_tdnba_00003_.png (2.95 MB, 960x1600)

2.95 MB PNG

Anonymous
10/27/24(Sun)22:47:35 No.102998553

Anonymous 10/27/24(Sun)22:47:35 No.102998553

File: ComfyUI_temp_tdnba_00005_.png (3.05 MB, 960x1600)

3.05 MB PNG

Anonymous
10/27/24(Sun)22:52:15 No.102998592

Anonymous 10/27/24(Sun)22:52:15 No.102998592

File: ComfyUI_temp_tdnba_00007_.png (2.43 MB, 960x1600)

2.43 MB PNG

Anonymous
10/27/24(Sun)22:53:15 No.102998600

Anonymous 10/27/24(Sun)22:53:15 No.102998600

>>102998287
>>102998592
THIS THREAD IS FOR AI IMAGES ONLY

Anonymous
10/27/24(Sun)22:54:44 No.102998610

Anonymous 10/27/24(Sun)22:54:44 No.102998610

>>102998592
Good shit. Could you do one with a bit of a tummy?

Anonymous
10/27/24(Sun)22:57:23 No.102998629

Anonymous 10/27/24(Sun)22:57:23 No.102998629

>bob cut
>mask
>simple shirt with a skirt
What is this archetype called? You see them all the time in those fake upskirt videos.

Anonymous
10/27/24(Sun)22:58:23 No.102998637

Anonymous 10/27/24(Sun)22:58:23 No.102998637

File: SO HECKING COOLERINO.jpg (210 KB, 1540x850)

210 KB JPG

>>102994919
Comfy is the opposite, it's a shitty nu-design trash heap for people who can't handle anything that isn't putting blocks in shaped holes-tier googoo shit.

Anonymous
10/27/24(Sun)22:59:20 No.102998642

Anonymous 10/27/24(Sun)22:59:20 No.102998642

File: ComfyUI_temp_tdnba_00008_.png (2.64 MB, 960x1600)

2.64 MB PNG

Anonymous
10/27/24(Sun)22:59:26 No.102998643

Anonymous 10/27/24(Sun)22:59:26 No.102998643

googoo gaagaa

Anonymous
10/27/24(Sun)22:59:26 No.102998644

Anonymous 10/27/24(Sun)22:59:26 No.102998644

>>102998637
Holy kino.

Anonymous
10/27/24(Sun)23:09:19 No.102998720

Anonymous 10/27/24(Sun)23:09:19 No.102998720

File: ComfyUI_temp_tnauq_00001_.png (2.73 MB, 960x1600)

2.73 MB PNG

Anonymous
10/27/24(Sun)23:26:30 No.102998828

Anonymous 10/27/24(Sun)23:26:30 No.102998828

File: 1727499707438309.jpg (167 KB, 1024x1024)

167 KB JPG

>>102994919
>>102996677
>>102998637
Explain like am tarded
>I am
What app do I use besides fooocus to run flux?
>or can fooocus just run it..?

Anonymous
10/27/24(Sun)23:49:34 No.102998947

Anonymous 10/27/24(Sun)23:49:34 No.102998947

>>102998828
I just followed the ComfyUI instructions on how to install. You aren't on Linux, so my special problems won't relate to yours. The instructions will have you genning first with SD 1.5. This basically will show that your gpu is working and ComfyUI is working.

as for other apps, no idea.

Anyway, after getting 1.5 working, you look at guides to getting comfyUI working with flux.

Anonymous
10/27/24(Sun)23:50:35 No.102998957

Anonymous 10/27/24(Sun)23:50:35 No.102998957

File: 2024-10-27_00013_.png (1 MB, 720x1280)

1 MB PNG

Anonymous
10/28/24(Mon)00:09:50 No.102999043

Anonymous 10/28/24(Mon)00:09:50 No.102999043

File: 2024-10-27_00014_.png (1.16 MB, 720x1280)

1.16 MB PNG

>>102998957
Flux doesn't know the vulcan salute?

Anonymous
10/28/24(Mon)00:46:45 No.102999252

Anonymous 10/28/24(Mon)00:46:45 No.102999252

>>102992357
nice

Anonymous
10/28/24(Mon)01:00:37 No.102999340

Anonymous 10/28/24(Mon)01:00:37 No.102999340

File: 002353.jpg (2.74 MB, 2432x1664)

2.74 MB JPG

Anonymous
10/28/24(Mon)01:21:53 No.102999472

Anonymous 10/28/24(Mon)01:21:53 No.102999472

File: 002357.jpg (1.89 MB, 2432x1664)

1.89 MB JPG

Anonymous
10/28/24(Mon)01:27:28 No.102999497

Anonymous 10/28/24(Mon)01:27:28 No.102999497

File: 1700859425328903.png (760 KB, 768x768)

760 KB PNG

Anonymous
10/28/24(Mon)02:51:01 No.103000015

Anonymous 10/28/24(Mon)02:51:01 No.103000015

>>102987755
>https://civitai.com/models/141592/pixelwave?modelVersionId=992642
the examples arent bad

Anonymous
10/28/24(Mon)03:13:49 No.103000159

Anonymous 10/28/24(Mon)03:13:49 No.103000159

File: 1718156774449445.jpg (636 KB, 1280x960)

636 KB JPG

Anonymous
10/28/24(Mon)03:43:06 No.103000358

Anonymous 10/28/24(Mon)03:43:06 No.103000358

>>102998828
Anon I use forge. I followed this guide, though it's a bit dated.
https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/981
It specifically tells you how to use the BNB NF4 version though, which might not be best for you.

Anonymous
10/28/24(Mon)04:00:25 No.103000455

Anonymous 10/28/24(Mon)04:00:25 No.103000455

File: file.png (2.64 MB, 1024x1024)

2.64 MB PNG

>>102987755
Idk man, I really like the fact it does a lot of styles, but we lost a lot of details in the process, that's one of the biggest strength of Flux, its details are amazing, and this finetune kinda destroyed that

Anonymous
10/28/24(Mon)04:03:41 No.103000471

Anonymous 10/28/24(Mon)04:03:41 No.103000471

File: ComfyUI_00221_.png.webm (1.95 MB, 848x480)

1.95 MB WEBM

Anonymous
10/28/24(Mon)04:04:32 No.103000477

Anonymous 10/28/24(Mon)04:04:32 No.103000477

>>103000471
that's local mochi right? Damn I can't wait to try out the image2video, that'll be fucking epic

Anonymous
10/28/24(Mon)04:06:36 No.103000492

Anonymous 10/28/24(Mon)04:06:36 No.103000492

File: 00104-3557268076-2e44eb30(...).png (3.06 MB, 1840x1432)

3.06 MB PNG

Anonymous
10/28/24(Mon)04:10:26 No.103000530

Anonymous 10/28/24(Mon)04:10:26 No.103000530

File: ComfyUI_00252_.png.webm (1.07 MB, 848x480)

1.07 MB WEBM

>>103000477
yeah this is running locally. I don't get why this isn't more popular because it's actually extremely good.

Anonymous
10/28/24(Mon)04:12:47 No.103000542

Anonymous 10/28/24(Mon)04:12:47 No.103000542

File: file.webm (2.16 MB, 856x480)

2.16 MB WEBM

>>103000530
>I don't get why this isn't more popular because it's actually extremely good.
it's asking for a lot of vram, it's really slow, and for realistic shit it can easily shit its bed, it's nowhere the level of what they displayed: https://x.com/genmoai/status/1848762405779574990#m

Anonymous
10/28/24(Mon)04:13:56 No.103000549

Anonymous 10/28/24(Mon)04:13:56 No.103000549

>>103000530
It's low resolution, doesn't do img2vid, someone told me scary requirements of needing giga graphics cards, and there's no porn finetune.
That's why I'm guessing. Honestly, I'd download it in an instant if I believed it was worth trying on a 3060. I believe that the next versions will be better, and more popular.

Anonymous
10/28/24(Mon)04:14:22 No.103000552

Anonymous 10/28/24(Mon)04:14:22 No.103000552

>>103000530
> I don't get why this isn't more popular
requires a lot of vram and takes a long time to gen, experimentation is unviable unless you have infinite patience or are using some of the top hardware available.

Anonymous
10/28/24(Mon)04:16:13 No.103000566

Anonymous 10/28/24(Mon)04:16:13 No.103000566

>>103000549
>I'd download it in an instant if I believed it was worth trying on a 3060.
it's working on a 16gb of vram, but not on a 3060, at least if you want decent quality with Q8_0

Anonymous
10/28/24(Mon)04:17:05 No.103000573

Anonymous 10/28/24(Mon)04:17:05 No.103000573

File: ComfyUI_00070_.png.webm (1.74 MB, 848x480)

1.74 MB WEBM

>>103000542
it does realistic fine

Anonymous
10/28/24(Mon)04:18:10 No.103000582

Anonymous 10/28/24(Mon)04:18:10 No.103000582

>>103000530
we're waiting for image2gen and the HD version, because so far what we have locally is way better than CogVideoX, but not quite good enough to be truely enjoyed

For image2gen, it'll definitely be a thing, we need the VAE encoder and they'll provide that in a near future
https://github.com/genmoai/models/issues/9#issuecomment-2430585334
>We will be open-sourcing the encoder, stay tuned :)

Anonymous
10/28/24(Mon)04:19:46 No.103000596

Anonymous 10/28/24(Mon)04:19:46 No.103000596

File: file.jpg (1.64 MB, 3840x3210)

1.64 MB JPG

Have you guys tried euler_ancestral (it's working on flux now) and the new "linear_quadratic" scheduler?

Anonymous
10/28/24(Mon)04:24:45 No.103000633

Anonymous 10/28/24(Mon)04:24:45 No.103000633

>>103000455
>its details are amazing, and this finetune kinda destroyed that
yeah I got the same conclusion on that model aswell, which is a shame because that guy really made the first finetune of flux, took him 5 weeks to do it and wanted to make it less rigid with styles, he had the best spirit of them all, at least it shows that it's possible to finetune Flux, because he went for the distilled version

Anonymous
10/28/24(Mon)04:26:19 No.103000643

Anonymous 10/28/24(Mon)04:26:19 No.103000643

File: ComfyUI_00115_.png.webm (1.24 MB, 848x480)

1.24 MB WEBM

Anonymous
10/28/24(Mon)04:27:51 No.103000651

Anonymous 10/28/24(Mon)04:27:51 No.103000651

>>102994081
>It's vaporware, isn't it?
It is, they just used us as a free advertisment and then got the money in the bank with twitter's partnership, they won't release anything locally anymore. Genmo and its MochiHD is our only hope for videogen now

Anonymous
10/28/24(Mon)04:29:10 No.103000657

Anonymous 10/28/24(Mon)04:29:10 No.103000657

>>102994081
BFL are finishing up raising a 100 million dollar round, they don't give a shit about local anymore.

Anonymous
10/28/24(Mon)04:32:49 No.103000684

Anonymous 10/28/24(Mon)04:32:49 No.103000684

>>103000530
>I don't get why this isn't more popular
it's just too fucking slow, waiting for 40 mn for a 5 sec video is too long, I get it it's a fucking 10b model and it has to make 163 frames at at least 50 steps, imagine we create a Turbo Lora like we did on flux, like we could get those videos in 10 steps max, the time wait would be way more acceptable

Anonymous
10/28/24(Mon)04:36:19 No.103000710

Anonymous 10/28/24(Mon)04:36:19 No.103000710

>>103000684

I cannot get cublas to work with mochi ggufs on my 16gb 4060ti and it's doing my head in.
Tried combos of batch/frame sizes, attentions, prescisions atc, every choice thats available in the nodes
New envs, new installs/compiles of cublas and so on, all torch cuda compatibility checks passed, hours with gpt4o always status error 1 which is a generic error so idk what else to do at this point, goddamit, i want my 40-50% speedup! reeeee

Anonymous
10/28/24(Mon)04:36:32 No.103000712

Anonymous 10/28/24(Mon)04:36:32 No.103000712

File: 00127-1587507724-f58377eb(...).png (2.82 MB, 1840x1432)

2.82 MB PNG

Anonymous
10/28/24(Mon)04:37:15 No.103000718

Anonymous 10/28/24(Mon)04:37:15 No.103000718

>>103000710
did you install a C compiler (visual studio for example) and cuda toolkit on your computer?

Anonymous
10/28/24(Mon)04:39:32 No.103000744

Anonymous 10/28/24(Mon)04:39:32 No.103000744

>>103000718
I'm using linux, i have gcc installed, pretty sure i have cuda toolkit, I seem to remember i couldn't use nvdia-smi without it being installed, I'll check both again though.

Anonymous
10/28/24(Mon)04:39:32 No.103000745

Anonymous 10/28/24(Mon)04:39:32 No.103000745

File: file.webm (635 KB, 1360x752)

635 KB WEBM

https://flux-ai.io/flux-video-ai/
wait what? they finished it already?

Anonymous
10/28/24(Mon)04:41:31 No.103000769

Anonymous 10/28/24(Mon)04:41:31 No.103000769

File: image - 2024-10-28T012850.430.png (1.23 MB, 1024x1024)

1.23 MB PNG

Anonymous
10/28/24(Mon)04:41:33 No.103000770

Anonymous 10/28/24(Mon)04:41:33 No.103000770

File: ComfyUI_00184_.png.webm (1.25 MB, 848x480)

1.25 MB WEBM

>>103000684
Use latest comfyUI and do 25 steps + simple scheduler + euler + cfg 6.0. that's how all these videos were generated.

>>103000745
Lol that's not their real website.

Anonymous
10/28/24(Mon)04:42:33 No.103000777

Anonymous 10/28/24(Mon)04:42:33 No.103000777

File: image - 2024-10-28T012704.024.png (1.16 MB, 1024x1024)

1.16 MB PNG

Anonymous
10/28/24(Mon)04:42:44 No.103000780

Anonymous 10/28/24(Mon)04:42:44 No.103000780

>>103000770
>Use latest comfyUI and do 25 steps + simple scheduler + euler + cfg 6.0. that's how all these videos were generated.
but ComfyUi's VAE don't support vae tilting, so it'll overflow 24gb cards no?

Anonymous
10/28/24(Mon)04:43:22 No.103000788

Anonymous 10/28/24(Mon)04:43:22 No.103000788

File: Screenshot from 2024-10-2(...).png (33 KB, 753x311)

33 KB PNG

>>103000718
Yup both there.

Anonymous
10/28/24(Mon)04:44:10 No.103000794

Anonymous 10/28/24(Mon)04:44:10 No.103000794

>>103000788
your cuda toolkit has the same version as your pytorch right? Cuda 12.0 sounds sus, usually it's 12.1 or 12.4 on pytorch

Anonymous
10/28/24(Mon)04:50:39 No.103000845

Anonymous 10/28/24(Mon)04:50:39 No.103000845

>>103000794
no, the toolkit is 12.0 and torch version is 12.4
I guess, update the toolkit or downgrade torch?

Anonymous
10/28/24(Mon)04:51:52 No.103000851

Anonymous 10/28/24(Mon)04:51:52 No.103000851

>>103000845
update toolkit to 12.4 yeah

Anonymous
10/28/24(Mon)04:56:00 No.103000881

Anonymous 10/28/24(Mon)04:56:00 No.103000881

tomorrow is sd 3.5 medium day

Anonymous
10/28/24(Mon)04:57:04 No.103000890

Anonymous 10/28/24(Mon)04:57:04 No.103000890

>>103000881
What's to be hyped about? The 8b model wasn't that good, how can the 2b change anything?

Anonymous
10/28/24(Mon)04:58:25 No.103000900

Anonymous 10/28/24(Mon)04:58:25 No.103000900

>>103000890
the 2b uses a different architecture, the 8b is probably something they shat out to save face

Anonymous
10/28/24(Mon)04:59:23 No.103000904

Anonymous 10/28/24(Mon)04:59:23 No.103000904

>>103000530
It's because I have a 2060 12GB

Anonymous
10/28/24(Mon)04:59:47 No.103000905

Anonymous 10/28/24(Mon)04:59:47 No.103000905

>>103000900
must be a miraculous architecture then, that poor 2b model will have to be competitive with Flux

Anonymous
10/28/24(Mon)04:59:59 No.103000908

Anonymous 10/28/24(Mon)04:59:59 No.103000908

>>103000573
Those are some flappy legs

Anonymous
10/28/24(Mon)05:03:03 No.103000936

Anonymous 10/28/24(Mon)05:03:03 No.103000936

>>103000905
>that poor 2b model will have to be competitive with Flux
no it doesn't. it just needs to be a decent 2b model, that's all. nobody is expecting a 2b to be on par with flux

Anonymous
10/28/24(Mon)05:04:54 No.103000953

Anonymous 10/28/24(Mon)05:04:54 No.103000953

>>103000936
So basically SAI decided to give up, they don't want to make SOTA local models, they admit that Flux has beaten then, what a sad day that is

Anonymous
10/28/24(Mon)05:05:31 No.103000960

Anonymous 10/28/24(Mon)05:05:31 No.103000960

>>103000936
>it just needs to be a decent 2b model
no it doesn't at that range you compete with XL finetunes, it should have been a 4b model instead

Anonymous
10/28/24(Mon)05:07:04 No.103000969

Anonymous 10/28/24(Mon)05:07:04 No.103000969

>>103000960
>compete with XL finetunes
if it's a better base finetuners will just switch
>it should have been a 4b model instead
yeah i think so too

Anonymous
10/28/24(Mon)05:09:14 No.103000982

Anonymous 10/28/24(Mon)05:09:14 No.103000982

>>103000573
That fucking thing on the left crawling into frame is actually disturbing

Anonymous
10/28/24(Mon)05:10:27 No.103000996

Anonymous 10/28/24(Mon)05:10:27 No.103000996

>>103000953
>pay employees to make SOTA
>employees create SOTA
>employees decide to keep SOTA for themselves
>give you shit instead
>employees leave with SOTA
>release it as their own

Anonymous
10/28/24(Mon)05:12:39 No.103001023

Anonymous 10/28/24(Mon)05:12:39 No.103001023

>>103000996
wait, you think the BFL fags made Flux when they were working for SAI? lmao if it's true that's fucking based

Anonymous
10/28/24(Mon)05:15:42 No.103001054

Anonymous 10/28/24(Mon)05:15:42 No.103001054

File: file.png (1.9 MB, 960x960)

1.9 MB PNG

https://x.com/deedydas/status/1850680875832496577#m
>New image generation model red_panda is #1 by over 100 ELO points on the Artificial Analysis leaderboard.
>Rumor is it's the new Flux model, Midjourney 7 or a Chinese lab!
Oh shit look at that image, damn...

Anonymous
10/28/24(Mon)05:17:10 No.103001068

Anonymous 10/28/24(Mon)05:17:10 No.103001068

File: file.png (129 KB, 1583x960)

129 KB PNG

>>103001054
https://artificialanalysis.ai/text-to-image/arena?tab=Leaderboard
HOLY MOG

Anonymous
10/28/24(Mon)05:18:39 No.103001083

Anonymous 10/28/24(Mon)05:18:39 No.103001083

File: file.png (30 KB, 1366x246)

30 KB PNG

>>103001068
wtf, why dalle3 is so low in the rankings? it barely beat SD3M

Anonymous
10/28/24(Mon)05:21:15 No.103001108

Anonymous 10/28/24(Mon)05:21:15 No.103001108

File: file.png (816 KB, 590x698)

816 KB PNG

>>103001054
>>103001068
>>103001083
kek, if this is true, then Twitter won't need Flux anymore, and the BFL fags will need us again to be relevant, prepare to have a good new Flux local model soon *inhale copium*

Anonymous
10/28/24(Mon)05:32:50 No.103001185

Anonymous 10/28/24(Mon)05:32:50 No.103001185

>>103001083
I did 60 runs on that website today, and the answer is that it's fucking shit. I don't think a single Dalle3 example got close to the prompt, and it usually looked like slop.

Anonymous
10/28/24(Mon)05:33:54 No.103001194

Anonymous 10/28/24(Mon)05:33:54 No.103001194

>>103001185
>I don't think a single Dalle3 example got close to the prompt, and it usually looked like slop.
that's weird, dalle3 is known to be one of the bests at following prompts

Anonymous
10/28/24(Mon)05:34:58 No.103001203

Anonymous 10/28/24(Mon)05:34:58 No.103001203

>>103001194
I only got given Dall E 3 a handful of times in those 60, it could just be bad luck on it's part.

Anonymous
10/28/24(Mon)05:37:26 No.103001217

Anonymous 10/28/24(Mon)05:37:26 No.103001217

File: file.jpg (1.29 MB, 3840x3210)

1.29 MB JPG

>>103000596
>Have you guys tried euler_ancestral (it's working on flux now) and the new "linear_quadratic" scheduler?
kek forget about it, those are memes

Anonymous
10/28/24(Mon)05:51:17 No.103001296

Anonymous 10/28/24(Mon)05:51:17 No.103001296

>>103001083
Flawed methodology, ELO doesn't work when there's no draw. Psychologically some people will just always pick A, B or randomly. Images can be cherrypicked to influence the result.

Anonymous
10/28/24(Mon)05:51:26 No.103001297

Anonymous 10/28/24(Mon)05:51:26 No.103001297

>>103000881
Medium hype

Anonymous
10/28/24(Mon)05:52:21 No.103001304

Anonymous 10/28/24(Mon)05:52:21 No.103001304

>>103001296
>ELO doesn't work when there's no draw
now that you say it, that's true that chess has draws too

Anonymous
10/28/24(Mon)05:52:32 No.103001305

Anonymous 10/28/24(Mon)05:52:32 No.103001305

What's the current advice for training a flux lora for a character?

Anonymous
10/28/24(Mon)06:01:14 No.103001364

Anonymous 10/28/24(Mon)06:01:14 No.103001364

File: file.png (3.61 MB, 1344x1728)

3.61 MB PNG

https://civitai.com/models/652699/amateur-photography-flux-dev
that's really impressive, that dude knows how to make Loras

Anonymous
10/28/24(Mon)06:07:30 No.103001404

Anonymous 10/28/24(Mon)06:07:30 No.103001404

File: e322fb44cfb3b7d3c17b46b47(...).jpg (297 KB, 896x1110)

297 KB JPG

Anonymous
10/28/24(Mon)06:10:20 No.103001431

Anonymous 10/28/24(Mon)06:10:20 No.103001431

File: c90493b6ec704be8bf5ec8af7(...).jpg (236 KB, 896x1152)

236 KB JPG

Anonymous
10/28/24(Mon)06:15:21 No.103001464

Anonymous 10/28/24(Mon)06:15:21 No.103001464

File: 39727ac8e60b004bd6f30d772(...).jpg (657 KB, 896x1152)

657 KB JPG

Anonymous
10/28/24(Mon)07:49:32 No.103002087

Anonymous 10/28/24(Mon)07:49:32 No.103002087

>>103000851
did that, same error , and i recomplied cublas for good measure after the update.. made sure CUDA_HOME is pointing correctly and a few other things along the way.

Anonymous
10/28/24(Mon)08:22:36 No.103002299

Anonymous 10/28/24(Mon)08:22:36 No.103002299

>>103002087
I've run tests in python in the env to check if cublas is working, matrix multiplication and so on, all fine. Just wont work for me in comfy on gguf models, if i use non gguf it doesn't error out but there's probably a flag not being set to use it if the model is not gguf which may be why, i've used the gguf models successfully without it and i dont know if they have to be made a certain way for cublas for specific cards, kind of like tensort model.
Quite lost now.
fails on 323 of execution.py
>gpt4o Thanks for providing the detailed error message and stack trace. The information indicates that the error occurs when calling the get_output_data() function in the MochiSampler node type, specifically during a processing step involving CUDA operations.
Think ill just give up for now and see what happens when more people have errors using it locally on non 4090's as i seem to be the only one with the problem rn

Anonymous
10/28/24(Mon)08:34:24 No.103002374

Anonymous 10/28/24(Mon)08:34:24 No.103002374

File: file.png (2.01 MB, 1024x1024)

2.01 MB PNG

Anonymous
10/28/24(Mon)09:01:16 No.103002551

Anonymous 10/28/24(Mon)09:01:16 No.103002551

>>102987755
>I fine tuned version 03 from base FLUX.1-dev for over 5 weeks on my 4090. It is able to do different art styles, photography, and anime.
I don't get it, I thought it was impossible to finetune Flux because it was distilled

Anonymous
10/28/24(Mon)09:39:16 No.103002890

Anonymous 10/28/24(Mon)09:39:16 No.103002890

>>103002551
Distill is just a training process, models can be trained to do anything including not being distilled, as instead of training strictly on CFG 1 you instead train it using the normal CFG process, of course it's going to be ass until it learns how to do CFG again and you really need to be training it on a massive dataset so it doesn't regress.

Anonymous
10/28/24(Mon)09:41:44 No.103002911

Anonymous 10/28/24(Mon)09:41:44 No.103002911

>>103002890
yeah I get that, but people couldn't stop saying that doing a real finetune of the distilled flux dev model was impossible because it was distilled, yet he managed to make a fine finetune with it, how did he do it?

Anonymous
10/28/24(Mon)09:43:53 No.103002932

Anonymous 10/28/24(Mon)09:43:53 No.103002932

>>103002911
What do you mean "impossible", it's impractical. Full finetune on a 4090 is like 15 seconds a step batch size 1, and you need to millions of steps to do it properly and Flux is a very, very fragile model so you also have to a ridiculously small learning rate.

Anonymous
10/28/24(Mon)09:45:21 No.103002950

Anonymous 10/28/24(Mon)09:45:21 No.103002950

>>103002932
>What do you mean "impossible", it's impractical.
I just report what people said on this thread over and over again, "you can't finetune flux, it's distilled", to me that sounded like an impossible task that's all

Anonymous
10/28/24(Mon)09:46:15 No.103002957

Anonymous 10/28/24(Mon)09:46:15 No.103002957

>>103002950
I think you just want to be a smartass faggot
The dedistilled models still suck ass, don't know if you noticed unless you like your crispy ass oversaturated outputs

Anonymous
10/28/24(Mon)10:07:05 No.103003201

Anonymous 10/28/24(Mon)10:07:05 No.103003201

File: mipu.jpg (228 KB, 1024x1280)

228 KB JPG

Anonymous
10/28/24(Mon)10:07:45 No.103003209

Anonymous 10/28/24(Mon)10:07:45 No.103003209

>>103003201
OMG IT PEPOGU

Anonymous
10/28/24(Mon)10:30:37 No.103003457

Anonymous 10/28/24(Mon)10:30:37 No.103003457

File: file.png (887 KB, 1024x1024)

887 KB PNG

https://civitai.com/models/739676/chudjak-flux-dev
kek

Anonymous
10/28/24(Mon)10:50:35 No.103003632

Anonymous 10/28/24(Mon)10:50:35 No.103003632

>>103002299
kinda got it working now, very low number of frames (39?) before it ooms (16gb) at least it's better for test runs theoretically, I probably need to reboot though, thanks to the anon that set me on the right path.

Anonymous
10/28/24(Mon)10:54:31 No.103003679

Anonymous 10/28/24(Mon)10:54:31 No.103003679

>>103003632
>very low number of frames (39?) before it ooms (16gb)
fp8?

Anonymous
10/28/24(Mon)11:01:10 No.103003758

Anonymous 10/28/24(Mon)11:01:10 No.103003758

File: file.webm (792 KB, 1080x720)

792 KB WEBM

https://huggingface.co/NimVideo/cogvideox-2b-img2vid
>Fine-tuned on 10 million videos for high-quality generation at SBS levels comparable to CogVideoX-5B!
Poor them, they probably did it before Mochi appeared and they couldn't stop halfway through

Anonymous
10/28/24(Mon)11:07:19 No.103003826

Anonymous 10/28/24(Mon)11:07:19 No.103003826

File: pepe.jpg (529 KB, 1352x1024)

529 KB JPG

Anonymous
10/28/24(Mon)11:07:50 No.103003835

Anonymous 10/28/24(Mon)11:07:50 No.103003835

File: canman5.jpg (1.43 MB, 2144x2144)

1.43 MB JPG

>>103001364
Nice

Anonymous
10/28/24(Mon)11:10:11 No.103003857

Anonymous 10/28/24(Mon)11:10:11 No.103003857

>>103003679
yes the GGUF 8 and it was 31 frames, also, after loading it up again it now gives me the original error, idfk anymore lol.
I need food so i'm off the case for a while.

Anonymous
10/28/24(Mon)11:18:43 No.103003950

Anonymous 10/28/24(Mon)11:18:43 No.103003950

>>103001364
how did you find this img of me

Anonymous
10/28/24(Mon)11:22:37 No.103004001

Anonymous 10/28/24(Mon)11:22:37 No.103004001

>>103001083
This is a test of prompt adherence. If redpanda is that good at adherence, it will be a big deal.

The problem is, it's not a test of prompt tuning, because that would require formal rules and judges and stuff

Anonymous
10/28/24(Mon)11:24:11 No.103004013

Anonymous 10/28/24(Mon)11:24:11 No.103004013

>>103004001
>If redpanda is that good at adherence, it will be a big deal.
do we have a clue on what redpanda will be? I'd guess it'll be Midjourney V7 or something, it's been a while they haven't updated their model

Anonymous
10/28/24(Mon)11:32:08 No.103004117

Anonymous 10/28/24(Mon)11:32:08 No.103004117

Prepare yourselves. A new SaaS SOTA is about to arrive.

Anonymous
10/28/24(Mon)11:34:43 No.103004146

Anonymous 10/28/24(Mon)11:34:43 No.103004146

File: skronk.jpg (28 KB, 480x360)

28 KB JPG

>Someone posted a webm of a girl holding her stomach in a distraught way a few threads ago
Okay, fine. How many damn GPUs do I need?

Anonymous
10/28/24(Mon)11:34:55 No.103004148

Anonymous 10/28/24(Mon)11:34:55 No.103004148

>>103003758
well, at least is already supported in comfyui and it can actually run on coomsumer hardware, what we need is a good video upscaler like topaz

Anonymous
10/28/24(Mon)11:35:35 No.103004155

Anonymous 10/28/24(Mon)11:35:35 No.103004155

>>103004146
at least 16gb of vram, if you have 24gb you can go up to 12 seconds of videos with a Q8_0 quant

Anonymous
10/28/24(Mon)11:40:37 No.103004221

Anonymous 10/28/24(Mon)11:40:37 No.103004221

File: pepe.jpg (101 KB, 768x768)

101 KB JPG

Anonymous
10/28/24(Mon)11:46:15 No.103004297

Anonymous 10/28/24(Mon)11:46:15 No.103004297

>>103004148
>is already supported in comfyui
Mochi is also supported on comfyui
https://github.com/kijai/ComfyUI-MochiWrapper
>it can actually run on coomsumer hardware
Mochi can also run on a consumer hardware, as long as you have at least 16gb of vram,

Anonymous
10/28/24(Mon)11:49:03 No.103004333

Anonymous 10/28/24(Mon)11:49:03 No.103004333

>>103001054
Is there others pictures of that redpanda model on the internet? I wanna see if it's a truly next level imagemodel

Anonymous
10/28/24(Mon)12:01:36 No.103004456

Anonymous 10/28/24(Mon)12:01:36 No.103004456

>>103004297
ok but where is the img2video model of mochi?

Anonymous
10/28/24(Mon)12:03:09 No.103004470

Anonymous 10/28/24(Mon)12:03:09 No.103004470

File: file.jpg (1.42 MB, 2048x2048)

1.42 MB JPG

>>103004333
https://cancel.com/jesus__suero/status/1850835432390426626#m
there's some here
>that fucking blur
maybe that's a new Flux model

Anonymous
10/28/24(Mon)12:04:11 No.103004480

Anonymous 10/28/24(Mon)12:04:11 No.103004480

>>103004456
>ok but where is the img2video model of mochi?
we don't have the VAE encoder to do that, they'll release it soon though
https://github.com/genmoai/models/issues/9#issuecomment-2430585334
>We will be open-sourcing the encoder, stay tuned :)

Anonymous
10/28/24(Mon)12:09:42 No.103004534

Anonymous 10/28/24(Mon)12:09:42 No.103004534

>>103004470
that doesn't look impressive at all

Anonymous
10/28/24(Mon)12:12:50 No.103004568

Anonymous 10/28/24(Mon)12:12:50 No.103004568

File: 00013-802331459.png (1.07 MB, 1024x1280)

1.07 MB PNG

>>103003457
Oh yeah, that one's great

Anonymous
10/28/24(Mon)12:14:00 No.103004582

Anonymous 10/28/24(Mon)12:14:00 No.103004582

>>103004568
billions must sculpt

Anonymous
10/28/24(Mon)12:19:58 No.103004640

Anonymous 10/28/24(Mon)12:19:58 No.103004640

>>103004480
Ok, so you dont have it

Anonymous
10/28/24(Mon)12:29:53 No.103004731

Anonymous 10/28/24(Mon)12:29:53 No.103004731

So nice skipping the caching in Kohya, the best part of having two 4090s. Fucking ridiculous how much space it takes and how long it takes especially when you can only do batch size 1 so runtime embeddings costs nothing.

Anonymous
10/28/24(Mon)12:31:47 No.103004755

Anonymous 10/28/24(Mon)12:31:47 No.103004755

>>103004731
you're making a flux lora anon?

Anonymous
10/28/24(Mon)12:33:03 No.103004767

Anonymous 10/28/24(Mon)12:33:03 No.103004767

>>103004755
I'm finetuning the 8B Flux which fits perfectly without having to do block swaps.

Anonymous
10/28/24(Mon)12:34:34 No.103004778

Anonymous 10/28/24(Mon)12:34:34 No.103004778

>>103004767
>finetuning
oh nice, on what exactly?

Anonymous
10/28/24(Mon)12:34:47 No.103004784

Anonymous 10/28/24(Mon)12:34:47 No.103004784

File: 1715746216172136.png (1.82 MB, 896x1152)

1.82 MB PNG

Anonymous
10/28/24(Mon)12:35:13 No.103004789

Anonymous 10/28/24(Mon)12:35:13 No.103004789

>>103004778
futa on shota

Anonymous
10/28/24(Mon)12:35:45 No.103004796

Anonymous 10/28/24(Mon)12:35:45 No.103004796

>>103004789
kek

Anonymous
10/28/24(Mon)12:38:06 No.103004823

Anonymous 10/28/24(Mon)12:38:06 No.103004823

>>103004778
Nude women obviously
Then maybe celebrities and other pop culture.
But it's really just until Sana comes out.

Anonymous
10/28/24(Mon)12:39:13 No.103004838

Anonymous 10/28/24(Mon)12:39:13 No.103004838

>>103004480
>don't have the VAE encoder
I just realized that this is an evil, but genius, way for a company to "open source" a model while keep it censored and preventing it from being used for anything "bad". Without the encoder, you can still do text2img, since it diffuses in the latent space and then decodes. But you can't do img2img (no more face swap, deepfake, whatever). More importantly, you can't train it, since you need to encode images to the latent space to train. Wonder if we start seeing companies start doing this...

Anonymous
10/28/24(Mon)12:40:29 No.103004851

Anonymous 10/28/24(Mon)12:40:29 No.103004851

>>103004838
>More importantly, you can't train it, since you need to encode images to the latent space to train.
that would singlehandlely kill the model, if it can't be trained no one will give a fuck about it

Anonymous
10/28/24(Mon)12:41:36 No.103004865

Anonymous 10/28/24(Mon)12:41:36 No.103004865

>>103004838
I don't think that's true, the inference code is available, you should be able to hack it to do what you want. AI models are inherently not black boxes because you interact with them with code. Training can be hidden from you but the inference stuff is like Javascript, anyone can see it.

Anonymous
10/28/24(Mon)12:45:01 No.103004902

Anonymous 10/28/24(Mon)12:45:01 No.103004902

>>103004851
Flux got really popular within days, even while the majority opinion was "you can't train it because it's distilled". Even now, we have exactly one real finetune. Normies don't give a fuck, they just use the base model, as they are doing currently with flux.

Anonymous
10/28/24(Mon)12:46:09 No.103004916

Anonymous 10/28/24(Mon)12:46:09 No.103004916

>>103004902
don't forget that we have hundreds of loras on flux to make it more fun, it's not like we're stuck with vanilla, thank god it's not the case

Anonymous
10/28/24(Mon)12:46:15 No.103004918

Anonymous 10/28/24(Mon)12:46:15 No.103004918

>>103004789
For once, I'm happy that flux fights anyone trying to train it literally to death(collapse).

Anonymous
10/28/24(Mon)12:49:12 No.103004943

Anonymous 10/28/24(Mon)12:49:12 No.103004943

>>103004902
BigAsp 2.0 cost $3500 to train. To do the same on Flux would be like $35000.

Anonymous
10/28/24(Mon)12:52:50 No.103004975

Anonymous 10/28/24(Mon)12:52:50 No.103004975

>>103004943
>To do the same on Flux would be like $35000.
Or you could run on a single 4090 and wait a month, like he did lol >>102987755

Anonymous
10/28/24(Mon)12:56:10 No.103005007

Anonymous 10/28/24(Mon)12:56:10 No.103005007

>>103004975
>barely a fine tune, minor changes at best
Let me help you anon, if you want pop culture put into Flux you're talking about hundreds of thousands if not millions of steps. Big Asp 2.0 was 6 million images trained for 40 million steps (Batch size 1024). Want to do the math on Batch 1, 15 seconds per step?

Anonymous
10/28/24(Mon)12:58:09 No.103005028

Anonymous 10/28/24(Mon)12:58:09 No.103005028

>>103005007
>>barely a fine tune, minor changes at best
you tried it?

Anonymous
10/28/24(Mon)13:00:16 No.103005058

Anonymous 10/28/24(Mon)13:00:16 No.103005058

>>103005028
It's a minor aesthetics update, feel free to prove me wrong. Looking at the examples and gallery, it's just aesthetics. That's not anything to write home about, you could literally do the same in 2 hours with a Lora.

Anonymous
10/28/24(Mon)13:00:50 No.103005063

Anonymous 10/28/24(Mon)13:00:50 No.103005063

>>102987712
Hang in there OP.

Anonymous
10/28/24(Mon)13:01:15 No.103005070

Anonymous 10/28/24(Mon)13:01:15 No.103005070

>>103005058
>It's a minor aesthetics update, feel free to prove me wrong.
You're the one who claimed it has "minor changes at best" first, therefore you're the one with the burden of proof, hope that helps.

Anonymous
10/28/24(Mon)13:08:53 No.103005146

Anonymous 10/28/24(Mon)13:08:53 No.103005146

File: tmpeim7jxq0.png (889 KB, 896x1152)

889 KB PNG

Anonymous
10/28/24(Mon)13:17:52 No.103005244

Anonymous 10/28/24(Mon)13:17:52 No.103005244

The buns are out and hot:
>>103005229
>>103005229
>>103005229

Anonymous
10/28/24(Mon)13:18:13 No.103005248

Anonymous 10/28/24(Mon)13:18:13 No.103005248

sana-samas what's our status?

Anonymous
10/28/24(Mon)13:20:07 No.103005267

Anonymous 10/28/24(Mon)13:20:07 No.103005267

>>103005248
hiding in shame

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.