/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/ldg/ - Local Diffusion Genera(...) 08/17/24(Sat)04:49:13 No.101937495

File: tmp.jpg (1.11 MB, 3264x3264)

1.11 MB JPG

/ldg/ - Local Diffusion General Anonymous 08/17/24(Sat)04:49:13 No.101937495

Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>101935309

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg

Anonymous
08/17/24(Sat)04:51:15 No.101937515

Anonymous 08/17/24(Sat)04:51:15 No.101937515

>>101937500

Was about to mention, this, I can't daisy chain two LoRAs together with gguf right now. Which sucks. Luckily I'm not a total vramlet.

Anonymous
08/17/24(Sat)04:51:33 No.101937519

Anonymous 08/17/24(Sat)04:51:33 No.101937519

File: 111.jpg (3.2 MB, 1999x1999)

3.2 MB JPG

Anonymous
08/17/24(Sat)04:51:39 No.101937520

Anonymous 08/17/24(Sat)04:51:39 No.101937520

File: ComfyUI_00061_.png (1.25 MB, 1024x1024)

1.25 MB PNG

Anonymous
08/17/24(Sat)04:51:47 No.101937522

Anonymous 08/17/24(Sat)04:51:47 No.101937522

Im just like gonna call it slop haha

Anonymous
08/17/24(Sat)04:51:48 No.101937523

Anonymous 08/17/24(Sat)04:51:48 No.101937523

L O R A S
O
R
A
S

Anonymous
08/17/24(Sat)04:56:52 No.101937574

Anonymous 08/17/24(Sat)04:56:52 No.101937574

File: 1700479568546415.png (1.62 MB, 1024x1024)

1.62 MB PNG

flux was worth it, solely for the Pepe lora. The rest is a bonus. And this is basically the first week of loras, most aren't even training yet.

Anonymous
08/17/24(Sat)04:58:50 No.101937590

Anonymous 08/17/24(Sat)04:58:50 No.101937590

So I'm interested in captioning my datasets now for LoRA training to see if it can improve the outputs. What captioner are we all using right now?

Anonymous
08/17/24(Sat)04:58:50 No.101937591

Anonymous 08/17/24(Sat)04:58:50 No.101937591

>>101937515
>can't use one lora without tonemap and therefore without cfg > 1
>with cfg = 1, can only load one lora and not more
yay...

Anonymous
08/17/24(Sat)04:59:51 No.101937606

Anonymous 08/17/24(Sat)04:59:51 No.101937606

>>101937590
for SFW, GPT4V, for NFSW, that joycaption thing

Anonymous
08/17/24(Sat)05:01:29 No.101937620

Anonymous 08/17/24(Sat)05:01:29 No.101937620

>>101937606
>joycaption
Were there weights available?

Anonymous
08/17/24(Sat)05:02:21 No.101937628

Anonymous 08/17/24(Sat)05:02:21 No.101937628

>>101937590
Joy captioner
DL from huggingface and replace with model another llama model to run locally/uncensored

Anonymous
08/17/24(Sat)05:03:22 No.101937636

Anonymous 08/17/24(Sat)05:03:22 No.101937636

>>101937628
>replace the model
I'm going to bed it's 3am

Anonymous
08/17/24(Sat)05:03:27 No.101937638

Anonymous 08/17/24(Sat)05:03:27 No.101937638

>>101937620
yes, just clone https://huggingface.co/spaces/fancyfeast/joy-caption-pre-alpha
it will download the weights for you (the llama weights require you going to the repo and requesting permission but there are mirrors on hugging face and finetunes work as well from what I've seen other anons do)

Anonymous
08/17/24(Sat)05:03:40 No.101937640

Anonymous 08/17/24(Sat)05:03:40 No.101937640

File: 1720693886864337.png (1.57 MB, 1024x1024)

1.57 MB PNG

>>101937574
slightly better: forgot the cartoon prompt

remember, if you can generate controversial stuff then the basic stuff is easy as fuck.

Anonymous
08/17/24(Sat)05:03:43 No.101937641

Anonymous 08/17/24(Sat)05:03:43 No.101937641

>>101937495
From a technical point of view, how does Flux differ from SDXL?
As in, if someone was trying to adapt something that supports SDXL to also support Flux, what would the key points of difference be?
Two CLIP encoders (is the second one optional)? What other magic is there?

Anonymous
08/17/24(Sat)05:05:22 No.101937657

Anonymous 08/17/24(Sat)05:05:22 No.101937657

>>101937640
lmao that one is good not gonna lie

Anonymous
08/17/24(Sat)05:05:36 No.101937661

Anonymous 08/17/24(Sat)05:05:36 No.101937661

>>101937641
it's a diffusion transformer architecture instead of a UNet, does you "something" care about the architecture?

Anonymous
08/17/24(Sat)05:06:35 No.101937673

Anonymous 08/17/24(Sat)05:06:35 No.101937673

File: 1711021540339872.png (1.09 MB, 1024x1024)

1.09 MB PNG

Pepe is wearing a business suit and is at a political rally. He is standing at a podium. A big rectangular sign that says "PEPE" is behind him. Pepe is saying "feels flux, man" in a white speech bubble.

Anonymous
08/17/24(Sat)05:08:26 No.101937690

Anonymous 08/17/24(Sat)05:08:26 No.101937690

Quite a few flux loras are beign trained on AI images, WTF is wrong with those people... It just makes everything look more AI.

e.g I saw a lora and I was like why does this look like a Pony image...well they trained it on Pony images.

Anonymous
08/17/24(Sat)05:08:26 No.101937691

Anonymous 08/17/24(Sat)05:08:26 No.101937691

>>101937673
Do one with a bullet grazing his head.

Anonymous
08/17/24(Sat)05:09:26 No.101937700

Anonymous 08/17/24(Sat)05:09:26 No.101937700

>>101937638
I hope they add this to Taggui, I love using that program.

Anonymous
08/17/24(Sat)05:10:09 No.101937709

Anonymous 08/17/24(Sat)05:10:09 No.101937709

>>101937690
those retards find now normal to do some AI inbreeding, I fucking despise that shit

Anonymous
08/17/24(Sat)05:12:02 No.101937732

Anonymous 08/17/24(Sat)05:12:02 No.101937732

File: 1702613342379333.png (1.01 MB, 1024x1024)

1.01 MB PNG

>>101937691

Anonymous
08/17/24(Sat)05:13:34 No.101937748

Anonymous 08/17/24(Sat)05:13:34 No.101937748

File: 1708804035369527.png (1021 KB, 1024x1024)

1021 KB PNG

>>101937732
slightly better:

Anonymous
08/17/24(Sat)05:14:30 No.101937752

Anonymous 08/17/24(Sat)05:14:30 No.101937752

>>101937661
Ah right, that's probably going in the too-hard basket for me then sorry.
Thought maybe I could hack support into StableDiffusion.cpp.

Anonymous
08/17/24(Sat)05:15:17 No.101937762

Anonymous 08/17/24(Sat)05:15:17 No.101937762

>>101937641
>From a technical point of view, how does Flux differ from SDXL?
better transformer architecture (DiT vs unet)
Way bigger (12b vs 3.5b)

Anonymous
08/17/24(Sat)05:15:32 No.101937764

Anonymous 08/17/24(Sat)05:15:32 No.101937764

File: 1703888458747817.png (991 KB, 1024x1024)

991 KB PNG

what a time to be alive, huh?

Anonymous
08/17/24(Sat)05:16:50 No.101937777

Anonymous 08/17/24(Sat)05:16:50 No.101937777

Is there a way to drag an output that is already linked so that I can choose a new output instead of a new input? Haven't found a modifier that changes it from the default.

Anonymous
08/17/24(Sat)05:17:38 No.101937787

Anonymous 08/17/24(Sat)05:17:38 No.101937787

>>101937732
>>101937748
fire the whole bullet
that's 33% more bullet

Anonymous
08/17/24(Sat)05:20:11 No.101937810

Anonymous 08/17/24(Sat)05:20:11 No.101937810

>Hatsune miku, her speech bubble says: "I'm loving it!", 50's comic book style
Choose your best unslop combinaison (X = cfg | Y = GuidanceNegative), made possible with Tonemap
https://files.catbox.moe/a8qovd.png

Anonymous
08/17/24(Sat)05:20:43 No.101937813

Anonymous 08/17/24(Sat)05:20:43 No.101937813

File: 1695429128484101.png (1.21 MB, 1024x1024)

1.21 MB PNG

Anonymous
08/17/24(Sat)05:21:26 No.101937821

Anonymous 08/17/24(Sat)05:21:26 No.101937821

>>101937764
>>101937748
>>101937732
KEK, nice.

Anonymous
08/17/24(Sat)05:24:52 No.101937846

Anonymous 08/17/24(Sat)05:24:52 No.101937846

File: FD_00340_.png (1.64 MB, 1024x1024)

1.64 MB PNG

Anonymous
08/17/24(Sat)05:27:04 No.101937862

Anonymous 08/17/24(Sat)05:27:04 No.101937862

>>101937813
No, kill yourself

Anonymous
08/17/24(Sat)05:27:04 No.101937863

Anonymous 08/17/24(Sat)05:27:04 No.101937863

It's over

Anonymous
08/17/24(Sat)05:27:05 No.101937864

Anonymous 08/17/24(Sat)05:27:05 No.101937864

File: FD_00317_.png (1.62 MB, 1024x1024)

1.62 MB PNG

Anonymous
08/17/24(Sat)05:30:31 No.101937891

Anonymous 08/17/24(Sat)05:30:31 No.101937891

File: Flux_00857_.png (1.45 MB, 1344x768)

1.45 MB PNG

Anonymous
08/17/24(Sat)05:30:49 No.101937893

Anonymous 08/17/24(Sat)05:30:49 No.101937893

File: ComfyUI_00146_.png (571 KB, 512x768)

571 KB PNG

Anyone knows how to change the font size of the prompt text? I edited the user.css with

element.style {
--comfy-textarea-font-size: 20px;
}

But it gets overwritten by html when opening comfyui

Anonymous
08/17/24(Sat)05:32:01 No.101937900

Anonymous 08/17/24(Sat)05:32:01 No.101937900

>>101937893
It wouldn't be ideal, but could you overwrite it in inspect elements?

Anonymous
08/17/24(Sat)05:32:38 No.101937905

Anonymous 08/17/24(Sat)05:32:38 No.101937905

File: 1702798476278549.png (982 KB, 1024x1024)

982 KB PNG

>may I take your order?

Anonymous
08/17/24(Sat)05:33:24 No.101937909

Anonymous 08/17/24(Sat)05:33:24 No.101937909

File: ComfyUI_00147_.png (585 KB, 512x768)

585 KB PNG

>>101937900
I did but it goes away after I close the tab

Anonymous
08/17/24(Sat)05:34:58 No.101937922

Anonymous 08/17/24(Sat)05:34:58 No.101937922

>>101937909
use Stylus

Anonymous
08/17/24(Sat)05:36:06 No.101937931

Anonymous 08/17/24(Sat)05:36:06 No.101937931

File: 1703392562031671.png (960 KB, 1024x1024)

960 KB PNG

>>101937905

Anonymous
08/17/24(Sat)05:36:19 No.101937934

Anonymous 08/17/24(Sat)05:36:19 No.101937934

>>101937893
Isn't it an option in the settings? Click the gear and look for "Textarea widget font size"

Anonymous
08/17/24(Sat)05:38:10 No.101937943

Anonymous 08/17/24(Sat)05:38:10 No.101937943

File: ComfyUI_00148_.png (495 KB, 512x768)

495 KB PNG

>>101937934
Holy fuck it works, tried for like 2 hours to change it

Anonymous
08/17/24(Sat)05:38:35 No.101937944

Anonymous 08/17/24(Sat)05:38:35 No.101937944

File: 1696969884608524.png (991 KB, 1024x1024)

991 KB PNG

the colonel in the background is the most impressive part.

Anonymous
08/17/24(Sat)05:41:28 No.101937972

Anonymous 08/17/24(Sat)05:41:28 No.101937972

>>101937944
I wonder if by virtue of most KFC marketing having the letters KFC in them, they came out better tagged in the dataset.

Anonymous
08/17/24(Sat)05:41:58 No.101937973

Anonymous 08/17/24(Sat)05:41:58 No.101937973

File: ComfyUI_00149_.png (657 KB, 512x768)

657 KB PNG

>>101937934
Thanks, now I need to find a font for it

Anonymous
08/17/24(Sat)05:42:07 No.101937976

Anonymous 08/17/24(Sat)05:42:07 No.101937976

File: 1693495951317533.png (1.05 MB, 1024x1024)

1.05 MB PNG

>>101937944
brands come out pretty good in addition to text too

Anonymous
08/17/24(Sat)05:42:11 No.101937978

Anonymous 08/17/24(Sat)05:42:11 No.101937978

is there an example repo of someone setting up a flux LoRA with a dataset so I can just copy it without thinking too hard?

Anonymous
08/17/24(Sat)05:44:46 No.101937996

Anonymous 08/17/24(Sat)05:44:46 No.101937996

File: ComfyUI_00150_.png (623 KB, 512x768)

623 KB PNG

More steps in schnell adds more details

Anonymous
08/17/24(Sat)05:44:48 No.101937997

Anonymous 08/17/24(Sat)05:44:48 No.101937997

>>101937515
>>101937591
I fixed it kek. Who needs sleep.

Anonymous
08/17/24(Sat)05:46:00 No.101938008

Anonymous 08/17/24(Sat)05:46:00 No.101938008

>>101937996
looks like a Bratz doll

Anonymous
08/17/24(Sat)05:48:07 No.101938020

Anonymous 08/17/24(Sat)05:48:07 No.101938020

>>101937905
>anime 2b
Can it do a normal in-game one?

Anonymous
08/17/24(Sat)05:49:17 No.101938032

Anonymous 08/17/24(Sat)05:49:17 No.101938032

>>101938020
this lora is trained in that style but if you gave it an artstyle prompt I guess it would work, not sure still testing.

Anonymous
08/17/24(Sat)05:49:32 No.101938033

Anonymous 08/17/24(Sat)05:49:32 No.101938033

why is my VRAM usage higher today and why does moving the VAE to the CPU give me OOM now

Anonymous
08/17/24(Sat)05:49:37 No.101938036

Anonymous 08/17/24(Sat)05:49:37 No.101938036

>>101937997
It works!! No errors and no OOM anymore, what a fucking legend!!
https://www.youtube.com/watch?v=VtAzlUu4b3Y

Anonymous
08/17/24(Sat)05:49:43 No.101938038

Anonymous 08/17/24(Sat)05:49:43 No.101938038

>>101937996
Truly groundbreaking, Anon.

Anonymous
08/17/24(Sat)05:50:19 No.101938045

Anonymous 08/17/24(Sat)05:50:19 No.101938045

File: 1699619065772874.png (1.1 MB, 1024x1024)

1.1 MB PNG

RAYGUN COULD NEVER
>even 2b is better than the aussie

Anonymous
08/17/24(Sat)05:53:24 No.101938080

Anonymous 08/17/24(Sat)05:53:24 No.101938080

>>101938045
but can 2B hop like a kangaroo

Anonymous
08/17/24(Sat)05:55:01 No.101938092

Anonymous 08/17/24(Sat)05:55:01 No.101938092

File: 1700288224393599.png (1.12 MB, 1024x1024)

1.12 MB PNG

>>101938045
this time but with more soul and the logo

Anonymous
08/17/24(Sat)05:55:03 No.101938094

Anonymous 08/17/24(Sat)05:55:03 No.101938094

File: file.png (10 KB, 252x331)

10 KB PNG

>hit the new "queue" button
>comfy freezes and lags for a couple seconds
Guess it needs gpu acceleration on to not shit and piss itself

Anonymous
08/17/24(Sat)05:57:45 No.101938117

Anonymous 08/17/24(Sat)05:57:45 No.101938117

>>101937997
No more errors anymore, for those having OOM, remove --highvram and remove the force MODELDevice thing too

Anonymous
08/17/24(Sat)05:58:37 No.101938123

Anonymous 08/17/24(Sat)05:58:37 No.101938123

>>101938094
But gpu accel directly affects your generation speed, why would anyone enable it

Anonymous
08/17/24(Sat)05:59:07 No.101938129

Anonymous 08/17/24(Sat)05:59:07 No.101938129

File: 1723888726514.jpg (224 KB, 1200x933)

224 KB JPG

gn everyone

Anonymous
08/17/24(Sat)05:59:31 No.101938132

Anonymous 08/17/24(Sat)05:59:31 No.101938132

File: 3922952966.png (1.04 MB, 1152x896)

1.04 MB PNG

Anonymous
08/17/24(Sat)06:03:06 No.101938160

Anonymous 08/17/24(Sat)06:03:06 No.101938160

>>101937638
so basically we need a bit of a wrapper to come up with a dataset to avoid duplicating work

Anonymous
08/17/24(Sat)06:04:59 No.101938175

Anonymous 08/17/24(Sat)06:04:59 No.101938175

File: 1716576859884215.png (1.02 MB, 1024x1024)

1.02 MB PNG

Anonymous
08/17/24(Sat)06:05:25 No.101938181

Anonymous 08/17/24(Sat)06:05:25 No.101938181

File: 1934828313.png (1.16 MB, 1152x896)

1.16 MB PNG

>>101938045
nice

Anonymous
08/17/24(Sat)06:06:28 No.101938187

Anonymous 08/17/24(Sat)06:06:28 No.101938187

>>101938181
we can use AI to make the routine a 10/10

Anonymous
08/17/24(Sat)06:06:53 No.101938191

Anonymous 08/17/24(Sat)06:06:53 No.101938191

File: Screenshot_20240817-120624~2.png (93 KB, 601x600)

93 KB PNG

I do Schnell 1024 res gens in about 18 s on my 3060
dunno if that's slow or not
RAWRRR

Anonymous
08/17/24(Sat)06:08:39 No.101938211

Anonymous 08/17/24(Sat)06:08:39 No.101938211

File: ComfyUI_01715_.png (664 KB, 1024x1024)

664 KB PNG

>>101937813

Anonymous
08/17/24(Sat)06:09:08 No.101938213

Anonymous 08/17/24(Sat)06:09:08 No.101938213

File: ComfyUI_00156_.png (566 KB, 512x768)

566 KB PNG

>>101938191
That's normal for now

Anonymous
08/17/24(Sat)06:09:38 No.101938222

Anonymous 08/17/24(Sat)06:09:38 No.101938222

File: 1723001676026406.png (908 KB, 1024x1024)

908 KB PNG

https://civitai.com/models/644109

a new era of pepes begins

>>101938211
very nice!

Anonymous
08/17/24(Sat)06:12:49 No.101938244

Anonymous 08/17/24(Sat)06:12:49 No.101938244

File: 1700785213834723.png (867 KB, 1024x1024)

867 KB PNG

pepe is also a fan of miku gens in flux

Anonymous
08/17/24(Sat)06:15:16 No.101938262

Anonymous 08/17/24(Sat)06:15:16 No.101938262

File: 1723706531678745.jpg (547 KB, 3024x1714)

547 KB JPG

What scaler or node can do this?

Anonymous
08/17/24(Sat)06:16:01 No.101938270

Anonymous 08/17/24(Sat)06:16:01 No.101938270

File: ComfyUI_Flux_24.png (1.54 MB, 1216x832)

1.54 MB PNG

>>101937997
Works perfectly with various loras/cfg adjustments and whatnot, thanks. Custom loaders work just fine too so no need to stick to the regular lora loader.

Anonymous
08/17/24(Sat)06:18:05 No.101938282

Anonymous 08/17/24(Sat)06:18:05 No.101938282

File: 1708169452067202.png (1.14 MB, 1024x1024)

1.14 MB PNG

Anonymous
08/17/24(Sat)06:19:06 No.101938290

Anonymous 08/17/24(Sat)06:19:06 No.101938290

So GUUG Q8 works better with loras than Nf4 right.

Anonymous
08/17/24(Sat)06:19:14 No.101938291

Anonymous 08/17/24(Sat)06:19:14 No.101938291

File: GU2o97HWMAAPWES.jpg (3.82 MB, 2438x1950)

3.82 MB JPG

Flux 1 dev with Realism Lora + Magnific + Luminar + Lightroom

Anonymous
08/17/24(Sat)06:19:16 No.101938293

Anonymous 08/17/24(Sat)06:19:16 No.101938293

>>101938282
it gave miku pepe eyes kek

Anonymous
08/17/24(Sat)06:20:07 No.101938299

Anonymous 08/17/24(Sat)06:20:07 No.101938299

>>101938290
GGUF*

Anonymous
08/17/24(Sat)06:21:34 No.101938305

Anonymous 08/17/24(Sat)06:21:34 No.101938305

>>101938291
a lot of work for a noisy analog

Anonymous
08/17/24(Sat)06:22:05 No.101938308

Anonymous 08/17/24(Sat)06:22:05 No.101938308

>>101938290
it should, Q8 is way better than nf4 on its own

Anonymous
08/17/24(Sat)06:23:03 No.101938314

Anonymous 08/17/24(Sat)06:23:03 No.101938314

>>101938308
Sounds good, downloading now baby

Anonymous
08/17/24(Sat)06:23:12 No.101938315

Anonymous 08/17/24(Sat)06:23:12 No.101938315

why the fuck is ComfyUI using more VRAM today, it's starting to hit the shared VRAM when it has been working fine for a week
even changed the DE to the iGPU, the VRAM use on the GPU is literally 0

Anonymous
08/17/24(Sat)06:24:06 No.101938321

Anonymous 08/17/24(Sat)06:24:06 No.101938321

Can you inpaint faces in Forge yet for Flux?

Anonymous
08/17/24(Sat)06:24:45 No.101938327

Anonymous 08/17/24(Sat)06:24:45 No.101938327

>>101938315
Did you update it? I noticed the last update did that

Anonymous
08/17/24(Sat)06:25:06 No.101938330

Anonymous 08/17/24(Sat)06:25:06 No.101938330

File: 1693782740265607.png (938 KB, 1024x1024)

938 KB PNG

what a time to be alive
>no there will be no nudes
>no loras are impossible
>no you need an a100 to run it
>no it has safety controls on it

Anonymous
08/17/24(Sat)06:25:21 No.101938331

Anonymous 08/17/24(Sat)06:25:21 No.101938331

File: ComfyUI_00944_.png (1.57 MB, 1344x768)

1.57 MB PNG

>>101938315
Are you using quants? Update the node and comfyui. idk which fixed the issue

Anonymous
08/17/24(Sat)06:25:44 No.101938334

Anonymous 08/17/24(Sat)06:25:44 No.101938334

File: Flux-20240817_120354-gen2(...).png (1.36 MB, 1152x896)

1.36 MB PNG

>>101938262
eh this isnt magic. just looked at their images. upscale by model (pick a decent ESRGAN model) > downscale to a reasonable size > resample with a low ish denoise at around 16 steps > upscale again to final size - with flux you get actually better results lol. many ways to skin that cat bro

Anonymous
08/17/24(Sat)06:27:30 No.101938348

Anonymous 08/17/24(Sat)06:27:30 No.101938348

>>101938331
I wasn't but tried it just now to see how well it works
using Q8 the VRAM use is lower so no swapping but it is two times slower than FP8, from 2.4s/it to 4.86s/it
is that normal?

Anonymous
08/17/24(Sat)06:29:41 No.101938362

Anonymous 08/17/24(Sat)06:29:41 No.101938362

File: Comparison_all_quants.jpg (3.84 MB, 7961x2897)

3.84 MB JPG

>>101938348
>using Q8 the VRAM use is lower so no swapping but it is two times slower than FP8, from 2.4s/it to 4.86s/it
>is that normal?
usually when you get something 2 times slower that's because you went from CFG = 1 to CFG > 1, the speed between Q8 and fp8 are supposed to be close

Anonymous
08/17/24(Sat)06:30:39 No.101938371

Anonymous 08/17/24(Sat)06:30:39 No.101938371

>>101938362
I don't use CFG at all so it's not that, I just changed from the normal loader to the GGUF loader

Anonymous
08/17/24(Sat)06:33:45 No.101938390

Anonymous 08/17/24(Sat)06:33:45 No.101938390

>>101938371
maybe it's overflowing your VRAM capacity and that's why it's slow, how many vram do you have?

Anonymous
08/17/24(Sat)06:35:50 No.101938404

Anonymous 08/17/24(Sat)06:35:50 No.101938404

>>101938390
I have 16GB
>>101938348
>using Q8 the VRAM use is lower so no swapping

Anonymous
08/17/24(Sat)06:35:54 No.101938406

Anonymous 08/17/24(Sat)06:35:54 No.101938406

File: ComfyUI_Flux_28.png (1.24 MB, 1216x832)

1.24 MB PNG

>>101938020
To answer my own question - kinda works but it definitely wasn't trained on the game renders. Looks more like a gta san andreas mod kek. I just put "a 3d screenshot" in the positive and "anime, 2d" in the negative

Anonymous
08/17/24(Sat)06:36:32 No.101938410

Anonymous 08/17/24(Sat)06:36:32 No.101938410

>>101938362
is the gen process pretty much like this:
comfy sampler -> gguf dequant -> flux model script using NN Linear?
thanks for the great work

Anonymous
08/17/24(Sat)06:38:32 No.101938430

Anonymous 08/17/24(Sat)06:38:32 No.101938430

File: Capture.jpg (96 KB, 1444x727)

96 KB JPG

>>101938404
>I have 16GB
do you see if it overflows your vram capacity on task manager? Piercel is the picture of Q8 vram usage during inference on my 3090

Anonymous
08/17/24(Sat)06:39:20 No.101938439

Anonymous 08/17/24(Sat)06:39:20 No.101938439

>>101938430
I literally said it doesn't in the comment...

Anonymous
08/17/24(Sat)06:39:55 No.101938446

Anonymous 08/17/24(Sat)06:39:55 No.101938446

File: 00006-2721554246.png (1.61 MB, 896x1152)

1.61 MB PNG

Ingore the audience

Anonymous
08/17/24(Sat)06:41:06 No.101938457

Anonymous 08/17/24(Sat)06:41:06 No.101938457

File: ComfyUI_Flux_30.png (1.38 MB, 1216x832)

1.38 MB PNG

>>101938406
And this is with "a real photo"

Anonymous
08/17/24(Sat)06:41:14 No.101938459

Anonymous 08/17/24(Sat)06:41:14 No.101938459

>>101938439
welp, sucks to be you I guess

Anonymous
08/17/24(Sat)06:41:25 No.101938460

Anonymous 08/17/24(Sat)06:41:25 No.101938460

File: ComfyUI_00162_.png (615 KB, 512x768)

615 KB PNG

>>101938334
Some dev ones look similar but takes forever to generate compared to schnell

Anonymous
08/17/24(Sat)06:42:15 No.101938468

Anonymous 08/17/24(Sat)06:42:15 No.101938468

>>101938457
you can prevent that by adding "anime" in the negative prompt

Anonymous
08/17/24(Sat)06:42:24 No.101938469

Anonymous 08/17/24(Sat)06:42:24 No.101938469

>>101937996
Take the 10 minute per gen pill. Go dev. You will learn to discern if it's worth it by looking at the preview of the first few iterations, and you will put in a lot more work into your gens.

Anonymous
08/17/24(Sat)06:42:28 No.101938470

Anonymous 08/17/24(Sat)06:42:28 No.101938470

File: 1695506901949471.png (1.08 MB, 1024x1024)

1.08 MB PNG

isnt technology cool, im a comp sci guy, not an art student.

Anonymous
08/17/24(Sat)06:42:45 No.101938472

Anonymous 08/17/24(Sat)06:42:45 No.101938472

>>101938457
Not bad.

Anonymous
08/17/24(Sat)06:43:11 No.101938478

Anonymous 08/17/24(Sat)06:43:11 No.101938478

>>101938470
Flux is worth it just for the coherence of backgrounds

Anonymous
08/17/24(Sat)06:44:33 No.101938495

Anonymous 08/17/24(Sat)06:44:33 No.101938495

File: 1703246731674638.png (1.1 MB, 1024x1024)

1.1 MB PNG

>>101938478
flux has a way of making photos/depth of field type prompts feel authentic, i'm not sure exactly how to describe it, but it works. It uses noise in a better way than SDXL, I guess?

Anonymous
08/17/24(Sat)06:44:38 No.101938497

Anonymous 08/17/24(Sat)06:44:38 No.101938497

Why doesn't image2image work with flux in general? Or is it just with schnell?
Image gets really overcooked or just broken

Anonymous
08/17/24(Sat)06:45:18 No.101938510

Anonymous 08/17/24(Sat)06:45:18 No.101938510

that explains it https://github.com/city96/ComfyUI-GGUF/issues/33#issuecomment-2294821171
I read that Forge also slows down with loras, is that right?

Anonymous
08/17/24(Sat)06:47:14 No.101938524

Anonymous 08/17/24(Sat)06:47:14 No.101938524

File: ComfyUI_Flux_31.png (1.4 MB, 1216x832)

1.4 MB PNG

>>101938468
Yeah, that's the first thing I did.
>>101938510
Everything always slows down with loras, even 1.5 and sdxl.
>>101938497
Dev needs a pretty high denoise value for significant changes, I'm talking 0.8+. No idea about shnell.

Anonymous
08/17/24(Sat)06:48:01 No.101938535

Anonymous 08/17/24(Sat)06:48:01 No.101938535

>>101938497
I2I should work with every model

Anonymous
08/17/24(Sat)06:48:25 No.101938536

Anonymous 08/17/24(Sat)06:48:25 No.101938536

>>101938524
>Everything always slows down with loras, even 1.5 and sdxl.
never by doubling gen times as it is happening now

Anonymous
08/17/24(Sat)06:48:37 No.101938540

Anonymous 08/17/24(Sat)06:48:37 No.101938540

File: 3812288914.jpg (2.55 MB, 1792x2304)

2.55 MB JPG

Anonymous
08/17/24(Sat)06:48:48 No.101938542

Anonymous 08/17/24(Sat)06:48:48 No.101938542

File: 00009-1045291052.png (1.31 MB, 896x1152)

1.31 MB PNG

>>101938348
Something similar with me not double but more, went from 3.9s/it to 5.5s/it. From 1min 20 seconds to 1min 50 seconds.

Gonna restart PC to see if it helps

Anonymous
08/17/24(Sat)06:48:49 No.101938543

Anonymous 08/17/24(Sat)06:48:49 No.101938543

File: file.png (617 KB, 800x600)

617 KB PNG

>>101938191
That's normal. My schnells take 1-3 minutes on my AMD 5700.
>>101938495
What the fuck is going on in that back room?

Anonymous
08/17/24(Sat)06:49:12 No.101938547

Anonymous 08/17/24(Sat)06:49:12 No.101938547

>>101938460
I scale them to 1.5x (of the original) size after the "upscale by model" node before resampling it, its reasonably fast. then, as a final step, in a seperate workflow, I just upscale with a clean ESRGAN model again and downscale to the final size. tried various DAT upscalers, waste of time in most cases. really hungry mofos. BRO

Anonymous
08/17/24(Sat)06:49:13 No.101938548

Anonymous 08/17/24(Sat)06:49:13 No.101938548

File: ComfyUI_00329_.png (1.78 MB, 1024x1024)

1.78 MB PNG

whats a good workflow for Lora stuff?

Anonymous
08/17/24(Sat)06:49:15 No.101938549

Anonymous 08/17/24(Sat)06:49:15 No.101938549

>>101938536
>never by doubling gen times as it is happening now
Theorically it should double the gen time because what CFG does is creating 2 images instead of just 1, so twice the work

Anonymous
08/17/24(Sat)06:49:39 No.101938553

Anonymous 08/17/24(Sat)06:49:39 No.101938553

File: 1708198229142296.png (1.98 MB, 1024x1024)

1.98 MB PNG

>>101938495
and for something a little different, art nouveau lora but with miku:

Anonymous
08/17/24(Sat)06:49:57 No.101938555

Anonymous 08/17/24(Sat)06:49:57 No.101938555

>>101938549
motherfucker I'm talking about loras what the fuck are you on about

Anonymous
08/17/24(Sat)06:49:58 No.101938556

Anonymous 08/17/24(Sat)06:49:58 No.101938556

I got a great deal for 4070 ti super 16gb model, anyone using it for image gen?

Anonymous
08/17/24(Sat)06:50:49 No.101938566

Anonymous 08/17/24(Sat)06:50:49 No.101938566

>>101938542
I see a booger up her nose. Realistic.

Anonymous
08/17/24(Sat)06:51:38 No.101938574

Anonymous 08/17/24(Sat)06:51:38 No.101938574

>>101938555
>Seething this hard
Take some pills retard, you need to control your anger issues

Anonymous
08/17/24(Sat)06:52:28 No.101938582

Anonymous 08/17/24(Sat)06:52:28 No.101938582

>>101938495
requesting catbox pls

Anonymous
08/17/24(Sat)06:52:56 No.101938588

Anonymous 08/17/24(Sat)06:52:56 No.101938588

>>101938574
you read "doubling gen times" and your monkey brain thinks "CFG" disregarding the context of the conversation because you make logical connections at the level of GPT-2

Anonymous
08/17/24(Sat)06:53:07 No.101938591

Anonymous 08/17/24(Sat)06:53:07 No.101938591

File: 00011-1045291054.png (1.18 MB, 896x1152)

1.18 MB PNG

>>101938566
it keeps happening on this prompt.

Anonymous
08/17/24(Sat)06:53:25 No.101938595

Anonymous 08/17/24(Sat)06:53:25 No.101938595

File: Flux_00915_.png (1.04 MB, 1344x768)

1.04 MB PNG

>>101938556
Using one right now for Flux. With ComfyUI's low VRAM mode you can run the full FP16 model.

Anonymous
08/17/24(Sat)06:54:29 No.101938603

Anonymous 08/17/24(Sat)06:54:29 No.101938603

>>101938588
Like I said, take some pills, if you're gonna scream like a monkey everytime someone missed something in a conversation you'll have some serious health issues

Anonymous
08/17/24(Sat)06:54:48 No.101938611

Anonymous 08/17/24(Sat)06:54:48 No.101938611

>>101938582
https://files.catbox.moe/tgka91.png

Anonymous
08/17/24(Sat)06:54:52 No.101938614

Anonymous 08/17/24(Sat)06:54:52 No.101938614

>>101938548
Just put a lora node in.

Anonymous
08/17/24(Sat)06:55:26 No.101938620

Anonymous 08/17/24(Sat)06:55:26 No.101938620

>>101938603
So you admit you're retarded.

Anonymous
08/17/24(Sat)06:55:37 No.101938623

Anonymous 08/17/24(Sat)06:55:37 No.101938623

File: Flux-20240817_125212-gen2(...).png (1.32 MB, 1152x896)

1.32 MB PNG

>>101938556
BRO EVEN A 3090 OUTDATED NOW (t.3090 user). seriously don't waste money on anything below a 4090 now

Anonymous
08/17/24(Sat)06:55:57 No.101938628

Anonymous 08/17/24(Sat)06:55:57 No.101938628

>>101938588
That dude spends his entire day posting his CFG hacks to here and reddit, any chance he can he'll shill his shit. His images always look deepfried and uber noisy because of it too.

Anonymous
08/17/24(Sat)06:56:33 No.101938635

Anonymous 08/17/24(Sat)06:56:33 No.101938635

>>101938620
So you admit that you have the same self-control as a unhinged gorrila?

Anonymous
08/17/24(Sat)06:56:38 No.101938636

Anonymous 08/17/24(Sat)06:56:38 No.101938636

File: ComfyUI_Flux_33.png (1.46 MB, 1216x832)

1.46 MB PNG

>>101938536
About 25% speed decrease or so for me. Running Q4_0 on RTX 2080

Anonymous
08/17/24(Sat)06:56:38 No.101938637

Anonymous 08/17/24(Sat)06:56:38 No.101938637

File: 1696294159936767.png (1007 KB, 1024x1024)

1007 KB PNG

I tried the pepe lora with the nf4 model. I got kermit instead. lmao

Anonymous
08/17/24(Sat)06:57:20 No.101938644

Anonymous 08/17/24(Sat)06:57:20 No.101938644

File: FluxDev_01555_.jpg (186 KB, 768x1312)

186 KB JPG

Is it me or her legs are looking a little thick.

Anonymous
08/17/24(Sat)06:57:45 No.101938649

Anonymous 08/17/24(Sat)06:57:45 No.101938649

>>101938636
Damn, nice

Anonymous
08/17/24(Sat)06:59:28 No.101938665

Anonymous 08/17/24(Sat)06:59:28 No.101938665

>>101938556
>>101938623
Everything is "outdated" when it comes to this. I'm going to get a 24GB used GPU myself as soon as I find a good deal. Otherwise you will be forever waiting for the next thing and regretting spending money because this scene moves so fast. Just make gradual upgrades and don't get to hung up on it.

Anonymous
08/17/24(Sat)06:59:53 No.101938669

Anonymous 08/17/24(Sat)06:59:53 No.101938669

>>101938595
Hey that's cool. How are the gen times?

>>101938623
I got a deal for 400€ vs 4090 almost 2k€ where I live

Anonymous
08/17/24(Sat)06:59:56 No.101938670

Anonymous 08/17/24(Sat)06:59:56 No.101938670

File: 175052_00001_.png (1.12 MB, 1024x1024)

1.12 MB PNG

>>101938556
I have a 4060ti 16GB and 64gb ddr5 and it does the default fluxd 1024x1024 in 49s, with comfyui on linux so you should be in for some good times.
>>101938611
Bless you anon, doesnt work for me, using comfui, but bless you anyway.

Anonymous
08/17/24(Sat)07:00:45 No.101938682

Anonymous 08/17/24(Sat)07:00:45 No.101938682

>>101938623
That attachment on the tip of the gun is for sticking it somewhere.

Anonymous
08/17/24(Sat)07:00:54 No.101938684

Anonymous 08/17/24(Sat)07:00:54 No.101938684

File: grid-0178.jpg (857 KB, 2304x1792)

857 KB JPG

I have no idea how I got outputs this varied from:
"Alex Garan artstyle, Page from American college yearbook, , 1980s in color"

Anonymous
08/17/24(Sat)07:01:41 No.101938693

Anonymous 08/17/24(Sat)07:01:41 No.101938693

>>101938684
means it has fuck all idea what "Alex Garan artstyle" is

Anonymous
08/17/24(Sat)07:01:54 No.101938697

Anonymous 08/17/24(Sat)07:01:54 No.101938697

>>101938669
Dale duro hermano. I'd go for more memory if I could, but that's a hell of a deal.

Anonymous
08/17/24(Sat)07:01:57 No.101938698

Anonymous 08/17/24(Sat)07:01:57 No.101938698

>>101938670
it's not complicated, it's just the emma lora then after the instance prompt "in a coffee shop, she is wearing a white tshirt that says..."

Anonymous
08/17/24(Sat)07:02:03 No.101938701

Anonymous 08/17/24(Sat)07:02:03 No.101938701

File: grid-0177.jpg (904 KB, 2304x1792)

904 KB JPG

Same prompt.

Anonymous
08/17/24(Sat)07:02:31 No.101938706

Anonymous 08/17/24(Sat)07:02:31 No.101938706

>>101938669
For that pic, which was 25 steps with Euler, it was around 45 seconds.

Anonymous
08/17/24(Sat)07:02:34 No.101938708

Anonymous 08/17/24(Sat)07:02:34 No.101938708

File: ifx66w.jpg (580 KB, 1600x1600)

580 KB JPG

>>101938682
fact checker

Anonymous
08/17/24(Sat)07:03:20 No.101938720

Anonymous 08/17/24(Sat)07:03:20 No.101938720

so in comfy, how do you connect the gguf -> lora -> nodes to text input? the output after lora is model, what does it link into?

what's the basic workflow for gguf/lora?

Anonymous
08/17/24(Sat)07:03:41 No.101938724

Anonymous 08/17/24(Sat)07:03:41 No.101938724

>>101938669
>I got a deal for 400€
fucking how, my 4060 Ti 16GB was 450
the 4070 ti super was ~850 at a minimum last I checked

Anonymous
08/17/24(Sat)07:03:44 No.101938725

Anonymous 08/17/24(Sat)07:03:44 No.101938725

File: Screenshot 2024-08-17 230312.png (7 KB, 293x157)

7 KB PNG

Say what you will about the comfy UI design but I am absolutely loving these node timers.

Anonymous
08/17/24(Sat)07:04:41 No.101938735

Anonymous 08/17/24(Sat)07:04:41 No.101938735

>>101938720
The lora alters the model. It just makes it react to new embeddings. You don't need to connect the lora to the text/embeddings input. Think of it as a filter for the model so that it understands your toddler foot fetish.

Anonymous
08/17/24(Sat)07:04:48 No.101938738

Anonymous 08/17/24(Sat)07:04:48 No.101938738

>>101938720
*cause the initial workflow has clip and vae I believe but the q4/q8 loader doesnt

Anonymous
08/17/24(Sat)07:05:11 No.101938746

Anonymous 08/17/24(Sat)07:05:11 No.101938746

>>101938720
depending on the LoRA loader you use, there should be a yellow clip thing you can pull out into the text input.
Depending on the LoRA though, it's probably not necessary.

Anonymous
08/17/24(Sat)07:05:13 No.101938747

Anonymous 08/17/24(Sat)07:05:13 No.101938747

File: FD_00404_.png (1.37 MB, 1024x1024)

1.37 MB PNG

>>101938495
If you can't beat them, join them

Anonymous
08/17/24(Sat)07:05:48 No.101938755

Anonymous 08/17/24(Sat)07:05:48 No.101938755

File: 00008-1120970443.png (1.66 MB, 896x1152)

1.66 MB PNG

https://civitai.com/models/523485?modelVersionId=732778
Finally, flux is perfect now

Anonymous
08/17/24(Sat)07:06:22 No.101938761

Anonymous 08/17/24(Sat)07:06:22 No.101938761

>>101938735
>>101938746
thanks

Anonymous
08/17/24(Sat)07:06:42 No.101938766

Anonymous 08/17/24(Sat)07:06:42 No.101938766

>>101938720
That said, there are several lora loader nodes. Some let you plug the image model and the text encoder (clip). I dunno.
>>101938755
My fetish.

Anonymous
08/17/24(Sat)07:08:05 No.101938779

Anonymous 08/17/24(Sat)07:08:05 No.101938779

https://civitai.com/models/653149/javelin-82
SHUT IT DOWN

Anonymous
08/17/24(Sat)07:08:19 No.101938784

Anonymous 08/17/24(Sat)07:08:19 No.101938784

you know not even a few months ago I would have said open source DALL-E 3 was pretty far away, and now flux is basically uncensored open source dalle. pony/sdxl was already good, this is a step above that.

openAI/Microsoft might be weight hoarding assholes but open source always wins.

Anonymous
08/17/24(Sat)07:08:19 No.101938785

Anonymous 08/17/24(Sat)07:08:19 No.101938785

File: aseet.jpg (20 KB, 542x375)

20 KB JPG

>>101938614
Sir pls I need the workflow

Anonymous
08/17/24(Sat)07:08:19 No.101938786

Anonymous 08/17/24(Sat)07:08:19 No.101938786

>>101938755
>dataset consists of 28 screencaps (1920×1024)
I hate these people so fucking much. Same as the ones finetuning text models with arbitrary prompt formats.
USE THE SAME FORMAT USED ON THE ORIGINAL MODEL FOR FUCK'S SAKE

Anonymous
08/17/24(Sat)07:08:38 No.101938791

Anonymous 08/17/24(Sat)07:08:38 No.101938791

File: Flux-20240817_014446-re-3(...).png (2.51 MB, 2304x960)

2.51 MB PNG

>>101938665
you are right, yes. that's why I got this 3090 last year. I don't regret it. still, feels too weak sauce for flux.
>>101938669
my advice: spend a bit more and get a 3090. 500? 600? you can do it.
>>101938725
it is going forward.
>>101938766
kiki best girl

Anonymous
08/17/24(Sat)07:09:01 No.101938794

Anonymous 08/17/24(Sat)07:09:01 No.101938794

>>101938779
Lmaoo, how the fuck civitai is still alive after hosting so many controvertial models

Anonymous
08/17/24(Sat)07:09:22 No.101938799

Anonymous 08/17/24(Sat)07:09:22 No.101938799

>>101938786
Flux was trained on a range of megapixels and ratios tho
what you should be saying is USE A VARIED SIZE AND RATIO DATA SET FOR FUCK'S SAKE

Anonymous
08/17/24(Sat)07:09:34 No.101938801

Anonymous 08/17/24(Sat)07:09:34 No.101938801

>>101938725
I like that too. I also like seeing the green outlines so you can visually see the program at work, it's like seeing data go from your ssd to your cpu and memory or whatever.

Anonymous
08/17/24(Sat)07:09:43 No.101938803

Anonymous 08/17/24(Sat)07:09:43 No.101938803

>>101938779
What's the trigger phrase for this?

Anonymous
08/17/24(Sat)07:10:30 No.101938812

Anonymous 08/17/24(Sat)07:10:30 No.101938812

>>101938786
>USE THE SAME FORMAT USED ON THE ORIGINAL MODEL FOR FUCK'S SAKE
That would ruin the whole thing, the ghibli screenshots aren't 1:1, flux must know how it really look like

Anonymous
08/17/24(Sat)07:11:30 No.101938825

Anonymous 08/17/24(Sat)07:11:30 No.101938825

>>101938725
how do you get that anon? that looks interesting

>>101938803
>What's the trigger phrase for this?
kike

Anonymous
08/17/24(Sat)07:12:46 No.101938843

Anonymous 08/17/24(Sat)07:12:46 No.101938843

>>101938812
Just crop the images judiciously. It's not about genning "a screenshot of a ghibli movie", but genning "kiki getting all her holes filled by a gang of magical brooms".

Anonymous
08/17/24(Sat)07:12:54 No.101938845

Anonymous 08/17/24(Sat)07:12:54 No.101938845

>>101938801
It always had the outlines though

Anonymous
08/17/24(Sat)07:13:34 No.101938850

Anonymous 08/17/24(Sat)07:13:34 No.101938850

>>101938812
but it knows cinematic wide shots already
besides the full shots you could also train on crops of the characters (keeping them withing the size/ratio Flux knows of course)
would bulk up the data set and avoid the chance of it degrading with gens that aren't wide

Anonymous
08/17/24(Sat)07:13:35 No.101938851

Anonymous 08/17/24(Sat)07:13:35 No.101938851

>>101938843
>Just crop the images judiciously.
that's fucking retarded, flux is good at a lot of resolutions, so why not train it further with the real deal instead of forcing it with some 1:1 nonsense

Anonymous
08/17/24(Sat)07:13:49 No.101938853

Anonymous 08/17/24(Sat)07:13:49 No.101938853

>>101938843
BRO.

Anonymous
08/17/24(Sat)07:13:54 No.101938856

Anonymous 08/17/24(Sat)07:13:54 No.101938856

>put in the Load Lora node into the workflow
>press queue prompt
>nothing happens

I dont get it

Anonymous
08/17/24(Sat)07:13:55 No.101938857

Anonymous 08/17/24(Sat)07:13:55 No.101938857

File: FD_00414_.png (1.09 MB, 1024x1024)

1.09 MB PNG

>>101938785
https://files.catbox.moe/qjmvkr.png
>>101938825
Update comfy and enable it in the settings

Anonymous
08/17/24(Sat)07:14:02 No.101938859

Anonymous 08/17/24(Sat)07:14:02 No.101938859

>>101937495
How's the Flux.1 q4_0 compare to q4_1?

Anonymous
08/17/24(Sat)07:14:10 No.101938860

Anonymous 08/17/24(Sat)07:14:10 No.101938860

File: 00015-954822782.png (1.64 MB, 832x1216)

1.64 MB PNG

Anonymous
08/17/24(Sat)07:14:45 No.101938866

Anonymous 08/17/24(Sat)07:14:45 No.101938866

File: 121147_00001_.png (1.11 MB, 1024x1024)

1.11 MB PNG

>>101938698
ah i didn't see it, got it now (was being a filter-tard) ty

Anonymous
08/17/24(Sat)07:16:29 No.101938885

Anonymous 08/17/24(Sat)07:16:29 No.101938885

>>101938843
Most training programs auto resize and crop pictures anyways to fit the SDXL resolutions, no need to do it manually

Anonymous
08/17/24(Sat)07:16:44 No.101938887

Anonymous 08/17/24(Sat)07:16:44 No.101938887

>>101938859
Q4_0 is always inferior to Q4_1, in the LLM space, Q4_1 has been created to improve from Q4_0 by adding some more high precisions into the weights, so it's a bit bigger but better too

Anonymous
08/17/24(Sat)07:16:48 No.101938888

Anonymous 08/17/24(Sat)07:16:48 No.101938888

>>101938856
You need to say the magic words.
>>101938851
If you made the LoRA, know that there's a reason you had to inpaint that fucking third leg out of that image. If Flux is giving your anatomical errors in 10 images you made as samples for your shitty work, there's something you did wrong and you know it.

Anonymous
08/17/24(Sat)07:16:50 No.101938889

Anonymous 08/17/24(Sat)07:16:50 No.101938889

>simple guidance
>adaptive guidance
>dynamic threshold
>guidance threshold
>flux sampler
>flux shift
>two CLIP nodes for positive/negative with individual guidance settings
>skimmed cfg
>tonemap
>CFG Guider
>AutoCFG
>PrepNegGuider

I am lost.

Anonymous
08/17/24(Sat)07:18:02 No.101938901

Anonymous 08/17/24(Sat)07:18:02 No.101938901

>>101938885
Worse yet, you are going to get randomly cropped faces and bodies that way.
Just put some effort into your work if you're going to publish for other people ffs. That's all I'm saying.

Anonymous
08/17/24(Sat)07:18:38 No.101938911

Anonymous 08/17/24(Sat)07:18:38 No.101938911

>>101938889
Don't bother with that one anon's setup, it's hacky and breaks whenever a new update comes out. Just use the basic workflow for now until better supported CFG options come out

Anonymous
08/17/24(Sat)07:18:55 No.101938917

Anonymous 08/17/24(Sat)07:18:55 No.101938917

File: laura-h4.jpg (603 KB, 1248x1824)

603 KB JPG

>>101938889
not enough nodes sir

Anonymous
08/17/24(Sat)07:18:56 No.101938918

Anonymous 08/17/24(Sat)07:18:56 No.101938918

>>101938889
When did you start learning on imagegen anon? You should try easy first and then learn each concept slowly but surely

Anonymous
08/17/24(Sat)07:18:59 No.101938919

Anonymous 08/17/24(Sat)07:18:59 No.101938919

>>101938889
>don't need it
>don't need it
>don't need it
>don't need it
>don't need it
>doesn't make a huge difference
>don't need it
>don't need it
>don't need it
>don't need it
>don't need it
>don't need it

Anonymous
08/17/24(Sat)07:19:04 No.101938921

Anonymous 08/17/24(Sat)07:19:04 No.101938921

File: FD_00410_.png (1.21 MB, 1024x1024)

1.21 MB PNG

This is bad, right?

Anonymous
08/17/24(Sat)07:19:57 No.101938932

Anonymous 08/17/24(Sat)07:19:57 No.101938932

>>101938919
you don't need dynamic threshold? seriously? that shit is great at unslopping anime images

Anonymous
08/17/24(Sat)07:20:00 No.101938933

Anonymous 08/17/24(Sat)07:20:00 No.101938933

>>101938921
you're treading on dangerous ground

Anonymous
08/17/24(Sat)07:21:03 No.101938945

Anonymous 08/17/24(Sat)07:21:03 No.101938945

>>101938901
Considering everyone already uses those tools and things turn out perfectly fine, I'm gonna have to go with you're just sperging out for no good reason. You can mess around with the tools used for bucketing/downscaling/cropping, you'd be hard pressed to throw something at it that will truly get fucked by the process.

Anonymous
08/17/24(Sat)07:21:17 No.101938948

Anonymous 08/17/24(Sat)07:21:17 No.101938948

>>101938933
Not intentional, but I can think of the implications of this from a LoRA.

Anonymous
08/17/24(Sat)07:21:33 No.101938955

Anonymous 08/17/24(Sat)07:21:33 No.101938955

>>101938919
Why are you saying you don't need Adaptive Guidance? You improve the speed of your gen with that, smells like some serious skill issues if you ask me.

Anonymous
08/17/24(Sat)07:21:50 No.101938959

Anonymous 08/17/24(Sat)07:21:50 No.101938959

File: file.png (544 KB, 512x512)

544 KB PNG

Anonymous
08/17/24(Sat)07:22:19 No.101938967

Anonymous 08/17/24(Sat)07:22:19 No.101938967

>>101938921
HALF GIRL HALF PLASTIC BRO. & she already hit the wall there

Anonymous
08/17/24(Sat)07:22:39 No.101938971

Anonymous 08/17/24(Sat)07:22:39 No.101938971

>>101938955
>You improve the speed of your gen with that
only if you're using CFG, you don't need it

Anonymous
08/17/24(Sat)07:23:09 No.101938978

Anonymous 08/17/24(Sat)07:23:09 No.101938978

File: longew.jpg (1.29 MB, 1600x1200)

1.29 MB JPG

Anonymous
08/17/24(Sat)07:23:10 No.101938979

Anonymous 08/17/24(Sat)07:23:10 No.101938979

>>101938945
>things turn out perfectly fine
You have real difficulty finding badly made LoRA's on Civitai lmao

Anonymous
08/17/24(Sat)07:23:35 No.101938981

Anonymous 08/17/24(Sat)07:23:35 No.101938981

>>101938971
nta, but CFG is good to remove the blur of photos flux is producing, I hate that blurry shit, why does flux have such a bias towards that

Anonymous
08/17/24(Sat)07:23:46 No.101938983

Anonymous 08/17/24(Sat)07:23:46 No.101938983

File: wut this means.jpg (62 KB, 1358x1072)

62 KB JPG

>>101938857
wat do?

Anonymous
08/17/24(Sat)07:24:10 No.101938990

Anonymous 08/17/24(Sat)07:24:10 No.101938990

>>101938932
It's good for anime, but for anything else it adds even more slop. The guy who shills it uses the same "woman on a street" picture to test his settings because for 98% of anything you'd want to prompt photo-wise it's trash

Anonymous
08/17/24(Sat)07:24:11 No.101938991

Anonymous 08/17/24(Sat)07:24:11 No.101938991

>>101938983
delete the nf4 loader, I have it there for no reason

Anonymous
08/17/24(Sat)07:24:15 No.101938992

Anonymous 08/17/24(Sat)07:24:15 No.101938992

>>101938983
It's over.

Anonymous
08/17/24(Sat)07:24:45 No.101938998

Anonymous 08/17/24(Sat)07:24:45 No.101938998

>>101938921
did you accidentally put megamind in the prompt?

Anonymous
08/17/24(Sat)07:25:05 No.101939003

Anonymous 08/17/24(Sat)07:25:05 No.101939003

>>101938990
>It's good for anime, but for anything else it adds even more slop.
So it's useful and you need it after all, unless you're pretending people aren't into anime or something?

Anonymous
08/17/24(Sat)07:25:26 No.101939007

Anonymous 08/17/24(Sat)07:25:26 No.101939007

>>101938979
Civit slop will be Civit slop no matter what tools you give them unfortunately kek

Anonymous
08/17/24(Sat)07:26:00 No.101939016

Anonymous 08/17/24(Sat)07:26:00 No.101939016

File: ComfyUI_01725_.png (689 KB, 1344x832)

689 KB PNG

Anonymous
08/17/24(Sat)07:26:12 No.101939018

Anonymous 08/17/24(Sat)07:26:12 No.101939018

File: fp125w.jpg (301 KB, 1600x1200)

301 KB JPG

Anonymous
08/17/24(Sat)07:26:43 No.101939022

Anonymous 08/17/24(Sat)07:26:43 No.101939022

File: 3433076726.png (1.31 MB, 1344x768)

1.31 MB PNG

Anonymous
08/17/24(Sat)07:27:18 No.101939032

Anonymous 08/17/24(Sat)07:27:18 No.101939032

>>101939003
>It's good for anime
>If you use it for anything else it's trash
>So you need it then?
Not great at reading between the lines their kid are ya?

Anonymous
08/17/24(Sat)07:27:59 No.101939038

Anonymous 08/17/24(Sat)07:27:59 No.101939038

>>101938919
>>don't need it
>>101939032
>>It's good for anime
choose one anon

Anonymous
08/17/24(Sat)07:28:18 No.101939040

Anonymous 08/17/24(Sat)07:28:18 No.101939040

File: ComfyUI_Flux_35.png (1.53 MB, 1216x832)

1.53 MB PNG

>>101938755
What the fuck are these birds

Anonymous
08/17/24(Sat)07:28:26 No.101939042

Anonymous 08/17/24(Sat)07:28:26 No.101939042

>>101939038
>Don't prompt for anime
>I don't need it
No :D

Anonymous
08/17/24(Sat)07:29:07 No.101939056

Anonymous 08/17/24(Sat)07:29:07 No.101939056

>>101939042
That's something you haven't specified on your first post, weird huh? :^)

Anonymous
08/17/24(Sat)07:29:15 No.101939059

Anonymous 08/17/24(Sat)07:29:15 No.101939059

>>101939038
>me
>not me
you people really can't tell when you're talking to different people?

Anonymous
08/17/24(Sat)07:29:36 No.101939064

Anonymous 08/17/24(Sat)07:29:36 No.101939064

>>101939040
cummy birds

Anonymous
08/17/24(Sat)07:29:45 No.101939065

Anonymous 08/17/24(Sat)07:29:45 No.101939065

>>101938918
When CompVis released 1.2.
Still way too many new nodes without very much documentation.

Anonymous
08/17/24(Sat)07:30:53 No.101939081

Anonymous 08/17/24(Sat)07:30:53 No.101939081

File: fp128w.jpg (912 KB, 1600x1200)

912 KB JPG

>>101939040
birds aren't real

Anonymous
08/17/24(Sat)07:31:30 No.101939090

Anonymous 08/17/24(Sat)07:31:30 No.101939090

>>101939059
Considering the guy only seems to talk when he can shove his workflow into a convo it doesn't surprise me that he doesn't know who he's talking to

Anonymous
08/17/24(Sat)07:31:50 No.101939093

Anonymous 08/17/24(Sat)07:31:50 No.101939093

>>101939040
That's the cum of everyone who ever came to kiki's doujinshi, impregnating the imaginal space from which these images emerge.

Anonymous
08/17/24(Sat)07:31:55 No.101939095

Anonymous 08/17/24(Sat)07:31:55 No.101939095

>>101938706
Ty

>>101938724
I order it using friends business that gets good offers every now and then

>>101938791
3090 way too expensive where I live

Anonymous
08/17/24(Sat)07:32:11 No.101939099

Anonymous 08/17/24(Sat)07:32:11 No.101939099

>>101939059
Bullshit I can see your name right there "Anonymous". You've spammed this thread.

Anonymous
08/17/24(Sat)07:32:23 No.101939102

Anonymous 08/17/24(Sat)07:32:23 No.101939102

File: 1757986426.png (1.1 MB, 1344x768)

1.1 MB PNG

Anonymous
08/17/24(Sat)07:33:15 No.101939114

Anonymous 08/17/24(Sat)07:33:15 No.101939114

File: file.png (2.29 MB, 1024x1024)

2.29 MB PNG

>write running behind a bus
>the bus is always behind him
I thought this thing understood natural language. This is basically the same issue SD had.

Anonymous
08/17/24(Sat)07:33:18 No.101939116

Anonymous 08/17/24(Sat)07:33:18 No.101939116

>>101939065
>Still way too many new nodes without very much documentation.
The only documentation we have so far are the tutorials on reddit, we definitely need a rentry or something

Anonymous
08/17/24(Sat)07:35:49 No.101939154

Anonymous 08/17/24(Sat)07:35:49 No.101939154

>>101939114
clearly he's running behind the bus that is behind the camera
try describing the back of the bus

Anonymous
08/17/24(Sat)07:36:01 No.101939159

Anonymous 08/17/24(Sat)07:36:01 No.101939159

>>101939114
This being a flux gen, you can clearly see it's trained to produce the kind of images that wow normies and journos. 1girl instagram simulacra. The moment you try to describe something that isn't that, it starts introducing weird shit like a shit model. Look at that foot. And what's going on on the left? It also hasn't decided if the camera is inside or outside a bus.
Flux is overtrained on specific types of images to make it look good on social media.

Prove me wrong (seriously, post non-typical situations)

Anonymous
08/17/24(Sat)07:36:28 No.101939169

Anonymous 08/17/24(Sat)07:36:28 No.101939169

>>101939114
try to go for some boomer prompts, chatgpt or claude can help you on making your prompt more descriptive

Anonymous
08/17/24(Sat)07:37:48 No.101939191

Anonymous 08/17/24(Sat)07:37:48 No.101939191

>>101939114
what about running after a bus

Anonymous
08/17/24(Sat)07:37:54 No.101939198

Anonymous 08/17/24(Sat)07:37:54 No.101939198

>>101939159
Damn, your life must be harsh at that moment, you're forcing yourself to be on /ldg/ and see people only talk about flux when you could've gone back to /sdg/ and enjoy talking about your favorite SD models, why are you doing that to yourself anon? genuine question

Anonymous
08/17/24(Sat)07:38:02 No.101939203

Anonymous 08/17/24(Sat)07:38:02 No.101939203

>>101939114
https://www.youtube.com/watch?v=MjbUnn32_zU

Anonymous
08/17/24(Sat)07:38:22 No.101939208

Anonymous 08/17/24(Sat)07:38:22 No.101939208

>>101939159
>when you try to describe situations it doesn't understand, it doesn't understand them
Woah

Anonymous
08/17/24(Sat)07:39:13 No.101939219

Anonymous 08/17/24(Sat)07:39:13 No.101939219

>>101939040
the humans on the ground look horrific aswell, must be a terrible lora to make flux shit itself like that

Anonymous
08/17/24(Sat)07:39:19 No.101939221

Anonymous 08/17/24(Sat)07:39:19 No.101939221

>>101939159
That's why I upscale and inpaint with SDXL. Flux is vanilla

Anonymous
08/17/24(Sat)07:39:23 No.101939223

Anonymous 08/17/24(Sat)07:39:23 No.101939223

File: file.png (1.08 MB, 800x600)

1.08 MB PNG

>>101939169
I know, I'm just trying out things. That prompt was "a jewish man running behind a bus".
This one is "a bus behind a jewish man running". Just interesting to see how this thing thinks.
>>101939191
Probably more idiomatic.

Anonymous
08/17/24(Sat)07:41:50 No.101939259

Anonymous 08/17/24(Sat)07:41:50 No.101939259

File: file.png (1.03 MB, 800x600)

1.03 MB PNG

>>101939191
>a jewish man running after a bus
This prompt breaks flux apparently.

Anonymous
08/17/24(Sat)07:41:50 No.101939260

Anonymous 08/17/24(Sat)07:41:50 No.101939260

>>101939040
The more LoRAs you mix the fuckier things seem to get

Anonymous
08/17/24(Sat)07:42:56 No.101939271

Anonymous 08/17/24(Sat)07:42:56 No.101939271

>>101939260
that's why we need a finetune that adds more concept to flux, doing some LoraMAXXing has some serious limits, what if you want to go for 2b + Pepe on a ghibli style, that's 3 Loras flux has to handle, it can't work

Anonymous
08/17/24(Sat)07:43:25 No.101939281

Anonymous 08/17/24(Sat)07:43:25 No.101939281

File: 2728765841.png (1.17 MB, 1344x768)

1.17 MB PNG

Anonymous
08/17/24(Sat)07:43:39 No.101939286

Anonymous 08/17/24(Sat)07:43:39 No.101939286

File: 00019-3077351720.png (1.42 MB, 832x1216)

1.42 MB PNG

Anonymous
08/17/24(Sat)07:44:07 No.101939292

Anonymous 08/17/24(Sat)07:44:07 No.101939292

>>101939260
Lycoris doesn't work with flux?

Anonymous
08/17/24(Sat)07:46:20 No.101939318

Anonymous 08/17/24(Sat)07:46:20 No.101939318

>>101939292
Asking the wrong guy here. I think most people are still kind of stumbling over the finer points of LoRAs for flux.

Anonymous
08/17/24(Sat)07:46:27 No.101939321

Anonymous 08/17/24(Sat)07:46:27 No.101939321

File: file.png (685 KB, 640x480)

685 KB PNG

Here our tormented Jewish man is chasing the bus in the opposite direction

Anonymous
08/17/24(Sat)07:47:08 No.101939328

Anonymous 08/17/24(Sat)07:47:08 No.101939328

>>101939321
that's a muslim

Anonymous
08/17/24(Sat)07:48:22 No.101939344

Anonymous 08/17/24(Sat)07:48:22 No.101939344

>>101939318
IA3 should also be tried.

Anonymous
08/17/24(Sat)07:48:23 No.101939345

Anonymous 08/17/24(Sat)07:48:23 No.101939345

simplest way to train a Flux lora on a 4090? want to try and improve its accuracy in generating a specfic outfit

Anonymous
08/17/24(Sat)07:48:45 No.101939350

Anonymous 08/17/24(Sat)07:48:45 No.101939350

File: file.png (716 KB, 640x480)

716 KB PNG

>>101939328
They're all semites, what difference does it make?
In this one I used "chasing a bus", and he didn't even bother running.
We need a LoRA for this it looks like.

Anonymous
08/17/24(Sat)07:49:43 No.101939363

Anonymous 08/17/24(Sat)07:49:43 No.101939363

>>101939345
https://github.com/ostris/ai-toolkit

Works on windows.
Just follow the instructions and you'll be cooking your LoRA within 20 minutes.

Anonymous
08/17/24(Sat)07:50:18 No.101939370

Anonymous 08/17/24(Sat)07:50:18 No.101939370

File: 2024-08-17_00024_.png (1.87 MB, 1280x1024)

1.87 MB PNG

Anonymous
08/17/24(Sat)07:50:23 No.101939373

Anonymous 08/17/24(Sat)07:50:23 No.101939373

>>101939363
>Just follow the instructions and you'll be cooking your LoRA within 20 minutes.
So that's why there's a spam of Lora on civitai recently...

Anonymous
08/17/24(Sat)07:50:54 No.101939380

Anonymous 08/17/24(Sat)07:50:54 No.101939380

>>101939350
the model is poorly tagged, embeddings would save the day here, yet that entire tech is being ignored atm.

Anonymous
08/17/24(Sat)07:51:53 No.101939394

Anonymous 08/17/24(Sat)07:51:53 No.101939394

>>101939373
They are all very good quality because of the reasons this anon stated >>101938945
You can train LoRA's using any resolution, any number of images, and have them automatically cropped by a script. And it will come out amazing and ready for Early Access monetization!

Anonymous
08/17/24(Sat)07:52:15 No.101939397

Anonymous 08/17/24(Sat)07:52:15 No.101939397

>>101939363
>https://github.com/ostris/ai-toolkit
Every time I have tried this I get an error. Gonna delete it all and start again, see if it still fails.

Anonymous
08/17/24(Sat)07:52:16 No.101939400

Anonymous 08/17/24(Sat)07:52:16 No.101939400

>>101939373
>So that's why there's a spam of Lora on civitai recently.
No. Even that's too difficult for the average smooth brained jeet on civit. Civit recently introduced in site LoRA training in exchange for "buzz" (whatever the fuck that is)

Anonymous
08/17/24(Sat)07:53:19 No.101939416

Anonymous 08/17/24(Sat)07:53:19 No.101939416

>>101939394
>You can train LoRA's using any resolution, any number of images, and have them automatically cropped by a script.
like it will adapt to each resolution each image have? that's an excellent feature if you ask me

Anonymous
08/17/24(Sat)07:54:20 No.101939426

Anonymous 08/17/24(Sat)07:54:20 No.101939426

>>101939400
>Civit recently introduced in site LoRA training in exchange for "buzz" (whatever the fuck that is)
Imagine giving your data + captions to civitai for free, I don't want to sound rude but you have to be pretty retarded to do something like that

Anonymous
08/17/24(Sat)07:54:39 No.101939428

Anonymous 08/17/24(Sat)07:54:39 No.101939428

File: FD_00430_.png (1.34 MB, 1024x1024)

1.34 MB PNG

This is the best attempt I could muster. Gonna try Pro, see if it makes a difference

Anonymous
08/17/24(Sat)07:55:39 No.101939441

Anonymous 08/17/24(Sat)07:55:39 No.101939441

>>101939426
>for free
Actually you have to pay for this too.

Anonymous
08/17/24(Sat)07:55:55 No.101939444

Anonymous 08/17/24(Sat)07:55:55 No.101939444

File: download - 2024-08-17T235(...).jpg (204 KB, 1024x1024)

204 KB JPG

>>101939428

Anonymous
08/17/24(Sat)07:58:05 No.101939471

Anonymous 08/17/24(Sat)07:58:05 No.101939471

>>101939426
Really depends on the data and your hardware? Me? Never in my life would I fork anything over to civit. But the random jeet might want to upload his face five times to make another linkedin scam and that doesn't really bother me.

Anonymous
08/17/24(Sat)07:58:24 No.101939478

Anonymous 08/17/24(Sat)07:58:24 No.101939478

>>101939441
I won't be surprised if one day Civitai will make their own giant finetune of flux out of people's data and make their own API from that

Anonymous
08/17/24(Sat)07:58:43 No.101939483

Anonymous 08/17/24(Sat)07:58:43 No.101939483

File: imagefxb.png (1.32 MB, 1024x1024)

1.32 MB PNG

>>101939428
ImageFX is better trained in many ways

Anonymous
08/17/24(Sat)08:01:03 No.101939523

Anonymous 08/17/24(Sat)08:01:03 No.101939523

>>101939483
Can it make a wet woman?

Anonymous
08/17/24(Sat)08:02:49 No.101939553

Anonymous 08/17/24(Sat)08:02:49 No.101939553

Not getting spellcheck in the prompt box in ComfyUI for some reason now
I'm ESL, I need it.

Anonymous
08/17/24(Sat)08:03:36 No.101939564

Anonymous 08/17/24(Sat)08:03:36 No.101939564

>>101939363
If I wanted to train my own LoRA from nothing but a bunch of images, what's the best way to batch caption them? JoyCaption? (does it even run locally?)
They all have the same [item] in them if that matters, not even sure JoyCaption would recognise it, and I want to ensure it captures every little detail.
Never done this before so any advice would be appreciated.

Anonymous
08/17/24(Sat)08:04:21 No.101939572

Anonymous 08/17/24(Sat)08:04:21 No.101939572

File: imagefxc.png (1.39 MB, 1024x1024)

1.39 MB PNG

>>101939523
yes but you have to tard wrangle

Anonymous
08/17/24(Sat)08:04:46 No.101939574

Anonymous 08/17/24(Sat)08:04:46 No.101939574

>>101939564
>does it even run locally?
yes
>what's the best way
by hand
>They all have the same [item] in them if that matters
You could state that in the prompt

Anonymous
08/17/24(Sat)08:05:09 No.101939578

Anonymous 08/17/24(Sat)08:05:09 No.101939578

>>101939564
>If I wanted to train my own LoRA from nothing but a bunch of images, what's the best way to batch caption them? JoyCaption? (does it even run locally?)
JoyCaption do run locally, I still think GPT4V is the best captionner model, I would use GPT4V for SFW and JoyCaption for NFSW desu

Anonymous
08/17/24(Sat)08:05:38 No.101939589

Anonymous 08/17/24(Sat)08:05:38 No.101939589

>>101939564
Well, if it's nice enough that JoyCaption won't even recognize it, then you can run it through it and then edit the outputs manually to fix any mistakes or do it raw.
It's also worth considering if you even need captions at all. Are you training a brand new concept or just trying to enhance something that's already in the model or force a particular style? If that's the case, captioning might be completely optional.

Anonymous
08/17/24(Sat)08:06:03 No.101939601

Anonymous 08/17/24(Sat)08:06:03 No.101939601

File: FluxDev_01573_.jpg (184 KB, 832x1216)

184 KB JPG

>>101939428
on a street
looking at the tail pipe on the back of a bus
looking at the back of a man with grey hair sprinting after the bus

Anonymous
08/17/24(Sat)08:06:12 No.101939604

Anonymous 08/17/24(Sat)08:06:12 No.101939604

File: FLUX_00040_.png (1012 KB, 896x1152)

1012 KB PNG

10th try and I'm still not happy
it wasn't that complicated of a prompt

Anonymous
08/17/24(Sat)08:08:49 No.101939643

Anonymous 08/17/24(Sat)08:08:49 No.101939643

>>101939604
LOOKS LIKE A JEWISH LADYBOY BRO.

Anonymous
08/17/24(Sat)08:09:02 No.101939645

Anonymous 08/17/24(Sat)08:09:02 No.101939645

>>101939604
>it wasn't that complicated of a prompt
Stick it in GPT and ask it to rewrite it to be more verbose?

Anonymous
08/17/24(Sat)08:10:00 No.101939658

Anonymous 08/17/24(Sat)08:10:00 No.101939658

File: Screenshot 2024-08-18 000904.png (47 KB, 749x430)

47 KB PNG

>>101939397
Still failing. The file is there so I don't really know what the issue is.

Anonymous
08/17/24(Sat)08:10:54 No.101939670

Anonymous 08/17/24(Sat)08:10:54 No.101939670

File: FLUX_00041_.png (1.01 MB, 896x1152)

1.01 MB PNG

luck of the draw I guess

Anonymous
08/17/24(Sat)08:11:29 No.101939675

Anonymous 08/17/24(Sat)08:11:29 No.101939675

File: Untitled.png (11 KB, 642x238)

11 KB PNG

>>101939658
I do.
Where is your config file in the config folder?

Anonymous
08/17/24(Sat)08:11:39 No.101939679

Anonymous 08/17/24(Sat)08:11:39 No.101939679

>>101939670
catbox? want to slap a realism lora on top for better skin

Anonymous
08/17/24(Sat)08:11:43 No.101939681

Anonymous 08/17/24(Sat)08:11:43 No.101939681

>>101939670
it doesn't look like Emma at all

Anonymous
08/17/24(Sat)08:11:44 No.101939682

Anonymous 08/17/24(Sat)08:11:44 No.101939682

File: ComfyUI_00955_.png (1.35 MB, 1024x1024)

1.35 MB PNG

>>101939428
this seems harder than i thought. need to ask claude

Anonymous
08/17/24(Sat)08:12:10 No.101939687

Anonymous 08/17/24(Sat)08:12:10 No.101939687

>>101939675
In that folder.

Anonymous
08/17/24(Sat)08:12:33 No.101939697

Anonymous 08/17/24(Sat)08:12:33 No.101939697

File: ComfyUI_00194_.png (578 KB, 512x768)

578 KB PNG

>>101939670

Anonymous
08/17/24(Sat)08:12:43 No.101939699

Anonymous 08/17/24(Sat)08:12:43 No.101939699

>>101939682
>>101939601

Anonymous
08/17/24(Sat)08:13:15 No.101939713

Anonymous 08/17/24(Sat)08:13:15 No.101939713

>>101939681
Emma? isn't that Taylor Swift?

Anonymous
08/17/24(Sat)08:13:16 No.101939714

Anonymous 08/17/24(Sat)08:13:16 No.101939714

>>101939687
Can I see? Something isn't adding up.

Anonymous
08/17/24(Sat)08:13:53 No.101939720

Anonymous 08/17/24(Sat)08:13:53 No.101939720

File: 1704810071215236.png (1.1 MB, 1024x1024)

1.1 MB PNG

Anonymous
08/17/24(Sat)08:14:07 No.101939721

Anonymous 08/17/24(Sat)08:14:07 No.101939721

>>101939713
Oh fuck you're right, kek

Anonymous
08/17/24(Sat)08:15:08 No.101939738

Anonymous 08/17/24(Sat)08:15:08 No.101939738

I wonder why Lykoris never managed to be the replacement of Lora, it's in theory a superior method

Anonymous
08/17/24(Sat)08:15:11 No.101939740

Anonymous 08/17/24(Sat)08:15:11 No.101939740

FLUX doesn't work on A111 webui, but will it ever? Is that even possible?

I like A1111...

Anonymous
08/17/24(Sat)08:15:12 No.101939743

Anonymous 08/17/24(Sat)08:15:12 No.101939743

>>101939721
I see the previous Emma image now so I can understand the confusion.

Anonymous
08/17/24(Sat)08:15:22 No.101939745

Anonymous 08/17/24(Sat)08:15:22 No.101939745

File: Screenshot 2024-08-18 001452.png (135 KB, 1769x671)

135 KB PNG

>>101939714

Anonymous
08/17/24(Sat)08:16:06 No.101939751

Anonymous 08/17/24(Sat)08:16:06 No.101939751

File: ComfyUI_00196_.png (548 KB, 512x768)

548 KB PNG

>>101939697
Using dev + schnell combined for 4 - 8 steps, seems better

https://huggingface.co/drbaph/FLUX.1-schnell-dev-merged-fp8-4step

Anonymous
08/17/24(Sat)08:16:08 No.101939752

Anonymous 08/17/24(Sat)08:16:08 No.101939752

>>101939740
why won't you jump to the forge ship, it's A1111 but faster and with all the recent updates

Anonymous
08/17/24(Sat)08:16:19 No.101939754

Anonymous 08/17/24(Sat)08:16:19 No.101939754

>>101939740
just use Forge

Anonymous
08/17/24(Sat)08:17:01 No.101939761

Anonymous 08/17/24(Sat)08:17:01 No.101939761

>>101939574
>yes
How exactly? Last I checked it was just a HF Spaces frontend to a model, not sure what tool I'd need to run it locally.
>>101939578
>GPT4V
Eh, that's paid so sadly I'll have to stick to JC. Hoping it'll suffice.
>>101939589
Just tested and JC recognises it, fortunately.
I'm actually just trying to enhance the details of a certain historical character that the model is already capable of generating, but not 100% accurately. For example, an emblem on the character's cap always looks wrong or a badge on his suit sometimes looks off, or even his general appearance since I sometimes have to generate 5-10 times before a decent result. I'm guessing that would require captioning.

Anonymous
08/17/24(Sat)08:17:06 No.101939765

Anonymous 08/17/24(Sat)08:17:06 No.101939765

>>101939752
until it gets abandoned
can you pin T5 to the cpu in Forget yet?

Anonymous
08/17/24(Sat)08:17:39 No.101939778

Anonymous 08/17/24(Sat)08:17:39 No.101939778

>>101939745
Okay, I see the problem.your config file is named naif.yml and you're trying to get it to run a file called lora.yaml which doesn't exist.

try this:
 python run.py config/naif.yml

Anonymous
08/17/24(Sat)08:17:41 No.101939779

Anonymous 08/17/24(Sat)08:17:41 No.101939779

>>101939765
>can you pin T5 to the cpu in Forget yet?
Nope, that's why I'm stuck with comfyUi, this shit has more important features

Anonymous
08/17/24(Sat)08:18:41 No.101939794

Anonymous 08/17/24(Sat)08:18:41 No.101939794

>>101939761
>How exactly?
huggingface spaces can be cloned, they are git repos
the app downloads the models for you (the llama model from the official meta repo requires asking for permission first but there are mirrors you can use just by changing one line in the code)

Anonymous
08/17/24(Sat)08:18:42 No.101939797

Anonymous 08/17/24(Sat)08:18:42 No.101939797

>>101939778
wait fuck. I see in your screenshot you've changed the name.

Anonymous
08/17/24(Sat)08:19:07 No.101939802

Anonymous 08/17/24(Sat)08:19:07 No.101939802

File: ComfyUI_00956_.png (1.22 MB, 1024x1024)

1.22 MB PNG

>>101939699
>tail pipe on the back of a bus
nice. adding "tail end of a city bus that's pulling away from its stop" did the job

Anonymous
08/17/24(Sat)08:19:15 No.101939805

Anonymous 08/17/24(Sat)08:19:15 No.101939805

>>101939778
no I renamed it. check the terminal again, it's calling the correct file.

Anonymous
08/17/24(Sat)08:19:54 No.101939813

Anonymous 08/17/24(Sat)08:19:54 No.101939813

File: 1693567355632801.png (772 KB, 1024x1024)

772 KB PNG

gentlemen

Anonymous
08/17/24(Sat)08:20:13 No.101939814

Anonymous 08/17/24(Sat)08:20:13 No.101939814

>>101939752
>>101939754
I haven't done this in a hot minute, didn't realize there was a better fork, thanks anons.

Anonymous
08/17/24(Sat)08:20:16 No.101939816

Anonymous 08/17/24(Sat)08:20:16 No.101939816

File: Screenshot 2024-08-18 001936.png (52 KB, 836x469)

52 KB PNG

>>101939797
I even tried calling the full path

Anonymous
08/17/24(Sat)08:20:25 No.101939820

Anonymous 08/17/24(Sat)08:20:25 No.101939820

>>101939802
they thought they could stump T5 when it was them who were promptlets, although "back of the bus" should be pretty clear

Anonymous
08/17/24(Sat)08:20:45 No.101939825

Anonymous 08/17/24(Sat)08:20:45 No.101939825

File: 00025-1402907279.png (1.17 MB, 832x1216)

1.17 MB PNG

>>101939114
My one is running head on into the bus...

Anonymous
08/17/24(Sat)08:21:07 No.101939832

Anonymous 08/17/24(Sat)08:21:07 No.101939832

>>101939578
What about stuff like llava?

Anonymous
08/17/24(Sat)08:21:16 No.101939834

Anonymous 08/17/24(Sat)08:21:16 No.101939834

>>101939805
Okay. One more thing though. In the command prompt you are writing "niaf" when the actual file name is naif

Anonymous
08/17/24(Sat)08:21:38 No.101939840

Anonymous 08/17/24(Sat)08:21:38 No.101939840

>>101939813
it's impressive how well flux is learning new concept, and this is just some loras we got there, imagine a serious finetune on this, I hope someone is up to the task, is pony-dev hinting he'll be working on that model?

Anonymous
08/17/24(Sat)08:22:23 No.101939853

Anonymous 08/17/24(Sat)08:22:23 No.101939853

>>101939840
someone will do it, it's a capable model

Anonymous
08/17/24(Sat)08:22:42 No.101939859

Anonymous 08/17/24(Sat)08:22:42 No.101939859

>>101939840
>is pony-dev hinting he'll be working on that model?
he can't monetize FluxDev so no

Anonymous
08/17/24(Sat)08:23:06 No.101939866

Anonymous 08/17/24(Sat)08:23:06 No.101939866

>>101939859
he can on schnell, that model fucking sucks though

Anonymous
08/17/24(Sat)08:23:38 No.101939872

Anonymous 08/17/24(Sat)08:23:38 No.101939872

>>101939840
>is pony-dev hinting he'll be working on that model?
Nope, he's been adamant about not wanting to do.

Anonymous
08/17/24(Sat)08:24:12 No.101939884

Anonymous 08/17/24(Sat)08:24:12 No.101939884

>>101939834
Oh holy fuck I am a retard. You're right.
Now I am getting an error about the folder path of the files but that can be fixed, probably doesn't like the spaces.

Anonymous
08/17/24(Sat)08:25:29 No.101939897

Anonymous 08/17/24(Sat)08:25:29 No.101939897

>>101939884
No problem. I once spent like two days trying to figure out an error before I realized I spelled the world university as unversity.

Anonymous
08/17/24(Sat)08:26:47 No.101939911

Anonymous 08/17/24(Sat)08:26:47 No.101939911

File: 00027-2540765661.png (1.2 MB, 832x1216)

1.2 MB PNG

I can't get inpainting to work with GGUF Q8 in forge, hmm.

Anonymous
08/17/24(Sat)08:27:48 No.101939928

Anonymous 08/17/24(Sat)08:27:48 No.101939928

>>101939897
And word as world.

Anonymous
08/17/24(Sat)08:28:50 No.101939942

Anonymous 08/17/24(Sat)08:28:50 No.101939942

File: 00028-2540765662.png (1.34 MB, 832x1216)

1.34 MB PNG

The guy laughing and pointing is a nice touch if I do say so myself.

Anonymous
08/17/24(Sat)08:28:57 No.101939943

Anonymous 08/17/24(Sat)08:28:57 No.101939943

>>101939897
Ifs a feeble mind that kant think of moar then won way to spel a wurd.

Anonymous
08/17/24(Sat)08:29:37 No.101939955

Anonymous 08/17/24(Sat)08:29:37 No.101939955

>>101939928
I know. It's my weakness.

Anonymous
08/17/24(Sat)08:30:24 No.101939966

Anonymous 08/17/24(Sat)08:30:24 No.101939966

this retard https://civitai.com/models/647663/porsche-911-gts-2024-flux
>2.5GB lora
>the LoRa is this big because it is trained with really high res images think 4k and with a really high rank 256. You could make it smaller but it wouldn't retain all the intricate details of the car
clueless

Anonymous
08/17/24(Sat)08:30:25 No.101939967

Anonymous 08/17/24(Sat)08:30:25 No.101939967

File: Flux-20240817_142747-gen-(...).png (1.47 MB, 1152x896)

1.47 MB PNG

>>101939840
ponyman went all in on auraflow, "disasterpiece"

Anonymous
08/17/24(Sat)08:31:57 No.101939984

Anonymous 08/17/24(Sat)08:31:57 No.101939984

>>101939897
Before I start, is there anything I should do to run this on a 16gb card. I know people have done it, I just don't know if they did it with this.

Anonymous
08/17/24(Sat)08:32:31 No.101939996

Anonymous 08/17/24(Sat)08:32:31 No.101939996

>>101939966
>All that wasted gpu time.

Flux honestly does fine at rank 16, I don't think going further beyond that is going to do much for you than deep fry whatever you want to make.

Anonymous
08/17/24(Sat)08:32:58 No.101940001

Anonymous 08/17/24(Sat)08:32:58 No.101940001

File: 123.jpg (2.85 MB, 1792x2304)

2.85 MB JPG

Anonymous
08/17/24(Sat)08:33:49 No.101940013

Anonymous 08/17/24(Sat)08:33:49 No.101940013

>>101939984
16gb? That's gonna be rough, run it in with the low vram args I guess and probably only train at like 512 resolution (not as bad as you think) and maybe consider lowering the rank to 8.

Anonymous
08/17/24(Sat)08:34:23 No.101940022

Anonymous 08/17/24(Sat)08:34:23 No.101940022

my puter is making pics of my waifu

Anonymous
08/17/24(Sat)08:34:46 No.101940030

Anonymous 08/17/24(Sat)08:34:46 No.101940030

File: ComfyUI_00197_.png (566 KB, 512x768)

566 KB PNG

>>101939967
It says sneed on the poster?

Anonymous
08/17/24(Sat)08:35:16 No.101940038

Anonymous 08/17/24(Sat)08:35:16 No.101940038

File: 445036257.png (1.41 MB, 1344x768)

1.41 MB PNG

>>101940001
I've seen this one already, though it does look nice.

Anonymous
08/17/24(Sat)08:36:48 No.101940058

Anonymous 08/17/24(Sat)08:36:48 No.101940058

>>101940030
>>101940038

Anonymous
08/17/24(Sat)08:37:56 No.101940074

Anonymous 08/17/24(Sat)08:37:56 No.101940074

>flux is released
>every single AI company release a new, much more powerful closed source model
its over

Anonymous
08/17/24(Sat)08:38:53 No.101940088

Anonymous 08/17/24(Sat)08:38:53 No.101940088

>>101940074
>every single AI company release a new, much more powerful closed source model
like what

Anonymous
08/17/24(Sat)08:39:26 No.101940096

Anonymous 08/17/24(Sat)08:39:26 No.101940096

>>101940030
supposed to be topless muscular men, adolf hilter and godzilla. got it right a few times.

Anonymous
08/17/24(Sat)08:40:23 No.101940107

Anonymous 08/17/24(Sat)08:40:23 No.101940107

>>101940074
now the standard is flux, everything inferior will be dismissed, at least that'll force them to work harder and give us actual good products, which is always welcomed

Anonymous
08/17/24(Sat)08:40:49 No.101940116

Anonymous 08/17/24(Sat)08:40:49 No.101940116

File: 0.jpg (220 KB, 1024x1024)

220 KB JPG

Anonymous
08/17/24(Sat)08:42:25 No.101940137

Anonymous 08/17/24(Sat)08:42:25 No.101940137

>>101940013
Still struggling. Keeps running into unicode string errors.
I should just boot into ubuntu and do it there. Don't know if I can be fucked though.
Might just wait for kohya

Anonymous
08/17/24(Sat)08:44:35 No.101940167

Anonymous 08/17/24(Sat)08:44:35 No.101940167

File: file.png (1.4 MB, 1024x768)

1.4 MB PNG

>>101940088
Grok2 and Imagen 3 (ImageFX) just got released.

Anonymous
08/17/24(Sat)08:46:10 No.101940197

Anonymous 08/17/24(Sat)08:46:10 No.101940197

>>101940167
That's nice, the genie is definitely going out of the bottle, I'm tired of all that fearmongering of AI

Anonymous
08/17/24(Sat)08:47:12 No.101940213

Anonymous 08/17/24(Sat)08:47:12 No.101940213

File: 84F00C84329PPR8F475CXG9630.jpg (312 KB, 832x1216)

312 KB JPG

>traditional japanese art style, ink on paper, a cyborg samurai in a futuristic Tokyo with katana and jingasa, red sun, japanese calligraphy on the upper right corner, wabi-sabi, henna and carmine, sepia, minimal brush strokes
IThat was was made by flux-pro, can dev reach that level?

Anonymous
08/17/24(Sat)08:47:17 No.101940215

Anonymous 08/17/24(Sat)08:47:17 No.101940215

File: 00034-127994779.png (1.37 MB, 1216x832)

1.37 MB PNG

>>101940167
What was that image made with?

Anonymous
08/17/24(Sat)08:47:25 No.101940219

Anonymous 08/17/24(Sat)08:47:25 No.101940219

Do we have a baker?

Anonymous
08/17/24(Sat)08:47:47 No.101940225

Anonymous 08/17/24(Sat)08:47:47 No.101940225

File: file.png (511 KB, 750x500)

511 KB PNG

>>101940215
Grok2.

Anonymous
08/17/24(Sat)08:48:21 No.101940237

Anonymous 08/17/24(Sat)08:48:21 No.101940237

>>101940225
>Grok2.
grok2 is dev pro though?

Anonymous
08/17/24(Sat)08:48:27 No.101940240

Anonymous 08/17/24(Sat)08:48:27 No.101940240

>>101940225
So Flux Pro

Anonymous
08/17/24(Sat)08:48:36 No.101940243

Anonymous 08/17/24(Sat)08:48:36 No.101940243

>>101940225
Now bad, has a more natural look.

Let's see how finetuned flux will be like though.

Anonymous
08/17/24(Sat)08:48:49 No.101940245

Anonymous 08/17/24(Sat)08:48:49 No.101940245

Here we go...
>>101940241
>>101940241
>>101940241

Anonymous
08/17/24(Sat)08:48:56 No.101940248

Anonymous 08/17/24(Sat)08:48:56 No.101940248

>>101940167
>Grok2
I'm not worried, Grok 2's image gen already got leaked.

Anonymous
08/17/24(Sat)08:49:25 No.101940255

Anonymous 08/17/24(Sat)08:49:25 No.101940255

File: ComfyUI_00952_.png (1.31 MB, 1344x768)

1.31 MB PNG

>>101940167
>Grok2
Flux pro
>Imagen 3
Doesn't look any better than Dev

Anonymous
08/17/24(Sat)08:49:36 No.101940257

Anonymous 08/17/24(Sat)08:49:36 No.101940257

>>101940248
>Grok 2's image gen already got leaked.
what?

Anonymous
08/17/24(Sat)08:50:34 No.101940274

Anonymous 08/17/24(Sat)08:50:34 No.101940274

>>101940213
I will try and post in the new thread

Anonymous
08/17/24(Sat)08:51:59 No.101940294

Anonymous 08/17/24(Sat)08:51:59 No.101940294

File: ComfyUI_00199_.png (575 KB, 512x768)

575 KB PNG

>>101940167
Looks real, did you use anything else?

Anonymous
08/17/24(Sat)08:54:31 No.101940331

Anonymous 08/17/24(Sat)08:54:31 No.101940331

>>101940257
Because Grok is flux, idiot.

Anonymous
08/17/24(Sat)08:55:03 No.101940341

Anonymous 08/17/24(Sat)08:55:03 No.101940341

>>101940331
it's flux pro, and we don't have that retard

Anonymous
08/17/24(Sat)08:56:46 No.101940360

Anonymous 08/17/24(Sat)08:56:46 No.101940360

>>101940255
where is her right hand?

Anonymous
08/17/24(Sat)08:57:14 No.101940372

Anonymous 08/17/24(Sat)08:57:14 No.101940372

>>101940255
>Doesn't look any better than Dev
Highly disagree. Case in point >>101939483

Anonymous
08/17/24(Sat)08:58:16 No.101940378

Anonymous 08/17/24(Sat)08:58:16 No.101940378

>>101940372
looks like a SDXL gen, the far away details are bad compared to flux

Anonymous
08/17/24(Sat)09:00:17 No.101940404

Anonymous 08/17/24(Sat)09:00:17 No.101940404

>>101940360
she is disabled

Anonymous
08/17/24(Sat)09:03:22 No.101940442

Anonymous 08/17/24(Sat)09:03:22 No.101940442

>>101940341
demonstrate something Pro can do that Dev can't

Anonymous
08/17/24(Sat)09:07:56 No.101940496

Anonymous 08/17/24(Sat)09:07:56 No.101940496

File: file.png (2.01 MB, 1024x1024)

2.01 MB PNG

>>101940442
Bro did you even look at the flux images using the same prompt in this very own thread?

>>101940442
Pro looks much better. It's not even close.

Anonymous
08/17/24(Sat)09:10:18 No.101940541

Anonymous 08/17/24(Sat)09:10:18 No.101940541

File: imagefxd.png (1.85 MB, 1024x1024)

1.85 MB PNG

>>101940378
I didn't specify for telephoto

Anonymous
08/17/24(Sat)09:12:13 No.101940562

Anonymous 08/17/24(Sat)09:12:13 No.101940562

File: Capture.jpg (44 KB, 791x351)

44 KB JPG

>>101940541
Come on man that's not a quality picture, that's some ugly mush, flux doesn't fuck up like that

Anonymous
08/17/24(Sat)09:18:17 No.101940629

Anonymous 08/17/24(Sat)09:18:17 No.101940629

File: ifx47.png (1.48 MB, 1024x1024)

1.48 MB PNG

>>101940562
that's a but queue anon...
try and gen a wren, a baby quoll and a salamander on moss in flux
i'll wait...

Anonymous
08/17/24(Sat)09:20:34 No.101940655

Anonymous 08/17/24(Sat)09:20:34 No.101940655

>>101940629
that I agree, flux doesn't know enough concepts, but it's easier to add more concept to a model than giving it more quality in images

Anonymous
08/17/24(Sat)09:22:38 No.101940681

Anonymous 08/17/24(Sat)09:22:38 No.101940681

>>101940655
both have strengths as with any model, hopefully a good finetune will add some more textures and film tropes you can see what they shied away from in training

Anonymous
08/17/24(Sat)10:15:48 No.101941296

Anonymous 08/17/24(Sat)10:15:48 No.101941296

>>101938755
>>101938786
28 screencaps....lazy, the right way is run the bluray through PySceneDetect, detect & remove near-duplicate images, caption with joy caption, do finetune on the final 1000~2000 images with random crop enabled, high epochs and slightly slow learning rate, then extract lora at different ranks and take best one

quality in, quality slop out

Anonymous
08/17/24(Sat)10:18:14 No.101941327

Anonymous 08/17/24(Sat)10:18:14 No.101941327

>>101941296
>captioning before random crop

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.