/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 08/21/24(Wed)14:26:41 No.102013088

File: tmp.jpg (1.17 MB, 3264x3264)

1.17 MB JPG

/ldg/ - Local Diffusion General Anonymous 08/21/24(Wed)14:26:41 No.102013088 Archived

Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102009692

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/g/sdg
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg

Anonymous
08/21/24(Wed)14:29:37 No.102013121

Anonymous 08/21/24(Wed)14:29:37 No.102013121

Roundhouse kick Debo through a wall.

Anonymous
08/21/24(Wed)14:29:40 No.102013126

Anonymous 08/21/24(Wed)14:29:40 No.102013126

Are we blessed

Anonymous
08/21/24(Wed)14:29:48 No.102013129

Anonymous 08/21/24(Wed)14:29:48 No.102013129

File: 1708652430111543.png (1.05 MB, 1024x1024)

1.05 MB PNG

someone put knowledge in my base model

Anonymous
08/21/24(Wed)14:30:11 No.102013133

Anonymous 08/21/24(Wed)14:30:11 No.102013133

>>102013108
I ask this because I assume the model expects everything to be described, so if there's a broom in one corner and I don't mention it, the model will end up thinking that broom is part of something else I did describe.

Anonymous
08/21/24(Wed)14:30:28 No.102013139

Anonymous 08/21/24(Wed)14:30:28 No.102013139

File: fs_0076.jpg (99 KB, 720x1024)

99 KB JPG

Anonymous
08/21/24(Wed)14:30:35 No.102013141

Anonymous 08/21/24(Wed)14:30:35 No.102013141

Blessed thread of frenship

Anonymous
08/21/24(Wed)14:31:01 No.102013150

Anonymous 08/21/24(Wed)14:31:01 No.102013150

>>102013129
>someone put knowledge in my base model
you heard what that anon said? DO IT

Anonymous
08/21/24(Wed)14:31:34 No.102013161

Anonymous 08/21/24(Wed)14:31:34 No.102013161

File: 1697314320809988.png (1.22 MB, 1024x1024)

1.22 MB PNG

>>102013150
yeah do it

Anonymous
08/21/24(Wed)14:33:33 No.102013187

Anonymous 08/21/24(Wed)14:33:33 No.102013187

File: 00013-1178910297.png (3.63 MB, 1280x1920)

3.63 MB PNG

>>102013131
Lol

Anonymous
08/21/24(Wed)14:34:05 No.102013192

Anonymous 08/21/24(Wed)14:34:05 No.102013192

File: Hapu_anime_SM.png (503 KB, 1280x720)

503 KB PNG

>>102013133
The idea, at least as I've gleaned from my time training, is to describe everything in tags that isn't intrinsic to the character or concept. For example, Hapu (picrel) has brown skin, black hair, purple eyes, and a purple bonnet. You would want to tag purple bonnet, and not tag black hair, purple eyes, brown skin. Those would all fall under the "Hapu" tag you're training.

Anonymous
08/21/24(Wed)14:35:17 No.102013213

Anonymous 08/21/24(Wed)14:35:17 No.102013213

>>102013192
Ah, makes sense. Thanks.

Anonymous
08/21/24(Wed)14:35:27 No.102013214

Anonymous 08/21/24(Wed)14:35:27 No.102013214

>>102013192
yeah no shit

Anonymous
08/21/24(Wed)14:36:35 No.102013236

Anonymous 08/21/24(Wed)14:36:35 No.102013236

File: ComfyUI_temp_oeuqn_00037_.png (1.37 MB, 1216x832)

1.37 MB PNG

Anonymous
08/21/24(Wed)14:37:39 No.102013250

Anonymous 08/21/24(Wed)14:37:39 No.102013250

>>102013214
He wanted it explained, anon. Not everyone has the same level of knowledge, we all start somewhere on this shit.

>>102013213
Of course.

Anonymous
08/21/24(Wed)14:38:08 No.102013256

Anonymous 08/21/24(Wed)14:38:08 No.102013256

>>102013192
If I may ask another question. How many images ideally should I use? And how long can I expect the training to take with a 3090?

Anonymous
08/21/24(Wed)14:39:18 No.102013271

Anonymous 08/21/24(Wed)14:39:18 No.102013271

File: bComfyUI_107008_.jpg (985 KB, 1424x2048)

985 KB JPG

nips

Anonymous
08/21/24(Wed)14:39:52 No.102013281

Anonymous 08/21/24(Wed)14:39:52 No.102013281

File: image.jpg (107 KB, 1536x1024)

107 KB JPG

Anonymous
08/21/24(Wed)14:40:00 No.102013283

Anonymous 08/21/24(Wed)14:40:00 No.102013283

I'm seeing the reference images ai-toolkit is generating with the unquantized model (at 40s/it) and holy shit I didn't expect the quality gap with Q8_0 would be this noticeable. That or the prompts for the references are excellent.

Anonymous
08/21/24(Wed)14:41:43 No.102013300

Anonymous 08/21/24(Wed)14:41:43 No.102013300

File: swamp fishing[sound=https(...).png (2.08 MB, 1440x1011)

2.08 MB PNG

Been away for little over half a year, haven't been able to keep up, are there any good extensions to come out worth noting, especially for auto1111?

Anonymous
08/21/24(Wed)14:42:22 No.102013310

Anonymous 08/21/24(Wed)14:42:22 No.102013310

File: image.jpg (108 KB, 1536x1024)

108 KB JPG

>>102013068
>they all look germanic and shit
include an ethnicity/nationality in your prompt

Anonymous
08/21/24(Wed)14:42:37 No.102013314

Anonymous 08/21/24(Wed)14:42:37 No.102013314

File: file.png (1.7 MB, 1024x1024)

1.7 MB PNG

>>102012483
>why doesn't the model just make up bullshit that I didn't even ask for?
If you want random bullshit use an LLM that makes your prompt random bullshit. Don't hate a new printer because it randomly stopped spewing ink everywhere like your old one.

Anonymous
08/21/24(Wed)14:42:55 No.102013320

Anonymous 08/21/24(Wed)14:42:55 No.102013320

File: flux_00318_.png (1.62 MB, 968x1120)

1.62 MB PNG

>says here you were heard saying you don't approve of women in the military. care to explain that statement, soldier?

Anonymous
08/21/24(Wed)14:43:31 No.102013333

Anonymous 08/21/24(Wed)14:43:31 No.102013333

File: ComfyUI_temp_oeuqn_00040_.png (1.28 MB, 1216x832)

1.28 MB PNG

Anonymous
08/21/24(Wed)14:43:34 No.102013334

Anonymous 08/21/24(Wed)14:43:34 No.102013334

>>102013256
Well, people have had success with as few as 30, but the more you have, the better. With the caveat that they're all pretty high quality and diverse, too. You want your character at various angles in various states of undress, different outfits, etc. You also want to avoid multi-character pictures like the plague. Don't use too many black and white or sketchy images of the character, unless one of both of those is the style they're typically drawn in.

>How long can I expect the training to take with a 3090?
Well, I haven't trained Flux before. I did a lot of SD 1.5 training and SDXL training, but they're probably a lot faster to do than a model many times their size.

Anonymous
08/21/24(Wed)14:44:12 No.102013344

Anonymous 08/21/24(Wed)14:44:12 No.102013344

>>102013256
i'm doing dim 16 768x768 batch 2 2000 steps on the full fp16 model and it takes about 3.5 hours, but that old anime lora posted last thread was trained for less than 300 steps so i clearly have no fucking idea what's going on

Anonymous
08/21/24(Wed)14:44:18 No.102013345

Anonymous 08/21/24(Wed)14:44:18 No.102013345

File: file.png (10 KB, 359x104)

10 KB PNG

>>102013311
Fug wrong thread
>>102013310
>include an ethnicity/nationality in your prompt
I meant they look like models.

>pic related
Can I expect training to take this long no matter how many images I use? I just used 11 here as a test.

Anonymous
08/21/24(Wed)14:44:19 No.102013347

Anonymous 08/21/24(Wed)14:44:19 No.102013347

File: 512-512-30epoch-ibuki-sat(...).jpg (2.7 MB, 2560x3172)

2.7 MB JPG

12gb flux lora anon reporting in
here's 30 epochs of my latest lion cosine, 8 dim/8 alpha test. captioned with both local joycaption and booru tags, using the wildcard arg. style is ibuki satsuki. genned at 512*512, dev-q4_0 & t5-v1_1-xxl-encoder-q5_k_m
>random seeds
>euler, simple
>20 steps
>prompt:
a man with long white hair and Chinese style clothing, 1boy, long hair, white hair

trained at 512*512 (I overshot the LR and got crap last run). tonight/tomorrow I will run the same settings/dataset on 1024*1024 to compare the difference

will post 1024*1024 sized gens in a bit, it'll take me a while to gen 30 of them rip. from these 512*512 gens it looks to have gotten the artists' style down pretty well, if 1024*1024 gens aren't a total shitshow I'll probably be pretty happy with these training settings

Anonymous
08/21/24(Wed)14:45:46 No.102013364

Anonymous 08/21/24(Wed)14:45:46 No.102013364

File: file.png (1.75 MB, 1024x1536)

1.75 MB PNG

Anonymous
08/21/24(Wed)14:45:46 No.102013365

Anonymous 08/21/24(Wed)14:45:46 No.102013365

File: miniature.jpg (88 KB, 1024x1024)

88 KB JPG

Remember back when people would just post their gens with the prompt? What i care about is the style and no VLM will give me that part.
>small miniature mario bros with girlfriend princess peach toadstool. disney pixar cartoon 3d texture details light landscape background city The little mushroom is there too. Whimsical.

Anonymous
08/21/24(Wed)14:46:26 No.102013372

Anonymous 08/21/24(Wed)14:46:26 No.102013372

File: file.png (12 KB, 552x108)

12 KB PNG

>>102013364
So far cooler than my old AMD

Anonymous
08/21/24(Wed)14:48:27 No.102013406

Anonymous 08/21/24(Wed)14:48:27 No.102013406

>>102013347
why would you use different seeds for comparisons

Anonymous
08/21/24(Wed)14:49:16 No.102013418

Anonymous 08/21/24(Wed)14:49:16 No.102013418

>>102013347
you're using kohya?

Anonymous
08/21/24(Wed)14:50:04 No.102013434

Anonymous 08/21/24(Wed)14:50:04 No.102013434

>>102013344
>2000 steps
Jeez. Could we see the output?

Anonymous
08/21/24(Wed)14:50:38 No.102013441

Anonymous 08/21/24(Wed)14:50:38 No.102013441

>>102013300
Auto1111 became obsolete because it doesn't support the model that replaced Stable Diffusion.

Anonymous
08/21/24(Wed)14:52:40 No.102013481

Anonymous 08/21/24(Wed)14:52:40 No.102013481

File: fs_0100.jpg (118 KB, 1920x1080)

118 KB JPG

Anonymous
08/21/24(Wed)14:52:55 No.102013487

Anonymous 08/21/24(Wed)14:52:55 No.102013487

File: image.jpg (106 KB, 1536x1024)

106 KB JPG

>star of david turns into triforce
What did flux mean by this?

>>102013345
>I meant they look like models.
80% flux issue 20% prompt issue. you'll need to wait for something equivalent to juggernaut to generate normal ugly people by default
for my use case it's actually a good thing they're all hyperbeautiful

Anonymous
08/21/24(Wed)14:54:23 No.102013520

Anonymous 08/21/24(Wed)14:54:23 No.102013520

>>102013088
why do you leave sdg in the OP?

Anonymous
08/21/24(Wed)14:54:42 No.102013529

Anonymous 08/21/24(Wed)14:54:42 No.102013529

>>102013434
when it's done sure

Anonymous
08/21/24(Wed)14:55:50 No.102013541

Anonymous 08/21/24(Wed)14:55:50 No.102013541

>>102013406
I want to see how the training settings have done overall, not just compare each epoch to the other on a single seed. I will do that later to select the "best" epoch over a variety of seeds/prompts
>>102013418
yes

Anonymous
08/21/24(Wed)14:56:13 No.102013554

Anonymous 08/21/24(Wed)14:56:13 No.102013554

>>102013520
why do you care?

Anonymous
08/21/24(Wed)14:56:42 No.102013562

Anonymous 08/21/24(Wed)14:56:42 No.102013562

Datasetting is kind of fun.

Anonymous
08/21/24(Wed)15:00:23 No.102013609

Anonymous 08/21/24(Wed)15:00:23 No.102013609

>>102013541
>yes
did you find a guide to get kohya to train flux or did you figure it out yourself? i'm running into some tokenizer error right now

Anonymous
08/21/24(Wed)15:01:28 No.102013629

Anonymous 08/21/24(Wed)15:01:28 No.102013629

Apparently ai-toolkit generates a set of sample images every 250 iterations. What are these for? Seeing when the lora shat the bed in case of trouble?

Anonymous
08/21/24(Wed)15:02:06 No.102013635

Anonymous 08/21/24(Wed)15:02:06 No.102013635

>>102013629
yes

Anonymous
08/21/24(Wed)15:02:24 No.102013641

Anonymous 08/21/24(Wed)15:02:24 No.102013641

>>102013629
I can't imagine why you'd want to see the progress of your image model training through generated images

Anonymous
08/21/24(Wed)15:02:37 No.102013644

Anonymous 08/21/24(Wed)15:02:37 No.102013644

>>102013487
Flux has given me lots of variety in faces, nothing like a lot SDXL finetunes that had that bimbo sameface engraved hard. Whenever I saw that face the model went to the recycling bin, I hated it so much.

Anonymous
08/21/24(Wed)15:03:20 No.102013653

Anonymous 08/21/24(Wed)15:03:20 No.102013653

>>102013635
Well, so far, so good. People in the images have randomly lost clothing items and become cuter.
I also had to open my computer case.

Anonymous
08/21/24(Wed)15:03:25 No.102013656

Anonymous 08/21/24(Wed)15:03:25 No.102013656

File: flux_00321_.png (1000 KB, 968x1120)

1000 KB PNG

Anonymous
08/21/24(Wed)15:04:47 No.102013670

Anonymous 08/21/24(Wed)15:04:47 No.102013670

File: bComfyUI_107015_.jpg (301 KB, 768x1024)

301 KB JPG

Anonymous
08/21/24(Wed)15:05:19 No.102013678

Anonymous 08/21/24(Wed)15:05:19 No.102013678

>>102013653
>People in the images have randomly lost clothing items and become cuter.
lewd

Anonymous
08/21/24(Wed)15:06:50 No.102013704

Anonymous 08/21/24(Wed)15:06:50 No.102013704

>>102013441
And which model is that?

Anonymous
08/21/24(Wed)15:09:49 No.102013757

Anonymous 08/21/24(Wed)15:09:49 No.102013757

File: ComfyUI_00802_.png (1.18 MB, 896x1152)

1.18 MB PNG

Anonymous
08/21/24(Wed)15:12:35 No.102013801

Anonymous 08/21/24(Wed)15:12:35 No.102013801

File: ip_adapter_workflow_example.png (1.21 MB, 2702x1000)

1.21 MB PNG

Big if it works. IP Adapter looks like the way to get varied styles back

Anonymous
08/21/24(Wed)15:13:03 No.102013810

Anonymous 08/21/24(Wed)15:13:03 No.102013810

File: lby_v2_000400_02_20240622(...).png (1.09 MB, 1024x1024)

1.09 MB PNG

>>102013641
it's fun(ny)
sometimes you get some crazy gens in the samples that are irreproduceable

Anonymous
08/21/24(Wed)15:13:47 No.102013819

Anonymous 08/21/24(Wed)15:13:47 No.102013819

File: ifx135.png (1.2 MB, 1024x1024)

1.2 MB PNG

Anonymous
08/21/24(Wed)15:14:25 No.102013831

Anonymous 08/21/24(Wed)15:14:25 No.102013831

File: ComfyUI_00804_.png (1.75 MB, 1152x1536)

1.75 MB PNG

Anonymous
08/21/24(Wed)15:14:27 No.102013834

Anonymous 08/21/24(Wed)15:14:27 No.102013834

>>102013801
What does it do...?

Anonymous
08/21/24(Wed)15:16:05 No.102013857

Anonymous 08/21/24(Wed)15:16:05 No.102013857

>>102013801
So wait, any character I want? just a picture, and I can prompt that character to do stuff?
DAMN. I need to copy this workflow IMMEDIATELY. This solves so many knowledge issues

Anonymous
08/21/24(Wed)15:16:07 No.102013859

Anonymous 08/21/24(Wed)15:16:07 No.102013859

File: bComfyUI_107090_.jpg (274 KB, 768x1024)

274 KB JPG

>>102013819
wtf is that

Anonymous
08/21/24(Wed)15:17:04 No.102013869

Anonymous 08/21/24(Wed)15:17:04 No.102013869

>>102013810
are you training 200px images?

Anonymous
08/21/24(Wed)15:17:43 No.102013883

Anonymous 08/21/24(Wed)15:17:43 No.102013883

>>102013869
1024x1024
that was an early sample

Anonymous
08/21/24(Wed)15:18:06 No.102013888

Anonymous 08/21/24(Wed)15:18:06 No.102013888

fuck it. i give up. kohya traning seems impossible right now

Anonymous
08/21/24(Wed)15:18:34 No.102013898

Anonymous 08/21/24(Wed)15:18:34 No.102013898

>>102013883
early samples shouldn't be that bad

Anonymous
08/21/24(Wed)15:18:55 No.102013905

Anonymous 08/21/24(Wed)15:18:55 No.102013905

>>102013888
that's the spirit

Anonymous
08/21/24(Wed)15:19:15 No.102013907

Anonymous 08/21/24(Wed)15:19:15 No.102013907

File: 00096-4138489953.png (2.23 MB, 1024x1528)

2.23 MB PNG

>load lora
>pray memory doesn't randomly balloon out to 30+gb
I swear, using loras in forge is like playing fucking slots.

Anonymous
08/21/24(Wed)15:19:16 No.102013908

Anonymous 08/21/24(Wed)15:19:16 No.102013908

File: ComfyUI_temp_oeuqn_00051_.png (1.48 MB, 1216x832)

1.48 MB PNG

>>102013834
Think of it as a 1 image Lora. It's the best way to describe it, but surely not the correct way. Anyway I've been using IpAdapter already to transfer style from a SDXL image to a Flux one,
I can't wait to try it.

Anonymous
08/21/24(Wed)15:19:17 No.102013909

Anonymous 08/21/24(Wed)15:19:17 No.102013909

>>102013898
says you

Anonymous
08/21/24(Wed)15:19:19 No.102013913

Anonymous 08/21/24(Wed)15:19:19 No.102013913

>images are watermarked with simple white text
>i get to train flux on text AND closeups of prime pussy at the same time
>>102013888
Use ai-toolkit. It's seriously easy. I don't know if my lora will be usable at all, but at least I'm able to do trial and error.

Anonymous
08/21/24(Wed)15:19:29 No.102013917

Anonymous 08/21/24(Wed)15:19:29 No.102013917

>>102013888
kohya's bugged as shit especially if you're using the gui

Anonymous
08/21/24(Wed)15:19:41 No.102013921

Anonymous 08/21/24(Wed)15:19:41 No.102013921

>>102013913
>Use ai-toolkit. It's seriously easy. I don't know if my lora will be usable at all, but at least I'm able to do trial and error.
16gb vram

Anonymous
08/21/24(Wed)15:20:36 No.102013939

Anonymous 08/21/24(Wed)15:20:36 No.102013939

File: file.png (105 KB, 116x673)

105 KB PNG

>>102013909
if you're not blowing it out it should progress nicely

Anonymous
08/21/24(Wed)15:20:39 No.102013941

Anonymous 08/21/24(Wed)15:20:39 No.102013941

>>102013908
god i love girls in knitted tops

Anonymous
08/21/24(Wed)15:21:31 No.102013950

Anonymous 08/21/24(Wed)15:21:31 No.102013950

>>102013908
How expensive is it to run? Would it work with 8gb of vram? I know I can run flux dev on Q_0 and 512x512 images so that's not an issue, but if IP adapter is expensive I might not be able to run this workflow

Anonymous
08/21/24(Wed)15:21:37 No.102013951

Anonymous 08/21/24(Wed)15:21:37 No.102013951

File: file.png (98 KB, 116x666)

98 KB PNG

>>102013939
etc etc etc
The progression of it learning Vaporeon

Anonymous
08/21/24(Wed)15:21:53 No.102013956

Anonymous 08/21/24(Wed)15:21:53 No.102013956

>>102013921
AAAH SORRY I FORGOT

Anonymous
08/21/24(Wed)15:22:04 No.102013959

Anonymous 08/21/24(Wed)15:22:04 No.102013959

>>102013939
show me some settings and i can try it your way

Anonymous
08/21/24(Wed)15:22:24 No.102013963

Anonymous 08/21/24(Wed)15:22:24 No.102013963

>>102013908
Interesting. It makes me wonder: Is there a way to make it transfer the style without transferring the character in the image? It seems like it's just for the character of the image, as it is right now.

Anonymous
08/21/24(Wed)15:23:08 No.102013971

Anonymous 08/21/24(Wed)15:23:08 No.102013971

>>102013959

      train:
        batch_size: 4
        steps: 40000  # total number of steps to train 500 - 4000 is a good range
        gradient_accumulation_steps: 1
        train_unet: true
        train_text_encoder: false  # probably won't work with flux
        content_or_style: content  # content, style, balanced
        gradient_checkpointing: true  # need the on unless you have a ton of vram
        noise_scheduler: "flowmatch" # for training only
        optimizer: "adamw8bit"
        lr: 2.5e-4

Anonymous
08/21/24(Wed)15:23:25 No.102013973

Anonymous 08/21/24(Wed)15:23:25 No.102013973

>>102013921
>>102013956
Can't you low-vram it (presumably at an extreme speed penalty)?

Anonymous
08/21/24(Wed)15:24:26 No.102013985

Anonymous 08/21/24(Wed)15:24:26 No.102013985

>>102013973
the github says 24gb and the low vram option doesnt help

Anonymous
08/21/24(Wed)15:25:08 No.102013996

Anonymous 08/21/24(Wed)15:25:08 No.102013996

>>102013971
>steps: 40000 # total number of steps to train 500 - 4000 is a good range
lel
wait you're not using kohya
i'll try tonight, i leave it training overnight

Anonymous
08/21/24(Wed)15:25:19 No.102013997

Anonymous 08/21/24(Wed)15:25:19 No.102013997

>>102013973
Yes, but I don't know how severe the penalty will be. Only one way to find out.

Anonymous
08/21/24(Wed)15:25:27 No.102014000

Anonymous 08/21/24(Wed)15:25:27 No.102014000

I am trying to use https://github.com/ostris/ai-toolkit

Why is it trying to download the full massive flux model again, why can't I point it to the already downloaded one I have?

Anonymous
08/21/24(Wed)15:25:37 No.102014003

Anonymous 08/21/24(Wed)15:25:37 No.102014003

>>102013996
kohya is absolutely ass right now (always was honestly)

Anonymous
08/21/24(Wed)15:26:34 No.102014021

Anonymous 08/21/24(Wed)15:26:34 No.102014021

File: d_0009.jpg (163 KB, 1920x1080)

163 KB JPG

Anonymous
08/21/24(Wed)15:26:38 No.102014022

Anonymous 08/21/24(Wed)15:26:38 No.102014022

>>102014000

      model:
        # huggingface model name or path
        name_or_path: "F:/ai/models/FLUX.1-dev"

Anonymous
08/21/24(Wed)15:26:48 No.102014027

Anonymous 08/21/24(Wed)15:26:48 No.102014027

Do scripts or nodes for batch captioning with JoyCaption exist?

Anonymous
08/21/24(Wed)15:27:51 No.102014043

Anonymous 08/21/24(Wed)15:27:51 No.102014043

File: 1724268407.png (9 KB, 287x280)

9 KB PNG

>>102014000
you can but it needs the whole setup structured in the same way as dark forest's HF page

Anonymous
08/21/24(Wed)15:28:55 No.102014061

Anonymous 08/21/24(Wed)15:28:55 No.102014061

>>102014003
are you on ai-toolkit?
i just switched to branch sd3-flux.1 on kohya and read through this
https://github.com/kohya-ss/sd-scripts/blob/sd3/README.md
i'm still testing but one model has come out ok'ish

Anonymous
08/21/24(Wed)15:29:15 No.102014065

Anonymous 08/21/24(Wed)15:29:15 No.102014065

File: ComfyUI_temp_oeuqn_00052_.png (1.36 MB, 1216x832)

1.36 MB PNG

>>102013963
In the regular ipadapter node there is a list of modes: one is just the style and if works quite well. I don't now about these nodes, they seem to be all customs. I'll set everything after dinner and I'll see.

Anonymous
08/21/24(Wed)15:29:31 No.102014069

Anonymous 08/21/24(Wed)15:29:31 No.102014069

>>102014043
>diffusers

Anonymous
08/21/24(Wed)15:30:12 No.102014083

Anonymous 08/21/24(Wed)15:30:12 No.102014083

>>102014061
yeah ai-toolkit
tried kohya for full finetuning but it was extremely slow and they don't even have samples over time so no way to properly see if you're not wasting time
for now I'm sticking with high rank Loras

Anonymous
08/21/24(Wed)15:30:14 No.102014084

Anonymous 08/21/24(Wed)15:30:14 No.102014084

>>102013609
I just followed kohya's instructions on the sd3 branch (it's flux, don't use the branch called flux) GitHub page and then edited the command line to my preference with some trial and error. make sure you update your python, accelerate etc like the page suggests, I got errors otherwise. &this is what I used for this one if you want to edit it to your liking:
files.catbox.moe/w3teku.txt
(comment too long if I put it in code tags, sorry)

Adamw8 is probably a "safer" bet. it also might be worth trying 2e-4 LR, I found 1e-4 to be undercooked but it looks like 3e-4 might've fried fingers a little (if that wasn't from the bucketing). I recommend keeping your dataset under 100 images if you want it to be done overnight. if you need longer training sessions utilize --save_state and --resume, I did 3 nights of a 500+ image dataset run without testing like a dolt only to realize I typo'd the LR and fried the entire thing. vowed to prune all my datasets after that lmao. I love making loras but I'm a certified retard desu

Anonymous
08/21/24(Wed)15:30:37 No.102014091

Anonymous 08/21/24(Wed)15:30:37 No.102014091

>>102014027
yeah, I wrote one

Anonymous
08/21/24(Wed)15:30:47 No.102014098

Anonymous 08/21/24(Wed)15:30:47 No.102014098

Where do I actually learn what nodes are doing? I see my prompt connect to guidance, which connects to a guider, and all I can think is "what the fuck is guidance? is 3.5 correct guidance? what the fuck is a guider?"

Anonymous
08/21/24(Wed)15:30:57 No.102014104

Anonymous 08/21/24(Wed)15:30:57 No.102014104

File: ComfyUI_00727_.png (1.45 MB, 832x1216)

1.45 MB PNG

from the Busty mesa
a Gooning shadow gross
standing in the Gushes
the Femboys of Dark Souls

Anonymous
08/21/24(Wed)15:31:06 No.102014105

Anonymous 08/21/24(Wed)15:31:06 No.102014105

File: 00045-2927397766_cleanup.png (2.58 MB, 1280x1920)

2.58 MB PNG

Anonymous
08/21/24(Wed)15:31:55 No.102014115

Anonymous 08/21/24(Wed)15:31:55 No.102014115

>>102014091
Good job!

Anonymous
08/21/24(Wed)15:38:35 No.102014210

Anonymous 08/21/24(Wed)15:38:35 No.102014210

>>102014043
hm I just tried this but am getting
ValueError: Z:\black-forest-labs\FLUX.1-dev\transformer\ does not appear to have a file named Z:\black-forest-labs\FLUX.1-dev\transformer\diffusion_pytorch_model-00001-of-00003.safetensors which is required according to the checkpoint index

Anonymous
08/21/24(Wed)15:38:57 No.102014216

Anonymous 08/21/24(Wed)15:38:57 No.102014216

I've manually described 55 images so far and I'm hard as diamonds. I think I've got a good variety of poses and outfits. When should I stop?

Anonymous
08/21/24(Wed)15:39:46 No.102014234

Anonymous 08/21/24(Wed)15:39:46 No.102014234

Any recommended temperature settings for JoyCaption?

Anonymous
08/21/24(Wed)15:40:12 No.102014239

Anonymous 08/21/24(Wed)15:40:12 No.102014239

File: ComfyUI_00735_.png (1.32 MB, 832x1216)

1.32 MB PNG

font reminds me of the colombian drug lord's apartment in predator 2

Anonymous
08/21/24(Wed)15:40:16 No.102014241

Anonymous 08/21/24(Wed)15:40:16 No.102014241

>>102014210
>>102014043
oh ffs do I need these 3 10gb files as well as flux-1-dev in the higher level? wtf

Anonymous
08/21/24(Wed)15:40:26 No.102014242

Anonymous 08/21/24(Wed)15:40:26 No.102014242

>>102013088
Kinda want to make a RuneScape Lora for no other reason than to generate funny pictures. Haven’t even touched the game in over a decade. Midjourney can do it natively but flux doesn’t know it at all.

Anonymous
08/21/24(Wed)15:40:39 No.102014244

Anonymous 08/21/24(Wed)15:40:39 No.102014244

File: 2178570291.png (1.61 MB, 896x1152)

1.61 MB PNG

Anonymous
08/21/24(Wed)15:41:36 No.102014261

Anonymous 08/21/24(Wed)15:41:36 No.102014261

>>102014241
the whole setup, structured the same way

it's a pain but at least it works

Anonymous
08/21/24(Wed)15:42:08 No.102014269

Anonymous 08/21/24(Wed)15:42:08 No.102014269

>>102014261
reeee my internet is so slow that will take hours, whatever thanks tho

Anonymous
08/21/24(Wed)15:42:55 No.102014288

Anonymous 08/21/24(Wed)15:42:55 No.102014288

File: 00050-545778476.png (2.34 MB, 1280x1920)

2.34 MB PNG

Anonymous
08/21/24(Wed)15:43:33 No.102014296

Anonymous 08/21/24(Wed)15:43:33 No.102014296

>>102014269
it would be faster for you to learn how to modify the script to not be so retarded
GPT/Claude can help

Anonymous
08/21/24(Wed)15:43:40 No.102014302

Anonymous 08/21/24(Wed)15:43:40 No.102014302

File: ComfyUI_00810_.png (1.78 MB, 1152x1536)

1.78 MB PNG

Anonymous
08/21/24(Wed)15:43:58 No.102014306

Anonymous 08/21/24(Wed)15:43:58 No.102014306

TroonMix 7.7

Anonymous
08/21/24(Wed)15:44:19 No.102014309

Anonymous 08/21/24(Wed)15:44:19 No.102014309

>>102014269
another problem i hit was the config.json in /text_encoder etc getting downloaded as text_encoder_config.json and then ai-toolkit couldn't find it

copy/pasted the config.json and renamed it so both paths were covered just in case

Anonymous
08/21/24(Wed)15:46:16 No.102014341

Anonymous 08/21/24(Wed)15:46:16 No.102014341

File: 00049-295344137_cleanup.png (2.53 MB, 1280x1920)

2.53 MB PNG

Anonymous
08/21/24(Wed)15:47:37 No.102014361

Anonymous 08/21/24(Wed)15:47:37 No.102014361

>>102014341
>ai face sloppa

Anonymous
08/21/24(Wed)15:47:40 No.102014362

Anonymous 08/21/24(Wed)15:47:40 No.102014362

File: bComfyUI_107072_.jpg (234 KB, 768x1024)

234 KB JPG

Anonymous
08/21/24(Wed)15:47:59 No.102014369

Anonymous 08/21/24(Wed)15:47:59 No.102014369

Is there a big naturals lora for Flux yet?

Anonymous
08/21/24(Wed)15:52:27 No.102014440

Anonymous 08/21/24(Wed)15:52:27 No.102014440

File: 625907938.png (1.35 MB, 832x1216)

1.35 MB PNG

Anonymous
08/21/24(Wed)15:53:08 No.102014452

Anonymous 08/21/24(Wed)15:53:08 No.102014452

File: ComfyUI_00814_.png (1.1 MB, 864x1152)

1.1 MB PNG

Anonymous
08/21/24(Wed)15:55:36 No.102014492

Anonymous 08/21/24(Wed)15:55:36 No.102014492

File: bComfyUI_106887_.jpg (952 KB, 3424x1728)

952 KB JPG

>>102014440
>breathe the sun

Anonymous
08/21/24(Wed)15:55:43 No.102014493

Anonymous 08/21/24(Wed)15:55:43 No.102014493

Literally every single Flux lora I tried completely destroys its ability to make pixel art

Anonymous
08/21/24(Wed)15:57:25 No.102014522

Anonymous 08/21/24(Wed)15:57:25 No.102014522

File: flux_00345_.png (1.46 MB, 968x1120)

1.46 MB PNG

Anonymous
08/21/24(Wed)15:57:31 No.102014525

Anonymous 08/21/24(Wed)15:57:31 No.102014525

How do I use flux loras in comfyui? When I try to load it like a normal lora it hangs and doesn't finish generating an image. I'm using the fp8 checkpoint

Anonymous
08/21/24(Wed)15:57:50 No.102014528

Anonymous 08/21/24(Wed)15:57:50 No.102014528

File: ComfyUI_00815_.png (1.54 MB, 1032x1376)

1.54 MB PNG

Anonymous
08/21/24(Wed)16:00:27 No.102014554

Anonymous 08/21/24(Wed)16:00:27 No.102014554

I suppose it's not ideal, but what are the implications of stopping the training at 500/2000 and adding more images to the dataset?

Anonymous
08/21/24(Wed)16:00:43 No.102014559

Anonymous 08/21/24(Wed)16:00:43 No.102014559

File: 00071-AYAKON_1248183.png (1.75 MB, 1280x1280)

1.75 MB PNG

Anonymous
08/21/24(Wed)16:01:21 No.102014565

Anonymous 08/21/24(Wed)16:01:21 No.102014565

>>102014525
I don't actually know but do all loras work with quantized versions of the model? ie if it was trained using full dev does it work with fp8

Anonymous
08/21/24(Wed)16:01:54 No.102014574

Anonymous 08/21/24(Wed)16:01:54 No.102014574

File: ComfyUI_21215_.png (2.33 MB, 1920x1080)

2.33 MB PNG

Anonymous
08/21/24(Wed)16:02:01 No.102014576

Anonymous 08/21/24(Wed)16:02:01 No.102014576

File: HS2_2024-08-18-01-29-59-199.png (3.66 MB, 1871x2160)

3.66 MB PNG

What's the best option for img to img, local or not? Specifically I want to turn a 3d image of a woman into a realistic one, pic rel. If you want to give it a try, i'd appreciate it.

Anonymous
08/21/24(Wed)16:02:23 No.102014581

Anonymous 08/21/24(Wed)16:02:23 No.102014581

>>102014565
Yes.

Anonymous
08/21/24(Wed)16:03:02 No.102014592

Anonymous 08/21/24(Wed)16:03:02 No.102014592

File: ComfyUI_21129_.png (2.35 MB, 1920x1080)

2.35 MB PNG

Anonymous
08/21/24(Wed)16:03:13 No.102014597

Anonymous 08/21/24(Wed)16:03:13 No.102014597

>>102014576
is this your boyfriends second life avatar?

Anonymous
08/21/24(Wed)16:03:46 No.102014601

Anonymous 08/21/24(Wed)16:03:46 No.102014601

I'm having trouble with flux prompting, having been so used to SDXL. They say just use natural language but I'm stumped

Anonymous
08/21/24(Wed)16:03:58 No.102014606

Anonymous 08/21/24(Wed)16:03:58 No.102014606

>>102014597
yes my username is bardfinn

Anonymous
08/21/24(Wed)16:04:06 No.102014608

Anonymous 08/21/24(Wed)16:04:06 No.102014608

File: ComfyUI_21283_.png (2.54 MB, 1920x1080)

2.54 MB PNG

Anonymous
08/21/24(Wed)16:04:37 No.102014615

Anonymous 08/21/24(Wed)16:04:37 No.102014615

>>102014597
yes

Anonymous
08/21/24(Wed)16:05:08 No.102014621

Anonymous 08/21/24(Wed)16:05:08 No.102014621

File: ComfyUI_21143_.png (2.57 MB, 1920x1080)

2.57 MB PNG

Anonymous
08/21/24(Wed)16:05:37 No.102014626

Anonymous 08/21/24(Wed)16:05:37 No.102014626

So, how long should I expect to wait until we start getting some finetuned models that support danbooru knowledge/tagging? I miss the power that gave me to be specific with outfits, plus character knowledge..

Anonymous
08/21/24(Wed)16:06:28 No.102014636

Anonymous 08/21/24(Wed)16:06:28 No.102014636

>>102014626
two weeks

Anonymous
08/21/24(Wed)16:06:32 No.102014637

Anonymous 08/21/24(Wed)16:06:32 No.102014637

>>102014576
>describe girl in prompt
>use control net to keep pose, canny, etc
>roll
>pray

Anonymous
08/21/24(Wed)16:07:56 No.102014650

Anonymous 08/21/24(Wed)16:07:56 No.102014650

File: 1024-1024-512-30epoch-ibu(...).jpg (2.55 MB, 2560x3072)

2.55 MB JPG

>>102013347
Full size: https://files.catbox.moe/brpmk3.png
here are the 1024*1024 gens with the same settings. early to mid epochs had my sides in stitches, but the later ones turned out better than I expected desu. I will be back either tomorrow night or the next day to compare how training at 1024*1024 turns out instead. fingers seem a bit wonky and the finer details are messy, but I left resizing for 512*512 to the fate of kohya's bucketing system and I can only imagine that made things worse

Anonymous
08/21/24(Wed)16:08:18 No.102014654

Anonymous 08/21/24(Wed)16:08:18 No.102014654

>>102013347
>captioned with both local joycaption and booru tags, using the wildcard arg
How do you do this and what do the final prompts end up looking like?

Anonymous
08/21/24(Wed)16:09:32 No.102014678

Anonymous 08/21/24(Wed)16:09:32 No.102014678

File: ComfyUI_21152_.png (1.81 MB, 1920x1080)

1.81 MB PNG

Anonymous
08/21/24(Wed)16:10:37 No.102014691

Anonymous 08/21/24(Wed)16:10:37 No.102014691

for flux loras is it looking better to just include the joytagger output or to also include the booru tags of the image if you have them?

Anonymous
08/21/24(Wed)16:12:55 No.102014723

Anonymous 08/21/24(Wed)16:12:55 No.102014723

>>102014691
no one knows, anon. We're all just trying things out. Tbh I think captions and tags are pretty irrelevant atm, the current scripts aren't even training the text encoder and t5 is recommended not to train at all.

Anonymous
08/21/24(Wed)16:13:34 No.102014728

Anonymous 08/21/24(Wed)16:13:34 No.102014728

>>102014105
>Yvonne Strahovski
Don't even need to ask if it's flux. Catbox or prompt? I really like the artstyle. Is it a lora?

Anonymous
08/21/24(Wed)16:13:39 No.102014729

Anonymous 08/21/24(Wed)16:13:39 No.102014729

File: 1706834065102747.png (1.12 MB, 1024x1024)

1.12 MB PNG

damn bitch you live like this?

Anonymous
08/21/24(Wed)16:16:01 No.102014768

Anonymous 08/21/24(Wed)16:16:01 No.102014768

File: ComfyUI_21221_.png (2.65 MB, 1920x1080)

2.65 MB PNG

Anonymous
08/21/24(Wed)16:17:17 No.102014780

Anonymous 08/21/24(Wed)16:17:17 No.102014780

File: flux_00355_.png (1.35 MB, 968x1120)

1.35 MB PNG

Anonymous
08/21/24(Wed)16:17:18 No.102014781

Anonymous 08/21/24(Wed)16:17:18 No.102014781

>no news for months
>flux
>that's not really bringing something new to the table
Are we in an age of stale or something?

Anonymous
08/21/24(Wed)16:17:50 No.102014787

Anonymous 08/21/24(Wed)16:17:50 No.102014787

File: ComfyUI_212321_.png (3.25 MB, 1920x1029)

3.25 MB PNG

Anonymous
08/21/24(Wed)16:18:32 No.102014793

Anonymous 08/21/24(Wed)16:18:32 No.102014793

>>102014781
kek
nothing ever happens

Anonymous
08/21/24(Wed)16:18:57 No.102014803

Anonymous 08/21/24(Wed)16:18:57 No.102014803

>>102013479
And out of those 12 thousand the lowest tier they are paying is 4.50 quid while the highest tier is 1,186 quid.

So this guy must be making a minimum or 50k quid a month...

And he's STILL hungry for more.

Anonymous
08/21/24(Wed)16:19:13 No.102014809

Anonymous 08/21/24(Wed)16:19:13 No.102014809

File: ComfyUI_21295_.png (2.39 MB, 1920x1080)

2.39 MB PNG

Anonymous
08/21/24(Wed)16:19:21 No.102014811

Anonymous 08/21/24(Wed)16:19:21 No.102014811

>>102014781
bro the boomer prompting and the text bro

Anonymous
08/21/24(Wed)16:19:38 No.102014813

Anonymous 08/21/24(Wed)16:19:38 No.102014813

>>102014637
Are you new to this?

Anonymous
08/21/24(Wed)16:19:43 No.102014815

Anonymous 08/21/24(Wed)16:19:43 No.102014815

>>102014781
>t5
>up to 2mp native support
>super effective loras
>trainable with local hardware
if Flux doesn't get your nipples hard, then I'm not sure what would

Anonymous
08/21/24(Wed)16:19:54 No.102014819

Anonymous 08/21/24(Wed)16:19:54 No.102014819

File: 00013-1151662440.png (3.4 MB, 1280x1920)

3.4 MB PNG

>>102014728
>Is it a lora?
Yes, and it's not Flux. Just copy/paste the prompt from civit.

https://civitai.com/models/87167?modelVersionId=311382

Anonymous
08/21/24(Wed)16:21:04 No.102014841

Anonymous 08/21/24(Wed)16:21:04 No.102014841

>>102014691
Write comprehensive and well written descriptions of the images by hand. That’s the Flux way.
Also holy shit is this thing easy to train.

Anonymous
08/21/24(Wed)16:21:06 No.102014842

Anonymous 08/21/24(Wed)16:21:06 No.102014842

File: ComfyUI_00615_.png (1.12 MB, 1024x1024)

1.12 MB PNG

Where do I see which models are loaded in VRAM and which in system RAM? The stdio output of Comfy only notes if and when models are loaded (and if “completely” or “partially”), but I can't see what they're loaded into.
I tried the Q5_K_S quants for both unet and t5xxl, and my gens actually take a few seconds longer (24.5 seconds per image with Schnell and no LoRas) than with my older setup of fp8_e4m3fn for the weight type and the t5 model. (19 seconds per image)
nvidia-smi shows VRAM being almost fully utilised in either scenario, system RAM usage seems to be noticeably higher when not using quants, but the quant setup still uses some system RAM
12 GB VRAM btw

Anonymous
08/21/24(Wed)16:22:48 No.102014871

Anonymous 08/21/24(Wed)16:22:48 No.102014871

File: 00138-486044689.png (2.21 MB, 1024x1528)

2.21 MB PNG

Anonymous
08/21/24(Wed)16:23:46 No.102014890

Anonymous 08/21/24(Wed)16:23:46 No.102014890

>>102014819
My mistake. Yvonne Strahovski shows up all the time umprompted for me on flux. I just assumed it was her again. Yeah I can do that artstyle on pony models easily enough.

Anonymous
08/21/24(Wed)16:24:12 No.102014898

Anonymous 08/21/24(Wed)16:24:12 No.102014898

>>102014723
???
You don't need to train the text encoder in the same way you don't train the clip. The text encoder converts the prompt into machine language, so your captions absolute *do* matter. The only difference is it's not braindead retarded to begin with unlike SDXL which doesn't even function without training if you're doing a new concept. The T5 doesn't give a shit about new concepts it converts everything fine because it's basically trained on everything to begin with so it knows basically every concept from The Pile (or the equivilent).

Anonymous
08/21/24(Wed)16:25:06 No.102014906

Anonymous 08/21/24(Wed)16:25:06 No.102014906

>>102014654
I got chatgpt to edit the joycaption script to mass queue all the images in a folder then save the caption output as individual .txt files. I do the booru tags the normal way with boorudatasettagmanager and randomly shuffle them (I made a script for this with chatgpt also but I think the setting is built into the dataset tagger, I just had these ones pretagged from sdxl). I combine them into one .txt file, that looks like this:
Line 1:
>Booru tags
New paragraph
Line 2:
>joy caption prompt (make sure there are no new paragraphs and it's all one solid block)

then in my kohya ARGs I use --enable_wildcard
this makes it so it randomly switches between the two lines for the chosen caption during training (booru tags vs boomer prompt). whether this is optimal for captioning or not I don't know, being a vramlet I haven't done extensive testing and am happy enough with this process as is

here is an example image from my ibuki satsuki dataset and the accompanying captioning: https://files.catbox.moe/8km0lp.jpg
https://files.catbox.moe/28qr0q.txt

as for how that translates to prompting when genning, well, you can see what tiny shitty prompt I used for my test images lol.. didn't need to echo the boomer prompt by any means

Anonymous
08/21/24(Wed)16:25:12 No.102014908

Anonymous 08/21/24(Wed)16:25:12 No.102014908

>>102014841
Also, you can train on 24GB without low vram mode with KDE or any other light DE (or if you don’t have a display server running at all, I guess). Which is neat.

Anonymous
08/21/24(Wed)16:26:40 No.102014921

Anonymous 08/21/24(Wed)16:26:40 No.102014921

File: ComfyUI_temp_sxtfq_00003_.png (1.73 MB, 1216x832)

1.73 MB PNG

First attempt testing Flux IpAdapter (focusing on style transfer for fine arts) I rate it: hey it works! and 2/10

Anonymous
08/21/24(Wed)16:26:56 No.102014923

Anonymous 08/21/24(Wed)16:26:56 No.102014923

Sub 100 properly tagged/described images > quantity of sloppy automated tagging

Anonymous
08/21/24(Wed)16:27:10 No.102014929

Anonymous 08/21/24(Wed)16:27:10 No.102014929

>>102014906
Thank you keep up the good work anon

Anonymous
08/21/24(Wed)16:27:42 No.102014935

Anonymous 08/21/24(Wed)16:27:42 No.102014935

>>102014923
lmao whatever helps you cope

Anonymous
08/21/24(Wed)16:29:38 No.102014959

Anonymous 08/21/24(Wed)16:29:38 No.102014959

>>102014890
>Yvonne Strahovski shows up all the time umprompted for me on flux
show us an image that you think looks like her
I think your brain is fried

Anonymous
08/21/24(Wed)16:29:56 No.102014963

Anonymous 08/21/24(Wed)16:29:56 No.102014963

File: bComfyUI_107210_.jpg (243 KB, 768x1024)

243 KB JPG

Anonymous
08/21/24(Wed)16:30:33 No.102014973

Anonymous 08/21/24(Wed)16:30:33 No.102014973

File: flux_tmp~2.png (2.94 MB, 2304x1792)

2.94 MB PNG

Anonymous
08/21/24(Wed)16:30:52 No.102014978

Anonymous 08/21/24(Wed)16:30:52 No.102014978

https://education.civitai.com/quickstart-guide-to-flux-1/
>We’re finding and hearing that captionless training is better than long narrative style captions (or Danbooru captioning)! Try your next training session with no captions!
What did they mean by this?

Anonymous
08/21/24(Wed)16:31:30 No.102014983

Anonymous 08/21/24(Wed)16:31:30 No.102014983

>>102014963
Neat. Prompt?
>>102014973
Neat, as always.

Anonymous
08/21/24(Wed)16:31:31 No.102014984

Anonymous 08/21/24(Wed)16:31:31 No.102014984

File: Heresy detected.gif (1.56 MB, 498x498)

1.56 MB GIF

>>102014978
what DID they mean by this?

Anonymous
08/21/24(Wed)16:32:20 No.102014999

Anonymous 08/21/24(Wed)16:32:20 No.102014999

File: chibimetroid.png (961 KB, 1024x1024)

961 KB PNG

>>102013314
Because if I wanted to fill up the prompt with something, I'd do it with more relevant things to get closer to what I'd imagine the prompt to produce, like picrel.
I could probably get closer if I kept removing things and adding things to the prompt, but I already have a 8MB text file with prompts to try out, and it keeps growing, so working in a single one seems like a waste.
It's just that with Flux I'm not really looking forward to see what it does for "The invention of gravity" because it'll probably be something bland.
>Pixel Art. Classic video game screenshot from the Super Nintendo. On the left there's a chibi Girl wearing a golden armor bodysuit with a red helmet and emerald pointing bazooka at giant lizard with red eyes on the right. It is night with black sky. Bricks tunnel. Blue floor made of squares, teal ceiling with bushes, a closed pink door at the right and orange spikes at the right. Life counter at the top with a checkered pattern.

Anonymous
08/21/24(Wed)16:32:42 No.102015003

Anonymous 08/21/24(Wed)16:32:42 No.102015003

>>102014978
Huh? As in training with nothing associated to the images at all? I guess it works for style LoRAs and other things you want to apply indiscriminately. But not for characters or objects.

Anonymous
08/21/24(Wed)16:33:08 No.102015009

Anonymous 08/21/24(Wed)16:33:08 No.102015009

>>102013088
Can I get top left with flux yet?

Anonymous
08/21/24(Wed)16:33:16 No.102015012

Anonymous 08/21/24(Wed)16:33:16 No.102015012

File: grid-0463.jpg (319 KB, 1792x2304)

319 KB JPG

Anonymous
08/21/24(Wed)16:34:14 No.102015028

Anonymous 08/21/24(Wed)16:34:14 No.102015028

>>102014929
thanks fren, good luck on your ventures if you're training
>>102014978
idc what they say I'm captioning

Anonymous
08/21/24(Wed)16:35:04 No.102015045

Anonymous 08/21/24(Wed)16:35:04 No.102015045

>>102014999
My point is you get the bullshit generator you wish you had by putting an unhinged LLM on your prompts. It's a stupid thing to whine about anyways, SD 1.5 still works if what you want is somewhat relevant random images.

Anonymous
08/21/24(Wed)16:36:11 No.102015061

Anonymous 08/21/24(Wed)16:36:11 No.102015061

>>102014978
>Phenomenal likeness capture can be achieved with Kohya, 20 images, ~1000 steps.
I hate when people say shit like this without stating the repeat quantity. 1,000 steps can mean a lot of different combinations of repeat/epochs

Anonymous
08/21/24(Wed)16:36:13 No.102015062

Anonymous 08/21/24(Wed)16:36:13 No.102015062

How many people on civitai are just generating a bunch of images with sdxl and using them and their prompts to train loras?

Anonymous
08/21/24(Wed)16:36:30 No.102015071

Anonymous 08/21/24(Wed)16:36:30 No.102015071

>>102015028
You need control anyway, if you are doing something with different clothes, characters, expressions and so on, you would want it to know that when you want to generate the images lol

Anonymous
08/21/24(Wed)16:36:35 No.102015074

Anonymous 08/21/24(Wed)16:36:35 No.102015074

>>102014978
Sounds absolutely retarded if you're training an Emma Watson Lora. It doesn't even make sense, words mean something, words are intent.

Anonymous
08/21/24(Wed)16:36:46 No.102015078

Anonymous 08/21/24(Wed)16:36:46 No.102015078

File: 04200-Maybe an image.png (1.07 MB, 896x1152)

1.07 MB PNG

Anonymous
08/21/24(Wed)16:37:02 No.102015082

Anonymous 08/21/24(Wed)16:37:02 No.102015082

File: 1692978537356205.png (1.45 MB, 1536x612)

1.45 MB PNG

>>102014781
Flux is a huge improvement, but it's still lacking the ability to follow prompts as well as corpo models. I know for a fact the knowledge can and will be expanded, but I'm not sure the complexity of prompts will be. The reason "bing image creator", as grubby as it sounds took off, was because people were able to type "goku robbing a gas station CCTV camera" and get close to what they were thinking about. Reminds me of craiyon which, while garbage quality, would ALWAYS try to the best of its ability to do what you were asking it to do. Flux is doing its best, but it drops concepts in the prompt left and right (in this example first person, doomguy, old doom graphics, etc).

>>102014999
very cute output, good example of what I mean about following instructions and getting the details of the prompt right

Anonymous
08/21/24(Wed)16:37:39 No.102015089

Anonymous 08/21/24(Wed)16:37:39 No.102015089

File: flux_00087_.png (1.77 MB, 896x1160)

1.77 MB PNG

>>102014959
You're just jealous that Yvonne never appears to you.

Anonymous
08/21/24(Wed)16:37:44 No.102015090

Anonymous 08/21/24(Wed)16:37:44 No.102015090

>>102015062
There are loads using Pony images and it baffles my kind and pisses me off, so damn dumb.

Anonymous
08/21/24(Wed)16:37:55 No.102015097

Anonymous 08/21/24(Wed)16:37:55 No.102015097

>craiyon

Anonymous
08/21/24(Wed)16:38:06 No.102015102

Anonymous 08/21/24(Wed)16:38:06 No.102015102

>>102013681
Welp, it's like technology is supposed to work in some way but in practice people just have to point at stuff like this as proof it's all slop.

Anonymous
08/21/24(Wed)16:38:09 No.102015103

Anonymous 08/21/24(Wed)16:38:09 No.102015103

>>102013088
patiently waiting for some nerd to create a local generator that is a simple .exe file with big buttons on all the commands I need. cba installing a bunch of random shit and need to do python commands etc

Anonymous
08/21/24(Wed)16:38:43 No.102015111

Anonymous 08/21/24(Wed)16:38:43 No.102015111

File: flux_00104_.png (1.69 MB, 896x1160)

1.69 MB PNG

>>102014959

Anonymous
08/21/24(Wed)16:39:31 No.102015126

Anonymous 08/21/24(Wed)16:39:31 No.102015126

>>102015089
>>102015111
yea, your brain is fried, anon
lmao

Anonymous
08/21/24(Wed)16:39:45 No.102015127

Anonymous 08/21/24(Wed)16:39:45 No.102015127

>>102015097
It was trash, if it's still around it's still trash. But I will stand behind what I said, every prompt was a complete slurry mess but by god did it somehow get within the ballpark of what the prompt asked for in natural language.

Anonymous
08/21/24(Wed)16:40:00 No.102015133

Anonymous 08/21/24(Wed)16:40:00 No.102015133

>>102015082
>it doesn't do Messi drinking a can of coke in the style of Doom 92 while he's playing Blitzball where the art direction is inspired by a dunk Picasso so it's bad
>so what if you can train a Lora doing exactly what you want in an hour, I expect it to compete against megacorpo models running on A100s and trained by people who hand curate and caption datasets

Anonymous
08/21/24(Wed)16:40:29 No.102015145

Anonymous 08/21/24(Wed)16:40:29 No.102015145

>>102015097
>>102015127
256x256 craiyon images were more detailed than any model on any architecture; 4 stitched together would be higher than stuff frfr ong

Anonymous
08/21/24(Wed)16:40:48 No.102015152

Anonymous 08/21/24(Wed)16:40:48 No.102015152

File: flux_00110_.png (1.61 MB, 896x1160)

1.61 MB PNG

>>102015126
I mean, it's not literally Yvonne Strahovski, but it definitely has some of her features right? I'm not a schizo. I'm not.

Anonymous
08/21/24(Wed)16:40:55 No.102015156

Anonymous 08/21/24(Wed)16:40:55 No.102015156

I trained a lora on a particular girl who sports a healthy bush. I described her pussy as “hairy pussy” every time. But my samples are showing images of her with hair covering her whole belly and part of her nipples too more and more as the training progresses.
What might’ve I done wrong? Using the default settings from ai-toolkit.

Anonymous
08/21/24(Wed)16:41:15 No.102015159

Anonymous 08/21/24(Wed)16:41:15 No.102015159

>>102015145
kek I remember that schizo

Anonymous
08/21/24(Wed)16:41:30 No.102015164

Anonymous 08/21/24(Wed)16:41:30 No.102015164

>>102015152
>Yvonne Strahovski
Looks more like Megan Fox with blonde hair

Anonymous
08/21/24(Wed)16:41:55 No.102015170

Anonymous 08/21/24(Wed)16:41:55 No.102015170

>>102015152
if by "some of her features" you mean generic pretty blonde then yeah... otherwise no, I'd never say any of these look like Yvonne

Anonymous
08/21/24(Wed)16:42:55 No.102015184

Anonymous 08/21/24(Wed)16:42:55 No.102015184

>>102015133
>in an hour
How to without creating something shitty?

Fuck. It’s a million degrees in this room right now. Holy shit this will be great in winter.

Anonymous
08/21/24(Wed)16:43:03 No.102015186

Anonymous 08/21/24(Wed)16:43:03 No.102015186

File: ComfyUI_temp_vxvqs_00044_.png (1.53 MB, 832x1216)

1.53 MB PNG

>>102015102
no, retard. that is not slop. THIS is slop. get it right, monkey.

Anonymous
08/21/24(Wed)16:43:25 No.102015191

Anonymous 08/21/24(Wed)16:43:25 No.102015191

>>102014978
my first train attempt was with no captions and it was shit compared to when i added the captions
i guess if you use class images no captions might work (or not)

Anonymous
08/21/24(Wed)16:43:53 No.102015201

Anonymous 08/21/24(Wed)16:43:53 No.102015201

File: 00053-2014742755.png (2.89 MB, 1280x1920)

2.89 MB PNG

>>102015152
>but it definitely has some of her features right?
Other than being blonde, no.

Anonymous
08/21/24(Wed)16:44:06 No.102015204

Anonymous 08/21/24(Wed)16:44:06 No.102015204

>>102015156
the model has zero concept of "hairy pussy", you will need other images of hairy pussies to demonstrate what you mean, otherwise you can keep training and eventually it will learn after overbaking on that person

Anonymous
08/21/24(Wed)16:44:12 No.102015207

Anonymous 08/21/24(Wed)16:44:12 No.102015207

>>102015159
Does anyone remember the guy that always wanted people to make gens of the asian girl at his work he's obsessed ove, which he would go into great detail for a good gen.

Anonymous
08/21/24(Wed)16:44:26 No.102015209

Anonymous 08/21/24(Wed)16:44:26 No.102015209

File: you.png (1.35 MB, 1036x1200)

1.35 MB PNG

>>102014576
>If you want to give it a try, i'd appreciate it.
kys beggar

Anonymous
08/21/24(Wed)16:44:39 No.102015211

Anonymous 08/21/24(Wed)16:44:39 No.102015211

File: FluxDev_02386_.jpg (252 KB, 832x1216)

252 KB JPG

Anonymous
08/21/24(Wed)16:45:00 No.102015216

Anonymous 08/21/24(Wed)16:45:00 No.102015216

>>102015082
>google image fx
what the fuck is that?

Anonymous
08/21/24(Wed)16:45:08 No.102015219

Anonymous 08/21/24(Wed)16:45:08 No.102015219

>>102015184
>30 images of Doom screenshots
>get magic in 250 steps

Anonymous
08/21/24(Wed)16:45:38 No.102015224

Anonymous 08/21/24(Wed)16:45:38 No.102015224

alright, so i have a dataset of ~150 pics i want to train a lora on
synthetic captioning hallucinates too much and makes too many errors, i want to ensure quality gens
just how detailed would my captions need to be if i'm training a character? if all the pics contain the character (but maybe different angles, or together with other characters) could i just copy-paste the character's description into each caption and be done with it? what parts would i actually need to describe?

Anonymous
08/21/24(Wed)16:46:02 No.102015228

Anonymous 08/21/24(Wed)16:46:02 No.102015228

>>102015216
if it is recent then it is the Imagen 3 model by Google

Anonymous
08/21/24(Wed)16:46:23 No.102015234

Anonymous 08/21/24(Wed)16:46:23 No.102015234

>>102015224
I don't mean to alarm you, but the model wasn't trained on perfect captions

Anonymous
08/21/24(Wed)16:47:01 No.102015242

Anonymous 08/21/24(Wed)16:47:01 No.102015242

>>102015224
>synthetic captioning hallucinates too much and makes too many errors, i want to ensure quality gens
You can always fix the captions manually

Anonymous
08/21/24(Wed)16:47:07 No.102015245

Anonymous 08/21/24(Wed)16:47:07 No.102015245

File: flux_00198_.png (1.62 MB, 896x1160)

1.62 MB PNG

Yvonne Strahovski has blessed me once again. Praised be!

Anonymous
08/21/24(Wed)16:47:08 No.102015246

Anonymous 08/21/24(Wed)16:47:08 No.102015246

>>102015204
From what I’m seeing so far, it will output a Chewbaka monster if I try to gen her, but it will do normal nakedness for other random girls.
But thanks. Next time I try this I will include a wider variety.
>has zero concept of hairy pussy
Or any pussy at all, really

Anonymous
08/21/24(Wed)16:47:13 No.102015249

Anonymous 08/21/24(Wed)16:47:13 No.102015249

File: result.jpg (188 KB, 750x520)

188 KB JPG

>>102014921
I don't think this can improve without an advanced ipadapter node. The text capabilities seem damaged, but it shows promise (1st image is the reference for ipadapter and the others are different attempts with different seeds, guidance and ipmodel strength)

Anonymous
08/21/24(Wed)16:47:22 No.102015251

Anonymous 08/21/24(Wed)16:47:22 No.102015251

Vem är detta? Är det typ matgeeks son eller?

Anonymous
08/21/24(Wed)16:47:22 No.102015252

Anonymous 08/21/24(Wed)16:47:22 No.102015252

>>102015071
my thoughts exactly. even for a style lora, my ibuki satsuki style choice is a perfect example of where captioning can help a lot - no way it can tell the androgynous ass xianxia men apart from the females in 75% of the dataset images without captioning. I'm also stubborn in that when training for SDXL no captioning on style was also "recommended" but looked like total ass compared to captioned. you just can't convince me no caption has a better outcome vs the captions were bad in their test group that saw a better outcome. but end of the day, if no captioning works for them then power to them... I'm still not sold, personally

Anonymous
08/21/24(Wed)16:47:57 No.102015261

Anonymous 08/21/24(Wed)16:47:57 No.102015261

File: woodshop.png (930 KB, 1024x1024)

930 KB PNG

>>102013345
Well, that got me curious. I wonder if there's a "wood shop" short word to make something very detailed like this but for a video game like super metroid. Also I had to change "little" for "chibi" or it would go to a place like this instead >>102013314
>Chibi girl in Super Metroid video game in a wood shop.

Anonymous
08/21/24(Wed)16:48:12 No.102015266

Anonymous 08/21/24(Wed)16:48:12 No.102015266

File: joybatch.png (65 KB, 816x615)

65 KB PNG

https://files.catbox.moe/a9tbk3.json
A simple batch Joy Caption workflow for fellow brainlets, change the path and set the batch count in the menu to match the number of images in your folder.

Anonymous
08/21/24(Wed)16:48:34 No.102015271

Anonymous 08/21/24(Wed)16:48:34 No.102015271

>>102015207
Yes. Coworker anon. The girl with the green cardigan.
>>102015219
Just 250? What other params?

Anonymous
08/21/24(Wed)16:48:51 No.102015278

Anonymous 08/21/24(Wed)16:48:51 No.102015278

File: 00208-2024-08-21-cJak.jpg (3.1 MB, 2048x2688)

3.1 MB JPG

Anonymous
08/21/24(Wed)16:49:43 No.102015286

Anonymous 08/21/24(Wed)16:49:43 No.102015286

>>102015271
The joy is in learning. Despite what they say, this space is quite creative. Want me to pick your images for you too? Maybe I'll write your captions as well.

Anonymous
08/21/24(Wed)16:51:43 No.102015321

Anonymous 08/21/24(Wed)16:51:43 No.102015321

deis+ddim_uniform gives me sharper images but also gives me extra limbs, and even heads once, more often than euler+normal

Anonymous
08/21/24(Wed)16:51:47 No.102015322

Anonymous 08/21/24(Wed)16:51:47 No.102015322

>>102013704
FLUX.1, what everyone in the thread is using:
https://huggingface.co/spaces/black-forest-labs/FLUX.1-dev
https://huggingface.co/black-forest-labs/FLUX.1-dev

Anonymous
08/21/24(Wed)16:51:47 No.102015323

Anonymous 08/21/24(Wed)16:51:47 No.102015323

>>102015211
Actually pretty funny

Anonymous
08/21/24(Wed)16:52:51 No.102015336

Anonymous 08/21/24(Wed)16:52:51 No.102015336

File: bComfyUI_107247_.jpg (262 KB, 768x1024)

262 KB JPG

Anonymous
08/21/24(Wed)16:52:51 No.102015337

Anonymous 08/21/24(Wed)16:52:51 No.102015337

>>102015234
yeah, i know, but i've done 5k steps on my synthetic captions without any manual intervention and it still hasn't learned the concept well
looked at the captions afterwards and they're often riddled with errors about what item is what and where it's placed on the character's body

>>102015242
fair, just figured it might take longer scrutinising each sentence for errors then correcting them than it would be to write something from scratch

Anonymous
08/21/24(Wed)16:53:15 No.102015341

Anonymous 08/21/24(Wed)16:53:15 No.102015341

>>102015266
thanks dude

Anonymous
08/21/24(Wed)16:53:30 No.102015346

Anonymous 08/21/24(Wed)16:53:30 No.102015346

>>102015252
Yeh I agree, at the end of the day we are training this loras for our own needs, and I think we know better what our needs are.

Anonymous
08/21/24(Wed)16:53:41 No.102015349

Anonymous 08/21/24(Wed)16:53:41 No.102015349

File: 1695564861871046.png (1.18 MB, 1024x1024)

1.18 MB PNG

>>102015133
>it doesn't do Messi drinking a can of coke in the style of Doom 92 while he's playing Blitzball where the art direction is inspired by a dunk Picasso so it's bad
I didn't say it was bad, I said it didn't improve an an area I was hoping it would. It's better than any previous SD at base. And yes, if one model can follow instructions in a prompt and the other one can't, one model doing worse than the other in that way.
>so what if you can train a Lora doing exactly what you want in an hour
LoRAs are not a magic bullet for fundamental model issues. Used right they can let you produce images better than any corpo model but the model is not going to get better at understanding NLP
>I expect it to compete against megacorpo models running on A100s and trained by people who hand curate and caption datasets
You sound offended and I have to ask why because I'm not here to support corpo models. Obviously microsoft will do better than a team of less than 10. The point is not to prove that it does worse, but to discuss in what ways it got better than SD and in what ways it didn't.

>>102015286
>The joy is in learning.
Exactly. I'm learning Flux, making comparisons with the current image gen sphere, and having fun. Don't take my explorations as a personal attack on the base model or something like that.

Anonymous
08/21/24(Wed)16:54:33 No.102015367

Anonymous 08/21/24(Wed)16:54:33 No.102015367

>>102013644
I know, right? And then there's lnkdn.safetensors which gives the greatest variety of ugly faces of any image model and nobody has given it a single heart at huggingface.

Anonymous
08/21/24(Wed)16:54:44 No.102015375

Anonymous 08/21/24(Wed)16:54:44 No.102015375

>>102015224
i assume that image recognition models prompted to caption a picture produce a caption structured roughly like what they were trained on, so feed a few pictures to gpt4 because it's the SOTA and ape what it does

Anonymous
08/21/24(Wed)16:54:50 No.102015376

Anonymous 08/21/24(Wed)16:54:50 No.102015376

>>102015349
I'm tired with faggots that can't even run the model bitching that it doesn't compete against 80GB VRAM corpo models. You really don't have anything interesting to say because at best you have the opinion of a spoiled child.

Anonymous
08/21/24(Wed)16:55:06 No.102015382

Anonymous 08/21/24(Wed)16:55:06 No.102015382

>>102015266
this is probably a dumb question, but why does that need to be in comfy? can't you just run a script to caption things?

Anonymous
08/21/24(Wed)16:55:08 No.102015384

Anonymous 08/21/24(Wed)16:55:08 No.102015384

>>102015336
Nice, using the Elden Ring lora right?

Anonymous
08/21/24(Wed)16:56:20 No.102015401

Anonymous 08/21/24(Wed)16:56:20 No.102015401

>>102015382
that's basically a visual programming workflow, no different that a for loop in code doing captions but took someone 5 minutes to do instead of 15

Anonymous
08/21/24(Wed)16:56:30 No.102015405

Anonymous 08/21/24(Wed)16:56:30 No.102015405

>>102015286
I don’t now what any of the parameters in training mean, so you telling me why and how you can train with only 250 steps would lead me to ask other questions and learn. But enjoy that weird superiority fetish. It’s not like you’re the only source of information in the world.

Anonymous
08/21/24(Wed)16:57:30 No.102015419

Anonymous 08/21/24(Wed)16:57:30 No.102015419

>>102015382
you can, i used this
https://github.com/bigdata-pw/florence-tool
it just werks
make sure you use
--task "<MORE_DETAILED_CAPTION>"
for some good captioning

Anonymous
08/21/24(Wed)16:57:42 No.102015423

Anonymous 08/21/24(Wed)16:57:42 No.102015423

>>102015405
Zoomers are so pathetic its like they forgotten how to use the internet. There are better places to get your spoon feeding.

Anonymous
08/21/24(Wed)16:57:44 No.102015425

Anonymous 08/21/24(Wed)16:57:44 No.102015425

I finally managed to get Flux lora training to start with kohya, but the loss is very high at 1.93. Is this normal for Flux?

Anonymous
08/21/24(Wed)16:58:09 No.102015434

Anonymous 08/21/24(Wed)16:58:09 No.102015434

>>102015382
needs top_kek parameter

Anonymous
08/21/24(Wed)16:58:37 No.102015448

Anonymous 08/21/24(Wed)16:58:37 No.102015448

>>102015209
hot
(nta)

Anonymous
08/21/24(Wed)16:58:47 No.102015449

Anonymous 08/21/24(Wed)16:58:47 No.102015449

>>102015425
my last attempt was at 3.43 lel
it did mostly work, i'm gonna try another tonight

Anonymous
08/21/24(Wed)16:59:06 No.102015453

Anonymous 08/21/24(Wed)16:59:06 No.102015453

>>102015405
>it turns out to be that weird ugly guy from github and he won't give you any info because he needs it for his patreon

Anonymous
08/21/24(Wed)16:59:10 No.102015455

Anonymous 08/21/24(Wed)16:59:10 No.102015455

>>102015425
I haven't had anything above .4

Anonymous
08/21/24(Wed)17:00:29 No.102015478

Anonymous 08/21/24(Wed)17:00:29 No.102015478

>>102014288
Was this in flux?

Anonymous
08/21/24(Wed)17:00:45 No.102015480

Anonymous 08/21/24(Wed)17:00:45 No.102015480

>>102015453
>no I won't even do the exact instructions posted on the Github, you need to tell me what to do exactly
>I expect you to spend way more time teaching me than the time I will spend training (spoiler: I won't be training)

Anonymous
08/21/24(Wed)17:01:18 No.102015487

Anonymous 08/21/24(Wed)17:01:18 No.102015487

JoyCaption or CogVLM-2? How do they compare? (And how do I get the latter running on Windows, preferably without WSL?)

Anonymous
08/21/24(Wed)17:01:25 No.102015490

Anonymous 08/21/24(Wed)17:01:25 No.102015490

>>102015478
TroonMixXL

Anonymous
08/21/24(Wed)17:01:34 No.102015496

Anonymous 08/21/24(Wed)17:01:34 No.102015496

File: bComfyUI_107250_.jpg (260 KB, 768x1024)

260 KB JPG

no it's just flux, i haven't fucked with loras yet doubt i will any time soon.

Anonymous
08/21/24(Wed)17:01:50 No.102015500

Anonymous 08/21/24(Wed)17:01:50 No.102015500

>>102015487
joycap is llama3.1, and it's alright

Anonymous
08/21/24(Wed)17:02:00 No.102015501

Anonymous 08/21/24(Wed)17:02:00 No.102015501

>>102015423
It's because later 90s/early 00s kids were all actually using PCs a lot, for MSN messenger, video games, hacking etc Which often came with many issues that come with PCs that we all had to deal with.

Meanwhile the new generation is growing up using iphones and ipads where everything is finetunes to cater for their needs, so they never actually understood the actual workings of anything.

Anonymous
08/21/24(Wed)17:02:08 No.102015502

Anonymous 08/21/24(Wed)17:02:08 No.102015502

File: 1699087107261870.png (588 KB, 821x821)

588 KB PNG

>this is very dangerous for our democracy

Anonymous
08/21/24(Wed)17:02:24 No.102015506

Anonymous 08/21/24(Wed)17:02:24 No.102015506

>>102014601
Post a pic that already looks like you want, for help.

Anonymous
08/21/24(Wed)17:03:33 No.102015519

Anonymous 08/21/24(Wed)17:03:33 No.102015519

>>102015501
We're so fucked, they have zero confidence and initiative and they are paralyzed the second they have to make their own decisions. Apps have absolutely fried their brains and make them dysfunctional.

Anonymous
08/21/24(Wed)17:04:12 No.102015532

Anonymous 08/21/24(Wed)17:04:12 No.102015532

File: 1699921812062305.png (55 KB, 769x863)

55 KB PNG

>>102015480
>i wont spoonfeed, learn it yourself
>the official documentation

Anonymous
08/21/24(Wed)17:04:48 No.102015533

Anonymous 08/21/24(Wed)17:04:48 No.102015533

>>102015453
You are just Jealous of my brain, come join patreon and I can teaching you 1 to 1 and maybe you brain can becomed amaze like me.

I have 12 THOUSAND patreon fans, you are NOTHING to me

Anonymous
08/21/24(Wed)17:05:09 No.102015538

Anonymous 08/21/24(Wed)17:05:09 No.102015538

number of loras going up on civitai for flux is insane

Anonymous
08/21/24(Wed)17:05:11 No.102015539

Anonymous 08/21/24(Wed)17:05:11 No.102015539

>>102015480
jokes on you, I'm the 12gb lora anon whose been replying with detailed explanations of what I do when anons ask. gatekeeping is dumb, so is trolling with it - shit or get off the pot

Anonymous
08/21/24(Wed)17:05:16 No.102015541

Anonymous 08/21/24(Wed)17:05:16 No.102015541

>>102015532
yeah that snark and bad faith image really showed me

Anonymous
08/21/24(Wed)17:05:49 No.102015548

Anonymous 08/21/24(Wed)17:05:49 No.102015548

>>102015478
No, it's 1.5

>>102015490
Get back to your containment general

Anonymous
08/21/24(Wed)17:06:17 No.102015558

Anonymous 08/21/24(Wed)17:06:17 No.102015558

>>102015539
why are you asking stupid questions if you already know the answers?

Anonymous
08/21/24(Wed)17:06:22 No.102015560

Anonymous 08/21/24(Wed)17:06:22 No.102015560

lel

Anonymous
08/21/24(Wed)17:06:27 No.102015562

Anonymous 08/21/24(Wed)17:06:27 No.102015562

>>102014815
Something like this:
https://huggingface.co/spaces/gokaygokay/FLUX.1-dev-with-Captioner
You input a picture, you get a picture and a prompt, that's the dream.
Except right now the picture you get is completely different, specially if it's a style it doesn't recognize.

Anonymous
08/21/24(Wed)17:07:07 No.102015575

Anonymous 08/21/24(Wed)17:07:07 No.102015575

>>102015538
>tfw will NEVER EVER get a nazi lora on there
my waifus...

Anonymous
08/21/24(Wed)17:07:25 No.102015581

Anonymous 08/21/24(Wed)17:07:25 No.102015581

>>102015519
imagine life in 15-20 yrs

Anonymous
08/21/24(Wed)17:07:57 No.102015583

Anonymous 08/21/24(Wed)17:07:57 No.102015583

>>102015558
OK I admit I'm that guy, I just a bit desperate because 12000 fans are waiting for me to teahcing them , so I need YOU TO TELL ME WHAT TO DO

Anonymous
08/21/24(Wed)17:08:25 No.102015591

Anonymous 08/21/24(Wed)17:08:25 No.102015591

>>102015558
I'm not the anon asking, I'm just the one making fun of you for acting like it's so hard to give them an answer. if I had to guess, you haven't actually trained what you claimed and have no answer to give them - you'd rather troll for your (You)s for some godforsaken reason

Anonymous
08/21/24(Wed)17:08:54 No.102015598

Anonymous 08/21/24(Wed)17:08:54 No.102015598

>>102015581
probably working while the zoomers get put in the pods as their parents drop dead

Anonymous
08/21/24(Wed)17:09:00 No.102015601

Anonymous 08/21/24(Wed)17:09:00 No.102015601

>>102015581
The generation after won't even be capable of imagining, that's how spoonfed they will be, that's how low their attention span will be, they will have no need to actually think or imagine anything.

Anonymous
08/21/24(Wed)17:09:20 No.102015605

Anonymous 08/21/24(Wed)17:09:20 No.102015605

>>102014898
>The T5 doesn't give a shit about new concepts it converts everything fine because it's basically trained on everything to begin with
It's clueless about Final Fantasy's Blitzball.

Anonymous
08/21/24(Wed)17:09:21 No.102015606

Anonymous 08/21/24(Wed)17:09:21 No.102015606

>>102015591
probably just wants to scare new local users away so they give up and use corposlop instead

Anonymous
08/21/24(Wed)17:09:48 No.102015610

Anonymous 08/21/24(Wed)17:09:48 No.102015610

>>102015211
did you prompt it as a body pillow? anyways i need a pillow like that in real life now

Anonymous
08/21/24(Wed)17:09:50 No.102015611

Anonymous 08/21/24(Wed)17:09:50 No.102015611

can one abuse HF to train a lora?

Anonymous
08/21/24(Wed)17:10:17 No.102015618

Anonymous 08/21/24(Wed)17:10:17 No.102015618

>>102015591
you have spent more time complaining about getting answers than the time it would take to google search your answers (or read the thread)
again, bad faith, disingenuous with zero intent
I'm not one of your Zoomer youtube videos that you pretend to watch to "learn" something
speaking of, there is a Youtube video for Flux Lora training, go watch it

Anonymous
08/21/24(Wed)17:10:22 No.102015619

Anonymous 08/21/24(Wed)17:10:22 No.102015619

>>102014815
>if Flux doesn't get your nipples hard, then I'm not sure what would
There are no nipples

Anonymous
08/21/24(Wed)17:10:35 No.102015622

Anonymous 08/21/24(Wed)17:10:35 No.102015622

Training fluxd takes too damn long desu

Anonymous
08/21/24(Wed)17:11:07 No.102015629

Anonymous 08/21/24(Wed)17:11:07 No.102015629

>>102015610
yeah, tried dakimakura first but it has no idea what that is

Anonymous
08/21/24(Wed)17:11:49 No.102015638

Anonymous 08/21/24(Wed)17:11:49 No.102015638

>>102015618
I made that video

Anonymous
08/21/24(Wed)17:12:48 No.102015646

Anonymous 08/21/24(Wed)17:12:48 No.102015646

>>102015605
The T5 doesn't make images you ignoramus.

Anonymous
08/21/24(Wed)17:12:56 No.102015647

Anonymous 08/21/24(Wed)17:12:56 No.102015647

how much faster can i gen if i rent a a100 or h100 pcie compared to say something like a 4090? this is just for genning, not for creating a lora.

Anonymous
08/21/24(Wed)17:12:57 No.102015648

Anonymous 08/21/24(Wed)17:12:57 No.102015648

>102015618
confirmed troll. no more (You)s for you, you can't even read what you're replying to
>>102015622
I leave it on overnight/into the morning if needed while I do other stuff and am trying to keep to a sub 100 dataset. makes it a lot less painful. definitely miss the 30 minute sdxl training though

Anonymous
08/21/24(Wed)17:13:52 No.102015660

Anonymous 08/21/24(Wed)17:13:52 No.102015660

>>102015648
You're a very helpful person anon and not at all a tool.

Anonymous
08/21/24(Wed)17:14:00 No.102015662

Anonymous 08/21/24(Wed)17:14:00 No.102015662

>>102015045
>SD 1.5 still works
I still have to jump from model to model depending on what I want. If only someone gave me my Omni SD 1.5 model, a Stable Diffusion 1.5v2 of sorts, I think I could move on with my life.
But it seems like a jump like
SDXL -> Flux -> the next big thing
Seem more likely.

Anonymous
08/21/24(Wed)17:14:51 No.102015671

Anonymous 08/21/24(Wed)17:14:51 No.102015671

File: ComfyUI_00627_.png (1.11 MB, 1024x1024)

1.11 MB PNG

>>102014842
I took the faster original workflow and only replaced the unet with the Q5_K_M one, and generations slow down to 24.5 seconds as well
The Q5_K_M t5xxl model alone had no negative effects, generations still at 19 seconds.
Why is it that the quantized unet slows down generation by a whopping 25%? (Using the latest version of comfy, freshly git fetch/git pulled.)

Anonymous
08/21/24(Wed)17:15:29 No.102015682

Anonymous 08/21/24(Wed)17:15:29 No.102015682

Is there a way to get a real time graph showing loss in ai-toolkit like koyha has?

Anonymous
08/21/24(Wed)17:15:31 No.102015683

Anonymous 08/21/24(Wed)17:15:31 No.102015683

>>102015662
I'm going to give you a protip: no one is making a shitty SD model that generates random gachas 20% based on what you wrote.

Anonymous
08/21/24(Wed)17:17:06 No.102015698

Anonymous 08/21/24(Wed)17:17:06 No.102015698

>>102015082
>Reminds me of craiyon which, while garbage quality, would ALWAYS try to the best of its ability to do what you were asking it to do.
Yeah, so why did nobody make some "Craiyon makes the compositions" and another model finishes the picture workflow? Wasn't DalleMini open source?
Craiyon was much better at this than Stable Diffusion 1.4 itself, why hasn't anybody been able to replicate such a thing?

Anonymous
08/21/24(Wed)17:18:14 No.102015710

Anonymous 08/21/24(Wed)17:18:14 No.102015710

>>102015698
>>102015145

Anonymous
08/21/24(Wed)17:18:18 No.102015712

Anonymous 08/21/24(Wed)17:18:18 No.102015712

File: bComfyUI_107292_.jpg (246 KB, 768x1024)

246 KB JPG

Anonymous
08/21/24(Wed)17:19:16 No.102015724

Anonymous 08/21/24(Wed)17:19:16 No.102015724

bumbo

Anonymous
08/21/24(Wed)17:20:35 No.102015736

Anonymous 08/21/24(Wed)17:20:35 No.102015736

File: result2.jpg (311 KB, 750x520)

311 KB JPG

Above SDXL images as reference for IPadapter. Below Flux.

I think this shows more promise than the style loras I've tested to the date.

Anonymous
08/21/24(Wed)17:22:19 No.102015761

Anonymous 08/21/24(Wed)17:22:19 No.102015761

please someone tell me an addon exists for comfy where it auto generates prompts, and then gens however many you want and then goes to the next prompt. this would be a lazy but cool way of finding cool stuff.

Anonymous
08/21/24(Wed)17:22:20 No.102015762

Anonymous 08/21/24(Wed)17:22:20 No.102015762

>>102015647
if you are just genning images it wouldn't actually be that much faster. H100's are 300 watt limited, they just have a ton of ram with a huge memory bus. Renting a 6000 Ada would be interesting, way cheaper than a h100 and you still get 48 GB of ram so can easily load the full model and loras.

Anonymous
08/21/24(Wed)17:22:49 No.102015765

Anonymous 08/21/24(Wed)17:22:49 No.102015765

>>102015671
It dequants on-the-fly I believe.
So you might be paying the toll in the sense that there's that extra dequant compute overhead.
I think it dequants to FP16 - but maybe someone can correct me?

Anonymous
08/21/24(Wed)17:22:58 No.102015767

Anonymous 08/21/24(Wed)17:22:58 No.102015767

My attempt at a porn lora is genning literal necrotic limbs.

Anonymous
08/21/24(Wed)17:23:56 No.102015778

Anonymous 08/21/24(Wed)17:23:56 No.102015778

>>102015127
>if it's still around it's still trash
Last time I used craiyon it was a worse version than SDXL that rewrote your prompts to censor them causing it to not draw at all what you sent or just refused to process them, but apparently they turned it into the most profitable adventure so it turns out most people are like flies attracted to bad smell.

Anonymous
08/21/24(Wed)17:24:09 No.102015782

Anonymous 08/21/24(Wed)17:24:09 No.102015782

>>102015767
dogkennel

Anonymous
08/21/24(Wed)17:25:10 No.102015790

Anonymous 08/21/24(Wed)17:25:10 No.102015790

ultimate sd upscale seems to be adding ghost limbs to the right side of every gen i make now, what settings affect this? i figure it might be the sampler but i usually just set the sampler to the same one i use for imagegen.

Anonymous
08/21/24(Wed)17:25:36 No.102015798

Anonymous 08/21/24(Wed)17:25:36 No.102015798

>>102013300
yeah the other guy is right, I was using a1111 for over a year right up until flux, you wanna move to comfyui immediately

Anonymous
08/21/24(Wed)17:26:06 No.102015804

Anonymous 08/21/24(Wed)17:26:06 No.102015804

>>102015186
Different kinds of slop.

Anonymous
08/21/24(Wed)17:26:26 No.102015810

Anonymous 08/21/24(Wed)17:26:26 No.102015810

File: 00069-993734854_cleanup.png (2.93 MB, 1280x1920)

2.93 MB PNG

>>102015790
Show side by side example. What's your denoising

Anonymous
08/21/24(Wed)17:26:28 No.102015811

Anonymous 08/21/24(Wed)17:26:28 No.102015811

>>102015765
Also, Flux support apparently added to SD.cpp.

Anonymous
08/21/24(Wed)17:26:30 No.102015812

Anonymous 08/21/24(Wed)17:26:30 No.102015812

>>102015423
You're strawmanning hard, friend.
>>102015501
>>102015519
I still remember hopping on IRC to ask questions to more knowledgeable people in the early 2000s when I was learning to compile the kernel or whatever. This is no different. You're projecting a gripe you have onto someone who isn't displaying what you complain about, at all.
Like I said, I will figure it out. You can stop tugging on your dick now lest you break it.
Or not, keep dooming hard about zoomies. Whatever gets you off.

Anonymous
08/21/24(Wed)17:27:31 No.102015822

Anonymous 08/21/24(Wed)17:27:31 No.102015822

File: its me hatsune miku.png (1.4 MB, 1024x1024)

1.4 MB PNG

>>102015698
craiyon as a model has interesting qualities but sometimes you're missing the forest for the trees when you inspect a model for its specific strengths. the forest is a nightmarish slurry of pixels, i blame nobody for not iterating on this in the foss environment

Anonymous
08/21/24(Wed)17:27:45 No.102015825

Anonymous 08/21/24(Wed)17:27:45 No.102015825

>>102015812
>I always needed to be spoon fed

Anonymous
08/21/24(Wed)17:28:07 No.102015830

Anonymous 08/21/24(Wed)17:28:07 No.102015830

had been generating exactly what I wanted and then...
https://files.catbox.moe/52vte7.jpg

Anonymous
08/21/24(Wed)17:28:33 No.102015834

Anonymous 08/21/24(Wed)17:28:33 No.102015834

File: 2024-08-21_192755_seed34_(...).png (943 KB, 1024x1024)

943 KB PNG

My migu keeps making this face bros...

Anonymous
08/21/24(Wed)17:28:33 No.102015835

Anonymous 08/21/24(Wed)17:28:33 No.102015835

okay using my lora, I can make a 1024x1024 image every <60s on my GPU, but as soon as I change the CFG to anything greater than 1, it turns into 20 minutes per image. wat

Anonymous
08/21/24(Wed)17:28:39 No.102015837

Anonymous 08/21/24(Wed)17:28:39 No.102015837

File: aseet.jpg (20 KB, 542x375)

20 KB JPG

>>102015767

Anonymous
08/21/24(Wed)17:28:39 No.102015838

Anonymous 08/21/24(Wed)17:28:39 No.102015838

>>102015082
All of these closed model have a LLM layer to enhance your prompt and fill in the detailed. How hard is to understand that? Flux dev is using exactly what you put in and you're surprised? ffs

Anonymous
08/21/24(Wed)17:28:50 No.102015842

Anonymous 08/21/24(Wed)17:28:50 No.102015842

>>102015228
Apparently it's been there since February?
This is some mandela effect shit, people haven't ever mentioned there being free access to Google Imagen generations and when they do it's 6 months old tech?

Anonymous
08/21/24(Wed)17:30:18 No.102015866

Anonymous 08/21/24(Wed)17:30:18 No.102015866

>>102015249
>it shows promise
It's the most terrible style copier I've ever seen.

Anonymous
08/21/24(Wed)17:30:22 No.102015868

Anonymous 08/21/24(Wed)17:30:22 No.102015868

>>102015811
https://github.com/leejet/stable-diffusion.cpp/tree/flux
whoa!

Anonymous
08/21/24(Wed)17:30:25 No.102015869

Anonymous 08/21/24(Wed)17:30:25 No.102015869

>>102015842
I think availability is what changed.

Anonymous
08/21/24(Wed)17:30:55 No.102015879

Anonymous 08/21/24(Wed)17:30:55 No.102015879

>>102015838
At this point I'm convinced they're OpenAI employees. Either that or they're retarded thinking that local AI is going to compete against models with hundreds of millions behind them and enterprise GPU hardware. Flux is here but we're already on the cusp of another major release from OpenAI.

Anonymous
08/21/24(Wed)17:31:03 No.102015885

Anonymous 08/21/24(Wed)17:31:03 No.102015885

>>102015804

bad slop >>102015186

good slop >>102013681

Anonymous
08/21/24(Wed)17:31:58 No.102015895

Anonymous 08/21/24(Wed)17:31:58 No.102015895

>>102015249
I'd rather just get a textual embedding of the style at that point

Anonymous
08/21/24(Wed)17:33:03 No.102015909

Anonymous 08/21/24(Wed)17:33:03 No.102015909

File: file.png (2.61 MB, 1024x1024)

2.61 MB PNG

>>102015782
It just doesn't understand what genitals are. It tries to emulate them by rendering deformed hands and limbs imitating the correct shape where the genitals should be. And suddenly this pic related lmao. It's really confused.
But now I think I understand how my captioning was deficient. Much of the stuff I described, the model has no reference to compare it to, and I threw complex scenes at it. With more simple images and granularity in introducing it to different things, it should work better. I think. I have no fucking clue what I'm doing. Also probably more training.
>>102015837
https://litter.catbox.moe/ds4pho.jpg
https://litter.catbox.moe/o4t8c9.jpg

Anonymous
08/21/24(Wed)17:33:53 No.102015915

Anonymous 08/21/24(Wed)17:33:53 No.102015915

File: wudduhfuck.png (1.13 MB, 1463x902)

1.13 MB PNG

>>102015810
yeah i think it was the sampler scheduler, i shouldve set it to normal and i think the cfg scale fucked it too
shit im still learning how all this works, i really regret going lazy mode with forge all year till the past 3 weeks.
not getting ghost limbs now but i am getting weird artifacts like that strange black thing on her foot, which came from the first pass.

Anonymous
08/21/24(Wed)17:34:40 No.102015924

Anonymous 08/21/24(Wed)17:34:40 No.102015924

>>102015879
>compete against models with hundreds of millions behind them and enterprise GPU hardware
SAI has had hundreds of millions behind them, BFL has at least 33 million behind them

Anonymous
08/21/24(Wed)17:35:22 No.102015931

Anonymous 08/21/24(Wed)17:35:22 No.102015931

File: 2024-08-21_182441_seed29_(...).png (1.9 MB, 1280x1280)

1.9 MB PNG

Anonymous
08/21/24(Wed)17:35:53 No.102015936

Anonymous 08/21/24(Wed)17:35:53 No.102015936

>>102015924
SAI was embezzling money. No way they spent that on training. BFL still has to stay within the realms of 24 GB of VRAM, OpenAI can train on 192GB+ VRAM beasts and inference on 80 GB H100s. It's completely different targets.

Anonymous
08/21/24(Wed)17:36:24 No.102015946

Anonymous 08/21/24(Wed)17:36:24 No.102015946

File: 2024-08-21_175128_seed6_s(...).png (1.8 MB, 1280x1280)

1.8 MB PNG

Anonymous
08/21/24(Wed)17:36:39 No.102015948

Anonymous 08/21/24(Wed)17:36:39 No.102015948

>>102015936
DALL-E 3 is 8B+5B

Anonymous
08/21/24(Wed)17:37:02 No.102015951

Anonymous 08/21/24(Wed)17:37:02 No.102015951

>>102015948
seriously doubt that lmao

Anonymous
08/21/24(Wed)17:37:50 No.102015968

Anonymous 08/21/24(Wed)17:37:50 No.102015968

>>102015951
Believe it. It's all in the training data.

Anonymous
08/21/24(Wed)17:38:36 No.102015978

Anonymous 08/21/24(Wed)17:38:36 No.102015978

>>102015968
>source: my ass
Either way I win, because 12B with proper finetuning will far excel DE3 if it's 8B

Anonymous
08/21/24(Wed)17:38:53 No.102015981

Anonymous 08/21/24(Wed)17:38:53 No.102015981

>>102015646
When you ask for it it doesn't have an idea of how to encode it so an image generator can make it.

Anonymous
08/21/24(Wed)17:38:57 No.102015984

Anonymous 08/21/24(Wed)17:38:57 No.102015984

>>102015895
This is done on the fly. There is no substitution for proper recognition of styles but that needs a finetune or another version of flux. Are loras that different from embeddings? I have only tested loras and what I've seen has been very disappointing.

Anonymous
08/21/24(Wed)17:39:26 No.102015992

Anonymous 08/21/24(Wed)17:39:26 No.102015992

>>102015879
>thinking that local AI is going to compete
Again, it's not a competition, it's a comparison. My goal isn't to find which model is "better" because that's a stupid and obvious thing to look for. I'm looking for how the local scene is improving on different fundamental skills like NLP comprehension. The other results just give a good idea of what a successful result looks like, if corpo models can't do it then it's not a good test to try and prompt Flux for it.

Anonymous
08/21/24(Wed)17:39:40 No.102015995

Anonymous 08/21/24(Wed)17:39:40 No.102015995

>>102015978
It came to me in a dream and finetuning Flux won't be enough.

Anonymous
08/21/24(Wed)17:40:52 No.102016004

Anonymous 08/21/24(Wed)17:40:52 No.102016004

>>102015995
???
I've already made Loras that far excelled the shit DE3 can do. Have fun with your dogs. I can't wait for DE4 with its thought crime layer.

Anonymous
08/21/24(Wed)17:41:50 No.102016020

Anonymous 08/21/24(Wed)17:41:50 No.102016020

>>102015936
It's a nice meme to say SAI embezzled or sniffed their funding but the reality is they oversubscribed to AWS to feed Emad's ego on having over 4000 A100. Their AWS bill is millions per month.
>BFL still has to stay within the realms of 24 GB of VRAM
No they don't. The community has made it fit in 24GB. BFL train on A100 or H100 like everyone else.

Anonymous
08/21/24(Wed)17:41:56 No.102016022

Anonymous 08/21/24(Wed)17:41:56 No.102016022

>>102016004
>talking about the censorship layered on top instead of the ability of the model
I accept your concession.

Anonymous
08/21/24(Wed)17:42:07 No.102016025

Anonymous 08/21/24(Wed)17:42:07 No.102016025

>>102015981
I'm half tempted to make a blitzball lora to prove you wrong. But I won't. I'm going to give a protip, the words "blitzball" are in the pile of trillion tokens the T5 was trained on.

Anonymous
08/21/24(Wed)17:42:21 No.102016027

Anonymous 08/21/24(Wed)17:42:21 No.102016027

>File upload IP range ban
Life is pain, a man can't even share some cute pictures nowadays.

Anonymous
08/21/24(Wed)17:42:47 No.102016033

Anonymous 08/21/24(Wed)17:42:47 No.102016033

>>102016020
the largest model is 23.3 gb

Anonymous
08/21/24(Wed)17:43:55 No.102016045

Anonymous 08/21/24(Wed)17:43:55 No.102016045

>>102016022
I already made photorealistic results based on what I wanted and well beyond the trite DE3 does
>that's a concession
I'm merely pointing out that the fun you had with DE3 won't even be possible in DE4 as problematic prompts will be stopped without even needing to dog.

Anonymous
08/21/24(Wed)17:44:27 No.102016050

Anonymous 08/21/24(Wed)17:44:27 No.102016050

>>102015487
I had more results with:
https://huggingface.co/spaces/SakanaAI/Llama-3-EvoVLM-JP-v2
JoyCaption is only good if you need it to recognize characters.

Anonymous
08/21/24(Wed)17:45:59 No.102016067

Anonymous 08/21/24(Wed)17:45:59 No.102016067

>>102016045
>he thinks it's about photorealism and not the control and builtin knowledge

Anonymous
08/21/24(Wed)17:46:05 No.102016068

Anonymous 08/21/24(Wed)17:46:05 No.102016068

>>102016022
Why would the uncensored ability of the model matter when you'll never have access to it?

Anonymous
08/21/24(Wed)17:47:55 No.102016100

Anonymous 08/21/24(Wed)17:47:55 No.102016100

File: file.png (523 KB, 1513x836)

523 KB PNG

>>102016067
I have ultimate control with Loras, whatever cursory knowledge DE3 might have with random topics.
>pic related

Anonymous
08/21/24(Wed)17:48:23 No.102016108

Anonymous 08/21/24(Wed)17:48:23 No.102016108

File: Capture.png (274 KB, 762x654)

274 KB PNG

>>102016050

Anonymous
08/21/24(Wed)17:48:53 No.102016115

Anonymous 08/21/24(Wed)17:48:53 No.102016115

>>102015611
They only allow free abuse of their CPUs, if you can train a lora on CPU, then, sure.

Anonymous
08/21/24(Wed)17:49:58 No.102016131

Anonymous 08/21/24(Wed)17:49:58 No.102016131

>>102016108
it's not wrong

Anonymous
08/21/24(Wed)17:50:01 No.102016133

Anonymous 08/21/24(Wed)17:50:01 No.102016133

File: helpme.jpg (9 KB, 250x237)

9 KB JPG

>>102015909
>https://litter.catbox.moe/ds4pho.jpg
>https://litter.catbox.moe/o4t8c9.jpg
>dat body horror

Anonymous
08/21/24(Wed)17:50:17 No.102016134

Anonymous 08/21/24(Wed)17:50:17 No.102016134

>>102016108
I can see why it said that but damn. Do you know any models that pass this test?

Anonymous
08/21/24(Wed)17:50:29 No.102016139

Anonymous 08/21/24(Wed)17:50:29 No.102016139

>>102016068
Because I'm not an entitled shit flinging monkey that only cares about what it can grab and destroy.
>>102016100
loras will never match a good base training, how many can you stack before shit falls apart

Anonymous
08/21/24(Wed)17:50:59 No.102016143

Anonymous 08/21/24(Wed)17:50:59 No.102016143

>>102016139
you don't need to stack on a 12B model

Anonymous
08/21/24(Wed)17:51:13 No.102016146

Anonymous 08/21/24(Wed)17:51:13 No.102016146

>>102016133
>when you notice her face in the first pic

Anonymous
08/21/24(Wed)17:51:20 No.102016150

Anonymous 08/21/24(Wed)17:51:20 No.102016150

>>102015133
holy fuck you're in every thread seething whenever someone drops a hint of criticism about the model. get over it. stop being a freetard.

Anonymous
08/21/24(Wed)17:52:04 No.102016161

Anonymous 08/21/24(Wed)17:52:04 No.102016161

>>102016133
what's so bad about it?
sd3 user btw

Anonymous
08/21/24(Wed)17:53:11 No.102016185

Anonymous 08/21/24(Wed)17:53:11 No.102016185

>>102016143
why, if you have loras of x, y and z how do you use them together without stacking?

Anonymous
08/21/24(Wed)17:53:27 No.102016187

Anonymous 08/21/24(Wed)17:53:27 No.102016187

>>102016150
you're in every thread saying the same complaint that's not even accurate

Anonymous
08/21/24(Wed)17:54:48 No.102016200

Anonymous 08/21/24(Wed)17:54:48 No.102016200

File: raygun.webm (3.52 MB, 640x385)

3.52 MB WEBM

Anons is there anything as "good" as luma yet? I want to make trippy videos and laugh.

Anonymous
08/21/24(Wed)17:54:53 No.102016201

Anonymous 08/21/24(Wed)17:54:53 No.102016201

>>102016185
>using a 12B model like SD
Flux Loras are exponentially more capable and have more surface area to work with as a rule

Anonymous
08/21/24(Wed)17:54:54 No.102016202

Anonymous 08/21/24(Wed)17:54:54 No.102016202

File: Capture.png (877 KB, 1531x606)

877 KB PNG

>>102016108
More detail but it's pretty garbage

Anonymous
08/21/24(Wed)17:55:42 No.102016210

Anonymous 08/21/24(Wed)17:55:42 No.102016210

>>102016200
I need to see her cunny

Anonymous
08/21/24(Wed)17:55:50 No.102016211

Anonymous 08/21/24(Wed)17:55:50 No.102016211

>>102016200
what do you think retard?
obviously if we had local luma there'd be an entire videogen thread. people even stopped sharing those attempts with SD a while ago we're far from there yet.

Anonymous
08/21/24(Wed)17:55:52 No.102016213

Anonymous 08/21/24(Wed)17:55:52 No.102016213

>>102016201
how does change the problem if the concepts are in the x, y and z loras
how do you use them together without stacking, you haven't explained that

Anonymous
08/21/24(Wed)17:56:05 No.102016215

Anonymous 08/21/24(Wed)17:56:05 No.102016215

File: 1699570651828244.png (16 KB, 435x364)

16 KB PNG

the hell is this???

Anonymous
08/21/24(Wed)17:56:11 No.102016218

Anonymous 08/21/24(Wed)17:56:11 No.102016218

>>102015834
I think I know where his left hand is.

Anonymous
08/21/24(Wed)17:56:29 No.102016224

Anonymous 08/21/24(Wed)17:56:29 No.102016224

>>102016210
She's like 40 she doesn't have one, retard.

Anonymous
08/21/24(Wed)17:56:53 No.102016227

Anonymous 08/21/24(Wed)17:56:53 No.102016227

>>102016200
that is not Luma tho

Anonymous
08/21/24(Wed)17:57:01 No.102016230

Anonymous 08/21/24(Wed)17:57:01 No.102016230

>>102016213
maybe having a lora for every single actress is a retarded idea? start there

Anonymous
08/21/24(Wed)17:57:08 No.102016232

Anonymous 08/21/24(Wed)17:57:08 No.102016232

>>102016004

This guy is right: >>102016022
If you take into account that a paid API does not need to censor the model itself, there is no way a censored local will ever surpass the potential of an API. Whether you are allowed to fully utilize it or not is a different story.

Anonymous
08/21/24(Wed)17:57:29 No.102016235

Anonymous 08/21/24(Wed)17:57:29 No.102016235

>>102016215
Sirs please think to the village

Anonymous
08/21/24(Wed)17:57:39 No.102016242

Anonymous 08/21/24(Wed)17:57:39 No.102016242

>>102016224
>because women lose their genitals at 40
ok, bummer

Anonymous
08/21/24(Wed)17:58:10 No.102016249

Anonymous 08/21/24(Wed)17:58:10 No.102016249

>>102016224
(Editors note:cunny before the pedos took it was a general term for a perfect, petite pussy.)

Anonymous
08/21/24(Wed)17:58:44 No.102016255

Anonymous 08/21/24(Wed)17:58:44 No.102016255

File: file.png (614 KB, 1532x825)

614 KB PNG

>>102016232
lmao

Anonymous
08/21/24(Wed)17:58:50 No.102016256

Anonymous 08/21/24(Wed)17:58:50 No.102016256

>>102016242
That word doesn't mean what you think it means.

Anonymous
08/21/24(Wed)17:58:55 No.102016258

Anonymous 08/21/24(Wed)17:58:55 No.102016258

>>102016249
this

Anonymous
08/21/24(Wed)17:59:06 No.102016261

Anonymous 08/21/24(Wed)17:59:06 No.102016261

>>102016230
ah so you'll make your own all in one lora, adding more and more data into and retraining every time you do so
is that it, did I get that right

Anonymous
08/21/24(Wed)17:59:12 No.102016265

Anonymous 08/21/24(Wed)17:59:12 No.102016265

>>102016200
That took me off guard hard

Anonymous
08/21/24(Wed)17:59:41 No.102016270

Anonymous 08/21/24(Wed)17:59:41 No.102016270

>>102016161
top kek

Anonymous
08/21/24(Wed)17:59:56 No.102016271

Anonymous 08/21/24(Wed)17:59:56 No.102016271

>>102016256
stop being pedo

Anonymous
08/21/24(Wed)18:00:02 No.102016272

Anonymous 08/21/24(Wed)18:00:02 No.102016272

File: TickleYourFancy.png (944 KB, 768x768)

944 KB PNG

>>102015683
Well, ZootVision is really close, it can probably give me "by Flux" style on a future version and it may even be more detailed than Flux's outputs after I merge it with a model like picrel.
>chibi girl in super metroid by dalle 3
(by dalle 3 is just a style in the prompt)

Anonymous
08/21/24(Wed)18:00:17 No.102016277

Anonymous 08/21/24(Wed)18:00:17 No.102016277

>>102016232
Why do hypotheticals matter? Companies will never take the risk of completely uncensoring their API.

Anonymous
08/21/24(Wed)18:00:46 No.102016284

Anonymous 08/21/24(Wed)18:00:46 No.102016284

>>102016261
Every model has a cut off point for knowledge. Does Dalle-3 retrain for every new movie?

Anonymous
08/21/24(Wed)18:02:09 No.102016305

Anonymous 08/21/24(Wed)18:02:09 No.102016305

File: kablam 800 percent mad.gif (657 KB, 193x298)

657 KB GIF

you guys might think the judgement for imagegen model's knowledge is stupid here, but /lmg/ faggots are unironically asking local llm's obscure or often stupid gaming trivia then calling it shit if it doesn't get it

>mfw the faggots asking mistral large if it knows a specific quote from castlevania symphony of the night then calling it slop for making something up

Anonymous
08/21/24(Wed)18:02:23 No.102016310

Anonymous 08/21/24(Wed)18:02:23 No.102016310

File: ComfyUI_09193_.jpg (1.16 MB, 2048x2048)

1.16 MB JPG

>>102016271

Anonymous
08/21/24(Wed)18:02:29 No.102016312

Anonymous 08/21/24(Wed)18:02:29 No.102016312

>>102016284
feel like you've completely lost track of what was being discussed anon
we were talking about loras being used to surpass DALL-E 3

Anonymous
08/21/24(Wed)18:03:24 No.102016322

Anonymous 08/21/24(Wed)18:03:24 No.102016322

>>102016211
open source AI is over

Anonymous
08/21/24(Wed)18:04:04 No.102016337

Anonymous 08/21/24(Wed)18:04:04 No.102016337

File: ComfyUI_temp_rxvgv_00018_.png (1.8 MB, 1216x832)

1.8 MB PNG

Anonymous
08/21/24(Wed)18:04:21 No.102016339

Anonymous 08/21/24(Wed)18:04:21 No.102016339

File: file.png (2.68 MB, 1024x1024)

2.68 MB PNG

>>102016312
you've lost track because you keep moving the goalpost, no one needs to stack dozens Loras and if you were you'd fucking merge them into a model
you just want to desperately win about DE3 even though it's completely shit, it doesn't even do Blitzball which is hilarious

Anonymous
08/21/24(Wed)18:05:59 No.102016361

Anonymous 08/21/24(Wed)18:05:59 No.102016361

now I'm curious, what does flux do when you prompt for football players swimming in a giant sphere of water and throwing balls around
my GPU is occupied

Anonymous
08/21/24(Wed)18:06:02 No.102016362

Anonymous 08/21/24(Wed)18:06:02 No.102016362

File: file.png (105 KB, 1139x461)

105 KB PNG

Rate my temps

Anonymous
08/21/24(Wed)18:06:03 No.102016363

Anonymous 08/21/24(Wed)18:06:03 No.102016363

File: file.png (584 KB, 926x1098)

584 KB PNG

>>102016322
Flux's next model is literally video.

Anonymous
08/21/24(Wed)18:06:07 No.102016364

Anonymous 08/21/24(Wed)18:06:07 No.102016364

File: 1703318082117541.png (1.28 MB, 1149x1862)

1.28 MB PNG

tried to compare prompting styles I don't think it makes much difference.
yeah I know this has been done to death I'm just autistic and had to see for myself.
model is flux

Anonymous
08/21/24(Wed)18:06:11 No.102016365

Anonymous 08/21/24(Wed)18:06:11 No.102016365

>>102015822
Except its creators DID iterate on this, but when it could draw a face that wasn't deformed it lost all the detail and compositions and became pointless because we already had the best SD1.5 finetunes at that point.

Anonymous
08/21/24(Wed)18:06:16 No.102016367

Anonymous 08/21/24(Wed)18:06:16 No.102016367

>>102016271
never

Anonymous
08/21/24(Wed)18:06:22 No.102016369

Anonymous 08/21/24(Wed)18:06:22 No.102016369

>>102016339
>you've lost track because you keep moving the goalpost
no anon, I did no such thing.
>you'd fucking merge them into a model
that's still stacking, try again

Anonymous
08/21/24(Wed)18:07:08 No.102016385

Anonymous 08/21/24(Wed)18:07:08 No.102016385

>>102016362
go do something if you're bored, waiting only makes it take longer

Anonymous
08/21/24(Wed)18:08:12 No.102016406

Anonymous 08/21/24(Wed)18:08:12 No.102016406

>>102015209
nice

is this flux? it keeps giving every girl a cleft chin for some reason lol

Anonymous
08/21/24(Wed)18:08:14 No.102016407

Anonymous 08/21/24(Wed)18:08:14 No.102016407

>>102016385
I'm actually scared something might catch on fire. I just posted that to see what people would say.

Anonymous
08/21/24(Wed)18:08:20 No.102016410

Anonymous 08/21/24(Wed)18:08:20 No.102016410

>>102016362
undervoltyourshit/1.5v

Anonymous
08/21/24(Wed)18:08:52 No.102016417

Anonymous 08/21/24(Wed)18:08:52 No.102016417

>>102016407
take the tinderbox out of your pc tower and you should be good

Anonymous
08/21/24(Wed)18:09:27 No.102016427

Anonymous 08/21/24(Wed)18:09:27 No.102016427

File: images.jpg (5 KB, 223x226)

5 KB JPG

>>102015909
>https://litter.catbox.moe/ds4pho.jpg
>https://litter.catbox.moe/o4t8c9.jpg
Make it stoooop!

Anonymous
08/21/24(Wed)18:10:08 No.102016438

Anonymous 08/21/24(Wed)18:10:08 No.102016438

File: Capture.png (15 KB, 239x505)

15 KB PNG

>>102016362

Anonymous
08/21/24(Wed)18:10:54 No.102016450

Anonymous 08/21/24(Wed)18:10:54 No.102016450

>>102016427
show the last images, you just did the same ones again

Anonymous
08/21/24(Wed)18:11:52 No.102016465

Anonymous 08/21/24(Wed)18:11:52 No.102016465

File: file.png (11 KB, 249x138)

11 KB PNG

How does this look? Are these sizes enough, or should I try to find larger ones?
>>102016438
Ah, OK.

Anonymous
08/21/24(Wed)18:13:27 No.102016489

Anonymous 08/21/24(Wed)18:13:27 No.102016489

>>102016305
>obscure or often stupid
>"make this guy sit in a chair"
>*there is a guy and a chair*
It's not a knowledge problemmmmm it's NLP comprehensionnnnn. Knowledge of things is only going to get better and there's already injectable knowledge, but understanding how to put the concepts together intelligently as the prompt says to do so is a different skill.

Anonymous
08/21/24(Wed)18:14:17 No.102016508

Anonymous 08/21/24(Wed)18:14:17 No.102016508

File: a full-body, glossy hyper(...).png (1.61 MB, 1024x1024)

1.61 MB PNG

>>102016004
Show me your Loras surpassing this, it has the prompt and all.

Anonymous
08/21/24(Wed)18:14:33 No.102016512

Anonymous 08/21/24(Wed)18:14:33 No.102016512

File: flux_00399_.png (1008 KB, 1200x936)

1008 KB PNG

Anonymous
08/21/24(Wed)18:14:49 No.102016517

Anonymous 08/21/24(Wed)18:14:49 No.102016517

>>102016508
nah but enjoy your DE3 grain

Anonymous
08/21/24(Wed)18:17:39 No.102016557

Anonymous 08/21/24(Wed)18:17:39 No.102016557

>>102016200
That kind of performance would give her this and the next four titles.

Anonymous
08/21/24(Wed)18:20:09 No.102016589

Anonymous 08/21/24(Wed)18:20:09 No.102016589

>>102016108
Skill issue, now try this prompt:
>Make a full and detailed long description of everything in this picture.

Anonymous
08/21/24(Wed)18:22:21 No.102016624

Anonymous 08/21/24(Wed)18:22:21 No.102016624

File: Capture.png (315 KB, 1500x817)

315 KB PNG

>>102016589

Anonymous
08/21/24(Wed)18:22:28 No.102016626

Anonymous 08/21/24(Wed)18:22:28 No.102016626

>>102015909
Looks like insufficient training.

Anonymous
08/21/24(Wed)18:23:11 No.102016641

Anonymous 08/21/24(Wed)18:23:11 No.102016641

>>102016626
Yeah, that's how SD1 looked like when I asked it for something obscure.

Anonymous
08/21/24(Wed)18:23:58 No.102016652

Anonymous 08/21/24(Wed)18:23:58 No.102016652

>>102016215
Civitai implemented a system so that most of the stuff in there is generated by people that only care about money, and the rest are buried out.

Anonymous
08/21/24(Wed)18:25:38 No.102016680

Anonymous 08/21/24(Wed)18:25:38 No.102016680

>i can generate additional pics of my favorite porn actress past her prime
The future is here.

Anonymous
08/21/24(Wed)18:26:23 No.102016689

Anonymous 08/21/24(Wed)18:26:23 No.102016689

>>102016363
fuck yea
I haven't bothered with flux at all. any good?

Anonymous
08/21/24(Wed)18:26:58 No.102016695

Anonymous 08/21/24(Wed)18:26:58 No.102016695

>>102016508
>Dall-E 3 is very good at recognising characters
It should join forces with Akinator.

Anonymous
08/21/24(Wed)18:27:01 No.102016696

Anonymous 08/21/24(Wed)18:27:01 No.102016696

>>102016680
>past her prime
you mean during her prime? why would you want a porn actress past her prime? you got a batwing beef flaps fetish?

Anonymous
08/21/24(Wed)18:27:39 No.102016702

Anonymous 08/21/24(Wed)18:27:39 No.102016702

>>102016249
I used remote vision to look at it and it's very rough.

Anonymous
08/21/24(Wed)18:27:45 No.102016704

Anonymous 08/21/24(Wed)18:27:45 No.102016704

>>102016689
>any good?
It's good but slow

Anonymous
08/21/24(Wed)18:27:48 No.102016706

Anonymous 08/21/24(Wed)18:27:48 No.102016706

>>102016696
I meant past her prime, I can still get new pictures (in her prime)

Anonymous
08/21/24(Wed)18:27:49 No.102016707

Anonymous 08/21/24(Wed)18:27:49 No.102016707

>>102016689
there entire thread of 400+ posts of flux fanboys posting.

Anonymous
08/21/24(Wed)18:29:09 No.102016721

Anonymous 08/21/24(Wed)18:29:09 No.102016721

>>102016339
Nobody else would know, you're the only one that knows what the fuck is Blitzball.

Anonymous
08/21/24(Wed)18:30:01 No.102016729

Anonymous 08/21/24(Wed)18:30:01 No.102016729

File: file.png (679 KB, 768x432)

679 KB PNG

>>102016721
Excuse me.

Anonymous
08/21/24(Wed)18:30:20 No.102016735

Anonymous 08/21/24(Wed)18:30:20 No.102016735

>>102016696
Your tastes change as you grow older anon, you will catch yourself looking at grannies in the future.

Anonymous
08/21/24(Wed)18:30:27 No.102016739

Anonymous 08/21/24(Wed)18:30:27 No.102016739

>>102016721
OMG you never played final fantasy 69?

Anonymous
08/21/24(Wed)18:31:01 No.102016748

Anonymous 08/21/24(Wed)18:31:01 No.102016748

>>102016735
>in the future
nah i do it now, but porn roastoids are different from hot 50-60 milfs.

Anonymous
08/21/24(Wed)18:31:18 No.102016750

Anonymous 08/21/24(Wed)18:31:18 No.102016750

File: chibi girls.png (1.03 MB, 1024x1024)

1.03 MB PNG

>>102016361
Who cares? But here's one with chibi girls at the start of your prompt.

Anonymous
08/21/24(Wed)18:31:28 No.102016751

Anonymous 08/21/24(Wed)18:31:28 No.102016751

File: file.png (1.33 MB, 1440x832)

1.33 MB PNG

>>102016689
Its meme value is off the charts and we're still figuring out how to train it.

Anonymous
08/21/24(Wed)18:31:31 No.102016752

Anonymous 08/21/24(Wed)18:31:31 No.102016752

>>102016735
>your tastes change as you grow older
I'm sure nobody here would touch an actual cunny model with a ten foot pole

Anonymous
08/21/24(Wed)18:31:40 No.102016754

Anonymous 08/21/24(Wed)18:31:40 No.102016754

File: ComfyUI_31_.jpg (1.54 MB, 2048x2048)

1.54 MB JPG

>>102016735

Anonymous
08/21/24(Wed)18:33:04 No.102016776

Anonymous 08/21/24(Wed)18:33:04 No.102016776

File: 1718185990179516.png (1.11 MB, 1024x1024)

1.11 MB PNG

Anonymous
08/21/24(Wed)18:34:53 No.102016799

Anonymous 08/21/24(Wed)18:34:53 No.102016799

>>102016624
Perfect.

Anonymous
08/21/24(Wed)18:36:14 No.102016821

Anonymous 08/21/24(Wed)18:36:14 No.102016821

>>102016624
what is the actual term for that, a half pipe?

Anonymous
08/21/24(Wed)18:36:42 No.102016825

Anonymous 08/21/24(Wed)18:36:42 No.102016825

>>102016624
>the car in the middle is slightly ahead
>also the car on the right side is further ahead

Anonymous
08/21/24(Wed)18:37:01 No.102016826

Anonymous 08/21/24(Wed)18:37:01 No.102016826

>>102016695
Meta.ai is another model good at characters (it can be used from Whatsapp, for some reason) and can do Betty Boop just fine. But it could never get close to anything like this.

Anonymous
08/21/24(Wed)18:37:30 No.102016832

Anonymous 08/21/24(Wed)18:37:30 No.102016832

>>102016826
>>102015145

Anonymous
08/21/24(Wed)18:37:50 No.102016837

Anonymous 08/21/24(Wed)18:37:50 No.102016837

>>102016821
banking or banked corner or banked turn
>>102016799
Generate it:

 
The image depicts a thrilling scene of three cars on a winding road. Two cars are driving on the road, while one car is in the air, performing a stunt. The car in the middle is slightly ahead of the other two cars, with the car on the right side of the road is further ahead.

The road is curving and has a concrete wall on the side, adding to the sense of speed and excitement. The cars' positions and the road's layout create a dynamic and thrilling atmosphere.

The cars' colors are not specified, but their shapes and sizes are distinct, with one car being particularly large. The image captures the

Anonymous
08/21/24(Wed)18:38:21 No.102016847

Anonymous 08/21/24(Wed)18:38:21 No.102016847

https://x.com/69420digits/media How is he making these?
Also this is the best or at least most interesting ai generated music I've heard so far
https://x.com/69420digits/status/1817466612921831740/video/1

Anonymous
08/21/24(Wed)18:38:25 No.102016848

Anonymous 08/21/24(Wed)18:38:25 No.102016848

>>102016707
Nope, when I post I complain about how Flux doesn't cover what I want to draw.

Anonymous
08/21/24(Wed)18:38:43 No.102016849

Anonymous 08/21/24(Wed)18:38:43 No.102016849

>>102016754
Would

Anonymous
08/21/24(Wed)18:38:47 No.102016850

Anonymous 08/21/24(Wed)18:38:47 No.102016850

>>102016825
Further ahead compared to what? You know you can write perfect captions and it will have the same result as the imperfect caption because you're ultimately competing against the 15 million other captions which were auto generated. The model is not suddenly going to be better at positioning things. Good is enough is actually enough.

Anonymous
08/21/24(Wed)18:39:32 No.102016856

Anonymous 08/21/24(Wed)18:39:32 No.102016856

>>102016850
>Good is enough is actually enough.
No.

Anonymous
08/21/24(Wed)18:39:46 No.102016859

Anonymous 08/21/24(Wed)18:39:46 No.102016859

>>102016739
Characters dying on Final Fantasy 3 broke my heart and I quit the series.

Anonymous
08/21/24(Wed)18:39:49 No.102016860

Anonymous 08/21/24(Wed)18:39:49 No.102016860

>>102016847
>Also this is the best or at least most interesting ai generated music I've heard so far
And the music generation has to do with the image somehow right? That's why I said this

Anonymous
08/21/24(Wed)18:40:55 No.102016873

Anonymous 08/21/24(Wed)18:40:55 No.102016873

>>102016856
Actually it is, but keep letting your autism make you take 10x as long for the same result. But please, prove me wrong. I want you to do 100 autogenerated captions with a high quality caption tool and then do 100 hand crafted perfect captions, train two models, same settings, prove the difference (you can't).

Anonymous
08/21/24(Wed)18:41:10 No.102016877

Anonymous 08/21/24(Wed)18:41:10 No.102016877

>>102016751
The best generations I've seen of Flux have been blurry, the crispier the faker.

Anonymous
08/21/24(Wed)18:42:12 No.102016883

Anonymous 08/21/24(Wed)18:42:12 No.102016883

>>102016752
What do you mean an actual cunny model? Like Lehina Model?

Anonymous
08/21/24(Wed)18:42:17 No.102016887

Anonymous 08/21/24(Wed)18:42:17 No.102016887

>>102016873
i can it's intuitive, any fuckfreak could do it

Anonymous
08/21/24(Wed)18:42:21 No.102016889

Anonymous 08/21/24(Wed)18:42:21 No.102016889

>>102016873
think I care about your requests?

Anonymous
08/21/24(Wed)18:43:23 No.102016899

Anonymous 08/21/24(Wed)18:43:23 No.102016899

>>102016887
it's not intuitive because otherwise you'd know your captions were competing against a million captions that said left is right

>>102016889
of course you don't care, you have autism so you'll waste time because you have a mental disorder

Anonymous
08/21/24(Wed)18:43:51 No.102016906

Anonymous 08/21/24(Wed)18:43:51 No.102016906

>>102016899
who shit in your cereal

Anonymous
08/21/24(Wed)18:44:55 No.102016920

Anonymous 08/21/24(Wed)18:44:55 No.102016920

>>102016906
>make stupid ass claim that will waste other people's claim
>get mad when confronted

Anonymous
08/21/24(Wed)18:45:25 No.102016928

Anonymous 08/21/24(Wed)18:45:25 No.102016928

>>102016899
>>102016920
>>102016887
don't get it twisted I use Gigacaptioner, 11/10 if gets it right, that way I can set it and forget it, you lone graboid

Anonymous
08/21/24(Wed)18:45:38 No.102016929

Anonymous 08/21/24(Wed)18:45:38 No.102016929

>>102016920
>make stupid ass claim
nigga I just pointed out a funny mistake by the model

Anonymous
08/21/24(Wed)18:45:58 No.102016938

Anonymous 08/21/24(Wed)18:45:58 No.102016938

ayo Niggas anyone bakin?

Anonymous
08/21/24(Wed)18:46:09 No.102016940

Anonymous 08/21/24(Wed)18:46:09 No.102016940

File: Fluxs.png (764 KB, 1280x720)

764 KB PNG

>>102016837
Damn, I wish I could tell you I won't because I'm very busy doing something else, but, here it is, a perfect reproduction of your initial picture thanks the flawless description provided.

Anonymous
08/21/24(Wed)18:46:31 No.102016945

Anonymous 08/21/24(Wed)18:46:31 No.102016945

baking, hold on 66% there

Anonymous
08/21/24(Wed)18:47:32 No.102016966

Anonymous 08/21/24(Wed)18:47:32 No.102016966

>>102016940
yamero

Anonymous
08/21/24(Wed)18:48:02 No.102016970

Anonymous 08/21/24(Wed)18:48:02 No.102016970

>>102016826
I don't care about how good it does a random character. A local base model that knows how to make every famous character that is protected by IP laws and doesn't know the name of famous painters whose works are public domain so it could imitate their style, would be quite sad.

Anonymous
08/21/24(Wed)18:48:09 No.102016975

Anonymous 08/21/24(Wed)18:48:09 No.102016975

>>102016940
>but if I wrote the caption perfectly I'm sure the result would be different, it'll be worth the 5 minutes per image
>of course we'll ignore that in the process of captioning 100 images for 500 minutes I'll make lots of mistakes or omit details the auto-caption would have

Anonymous
08/21/24(Wed)18:48:35 No.102016986

Anonymous 08/21/24(Wed)18:48:35 No.102016986

File: 1715566822213630.png (495 KB, 512x768)

495 KB PNG

is it just me or is it impossible to generate a proper image with "death to israel" on it using flux?
you can generate images with death to any other country but israel tho.
conspiracy???

Anonymous
08/21/24(Wed)18:49:07 No.102016994

Anonymous 08/21/24(Wed)18:49:07 No.102016994

>>102016108
Use gemini 1.5 pro. It is free

Anonymous
08/21/24(Wed)18:49:31 No.102017005

Anonymous 08/21/24(Wed)18:49:31 No.102017005

>>102016994
Gemini makes stupid mistakes too.

Anonymous
08/21/24(Wed)18:49:35 No.102017007

Anonymous 08/21/24(Wed)18:49:35 No.102017007

>>102016975
>I'll make lots of mistakes or omit details
no because I'm not a blind ESL

Anonymous
08/21/24(Wed)18:50:11 No.102017016

Anonymous 08/21/24(Wed)18:50:11 No.102017016

>>102016975
It's the schell version, it took 2 seconds to generate.

Anonymous
08/21/24(Wed)18:50:33 No.102017023

Anonymous 08/21/24(Wed)18:50:33 No.102017023

>>102016227
then what is it?

Anonymous
08/21/24(Wed)18:50:59 No.102017031

Anonymous 08/21/24(Wed)18:50:59 No.102017031

>>102017007
missed a comma

Anonymous
08/21/24(Wed)18:51:13 No.102017033

Anonymous 08/21/24(Wed)18:51:13 No.102017033

>>102016994
Where?

Anonymous
08/21/24(Wed)18:51:47 No.102017043

Anonymous 08/21/24(Wed)18:51:47 No.102017043

>>102017023
Gen 3 or Kling

Anonymous
08/21/24(Wed)18:52:55 No.102017058

Anonymous 08/21/24(Wed)18:52:55 No.102017058

>>102016994
google is an irrelevant company

Anonymous
08/21/24(Wed)18:52:59 No.102017061

Anonymous 08/21/24(Wed)18:52:59 No.102017061

>>102017033
Geocities

Anonymous
08/21/24(Wed)18:53:46 No.102017069

Anonymous 08/21/24(Wed)18:53:46 No.102017069

>>102017043
Do those both gen sound with it

Anonymous
08/21/24(Wed)18:54:20 No.102017076

Anonymous 08/21/24(Wed)18:54:20 No.102017076

>>102017033
google aistudio
>>102017005
every vllm makes mistakes

Anonymous
08/21/24(Wed)18:56:10 No.102017110

Anonymous 08/21/24(Wed)18:56:10 No.102017110

File: 1704778394833557.png (547 KB, 512x768)

547 KB PNG

>>102016986
somehow if you lower the steps to 1 the AI gets it correctly but the more steps you add the more it wraps the text.
what is going on here??

Anonymous
08/21/24(Wed)18:59:08 No.102017156

Anonymous 08/21/24(Wed)18:59:08 No.102017156

>>102016508
why would i want to gen that?

Anonymous
08/21/24(Wed)18:59:35 No.102017160

Anonymous 08/21/24(Wed)18:59:35 No.102017160

>>102017110
curious...

Anonymous
08/21/24(Wed)18:59:44 No.102017164

Anonymous 08/21/24(Wed)18:59:44 No.102017164

>>102017076
>Signs in to use it
>Remembers the other anon never uploaded the original picture so I can't send it.
Damn.

Anonymous
08/21/24(Wed)19:00:45 No.102017176

Anonymous 08/21/24(Wed)19:00:45 No.102017176

>>102017156
Those pics sell for $50 at Alibaba.

Anonymous
08/21/24(Wed)19:00:52 No.102017178

Anonymous 08/21/24(Wed)19:00:52 No.102017178

File: Screenshot 2024-08-22 000024.jpg (3 KB, 160x22)

3 KB JPG

time for bed

Anonymous
08/21/24(Wed)19:04:05 No.102017215

Anonymous 08/21/24(Wed)19:04:05 No.102017215

File: Rememberthese.png (661 KB, 768x768)

661 KB PNG

>>102016938
From the 4 threads I created for /sdg/ 3 of them were nuked, I'm not risking a repeat for /ldg/

Anonymous
08/21/24(Wed)19:08:42 No.102017260

Anonymous 08/21/24(Wed)19:08:42 No.102017260

I can redeem the bake

Anonymous
08/21/24(Wed)19:11:31 No.102017303

Anonymous 08/21/24(Wed)19:11:31 No.102017303

>>102017300

Anonymous
08/21/24(Wed)19:18:11 No.102017370

Anonymous 08/21/24(Wed)19:18:11 No.102017370

>>102016825
this is good enough for flux because it doesnt even know the difference between left and right

Anonymous
08/21/24(Wed)19:27:55 No.102017474

Anonymous 08/21/24(Wed)19:27:55 No.102017474

>>102016751
>>102016707
>flux good
This is exciting. I switch between lmg and sdg and obviously I'm out of the loop, still running Pony XL
How's flux for landsdcape paintings and art and shit? Not just coomer trash or memes. Though those are important too

Anonymous
08/21/24(Wed)19:52:21 No.102017729

Anonymous 08/21/24(Wed)19:52:21 No.102017729

>image thread
>not reach image limit
ancestors cry

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.