/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 08/24/24(Sat)04:32:25 No.102055035

File: tmp.jpg (1.26 MB, 3264x3264)

1.26 MB JPG

/ldg/ - Local Diffusion General Anonymous 08/24/24(Sat)04:32:25 No.102055035 Archived

Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102052110

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg

Anonymous
08/24/24(Sat)04:33:09 No.102055043

Anonymous 08/24/24(Sat)04:33:09 No.102055043

Blessed thread of frenship

Anonymous
08/24/24(Sat)04:34:06 No.102055056

Anonymous 08/24/24(Sat)04:34:06 No.102055056

>>102055040
Anything that requires more than 3 seconds of focused attention is scary, weird and alien for a zoomer.

Anonymous
08/24/24(Sat)04:35:46 No.102055070

Anonymous 08/24/24(Sat)04:35:46 No.102055070

>>102055040
It's just a carry-over word from SD 1.5 where people would write descriptive sentences in a model that didn't understand it. We didn't have t5 encoding back then so it was a dumb way to do it.

Anonymous
08/24/24(Sat)04:36:57 No.102055080

Anonymous 08/24/24(Sat)04:36:57 No.102055080

>>102054997
I recommend just going to Kohya at this point AI-toolkit was good for how fast it was, but Kohya is a much cleaner, fast and memory efficient implementation.

>>102055056
Personally, my dream prompting method would be a combination of booru tags with natural language to guide the composition.

Anonymous
08/24/24(Sat)04:37:17 No.102055084

Anonymous 08/24/24(Sat)04:37:17 No.102055084

>>102055035
ty baker

Anonymous
08/24/24(Sat)04:38:54 No.102055094

Anonymous 08/24/24(Sat)04:38:54 No.102055094

Boomer prompting = "please generate a wonderful image full of whimsy of a woman standing in an empty field. make sure to include minute details that entice the viewer to look further. there is an air of mystery and mystique to the image composition"

Regular prompting = "a woman standing in an empty field"

Anonymous
08/24/24(Sat)04:39:50 No.102055103

Anonymous 08/24/24(Sat)04:39:50 No.102055103

>>102055094
You forgot the "Thank you" at the end

Anonymous
08/24/24(Sat)04:40:25 No.102055112

Anonymous 08/24/24(Sat)04:40:25 No.102055112

>computer please draw for me

Anonymous
08/24/24(Sat)04:40:38 No.102055113

Anonymous 08/24/24(Sat)04:40:38 No.102055113

>>102055094
that's not regular prompting, that's brainlet prompting, unless your goal is to have a boring sterile image

Anonymous
08/24/24(Sat)04:40:51 No.102055117

Anonymous 08/24/24(Sat)04:40:51 No.102055117

File: ComfyUI_02089_.png (1.08 MB, 768x1024)

1.08 MB PNG

Anonymous
08/24/24(Sat)04:41:06 No.102055119

Anonymous 08/24/24(Sat)04:41:06 No.102055119

>>102055103
He is rude youngster, forgive him

Anonymous
08/24/24(Sat)04:41:31 No.102055122

Anonymous 08/24/24(Sat)04:41:31 No.102055122

It is just me or does stacking LoRAs make gens take 10x longer?

Anonymous
08/24/24(Sat)04:42:11 No.102055127

Anonymous 08/24/24(Sat)04:42:11 No.102055127

>>102055056
You jest, but the average 20 year old reels at the sight of a paragraph with more than 3 lines of text. Their dopamine receptors BURN with very literal pain.

Anonymous
08/24/24(Sat)04:42:39 No.102055128

Anonymous 08/24/24(Sat)04:42:39 No.102055128

>>102055122
Shouldn't. It's possible your vram is being offloaded.

Anonymous
08/24/24(Sat)04:43:02 No.102055133

Anonymous 08/24/24(Sat)04:43:02 No.102055133

>>102055113
The intrigue comes from additional descriptors, not "pls gib pretty pretty image ty".

Anonymous
08/24/24(Sat)04:43:12 No.102055134

Anonymous 08/24/24(Sat)04:43:12 No.102055134

>>102055094
See? "Boomer" means "long and too hard to read for my zoomi brainie"

Anonymous
08/24/24(Sat)04:43:34 No.102055138

Anonymous 08/24/24(Sat)04:43:34 No.102055138

>>102055094
Well, that boomer prompt takes it to the extreme, but I've used Dall-E 3 enough to know that these types of models like bit of verbosity in the prompts. That is the type of text you get out from the vision models that handle the image tagging. But the "please generate" at the start is dumb.

Anonymous
08/24/24(Sat)04:44:11 No.102055142

Anonymous 08/24/24(Sat)04:44:11 No.102055142

File: FD_00057_.png (1.64 MB, 768x1344)

1.64 MB PNG

>>102055128
It is always, even with 16gb.

Anonymous
08/24/24(Sat)04:46:26 No.102055155

Anonymous 08/24/24(Sat)04:46:26 No.102055155

I always hated that tags didn't let me describe two characters differently or something more specific for background or setting

Anonymous
08/24/24(Sat)04:46:28 No.102055156

Anonymous 08/24/24(Sat)04:46:28 No.102055156

File: FD_00058_.png (678 KB, 768x1344)

678 KB PNG

This is coming along better than I expected, it's really struggling with her tattoos though

Anonymous
08/24/24(Sat)04:46:30 No.102055157

Anonymous 08/24/24(Sat)04:46:30 No.102055157

>>102055113
>brainlet
its brainlet to rely only on the prompt to get interesting outputs

Anonymous
08/24/24(Sat)04:47:10 No.102055160

Anonymous 08/24/24(Sat)04:47:10 No.102055160

>>102055156
He is packing

Anonymous
08/24/24(Sat)04:47:39 No.102055162

Anonymous 08/24/24(Sat)04:47:39 No.102055162

>>102055117
I just wish I would not need to see his puke face everytime I come here. Can't you start a reddit for that shit?

Anonymous
08/24/24(Sat)04:48:21 No.102055169

Anonymous 08/24/24(Sat)04:48:21 No.102055169

I've updated my llama.cpp install and want to try writing prompts, sending them to llama.cpp (with Llama3 8B or whatever) to elaborate on them, then piping the output into stable-diffusion.cpp as the prompt for Flux.
What should my instruction to the LLM be for it to spit out the kind of boomerprompt that Flux is highly responsive to?
Also, are there any LLMs that are particularly well-trained on art history and architectural details?

Anonymous
08/24/24(Sat)04:48:45 No.102055173

Anonymous 08/24/24(Sat)04:48:45 No.102055173

>>102055117
'em

Anonymous
08/24/24(Sat)04:49:38 No.102055181

Anonymous 08/24/24(Sat)04:49:38 No.102055181

>>102055122
if you're using Q quants then yes, the loras have to be constantly re-applied because they modify the dequant weights and the weights are only dequant on-the-fly, you can't apply loras in one pass like you can with the non-Q models

Anonymous
08/24/24(Sat)04:49:53 No.102055185

Anonymous 08/24/24(Sat)04:49:53 No.102055185

File: FD_00061_.png (1.06 MB, 1344x768)

1.06 MB PNG

>>102055160
Packing heat

Anonymous
08/24/24(Sat)04:51:50 No.102055197

Anonymous 08/24/24(Sat)04:51:50 No.102055197

>>102055185
Looks really good.

Anonymous
08/24/24(Sat)04:51:56 No.102055199

Anonymous 08/24/24(Sat)04:51:56 No.102055199

>>102055181
I see, that's why it takes so long between gens.

Anonymous
08/24/24(Sat)04:53:09 No.102055204

Anonymous 08/24/24(Sat)04:53:09 No.102055204

>>102055181
yea this .. it seems its just what it is, so for heavy lora usage Q's are not an option .. you could like merge the loras into default and then quant your merged model .... but rofl I guess that would take ages as prestep

Anonymous
08/24/24(Sat)04:53:18 No.102055205

Anonymous 08/24/24(Sat)04:53:18 No.102055205

>>102055113
that's the oposite of brainlet, you don't give the model much direction so it has the freedom to add nice touches on top of it

Anonymous
08/24/24(Sat)04:53:24 No.102055208

Anonymous 08/24/24(Sat)04:53:24 No.102055208

File: FD_00059_.png (841 KB, 768x1344)

841 KB PNG

>>102055197
No it doesn't, it's only eopch 6 but it's the first one that seems to understand the concept.

Anonymous
08/24/24(Sat)04:54:19 No.102055216

Anonymous 08/24/24(Sat)04:54:19 No.102055216

>>102055134
>I like to write convoluted sentences when you can have the same effect for a sentense 10 times shorter
https://www.youtube.com/watch?v=3RMAPFH75AU

Anonymous
08/24/24(Sat)04:54:28 No.102055217

Anonymous 08/24/24(Sat)04:54:28 No.102055217

>>102055208
Maybe, I haven't seen the show is a long time so I can't recall exactly what's off about the image.

Anonymous
08/24/24(Sat)04:55:10 No.102055225

Anonymous 08/24/24(Sat)04:55:10 No.102055225

>>102055204
Using a single LoRA it's fine, it only adds half a second per it to my gens, but stacking them fucks it.

Anonymous
08/24/24(Sat)04:56:11 No.102055232

Anonymous 08/24/24(Sat)04:56:11 No.102055232

>>102055217
her tattoos are wrong

Anonymous
08/24/24(Sat)04:56:21 No.102055236

Anonymous 08/24/24(Sat)04:56:21 No.102055236

>>102055204
would it be possible to have a node to quant a lora to our desired quant? could that allow it to applied directly?

Anonymous
08/24/24(Sat)04:57:09 No.102055247

Anonymous 08/24/24(Sat)04:57:09 No.102055247

>>102055012
Every optimization is a trade-off between speed, usability and support for additional features. Optimizations are becoming a hard requirement as newer models increase parameter count to compensate for their architecture and datasets.

Anonymous
08/24/24(Sat)04:58:14 No.102055257

Anonymous 08/24/24(Sat)04:58:14 No.102055257

You people are exaggerating the difference between Q8 and FP8.

Anonymous
08/24/24(Sat)04:58:45 No.102055264

Anonymous 08/24/24(Sat)04:58:45 No.102055264

>>102055247
you don't have trade off if you give more options to the user, it's not the 90's anymore when you couldn't even chose between mono and stereo in N64 games, nowdays a PC game has the option feature and a shit ton of shit to choose, as it should

Anonymous
08/24/24(Sat)04:58:54 No.102055265

Anonymous 08/24/24(Sat)04:58:54 No.102055265

I simply do not care for character loras. Styles and concepts only, please.

Anonymous
08/24/24(Sat)04:59:01 No.102055266

Anonymous 08/24/24(Sat)04:59:01 No.102055266

>>102055080
I train under 24 gb on ai-toolkit and get about 2-3 s per it on the default flux preset. What numbers does kohya get?
I tried both and found ai-toolkit much more straightforward, but other than that they seem to be doing pretty much the same thing.

Anonymous
08/24/24(Sat)04:59:41 No.102055270

Anonymous 08/24/24(Sat)04:59:41 No.102055270

File: FD_00062_.png (1.21 MB, 1024x1536)

1.21 MB PNG

>zabecca

Anonymous
08/24/24(Sat)04:59:54 No.102055272

Anonymous 08/24/24(Sat)04:59:54 No.102055272

>>102055225
yep, that is what I see to
>>102055236
I don't think that would work, its still an operation you have to do on every weight, probably would be even slower. The quanting is a trick to make the 8bit weight closer to what it would be to the 16bit weight then by just truncating it, right?

>>102055257
there indeed are.. q8 is very close to fp16 in some gens, while fp8 fucks em up.. for some generic nature shot it doesnt matter, but especially on text etc. its kinda bad

Anonymous
08/24/24(Sat)05:00:45 No.102055275

Anonymous 08/24/24(Sat)05:00:45 No.102055275

>>102055265
Why not character lora? It makes your prompts simple when you don't have to describe every little detail to have similar character across gens.

Anonymous
08/24/24(Sat)05:01:06 No.102055277

Anonymous 08/24/24(Sat)05:01:06 No.102055277

>>102055257
Q8 and FP8 are not similar and Q8 is a much better approximation of what FP16 is compared to FP8

Anonymous
08/24/24(Sat)05:01:42 No.102055281

Anonymous 08/24/24(Sat)05:01:42 No.102055281

>>102055257
no we're not, Q8 is really close to fp16 wheras fp8 is not at all

Anonymous
08/24/24(Sat)05:02:29 No.102055285

Anonymous 08/24/24(Sat)05:02:29 No.102055285

You people really are exaggerating the difference between Q8 and FP8.

Anonymous
08/24/24(Sat)05:02:40 No.102055288

Anonymous 08/24/24(Sat)05:02:40 No.102055288

File: fp16-vs-q8-vs-fp8.jpg (740 KB, 3648x1260)

740 KB JPG

>>102055257
I did this test yesterday (with half a dozen other pictures to) that demonstrates it nicely on a bit more complex prompt.

left fp16, middle q8, right fp8_e4

Anonymous
08/24/24(Sat)05:03:06 No.102055292

Anonymous 08/24/24(Sat)05:03:06 No.102055292

>>102055275
I do not care for your silly homosexual OC. Refrain from training such.

Anonymous
08/24/24(Sat)05:04:16 No.102055303

Anonymous 08/24/24(Sat)05:04:16 No.102055303

File: 1032567688.png (1.41 MB, 896x1152)

1.41 MB PNG

Anonymous
08/24/24(Sat)05:04:46 No.102055309

Anonymous 08/24/24(Sat)05:04:46 No.102055309

>>102055169
You should ask most of that to /lmg/. But it's easy to come up with a prompt just with trial and error.

Anonymous
08/24/24(Sat)05:04:59 No.102055312

Anonymous 08/24/24(Sat)05:04:59 No.102055312

>>102055133
yes, you can describe an image better in a proper way without acting like you're pleading, begging and flirting with the AI. Seen far too many people prompt basic shit and be surprised they end up generating basic shit

>>102055205
if a model is adding "nice touches" unprompted then it's a fucked up model, that is unless there's a simple, straightforward way to choose between it strictly following the prompt or being more creative, CFG is ok but not ideal

Anonymous
08/24/24(Sat)05:05:25 No.102055314

Anonymous 08/24/24(Sat)05:05:25 No.102055314

>>102055292
>OC
How does that work? I generate 20 images of some custom character and then train lora on those gens to "save" the OC character?
I would never use such loras.

Anonymous
08/24/24(Sat)05:05:28 No.102055316

Anonymous 08/24/24(Sat)05:05:28 No.102055316

>>102055288
that picture sums up nicely the feeling I had towards Q8 and fp8, Q8 gives almost the same picture as fp16 with some little difference in details, fp8 can make completely different poses and fuck up some shit that Q8 won't

Anonymous
08/24/24(Sat)05:05:59 No.102055320

Anonymous 08/24/24(Sat)05:05:59 No.102055320

>>102055264
City hasn't explained it well. You're expecting an extra option that loads the lora to gpu to solve the problem, if that's not already the case it would be a marginal improvement at best. The trade-off is that loras need to be applied on-the-fly due to quantization/dequantization.

Anonymous
08/24/24(Sat)05:06:28 No.102055326

Anonymous 08/24/24(Sat)05:06:28 No.102055326

>>102055312
>if a model is adding "nice touches" unprompted then it's a fucked up mode
no it's not, if the model follow your simple prompt and you never specified to not add more stuff to it, that's your fault, not the model

Anonymous
08/24/24(Sat)05:07:29 No.102055337

Anonymous 08/24/24(Sat)05:07:29 No.102055337

>>102055320
>if that's not already the case it would be a marginal improvement at best.
we would never know if we don't try, I don't like suppositions, I like to see it with my own eyes

Anonymous
08/24/24(Sat)05:08:23 No.102055343

Anonymous 08/24/24(Sat)05:08:23 No.102055343

>>102055337
then start coding

Anonymous
08/24/24(Sat)05:09:18 No.102055349

Anonymous 08/24/24(Sat)05:09:18 No.102055349

>>102055343
it's not my repo, it's yours city, if you're a lazy ass and you don't want people to enjoy decent speed on loras + gguf quants, just say it, don't waste our time by finding excuses

Anonymous
08/24/24(Sat)05:10:03 No.102055358

Anonymous 08/24/24(Sat)05:10:03 No.102055358

>>102055326
>a woman standing in an empty field, DO NOT ADD GRASS, DO NOT ADD ANY KIND OF VEGETATION, THERE IS NO SKY AND NO CLOUDS, THERE IS NO SUN, ONLY BARREN GROUND
can't wait to prompt like this, it will be truly the model of all models

Anonymous
08/24/24(Sat)05:10:06 No.102055359

Anonymous 08/24/24(Sat)05:10:06 No.102055359

>>102055349
I accept your concession, codelet.

Anonymous
08/24/24(Sat)05:10:40 No.102055365

Anonymous 08/24/24(Sat)05:10:40 No.102055365

>>102055303
Your LoRA got fucked.

Anonymous
08/24/24(Sat)05:10:55 No.102055368

Anonymous 08/24/24(Sat)05:10:55 No.102055368

>>102055359
>codelet
says the man who won't do anything either, and therefore can be categorized as a codelet too, ironic.

Anonymous
08/24/24(Sat)05:11:08 No.102055371

Anonymous 08/24/24(Sat)05:11:08 No.102055371

File: lewd.jpg (219 KB, 1024x1536)

219 KB JPG

I should have put more nudes in the data set. V2 maybe

Anonymous
08/24/24(Sat)05:11:14 No.102055372

Anonymous 08/24/24(Sat)05:11:14 No.102055372

do llms have the same struggle with loras? i doubt anyone runs a non quant llm

Anonymous
08/24/24(Sat)05:11:27 No.102055374

Anonymous 08/24/24(Sat)05:11:27 No.102055374

shits too complicated now take me back to 1.5

Anonymous
08/24/24(Sat)05:11:45 No.102055380

Anonymous 08/24/24(Sat)05:11:45 No.102055380

/sdg/ is now just full of women and troons. I wonder if any of them are hot.

Anonymous
08/24/24(Sat)05:11:56 No.102055384

Anonymous 08/24/24(Sat)05:11:56 No.102055384

>>102055358
>what is a negative prompt?

Anonymous
08/24/24(Sat)05:12:10 No.102055386

Anonymous 08/24/24(Sat)05:12:10 No.102055386

>>102055371
So uh... what were your LoRA settings?

Anonymous
08/24/24(Sat)05:12:54 No.102055394

Anonymous 08/24/24(Sat)05:12:54 No.102055394

>>102055380
I am not a woman. I'm a tranny

Anonymous
08/24/24(Sat)05:12:56 No.102055395

Anonymous 08/24/24(Sat)05:12:56 No.102055395

>>102055380
wait you'd fuck troons? that's kinda gay if you ask me

Anonymous
08/24/24(Sat)05:13:02 No.102055396

Anonymous 08/24/24(Sat)05:13:02 No.102055396

>>102055380
>I wonder if any of them are hot.
Only the troons will care to look feminine and pretty.

Anonymous
08/24/24(Sat)05:13:10 No.102055399

Anonymous 08/24/24(Sat)05:13:10 No.102055399

>>102055374
Shit has never been so good, what are you talking about?

Anonymous
08/24/24(Sat)05:13:23 No.102055400

Anonymous 08/24/24(Sat)05:13:23 No.102055400

>>102055386
Whatever civit set, but with repeats and epochs set to 10 and size set to 1024

Anonymous
08/24/24(Sat)05:13:57 No.102055406

Anonymous 08/24/24(Sat)05:13:57 No.102055406

>>102055399
this, if the price to pay is a little more complex setup to get kino pictures, I'm all for it

Anonymous
08/24/24(Sat)05:15:05 No.102055417

Anonymous 08/24/24(Sat)05:15:05 No.102055417

File: bildbeschreibungen.png (1.07 MB, 1280x2911)

1.07 MB PNG

Screenshotted Anon's research from a few days ago. Great work.
If you're still around, what prompt did you use to ask these LLMs for the description? I noticed that simply “describe this image in detail” isn't enough if you want more information about the artstyle.

Anonymous
08/24/24(Sat)05:15:36 No.102055421

Anonymous 08/24/24(Sat)05:15:36 No.102055421

>>102055384
>pos: woman standing in an empty field
>neg: tree, fence, bench, picnic blanket, bicycle, dog, bird, cloud, barn, haystack, flowers, path, windmill, kite, distant mountains, farmhouse, grass, shrubs, butterfly, rocking chair, watering can, lamp post, puddle, shovel, scarecrow, street sign, abandoned car, bicycle, gazebo, mailbox, apple tree, stone wall, playground equipment, tent, lantern, birdhouse, deer, rustic gate, garden gnome, wooden crate, picnic basket

Anonymous
08/24/24(Sat)05:15:39 No.102055422

Anonymous 08/24/24(Sat)05:15:39 No.102055422

File: 176592752.png (1.33 MB, 896x1152)

1.33 MB PNG

>>102055365
How so?

Anonymous
08/24/24(Sat)05:15:57 No.102055425

Anonymous 08/24/24(Sat)05:15:57 No.102055425

>>102055374
Funny thing. Yesterday I went to look at the old 1.4 and 1.5 gens folder. Even the NAI finetune shit looks so horrible in comparison. I'm glad it's over.
I can't believe that I thought the shit was good looking.

Anonymous
08/24/24(Sat)05:16:09 No.102055428

Anonymous 08/24/24(Sat)05:16:09 No.102055428

>>102055384
>what is doubling the gen time?

Anonymous
08/24/24(Sat)05:16:25 No.102055431

Anonymous 08/24/24(Sat)05:16:25 No.102055431

>>102055421
this is unironically how i prompt desu

Anonymous
08/24/24(Sat)05:16:28 No.102055432

Anonymous 08/24/24(Sat)05:16:28 No.102055432

>>102055421
find something better than that, I'm all for having as little words as possible on my prompt

Anonymous
08/24/24(Sat)05:16:40 No.102055433

Anonymous 08/24/24(Sat)05:16:40 No.102055433

>>102055372
vector databases/RAG/3rd things are all hot topics. Thebloke has hundreds of LLM GGUFs.

>>102055406
you aren't though. The upgrade is text. It will be better, it is still base model days.

Anonymous
08/24/24(Sat)05:17:35 No.102055441

Anonymous 08/24/24(Sat)05:17:35 No.102055441

>>102055428
the thing is, CFG was created to improve adherance prompt, not just to have a negative prompt, so that double gen time is worth it if you want Flux to actually follow your prompts for good

Anonymous
08/24/24(Sat)05:17:51 No.102055443

Anonymous 08/24/24(Sat)05:17:51 No.102055443

do you think it is possible to teach the model to handle either a "negative" section in the prompt or a separate conditioning and avoid having to sample twice to get a usable negative prompt?

Anonymous
08/24/24(Sat)05:17:52 No.102055444

Anonymous 08/24/24(Sat)05:17:52 No.102055444

File: flux_cyber-env11.jpg (3.03 MB, 2080x2720)

3.03 MB JPG

Anonymous
08/24/24(Sat)05:18:41 No.102055450

Anonymous 08/24/24(Sat)05:18:41 No.102055450

File: FD_00071_.png (1.8 MB, 1024x1536)

1.8 MB PNG

Anonymous
08/24/24(Sat)05:18:42 No.102055451

Anonymous 08/24/24(Sat)05:18:42 No.102055451

>>102055443
i pray to the lord everyday for this to be true

Anonymous
08/24/24(Sat)05:19:26 No.102055454

Anonymous 08/24/24(Sat)05:19:26 No.102055454

>>102055396
>Only the troons will care to look feminine and pretty.
In waht world do you live

Anonymous
08/24/24(Sat)05:19:38 No.102055458

Anonymous 08/24/24(Sat)05:19:38 No.102055458

>>102055443
a lot of researsh has been done to get rid of CFG, but it turns out those solutions were even slower, like PerpNeg which is 3x slower than CFG = 1 kek

Anonymous
08/24/24(Sat)05:19:47 No.102055462

Anonymous 08/24/24(Sat)05:19:47 No.102055462

>>102055433
>still base model days
I don't see Flux picking up in the way SDXL did. Everyone seems to go through the same phase of being wowed by easy kino, hitting its thematic limitations and 1girl-itis, making a lora, realizing xl is still better for what they want (usually lewds) and abandoning Flux. Plus it's higher requirements.

Screenshot this post. In a year's time, Flux D 1 will be dead.

Anonymous
08/24/24(Sat)05:20:53 No.102055472

Anonymous 08/24/24(Sat)05:20:53 No.102055472

>>102055462
Flux is doing much better right now than SDXL did its first couple weeks. Flux might be replaced in a year, but its heyday is far from today.

Anonymous
08/24/24(Sat)05:21:11 No.102055474

Anonymous 08/24/24(Sat)05:21:11 No.102055474

>>102055417
nta, atleast for JoyCaption I can give you the answer: none .. you just paste the picture and it spits out the prompt

Anonymous
08/24/24(Sat)05:22:05 No.102055478

Anonymous 08/24/24(Sat)05:22:05 No.102055478

>>102055094
>>102055138
dalle is funny with what triggers it sometimes in it's own revised prompts

>Enter prompt:
>a japanese woman laying in the grass!
>Private Client
>Your request was rejected as a result of our safety system. The revised prompt may contain text that is not >allowed by our safety system.
>Revised prompt rejected.
>A Japanese woman with traditional kimono is lying leisurely in a lush, green grass field, basking under the >calming sun. Her eyes are closed, and she breathes in the serene nature around her. In her vicinity, blooming >flowers of varying colors add to the overall tranquility of the setting, while majestic trees provide a slight shade >from the afternoon sun. Her relaxed pose conveys the tranquility of the environment, making for a peaceful and >serene scene.

Anonymous
08/24/24(Sat)05:22:39 No.102055482

Anonymous 08/24/24(Sat)05:22:39 No.102055482

There is no paper on flux, but if its at all similar process to Dall-E 3, then it would do good to have bit more verbose prompt.

Read the Dall-E 3 paper and look at all the examples and training examples. They used like 90% of AI tagging and the vision model was very verbose.
I think same applies to flux and what works with Dall-E 3, will work with flux.
As far as I know, only Bing implementation of Dalle does not auto force prompt rewriting by AI. ChatGPT always rewrites, which is ass.

Anonymous
08/24/24(Sat)05:23:46 No.102055490

Anonymous 08/24/24(Sat)05:23:46 No.102055490

>>102055472
video card tech has moved on. It would have done this well if 16GB wasn't so common.

Anonymous
08/24/24(Sat)05:23:54 No.102055492

Anonymous 08/24/24(Sat)05:23:54 No.102055492

>>102055478
>Enter prompt:
>a man who doesn't know how to green text
>Revised prompt:
>An image of a confused middle-aged South Asian man, sitting at a computer. The screen of the computer is focused on a text-based application filled with neatly organized rows of text, but one line of text remains visibly black while the others are highlighted in green, indicating his struggle to use the 'green text' feature.

Anonymous
08/24/24(Sat)05:24:14 No.102055495

Anonymous 08/24/24(Sat)05:24:14 No.102055495

>>102055417
I think his challenge was to check how these image descriptors fare without any further hints

Anonymous
08/24/24(Sat)05:25:29 No.102055508

Anonymous 08/24/24(Sat)05:25:29 No.102055508

>>102055454
the real one, unfortunately

Anonymous
08/24/24(Sat)05:25:55 No.102055511

Anonymous 08/24/24(Sat)05:25:55 No.102055511

>>102055458
PerpNeg is just a different way to calculate CFG, you're still doing two passes.
I'm talking about building the ability to handle negative prompts during training.

Anonymous
08/24/24(Sat)05:26:21 No.102055515

Anonymous 08/24/24(Sat)05:26:21 No.102055515

>>102055508
>the real one, unfortunately
I felt that

Anonymous
08/24/24(Sat)05:26:35 No.102055518

Anonymous 08/24/24(Sat)05:26:35 No.102055518

File: f.png (155 KB, 832x1152)

155 KB PNG

Anonymous
08/24/24(Sat)05:27:06 No.102055524

Anonymous 08/24/24(Sat)05:27:06 No.102055524

>>102055482
you're way late, anon, we figured out Flux was trained on LLM slopa on day one

Anonymous
08/24/24(Sat)05:27:36 No.102055525

Anonymous 08/24/24(Sat)05:27:36 No.102055525

>>102055511
>I'm talking about building the ability to handle negative prompts during training.
that's a good question, but even if it's possible, higher CFG always improve prompt adherance, so it's not like CFG will magically becomes useless if there was a way to get negative prompt at CFG = 1

Anonymous
08/24/24(Sat)05:28:37 No.102055529

Anonymous 08/24/24(Sat)05:28:37 No.102055529

>>102055524
>we figured out Flux was trained on LLM slopa on day one
I mean that's obvious no? Pretraining an image model requires billions of pictures, who's gonna do that by hand?

Anonymous
08/24/24(Sat)05:28:45 No.102055530

Anonymous 08/24/24(Sat)05:28:45 No.102055530

>>102055431
that is how I often had to prompt on 1.5

>>102055432
if we had something like a creativity slider and it was set to 0 creativity, it should only do the bare minimum of concepts you prompted, set it to maximum creativity and you'll get a bunch of fluff and random objects of interest and still end up looking great, that is where you can lock in the seed and add in things like "dog, kite" into the negs if you don't want those in particular. like I said we have CFG but it doesn't quite work like that, at least not with the models I've used, and then you've got all the side effects of your CFG being too low or too high
Flux isn't perfect but from my experience it's the best one so far at following simple prompts, if it knows the things you're prompting

Anonymous
08/24/24(Sat)05:30:05 No.102055535

Anonymous 08/24/24(Sat)05:30:05 No.102055535

>>102055530
>Flux isn't perfect but from my experience it's the best one so far at following simple prompts, if it knows the things you're prompting
100% agree with that take

Anonymous
08/24/24(Sat)05:31:16 No.102055544

Anonymous 08/24/24(Sat)05:31:16 No.102055544

File: file.png (2.21 MB, 896x1152)

2.21 MB PNG

>>102055472
Maybe, but the first few days /ldg/ maxed on pics per thread. People posted absolute kino non-stop. Nowadays all you see is arguing, some anons trying to train loras with varied success, and civitai doing an enshittification speedrun. I don't want to doom, but idk. It feels off.

Anonymous
08/24/24(Sat)05:31:53 No.102055549

Anonymous 08/24/24(Sat)05:31:53 No.102055549

>>102055544
do again, but put the rowboys into canoes

Anonymous
08/24/24(Sat)05:32:20 No.102055552

Anonymous 08/24/24(Sat)05:32:20 No.102055552

>>102055524
>LLM slopa
Though if you read the Dalle paper, you would have noticed the very important finding: AI captions improved the model significantly.

They figured out that the benefit was maximal around 90% ai and 10% human. So 90% of AI captioned images and 10% human captioned was better than 80% AI and 20% human and so on.

Read the paper. It was very significant and showed that synthetic data is relevant and can improve performance.

Anonymous
08/24/24(Sat)05:33:36 No.102055561

Anonymous 08/24/24(Sat)05:33:36 No.102055561

File: file.png (17 KB, 149x99)

17 KB PNG

>>102055035
why did ForgeUI suddenly start having TWO images in the output?
They're perfectly identical and it's not actually generating two images, it's generating just the one and then it gets shows as two identical outputs.

Wtf is going on?

Anonymous
08/24/24(Sat)05:33:45 No.102055562

Anonymous 08/24/24(Sat)05:33:45 No.102055562

File: file.png (2.31 MB, 896x1152)

2.31 MB PNG

>>102055549
>This is a movie poster for a film from 1994. The film is set in a cyberpunk dystopia, and features robots, literature scholars, and painters. The style of the poster is realistic, gritty, and intended for mature audiences, making heavy use of professional, high quality photography. The title of the film is predominantly displayed in the bottom part of the poster: "Flopx". Underneath the title, a subtitle reads: "Doom and Gloom".

Do it yourself. Here's the prompt:
>This is a movie poster for a Steven Spielberg film from 1994. The film is set in the wild west, and features robots and cowboys. The title of the film is predominantly displayed in the bottom part of the poster: "ROWBOYS".

Anonymous
08/24/24(Sat)05:33:48 No.102055564

Anonymous 08/24/24(Sat)05:33:48 No.102055564

>>102055544
all we need is a good full finetune and we're back

Anonymous
08/24/24(Sat)05:33:48 No.102055565

Anonymous 08/24/24(Sat)05:33:48 No.102055565

>>102055552
>AI captions improved the model significantly.
compared to full human's captions that have a 100% accuracy caption?

Anonymous
08/24/24(Sat)05:34:58 No.102055573

Anonymous 08/24/24(Sat)05:34:58 No.102055573

File: file.png (2.22 MB, 1024x1024)

2.22 MB PNG

Anonymous
08/24/24(Sat)05:35:22 No.102055578

Anonymous 08/24/24(Sat)05:35:22 No.102055578

File: file.png (1.89 MB, 896x1152)

1.89 MB PNG

>>102055564
I get that, but how difficult will it be to tune this model properly, given it's size and the quality of the original dataset?
Don't get me wrong, I got my 3090 specifically to proompt all day with it, and I don't regret it. Like I said, it just feels off right now.

Anonymous
08/24/24(Sat)05:36:31 No.102055583

Anonymous 08/24/24(Sat)05:36:31 No.102055583

>>102055565
Read the paper on the matter.

https://cdn.openai.com/papers/dall-e-3.pdf

Anonymous
08/24/24(Sat)05:37:08 No.102055587

Anonymous 08/24/24(Sat)05:37:08 No.102055587

Official pixart bigma hype begins in 8 days

Anonymous
08/24/24(Sat)05:37:32 No.102055594

Anonymous 08/24/24(Sat)05:37:32 No.102055594

>>102055565
>full human's captions that have a 100% accuracy
anon, this has never been try for anything.

Anonymous
08/24/24(Sat)05:37:32 No.102055595

Anonymous 08/24/24(Sat)05:37:32 No.102055595

>>102055583
how is that possible though? no AI captions will be 100% accurate, humans can make perfect captions though, so I suspect they hired shit pajeet captionners or some shit

Anonymous
08/24/24(Sat)05:37:36 No.102055596

Anonymous 08/24/24(Sat)05:37:36 No.102055596

File: file.png (1.98 MB, 896x1152)

1.98 MB PNG

By the way, does anybody else use ipndm/sgm_uniform? I find it gives excellent results.

Anonymous
08/24/24(Sat)05:38:04 No.102055601

Anonymous 08/24/24(Sat)05:38:04 No.102055601

File: 2024-08-24_00158_.png (1.52 MB, 720x1280)

1.52 MB PNG

>>102055562
Thanks. Changed it abit
>This is a movie poster for a Steven Spielberg film from 1994. The film is set in the wild west, and features robots and cowboys. The title of the film is predominantly displayed in the bottom part of the poster: "ROWBOYS". The cowboys on the movie poster are in canoes that are on a wild river. They seem joyful and full of energy as they paddle thru the water. On the side of the river robots with revolvers shoot on the cowboys.

Anonymous
08/24/24(Sat)05:38:33 No.102055604

Anonymous 08/24/24(Sat)05:38:33 No.102055604

>>102055596
>ipndm
this is twice as slow as euler though?

Anonymous
08/24/24(Sat)05:38:47 No.102055605

Anonymous 08/24/24(Sat)05:38:47 No.102055605

File: file.png (2.17 MB, 896x1152)

2.17 MB PNG

>>102055601
Fucking kino

Anonymous
08/24/24(Sat)05:39:40 No.102055610

Anonymous 08/24/24(Sat)05:39:40 No.102055610

>>102055601
>robots are shooting each other instead of the rowing cowboys
SO CLOSE

Anonymous
08/24/24(Sat)05:40:50 No.102055618

Anonymous 08/24/24(Sat)05:40:50 No.102055618

File: file.png (2.09 MB, 896x1152)

2.09 MB PNG

>>102055604
Really? Pic related is the same as >>102055605
with euler. It took the same in both cases. (and they look the same damn)

Anonymous
08/24/24(Sat)05:40:56 No.102055620

Anonymous 08/24/24(Sat)05:40:56 No.102055620

>>102055508
Apparantly not

Anonymous
08/24/24(Sat)05:42:02 No.102055630

Anonymous 08/24/24(Sat)05:42:02 No.102055630

File: 00002-2048854268.jpg (222 KB, 1192x880)

222 KB JPG

Anonymous
08/24/24(Sat)05:42:15 No.102055631

Anonymous 08/24/24(Sat)05:42:15 No.102055631

How come there is paper(s) on Dalle3 but not flux, even though flux is the "open"?

Anonymous
08/24/24(Sat)05:42:23 No.102055633

Anonymous 08/24/24(Sat)05:42:23 No.102055633

File: 00105-2221437000.png (1.48 MB, 1440x1248)

1.48 MB PNG

Anonymous
08/24/24(Sat)05:43:33 No.102055640

Anonymous 08/24/24(Sat)05:43:33 No.102055640

>>102055631
maybe they found a secret sauce and they don't want to share it to everyone, and I won't pretend that OpenAI's paper is geuine, maybe they misled us with some exagerations or some lies

Anonymous
08/24/24(Sat)05:43:52 No.102055645

Anonymous 08/24/24(Sat)05:43:52 No.102055645

File: file.png (1.8 MB, 896x1152)

1.8 MB PNG

>>102055618
In some cases it looks very different
(>>102055578)

Anonymous
08/24/24(Sat)05:44:24 No.102055650

Anonymous 08/24/24(Sat)05:44:24 No.102055650

File: t.png (436 KB, 832x1152)

436 KB PNG

Anonymous
08/24/24(Sat)05:45:21 No.102055664

Anonymous 08/24/24(Sat)05:45:21 No.102055664

File: igx182w.jpg (521 KB, 1600x1600)

521 KB JPG

Anonymous
08/24/24(Sat)05:46:22 No.102055680

Anonymous 08/24/24(Sat)05:46:22 No.102055680

>>102055664
a studio ghibli style lora?

Anonymous
08/24/24(Sat)05:46:31 No.102055682

Anonymous 08/24/24(Sat)05:46:31 No.102055682

>>102055664
Niiice

Anonymous
08/24/24(Sat)05:46:38 No.102055683

Anonymous 08/24/24(Sat)05:46:38 No.102055683

>>102055664
Very miazaki

Anonymous
08/24/24(Sat)05:46:46 No.102055686

Anonymous 08/24/24(Sat)05:46:46 No.102055686

File: 2024-08-24_00162_.png (1.59 MB, 832x1216)

1.59 MB PNG

>>102055610
ya I'm not native English .. I guess it needs to be "are shooting at" not "shoot on" , I noticed t5 can be a fucking grammar nazi sometimes, for example describe something with an animal and don't call it "it" .. instead call it he or she and it just doesnt get.

>This is a movie poster for a Steven Spielberg film from 1994. The film is set in the wild west, and features robots and cowboys. The title of the film is predominantly displayed in the bottom part of the poster: "ROWBOYS". The cowboys on the movie poster are in canoes that are on a wild river. They seem joyful and full of energy as they paddle through the water. Am Flussufer stehen Roboter die auf die Cowboys schießen.

Anonymous
08/24/24(Sat)05:46:59 No.102055690

Anonymous 08/24/24(Sat)05:46:59 No.102055690

>>102055640
I'm gonna level with you.
With "secret sauce", I would imagine model of this size to be 10x better than SDXL.

I think it all comes from scale and better captioning and using literal LLM instead of clip. No secret.

Anonymous
08/24/24(Sat)05:47:21 No.102055694

Anonymous 08/24/24(Sat)05:47:21 No.102055694

File: ifx183.png (1.38 MB, 1024x1024)

1.38 MB PNG

>>102055680
that's just ImageFX

Anonymous
08/24/24(Sat)05:47:28 No.102055695

Anonymous 08/24/24(Sat)05:47:28 No.102055695

File: file.png (1.83 MB, 1024x1024)

1.83 MB PNG

https://civitai.com/models/677362/mario-strikers-style-flux?modelVersionId=758237

Anonymous
08/24/24(Sat)05:47:52 No.102055700

Anonymous 08/24/24(Sat)05:47:52 No.102055700

File: file.png (2.42 MB, 896x1152)

2.42 MB PNG

>>102055686
I like the title font on that one.
>This is a movie poster for a Steven Spielberg film from 1994. The film is set in the wild west, and features robots and cowboys. The cowboys on the movie poster paddle in canoes down the rapids of a wild river. They look joyful and full of energy as they paddle intensely through the water. On the banks of the river, robots with revolvers shoot at the cowboys from the distance.
>The artwork on the poster uses a painterly style that highlights the adventurous tone of the film, reminiscent of Lucas Films.
>The title of the film is predominantly displayed in the bottom part of the poster: "ROWBOYS".
>Underneath the title, a subtitle can be read: "Paddling of destiny".

Anonymous
08/24/24(Sat)05:48:00 No.102055703

Anonymous 08/24/24(Sat)05:48:00 No.102055703

>fooled by SaaS gen again

Anonymous
08/24/24(Sat)05:48:22 No.102055706

Anonymous 08/24/24(Sat)05:48:22 No.102055706

>>102055695
>https://civitai.com/models/677362/mario-strikers-style-flux?modelVersionId=758237
funny, looks like "The World Ends With You" artwork .. is Mario Strikers and TWEWY the same artist?

Anonymous
08/24/24(Sat)05:48:34 No.102055707

Anonymous 08/24/24(Sat)05:48:34 No.102055707

>>102055690
>I would imagine model of this size to be 10x better than SDXL.
1) SDXL is a 3.5b model, so it's a 3x scale, not 10x
2) SD3-8b isn't even close to Flux even though it's size is quite close

Anonymous
08/24/24(Sat)05:48:56 No.102055710

Anonymous 08/24/24(Sat)05:48:56 No.102055710

File: file.png (2.98 MB, 960x1280)

2.98 MB PNG

g2g catch you later boys
>steven spoilberg
lmao

Anonymous
08/24/24(Sat)05:49:46 No.102055723

Anonymous 08/24/24(Sat)05:49:46 No.102055723

>>102055664
>when even the most cucked GAFAM of them all (google) has miyazaki style in its model, you know that Flux did something really wrong

Anonymous
08/24/24(Sat)05:49:50 No.102055725

Anonymous 08/24/24(Sat)05:49:50 No.102055725

>>102055700
Awesome!

Anonymous
08/24/24(Sat)05:50:25 No.102055730

Anonymous 08/24/24(Sat)05:50:25 No.102055730

>>102055723
>GAFAM
?

Anonymous
08/24/24(Sat)05:50:46 No.102055734

Anonymous 08/24/24(Sat)05:50:46 No.102055734

>>102055710
>spoilberg
he spoiled us this new "Row Boys" movies before the official announcement :'(

Anonymous
08/24/24(Sat)05:50:48 No.102055735

Anonymous 08/24/24(Sat)05:50:48 No.102055735

>>102055700
>that cooperative 1-hand each paddling on the front boat

Anonymous
08/24/24(Sat)05:50:51 No.102055736

Anonymous 08/24/24(Sat)05:50:51 No.102055736

File: 4step_up_00022_.png (3.52 MB, 1536x1536)

3.52 MB PNG

Anonymous
08/24/24(Sat)05:51:05 No.102055738

Anonymous 08/24/24(Sat)05:51:05 No.102055738

File: images.jpg (12 KB, 296x170)

12 KB JPG

>>102055664
>STUDIO GHIBLI STYLE

Anonymous
08/24/24(Sat)05:51:07 No.102055740

Anonymous 08/24/24(Sat)05:51:07 No.102055740

>>102055707
That is the point. It should be better than just basic scaling.

Anonymous
08/24/24(Sat)05:51:23 No.102055743

Anonymous 08/24/24(Sat)05:51:23 No.102055743

>>102055552
>Though if you read the Dalle paper, you would have noticed the very important finding: AI captions improved the model significantly.
Yes but OpenAI has a much better caption model.

Anonymous
08/24/24(Sat)05:51:48 No.102055748

Anonymous 08/24/24(Sat)05:51:48 No.102055748

>>102055730
https://en.wikipedia.org/wiki/Big_Tech
>Alphabet, Amazon, Apple, Meta, and Microsoft are known as the Big Five tech companies. They were known as GAFAM before Facebook changed its name to Meta in 2021
Alphabet is Google

Anonymous
08/24/24(Sat)05:52:17 No.102055754

Anonymous 08/24/24(Sat)05:52:17 No.102055754

File: FD_00073_.png (1.39 MB, 704x1472)

1.39 MB PNG

Still struggling with the tat

Anonymous
08/24/24(Sat)05:52:49 No.102055761

Anonymous 08/24/24(Sat)05:52:49 No.102055761

File: fp123.jpg (204 KB, 1024x1024)

204 KB JPG

>>102055738

Anonymous
08/24/24(Sat)05:52:56 No.102055762

Anonymous 08/24/24(Sat)05:52:56 No.102055762

>>102055740
yeah, SAI fucked it up big, they scaled up their model up to 8b and didn't have the level of Flux, that's a huge failure

Anonymous
08/24/24(Sat)05:53:57 No.102055765

Anonymous 08/24/24(Sat)05:53:57 No.102055765

>>102055743
>Yes but OpenAI has a much better caption model.
maybe Flux also used GPT4V caption model

Anonymous
08/24/24(Sat)05:54:55 No.102055767

Anonymous 08/24/24(Sat)05:54:55 No.102055767

>>102055765
we don't know if DALL-E 3 used GPT-4V

Anonymous
08/24/24(Sat)05:55:17 No.102055771

Anonymous 08/24/24(Sat)05:55:17 No.102055771

>>102055743
I have not researched that. DE3 was trained early 2023 and the research done maybe late 2022.

Are you sure they had access to better vision models than current SOTA open source? I really don't know, but I think they have improved quite a bit.

Anonymous
08/24/24(Sat)05:57:18 No.102055787

Anonymous 08/24/24(Sat)05:57:18 No.102055787

>>102055771
>Are you sure they had access to better vision models than current SOTA open source?
Most likely, they trained their own with their own image caption pair data set made by humans.
It's in the paper.
is current SOTA using high volume quality human made captions?

Anonymous
08/24/24(Sat)05:59:22 No.102055801

Anonymous 08/24/24(Sat)05:59:22 No.102055801

>>102055748
I'll see myself out.

Anonymous
08/24/24(Sat)05:59:30 No.102055804

Anonymous 08/24/24(Sat)05:59:30 No.102055804

File: 4step_up_00023_.png (3.65 MB, 1536x1536)

3.65 MB PNG

>>102055686
T5 knows German and French too, prompt does not have to be in english

Anonymous
08/24/24(Sat)05:59:59 No.102055811

Anonymous 08/24/24(Sat)05:59:59 No.102055811

>>102055787
I don't know. Last time I checked, the model from Meta was best, or maybe some finetune of it.

Anonymous
08/24/24(Sat)06:03:03 No.102055836

Anonymous 08/24/24(Sat)06:03:03 No.102055836

File: file.png (1.86 MB, 1024x1024)

1.86 MB PNG

>>102055695
Really impressive Lora

Anonymous
08/24/24(Sat)06:04:13 No.102055847

Anonymous 08/24/24(Sat)06:04:13 No.102055847

How "locked" are peoples PC's when using kohya and making Loras? I have a 13500 and 16GB card and ddr5 and would like to be able to browse and shit while the lora is being made.
Are there any settings to limit GPU/CPU use during training, or even, other tools run locally?

Anonymous
08/24/24(Sat)06:04:14 No.102055849

Anonymous 08/24/24(Sat)06:04:14 No.102055849

File: e.png (829 KB, 832x1152)

829 KB PNG

Anonymous
08/24/24(Sat)06:06:40 No.102055862

Anonymous 08/24/24(Sat)06:06:40 No.102055862

>>102055804
Yes I know, but it tends to give different results if you prompt in German. Also I think more in English than in German these days.. especially when working with AI.

Anonymous
08/24/24(Sat)06:07:32 No.102055870

Anonymous 08/24/24(Sat)06:07:32 No.102055870

File: 10205503545344.png (2.17 MB, 1024x1024)

2.17 MB PNG

>>102055035

Anonymous
08/24/24(Sat)06:08:01 No.102055873

Anonymous 08/24/24(Sat)06:08:01 No.102055873

Making some really nasty stuff atm

Anonymous
08/24/24(Sat)06:11:30 No.102055901

Anonymous 08/24/24(Sat)06:11:30 No.102055901

GIVE ME THE LORA

Anonymous
08/24/24(Sat)06:11:49 No.102055905

Anonymous 08/24/24(Sat)06:11:49 No.102055905

Is there an option in Forge to keep the text encoding on a separate GPU?

Anonymous
08/24/24(Sat)06:12:32 No.102055915

Anonymous 08/24/24(Sat)06:12:32 No.102055915

>>102055901
>>102055117
Meant for this guy lol

Anonymous
08/24/24(Sat)06:12:38 No.102055917

Anonymous 08/24/24(Sat)06:12:38 No.102055917

>>102055905
nope, that's why I'm stuck on comfyUi

Anonymous
08/24/24(Sat)06:12:55 No.102055919

Anonymous 08/24/24(Sat)06:12:55 No.102055919

>>102055915
Just sub to his patreon, it's only $5

Anonymous
08/24/24(Sat)06:17:58 No.102055954

Anonymous 08/24/24(Sat)06:17:58 No.102055954

>>102055901
>>102055915
he just annoys us here posting pics of the grifter not sharing the lora.. by now I think its actually him doing viral marketing

Anonymous
08/24/24(Sat)06:18:03 No.102055955

Anonymous 08/24/24(Sat)06:18:03 No.102055955

File: Capture.png (596 B, 96x26)

596 B PNG

Now were cooking with CFG

Anonymous
08/24/24(Sat)06:18:45 No.102055957

Anonymous 08/24/24(Sat)06:18:45 No.102055957

File: Screenshot 2024-08-24 221831.png (8 KB, 311x112)

8 KB PNG

>>102055955
same

Anonymous
08/24/24(Sat)06:21:00 No.102055975

Anonymous 08/24/24(Sat)06:21:00 No.102055975

File: file.png (1.89 MB, 1024x1280)

1.89 MB PNG

>>102054020
>balthus
what an interesting artist
https://resources.metmuseum.org/resources/metpublications/pdf/Balthuss_Thereses_The_Metropolitan_Museum_Journal_v_33_1998.pdf

Anonymous
08/24/24(Sat)06:23:06 No.102055995

Anonymous 08/24/24(Sat)06:23:06 No.102055995

>>102055917
That's what I figured, no need to wedge this 2080 Ti in then, thanks anon.

Anonymous
08/24/24(Sat)06:23:14 No.102055996

Anonymous 08/24/24(Sat)06:23:14 No.102055996

File: bComfyUI_108758_.jpg (858 KB, 2048x1088)

858 KB JPG

Anonymous
08/24/24(Sat)06:23:17 No.102055997

Anonymous 08/24/24(Sat)06:23:17 No.102055997

File: FD_00074_.png (1.33 MB, 768x1344)

1.33 MB PNG

LoRA done. Once I get enough civit sheckles I will re-do it. with more nudes. Will post it soon.

Anonymous
08/24/24(Sat)06:26:23 No.102056023

Anonymous 08/24/24(Sat)06:26:23 No.102056023

>>102055975
>Balthus even had the Rola arms embroidered onto many of his kimono, in the style of a Japanese kamon.
it's always the anime fans

Anonymous
08/24/24(Sat)06:30:18 No.102056053

Anonymous 08/24/24(Sat)06:30:18 No.102056053

File: 1719318662886651.png (551 KB, 512x512)

551 KB PNG

Anonymous
08/24/24(Sat)06:35:39 No.102056091

Anonymous 08/24/24(Sat)06:35:39 No.102056091

is there a node that makes a sound when the gen is over? don't give me this, this shit also changes the whole Ui, I hate that
https://github.com/pythongosssss/ComfyUI-Custom-Scripts?tab=readme-ov-file#play-sound

Anonymous
08/24/24(Sat)06:35:43 No.102056092

Anonymous 08/24/24(Sat)06:35:43 No.102056092

File: 4step_up_00024_.png (3.47 MB, 1536x1536)

3.47 MB PNG

Anonymous
08/24/24(Sat)06:36:45 No.102056101

Anonymous 08/24/24(Sat)06:36:45 No.102056101

File: file.png (2.17 MB, 1024x1024)

2.17 MB PNG

https://www.youtube.com/watch?v=FuJBwu_03r8

Anonymous
08/24/24(Sat)06:36:58 No.102056104

Anonymous 08/24/24(Sat)06:36:58 No.102056104

File: 4step_up_00025_.png (3.65 MB, 1536x1536)

3.65 MB PNG

>>102056092
I like the font on this one

Anonymous
08/24/24(Sat)06:38:48 No.102056126

Anonymous 08/24/24(Sat)06:38:48 No.102056126

File: bComfyUI_108785_.jpg (741 KB, 2048x1088)

741 KB JPG

Anonymous
08/24/24(Sat)06:38:50 No.102056127

Anonymous 08/24/24(Sat)06:38:50 No.102056127

File: a.png (1.32 MB, 832x1152)

1.32 MB PNG

Anonymous
08/24/24(Sat)06:40:22 No.102056142

Anonymous 08/24/24(Sat)06:40:22 No.102056142

>>102055748
I prefer FAGMA

Anonymous
08/24/24(Sat)06:40:37 No.102056146

Anonymous 08/24/24(Sat)06:40:37 No.102056146

File: 00115-3221897838.png (1.18 MB, 1440x1248)

1.18 MB PNG

Anonymous
08/24/24(Sat)06:41:23 No.102056153

Anonymous 08/24/24(Sat)06:41:23 No.102056153

>>102056053
the nose knows

Anonymous
08/24/24(Sat)06:42:55 No.102056169

Anonymous 08/24/24(Sat)06:42:55 No.102056169

>>102055847
I use ai-toolkit, but to answer your question I think it doesn’t matter. My 3090 is left with just enough memory to browse with one tab open and I see graphical glitches from time to time. Also I have to close stuff before starting or else it won’t. This on Linux and KDE, which is significantly lighter than Windows.

Anonymous
08/24/24(Sat)06:43:48 No.102056174

Anonymous 08/24/24(Sat)06:43:48 No.102056174

File: file.png (2.13 MB, 1024x1024)

2.13 MB PNG

>>102055695
>striker0s, Mario Striker art style,
>Hatsune Miku as a sleek, robotic samurai in chrome armor is slicing through waves of sushi rolls flying through the air. Each slice sends colorful sparks flying. Behind her, a giant koi fish swims through the sky as if it were water, creating ripples of light, text at the bottom: "Sushi Master."
Holy fuck it nailed that shit perfectly

Anonymous
08/24/24(Sat)06:45:05 No.102056186

Anonymous 08/24/24(Sat)06:45:05 No.102056186

>>102056146
Young scully lewd lora when?

Anonymous
08/24/24(Sat)06:47:07 No.102056200

Anonymous 08/24/24(Sat)06:47:07 No.102056200

>>102055578
Moreover there’s the licensing issue. Flux is not going to take off like SDXL did. This is as far as it’s going to get unless some autist is autistic enough to devote a ton of money and time for a proper tune.

Anonymous
08/24/24(Sat)06:53:37 No.102056249

Anonymous 08/24/24(Sat)06:53:37 No.102056249

File: file.png (2.34 MB, 1024x1024)

2.34 MB PNG

>>102055695

Anonymous
08/24/24(Sat)07:06:52 No.102056337

Anonymous 08/24/24(Sat)07:06:52 No.102056337

File: 4step_up_00028_.png (3.48 MB, 1536x1536)

3.48 MB PNG

>>102056174

Anonymous
08/24/24(Sat)07:08:34 No.102056351

Anonymous 08/24/24(Sat)07:08:34 No.102056351

File: 00124-3362575104.png (975 KB, 1360x1024)

975 KB PNG

>>102056186
When there's a porn finetune probably

Anonymous
08/24/24(Sat)07:09:11 No.102056353

Anonymous 08/24/24(Sat)07:09:11 No.102056353

>>102056337
nice

Anonymous
08/24/24(Sat)07:09:36 No.102056358

Anonymous 08/24/24(Sat)07:09:36 No.102056358

What learning rate are you all using for your LoRA training?

Anonymous
08/24/24(Sat)07:11:12 No.102056370

Anonymous 08/24/24(Sat)07:11:12 No.102056370

>>102056200
Licensing is something that's only really being screeched about by a few big names. Let them die. Flux will take over when people learn to game the system again.
It's like everyone just forgot every model was merged with stolen NAI weights for like a year.

Anonymous
08/24/24(Sat)07:19:47 No.102056433

Anonymous 08/24/24(Sat)07:19:47 No.102056433

File: ComfyUI_05485_.png (1.59 MB, 1024x1024)

1.59 MB PNG

Anonymous
08/24/24(Sat)07:23:13 No.102056461

Anonymous 08/24/24(Sat)07:23:13 No.102056461

File: file.png (2.26 MB, 1024x1024)

2.26 MB PNG

>>102056433

Anonymous
08/24/24(Sat)07:23:42 No.102056467

Anonymous 08/24/24(Sat)07:23:42 No.102056467

>>102056370
NAI chose to ignore that because it drove business to them, they could have easily shut it down if they wanted to, there would have just been no benefit in doing so.
The situation with BFL is different. There's no gaming the system. If anyone releases a full finetune and attempts to profit from it BFL will not stand for it. They wouldn't have included the terms if they had no intention of enforcing them, it's part of their business plan.

Anonymous
08/24/24(Sat)07:24:36 No.102056478

Anonymous 08/24/24(Sat)07:24:36 No.102056478

>>102056169
thanks, ubuntu and gnome here ill look at aitoolkit, i guess it's 512 16gb vramlets?

Anonymous
08/24/24(Sat)07:32:49 No.102056545

Anonymous 08/24/24(Sat)07:32:49 No.102056545

File: 00041-4239731267.png (789 KB, 832x1216)

789 KB PNG

>>102056200
I feel like the guys making porno models did not care about the license.

Anonymous
08/24/24(Sat)07:37:29 No.102056581

Anonymous 08/24/24(Sat)07:37:29 No.102056581

File: file.png (2.1 MB, 1024x1024)

2.1 MB PNG

Anonymous
08/24/24(Sat)07:37:50 No.102056589

Anonymous 08/24/24(Sat)07:37:50 No.102056589

File: 2024-08-24_00207_.png (1.12 MB, 832x1216)

1.12 MB PNG

Anonymous
08/24/24(Sat)07:38:58 No.102056594

Anonymous 08/24/24(Sat)07:38:58 No.102056594

>>102056545
that pony lover sure does, he chose sdxl cause he can cash in on his finetune and start fighting and argueing with SAI when SD3 did have a "non-commercial" license .. just like Flux.dev, and now he wont even consider Flux.dev cause its non-commercial only

Anonymous
08/24/24(Sat)07:39:05 No.102056596

Anonymous 08/24/24(Sat)07:39:05 No.102056596

>>102056545
I wanna see if BFL takes down all the non-schnell models that "generate revenue".

NAI never did that, even though their model was leaked and it was literally almost over for them. As far as I know, they never tried to take down any model or mix. The whole 1.4 and 1.5 ecosystem relied on the original NAI leak.

Anonymous
08/24/24(Sat)07:39:21 No.102056601

Anonymous 08/24/24(Sat)07:39:21 No.102056601

File: ComfyUI_Flux_Dev_00343_.png (1.74 MB, 1024x1024)

1.74 MB PNG

Any interesting news in the past week or so?

Anonymous
08/24/24(Sat)07:41:10 No.102056627

Anonymous 08/24/24(Sat)07:41:10 No.102056627

File: FD_00111_.png (681 KB, 1024x1024)

681 KB PNG

Alright, I am not really happy with this LoRA but it's there if you want it.
https://civitai.com/models/680433

Anonymous
08/24/24(Sat)07:41:18 No.102056628

Anonymous 08/24/24(Sat)07:41:18 No.102056628

File: bComfyUI_108806_.jpg (1.05 MB, 2048x1088)

1.05 MB JPG

Anonymous
08/24/24(Sat)07:41:37 No.102056632

Anonymous 08/24/24(Sat)07:41:37 No.102056632

File: 2024-08-24_00215_.png (1.24 MB, 832x1216)

1.24 MB PNG

Anonymous
08/24/24(Sat)07:42:46 No.102056645

Anonymous 08/24/24(Sat)07:42:46 No.102056645

File: 2024-08-24_00214_.png (1.19 MB, 832x1216)

1.19 MB PNG

>>102056627
Thanks, I don't really care for Rebecca, but I'll give you a thumbs up in civitai for your work.

Anonymous
08/24/24(Sat)07:46:07 No.102056669

Anonymous 08/24/24(Sat)07:46:07 No.102056669

>>102056645
What lora is this?

Anonymous
08/24/24(Sat)07:47:18 No.102056684

Anonymous 08/24/24(Sat)07:47:18 No.102056684

>>102056467
We have yet to see that and until then, you're talking just as much shit as I am.

Anonymous
08/24/24(Sat)07:49:43 No.102056712

Anonymous 08/24/24(Sat)07:49:43 No.102056712

File: 2024-08-24_00216_.png (1.16 MB, 832x1216)

1.16 MB PNG

>>102056669
none, I am just writing abit boomer prosa atm, that was
>A picture in a flat anime style of a human wizard in the forest picking mushrooms. The scenario is sinister and evil. The mushrooms are black with green slime dripping of them.
>The wizards eyes glow with a dark menance, his robe is embroided with arcane symbols. He is a vile necromancer. Behind the wizard stands a skeleton servant holding a black weave basket filled with the same mushrooms the wizard is picking. Its nighttime and there is a mysterious glowing red moon in the sky.

still workin on it tho, the skeleton doesnt wanna hold the basket

Anonymous
08/24/24(Sat)07:50:37 No.102056714

Anonymous 08/24/24(Sat)07:50:37 No.102056714

>>102055954
yea he doesnt seem to clown on him too hard. You'd think there would be some of him getting his ass fucked by a demon or something by now. I think you are right.

Anonymous
08/24/24(Sat)07:53:29 No.102056736

Anonymous 08/24/24(Sat)07:53:29 No.102056736

another day, another hundred celebrity loras uploaded to civit

Anonymous
08/24/24(Sat)07:54:35 No.102056747

Anonymous 08/24/24(Sat)07:54:35 No.102056747

File: file.png (817 KB, 856x850)

817 KB PNG

fuck flux, when the next sota dropping?

Anonymous
08/24/24(Sat)07:55:06 No.102056750

Anonymous 08/24/24(Sat)07:55:06 No.102056750

>>102056736
any onlyfans girls?

Anonymous
08/24/24(Sat)07:55:16 No.102056755

Anonymous 08/24/24(Sat)07:55:16 No.102056755

File: ComfyUI_02096_.png (1.13 MB, 1024x768)

1.13 MB PNG

>>102056714
Interesting theory, but wrong

Anonymous
08/24/24(Sat)07:56:20 No.102056763

Anonymous 08/24/24(Sat)07:56:20 No.102056763

File: bComfyUI_108817_.jpg (803 KB, 2048x1088)

803 KB JPG

Anonymous
08/24/24(Sat)07:56:37 No.102056767

Anonymous 08/24/24(Sat)07:56:37 No.102056767

>>102056755
then stop posting the shit or make him fuck a demon or post the lora

Anonymous
08/24/24(Sat)07:57:08 No.102056773

Anonymous 08/24/24(Sat)07:57:08 No.102056773

>>102056755
it's completely the grifter, not a single time he got clowned and has the same style as the real one doing his own loras

Anonymous
08/24/24(Sat)07:57:39 No.102056780

Anonymous 08/24/24(Sat)07:57:39 No.102056780

File: 2024-08-24_00222_.png (1.21 MB, 832x1216)

1.21 MB PNG

damnit the skeleton just does not want to hold the basket, it is mocking me

Anonymous
08/24/24(Sat)07:57:41 No.102056781

Anonymous 08/24/24(Sat)07:57:41 No.102056781

File: 00014-2710870355.png (1.06 MB, 1152x896)

1.06 MB PNG

>>102056594
If you're not in it for the love of the game, I will reject your fine-tune on principle.

Anonymous
08/24/24(Sat)07:58:14 No.102056784

Anonymous 08/24/24(Sat)07:58:14 No.102056784

>>102056750
I dunno, aren't all of them?
it's not as if OF is an exclusive club with a high barrier to entry, anyone can sign up

Anonymous
08/24/24(Sat)07:58:49 No.102056791

Anonymous 08/24/24(Sat)07:58:49 No.102056791

File: Flux-20240824_135319-up-7(...).png (2.8 MB, 1248x1728)

2.8 MB PNG

>>102056755
then show us a catbox of fuckface fucking himself in the ass or s t o p.
>>102056736
anything nice?

Anonymous
08/24/24(Sat)07:58:57 No.102056792

Anonymous 08/24/24(Sat)07:58:57 No.102056792

File: 2024-08-24_00226_.png (1.37 MB, 832x1216)

1.37 MB PNG

... damn lazy skeleton.

Anonymous
08/24/24(Sat)08:00:41 No.102056809

Anonymous 08/24/24(Sat)08:00:41 No.102056809

File: 1695254425138922.png (1.73 MB, 1024x1024)

1.73 MB PNG

Anonymous
08/24/24(Sat)08:01:42 No.102056816

Anonymous 08/24/24(Sat)08:01:42 No.102056816

What's this Flux Atilessence Lora test
?

https://civitai.com/models/647940

Anonymous
08/24/24(Sat)08:03:11 No.102056828

Anonymous 08/24/24(Sat)08:03:11 No.102056828

File: bComfyUI_108936_.jpg (530 KB, 2048x1088)

530 KB JPG

Anonymous
08/24/24(Sat)08:03:35 No.102056832

Anonymous 08/24/24(Sat)08:03:35 No.102056832

File: ComfyUI_05488_.png (1.42 MB, 1024x1024)

1.42 MB PNG

>>102056823
Sigh... if you want better prompt adherance you have to go for other samplers than euler, but the one that work well are 2 times slower, fuck off
https://imgsli.com/MjkwNjE1
>striker0s, Mario Striker art style,
>A joyful woman with tears of happiness streaming down her face is holding a goat high in the air. The goat is wearing a golden crown adorned with jewels, and the word ‘Flux’ is elegantly written on the crown. The woman has a speech bubble next to her that exclaims, ‘THAT’S WHY HE’S THE GOAT!!’ The background is a vibrant, celebratory scene with confetti falling from the sky and a crowd of people cheering in the distance. The woman is dressed in casual, colorful clothing, and the goat looks proud and majestic with its crown.

Anonymous
08/24/24(Sat)08:03:42 No.102056833

Anonymous 08/24/24(Sat)08:03:42 No.102056833

File: Flux-20240824_140002-up-9(...).png (2.68 MB, 1248x1728)

2.68 MB PNG

>>102056792
he looks like he knows exactly which mushrooms to pick lulz
>>102056601
a lot is happening but we're slowly settling in, for now. lora flood phase. nice gen

Anonymous
08/24/24(Sat)08:04:13 No.102056842

Anonymous 08/24/24(Sat)08:04:13 No.102056842

File: 1716609748221962.png (1.76 MB, 1024x1024)

1.76 MB PNG

Anonymous
08/24/24(Sat)08:04:45 No.102056852

Anonymous 08/24/24(Sat)08:04:45 No.102056852

File: ComfyUI_00635_.png (579 KB, 512x768)

579 KB PNG

Does anyone have an example comfy workflow for generating an image in SD1.5 and then piping the latent into Flux for upscaling & refinement?
Or, incidentally, the other way round?

Anonymous
08/24/24(Sat)08:07:22 No.102056874

Anonymous 08/24/24(Sat)08:07:22 No.102056874

>>102056852
Why sd 1.5? Why not some sdxl stuff?

Anonymous
08/24/24(Sat)08:09:47 No.102056890

Anonymous 08/24/24(Sat)08:09:47 No.102056890

File: Flux-20240824_133025-up-7(...).png (2.65 MB, 1248x1728)

2.65 MB PNG

>>102056852
can you replicate a flow based on an image? dont wanna share directly.

Anonymous
08/24/24(Sat)08:10:17 No.102056893

Anonymous 08/24/24(Sat)08:10:17 No.102056893

>>102056767
So fucking stupid that I have to be bullied into posting this half baked LoRA.

https://gofile.io/d/7RjVgs

Anonymous
08/24/24(Sat)08:10:47 No.102056900

Anonymous 08/24/24(Sat)08:10:47 No.102056900

>>102056874
Exquisite Details is too good and stuck on SD1.5

>>102056890
Sure, that'd be great for learning actually.

Anonymous
08/24/24(Sat)08:11:18 No.102056907

Anonymous 08/24/24(Sat)08:11:18 No.102056907

>>102056893
I believe the activation phrase is Cerfukin

Anonymous
08/24/24(Sat)08:12:21 No.102056914

Anonymous 08/24/24(Sat)08:12:21 No.102056914

>>102056893
based, what's the trigger word?

Anonymous
08/24/24(Sat)08:12:49 No.102056918

Anonymous 08/24/24(Sat)08:12:49 No.102056918

>>102056893
early and often

Anonymous
08/24/24(Sat)08:12:57 No.102056919

Anonymous 08/24/24(Sat)08:12:57 No.102056919

>>102056914
Cerfukin

Anonymous
08/24/24(Sat)08:16:18 No.102056944

Anonymous 08/24/24(Sat)08:16:18 No.102056944

File: 1699610027443295.png (1.81 MB, 1024x1024)

1.81 MB PNG

Anonymous
08/24/24(Sat)08:16:27 No.102056947

Anonymous 08/24/24(Sat)08:16:27 No.102056947

Anyone using joycaption locally? I can't figure it out even with translate

Anonymous
08/24/24(Sat)08:16:45 No.102056953

Anonymous 08/24/24(Sat)08:16:45 No.102056953

File: up.png (249 KB, 2552x606)

249 KB PNG

>>102056900
sleepy but ask away. need some custom nodes. power perlin noise, rgthree, KJnodes, nothing fancy. don't get confused by those set/get nodes.

Anonymous
08/24/24(Sat)08:18:19 No.102056967

Anonymous 08/24/24(Sat)08:18:19 No.102056967

>>102056944
what about the worst remixes?

Anonymous
08/24/24(Sat)08:19:01 No.102056977

Anonymous 08/24/24(Sat)08:19:01 No.102056977

>>102056967
Turn on youtube and boom you got em

Anonymous
08/24/24(Sat)08:20:17 No.102056984

Anonymous 08/24/24(Sat)08:20:17 No.102056984

File: ComfyUI_01355_.png (720 KB, 1024x768)

720 KB PNG

Anonymous
08/24/24(Sat)08:21:33 No.102056994

Anonymous 08/24/24(Sat)08:21:33 No.102056994

File: ComfyUI_01372_.png (758 KB, 1024x768)

758 KB PNG

>>102056984

Anonymous
08/24/24(Sat)08:22:51 No.102057010

Anonymous 08/24/24(Sat)08:22:51 No.102057010

File: ComfyUI_01373_.png (768 KB, 1024x768)

768 KB PNG

>>102056994

Anonymous
08/24/24(Sat)08:23:09 No.102057015

Anonymous 08/24/24(Sat)08:23:09 No.102057015

>>102053192
>JC Denton
Seconding this
(I just woke up)

Anonymous
08/24/24(Sat)08:23:29 No.102057016

Anonymous 08/24/24(Sat)08:23:29 No.102057016

File: 0.jpg (101 KB, 1024x1024)

101 KB JPG

Anonymous
08/24/24(Sat)08:23:57 No.102057019

Anonymous 08/24/24(Sat)08:23:57 No.102057019

>>102057015
Why not a whole deus ex lora?

Anonymous
08/24/24(Sat)08:24:32 No.102057027

Anonymous 08/24/24(Sat)08:24:32 No.102057027

>>102052849
TV's Frank

Anonymous
08/24/24(Sat)08:24:49 No.102057032

Anonymous 08/24/24(Sat)08:24:49 No.102057032

File: ComfyUI_01379_.png (760 KB, 1024x768)

760 KB PNG

>>102057010

Anonymous
08/24/24(Sat)08:25:03 No.102057035

Anonymous 08/24/24(Sat)08:25:03 No.102057035

>>102056792
That's a trustworthy fellow if I ever saw one.

>>102057019
Yeah, that would be really cool. I'd do one, but I ran into an error no one on the internet has. I'm waiting a few weeks to see if updates solve it.

Anonymous
08/24/24(Sat)08:25:52 No.102057046

Anonymous 08/24/24(Sat)08:25:52 No.102057046

>>102057019
>>102057035
>>102057015
I might do this next, the Rebecca LoRA pissed me off with how bad it is. Just need buzz. Should be able to farm it up in a couple of days.

Anonymous
08/24/24(Sat)08:25:59 No.102057047

Anonymous 08/24/24(Sat)08:25:59 No.102057047

File: ComfyUI_01380_.png (730 KB, 1024x768)

730 KB PNG

>>102057032

Anonymous
08/24/24(Sat)08:26:04 No.102057048

Anonymous 08/24/24(Sat)08:26:04 No.102057048

Model train you!

Anonymous
08/24/24(Sat)08:27:02 No.102057060

Anonymous 08/24/24(Sat)08:27:02 No.102057060

>>102057046
The Rebecca LoRA looked good.

Anonymous
08/24/24(Sat)08:27:15 No.102057063

Anonymous 08/24/24(Sat)08:27:15 No.102057063

File: ComfyUI_01383_.png (702 KB, 1024x768)

702 KB PNG

>>102057047

Anonymous
08/24/24(Sat)08:28:09 No.102057070

Anonymous 08/24/24(Sat)08:28:09 No.102057070

File: flux_00739_.png (1.86 MB, 1024x1320)

1.86 MB PNG

>>102057046
Bless you, anon. I tried one of my Deus Ex prompts and was disappointed that flux didn't recognize it.

Anonymous
08/24/24(Sat)08:28:20 No.102057071

Anonymous 08/24/24(Sat)08:28:20 No.102057071

Man SDXL was way better, most loras so far have been shit for my flux creations. It seems and maybe im wrong here that SDXL took way less effort to create stuff that was presentable. If you have even 1 shit dataset in your flux it will fucking ruin everything, with SDXL, you could maybe have a few in there but it wouldn't shit the bed completely. What am I doing wrong? The captions are fine, im using openai to interrogate the images, using 5000 steps, but if you include a dataset that might be even a slightly bit different than everything else, even though it has all the same features, your end result will be shit.

Anonymous
08/24/24(Sat)08:28:50 No.102057076

Anonymous 08/24/24(Sat)08:28:50 No.102057076

File: ComfyUI_01384_.png (793 KB, 1024x768)

793 KB PNG

>>102057063

Anonymous
08/24/24(Sat)08:28:52 No.102057077

Anonymous 08/24/24(Sat)08:28:52 No.102057077

File: FD_00100_.png (941 KB, 1024x1024)

941 KB PNG

>>102057060
It's very cherry picked. Have to describe her pretty exactly otherwise you just get random women with her tattoos and eyes.

Anonymous
08/24/24(Sat)08:29:27 No.102057085

Anonymous 08/24/24(Sat)08:29:27 No.102057085

>>102057071
>l
What parameters are you training at? People say Flux isn't easy to overtrain, but I find it can easily go off the rails if your settings are too strong.

Anonymous
08/24/24(Sat)08:29:51 No.102057087

Anonymous 08/24/24(Sat)08:29:51 No.102057087

>>102055975
wtf why is it always the french?

Anonymous
08/24/24(Sat)08:29:58 No.102057088

Anonymous 08/24/24(Sat)08:29:58 No.102057088

I love you rebeccaaaaa

Anonymous
08/24/24(Sat)08:30:00 No.102057090

Anonymous 08/24/24(Sat)08:30:00 No.102057090

>>102057046
>>102057060
>>102057077
Where's the Rebecca LoRA?

Anonymous
08/24/24(Sat)08:30:11 No.102057093

Anonymous 08/24/24(Sat)08:30:11 No.102057093

File: file.png (2.4 MB, 1024x1024)

2.4 MB PNG

>>102057076

Anonymous
08/24/24(Sat)08:30:28 No.102057096

Anonymous 08/24/24(Sat)08:30:28 No.102057096

>>102057077
Oh, okay, now I see.

Anonymous
08/24/24(Sat)08:30:31 No.102057098

Anonymous 08/24/24(Sat)08:30:31 No.102057098

File: FD_00110_.png (1 MB, 1024x1024)

1 MB PNG

>>102057090
https://civitai.com/models/680433

Anonymous
08/24/24(Sat)08:30:35 No.102057099

Anonymous 08/24/24(Sat)08:30:35 No.102057099

>>102057085

https://pastebin.com/m0FPqQFP

Anonymous
08/24/24(Sat)08:31:01 No.102057102

Anonymous 08/24/24(Sat)08:31:01 No.102057102

File: ComfyUI_01387_.png (770 KB, 1024x768)

770 KB PNG

>>102057093

Anonymous
08/24/24(Sat)08:31:13 No.102057106

Anonymous 08/24/24(Sat)08:31:13 No.102057106

>>102055349
Ignore the retard, I'll try to find a clean(ish) way to do that when I get home. It's slightly faster if you have the vram but I didn't want to add separate LoRA/CN/etc nodes just for gguf shit but a "force patch vram" node or sth should work even if it's hack.

Anonymous
08/24/24(Sat)08:32:11 No.102057119

Anonymous 08/24/24(Sat)08:32:11 No.102057119

File: ComfyUI_01392_.png (741 KB, 1024x768)

741 KB PNG

>>102057093
Cursed

Anonymous
08/24/24(Sat)08:32:20 No.102057121

Anonymous 08/24/24(Sat)08:32:20 No.102057121

File: 00044-2573187592.jpg (966 KB, 2224x1248)

966 KB JPG

goomorn
today is, indeed, latina caturday.
https://youtu.be/M8qZT4BqZ6E?si=8SvP2v-XElvoQNV4

Anonymous
08/24/24(Sat)08:32:35 No.102057125

Anonymous 08/24/24(Sat)08:32:35 No.102057125

>>102057106
>I didn't want to add separate LoRA/CN/etc nodes just for gguf shit but a "force patch vram" node or sth should work even if it's hack.
why can't you add a button on your GGUF model loader, like "load the lora on vram" ON/OFF, that could do the trick?

Anonymous
08/24/24(Sat)08:32:41 No.102057127

Anonymous 08/24/24(Sat)08:32:41 No.102057127

File: 2024-08-24_00231_.png (1.47 MB, 832x1216)

1.47 MB PNG

>>102057035
>>102056833
ya you can trust em with your loved ones and best kept secrets..

also I gave em fucking A and B signifiers and then said the skeleton with the B hold the basket.. nothing.. by now I think there is magic at play

Anonymous
08/24/24(Sat)08:32:50 No.102057130

Anonymous 08/24/24(Sat)08:32:50 No.102057130

File: Flux-20240824_142712-up-8(...).png (2.82 MB, 1248x1728)

2.82 MB PNG

why are you (You)ing yourself anon
>>102057071
I dunno, I dont make loras. but what I can say is flux loras, so far, are fucking all over the place and testing them/dialing them in is painful AF. patience wearing thin desu.
>>102057098
wild!

Anonymous
08/24/24(Sat)08:33:27 No.102057135

Anonymous 08/24/24(Sat)08:33:27 No.102057135

File: ComfyUI_01394_.png (807 KB, 1024x768)

807 KB PNG

>>102057119

Anonymous
08/24/24(Sat)08:33:36 No.102057137

Anonymous 08/24/24(Sat)08:33:36 No.102057137

>>102057119
no Sailor, it's still not a Hamburguaa :(

Anonymous
08/24/24(Sat)08:35:18 No.102057158

Anonymous 08/24/24(Sat)08:35:18 No.102057158

File: 2024-08-24_00232_.png (1.38 MB, 832x1216)

1.38 MB PNG

gngnrrnnfn...

Anonymous
08/24/24(Sat)08:35:30 No.102057159

Anonymous 08/24/24(Sat)08:35:30 No.102057159

File: ComfyUI_01397_.png (955 KB, 1024x768)

955 KB PNG

>>102057137

Anonymous
08/24/24(Sat)08:36:10 No.102057165

Anonymous 08/24/24(Sat)08:36:10 No.102057165

File: ComfyUI_temp_fgefi_00038_.png (972 KB, 1128x768)

972 KB PNG

>>102057070
>I tried one of my Deus Ex prompts
Here's one from way back, maybe from an SD1.5 finetune. I really like the aesthetics, except the guy for some reason turned green.

Anonymous
08/24/24(Sat)08:36:26 No.102057170

Anonymous 08/24/24(Sat)08:36:26 No.102057170

>>102057130
>I dunno, I dont make loras. but what I can say is flux loras, so far, are fucking all over the place and testing them/dialing them in is painful AF. patience wearing thin desu.

That's just it though, dialing it in takes forever and then finding a prompt where you get variances is also hard to find. With SDXL you just changed a few words and you would get totally different setting, lighting, etc. With Flux its almost always the same fucking gen

Anonymous
08/24/24(Sat)08:37:00 No.102057173

Anonymous 08/24/24(Sat)08:37:00 No.102057173

>>102057165
>the guy for some reason turned green.
I distinctly remember all art from that game having that green hue. If anything it's the woman that's not matching the aesthetic.

Anonymous
08/24/24(Sat)08:37:01 No.102057174

Anonymous 08/24/24(Sat)08:37:01 No.102057174

File: ComfyUI_01398_.png (963 KB, 1024x768)

963 KB PNG

>>102057159

Anonymous
08/24/24(Sat)08:38:13 No.102057186

Anonymous 08/24/24(Sat)08:38:13 No.102057186

>>102057127
flux got a will of its own. (certainly) guidance, step amount, the max/base shift and jesus all play a part in this.

Anonymous
08/24/24(Sat)08:38:57 No.102057196

Anonymous 08/24/24(Sat)08:38:57 No.102057196

>>102057170
>With Flux its almost always the same fucking gen
Doesn't that usually mean it's overtrained?

Anonymous
08/24/24(Sat)08:39:59 No.102057211

Anonymous 08/24/24(Sat)08:39:59 No.102057211

File: ComfyUI_01406_.png (928 KB, 768x1024)

928 KB PNG

>>102057186
It does, I am not even sure what it's trying to tell me with this chinese text

Anonymous
08/24/24(Sat)08:40:11 No.102057212

Anonymous 08/24/24(Sat)08:40:11 No.102057212

File: 2024-08-24_00233_.png (1.49 MB, 832x1216)

1.49 MB PNG

>>102057186
mostly Jesus .. now the skeleton is A .. guidance is 3.5 .. I guess ill try lowering that

Anonymous
08/24/24(Sat)08:40:55 No.102057226

Anonymous 08/24/24(Sat)08:40:55 No.102057226

File: file.png (2.27 MB, 1024x1024)

2.27 MB PNG

Anonymous
08/24/24(Sat)08:41:05 No.102057227

Anonymous 08/24/24(Sat)08:41:05 No.102057227

>>102057212
What's the prompt?

Anonymous
08/24/24(Sat)08:41:31 No.102057235

Anonymous 08/24/24(Sat)08:41:31 No.102057235

File: 4086247269.png (1.4 MB, 896x1152)

1.4 MB PNG

Looks like training on 512px works but introduces the old familiar problems like mangled hands and such. Or it could be something else with the dataset, what do you think?

Anonymous
08/24/24(Sat)08:42:00 No.102057243

Anonymous 08/24/24(Sat)08:42:00 No.102057243

>>102057196

It doesn't matter a whole lot, tried 1000 steps to 5000 steps, its cool to try dont get me wrong, but to actually do work with, its a nightmare. It just has potential but that's it. Its over hyped bullshit

Anonymous
08/24/24(Sat)08:42:02 No.102057245

Anonymous 08/24/24(Sat)08:42:02 No.102057245

>>102057227
holy fucking christ now they are holding it together looking mockingly at me...

ah.. the prompt here:

>A cinematic shot in a photorealistic style of a human wizard. He is wearing a long black hooded robe and has a long white beard. His robe is embroided with arcane symbols. He is bald and his face is grumpled and old. He has a sinister grin on his face. His left eye is blind. He is a vile necromancer. On the wizard's forehead there is a tattoo that is the simple bold letter A.

>He is in the forest picking mushrooms. The scenario is sinister and evil. The mushrooms are black. Green slime covers the mushrooms.

>Behind the wizard stands a skeleton servant with glowing eyes. On the skeletons skull a is a simple bold letter B is painted in red.

>The skeleton with B on on its skull is holding a basket. The basket is filled with mushrooms. Its nighttime and there is a mysterious glowing blue moon in the sky.

Anonymous
08/24/24(Sat)08:42:03 No.102057246

Anonymous 08/24/24(Sat)08:42:03 No.102057246

File: 0.jpg (140 KB, 1024x1024)

140 KB JPG

Anonymous
08/24/24(Sat)08:42:20 No.102057247

Anonymous 08/24/24(Sat)08:42:20 No.102057247

File: 00058-195349927.png (810 KB, 1152x896)

810 KB PNG

>>102057186
FLUX is the LLaMA 3 of image models (high quality but very censored).

Anonymous
08/24/24(Sat)08:43:04 No.102057253

Anonymous 08/24/24(Sat)08:43:04 No.102057253

File: 2024-08-24_00235_.png (1.35 MB, 832x1216)

1.35 MB PNG

Anonymous
08/24/24(Sat)08:43:47 No.102057260

Anonymous 08/24/24(Sat)08:43:47 No.102057260

>>102057247
ablation when

Anonymous
08/24/24(Sat)08:46:26 No.102057282

Anonymous 08/24/24(Sat)08:46:26 No.102057282

>>102057260
>ablation
I had to look that word up.

Anonymous
08/24/24(Sat)08:46:47 No.102057287

Anonymous 08/24/24(Sat)08:46:47 No.102057287

File: ComfyUI_01274_.png (934 KB, 1280x720)

934 KB PNG

Anonymous
08/24/24(Sat)08:46:59 No.102057290

Anonymous 08/24/24(Sat)08:46:59 No.102057290

Ready to roll with the a fresh loaf of...
>>102057280
>>102057280
>>102057280

Anonymous
08/24/24(Sat)08:48:49 No.102057302

Anonymous 08/24/24(Sat)08:48:49 No.102057302

>>102057211
well there is the number 33 in there so thats a good thing. I like numbers.
>>102057212
I mean it certainly does listen. just go like "4 panel grid, 4x4, panel 1: bla, panel 2:jesus, ..." and it will do that
>>102057246
super
>>102057247
well I distinctly remember fucking a llama-3 powered characters brains out so that can be jailbreaked. assistent assistant. nice face there

Anonymous
08/24/24(Sat)09:21:30 No.102057682

Anonymous 08/24/24(Sat)09:21:30 No.102057682

File: ifx185.png (1.69 MB, 1024x1024)

1.69 MB PNG

Anonymous
08/24/24(Sat)09:23:14 No.102057713

Anonymous 08/24/24(Sat)09:23:14 No.102057713

>>102056832
it seems to me like both failed, could you try it without the lora? these types of comparisons are best on the base model.

Anonymous
08/24/24(Sat)10:09:41 No.102058268

Anonymous 08/24/24(Sat)10:09:41 No.102058268

>>102055870
pretty slick artstyle
flux lora?

Anonymous
08/24/24(Sat)10:51:04 No.102058832

Anonymous 08/24/24(Sat)10:51:04 No.102058832

>>102057170
Not with schnell, as ong as you're not after photos. Schnell got range.

Anonymous
08/24/24(Sat)11:56:56 No.102059746

Anonymous 08/24/24(Sat)11:56:56 No.102059746

>>102056953
nta but thanks a lot, also those are some great custom nodes.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.