[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1.4 MB, 3264x3264)
1.4 MB
1.4 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102057280

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
File: 2024-08-24_00273_.png (1.43 MB, 1216x832)
1.43 MB
1.43 MB PNG
>>102059183
ty baker
>>
>>102059207
need kawaii cat skeleton in cat poses like leg up licking himself, nya hands pose, on back looking at you while he shows his skelly belly, etc
>>
There's almost 15k flux loras now?

1 week ago there were around 100

Can barely find interesting loras now
>>
>>
Good collage choices.
By the way fuck the Patreon guy. Here's a workflow some dude posted on Reddit to do it in comfy for free. Works extremely well, uses about 9GB VRAM.
D:\ai\training_sets\rebecca_cyberpunk_v2\caption
>>
>>102059230
>spending time testing different settings, refining my datasets, captions
>now if I post any of my effort loras I'll be drowned out by a sea of shit
well, at least I enjoy making them even if no one ends up using em
>>
File: 1710336089488063.png (24 KB, 481x297)
24 KB
24 KB PNG
dual clip or no?
>>
>>102059255
>D:\ai\training_sets\rebecca_cyberpunk_v2\caption
anon...
>>
File: 1717833043290858.png (1.14 MB, 671x971)
1.14 MB
1.14 MB PNG
>The power of Flux
BlackForest-Sama I kneel.
>>
>>102059255
Can't tell if funny or legit retarded.
>>
>>102059255
I am in your computer
>>
File: 2367814336.png (1.31 MB, 832x1216)
1.31 MB
1.31 MB PNG
I somehow managed to train a lora that makes Flux look like SD1.4 kek
>>
>>102059255
>uses about 9GB VRAM.
what flux model?
>>
>>102059264
/g/
>technology
>>
>>102059255
>guys let's get poorfags (with shit tastes) the ability to train instead of focusing on how to get full finetunes done by richfags
>>
>>102059255
Kill myself now wrong fucking paste
https://files.catbox.moe/u2xfic.json
>>
>>102059277
It's perfect..
>>
>>102059255
I prefer my AUTOMATIC1111 workflow, I already have it set up if anyone wants to try it. http://127.0.0.1:7860/
>>
>>102059308
kek'd
>>
>joy-caption
Should I switch the Llama3.1b with the uncensored version?
>>
>>102059259
Yes
>>
File: 393501445.png (1.39 MB, 896x1152)
1.39 MB
1.39 MB PNG
>>102059255
What resolution are you training on, based retard? Cuz I tried 512px and the hands got all fucked up.
>>
File: 1708089120150285.jpg (142 KB, 1536x864)
142 KB
142 KB JPG
Can anyone use the great new summertime saga lora on this meme image as sketch controlnet?

https://civitai.com/models/676047/summertime-saga-ponyxl-style-dora?modelVersionId=756775
>>
File: 4step_up_00057_.png (3.7 MB, 1536x1536)
3.7 MB
3.7 MB PNG
>>
>>102059335
1024. It's a shit LoRA, first time ever doing anime training on any model so it's fucked up.
>>
>>102059335
>I tried 512px and the hands got all fucked up.
nta but I had the same problem from bucketing, might end up less fucked if you crop/shrink the images yourself but I didn't have the same issue using 1024 reso, so
>>
I want to train a flux lora on smaller images. They are variable in their resolution but most are about 400x300. Will they just be ignored when I train on a higher resolution, e.g., 512x512? What resolution and bucket sizes would you use?
>>
>>102059323
Joycaption can into nudity, but which uncensored model are you talking about?
>>
File: 00120-AYAKON_1248188.png (3.4 MB, 1536x2560)
3.4 MB
3.4 MB PNG
>>
>>102059295
for finetune or lora babe?
>>
File: 3314905267.png (1.33 MB, 1216x832)
1.33 MB
1.33 MB PNG
>>102059353
>>102059364
Good to know, I'll try a run tonight with 1024 with bucketing and if that's still fucked up I'll try a 1024x1024 cropped dataset.
>>
>>102059376
it can be set to upscale them. yes I'd use 512x512
>>
anyone else has problems with ComfyUI Fill Nodes? Unresolvable import failure.. gives no error msg
>>
>>102059377
Llama3.1 Lexi
>>
File: Untitled.png (4 KB, 248x75)
4 KB
4 KB PNG
The fuck this button do?
>>
>>102059412
comfortableui
>>
>>102059394
LoRA. I don't have the capacity for a fine tune.
But it's nice to be able to download a shit load of images, rename them all with a powershell script, and hit "go" on comfy to have them all captioned. The hard part now is manually checking the captions are correct.
>>
>>102059399
do keep in mind fingers are usually the first to go if you overcook too. 1024 reso with bucketing should work fine though
>>
>>102059423
Why is the model in 4 files instead of 1?
>>
>>102059433
Noted, though I only did 1600 steps so it seems unlikely to be overcooked.
>>
>>102059426
I'm not sure either but my search to find out isn't looking so good..
>>
File: ComfyUI_00814_.png (808 KB, 1024x1024)
808 KB
808 KB PNG
So I finished training my LoRa and been testing it out a bit and I'm not all that happy about it.
My dataset contains 59 images and I let it run for 2000 steps.
>>
>>102059401
but won't that introduce a load of artifacts when it's done on almost all the images?
>>
>>102059470
Looks fine to me, what's the problem?
>>
>>102059470
params used?
>>
>>102059478
sometimes, really depends. 512x512 reso wouldn't be a huge leap from the orig reso so you might be able to get away with it
>>
>>102059470
Is that supposed to be someone we know
>>
File: ComfyUI_00813_.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>102059483
Feels like its too difficult to get Flux to make her.
>>102059490
you mean in the training config? I dunno lol, I mostly kept everything like in the example.yml
>>
>>102059470
>>102059526
literally who?
>>
>>102059463
Well, let's see what happens.
>>
File: ComfyUI_00772_.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
>>102059522
goddamnit its so bad nobody even recognizes who its supposed to be.
JUST fuck my shit up
>>
File: ComfyUI_21314_.png (2.34 MB, 1920x1080)
2.34 MB
2.34 MB PNG
>>
>>102059526
>I mostly kept everything like in the example.yml
that's probably why. you'll at the least want to increase LR if you aren't satisfied with the training by 2000 steps with that size of dataset. no idea what the example uses but if it's on adafactor try adamw8bit with cosine instead
>>
>>102059539
Myabe the training isn;t the issue maybe it's how you are using the lora.
>>
>>102059539
Tell us who it's supposed to be. I trained myself and some other irl people and it got us perfectly. My face isn't famous and yet I have a LoRA of myself.
>>
>>102059535
come back with results I must know
>>
File: ComfyUI_21310_.png (2.41 MB, 1920x1080)
2.41 MB
2.41 MB PNG
>>
>>102059539
It's Kasia isn't it?
>>
File: ComfyUI_02088_.png (1.11 MB, 768x1024)
1.11 MB
1.11 MB PNG
>>102059564
t-
>>
>>102059539
Hi can you stop posting a lora of my wife?
>>
Was gone for a week, any big news? Did someone figure out how to train flux loras on low vram?
>>
>>102059564
grifter... Is that you...???
>>
>>102059580
Thankfully I'm not that Jewish
>>
File: ComfyUI_21332_.png (2.35 MB, 1920x1080)
2.35 MB
2.35 MB PNG
>>
>>102059539
It's HT
>>
File: 3336036614.png (962 KB, 1216x832)
962 KB
962 KB PNG
>>102059336
>>
>>102059597
No I am the guy who AI'd my linked in profile because I don't want to buy a suit.
>>
>>102059559
>train:
>batch_size: 1
>steps: 2000
>gradient_accumulation_steps: 1
>train_unet: true
>train_text_encoder: false
>gradient_checkpointing: true
>noise_scheduler: flowmatch
>optimizer: adamw8bit
>lr: 0.0001
>ema_config:
>use_ema: true
>ema_decay: 0.99
>dtype: bf16

Should I have changed anything here?

>>102059563
could be, how can I figure out the optimal settings for the LoRa I made?

>>102059577
YES! its Kasia.
>>
>>102059611
>AI'd my linked in profile because I don't want to buy a suit.

Every time I see someone doing this, they are always indian, always.
>>
>>102059612
who the fuck is Kasia
>>
>>102059577
>>102059612
>Kasia
I don't even know who that is. I must be old.
>>
>>102059612
try LR 0.0003
>>
>>102059542
>>102059575
shieeet i remember you from way back, this a flux lora?
>>
File: ComfyUI_212321_.png (3.25 MB, 1920x1029)
3.25 MB
3.25 MB PNG
>>
Offtopic but is there a general for voice generation models? Idk what board to check
>>
File: hfhfg6.jpg (1.12 MB, 4608x857)
1.12 MB
1.12 MB JPG
pick your weight. -3 for mig
>>
https://reddit.com/r/StableDiffusion/comments/1f026vb/flux_isnt_great_you_only_think_it_is/
Is this Lykon's reddit?
>>
File: bComfyUI_109238_.jpg (825 KB, 2048x1088)
825 KB
825 KB JPG
>>
>>102059630
>Kasia.
Even a google search doesn't bring anything up, there's this plastic surgery looking woman that always wears sunglasses but that doesn't seem to be her.

This is some niche it seems
>>
File: ComfyUI_00770_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>102059625
>>102059630
only old Coomers will know.

>>102059637
what does that do?
also does that make training slower?
and should I go for more than 2000 steps if I change this?
>>
>>102059570
Well, so far it's added +2 seconds on the time it takes for one it.
>>
>>102059607
lol, increase denoise or whatever perhaps? to create an actual image on the more vague basis of the sketch
>>
>>102059658
>what does that do?

LR means learning rate.
Take a wild guess of what will increase if you bump to learning rate from 0.0001 to 0.0003
>>
File: ComfyUI_21157_.png (2.04 MB, 1920x1080)
2.04 MB
2.04 MB PNG
>>102059640

yes, ill probably release it later today
>>
>>102059622
I am white, and even people who've known me for ages don't know it's not real.
>>102059658
Looks just like her desu, what's wrong with the LoRA in your mind?
>>
lora bros... kohya keeps dismissing adding quants... I don't think I can cope...
>>
>>102059658
I'm an old coomer and I have no idea who that is
>>
>>102059647
6
>>
>>102059646
No, there's not that much happening in that field atm. You'll get the most discussion and information about it on /mlp/. Best bet is still XTTS-v2.
>>
>>102059693
too fat!
>>
>>102059612
bump the batch size if you got spare VRAM
>>
>>102059689
>>102059658


I found out who she is, she's someone called Teen Kasia
>>
>>102059696
Good to know, thanks
>>
File: Flux_00070_.png (938 KB, 1024x1024)
938 KB
938 KB PNG
>>
>>102059723
I just got fired from my night job.
>>
File: ComfyUI_00815_.png (868 KB, 1024x1024)
868 KB
868 KB PNG
>>102059710
I'm a vramlet with only 24GB VRAM Sir
>>102059716
and? do you see the similarity?
>>
>>102059723
Try it in forge to see if it looks more like her
>>
File: 2717634441.png (1.46 MB, 832x1216)
1.46 MB
1.46 MB PNG
>>102059665
Could do that, but then it would just be a different image.
>>
>>102059747
yeah although her main feature seems to be her tight body, so hard to say without seeing more
>>
>>102056370
The problem is a proper finetune of Flux will cost money, and you can't make money from it due to the license.
>>
>>102059726
kek
>>102059736
why?

>>102059749
why would that make a difference?
>>
>>102059726
Hellraiser shit
>>
>>102059723
>Feels like its not doing her reliably enough whenever I prompt her.
I have found that in non-close ups too, but if you upscale through the LoRA the face comes back.
Also her nipples slipped
>>
File: ComfyUI_21269_.png (2.77 MB, 1920x1080)
2.77 MB
2.77 MB PNG
>>
>>102059750
you can tweak it by little and get a solid balance but alright, thanks anyway
>>
>>102059782
Sometimes loras can be better in Forge
>>
>>102059658
makes it learn stronger
>also does that make training slower?
>and should I go for more than 2000 steps if I change this?
no & no
you can try more than 2000 steps but it'll probably be overkill, never hurts to try and just use an earlier epoch though
>>
>>102059750
if you drop discord tag ill give you money for more of these
>>
>>102059663
terrifying
>>
>>102059647
-1, thigh gap or bust
>>
File: ComfyUI_00816_.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>102059789
>Also her nipples slipped
goddamnit
>>102059807
Do I have to start from 0 steps again or can I make it train from that config with higher learning rate and add another 2000 steps to the 2000 steps that I already have?
>>
>>102059840
I recommend starting from 0 for consistency but in theory you should be able to resume off the existing training, if you want to try that
>>
>>102059871
how I do that?
do I just change the config to 4000 steps and then just run it?
>>
>>102059426
that button? fuck all. attention mask would be useful for training if it were implemented properly. there's no point using an attention mask full of 1s, that's the same as not using an attention mask. the use case is applying attention only to certain tokens, it needs something in the dataset to determine which tokens should be masked
>>
>>102059884
I have no idea how you do it on whatever you're using, you will have to check the trainer's readme or GitHub page
>>
File: file.png (3.01 MB, 960x1280)
3.01 MB
3.01 MB PNG
>>102059747
>do you see the similarity?
Gen a video of her fucking a wall dildo on her bunk bed and I'll tell you.
Oh wait, Flux can't into porn (and probably never will).
>>
File: ComfyUI_21121_.png (2.28 MB, 1920x1080)
2.28 MB
2.28 MB PNG
>>
File: gtfdgf.gif (3 MB, 230x345)
3 MB
3 MB GIF
>>
File: 1717900582981338.png (3.36 MB, 2592x1192)
3.36 MB
3.36 MB PNG
>>
>>102059945
>guy in the back of the first boat paddling with a soup laddle, trolling his bro infront to do all the work
>>
>>102059808
no thanks d*bo
>>
>>102059945
spielberg's up shit creek
>>
>>102060018
not me, please...
>>
I posted a lora to civitai I trained as a test. It's objectively GARBAGE. It will give you unusable SHIT 9 out of ever 10 gens. And still people have only left positive ratings and keep downloading it. Already have 500 buzz (whatever that's good for) for essentially taking a shit on the middle of the street.
What the fuck is wrong with this gay world?
>>
is 1.0 the default strength_model for my loras? or does it vary? because 1.0 is not completely my model, i gotta go like 1.30 or above. what do you set this on to keep majority of model and to maybe add very schmol features?
>>
>>102059999
>9999
I wasted it... Hope whoever got 60k did better. let's see:
>>102060000
>>
File: 3725512108.png (1.13 MB, 1344x768)
1.13 MB
1.13 MB PNG
>>
File: ifx178.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
>>
File: azz.png (1.6 MB, 1216x832)
1.6 MB
1.6 MB PNG
>>
File: FD_u_00013_.jpg (916 KB, 2688x1536)
916 KB
916 KB JPG
>>
>>102060034
the vast majority of people have shit taste, and yeah, that's a feeling you cannot be used with, we are truely surrounded by retards
>>
>>102059920
So it's just burning my compute for nothing?
>>
>>102060051
this one's way cooler than the last. what about at night with a fires glow or moonlight highlighting their armor
>>
File: file.png (3.04 MB, 960x1280)
3.04 MB
3.04 MB PNG
>>
File: ComfyUI_212436_.png (1.96 MB, 1920x1080)
1.96 MB
1.96 MB PNG
>he slaps your girls ass, wwyd?
>>
File: ComfyUI_00822_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>102059945
>Gen a video
Technology isnt that far yet, GPUs that can do this have to be invented first.
>Flux can't into porn (and probably never will)
It can and it will eventually.

>>102059928
I'm using this here
https://github.com/ostris/ai-toolkit
>>
>>102060101
I've cum buckets to this girl
>>
does describing an objects height help when prompting and having proportion issues?
>>
I'm waiting for a pony-fied version of Flux.
>>
>>102060134
the best you'll get is schnell
sorry.
>>
>>102060110
its cool how AI has the potential to revive her.
>>
>>102060149
>revive
huh? masaka
>>
>>102060134
never happening and even if it was do you really believe astracuck can get that lucky twice in a row? pray for someone to crawl out of the woodwork to take up the mantle
>>
File: file.png (1.53 MB, 1280x960)
1.53 MB
1.53 MB PNG
>>102060091
>>
>>102060157
why?
>>
>>102060101
>Technology isnt that far yet, GPUs that can do this have to be invented first.
Have you been living under a rock?
Sora, KlingAI, Runway Gen-3, etc
>>
god. it will take longer to caption than to train
>>
>>102060171
Is she...dead?
>>
>>102060190
>Sora, KlingAI, Runway Gen-3, etc
you think you can run any of that shit on your consumer card?
>>
File: ComfyUI_00823_.png (1.56 MB, 1024x1024)
1.56 MB
1.56 MB PNG
>>102060196
might as well be, she got married, she got fat, hit the wall and has 2 kids now.
>>
>>102060199
Was running on consumer cards a requirement stipulated anywhere?
>>
File: file.png (2.18 MB, 1280x960)
2.18 MB
2.18 MB PNG
>>102060091
Still waiting for a comeback
>>
>>102060199
Yes when Flux video comes out
>>
>>102060161
>I can't sell my shit therefore I'm not using it
Absolute joke of a baker
>>
>>102060218
look at the name of the thread you are posting in.
>>
>>102060239
Look at the comment I quoted.
>>
File: FD_u_00017_.jpg (752 KB, 2688x1536)
752 KB
752 KB JPG
>>102060084
Doing upscales at the minute so no new gens, but this one shows that
>>
>>102060218
Well, can you run anything else local?
>>
>>102060246
yep, it's in the same thread
you really should read the name, it'll all make sense then
>>
File: file.png (1.86 MB, 1280x960)
1.86 MB
1.86 MB PNG
>>
File: ComfyUI_00824_.png (2 MB, 1024x1024)
2 MB
2 MB PNG
>>
>>102060261
you really are one stupid son of a bitch uh?
>>
>>102060199
>>102060234

They all suck anyways, all of them including Sora and Luma, for every 5 minutes of video it will make, you will get maybe like 5 seconds of useable stuff, not to mention the shit load of time it will take. It's going to take at least 5 years to get anything worthwhile out of ai video. I'm not talking about genning random stuff like an elephant drinking water or some lame shit, im talking about actually using your own images and then prompting and getting anything worthwhile out of it, right now its just a scam, especially this luma bullshit right now. It can't even do basic animation without fucking up a whole lot.
>>
>>102060216
Excellent definition in her legs
>>
>>102059295
How do I add a different model to this? It's not showing up
>>
>>102060315
I cant even tell anymore, I been looking at AI chicks for so long I have no idea if they are anatomically correct or not.
>>
>>102060304
well duh that's why local flxu video will be way better, we will have more controll on how to use them.

e.g training flux video models and loras

Nothing stays the same, things advance
>>
>>102060320
sd3 was made for you
>>
>>102060320
nah those legs look fine
>>
File: ComfyUI_00807_.png (1005 KB, 1024x1024)
1005 KB
1005 KB PNG
>>102060304
>not to mention the shit load of time it will take.
Doing the math is really sobering.

>1 image in Flux takes around 1 Minute
>to be watchable the video needs 30 frames per second which is 30 images per second
>1 second of video : 30 minutes
>>
File: ComfyUI_Flux_0201.jpg (911 KB, 1536x2688)
911 KB
911 KB JPG
>>
File: file.png (1.66 MB, 1280x960)
1.66 MB
1.66 MB PNG
>>102060355
>1 second of video : 30 minutes
Actual video production is much, much slower.
>>
File: 1717861922125591.png (1.25 MB, 1152x896)
1.25 MB
1.25 MB PNG
Flux sucks it doesn't even gen vampires, they NEVER have fangs even if you specify fangs

I tried to gen Tony Blair drinking a pint of blood and it was just a clear red liquiod. So much for an "uncensored" model. What do people use to gen stuff that may be violent or saucy? I tried using Pony in the past but I could never get good results and the prompts I saw others using that got good shit looked schizo as fuck and made no sense to me

Flux is neat for generating cars though there are very few flaws

>>102060216
Rip, Kasia was cute as fuck
>>
>>102060143
What happens when you use a dev LoRA on schnell?
>>
>>102060355
Well as someone who has done 3D animation I guess it won't be much different lol
>>
>>102060376
Same as when you don't use a LoRA. It works, but schnell is shit.
>>
File: 1704361295766082.png (1 MB, 760x1088)
1 MB
1 MB PNG
>>
File: GGUF_TEST.png (40 KB, 829x412)
40 KB
40 KB PNG
>>102057125
Hope you'll see this anon. This is all the experimental optimization shit I could think of but some options may fuck image quality (like applying the LoRA in bf16) or OOM (like with that meme 1GB civitai LoRA).
Fastest would be picrel, which goes from 2.46s/it to 2.15s/it when using 1 LoRA on Q4_0 or 3.42s/it to 2.83s/it with 2 LoRAs on Q5_K_M.
>>
>>102060416
>that meme 1GB civitai LoRA
There is always a bigger fish. https://civitai.com/models/647663/porsche-911-gts-2024-flux
>>
>>102060413
mine
>>
>>102059255
I just wish i could get 512 to even fucking work on 16gb for flux, tried several redditors workflows and none of them work.
A least i have a training set ready to go.
>>
>>102060416
This also technically adds back the snakeoil sidegrade fp32 changes I ended up reverting a while ago as an option.
>>102060436
God damn it kek
>>
>>102060416
>Hope you'll see this anon.
Yes I'm here :D
>This is all the experimental optimization shit I could think of but some options may fuck image quality (like applying the LoRA in bf16)
Which one decrease the quality of picture between the 3 (dequant_dtype, patch_dtype, patch_on_device)?
>Fastest would be picrel, which goes from 2.46s/it to 2.15s/it when using 1 LoRA on Q4_0 or 3.42s/it to 2.83s/it with 2 LoRAs on Q5_K_M.
That's a nice improvement anon, I'm glad the GGUF will have tolerable speed with loras
>>
>use 1 lora: 2.40 s/it
>use 2 loras: 1,9 s/it
Why?
>>
>>102060386
Have there been any x/y comparisons?
>>
>>102060470
>I'm glad the GGUF will have tolerable speed with loras
it won't, he said it slightly increases speed but we know GGUF with lora massively loses speed so if the optimization doesn't massively speed it up we're still losing a lot of speed with loras
>>
File: ComfyUI_Flux_0197.jpg (1.01 MB, 1536x2688)
1.01 MB
1.01 MB JPG
>>
File: ComfyUI_212442_.png (2.51 MB, 1920x1080)
2.51 MB
2.51 MB PNG
Going live here soon, he was a pain in the ass to do, SDXL would never get details correct no matter what I did but Flux is doing a better job at it. Used better datasets including pictures from videos of people reviewing their chronicles cain figure which is now sold out and like 2K-3K. Can't use any other material like other figures or cheaper version as they dont line up to the movie datasets and it fucks things up, so it has to be pure. Would always fuck up on the radiation symbol and cause it to have extra petals. But happy with what im getting out of it.
>>
>>102060473
now do hires upscale
i get 7-9s/it
>>
>>102060504
Just merge your favorite loras into the base model. It's trivial to do so using ai-toolkit.
>>
File: flux_00802_.png (1.66 MB, 1280x1024)
1.66 MB
1.66 MB PNG
>>
>>102060470
>Which one decrease the quality of picture between the 3
dequant_dtype: target shouldn't change it (it'll change slightly if you set it to fp32). I'll probably enable this as the default but I have to make sure there's not some random precision loss that fucks one of the K quants or when using FP16 fallback.
patch_dtype: is the one that for me seemed to make LoRAs look ever so slightly off, though that also increased the speed the most. Also needs a recent-ish version of comfy to work (last week or so)
patch_on_device just puts it on the GPU so no difference there other than a possible OOM.
>>
File: ComfyUI_00971_.png (2.26 MB, 1152x1536)
2.26 MB
2.26 MB PNG
>>
File: ComfyUI_Flux_10551.jpg (353 KB, 768x1344)
353 KB
353 KB JPG
>>
Kasia was obviously a beautiful woman by any metric, but I feel like perfect tanned blondes are so ubiquitous in media that I've become completely desensitized to them. I wouldn't look twice if I passed a girl like that on the street. With pale skin or dark hair I'd be interested, but that specific look makes women invisible to me even no matter how attractive they are. Anybody else experience this?
>>
>>102060552
>dequant_dtype: target shouldn't change it (it'll change slightly if you set it to fp32).
oh it's the one that were supposed to improve precision but when compared to the current one we noticed it was further away from fp16 right? that's weird it happened that way, you'd imagine that adding more precision on some weights would increase the accuracy overall
>>
>>102060473
are you sure it didn't swap from s/it to it/s?
>>
how do you force a style to appear when you are genning that you created a keyword for its image inside its text file?
>>
>>102060504
Weird I get same speed with lora on and off, except for the part that loads up the lora
>>
File: file.png (376 KB, 943x341)
376 KB
376 KB PNG
>>102060620
Actually, I changed the resolution of the image. Nevermind.
>>
>>102060583
yeah I don't think her face stands out much but she has a great body.

And yes I love dark haired pale women, they stand out more, even when someone you know that's usually blonde goes dark she loks way better to me
>>
>>102060583
What happened to her?
>>
Failed gen, but I need to share it
https://litter.catbox.moe/di7jq5.png
>>
>>102060648
pink floyd - the wall
>>
File: ComfyUI_00831_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>102060583
>>102060645
ok so that mean my LoRa is shit and I should delete it.

>>102060648
see >>102060216
>>
>>102060668
No she looks similar to the real one, nice body ok/passable face
>>
File: bComfyUI_109336_.jpg (1.16 MB, 1536x2048)
1.16 MB
1.16 MB JPG
>>
>>102060603
Yeah. The reference implementation had FP32 compute too so not sure what that's about.
"target" here just converts it to whatever the model needs inside the dequantization instead of doing it after it's been dequantized - so one less conversion needed. Not a massive speedup but I guess it adds up.
>>
>>102060668
The real question is, can it generate a picture of her wearing nothing but a choker, fucking a dragon dildo, squirting everywhere?
>>
File: file.png (466 KB, 810x1002)
466 KB
466 KB PNG
>>102060719
>>
>>102060648
I just used the past tense because it's been like a decade since everybody was obsessed with her.
>>102060645
Even losing the tan and keeping the blonde would be a big upgrade in my opinion. Dark hair and fair skin is the best though.
>>102060668
Obviously people like her and it's no mystery why as she's pretty objectively attractive. I don't really understand what it is about her that makes be see her as completely unremarkable.
>>
>>102060719
Yes
>>
File: ComfyUI_00832_.png (1023 KB, 1024x1024)
1023 KB
1023 KB PNG
>>102060684
Should I release it on civitai?
do you think people would like it?

>>102060719
give me a prompt and I see what it can do.
the training data had lots of her naked body in it.

>>102060734
she has that girl next door vibe.
>>
>>102060741
Well, why haven't you provided a download link yet??
>>
>>102060767
These images are too hot for you, anon.
>>
>remember that guy who made Trump, Kamala, Kim Jong Un, Maduro and Greta loras
>his stuff got deleted
Damn, I wanted a Putin lora
>>
>>102060753
Yeah that's it she has that girl nextdoor vibe that people like.

You can release it if you want, but she might be a bit of a niche to get a lot of downloads.
>>
>>102060791
Where? They're still there
>>
File: ComfyUI_00834_.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
>>102060793
the Kasia one for SD1.5 has 2000+ downloads.
>>
>>102060753
Honestly part of my reaction to her might be that only the biggest normie trash girls went for the tanned blonde look during my formative years. That and the ubiquity of that look in media. I could see her as a girl next door, but the normie girl next door I wouldn't want to interact with.

The weird part is that I wouldn't want to interact with arthoe types, but I absolutely still find them attractive.
>>
File: file.png (170 KB, 2681x877)
170 KB
170 KB PNG
>>102060552
>I'll probably enable this as the default
what does "default" means here? default is neither "target", "float32", "float16", "bfloat16"? it's like something else? because you're written {"default": "default"}) on dequant_dtype
>>
>>102060753
>give me a prompt and I see what it can do.
the training data had lots of her naked body in it.
Try this, change it to use whatever keywords you know it should know:
>This is an explicit picture of teen Kasia posing nude in a bedroom. Kasia's legs are spread apart, prominently displaying her shaved vulva. She is smiling, wearing light makeup, highlighting her natural beauty and youthful air. Kasia is completely nude. She is wearing a black choker.
>This picture is clearly intended for mature audiences due to its explicit and provocative nature, combining elements of naivete and raw sensuality characteristic of Kasia's work as a nude model.
>>102060791
You can make one in less than an hour.
>>
File: tmpzkgejjj4.png (3.56 MB, 2600x1109)
3.56 MB
3.56 MB PNG
der katalog
>>
>>102060805
There is also a huge different with seeing someone in images and seeing them in real life. Just the vibe and the way she moves etc can make you a lot more attracted vs images.
>>
arr rook same
>>
>>102060823
Default is just "None" for the cast inside the dequantization, i.e. what it currently does. Technically the same as setting fp16 since that's what it gets converted to.
>>
Who made this

https://civitai.com/models/676564/world-of-horror-flux-d
>>
>>102060877
oh ok, thanks for the answer
>>
>>102060823
Also, if you mean the option, that second field is just how you set the defaults for a field on a custom node. Both the option and the variable being called default looks a bit stupid now that I look at it lol.
>>
>>102060623
it's ever so slightly slower with just one lora now but it still loses speed as you stack more
2.5s
2.57s
3.18s
3.71s
>>102060552
Is it possible to merge the loras into one and have just that one in VRAM or does the math not check out like that?
>>
>>102060885
A based anon from these threads who rocks a 4090 and doesn't afraid of anything.
>>
>>102060885
yeah it was actually someone from here
>>
>>102060900
>Both the option and the variable being called default looks a bit stupid now that I look at it lol.
it's all right, if it works it works :v
>>
>>102060885
>>
>>102060832
actually turned out better than expected, should have probably included even more nude pictures of her in the training set (training set was 59 images)
https://files.catbox.moe/suxw58.png
>>
>>102060903
>Is it possible to merge the loras into one and have just that one in VRAM or does the math not check out like that?
I'd assume not unless they're the same dim/alpha I'd assume. Could probably use SVD but you'd wait 2 minutes every time you changed the strength which, yeah no fuck that.
Might try and replace comfy's calculate weight thing (which merges it onto the model) with the actual way you're meant to use LoRAs (calling up/down separately) and see if that's any faster.
>>
>>102060943
Not bad. I completely forgot about the dildo while writing the prompt lmao
>>
>>102060904
Why tf is 4090 so expensive
>>
>>102060953
>I'd assume
Apparently I had a stroke while writing that kek.
>>
>>102060960
Cause it's new. You can do the same training with a 3090, btw. Possibly with a 3060 if you're willing to wait longer.
>>
>>102060960
its only like $2000.
>>
>>102060960
because AMD won't compete
>>
File: ComfyUI_00836_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
kek
>>
File: file.png (19 KB, 265x183)
19 KB
19 KB PNG
>>102060960
I'll never have 2 A6000 Adas :(
>>
File: file.png (2.04 MB, 1024x1024)
2.04 MB
2.04 MB PNG
>>
File: I need this.jpg (194 KB, 2562x1060)
194 KB
194 KB JPG
>>102061011
>>
>>102061011
The Adas are just called 6000
>>
>>102061011
How fast does it generate an image on dev 30 steps?
>>
>>102061040
the 6000 and 6000 ada are two different cards
>>
File: ComfyUI_00842_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>
File: file.png (733 KB, 512x512)
733 KB
733 KB PNG
>>102061040
No the dumb niggers at Nvidia have two A6000s. Regular which is 3090s and A6000 Adas which is 4090s.

>>102061042
It's still 20 seconds. But I haven't done all the speed optimizations, I really only train now.
>>
>>102061060
right, what I mean is it is called RTX 6000 Ada, not A6000 Ada
>>
>>102061038
would be kinda pointless for image gen, 48gb of ram on the 6000 ada is more than enough even if we get a 30b image model
>>
>>102060436
You wouldn't download a car
>>
>>102061075
you need the VRAM to run big boy batch sizes, it's all about the batch size
>>
>>102061100
kek
>>
>>102061100
lmaooo
>>
>>102061070
Why do you train? You farm buzz?
>>
File: FluxDev_02811_.jpg (197 KB, 832x1216)
197 KB
197 KB JPG
Are we back, dall-e bros?
>>
File: file.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>102061030
>>
>>102061126
lol
>>
>>102061126
I'll take two
>>
>>102061124
I train because it gives me joy. For the same reason someone makes mods for a game. Although in this case I'm making a full base model from scratch.
>>
File: file.png (3.64 MB, 2638x1452)
3.64 MB
3.64 MB PNG
>>102061100
>>
File: file.png (788 KB, 1024x1024)
788 KB
788 KB PNG
>>102061126
That's a lot of condensed milk for such a small fairy. It's going to raise her glucose through the roof!
>>102061159
A total conversion. Based.
>>
its out https://civitai.com/models/682070/robocain-v099-flux
>>
File: file.png (2.22 MB, 1024x1024)
2.22 MB
2.22 MB PNG
>>102061181
Upboated
>>
>>102060977
FUCK OFF
>>
>>102061210
$2000 isnt much.
>>
>>102061062
Nice Turkish cat experience photo
>>
>>102061159
Like combine multiple loras for a checkpoint?
>>
>>102061234
Like make a 1.3B Pixart architecture model all the way from zero latents.
>>
>>102061207

ty anon, for next lora i wanna make an entire style, basically very late 1980s to mid 1990s sgi aesthetics, including their irix operating system and color palette choice, but also the design of the cases for sgi like the indigo 2, o2, octane 2, o2 purple, and then make nice battlestations of the future, but this is one of those projects where I just say it and never actually start doing it.
>>
>>102061218
FUCK OFF!!!!!!!!!!!!!!!!!!!!!!!!!!

The 1080 days was way better, you got what you paid for.

Go away Nvidia shill!!!!

MY GOD YOU GUYS ANNOY ME
>>
File: file.png (2.43 MB, 1280x960)
2.43 MB
2.43 MB PNG
>>
>>102061252
It's only 2000$ though.
>>
>hes back
>>
>>102061252
in modern money they are the same price, it's intuitive
>>
>>102061252
>bro remember my 1080 you could play like 1080p games on it, it was so worth it for playing Dishonored 2
>what, a graphics card that can generate 2K images from the AI ether for more than $1000, no thank you
>>
File: file.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
Nice
https://civitai.com/models/681642/illustrationscute-cartoon-cute-manga-flux?modelVersionId=762939
>>
>>102061259
Super Pepe 64
>>
File: 1643916497.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>>
>>102061259

say low poly head
>>
>>102061275
Sorry I mean 1090, all the GPUs were worth the price.

Dont understanding why u are guys prtend it not crazy now because of bitcoin miner etc
>>
Her breasts are small, and she has a penis, which is semi-erect and positioned prominently between her legs.

bravo joycap
>>
File: file.png (1.86 MB, 1024x1024)
1.86 MB
1.86 MB PNG
>>102061277
>Obligatory Miku cutting sushis to test out the model
>>
>>102061334
catbox it
>>
>>102061325
yeah, we have GPUs which can do more than play games now, crazy
>What, you need a GRAPHICS CARD to play games! We used to play games with just a CPU! WHAT A RIP OFF
That is you.
>>
>>102061252
do you have no job?
I make $16000 per month.
>>
Why cant we use negatives? Literally kills everything.
>>
>>102061353
>I exploit $16000 out of workers and consumers per month.
FTFY
>>
I didn't know /g/ was poor.
>>
Training something cute
>>
in a few different ways
>>
File: 4qe86qs255id1.jpg (1.92 MB, 3277x4229)
1.92 MB
1.92 MB JPG
>>102061360
>Why cant we use negatives?
you can anon, go for CFG > 1 + An anti CFG-Burner like AutomaticCfg, Tonemap, DynamicThresholding, SkimmedCFG...
>>
>>102061353
No matter how much u make it does not excuse shilling for corps
>>
>>102061363
Give you a protip, AI images wouldn't exist under communism.
>>
>>102061387
good
>>
just rent a 4090, its like 50 cents or less an hour, i have a 4080 but still dont use it and just rent a gpu, id rather it slow down in the cloud than locally while i do other shit.
>>
>>102061386
You don't have to buy a graphics card.
>>
Can anyone help me out?
I'm using some pony model..
I'm trying to get a character that wears a robe over lingerie, but everything I've been trying, it just combines the two into a single piece of clothing.
>>
>>102061409
there's always some cuck willing to help
>>
>>102061409
is it a realistic pony model? base PonyXL has no issue with that
>>
>>102061396
Where?
>>
>>102061379
soul
>>
File: 00022-2513160362.png (1.15 MB, 1024x1440)
1.15 MB
1.15 MB PNG
lol
>>
File: 355240.png (1.83 MB, 1024x1024)
1.83 MB
1.83 MB PNG
need flux foreskins
>>
>>102061452

community cloud is .44 cents an hour on runpod right now and im sure its even cheaper elsewhere, I have not looked.
>>
>>102061470
it's the Little Butcher meme
>>
File: file.png (2.47 MB, 1024x1024)
2.47 MB
2.47 MB PNG
Jesus! How tall is he??
>>
File: ComfyUI_Flux_0213.jpg (1009 KB, 1536x2688)
1009 KB
1009 KB JPG
>>
File: 709551257.png (1.58 MB, 1024x1024)
1.58 MB
1.58 MB PNG
>>
>>102061516

deeboooo is that you? lol little horn FAGGOT!
>>
It's here, bread straight from the oven...
>>102061535
>>102061535
>>102061535
>>
File: ifx193.jpg (156 KB, 1024x1024)
156 KB
156 KB JPG
>>
>>102061544
nice one
>>
>>102061404
You don't have to defend high prices.
>>
>>102061484
So you can make around 100 flux gens in 1 hour for 44 cents?
>>
>>102061126
FAIRY SEX
>>
>>102061207
>>102061259
Is this n64 style base flux or a lora?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.