[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (866 KB, 3264x3264)
866 KB
866 KB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102106681

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
Blessed deboless thread of friendship.
>>
File: 2024-08-27_00210_.jpg (961 KB, 3840x2160)
961 KB
961 KB JPG
>>102111793
ty baker
>>
File: ComfyUI_00904_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
ok I think I'm ready to showcase my new LoRa that I was baking for 6000 steps.
>>
File: ComfyUI_33109_.png (878 KB, 1280x720)
878 KB
878 KB PNG
>>
File: 00012-1953973024.png (1.91 MB, 1152x1728)
1.91 MB
1.91 MB PNG
>>102110652
>>
File: 00018-1953973030.png (1.8 MB, 1152x1728)
1.8 MB
1.8 MB PNG
>>102111909
>>
File: flux_01012_.png (1.45 MB, 968x1240)
1.45 MB
1.45 MB PNG
>>
File: 2024-08-27_00212_.jpg (1.12 MB, 3840x2160)
1.12 MB
1.12 MB JPG
>>
>>102111909
>>102111919
Oh yes, lovely.
>>
1girl
>>
File: ComfyUI_00920_.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>102111963
yes
>>
1man bald
>>
>>102111963
please make 2guys
>>
File: 00277-1714176830.jpg (873 KB, 1440x1920)
873 KB
873 KB JPG
>>102111973
I think I have seen this particular Aika kino episode, fellow man of culture
>>
Too many asians.

Good.
>>
>>102111929
I love this style. Please post more!
>>
>>102111630
>hashed tokens
QRD? Never heard of this, and it doesn't make sense.
>>
>>102112100
I think he's hoping someone can come up with something like this for Flux: https://lite.framacalc.org/4ttgzvd0rx-a6jf
>>
File: 00023-3002410243_cleanup.png (2.96 MB, 1280x1920)
2.96 MB
2.96 MB PNG
>>102112064
>>
File: ComfyUI_33111_.png (766 KB, 1280x720)
766 KB
766 KB PNG
>>102112004
>>
File: ComfyUI_00933_.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>102112050
>>
>>102112100
It's schizo talk, nothing was hashed.
>>
File: flux_01016_.png (1.6 MB, 968x1240)
1.6 MB
1.6 MB PNG
>>102112067
Sure thing m8.
>>
File: grid-0079.jpg (839 KB, 2160x2880)
839 KB
839 KB JPG
>>
File: ComfyUI_00915_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
>>102112117
>>102112139
>>102112179
>>102112050
>>102111973
>>102111919
>>102111909
Good
>>
could you guys stop? my mom checks my computer.
>>
>>102112182
nails all over the place
>>
>>102112192
Make a lora of your mom and upload it to civitai
>>
File: ComfyUI_00922_.png (855 KB, 1024x1024)
855 KB
855 KB PNG
>>102112192
>>
File: flux_01017_.png (1.46 MB, 968x1240)
1.46 MB
1.46 MB PNG
>>
>>102112129
ya thats the stuff, thanks
>>
File: 00080-2253546269.png (1.18 MB, 896x1152)
1.18 MB
1.18 MB PNG
>>102112192
Sorry, here you go.
>>
>>102112213
give her armpit hair
>>
File: ComfyUI_00934_.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
>>102112257
no
>>
Biggest tits Flux can currently handle?
>>
>>102112270
those mangled feet
>>
File: ComfyUI_00901_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>
>>102112316
make her bald real quick, see what she looks like.
>>
File: 00002-1714176827.jpg (319 KB, 1792x2400)
319 KB
319 KB JPG
>>102112316
great lora, she can look so damn trashy
>>
>>
File: ComfyUI_00938_.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>102112333
lol ok
>>102112358
thanks bro, it took 4+ hours to cook this up.
>>
File: ComfyUI_00939_.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>102112333
kek
>>
>>102112436
lmao thanks for that
>>
>>102112436
penor plushie
>>
>>102112216
Nice warrior

>>102112436
yes
>>
File: 00310-1714176830.jpg (966 KB, 1600x2400)
966 KB
966 KB JPG
>>102112436
much better than I thought
>>
>>102112436
btw what was the prompt for the hair, so we can fill this thread with bald women
>>
>>102112494
take one guess
>>
File: ComfyUI_00940_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>102112494
I just wrote
>"Aika is bald and has no hair on her head".
>>
>>102112514
some people don't have lots of vram, it takes 20 minutes to gen a 512x512 picture so they don't want to take any chances
check your privilege
>>
>>102112523
>>102112514
I see I wanted the muzzcut look you had so was wondering if bald would make her slick bald or not.
>>
I am benchmarking my model and comparing it with flux dev, among others.
How many steps do you usually use for it, to balance quality vs time per gen?
>>
File: flux_01026_.png (1.55 MB, 968x1240)
1.55 MB
1.55 MB PNG
>>
File: 00132-2226528666.png (1.92 MB, 1152x1728)
1.92 MB
1.92 MB PNG
>>
File: 0.jpg (188 KB, 2048x1024)
188 KB
188 KB JPG
>>
File: ComfyUI_00943_.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
>>102112546
just buy a 4090 bro.
>>
File: 2024-08-27_00232_.jpg (1.62 MB, 3840x2160)
1.62 MB
1.62 MB JPG
>>
File: 00344-1714176828.jpg (1.61 MB, 1600x2400)
1.61 MB
1.61 MB JPG
>>102112663
heaven on earth
>>
File: 2024-08-27_00239_.png (841 KB, 1280x720)
841 KB
841 KB PNG
>>102112546
>>
File: grid-0514.jpg (474 KB, 1792x2304)
474 KB
474 KB JPG
>>
Flux Guidance has a huge impact on training but seems to do nothing for image gen.
>>
File: 2024-08-27_00243_.png (1.02 MB, 720x1280)
1.02 MB
1.02 MB PNG
>>
File: fp081.jpg (487 KB, 1024x1024)
487 KB
487 KB JPG
>>102112855
flux really likes that 1 training image xd
>>
File: grid-0003.jpg (2.61 MB, 4992x7296)
2.61 MB
2.61 MB JPG
fun grid
>>
File: ComfyUI_Flux_11062.jpg (175 KB, 576x1024)
175 KB
175 KB JPG
>>
File: 00406-1714176830.jpg (663 KB, 1440x1920)
663 KB
663 KB JPG
>>
File: 00140-260591381.png (1.21 MB, 832x1216)
1.21 MB
1.21 MB PNG
>>102112663
lol nice
>>
File: bComfyUI_105599_.jpg (286 KB, 768x1024)
286 KB
286 KB JPG
>>
File: 00142-260591383.png (1.07 MB, 832x1216)
1.07 MB
1.07 MB PNG
>>
File: ComfyUI_Flux_0277.jpg (511 KB, 1152x2048)
511 KB
511 KB JPG
>>
>>102112858
how would it be applicable for inference + backprop, if it wasn't applicable for inference?
>>
File: bComfyUI_105732_.jpg (228 KB, 768x1024)
228 KB
228 KB JPG
>>102113091
she looks p good with a shaved head
>>
File: FLUX_00061_.png (1.41 MB, 896x1152)
1.41 MB
1.41 MB PNG
>>
File: 00149-367119094.png (1.04 MB, 832x1216)
1.04 MB
1.04 MB PNG
>>102113137
Thanks, she appreciates that.
>>
How do you load 2 loras at same time?
>>
>>102113299
Cram them.
>>
File: file.png (17 KB, 416x259)
17 KB
17 KB PNG
>>102113299
>>
>>102112546
There are ggufs for almost everyone (unless you're using integrated intel gpu or amd)
>>
File: 00154-819286945.png (1.35 MB, 1216x832)
1.35 MB
1.35 MB PNG
>>
>>102112663
Nice sky
>>
>>102113349
That has only one strength option, I need one like that but with both

strength_model

and

strength_clip
>>
>>102111937
Neat
>>
>>102113379
it has both, read the docs
>>
>>102113299
I'm dumb lazy and daisy chain them
if I were smart lazy I'd take 5 seconds to find a node that stacks them
>>
>>102113349
You don't need to connect the clip output by the way. It does require an input but you can leave the output empty so you won't need to re-encore your prompt every time you change your loras
>>102113379
Clip strength doesn't work for flux loras
>>
File: file.png (30 KB, 545x392)
30 KB
30 KB PNG
>>102113379
>>
File: 2024-08-27_00214_.jpg (1.41 MB, 3840x2160)
1.41 MB
1.41 MB JPG
>>102113370
ty
> A marvelous night sky with stars and the milky way.
does the trick

>>102113385
ty
>>
File: file.png (2.09 MB, 1024x1024)
2.09 MB
2.09 MB PNG
>>102113392
>You don't need to connect the clip output by the way. It does require an input but you can leave the output empty so you won't need to re-encore your prompt every time you change your loras
It doesn't do anything to the t5 prompt?
>>
>>102113392
It seems to work, or might be rng

>>102113397
Thanks, why the fuck was that hidden

I wonder how many other nodes have hidden features like that
>>
>>102113438
rghree likes to hide a lot of stuff behind settings, the other day there was anon who couldn't figure out how folder nesting worked in this node
>>
File: file.png (2.2 MB, 1024x1024)
2.2 MB
2.2 MB PNG
>>102113438
I didn't know either. I checked because of what the other anon said. But according to him it doesn't do anything anyway.
>>
>>102113479
>>102113471
I'm using dual clip loader for both t5 and clip, maybe that's why it works?
>>
File: file.png (2.2 MB, 1024x1024)
2.2 MB
2.2 MB PNG
>>102113497
I have no idea
>>
>>102113497
who isn't?
>>
File: file.png (2.4 MB, 1024x1024)
2.4 MB
2.4 MB PNG
>>
>>102113068
My wife
>>
https://huggingface.co/ByteDance/Hyper-SD/tree/main

>just turn your dev into schnell bro
>>
File: file.png (2.58 MB, 1024x1024)
2.58 MB
2.58 MB PNG
>>
File: bComfyUI_105541_.jpg (323 KB, 768x1024)
323 KB
323 KB JPG
>>
File: 00157-819286948.png (1.35 MB, 1216x832)
1.35 MB
1.35 MB PNG
>>
File: 00158-819286949.png (1.38 MB, 1216x832)
1.38 MB
1.38 MB PNG
>>
>>102113397
This works way better than the daisy chain
>>102113387
>>
File: 2024-08-27_00263_.jpg (1.3 MB, 3840x2160)
1.3 MB
1.3 MB JPG
>>
>>102113578
what's a prople?
>>
File: file.png (2.13 MB, 1024x1024)
2.13 MB
2.13 MB PNG
>>
>>102113632
I don't knwo. I've only had a 3090 for a week and it already feels slow.
>>
https://litter.catbox.moe/16bfx7.png
>>
File: bComfyUI_107808_.jpg (243 KB, 768x1024)
243 KB
243 KB JPG
>>102113620
is it aliens?
>>
File: 00163-819286954.png (1.35 MB, 1216x832)
1.35 MB
1.35 MB PNG
>>
>>102113739
Cool
>>
File: file.png (2.54 MB, 1024x1024)
2.54 MB
2.54 MB PNG
>>
>>102113713
I did not need to see this
>>
File: file.png (2.45 MB, 1024x1024)
2.45 MB
2.45 MB PNG
>>
>>102113804
This could be some game

Woke Inc or something
>>
>>102113713
I needed to see this
>>
File: file.png (2.62 MB, 1024x1024)
2.62 MB
2.62 MB PNG
>>102113837
Just some random sentences from a youtube video
>>
File: 1711545405313183.jpg (93 KB, 1292x738)
93 KB
93 KB JPG
Evolution
>>
>>102113872
powerful
>>
File: file.png (2.36 MB, 1024x1024)
2.36 MB
2.36 MB PNG
is it still on topic if I modify it with photoshop?
i think these threads would benefit from having shit made with ai as a central part of the workflow, not just gens
there's a lot of potential with this tool
not me, i just do schizo shit
>>
>>102113921
>is it still on topic if I modify it with photoshop?
no
>>
flux can get really weird if you vibe with it
https://litter.catbox.moe/98ky46.png
>>
>>102113946
leave
>>
File: file.png (2.38 MB, 1024x1024)
2.38 MB
2.38 MB PNG
>>
What were the best realistic models for SD1.5?
>>
>>
File: bComfyUI_107585_.jpg (305 KB, 768x1024)
305 KB
305 KB JPG
>>
File: file.png (2.05 MB, 1024x1024)
2.05 MB
2.05 MB PNG
>>
File: 2024-08-28_00004_.jpg (1.49 MB, 3840x2160)
1.49 MB
1.49 MB JPG
>>102113730
definitely aliens
>>
>>102114025
So many AI generated images of bellybuttons tend to have piercings. How long before tattoos are the default?
>>
Hi anons, im a bit late to the hype
is there a way to prompt flux to gen full nsfw or is it owari?
Also, any loras or anything being made for it?
>>
File: ComfyUI_00956_.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>
File: file.png (2.47 MB, 1024x1024)
2.47 MB
2.47 MB PNG
>photography professional still frame move screenshot. it's demons. madness. psychodelically merge the two images. children in clown customes. ravage. psychedellic style. DMT. machine elves. terrence mkenna
>>
>>102114074
use Pony for nsfw
>>
how many steps for Dev?
>>
File: ComfyUI_00957_.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
>>
i am trying to recreate the lady in the front with flux
>>
>>102114110
30 to 50 is best. below 30 isnt consistent enough and above 50 there is not much gains.
>>
File: file.png (2.74 MB, 1024x1024)
2.74 MB
2.74 MB PNG
It's obvious if you're carefully writing your prompts, you're doing it wrong. You need to let go and just smash your face against your keyboard repeatedly. Then write a fake prompt if you're going to share it with your employer/client.
>psychedelia style. 60s hippie movement. watercolor. kaleidoscope. it's demons. madness. psychodelically merge the two images. children in clown customes. ravage. psychedellic style. DMT. machine elves. terrence mkenna
But schizo is the way to go. definitely
>>102114110
quatre
>>
File: 2024-08-27_00269_.jpg (1.59 MB, 3840x2160)
1.59 MB
1.59 MB JPG
>>102114110
atleast 20, more for complex scenes with alot going on and complex texts .. pic related is 30
>>
>>102114125
that's nice dear
>>
>>102114074
I've seen plenty of nsfw from flux on civitai. I think they have different checkpoints they use for it. Go there, find one of those models.
>>102114103
He might not be looking for the best option, but instead experimenting with different things.
>>
File: warriorgirl1.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>102114125
not quite there yet should i use a lora or finetune for this?
>>
File: file.png (2.67 MB, 1024x1024)
2.67 MB
2.67 MB PNG
>>102114125
Post results.
Maybe feed a cropped image to joycaption. actually, let me do it
>This image is a digital artwork featuring a highly detailed and stylized illustration of a warrior woman in a fantasy setting. >The central figure is a blonde woman with voluminous, wavy hair tied back in a high ponytail. She has a strong, athletic build with large breasts, and she is dressed in a revealing black outfit that includes a small bikini top and a thong, emphasizing her curvaceous figure. She holds a sword in both hands, gripping it firmly, and her expression is intense and focused.
>To her left, a muscular, armored warrior with a helmet and a stern expression stands in a protective stance. His armor is metallic and features intricate designs. To the right, another warrior with a more barbaric appearance, wearing a helmet with horns and a wild expression, is partially visible.
>The background is a gradient of warm colors, transitioning from yellow at the top to a darker orange at the bottom, creating a sense of depth and drama. The overall style is reminiscent of classic fantasy art with a modern digital twist, highlighting the sharp contrasts and vivid colors. The texture of the armor and the woman's skin is smooth, while the background has a slightly blurred effect, adding to the dynamic and intense atmosphere of the scene.
Work with that.
>>
>>102114155
The lady in the front of the Arena cover, to my knowledge, only has one picture of her. Wouldn't a lora trained on only one image be uh, bad?
>>
>>102114131
schizo prompts for schizo output
got it
>>
File: file.png (2.78 MB, 1024x1024)
2.78 MB
2.78 MB PNG
>psychedelia style. 60s hippie movement. watercolor. kaleidoscope. it's demons. madness. psychodelically merge the two images. children in clown customes. ravage. psychedellic style. DMT. machine elves. terrence mkenna. EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER EAT EACH OTHER
>>
>>102114112
5090 by Knights Visual
>>
>>102114171
>>102114171
>>10211417
Did you write that yourself or run the image through something? Trying it now
>>
>>102114189
This is fucking glorious. I can go now.
>>
File: ComfyUI_00160_.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
>>102114172
i mean like a fantasy lora with that general aesthetic, retard
>>
File: db03.jpg (164 KB, 1024x1024)
164 KB
164 KB JPG
>>102114131
ok
>>
File: ComfyUI_00161_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>102114171
this is the result of your prompt
>>
>>102114227
>should i use a lora to recreate a specific character
>noooo not training a character lora
Uh ok.
>>
>>102114074
there are nsfw loras for flux, but i think pony is more fun for nswf stuff.
it will take a while until flux finetunes are available.
>>
File: bComfyUI_112107_.jpg (610 KB, 1920x1088)
610 KB
610 KB JPG
>>
>>102113392
>Clip strength doesn't work for flux loras

It does work, depending on the lora.
>>
>>
File: ComfyUI_00163_.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>
File: ComfyUI_00165_.png (1 MB, 1024x1024)
1 MB
1 MB PNG
>>
>>102114252
I said work with it
>>
>>102114237
Gruesome
>>
File: 1701583451861171.png (195 KB, 515x471)
195 KB
195 KB PNG
have there been any new flux optimizations for 16gb-lets over the past few days?
>>
>>102114518
loras are fucked with quants, imo it's best to wait for further optimizations before even bothering
>>
>>
>>102114237
got more anon? this really does it for me
>>
File: 2024-08-28_00024_.jpg (1.31 MB, 3840x2160)
1.31 MB
1.31 MB JPG
>>
File: ee7.jpg (155 KB, 1024x1024)
155 KB
155 KB JPG
>>102114614
yes
>>
>>102114641
good shit man
>>
what's the best way to make nudes with flux? do the loras actually work or do they just give the same pair of tits to every character?
>>
>>102114718
why not just try and see what kind of titties you get
>>
File: 00051-1726827292.png (598 KB, 888x616)
598 KB
598 KB PNG
>>
>>102114718
some loras work, some don't .. the quality of the current loras are mixed bag
>>
File: bComfyUI_109520_.jpg (231 KB, 768x1024)
231 KB
231 KB JPG
>>
File: 2024-08-28_00034_.jpg (1.51 MB, 3840x2160)
1.51 MB
1.51 MB JPG
>>
>>102114718
https://litter.catbox.moe/jk080p.jpg
Kinda, the size of the breasts do kinda match the build of the woman. That one is using SCG Anatomy for the nudity and a saggy breasts lora for a more natural look.
>>
>>102115019
Pretty good
>>
>>102115019
what about torpedo tits and maybe granny tits depending on the mood
>>
File: 3e87c63f.webm (1.62 MB, 1360x752)
1.62 MB
1.62 MB WEBM
>>
>>102115059
Sorry Rajeesh but this is a LOCAL diffusion general.
>>
>>102115044
torpedo no but the lora does get them pretty low at full strength although that affects likeness
haven't tried prompting for an old woman and... I don't wanna
>>
File: _0044.png (693 KB, 1024x1024)
693 KB
693 KB PNG
>>102111253
alright, so, I can't use the caption_dropout arg as it turns out, it says it can't be used with cache_text_encoder_outputs which will cause an error if not used with kohya
I did however go through my dataset and make sure the captions had no extra paragraphs spaces this run, and for good measure I put the booru tags 50% before the boomer prompts in the captions, 50% after the boomer prompts in the captions. will see if this makes any difference

topped up my runpod account so I can try the 50 images batch 2 thing next if this still doesn't work

I'm pretty ready to just settle on an epoch and release this lora for better or worse though, want to move on to more datasets and feel relatively comfy with my current settings
>>
>>102115140
>will see if this makes any difference
to note, I am still using the wildcard arg, so I believe in theory this should not matter, but when I'm paying nearly $9 to run the full training on this dataset I'm not taking too many risks kek
>>
File: ComfyUI_temp_likcy_00021_.png (3.52 MB, 1360x1600)
3.52 MB
3.52 MB PNG
any decent feet loras for flux?
>>
>>102115188
holy fuck that's disgusting I wish I hadn't opened it, my skin is crawling
>>
File: ComfyUI_02490_.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
I've just trained a LoRA using the latest clip training arguments and It's interesting but the results are not quite what I expected.

Forgive the simplicity of these prompts, I just want to see if the tagging worked.

Without putting the megumin token in clip:
>Megumin standing in a living room setting

The fuck?

cont'
>>
>>102115188
can you make them dirtier? maybe with some abscesses and calluses on them. backward toes are a nice touch tho keep those in.
>>
>>102115232
I'm a retard, does training with clip take up much more vram than without?
>>
>>102115140
Let us know how it goes, gl dude
>>
File: ComfyUI_02491_.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
>>102115232
With token in clip L window
>clip l: Megumin
>Megumin standing in a living room setting

It looks like training clip L successfully associated the Megumin token with something that looks like Megumin.

I have more settings to tweak, but it looks like clip L training is working with some... unexpected results.
>>
>>102115249
Probably, but unless you are actually starving for Vram I doubt it would make much difference. I'm not even sure if it does the thing I think it does either so I'll test some more.
>>
File: ComfyUI_02494_.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>102115256
Here's another but the megumin token is replaced with Aqua.

I think the settings I used outside of the clip training were kind of bunk so I need to tweak those, but the initial response for getting unique tags to associate with characters looks somewhat promising.
>>
File: 2024-08-28_00042_.jpg (1.47 MB, 3840x2160)
1.47 MB
1.47 MB JPG
>>
File: ComfyUI_02498_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>102115278
For reference, here is Aqua without the "Aqua" token defined in the clip L prompt window.

Very weird results. desu
>>
File: bComfyUI_107743_.jpg (257 KB, 768x1024)
257 KB
257 KB JPG
>>
>>102115266
I mean, if it makes it so you get >>102115256
instead of >>102115232
that's a pretty ideal result of clip training, no?
>>102115255
thanks fren, will report back
>>
>>102115300
I love the details on the shirt-dress-whatever shes wearing
>>
>>102115310
Yeah I think it it's a pretty good indicator that something happened during training. I just don't know what the real implications are until I do a few more test LoRAs
>>
File: flux_01080_.png (1.65 MB, 968x1240)
1.65 MB
1.65 MB PNG
>>
File: ComfyUI_00972_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>
>>102115295
this is tiger from monster rancher and none of you are old enough to know
>>
File: Furkan.png (20 KB, 951x265)
20 KB
20 KB PNG
Latest breaking Dr. Furkan news

Dr Furkan says something in an r/youtube thread on Asmongold ad gets deleted.
What did the doctor say that would cause such a response?
>>
So # images * # repeats = # steps per epoch
# steps per epoch * # epochs = # total steps

What happens if I have multiple GPUs for training?
>>
>>102115410
how much vram does that fucking thing have
>>
>>102115440
blocking channels on youtube works. asmongold just has multiple channels.
so he needs to block multiple times.
what a dumbass.
>>
File: 2024-08-28_00050_.jpg (1.33 MB, 3840x2160)
1.33 MB
1.33 MB JPG
>>102115451
not enough!
>>
>>102115451
24GB
>>
File: bComfyUI_107397_.jpg (259 KB, 768x1024)
259 KB
259 KB JPG
>>102115333
>shirt-dress-whatever shes wearing
only flux knows
>>
>>102115449
the batch size is multiplied by each gpu
batch size 2 on 2 gpus = batch size 4
>>
>>102115553
So I'm effectively doubling the batch size by the number of GPUs I have ( # of battces per gpu). Does the number of epochs get doubled as well?
>>
>>102115422
desu, I don't know what it is or why it appeared during when not specifically using character tags.
>>
>>102115465
People also repost his stuff don't they? Like take clips off his streams? LLMs should be able to block the concept of a celebrity though, I wonder if that will be a youtube feature "don't ever show me this concept".
>>
>>102115669
>>102115636
>>102115465
Man it would pretty ironic if Furkan was complaining about seeing someone everywhere.
>>
File: bComfyUI_104974_.jpg (313 KB, 768x1024)
313 KB
313 KB JPG
>>
>>102115440
lol that is truly the definition of first world problems.
>>
File: 00030-960619547.png (1.09 MB, 1152x896)
1.09 MB
1.09 MB PNG
Here is my dumbest experiment. Flux with same prompt, same seed, same lora etc etc except i add to the beginning and end of the prompt "hello flux i would like" and at the end "please and thank you"
First pic no politeness
>>
File: 00031-960619547.png (1.15 MB, 1152x896)
1.15 MB
1.15 MB PNG
>>102115750
and now here it is with grandma being nice to google tier prompt adjustment
>>
File: flux_01093_.png (1.47 MB, 968x1240)
1.47 MB
1.47 MB PNG
>>
File: ComfyUI_02389_.png (1.3 MB, 960x1344)
1.3 MB
1.3 MB PNG
>>102115733
Well, he is from Turkey which despite being definitively a fist world country, gives off a very third world vibe. But Furkan is actually on all levels but physical Indian.
>>
>>102115796
Wait..what I didn't realise it was a comment from him lol
>>
File: 2024-08-28_00061_.jpg (1.25 MB, 3840x2160)
1.25 MB
1.25 MB JPG
>>
>>102115813
Yeah, it's so weird seeing a wall of SD related posts and one very conspicuous post being removed in a discussion about asmongold being ubiquitous. What could he have said?
>>
>>102113392
>You don't need to connect the clip output by the way. It does require an input but you can leave the output empty so you won't need to re-encore your prompt every time you change your loras
oh yeah, good point. flux loras aren't trained on the clip right so it's unnecessary
>>
File: file.png (19 KB, 2004x102)
19 KB
19 KB PNG
>>102115440
>>102115850

Does new reddit not allow you to see moderator deleted posts on people's profile?
>>
File: Untitled.png (12 KB, 753x98)
12 KB
12 KB PNG
>>102115855
>flux loras aren't trained on the clip
Not true as of 12 hours ago.
>>
>>102115892
oh shit, what trainer is this?
>>
>>102115887
Oh sweet. He really isn't self aware.
>>
File: bComfyUI_105161_.jpg (276 KB, 768x1024)
276 KB
276 KB JPG
>>
>>102115892
Latest Kohya, pull on the SD3 branch. Seems to be working and makes gives functionality to the clip prompt window.


I don't really like how I have to write a short novel of command line arguments to run Kohya though.
>>
>>102115923
Meant for
>>102115903
>>
>>102115923
I refuse to believe you type out every argument and its value
>>
>>102115946
No, I have text file for that.
>>
File: ComfyUI_00975_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
uploaded my Aika LoRa
https://civitai.com/models/694163?modelVersionId=776857
>>
>>102115188
how to make it stop doing backwards toes footbros?
>>
>>102115995
Well, I doubt BFL went out of their way to collect images of the soles of people's feet. you need to train that back in.
>>
>>102115995
looks like she has micropenis toes
>>
File: bComfyUI_112181_.jpg (245 KB, 544x960)
245 KB
245 KB JPG
>>
File: file.png (1.22 MB, 1038x1358)
1.22 MB
1.22 MB PNG
damn look at all the styles flux pro has access to. hardly has an effect on dev.
>>
>>102115892
What does this mean, does this mean we can train multiple concepts now? Because people reported a lot of bleeding.
>>
>>102116093
Young Santa Claus
>>
>>102116101
Maybe. Maybe not. Still need to some testing. The stuff I has seen kind of makes the character tokens act like an on/off switch.
It's only been widely available for a few hours now so nobody really knows for sure what it means. Even Kohya was kind of puzzled by the outputs and he's the guy who implemented it.
>>
>>102116131
ah ok cool, yeah we are still in the early days
>>
File: bComfyUI_112930_.jpg (681 KB, 2048x1088)
681 KB
681 KB JPG
>>102116097
what do you mean by hardly? i've going through quite a few of them in the past few days and i see quite an influence using their names.
>>
File: 00038-1638469766.png (763 KB, 1152x896)
763 KB
763 KB PNG
Looks kinda like a doctor who episode.
>>
File: 00042-2473296133.png (791 KB, 1152x896)
791 KB
791 KB PNG
>>102116263
WHY CANT YOU MAKE A SPHINX
>>
>>102116263
It has a 90s vibe. I'd say it matches a bunch of shows from those years.
>>
File: 2024-08-28_00074_.jpg (1.36 MB, 3840x2160)
1.36 MB
1.36 MB JPG
>>
>>102116355
Yeah it's one of those 80s dark fantasy loras. It does feel a little later 80s early 90s though. Doesn't exactly match the Roger Corman movie vibe these things are supposed to recreate.
>>
>>102116388
That reflection on the orb is unrelated to the scene, but i suppose it would be really impressive if it actually nailed it.
>>
>Verbose boomer prompting is the best
>NO! No captions is the best
>NO! Single word concept captions are the best
>NO! Booru tags are the best

Nobody fucking knows what's going on and anyone that claims to know better is a fucking liar.
>>
File: bComfyUI_113624_.jpg (215 KB, 768x1024)
215 KB
215 KB JPG
>>
>>102116403
I usually do a hybrid with flux. Like small elements in the scene are simple one, word, schizo, elements, but ill use some boomer stuff for the characters.
When I was using pony i used booru tags but that's because i knew that's how the dataset was captioned.
>>
>>102116429
Why would someone post fucked hands in 2024.
>>
File: ComfyUI_00564_.png (1.79 MB, 1024x1024)
1.79 MB
1.79 MB PNG
why are the threads so slow now?
>>
File: 00288-944495170.png (1.08 MB, 920x1296)
1.08 MB
1.08 MB PNG
Hey /ldg/
I've been away for a bit, is there a good anime tune for Flux yet?

I really just want to make waifus, but haven't had much inspiration since the 1.5 days
>>
>>102116398
ya I did not even prompt the orb.. I prompted for an UFO, maybe if you explicitly prompt for a reflection of the scene it might atleast partly work, but its asking alot
>>
>>102116432
But there's also the argument that flux already knows that XY and Z is, so when training a LoRA of let's say a Luigi for example, the model can already see he has a green hat, blue overalls and is a pussy bitch, but what it doesn't know is that his name is Luigi.
So there's an argument that simply captioning "Luigi" and nothing else might be the way to go.
>>
File: bComfyUI_112278_.jpg (704 KB, 1088x1920)
704 KB
704 KB JPG
>>102116445
don't knock my fetish
>>
File: 2024-08-28_00076_.png (1.01 MB, 1280x720)
1.01 MB
1.01 MB PNG
>>102116097
Adrian Tomine instantly worked on dev
>>
>>102116456
Nothing much going on in the realm of fine tunes yet, the tooling for it is still being worked out. With that in mind, use any "Finetunes" on civit at your peril, 99.99% chance they're just mystery meat LoRA mergers and schnell hybrids that rape the weights and deliver nothing.
>>
File: bComfyUI_113440_.jpg (510 KB, 1536x768)
510 KB
510 KB JPG
>>102116475
yeah, i'm not sure if he's tested any himself but plenty of artists work with flux that i've tried so far.
>>
File: 0.jpg (450 KB, 1024x1024)
450 KB
450 KB JPG
>>
File: 0.jpg (207 KB, 2048x1024)
207 KB
207 KB JPG
>>
File: 2024-08-28_00082_.jpg (1.39 MB, 3840x2160)
1.39 MB
1.39 MB JPG
>>102116548
ya.. no idea how it works in flux.pro .. but how you prompt for a style is key in dev

also pic related Titian painting my vintage car pinup 1girl
>>
File: ComfyUI_33132_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
It was supposed to be 1994, but this works too, somehow.
>>
File: bComfyUI_105496_.jpg (187 KB, 768x1024)
187 KB
187 KB JPG
>>102116632
try Ilya Kuvshinov
>>
I know I'm a month behind everyone else, but I think I've cracked lora training now
>>
>>102116632
Why do all these cars have weird side vents for the engine is that a common car feature I normally don't notice?
>>
File: Untitled.png (11 KB, 396x119)
11 KB
11 KB PNG
>>102116760
Ominous
>>
>>102116793
kek
>>
>>102116763
thats a 1960 Mercedes-Benz 300 SL
>>102116717
will do, tho I think its not the best prompt for Kvshinov, he mostly does character art .. can't associate any landscape art with him
>>
>>102116705
But not to europe right
NOT TO EUROPE RIGHT
>>
File: 2024-08-28_00090_.jpg (1000 KB, 3840x2160)
1000 KB
1000 KB JPG
>>102116717
by Kuvshinov .. but as I thought, if its not a character closeup its mostly just digital anime illustration style for flux
>>
File: bComfyUI_113680_.jpg (194 KB, 768x1024)
194 KB
194 KB JPG
>>102116802
>can't associate any landscape art with him
might make things more red
>>
File: ComfyUI_temp_bprfq_00015_.png (1.85 MB, 1512x1024)
1.85 MB
1.85 MB PNG
>>
File: aigrifter2.png (793 KB, 1024x1024)
793 KB
793 KB PNG
>>102116793
heh
>>
File: ComfyUI_temp_bprfq_00024_.png (2.5 MB, 1512x1024)
2.5 MB
2.5 MB PNG
>>
File: konosuba flux dev.jpg (1.44 MB, 3072x1024)
1.44 MB
1.44 MB JPG
>Multichar LoRa finished cooking after work.
>Works reasonably right away

Multichar LoRa is possible at dimm/lora 32. Not sure why you wouldn't just make 3x LoRa to prevent bleeding issues. I experienced no such problems that can be identified as bleeding during training though. Finished training this LoRa before the Clip-L updates today on Kohya SS. Maybe I'll recook to see if there's any improvements to be gained to Clip-L training. In any case, I used this config as a base:

https://github.com/bmaltais/kohya_ss/issues/2701#issuecomment-2294611159

Changes
768x768
Learning Rate 0.0002
Network DIMM/Alpha 32 (Testing)
Min S/R Gamma 0.5


WD booru style captions only
Threshold setting 0.54
https://huggingface.co/SmilingWolf/wd-eva02-large-tagger-v3

150 Aqua images x2 repeat, 30 Darkness x5 repeat, 25 Megumin x6 repeats
>>
File: 2024-08-28_00093_.jpg (863 KB, 2160x3840)
863 KB
863 KB JPG
>>102116834
as I thought, when you do a character closeup Kuvshinov is stronger, pic related
>>
File: ComfyUI_temp_bprfq_00036_.png (1.66 MB, 1920x1080)
1.66 MB
1.66 MB PNG
>>
Fresh bread

>>102116944
>>102116944
>>102116944
>>
>>102116915

Interesting, what prompts did you use when prompting for the characters when generating, did you describe in full or simply use the name + whatever they were doing in the image?
>>
File: bComfyUI_113696_.jpg (237 KB, 768x1024)
237 KB
237 KB JPG
>>102116918
nice, I see what you mean. heres that list anon took an ss of https://tiny...url.com/47hfetse, should have a lot more to fuck around with now.
>>
File: ComfyUI_05409_.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
>>102116971

Prompt:

Anime girl standing. She is holding a sword.

darkness \(konosuba\), 1girl, upper body, solo, long hair, blue eyes, blonde hair, hair ornament, ponytail, weapon, outdoors, sky, day, sword, armor, blue sky, tree, x hair ornament, shoulder armor, planted sword, hands on hilt


You just tell what you want the character to do + relevant tags.
>>
>>102117035
Can you humor me for the sake of an experiment and cut down prompt the just the character name token and Anime girl standing. She is holding a sword.

I want to see how much weight just the name has.
>>
File: ComfyUI_05411_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>102116971

Anime girl standing. She is holding a blue paper fan. She has a smug expression on her face.

aqua \(konosuba\), 1girl, solo, long hair, breasts, looking at viewer, smile, open mouth, blue eyes, skirt, hair ornament, bow, very long hair, medium breasts, blue hair, detached sleeves, blue skirt, tavern background, hand fan, green bow, hair rings, folding fan, holding fan, single hair ring,
>>
File: 2024-08-26_00059_.png (1.06 MB, 1280x720)
1.06 MB
1.06 MB PNG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.