[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (991 KB, 3264x3264)
991 KB
991 KB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102135630

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
File: 2024-08-29_00240_.png (1013 KB, 832x1216)
1013 KB
1013 KB PNG
>>102139227
ty baker
>>
File: ComfyUI_Flux_11.png (1.35 MB, 1216x832)
1.35 MB
1.35 MB PNG
>>102139222
I'm digging uni_pc with sgm_uniform, seems to do just fine at 20 steps
>>
Blessed thread of frenship
>>
File: ComfyUI_33159_.png (1.24 MB, 1280x720)
1.24 MB
1.24 MB PNG
>>
File: 2024-08-29_00245_.png (1.06 MB, 832x1216)
1.06 MB
1.06 MB PNG
>>
>>102139067
>why is there no change to the dev images when I connect 3.0 FluxGuidance to the positive prompt, going into KSampler?

halp
Do I need the specific guidance/sampler nodes? I'm probably just overlooking something simple.
>>
File: 2024-08-29_00251_.png (981 KB, 832x1216)
981 KB
981 KB PNG
>>
File: fs_0074.jpg (85 KB, 1280x880)
85 KB
85 KB JPG
>>
File: file.png (1.59 MB, 1440x1024)
1.59 MB
1.59 MB PNG
>>
>>102139392
The US army headwear really got a downgrade.
>>
File: fs_0096.jpg (83 KB, 1280x880)
83 KB
83 KB JPG
>>
File: 2024-08-29_00252_.png (1014 KB, 832x1216)
1014 KB
1014 KB PNG
>>
>>102139501
gib nao
>>
>>102139258
euler + beta seems better
>>
File: 1709763024668904.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
flux used to be so bad 3 days ago
>>
Flux killed my dog 3 days ago.
>>
if only it had the insane detail of Craiyon 3
>>
File: 00163-2670538863.png (2.63 MB, 1440x1440)
2.63 MB
2.63 MB PNG
>>
File: ComfyUI_00489_.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
flux was not the mature model people where talking about 3 days ago but it has much improved since and now it is
>>
>>102139314
_O_
>>
File: ComfyUI_33199_.png (939 KB, 1280x720)
939 KB
939 KB PNG
>>
File deleted.
this was what flux was capabel 4 days ago
>>
File: ifx305.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
>>
File: 1714999927088817.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
this is 2 days ago
>>
File: fs_0120.jpg (136 KB, 1280x880)
136 KB
136 KB JPG
>>
File: autism.jpg (61 KB, 1204x205)
61 KB
61 KB JPG
how the time flies
>>
File: ifx306.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
>>102139731
>>102139793
mm looks salty
>>
>>102139793
Hammerhead shark jelly?
>>
File: fs_0132.jpg (106 KB, 1280x880)
106 KB
106 KB JPG
>>
>>102139817
I was trying to make Chondrichthyes on toast
>>
>>102139793
how? teach me
>>
>>102139859
its a non-local model.
>>
>>102139879
heretic
>>
>>102139879
>Local Diffusion General
>>
File: ComfyUI_temp_yhjtu_00001_.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>
File: ComfyUI_33200_.png (1.09 MB, 1280x720)
1.09 MB
1.09 MB PNG
>>
>>102139879
yea anon stop that!
>>
>>102139879
please be more vague. anyways
>>
Anyone know how to unfuck the lighting in images?
>>
File: 2024-08-29_00272_.png (1.13 MB, 1280x720)
1.13 MB
1.13 MB PNG
>>
>>102139980
Describe lighting
>>
whats the best way to stack loras?
>>
>>102139959
Imagen 3
>>
What's this on GGUF node?

>UserWarning: To copy construct from a tensor, it is recommended to use sourceTensor.clone().detach() or sourceTensor.clone().detach().requires_grad_(True), rather than torch.tensor(sourceTensor).
>>
>>102140048
rgthree power lora loader
>>
File: YtsnHybrid-3.png (513 KB, 512x512)
513 KB
513 KB PNG
>>102140038
The kind of exaggerated sun-blasted bullshit you used to get all the time.
>>
>>102140057
thank you. those gens you posted look fucking amazing but no. sorry. no walled garden shit for me. still. damn.
>>
File: ComfyUI_33205_.png (1.17 MB, 1280x720)
1.17 MB
1.17 MB PNG
>>
>>102140167
he's pretending to be me and is a known prompt stealer
>>
File: 2024-08-29_00277_.jpg (1008 KB, 3840x2160)
1008 KB
1008 KB JPG
>>
>>102139067
>>102139322
>>102140085
why isn't /ldg/ as helpful as /sdg/?
this is the second time this has happened
>>
@102140220
nig*bo drama
>>
general comment to no one in particular, but if you got questions nobody seems to know the answer to, great chance to do some fucking science and experiment
>>
Blessed thread of frenship
>>
>>102140214
you're schizo, I was just answering the anon got a vague response, no "NTA" was needed
>>
File: GLcWV9eXIAAFMlR.jpg (44 KB, 512x522)
44 KB
44 KB JPG
>>102140235
Newfags are unwelcome
>>
>>102140244
yeah, if only I had tried multiple guidance settings, resolutions, seeds
>>
>>102140264
we can't come here for technical questions I see, where's your gen? or hiding?
>>
>>102140270
somewhere, somehow, someone made that, ask that person
>>
>>102140344
so your suggestion is to not ask the simple question in a thread for stable diffusion, but instead make an issue on their GitHub or join their discord, or similar?
seems extremely overkill
>>
File: GNj1ogfXUAAEjNA.jpg (154 KB, 1200x1126)
154 KB
154 KB JPG
>>102140318
>asking if someone is hiding
>on anonymous image boards
>>
>>102140270
whats the problem? guidance scale not doing much / anything? need to see your workflow. probbly a wiring issue.
>>
>>102140383
My suggestion is stop asking a stupid question multiple times expecting someone to know the answer and then getting salty about it.
>>
>>
>>102140391
not AI
>>
>>102140406
well, I tried the example workflow from comfy examples from dev. zero difference between using the FluxGuidance node and without.

>>102140413
that's not what was written above though. and there's nothing stupid about the question
>>
>>102140415
I'm no engineer but I don't think this works.
>>
>>102140447
If the example workflow doesn't work then it doesn't work. Your question is stupid if you used the dev's workflow (without any changes) and it didn't work.
>>
>>102140463
>>102140270
>>
>>102140506
If the example workflow doesn't work then it doesn't work. Your question is stupid if you used the dev's workflow (without any changes) and it didn't work. You didn't say you used the dev's example workflow and based on your retarded response, I'm going to guess you just fucked around thinking you know better when the first thing you should do when something doesn't work in Comfy is to USE THE OFFICIAL EXAMPLE WORKFLOW. If the example workflow doesn't work then it's fucking obvious the extension is broken and wowee you saved yourself time. Do you need confirmation the sky is blue too?
>>
File: fs_0180.jpg (134 KB, 1280x880)
134 KB
134 KB JPG
>>
>>102140447
the flux guidance value absolutely does something. just plug it in and try 1.0 vs 4.0. again, show workflow. can't mindread.
>>
>>102140565
rollercoaster tycoon
>>
>>102140542
jesus christ, anon, take a few deep breaths.
the comment clearly means that I experimented
>>
File: fs_0184.jpg (121 KB, 1280x880)
121 KB
121 KB JPG
>>
>>102140633
Holy shit you fucking retard. My comment is THE EXACT OPPOSITE of "experimented". Did you use the official fucking workflow or not? Yes or no. I don't care what fucking experiments you did, did you fucking use the official workflow. If yes, and there is no changes, THERE IS A BUG IN THE EXTENSION YOU FUCKING RETARD KILL YOURSELF SPOON FEEDING ZOOMER
>>
>>102140542
>You didn't say you used the dev's example workflow
from earlier:
>it's exactly like what is used in the comfy examples

>>102140579
there's no need to mindread. it's the simplest possible implementation.
thanks, anon
>>
>>102140692
>comfy examples
Let me help anon, when you are using some dumbfuck's custom node, use their example. If they don't have an example, it's probably malware.
>>
>>102140719
NTA but Flux Guidance is a Comfy node.
>>
>>102140765
Then use the example on Comfy that uses Flux Guidance. If their example doesn't work, then there is a bug. It is very easy to test: you use the same prompt and same seed and do two different images changing the guidance value. If there is a change, it works, if there is no change, then there's a bug and you don't need someone here to tell you.
>>
>>102140765
It's different from classifier free one?
>>
>>102139256
I like the Zelda idea in the OP, nice
>>
>>102139256
also which lora is this? Is it this one?

https://civitai.com/models/682944/princess-zelda-flux-or-dogmaai
>>
File: ComfyUI_33168_.png (1.14 MB, 1280x720)
1.14 MB
1.14 MB PNG
>>
>>102140787
>use the example on Comfy that uses Flux Guidance.
and that's what I did. not sure how many times I can communicate that.
this has gone on for long enough now, lmao
>>
>>102139760
Same people are now saying video will never happen any time soon
>>
>>102140828
Okay you did, and it's clearly broken. CONGRATS ANON! You are right, the sky is blue! I hope you have your confirmation and you shut up now.
>>
>>102140788
Yes. Custom Flux thing. There's not much info about it.
>>
>>102140842
>you are right
I haven't argued for that anything is broken, so not sure where you're getting that from. anyway, there's no need to be salty
>>
File: fs_0214.jpg (94 KB, 1024x1024)
94 KB
94 KB JPG
don't squeeze dat fish
>>
>>102140892
"why does this thing not work I used their example and it didn't work, can someone help me, why won't anyone help me? I have crippling anxiety where I need confirmation about every single thing in my life no matter how obvious and simple."
>>
>>102140910
there's no need for strawmen. I hope you can put this behind you
>>
>>102140928
You're right, I was way out of order. Life has been hard lately, my girlfriend left me for another woman with a bigger dick than me and I'm on the edge.
Sorry.
>>
>>102140928
I'm already going back to ignoring you. Just pointing out how you're wasting your time.
>>
>>102140945
laissez-faire capitalist
>>
>>102140961
no that's me, don't confuse me with that retarded anon
>>
>>102140945
>I just wanted to let you know I am ignoring you
kek
>>
>>102140986
I've discovered it's an effective strategy because now you'll know people are actively ignoring you.
>>
Any cool updates by turkish guy today?
>>
How many steps for pony lora with around 350 images?
>>
>>102141021
yes check xitter
>>
>>102141030
thanks will do
>>
File: ComfyUI_01051_.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
I did it again, I cooked up a new LoRa
>>
>>102141028
I think ponies have 4 steps since they have 4 legs, I assume that's how you count the steps of horses?
>>
>>102141112
Ah so it's batch size 2 + gradient acc 2
>>
>>102141028
There is no set standard, it depends on the difficulty of the concept, the quality of your dataset and the model's prior knowledge of your concept.
>>
File: ComfyUI_temp_ehhye_00036_.png (3.47 MB, 1600x1360)
3.47 MB
3.47 MB PNG
>>
File: ComfyUI_00657_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>102141240
>Buttchin
>>
>>102141240
that's great
>>
File: 2024-08-29_00172_.jpg (1.34 MB, 2496x3648)
1.34 MB
1.34 MB JPG
>>102140797
ty
>>
>>102141240
catbox?
>>
File: E1VGc2NVcAA97KV.jpg (62 KB, 960x718)
62 KB
62 KB JPG
>>102141265
I wonder who in BFL did it.
>>
File: ComfyUI_temp_ehhye_00037_.png (2.86 MB, 1600x1360)
2.86 MB
2.86 MB PNG
>>102141265
ai people have chins anon
>>
File: 2024-08-29_00181_.png (1.26 MB, 832x1216)
1.26 MB
1.26 MB PNG
>>102140812
>https://civitai.com/models/682944/princess-zelda-flux-or-dogmaai
no, this one..
>https://civitai.com/models/697112/zelda
the former one gives a to clear image, I prefer the other one
>>
File: ComfyUI_01007_.png (957 KB, 1024x1024)
957 KB
957 KB PNG
>>102141341
Never post Buttchins again buddy
>>
>>102141294
give her armpit hair
>>
File: ComfyUI_temp_ehhye_00038_.png (2.73 MB, 1600x1360)
2.73 MB
2.73 MB PNG
selling my soul to satan for more vram
>>
File: 2024-08-29_00312_.jpg (896 KB, 3840x2160)
896 KB
896 KB JPG
>>102141362
do it yourself, I am done with Zelda gens for now
>>
File: ComfyUI_01002_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>>102141375
>>
>>102141341
her buttchin went to her tits
>>
>>102141384
is that debo?
>>
File: 1724317829532736.png (1.27 MB, 896x1152)
1.27 MB
1.27 MB PNG
>>
>>102141403
no, thats some generic anime dude the Persona 5 lora made
>>
How many was the most steps you did in a single image that turned out good?
>>
File: 1703203075988527.jpg (48 KB, 365x444)
48 KB
48 KB JPG
Does this general have one guy in particular who is irrational and just... angry?
>>
>>102139227
Sex with wanda
>>
File: fp018.jpg (323 KB, 1024x1024)
323 KB
323 KB JPG
https://www.youtube.com/watch?v=f1agQE9Hr08
>>
>>102141433
yeah, some freak. i remember we had a guy like that during the pre-flux pixart days as well, not sure if it's the same anon.
>>
File: ComfyUI_temp_ehhye_00041_.png (3.24 MB, 1600x1360)
3.24 MB
3.24 MB PNG
>>
File: 1716538147929902.png (857 KB, 1024x1024)
857 KB
857 KB PNG
>>102139661
Its alright
v4 btw
>>
notice how the people will say anything except solve the problem the anon had
I thought sdg was the virtue signaling hugbox
>>
>>102141433
I think so. he raged because some anon dared to critique the code for a lora trainer
>>
>>102141460
is this what girls do during their period? she should ask for some anti-aging cream
>>
>>102141488
they said they needed to train the clip, they added clip training, he still has the same problem :^)
>>
>>102141481
read the OP
>>
>>102141504
that's the laissez faire capitalist, ignore him
>>
>>102141481
It's hilarious. The reply was literally just "you need to up the strength to see effects" or similar. Everyone knows this. Don't need the workflow for that.
>>
https://civitai.com/models/226478
Finally, flux is saved
>>
>>102141530
no, that's me, I'm just reading the thread, whoever has been arguing is someone else
>>
>>102141558
this guy is pretending to be me >>102141530
take trolling to >>>/b/ please, thanks
>>
File: ComfyUI_temp_ehhye_00044_.png (3.08 MB, 1600x1360)
3.08 MB
3.08 MB PNG
>>
they are multiplying
>>
>>102141573
ask a janny
>>
>>102141504
nta but does it answer the guidance question? It has seemed like it doesn't do shit for me either. CFG works better
>>
>>102141184
Well I just save every second epoch and see how it works. Prodigy takes the wheel
>>
>>102141602
define guidance
>>
File: ComfyUI_temp_ehhye_00045_.png (2.78 MB, 1600x1360)
2.78 MB
2.78 MB PNG
>>
>>102141625
why no catbox?
>>
File: 2024-08-29_00318_.jpg (817 KB, 3840x2160)
817 KB
817 KB JPG
>>
>>102141613
No, I don't think I will.
>>
>>102141613
Why don't you take these stupid fucking questions to chatgpt
>>
Has anyone got controlnets to work with flux yet? And could they share their workflow?
>>
So are we all just waiting for a real finetune or what
>>
>>102141654
can you describe very specifically how it differs from cfg?
>>
>>102141673
Just watching average loss on terminal like usual
>>
>>102141673
>So are we all just waiting for a real finetune
yeah
>>
File: 00209-2907788018.png (2.84 MB, 1440x1440)
2.84 MB
2.84 MB PNG
>>
File: aigrifter.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
>>102141633
last time I posted a catbox it ended on reddit, civitai, etc. there are too many grifters lurking on these threads
>>
File: 1724880795394109.jpg (249 KB, 1024x1024)
249 KB
249 KB JPG
/stg/ guy told me to come here to get serious help
I'm trying to create a profile pic for my socials. My nickname, I like furyo/yankii and I'm a dev.
I got pic related with the help of someone from the dall-e thread. I want to lightly edit it but I always end up with something totally different.
I'd like to modify the following things:
> Character
I want him to have the same haircut but curly. Some light stubble. I want him to wear an all black regular gakuran/uniform
> Scene
The scene is good but I'd like to convey the fact that this is a developer. Idk what to add, maybe some coffee or glasses with lines of code in the reflection.
> Artstyle
It's perfect the way it is but I asked for something "close" to Jojo, not a straight tup copy. Doesn't matter though, I like it like that.
>>
>>102141751
what is your name?
>>
>>102141743
it's called sharing, anon, did your parents not teach you about it?
>>
File: aigrifter2.png (793 KB, 1024x1024)
793 KB
793 KB PNG
>>102141743
workflows should have licenses
>>
>>102141743
>last time I posted a catbox it ended on reddit, civitai, etc.
LDG is truly on the cutting edge. Anon cannot overstate it's influence on the space.
>>
>>102141763
FuryoDev
>>
File: file.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>
>>
File: file.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>
>>102141343

Thanks yeah I agree
>>
>>102141384
Wait WTF you do a variety of images and you don't samefag!? woah
>>
>>102141824
Don't go full programmer socks
>>
File: file.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>>
File: ComfyUI_01347_.jpg (1.16 MB, 1728x2304)
1.16 MB
1.16 MB JPG
been spending a lot of time training flux loras, being able to train the clip has definitely helped a lot. Did a test with captions vs. tokens vs. captions+tokens concatenated; may have just been this specific test but the tokens and concatenated lora were both better looking than just captions alone
>>
>>102141935
>>102141878
>>102141849
Thanks for trying dude but I got a question.
Is there an AI chatbot that allows you to apply light modifications to an image? Whatever I'm trying, I'm supposed to hope the AI will nail what I'm asking it.
(I'm not really into this, I'm just trying it for this profile picture thing)
>>
Remember when sd 1.4 released and some people refused to share any prompts because they worked so hard to perfect them. lol
>>
>>102141940
why do the hands always look worse? not shitting on this particular image, ive just noticed that every flux lora ive tried butchers the hands
>>
>>102141940
retard here, what do you mean by tokens?
>>
File: ComfyUI_01198_.png (1.38 MB, 896x1152)
1.38 MB
1.38 MB PNG
>>102141975
just a sign of overtraining, I think. I've had a few loras I trained mangle hands pretty badly, but when I use an earlier epoch version it'll usually go away
>>
File: file.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>102141963
What you're asking for is inpainting. There's really no way to modify an image like you're asking with just words and preserve the existing image. These models essentially work via hallucinating pixels. The only way to really do what you want is to take your image into photoshop or krita and then composite inpainting results of small sections you want to change to achieve what you want.
>>
>>102141968
>art by artgerm and greg rutkowski and alphonse mucha, cgsociety
DONUT STEEL
>>
>>102141968
Over time I've grown to hate you people because you're the laziest niggers on the planet. Simply put, if you don't get the prompt you cannot function.
>>
>>102141991
sorry, I meant tags, like danbooru tags. Tokens just refers to each word in a prompt, the text encoder takes the tag/caption and converts them into what's called tokens
>>
>>102142047
Shouldn't you be using closed black box models. Isn't that more your philosophy?
>>
File: FluxDev_03799_.jpg (261 KB, 832x1216)
261 KB
261 KB JPG
>>
>>102141968
Imagine seething about prompts not being shared when you have joycaption available
>>
>>102142007
Yeah overtraining usually ruins fingers with 1.5 and SDXL, might be same with flux
>>
>>102142091
Someone (who isn't me) should hold my hand.
>>
>>102142151
Open source models. Can you imagine. Someone lazy might take the code and use it without sweating for it. Disgusting.
>>
how retarded would it be to buy a 3090/4090 right now to train Flux loras?
>>
>>102142007
are you training at 512x512?
>>
>>102142167
You're missing two parts:
- you want me to make the code
- you then demand I give you the code too
>>
>>102142171
I can't tell you how to spend your money. Will it possibly give your 100s of hours of joy? Yeah.
>>
File: 1719382652795477.png (326 KB, 681x871)
326 KB
326 KB PNG
wtf why didn't you guys told me about group nodes?
>>
>>102142171
depends how much you pay for it. You can train flux loras fine with 24gb. Current rumors put the 5090 at 28 to 32Gb of ram launching at the end of the year. You might see used 4090s drop a bit around that time but I doubt the price will be that different from now.
>>
File: 2024-08-29_00330_.jpg (516 KB, 2160x3840)
516 KB
516 KB JPG
>>102142171
5090 is still atleast half a year away, maybe more, so you wont be able to get anything better unless you spend $10k for an A6000
>>
anyone else suffer from some kind of autistic sickness where they have to read all the archived /ldg/ threads they missed before they can read the new one just incase some new thing was discussed or posted?
>>
>>102142249
sounds more like OCD than autism
>>
>>102142249
as you should. no need to ask something already answered
>>
lora bakers: what's your secret to good hands/fingers? I had this same problem with SDXL as I'm having with flux: by the time the concept/character/style is at its best, the hands/fingers get absolutely destroyed. I've tried turning down the LR too, but it feels like it's inevitable. should I put a random folder of high quality hand photos in or something and hope it understands, lmao?
>>
>>102142226
>launching at the end of the year
unlikely, NVidia yesterday admitted at there financial report session that the Blackwell production process has serious problems and the yields are less than 10%. They ordered new lithographs made at TSMC which will delay the Blackwell market introduction by several months.
>https://www.tweaktown.com/news/100214/nvidia-says-it-will-tweak-blackwell-ai-gpus-issues-with-the-gpu-mask-needing-b200-re-spin/index.html
>>
>>102142249
I like to go through dumpsters too just in case someone famous threw away their receipt.
>>
>>102142324
kek'd
>>
>>102142221
>"his abnormally tall legs occupy 90% of the image"
>legs occupy 30% of the image at best
Fuck this gay ass earth
>>
>>102142317
>should I put a random folder of high quality hand photos in or something and hope it understands, lmao?
That's not how it learns. In particular with Loras you're raping the weights and essentially your dataset is you screaming "I WANT IMAGES THAT LOOK LIKE THIS". A photo of a hand does not make the model learn hands your anime babes better and if anything it would destroy the model even more because you're raping in disparate concepts (hand model vs anime babe).
>>
Listen.
I just can't allow my prompts to get out in the public ever again. The amount of damage that could be done by my perfect prompting prose could do irreparable damage to the minds of normal prompters. I just can't take that risk and still sleep at night.
>>
>>102142453
if prompts are so easy and trivial why do you need them posted?
>>
>>102142453
kek you say prompts but truly you want the workflow ;)
>>
>>102142480
>being this retarded
>>
>>102142317
include more images in your dataset where the hands are visible, also use at a minimum 150 images.
>>
>>102142498
obviously the prompts aren't easy otherwise you wouldn't need them posted
>>
File: 1708148266564601.png (422 KB, 1787x872)
422 KB
422 KB PNG
>>102142371
yeah that sucks
>>
>>102142514
>being this retarded
>>
>>102142537
Explain why you need the prompts posted
>>
>>102142543
NTA but isn't sharing enough of a reason?
>>
is 20 image flux training still the recommended way?
>>
>>102142558
No, not particularly. In the same way I don't care if you fail or succeed. If me not sharing my prompt harms you that makes it even better.
>>
>>102142577
this nigga set his upload speed to 0 in his torrent software
>>
>>102142573
It's probably enough for a person or character. For a style or some esoteric concept you'll want many more
>>
>>102142608
No, you see, when it comes to "sharing" there is a fundamental concept of reciprocation. For me to share I have to believe that I will receive equal value back. In this case, I do not believe you have anything of value, you need prompts because your prompts suck.
>>
>today me
>tomorrow you
>>
>>102142625
egoist
>>
>>102141433
the lora rapist
>>
File: file.png (244 KB, 318x342)
244 KB
244 KB PNG
>>102142645
irony because your demand for prompts literally comes from a place of selfishness
>>
>>102142663
several anons are replying to you, schizo
>>
>>102142625
prompt stealer frfr
>>
File: 2024-08-29_00332_.jpg (540 KB, 2160x3840)
540 KB
540 KB JPG
>>
File: file.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>
>they they are stealing my prompts and then they post these images somewhere and someone gives them attention for it and that that could have been my attention my work has been stolen I could have made money somehow off my prompts
>>
File: file.png (979 KB, 1024x1024)
979 KB
979 KB PNG
>>
File: 1712436184592960.png (186 KB, 1865x448)
186 KB
186 KB PNG
prompting is very important. i'm trying to make a woman have a abnormally long neck that occupies most of the image's frame but this is the best i'm getting and i'm sure i can get it with just the right prompt. any tips?

this is the prompt, i'm sharing it willingly and freely by my own volition
>National Geographic magazine faceshot of a exotic young african woman with the longest neck in the world, she wears over 100 gold rings around abnormally long neck. Because of her neck she is over 2 meters tall. Her extremely long neck occupies most of the image frame.
>>
>>102142799
>picrel
keke
>>
File: file.png (661 KB, 1024x1024)
661 KB
661 KB PNG
I always figured throwing a tantrum like a 6 year old not getting to play with his brother's toys was an effective strategy.
>>
>>102142510
thank you. my current dataset is already 5x that but the artist doesn't have a lot of hands visible and some of them are not very well drawn (I tried to remove those ones or crop where applicable). I guess that's part of the problem, though
>>102142375
do you think it'd help to crop some close ups of the existing images where hands are visible? like not just of the hand alone, but I guess so that area takes up more of the image space?
>>
>>102142819
>the egoist can't help but try to get more attention
>>
>"Prompt thieves could be here" he thought "I've never been in this general before. There could be prompt thieves anywhere."
>>
>>102142821
You have to think of training like a Xerox copier. You're pushing the model to make exact copies of an image with a given caption. The best way to train any given Lora is having images that you want to see more of and captioned in a way you want to prompt.
>>
>>102142317
>by the time the concept/character/style is at its best, the hands/fingers get absolutely destroyed

You can't have good reproduction concept/character/style, good hands/fingers, and good flexibility. You can only have 2 out of 3.
>>
File: 1714309822477777.png (42 KB, 1847x244)
42 KB
42 KB PNG
>>102142814
i'm stumped. i can't make her neck longer, but i'm convinced it is possible
>>
File: file.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
Surely if I call my brother names he'll let me play with his toys.
>>
>>102142693
Cool

>>102142799
lol
>>
He hasn't even read the book or is unable to prompt the cover he wants properly, lmao.
>>
File: file.png (891 KB, 1024x1024)
891 KB
891 KB PNG
Maybe if I demand people give me their prompts they will do it.
>>
please stop having a melty, anon
>>
File: file.png (838 KB, 1878x1367)
838 KB
838 KB PNG
https://www.reddit.com/r/StableDiffusion/comments/1f4369h/comment/lkil6nm/?utm_source=share&utm_medium=web2x&context=3
Now we're talking, they making a finetune of Flux
>>
>>102142880
woman with extra long giraffe neck that reaches the sky maybe
>>
>>102142799
"she has a neck like a giraffe's"
>>
File: file.png (1.13 MB, 1024x1024)
1.13 MB
1.13 MB PNG
>>
>>102142880
Maybe if you make the distance between the two nodes a big wider it'll work. You're almost there I can sense it.
>>
>>102142929
pie went back in time
>>
>>102142929
But the cherry pie on the left looks nicer
>>
>>102142857
>You can't have good reproduction concept/character/style, good hands/fingers, and good flexibility. You can only have 2 out of 3.
why does lora training have to be so far behind with no tangible advancements...
>>
>gets asked for the prompt
>immediately thinks himself better than everyone else because he got the slightest bit of attention
>>
>>102142929
>DOF effect exactly the same
worthless
>>
File: file.png (697 KB, 1024x1024)
697 KB
697 KB PNG
>>
>>102142929
>If you know anyone with money... sens them our way
it's that expensive to finetune Flux? I thought it could be done on a 24gb vram card
>>
File: 2024-08-29_00355_.jpg (672 KB, 2160x3840)
672 KB
672 KB JPG
>>102142895
>Cool
ty
>>
>>102142943
>the purpose of our finetune is to make everything worse, enjoy
>>
>>102142929
>Picks the blurriest bokeh ridden example image possible
Nice
>>
>>102142929
>we have no data
>we have no dataset
>we have no money
>we have no gpus
>that is why we're perfect for finetuning flux!
I mean great they're trying but you'd think they'd at least try to get one of these things beforehand
>>
>>102142969
Thanks I will enjoy it when ti comes out so I can share worse images here
>>
File: 00521-4192611187.jpg (448 KB, 1152x1728)
448 KB
448 KB JPG
pony lora was a mild success
>>
>>102142978
Yeah I would have waited until they at least have a better comparison
>>
>>102142963
Assuming 20s/it on a 4090, it would take 23 days per epoch on a dataset of 100k images.
>>
File: 1699283913181131.png (41 KB, 77x879)
41 KB
41 KB PNG
>>102142935
>>102142937
then it would be too big i think

>>102142941
thank you i know it is possible. i'm trying a different workflow now and it seems to be working. slowly but surely that neck is getting bigger
>>
>>102142978
>on the left: a cherry pie with no crust
>on the right: a tart with no crust and some kind of metal halo
what did they mean by this
>>
>>102142989
well done
>>
>>102142978
>left side isn't even a table, it's on some kind of tree stump
WHAT
DID
THEY
MEAN
BY
THIS
>>
>>102143004
>it would take 23 days per epoch on a dataset of 100k images.
yeah... so we have no other choice but to rent gpu's?
>>
>>102143032
I meant right side, holy shit I'm retarded enough I should be joining their finetuning team
>>
>>102142978
They have Reddit model disease which sadly affected Flux at its core. They only train on aesthetic professional photographer slop, that means maximum bokeh.
>>
File: ComfyUI_00065_.png (726 KB, 360x1512)
726 KB
726 KB PNG
almost there
>>
>>102141240
Please upload to catbox <3
>>
>>102143043
>>102143004
doesn't it only take like 10 epochs to really finetune most archs, 20ish to fully overwrite the original knowledge?
that means in less than one (1) year we could have a flux finetune once we've all moved on to new tech!
>>
File: grid-0203.jpg (185 KB, 1536x1152)
185 KB
185 KB JPG
>>102143027
thanks, trying multi character lora based on vampire hunter d
>>
>>102143068
I'm sure it's slow as fuck because 24gb isn't enough and it has to switch from layers to layers and that shit takes time, but maybe 2x4090 would do the trick
>>
File: FtcwlXuWAAgJWDb.jpg (95 KB, 924x768)
95 KB
95 KB JPG
>>102143005
"Woman with a neck like Junpei from Persona 3"
>>
>>102143068
it would take longer because you probably need a minimum of 200k images to properly fine tune it especially if the goal was to make it more pop culture savvy
and that's assuming that the janky low VRAM script of Kohya doesn't spin the model into oblivion 10,000 steps in
>>
File: 1694802076383633.png (726 KB, 360x1512)
726 KB
726 KB PNG
>>102143086
too short. don't want no short necked woman
>>
>>102143083
Kohya has never really properly supported multi-GPU, it's always seemed janky if not outright non-functional
>>
>>102143106
wait seriously? they should focus on that stuff, you can't finetune flux on a single gpu, goddam it's been 2 years the diffusion thing got mainstream and they waited for flux to appear to implement a multi gpu training? fuck man
>>
>>102143106
yeah, as a multi-gpu haver I have never been able to get it to work, even on troonix. it won't work at all on windows because it depends entirely on accelerate and accelerate depends on nccl. I've read some old posts claiming they got it to work on windows by changing the backend to gloo but I only hit error after error
>>102143123
iirc the issue isn't really with kohya's code so much as it is with accelerate. and if you look on accelerate's github issues, the devs pretty much give 0 fucks about functionality or implementing new things
basically we need to replace accelerate
>>
It's finally out for FREE

https://www.reddit.com/r/StableDiffusion/comments/1f4b6rk/flux_lora_training_simplified_from_zero_to_hero/
>>
File: file.png (74 KB, 3765x913)
74 KB
74 KB PNG
>>102143169
doesn't seem to wkr
>>
b u y a n a d
u
y

a
n

a
d
>>
>>102143191
>>102143067
>>
>>102143160
>basically we need to replace accelerate
or we need to make a fork of accelerate that will implement that multi gpu setup
>>
>>102143188
FALSE FLAG FAKE NEWS, you are trying to stop my main man from getting more views?
>>
>>102143217
kek
>>
File: FD_00144_.png (1.88 MB, 768x1344)
1.88 MB
1.88 MB PNG
>>102139760
Oh good someone capped it. The funniest thing about this is the Anon who said this is in this thread right now shutting the fuck up finally.
>>
File: 00546-4192611188.jpg (401 KB, 1152x1728)
401 KB
401 KB JPG
>>
>>102143205
right, or that. ngl debating asking chatgpt to change kohya's code til it works with my gpus on windows and just seeing how far it gets
>>
>>102139760
for lora it became possible, for finetune it's still not, not having the possibility to do multi-gpu finetune makes it impossible for now
>>
>>102142929
>it's the same fucking image
Some rich coomer needs to donate a fuckton of compute to the bigASP guy and get him to do a full flux finetune on that dataset. At least that model is trying to be something different.
>>
>>102143242
>debating asking chatgpt to change kohya's code til it works with my gpus on windows and just seeing how far it gets
my new best AI friend is Claude 3.5 Sonnet, that mf just gets it
>>
>>102143169
>only 5% paywalled
He's an expert salesman
>>
>>102143270
I wouldn't give it to them, they're only focused on aesthetic when the main focus should be adding more concepts to the model
>>
>>102143270
Yes exactly, that's the most realistic sdxl model really, not so much of that modeling photoshop look
>>
>>102143279
I tried claude for a while and although some of the coding was better out-of-the-box, I found it did a lot worse at understanding wtf I wanted and problem solving issues. I think my best results so far have been getting claude to write a 'foundation' then having chatgpt adjust it
>>
>>102143160
I got multi-GPU to work for Gloo on Windows training Pixart and it uses accelerate. But kohya's code is janky so it's likely they haven't properly coded for multi-GPU and they honestly don't seem to care at all if people use their trainer or not. But maybe Flux will finally be the reason to actually test and support multi GPU (no high hopes).
>>
>>102140220
Simple. Some people simply don't know the answer and debo makes shit up to try and sound smarter than he is.
People in here are very helpful overall.
>>
File: 00557-4192611188.jpg (370 KB, 1536x1536)
370 KB
370 KB JPG
>>
>>102143320
there's no alternative to kohya? surely someone will make that shit work, finetuning flux is a big deal
>>
>>102143320
yeah, I'm guessing my best bet if I want to utilize kohya is to more or less rewrite parts of his code if I want it to work
>>
>tfw it's a stolen prompt from fluxart.pro
Oh well, thanks I guess
>>
>>102143338
I mean hypothetically if you have someone serious about training who isn't just some jeet on reddit, they should be capable of finding someone to, or writing their own training code (or editing existing) to do the job
>>
>>102143338
Sadly the space is made up of poorfags. Maybe ai-toolkit's dev is up to the task, OneTrainer's dev is a poorfag with like a GTX 1080, he couldn't even test Pixart properly. These guys don't have 4090s to test with let alone two of them.
>>
>>102143383
>Sadly the space is made up of poorfags. Maybe ai-toolkit's dev is up to the task
still, having a script that support multi-gpu is important, you can rent multiple gpus and it won't be that expensive, that's what the llm fags are doing
>>
>>102143404
Flux is the first model to really push the limits whereas LLM's needed it from the start even for inference on large models.
>>
>>102140220
Giving a vague answer that anyone using Flux would already know isn't very helpful.
>>
>>102143431
yeah Idk, having a multi-gpu setup is cool even on small models like SDXL, it makes shit faster, I've heard that the pony-dev has like 8 gpus workin in parallel or some shit
>>
>>102143381
Multi-GPU training code is more esoteric and awful than multi-threading. It would help if the original devs released their training code like Pixart did. Pixart so far is the only one to release actual production ready training code with support for mutli-GPUs.
>>
>>102143461
>Pixart so far is the only one to release actual production ready training code with support for mutli-GPUs.
maybe we can inspire from Pixart's training code to make it work on flux, they use the same architecture (DiT) after all
>>
Here we go, a fresh delivery of...
>>102143475
>>102143475
>>102143475
>>
File: ifx307.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
>>
>>102143538
sonichu...
>>
File: ifx266.jpg (264 KB, 1024x1024)
264 KB
264 KB JPG
>>
>>102143577
This is great
>>
>>102143614
ty
>>
>>102142249
I get all my lora training info from the /h/ archives.
>>
>can't even make it to 100 imgs
>>
File: ifx267.jpg (202 KB, 1024x1024)
202 KB
202 KB JPG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.