[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>107938114

https://rentry.org/ldg-lazy-getting-started-guide

>UI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>ZiT
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/ranfaggot
>>
>>107939880
In case you’re wondering why these trolls won’t stop infesting this thread: their dead general has been dead for almost a week.
No one wants to post in that shithole anymore, so here they are.
>>
>>107939880
>>107939920
Kill yourself Julien
>>
blessed thread of frenship
>>
So are the mods ever going to do anything about the spam? I got a warning for reporting it so somehow they clearly haven't noticed a pattern in the report queue yet.
>>
>>107939880
>https://rentry.org/ranfaggot
Has a paranoid schizophrenic written that? Reads like it desu
>>
>>107939989
In order:
Jannies don't give a shit about the wellbeing of the website
They somehow give even less shit about AI threads
Reporting is retarded on these grounds
You expecting the lazy hall monitors to do detective work is also retarded, they will glance at the reported post for two seconds, if they can't immediately tell something wrong with it, they will just ban you to stop annoying them
No they won't do anything about it, it will just keep getting worse
And no there is nothing (You) can do about it, maybe pestering them on irc will result in something, but in all likelihood they will just laugh at you
>>
In the off-chance that this thread stays up
>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>>107940032
we have to move to /bant/, or any board with IDs, this is unbareable
>>
>>107940083
There is already a bant ai thread, but it is dead and slow as shit
IDs won't discourage schizos, they are completely shameless and spam their garbage regardless
It can make filtering easier, but I think they will start residential proxy hopping if they feel like they are getting filtering too much
>>
File: NoLoraVersusWithLora.jpg (3.63 MB, 2880x1824)
3.63 MB
3.63 MB JPG
So I can pretty confidently say anyone complaining about Klein Lora training is probably just a jeet. Tried a quick Klein 4B Base one on FAL's trainer with a Rubi Rose dataset I already had, just cause she has kinda a subtlely blasian look to her that can be a bit tricky for these models to get right sometimes. Even inferencing just on the Base itself as opposed to Distilled, it's pretty spot on, even at one-shot 1440x1800. Same seed / same prompt in the pic, without lora on left and with it on the right.
>>
>>107940311
Posted on the schizo drama bake so thread's gonna get nuked but interesting. I am waiting for better support on OneTrainer until starting to dabble with klein lora training.
Does it work fine with the distill?
>>
>>107936662
Photograph of a young woman sitting in a train carriage, taken from a medium close-up angle. She has long, straight blonde hair, fair skin, and is wearing a bright pink cardigan over a white crop top, paired with a pink mini skirt and red heart-shaped sunglasses. She's holding a water bottle in her right hand, looking to her left with a slight smile. The background includes patterned black seats with pink accents, yellow handrails, and a window showing green foliage outside. The lighting is natural, coming from the window. The composition uses the rule of thirds, with the woman positioned slightly to the right. The image has a very high aesthetic quality. The camera angle is slightly tilted, adding a casual, candid feel. The overall style is modern and vibrant, with a focus on the subject's relaxed, stylish appearance.

>>107936783
nah im not ganna release people loras sorry

>>107936977
this is nice. how did you make this?
>>
>>107940311
skin looks like plastic either way, but even worse on the lora side
>>
>>107940425
Well it is Base. I expect it to look better in that regard on Distilled. I was also pushing the one-shot resolution pretty high there. In any case it's trained on actual photos of her so not really much else I can do lol, a lot of these kinds of photos are filtered to begin with before being uploaded and don't exactly have muh detailed skin in the first place.
>>
>>107940368
I don't see another "current" thread
>>
https://github.com/Comfy-Org/desktop/releases
>2 days ago
>still no successful build
did they just give up?
>>
>>107940727
yes, we are all going back to a1111
>>
>>107940311
wait but that looks almost nothing like her?
>>
>>107940883
Wat? On the right? It looks pretty much exactly like her when she doesn't have her hair dyed a weird color.
>>
>>107941046
(Samefag)
this is two of the actual dataset images side by side, for example
>>
any tips on making an image very realistic. say the input is a 3d render or a medieval gravure. using flux klein 9b. Tried with chatgpt
>>
>>107941092
what were your settings/captions?
training one on 9b right now and the samples look absolutely retarded again.
i feel like there must be a bug or something with my python/torch/cuda version or whatever
f2k character loras are absolutely shit for me
>>
>>107941244
Klein has that flux plasticity. Not too many options besides seed lottery. Sometimes asking for shit like "Analog photo from [Year] shot with [camera]" can help a bit.
>>
>>107941260
Ai-toolkit? Wait until an actually decent trainer like OneTrainer finishes adding proper support.
>>
>>107941308
using the onetrainer PR, but honestly it gives me the same results as aitoolkit. already tried tons of different configs because i really want f2k to work but i only get crap. will post an example later, i'm currently at 1.7k steps

would you maybe upload the rubi dataset? i could test it and compare results
>>
>>107941346
The PR isn't merged yet for a reason. Still missing decent chunk of stuff. Either wait or don't get worked up too much over it.
>would you maybe upload the rubi dataset?
I am NTA.
>>
>>107940311
You may not see it yet but Klein tends to give the same face structure of every women, (nose, jaw, eyes) nothing compared to z-image which gets the look right away
>>
Your thread did not reach the bump limit yet
>>107938114
>>
>>107941455
>>107940311
https://files.catbox.moe/2szs05.png
pic related is 3k steps of greta training with onetrainer. aitoolkit gives me similar results. you can see it picks up the character and also learns some details (like mole on the cheek), but it looks slopped and uncanny and not really like her. maybe she is a bad example because she is supposed to look like a retard but still...
>Still missing decent chunk of stuff
like what? honestly just curious
>>
>>107941298
no it doesn't lmao, it looks nothing remotely in any way like stock Flux.1 Dev
>>
does this ran person actualyl exist or is it some bogeyman made up by ani to cope with being hated by everyone, I can't keep up with the lore anymore
>>
>>107941488
Z even with Ostris V2 absolutely does not get the look right away lol, it takes WAY more epochs than e.g. Flux.1 did to adhere likeness wise
>>
>>107941725
Mr catjak in archive and x
>>
>>107941633
It seems I confused it with musubi's pending PR, my bad. Doesn't seem like anything egregious is missing in the PR. Still, I am going to keep waiting until proper release.
>>
>>107941260
My dataset has 120 images of her in like a ton of different outfits and varied locations, all at least 1440x1440. All the images were descriptively NLP captioned fully with jailbroken Gemini 3 Pro using a custom captioning prompt I have that gets it to output stuff a certain way. The one consistency between all the captions in this case is basically they all start with "a photograph of a [sometimes other descriptors here] Black Rubi Rose woman", which is basically how I always do it for single subject loras, like [Race] [Name] woman / man / whatever.
>>
>>107941799
hmm i see, sounds solid. but even a less sophisticated dataset should work imo
>>
>>107941633
I mean the samples are never great (although I hear you can use Distilled to do the sample inference while training on base) but to me it just looks also kinda like you're inferencing at a relatively higher resolution than the original images were or than the training resolution was.
>>
>>107941839
yeah this one was actually trained on 512 because 1024 didnt really work for me either so i played around a bit.
but even if i generate at 512 it's not very good
https://files.catbox.moe/4ccgpu.png
>>
>>107939920
This has nothing to do with this and everything to do with a mentally ill retard infested the general for more than 2 years.
Maybe you, maybe someone else, doesn't really matter, it just ruins any sane discussion.
>>
If you train lora on a base model, can you use it with a distilled model or not? What is the recommended way of creating the lora on? Base or distilled? Related to klein but a more general question.
>>
>>107941975
rule of thumb is to train on base, use on distill
but then there are also cases like zit with no base, so people created workarounds
>>
>>107942048
OK thank you anon, I thought this rule was only for finetunes so I wanted to be sure.
>>
>>107941975
Base will certainly respond a lot better to training than distill in general. But a lora trained on distill will be more compatible for distill inference and for relatively more simple concepts, the distill might still be able to learn fine if they have not fried it. But also based on how extra limbs commonly keep appearing, I feel like they fried the distill a bit. (But again it has respectable seed variance, which means the distill might be reasonably conductive still to absorbing new concepts?)
My guess is that you want to train on base but needs to be tested.
And I am aware that I said but a billion times.
>>
>>107941937
say that to the mods, when they analyze the raid history
>>
first deleted lora for klein 9b, a breast one (enhancing breasts look), probably because you can use it to make a clothed to topless image using edit
>>
>>107941975
Yes, you should only do it that way
>>
>>107942066
>My guess is that you want to train on base but needs to be tested.
I'm doing that so I'll see.
>>
>>107941878
The likeness is decent IMO, it's just not that detailed. How detailed is the dataset to start with?
>>
File: FourModelComparison.jpg (3.28 MB, 5184x1728)
3.28 MB
3.28 MB JPG
```A stunning, edgy young woman with split-dyed hair—left side raven black, right side neon green—dominates the frame, leaning back playfully on a messy bed in a suggestive pose. She wears a distressed graphic crop top, a spiked leather choker, and fishnet tights, gazing directly into the lens with a confident smirk. Her makeup is bold, featuring sharp winged eyeliner and dark matte lipstick. The scene is illuminated by the harsh, direct flash of a modern smartphone camera, which creates hard, crisp shadows against the wall behind her and highlights the glossy texture of her hair and the natural sheen on her skin. The background is a cluttered, authentic bedroom featuring band posters taped to walls and tangled LED fairy lights glowing faintly in the periphery. Style: Candid amateur photography, vertical social media aesthetic, raw and unfiltered. Mood: Confident, intimate, rebellious.```
>>
>>107942152
X to doubt given the multiple literal breast and ass slider loras I can see for Klein as we speak on CivitAI
>>
>>107942246
>The likeness is decent IMO
i think it's okayish, but these are already cherry picked results. keep in mind she naturally looks retarded which helps a lot here.
it's 74 pictures with different sizes and aspect ratios. what i posted was only with a trigger word, but i also tried with captions generated by qwen vl
>>
>>107941298
If you mention 3d or if the character doesn't have realistic proportions, it'll make the characters plastic.
>>
>/ldg/ thread
>look inside
>it's some gay tranny shit
this place has fallen really deep
>>
>>107941260
Many people are reporting the same bad results with likeliness. I didn't get good results either even with a 1 single image in the dataset. Chances are it's the model's problem.
>>
>>107942285
x to doubt what? the reason? that's the only one I can see, there is nothing special to the lora otherwise
>>
>>107942329
it's been the same schizo for four years but now he's crashing out more often
>>
Is WAN good for gooning?
>>
>>107942329
>>107942386
lmao, when was the last time you /adt/ fags even baked, like a week ago? no wonder you're raiding us 24/7, you've got nothing else going on kek.
dead general energy you comments
>>
/adt/ refugee status?
>>
File: 1751278258543282.jpg (2.21 MB, 1664x2432)
2.21 MB
2.21 MB JPG
>>
>>107942329
I wish we could go back 3 years. Early diffusion generals were amazing, full of creativity and fun. Now most good posters just fucked off to pixiv or civit
>>
>>107942406
>>107942396
I hate you!! Why aren’t you in my thread validating my sick fetishes? I need you to be my accomplice!!
>>
>>107942396
huh what are you capable of understanding text?
>>
File: 1744916498774516.jpg (2.95 MB, 2688x1536)
2.95 MB
2.95 MB JPG
>>
>>107941735
nigga all you train is celeb loras. the most low-hanging fruit of lora training, it would be bad if they sucked ass because you obviously have access to very good dataset available for anyone, you should start preaching when you train an original character
>>
File: LoomerTwerk.mp4 (1.64 MB, 480x640)
1.64 MB
1.64 MB MP4
>>107942336
People always report bad results with AI Toolkit specifically whenever models first come out though, I think he rushes the training code to be first
>>
>>107940311
>Rubi Rose dataset
why would anyone waste time and money on such a hideous nigger?
>>
>>107942509
*puke*
>>
>>107942526
Not AI generated believe it or not
>>
>>107942537
*puke*
>>
>>107942509
dunno who this is, but dat ass
>>
>>107942516
>absolute blasian smokeshow
>hideous
ok there bud kek
>>
>>107942544
She's known for her activities as a far-left progressive activist, primarily
>>
>>107942550
it's almost like people have different tastes in things

disgusting
>>
>>107942556
>far-left
>progressive

nice oxymoron
>>
>>107942550
>Brown hands typed this
>>
>currently 6 threads shitting up the catalog
>but we must ban the general that has been in one thread for a week
lol
>>
File: 04064414.png (1.74 MB, 1088x1088)
1.74 MB
1.74 MB PNG
>>
>>107942568
I'm a white Canadian guy in his 30s lmao. When was the last time you heard a (presumably American) brown person say "smokeshow" or "ok there bud" anon
>>
>>107942611
sure
>>
File: ZIT_Woman.png (3.18 MB, 1536x1536)
3.18 MB
3.18 MB PNG
>>107942558
Somehow I suspect your taste is exclusively this isn't it Anon
>>
>>107942599
hockey players?
>>
>>107942618
no, i have no problem with any race.. im not a racist freak like most of 4chan
>>
>>107942628
wat
How are you not this guy if this comment chain is followed straight back:
>>107942516
>>
Is there no fucking way to get Turbo ZiT to do anything other than Southeast Asian women?
>>
>>107942628
You've a got problem, man. You're stereotyping just much as these racists.
>>
>>107942653
yeah just say the ethnicity, like Caucasian, Black, whatever. Sometimes countries or continents work but not as reliably
>>
inpaint masking not possible with flux klein?
>>
>>
>>107942647
uh, what?
>>
>>107942594
you did the same with /edg/ and /hdg/
pattern of behavior established, faggot
>>
File: Flux1Dev_Output_262662.png (3.51 MB, 1536x1536)
3.51 MB
3.51 MB PNG
>>107942757
Flux.1 Dev version of the same prompt is so ass in comparison lmao
>>
>>107942757
>No legs
>>
>>107942666
it is possible. i do it with inpaint crop&stitch.
very lazy example: https://files.catbox.moe/6gugcj.png
>>
>>107942246
>>107941839
heres the zit version with the same dataset and triggerword only. honestly think its just a lot better overall
https://files.catbox.moe/2cvwnx.png
>>
File: 0305884.png (2.27 MB, 928x1664)
2.27 MB
2.27 MB PNG
>>
File: 41698180.png (1.96 MB, 1216x960)
1.96 MB
1.96 MB PNG
>>
File: 27637669.png (2.2 MB, 1756x1168)
2.2 MB
2.2 MB PNG
It still gets me out of guard how this just works, lol
>>
File: x_anqbv6.png (1.26 MB, 1536x1024)
1.26 MB
1.26 MB PNG
>>
>>107941937
There seems to by multiple mentally unstable people. There's the shitty video poster. There's the weirdo who keeps calling everyone Julien (and might murder some random guy that they "doxxed" Julien as). Then there's this weird ani/ran/debo schizo drama (who knows how many mentally unhinged people are wrapped up in that). These people have been absolutely brain fucked by AI. I guess this is the future we have to look forward to.
>>
>>107942993
>>
File: file.png (2 KB, 136x52)
2 KB
2 KB PNG
>why is my room so warm?
>>
>>107942757
her hood turns into her hair
>>
>>107942890
How is your lora trained for Klein? Did you use just one dataset or multiple datasets that are paired?
>>
>>107942890
maybe the klein lora is just underbaked
>>
File: 1738425659047320.gif (165 KB, 492x662)
165 KB
165 KB GIF
So I have a pretty big collection of images related to my specific fetishes, what would be the best way to train a lora on this to generate more? (and also what model would train best I guess?)

Let's say some images depict X, others depict Y or Z. Could I train a generic, "universal lora" that would include and be able to generate all of these concepts, or only one at a time? I'm guessing not, because from my understanding, you shouldn't caption what you're training on, so I couldn't have image A captioned as "a woman wearing [thing X]" and image B captioned as "a [thing Y] on the table" and then have the lora distinguish between them?
Or have I got it completely backwards?
>>
>>107943104
one dataset because i am going for t2i. the pairing is only used for edit training, right?

>>107943107
i tried a lot more steps with other characters but it toally fried at some point without increasing likeness. this time both were 3k steps with batch size 2 so i think it should be enough imo
>>
>>107943009
the Julien poster is just ranfag/catjak. same with the drama poster it's the same faggot. the pepe/slopvid spammer is just an unrelated troll because the ranfaggot gets away with his bullshit all the time
>>
>>107943132
I wonder if all of the complaints about Klein loras turning out badly is centered around t2i. Maybe training for Klein doesn't work well unless you're using paired datasets.
>>
>>107943124
Loras have bad concept separation in general. Supposedly, new lora formats fixes this problems, but they produce poor likeliness in my experience.

If you don't caption what you're training on, you'll just have a stiffer lora that's less flexible because your dataset is associated with fewer keywords.
>>
>>107943143
I've been here since 2016. If you think you can tell who is who, you are wrong.
>>
>>107942949
>just werks
>skin colors don't match
>>
>>107943193
>2016
tourist
>>
>>107943124
Rule of thumb is that if you train a LoRA you might want to train one fetish each, if you want all of them trained in the same thing you should prepare to really crunch whole checkpoints directly rather than a reduced rank LoRa as is typical.

If you want to try and see if it works in a LoRA with all of XYZ... feel free to give it a shot,but I'd recommend against attempting it very many times. You should sooner give up and try on another base model or, more likely, switch to whole checkpoint training if it doesn't work.
>>
>>107943225
kek
point is, people pop in and pop out. Sometimes people get a wild hair and decide to troll a little on their own. I've been called every kind of thing based on whatever drama was happening at the time. Never assume that you know what is going on here, and definitely don't take it seriously. This is not some community that has known actors (unless they are a tripfag, but even then, they get hacked sometimes). It's just an anonymous board. Nobody here will every know eachother in real life. Stop letting your human instinct force this idea of community on you when you post here. It can literally drive you insane (as has been evidenced in these threads). Maybe it's because it's a general, people feel like they are talking to the same people every time. You're not. This is not a forum.
>>
>>107943277
Good Point.

Even If I just change up my capitalization and sentence structure people can't tell that I'm samefagging.

(Me)
>>
>>107943190
>If you don't caption what you're training on, you'll just have a stiffer lora that's less flexible because your dataset is associated with fewer keywords.
My understanding was always that you would substitute what you want to train on with a single trigger word as opposed to describing the thing itself, since the model would "ignore" the other things you described.
Only problem I have with that is that a lot of things don't really have a universal, standard depiction (see: a single character vs. many different people doing or wearing variations of X or Y), so I figured it'd be ineffective at learning that.
>>107943259
Fair enough, I suppose I'll have to experiment around a bit - my only pain point really is the captioning since I doubt it can be automated well for hundreds of images, especially not with some niche fetishes or whatever. If it gives better results then I suppose I can spare the time to train several loras for each.
>>
once you see the AI you can't really unsee it
>>
>>107942252
>dat buttchin
>>
>>107943299
>my only pain point really is the captioning since I doubt it can be automated well
yes, you have to just tag the fetishes you want on the images

then you either train like that (just 1 tag per image) or you add the other tags with a non-nsfw-refusing or even nsfw-friendly (pixai, joycaption, wd...) captioning model
>>
>>107943124
>I'm guessing not, because from my understanding, you shouldn't caption what you're training
Wrong, it should be captioned, the less guesswork the model has to do the better the result will be.
As others have said loras work better with only a single or very few things in them, if you need multiple things that work together you should do dreambooth training
>>
>>107943277
>>107943295
this is a really bad take. ranfaggot actively spammed the discord with drama as well, exact same nomenclature as the schizo here until he was banned. it's literally him most of the time. unrelated trolling is other anons is the only thing I can agree with
>>
>>107943299
Trigger words don't always work.
>>
File: 96595.png (1.74 MB, 1152x1024)
1.74 MB
1.74 MB PNG
>>
File: 613341743.png (1.64 MB, 896x1408)
1.64 MB
1.64 MB PNG
Don't embedding in principle also work for these DiT models? The devs of the training repos just didn't bother to port it for flux? I really dislike how slow lora training is specially if you want to train something simple
>>
>>107943532
i'm kinda sad we have forgotten about embeddings.. ofc they only work for things the model already knows, but we could be easily sharing tiny 20kb embeddings of different useful concepts and training them quickly
for example all those slider loras could just be embeddings, and would probably be less destructive than the trained loras
>>
>>107943344
yeah Flux.1 looks extra Fluxxy nowadays in comparison to the other models lol
>>
>>107943573
Embeddings are a lot easier to overcook.
>>
>>107942828
she's kneeling
>>
>>107943581
surely can be mitigated by adjusting learning rate and other hyperparams?
idk i feel it's a bit unexplored, like it was explored a bit during the sd1.5 days and then dreambooth and later loras came and everyone forgot about them
>>
>>107943168
it's probably also people expecting training at like FP4 and low resolution and other copery to just magically work super well too
>>
>>107942890
were your other gens on Base though or on Distilled? make sure you test it on Distilled, not just Base
>>
>>107943386
okay schizo LOL :]
>>
>>107943573
remember textural inversion?
>>
>>107943668
thats what embeddings are lol
>>
someone trained a prompt boomerifier
https://huggingface.co/retowyss/PromptBridge-0.6b-Alpha
the fact that this even needs to exist is ridiculous in my opinion, the newer DiT models have absolutely terrible seed variance so we get around this by plugging in non-deterministic LLMs to rewrite the prompts endlessly so the model doesn't generate the same 1woman in red 1dress standing on 1beach every time
>>
Anyone have a guide on how to prompt for flux?
I assumed it used spoken english but I think I'm missing something here.
>>
>>107943277 (Me)
>>107943295 (Me)
>>107943386
>>107943633 (Not Me)
I didn't even know there was an /ldg/ discord. Even if I did, I wouldn't have joined it. It's a bad idea to go places with an identity that are linked to here. A huge crackdown hasn't happened yet, but just be cautious, anon. Right now powerful entities are in the collection and analyzation phase.
>>
>>107943386
>the discord
can't you just fuck off to it and never come back faggot
>>
>>107943694
If you mean Klein, this guy distilled BFL's official prompt guide for it into a system prompt for LLMs:
reddit.com/r/StableDiffusion/comments/1qf2qew/flux2_klein_prompt_enhancement_llm_system_prompt/

Works pretty well with Gemini 3 in my testing, probably with others too.
>>
>>107943675
meant hypernetwork. my bad
>>
>>107943715
Good lord, they sure made it convoluted. If it wasn't for the pixel shift issue with qwen, there would be no reason to even try klein
>>
File: 00227.jpg (1.11 MB, 2737x4000)
1.11 MB
1.11 MB JPG
>>
>>107943739
whoa, this was with ai?
>>
>>107943757
>2.5D
obviously
>>
>>107943775
not bro falling for the uber low tier bait
>>
>>107943783
in the off chance that it was some random person was scrolling /g/ outside of the catalog and hadn't been keeping up with the state of local AI
>>
File: 089143447.png (2.46 MB, 1024x1280)
2.46 MB
2.46 MB PNG
>>
File: 00212.jpg (590 KB, 2737x4000)
590 KB
590 KB JPG
>>107943757
Haha yes it was.
>>
WARNING WARNING WARNING WARNING WARNING

Installed frontend version 1.35.9 is lower than the recommended version 1.37.11.

Please install the updated requirements.txt file by running:
\ComfyUI_windows_portable\python_embeded\python.exe -s -m pip install -r E:\SD\ComfyUI_windows_portable\ComfyUI\requirements.txt
If you are on the portable package you can run: update\update_comfyui.bat to solve this problem.

This error is happening because the ComfyUI frontend is no longer shipped as part of the main repo but as a pip package instead.
________________________________________________________________________

FUCK FUCK FUCK FUCK
>>
>he pulled
>>
>>107943594
I trained embeddings multiple times a long while back. Despite adjusting learning rates, I couldn't get good results.

If embeddings were good, one would expect a lot of them to be made. The fact there's so few of them from SDXL onwards means that they're not very good, just like with all the exotic lora formats.
>>
https://civitai.com/models/2330093/juggernaut-zit
Juggernaut ZIT neat if anyone wants to shill it properly feel free I’m tired.
>>
>>107943921
>early access
>>
>>107943839
So, they drew something in photoshop, printed it out on recycled paper, and then put finger paints on top?
>>
>>107943594
I kind of agree and would like to see it at least attempted with newer models.
>>107943872
>The fact there's so few of them from SDXL onwards means that they're not very good
maybe. But I still think it's worth trying with recent models. I just wouldn't know what software to use.
>all the exotic lora formats
yea, that shit is retarded. Civit still has all of those nonsense formats, but they refuse to break qwen out even into image and edit subcategories.
>>
File: file.png (45 KB, 353x252)
45 KB
45 KB PNG
>>107943921
>pic related
we're back to this stupid shit?
And it didn't even make the "skull made of flowers" that they asked for. What a bunch of retards.
>>
>>107943851
not a big deal, just update the ui version
>>
>>107943730
I mean it helps but it's not strictly necessary for really any of these newer models, as you can see in this comparison for example:
https://reddit.com/r/StableDiffusion/comments/1qj9cjl/flux_klein_4b_distilled_vs_flux_klein_9b/
>>
File: file.png (379 KB, 699x867)
379 KB
379 KB PNG
>>107943921
oooooooooo someone hacked yo shit
>>
https://litter.catbox.moe/69y7ol.webm
>>
>>107944008
>masterpiece
>BREAK
KEEEEEEEEEK
>>
>>107944030
why can't those faggots just let me open an image in a new window?
>>
btw the spammer got wiped in yesterday's thread:

https://desuarchive.org/g/thread/107938114/#107939336

So if he's posting now, he's evading. Remember to report his posts.
>>
>>107944030
>>107944045
that said, klein looks way better for realism, for everything that doesn't involve human anatomy
>>
What is better, Klein9B or Qwen Image?
>>
File: 88.png (1.55 MB, 1024x1280)
1.55 MB
1.55 MB PNG
>>107944061
you will have to test for your specific >>usecase<<, you may like the aesthetics or capabilities of one over the other
>>
File: file.png (2 KB, 132x53)
2 KB
2 KB PNG
>>107944061
probably qwen 2512, but klein is faster. Idk, make a comparison chart and post it here. My gpu is busy atm.
>>
>>107944055
Kleins Chefs and Cowboy kinda more detailed backgrounds too I think though
>>
>>107944061
Klein 9B is a gorillion times faster and has a better VAE, and better prompt adherence I think.
>>
>>107944061
klein is better, faster, and the gens give different scenes
>>
Been out of the game for awhile, do people not use A1111 anymore?
>>
Has the schizo problem been handled? It was obscenely bad last night. I had to just close the browser.
>>
File: 631958935.png (1.81 MB, 800x1440)
1.81 MB
1.81 MB PNG
>>107944130
to add to that the edit and regular model are one in the same so you don't have to switch, when it was zit vs qwen I was using qwen cuz it was just better since its larger, now I'm doing it all on klein cuz klein has been just as capable but much faster
>>107944188
seems so, the thread is so normal its weird
>>
>>107944188
Yeah the conclusion was to keep the quality maintaining rentrys and laugh at lolcows
>>
>>107944243
How would you, a self proclaimed tourist, even know? Kek
>>
>>107944188
he'll pop back up when the next real thread is baked
>>
>>107944256
You don't need months of seeing people saying "cumfy, troonfy, ani, trani, ranfaggot, debo" to notice it suddenly stopped
>>
>>107944188
it's cyclical, as long as the moderation doesn't do its job
>>
>>107944256
you made me wonder so I plotted all my visits to /ldg/:
https://files.catbox.moe/dhb5mv.png
>>
File: x_im2oms.png (1.47 MB, 1536x1024)
1.47 MB
1.47 MB PNG
>>
>>107943921
>if anyone wants to shill it properly feel free I’m tired.
why the fuck would anyone want to shill a fucking shitmerge let alone one from jug LMAO
>>
>>107944032
what the hell
>>
>>107944061
Qwen is too slopped to be good for much anything.
>>
>>107939880
what is up with these trash gens?
>>
>>107944384
>to be good for much anything.
What about creating slop?
>>
File: x_im2oms.png (1.8 MB, 1536x1024)
1.8 MB
1.8 MB PNG
>>
>>107944398
Last thread was basically an unusable wasteland.
>>
>>107944427
so this is the power of cunny++
>>
>>107944427
lewd loli
>>
>>107944465
did you forget what website you are on?
>>
>>107944523
not everyone here is a rl diddler
>>
How do you finetune a vae?
>>
>>107944402
True
>>
>>107944537
you load the model, do a forward pass, calculate the loss then backpropagate, just like any other model
>>
>julien
>>
>>107944556
>fries
>>
should I get these qwen tts models before they pull a vibevoice?
>>
is klein less retarded at anatomy if I don't use the distilled model?
>>
>>107944586
It's really kinda only good at making cringe anime dub voices. but I know some people like that.
>>
File: 00223.jpg (705 KB, 2737x4000)
705 KB
705 KB JPG
>>
i love huggingface
>Download complete: : 4.52GB [00:05, 900MB/s]
>>
>>107944587
yes definitely
>>
>>107944622
Wait. can you use negative prompts with the base model?
>>
>>107944609
it's not usually downloading at full speed here (10gbit in Europe).
>>
>>107944587
It can be but there's much less aesthetic tuning on the Base. Try just doing 5 to 8 or so steps instead of four. And don't use FP8, use the full BF16 model or one of the GGUFs.
>>
>>107944631
yes
>>
>>107944602
https://files.catbox.moe/1s2sor.mp4

I simply had to animate this image.
>>
>>107944635
that goes for the text encoder too, I should have said.
>>
File: 00221.jpg (851 KB, 4000x2737)
851 KB
851 KB JPG
>>107944643
Cool, thank you! I appreciate this.
>>
>>107944648
Any point in using a derestricted/ablated text encoder?
>>
>>107944633
peering and server load issue
>>
>>107944648
sounds like I should switch to gguf for everything? I'm assuming text-encoder is less sensitive to quantization?
>>
>>107944723
>I'm assuming text-encoder is less sensitive to quantization?
The opposite. Most people simply offload it to the CPU if they can't fit it on their card.
>>
File: 1739051206513940.jpg (65 KB, 864x1097)
65 KB
65 KB JPG
klein edit is... too great
>>
>>107944032
>5k buzz
what a loser
>>
>>107944811
without bots it's around 40.
>>
>>107944723
it has worked quite well for me, mostly with q8 for the main diffusion model

yes, the text encoders usually are less sensitive to the exact quant, many times I can't even tell if or at least about how often Q5 is different from Q8
>>
>>107944656
No problem.
>>
>>107944427
if only it was just cute cunny without the insane constant schizo shit
>>
>>107944828
Hey. You're not me.

Also I made another video

>>107944656
https://files.catbox.moe/xi8dw3.mp4

I was truly inspired.
>>
>>107944811
Why do people even care for buzz. is it just because they can't gen on their own machine? does it let you train LORAs?
>>
>>107944843
it does let you train loras and gen
I used it to train and it is legit, it came out great specially because of the high batch value
>>
>>107944843
>does it let you train LORAs?
Yes. You can also cash it out if you're a creator, but since you need to buy a subscription to become a creator it's usually not worth it.
>>
>>107944855
But high batch value degrades lora quality.
>>
>>107944877
Quite the opposite.
>>
>>107944959
nta but debatable.
>>
sure is lots of mysticism around AI.
>>
Please be good
https://qwen.ai/blog?id=qwen3tts-0115
>>
File: ComfyUI_temp_hrmad_00019_.png (2.69 MB, 1184x1664)
2.69 MB
2.69 MB PNG
>>
File: ComfyUI_temp_hrmad_00022_.png (3.24 MB, 1600x1280)
3.24 MB
3.24 MB PNG
>>
>>
>>107944963
im sure there was lots around fire too before people learned wtf it is and how it really works
>>
any good rule 34 or hentai artists that you use these days?

or any artist blends?

right now I'm jumping between {happoubi jin, solo, curtian background, white background|by kiriyama taichi, watercolor (medium), traditional media, pink background|yukimi, orange background, simple background|homare \(fool's art\), dark background}
>>
>>107944959
According to Google:

>High batch values decrease LoRA quality because they average the unique features of training images, leading to a loss of fine details and reduced training accuracy. Larger batches produce smoother, less "noisy" gradients, which, while faster, can result in the model getting stuck in local minima and failing to capture the nuance required for high-quality, specialized outputs.
>>
File: 1751526222636339.png (3.2 MB, 1248x1872)
3.2 MB
3.2 MB PNG
>>
>>107945100
>rule 34
No one of consequence trains on r34 because the site is "poorly tagged".
>>
>>107945112
>trusting the google AI overview
Most of what it said is wrong.
>>
>>107945125
yeah but if you were to make a pinup right now who would you prompt?
>>
>>
>>107945161
Post your source then.
>>
>>
>>107945225
> bri'ish slags
>>
>>107945261
give them bitches samurai swords. make em fight
>>
>>107945205
John William Waterhouse
>>
>>
>>107945225
Disgusting.
>>
>>107945324
>>
>>107945318
>exit your hotel room
>see this
wyd?
>>
>>107945303
nice
>>
>>
>>107945112
If you have gradients that pull into opposite directions you just have to adjust your training settings, you don't have to average these despite using large batch size.
>>
>>107945242
Most of this shit is stuff an experienced practitioner just knows. Some of it can be looked up in papers but I'm not doing that at fucking midnight on a weekday. Here is my response to the points, deep dive it yourself and you'll see that I'm right.
>High batch values decrease LoRA quality
No. Many examples of high quality loras trained on batch sizes of 100+. Lora training speed scales almost linearly up to at least several hundred batch size, full fine tunes up to several thousand.
>they average the unique features of training images
Averaging gradients of individual examples doesn't somehow average the "features" of individual images.
>leading to a loss of fine details
If anything it does the opposite. High batch sizes can act similarly to low learning rates, allowing models to learn fine details better rather than just bouncing the weights around noisily.
>reduced training accuracy
AI slop statement, this doesn't mean anything
>Larger batches produce smoother, less "noisy" gradients
True. It's a better approximation of the full batch gradient.
>which, while faster
Unclear what this means. Larger batches do reach the same level of quality faster in wall-clock time if they are leading to better hardware utilization (which usually they are).
>can result in the model getting stuck in local minima
Impossible. Your lora dataset is small. The model can memorize it perfectly if trained long enough on low or high batch size. There are billions of directions in parameter space to move in, you're telling me ALL of them increase the full batch training loss? (i.e. you're at a local minimum)
>failing to capture the nuance required for high-quality, specialized outputs
AI slop writing this doesn't mean anything
>>
File: AniStudio_00371_.png (1.27 MB, 800x1296)
1.27 MB
1.27 MB PNG
>>107940488
> this is nice. how did you make this?
>>107938596
>>
>>107945477
can I get the outfit prompt on that?
>>
>>107945486
It's edit too.
>>
>>107945425
>source: trust me
>>
>>107943622
it was all tested with distill
>>
>>107943597
i also trained 9b with 1024, rank32 and fp8 for transformer and te. this should give me very good results but they are trash.
zit doesnt even have base and outperforms klein massively when it comes to character loras. i really want f2k to be good but it looks like there is something going on. either the training code sucks or they poisoned the model
>>
>>107945564
You should trust anon.
>>
File: BptVE1JIEAAA3dT.jpg (23 KB, 600x405)
23 KB
23 KB JPG
>>107945672
but I'm anon, why would I trust myself?
>>
File: img_00103_.jpg (532 KB, 1760x1336)
532 KB
532 KB JPG
>>
File: img_00112_.jpg (402 KB, 1624x1328)
402 KB
402 KB JPG
>>
Me on the right.
>>
>>107945769
me on the left (anon's my big cutie bf :3)
>>
>>107942550
>>107940311
fucking disgusting
>>
>doing svi pro 2 shit
>offloading all the models so i can maximize video quality
>1080p 81 frames
>53%vram used in both high and low

The fuck is going on?
>>
>>107942831
Thanks mang!!

Any outpaint wf for klein guys?
>>
>>107945425
>Impossible. Your lora dataset is small. The model can memorize it perfectly if trained long enough on low or high batch size. There are billions of directions in parameter space to move in, you're telling me ALL of them increase the full batch training loss? (i.e. you're at a local minimum)
It is absolutely possible to get stuck in suboptimal solutions, especially with small datasets. In fact, small datasets are prone to overfitting. Basically training loss is zero, but the model fails to generate good outputs.
>>
File: Flux2-Klein_00077_.png (2.93 MB, 1136x1840)
2.93 MB
2.93 MB PNG
>>
>>107942831
no way to reduce denoise?
>>
>>107942890
>triggerword
you mean her name spelled normally... right? please say yes
>>
>>107945971
yes you idiot
stop looking for excuses
>>
Newbie here, anyone got any prompts I can try out to learn?
ComfyUI + Z Image Turbo
>>
>>107945826
This is why you up the resolution for small dataset and use warm restarts
>>
>>107946009
Check civitai, filter images by checkpoint and scroll past 10 first pages for something else than porn
>>
>update njewdia drivers
>inference got slower by 20%
thanks
>>
>>107946009
Z image turbo is piss easy to prompt, you don't need anyone teaching you how to do that. Ask again when you're prompting a real model like Chroma, Klein 9b base, Illustrious, etc.
>>
>>107946009
1girl
>>
>>107946009
1girl

try empty latent 1280x720 and 720x1280
>>
>>107946009
close up of a redhead girl with blue eyes with her tounge out making the peace sign, behind her a neon sign "WBC only", Background is her bedroom.
>>
>>107945988
just checking
>>
File: temp1.png (1.15 MB, 1025x512)
1.15 MB
1.15 MB PNG
>>107946067
>>107946075
>>107946090
>>107946038
>>107946059
ComfyUI + Z Image Turbo
Resolution: 512x512
1050 Ti (4 GB VRAM) + 16 GB RAM
Left: 223 seconds
Right: 260 seconds
>>
>>107946224
>1050ti
BRO
>>
when ready

>>107946229
>>107946229
>>107946229
>>107946229
>>
What would be the essential loras for wan 2.2?
If i have Nudes in mind but not necessarily porn?
>>
>>107946240
Just use smooth mix and turn off your brain. Anyone saying otherwise is selling you snake oil.
>>
>>107939880
In case you’re wondering why these trolls won’t stop infesting this thread: their dead general has been dead for a week.
No one wants to post in that shithole anymore, so here they are.
>>
>>107946240
there's a nude posing lora on civit that works pretty well. I don't know about essential but as far as utilitarian loras there is the FFGO lora which lets you use reference images in the first frame and gen a video based on those references.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.