[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (877 KB, 3264x3264)
877 KB
877 KB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102265952

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/pol/uncensored+ai
>>
File: file.png (1.47 MB, 896x1152)
1.47 MB
1.47 MB PNG
>>102273818
Again, feels sanitized as fuck. I'm going back to XL.
>>
Blessed thread of frenship
>>
File: ComfyUI_33452_.png (853 KB, 1024x768)
853 KB
853 KB PNG
https://mega.nz/folder/mtknTSxB#cGzjJnEqhEXfb_ddb6yxNQ (inupool folder)
A lora for huge-eyed, extremely precious goodness. Based on this artist: https://xcancel.com/1nupool/media
>>
File: ComfyUI_33446_.png (947 KB, 1024x768)
947 KB
947 KB PNG
>>
File: ComfyUI_33455_.png (852 KB, 1024x768)
852 KB
852 KB PNG
>>
Anyone else noticed that the higher size loras perform better? 1GB lora will create better hands or more realistic images compared to a 20MB one
>>
>>102273891
what's the deal with anon being so weary on sharing their shit on huggingface or civitai? in 12 hours, everyone will have forgotten about your lora on the next thread
>>
>>102273932
desu I thought it would be the opposite, bigger loras means more weights raped and in consequence more broken anatomy, worse prompt understanding...
>>
>>102273938
4chan is an anonymous site and HF and Civit are not?
>>
>>102273951
>4chan is an anonymous site
unless you're sharing your file via a proxy, you're not anonymous at all to the eyes of the jannies
>>
>>102273965
That's clearly not my point you retard
>>
https://files.catbox.moe/m8fxbz.png
>>102273932
What determines if a LoRA ends up larger or smaller? More steps? More images? Both?
>>102273951
Just ignore him.
>>
>>102273970
oh so you're a 2 digit IQ who doesn't know what he's talking about then, got it
>>
>>102273984
Oh does HF and Civit allow for anonymous file uploading or do you have to have a profile?
>>
>>102273845 #
>6.24gb.
Some flux models are this size. Nf4 in particular iirc. Fp8 is 16gb, some other typical custom flux models I try are 11gb. That model is states it is an nf4 model.
Just tried it put
>Needs clip dictionary, t5 and vae
>using detected unet type: nf4
It is a flux model, nf4 type
https://files.catbox.moe/o67108.png
>>
File: file.png (471 KB, 968x832)
471 KB
471 KB PNG
>>102273951
>4chan is an anonymous site
perfect goy, keep using our talking points, you're doing great goyim
>>
>>102274006
Feel free to start posting with a username instead of "Anonymous".
>>
File: file.png (2.06 MB, 896x1152)
2.06 MB
2.06 MB PNG
Dealing with the resident nogen retard is very easy. If a post disparages or antagonizes another with no apparent reason, and especially if it's disproportionately vicious, ignore it.
>>
File: file.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>
File: 1706403181267.jpg (403 KB, 1024x1024)
403 KB
403 KB JPG
>>
>>102273949
>bigger loras means more weights raped
loras usually are bigger because of a bigger rank, rank doesn't change how many weights are modified, lower rank means less precise changes to the same number of weights
>>
File: ComfyUI_33464_.png (845 KB, 1024x768)
845 KB
845 KB PNG
>>102273938
I'm fine with that, I'm making them for myself, and I don't want to share them with a large audience because it always sucks when the artist finds out about his lora.
>>
>>102274046
I thought it was more weights because the bigger the rank, the bigger the file
>>
>>102274001
Ignore me I accidentally put the wrong model on lmao
>>
>>102274052
>it always sucks when the artist finds out about his lora.
I'm the artist in question, you dissapointed me anon :(
>>
>>102273949
>>102273972
The higher size ones user higher resolution images for training?
>>
>>102274059
Nope, the blocks the lora affects and the rank are two separate things. Say rank 1024 allows you to have a new value for each parameter, at rank 512 one value in the lora affects two parameters in the model, and so on. It gets less precise but it affects the same number of parameters.
>>
File: ComfyUI_00255_.png (1.86 MB, 1024x1024)
1.86 MB
1.86 MB PNG
>>
>>102273938
calm down anon just download it
>>
>>102274112
Oh I see, that's really interesting, and now it's logical why low rank loras mess up with the vanilla model more, it gives the model a lot of "bad" replaced weights due to less precise ranks
>>
>>102274136
nice try glowie
>>
>>102274001
>>102274063
>>102273845
Here's the right gen, still an nf4 flux model
https://files.catbox.moe/g6mo51.png
It's almost the same, also did the same settings on flux dev fp8
Flux fp8: https://files.catbox.moe/640njv.png
Which is almost identical again.
>>
how soon until we start clockwork orange training AI?
>>
File: 1722356174679.jpg (343 KB, 1024x1024)
343 KB
343 KB JPG
>>
>>102274039
Nice
>>
File: 1711532706587.jpg (424 KB, 1024x1024)
424 KB
424 KB JPG
>>
File: Flux_04423_.png (670 KB, 1024x768)
670 KB
670 KB PNG
>>
>>102273949
Bigger loras means more regularization images could have been used to stop the raping of the weights (like, their hymen is reconstructed.)
>>
File: Flux_04438_.png (686 KB, 1024x768)
686 KB
686 KB PNG
>>102274563
>>
>>102273999
Must have a profile.
But doesn't MEGA also force you to have an account when uploading?
>>
>>102273972
network dimension (rank) but you can resize the lora to drop out weights that have tiny values that won't do anything but just take up size in the actual file
>>
>>102274080
Stop pretending to be me.
>>
File: 1701854485432019.png (1.17 MB, 811x1141)
1.17 MB
1.17 MB PNG
https://files.catbox.moe/c7oidi.png
i retrained my lying down LORA from yesterday with new tags by joytagger and i've gotten much better results. still suffers from the flux nipples though.
>>
File: Flux_04465_.png (1.62 MB, 1080x1440)
1.62 MB
1.62 MB PNG
>>
>>102274764
outside of nipples, why does Flux boobs all look AI generated and generic
>>
File: 1697624420933.jpg (440 KB, 1024x1024)
440 KB
440 KB JPG
>>
File: 00000-1010803634.png (1.05 MB, 896x1152)
1.05 MB
1.05 MB PNG
>>
File: 00002-2088734012.png (1.28 MB, 896x1152)
1.28 MB
1.28 MB PNG
>>
File: tmpua9mljyk.png (1.3 MB, 1280x896)
1.3 MB
1.3 MB PNG
>>
File: 00010-3923971496.png (1.11 MB, 896x1152)
1.11 MB
1.11 MB PNG
kathy bates as doom guy
>>
File: 00015-481572465.png (1.1 MB, 896x1152)
1.1 MB
1.1 MB PNG
>>
File: 00008-2027942660.png (1.04 MB, 896x1152)
1.04 MB
1.04 MB PNG
>>
File: 00027-3832656635.png (853 KB, 896x1152)
853 KB
853 KB PNG
>>
File: Flux_04524_.png (1.02 MB, 1080x1440)
1.02 MB
1.02 MB PNG
>>
File: Flux_04520_.png (1.18 MB, 1080x1440)
1.18 MB
1.18 MB PNG
>>
Hires fix is doing wonky shit. Is it because I'm using a LoRA?
>>
>>102275191
try lowering denoise
>>
>>102274807
Because they didn't train on real nudity but on AI generated nudity.
Er
*air quotes* nudity *air quotes*
>>
File: ComfyUI_06214_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
https://imgsli.com/Mjk1MDA2/0/1
>AutomaticCFG burns the image too much but gives great prompt understanding
>SkimmedCFG burns the image way less but it's too close to CFG 1 in terms of prompt understanding
sigh...
>>
>>102275240
Why not just use both at the same time?
>>
>>102275224
That was it, thanks.
>>102275235
The absolute state of Flux.
>>
File: tmpq_i7kmb2.png (1.35 MB, 1152x896)
1.35 MB
1.35 MB PNG
>>102275270
You're welcome.
>>
File: 000000_17401_.png (677 KB, 516x754)
677 KB
677 KB PNG
>>
File: file.png (103 KB, 642x1705)
103 KB
103 KB PNG
>>102275261
it's giving me this, a bit more burned than SkimmedCFG, but not quite the prompt understanding of AutomaticCFG, thanks for the suggestion though, could've worked, I guess I have to look at the schizos parameters from AutomaticCFG and see if I can make it less burned or something
>>
File: file.png (2.45 MB, 1024x1024)
2.45 MB
2.45 MB PNG
>>102275331
>it's giving me this
oops
>>
>>102273821
Is there anything better than SD yet?
>>
>>102275515
your mom
>>
File: file.png (42 KB, 1077x212)
42 KB
42 KB PNG
>>102275331
This is what I'm currently using, CFG at ~8-10. Adaptive guider doesn't work well with them together, but even without it I can manually control when to shift back to CFG 1 with uncond_sigma_end. 0.5 = last 50% steps are generated at CFG 1, 0.3 = last 30% steps and so on.
As you can see yourself >>102275331 the dev of these both is extremely autistic so there's a fuckton of skimmed/auto cfg node variants, each has its own stuff so someone (preferably not me) has gotta test it all out.
>>
>>102275515
Flux, Pony
>>
>>102275542
No one askled an underaged summerfag nigger blow-in.
>>
>>102275559
Interesting, I'm gonna test your settings out, thanks anon
>uncond_sigma_end. 0.5 = last 50% steps are generated at CFG 1, 0.3 = last 30% steps and so on.
does that also mean you'll get the cfg = 1 speed at the last 50% steps?
>>
>>102275565
>Pony
pony is a SD model though
>>
>>102275602
>does that also mean you'll get the cfg = 1 speed at the last 50% steps?
Obviously, yeah. Also if I go all the way with full CFG it ends up looking grainy and kinda fried, so last steps really need to be at cfg 1.
>>
>>102275624
Is there a reason you went for "Skimmed_CFG" 2.0? the goal is to mimic cfg 1 no?
>>
>>102275615
Well, it's based on it, but it's been autistically finetuned enough to consider it a new branch, as compared to SDXL finetunes for example. It's in it's own category for a reason, even if it share the architecture. The next model is supposed to release on either auraflow or flux, so there's also that.
>>
>>102275559
Bro, you adding way too much shit into your workflow that barely changes anything in your gen, both backgrounds look bad, and you’re probably scratching your head playing with values , sometimes less is more, that skimmed automatic cfg is snake oil
>>
File: ComfyUI_00262_.jpg (845 KB, 2112x1584)
845 KB
845 KB JPG
>>
File: 3631547020.png (1.16 MB, 1344x768)
1.16 MB
1.16 MB PNG
>>
File: file.png (677 KB, 2069x442)
677 KB
677 KB PNG
>>102275656
>that skimmed automatic cfg is snake oil
not true, you can get better prompt adherance if you know what you're doing
>>
>>102275667
They're all the same picture.
>>
File: file.png (136 KB, 1072x477)
136 KB
136 KB PNG
>>102275644
I'm literally in the middle of playing around with it, yeah cfg 1 looks best but higher values make the foreground a bit sharper (but fuck with the background). I may or may not post my results later.
>>102275656
H-haha yeah...
>>
>>102275689
are you blind or something? read the prompt, cfg 1 refused to acknoledge Hatsune Miku or the sushis
>>
>>102275694
Is that Hifumi from persona5?
>>
>>102275667
I’ve seen you for weeks posting here and on reddit about those crappy custom nodes and those gens don’t look any different than the rest, I just don’t see the point, all it has added is confusion in your head, there are so much other stuff that you can test on, stuff that actually relates to flux, the funny thing about those skimmed automatic cfg nodes, is that they were made for sdxl and yet you keep try to make them work for flux
>>
>>102275835
>those gens don’t look any different than the rest
>>102275708
>>
>>102275667
>muh prompt adherence

How about you learn to properly write a prompt? You just got proven wrong a few days ago with your simpsons style flux theory
>>
>>102275871
https://www.youtube.com/watch?v=vgk-lA12FBk
>>
File: 00036-2110420122.png (1.3 MB, 896x1152)
1.3 MB
1.3 MB PNG
>>
>>102273910
ani! <3
>>
File: 00047-1106882351.png (1018 KB, 896x1152)
1018 KB
1018 KB PNG
>>
File: 1698541381642996.jpg (25 KB, 303x336)
25 KB
25 KB JPG
What's the difference between /ldg/ and /sdg/?
>>
File: 00052-3244945825.png (1.07 MB, 896x1152)
1.07 MB
1.07 MB PNG
>>
>>102274807
I actually got some good ones a few times on standard flux dev fp8, it is heavily reticent about nipples usually, often you can nip slips out of a bra before full exposed.
>>102275655
Flux dev pony?
I'm ready
>>
>>102276009
before there was a difference, the schizos were on /sdg/ and /ldg/ was more like a comfy place but now it seems like they decided to hijack this thread aswell, so there's not much difference anymore
>>
File: fs_0028.jpg (50 KB, 768x768)
50 KB
50 KB JPG
>>
>>102276019
>Flux dev pony?
They mentioned it as a secondary option though. Time will tell.
>>
>>102276027
0/10
>>
>>102276068
thanks for proving my point anon
>>
>>102276009
fuck around and find out
>>
>>102276019
>Flux dev pony?
No, schnell, because Flux dev pony won't get them any money. And it's on schnell only if the Auraflow version sucks.
>>
File: received_515781400834140.png (1.25 MB, 896x1152)
1.25 MB
1.25 MB PNG
>>102276082
Nobody can still answer how a few people are blessing sdxl and flux. But starting to see some flux dev models that are different from the base model
>>
>>102276108
>But starting to see some flux dev models that are different from the base model
such as?
>>
File: 00048-4053174991.png (1.03 MB, 896x1152)
1.03 MB
1.03 MB PNG
>>
File: tmpj3v1c4j2.png (1.27 MB, 1152x896)
1.27 MB
1.27 MB PNG
>>
>>102276108
What if they're just generating images with SDXL and using them to finetune Flux and they call that "blending"?
That would be hilarious.
>>
>>102276121
Kestral anime one, quite substantially different output. There's a couple NSFW ones that (Inc kestral) that are less inhibited than stock flux, nothing like pony though. Just look filter for flux dev checkpoints on civitai
>>
File: ComfyUI_00267_.jpg (1016 KB, 2112x1584)
1016 KB
1016 KB JPG
>>
>>102276149
https://civitai.com/models/697877/kestral-flux-anime-by-zovya
>This is the unet only at fp8 to save on download times.
I hate when they do that, fp16 or bust, I don't care about fp8 anymore since Q8_0 exists, and the GGUF node is the only one that doesn't unload/reload the model during a lora change
https://github.com/city96/ComfyUI-GGUF/commit/69f0daffc5ed1c4721892ca503bffa9e7d6ebfbf
>>
File: 00077-1423906614.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
>>
>>102276138
Possibly but they say they've blended specific models. At least in the case for dx hybrid which I'm trying now. They still give flux quality outputs so as long as that's the case I'm not concerned with how they enhance what flux can do
>>
File: 00078-3571889571.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
>>
File: 000000_17404_.png (2.15 MB, 1032x1508)
2.15 MB
2.15 MB PNG
>>
>>102276009
Flux is supposed to be offtopic on /sdg/ because it's not a stable diffusion based model, it's akin to posting Dalle 3 generations over there.
Since they're not enforcing their own rules, it's... yeah, some users only post here and others only post there, and that's the difference.
>>
>>102276188
Gguf takes 2+ min to load for me, fp16 was faster to load. I find it thoroughly annoying because I model hop a lot. A lot of the flux models on civit are nf4, but I've still found them to be good, and well beyond any xl or pony model in terms of complexity like concepts, interactions with objects, etc.
>>
How many images do you actually need to fine-tune flux?
>>
File: 00090-2867319062.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>
>>102276255
about a tree fiddy
>>
>>102276188
Also, it's Zovya, the guy that ruined ReVAnimated by merging it with Dreamshaper 8 to get rid of the unique things it had going for it on the Rebirth version.
His only good model was AZovyaArtistsToolsV2Art and he didn't ever update that one again.
>>
>>102276241
>Gguf takes 2+ min to load for me
that's weird, should be fast with mmap (it's slow the first time and after that it's blazing fast)
>>
File: tmp42w320go.png (1.1 MB, 1152x896)
1.1 MB
1.1 MB PNG
>>
>>102276279
You mean slow after the first ever load then fast even if I switch models and back? Or slow every time I switch models and it has to be reloaded?
>>
>>102276197
That's a very strong claim without showing us them side by side in a comparison.
You're claiming Flux has already been obsoleted with that method and people still using vanilla Flux are retarded for not moving to the enhanced versions.
>>
>>102276295
yeah if you switch models back it becomes slow again, maybe if you disable mmap it would make it better, but I have no idea if it can be disabled in the first place
>>
File: ComfyUI_Flux19.jpg (3.79 MB, 3840x2160)
3.79 MB
3.79 MB JPG
>>102276255
More than there are stars in the sky.
>>
>>102276273
So funny to see all those sdxl “finetuners” getting exposed by flux
>>
File: ComfyUI_Flux_76.png (774 KB, 1216x832)
774 KB
774 KB PNG
>>102275768
Yep.
>>
>>102276333
this, now the level is way higher, they cannot make half assed finetunes anymore and call it a day, that's the moment we'll see who were the real talented finetuners and the frauds
>>
File: 00110-2226707175.png (1.35 MB, 896x1152)
1.35 MB
1.35 MB PNG
>>
>>102276336
>not shogi
>>
File: 00015-3080833300.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>
>>102273821
what's the best model currently to make porn anime picture with?
>>
>>102276255
4000 should be enough, 2000 to add the knowledge/styles you want and 2000 to regularize it so it's not like merging a giant Lora into it.
When finetuned was implemented originally, the idea, back when people released 7GB models of Stable Diffusion because they had the full ema in there for training, the idea was to have a collaborative effort where someone would finetune the model to improve it, and then someone else would finetune THAT version to improve it, and you keep getting better and better models.
Stability AI was supposed to look at the best such finetune of added improvements, add their own improvements, and release a SD1.6 that obsoleted SD1.5 (note no proper model of that architecture ever obsoleted it, people still have to train their loras on it.)
Instead, they dropped the ball with SD2.0 abandoning all the work, and setting a trend where you know your loras and finetunes will never work on future models.
And then they released SD1.6 which actually outperforms SDXL on blind tests, but they kept it API only and nobody can download it.
If we pick up that idea again, someone could make a finetune that obsoletes Flux, by, say, being identical but producing women without buttchins, and then people would finetune that one instead of the original Flux.
And such a thing could be done with 2000 images, the problem is it needs a good dataset and a person that properly tags it, and the only person that has done such a thing insists on removing artist names from the prompts.
>>
>>102276416
>4000 should be enough
do we know how many pictures were used to pretrain a model, what I know is that the SAI cucks filtered 98% of Laion 5b to pretrain Stable Cascade, so basically they were using 100 millions of pictures to pretrain that model, that's the only number I got, if someone else can help on that that would be appreciated
>>
>>102276416
False, flux is a distilled model meaning that those 2000 to 4000 images would overfit the model making it useless, might as well train a lora
>>
>>102275952
*is garbage
>>
>>102276345
>they cannot make half assed finetunes anymore
It's funny most of them never finetuned anything and became famous by just merging models from others.
Elldreth was the worst offender and perhaps he deleted all his models to avoid being found out, people were mentioning what he merged on their reviews ("oh, I wished Eldreth has merged this model instead of that one"), and if the model is gone, nobody can ever know.
>>
>>102276297
This Is a dx hybrid (flux model) output, it specifically draws symbols, text etc we'll, which XL does not.
>>102276108
>>
>>102276408
Yiffymix was never surpassed on the fine details and creativity department (say, you send "1girl", it does wonders just with that.)
>>
File: 00048-2027763283.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>
File: file.png (61 KB, 315x860)
61 KB
61 KB PNG
ok there that's bigger than laion. preparing the update now :)
>>
>>102276453
Nobody knows, it's a black box. But that was the heavy weighting, it's already done, from there you don't need as many images, you just add what is missing.
>>
>>102276480
>Flux can't be trained.
You are here.
>Flux can't be improved because it's a distilled model so just train a Lora.
>>
>>102276595
Cope, training =/= creating a shitty lora
>>
File: received_534511192593574.png (1.15 MB, 1152x896)
1.15 MB
1.15 MB PNG
>>
File: 2024-09-07_00173_.png (1.44 MB, 832x1216)
1.44 MB
1.44 MB PNG
>>
>>102276511
>, it specifically draws symbols, text etc we'll, which XL does not.
We know all Flux based models make XL look like Craiyon, the question is if there's any model out there that is better than Flux.
>>
>>102276524
Is there a link online to try it out? My computer fucking sucks
>>
>>102276570
>you just add what is missing.
I'm not gonna sound pessimistic, but Flux misses a lot of shit, it doesn't even know Picasso's work
>>
>>102276617
>Cope
I mean people have been talking and talking about what is being impossible to do and have always been proven wrong, so good luck being the first being right.
The portrait finetune of Flux could have been it if they trained it properly, the only reason we don't have it is because of their incompetence.
>>
>>102276642
No, they're still flux models, and the enhancements added in the custom models may be lesser than stock flux but better than XL and you can do stuff that's much harder in XL etc. So in a way it's better if it's something that you can't do in stock flux because it's either censored or something it doesn't know
>>
>>102276657
Sure:

https://huggingface.co/Yntec/YiffyMix

Type your text in the box and click Compute and hope you don't get an unknown error. You can't customize anything, though.
>>
>>102276674
Have you heard about some Blitzball game on some obscure version of Final Fantasy? Flux can't do that one either :(
>>
File: 796658016.png (962 KB, 832x1216)
962 KB
962 KB PNG
>>
>>102276707
Can I use googlecolab or something and run it with automatic 111? I remember doing something similar with Stable diffusion a year or so ago
>>
>>102276680
>if they trained it properly
Almost as if it can't be trained properly
>>
>>102276719
>He called FFX obscure
>>
>>102276680
Well, if flux is so trainable where are the super finetunes then?
>>
>>102276779
Didn't it only just come out? How long did good sd1.5 models take to come out and it's far simpler and less needy on hardware. Meanwhile Nvidia still sticking with 16gb on the 5080.
>>
>>102276779
wait you expected some super finetunes after 1 month of its release? We didn't even have the training script
>>
any flux news within the last week or so? new controlnets, finetunes, anything?
>>
>>102276793
The awportrait model is not a finetune but a lora merge, it just rendered your whole 4000-2000 images is enough theory invalid since that’s the amount of images they used for their super dooper fine tune that overfitted the model, also stop comparing 1.5 with flux they are totally different models
>>
>>102276825
What theory? Where are you getting that from?
>>
>>102276727
Yes, and no.
Yes, because you can go to a space like this: https://huggingface.co/spaces/Yntec/Noosphere-Webui-CPU go to the Extension Tab, install the Batchlink Downloader extension, download Yiffymix on there, and use it on the space.
An no, because it's CPU inference and you'd need to wait for 20 minutes to gen a 512x512 image at 20 steps.
They have it like this because if you want GPU you need to buy a pro account.
Something I've never seen is an anon on 4chan with a pro account creating a Zero GPU space of a model on request, since they are FREE (once you buy pro...)
>>
>>102276850
he saw that on a dream, the same dream that relevated to him that Loras on Flux were impossible because "iz distilled therefore impossibru"
>>
>>102276743
They didn't use a single regularization image to make sure it didn't forget how to draw.
Because all they cared about was portraits, fuck everything else.
>>
How do I into flux on forge, just take off --xformers and im good to go? Aside I found out you can use XL loras on pony and it works alright
>>
>>102276850
>>102276887
Just scroll up, one of your cope anon friends say it , not me :)
>>
File: 000000_17409_.png (1.9 MB, 1032x1508)
1.9 MB
1.9 MB PNG
>>
>>102276777
By now it's as obscure as Super Mario Sunshine, the number of people in the world that had ever played it is decreasing.
>>
what models are you guys using? I use the pony models for pretty much everything, if it's not porn I sometimes us flux but the pony models can handle most of my needs
>>
>>102276793
>How long did good sd1.5 models take to come out
Depends on your definition of good, some would say it was Novelai, but that was never supposed to be released, it's like hoping someone makes a godly finetune of dev for personal use and it's leaked.
>>
File: OkZoomer.png (45 KB, 1177x662)
45 KB
45 KB PNG
>>102276912
>>
>>102276931
These newfags weren’t there when we had the waifudiffusion model
>>
>>102276823
Yeah, people have been blending it with SDXL and this anon claims they're better than stock Flux.
>>
>>102276887
Oh man, the voice I heard on my head reading that made it sound very funny!
>>
>>102276926
Depends, if I want to draw a specific picture I have in mind, I use Flux.
If I want some interpretation of the prompt on the creative side, I use Kolors.
>>
I've just been using Euler simple 20 steps for flux at default of
Cfg scale 1
Distilled cfg scale 3.5

Any tips?
>>
>>102276939
its a troll, likely an llm bot, stop getting baited
>>
>>102277061
you can try higher CFG if you want better prompt understanding, it'll be slower though
https://reddit.com/r/StableDiffusion/comments/1eza71h/four_methods_to_run_flux_at_cfg_1/
>>
>>102276951
Last time I used Waifu Diffusion it gave me garbage, though. It didn't age well, something like Nuipeni 2 blows it out of the water.
>>
>>102277061
>Any tips?
If you're not drawing text scale 3 is better.
>>
what's the point of models like this? why bring the ai face back to flux
>https://civitai.com/models/118111/copax-timelessxl
>>
File: buzz.png (9 KB, 881x87)
9 KB
9 KB PNG
>>102277099
It's now done for buzz farming.
Buzz was a mistake and now people doing it for the love of art are buried by the farmers.
It creates the illusion of paying with virtual currency, so it's still free, so people will pay for it with that, and the images produced by the models are no longer important, but how much buzz you can extract from losers.
>>
>>102277099
like blender porn, people have jerked off to pony so much their brain has rotted and now they find that look attractive
>>
>>102276009
This thread is worse, that's the difference.
>>
>>102277147
>Buzz was a mistake and now people doing it for the love of art are buried by the farmers.
the civitai dev don't give a fuck unfortunately, they make a shit ton of money from those retards
>>
File: SweetSpot-min.jpg (3.65 MB, 8146x3068)
3.65 MB
3.65 MB JPG
Oh man, I thought cfg 6 was enough, but now I have to crank this shit up even higher, I definitely have to look at the more advanced parameters of AutomaticCFG to see if I can fix the burn or some shit
>>
>>102277151
Huh? You could have said that about hentai, cartoon girls without lips on their mount, a poor nose and oversized eyes.
And hentai models are more popular than realistic ones.
>>
>>102277171
>they make a shit ton of money from those retards
And I thought they were going to die with the vac, but apparently retards are resilient.
>>
File: kys4lyfe.png (18 KB, 876x66)
18 KB
18 KB PNG
>>
>>102277350
what's your prompt anon?
>>
>>102276269
Jesus anon this is really good
I’m just a tourist but can you tell very briefly what you made this sith? Program, database etc
>>
why does nobody ever compliment my 1girls, I make hundreds
>>
Anyone has experience with the flux template on vast ai (cloud gpus)? Comfy can't seem to find the checkpoint
>>
>>102277486
this is a local thread anon
>>
>>102277496
Yeah, it's a local install on a cloud environement that's why I'm asking here
>>
>>102277486
I haven't used comfy on it, but I found on runpod when training loras I sometimes had to delete the folder it was supposed to be in, remake a folder with the same name, then put the model in it. not sure if it
example for your usecase:
>on comfy delete the 'unet' folder
>make a new 'unet' folder
>move flux model into it
>>
>>102277586
>not sure if it
not sure if it is the same on vast gpus*
post ate my sentence kek
>>
>>102277586
>>102277595
Ok I'll try thanks anon. I'd rather use runpod but they decline my card every time and I have no btc
>>
>>102277496
>this is a local thread anon
Thread is on external server
>>
File: 0.jpg (353 KB, 1024x1024)
353 KB
353 KB JPG
>>
File: 0.jpg (103 KB, 1024x1024)
103 KB
103 KB JPG
>>
For those using joycaption, is it possible to use a guuf LLM with it ?
>>
File: 000000_17414_.png (2.42 MB, 1032x1508)
2.42 MB
2.42 MB PNG
>>
File: 2024-09-07_00240_.jpg (625 KB, 2496x3648)
625 KB
625 KB JPG
>>
>>102277971
truth
>>
>>102277971
>creatures on flux be like
>>
File: 2024-09-07_00244_.jpg (755 KB, 2496x3648)
755 KB
755 KB JPG
>>
>>102277627
runpod is significantly more expensive usually, so vast is probably better overall. I just used runpod because I had no idea what I was doing and thought it was easier to have the jupyter thing, now that I understand it a bit better I'd recommend vast price wise
>>102277948
yes, just clone the repo and change the "model=" in the app.py to whatever you want
>>
>>102278139
God bless you anon
>>
File: 2024-09-07_00249_.png (1.25 MB, 832x1216)
1.25 MB
1.25 MB PNG
>>
File: 0.jpg (144 KB, 1024x1024)
144 KB
144 KB JPG
>>
File: 1722050955941286.png (7 KB, 147x153)
7 KB
7 KB PNG
let me guess, you need more?
>>
File: 00000_17416_.png (2.23 MB, 1032x1508)
2.23 MB
2.23 MB PNG
>>102278008
>>102278067
>GGUF-Q8_0
>>
i finally caved in and bought a 4090
now, is a 850W PSU good enough?
>>
>>102278428
yeah you're good with that anon
>>
>>102278432
ok, thank you
will probably need a bigger case though
>>
File: 0.jpg (137 KB, 1024x1024)
137 KB
137 KB JPG
>>
>>102278428
Yes, should be fine if you didn't buy some XXX GIGA OC model which has its stock power limit pushed up to some insane value.
>>
File: 00061-2708198544.png (1.18 MB, 832x1216)
1.18 MB
1.18 MB PNG
>>
>>102273821
Is it possible to make LORA"s for Flux yet?
>>
>>102278772
it's never gonna happen, stop asking
>>
File: 0.jpg (438 KB, 1024x1024)
438 KB
438 KB JPG
>>
File: 1708901837711042.jpg (190 KB, 1024x1024)
190 KB
190 KB JPG
>>102278805
great, so we are still stuck with shitty stable AI that outputs half baked slop that you always have to inpaint or edit to make look good.
>>
>>102278838
no different than yesterday, or tomorrow
>>
File: 1694919633494.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>102266122
So my issue was that the inpainting controlnet uses the whole image as base and I can't control that. And raising the crop factor of the detailer so that it also uses the whole uncropped image is the only solution with this workflow.
So I'll try to add cropping and padding before the detailer but with this overcomplication now the whole idea seems pointless.
>>
>>102278772
google is your friend
>>
File: 00068-904008611.png (1.24 MB, 1216x832)
1.24 MB
1.24 MB PNG
>>
>>102278948
albino gorilla
>>
>>102274563
>>102274579
Dreamy Floating Flux Lora? Are you using anything else? Especially love that first one
>>
>>102278772
holy fuck there are like 10 new ones each hour on civitai .. how techlet can you be?
>>
File: 1703767743448201.png (1.52 MB, 1024x1024)
1.52 MB
1.52 MB PNG
>>
File: 1.jpg (94 KB, 1280x768)
94 KB
94 KB JPG
>>
File: 0.jpg (688 KB, 2048x1024)
688 KB
688 KB JPG
>>
>>102279125
Looks like the interior booklet design of a crust punk album
>>
File: 1.jpg (126 KB, 1440x1120)
126 KB
126 KB JPG
>>
File: ComfyUI_Flux_17.jpg (1.79 MB, 2432x1664)
1.79 MB
1.79 MB JPG
https://civitai.com/models/150094/bogdanoff-twins
>>
>>102279156
BASED, thanks for a lot for that one anon
>>
>>102279156
based migu
>verification not required
>>
>>102279156
this is perfect, I've been looking for a plastic surgery kinda of look and this should do nicely at low strength
>>
File: 1.jpg (108 KB, 1600x960)
108 KB
108 KB JPG
>>
File: 1.jpg (73 KB, 1600x960)
73 KB
73 KB JPG
>>
File: sharp.jpg (824 KB, 2048x1024)
824 KB
824 KB JPG
>>102279125
aint it too blurry?
>>
>>102279257
2nd best puddle of mud song
>>
>>102279257
How did you do that?
>>
>>102279279
>>
https://civitai.com/images/27252141

Why can I not replicate this image? I put the data into forge png info txt2img but I am getting this
https://files.catbox.moe/79doy0.png
>>
File: FLUX_00128_.png (1.67 MB, 1120x1440)
1.67 MB
1.67 MB PNG
christ
>>
>>102279300
assuming you are using the same seed, change diffusion in low bits from "automatic" to "automatic fp16" or vice versa
>>
File: 1.jpg (115 KB, 1920x1080)
115 KB
115 KB JPG
>>
>>102279327
the thing I have different seems to be

Version: f2.0.1v1.10.1-previous-495-g4f64f6da

vs

Version: f2.0.1v1.10.1-previous-516-gecb396e0

what are these?
>>
File: 00003-4119822477.jpg (719 KB, 1664x2432)
719 KB
719 KB JPG
>>
>>102274574
>>102274087
Someone needs to cl
All out how wrong both of these are.
The only thing that affects the size of the LoRA are the dimensions. It has nothing to do with the resolution of the images or the regularisation images
>>
File: 1712890455811.jpg (451 KB, 1024x1024)
451 KB
451 KB JPG
>>
File: 0.jpg (372 KB, 1024x1024)
372 KB
372 KB JPG
>>102279134
Punk rock album cover.
No linear notes.
>102279279 : How did you do that?
img2img repeated on seed image,
prompt:
by Daido Moriyama , Bauhaus

Negative prompt: little man, distant man
Steps: 20, Sampler: DPM++ 2M, Schedule type: Karras, CFG scale: 7, Seed: 609329482, Size: 2048x1024, Model hash: e6bb9ea85b, Model: sdXL_v10VAEFix, Denoising strength: 0.9, Version: v1.10.
>>
>>102273932
But how flexible are those loras?
>>
File: 1.jpg (193 KB, 1920x1080)
193 KB
193 KB JPG
>>
File: 1.jpg (128 KB, 1920x1080)
128 KB
128 KB JPG
>>
>>102276453
Pulling it out of my ass but like training a Lora you really need like 20 diverse images per subject, idea, style, concept. With a minimum of at least 500k images which would be 25,000 concepts.
>>
File: 2024-09-07_00264_.png (1.22 MB, 832x1216)
1.22 MB
1.22 MB PNG
>>102279371
as flexible as your VRAM allows .. if you cant load em, just merge em into the model
>>
File: ComfyUI_00301_.png (1.67 MB, 1024x1280)
1.67 MB
1.67 MB PNG
>>
File: 1.jpg (190 KB, 1920x1080)
190 KB
190 KB JPG
>>
File: censoredforg.png (1.96 MB, 1024x1024)
1.96 MB
1.96 MB PNG
>>
File: 2024-09-08_00003_.jpg (915 KB, 2496x3648)
915 KB
915 KB JPG
>>
>>102279337
>>102279300
anyone?
Really stuck on if I have some fucked up settings
>>
>>102279350
How do you explain the HyperSD Lora that is 1GB in size and has the same dimensions? Hmmm?
>>
File: file.png (2.06 MB, 1024x1024)
2.06 MB
2.06 MB PNG
>>102279156
really fun lora anon, it's mixing well with the
Satoshi Urushihara style one
https://civitai.com/models/7227/urushi
>>
File: 2024-09-08_00006_.jpg (1 MB, 2496x3648)
1 MB
1 MB JPG
>>
File: fs_0030.jpg (58 KB, 768x768)
58 KB
58 KB JPG
>>
>>
File: fs_0036.jpg (70 KB, 768x768)
70 KB
70 KB JPG
>>
File: Oscar Claude Monet LoRA.jpg (176 KB, 800x1170)
176 KB
176 KB JPG
>>102273821
The Claude Monet LoRA I trained looks pretty good so far
>>
Was there a booru alternative made to collect all the gens after booru.plus died?
>>
>>102280249
Go be disgusting on >>>/b/
>>
>>102280485
There's nothing more beautiful than two sexy girls tongue kissing
>>
>>102280485
cock lover from /hdg/ spotted
>>
>>102280485
if you have a problem with having 2 beautiful women kissing with each other, then I got news for you...
https://youtu.be/Zd8vzIRQLLM?t=7
>>
>>102280380
classic artist loras seem to work well as they somehow train good image composition (beyond the style). Kind of a pain to put them together though.
>>
does this work well?
https://github.com/LykosAI/StabilityMatrix
>>
File: file.jpg (2.53 MB, 2700x2719)
2.53 MB
2.53 MB JPG
>>102280534
this
>>
File: 00191-3913522346.png (1.94 MB, 1024x1536)
1.94 MB
1.94 MB PNG
>>
>>102280534
correct
>>
File: file.png (1.85 MB, 1024x1024)
1.85 MB
1.85 MB PNG
I swear to god I didn't prompt for "goy", it did this shit by itself, based flux kek
>a pulp cult anime illustration from japan,
>Bogdanoff on the phone, with a speech bubble that says "Dump it"
>>
File: 00000_17424_.png (2.51 MB, 1032x1508)
2.51 MB
2.51 MB PNG
>>
File: 2024-09-08_00060_.jpg (1.23 MB, 3840x2160)
1.23 MB
1.23 MB JPG
>>
>>102278999
peace was never an option
>>
File: 2024-09-08_00066_.jpg (1.27 MB, 3840x2160)
1.27 MB
1.27 MB JPG
>>
File: ComfyUI_33519_.png (746 KB, 736x1024)
746 KB
746 KB PNG
>>
File: ComfyUI_33520_.png (660 KB, 736x1024)
660 KB
660 KB PNG
>>
>>102281028
>>102281036
This lora is so good
>>
File: 5Flux.jpg (183 KB, 1584x1064)
183 KB
183 KB JPG
>>
>>102279755
I'm surprised how realistic this looks, and the background doesn't look weird or dreamy or disproportionate. Doesn't seem to have any nightmare creatures. The only thing that makes the subject immediately obviously fake is the skin texture from collarbone down, which seems more like some sort of plastic than skin.

Was this a recent model and was it difficult to produce this? Did you have to generate 100 mutant monsters before finding this? Or is this a typical gen nowadays and you'd probably get it even if you just generated 1 image instead of 100?
>>
>>102273891
thanks for sharing
>>
File: ComfyUI_00279_.png (3.81 MB, 2112x1584)
3.81 MB
3.81 MB PNG
>>
>>102281262
not the same anon but on various models you can get 1girl face + boob portraits in great numbers without much difficulty - even with SD/SDXL/Pony, not the newer model types like Flux or w/e which promise to be better eventually.
>>
>>102280616
IMO it works quite well, but I don't use it often vs. just ComfyUI
>>
File: 2024-09-08_00014_.jpg (1.02 MB, 2496x3648)
1.02 MB
1.02 MB JPG
>>102281262
>Did you have to generate 100 mutant monsters before finding this?
no, 8 of 10 look good

>>102281510
those are flux
>>
File: ComfyUI_00884_.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
tried to get an angry karen going super saiyan
>>
File: 000000_17427_.png (2.26 MB, 1032x1508)
2.26 MB
2.26 MB PNG
>>
File: ComfyUI_00285_.png (3.16 MB, 2112x1584)
3.16 MB
3.16 MB PNG
>>
File: ComfyUI_00352_.png (1.63 MB, 1280x1024)
1.63 MB
1.63 MB PNG
>>
>>102281578
>those are flux
looks nice. easier to see in this image (detailed ear jewelry, hair strands, belt string and bg details)
>>
File: 2024-09-08_00082_.jpg (979 KB, 2496x3648)
979 KB
979 KB JPG
>>102281659
>looks nice
ty
>>102281659
>(detailed ear jewelry, hair strands, belt string and bg details)
but also sadly the insane bokeh blur.. well that's flux
>>
>>102281691

negative prompt bokeh retard
I haven't even tried flux yet
>>
>>102281721
(you)
>>
>>102281721
>I haven't even tried flux yet
then shut up, you think I haven't tried that? get a life
>>
>>102281742
>>102281745

flux gen -> mask -> whatever with negative bokeh
>>
>>102281589
but.. why?
>>
Let's get a fresh loaf of bread ready to go...
>>102281807
>>102281807
>>102281807
>>
>>102281802
because we can
>>
>>102281510
>>102281578
Well that's shocking. I'm starting to wonder if we'll still be able to tell what's real at this time next year.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.