[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (896 KB, 3264x3264)
896 KB
896 KB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102187084

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
ballees brgewa of cfefeiesnship DN AMAGGyah
>>
>schizo thread
>>
File: 2024-09-01_00339_.png (1.89 MB, 1024x1024)
1.89 MB
1.89 MB PNG
>>102191214
ty baker
>>
File: 👨🏿---.jpg (125 KB, 1024x1024)
125 KB
125 KB JPG
>mfw
>>
File: 00275.png (1.57 MB, 896x1152)
1.57 MB
1.57 MB PNG
>>
>>102191279
>That does seem to gel with what I've seen.
Let's not forget that the QK quants were optimised for LLMs (Large Language Models) and that there's no guarantee it would be as efficient for image models, maybe there should be an alternative to QK that is optimised only for image models
>>
File: 00010-2130684045.jpg (448 KB, 1440x1080)
448 KB
448 KB JPG
>>
Blessed thread of frenship
>>
>>102191330
>QK quants
quantization is a mathematic method that works regardless of the neural networks task .. it works equally well LLMs as diffusion models.. if you wann know more read
>https://www.maartengrootendorst.com/blog/quantization/
>>
>>102191388
>it works equally well LLMs as diffusion models
the reality says otherwise though >>102191134
>>
File: 2024-09-01_00342_.jpg (1.9 MB, 3072x3072)
1.9 MB
1.9 MB JPG
>>
File: 1704344495618201.png (2.53 MB, 1024x1536)
2.53 MB
2.53 MB PNG
Q4_1 top, Q5_1 bottom
>>
>>102191408
A quantized LLM also is not as "smart" as non quantized one, especially if go as hard as fp32
-> q4 .. its just not visible in images. As they say an image is worth a thousand words
>>
>>102191446
>A quantized LLM also is not as "smart" as non quantized one, especially if go as hard as fp32
>-> q4
We got the bf16 not the fp32 model (if we suppose such model even exists) though?
>>
>>102191430
Bottom looks crispy
>>
>>102191463
>not the fp32 model (if we suppose such model even exists) though?
probably internally at BFL .. but they chose they parameter count and weight precision they released very carefully so it just run on 24GB VRAM is my guess..

also I am not sure the flux1 default is bf16 or just fp16
>>
why does flux start generating grid-like artefacts at high res and how do i prevent it (if even possible)?
full upscale would be much better than shitty tiled seams
>>
File: file.png (55 KB, 1677x523)
55 KB
55 KB PNG
>>102191494
>also I am not sure the flux1 default is bf16 or just fp16
it's bf16
https://huggingface.co/black-forest-labs/FLUX.1-dev
>>
File: 1705745922086757.png (848 KB, 1024x768)
848 KB
848 KB PNG
>>102191493
I've been doing everything with a CFG of 6 so I need to try adjusting that next
>flesh high-heels
New fetish unlocked
>>
File: file.png (60 KB, 1192x514)
60 KB
60 KB PNG
>>102191514
>it's bf16
let's be clear there, are we running the bf16 or the fp16 when we do that then?
>>
>>102191528
>I've been doing everything with a CFG of 6 so I need to try adjusting that next
what's your anti burner? for me it's AutomaticCFG
>>
>>102191568
>AutomaticCFG
Wasn't aware of this, will try it out
>>
>>102191430
>>102191528
these don't appear as close to the source material as id expect flux to be able to do desu
>>
>>102191588
I think the dude who made it said he was going to retrain and optimize because he wasn't fully satisfied either. I'm happy tho, and I'm probably not prompting as carefully as I could.
>>
>>102191587
you can see some comparisons there, AutomaticCFG seems to be the one that gives the best prompt adherance to them all
https://reddit.com/r/StableDiffusion/comments/1eza71h/four_methods_to_run_flux_at_cfg_1/
>>
>>102191514
where does it say that there?
>>
File: 1722236457283921.png (2.17 MB, 1024x1536)
2.17 MB
2.17 MB PNG
Q4_1 top, Q5_1 bottom again
>>
>>102191656
>where does it say that there?
it's written: "torch_dtype = torch.bfloat16" anon
>>
File: chinesevideo.webm (1.95 MB, 1280x720)
1.95 MB
1.95 MB WEBM
The new Chinese "MiniMax" text to video model is surprisingly coherent with little warping. Almost feels like it's an actual video of a 3D environment. Hope flux video has these capabilities.
>>
>>102191675
I see.. I was searching for bf16 .. thx
>>
>>102191544
all releases of flux are bf16 originally.
converting it to fp16 doesn't matter except for compatibility
"default" here means you're not converting from whatever dtype your local file has

>>102191656
it's in the config file for the relevant archs
>>
File: ComfyUI_00252_.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>102191544
BF16 unless you have a shit card that doesn't support it.
>>
File: hmm_face.png (84 KB, 500x500)
84 KB
84 KB PNG
>>102191715
>>102191693
>>
File: 2024-09-02_00009_.png (1.75 MB, 1024x1024)
1.75 MB
1.75 MB PNG
>>
>>102191715
that foxgirl identifies as the Golden Gate bridge, desu
>>
>>102191704
>converting it to fp16 doesn't matter except for compatibility
going from bf16 to fp16 is lossy though, that's why it's important to know if it's been done somewhere
https://new.reddit.com/r/StableDiffusion/comments/1f6obs0/comment/ll20fbt/?utm_source=share&utm_medium=mweb3x&utm_name=mweb3xcss&utm_term=1&utm_content=share_button
>All looked "worse". The original weights are actually BF16, not FP16.
>i.e on my non BF16 supporting card, gens are subtly inferior. Doing merges from quantized/casted weights.. also not as good. Precision is lost. Model makers don't seem aware of this yet.
>>
File: 00093-3502244900.png (787 KB, 616x1008)
787 KB
787 KB PNG
>>
>>102191746
Based Comfy driving anons to suicide
>>
File: 2024-09-02_00013_.jpg (1.81 MB, 3072x3072)
1.81 MB
1.81 MB JPG
>>
File: 1702866144462878.png (842 KB, 1024x768)
842 KB
842 KB PNG
>>102191568
>>102191587
>>102191600
Tried it but don't see any differences with a CFG at 6 and Q4_1 or Q5_1, but I'll keep it on in case it helps for other prompts
>>
File: 00282.png (1.38 MB, 896x1152)
1.38 MB
1.38 MB PNG
>>
File: 1715604630773271.jpg (22 KB, 372x323)
22 KB
22 KB JPG
Are there any good online generators / generation services that are good and don't have limitations? Ideogram fulfilled this purpose before and now it's became nogger, unsure if Flux is a meme (I've been using Flux Pro too); any recommendations to use?
>>
>>102191777
it's really negligible if the bf16 values aren't outside the fp16 range
by negligible I mean a MAD below 1^e-5 for example
>>
>>102191947
>local
>>
File: 2024-09-02_00015_.jpg (1.44 MB, 3072x3072)
1.44 MB
1.44 MB JPG
>>
>>102192138
You will never be a human.
>>
File: 2024-09-02_00023_.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
>>102192147
>>
>>102191214
does t5 overwhelm clip_l?
>>
File: xyz_grid-0072-14572326.png (2.7 MB, 1848x1008)
2.7 MB
2.7 MB PNG
going through epochs, trying to pick the ones i will keep. these are i, j, and k. k is most interesting but most off-prompt.
>>
File: 1701274506846750.png (1.03 MB, 1152x864)
1.03 MB
1.03 MB PNG
>>
File: 1723415892057520.jpg (101 KB, 906x1024)
101 KB
101 KB JPG
>>102192037
I haven't went local in ages; I've got lazy in my old age. What're the advantages of going local VS Le cloud based?

>I have a 4090GTX
>>
File: 2024-09-02_00028_.png (1.73 MB, 1024x1024)
1.73 MB
1.73 MB PNG
>>102192242
online generators don't give you as much freedom in configuration, cost money and are often censored
>>
File: 2024-09-02_00026_.png (1.58 MB, 1024x1024)
1.58 MB
1.58 MB PNG
>>102192242
ow and also my mother is already dead
>>
>>102192242
dont post "your mother will die in her sleep" stuff please
>>
File: 00017-3121424497.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>>
File: 00020-1438463393.png (906 KB, 1024x1024)
906 KB
906 KB PNG
>>
>>102192284
What would result in better results? What beats the meme of the week Flux or DALLE 3 in terms of Local then? I'm out of the loop and SDXL was the last I used with Automatic1111
>>
>>102192242
not sending your data to some company
>>
File: 00030-3849217814.png (1007 KB, 1024x1024)
1007 KB
1007 KB PNG
>>
File: 00039-1307847359.png (952 KB, 1024x1024)
952 KB
952 KB PNG
>>
File: 000000_17203_.png (2.81 MB, 1508x1032)
2.81 MB
2.81 MB PNG
>>
File: 2024-09-02_00006_.jpg (1.99 MB, 3072x3072)
1.99 MB
1.99 MB JPG
>>102192412
Flux (or what will be next fab) local gives you access to free choice of generating parameters, extra nodes (like SDUltimateUpscale for nearly unlimited resolutions!) that change quality of output, free choice of loras

>>102192412
>I used with Automatic1111
atm its rarely being updated.. there is a fork called Forge that slowly is replacing it, also ComfyUI has seriously matured and has a huge trickbox extra nodes that can greatly alter and enhance quality of gens

but running diffusion non local is fine.. it just won't give you the freedom (and the unlimited gens)
>>
File: 1705967460428306.jpg (280 KB, 1344x768)
280 KB
280 KB JPG
>>102192502
Interesting, thanks for the insight. I saw LORAs on civai or whatever it's called with the begging. I've heard of Forge and ComfyUI. "Trickbox" I've heard once, what exactly is it? See I'd be interested in making some " Spider-Man, Batman, and Iron Man" together unique and HQ images for my best friend's kids and wonder how much extra I could get from doing it locally. Definitely going to be trying it out tomorrow when I get back from the hospital.
>>
>>102191528
>I've been doing everything with a CFG of 6 so I need to try adjusting that next
Does that help for this LoRA? I've been using the default 1.0 and getting inconsistent results.
>>
>>102192412
>What beats the meme of the week Flux or DALLE 3 in terms of Local then?
>Dalle 3
>Local

Anyway, I don't know how someone could use a paid service and enjoy it. 99% of the time I'm just dicking around or tinkering, I don't get why I'd have to pay someone for that privilege. But yeah, flux is the meme of the week especially if you like to have fun with the prompts.
>>
>>102192584
>"Trickbox" I've heard once, what exactly is it?
you can install custom nodes in ComfyUI .. they range from simple loaders to upscales and whole animation making setups, since its modular in design some anons came up with insanely intricate workflows to make very high quality outputs

>See I'd be interested in making some " Spider-Man, Batman, and Iron Man" together unique and HQ images
will work in FLUX .. but I hope you got the hardware to run it effectively, else you will be waiting long and tired for hires outputs that will please you
>>
File: 2024-09-02_00037_.jpg (2.22 MB, 2160x3840)
2.22 MB
2.22 MB JPG
>>
>>102192651
SD ultimate upscaler works for flux for tiled up scales. It's how I do all of mine
>>
>You MUST have long verbose captions for your dataset
>NO I don't care if you use joy caption which was trained in the basement of a literal who and hallucinates all over the place and make assumptions about the image.
>You just HAVE to. OKAY?
>I don't care that the model has seen billions of examples of everything you're putting in your LoRA and you're more likely to mistag something than actually describe it accurately, you MUST use the meme captioner, how else would the model KNOW that it's looking at a red dress? GOD!
>NO it's not enough just to tag names and unqiue concepts to a dataset, you MUST tag that flower as well or the model will be confused. There's NO WAY a model knows what a flower is.
>>
File: 2024-09-02_00035_.jpg (1.6 MB, 3072x3072)
1.6 MB
1.6 MB JPG
>>102192682
thats how I do mine to, pic related
>>
>>102192242
I can gen tits and Nazi propaganda locally.
>>
>>102192691
the meds will stop the voices anon, atleast for a while. Take them
>>
>>102192691
OK please train 2 LoRAs, one with verbose captions and one with no captions with otherwise the se dataset and compare the results. I don't think anyone has done this before.
you can think whatever you like and I'm not saying you're wrong because I simply don't know, but I'd like to see some evidence, since verbose captioning has produced excellent quality LoRAs for me.
>>
File: Bruh.png (2.57 MB, 3185x1612)
2.57 MB
2.57 MB PNG
Pytorch 2.4.0 is fucked up lol
https://imgsli.com/MjkzMjI3/0/2
>>
>>102192730
https://civitai.com/articles/6792/flux-captioning-differences-training-diary
>>
File: ComfyUI_05518_.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
Learned to how use Lambda Labs cause I am scared of starting house fires when leaving the LoRa baker on while at work. The downside is can't train anything questionable unless I want to be potentially vanned by the shifting laws regarding AI.
>>
>>102192691
How does not tagging the flower affects the probability of it being associated with the other elements that were tagged?
>>
>>102192731
>Schizos complain that torch 2.5 results were different, and faster therefore worse
>Turns out it was actually 2.4 that was shit.

The ironing.
>>
File: 1713405650193.jpg (356 KB, 1024x1024)
356 KB
356 KB JPG
>>
>>102192781
both 2.5 and 2.4 look worse than 2.3.1 though
>>
File: 2024-09-02_00045_.png (722 KB, 720x1280)
722 KB
722 KB PNG
>>
>>102192778
You don't tag anything but the common element between the concept you're trying to train, ie, the character's name or the idea of an an upskirt shot.
Everything else the model already knows better than you do.
>>
>>102192765
Thanks. I'll try this on my next LoRA.
This is specific to style or will it work the same on character LoRAs too?
>>
File: 1706127003194130.gif (138 KB, 500x289)
138 KB
138 KB GIF
>>102192651
I have an i13 and a 4090GTX so hopefully should be good
>>
File: 1699309796663.jpg (523 KB, 1024x1024)
523 KB
523 KB JPG
>>
Does anyone have a settings json for kohya that will work on 16gbVRAM? I've been struggling
>>
>>102192807
*RTX I hope .. or you got arcane hardware.

But an advice: If you are used to A1111, you might try Forge first for flux local before going ComfyUI, since the Forge UI is mostly the same as A1111 .. ComfyUI is more powerful tho
>>
https://www.reddit.com/r/StableDiffusion/comments/1f523bd/comment/lkpzeh9/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button

Some more evidence in the "Your captions are probably doing more harm than good" camp. This is from the guy that showcased his block specific training yesterday. He's a fairly prolific LoRA trainer so his ideas carry at least some weight behind them.
>>
File: konosuba flux dev.jpg (1.44 MB, 3072x1024)
1.44 MB
1.44 MB JPG
>>102192804


NTA, I prefer WD14 tagging since I already have a ton of datasets in WD14. And it does work with characters.
>>
>>102192839
>I've always preached against captions, captions should only be used for non existent concepts,
how are we supposed to know Flux knows or doesn't know a concept?
>>
>>102192839
I don't care who people are, I care about the results.
>>
>>102192864
12B parameters, unless it's your own ass hole you're training, it probably knows it.
>>
File: 1715822310880145.jpg (11 KB, 360x234)
11 KB
11 KB JPG
>>102192837
Shit, yes, RTX. Faux paux on my part lmao

I swear I've used ComfyUI now the more I think about it. The stringy lines connecting and the likes. By Flux local, is that Flux Schnel, Dev, or Pro equivalent?

And should I even be focusing on Flux to begin with?
>>
>>102192884
>12B parameters
what does this has to do with anything? We have no idea how many pictures this mf eat during the pretraining, maybe BFL has filtered it hard and there's not much Flux has seen in the end
>>
>>102192895
Schnell is a weaker 4 step model, Dev is the default for local, and pro is api only.
>>
File: 1696649665483661.png (246 KB, 500x500)
246 KB
246 KB PNG
>>102192895
Faux pas*
>>
>>102192901
Given just how easy it is to train I'd say that's highly unlikely, it already knows them it just doesn't know what they're called. All we need to do is tell it.
>>
>>102192895
>And should I even be focusing on Flux to begin with?
yes.. SDXL is mostly stuck in porn limbo

>>102192895
>I swear I've used ComfyUI now the more I think about it. The stringy lines connecting and the likes. By Flux local, is that Flux Schnel, Dev, or Pro equivalent?
yes thats the noodle UI .. local FLUX is either FLUX.dev or FLUX.schnell.. schnell is abit retarded tho, use dev
>>
>>102192924
if that's the case then we should go for the embeddings insteads of Loras? That's what embeddings were made for
>>
>>102192935
Because everyone is used to loras and won't change to embeddings because people are resistant to change. It's as simple as that.
>>
File: 1709109380305849.jpg (219 KB, 1024x1024)
219 KB
219 KB JPG
>>102192910
What about the free Pro options online? How do they compare? And is Flux the right way to go? I find Ideogram to be the comfiest I've seen for neat results personally.
>>
>>102192950
that's too bad because you can make embeddings with a potato PC kek
>>
>>102192959
Stronger than Dev but censored, token limited and worthless.
>>
>>102192731
Now I start to get why Comfy doesn't want to switch to Pytorch 2.4.0 lol
>>
>>102192963
Well I have a 4090 but I haven't made an embedding since early early sd, before we had loras. I don't even know how to begin training an embedding
>>
>>102192585
>Does that help for this LoRA? I've been using the default 1.0 and getting inconsistent results.
It helped with my first few gens but I never really went back to see how widely it applies
>>
>>102192871
Neither do I, but he trains good LoRAs and recommends against anything but the most essential captioning, he's a valuable point of data among many opinions.
>>
File: 00284.png (1.27 MB, 896x1152)
1.27 MB
1.27 MB PNG
Barn
>>
>>102192997
Dunno if you're the melty anon or not but assuming you are just provide the evidence next time instead of being schizo about it. More people will listen to you.
>>
>>102192977
weird.. my ComfyUI portable downloaded 2.4.0 tho
>>
I just made a rather peculiar discovery. Using base Pony, keeping same seed, and changing prompt one term at a time, I found out that the token "!" (Yes, just an exclamation mark, alone, between commas, nothing more) has a dramatic effect on generation. Seems so far to be for the better.

What gives? Is this effect intended? What does the model interpret ! (alone) as?
>>
>>102193040
it installs 2.4.0 if you open the "update_comfyui_and_python_dependencies.bat", or else you have 2.3.1 by default, that's how it went for me
>>
>>102192997
Alright, next lora I'm training is of my cat, so I'll just use his name and no other captions and see what happens.
I have no idea how to into layer training though.
>>
>>102192803
that's not how it works. by doing that you end up with hallucinations in your images because it thinks thats what you want when you try to use your concept token. it comes to associate everything with the token. you've clearly never trained a proper lora before and are just trying to cause discourse and spread misinfo
>>
>>102193048
Pony was trained in an incredibly retarded way, that score bullshit is fucked up. So it's jot surprising
>>
>>102193060
Scroll up, there's evidence to back this up. Check the links
>>
File: 2024-09-02_00059_.png (708 KB, 720x1280)
708 KB
708 KB PNG
>>102193050
okay.. I guess ill downgrade once this queue finishes
>>
>>102193069
there is not.
>>
>It doesn't exist if I close my eyes
>>
>>102193071
to help you you have to do this:
1) go on ComfyUI_windows_portable\update folder
2) do this cmd command here:
..\python_embeded\python.exe -s -m pip install --upgrade torch==2.3.1 torchvision==0.18.1 torchaudio==2.3.1 --index-url https://download.pytorch.org/whl/cu121
>>
File: 00106-2537947629.png (564 KB, 616x808)
564 KB
564 KB PNG
>>
File: 2024-09-02_00054_.png (983 KB, 720x1280)
983 KB
983 KB PNG
>>102193086
thanks anon! that saved me 10 minutes searching for the right cmds.. love you
>>
>>102193116
you're welcome o/
>>
File: 1718491365773605.png (19 KB, 475x225)
19 KB
19 KB PNG
are there any good fp8 based finetunes for making ludes yet? i'd rather avoid experimenting and trying the 100s of nsfw loras to find the best one myself
>>
>>102192997
That's for character lora's though, style lora's are different
>>
>>102193181
there are no finetunes of flux yet, this shit asks for too much VRAM
>>
>>102191214
>all avatars in OP
>/sdg/ 2.0
>>
File: 2024-09-02_00067_.png (643 KB, 720x1280)
643 KB
643 KB PNG
>>102193086
okay the difference is extreme.. wtf are the torch devs doing that the output is so drastically different

pic related is >>102193071 but with 2.3.1 instead of 2.4.0
>>
File: 1701052242899092.png (893 KB, 1152x864)
893 KB
893 KB PNG
>>
>>102193071
>>102193240
>okay the difference is extreme.. wtf are the torch devs doing that the output is so drastically different
Ikr, they seem to not give a fuck about quality and shit, now I wonder if going for even older torch versions (like 2.2.0) can give even better quality than the 2.3.1 one for example

just made a side by side comparison for everyone interested: https://imgsli.com/MjkzMjM2
>>
>>102193239
Who do you think made this thread, Anon?
>>
File: 1708041876001120.png (116 KB, 306x250)
116 KB
116 KB PNG
>hmm, i should train yet another emma watson lora and upload it to civitai
why is he like this?
>>
>>102193239
whats sdg
>>
>add large breasts or busty to the prompt
>the person becomes a fatass
what did the flux devs mean by this
>>
File: 2024-09-02_00075_.png (1.12 MB, 720x1280)
1.12 MB
1.12 MB PNG
>>102193288
>https://imgsli.com/MjkzMjM2
Interesting. Probably should do a test with some full body gens. Iwonder if anatomy is better with 2.3.1
>>
File: Sigma_13048_.png (2.24 MB, 1024x1024)
2.24 MB
2.24 MB PNG
>>
>>102193288
...I...is it possible that because kohya sd-scripts is updated to 2.4.0 the loras trained are also shittier because of it or it wouldn't work that way...?
>>
>>102193390
it's possible yeah, to be sure you'd have to make the same lora training for both 2.3.1 and 2.4.0 and see if there's a difference, and if there is, which one is worse
>>
File: 2024-09-02_00078_.png (1.07 MB, 832x1216)
1.07 MB
1.07 MB PNG
>>102193412
ill do that over night.. Ill downgrade to 2.3.1 on ai-toolkit and retrain my disgea lora and compare tmrw
>>
>>102193430
Godspeed anon
>>
File: file.png (2.03 MB, 1024x1024)
2.03 MB
2.03 MB PNG
>>102193430
Good luck anon
>>
Switching from forge to Comfy. What am I in for anons?
>>
>>102193480
unbridled comfort
>>
>>102193480
you're gonna be overwhelmed and you're gonna feel like going back to forge, but you need to spend a few hours finding yourself a workflow that does the same thing you usually do in forge and you will eventually pick up how everything works
>>
>>102193480
you'll be able to put the text encoder on your cpu (or on your second gpu if you have one), you'll be able to use AutomaticCFG (the best anti burner) if you're into CFGmaxxing, and you'll be able to use Negative Guidance aswell
>>
File: 2024-09-02_00076_.png (1.17 MB, 720x1280)
1.17 MB
1.17 MB PNG
>>102193459
>>102193467
ty, its running..
>>
>>102193480
you'll spend every second wondering why the fuck you swapped but the sunken cost fallacy and that single WF that's customized just slightly behind what forge offers will keep you stuck. you'll hate every second of it, you'll learn not to pull updates ever, and you'll eventually realize what a talentless, egotistical and annoying faggot the creator is, as it slaps you in the face over time even if you generally avoid petty drama faggotry

enjoy
>>
>>102193525
and the worst part is that you don't feel like leaving ComfyUi because you've spent weeks making the most perfect workflow ever and you don't want to ruin that work kek
>>
how do i add the automatic cfg hack to my workflow? what nodes do i need?
>>
File: file.png (47 KB, 874x414)
47 KB
47 KB PNG
>>102193555
You just need that node and that's pretty much it
https://github.com/Extraltodeus/ComfyUI-AutomaticCFG
>>
>>102193589
before or after loading a lora? does it matter?
>>
Buera Ruierd
>>
>>102193640
I don't think the order matter, but personally I put this after loading the lora
>>
>>102193650
>Buera Ruierd
Don't know what that means but pretty pic
>>
>>102193650
merhily beren branel,
and the mome raths outgrabe.
>>
File: file.png (33 KB, 852x465)
33 KB
33 KB PNG
https://github.com/chrisgoringe/cg-mixed-casting
this node is quite insane, you can make your own quants by choosing which precision goes to each layer, and it's saved as a safetensor, so no more GGUF slow shit, it's loading like a regular fp8 model
>>
File: 1716379842138742.png (1.36 MB, 899x1163)
1.36 MB
1.36 MB PNG
>>102193658
oh god
>>
>>102193678
wtf bruh, show me a screen of your workflow, you messed something up that's for sure kek
>>
>>102193678
looks tight thodesu
>>
>>102193589
>Extraltodeus

if you took time to read that dev repos, you will find out that he's autistic as hell
>>
>>102193694
maybe, Idk, I tested a lot of anti burners and his node is the best so far
https://reddit.com/r/StableDiffusion/comments/1eza71h/four_methods_to_run_flux_at_cfg_1/
>>
File: 1713009776669003.png (173 KB, 2163x665)
173 KB
173 KB PNG
>>102193688
i'm using the simple version, maybe i have to run it on my gguf workflow?
>>
File: 2024-09-02_00030_.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
>>102193430
>>102193459
>>102193467
>>102193507
can't do it... ai-toolkit goes OOM with torch 2.3.1 .. so something in 2.4.0 handles VRAM better .. guess we will never know, I can not train the same 1024x1024 dim 32 lora on 2.3.1 with my 4090
>>
>>102193710
>Guidance 1.5
maybe that's the culprit, put it on a default value (3.5)
>>
File: 00294.png (1.55 MB, 896x1152)
1.55 MB
1.55 MB PNG
>>
>>102193731
>guess we will never know
https://youtu.be/GzlKja1ySzo?t=10
You could try on 2.5.0 though, it's different to 2.4.0 aswell
>>
My heart weeps for all the beauty that won't be posted, unless some gentle anon should read this, and take pity upon a lusty old gentleman, and post sweet maidens for his pleasure...
>>
>>102193732
yeah that did it. thank you for the help
>>
File: ComfyUI_02285_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>102193760
Did someone call for a sweet maiden?
>>
>>102193335
Unfortunately, and I really mean it, to the point that I'm crying right now, big breasted slim bodied women are not the majority in the west(or maybe even in the east, can't say for sure) due to a wide variety of reasons like lack of exercise, too much trash food, etc. So, if left unattended, the model will probably associate big boobs with ham planets.
>>
File: 1723814484498989.png (426 KB, 640x480)
426 KB
426 KB PNG
>>102193760
>>
>>102193768
you're welcome, and yeah, low guidance doesn't like high CFG somehow
https://reddit.com/r/StableDiffusion/comments/1emow5p/finding_the_sweet_spot_between_guidance_and_cfg/
>>
>>102193710
3cfg is insanely high for flux. default is 1.0.
>>
>>102193668
based Carroll reference, I love that poem
>>
File: 1704124077944870.png (430 KB, 640x480)
430 KB
430 KB PNG
>>102193335
>>102193783
Works for me
>>
>>102193796
that's the point, he wants to use AutomaticCFG to prevent the burning of high CFG
>>
>>102193753
uuh..
>ERROR: No matching distribution found for torch==2.5.0
not sure if I wanna go for nightly builds
>>
File: fkrojeqhixmb1.jpg (736 KB, 1986x2000)
736 KB
736 KB JPG
>>102193784
>The boomer shooter LoRA again
>>
File: 000000_17213_.png (2.53 MB, 2172x743)
2.53 MB
2.53 MB PNG
>>
File: file.png (1.69 MB, 768x1344)
1.69 MB
1.69 MB PNG
https://civitai.com/images/27312232
Finally, Migu won't be the only vocaloid that could be used on Flux kek
>>
>>102193899
https://www.youtube.com/watch?v=rSoo9WS8t7k
>>
>>102193899
I don't even know who you are
>>
>>102193899
Migu will always be better because she is in the default model, no LoRA needed.
>>
File: file.png (271 KB, 360x360)
271 KB
271 KB PNG
>>102193931
>Migu will always be better because she is in the default model, no LoRA needed.
>>
>>102191214
>avatar OP
>>
File: file.png (1.67 MB, 1024x1024)
1.67 MB
1.67 MB PNG
Noice
https://civitai.com/models/690948/paper-mario-style-f1d?modelVersionId=773299
>>
File: file.png (1.78 MB, 1024x1024)
1.78 MB
1.78 MB PNG
>>102193973
>>
who is the worse attention whore, koff or debo
>>
>>102194034
which one do you spend more time thinking about, or is it equal
>>
>>102194034
Who's koff?
>>
>>102194046
nick fe, aka creepy loli alien horseface poster (also banned for posting real cp on 4chan before)
>>
File: 00121-1476621908.png (913 KB, 1008x688)
913 KB
913 KB PNG
>>
>>102194065
Oh right. Probably debo then. But it's hard to tell if I'm looking at debo or jus a horrible person sometimes, at least koff has the decency to be identifiable
>>
>>102194046
>>102194091
this guy
>>
>>102194034
It's you, because you keep talking about them as if they matter, unprompted.
>>
I miss schizo anon 2
>>
>>102192182
Why does that face show up so much? Is it a character, or just a base model in Flux?
>>
>>102194187
I am still here. Also who is that?
>>
>>102193784
She is a cool delicious nectar to
my parchèd eyes. You gratify me greatly yielding freely such a prize. I offer thanks as freely, for her generous bosom's size, and every woman thing in her which bids my manhood rise—arising like a haughty tower thrust into the skies, whose stony tip parts tufted clouds, the heavens' spotless thighs, and reaches into hidden regions no man e'er espies—I will not tell of all the places where my fancy flies; of balmy daydreams that I dream I need not you apprise, but just my thanks for the delight your lovely gen supplies.
>>
>>102192731
That's false, if you have the exact same settings, you get the exact same result. This is true regardless of gpu, or if using extremely slow cpu. Regardless, if it's all the same, you get the same result, exacly.
>>
>>102194187
The Nekopara porn poster? That dude was legitimately autistic
>>
>>102194178
kinda sad.
>>102194188
it is a mystery even to me.
>>
>>102194091
That's the same face again. The "Flux face". Or one of them. Who is it?
>>
>>102194206
There is no way this isn't debo trying to stir shit.
>>
>>102194220
This is a bad faith argument and you know it.
>>
File: 1717024003195958.png (449 KB, 640x480)
449 KB
449 KB PNG
>>102194202
You're welcome
>>
>>102194220
idk, just weird, all you have to do is load the example workspaces and you get 100% identical gens, with the same everything, seed, cfg, guidance.
>>
File: 1701722494218.jpg (296 KB, 1024x1024)
296 KB
296 KB JPG
>>
>>102194256
my face after 12 hours of gooning
>>
>>102194188
Baked in as much as the butt chin.
>>
>>102194250
post a catbox of one of your outputs, I will test this
>>
>https://comfyanonymous.github.io/ComfyUI_examples/flux/#regular-full-version
followed this and got an (unknown error) popup
put everything in correct folder
4090 with 32gb ram
any ideas?
>>
>>102194309
>(unknown error)
That's all it said and nothing else?
>>
>>102194330
yep
>>
File: delux_sg_00110_.png (1.86 MB, 1536x968)
1.86 MB
1.86 MB PNG
>>102194097
I was post a gen. that debo that you see around every corner, behind every post, and every night in your dreams is something wrong with your brain

>>102194187
this is getting out of hand. now there's two of them?

>>102194256
is this a controlnet?

>>102194357
>>(unknown error)
I like how its whispering it to you
nothing else in the console window?
also doesnt the weight need to be flux? f8 or something
>>
>>102194357
Share your full workflow to catbox
>>
>>102194357
What does the command prompt say, even though it's giving you unknown error there, there's likely more information in the command window.
>>
>>102194399
>>102194357
Actually don't, I just realised it's the default one.
Does it work with a different model file?
>>
>>102193731
rip, thanks for trying anon. maybe we're better not knowing..
>>
>>102194384
>also doesnt the weight need to be flux? f8 or something
Default should run it at fp16, which on a 4090 he can do
>>
File: 1719350210632.jpg (264 KB, 1024x1024)
264 KB
264 KB JPG
>>102194384
>is this a controlnet?
no, only prompt
>>
>>102194309
>>102194357
>>102194399
>>102194420
>>102194422
so i got it to work literally once then same unknown error
i changed weight_dtype from default to fp8e5m2
>>
So is it better to train lora at a high rank, then resize it to a lower rank? I remember anon saying it's faster to train at a high rank.
>>
>>102194460
Are you OOMing on default?
>>
>>102194465
It's better to train specific layers, you can get a 128 network dim LoRA at 4.5mb
Don't ask me how to do it because I don't know.
>>
File: delux_sg_00113_.png (1.73 MB, 1536x968)
1.73 MB
1.73 MB PNG
>>102194449
ooo, ok. that makes sense

>>102194458
cool. I'm surprised you can get such dark outputs
>>
>>102194475
>layers
blocks.
>>
>>102194480
layers, blocks, same thing only different
>>
does ai-toolkit lora training work on a 4080? or is 4090 minimum?
>>
>>102194469
i get this on top right
>>
>>102194488
I find my gpu usage hovers around 20gb regardless of what I do on ai-toolkit, but I also find the LoRAs that come out of it to be kind of better... but that's probably because the settings are very straightforward.
>>
>>102194497
That's because comfy stopped. You need to re-start it.
Wait, did you use run_nvidia_gpu or run_cpu bat?
>>
>>102194504
nvidia
>>
File: ComfyUI_00657_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>102194479
>Buttchin
>>
>>102194510
toshiba
>>
File: delux_sg_00115_.png (1.88 MB, 1536x968)
1.88 MB
1.88 MB PNG
>>102194511
I don't care at all about chins
>>
File: ComfyUI_01007_.png (957 KB, 1024x1024)
957 KB
957 KB PNG
>>102194531
>>
I liek birba
>>
File: ComfyUI_01149_.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
I'm gonna post some pics of my new LoRa now
>>
File: 1710276225362.jpg (881 KB, 1024x1024)
881 KB
881 KB JPG
>>
File: ComfyUI_01121_.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>102194589
>>
File: delux_sg_00114_.png (2.02 MB, 1536x968)
2.02 MB
2.02 MB PNG
>>102194589
neat door, cute girl

>>102194599
neat robit, cool scene and instagram filter
>>
>>102194604
>>102194589
>*Begins panting heavily*
>Is that... IS THAT 1 GIRL?
>>
>>102194531
you don't care about quality or fingers either
>>
File: ComfyUI_01138_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>102194604
>>
>>102194460
you're getting OOM, use the fp8 model for now, you need more RAM, 32 isn't enough sorry even if you have a 4090, 32GB of RAM isn't enough
>>
File: ComfyUI_01143_.png (1.35 MB, 1024x1024)
1.35 MB
1.35 MB PNG
>>102194612
>>
File: delux_sg_00129_.png (1.95 MB, 1536x968)
1.95 MB
1.95 MB PNG
>>102194617
my quality is always high and my fingers are always perfect
>>
File: 1702228221194.jpg (1.14 MB, 1024x1024)
1.14 MB
1.14 MB JPG
>>
File: ComfyUI_01144_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>
File: 00128-2119237729.png (848 KB, 616x888)
848 KB
848 KB PNG
'grocery nuisance' paragraph of seethe still amuses me.
>>
>>102194630
tfw 31.2 usable memory
thanks anyway
>>
>>102194645
or you can use 32gb with the full dev model but stop playing vidya, watching youtube, twitch, anything that eats ram.

You can also try a GUUF model for less memory
>>
/sdg/ 2.0
>>
File: ComfyUI_01153_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>
File: FD_00370_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>102194639
Fellow automaton enjoyer
>>
File: ComfyUI_01155_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
File: delux_sg_00116_.png (1.97 MB, 1536x968)
1.97 MB
1.97 MB PNG
>>102194661
gm
>>
>>102194661
It didn't take long for the avatar fags to realize their captive audience had moved here. What's next?
>>
>>102193332
sexual degeneracy general
>>
>>102194698
Sustainable development goals.
>>
>>102194685
Schizo anon 3
>>
File: FD_00199_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>102194685
Shitposting
>>
File: delux_sg_00117_.png (1.92 MB, 1536x968)
1.92 MB
1.92 MB PNG
>>102194712
wow its literally me
>>
>>102194718
stay in your confinement thread
>>
sigh, now all we need is PW to start posting here... it was good while it lasted
>>
I had an epiphany while training specific blocks when doing LoRAs. What if we just trained all the blocks?
>>
>>102194752
>now all we need is PW
If that troon starts posting here I will leave.
inb4
>That's a good thing!
I cannot stand that troon.
>>
File: ComfyUI_00689_.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
Heres your controller bro
>>
File: 1710880664498.jpg (990 KB, 1024x1024)
990 KB
990 KB JPG
>>
>>102194767
thanks for not touching the d-pad or face buttons I guess
>>
File: delux_sg_00118_.png (1.89 MB, 1536x968)
1.89 MB
1.89 MB PNG
>>102194761
>If that troon starts posting here I will leave.
thats what we call a win-win
>>
File: FD_00057_.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>
File: 1718186755398.jpg (1.05 MB, 1024x1024)
1.05 MB
1.05 MB JPG
>>
>>102194779
licked them clean no worries brah
>>
File: ComfyUI_00691_.png (988 KB, 1024x1024)
988 KB
988 KB PNG
>>102194790
>meanwhile chad at the bar with your crush
>>
>>102194819
HELLO SAAR
>>
File: ComfyUI_01750_.png (738 KB, 720x720)
738 KB
738 KB PNG
>>102192242
fuck off fren
>>
File: FD_00061_.png (1.59 MB, 1024x1024)
1.59 MB
1.59 MB PNG
>>102194819
>>
>>102194868
>>102194790
Anyone notice flux gives men surprisingly juicy tits despite being covered in hair?
>>
File: 1702628826629.jpg (1.01 MB, 1024x1024)
1.01 MB
1.01 MB JPG
>>
File: 1695214433297332.png (2.39 MB, 1280x1280)
2.39 MB
2.39 MB PNG
>>
File: 1712188548234917.png (2.5 MB, 1280x1280)
2.5 MB
2.5 MB PNG
>>
File: 1699691389456297.png (1.69 MB, 1024x1024)
1.69 MB
1.69 MB PNG
>>
>>102194888
>>102194897
>>102194904
Good gens but fuck that looks miserable to live there. Really soviet block chic
>>
File: 1710773365523.jpg (1.07 MB, 1024x1024)
1.07 MB
1.07 MB JPG
>>
>>102194917
Was going for more of an outdoor liminal vibe, a creepy too-clean corporate campus
>>
File: zZ4QnH0t_400x400.jpg (34 KB, 400x400)
34 KB
34 KB JPG
Hello? Where is everyone?
>>
>>102194917
not enough mcdonalds for you?
>>
>>102194934
>creepy too-clean corporate
That's definitely what it is
>>
File: 1696994451548.jpg (820 KB, 1024x1024)
820 KB
820 KB JPG
>>
File: 1696410203462.jpg (380 KB, 1024x1024)
380 KB
380 KB JPG
>>
File: 1715696609276506.png (414 KB, 640x480)
414 KB
414 KB PNG
Accidental Attack on Titan moment
>>
File: delux_sg_00121_.png (1.93 MB, 1536x968)
1.93 MB
1.93 MB PNG
>>102194953
I think a bunch of people are serving time rn
>>
Sometimes we unconsciously fail to consider that these models do not "think" like the human brain at all.

There's a prompt I borrowed that paints some goooorgeous orange haired girls, right. So you think "Well, even if it doesn't have experience painting girls with other hair colors, it'll just extrapolate, right. Swap every orange hair pixel to black/silver/blonde/whatever ain't rocket science. But no. EVERYTHING changes. The drop in quality is astounding.

It makes all sorts of diffuse associations by correlation between everything. Track suits are only worn by athletic people. America only has fatsos. Black dudes are always carrying a bowl of AFC and a watermelon.

You have the perfect prompt, you think you're going to get some sweet permutations out of it, but the moment you change ONE TOKEN, everything goes to hell.
>>
File: 00134-2331230946.png (1.25 MB, 1152x896)
1.25 MB
1.25 MB PNG
>>
File: 1698499773766551.png (933 KB, 1024x1024)
933 KB
933 KB PNG
>>
Bread
>>102195069
>>
File: 1719169229837677.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>
>>102195104
Retard, let the thread autosage first.
>>
File: 1702891044942361.png (939 KB, 1024x1024)
939 KB
939 KB PNG
>>102195137
Thought it was 300 here, my bad
>>
File: 3v4fclf2hmlc1.jpg (89 KB, 1170x1142)
89 KB
89 KB JPG
>>102192242
Nyo
>>
File: ifx343.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>
>>102194206
go for it debo, exact same settings and change pytorch versions and you'll see you'll get something different, and it's not surprising at all, they change a lot of math shit for optimisation so you get different rounding results
>>
>>102192242
replying to save my mother
>>
amen to that
>>
>>102197627
Nice
>>
>>102197644
you're welcome... anon
>>
File: 1715091300334750.jpg (44 KB, 896x512)
44 KB
44 KB JPG
>>
>>102197672
Nice colors
>>
>>102194685
>What's next?
Local Diffusion Non-Eukaryote General



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.