[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (978 KB, 3264x3264)
978 KB
978 KB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102195069

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
File: 2024-09-02_00133_.png (1.37 MB, 832x1216)
1.37 MB
1.37 MB PNG
>>102197970
tanks bakerman
>>
Shit collage.
>>
Last episode was very cat centric
>>
File: FD_00135_.png (1.42 MB, 1344x768)
1.42 MB
1.42 MB PNG
>>
File: 2024-09-02_00138_.png (1005 KB, 832x1216)
1005 KB
1005 KB PNG
>>102197938
very interesting result
>>
File: 2024-09-02_00011_.png (1.35 MB, 1280x720)
1.35 MB
1.35 MB PNG
>>102197998
Have a cat.
>>
File: file.png (2.07 MB, 1024x1024)
2.07 MB
2.07 MB PNG
>>102198026
yeah, she looks good in the 90's style setting
>>
Is there anywhere online that I can find a simple table of GPUs that show me
>memory
>price
>name of gpu
?
>>
File: FD_00141_.png (1.07 MB, 1344x768)
1.07 MB
1.07 MB PNG
>>102197998
And this episode will be no different because my LoRA is done.
>>
>>102198037
there's no need, only 3 gpus are relevant nowdays, the 3090, the 3090ti and the 4090
>>
>no glass cats in collage
>no milk pics
someone fix it
>>
>>102198066
I don't want to spend that much.
>>
>>102198037
Because actually AMD gpu like I have are very bad at ai, because all of the programmers for ai never owned an AMD card in their lives.

So, that leaves nvidia.

You don't want to mess with the headless cards anymore, because the prices have gone up too much.

What you want is a 4090, actually. Plus 64gb of system memory. You're welcome, go buy that.
>>
File: file.png (1.95 MB, 1024x1024)
1.95 MB
1.95 MB PNG
that looks really good, imagine if everyone would make loras with the same quality as this one:
https://civitai.com/models/7227?modelVersionId=782696
>>
Blessed thread of frenship
>>
File: 2024-09-02_00146_.png (1.11 MB, 832x1216)
1.11 MB
1.11 MB PNG
>>102198034
I used the Disgea lora tho
>https://civitai.com/models/709964/disgea-style-by-takehito-harada-for-flux?modelVersionId=794117
>>
>>102198079
>585mb character LoRA
Imagine if they didn't
>>
>>102198121
nta, but the Satoshi lora is a style lora
>>
>>102198121
I won't cry for a 500mb Lora when flux Q8 is already asking for 14GB during inference, I have plenty room to spare with my 3090 desu
>>
>>102198055
I love cat
So soft so fluffy
Cat cat cat
>>
>>102198121
Oh wait it's a style lora never mind
I just assumed with your vocaloid without even looking
>>
>>102198121
there also is a 20mb version of that ura style lora
>https://civitai.com/models/7227?modelVersionId=790696
>>
File: FD_00146_.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
>>102198131
https://suno.com/song/ee467d00-5813-4a74-9792-c9ae4a09d344
>>
File: knitted.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
>>102197983
>masterpiece, best quality, trending on artstation, greg rutkowski. Knitted hedgehog wearing sunflower.
Yeaahhh... the one on the OP without all this looks better.
>>
Honestly, partially rendered VistaPro images are pretty neat.

This is a real one. Flux doesn't know what VistaPro is.
>>
>>102198076
What about a 4060 ti? It's got 16GB of memory and it's only $450.
>>
>>102198037
The best graphics card in 2024: top GPUs for all budgets,, updated August 28, 2024
https://www.techradar.com/news/computing-components/graphics-cards/best-graphics-cards-1291458
>>
>>102198162
That shows more progress than the time it took to make this in Flux. Pretty funny to look back.

VistaPro 3.2 (or was it 3.1?) came with a big book I had on VR.
>>
>>102198172
That's one of the google results that I discarded, thanks.
>>
has anyone managed to trick flux into doing lude acts? i'm trying to get her to suck on a lollipop but it refuses. inb4 lora
>>
>>102198168
I am running a 4080 normal. It's fine, fp8 gens at 1.1s/it on Flux
>>
>>102198168
lmgtfy?

>Flux takes 1m20s on 20 steps on RTX 4060 Ti so it's the best card for Flux in lower-range price imo.

> 4070 super generates an image in 44 secs

Both are faster than my 6950xt. Well, at ai.
>>
>>102198185
You're welcome, No go buy either a 3090 Ti, 4090 or get yourself a PNY A5000 24GB.
>>
How to know someone has a 3090/4090?
>don't worry, they'll tell you
>>
File: 2024-09-02_00012_.png (1.46 MB, 1280x720)
1.46 MB
1.46 MB PNG
>>102198173
picrel
>>
File: ezgif-7-a0b51e38e5.png (944 KB, 1024x1024)
944 KB
944 KB PNG
Katppa
>>
vramlet seethe
>>
File: 2024-09-02_00149_.png (1.27 MB, 832x1216)
1.27 MB
1.27 MB PNG
>>102198187
try
>Photorealistic blonde woman eating a lollipop
sucking? that lewd language! how dare you talk lewd to the sentient machine!
>>
File: future is not here yet .png (578 KB, 1280x768)
578 KB
578 KB PNG
>>102198162
That won't be necessary in the future, you'll make an AI describe the picture and send the prompt to another AI that will draw it in that style.
>>
>>102198193
>Flux takes 1m20s on 20 steps on RTX 4060 Ti
I have a 4060 Ti 16GB, it's around 2.4s/it, 20 steps would be 48 seconds
>>
File: 2024-09-02_00151_.png (1.25 MB, 832x1216)
1.25 MB
1.25 MB PNG
>>102198229
my bad.. sucking works to
>>
>>102198234
Hopefully someone will make a lora.
>>
>>102198234
thats already here.. you can use a WebCrawl img download -> JoyCaption -> FLUX loop
>>
>>102198243
it clearly depends on the resolution
>>
File: lolli.png (172 KB, 962x1390)
172 KB
172 KB PNG
>>102198229
I get nothing unless it's one of these.
>>
File: 2024-09-02_00157_.png (1.33 MB, 832x1216)
1.33 MB
1.33 MB PNG
>>102198187
>>
File: ezgif-7-3a8d4b834a.png (909 KB, 1024x1024)
909 KB
909 KB PNG
>>102198225
It didn't quite understand...
>>
File: 2024-09-02_00160_.png (1.11 MB, 832x1216)
1.11 MB
1.11 MB PNG
>>102198272
see >>102198274 you happy now?
prompt
>Photorealistic blonde woman eating a small ball like lollipop. The lollipop is inside her mouth.
>>
>>102198264
That's EXACTLY what I did for the picrel of that post, to show the state of the art and that it doesn't look anything like the source image.
>This is a digital rendering of a surreal, fantastical landscape. The image features a massive, elongated mountain that appears to be floating in mid-air, stretching from the top left to the bottom right. The mountain is covered in various shades of green and brown, indicating a mix of vegetation and rocky terrain. At the top of the mountain, there is a small, clear blue lake with a cascading waterfall that flows down the side of the mountain, adding a dynamic element to the scene. The mountain's peak is covered in a sparse, brownish landscape, suggesting a dry or barren area. In the foreground, there is a flat, grassy plain with a gradient of light green and white, suggesting a distant horizon. The sky is a gradient of blue, transitioning from a deep blue at the top to a lighter blue near the horizon, creating a sense of depth and distance. The texture of the mountain is rough and rugged, while the lake and waterfall have smooth, reflective surfaces. The overall style of the image is hyper-realistic, with a focus on natural elements and a sense of grandeur. The surrealism comes from the floating mountain and the unusual positioning of the waterfall.
>>
>>102198274
Excuse me, I have to go now to... do some business.
>>
>>102198285
>>102198274
>>102198244
guidance?
>>
>>102198289
yea I wish someone would figure out how to use T5xxl in JoyCaption instead of llama .. then we would get the discriptions directly out of the horses mouth
>>
>>102198285
Yes, yes, no complaints, Flux nails it.
>>
>>102198055
Is it a Cat Lora?
Because that's cool.
Cats are cool, cool cat.
>>
File: ComfyUI_02859_.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>People getting all pissy about LoRA size when you can now make 128 dim LoRAs for 10MB
>>
File: 2024-09-02_00161_.png (1.22 MB, 832x1216)
1.22 MB
1.22 MB PNG
>>102198300
guidance 3.5, cfg 1, no hack
>>
>>102198267
No shit, Sherlock, but if different resolutions are in play and aren't stated then the performance figures are useless.
Mine are at 1024x1024 which is the reasonable default to test Flux speed at.
>>
>>102198315
>cool cat
That term is already reserved for things that are retro, vintage, or old school, you can't just call a cat that is cool that way.
>>
File: FD_00143_.png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
>>102198315
Yes it is a lora of my cat.
>>
File: bf16.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
https://imgsli.com/MjkzMzU4
I still don't get why Q8_0 is closer to bf16 compared to a bigger version of Q8_0
https://huggingface.co/mo137/FLUX.1-dev_Q8-fp16-fp32-mix_8-to-32-bpw_gguf/tree/main
>>
File: 150714_00001_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>102198187
I kinda gave up after a while.
>>
>>102198304
Wouldn't be any better unfortunately. It's just not how it works. Also consider that T5 is frozen, it wasn't touched at all during Flux's training.
>>
File: lol.png (657 KB, 1024x1024)
657 KB
657 KB PNG
>>
>>102198413
the hand paw brings fear into my heart
t-thanks flux
>>
File: 2024-09-02_00180_.png (1.35 MB, 832x1216)
1.35 MB
1.35 MB PNG
>>
File: 2024-09-02_00182_.png (1.07 MB, 832x1216)
1.07 MB
1.07 MB PNG
>>
File: 31760.png (758 KB, 1024x1024)
758 KB
758 KB PNG
>>
>>102198450
>all these flavors and you choose to be salty
>>
Kek more cats with lollipops to combat the coomers please
>>
File: ComfyUI_01189_.png (985 KB, 1024x1024)
985 KB
985 KB PNG
>>
>>102198459
what if I told you...
>>
File: 2024-09-02_00186_.png (1.34 MB, 832x1216)
1.34 MB
1.34 MB PNG
>>
>>102198463
Oh God no... Anon... No...
>>
File: 000000_17223_.png (1.38 MB, 998x1459)
1.38 MB
1.38 MB PNG
>>
File: 2024-09-02_00187_.png (1.31 MB, 832x1216)
1.31 MB
1.31 MB PNG
>>
File: ComfyUI_01192_.png (663 KB, 1024x1024)
663 KB
663 KB PNG
>>
>>102198460
>>102198483
ah this is how NASA fakes their satellite photos
>>
File: ComfyUI_0627_png.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>>
File: ComfyUI_hgdf_00227_.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
>>
>>102198357
Here's a comparison between all the alternative quants: https://imgsli.com/MjkzMzY1/0/1
>>
File: 000000_17225_.png (1.75 MB, 1040x1520)
1.75 MB
1.75 MB PNG
>>102198579
hehe
>>
File: main.jpg (1.9 MB, 3361x2497)
1.9 MB
1.9 MB JPG
big artwork collection in progress, should be 1m+, various mediums, high res in most cases
>>
File: 2024-09-02_00194_.png (1.4 MB, 832x1216)
1.4 MB
1.4 MB PNG
>>
File: 2024-09-02_00197_.png (999 KB, 832x1216)
999 KB
999 KB PNG
>>
>>102198492
not really, you can observe these things with a Telescope yourself.
>>
File: 2024-09-02_00203_.png (1.07 MB, 832x1216)
1.07 MB
1.07 MB PNG
>>
Dark Moon Greatsword
>>
should i feel bad for scraping an artists pixiv and training a lora on it
>>
https://github.com/city96/ComfyUI-GGUF/pull/92
>LoRA/etc should no longer reload the model on weight changes or when enabling/disabling/muting them
Chat is this real?
>>
File: 2024-09-02_00205_.png (1011 KB, 832x1216)
1011 KB
1011 KB PNG
>>102198729
Ice Pop Moon greatsword!
>>
>>102198736
no
>>
>>102198746
good.
>>
>>102198736
where do you think this all comes from
>>
File: ComfyUI_hgdf_00039_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>102198593
no
>>
Why are recent models still struggling to gen photo realistic succubus cosplay?
>>
>>102198736
you should feel bad for not scraping it already
>>
File: 1699740191799455.png (2.82 MB, 1248x1824)
2.82 MB
2.82 MB PNG
Does anybody know what style or lora was used here? The character is supposed to be Zeta from granblue.
>>
>>102198772
you probably have to be more descriptive, there are so many different depictions of succubi, some are horror like creatures, some have tails, some have horns, some don't etc.
>>
>>102198772
Why gen fake succubus when you can gen real ones?
>>
File: 2024-09-02_00213_.png (936 KB, 832x1216)
936 KB
936 KB PNG
>>102198695
Did you say carrotoscope?
>>
File: ComfyUI_01173_.png (1.5 MB, 1024x1024)
1.5 MB
1.5 MB PNG
>>102198801
no I did not.
>>
>>102198799
I tried to gen real female centaurs and didn't get anywhere. Seems there's a lack of source data.
>>
File: file.png (481 KB, 512x512)
481 KB
481 KB PNG
The turk have a new dataset in his pipeline... are you excited for his new patreon exclusive step by step training guides for Kohya?
>>
>>102198817
this looks like fat spoony lmao
>>
File: ComfyUI_01194_.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>102198823
no
>>
File: file.png (13 KB, 916x306)
13 KB
13 KB PNG
>>102198739
It is true, I just changed the strength of a lora and removed some and it doesn't do the unloading/reloading anymore, city really is a magician.

To use that open PR you go here:
ComfyUI_windows_portable\ComfyUI\custom_nodes\ComfyUI-GGUF

Then on your cmd command you do this:
- git fetch origin pull/92/head:PR-92
- git checkout PR-92

Once he'll get this shit merged, you'll have to do "git checkout main" + "git pull" to get the official code
>>
>>102198832
Cute birb
Captcha:
STDG
I think it's trying to give me new general names
>>
>>102198845
>city really is a magician.
I can't overlook xe's fruitcake melty where he doubled down on estrogen pills and hallucinated illya was bullying him so threw a public GitHub meltie lmao
>>
File: ComfyUI_01195_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>102198850
>>
>>102198873
no one is perfect anon, he did way more cool stuff than uncool stuff, that fix is really important, now I can switch loras whenever I want without having to wait for ages for the unload/reload shit, it's way more convenient
>>
File: 2024-09-02_00215_.png (1.24 MB, 832x1216)
1.24 MB
1.24 MB PNG
>>102198806
I am sure you did. I can see Saturn through my carrotoscope!
>>
File: lollimiku.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
I failed to make her lick the lollipop, failed generations never looked this good.
>>
>>102198879
Very cute!
>>
File: ComfyUI_01196_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>
File: 2024-09-02_00218_.png (1.37 MB, 832x1216)
1.37 MB
1.37 MB PNG
>>
>>102198845
isn't this also an issue with ComfyUI? I don't use GGUF and the model has to reload into VRAM every time I change the lora stack or strength
>>
>>102198873
buzzword bingo
>>
File: 1696437637376434.png (1.35 MB, 896x1152)
1.35 MB
1.35 MB PNG
>>
File: 000000_17227_.png (2.07 MB, 1032x1508)
2.07 MB
2.07 MB PNG
>Great the subject is in focus.....
>>
>>102198933
>isn't this also an issue with ComfyUI? I don't use GGUF and the model has to reload into VRAM every time I change the lora stack or strength
yeah, somehow he managed to make it work on ComfyUi, I think it's still an issue on the regular loaders (fp8, bf16), but for his gguf node it just werks, guess that Comfy should inspire from his code to also make it work on his "Load Diffusion Model" node aswell, because that's a really important feature to have
>>
>>102198933
reload on lora stack is normal.. it needs to calculate weights new, on strength change is wonky
>>
>>102198951
Describe the background you want with a lot of detail and it will be as sharp as the squirrel.
>>
File: 1713556248559905.png (1.26 MB, 896x1152)
1.26 MB
1.26 MB PNG
>>
>>102198969
>ok, ty
>>
>>102198951
its a shame that most photorealistic gens have insane aperture .. its like every photo in the dataset was shot fit f/1.1 or something.. I have tried to get rid of it with negatives, but it only removes a little bit of blur
>>
>>102198957
>reload on lora stack is normal.. it needs to calculate weights new,
nope, not normal, because city managed to not make it reload when you change loras
>>
I had to reinstall forge and now when I try and gen with flux I get "TypeError: 'NoneType' object is not iterable" what the fuck could I have done? it can gen sd1.5 although for some reason it looks a lot worse than before...
>>
>>102198979
looks like a horizontal tilt shift was applied.. I have a negative with 'Bokeh, Tilt-shift,' see if that helps a bit..
>>
>>102198973
Nice candid shots.
>>
>>102198981
"normal" without accounting for 10-hour-old commits then
>>
File: 1717413962022070.png (1.31 MB, 896x1152)
1.31 MB
1.31 MB PNG
>>
>>102198935
Pretty accurate to what happened desu
https://github.com/lllyasviel/stable-diffusion-webui-forge/issues/1503
>>
>>102199006
if it was possible to make it work without unloading/reloading, it means that it was never normal in the first place, I fucking hate when people say shit like "oh it's normal to have a feature that should exist but doesn't"
>>
>>102199021
"normal" as in "common", the general scenario for something
>>
File: 1701822965216553.png (1.2 MB, 896x1152)
1.2 MB
1.2 MB PNG
>>
File: ComfyUI_01201_.png (1.82 MB, 1024x1024)
1.82 MB
1.82 MB PNG
>>
>>102198989
okay I thought I fixed it but as the image finished generating it turned into a solid blue square, doesn't say an error or anything
>>
>>102199037
yeah I get what you mean anon, I want to be angry on Comfy for not implementing this feature because he had 2 years to do that but I'll accept that it wasn't that important until recently because the model we used to have were small as fuck so the unload/reload was fast enough to not be noticed
>>
>>102199021
It's normal for comfy to be a hack that can't outcode a troon that uses chatgpt, though, that much is true.
>>
>>102198989
I want back to vanilla A1111 until Forge stabilizes. It's gone from dead to unstable.
Try removing/updating extensions maybe?
>>
>>102199057
I think forge also unload/reloads when you change a lora on flux? It's not just a ComfyUi issue
>>
>>102199017
>I do not care for a public apology or the sort, nor do I care about "drama". I especially do NOT want credit for the hard work of others.

Not really.
>>
File: ComfyUI_01202_.png (1.71 MB, 1024x1024)
1.71 MB
1.71 MB PNG
>>
File: 1716344797505450.png (1.18 MB, 896x1152)
1.18 MB
1.18 MB PNG
>>
File: Knitlolli.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
Some prompts are just too hard for Flux.
>>
>>102199049
Check you don't have any of the dropdown buttons accidentally ticked, like with controlnet and such?
>>
>>102199054
yeah that makes sense
good to see it fixed anyway
>>
>>102199049
It's the new safety checker, blue is the new black.
>>
>>102199071
>He says, as he makes a public issue about it spazzing out then doubles down when it's explained he's hallucinating imaginary greviences
Lol ok chitty, keep it up
>>
File: ComfyUI_01203_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>>
>>102199081
>>102199089
I guess I'm retarded. I had installed a flux Vae and tried using it for the first time which was resulting in the blue screen. So flux doesn't need a vae?
>>
>>102199098
I don't expect you to change your mind about anything; I just don't agree with your meme word description.
it's okay, anon.
>>
>>102199098
I hope you have the same energy on Comfy when he cried that Forge was copying all his code, I mean, what can I say? Those people work hard and can be frustrated when they don't get the recognition they feel they deserved, call that an egotrip or whatever, but at the end of the day, I'm glad those guys exist because we can enjoy diffusions models without much torture
>>
>>102199099
why do anime and digital art Loras all get those same weird elongated fingers with the weirdly rounded tips. It's not as extreme in this picture but I've noticed it in a few now, seems to be unique to loras with animu esque styles or digital art om flux..
>>
>>102199104
Really? You're asking a prankster? I have no idea about what you're talking about. Let me try again:
They had implemented the Blue Screen Of Death from Windows in Flux, which appears instead of your generated picture.
>>
File: ComfyUI_01206_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>>102199132
I didnt use a LoRa for that image
>>
File: ComfyUI_01586_.jpg (1.08 MB, 1920x1440)
1.08 MB
1.08 MB JPG
>>
>>102199132
flux can't do anime, in fact it hates it for some reason. some anime loras can help, but they need to be styles that don't look like the usual flux anime slop
>>
>>102198739
>>102198845
City, if you're reading this, much love. That was the last important thing I wanted it to work, I was weary to experiment with loras because that unload/reload shit was slow and tedious as fuck, now I can change the strength of a lora as easily as changing prompts and the result is instant.
>>
File: 2024-09-02_00233_.png (1 MB, 1024x1024)
1 MB
1 MB PNG
>>102198998
sadly does not work.. background cat still is a blurry mess
>>
>>102199125
Anon there isn't a single person on /g/ or even plebbit who didn't have that "same energy" for comfy being am absolute faggot then, either. Chitty can samefag to defend himself all he wants, just shows he has 0 shame for his retarded little speel.
>>
File: ComfyUI_05896_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>102199152
>but they need to be styles that don't look like the usual flux anime slop
I agree with that, there's one lora that give exactly the sovl Flux desperately needed all along:
https://civitai.com/models/7227?modelVersionId=782696
>>
>>102199144
Wtf does flux just have fucky fingers trained in for anime then
>>
Is there a way to look at the weights of a model? I want to verify if my "flux1-dev.sft" has bf16 or fp16 in it
>>
>>102199187
because flux has probably seen less anime pictures than realistic pictures, less training -> less good
>>
>>102199017
wow that gave me hardcore 2nd hand embarrassment to read through.
>>
File: Boo.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
Boo! It failed to draw a lollipop the only time I needed it, that was the whole point...
>>
>>102198079
>>102198148
>>102199181
>three times
>>
>>102199215
are you on CFG = 1? if yes then go for some CFGmaxxing + AutomaticCFG and Flux will listen to your prompt better
>>
>>102199060
-glasses: fucked, -fingernails: fucked, -plastic horns? size: 9999999x9999999.
>>102199159
I tried, and eventually gave up.
>>
File: ComfyUI_01210_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>102199187
I have no clue
>>
>>102199197
import safetensors

path = "model.safetensors"
weights = safetensors.safe_open(path, framework="pt")
for key in weights.keys():
tensor = weights.get_tensor(key)
shape = tensor.shape
dtype = tensor.dtype
print(f"{key} {shape} {dtype}")
>>
File: 1708927189497084.png (1.35 MB, 896x1152)
1.35 MB
1.35 MB PNG
>>
>>102199253
excellent anon, that was exactly what I was looking for, thank you
>>
File: 2024-09-02_00243_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>102199231
>I tried, and eventually gave up.
ya.. as soon as you mention "photo" in the prompt it goes ultra bokeh
>>
File: Boo2.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>102199215
Damn, it just keeps forgetting the long white stick of the big lollipop for some reason :(
>>
File: ComfyUI_01211_.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
>>
>>102199293
I like it
>>
>>102199229
This is huggingface to stuck at CFG = 1 - but from what I've seen it's probably a skill issue because I don't know how to prompt it (one was anthropomorphic red ball like lollipop, the other happy ball like lollipop, perhaps the ball part trips it up.)
>>
File: 2024-09-02_00248_.png (957 KB, 1024x1024)
957 KB
957 KB PNG
>>102199280
and it even does that in anime gens, but not as extreme
>>
File: ComfyUI_01213_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>102199307
>>
>>102199231
>9999999x9999999.
Can't even count. It's just 2048x3072.
>>
File: 1698771498701454.png (1.27 MB, 896x1152)
1.27 MB
1.27 MB PNG
>>
>>102199349
THE POINT IS.. ah whatever. you go
>>
File: 2024-09-02_00225_.png (1 MB, 1024x1024)
1 MB
1 MB PNG
>>102199293
>>102199321
magestic birbs!
>>
>>102199318
can you give me your prompt, I wanna see if it works better at cfg = 6 on my machine
>>
File: Lolliface.png (401 KB, 1024x1024)
401 KB
401 KB PNG
>>102199291
So I guess I need to do it on 2 steps, first I learn to make the character...
I...
I guess this one is fine...
>>
File: ComfyUI_01215_.png (1.54 MB, 1024x1024)
1.54 MB
1.54 MB PNG
>>102199388
>>
File: 2024-09-02_00257_.png (1.35 MB, 1024x1024)
1.35 MB
1.35 MB PNG
>>102199407
cute
>>
File: example.png (286 KB, 1378x1378)
286 KB
286 KB PNG
>>102199393
Sure!
>This image is a vintage advertisement for a "Lollipop Lollipop" product, featuring an anthropomorphic red ball like lollipop with cute eyes in a classic 1950s style. She is depicted sticking her tongue out eating a smaller red ball like lollipop, She is wearing headphones, suggesting a DJ setting. The background is a party stage, which emphasizes the subject. The text above her reads, "The lollipop lollipop!" in a playful, cursive blue font. To the right of her, there is a lollipop box, labeled "Lollipop Lollipop" in blue and yellow lettering, with the anthropomorphic red ball like lollipop holding the smaller lollipop painted on it. The box is positioned to the right of her, slightly behind. Below her, the advertisement includes a text box that reads, "It's lollipopception!" The text box is surrounded by illustrations of the lollipop box, emphasizing its nutrients. The bottom of the advertisement includes text that says "Have you listened to the song?" with small illustrations, promoting the lollipop's nutritional benefits. The overall style of the advertisement is classic, typical of mid-20th-century.
(1024x1024 - Guidance 3 - 28 Steps - Seed: 898)
The goal is to have the arms and legs come out from the white stick, like picrel, Flux just makes them come from the ball and doesn't add what makes it a lollipop.
>>
File: 00117-3088164496.jpg (403 KB, 1552x1200)
403 KB
403 KB JPG
>>102199424
EPIC
>>
>>
File: 00032-1162512079.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
>>
blurry, bokeh in neg, "it just works"
>>
File: Boo3.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>102199399
WHAT?
I didn't ask for a woman or a human!
This is me, giving up :'(
>>
File: 2024-09-02_00266_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>102199443
ty
>>
File: 000000_17233_.png (1.73 MB, 1040x1520)
1.73 MB
1.73 MB PNG
>>
File: file.png (3.23 MB, 3144x1484)
3.23 MB
3.23 MB PNG
>>102199438
Tough luck anon
>>
File: 2024-09-02_00276_.png (1018 KB, 1024x1024)
1018 KB
1018 KB PNG
>>
File: 00038-2585278607.png (2.04 MB, 1536x1536)
2.04 MB
2.04 MB PNG
>>
File: 78g.jpg (149 KB, 1352x1248)
149 KB
149 KB JPG
>Get them from Civtai (requires an account)
>Or from Huggingface (requires an account)
>>
>>102199607
Train your own
>>
File: 2024-09-02_00279_.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
>>
File: 2024-09-02_00282_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>102199582
shocker! is that cannibalism?!
>>
>>102199607
Create an account
>>
File: 2024-09-02_00284_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
never would have thought that anthropomorphic lollipops are so fucking creepy
>>
File: 1711848741228278.png (1.7 MB, 1152x896)
1.7 MB
1.7 MB PNG
>>
File: sdxl_10.jpg (250 KB, 1480x1328)
250 KB
250 KB JPG
Happy labor day
>>
File: Flux lollipops.png (431 KB, 1024x1024)
431 KB
431 KB PNG
>>102199582
Thanks, I'm glad to know the problem isn't related to CFG=1. I can do these ones so I should be able to get there, eventually.
>>
>>102199607
>Or from Huggingface (requires an account)
Not really, you can download without an account.
>>
>>102199654
No, we want it to be cannibalism but have only some random red ball eating a lollipop.
Also, apparently it's a novel concept, I can't find images of it for joycaption :(
>>
>>102199702
not everything, some models need you to log in.. not like that matters if you are security inclined, you can just make a throw away account
>>
File: 2024-09-02_00291_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>
File: file.png (1.52 MB, 1024x1024)
1.52 MB
1.52 MB PNG
>>
File: sizes.png (432 KB, 1024x1024)
432 KB
432 KB PNG
>>102199684
Now the left one is "very big", the middle one "of normal size", and the right one "very small".
... I can barely tell...
>>
File: 2024-09-02_00293_.png (868 KB, 1024x1024)
868 KB
868 KB PNG
>>
>>102199743
If models are good they have a version that is a mirror that doesn't require you to login.
Chilloutvlara doesn't have such a mirror, which means it's not worth your time.
>>
File: 9ly.png (3.26 MB, 1472x1704)
3.26 MB
3.26 MB PNG
>Create an account
>>
>>102199794
yea ofc.. you can download flux for example like from a dozen other sources, but the official one is gated with a login
>>
>>102199668
That's not an anthropomorphic lollipop, it's just candy, it's missing the stick, that's a problem we're having with Flux that not even high CFG and advanced nodes in comfy can fix.
>>
File: 2024-09-02_00298_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>
>>102199772
Does that style require a lora? (I'm talking about the moving lines that give it a drawn effect.)
>>
>>102199824
yes
https://civitai.com/models/677362/mario-strikers-style-flux
>>
>>102199807
>from a dozen other sources
Including ones on huggingface, the point is you can still download them from huggingface without login even if they're not the official source.
>>
File: 2024-09-02_00302_.png (868 KB, 1024x1024)
868 KB
868 KB PNG
>>102199812
happy now?
>>
File: ComfyUI_01216_.png (1.96 MB, 1024x1024)
1.96 MB
1.96 MB PNG
>>
File: 2024-09-02_00303_.png (937 KB, 1024x1024)
937 KB
937 KB PNG
>>
>>102199835
Ha, I was going to tell you "it reminds me of the way the characters from Mario Strikers are drawn in their promotional material", I had no idea they created that style for the game.
Also, why don't videogames all look like different styles? They all basically look like Pixar movies...
>>
File: 2024090207.jpg (3.07 MB, 2992x1824)
3.07 MB
3.07 MB JPG
>>
>>102199839
Yeah, what would work, how did you prompt it?
>>
hm
>>
>>102199866
You basically need these guys:
>>102199851
In there.
>>
>>102199852
It's true, there's certainly a need for a greater variety of graphic styles in video games.
>>
File: 2024-09-02_00307_.png (677 KB, 1024x1024)
677 KB
677 KB PNG
>>102199862
that was
>A red ball with eyes and a mouth on a stick is eating a ball like purple lollipop on a stick. The purple lollipop is in the balls mouth. The tongue is visible.
So i basically tricked t5 into making the red ball look like a lollipop but not make it think of it as a lollipop
>>
File: 2024090208.jpg (3.6 MB, 2096x2096)
3.6 MB
3.6 MB JPG
>>102199885
He looks so smug
>>
File: ComfyUI_01219_.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>
>>102199806
>doesnt create flake email
>>
>>102199918
>flake email
>>
>>102199885
yeah that might be it. I tried replacing various words, "she" with "it" etc but that was not enough. now I got a lollipop with a stick up its arse, great. its also the headphones and whatever. & cool image lol
>>
File: 2024-09-02_00309_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
kek, no flux is trolling me

>>102199896
yep.. the eyes only a mother can love
>>
>>102199932
>uses reel email
>>
>>102199905
people easily see the military toys they get, what they're not seeing is all the blood that is being spilled by both ends, truly a tragedy, jews will tell you that slavs are subhumans, that they have asian blood or whatever bullshit, but they are in fact our brothers, and their blood is being spilled wildly
>>
>>102199952
Dude what
>>
>>102199952
?
>>
File: file.png (2.55 MB, 1024x1024)
2.55 MB
2.55 MB PNG
>>
>>102199952
I sorta get what you're trying to say but this ain't /pol/,... kek
>>
File: 5j7.png (2.98 MB, 2352x1472)
2.98 MB
2.98 MB PNG
>anyone that does not follow SAI license is doing that illegally
>>
Sorry for the spoonfeed request, what are the best autotaggers these days? I think joy caption was considered the best for natural language but whats the best for booru style tags?
>>
File: file.png (2.05 MB, 1024x1024)
2.05 MB
2.05 MB PNG
>>
File: 2024-09-02_00321_.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
okay last one.. I wanna prompt something less creepy
>>
i missed u
>>
>>102200050
I missed your mom's buns. jk I missed her cookies. wholesome
>>
>>102199975
no anon nooo
>>
>>
>>102199958
>>102199968
>>102199978
just the picture made me think too much
>>
>>102199975
now we're talking
>>
File: ComfyUI_01223_.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
>>
>>102199885
That's very impressive! Now I feel like using Joycaption has fried parts of my brain that would think like that.
>>
>>102200140
How original
>>
File: file.png (1.55 MB, 1024x1024)
1.55 MB
1.55 MB PNG
>>
File: ComfyUI_01224_.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
>>102200153
>>
>>102200140
the godzilla megaton workout one was better dude.
>>
File: 2024-09-02_00289_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>102200161
>>
>>102200216
aww that's cute
>>
File: 000000_17231_.png (2.74 MB, 1032x1508)
2.74 MB
2.74 MB PNG
>>
File: 2024-09-02_00333_.png (1.64 MB, 1024x1024)
1.64 MB
1.64 MB PNG
>>102200229
ty
>>
File: 000000_17236_.png (2.11 MB, 1032x1508)
2.11 MB
2.11 MB PNG
>BANANA!!
>>
File: file.png (2.1 MB, 1024x1024)
2.1 MB
2.1 MB PNG
>>
>>102200256
https://www.youtube.com/watch?v=gNaAiQo2eaE
>>
File: 00099-3260908083.png (2.14 MB, 1440x1440)
2.14 MB
2.14 MB PNG
>>
>>102200172
>>102200153
>>
File: file.png (1.78 MB, 1024x1024)
1.78 MB
1.78 MB PNG
>>102200270
>>
>>102200279
very nice anon. a lot less creepy than the real thing.
>>
>>102200290
more original and funny than this lolli autism bullshit.
>>
File: Boo4.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>102199939
I think Flux trolls me the worst!
>>
>>102200279
This is an AI image general, not a place to show your toys.
>>
>>102200372
I think the headphones push it over into the "must have legs" vector
>>
>>102200372
I like that one though
>>
>>102200339
Yeah bro it hasn't been copypasted all over /fit/ for aeons for sure
>>
File: Untitled.png (125 KB, 1801x1279)
125 KB
125 KB PNG
>>102199017
>>102199098
>imaginary greviences
I'm doubling down. He can go fuck himself for stripping the license from the llama.cpp code and putting it under the dogshit known as AGPL then shitting on the actual llama.cpp team for now reason.
Also here's the ChatGPT commit, he was 100% mocking me for no fucking reason in multiple commits like I'm too retarded to rewrite 10 lines of numpy code to pytorch. As if I'd ever give altman my phone number.
https://github.com/lllyasviel/stable-diffusion-webui-forge/commit/e5f213c21e02fd90c56a0c5e8e776ee4a20c18d9

>hurr durr they them
If I ever start with idpol shit you have the permission to shoot me execution style.
>>
>>102199851
Uoooohhhh
>>
File: 2024-09-02_00335_.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
>>
>>102200444
city, why are you feeding the trolls?
>>
File: 2024090204.jpg (2.15 MB, 1888x2208)
2.15 MB
2.15 MB JPG
>>102200457
>>
>>102198739
>>102198845
Doesn't work, you need some gguf node to load loras?
>>
File: file.png (678 KB, 2617x1441)
678 KB
678 KB PNG
>>102200488
not really, you just go for the gguf loader + a regular lora loader, are you sure you did the cmd command well, you can verify that you're on the good branch by doing "git branch"
>>
>>102200444
+1 for stating the obvious, doesn't go unnoticed. ignore the fuckwits. also +1 for bringing up the comfy thing.
>>
>>102200472
Indoor rain? Why not.
>>
>>102200504
does it stop lora reloading for the non-quant models too?
>>
>>102200525
it only work for the gguf loader because that's here that the code has changed
>>
File: 2024-09-02_00336_.png (1.41 MB, 832x1216)
1.41 MB
1.41 MB PNG
>>102200472
>>
>>102200523
Its an open air throne room. Watch some movies about egypt
>>
>>102200523
could be a large walk in shower with a seat
>>
>>102200539
>bzzzzzbzzzzzzzbzzzz
>>
File: 2024-09-02_00340_.png (1.6 MB, 832x1216)
1.6 MB
1.6 MB PNG
>>102200523
there was this story about a Zeppelin construction hall in Germany that was so big that it actually developed micro climate and it started to accumulate indoor clouds and it started to rain...
>>
File: file.png (2.34 MB, 1024x1024)
2.34 MB
2.34 MB PNG
>>
>>102200583
ever feel like getting off that miku trip?
>>
File: file.png (1.73 MB, 1024x1024)
1.73 MB
1.73 MB PNG
>>102200618
Try to guess
>>
Bread delivery...
>>102200628
>>102200628
>>102200628
>>
>>102200504
>>102200533
Is this normal, it loads flux clip model each generation

>Requested to load FluxClipModel_
Loading 1 new model
>>
>>102200645
if you haven't put the text encoder elsewhere (on the cpu or on a second gpu) I'd say that's normal?
>>
File: 1702120982908094.jpg (91 KB, 1080x729)
91 KB
91 KB JPG
How does one actually make a LORA? Does it take long?
>>
>>102200742
you collect 20+ images, put em into a folder numbered 1.jpg, 2.jpg etc. write a caption text (or not) 1.txt, 2.txt etc .. then run it thru one of the lora making scripts like kohyas or ai-toolkit for a few hours and you are done (alternatively you use a webservice if your GPU sucks)
>>
>>102200769
Other anon.
So where's the skill involved here that makes the difference between a good and a bad lora?
>>
>>102200837
A good data set
Good captions/tagging
Picking the right training settings
>>
>>102200837
>So where's the skill involved here that makes the difference between a good and a bad lora?
I'd say more pictures + good captions, there's no secret anon, the better the data quality, the better the lora
>>
>>102200864
>>102200869
Is it important to replicate the captioning convention used for the original training set of the base model?
>>
>>102200909
The model is no say in that, its the text encoder. If the target model uses just CLIP, you need to do tag prompt that is simple and short.

If the target model uses T5xxl you should use a natural language description, although there are ppl that say FLUX does not need any captions at all.. I am not so sure about that. My lora when done with natural language captions done with JoyCaption and manual editing performs better
>>
>>102200909
1.5/SDXL/PonyXL need booru style tags. There's some disagreement about Flux but in my experience all methods of captioning work quite well
>>
ok
>>
>>102200172
Do some more, i'm laughing my ass off.
>>
Booru Tags Chad
>masterpiece, high quality, 1girl, giant boobs, maid outfit, freckles, blue eyes, smiling, black hair
Natural Language Virgin
>This is a high-resolution photograph featuring a young woman dressed in a classic French maid outfit. She has a fair complexion with a sprinkle of freckles on her cheeks and nose, and her skin appears smooth. She has long, straight, jet-black hair that cascades past her shoulders, complemented by a white maid's headpiece with a ruffled edge. Her eyes are a striking blue, and she has a gentle, inviting smile.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.