[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: long long dick general.jpg (3.15 MB, 3264x1830)
3.15 MB
3.15 MB JPG
Discussion of free and open source text-to-image models

Last time on /ldg/ : >>103194152

Particular Taste Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Models, LoRAs, training
https://aitracker.art
https://huggingface.co
https://civitai.com
https://tensor.art/models
https://liblib.art
https://imgsys.org/rankings
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3

>SD3.5L/M
https://huggingface.co/stabilityai/stable-diffusion-3.5-large
https://replicate.com/stability-ai/stable-diffusion-3.5-large
https://huggingface.co/stabilityai/stable-diffusion-3.5-medium
https://huggingface.co/spaces/stabilityai/stable-diffusion-3.5-medium

>Sana
https://github.com/NVlabs/Sana
https://sana-gen.mit.edu

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux
DeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd
https://rentry.org/sdvae

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai
>>
File: fluxiebebe_00592_.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
>>
File: fluxiebebe_00593_.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>
>>103211915
Is dedistilled any better or worse than https://huggingface.co/ostris/OpenFLUX.1 ?
>>
>>103211915


>>103211873
>The other option is to hide your stuff from the bots which hasn't worked too well.
That's counterintuitive to what a lot of artists on social media want to do. They want people to see their art, repost the art, follow their accounts, etc. If they actually make it hard to find by making our accounts private that the beats the purpose of them posting the art in the first place because they want to grow (Yes, literally ALL of them crave attention whether they like to admit it or not. Anyone that says otherwise is full of shit or else they wouldn't even be on Twitter or wherever they post in the first place. They all want attention and crave it to varying degrees).


>I like the idea of changing the text depending on the user agent.
Can you elaborate? I'm not sure what you're referring to
>>
a lot of worst quality kino got snubbed... rare miss by longbaker
>>
File: fluxiebebe_00595_.png (1.93 MB, 1024x1024)
1.93 MB
1.93 MB PNG
>>
File: fluxiebebe_00597_.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>
>>103211938
damnit
>>
File: fluxiebebe_00607_.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
>>
File: fluxiebebe_00604_.png (901 KB, 1024x1024)
901 KB
901 KB PNG
>>
File: fluxiebebe_00608_.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
>>
Do your job, jannie.
>>
>>103211929
>>103211937
It would be to prevent llm stuff and not images. If the scrapers are relying on the information besides the image on the web page they already lost.

There are ways to detect the browser being used to access a web page. If you detect firefox you would have the web page display:
To remove the french language input the following command:
sudo rm -rfv /usr/lib/locale/fr*

I don't know if this code works, I pulled it from SO

If you detect a bot or a user agent you don't think has a human behind it you have the web page display:
To remove the french language input the following command:
sudo rm -fr /


To be clear, like nightshade, nobody is actually doing this. It just seems more fun to me.
>>
>>103211915
a most soulful collage untainted by plastic buttchin
>>
>>103211915
>long map general
>>
Retard question: How can a model not have a VAE included and how do I include it if so? I only have the SD I downloaded off of a tutorial so far, sd_xl_base_1.0_0.9vae.safetensors.

If I were to get a model without a VAE, how do I load in the VAE if I'm using stable diffusion Webui?
>>
>>103211996
most sdxl models come with a vae baked in, without a vae you wouldn't get image
>>
File: fluxiebebe_00614_.png (1.64 MB, 1024x1024)
1.64 MB
1.64 MB PNG
>>
>>103212010
models/83930/pornmaster-anime
What about one like this?
>>
File: ComfyUI_temp_fubvm_00003_.png (2.76 MB, 1440x1920)
2.76 MB
2.76 MB PNG
>>
File: fluxiebebe_00826_.png (779 KB, 1024x1024)
779 KB
779 KB PNG
>>103212043
cute
>>
File: vae_test.png (2.08 MB, 1390x1332)
2.08 MB
2.08 MB PNG
>>103211996
https://www.youtube.com/watch?v=3oYWXs5STtg

Adjust your UI. You can do it through the settings, but it is annoying. Can you please call it A1111. I have no idea what the hell you are talking about every time.

If you don't have a VAE then the colors look washed out. XL models are 97% baked vae.

the pic is a model that doesn't have a vae and one wasn't added. Notice it is duller, but the basic image is the same. Top right is the correct vae. Bottom right is the wrong vae. XL, SD1.5, Flux vaes are not compatible and you have to match them.
>>
File: ComfyUI_temp_fubvm_00004_.png (2.87 MB, 1440x1920)
2.87 MB
2.87 MB PNG
>>
File: 004458.png (3.24 MB, 1680x2160)
3.24 MB
3.24 MB PNG
>>
File: fluxiebebe_00611_.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
>>
>>103212072
>XL models are 97% baked vae.
?
>>
File: ComfyUI_temp_fubvm_00005_.png (2.87 MB, 1440x1920)
2.87 MB
2.87 MB PNG
>>
>>103212037
>models/83930/pornmaster-anime
i'm not familiar with that model but they all come with a vae, on the top right corner next to where you select your checkpoint model you can also change your vae to something else if the image looks washed out. for newer finetunes like noob and illustrious the baked in vae should be good enough though
>>
>>103212037
>>103212136
>on the top right corner next to where you select your checkpoint model you can also change your vae to something else
nvm, this is only for forge and reforge, i'm not sure how to do it on a1111
>>
>>103212125
Baked VAE - the VAE is included in the safetensors file and you don't need to worry about it.

The 97% is a number from my butt saying that most XL models I have encountered have a baked in VAE. There are a very few exceptions where that is not the case.
>>
File: fluxiebebe_00800_.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
>>
File: ComfyUI_temp_fubvm_00007_.png (3.02 MB, 1440x1920)
3.02 MB
3.02 MB PNG
>>
File: fluxiebebe_00613_.png (1.58 MB, 1024x1024)
1.58 MB
1.58 MB PNG
>>
File: fluxiebebe_00844_.png (970 KB, 1024x1024)
970 KB
970 KB PNG
>>
File: fluxiebebe_00805_.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
>>
>>103211973
What is this? Some sort of psyop to get retards to wipe their hard drives?
>>
>>103212201
meme/psyop, whatever you want. google displayed the -fr version in their AI summary which made the news. Being the linux news I am sure me and 3 other dudes saw it.
>>
why would a bot have a privileged shell or run code?
>>
>>103212302
I was describing what could go wrong (purposely) with the scraping process for an LLM.

The answer to your question is because you gave control to the bot.
https://www.youtube.com/watch?v=DVRg0daTads
>>
>
>>
File: 004484.png (3.37 MB, 2520x1440)
3.37 MB
3.37 MB PNG
>>
File: collage.jpg (2.92 MB, 2865x2484)
2.92 MB
2.92 MB JPG
I also make collage ok
>>
File: 00004-401412767.png (3.03 MB, 1152x1632)
3.03 MB
3.03 MB PNG
>>
>>103212757
very nice
>>
i don't make collage, this time
>>
https://civitai.com/models/833294/noobai-xl-nai-xl
>>
File: 1728006808105787.jpg (454 KB, 1024x1024)
454 KB
454 KB JPG
>/a/ is having a melty over ai again
>>
>>103212893
>Adapted to more sampling methods besides Euler, including Euler a, and other sampling methods will be supported in subsequent versions;
been using euler a almost exclusively all this time, sheeiiittt

>3. Released three new NoobAI XL dedicated ControlNet models: openpose, softedge, and lineart;

ooooohh budddyy
>>
>>103208621
>Do you ever ask yourself what is this all for?
It's all for the joys of inpainting. A new brush.
>>103212895
>/a/ meltdown
what over exactly?
>>
File: 1731746237455626.jpg (464 KB, 1024x1024)
464 KB
464 KB JPG
>>103212904
>what over exactly?
Araki is crying about ai art. As if his dogshit art style is worth copying to begin with.
>>
>>103212913
>be an artist
>think you're so original
>artstyle can be more or less imitated with a simple lora
gets me everything
>>
>>103212893
>>103212901
So Pony is finished?
>>
>>103212933
always was
>>
>>103212893
I like where this is going
>>
>>103212933
guess we'll see when all the merges get updated, my money's on
https://civitai.com/models/900166/illunext-noobai-illustrious
and
https://civitai.com/models/835655?modelVersionId=1023901

It'd be nice if ((civitai)) let us know what particular base models they train on, i have no idea if they use this new vpred to train now or not.
Does it matter even?
>>
File: 40581557.png (1.32 MB, 1024x1536)
1.32 MB
1.32 MB PNG
>>103212893
insane
>>
>>103212959
>seethrough instrument revealing blurry and distorted feminine curves
Kino. Absolute 1girl cinema. We really are leaving the slop era behind at last.
>>
File: noob prompting.png (869 KB, 1452x883)
869 KB
869 KB PNG
>>103212959
>when even the noob devs are schizoprompting
>(((.3310000000000004))
>and it works
i kneel..
>>
once again, the power of autism knows no limits
>>
this shit is nuts what the hell
https://civitai.com/images/40585720
https://files.catbox.moe/63hrdw.jpeg
>>
>>103213048
Eh, I don't see what impressive about this one. Interesting play of colours nad contrast maybe? Feels a bit flat though.
>>
Vanilla forge doesn't support NoobAI? What's different about it's architecture, when it supposedly counts as an SDXL model?
>>
>>103212959
her hands are shit and her left leg has fused with the cello, maybe for a SDXL finetune that's impressive but we have better base models than that now, I can't pretend this look good when we can get way better anatomy on Flux
>>
>>103213111
Fair point, but it does show promise in more complex composition and prompt adherance. At least that's what I seem to be noticing the little I've used it, or some form of it.
>>
>>103213111
im okay with a little mistake since xl is not the behemoth that is flux. and i used to think xl was slow as fuck
>>
>>103213128
>>103213133
you absolutely did not need to respond to such obvious bait
come on faggots, a 12b vs a 3.5b?
>and that 12b couldn't even do 2D until it was made possible with more training and loras
>>
>>103213141
Slow day
>>
>>103213141
>a 12b vs a 3.5b?
I'm comparing it to pony, not flux.
>>
>>103213141
>and that 12b couldn't even do 2D
the fuck you talk about? it can do 2d just fine, the problem is that it's the same boring 2d shit, Flux needs more variety
>>
File: 1703806895854957.jpg (1.7 MB, 1664x2432)
1.7 MB
1.7 MB JPG
>>
File: tmp_crwuxrv.png (1.37 MB, 896x1152)
1.37 MB
1.37 MB PNG
>>103213223
what a coincidence
>>
>>103213233
that one is beautiful, love the colors and the sky
>>
>>103212893
Diaper status?
>>
>>103213257
needs changie
>>
full
>>
>>103213257
stinky
>>
File: tmpz4e4l7uo.png (787 KB, 896x1152)
787 KB
787 KB PNG
>>103213241
Fun fact, picrel is how it started, since base noobai doesn't like whatever I'm doing with it. Starting to think it really doesn't want me to use forge, but I'm too lazy to install reforge and check.
>>
>>103213283
i have no idea how im supposed to get vpred to work in forge, and i pulled the latest main too
it's sort of overly high contrasted even at cfg 3 on euler.
reforge is a buggy POS that's super behind.
>>
>>103213283
>>103213304
it works on ComfyUi?
>>
>>103213315
here buddy i need to strap your tard helmet on properly, when did you go and unfasten it?
>>
File: 2024-11-17 112555.png (6 KB, 271x141)
6 KB
6 KB PNG
>>103213315
>it works on ComfyUi?
supposedly
>>
>>103213315
Everything works with ComfyUI
>>
>>103213320
>>103213326
oh ok, thanks for the answer
>>
File: 00010-1318515929.png (1.44 MB, 1024x1536)
1.44 MB
1.44 MB PNG
nevermind it works fine in forge im just fucking stupid and forgetting my loras aren't compatible with v-pred
>>
>>103213326
>Everything works with ComfyUI
Except for any fun, creative and convenient "workflow".
>>
>>103213333
Still no luck on my side. What's your samplers, steps, scheduler and the likes? Think I've read it's supposed to be used with euler, but that's as much as I know.
>>
>>103213339
>muh noodles
As long as I can get good picture, I could care less how my "workflow" looks.
>>
>>103213347
based
>>
>>103213347
All's fair in love and 1girls.
>>
>>103213346
just recreate an example image and you'll know instantly
it should be working out of the box at least with example settings
also the forge devs F I N A L L Y fixed the fucking memory leak
>>
diffusion_pytorch_model.safetensors? Is that the openpose model I'm looking for?
>>
>>103213363
>also the forge devs F I N A L L Y fixed the fucking memory leak
Don't worry, they'll break something else in no time for old time's sake.
>>
>>103213350
>>103213359
I really do wish we had new UI options though. We can't just stick with Gradio for everything.
>>
>>103213377
If you want the cutting edge of ComfyUI as backend, and an arguably usable frontend, there's the choice of SwarmUI or Metastable.
>>
Why do ReForge exists anyway? What did they want to do differently than Forge?
>>
>>103213393
I second this question, and the lack of an anwser is why I haven't bothered giving it a try yet.
>>
>>103213363
Why even use forge when stable swarm exists?
>>
>>103213411
>updated even LESS than forge
>last update 5 months ago
why even bring this up?
>>
File: 1731570768123474.png (5 KB, 183x135)
5 KB
5 KB PNG
>>103213419
>last update 5 months ago
Uh?
>>
File: succ_0959.jpg (1.81 MB, 2880x1824)
1.81 MB
1.81 MB JPG
>>103213393
Wasn't it become forge seemed like it was getting abandoned or moved in a different direction? Then forge got ramped up again and it became somewhat redundant.
>>
>>103213425
what fork is that then?
https://github.com/Stability-AI/StableSwarmUI
>>
>>103213141
It is worth pointing out that SDXL absolutely should not be the focus now with SD 3.5 out but no one wants to build from scratch for some reason and the million dollar limit is not good enough for some people. My main issue with NoobAI specifically is how much they had to train the shit out of it to approach what was in Illustrious' paper, Illustrious-XL 0.1 was trained on a beta version of Kohaku XL. What NoobAI should've done was just do what the paper did and build on Kohaku-XL Zeta or better anime SDXL models to achieve a better base model. But because of laziness or something, they decided to use 0.1 Illustrious-XL as the foundation and spend compute resources away to train away the flaws which is stupid.
>>
>>103213435
An abandoned one if i have to guess
The current one is https://github.com/mcmonkeyprojects/SwarmUI
I legit haven't seen a better UI yet. Even comes with a built-in comfy
>>
File: forgeretarded.png (277 KB, 1826x871)
277 KB
277 KB PNG
>>103213446
kek you dingus you should at least know the first thing about your choice of UI before you go and shill it to people
cool im gonna check this out because forge pissed me off with that retarded checkpoints UI change, this shit is dated and they somehow made it worse with no way to change it.

>imagine this screenshot but with over 7,000 loras and 100 folders
>>
>>103213459
They are annoying but you can adjust the size of those oversized boxes in settings.
>>
>>103213459
Swarm allows you to fetch lora and model data directly from civitai btw
>>
>>103213363
>also the forge devs F I N A L L Y fixed the fucking memory leak
oh damn, I hope it works now
>>
>>103213393
reforge is gradio 3.4
forge is (now) gradio 4
the jump to gradio 4 breaks a lot of extensions.
also forge repositioned itself to just doing weird/experimental stuff while reforge is basically what people wanted out of a1111.
>>
I don't get it. Why not just use comfyui?
>>
>>103213472
does this shit have a tab for img2img inpainting and etc?
>>
>>103213522
not a lot of people like spaggheti shit Ui, I don't like it either, I'm stuck on ComfyUi though because it's the only software that allows me to use one gpu for the text encoder, and another gpu for the model
>>
>>103213522
For some reason in comfyui, I can't get img2img to work right with flux. Even using reference workflows, I set denoising to .2 .3 .4 .5 etc and there's practically no change in the image. Not until I set it to about 0.95 then it suddenly becomes a completely different image. I don't have this issue with forge. Also I can't get inpainting to work right with flux in comfyui (works fine with sdxl). At some point I will have to sit down and figure it out because apparently neither regional prompting (with flux) nor flux controlnets are working in forge.
>>
>>103213528
Edit image, init image.
>>
>>103213514
I see, thank for the detailled explanation anon, much appreciated.
>>
>>103213522
Because it's fucking obnoxious to use. Unraveling the spaghetti every time i want to change my settings globally is annoying compared to just using a few sliders. Installing certain nodes is a pain in the ass. It's good if you want to automatize something but there's nothing "comfy" about this interface.
>>
>>103213572
>Because it's fucking obnoxious to use. Unraveling the spaghetti every time i want to change my settings globally is annoying compared to just using a few sliders
this, 100% this
>>
File: 00037-1950259701.png (1.42 MB, 1536x1152)
1.42 MB
1.42 MB PNG
>>103213552
cooool ill check it in a bit

>>103213572
pretty much this is what killed it for me, its unnecessary to do all this when a few sliders and tabs in a good UI does the trick and saves a lot of time.
I had a good setup going for adetailer to fix hands/feet/faces and ultimate sd but even then it wasn't good enough and i spent too many hours doing node research, trying to find other people's setups to copy from, and trial and error before i gave up.


>anyway my LORA works fine with vpred 0.6 not sure what i was doing wrong before
>>
File: 00174-3956943458.jpg (562 KB, 1344x1728)
562 KB
562 KB JPG
>>103212893
I have around 25% success rate with this. Probably have to adjust prompt.
>>
>>103213600
>stocking
damn I loved that anime
https://www.youtube.com/watch?v=mixzbaXx208&list=PLdE7sv4frbx-5WOBCGxgmkev_YTf-5gZ4&index=2
>>
File: 00187-3956943457.jpg (698 KB, 1344x1728)
698 KB
698 KB JPG
>>103213607
Euler CFG++ cfg 1, 38 steps
>>
>>103213611
yeah shes been my main girl for over a decade, 99% of my posts here are her kek im amazed no one's called me out as annoying or anything
>also one of my dumb gens of her made it into a collage
>>
File: xyz_grid-0011.jpg (1.94 MB, 1675x10000)
1.94 MB
1.94 MB JPG
>>
>>103213636
>im amazed no one's called me out as annoying or anything
I've more of an issue with the artstyle really, since Stocking's design is kino. Similarly I could sperg at the microbikini anon, but their stuff is just eyecandy to me.
>>
File: 4288919299.png (2.54 MB, 1728x1344)
2.54 MB
2.54 MB PNG
>>
File: 2024-11-17 123138.png (562 KB, 473x613)
562 KB
562 KB PNG
>>103213686
>picrel
weirdly fascinating
>>
>>103213691
Yo baron, I can dig it. Finally some good fucking food.
>>
>>103213703
Yeah. Microbikini 1girl by John Carpenter.
>>
>>103213687
what could be better about the style from your opinion? i take constructive criticism, even if its very subjective i'm tweaking it constantly before i go and train again.


>also vpred 0.6 is a million times better than any other model i've been using 9 out of 10 gens so far have had almost perfect to perfect hand anatomy
>>
>>103213442
Most people will take familiarity over doing the hard work of innovating and research until there is a solid foundation someone already did upon which lazy trains or merges on top is sufficient for someone to use which then incentivizes moving or you have a gigantic leap like what happened with Flux and NoobAI isn't a pioneer by any means here nor is it work based on a gigantic leap. SD 3.5 Medium (which I assume because Large isn't comparable and is larger than SDXL) has none of that going for it yet and I don't know if it will ever actually pick up steam given the fiasco earlier this year which sucks.
I do agree though they really should choose a better foundation and actually train it with the right settings from the getgo rather than waste compute time to train out mistakes, in which case, whatever they have here, just get to a 1.0 and then they should just do a from scratch training run for v2 with that fixed and a better foundation.
>>103213686
GITS scheduler is missing.
>>
so why did illustrious die at v0.1?
>>
File: file.png (96 KB, 1047x189)
96 KB
96 KB PNG
>>103213738
It didn't die, Please read the paper attached when possible. They have a v2.0 version even. They are planning to probably do as much as they can to commercialize it, given that in the paper, they state the following.
>We plan to publicly release updated Illustrious model series sequentially as well as sustainable plans for improvements on HuggingFace with
a license.
>>
From my tests 3.5 medium does worse than XL. Generates similar pictures and adhere to the prompt poorly.
Large is better but much slower.
>>
>>103213715
Hard to describe, so I can only point you to which make it or break it for me visually. The issue I'm seeing is the gens border on uncanny valley, especially with the likes of >>103213600, or the one in previous collage. Something like >>103206040 looked much better and seems to play well to it's strengths. Might've even included it in my own collage if I cooked. Other than that they seem a bit flat or too smooth, technically they have shading like in that beach one, but it doesn't add much depth for some reason. Feels a bit like an overcook, or a too high cfg, but from what you mentioned earlier, I think it's more related to whatever you're putting into training.
>>
>>103213778
>From my tests 3.5 medium does worse than XL. Generates similar pictures and adhere to the prompt poorly.
like worse prompt adherance than SDXL? that's crazy because 3.5 has T5 as the text encoder (So does Flux)
>>
>>103213785
holy shit you just cracked the code for the main issue i've been struggling with for this the past few days, yeah its too high cfg related issues, i keep bouncing between 3.5 all the way up to 5.5 and adjusting the prompt from there but really 3.5 is the sweet spot.
its really easy to break into too smooth especially because of adetailer, i have no clue how to get that bitch to stop over smoothening the faces.
At the very least with vpred 0.6 i don't need to fuss about the hands as much.
>>
File: comparison2.jpg (689 KB, 2168x2448)
689 KB
689 KB JPG
>>103213778
That's not even remotely true from the tests I've seen. People keep forgetting how bad SDXL actually is and confuse fine tunes/merges they've seen with it and forget the base model needs a refiner too where out of the gate without that, it's even worse. SD3.5M is absolutely a much better base model.
>>
>>103213810
>SD3.5M is absolutely a much better base model.
what about the licence though? That's why the pony fag went for a SDXL finetune and not a 3.5 finetune
>>
File: xyz_grid-0014.jpg (2.51 MB, 2675x10000)
2.51 MB
2.51 MB JPG
>>
>>103213804
I've stopped training since 1.5 and was never good with it, but maybe you could try a more gentle learning rate? Looking better at lower CFGs does suggest an overcook from what I recall. Then again, noobai is a rough gem itself, so that certainly doesn't make it any easier and might be another factor that makes it harder to chisel.
>>
>>103213810
For me, it's 1.6
>>
>>103213818
It is better than the shitshow that was SD3 but commercialization is no longer free reigns like in SDXL, anything over 1 million USD and you need to deal with Stability themselves as far as licensing goes. No one really wants to touch it if they want to make money because of how much trust got burned but outside of that, community things and smaller time efforts will probably migrate over at some point but that will take some time.
>>
>>103213833
Align your steps got sovl, but I'm growing curious about KL Optimal.
>>
>>103213839
It's definitely a combination of i should've diversified the dataset and took the learning rate down 1 digit because i did bump it compared to how i normally train models.
Though it seems vpred 0.6 still has overexposure problems, now i'm seeing it as i tweak things with new gens.
Yikes.
>>
>>103213865
Best of luck in cooking. Looking forward to progress as steady she goes.
>>
>>103213833
>>103213862
ZAMN align your steps is creative as FUCK, i just turned it on after looking at your grid and i'm seeing exactly what your test shows;
SOVL >>103213862


>>103213872
thanks. gonna need it. i'm too much of an updooter with SD so im constantly bouncing to new models instead of sticking to what works and going from there.
>>
File: xyz_grid-0015.jpg (2.7 MB, 2675x10000)
2.7 MB
2.7 MB JPG
>>103213862
>KL Optimal.
Probably have to adjust settings for it, I've gotten really nice results with it earlier. Might need PAG/SAG etc.
>>
>>103213888
>im constantly bouncing to new models instead of sticking to what works and going from there
Same here, currently I keep benchmarking different pony models to see which are more coherent or creative in output. I think it's for the best really. Sticking with one thing can easily end up in stagnation. Many folks think practice makes perfects, but I think it's more about having a diversity of context to draw that experience from.
>>
File: xyz_grid-0018.jpg (2.93 MB, 3145x7500)
2.93 MB
2.93 MB JPG
>>
>>103214006
>KL optimal is similar to Align
that would explain why they're both so good
>>
File: settings.png (91 KB, 1168x738)
91 KB
91 KB PNG
>>103214006
>>103213904
>>
holy SHIT my brain is overloaded by this image
https://civitai.com/images/40178492
https://files.catbox.moe/ay3x4w.png
kek all the obvious inpainting artifacts
>>
>>103214125
for the record, when I ask anons to inpaint, THIS IS NOT WHAT I MEANT
>>
File: inpainting nightmare.png (84 KB, 202x145)
84 KB
84 KB PNG
>>103214186
>what did he mean by this?
>>
File: 130302_00001.webm (594 KB, 848x480)
594 KB
594 KB WEBM
Let's see if i can fix this.
>>
File: elf-under-mushroom.jpg (1.58 MB, 2880x1616)
1.58 MB
1.58 MB JPG
>>
>>103214573
what's she drinking?
>>
File: 1703124876849759.png (172 KB, 700x724)
172 KB
172 KB PNG
>>103214411
C'mon genmo, give us the i2v vae...
>>
>>103214594
Supposed to be a metal flask of water but was hard to get it to look right. Seems more like a big salt shaker.
>>
>>103214596
Be interesting to see what the hold up is, I mean, the preview model is pretty great for local, Vram?, coherence, who knows.
>>
File: 141217_00001.webm (588 KB, 848x480)
588 KB
588 KB WEBM
>>103214411
No, i cannot in the alotted amount of time, it's over.
>>
to the guy that suggested swarmui, how do you get adetailer? i dont even see extensions installing anywhere either.
>>
File: 214424_00001.webm (810 KB, 848x480)
810 KB
810 KB WEBM
Page 4 Intermission.
>>
>>103214914
figured it out, its called segmentation, and it combines regional prompting and adetailer into a more sophisticated system that doesn't suck ass, so you have to add what you're trying to adetail in the prompt itself along with, a prompt of what you want. very cool.
so far i'm impressed by swarmUI but its one hell of a learning curve for img2img and upscaling, can't get it to upscale anything even when i have it enabled.
>>
>>103215122
Still 30min per gen?
>>
File: 1716169207618817.jpg (921 KB, 3354x2252)
921 KB
921 KB JPG
https://huggingface.co/NexaAIDev/omnivision-968M
Has anyone tried this model?
>>
>>103215443
2873s for 97 frames @65 steps, 30 steps is probably fine, I've not done much testing, thats the default on the comfy implimentation.
>>
>>103215746
>thats the default on the comfy implimentation.
I tried Comfy but I got OOM during the vae decoding, I thought he had vae tilting in it, but it doesn't seem to be working well
>>
oh yeah swarm UI shits all over forge, holy fuck this is really good. switch if you use forge/reforge, i can't believe this didn't get talked about at all until one guy brings it up kek imagine waiting for the peanut gallery devs of forge to fix then break more shit in random development cycles when something stable has already existed. phew.
>>
>>103215761
add the save latent node coming out from the sampler, it gets put in comfyui/output/latents after genning is complete and it ooms, copy it over to comfyui/input and use the sheet in picrel is what i do, as i have the same problem.
>>
>>103215815
so you're using Comfy for creating the latents and then kijai's node for the decoding? I didn't know they were compatible
>>
>>103215836
Yes.
>>
File: 153522_00001.webm (731 KB, 848x480)
731 KB
731 KB WEBM
>>103215836
Also it's possible that you can just add the nodes of the decoder sheet to the genning sheet, but i couldn't figure out how to connect the nodes as they natively dont connect, I'm sure someone more familiar with linking nodes could figure out how to do it, skill issue for me.

And i still can't get the typewriter the other way around, reeee.
>>
>>103215887
what's your cfg at?
>>
File: file.png (257 KB, 462x680)
257 KB
257 KB PNG
Any good way to get this flat shaded 3d style like Sidonia, BOTW, berserk (but good)? Anime style is optional.
>>
File: i829.jpg (327 KB, 1024x1024)
327 KB
327 KB JPG
>>
>>103215746
>2873s for 97 frames @65 steps, 30 steps
so for those of us that haven't seen your posts until now... what GPU is that on? I'm guessing it's an xx80 at the very least. If that's a 4090 then I'm not even gonna bother with mine.
>>
>>103215967
7
>>
>>103216090
ah, sorry missed your post, multi tasking, 4060ti16gb
>>
>>103216120
>>103215815
the comfy mochi setting only works for bf16 right? I'd like it to work for Q8_0 aswell
>>
>>103211973
>If the scrapers are relying on the information besides the image
If you're training a chat GPT or llama 3 tier model then why the flying fuck what they be using information off of some random Twitter artist's page? If they want to make sure the model is good at programming they're going to use sites like stack overflow or GitHub or transcripts off of an Indian YouTube video. Nothing they do fucking matters. They're just screaming at the sky because they feel useless.
As for your other method, that just sounds retarded......
>>
File: i843.jpg (126 KB, 1408x768)
126 KB
126 KB JPG
>>
>>103214573
very impressive
>>
File: ComfyUI_135408.webm (1.66 MB, 848x480)
1.66 MB
1.66 MB WEBM
>>
File: 1731631815546085.png (801 KB, 3249x1427)
801 KB
801 KB PNG
>>103216206
can you show a screen of your workflow I wanna know if I have something similar
>>
>>103216141
I tried quickly with a gguf model and the output was black, so i guess not.
>>
File: 1708116921123418.png (102 KB, 1402x781)
102 KB
102 KB PNG
>>103216223
ComfyUi's mochi node's is impressive in terms of memory managment, I can go for bf16 + 5 seconds (24 fps + 120 frames) without overflowing my 3090 card, with kijai's node you couldn't go past 40 frames
>>
>>103216324
forreal? thats a whole video model loaded in just a 3090? what model of ti are you using by the way?
>>
>>103216335
>forreal? thats a whole video model loaded in just a 3090?
yeah, Mochi is a 10b model so it can fit onto a 3090
>what model of ti are you using by the way?
I have the regular 3090 not the 3090ti
>>
>>103216073
imagen 3?
>>
>>103215518
Looks interesting but after sending a few pics to it on HF I can say it won't replace Florence2 finetunes for me.
>>
File: mochisettings.png (151 KB, 1383x844)
151 KB
151 KB PNG
>>103216223
still mainly using kijais shit
>>
>>103216508
how are you able to go for bf16 + 121 frames on kijai's node, it's too much for my 24gb vram card
>>
File: ComfyUI_135398.webm (3.05 MB, 848x480)
3.05 MB
3.05 MB WEBM
>>103216517
i didn't do anything special, it just worked so i have no clue man.
>>
>>103216440
ye
>>
File: file.png (1.56 MB, 1280x1080)
1.56 MB
1.56 MB PNG
>>
>>103216203
thanks
>>
>>103216129
thanks for the info, I guess even with a 4080 we'd still be looking around 10 to 15mins.
>>
>>103216687
depending on the steps/frames/fp8/etc. you can get it lower.
>>
File: 1722529435153689.png (615 KB, 3840x1796)
615 KB
615 KB PNG
>>103216223
I pulled... the fuck is this new ugly Ui shit, it's even more convoluted than before, fuck, now you have to go for 2 menus before loading a workflow, the fuck is comfy doing?
>>
File: 3687459387.png (3.24 MB, 1824x1248)
3.24 MB
3.24 MB PNG
>>
File: ComfyUI_03131_.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>103216768
>he pulled?
>>
>>103216820
kek
>>
File: bogsylvania.webm (211 KB, 720x480)
211 KB
211 KB WEBM
>>103216824
>i'm sending all comfy pullers to Bogsylvania
>>
>>103216860
you went for CogVideo-1.5 i2v for that one?
>>
>>103216687
If your lucky enough, and I say lucky as i just followed the instructions without any errors to set up Kijais wrapper and cannot, as >>103216561
you'll have options to run various speedups in the mochi model loader such as cublas, flash attn and so on, reducing the gen time further if you have a card that is compatible, and iirc your 4080 is. Even for results of tests on 4090 listed in the docs, i still got 10-15% speedups on one of the settings at least.
>>
>>103216888
CogVideo X 5b space
https://huggingface.co/spaces/THUDM/CogVideoX-5B-Space
>also did this one on that space
https://files.catbox.moe/4ohpva.webm
>>
>>103216920
kek, nice vid, that splatoon girl looks so comfy on that car
>>
File: ComfyUI_135409.webm (870 KB, 848x480)
870 KB
870 KB WEBM
>>103216909
wasn't that difficult from what i remember when i set up mine. just followed the directions on this https://github.com/woct0rdho/triton-windows and everything worked in the end. i didn't care much for having to install the bloat shit but i guess that's what i get for moving back to windows last year.
>>
File: ComfyUI_temp_fubvm_00009_.png (2.81 MB, 1440x1920)
2.81 MB
2.81 MB PNG
>>
File: Result_03118_.png (1.4 MB, 1280x1024)
1.4 MB
1.4 MB PNG
>>
File: ComfyUI_temp_fubvm_00010_.png (2.4 MB, 1440x1920)
2.4 MB
2.4 MB PNG
>>
File: ComfyUI_temp_eezdp_00016_.png (3.02 MB, 1440x1920)
3.02 MB
3.02 MB PNG
>>
File: ComfyUI_temp_fubvm_00014_.png (3.35 MB, 1440x1920)
3.35 MB
3.35 MB PNG
>>
>>103217361
>>103217394
KIINOOOOOO
>>
>>103214055
TY
>>
File: 3411124022.png (2.3 MB, 1536x1536)
2.3 MB
2.3 MB PNG
>>
File: ComfyUI_21154_.png (1.91 MB, 1080x1920)
1.91 MB
1.91 MB PNG
>>
File: fluxiebebe_00820_.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
>>
>>103217453
beautiful, a smile to die for
>>
File: fluxiebebe_00629_.png (1.61 MB, 1024x1024)
1.61 MB
1.61 MB PNG
>>
File: fluxiebebe_00857_.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
>>
File: fluxiebebe_00860_.png (2 MB, 1024x1024)
2 MB
2 MB PNG
>>
File: fluxiebebe_00874_.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>>
File: ComfyUI_21164_.png (2.93 MB, 1080x1920)
2.93 MB
2.93 MB PNG
>>
File: ComfyUI_21278_.png (2.2 MB, 1080x1920)
2.2 MB
2.2 MB PNG
>>
File: fluxiebebe_00879_.png (1022 KB, 1024x1024)
1022 KB
1022 KB PNG
>>
File: 1069508735.png (1.99 MB, 1344x1728)
1.99 MB
1.99 MB PNG
>>
>>103217720
fancy, very nice
>>
File: 481491707.png (2.06 MB, 1248x1824)
2.06 MB
2.06 MB PNG
>>103217750
ye
>>
File: fluxiebebe_00792_.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>
File: fluxiebebe_00882_.png (1.52 MB, 1024x1024)
1.52 MB
1.52 MB PNG
>>
File: 1300690376.png (2.17 MB, 1248x1824)
2.17 MB
2.17 MB PNG
>>
>>103217767
chromatic
ass
s
s
>>
File: Gb88MzfaAAAcHpp.jpg (303 KB, 1312x2200)
303 KB
303 KB JPG
I have some questions.
I heard Forge is better for SDXL and 1.5 and Reforge is better for newer models.
Where does Pony fall on this? Pony is a take on SDXL so should I use forge? Same question for Illustrious? What's the best UI for it?
>>
File: ComfyUI_21298_.png (2.56 MB, 1080x1920)
2.56 MB
2.56 MB PNG
>>
damn i didn't realize 0.6 was this fuckin broken
https://civitai.com/models/935739/noobai-vpred-06-itercomp-fix?modelVersionId=1071682
>>
>>103217863
He doesnt show scheduler, steps, cfg used in examples
>>
>>103217904
and for some reason he has the prompt in comments. odd.
>>
File: ComfyUI_21309_.png (2.11 MB, 1080x1920)
2.11 MB
2.11 MB PNG
>>
File: edgvedavaedgea.png (112 KB, 275x177)
112 KB
112 KB PNG
why the flippin heck is this happening?
>denoise 0.35
>samples 10
>CFG same as low res
>>
Whats with all y'all no comment slop postings, is /sdg/ broken?
>>
>>103217958
I dont understand these overly complicated merges instead using train difference
>>
>>103217984
too low denoise and too little steps, if I'm getting the right idea
>>103217990
I don't get what you mean
>>
File: noob_hr_00004_.png (3.43 MB, 2432x1664)
3.43 MB
3.43 MB PNG
>>
>>103218026
i lowered steps and raised denoise, and then lowered denoise while keeping the low steps and still no go, weird.

man working in pony reminds me how SHIT it is, needing loras for already popular characters which work half the time accuracy wise is infuriating.
noob needs its realism model stat.
>>
File: GbNb6uSaUAAusLK.jpg (156 KB, 832x1216)
156 KB
156 KB JPG
If I'm a non techy person but would like to one day make my own comics or anime with this stuff what's the kind of informatic stuff I should learn?
>>
>>103218195
gonna catbox my setup here maybe a swarmui user can step in and help
https://files.catbox.moe/tfuvbi.png
hindsight with just a few weeks of noob/illustrious really made me realize how utterly shit pony was for character accuracy kek im sure even in this screenshot were the previews are tiny you can see how its failing to even pencil in strips of stocking anarchy's hair to its correct color.
>>
>>103218195
My high melanin content anon, that's because you need to increase both steps AND denoise, you've too high a ratio difference between the two. You're basically doing the equivalent of putting a shit ton of prompts with a schizo high CFG, screaming at the model to stick closely to the original input within it's already tight limits, and it's basically having a mental breakdown being unable to meet your demands.
>>
>>103218221
>0.4 at 10 steps
>0.3 at 5 steps
>shocked_pikachu.png
oops i forgor haha lol thank you
>>
File: ldg.png (425 KB, 517x463)
425 KB
425 KB PNG
>>103218214
None really. I come from a hobbyist artsy background and no coding is required. You just gotta have a basic understanding of how it works, then find a UI that suits your workflow, and a base model that suits your needs. Similarly to how you'd learn to use software, just extra steps to host it on your own GPU.

For reference, you want at least an 8VRAM nvidia graphics card to meet minimum spec requirements.t. UI wise I'd advise Metastable or Forge, and for model, based on your image, you'd probably want a "Pony" finetune of your choice, which you can find on CivitAI. Then off to experiment with prompts, generation settings, img2img/inpainting.

Simple as.
>>
>>103218278
Personally I'd advise going with denoise no lower than 0.35/0.38
>>
File: ComfyUI_135424.webm (3.13 MB, 848x480)
3.13 MB
3.13 MB WEBM
>>
>>103218398
40k emperor kinda vibe
>>
File: ComfyUI_135425.webm (1.89 MB, 848x480)
1.89 MB
1.89 MB WEBM
mochi is p good with gore even tho it looks comical in some gens. tried recreating the dog scene in The Thing and it wasn't bad but not close either lol.

>>103218417
all the prompts i've tried today have been 40k based. really wish they'd release the hd model or i2v already.
>>
>>103218442
>anon_shagging_ benis_into_warp.webm
>>
>>103216820
certified ldg classic
>>
>>103216768
>ComfyUI
neverever
>>
>>103216768
what even is a mochi?
>>
>>103218495
>what even is a mochi?
txt2kino
>>
>>103218495
>what even is a mochi?
a blessing from the sky
https://xcancel.com/genmoai/status/1848762405779574990#m
>>
>>103218498
>>103218506
what do you need to run that?
Is a RX7800XT sufficient?
>>
>>103218524
>AMD
oh man RIP
>>
>>103218524
kek.. dont buy inferior cards
>>
>>103218524
>16gb of vram
that could do it if you go for Q8_0
https://reddit.com/r/StableDiffusion/comments/1gb07vj/how_to_run_mochi_1_on_a_single_24gb_vram_card/
>>
>>103218524
>RX
They die so young..
>>
File: fluxiebebe_00903_.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>
File: 1719874023690563.png (189 KB, 750x1000)
189 KB
189 KB PNG
>>103218524
>amd
>>
>>103218524
i bought a 4090 and i never looked back
>>
File: fluxiebebe_01038_.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
>>
>>103217823
Pony is obsolete because of illustrious and noob
>>
File: tmpyy3d23h5.png (1.52 MB, 896x1152)
1.52 MB
1.52 MB PNG
>>
File: fluxiebebe_01043_.png (1019 KB, 1024x1024)
1019 KB
1019 KB PNG
>>
>>103218577
Miss me with that unstable shit and come back in a month or two.
>>
File: 1731355635538630.png (122 KB, 584x336)
122 KB
122 KB PNG
>>
>>103218610
We're at 274 and I'm going to sleep soon. Ask the other collagebaker.
>>
File: ComfyUI_135426.webm (1.83 MB, 848x480)
1.83 MB
1.83 MB WEBM
>>103218566
i don't blame people for not wanting to spend 2k+ on one. i got mine for $1400 early last year and i feel lucky now after seeing what they're going for.
curious what the msrp is going to be for the 5090.
>>
I wonder how scuffed 5090 mobiles will be.
>>
>>103218648
they're about the same price now as they were when i got mine about 2 years ago now
>>
>>103217984
>275x177
>>
good kino soul in this bread
>>
>>103218755
based retard
>>
File: ComfyUI_135432.webm (1.69 MB, 848x480)
1.69 MB
1.69 MB WEBM
>>103218723
what'd you buy yours at? i got mine off the msi store and looking now they don't even have the suprims anymore besides the water cooled one. if they did i'm assuming it'd be around $1900 tho they have the gaming trio one at that price so fuck if i know.
>>
>>103218577
Even AutismMix? What is the best illustrious model for cute ecchi then?
>>
File: 0.jpg (300 KB, 1432x848)
300 KB
300 KB JPG
>>
File: tmpry0ww3mx.png (1.1 MB, 1280x896)
1.1 MB
1.1 MB PNG
>>
>>103218803
Calm down, bro.
>>
File: 0.jpg (287 KB, 1408x832)
287 KB
287 KB JPG
>>
>>103212933

If it can't do western/cartoon art, I don't give a shit.
>>
models able to do hands when
>>
>>103219359
Gotta make loras
>>
>>103218899
NoobAI v0.6
>>103219363
Flux
>>
>>103217139
Where did you get this video of me?
>>
>>103218648
the faces at the end are so cool i love it
>>
>>103219459
>Flux
Fuck no
And it can't even do lewd stuff
>>
>>103219685
>it can't even do lewd stuff
The price we pay for perfect hands :(
>>
File: fluxiebebe_01045_.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
>>
File: ComfyUI_21378_.png (2.25 MB, 1080x1920)
2.25 MB
2.25 MB PNG
>>
File: i821.jpg (369 KB, 1024x1024)
369 KB
369 KB JPG
>>
File: fluxiebebe_01039_.png (1020 KB, 1024x1024)
1020 KB
1020 KB PNG
>>
File: fluxiebebe_01047_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>
>>103219935
nice colors
>>
File: ComfyUI_22116_.png (2.29 MB, 1080x1920)
2.29 MB
2.29 MB PNG
>>
File: ComfyUI_22055_.png (1.87 MB, 1080x1920)
1.87 MB
1.87 MB PNG
>>
>>103219685
It can do lewd stuff with the appropriate loras.
>>
File: fluxiebebe_01054_.png (822 KB, 1024x1024)
822 KB
822 KB PNG
>>
File: fluxiebebe_01061_.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>
File: fluxiebebe_01074_.png (1 MB, 1024x1024)
1 MB
1 MB PNG
>>
File: fluxiebebe_01086_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>
File: fluxiebebe_01084_.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
>>
File: fluxiebebe_01156_.png (1.68 MB, 1024x1024)
1.68 MB
1.68 MB PNG
>>
File: fluxiebebe_01185_.png (1.55 MB, 1024x1024)
1.55 MB
1.55 MB PNG
>>
>>103220140
None that look good even
>>
File: fluxiebebe_01193_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>
>>103220213
You have to train your own then.
>>
File: fluxiebebe_01204_.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>
File: bComfyUI_130424_.jpg (691 KB, 1024x1024)
691 KB
691 KB JPG
>>103220278
reminds me of album cover art
>>
>>103220313
nice
>>
File: fluxiebebe_01262_.png (2.09 MB, 1024x1024)
2.09 MB
2.09 MB PNG
>>
File: fluxiebebe_01270_.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>
>>103220402
nice, makes me want to try proompting for geometric shit again
>>
>mindless slop hour
>>
>>103220431
comes here, complains about coming here
>>
so who's making the collage
>>
>>103212072
funny how the "correct vae" looks like deep fried bullshit, and the no vae is less jarring to the eyes
>>
>>103220495
just made one
>>
Only page 4
>>
>>103220495
just got up from bed because i can't sleep
>>
>>103220514
>>103220514
>>103220514
new
>>103220514
>>103220514
>>103220514
>>
File: 2024-11-17_00001_.png (1.39 MB, 720x1280)
1.39 MB
1.39 MB PNG
>>103211915
I'M BACK!!!
>my b& is proof that Flux makes highly valid tits sometimes


(accidental xpost)



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.