[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: the longest dick general.jpg (2.51 MB, 1338x2000)
2.51 MB
2.51 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bred : >>102788863

Turkish Breeding Grounds Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://aitracker.art
https://huggingface.co
https://civitai.com
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/kohya-ss/sd-scripts/tree/sd3

>Flux
https://replicate.com/black-forest-labs/flux-1.1-pro
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai
>>
>>102804738
i dont get that image
>>
File: 315318856.png (1.59 MB, 1344x768)
1.59 MB
1.59 MB PNG
>>
File: 3429698342.png (1.22 MB, 896x1152)
1.22 MB
1.22 MB PNG
>>
Anyone running on peasant hardware?
What's your hardware + software combination ?
>>
>>102804738
0/10
>>
>>102804738
lmao at image, srsly it's like the scene in Batman where the accounting dweeb is blackmailing based Negro R&D for Batman bloke.
>He stomps people for a job
>And your plan is to blackmail him

Fuckan will be furious!
>>
Blessed thread of frenship
>>
requesting toned Azula or Korra wearing transparent lingerie, stockings, choker, and a cropped leather jacket, standing, erotic pose, arcade, arcade cabinets, neon lights, dynamic lighting
>>
File: ponyxl-943742810.jpg (916 KB, 2048x2048)
916 KB
916 KB JPG
>>102805430
>gens that will make your eyes bleed
>>
>>102805692
beautiful
>>
File: ponyxl-943742810 as well.jpg (1017 KB, 2048x2048)
1017 KB
1017 KB JPG
>>
>>102805692
>>102805718
some of these are really good
>>
>>102805430
the effects of unregulated 1girl postage on the /ldg/
>>
File: 1728696748081013.jpg (340 KB, 1024x1024)
340 KB
340 KB JPG
>>102805756
seems fine
>>
>>102805743
I think they would benefit from iterating upon.
If the requester had their own hardware then they could have chewed through a number of them by now.
>>
Sunday Slowday
>>
>>102806378
post Korra
>>
post cute butts
>>
Finally, all the quants are here, fuck you huggingface you're bitch to work with :(
https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main
>>
>>102807909
thank you for your service
>>
https://stability.ai/news/license-update
>SD3 Medium is still a work in progress. We aim to release a much improved version in the coming weeks.
This sentense has reached its 100 days anniversary, yayyy
>>
>>102807909
i have 12gb vram
what quant should i use?
>>
>>102808200
try with Q5_K_M, if you still have room to spare maybe you can go for Q6_K
>>
File: 00837-342464345.jpg (391 KB, 1728x1296)
391 KB
391 KB JPG
>>102808200
Q6_K
>>
>>102808191
They literally could've trained a new model from scratch in that time
>>
>>102805692
>>102805718
this buth with mako instead pls
>>
https://www.reddit.com/r/StableDiffusion/comments/1g2luvs/comment/lrpaeyq/?utm_source=share&utm_medium=web2x&context=3
>I fully finetuned the de-destillend flux-dev on 534 pictures (1024p) with the new block swap option in Kohya two days ago on 48 GB VRAM (80% ram consumption)
>I got a very accurate detailed creative full-fine-tune. Even the skin came out very good (I always had plastic skin when I trained a LoRA on the regular dev).
So if you have 2x3090 cards you can do a full finetune of Flux now?
>>
>>102808419
>Sorry I can't share the checkpoint - it's all based on pics of my wife
Kek wtf.
>>
>>102808441
He'll share if after the divorce, no doubt about it kek
>>
File: file.png (203 KB, 960x449)
203 KB
203 KB PNG
>>102808191
That's not the most ridiculous thing this company has said though, this one takes the take (2022)
>>
>>102808502
Lol
>>
File: 0.jpg (163 KB, 884x736)
163 KB
163 KB JPG
>>
>>102808502
>This image generator understands 2 billion different concepts
fair to say they trained SD2.1 with more than 2 billions pictures? Or maybe one picture has an average of 10 different concepts and he could make it work with 200 millions of pictures or something?
>>
>>102808419
It's still based on the maximum VRAM per card, so you need a 40 GB GPU. No one seems to care to try doing multi-GPU sharded training. We can only hope that either a) someone figures out how to squeeze everything on a 5090's 32 GB of VRAM or that the Titan AI card is 40+ GB. Or it stays limited as a Cloud training thing.
>>
>>102809021
>No one seems to care to try doing multi-GPU sharded training.
I remember pony say that he used 8 GPUS to train his model, so he's doing shared training no?
>>
>>102809039
No, he has 40 GB GPUs. His training computer is like $40k.
>>
>>102809051
yeah but he has multiple GPUs though, so he's doing shared training
>>
>>102809061
I mean sharded, as in your split the weights so that a 40 GB model fits on two 24 GB GPUs. Rather than putting the full model on each GPU and then syncing the training.
>>
>>102809079
ohhh ok I get it, I didn't know you still couldn't split the weights onto multiple GPUs, the llm fags have mastered this because they're working with giant models so they didn't have much choice kek
>>
>>102809079
>>102809061
https://fairscale.readthedocs.io/en/latest/deep_dive/offload.html
In practice you would have a model set up like this. I hope one day someone interested in a public open source model fully integrates Fairscale.
>>
>>102808419
>48 GB VRAM (80% ram consumption)
What does he mean by that?
>>
man
suddenly my auto1111 crashes every time i try to change the model
this didnt happen before
wtf
my gpu can handle the model it's just as if it wasnt unloading the model before
>>
>>102809200
Switch to forge or comfyui retard.
A1111 is abandoned.
>>
>>102809234
this
>>
>>102809250
that
>>
File: 40112.png (2.65 MB, 1176x1472)
2.65 MB
2.65 MB PNG
humongous.
bara.
tiddies.
>>
File: 0.jpg (228 KB, 1024x1024)
228 KB
228 KB JPG
>>
kinda new to all of this
what do you guys recommend for a realistic model?
and anime which one would you recommend?
i run a 12gb vram setup
>>
File: file.png (153 KB, 1718x936)
153 KB
153 KB PNG
>use comfyui, they said
WHAT THE FUCK AM I LOOKING AT, NIGGA?!
>>
>>102807909
is 3KS even doable on 8gb of vram? thats like 10 gigs of total ram usage isnt it?
>>
>>102810073
try it anon, I'm sure it'll fit on a 8gb vram card
>>
>>102810071
cop a catbox and press queue, it's that easy.
>>
>>102810071
WELCOME TO THE COMFY FIELDS MOTHER FUCKER

>>102810118
well, alright, here's hoping things've sped up since i last tried.
>>
so after all this time, unet > dit when it comes to being best for training locally? same with inference speed
>>
File: wallace meme.png (393 KB, 509x413)
393 KB
393 KB PNG
>>102810137
>22s/it

nope
>>
File: file.png (903 KB, 3328x1751)
903 KB
903 KB PNG
>>102810071
>WHAT THE FUCK AM I LOOKING AT, NIGGA?!
you're looking at the simpliest ComfyUi setting ever, you have no idea how much worse it can be... :(
>>
>>102810167
>so after all this time, unet > dit when it comes to being best for training locally?
not at all, training with un-distill made everything easier now, it's still a pain because it's a giant model (12b) and we used to train small shits back then (SDXL -> 2.7b, SD 1.5 -> 0.75b), it has nothing to do with the architecture
>>
3090 or 4007 ti super?
>>
>>102810219
>3090 or 4007 ti super?
3090, you never pass on a 24gb card
>>
Requesting the Joker getting his crust busted.
>>
>>102810169
>22s/it
why is it so slow? is it because it's overflowing your vram storage?
>>
>>102810254
because gtx 1080 kek
>That said im ((pulling)) stable diffusion forge to see if it's at least a few seconds quicker
>>
my 1girls will never be real lol. they're so cute and the way they look at me breaks my heart, but my hands will never touch them and my soul will never commune with their soul, which doesn't exist, lol

need someone talk me off the LeDGe here, what's the point of drawing another breath, cause I'm not feeling so stable right now I'm just thinking about the diffusion of my guts across the pavement lol

come to me o queen of my soul, I will barrel through this earth's crust at 140mph to meet you, and at the point of inversion my meteoric fall ceases and I ascend a phoenix new-born, together with you, my sweet love, my one girl forever
>>
>>102810293
meds
>>
>>102810293
>poetic skizophrenia
my favorite ones
>>
>>102810190
>how much worse it can be... :(
That workflow isn't even that bad kek
>>
is there any guide on gguf flux?
>>
>>102810293
fresh pasta yum
>>
>>102810341
youtube still exists as far as I'm aware of:
https://www.youtube.com/watch?v=nncY3dJLV78
>>
>all the good genners left
>>
>mat1 and mat2 shapes cannot be multiplied (4032x64 and 256x768)

>resolution is set to base
what did forge mean by this?
>>
>>102810482
what model are you trying to run anon?
>>
>>102810482
the de-distill quants dont work on forge
>>
File: aezdgbveadzgbeadghbaed.png (146 KB, 1874x920)
146 KB
146 KB PNG
>>102810501
>>102810519
well that was a monumental waste of time, back to sdxl kek
>>
>>102810519
>>102810522
it works on ComfyUi, the fuck is Forge doing?
>>
>>102810538
idk, whats weird is that the normal dev quants do work
>>
>>102810551
>the normal dev quants do work
those are old quants city made 2 months ago, he probably changed some stuff on his repo and Forge doesn't recognize the new quants anymore
>>
>>102810538
sticking his thumb up his ass and sitting on it
reforge is weird too, it doesn't support flux *at all*.
>>
to be fair, comfy doesnt support loras with nf4 which forge does support
>>
>>102810598
>nf4
that shit is worse than Q4_0 which has the same size though
>>
File: file.jpg (422 KB, 3290x1134)
422 KB
422 KB JPG
Don't sleep on schedulers they can make a huge difference
>>
>>102810614
im tired of this discussion, its not strictly worse and its a fact that its faster
>>
File: file.png (84 KB, 1244x202)
84 KB
84 KB PNG
>>102810641
>its a fact that its faster
to me they're the same speed
>>
>>102810658
use v1
i only got worse results out of v2 and its slower kek
>>
>>102810667
damn, if it's true then the Forge guy really fucked everything up
>>
ok comfy is not that bad but
is there anything like the tag auto completion of auto1111?
>>
what the fuck is a 'empty latent image'?
>>
>>102810852
it's an image with complete noise, and the model has the denoise that to get the picture you want at the end
>>
>>102810852
what ur going to fill with ai cum magic to birth ur 1girl
>>
>>102810868
poetic as fuck
>>
>>102810709
search the comfyui manager, you will find it
>>
>>102810879
what the fuck is a comfyui manager?
>>
>>102810889
https://letmegooglethat.com/?q=comfyui+manager+github
>>
>>102810889
https://gprivate.com/6dq08
>>
Newfaggotty levels off the charts ITT
>>
>>
>>102807909
how to use it?
>>
>>102810965
remove the distilled guidance scale and go for CFG > 1
>>
File: ComfyUI_00044_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>102810901
thank you anon, have a waifu
>>
File: file.png (28 KB, 592x134)
28 KB
28 KB PNG
With the Flux release, it's fair to say that we don't need much more than 12b to achieve almost perfect pictures
>>
how do I get rid of that plasticy skin texture that flux has?
>>
>>102811455
Go for this lora
https://civitai.com/models/652699/amateur-photography-flux-dev
>>
>>102811349
>12b parameters
>has plasticy skin by default >>102811455

what did they mean by this?
>>
>>102811494
>what did they mean by this?
that a great architecture cannot end up well if we only train with slop pictures, only the chinks can save us, those mf don't give a fuck about copyright and shit, they train their models with the best pictures/videos and it shows
https://reddit.com/r/aivideo/comments/1g2lxq5/dreamina_20_a_new_ai_gamechanger_is_coming/
>>
>>102809315
so biggo
>>
>>102811349
It's very likely 12B is underutilized and 6B or less with better training could achieve the same or better results.
>>
>>102811788
of course, 12b is the current max I would say, like we now now that 12b is enough to get great results, now the next challenge is to get something as good but with a less bigger shit
>>
>>102811494
Flux looks like that because of the aesthetic fine-tuning so normies can prompt "hot woman" and it works
>>
File: 1728255399339169m.jpg (59 KB, 1024x683)
59 KB
59 KB JPG
Is there a local model to generate music yet? With vocals preferred, but without is ok.
>>
File: file.jpg (3.65 MB, 1539x6800)
3.65 MB
3.65 MB JPG
>A photo of a woman dressed as Hatsune Miku picking flowers, A wooden sign on the right reads “Trojan blur”.
Who wins?
>>
>>102807909
wait, de-destilled? the fuck did i miss?
>>
>>102811897
>the fuck did i miss?
someone removed the distilled guidance out of flux dev, now it can support CFG, negative prompt, and most importantly, that means we can finetune flux (because distilled models collapse during a serious finetuning)
>>
>>102811911
holy shit
>>
does comfy have alternate UI options, like replacing the stupid list view for lora searching? I have hundreds possibly 1k loras and having to scroll/manually search it is pure aids
>>
>>102812209
>having to scroll/manually search it is pure aids
you can type on your keyboard and do a searsh if you know the name of the lora you're looking for
>>
11 hour thread, 22 images. that's 2 per hour
>>
>>102812298
>2 per hour
good
>>
>>102812298
come back next thread
>>
what's the status on flux controlnet, still suck?
>>
>>102811882
Beta scheduler is so good that not even LCM's shittyness can fuck with it
>>
>>102805080
an omen of what's to come
>>
>updated comfyui
>moving workflow with spacebar doesn't work anymore
REEEEEEEEEEEEEEEEEEE
>>
File: file.png (1.67 MB, 1024x1024)
1.67 MB
1.67 MB PNG
>>
File: file.png (1.71 MB, 1024x1024)
1.71 MB
1.71 MB PNG
>>
Pan mode needs autistical triggers now
This UI designer must be a pajeet
>>
there's something so grotesque about gooning to "milfs" (old obese women), I regret all the gens I did today
>>
Still no Loras made out of flux-de-distill?
>>
to anyone with fine aesthetic sensibilities... I'm so sorry
>>
>>
File: file.png (1.97 MB, 1024x1024)
1.97 MB
1.97 MB PNG
That's a weird looking cheesboard, but I'm pretty sure Migu is loosing kek
>>
>>
>>
>>
what did /ldg/ do to deserve this hellish fate?
>>
>>102813210
you didn't post enough images
>>
>>102813210
im in heaven
>>
ok now I'm done
>>
>>102813216
i'm all outta ideas :'(
>>
>>102813210
>hellish fate
Elaborate. I like it it's cozy, you learn a lot of cool news in it, what's more to ask?
>>
>>102813067
No it's not, it's noble. Read what ben franklin wrote about fucking older women. You're a goddamn patriot
>>
>>102813242
on the contrary,
https://www.youtube.com/watch?v=9_FjSCX2lz8
>>
>>102813295
Sounds pedophilic
>>
File: file.png (1.85 MB, 1024x1024)
1.85 MB
1.85 MB PNG
>>
>>102813335
everything anyone ever wrote about beautiful girls prior to the 1990s sounds pedophilic now
>>
>>102813353
Touche
>>
>>102808419
>534 pictures
>full finetune
>>
>>102813605
you don't always have to make a full finetune with train millions of pictures, sometimes you just want to make a lora with a few pictures but with better quality, and to do that you do a full finetune
>>
File: file.png (2.3 MB, 1024x1024)
2.3 MB
2.3 MB PNG
>>
>>102813803
>you don't always have to make a full finetune with train millions of pictures, sometimes you just want to make a lora with a few pictures but with better quality, and to do that you do a full finetune
I've read this multiple times and still cannot comprehend what's trying to be said.
>>
>>102813944
let's say you want to add one concept to the model, sure you can use a Lora, but a Lora will never be as accurate as a full finetune, so sometimes, if you want to add just one concept and have the best quality, you go for the finetune instead of a Lora
>>
File: file.png (57 KB, 815x280)
57 KB
57 KB PNG
So you're telling me we all those samplers and schedulers because we can't resolve this tiny equation? Goddam mathematicians are fucking noobs!
>>
File: mclaren.jpg (1.08 MB, 2296x2296)
1.08 MB
1.08 MB JPG
>>
>>102814007
and the dumbest of all samplers, euler, is still king
>>
>>102814552
>and the dumbest of all samplers, euler, is still king
I disagree, for me it's the second best behind deis, but yeah, old solvers are somehow better than all those fancy ones we got with time
>>
>>
>>102814007
What do D, x and t mean?
>>
What do you call the expression a sauna gives you that isn't necessarily "exhaustion" but looks almost like mild ahego? Like "relieved" but it's particular to that warm relief.

>>102813242
>what ben franklin wrote about fucking older women
quick run down?
>>
last 1girl of the night
>>
>>102807909
My 970 was able to (slowly) run Q4 and Q6 flux models on 1024x1024 latents. Since they're becoming more popular I'm gonna have to set up the quantize tool and see if I can produce them myself when necessary. This is likely the final milestone before I have to finally upgrade my GPU.
>>
>>102815040
I lied. 512 latents with flux, I do 1024 with XL.
>>
Is an nvidia gpu even worth it if all i gain to benefit over amd is just porn and lora training?
>>
>>102814896
>D, x and t mean?
D is the coefficient of diffusitivity
x is the distance
T is the temperature

that's the diffusion equation for thermodynamics, but it can be applied on a lot of shit, and it's applied for the diffusion models, if we managed to solve that mathematically, we wouldn't need solvers like euler and shit and the generation speed would be way faster
>>
File: ComfyUI_01849_.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>>102815110
yes a 4090 is worth every cent.
>>
>>102815110
Do you want porn and lora training?
Bc with superior hardware, that’s what you get
>>
>>102814896
>anon is gonna solve it
>>
>>102813926
She should be wearing a bowler hat
>>
>>102808419
assuming this is trained on 'vintage anime', it looks nothing like it. absolute fluxslop face. flux loras are very powerful, but the generic toonface is proving very difficult to remove
>>
File: file.png (2.87 MB, 1590x2235)
2.87 MB
2.87 MB PNG
https://huggingface.co/MeissonFlow/Meissonic/
babe wake up, new base image model got released
>>
File: file.png (1.51 MB, 1024x768)
1.51 MB
1.51 MB PNG
>>102815485
>flux loras are very powerful, but the generic toonface is proving very difficult to remove
with good loras it's definitely possible
https://civitai.com/models/7227?modelVersionId=782696
>>
>>102815495
doesn't look good, I think we got spoiled with Flux, our standards are too high now
>>
>>102815505
>"Flux is so good nobody will dare anymore to release anything that isn't better than it!"
People were wrong, so give me Kolors 2 and AuraFlow 2 even if they're not as good as Flux, just improve your ghost's time.
>>
>>102815495
Note they did not release anything that could be used locally, just a diffusers version, someone has to find a way to put those things on a single file.
>>
>>102815551
>give me Kolors 2 and AuraFlow 2 even if they're not as good as Flux
what's the point of wanting a model worse than Flux? if it's not better than Flux I'm not leaving
>>
>>102815559
https://github.com/viiika/Meissonic/blob/main/inference.py
This looks usable to me, right?
>>
>>102815495
it seems to be a 1b model that uses clip instead of t5 or an llm, not sure what vae they are using though i'm illiterate
>>
>>102815580
>not sure what vae they are using though i'm illiterate
I don't think they're using that, they went for tokens instead of a diffusion algorithm
>>
>>102815566
Because they're already better than flux in areas like creativity and compositions. They don't need you to tell them exactly where you want everything to be.
>>
>>102815574
Oh, I thought the huggingface page was all they released.
>>
>>102815678
>They don't need you to tell them exactly where you want everything to be.
Like it or not anon, but the tyrany of majority is relevant in a field where you have to sell your products, if no one care about you, good luck going forward and get some investor's money
>>
>>102815495
is this just about methodology? The paper reads like it really wants to be an upscaler. SD1.5 used 4.8 billion images for training. MeissonFlow used 210 million.
>>
>>102815761
>SD1.5 used 4.8 billion images for training.
that much? holy fuck dude...
>>
>>102815761
didn't pixart sigma only use 25 million images?
>>
>>102815703
Look at Stockfish, it's the best chess engine in the world, bar none (it was funny when chess.com released their Torch and claimed it was clearly the "second best"), and it's completely open source and nobody wins a cent with it.
We just need the Stockfish of AI image generators, and it'll be able to do anything Dalle 3 can do, with any style Midjourney can do and with any composition and prompt understanding Flux can do, and nobody will make a cent from it.
>>
File: file.png (3.12 MB, 1600x1269)
3.12 MB
3.12 MB PNG
>>102815802
>We just need the Stockfish of AI image generators, and it'll be able to do anything Dalle 3 can do, with any style Midjourney can do and with any composition and prompt understanding Flux can do, and nobody will make a cent from it.
Me too I have a dream anon
>>
File: training.png (111 KB, 758x459)
111 KB
111 KB PNG
>>102815774
I was being lazy. screenshot from the paper. In my mind if you want something that runs on consumer hardware your training dataset needs to be much larger.

>>102815774
33 million, although higher res than I suspect SD1.5 was. All of them were exceeding 1K resolution.

It just seems like they need more the examples are limited and the method seems efficient, but limited.
>>
>>102815821
4.8 billions pictures on SD1.5 is crazy...
>>
>>102815840
it explains a lot of things. It is the most creative model with the other side of that coin being it was the worst model for body horrors and other blended AI nightmare fuel.
>>
>>102815819
The thing with Stockfish is that everyone focused their efforts in a single project, imagine if instead of everyone and their aunt doing their own finetune of SD1.5 they all joined forces to make a super model that would have been better than SDXL because of finer detail.
But, noooo, everyone did their own thing and improvements could never pileup because of catastrophic forgetting that makes just merging 2 loras into a model cause everything to fuck up.
If we want this people need to change what they're doing, instead of trying to make the next dreamshaper, people gather and make black forest labs look like 5 years old children.
>>
>>102807909
>Q5_K_M
>Q4_K_M
Late but I thought the ComfyUI plugin didn't have support for that yet. Has that changed or was it just quantized using some LLM quantization script that includes those quantization sizes?
>>
>>102815925
>Late but I thought the ComfyUI plugin didn't have support for that yet.
it does now
>>
>>102811882
i'll stick with euler/simple
>>
File: Flux_00802_.png (1.13 MB, 1344x768)
1.13 MB
1.13 MB PNG
Thank god this technology didn't exist any sooner than it did. If this was the type of shit you could generate back in the 2000's, you could catfish anybody.
>>
>>102817301
the fucked up text is a pretty clear giveaway, but if you put in the time to inpaint/photoshop them then yeah
>>
>>102817301
damn
>>
>>102815761
It's not about total image count. It's about the diversity and quality of the images used.
>>
>>102815495
As always the dumb niggers never give training code.
>>
File: forge.png (167 KB, 1820x931)
167 KB
167 KB PNG
retard here, what am i doing wrong?
>>
>>102818197
you could click the flux button above so you'll get working settings right away
>>
>>102815495
>https://huggingface.co/MeissonFlow/Meissonic/
Looks literally like a crappy SDXL or very good SD1.5.
The background details are crappy (the spaceship pannels are all melted), the colors look burned, the anatomy is messed up (the 3-handed pikachu thing) and the extremely high contrast between lights and shadows make the skin plastic.

Before SDXL this could have been a hit, but right now? It looks bad.
>>
>>102818282
What's interesting is it's using a VQVAE.
>>
>>102818217
gave me the same error, but time taken was 5 seconds instead of 5 minutes
>>
>>102815495
>Jinbin Bai, Tian Ye, Wei Chow, Enxin Song, Qing-Guo Chen, Xiangtai Li, Zhen Dong, Lei Zhu, Shuicheng Yan
assuming garbage until proven otherwise
>>
>>102818596
>Wei Chow
im hungry
>>
File: meiss.jpg (184 KB, 1024x1024)
184 KB
184 KB JPG
>>102815495
meh
>>
File: file.png (22 KB, 911x104)
22 KB
22 KB PNG
>>
File: 00066-4060555982.jpg (473 KB, 1536x1536)
473 KB
473 KB JPG
>>102818936
0 expectations
>>
I simply do not care for furfag models
>>
I have been out of the loop for a while, can someone quickly tell me if I can just make Flux model work on the Automatic webUI as if it was SD?
>>
File: 00071-4060555980.jpg (454 KB, 1536x1536)
454 KB
454 KB JPG
>>
>>102818988
no
>>
The Death of LDG
>>
what happened?
>>
>>102820234
unless people are spamming they don't feel personally validated, their enjoyment in life is based on how many updoots per minute something receives
>>
>>102813926
max cfg deep fried
still can't spell
>>
>>102818197
>what am i doing wrong?
you're using forge
>>
>>102820234
probably got bored and are taking a break until the next new toy comes out
>>
>>102820765
Yeh or got mass reported and banned for posting here, happened several times. That being said there's nothing wrong with slow threads.
>>
My take on a "Better Illustrious":
https://civitai.com/models/852957/zootlustrious-aesthetic-illustrious-xl-refinement
>>
>>102821105
have to give it a try.
>>
>>102820853
>got mass reported and banned for posting here
Big if true
>>
>>102821476
I got 5 off-topic bans for simply posting gens.
>>
>>102815860
It also explains the architecture is more important than the amount of images. SD1.5 is simultaneously more creative and rigid than other models.
>>
>>102821831
I bet you could get a lot more from less if instead of procuring billions of images you instead had more captions per image especially if you can focus on different details. I bet a diverse dataset of 5 million images but with four sets of captions each of varying lengths, focus, direction, etc, would help the model generalize better than throwing a shit ton of images at the problem.
>>
>>102821650
Mods also enforce the rules selectively. One anon can post a blatant nsfw, tricking other anons think that it's okay to do it as well. The other anons all get banned and images deleted while the anon that posted the nsfw still has his image up.
>>
>>102821650
You take any screenshots? Sounds crazy
>>
>>102822139
I got banned for off-topic once when I was garloid posting with a Garloid Lora I made. I assume it was the butthurt 1girl underaged bans getting mad.
>>
>>102822139
Fuck off debo, everybody knows that you’re actively reporting posts from here to get them banned
>>
>>102817301
nice
>>
>>102822322
im gonna need you to share that garloid lora
>>
>>102821105
The example images do not look very good.
>>
>>102822139
Yeah, I did take some screenshots

>>102822322
I think it's just mass shit flinging and hoping something sticks
>>
>>102804738
what is the top left? is that a real picture? are they being awarded for an ai art contest?
>>
https://huggingface.co/nyanko7/flux-dev-de-distill/discussions/3#670d385ae7ae64a7a30216b9
>Good news: I can train many subjects at the same time without bleeding between each other 11 people in one LORA
I'm still waiting for a Lora or a finetune based of de-distill personally, a lot of people are praising it but there's no one sharing anything yet
>>
File: file.png (3.02 MB, 2227x1492)
3.02 MB
3.02 MB PNG
>>102820479
>still can't spell
the goat "beta scheduler" is here to save the day
>>
File: jesos.jpg (1.37 MB, 2816x2816)
1.37 MB
1.37 MB JPG
>>102818197
your not using comfyui and you didnt do a crash course on all the diagram nonsense
>>
        train_dataset_group.new_cache_latents(ae, accelerator.is_main_process)

Do these dumb fucks not test their code? This code does not work because it's supposed to pass the accelerator object not the fucking is_main_process bool. So some dumb fucker coder is writing code they didn't even fucking test.
>>
>>102824226
to be fair, ComfyUi wouldn't be so popular if the alternatives wouldn't be so fucking ass, I hate this spaggheti shit but at least my models are working in there
>>
>>102824367
what repository?
>>
>>102824428
kohya sd-scripts
>>
>>102824367
feel free to fix that code and make a PR, those guys aren't gods they can make mistake
>>
>>102824470
it's not about "being gods" it's the fucking programmer who made an update and didn't even test it
don't ever be that guy, if you make changes to your code, at the very least do the courtesy and make sure it compiles / runs, thanks!
>>
>>102824487
>don't ever be that guy
I won't lol, I spent probably 90% of my time testing, and 10% of the remaining time writing code
>>
https://youtu.be/puEgugluadk?t=77
>Our video model is only trained on public domain
that's precisely why China will win the AI war, they don't give a fuck about people's feelings and train their model with the best videos the world have to offer, as it should.
>>
>>102822990
I mean if you're the sort of person who actually prefers a sort of greyish washed out hand-drawn kind of style it's not really gonna be the thing for you I guess, you might as well use the original model then.
>>
File: file.png (762 KB, 450x657)
762 KB
762 KB PNG
>>102824748
he's right, the colors are way too saturated, why did he think it was a good idea to showcase pictures that went through CFG 25?
>>
>>102824771
very turbo-sloppa
>>
>>102824771
I mean that robot one i prompted for vibrancy on purpose, it was an intentional choice, like a whole "neon" thing or whatever. That's Euler A / CFG 7 though.

If you scroll through there's a bunch of others in there that aren't quite as saturated (along with several by users down in the gallery now that get more into the named artist tags and generally less of a digital art look entirely).
>>
>>102824807
I can't show the others that are geuinely oversaturated they're NFSW but when I saw those images examples I just closed the tab and went on with my life, is he a retard? if he wants us to try his model, better make good examples images and not burned ones
>>
>>102824820
I think he simply has a wildly different aesthetic taste than you and I which is fine. The funny part is how butt hurt he gets when someone calls it slop.
>>
File: ComfyUI_121857_.jpg (1.89 MB, 1328x1664)
1.89 MB
1.89 MB JPG
>>102824820
well, if that's actually what people think I could swap some of them out I guess, I'm pretty sure I know the specific two you mean. I was trying to show a bit more versatility rather than just sticking completely to this sort of ultra clean digital 2D look the whole time though (should be within the bounds of this board cause no nipple lol)
>>
>>102824846
i appreciate the feedback WRT people's perception of actual oversaturation honestly
>>
Legendary Bread
>>
>>102823841
thats a gen, anon welcum to the new age of AI art
>>
>>102825097
kek
>>
>>102811882
The one looking at me
>>
>>102808502
They also said SD3 will be the last model because it will be so good that there won't be any need for any future improvements. Or something along those lines.

Those guys really talked a lot of BS lol
>>
File: file.png (252 KB, 302x309)
252 KB
252 KB PNG
>>102825163
She looks like she has dawn syndrome though kek
>>
what is the veredict on auraflow 0.3?
is it better than juggernautXI?
too poor for flux
>>
are there any tips for comfyui performance boosts? i already do the basic stuff like using a smaller quant, running in fp16 and keeping batch size low, as well as using the newest torch and cuda version available, but wondering if im overlooking anything
for example, i noticed that when it goes over the vram limit (which is bound to happen on a 3070), it doesnt seem to be filling my shared gpu memory but just going straight to normal ram, which makes me think it could go faster if it used shared gpu memory instead?
>>
>>102825213
>too poor for flux
how much vram do you have anon? you can run its quants instead of the big bf16 thing
https://huggingface.co/city96/FLUX.1-dev-gguf
>>
>>102825251
12gb 3060 on comfyui
>>
>>102825257
then go for Q6_K
>>
>>102825262
do i only need the gguf?
>>
>>102825369
you also need the text encoder and the VAE of course
>>
File: ComfyUI_01890_.png (1.75 MB, 1024x1024)
1.75 MB
1.75 MB PNG
>>
File: ComfyUI_01894_.png (1.74 MB, 1024x1024)
1.74 MB
1.74 MB PNG
>>
File: file.png (353 KB, 3013x1440)
353 KB
353 KB PNG
Has anyone tried custom samplers and schedulers from there? If yes, did you find something objectively better than the official ones that are in ComfyUi?
>>
>>102825386
>>102825400
how many JAV loras did you end up making?
>>
what is your favourite illustrious finetune?
currently using dposmooth but it still has the fucking cans
>>
>>102825437
Noob and smoothFT are the best
>>
File: ComfyUI_01783_.png (1.13 MB, 1024x1024)
1.13 MB
1.13 MB PNG
>>102825431
not as many as I'd like.
fucking civitai niggers sabotage me.
>>
>>102825437
what do you mean by cans lol
>>
>>102810167
Kolors is IMO overall better than Pixart and Hunyuan stock, and it's UNET + some tweaks and ChatGLM for the text encoder
>>
>>102825615
>Kolors is IMO overall better than Pixart and Hunyuan stock
it doesn't work with windows though, because it's asking to install triton
>>
>>102825605
like just fuckings cans of soft drinks everywhere
almost like some sort of fucking c urse
i dont know what causes it but sometimes when i make a prompt suddenly cans will appear somewhere in the image, either somewhere logical and inconspicuous
or being used as a smartphone
>>
>>102825642
Literally have not had that problem at all. What kind of prompts are you throwing at it?
>>
>>102810167
said who?
>>
>>102825678
just normal danbooru style prompts
i am not quite sure what causes this
>>
>>102812387
>Beta scheduler is so good that not even LCM's shittyness can fuck with it
>>102824163
>the goat "beta scheduler" is here to save the day
I also like beta, it improves prompt understanding, the issue is that it's still burning the picture, the first mf who manages to optimise prompt understanding without the burning will be a good in my book
https://imgsli.com/MzA3NTc3
>>
ok how do i make flux look good now?
>>
>>102825779
>look good
define that, it's so subjective you need to specify what kind of picture you're asking specifically
>>
File: file.png (291 KB, 512x512)
291 KB
291 KB PNG
>>102825788
i am asking for a blue sky and all it gives me is a really blurry blue picture
>>
>>102825779
how to make flux look good: delete flux and install 1.5
>>
>>102825788
>>102825803
i've noticed the file i downloaded has 'de distill' written on its filename, is this the wrong file perhaps?
>>
File: file.png (1.27 MB, 3330x1626)
1.27 MB
1.27 MB PNG
>>102825803
>>102825840
I have no idea how you managed to get such a bad image
>>
>>102825803
i mean it does look like a blue sky desew
>>
>>102815052
Still unbelievable, make it 256
>>
>>102825694
post example?
>>
>>102825386
>>102825400
Assuming those streaks are supposed to be rain and not some kind of weird artifacting
>>
File: ComfyUI_01915_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>102826041
yes its supposed to be rain, you can see that she is wet.
>>
>>102825858
i have no idea what half this shit is
>>
>>102826100
what node do you want me to explain? those are really useful ones
>>
>>102826108
frankly i just want the bare minimum to be able to generate to see if flux is worth the hassle.
As of now it isnt, i cant generate shit. And i followed the guide to the T
>>
>>102825858
do you really need the play sound node anymore? favicon stuff exists
>>
>>102826121
>frankly i just want the bare minimum to be able to generate to see if flux is worth the hassle.
as you can see, I got a nice and clean blue sky, if you're too lazy to make this shit by yourself, just download a workflow and use it, that's all
https://files.catbox.moe/bncuye.png
>>
>>102826127
>do you really need the play sound node anymore? favicon stuff exists
can you give me a link? so far I'm happy with the play sound node, it does its job perfectly
>>
>>102825196
we've seen much worse
>>
>>102826150
its a part of pythongosssss and jnodes
>>
File: ComfyUI_241922_.png (977 KB, 1024x1024)
977 KB
977 KB PNG
>>
File: file.png (881 KB, 1024x1024)
881 KB
881 KB PNG
>>102826139
i am having better luck with the non dedistilled model... for some reason i get trash output with the de distilled model i had
that glow is fucking strange tho
it takes twice the time i do on sdxl tho
>>
>>102826184
Comfy, I know you made a fix to not reload the model when we change loras, but it doesn't seem to work for the GGUF node loader, is that strickly for the regular Load Diffusion Model node?
>>
>>102824709
kek Firefly looks better than most txt2vid models
>>
>>102826205
can you show a screen of your workflow? Surely you fucked something up, can't look this bad on dedistill, I'm using it everyday
>>
>>102825959
512 works on my machine. What's happening on your end?
>>
File: ComfyUI_241923_.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
>>102826210
The regular lora loader node is always going to reload the model because it applies the lora once instead of every iteration, it's the gguf loader that shouldn't do that.
>>
File: file.png (2 MB, 1024x1024)
2 MB
2 MB PNG
>>102826225
it's not even looking bad but literally trashy noise
probably fucked up something i've only started on comfyUI like, yesterday or some shit? Still got to see some docs or videos on it.
Another gen, it's generating something at least, but goddamn this looks like trash compared to some SDXLs i been usin'
>>
>>102826247
>>102826184
Afraid of /sdg/? Still posting the same images every single day though.
>>
kek chill, anon
>>
File: file.png (61 KB, 952x318)
61 KB
61 KB PNG
>>102826247
>it's the gguf loader that shouldn't do that.
he reverted it back, the only way I got this working is by adding the OverrideMODEL node, with this combinaison it's not reloading it anymore
>>
File: itercomp_v1_00012_.jpg (142 KB, 896x1152)
142 KB
142 KB JPG
>>
>>102826255
>it's not even looking bad but literally trashy noise
>probably fucked up something i've only started on comfyUI like, yesterday or some shit? Still got to see some docs or videos on it.
like I said, de-distill look fine with the right settings, that's why I'm trying to help you and see your workflow, you probably made a mistake, it's not the model's fault
>>
>>102813926
>This is not
I call bullshit. This has everything Subitiles are know for. It's textbook Subitile
>>
>>102826269
that looks way too plastic to be flux, what model is it anon?
>>
File: file.png (2.33 MB, 1024x1024)
2.33 MB
2.33 MB PNG
>>102826280
Aaand this is the result on dedistill
>>
>>102826294
>that looks way too plastic to be flux
Never thought I'd see that phrase
>>
File: file.png (2.06 MB, 1024x771)
2.06 MB
2.06 MB PNG
>>102826284
congrats anon, you got that joke kek
https://en.wikipedia.org/wiki/The_Treachery_of_Images
>>
>>102826294
https://civitai.com/models/840857/itercomp
Can't do lewd poses
>>
File: ComfyUI_241927_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>102826260
?

>>102826267
yeah that's going to prevent it from reloading but you'll need enough extra vram for the VAE, etc...
>>
>>102826280
The workflow is literally the default workflow, only instead of load checkpoint i added the load unet, dualcliploader and load VAE
the guide i followed wasnt using dedistill tho so i dont know if you have to do it differently on it
>>
>>102826295
>Aaand this is the result on dedistill
kek, CFG 1 is undistilled sovl desu
>>
>>102826313
You have no power here.
>>
>>102826320
>The workflow is literally the default workflow
and that's the problem anon, it's the distilled workflow, if you want to make it work on undistill you have to get a CFG node and put it at CFG > 1 (CFG 3 works fine)
>>
File: file.png (209 KB, 510x346)
209 KB
209 KB PNG
>>102826333
>>
>>102826313
>yeah that's going to prevent it from reloading but you'll need enough extra vram for the VAE, etc...
What's stopping you on making a script that prevents the reloading of the unet model when changing a lora? We've been asking this for months at this point
>>
>>102826344
anon, show me a screen of your workflow, and I'll tell you what to modify, sounds easy enough right?
>>
File: file.png (201 KB, 1359x634)
201 KB
201 KB PNG
>>102826355
>>
File: file.png (722 KB, 1024x1024)
722 KB
722 KB PNG
>>102826361
i put the cfg up to 5 after what you said and i actually got a image
but opening it reveals that the image is too small, even tho i generated on 1024
>>
>>102826361
ok that doesn't look too bad, the CFG is too high though, if you go over CFG 3 it starts to burn, also, T5 Q5_K is rough... maybe you could put a higher quant T5 and put it on your cpu to save some vram storage? you do that with this node
https://reddit.com/r/StableDiffusion/comments/1el79h3/flux_can_be_run_on_a_multigpu_configuration/
>>
File: 00053-2317500264.png (570 KB, 1024x1024)
570 KB
570 KB PNG
>>102826313
>>102826247
>>102826184
4u faggot
>>
>>102825424
https://github.com/ClownsharkBatwing/RES4LYF
>>
File: file.png (293 KB, 1016x870)
293 KB
293 KB PNG
>>102826377
>i put the cfg up to 5 after what you said and i actually got a image
>but opening it reveals that the image is too small, even tho i generated on 1024
cfg 5 is too high, and when it's too high it can blur the picture, so that's expected, if you want to go to such high cfg, you can use an anti CFG burner like DynamicThreshold, like you can see on this picture >>102825858
https://github.com/mcmonkeyprojects/sd-dynamic-thresholding
>>
>>102826379
I will try getting a higher quant then, wish me luck!
>>
>>102826390
interesting, thanks anon, what are your favorite settings on this node?
>>
>>102826295
Yeah it's fucking SHIT
>>
>>102826435
don't tell me you took this picture seriously, he's trolling he's making pictures at cfg 1, which isn't the value to put on a de-distilled model, you put cfg 1 everywhere (SD1.5, SDXL...) and they'll all look like garbage lol
>>
>>102826308
that looks bad, it's obvious he finetuned SDXL with only synthetic data
>>
>>102826446
I made this >>102810965
I will retest again later
>>
File: file.png (1.68 MB, 1024x1024)
1.68 MB
1.68 MB PNG
>>102826379
didnt need to use it, managed to put a higher quant with some going into my SSD
definitively not optimal but will do for now, i should probably buy some sticks though.
new image with q8 quant, looks better her hands are fucked up tho...

>>102826446
frankly i have no idea what the fuck a dedistill is
>>
Next Bred

>>102826540
>>102826540
>>102826540

>>102807909
Thanks, added to OP.
>>
>>102826517
>new image with q8 quant, looks better her hands are fucked up tho...
what CFG value did you put on that one?

>frankly i have no idea what the fuck a dedistill is
The original flux dev is "guidance distilled", it means that they modified flux pro (their API model) in a way flux dev can work at CFG 1, why did they do that?

- It's 2 times faster when going on cfg 1 than cfg > 1 so that's cool
- They can hide the real architecture of flux pro because flux dev can have a different architecture of pro while looking similar, it's a way to keep their secret sauce to themselves

Why is it bad that we had distilled models instead of a "normal" one like we used to have with the SD models?
- CFG 1 is limiting, if we were able to increase that value, we would get better prompt understanding and we could use negative prompt
- We can't finetune distilled models, they collapse when putting too many pictures with it, so we were basically fucked if we wanted to improve flux dev

That's why un-distill exists. And the man who undistilled it is a hero in my book.
>>
>>102826558
3, same as the others
>>
File: file.png (2.23 MB, 2176x1339)
2.23 MB
2.23 MB PNG
>>102826571
>3, same as the others
Try to go for 3.5 if you feel it looks better, or you can go for CFG 6 + DynamicThreshold to get the optimal settings >>102825858 >>102826410



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.