Discussion of free and open source text-to-image modelsPrevious /ldg/ bread : >>102203927>Beginner UIEasyDiffusion: https://easydiffusion.github.ioFooocus: https://github.com/lllyasviel/fooocusMetastable: https://metastable.studio>Advanced UIAutomatic1111: https://github.com/automatic1111/stable-diffusion-webuiComfyUI: https://github.com/comfyanonymous/ComfyUIForge: https://github.com/lllyasviel/stable-diffusion-webui-forgeInvokeAI: https://github.com/invoke-ai/InvokeAISD.Next: https://github.com/vladmandic/automaticSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Use a VAE if your images look washed outhttps://rentry.org/sdvae>Model Rankinghttps://imgsys.org/rankings>Models, LoRAs & traininghttps://civitai.comhttps://huggingface.cohttps://aitracker.arthttps://github.com/Nerogar/OneTrainerhttps://github.com/derrian-distro/LoRA_Easy_Training_Scripts>Fluxhttps://huggingface.co/spaces/black-forest-labs/FLUX.1-schnellhttps://comfyanonymous.github.io/ComfyUI_examples/flux>Pixart Sigma & Hunyuan DIThttps://huggingface.co/spaces/PixArt-alpha/PixArt-Sigmahttps://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiThttps://huggingface.co/comfyanonymous/hunyuan_dit_comfyuiNodes: https://github.com/city96/ComfyUI_ExtraModels>Index of guides and other toolshttps://rentry.org/sdg-linkhttps://rentry.org/rentrysd>GPU performancehttps://vladmandic.github.io/sd-extension-system-info/pages/benchmark.htmlhttps://docs.getgrist.com/3mjouqRSdkBY/sdperformance>Try online without registrationtxt2img: https://www.mage.spaceimg2img: https://huggingface.co/spaces/huggingface/diffuse-the-restsd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium>Maintain thread qualityhttps://rentry.org/debo>Related boards>>>/h/hdg>>>/e/edg>>>/c/kdg>>>/d/ddg>>>/b/degen>>>/vt/vtai>>>/aco/sdg>>>/u/udg>>>/trash/sdg
>mfw
>>102209070thanks for bake
did they release the new pixart model yet? only the chinese can save /ldg/ from it's current horrific condition
>>102208406>>102207981Testing selected block training again, this time at a dim of only 8. Still getting visible results, but has a few flaws. It also turned the happy meal into like a blood elf happy meal or something.
grid dodge
>>102209109there is no saving it, from the first 4 posts, 3 are avatar fags. just make a general for pixart or for whatever other model releases next.
>>102209137It's fluxface!
>>102209174>walking through forest>deep in, not really any signs of civilizations or campers>see thiswhat do
>>102209167epic
/sdg/ is empty and all the demons are here
>>102209197Do bean burrito flavor
>>102209174cool
>>102209221ignore and report whenever possible, its the only way
>>102209197>Disgusting goyslop gensCreative
Me and my gf, back when she was in her old apartment.
>>102209232also filter them with stub:no; before the filter
>>102209233rate my creativity
>>102209274fluxgirl
>>102208918>enjoy your 40gb modelThat's for the encoder+decoder at FP32. You only need the encoder for image gen, current T5 is also 40GBs for the original https://huggingface.co/google/t5-v1_1-xxl/blob/main/pytorch_model.bin
>>102209197>advertising for free
random observation: changing from torch 2.4.0+cu121 to 2.4.0+cu124 changed the outputs ever so slightly
are the current flux controlnets any good?does stuff like ipadapter work?does swarmUI have any good features for flux yet?
>>102209274man I'm kinda losing interest in Flux because of how bad the model is at doing different art styles. It's like it has 3 settings.>Anime>Ugly disney Pixar slop>High contrast live action slop that looks like a Marvel movieand no easy way to force it to use one over the others no matter how heavily you prompt for anime
>>102209342>does stuff like ipadapter work?yes, but rather poorly
>>102209345loras?
>>102209274would be easier if you would gen something else but the same type of gel over and over again.. sure in doing just that you show some creative effort, but your flexibility to be really creative you maybe should try to prompt something else but samegirl all the time
>>102209411thanks for typing all that.
>>102209397loras seem like they limit the model's creativity too much. a style lora might be nice though.
>>102209411Flux makes various fluxgirls. This is Flux's default non-photographic girl. She is very similar to Flux's generic girl.
>>102209426even with the flux default girl you can get more creative than just "girl standing"
>>102209386what the fuck is wrong with his knuckles
>>102209443nta, but probably punched the wall to often?
>>102209443Are you making fun of my hands?
>>102209342Ipadapter sucks and is horribly unoptimized. Everything released by xlabs is terrible
girl standing is good, fur mich, because i like to look at them, standing. makes me feel good, in a few ways. but ill gen some of pic related, i think it is needed.
>>102209457It's not.
>just bend the knee and suck HF dick bro>why are you afraid of being a retarded cuck like me?
Is there a Flux zoom script?
JUST ONE MORE GEN BEFORE BED
>>102209532that's how I've ended up going to bed at 5 am these days :( the worst part is I'll actually try to go to bed earlier, but I end up going on my phone and genning crap on HF schnell..
It's like watching a roach seize up after getting sprayed with poison
It's ok, he's a code inspector, he knows what he's doing.
>>102209615inspector? I hardly know her!
Still not asking for it.
>>102209634stop posting this roastie.
>>102209659whats a roastie
>>102209673get some new prompts you lazy nigger
>>102209670>>102209661Flux strongly favors spade heads, in women
>>102209681your prompts: 0also, why do you hate women? gay?
>>102209697mangled hands
>>102209070Good collage
me on the right
>>102209713Flux is too busy hunting down titties to stamp.
do not engage with him
>the schizo keeps replying to himselffiltering > recursive hiding
>>102209723Creepy ngl
>>102209706>your prompts: 0now you look stupid, like the roastie you keep posting.>why do you hate women?not woman as a whole but particularly the one you keep spamming.
>>102209706Women like dicks and liking dicks is gay therefore women are gay.
>>102209749Is that talc, does she smell like feet or something?
>>102209765He's a tradie. He's filling the holes.
>>102209730I wouldn't say yes anyway. you're not my type>>102209739congrats on your first gen>particularly the one you keep spamming.why?>>102209741damn, true
>>102209769I hope he got the foot surgery he needed.
>>102209780the buttchin sisters
>>102209785make some new material you lazy niggeralsoquality > quantity
>/ldg/ when being asked to discuss anything to do with image diffusion
>>102209791>>102209769mangled hands
>>102209791aww thats adorable, looks like a real image
>>102209791cute image, prompt?
>>102209857mommy
>>102209808I have new material every day. but why pretend to care? you'd hate anything I post just because its me posting it. just say that instead of lying>>102209830what are your top 3 favorite image diffusions?>>102209843lora or vanilla?
lots of trolls here rn, difficult to have any sort of productive discussion
>>102209862vanilla, also nice pic
>>102209862I prefer this to more space station women desu
>>102209864creepy olaf
is there a better way to filter through LORAs in comfy instead of going through that long list of 100s with a simple search bar?
>>102209730I think he's having a bad night. Just let him reply to himself and he'll get bored.
>>102209864You fags have your own thread, why not use that?
>>102209883The have the crippling realization that they are the kids they used to be in highschool. It's worse than being bullied they just get segregated and ignored by everyone.
Do you think it would be possible to get the no avatar fagging rule enforced in these threads at some point? It kind of fell into a gray area because of the nature of the image diffusion, but it's been years now and people are just using it to gen their passive aggressive avatars in new ways. It's about time we started enforcing the rules again.
>>102209862>you'd hate anything I post just because its me posting itnope you are a paranoid faggot.
>>102209896>>102209874
weezin tha jui-uuce
>guidance 3.5>looks like shit>does what i want>guidance 1.7>looks amazing>doesnt care about my promptwhy cant i have my cake and eat it too
>>102209865>vanillathats nice. I tried to get some rave/festival gens but couldn't get that organic candid aesthetic. any tips?>also nice picthanks!>>102209883you're the ones who don't like women
look, im being creative, tak for the encouragement. i feel like i am elevating the thread quality.
>>102209993nope it just did it by itself.. been using the same prompt for all these mommy gens..
>>102209913name three of your favorite gens of mine>>102209991guidance is a pretty lame param. very narrow functional range and doesn't even affect much control. somewhat unpredictable behavior too. I tried dicking around with all the various cfg techs but nothing seemed to work well
>>102210024I dont care
Instead of posting in a thread that doesn't want you how about you look at job applications?
>>102210057this is our thread now. were elevating the prior lackluster quality(of the thread)
>>102210082Thanks for the free bumps. The desperation just makes you look worse
low dist cfg scale indeed has nice texture, sadly the rest..>>102210095uh, you are welcome.
All of my character LoRAs in ai-toolkit using specific block training all became substantially better once I set the alpha to 10X what the network dimensions are by accident and I can't tell if it's all in my head or not. Someone said alpha isn't even working with AI-toolkit
/sdg/ is the tech thread now and this is debo containment
enough spooky gens from me for now>>102210258amusing turnip events
>>102210258so there's no difference between the threads? maybe we should unite them into one
>>102210258Giving people false hope is cruel
>>102210313Do you not have a job or friends or family or go to school or what is the deal why and how are you here all day? I dont hate you I could care less about avatars and lore and whatnot but I feel like its a big reason people dislike you.
>>102210336These are great and creative
>>102210325>I dont hate you I could care less about avatars and lore and whatnotindeed you could care less
>>102210356indeed
Read the OP new friends
>>102210345thanks! flux is great for exercising brain cells
id ask for gen tips from the discord crew, but i am vastly more capable than them, and i am not conceited at all, so i am afraid they have nothing useful to offer. probably why they instead spew negativity and hate.. sad situation.
>>102209070I'm not going to make it, /g/!Carry on without me!
anyone else treat this like a slot machine? adjusting this and that and doing 1000 gens per day until i hit the jackpot and make something decent, instead of downloading complex workflows/trying new nodes and technology to up my game.im such a low-IQ meathead
>>102210482They ran you out
>>102210506gn (or sorry you're dead?)>>102210514sort of but not in the way you're describing. I'm not just pulling the lever and seeing what comes out, I'm trying to hunt for machines with better odds and tricks to increase my EV. I'm usually not happy with a prompt/workflow if it doesn't have a high success rate
>>102210492I've seen that stuff in my loft
>>102210526>gn (or sorry you're dead?)No, I'm just a coomer on the loose in the thread.
>>102209697Sweet cheeks!>>102209922And this one looks Sick!
>>102210555Do not engage, please.
Tested this:https://www.reddit.com/r/StableDiffusion/comments/1f523bd/good_flux_loras_can_be_less_than_45mb_128_dim/In terms of details, Kohya_SS 16 DIM/Alpha is superior to Ostris AI-toolkit 128 DIM/Alpha 2 layers block 7 and 20. I cooked all day, but could not get good result on AI-toolkit using layer 7 and 20. Perhaps more layers are needed when you got complex characters. Also, Ai toolkit only train 1 trigger word by default. I have yet to find out how to include ALL the tags that Kohya SS does for greater control of the subject on AI toolkit. Since file size isn't an issue for me, Kohya_SS is the superior way to go if you want accurate character replication.Once Kohya implements layer training, Ostris AI Toolkit wont be of much use due to lacking certain features. Hopefully the turk pestering kohya about training layers on github wont make him rage quit out of spite.
>>102210524certainly a negative experience, but for the better, why would i want to be in a discord with a bunch of mean shitty ppl? i ask (you)
>>102209991how about guidance 1.7 + high CFG?
>>102205410>>102205502cool images anon, it's a lora from flux?>>102207561when we'll download such loras, the T5 finetuned will be included in there right? But that also means that now we have to download a 9gb file on top of that 30mb lora? fuck
>>102210570You got chased out for being a sad cunt lol
>>102210570sure thing debo, everyone is bad except you right?
>debo sneaking out of his containment thread at nightWhere’s his mother to keep him in his cage
>>102210579the cfg hack looks like shit at that low guidance.
>>102210600i was too good for your discord. pic related
>>102210686mangled hands
>>102210686>pedo thinks he’s too good
>>102210566yeah, kohya seems to be the superior outcome here. man, I want him to give us block training so we can do some more optimal testing but I also kind of want the grifter to suffer...
>>102210749>pictured: my GPU generates the image I asked for
lets do some sports gens
>>102210566you haven't used the same trained words though, the comparison isn't really apple to apple, unless you can't simply add her name on Kohya or something? sorry for that retarded question I never trained a lora so... kek
>>102210749
>>102210848kek
>>102210829The name scarlett is there for Kohya_SS. Booru style tagging includes tags like white background, cowboy shot, etc ,etc. I don't know how to make Ostris Ai toolkit keep all the tags in the txt files.
https://github.com/city96/ComfyUI-GGUF/pull/92It got merged, that means that now when you change a lora or change a lora's strength, it won't unload/reload the model, good job city!
>>102210589>T5 finetunedNo, you'd just also apply the T5 LoRA to the t5. Ideally you'd just apply the 1 model to both the t5 and the model.
>>102210877>he's samefagging asspats again
>>102210883oh yeah right... desu it's amazing we managed to get such accurate loras so far without even finetuning the T5, now with this shit it'll get even better
id watch sports if.. nevermind
>>102210893that's really not how it works, you truly don't need to finetune the T5 for a lora
>>102210911by "finetune the T5" I meant "apply the T5 LoRA" in case there's a misunderstanding
>>102210848At least it comes out of top than bottom
>>102210566and here i am, still waiting for someone to fix the pivotal tuning pr for sdxl but seems it won't ever happen
>>102210920it's the same thing; you don't need to train the T5 with a lora, it already knows everything you're trying to "teach" it
>>102210928then there's no point in doing a T5 lora then? why are they implementing it then?
>>102210877I'm gonna try this out later, wish me luck!
Harry Potter sees GOD while tripping.
>>102210938experimenting, I assume. even flux itself did not train the T5
>>102210877ehh? I got different pictures now since his new commit https://imgsli.com/MjkzNTg1
>>102210954>yfw chitty unknowingly ruined sailor moon
>>102210963i blame /b/ and their avatarfags raiding every aigen thread
>>102210954yeah it's not a fluke, it changed the way of making output, what have you done city? ;_;https://imgsli.com/MjkzNTg3
>>102210982>>102210954
>>102211010kek'd
>>102211005thumbnail looked like he was going for a sick dunk
>>102211010wait, maybe that's a false alarm, I got different pictures compared to the non-merged PR, maybe it wasn't finished and had such issues in the first place, when I compare to pictures I made a week ago I got the same result
Captioning your LoRA datasets is an act of hubris that states. A declaration that you know exactly how Flux is built and how it classifies all objects in an image. Pure retardation
summer is fading
>>102211109what if you only caption them for style, rather than content
>>102211116I wish avatar fags worked like bugs and died when it started to cool down.
>>102211123>only caption them for styleI don't see why a simple trigger word for that style wouldn't be enough.
>>102211005When I see a skeleton, I think Mr. Bones wild ride.
>>102211143Now with more tracks!
>>102211158Last one for the night!
does anybody got any landscapes? >>>/tg/93796173
I don't understand how people can generate one person in one style going on two years now. How is it possible?
>>102211210its called a mental illness
>>102211187sure I'll post a couple1
>>102211187>>102211221and 2
>>102211210seething over it seems more weird
>>102211228nice
>>102211210the brain is stuck in a nasty feedback loop, just needs a little whack but there is noone to do it>>102211229yeah but miku nr. 34987598375937458973 gets a little tiresome
>>102211210Imagine you see a woman rambling about systems of oppression and spiritual healing while on food stamps (real world example). You will immediately assume she's either really stupid or mentally unwell (possibly both).But stupidity and mental illness manifest in different ways and just like twitter is full of examples of above, 4chan is full of similarly dumb and crazy men.>>102211229This is an example of when a healthy adult gets put in a mental institution and eventually starts believing that the patients are the normal ones. Basically anyone without a healthy social life irl will be subject to this.
geting a weird "The given NumPy array is not writable, and PyTorch does not support non-writable tensors." console entry when using a GGUF flux model in comfy, what to make of it?
nice thing about being on disability, all day can be spent genning cool gens, like..nevermind
>>102211346whats your disability?
>>102211371not tell you, lol. gn losers, did what i could to maintain the garbage thread quality, see you next shift
>>102211371Probably autism
flux finetunes when
>>102211388>not tell you, lol.why not?
>>102211371he's got a serious case of sad-stickophilia.
>>102211408I think it won't happen, it's asking for more than 24gb of vram, Nvdia fucked us in the ass and we can't do multi gpu training innit?
>>102211388>>102211415it's literally this scene kekhttps://youtu.be/mBx-eT8D5nc?t=39
>>102211053yeah, definitely a false alarm, the merged PR is good, yesterday I wanted to test this feature so badly I got the incomplete and less accurate version kek https://imgsli.com/MjkzNjA5
>>102211548How many times you gunna post this same gen
>>102211573I think you need to read posts before commenting on them
>>102211548did you ever a/b test the t5xxl_fp16 vs the t5_v1 gguf q8?
>>102211585>I thinkI don't really care
>>102211596>I don't really care
>>102211595I didn't, but you can do it I guess, I'm pretty sure the difference won't be that big
>run comfyui headless>perfectly stable>run comyui with a desktop>nukes entire session into orbitit's over
>>102211605gonna do that now
>>102211609Are you comfy yet?
>>102210939how did it went anon?
>>102211664I don't know how to use tmux so noI cheated by disabling hw accelerate in firefox, saves like 600MB of vram, maybe it'll work this time.gnome still eats 500MB though
>>102211668https://imgsli.com/MjkzNjE0hm. same smooth-best-absurdres clip_l but its a dual sampler setup with some noise added and I didn't sync that so a tiny variance would've been there anyways. "hm."
>>102211697it should be the exact same settings anon :(
>>102211668oh fuck nvm me. it went well! as advertised basically. no reload on weight change to lora is big. thumbsup
>>102211714Ikr, I still remember that anon saying that preventing the unload/reload shit on lora changes was too hard to do on ComfyUi, turns out he was full of shit. And I'm glad he was wrong, that's a really important feature, now I can experiment the strength of a lora without having to wait for ages, feelsgoodman
>>102210977/ldg/ started it, reap what you sew
published the Rocksylight LoRahttps://civitai.com/models/716286?modelVersionId=801012
>>1022117022 more. (1st one is with dynamic thresholding into tonemap into adaptive guider, 2nd one is CFG 1 basic guider). noise injection seed was locked this time.https://imgsli.com/MjkzNjE5https://imgsli.com/MjkzNjIw
>>102211770yeah it's obvious the fp16 gives better results>1st one is with dynamic thresholding into tonemap into adaptive guiderI'm using AutomaticCFG now, I'm getting better results imo
>>102211740Hats running in kino Miku?
>>102211780just automaticCGF into adaptive guider? been testing that shit all morning, its confusing and time consuming and my workflow is, well "getting a little crowded here boss"
>have over 500 datasets I hand curated I want to make into flux loras >can't find the motivation to finish tagging any of them I hate myself
>>102211815kek, I got the same convoluted shit, can't relate more, but yeah, AutomaticCFG + AdaptiveGuider there is, do not use the boost thing though, this shit is killing the output
>>102211740What's in kino I meant :( miss-typed with my grubby fingers
>>102211825That's okay, tagging isn't necessary.
>>102211845>What's in kino I meantI'm not sure to get what does that mean :(
>>102211763I thought this was a non gen img not gonna lie. Solid work, very uncanny next to the real thing.
>>102211851I thought kino=cinema was common internet vocabulary. I think I might be "special" :')
>>102211872Oh, Idk what movie she's watching but she looks really scared desu kek
>>102211780fp16 encoder, Q8 GGUF model w/ 1 lora active: 95% of 24GB VRAM, so near OOM basically. as long as its just one lora that is ok but yeah, that needs to be considered.
>>102211847yes it is, I don't want to make shit loras like you
Nice facial expression she got there
>>102211882maybe you should ask that guy to make a Q8+ gguf on T5, it would be closer to fp16 that wayhttps://huggingface.co/mo137/FLUX.1-dev_Q8-fp16-fp32-mix_8-to-32-bpw_gguf
>>102211825What's the rush? Flux lora training sucks at the moment anyways
>>102211872>kinoKino is actually also the German word for cinema.>Do you want to watch a movie in the cinema?>Willst du einen Film im Kino sehen?
>>102211948lichtspielhaus
Is there a way to get a dynamic steps shit, like the first half (25 steps) would be like a 50 steps setting and the end would be rough like 10 steps, because I feel like the end of a gen is overkill and doesn't need that much step to work
>>102211883No tagger would admit their tags only do harm being they wrap their entire ego around their tags.
>>102211858thanks bro
>>102211948As mush as I know also in Finnish and in Estonian
I finished the dim 256 lora with 10 blocks trained. Compared to the dim 64 lora it does not bleed the style through to everything as much. Still can do it when prompting for anime, but details remain abit more like without lora and realistic gens dont suddenly swap into anime style sometimes. Also small details like pointed ears appearing on normal humans. It is more like the dim 32 lora in style I trained earlier when not prompting for anime. Also size is 90mb vs. 330mb for dim 32, or 650mb for dim 64pic related>>102211970haven't heard that since 1920
>a pulp cult anime illustration from japan,>Hatsune Miku holding her bike with her right hand, hands up in the airis Migu a member of /pol/??!!
Every once in a while proompting I still get blown away that this technology exists for us to use at the mere cost of a $300 video card. We live in amazing times.
>>102212133amen anon, fucking amen
>>102212089it looks more like her at dim 32 somehow, especially when you look at her red ribbon
>>102212133>literally a machine that allows you to see whatever you want to seeliterally magic
>>102212133yaa!>>102212101that grammar will confuse t5>A pulp cult anime illustration from Japan. Hatsune Miku is lifting a bike in the air with one hand.>>102212177yes, removing blocks seems to make it forget details, but also doesnt make it bleed in wrong stuff, hard to decide which is better, I need to do more tests I guess
>>102212133Just think of how much money went into DALL-E 3, only for it to get mogged by this
>>102212201>removing blocks seems to make it forget details, but also doesnt make it bleed in wrong stuffthat's the dilema of loras, too much weights change rape the model too much and you get concept bleed, not enough weights and you lose the details, that's why we need a finetune instead, Loras can't be the final solution to all of this
>>102212201thanks for your prompt anon, I got some kino out of it
>>102212212>Just think of how much money went into DALL-E 3, only for it to get mogged by thisDALL-E walked so that Flux can run, we can't overlook its contribution and influence
>>102212212BFL is strangely very silent about how much money went into making flux. I remember SAI bragging about the hundreds of thousands they spent on training SD15>>102212232I guess so.. pic related is the dim 256 version of >>102208419
Come and get your next loaf of...>>102212255>>102212255>>102212255
>>102212101>German developers>>10221225014.88 million