Discussion of free and open source text-to-image modelsPreviously baked bread : >>103082516Comeback Edition>Beginner UIFooocus: https://github.com/lllyasviel/fooocusEasyDiffusion: https://easydiffusion.github.ioMetastable: https://metastable.studio>Advanced UIForge: https://github.com/lllyasviel/stable-diffusion-webui-forgereForge: https://github.com/Panchovix/stable-diffusion-webui-reForgeAutomatic1111: https://github.com/automatic1111/stable-diffusion-webuiComfyUI: https://github.com/comfyanonymous/ComfyUIInvokeAI: https://github.com/invoke-ai/InvokeAISD.Next: https://github.com/vladmandic/automaticSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Use a VAE if your images look washed outhttps://rentry.org/sdvae>Model Rankinghttps://imgsys.org/rankings>Models, LoRAs & traininghttps://aitracker.arthttps://huggingface.cohttps://civitai.comhttps://tensor.art/modelshttps://liblib.arthttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3>SD3.5L/Mhttps://huggingface.co/stabilityai/stable-diffusion-3.5-largehttps://replicate.com/stability-ai/stable-diffusion-3.5-largehttps://huggingface.co/stabilityai/stable-diffusion-3.5-mediumhttps://huggingface.co/spaces/stabilityai/stable-diffusion-3.5-medium>Sanahttps://github.com/NVlabs/Sanahttps://sana-gen.mit.edu>Fluxhttps://huggingface.co/spaces/black-forest-labs/FLUX.1-schnellhttps://comfyanonymous.github.io/ComfyUI_examples/fluxDeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main>Index of guides and other toolshttps://rentry.org/sdg-linkhttps://rentry.org/rentrysd>Try online without registrationtxt2img: https://www.mage.spaceimg2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest>Maintain thread qualityhttps://rentry.org/debo>Related boards>>>/aco/sdg>>>/aco/aivg>>>/b/degen>>>/c/kdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/tg/slop>>>/trash/sdg>>>/u/udg>>>/vt/vtai
Blessed thread of frenship
One last before bed.
>>103089728I lied.
>>103089665woah my bubble butt made it into the epic collage
>>103089754congrats anon
>>103089772classic example of a gen where you can see a nipple or other nude detail when you zoom out but not when it's full-size. Shows that the model was undecided about whether it was supposed to show bare breasts or not
>>103089832ty
let's draw straws to see who will stay up tonight and stand guard. we can't fall off page 11 again
>>103089896Very smug
>>103089990Top Smug
too much 1girl ITT
How important is token order with NoobAI-XL?
>>103090174You are ordered to toke.
>>103090312As you command.
>>103090174><1girl/1boy/1other/...>, <character>, <series>, <artists>, <special tags>, <general tags>This is their caption order from their model page so probably stay close to it for best results but feel free to try to diverge and make something fun.
>>103089989this blessed thread will take at least twelve hours to pass likely more
>>103090788Kekd
>>103090788lmao
>>103090158I dig this style.
>>103090811catbox?
>>103090879looks like Mayli
>>103090886good job anon
>>103090886anon...
Awh sweet a Mayli thread
wtf I randomly got this
Yeah but where's the cum?
>>103090901YJK
>>103090934make the ice cream cone comically large
>>103090956what model is this?
>>103090966think carefully, anon
I need to train a Lora badly, should I use OneTrainer or Kohya?
pony v7 test models in <24 hours
>https://huggingface.co/tencent/Tencent-Hunyuan-Largethe hunyuan chinks are making a 389B MoE LLM with 52B active parameters, that means they have the compute to utterly wipe BFL off the face of the planet if they wanted to
>>103091142you want them to make a giant image model no one will be able to run?
>>103091139I hope it's SAFE and ETHICAL with all nsfw and copyrighted material removed from the dataset
>>103091154show me exactly where i said that
>>103091161>show me exactly where i said that>>103091142>that means they have the compute to utterly wipe BFL off the face of the planet if they wanted towhat do you want them to do? make a small model? If that's the case you don't need "the compute"
>>103091139cautiously interested>>103091142>utterly wipe BFL off the face of the planei bore of this corpo tribalism, just give me a nice model that will run on my hardware
>>103091171>what do you want them to do? make a small model?what? all i said is that if they are capable of making a 389b llm they have significantly more compute than BFL does, that's all. what are you on about?
>>103091139>cucked model that has the artist names removes + trained on AuraMemenothingburger
>>103091182>all i said is that if they are capable of making a 389b llm they have significantly more compute than BFL does, that's all.no, you said that they have enough compute to destroy BFL, now tell me, how are they gonna defeat Flux-12b?
>>103091192are you stupid or what
>>103091202Concession Accepted.
>hed rather stack layers than optimize what he already has Sad
>>103091214unfortunately, you can't defeat Moore's Law, even a perfectly trained small model will never defeat a half assed trained big modelhttps://www.cs.utexas.edu/~eunsol/courses/data/bitter_lesson.pdf
>>103091223>never defeat a half assed trained big modelthe flux 8b distilled version proved this wrong, flux doesn't need all 12b parameters. a well trained 5b-8b could beat flux
>>103091236>the flux 8b distilled version proved this wrongare you serious? the 8b isn't close to what the 12b is able to achieve, but I tell you that, it's not because of the size, it's just that removing parts of a model is a bad idea overallThere's a reason every single AI company on earth go for giant models, that's because "perfectly training a small model" is utopia, they rather take a genius giant model that can chew shit and shit gold than having to bother to perfectly filter the training data to train a retarded little model to be 5% better
>>103091253>every single AI company on earthno, only the LLM ai companies, image gen models are no where near saturating their parameters unlike with text gen where they are reaching the limits of the transformers architecture
>>103091262>image gen models are no where near saturating their parametersunfortunately, neither you and I were on their lab to see their loss function, the only evidence we have is that Flux is by far the SOTA model, and it's a giant 12b model, call that a coinscidence, I call that Moore's Law
>>103091154No one gives a shit about the random dude making pics for himself unless they break a law. They only care about the money.
>>103091223>unfortunately, you can't defeat Moore's Lawthere's a trick to deal with it though, find a better architecture, I don't believe the transformers architecture is the end of the road, if we find something better, we could truly get a great base model with fewers parameters
>>103091139imagine the stress astralite is feeling over a thing that makes little furfag pictures
>>103090028What model did you use for this?
>>103091139>pony v7 test models in <24 hourshow do you know that? he said it on his discord or something?
>>103091388Flux (notice the apparent resolution)
>>103091458Thanks, do you know what flux model exactly? I really need to get to playing with it sometime. I am using juggernautXL.
>>103090411am i retarded or is that gay
>>103091536Yes
>>103090788lora?
>>103091536i just do <artist tags> first <character>, <series>, <general tag like yuri or something>, <1girl/1boy/1other/...>, <the rest> and quality tags at the very end
>>103091641
>>103091672Shieet
china save us
Hi, automatic tourist here. What's your preferred upscale method in comfy? I'm used to just pressing hires fix and not sure how to get good results now.
>>103091938i don't use comfy but the way hires fix works is that it upscales your image using an upscale model and then downscales it to your prefered size (so if pick resize image by x1.5 and you use an x4 upscale model it upscales the image by x4 and downscales it to x1.5). after that it does a low denoise with a ksampler (image-to-image) on the image to smooth out the upscaler's imperfections. recreate that and you should get hiresfix on cumfy
>>103091938>>103091975this link should also help you out>https://comfyanonymous.github.io/ComfyUI_examples/2_pass_txt2img/
>>103091938SD ultimate upscale node
>>103092023Well this is old meme. Reminds me of those counter scammed Nigerians.
>>103092065
>>103092013kek
>>103092098
"Hires fix" is like caveman technology nowadays
Come on genmo, give us MochiHD and the i2v encoder
>>103092530>Picsartmust be a sign... sana 2 soon?
>>103092110>>103092013i like chickens, please don't make them say rude things
im a bigmasexual and nothing can stop me
love me 1girlssimple as
>InCase Style
>>103090826Thanks, it does give a cozy feeling
Aren't /sdg/ and /ldg/ the exact same threads?Why do we need two different generals?Also people on /b/ are genning shit turds in toilet bowls, can't make this shit up.
>>103092754
>>103093494ty anon, I will play about with Mochi today too.
>>103093510i'm bored with flux to be honest and this is more fun atm to me than seeing what SD3.5M/L has to offer. that and cogvideos i2v sounds interesting, have you messed with that one any? once i'm out of this lazy mood i'll prob try that next.
>>103093571Cog will not work on my OS, Linux, I do not know why, I have spent maybe... 10 hours all told trying to get it to work and I've been using this sort of tech since 2019. It's just one of those things that's niche specific to this install even after using seperate envs for all of the tools, and I cannot be arsed to try anymore.I stopped using Flux due to the gen time and I kinda got bored with single images.We're all waiting for Mochi full and release of img to vid for preview and full.I still cant decide which is the superior t5 model to use atm, currently using google1.1 encoder only. That's a very nice clip. Imagine what we could do with 32GB+
>>103092005if you use this coming from A1111 half the denoise that you normally use.
So how does sd3.5_medium-F16 compare to sd3.5_large-Q4? Which one should I use?
Prompt: rabbi yeshua
>>103093632that's why i keep putting it off too, i already know it's probably going to be a lot of fiddling to get it up and running.POV videos are p fun but i couldn't get a good warzone one so gave up after a few tries.
>>103093993Not neccessarily, it may work straight off the bat and tbf if i went over the error logs and satisfied everything it would probably, eventually, work. I just can't justify the time ivestment when Mochi is the new hotness.I think you should give it at least one go.
>>103094082oh i just know after reading through the documentation that incorporating everything to get what i want is going to take some messing around. while i'm not adverse to it, i just have to be in the right mood to want to do it. like you atm i'm just having too much fun with mochi.
>>103094194This looks coolI wanted to make my own space battles but I feel like the lack of control for video generation would be annoying, so my idea was to use Unreal Engine and plot out basic space battle movies, then run them through an AI filter to make them look cooler
>>103094488yeah not going to lie i was looking at this a while back myself https://github.com/tin2tin/Pallaidium
Looking for recs on good checkpoints or loras to create scenery, landscapes etc.This shit is fucking ass.
>>103089665Bonus round of: >>103082516
flux needs to be thrown out, it always generates the same exact facetrash
Local (Video) Diffusion General
>>103094695
>>103093821looks accurate to me
S A M E F A C EAMEFACE
>>103094859Always the same fucking face.
couldn't you like.. inpaint different features into your face to make it more unique?
>>103094591literally any XL checkpoint
>>103094934You had a stroke. Now all faces look the same to you. The facial recognition portion of your brain was deprived of oxygen for too long.
I blame proompting.
>>103095137Skill issue.
>testing new models>it's so shit I stop it mid-preview to not waste electricityfr no cap
>>103095166same
>>103095159is prompting a skill?
>>103095180Promptcraft is a noble profession.
I'm a promptologist, if you will
>>103095190Is promptcraft a form of wordsmithing?
Ever find a model so ugly it's actually kinda interesting?
>>103095251sweet
i installed this shit in my custom nodes folder via git clone but it doesnt want to load it inwhat am i doing wrong?
Is there a rhyme or reason to this sort of artifacting? Or can I somehow fix it? NoobAI, Euler, 920x1224, 45 steps. Don't know what the consensus settings are for NoobAI.
>>103095309nvm, fixed it somehow by cloning it through the node manager instead
>>103095335Apparently NoobAI doesn't like prompts that do not inclue at least one artist to reference a style from. Also inpaint. Also also you don't need that many steps. 35 tops.
>>103095335try euler a instead and don't use so many steps, 20-25 is more than enough. i like euler a with sgm uniform.>Is there a rhyme or reason to this sort of artifacting?that just looks like sdxl being sdxl due to the vae. you can fix it using inpainting/adetailer. i also recommend you stick to the recommended sdxl resolutions>https://stable-diffusion.app/blog/best-sdxl-sizes-recommended-sdxl-generation-sizes/Also, like this anon said >>103095473 both noob and illustrious are made to be used with artist tags and don't have a default style. you can try putting a few artist tags at different weights and make your own style mix
>>103091080c'mon anons
Yeah like the other anons said you can try and not use an artist tag for noob but what will likely happen is something like this >>103086808 noob doesn't know what do without at least one artist on the prompt
>>103091080>>103095583https://github.com/derrian-distro/LoRA_Easy_Training_ScriptsUnless you're training flux or sd3. I put it in the sticky for a good reason, why was it taken out?
>>103095613Sorry whats the best option for a Flux Lora?
he posted slop, don't answer his question. he must pay for his sins
at least it's an interesting posei'm a sucker for good posing
aaand ruined
he's going out of control, someone stop him
>>103095701she's knotting the wall
>>103095710now I can't unsee it
>>103095806it's clearly AI generated because the bird doesn't have boobs
>>103095806I feel like he's about to yell some expletive.
>>103095806seems like he has something to say
>>103095806>N
https://www.reddit.com/r/StableDiffusion/comments/1gjl982/lora_torchcompile_is_now_possible_thanks_to/for that one anon that was trying, looks like only the fp8 fast turd comfy put out works.
>>103095806Can u make chickens that don't look like eagles tho??
>>103095836learn to inpaint foo! cute bobo girl otherwise
>>103095880i don't think even inpainting can fix illustrous/noob
>>103095880Yeah I really should check it out how to use inpainting
>>103095940Only masked, original, 0.6 ~ 0.4 denoise and you're off to a wild ride.
what's the best model/workflow to generate dating profile pictures of myself having cool normie friends doing cool normie things?
>>103095965I'll give you a tip, if you fabricate your life women still won't date you because you won't be able to cash the checks you are writing. I'll give you a pro tip, Chad can post a blurry mirror pic and get matches.
>>103095940>https://rentry.org/fluffscaler-inpaint#detailing-parts-a-better-adetaileri also recommend using the 'close-up' tag when inpainting
Why is /sdg/ /pol/...I lurked some more about >>103095144 and it seems the answer to my first question is that it's better to use all 3 text encoders. And about upscaling, they say 3.5medium is actually trained up to 1440px. So there's no simple way to upscale it more using itself, unlike SDXL?
>>103096038doesn't look like it
>>103096038>they say 3.5medium is actually trained up to 1440px. So there's no simple way to upscale it more using itself, unlike SDXL?it means you should be able to upscale upto 1440 safely without the image turning into a mess like with sdxl when you hires fix with too high of a denoise i think>answer to my first question is that it's better to use all 3 text encodersi think so, the t5 and clip_l is for sentences while clip_g should be for tags like sdxl. i haven't played around with sd 3.5 much so i can't say for sure. i'm waiting for finetunes
>>103096079
>>103096029Thanks, I usually use adetailer but it's good to understand what is actually happening though it might not be enough to fix illustrous/noob gens.
>>103095880Ahhhh fuck I just saw the face up close, yeah nah that one is my bad I'll tweak that one later LOL
hunyuan....
bigmart sigmax..
kkkolors...
western suicide status?
>>103096210hunyawn released both a really big llm and a text to 3d model a few hours ago so there's a non zero chance they might release a text to image model too
>>103096225>inb4 more teenage asian smartphone selfies
>>103096079i like this aesthetic
>>103091139Is this pile of shit out yet or
>>103096238i prefer cute chinese jade beauty smelly feet selfie to middle aged western shemale in midlife crisis selfie
>>103096087>clip_l is for sentences while clip_g should be for tagsI still remember clip_l for styLe and clip_g for subGect. Funny how no one really uses them separately anymore regardless of the correct usage.
>>103095848p good too bad it's only fp8
>>103096210>western suicide status?https://youtu.be/twV0ru_5KY0?si=9aOg9BTxIfewADJI
>>103096210
>a photo of Jennifer. She's sitting on a desk in a library. she has a short pixie style haircut, and she's wearing rectangular reading glasses. she's wearing a t-shirt and a long skirt with suspenders.why is FLUX giving me 90% anime pictures with this prompt? i'm even using a lora trained on real images.
>>103095817>>103095827>>103095837>>103095842
>>103096491describe the characteristics of a photograph further flux wants you to be verbose
>>103095627Kohya is easy to use, and its UI is lightweight. OneTrainer is slower and has more limited settings.
Death march
>>103096087Alright, I see. So I can gen even 1.4MP straight up but can't upscale higher than that at all. Even x1.6 is already garbled.>turning into a mess like with sdxlHaven't touched base model since it was released, so then since any finetunes I use can do upscaling easily that must be because they were trained with higher res images; and 3.5 finetunes could be fixed similarly?>>103095946Thx for the prompt lul.
>>103096583You could've just asked.>1girl, solo, dark fantasy, fantasy, sexy female warrior, dungeon, muscular female, solo, highleg panties, large hips, blue eyes, short red hair, white hair, annoyed, looking away, sweat, sitting on wooden chair, multicolored hair, tavern, spread legs
>>103096613I just Florence'd your pic, and needed plain text instead of tags anyway.
>>103095128I can't stop seeing that face in every 1girl prompt, it's all over civitai and even outside of it. Pronounced chin, nose, eyes.
Are loras possible for video models?
https://huggingface.co/tencent/Tencent-Hunyuan-Large>Hunyuan-LargeA larger image model version of Hunyuan??>It's a LLMoh...
I wish I had a 4090
I wish I had a GF
I wish I had a GF with a 5090
I wish i had text-to-reallife monster girl model
tfw no chinese ai researcher gf
>>103096848wish granted
>>103096859嗨,怎么了?
>>103096839>>103096848>>103096852waifubots soon
discovered a new use for my border-adding node already—helps with forcing a perspective of looking at a screen. Without adding the border I kept getting pictures of television sets, no good. With the border I almost always get something closer to what I wanted.
>>103096864ugly whore
>>103096864>slop grantedthe monkey's paw curls..
I wish I had a chinese researcher gf that coded a text-to-reallife monster girl model with a 4090
>>103096877>>103096882
what if they invent waifubots tomorrow and let you gen your own custom waifu using ai but you can only use base flux (no loras)
>>103096925blurjeetas are back on the menu boys
>>103096925they'll sooner charge a monthly subscription
Hunyuan 100B txt2img model waiting room
>>103096925>all waifubots have BUTTCHINS
>>103096925all our waifus would have the same face
>>103096925Are you saying what if Trump wins and anime becomes real?>>103096989>>103096970turn down your cfg>>103096942Error: no subject identified, image could not be read.
aaaaAAAAAAAAAAAAAAAAAAAAHHHHHHHHHHH!!!!!!!!!!!!!!!!!!!
>>103097022Don't want no hamplanets.
but why? took so god damn long to train loras
https://github.com/Vchitect/FasterCache/issues/2damn we just keep getting faster
>>103097075off topic doesn't even make sense wtf
>>103097075>translator note: off-topic means I don't like it
>>103097075that was a really nice ass too
>>103096954what did she just read/see? was it a dank meme?
>>103097107>anon does not enjoy ass
>>103097134jannies are known to be faggots
>>103097093yeah I dont get it either>>103097107dont like this keikaku at all>>103097121used Monica Corgan in the dataset
>>103096410Very nice
>>103097075Janny thought it's a random ass picture
>>103097181>AI is so good janitors cant tell its generated
>>103097125She found out who won the elections
>>103097075>took so god damn long to train lorasmaybe mention this genius. The technology you used on the technology board. I haven't been heavily monitoring the thread but I thought most of these posts were blank.
>>103096524thank you buddy
why coomers are such anxious people? I run this paid commision site and everyday there is this guy who writes me "when I'm gonna upload x image?", I upload that "X" kind of image, and the next day he writes me again asking me when im gonna upload another image?, writes me everyday asking the same question, if he wasnt paying, I would block his ass
>>103097302don't underestimate the power of coom
>>103097302just tell him when you upload it, wtf is your problem?
>click and drag the noodle why does everyone make this so homosexual? call it a wire or a cable not a fucking "noodle"
>>103097360the original docs are written by an actual french canadian homosexual. Get over it.
>>103097321But I do, once I upload said requested type of image, the coomer guy starts asking me again when i'm gonna upload another image, then he asks when im gonna generate a set of images, I say: "i'll upload it X day", and he starts asking if the image is ready, when I'm going to upload it? "when, when, when", its a never ending cycle, other people are just fine but this guy is really annoying.This coomer guy never is satisfied, whats more annoying is that he keeps requesting full body shots with hands and feet included, so its a pain in the ass to gen them, feet are not that of a big deal but hands are really annoying and take time to fix them, I could be generating something else
>>103097394then ban him. Paying customers are a gift. If you don't want the work then reject it.
>>103097394Why don't you do something about then instead of complaining on here
>tfw too anxious to even run a paid commission site
>>103097475I'm just ranting, why would I ban someone whos paying because hes a little annoying>>103097411 >Paying customers are a gift.is right
>>103097492Are the commissions he's asking for some sick shit?
>>103097536futa on shota
>>103097547Ban him
>>103097547guess making commission money with tasteful nudes remains a pipe dream of mine
>>103097580I would be fine with untasteful noods. The commission work seems to be illegal in my country. shota definitely is, deep fake work is very soon behind it.
baker san?
>>103097536not really, just women walking naked but the hands of people casually walking are really hard to fix
>>103097154>Monica Corgannice, never heard of her.
>>103097315oh damn 10/10, i didn't realize we could actually do freckles now, can you share a catbox?
>>103097636It's easy actually
>>103097666sideways yeah but walking towards you it isn't
>>103097666just use controlnet vibes.
>>103097672Need examples of that
Fresh >>103097839>>103097839>>103097839
>>103090449I need more female halo spartans please