Discussion of free and open source text-to-image modelsPreviously baked bread : >>103016063 Tallest Edition>Beginner UIFooocus: https://github.com/lllyasviel/fooocusEasyDiffusion: https://easydiffusion.github.ioMetastable: https://metastable.studio>Advanced UIForge: https://github.com/lllyasviel/stable-diffusion-webui-forgereForge: https://github.com/Panchovix/stable-diffusion-webui-reForgeAutomatic1111: https://github.com/automatic1111/stable-diffusion-webuiComfyUI: https://github.com/comfyanonymous/ComfyUIInvokeAI: https://github.com/invoke-ai/InvokeAISD.Next: https://github.com/vladmandic/automaticSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Use a VAE if your images look washed outhttps://rentry.org/sdvae>Model Rankinghttps://imgsys.org/rankings>Models, LoRAs & traininghttps://aitracker.arthttps://huggingface.cohttps://civitai.comhttps://tensor.art/modelshttps://liblib.arthttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3>SD3.5L/Mhttps://huggingface.co/stabilityai/stable-diffusion-3.5-largehttps://replicate.com/stability-ai/stable-diffusion-3.5-largehttps://huggingface.co/stabilityai/stable-diffusion-3.5-mediumhttps://huggingface.co/spaces/stabilityai/stable-diffusion-3.5-medium>Sanahttps://github.com/NVlabs/Sanahttps://sana-gen.mit.edu>Fluxhttps://huggingface.co/spaces/black-forest-labs/FLUX.1-schnellhttps://comfyanonymous.github.io/ComfyUI_examples/fluxDeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main>Index of guides and other toolshttps://rentry.org/sdg-linkhttps://rentry.org/rentrysd>Try online without registrationtxt2img: https://www.mage.spaceimg2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest>Maintain thread qualityhttps://rentry.org/debo>Related boards>>>/aco/sdg>>>/aco/aivg>>>/b/degen>>>/c/kdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/tg/slop>>>/trash/sdg>>>/u/udg>>>/vt/vtai
Blessed thread of frenship
Last night was a laugh riot, new extra venv, new comfy install with all the trimmings.Could not figure out why it wasn't saving gens into my used-for-a-year nice neat subfolders directory setup....yeah, took me a frustrating hour to figure out. I am an idiot. (it was in the new comfy folder ofc).Mochi doesnt know Hatsune Miku :(
>>103024409>Mochi doesnt know Hatsune Miku :(I think that's the first model in existance that doesn't know Migu, that's impressive lol
>tallest edition ok
ok I'm retarded so excuse the dumb question but is turning images into tensors with rbg values scaled down by 255 a normal thing for apps to do or is it specifically a comfyui thing? or is it somehow necessary for ai image genbecause for writing comfy nodes to do a little image processing it doesn't feel easier
>>103024437I recall you were trying to get the perspective from the feet level but not having success, have you considered them "standing on a glass floor" and describing the camera rising up through it?
>>103024458>have you considered them "standing on a glass floor" and describing the camera rising up through it?I have not considered that but now I will try this, thanks
doesn't really work. It's ok all models struggle with extreme from below or birds eye view shows. I tried to do a couple of prompts or them lying down from above and it was body horror. I have enough good from below shots anyways I think
>>103024661I hope the HD version will be better
>>103024661That's a shame, but models probably dont have much training data on "shots from foot level", but one has to try these things.
what happened to /sdg/, were all the regulars culled?
https://github.com/Jonseed/ComfyUI-Detail-DaemonReally cool
>>103024694anon, this is /ldg/
>>103024431reposted with better decoding settings>>103024431>High definition video of Hatsune Miku skateboarding down a street towards the camera in New YorkThe model needs to have her added.
This was so close to great damn. On another note, I'm getting gens that just say "Failed" and they use one of my credits. This is a new thing they updated the site with, I'm assuming to keep girls from being generated too young and skimpy >>103024668>I hope the HD version will be betterThis is the HD version, I'm using the website>>103024682>That's a shame, but models probably dont have much training data on "shots from foot level", but one has to try these things.Yeah but limitation breeds creativity. Maybe I get get step on me mommy energy in a different way
>>103024722>This is the HD version, I'm using the websiteare we sure this is the HD version, maybe they just upscaled their model, it's not supposed to be at 960x resolution, the HD version will be at 720p, they're probably using the same model as ours except that they don't use tilted VAE because they have more vram, that's why it looks better to them than us
>>103024722>mommyHmm, perhaps the foot crushing something small, like a grape or olive if it's a bar scene at the start of the prompt?
>>103024738>the HD version will be 720pSource? It makes sense that it would be double the resolution of the 480p version, and they can't quite call it full HD/1080p since it's not there yet>>103024739No anon I just meant the energy, I'm not trying to generate crushing porn lol this is for a music video/cyberpunk world buildingBut thank you for engaging and for the suggestion. Id be interested in seeing how well mochi can do crushing content for curiosity's sake
>>103024747>Source?https://www.genmo.ai/blog>Mochi 1 HD will support 720p video generation with enhanced fidelity and even smoother motion, addressing edge cases such as warping in complex scenes.
>>103024751thanks for the source, hopefully the website's model is indeed 480p plus an upscale so I have something to look forward to
>>103024700I like that, really cool node if you feel Flux displays oversimplistic images, I gave you my settings but I'm just getting started, can be definitely improvedhttps://imgsli.com/MzEzOTc4
>>103024722Test run: 50 steps fp8 (went a bit overboard on the juice)>Under the ultraviolet pink and blue glow of holographic advertisements in a futuristic bar, a womans shiny latex boot crushes a grape beneath her boot, spurts of juice, the camera pans up her athletic latex and cybernetically-enhanced body to her smiling faceFrom this 1 sample size, i dont think it understands in the way we do.
>>103024928spurting in general is a challenge for video models
looks like mochi is aggressively banning youthful prompts now and still counting it as a gen, so i guess I'm done for now. I think I have 40ish seconds worth of video to make a proof of concept for my art project at least
flux chuds wonhttps://civitai.com/models/743311/forever-flux-or-andrea-botez-or-the-100-most-beautiful-faces-no8
>>103025148Yes, flux is perfect and this woman is so beautiful that we have to celebrate her lora.We will never find a woman ten times better just by walking down the street.
>>103024250Epic
>>103024694turns out ""people"" acting like attentionwhoring retards in an anonymous imageboard drives said imageboard's userbase away; who would have thought?anyway, when that shithole inevitably dies, do not give the avatartroons a single inch; let them rot in the festering, stool-filled abscess they made for themselves
Prompter beware! You're in for a scare! Be careful with your choice of Halloween costume...
kek
been away for a few months, anything new and exciting happen?Seems like no? some crappy new UI (REforge) and SD 3.5 which looks marginally better
>>103026440>SD 3.5 which looks marginally betterhttps://www.youtube.com/watch?v=OJy6bJ_RxXg
>>103026440>been away for a few months, anything new and exciting happen?Not much, I feel it'll take years to surpass Flux dev, that's how good this model is
>>103026440SD 3.5M is an upgrade to SDXL and it's definitely much faster to train, at least twice as fast just at a raw efficiency level, 1024px images can be trained at batch 2, 4 seconds per batch on a 4090 (no cache). You can expect real porn models on it soon.
>>103026523i don't believe it, the chink video maker for one is something that should get an equivalent soon>>103026525not really interested in training my own models but cool to hear
>>103026564>the chink video maker for one is something that should get an equivalent soonyou talk about the Mochi guys?
>>103026583nay I meant thishttps://hailuoai.video/
>>103026601what makes you believe they'll make a local image model? MiniMax is API only
>>103026150my wife
>look at llm fags (who im totally not one of btw), they think about it like ABC while we, img fags, think about it like XYZ (which is vastly inferior). if only you I mean we thought about it like they do, we would be in a much better spot, because we I mean llm fags are just so much better you know?
>>103026817kek
>>103026969this, unironically
>yeah so what if LLMs can be used anywhere that text is used>why isn't Meta training a 70B porn image model?
>>103027463I wanna make sex with chinamen
>>103027479he's obviously japanese, retard
>>103027502Ni hao my nigger
For training SD 3.5M seems like 4e-6 is the sweet spot
>>103028045post test results?
>>103028064I'm still training, but I was mostly observing the rate of change between different learning rates. 1e-6 is way too slow, 1e-5 is way too rapid and will make monstrosities.
Honestly already bored of video models, looking forward to interactive world models.
skeet
>>103028082good luck anon :3
Is flux hard to set up to run at home? I'm a civitai proompter but flux is too expensive to run there so I thought I'd try some gens on my rtx2070.
>>103025782Desu appropriate
>>103028601>rtx2070that's a 8gb vram card, you can run flux if you go for a Q4 quant I guess guess
Midjourney-Niji il really a special model, so far no one even got close to it in terms of making an anime image that doesn't look like AI slop
>>103028601You'll have more fun with Pony/SDXL on Forge UI. Flux is a tad bit too slow on 8vram, but it's doable.
>>103028744skill issue
Man, even after huge new model release, sd /g/ threads are fucking dead. Guess SD really is only good for coom...
>>103028744ngl, based on this comparison they really have something good going on behind the scenes
>>103028766>hugedebatable
>>103028766Always been. Quite some time ago I used to look it up just to check up on how bad the situation is, but even that's gotten boring.
>>103028769>they really have something good going on behind the scenesthey just train their models with every good anime images they can find, the rest of the field go for """ethical datasets""", MJ is like the chinks, they know that to get a good model you need good images, and they don't give a fuck about the artists fee fees, as it should
>>103028769The secret is they used screenshots from real anime.
>>103028797>>103028793i don't think training quality foundational models is as easy as "just pick good images bro"
>>103028813it is though, we're talking about millions of good quality pictures, if you train a model well, it'l be good, if you train your model with slop, it'll be good at slop, garbage in, garbage out
>>103028813Actually it is. If you do fps=1 screenshots of every anime, threw them into any VLM (ideally with the name of the movie in the prompt) and then finetuned it you'd be shocked how good the end images look.
>>103028839imagine you train a model with every single frames of every animes that exist, how many frames would that be? I wonder how many images Flux can eat before saturation
>>103028744Pixelwave looks betterthat man in the MJ looks horrible, look at hands, legs, etc, if you added noise and lower the contrast in the pixelwave it will look same as the MJ one, don't lie to yourself anon
>>103028793I'm not sure that's entirely the case. Based on this one comparison it seems to be more coherent, it could be better trained parameters, or it could be more of them. Whatever they're doing is doing a similar jump to that of comprehensible text, except here it's about composition.I'm often testing new and niche models/mixes for Pony, and even there I'm seeing huge jumps in this "compositional coherency", so it might be a matter of better trained parameters indeed. For example I'm seeing way better compositions with something like Hadrian Delice Stylized. Other models and mixes tend to be just a different style flavour of the same base composition you'd see in something like Autismmix, meaning their fine-tuning mostly touches just the surface and doesn't influence it's internal logic or understanding of pixel context.
>>103028871It's a lot of frames (24 FPS * 3600 is 86400 images per hour) and likely the model would get a lot smarter because it'll be exposed to a lot of spatially changed similar images. Videos are a goldmine for extremely high quality and high volume images.
>>103028875let's not pretend that the MJ niji style isn't appealing, would be disingenuous to do something like thathttps://civitai.com/search/models?sortBy=models_v9&query=niji
>>103028912meme model overhyped by social media
>>103028904>86400 images per hour>86400 images per 3 episodes>let's say a regular anime has 3 seasons + 24 episodes >2073600 images per anime>there are roughly 12000 animes that existed>24883200000 images total (24.8 billions images)Holy fuck kek
>>103028170can i skeet on her?
>>103028941it's not overhyped, it's legit good, I love MJ niji style, it looks like real anime drawings
>>103028970Throw 20,000 high quality anime images at SD 3.5M and you'll get the same model.
>>103028970What's the purpose of your posting? Do you think you'll get any answers that are not "just train a lora, skill issue" here?
>>103029002He's the resident MJ shill.
>>103028970the image on the left is very bad, just lower the contrast of your gens and add some blurry filter, some noise and you will get a similar aesthetic
I'm the resident Pony shill, just not vanilla and most finetunes/mixes. It's a rough gem alright, just needs some elbow inpaint grease.
>>103029009What's wrong with saying that MJ is good? You can enjoy local and at the same time wishing it could be as good as its API rivals, that's not possible?
>>103029029Because you're actually a bad faith concern troll.
>>103028952They don't use all frames because there a high chance that the frame x and frame x+1 are too similar. They program checks to see if there is enough variation before getting the frame.
>>103029009I don't even disagree with him, but this is literally /local/ general. Whining won't get you a better local model.
bullshit ass fuck dumb cunt journey, more like
>>103029050The problem is a lot of bullshit he says is subjective opinion about what good anime looks like. It's just a flavor of ice cream, not some impossible standard.
>>103029023Vanilla Pony is somehow still the king of proompting.
>>103029062>It's just a flavor of ice cream, not some impossible standard.Bullshit
>>103029068Never managed to tardwrangle vanilla, though I've had issues with vanilla autism too. DPO did the trick for me, but at this point I can sometimes find a finetune or mix that on average does a better job than either of the two.
>>103029079There is nothing special in those images that Flux for example can't do with a Lora. Don't confuse aesthetic flair with technicals.
>proompting
>>103029089>There is nothing special in those imagesthere is
>>103029062Honestly, I think local was extremely behind with anime up until Illustrious and Noobxl and if you think Pony was nearly as good as NAI or Niji, you're pretty much coping.
>>103028959yes
>>103029096No one said Pony was on part with MJ because we all know that MJ is at least a 3B model and SDXL is a literal piece of shit.
>>103029090Anon's got a point, it's a coomer infested environment.
>>103029106>we all know that MJ is at least a 3B model and SDXL is a literal piece of shit.SDXL is a 2.7b model though, it's not that small
>>103029106Pony's issue wasn't SDXL though. SDXL is crap but it can do much better than Pony.
>>103029114It's almost like parameters aren't created equal.
>>103029106>we all know that MJ is at least a 3B model>>103029121>It's almost like parameters aren't created equal.so parameters matters or not?
>>103028871I worked it out according chatgpt earlier this year, it estimated 60,600 hours of anime has been produced.
>>103029114If that's the case, the parameter count could make for a huge difference in coherence/complexity, even if it's arguably not that big a size difference. I've seen the jumps between smaller LLMs, so I wouldn't be surprised if the same applied here.
>>103029132MJ is a modern architecture 3B model. SDXL is a shitty unet CLIP model.
>>103028656>>103028748I guess I will give it a shot.
>>103029140>MJ is a modern architecture 3B model. SDXL is a shitty unet CLIP model.Also noteworthy. SD had an awful background in terms of encoders, I've seen Laion and it was a fucking mess, so I wouldn't be surprised if CLIP wasn't much better either.>>103029155If you go for Forge, linkrel might be the simplest to run:https://civitai.com/models/638187/flux1-dev-hyper-nf4-flux1-dev-bnb-nf4-flux1-schnell-bnb-nf4You pretty much just paste it into the model folder, switch to Flux mode up top and off you gen at roughly 1 image a minute.
>>103029140>MJ is a modern architecture 3B model. SDXL is a shitty unet CLIP model.Idk man, Midjourney Niji v6 was released in december 2023, it's probably a unet model, DiT wasn't a thing back then
How much have they really proven to us about MJ? Can't we only guess at it's innerworkings? It's not like they've released any papers or anything, right?
Speaking of DiT, anyone bother to share some insight on it? I know nothing, other than it having been used back in Hunyuan.
>>103029179Pixart Alpha used DiT and it released its paper Sept 2023. Maybe you're just ignorant and a retard. It's okay not to talk when you have no clue what you're talking about.
>>103029208Ok so you're a 2 digit IQ retard, I said that DiT wasn't a thing, meaning that it wasn't mainstream, your anecdotical Pixal Alpha release don't change that fact, that field was dominated by unet in 2023, whether your monkey brain likes it or not
anyone else just typing random things, quotes or movie titles so see what they get?>long live the new flesh
>>103029231Sometimes I prompt with song lyrics I'm currently listening to, though mostly a random bunch of it, or the most catchy ones.
>>103029231Song lyrics and bible verses have been and will always be kang
>>103029200It's just a Transformers layers model. It's basically a stack of Attention and Cross Attention layers that process a stream of tokens that represent parts of the image and the caption.
>>103028744The magic of Midjourney is that they have an LLM to refine your prompt to get the best possible image with their model, meanwhile for SD and Flux it takes exactly what you put in
>>103029191illustrious and noob show us that properly training your model with a good dataset goes a long way, that and maybe they have some external methods to help with details like that detail daemon thingy for flux
>>103029263Kudos.
>>103029224DiT was originally published in 2022. I know you're a fucking faggot, but here's a clue: often multiple teams can work on similar ideas. I know you're desperate to be right, but you're wrong. Also three months is enough time to train a model from scratch if they saw the Pixart paper and said WOW THAT IS GOOD.USE YOUR BRAIN.
>I've seen things you people wouldn't believe. Attack ships on fire off the shoulder of Orion. I watched C-beams glitter in the dark near the Tannhäuser Gate. All those moments will be lost in time, like tears in rain. Time to die.
>>103029264>The magic of Midjourney is that they have an LLM to refine your prompt to get the best possible image with their modelis it? I often do some boomer prompting with chatgpt, and it doesn't suddently makes the image more aesthetic, it just helps to make the model follow my initial prompts better
At this rate I'm getting popcron, it's kino /ldg/ banter hours.
COZY thread
back in the pixart days i'd use the banter as prompts to get the most kino outputs
>>103029283>I know you're desperate to be right, but you're wrong.How can I be wrong? You and I have no idea what architecture was used on MJ, you're just sperging on assumptions, not facts, that's expected from a 2 digit IQ monkey though so not that surprising.
>>103029304I am literally correcting your literal ass backwards assertion that DiT didn't exist in Dec 2023. Kill yourself.
pixart sexuals live on in eternity
>>103029317>your literal ass backwards assertion that DiT didn't exist in Dec 2023not my fault if you can't proprely read english, I already explained that saying that "DiT wasn't a thing" meant "not mainstream">>103029224>I said that DiT wasn't a thing, meaning that it wasn't mainstreamAnd yet you continue to pretend that I meant "it doesn't exist", how much of a retard are you?
>>103029014good image
>It Doesn’t Feel Pity, Or Remorse, Or Fear, And It Absolutely Will Not Stop, Ever, Until You Are Dead!kek wtf
>>103029341me begging for anon to post more gens
>>103029339I know you only do things if they're mainstream but someone running an AI company might be a little ahead of the curve. I don't even know why I talk to you, I'm pretty sure you're debo, he always had ass backwards opinions.
>>103029351>I'm pretty sure you're debo, he always had ass backwards opinions.looooool, I thought you were debo too, so you're just someone as retarded as him, god, why did you create a clone of that retard in this place...
>>103029341what i imagine the average flux 1girl poster to look like
>>103029281Does she come with a self-cleaning hole?
From now on /niji/is
>to the last I grapple with thee; from hell's heart I stab at thee; for hate's sake I spit my last breath at thee.
>>103028744regarding pixelwave + loras:https://civitai.com/articles/8505
>>103024700>(but also works with SDXL, SD1.5, and likely other models). very interesting ty anon
>>103029290That's because ChatGPT doesn't know the best prompting method for Flux or SD. It doesn't know how the images were captioned, it doesn't know the training data, it doesn't know anything about the model. Meanwhile midjourney trains their own LLM with exactly the same data their image model is trained so it knows everything about the model and how to prompt it to get the best results, the image model and the LLM work in tandem.
>>103029383what else do you put in your filter?
>>103029495If I told you you would try to circumvent it. But over time there are certain key phrases that guarantee it's a low IQ phoneposter.
>>103029377of course
>All is forgotten in the stone halls of the dead. These are the rooms of ruin where the spiders spin and the great circuits fall quiet, one by one...
>>103029384would've been better if it was a love-hate relationshop where they caress eachother's cheek with one hand, and hold weapons in the other
>We may ask what is relevant but anything beyond that is dangerous. He is a liar. The demon is a liar. He will lie to confuse us. But he will mix lies with the truth to attack us. The attack is psycological, Damien, and powerful.
>>103029440fair enough, I'm sure there's some models where we know what caption models they used, I don't remember which one though
>A picture of the entity that lives within the dataset of flux
>>103024977what's with all these grandmothers ITT
>>103029752The buttchin entity
>>103029902Looks like the Brazil flag rotated. It even this white streak going across the blue circle.
>>103029947kek
tried to generate Neuro sama just by describing her with a wall of text,SD 3.5 large model
>A picture of the entity that lives within the dataset of this pony finetuneshould've expected it
>>103029440>>103029264>>103028119would you box one or more of these up, please?Kind anons, I'm looking for box info on picrel. I saved it nov 22, 2023. I need the box info to gen the same character, but a mouse.
>>103029948yeah, kind of does look like it a litte.
>>103030070sovl
>A picture of the entity that lives within the dataset of this SDXL finetunea bit better
>>103030112nice
see you on the other side
And this is made with flux, same textwall
>>103030259Godspeed
>A picture of the entity that lives within the dataset of this different SDXL finetune
And this one with the flux oil painting lora
>>103030333horrifying artstyle, feels like the corporate flavour of aigen, devoid of soul, as if we didn't have to argue about it being capable of sovl to begin with
>>103030357I think the intention is to be creepy
>>103029947The Pixelwave 3 finetune has pretty much fixed the buttchin problem>>103030068https://files.catbox.moe/j61xif.png
>>103029936>what's with all these grandmothers ITTRobots cannot reproduce and therefore cannot be grandmothers
>>103030386Instead it's just generic AI Flux face.
>>103030476nice
>>103030480Nah, it's actually pretty good at face variety as well. It basically fixes all the problems flux had.
>>103030386>>103030511You can tell it's yearning to give her a buttchin desu
>>103030511Generic AI Face #2Are you going to give me an actually different face?
>>103030561There you go, enjoy
>>103029974"Heart!", "well done">>103030261>>103030333"filtered"
>>103030690nice
>>103030509thx
>>103030614Face #2 with black skin toneAN ACTUALLY DIFFERENT FACE, PLEASE
>>103030757I think you might have autism it's clearly different faces.
>>103030768You must have autism because they're clearly not.
>>103030690sovl
>>103030776A telltale sign of autism not being able to perceive social aspects like facial recondition, to anyone normal they can see the variety and differences, for you they somehow look the same. I think you need to get tested or something, my condolences.
>>103030784what is this kino model?
its midjourney. dont worry, sd5 will catch up
>>103030776>>103030802>autists arguing over who's more autistic
>>103030802I knew that a 1girl slop poster is face blind. It's actually quite sad that you can't tell they have the same skull shape.
holy fuck why is Verus Vision so fucking SLOW
>>103030813Again, you lack the social awareness to properly differentiate faces, for your sake get tested and see how severely you rank on the spectrum.
>>103030843>Says the retard autistically posting 1girl, standing images each of which look like they're 90% related.
>>103030839Trained from dedistilled
>>103030860>>103030843I think you two should kiss already
Let me call my buddy who's an expert on this thread.
>>103030511did it fix the flux loicense problem?
>>103030860Are you actually retarded? I was posting cars, I did a single post to show that the pixelwave model fixed the buttchin problem. You can't even make a coherent and consistent argument.
w00t I got it workingsoon I will be able to make many variations on starter pics for img2img. much to do still however.
Is there a way to get flux to gen adult women with flat chest? All my attempts have failed so far
>>103030757Here you go.
>>103031016nice
>>103030920if only I had a beard this glorious
>>103031024SD 3.5M scratches my Pixart itch. Just waiting for Sana now to do the true training head to head.
>>103031040good luck
Ok /g/ I’ve been schlubbing it with Pony Realism and the OG YAPM for anime grills. What newer models should I run to blow my mind for realistic and anime gens? I saw something about Flux, how is that? The marketing sheet looks just like Pony’s did back when it came out.
>>103031024I like the parking garage, esp. the hint of a view and the blue sky
>>103031266It's a nice setting.
>>103031399sovl, if it didn't stink so much of incase (dw i actually like their artstyle & comics)
This thread has very good gens
>>103030933what kinds of variations do you mean? cool node tho
>>103031714Is that layerdiffusion or something new?
Has anyone managed to successfully train a lora on sd3.5? I'm using kohya with the 3.5 branch. My results are garbage, shitty noisy details, body horror, everything looks like melted wax. Basically the model gets fried, not in an oversaturated sense, but in a "it just looks like shit" sense.Doesn't matter if it's large or medium. Learning rate anywhere from 1e-4 down to 1e-5. max_grad_norm at the default of 1, or 0.01. Different optimizers. Leaving the prediction target at the default, or changing it to match AI toolkit / SimpleTuner (which is the same as flux). Fucking garbage results no matter what.Meanwhile flux literally just werks, pretty much no matter what hyperparameters I choose. It always looks at least decent. The base SD3.5 model looks good, especially large, so obviously stability trained it *somehow*. Maybe there's some weird training detail everyone is missing that's required for it to work.
>>103031130NoobAI-XL
>>103031801Ask the anon who was here earlier that posted >>103028045 and >>103028082. Seems like your learning rate is too fast but your results are the opposite of his.
>>103031801>>103032165It's possible Lora training is broken or your learning rate is too high and you're blowing up the weights which is normal for body horror and blown out colors.
>>103024508wait a second...
>>103031801Literally in the same thread:>>103028045>>103028082
>>103025148>100 most beautiful facesI've seen prettier women on the fucking bus, who the fuck rates these women? 70yo pajeets?
>>103032476For the record I'm doing a full finetune, not a Lora
>>103032409wait for what?
>>103028970by real anime do you mean basic bitch studio ghibli style? go gen some more styles and come back with results
Thoughts on Pinokio?>anybody even using it?
>>103032599That pistol is way too perfect, what the fuck
>>103028744>not even generating the right anime
>>103032626its Kino isnt it?
>he fell for the MJ b8 again
>>103031016vased
>>103031586sick>>103031219v nice>>103030892oooooooo
>>103031569I just mean I have the framework in place now to do more than just adding borders. Certainly also gonna do adding black bars at top and bottom of the image, maybe also flipping and cropping and such. The most important thing though was being able to get the image framed in some way. E.g. the black bars coupled with an appropriate image resolution will reliably tell the model to expect something that looks like a film still. And images framed with white tend to turn out a bit more 'artistic'
>>103033382neat
>>103024722remember all the posts of people saying we wouldn't have local video generation capable anything better than the mess that is animatediff? I said we would have something by the end of the year, I was right. So next time some no gen fag comes into one of these threads puking verbal diarrhea out of their mouths you can simply ignore them. They are elitist glowie shills that do not wish for us to have nice things. I predict within a few months we willd have something really worth talking about that is more efficient and runs on lower hardware. I honestly think that the key is how 3D games animate things, only with photo realistic textures exact to real life, no inference needed for generating frames, the animation data is already there, the model would just need to adhere to our prompts.
>>103033502Designers needed for ongoing meme wars.
>>103033184Me on the left
>>103031586I'm liking this anon. gives me inspiration, I too enjoy genning military things, but my gens are never this accurate, this one you done is creepy realistic.
So this is red_panda...https://www.recraft.ai
>>103033726I wish someone just would hack them with a 0day like some anon did with NAI
>>103033726better demo link https://replicate.com/recraft-ai/recraft-v3
>>103033760nevermind you have to pay to use it on replicate kek
>>103033536In my country I can goto jail for those images.
I don't know what to gen today, the models are pissing me off also.
>>103033552It's really just Pixelwave 3, no lora's or anything special. The finetune model is just that good.
>>103033956i hate flux its slow.
>prompt “stroking a cucumber with both hands”>this is the best it comes up with
>>103033903Kek, generating text with pony test, it could work. depth map was used for controlnet.
>>103033956shnell 3 or dev 3?
>>103033910In my country I can email them to your politicians, as long as it's not a threat.
Are there any good recommendations for an SDXL or pony model that isn't just trained on 1girl images? Because You know after a few month you kind of get bored of genning 1girl... All the models i have sitting on my drive trend to put same face 1girl where ever they can...
>>103034098In my country they make up the rules as they go along without public approval, they also believe they can just extradite people like you anon for sending them problematic images. You can also go to prison for calling a Islamic extremist that killed 3 little girls an Islamic extremist. Get gas lit to fuck by the media for months when they knew for a fact he was the exact thing you called him.
>>103034120there isnt one. people act like local is this bustling scene of dynamic finetunes each with their own purpose but the blackpilled reality is that it's a bunch of shitty civitai jeetmixes of the same models over and over again and there really isn't anything unique or gamechanging when it comes to finetunes. more of the same. you can count the number of actual finetunes on one hand
>>103034156or maybe we don't release every finetune because the scene is full of faggots like you
he who only sees muck is himself muck
>>103034156I know anon, and its depressing, we will be forever stuck with just genning 1girl till our dicks fall off.
>>103034173stop being poor and train your own modeloh wait
>>103034173maybe i will just take this pepe and use ipadaptor and do something with him.
>>103034086dev 3, specifically the bf16 version. You'll need a 3090 or 4090 for that specific version. You can try the smaller quants though, I hear they're good too.>>103033991It's the tradeoff for the quality and capabilities
>>103034206Downloading, will try it on my rx 6950xt. Wish me luck.
>>103034206i got 3060 it works fine, just slow and annoying. I got various flux models, i fucking hate how slow it is when ever I change the prompt. Changing seed only its not so slow, its only when clip needs updating i guess.
>>103034206ignore me though anony i just feel fucking deflated today, have been feeling like this for 3 days, don't know whats up with me, sleeping lots also. perhaps i'm run down or coming down with some flu virus (I hope not)
now youtube is like"Are you still watch this video?" to some background music I was listening to after only 5 minutes GOD WHY? WHY? WHAT DO THEY CARE?God please just burn it all with fucking fire
lmao oops
>>103033910>>103034098>In my country I can email them to your politicians, as long as it's not a threat.Yeh same>>103034150But would they kill you for merely downloading untraceable social apps such as SimpleX?
>>103034256i messed with some gore loras one day and holy shit, bodies chopped in half the lot, no i don't think I have the loras and no i wont tell you where to find them. The stuff would be way to bad to post here even, but nothing that Hollywood never produced. Just people get triggered when its AI generated by one mad man on his home computer...
>>103034259yeah but the quality of those models are all over the place most of the time I download and try at least, plus they seem censored, mostly created by idiots imo. I might get into tuning soon, i'd have to do it on a remote cloud based system though.
I kind of want to train models for buildings of various cities around the world using my own camera. I guess you would call that a data set. I just need to buy the camera first.
>>103034215Good luck anon, not sure how AMD GPUs will work out but with 16GB VRAM you should be in a pretty solid spot to use the 8fp version if you don't want the model to overflow into RAM>>103034227Yea, the CLIP issue is even a thing on a single 4090 or 3090. Flux with everything loaded is around 29 GB so generally the CLIP model overflows into RAM. I have a 4090 and 3090, load the flux model on the 4090 and the clip model on the 3090. I can pump these out pretty quickly since it removes most of the bottlenecks with flux and allows me to generate 9 of these images at a time in this resolution and takes a little over a minute. I tried doing it just on the 4090 without clip offloading and it was much slower.>>103034241hope you feel better soon
>>103034321I do also think there is a market in this also, just gonna put that out there for any anons looking for a way to profit from it.
>>103034120>after a few month you kind of get bored of genning 1girltry a new style
>>103034346I'm actually thinking about getting a drawing pad and writing some hook to gimp into comfyui, ipadator is extremely powerful. AI assisted art would be cool.
I'm just working on workflow design strategies desu, I'm sick of things ending up in a confusing mess.
>>103034440nice, can you do a broken mirror reflecting a face?
>>103034452face detector fail? I had a similar problem the other day with reactor node, it was driving me mad...
>>103034719>failLooks like a win to me
>>1030342273060's faster than my 6950xt at image geniirc
>>103034740hmm if you say so anon, reactor node what ever its called is broken on my machine, keeps complaining about corrupt models but they have been redownloaded and hash checked many times. Only the face swap model works, so I don't even other with that node as I think some shady shit is going on with models being altered, they are pickel and not safe. I use some other more easy nodes for faceswapping and shit, had some funny fun with those.
https://x.com/recraftai/status/1851706399631224939red_panda turned out to be another closed source model, sigh...
>>103034788>another stock photo modelWho cares
>>103033757imagine
>>103034794that's the thing, I don't know why it's ranked so high, do people really prefer stock photo models over something actually good like Midjourney or dalle?
>>103034805Yes, because people are actually tasteless retards. That's why we keep getting bokeh = good models.
Throwing random shit art done in gimp into ipdaptor to see results
>>103035017think it needs more ksamplers
powerful
>>103034453Cool, give me an example prompt from one of yours, let's see what I get with cfg 1 in dedistilled.
that feel when the drugs are wearing off at 5AM during a summer morning. WTF just happened and why am i in this pond?
>>103035191It reminds me of being up way too late reading my Bible, then I wake up dreaming in a cold sweat about something crazy.
>>103024700not sure if i like it with sd1.5
>>103035111All my prompts are simple, that one specifically is "photo of 1990s special forces during desert storm".
Baker...
Hold....
>>103035497First of all, thanks for your review. Glad I downloaded Pixelwave. It really adheres to prompts better than Flux.>photo of 1990s special forces during desert storm Doing it with some anti-detail with dedistilled cfg 1. 30 steps, euler, beta.This is the result of the simple prompt. I asked a llama LLM to juice up my prompt:>Capture the gritty intensity of a 1990s special operations forces unit during Operation Desert Storm in a photograph. The image should convey the harsh conditions and rugged terrain of the desert landscape, with the soldiers' worn and weathered uniforms and equipment a testament to their unwavering dedication and resilience. Incorporate subtle hints of the era's distinctive military gear, such as PASGT helmets, ALICE packs, and M16A2 rifles, while also emphasizing the soldiers' unyielding focus and camaraderie in the face of adversity. Consider a dramatic, golden-hour lighting scheme to evoke the sense of a moment frozen in time, with the desert sun casting long shadows and accentuating the textures of the soldiers' gear and the surrounding environment.I'll show this next.>>103024700>Detail Daemon(demon?)Anyway, how does it differ from using LyingSigmaSampler? You can chain LyingSigmaSamplers (if you are careful you can therefore set the lie amount for each step). I am anti-detailing.
>>103035607>Anyway, how does it differ from using LyingSigmaSampler?you have more options on how to change the sigmas
>>103035607
>>103035607>>103035640something important with pixelwave 3, make sure to use dpmpp_2m for the sampler and sgm_uniform for the scheduler, do at least 25 steps
Fresh>>103035679>>103035679>>103035679