Discussion of free and open source text-to-image modelsPrevious /ldg/ bread : >>101715949>Beginner UIEasyDiffusion: https://easydiffusion.github.ioFooocus: https://github.com/lllyasviel/fooocusMetastable: https://metastable.studio>Advanced UIAutomatic1111: https://github.com/automatic1111/stable-diffusion-webuiComfyUI: https://github.com/comfyanonymous/ComfyUIInvokeAI: https://github.com/invoke-ai/InvokeAISD.Next: https://github.com/vladmandic/automaticSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Use a VAE if your images look washed outhttps://rentry.org/sdvae>Model Rankinghttps://imgsys.org/rankings>Models, LoRAs & traininghttps://civitai.comhttps://huggingface.cohttps://aitracker.arthttps://github.com/Nerogar/OneTrainerhttps://github.com/derrian-distro/LoRA_Easy_Training_Scripts>Fluxhttps://huggingface.co/spaces/black-forest-labs/FLUX.1-schnellhttps://comfyanonymous.github.io/ComfyUI_examples/flux>Pixart Sigma & Hunyuan DIThttps://huggingface.co/spaces/PixArt-alpha/PixArt-Sigmahttps://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiThttps://huggingface.co/comfyanonymous/hunyuan_dit_comfyuiNodes: https://github.com/city96/ComfyUI_ExtraModels>Kolorshttps://gokaygokay-kolors.hf.spaceNodes: https://github.com/kijai/ComfyUI-KwaiKolorsWrapper>AuraFlowhttps://fal.ai/models/fal-ai/aura-flowhttps://huggingface.co/fal/AuraFlows>Index of guides and other toolshttps://rentry.org/sdg-linkhttps://rentry.org/rentrysd>GPU performancehttps://vladmandic.github.io/sd-extension-system-info/pages/benchmark.htmlhttps://docs.getgrist.com/3mjouqRSdkBY/sdperformance>Try online without registrationtxt2img: https://www.mage.spaceimg2img: https://huggingface.co/spaces/huggingface/diffuse-the-restsd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium>Related boards>>>/h/hdg>>>/e/edg>>>/d/ddg>>>/b/degen>>>/vt/vtai>>>/aco/sdg>>>/trash/sdg
>>101720035the onahole looks washed out and deep fried
>>101720035
>>101720100Crispy like it should be
Blessed thread of frenship
Bigma status?
>>101721012September release
bigma balls lmao
I love being able to prompt scenes where it looks like something is happening rather than a picture of a person standing somewhere. It's still a lot of work to get the AI to understand the prompt but so much more is possible.
I noticed whenever I prompt for a stark white background + an object with Flux dev, the result is blurry or out of focus. Pic related, left is with "stark white background" and right is without. Have tried with a few prompts Increasing the steps doesn't seem to help but once I take the background out of the prompt, the result is pretty much perfect. My usecase is making icons from the gens or 3D-models with Stable Fast 3D, both needing a transparent background.
>>101717687Where's the Miku in historical paintings?
>>101721273have you tried (focused) or (in focus) with prompt?
>>101721298The blurry result was with "sharp focus" in the prompt. Also tried with your suggestion with the parenthesis for emphasis but no dice, still blurry.
>>101721360I guess it gets easier once negative prompt works
What context size flux has in the prompt?
>>101718764Nta but bumping the question. My rtx 2080 is too slow to do autistic comparisons of every combination with various different prompts, currently just using euler/normal with 2.5 guidance. I did try deis with ddim uniform like >>101719696 said but the results look oversharpened and crispy
>>101721396since there is no documentation it ain't known, but you can guess the CLIP part is still 75 tokens, but the txxl5+ will be at least 256+ tokens
official pixart bigma and hunyuan finetune waiting room
>>101721396512 for dev, 256 for schnell
i drew this (with my finger in artrage). can models do this?
>>101721699Is this Loss?
Retard here, is it over for VRAMlets for real now or is there a chance of a slimmed down Flux variant that generates at a lower resolution or something but is still a significant improvement over XL/Pony?
>>101721749flux runs in 8gb
>>101721749Nigger, wait and there will be better small models, the original LDM model was colossal and a piece of crap compare to models now
looks like offensive posters are back on the menu boys
>>101721765kek, it runs on cpu if you are that brave, you need at least 12 GB to run the fp8 properly
>>101721860Nice
>>101721711Good thinking, I think our sermon did center around loss! Nehemiah.
>>101721960>religious>terrible finger paintingsanon, are you 12 or retarded?
>>101721938
>>101722004>SafetyLibtards are really going to town with this word. How long until it turns into a meaningless buzzword and we can no longer describe the thing that tells you not to stick your hand in a lathe?
>>101722033that's the idea
Wait what’s the latest
>>101722072flux is trainable but no releases. The guy who said it's not trainable now declares it trainable after seeing what Ostris did. Thanks Ostris. You da real mvp
>>101722122can I use up all my saved buzz to train a lora on civits hardware
>>101722137Not yet
>>101722122any set of weights are trainable, but does it train good?
>>101722033I HATE SAFETY! I HATE SAFETY! I HATE SAFETY! I HATE SAFETY! I HATE SAFETY! I HATE SAFETY! I HATE SAFETY! I HATE SAFETY! I HATE SAFETY! I HATE SAFETY! I HATE SAFETY! I HATE SAFETY! I HATE SAFETY! I HATE SAFETY! I HATE SAFETY! I HATE SAFETY! I HATE SAFETY! I HATE SAFETY! I HATE SAFETY! I HATE SAFETY! I HATE SAFETY!
>>101722230generated for BBC
>>101721997God is obviously real and even if he isn't Dawkins is right that you have to pretend he is
>>101722257you gotta do 2girls1cup now
>>101722263my dick is obviously real, will you worship it?
>>101722270Depends if it's worthy. BBC?
>>101722249good movie, watched it to many times tho
>>101722278AWC
>>101722279yeah I'm burned out on it too
>>101720054let me goblinfuck her
>>101721997I wasn't asking for your opinion of my fine 10/10 art, but if ai gan genwrate similar. I will try, but I don't know what to proompt
>>101722263God has to pretend he might save Dawkins from hell. his burden is greater.
>>101722302>rotted pussywould need to use a sword to cut it open again before fucking her
>>101721194Definitely!>>101721273"Gradient background" has been good to me. But if you're looking for a solid background to remove in post then try "no background">>101721546Catbox please?
>>101722264for me, ai is about walling out the world, bot bringing in the filchWeird how Myst is real life, just almost.
too much /pol/ here
>>101722408I wonder why. Oh yeah.
>>101722429c-cute
>>101722264
>>101722355>Catbox please?sure>https://files.catbox.moe/cn8jx5.pngshe started a peaceful career as keyboarder now tho
>>101722421the amerimutts are awake and ruining ai
fun
Flux would be more fun if it knew words like "downblouse"
>>101722550laughing like a spastic at thisI'm very ashamed
>>101722504>Save Europeans from communism, atheism and degeneracy>Now they want to be marxists, atheists and degenerates and invite muslims and blacks to rape their wives and integrate them with black babies Enjoy your atheist shithole while you jerk off to being blacked
>>101722550>>101722562Same, I kek'd hard
>>101722550>>101722562>>101722570NOOO THIS IS UNSAFE
>>101722550lol'd hard
>>101722662what a weird room
>>101722674man with itchy asshole lives there
How do I make a little script to run the starter thing? It's kinda annoying to pyenv shell 3.10.6 then start.py and turn the pyenv shell back to system when i'm done.
>>101720035>thread picTourist here: what happened?>>101720085Nice kitty.
i stopped following the technical details of generation for a year and now I have no fucking clue what any of this shit is
>>101722747better bigger model from ex stable diffusion makers (now black forest labs) was released that btfo SAI and stable diffusion, also SAI is failing as corporation as we talk
hmm, yes, flux is good model
what's a presidential podium called, the thing where the prez gives press conferences and stuff to the media. It's not the oval office, I think?
>>101722830>podium
>>101722830speaking podium, lectern?
>>101722830white house press room podium
>>101722828fucking lol
flux can produce some cool atmospheric shit when you push it in the right direction
>>101722872thanks fella
Does the huggingface link for flux from the OP work with mobile? I want a friend to try but I'm on desktop & can't test it
>>101722913no u
>>101722926If you open chrome developer tools you can resize your screen to be a phone
Anyone else experience this weird grain? Using euler + normal scheduler, maybe it because of 2.5 guidance
>>101722926I grabbed my phone and tested it. It works!
>>101722981>maybe it because of 2.5 guidancyes it is
>>101720035Can it generate moonman though?
>>101722909Bur can it generate old people enjoying themselves in the sauna? https://files.catbox.moe/6mhfrj.jpeg
>>101722954That might change the page behavior, yes, but there are hardware, os, and other software considerations (and my user-agent would still be desktop if it matters)>>101722983Thanks, you are awesome!
>>101723011aslong their belly is hiding their genitalia like in that case, sure I guess
>>101720035what's about this flux I'm hearing you guys talking about, since it's here is local, I saw talks that 8gb vramlets can run it, but can it do nsfw? can it do 1girl? 1children?
>>101723033It's a gradio interface dude get over yourself
>>101723049>8gb vramletsif you got half a day for a gen yes>>101723049>but can it do nsfw?no, soft erotica to a certain point, no genitalia, no hardcore>>101723049>1girl?meds>>101723049>1children?kys
>>101722500Very nice, thank you!
>>101722230There are kids jerking off to this.
>>101723078thanks for the reply kind anon, now I am informed, and will keep the last two recommendations in mind.
>>101722981more honkong kino?
>>101723100>>101723216cryptominers taking proof-of-work pics
>>101723078>no, soft erotica to a certain point, no genitalia, no hardcoreGuess I'll wait for a fine-tune then.
>>101722981>>101722998Yeah, pumping guidance to 3 clears the image a lot, but the soul is kinda lost. Guy's now a half poo or something.>>101723219
Its crazy how good flux is.so how did they do it? whats their secret?Also some other questions I have:>will there be Loras and checkpoints for this too?>any good tips for the comfyUI workflow?
not sure what this means, but I'm willing to look it up and try ithttps://x.com/cubiq/status/1819680923815649304
>>101723337
>>101720035Just got a new GPU with 16gb VRAM, is native training possible? I don't like the results I get with LORA at all.
>>101723335guidance is literally the slpo slider on flux because these crappy corpo models are always smeared in shit (read: fine-tuned with "aesthetic" data) to sell to customers
>>101723335>3.5 suddenly chadmaxxes the protaglmao>>101723370We live in a society
well at least there are 5 toes
>>101723411bootlegs.
>>101723358
>>101723462
>>101723201kek
>>101723415Skill issue.
>>101723201howd you get it to look like an actual stick figure and not >>101723525
>>101723543I dunno.A stick figure with a sad expression holding up a giant spoon above his head and saying "My spoon's too big" in a speech bubble.
A stick figure with a sad expression holding up a giant spoon above his head and saying "My spoon's too big" in a speech bubble.
>>101723512
>>101723579That's a big hand
>>101723589all the better to... never mind
Can I generate female nudity with Flux? Otherwise I won't even waste my time.
>>101723633Yes, but the nipples appear messed up most of the time
>>101723568
>>101723633--> >>101714801
Definitely not gonna watch this one
>>101722789why did they leave SAI?
>>101722687ESL?
>>101723337>whats their secret?not shooting themselves in the foot, stability might as well be clowns in comparison. flux created a safe model that works, meanwhile stability nukes even navels.
>>101723850>meanwhile stability nukes even navels.so why did they do this?was it just retardation?
>>101723850>created a safe* model*please ignore the cunny it generates
>>101723863probably to make up for releasing the evil sd1.5 model that everyone used for UNSAFE porn
>>101721749Tet flux in vp8 mode at 768x768 and 512x512. It produces coherent results at those sizes.
>>101652874>>101653006I am growing stronger. The gooners in trash have been significantly helpful.
>>101723871kek>>101723893>that everyone used for UNSAFE pornand whats the problem with this?
>>101723890SAI shill cries out in pain as he strikes you.
first flux lora results https://github.com/bghira/SimpleTuner/pull/622#issuecomment-2267624531
>>101723983so there is hope?
>>101722529Would you be willing to share the prompt of top left?
>>101723770They realized that the only purpose of that company is to stagnate tech and not evolve it. the entire company is currently at the mercy of the flux team not deciding to release a 4B model in the wild.
>>101723968>and whats the problem with this?absolutely nothing, i hope to see many more safety clowns get BTFO's by good models
>>101721273wait, what is the blurry prompt? or another blurry one you don't care about sharing
>>101721333checked, barbie prompt?
>>101723863>require email for the safe versionwhay are they doing?
>>101724043>absolutely nothingthen why did they do it?whats this "safety" obsession about?
>>101724074don't need it, once it's in the wild just download it from elsewhere
>>101724074just give them a burner email and claim your name is Nigger Niggardson like I did
>>101724125>woah guys look niggerkiller88@aol.com downloaded our model!
>>101723863the cycle of AI is always>new thing>it's too good and people have too much fun with it>panic because advertisers and payment processors wont be pleased>spend way too much time and money trying to "fix the problem">AI is now dogshit that appeals to nobody and no one will use it>repeat
>>101724166>panic because advertisers and payment processors wont be pleasedhow is that even a problem nowadays?why dont they just collect funding and donations through crypto?
>>101724162Hey bro don't dox me
>>101724168how do you get retro?
>>101724191"A screenshot of nuclear explosion in the city from a N64 game, lowpoly"
>>101723893are any only capable of safe nudes, and no unsafe ones, and no sex?
>>101724180I guess cultural terrorism is pretty deeply ingrained in everyone's mind, thanks to social media
>>101724166flux is local, no one can ever take it from mealso, the standard by which new ai will be judged has been raised- users will not tolerate anything new if it can't compare to flux
also what GPUs do you dudes use and how efficient are you with the flux?me I'm using an Gigabyte RTX 4090 Aero OC with 24GB vram,generating 1 image takes somewhere in the ballpark of 40 seconds to 2+ minutes.also what I found interesting is that this flux model doesnt seem to stress the GPU all that much, the GPU temps are very good while with SD models they go up way higher and the GPU also seems to draw less power with flux.
>>101724180you have to do business and the vast majority of normies won't want or know how to use cryptopersonally, I think we should do pic related to solve the payment processor question.
>>101724220No one can take it but arguable they avoided training certain concepts to cover their assOf course someone will fix that sooner or later
>>101724231>you have to do businesswhat business? they arent really selling these models are they?>the vast majority of normies won't want or know how to use cryptoshitloads of people already have crypto and these people are also usually tech savy so I'd image the user base overlaps here a lot.>I think we should do pic related to solve the payment processor question.Chinese Communism?
>>101724226>generating 1 image takes somewhere in the ballpark of 40 seconds to 2+ minutes.Maybe that's about right? My 6950xt gets me 14.88 sits. what are yours?
>>101724220>no one can ever take it from meIt's like the new Linux.
>>101721818Does it really? Fuck I only have 10 GB.
>>101724226RTX 3090 msi triopicrel takes 58 sec at 1536x1024 25 steps euler
>>101724252i managed to get 60s per gen on 2x 3080tis (3s/it)
>>101724249>what business? they arent really selling these models are they?while dev and schnell are opensource, BFLA have a pay-per-image api service. as well as a closed-source "pro" model also SaaS
does flux run on amd?
>>101724282
>>101724252>what are yours?seems to vary a lot for me, I think that has probably to do with me doing other shit while it generates but dunno
Imagine the coom when there is a flux well-tuned for pornNot that I'm a coomer who wants that or anything haha... unless?
>>101724287I see, so their problem is basically the SaaS not wanting coomer customers?
Flux (FP8) seems to break down at native 2048x2048 but is fine at 1536x1536.
>>101724166This. I love Flux but it too is on a timebomb before they pull another SAI or go closed source and then we have to rely on yet another startup, repeat.
>>1017243441536x1536 for comparison, same seed and prompt.
>>101724360*jaw drops to floor, eyes pop out of sockets accompanied by trumpets, heart beats out of chest, awooga awooga sound effect, pulls chain on train whistle that has appeared next to head as steam blows out, slams fists on table, rattling any plates, bowls or silverware, whistles loudly, fireworks shoot from top of head, pants loudly as tongue hangs out of mouth, wipes comically large bead of sweat from forehead*
>>101724370
>>101723525heh
These times all sound higher than I would expect for the powerful hardware, are you all using dev?3060-12G | i7-9700K | 32GB DDR4:schnell @ fp16 | 4 steps | 1024x1024first gen for a prompt: avg 118.9s (n=31)repeat prompt only changing seed: avg 34.0s (n=224)dev @ fp16 | 20 steps | 1024x1024first gen for a prompt: avg 216.5s (n=24)repeat prompt only changing seed: avg 121.1s (n=139)
>>101724300Honestly, I am now wondering if Flux is friendly to amd vs what we might expect.
>>101724057the usual young blonde 1girl + retro artstyle. Loras do the work
>>101724197so it doesn't know what N64 games look like, at all
>>101724416my 6950rx gets me 14.88 sets, with fp8. what are you getting with fp8? I jave not tried the fpX thing and don't know how iy works.
>>101724438
>>101724438I bet it's trained on comments on an art site.
>>101724445>title screen>no gamewhat are you trying to say, anon
>>101724393
>>101724452
>>101724469so it can't do N64, got it. thanks for the examples
>>101724289YESNOit's hard bakalaka
>>101724344Adding steps seems to have helped somewhat. This was 30 steps. Gonna try 50. RIP my GPU
>>101724484It can, I saw gens, i just took them for inspiration, didn't actually try to get N64 look that hard
So is this like a high res local dalle3? Can it do a mazda rx-7 drifting on a canyon road? Maybe a bathtub full of sausages?
>>101724501It can't, not by prompting "N64" or "Nintendo 64"
>>101724506
>>101724506I dunno is this is the right car
>>101724529Bros... we made it. What the hell is that poopy water though.
>>101724506and
>>101724560nta but that's the correct car
>>101724560>>101724564funny how similar they are lol
>>101724560>>101724564Damn. It really is local dalle3.
>>101724575dalle3 can do Scarlett Johansson. Flux cannot.It is useless to me.
>>101724439hang on, I haven't tried yet (I want to test with full capacity) i'll get back to you when queue clears...
>>101724561Sausage water, anon.
>>101724586Female celebrities censored out (but probably still trained). Artist names censored out (but probably still trained). This is a temporary issue.
>>101724586
>>101724586Shame. At least it's still progress compared to the SD shitfest.
When comfy says loading in lowvram mode is it automatically switching to fp8 down from 16 or something entirely different?
>>101724604Yum.
>>101724608you don't really think that's her, do you?
>>101724605Wouldn't it be possible for someone to make a VLM out of Flux and then reverse engineer what it thinks the celebrities and artists are?
>>101724604
>>101724496More steps helps. This is 50
>>101724645
>>101724656try this with 1536x1536, going above this seems to start breaking things regardless of steps
so why is it an llm can be offloaded on multiple gpus but not an imagen, how would that work with vlm then
>>101724586>gen an image>inpaint face at 0.5+ denoise using your preferred SDXL slopsimple as
>>101724657would love the prompt, love the creepy stuff
>>101724689the body will be different, also SDXL doesn't know her either, it is closer but so off.
>>101724688don't see why imagegen wouldn't be able to be split across gpus, it's just a matter of bothering to implement it
>>101724671
>>101724706https://files.catbox.moe/pbzm3u.webp
>>101724709There's like 10 loras of her on civitai for sd1.5, sdxl and pony
>>101724746all shit
>>101724709https://github.com/TencentARC/PhotoMakerdoes this work?The guy that got in the news over the gens of Taylor Swift basically being hazed at a football game used something like that, not sure which one.
>>101724770>The guy that got in the news over the gens of Taylor Swift basically being hazed at a football game used something like that, not sure which one.He took the Taylor Swift images from the DALL-E 3 threads, took pretty much all celeb images to sell on Twitter but only heard drama regarding Taylor Swift's.
>>101724741cheers
>>101724785he genned them, and for some reason posted them on /pol/it's relevant, because women found out about ai, or some of them did, that never heard of it really.
>>101724689>Replace Flux's high quality face with sd's same face-same expression retarded slopGenius move right there
>>101724792It's creepy, because that's the kind of person who would never say hi to me. In some way it's a hidden ugliness, thus it symbolizes evil, and it's a surprisingly universal symbol of evil.
>>101724092>whats this "safety" obsession about?It's convergent evolution, every AI image model needs to converge to maximum safety, where you can only make cat memes.
>>101724709>the body will be differentyeah it probably wont be a jew fridge body, whats the problem?
>>101724839What about:>>101724770
>>101724605>This is a temporary issue.How? Can someone really finetune it to add artists, characters, nsfw, and female celebs, all while not going back to tag writing?Who even has such a gigantic training dataset, and the money to finetune this?
so are flux LoRAs letting me gen anime porn of my waifu yet?
>>101724827Worst thing that could happen, is to make normies go on a moral panic about this.
>>101724827>he genned them, and for some reason posted them on /pol/he didn't gen them and they were being posted on /trash/ if my memory doesn't fail me. can't recall the board but I never go to /pol/ so unless someone was posting on both it was not from there>>101724854cool it
Holy shit guys. I can finally emulate the finest youtuber thumbnails.
>>101724660Catbox for this one?
>>101724845
Flux is stealing shit from actual people lmao
>>101724992Oh god. Can it generate terrible sonic OC?
>>101725010ye
Asking AI to write long detailed prompts for imaginary product advertisement posters was a good idea.
Ready to roll with fresh bread...>>101725030>>101725030>>101725030
>>101724906not yet, we'll get there
>>101725015yep
>>101725010Not yet, actually, apparently. But maybe someone will figure it out.
>>101725048What if you use deviantart as a quality modifier?
>>1017250 i'm just using the website.https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnellI can't use any modifiers.
>>101725048chris chan in shambles
>>101725093alternate universe sonichu?
>>101722504I RUINED AI ALL NIGHT LONGseed: 666guidance: 6.7https://files.catbox.moe/fvq4xd.png
>>101725113OOPSdoggo is seed 553046854372910
>>101724679I did. See >>101724360
>>101725113but I will still call him 666 doggo. What's he been up to here?
>>101725048Who is the bald purple Sanic?Chemo the Hedgehog?
>>101725048That's very nicely drawn. Definitely gets an A+ in art class. Makes sense, people don't upload bad drawings.
>>101725131Wait, when I enter seed 666, it turns into >553046854372910
>>1017242263080 with 32gb system ram. prefer no sysmem in nvidia settings.takes 90-120 seconds for fp16 using the default settings from the comfy example. for fp8 it's around 60 seconds, usually just under.there's a little bit of swapping to pagefile with fp16 so would probably be way fucking slower without nvme.
>>101725375>32gb system ramdoes that matter?I thought its only about vram
>>101725419YES!!!I had 16gb in my system, with a 6950xt, which has 16gb of VRAM, I thought that should be enough, but I guess it might take a whole hour to generate with fp8, I gave up and tried installing the 16gb from another system, now 32gb. Well now I gen an image in about 5-6 minutes.And maybe I need more ram.
>>101725419i'm not gonna pretend to know how/why it works but since it is a swapping a lot it seems to matter a bit. i suspect as long as you have "enough" (more than 32gig) sysram the speed/performance of it doesn't matter that much.
>>101725375What's your sits on fp8? Mine is 14.88, on the 6950xt.
>>1017255052.6
>>101725466>>101725474I have 64GB ram, if I upgrade to 128GB ram it will not become faster?
>>101725505>14.88
>>101725582i doubt it. it might allow for better vram swapping if you have prefer sysmen fallback enabled in nvidia settings. worth trying for science if you have spare ram lying around but i wouldn't spend money unless somebody can confirm it helps.
Just kind of funny that's where it averages.
>>101724882The solution wouldn't be a regular finetune, but something similar to a vector DB that dynamically applies a LoRA on the fly. Ideally the LoRAs wouldn't all be trained manually and just values generalized from the data itself. Something similar has been done for SD, forgot what it's called.
Are safetensors slower?
>>101724439I'm getting same speeds (give or take 3 or 4 seconds) with fp8 idk what to tell you
>>101726677Interesting. Well, it means your card works great with Flux, vs AMD.
>>101724586There may be a way to find the negative prompts.This may be the gateway to reverting baked in negatives.