Discussion of free and open source text-to-image modelsPrevious /ldg/ bred : >>102922252SD3 Large Edition >Beginner UIFooocus: https://github.com/lllyasviel/fooocusEasyDiffusion: https://easydiffusion.github.ioMetastable: https://metastable.studio>Advanced UIForge: https://github.com/lllyasviel/stable-diffusion-webui-forgereForge: https://github.com/Panchovix/stable-diffusion-webui-reForgeAutomatic1111: https://github.com/automatic1111/stable-diffusion-webuiComfyUI: https://github.com/comfyanonymous/ComfyUIInvokeAI: https://github.com/invoke-ai/InvokeAISD.Next: https://github.com/vladmandic/automaticSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Use a VAE if your images look washed outhttps://rentry.org/sdvae>Model Rankinghttps://imgsys.org/rankings>Models, LoRAs & traininghttps://aitracker.arthttps://huggingface.cohttps://civitai.comhttps://tensor.art/modelshttps://liblib.arthttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3>SD3 Largehttps://huggingface.co/stabilityai/stable-diffusion-3.5-largehttps://replicate.com/stability-ai/stable-diffusion-3.5-large>SANAhttps://github.com/NVlabs/Sanahttps://ea13ab4f5bd9c74f93.gradio.live>Fluxhttps://huggingface.co/spaces/black-forest-labs/FLUX.1-schnellhttps://comfyanonymous.github.io/ComfyUI_examples/fluxDeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main>Index of guides and other toolshttps://rentry.org/sdg-linkhttps://rentry.org/rentrysd>Try online without registrationtxt2img: https://www.mage.spaceimg2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest>Maintain thread qualityhttps://rentry.org/debo>Related boards>>>/aco/sdg>>>/aco/aivg>>>/b/degen>>>/c/kdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/tg/slop>>>/trash/sdg>>>/u/udg>>>/vt/vtai
>mfw
so current conclusion is that sd3.5 cannot be finetuned in practice, like flux?
ldg and discord stay winning
>>102926821>so current conclusion is that sd3.5 cannot be finetuned in practice, like flux?it can, it's not a distilled model, the issue is that it's a giant model (8b), so you won't train it with your 24gb card, don't dream about that lol
I DON'T WANT TO SIGN UP FOR AN ACCOUNT JUST TO DOWNLOAD THIS SHIT FUCK YOU
>>102926803>subhuman so obnoxious not even the rest of xir fellow avatartroons tolerate xirLMFAAAAAAAAAAAOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO
>>102926821You can almost train Flux with layer offloading, so yes, you can train a model that is 4B smaller.
>>102926860desu skill issue
so do i need to download any of the other shit off of hf or just the model
>>102926862>layer offloadingthat's slow as fuck though?
>>102926855Meh
looks like SD3.5 can't do high resolutions?
>>102926897it's still going to be significantly faster because 4B is a lot
>>102926860buy vpn retard.
>>102926843>>102926862I see, thanks anons
>>102926860>I DON'T WANT TO SIGN UP FOR AN ACCOUNT JUST TO DOWNLOAD THIS SHIT FUCK YOUthis, I'll wait for someone to upload the weights elsewhere
>>102926966What does that have to do with anything?
did they remove negatives again?
>>102926980it's just an ad for nordvpn, don't worry anon
they spent months retraining SD3 -> SD3.5 on women lying on grass and they still haven't nailed it lmao
>>102926934Looks like the same issue as flux and sd3. Use a different technique to upscale like SDUltimate. Cap the working area to 1 megapixel
>>102926990there should be negatives because you go for CFG > 1, the demo just doesn't allow you to use them I guess
>>1029269341280x1280 seems about the limit before things start getting extremely fucked up. Edges still get garbled though
>>102927007Probably the safety/alignment teams fault. They’re so scared of the models learning anatomy that they butcher them.
>>102927020Flux can do higher resolutions without any upscaling just fine
>>102927040>They’re so scared of the models learning anatomy that they butcher them.if they want to stay behind Flux for the rest of their life, that's their problem, we'll go forward with or without them
>>102927007It's real? Damn, they really didn't learn.
>>102926803welcome back to these threads, it's been a while
i absolutely love FLUX for being so incredibly good at character LORAs. 20 small images and you get near perfect results you couldnt even dream off using SDXL. i fucking hope SD 3.5 will mange this as well, because i cant go back now. i recently tried using my old SDXL/PONY LORAs that i thought turned out well and they look fucking dogshit all of them.
>>102927007I really doesn't like upside down view>a top-down full body view of a woman lying in grass, she's holding a sign with the text "Garbage model, don't recommend"
it passes the sailor moon cenobite check. already liking it
i'll try this flux prompt in 3.5, brb
>>102927127This is the same prompt but with Flux dev dedistill
>>102927195her feet are fucking blurry
Babe wake up, the first real finetune of flux dev (from dedistill) has been finishedhttps://huggingface.co/SG161222/Verus_Vision_1.0b
>>102927206I can't keep up with all these models nigga
>>102927200sorry Dan
>>102927200yeah I hate Flux for that, it makes everything blurry
>>102927206so how is it?
>>102927195>>102927200more than that, the hands are fucked up on a fundamental level
>>102927232>the hands are fucked up on a fundamental levelcompared to SD3.5 it looks fine lol >>102927127
>>102927200Skill issue unironically. I posted about how to prompt it back during release but I guess people didn't keep the knowledge alive.
Quick impression. I always found flux gives its "oil paintings" a muddied amateurish look. SD3.5 gives a better result, looks more like bing which isn't a bad thing exactly.Still no artist names sucks.
It's fun but being limited to 1mp without upscaling kills it for me
Is it over or are we back bros?
>>102927241none of that ever worked reliably on all sorts of difference prompts
>>102927206>11.9 GBI hope he's not only releasing the fp8 model, would be retarded
>>102927136>sailor moon cenobite checksana
>>102927206>warning do not use a negative promptlol okay
>>102927243>Still no artist names sucks.I feel like we'll never get something as kino as MJ for the artist names/celebrities if they keep going for the VLM path, and they like doing it that way, they are too cucked to take the risk on making the mob angry, that's why I respect MJ, they don't give a fuck they have actual balls
>>102927282>prompt: shifty looking chinaman greased with lard
>>102927252>Is it over or are we back bros?SD3.5 looks like a worse version of Flux, desu I don't see the point, if people can run a 8b model they definitely can run a 12b one, so I'll stick with Flux I guess, the only way to win me over would be them having a model that has the styles and celebrities in there, but it looks like it's as empty as Flux
>>102927310you won't. localjeets have been psyopped into thinking text on signs and jesters juggling green cubes are more important.
>>102927272It did, but you never tried it since you never actually heard about it, because people don't listen and let information be buried.
>>102927336>you won't.I know anon, I know
>>102927310>>102927333it's a nothingburger of a model, needs a decent finetune before even being worth checking out.
>>102927349prove your claims
>>102927296yeah, even with dedistill, when you use negative prompt it tend to destroy the quality image, maybe that's a proof it's not fully undistilled idk?
>>102927206dedistill is a red herring stop wasting your time
>>102927310It's never going to happen because you'll just do deep fakes and ruin it for everyone
>>102927408>you'll just do deep fakes and ruin it for everyonewhat is celebrity loras? what is PuLID? what is InstantID?
>>102927428extra hoops that filters you
>>102927433it doesn't, I know how to use them
>>102927445no you don't, that's why you bitch about it
>>102927388nope, you don't want to try that's a you problem, people want to save flux and we're going to do it, with or without your doomerism
sana doesnt appear to know "cenobrite" really well but cool image regardless >premiere studio anime, junji ito and yoji shinkawa, sailor moon as a cenobite from hellraiser
>>102927463by we you mean someone that isn't you
SD3.5 can do nippleshttps://imgur.com/m6yJqRB
>>102927472CFG 1 for both of these >>102927486im so lonely bros
>>102927206>An asian womanLeft to rightVerus_VisionFluxDeDistillfp8Fluxfp8
that one is pretty good>WWE fight, a person jumping from the ropes into another one
>>102927506looks like he removed the buttchin and the wrinkles over the mouth, nice
>>102927152*first pic made with downloaded sd 3.5l *oh, way better than expected
Are we back or is it so over?
>>102927566they definitely improved their model, but it's still inferior to Flux so... maybe SD4 will beat them I guess?
>>102927566it is always over. It will be days of nitpicks between models and idiots who deny that secondary i2i is always needed.
>>102927537Not quite. "Middle aged blonde woman" this time
>>102927490it's mispositioned
>>102927611why is it so shiny on vanilla flux? are you on cfg 1 on that one?
>>102927629Yes. That's what skin looks like with default flux
>>102927506thanks for the comparison, anon
>>102927517this is the same prompt but on flux dev dedistill
>>102927566It was never over.. personally I'm waiting for the 3.5 Medium
>>102927646>That's what skin looks like with default fluxI'm surprised dedistill has the skin more natural, he really nailed that shit desu
>>102927357Here>>101714916>>101714923>>101714958And NEVER use "sharp background", "clear", "in focus", or other things that describe the focus of an image. Those do nothing or can make things worse.In my experience since those posts, styles have more influence in the ability to remove DOF, but optimally you want both detailed description and style to get consistently sharp gens.>but I don't want to describe so much shitSure it's unfortunate but it works and it proves it is possible.The better solution these days is probably to just use a LoRA or use a workflow with negative. I simply just wanted to challenge myself to see if it was possible to do with a vanilla workflow and prompting alone.I don't gen images anymore since I always just check things out once and then wait for more releases.
>>102927566>Are we back or is it so over?I think we're back, the licence is good, it's not better than dev but better than schnell (I guess?) and it's smaller (12b -> 8b), this definitely has potential for training
>>102927696neat. can you do other stuff in that style? buff guys with swords, buxom women, general fantasy, etc.
>>102927541what about when you add some text like in your previous images?
>>102927744lets try, also ,can't render a full figure... i smell something fishy here
>>102927704>NEVER use "sharp background", "clear", "in focus", or other things that describe the focus of an image. Those do nothing or can make things worse.It's insane how many anons think tokens like these help, but then you remember most don't even care to set a fixed seed when testing...
>>102927704why did you bother to link me posts when there is literally one sentence of usable info in them? and no metadata for the imageson second thought dont bother replying, it's cool
am i stupid or wasn't there one branch of forge/reforge that has sd3 support
>>102927839kek but that's a portrait nono, pikachu don't need a skyscraper gorwing out of he head
https://huggingface.co/camenduru/stable-diffusion-3.5-large/tree/mainOk, one fine gentleman uploaded the fp16 weights on his channel, so we won't have to give our infos to SAI to download it, let's go
How large is large?
>>102927871cityscape was prompted for
back status?
>>102927007YOU HAD ONE JOB
>>102927898>How large is large?8b
>>102927964thank you
>>102927876now reduce and compress that shit until i can run it on my 10 series chop chop
>>102927944ngl that doesnt look very stable
>>102927992kek
>>102927860dev2?
60% of renders are plagued with blatant anatomy errors, but i'm also toying with cfg... still no legs
On Fal:>SD 3.5 large -> $0.065 per megapixel>Flux-dev -> $0.025 per megapixelSAI is completly delusional, making you pay almost 3x more than a better model
I like it tho
it's strange that the default denoise strength on the demo s 0.85
I'm sorry footfags, but you won't be eating good with SD3.5 ;(
>>102927206why is verus vision so slow? 5s/it when flux-dev is 2.4
>>102928144that's because you're going for CFG > 1, so that halves the speed, such is the fate of undistilled models
>>102928111>>102928149Is this what I think it is?
>>102927803Well, it's not impossible they could help with some models if they were trained on those tokens. In the end the issue with people is probably just not wanting to spend the time doing objective AB comparisons on the influence of tokens. And actual shitposters.Anyway, looks like the other guy was just a troll after all that doesn't actually want to learn anything or have a productive discussion. Sad.
>>102928159what do you think it is?
Jesus Christ, i'm starting to think they've done a trip to a grassy field and took a bunch of photos to train this model, but only waist up...
>>102928158its over....
>>102928158nta but why does cfg halve the speed? what is different about flux/dedistill cfg compared to classic 1.5/xl?
>>102927506
>>102928172Never mind. What is it? It's interesting.
>>102928180the negative prompt makes a sort of nega-image that it uses to guide the main image away from, with cfg 1 negative is ignores so obviously it will be twice as fast
>>102928180>nta but why does cfg halve the speed?because cfg uses 2 pictures to calculate the negative prompt against the positive prompt, so it's twice as slow>what is different about flux/dedistill cfg compared to classic 1.5/xl?nothing, that's the point, it's undistilled so it means it act like a "normal" model like SD1.5/XL (those one also work on CFG > 1 and also have their speed halved)
>>102928186>horror movie screengrab, 1980s, cinematographyIt's my unstable Pixart Sigma 600m finetune while I wait for Sana.
>>102928185the difference is subtle, but it looks less burned on the finetune I guess? and the woman has better horizontal proportion
what I don't like about SD3.5 is the oversaturation, they fucked up the colors somehow
Well there's definitely some "diversity". I prompted for cyberpunk geisha android and got netflix version.>>102928213cool stuff dude
>>102928228have you tried a slightly lower cfg. that might desaturate things a bit.
>>102928247yeah but it looks like cfg 3.5 is the "expected" value from that model to get good prompt adherance
>>102928214Yeah I'm not sure. Skin texture might be a little better but that could be the slight grain Verus seems to have, I dunno. The only thing these comparisons have reinforced for me is I'm not going back to distilled ever, s/it be damned
>>102928260model authors are wrong about this shit almost every time
You guys see the new Open Source video gen model? https://x.com/genmoai/status/1848762405779574990The weights are open source, I wonder how much ram it takes to run though.
>>102928291What the fuck is happening with all these model releases
>>102928254Yeah I decided to play with base Sigma since Sana was announced.
>>102928291HOLY SHIT HOW MANY MODELS GOT RELEASED TODAY??- Moshi 1- SD3.5- Allegro- Omnigen- Emu1- The demo of SanaI won't forget this day that's for sure
>>102928291The quality is absolutely insane, and that's Apage 2.0? What the fuck man?
>>102928291>>102928321Now which one is the best of the bunch and can run on a 3090...
what do i do with guidance with verus vision?
>>102928291>open torrent>dit.safetensors>40 GBholy shit its a big boi
>>102928367>holy shit its a big boiif that's the whole package (DiT + text encoders) then maybe it'll be runable on Q8_0 on a 24gb card
Bigma sisters?
>>102928291>Minimax tier but localDo you guys have any idea how big of a deal this is?https://files.catbox.moe/6ddgsl.mp4
>>102928291They have a test site you can try it out on https://www.genmo.ai/
>>102927989Jones!
>>102928429bigma dead, sd 3.5 curb stomped it
>>102928456600m is all you need
>Our commitment to safety>We believe in safe, responsible AI practices and take deliberate measures to ensure Integrity starts at the early stages of development. This means we have taken and continue to take reasonable steps to prevent the misuse of Stable Diffusion 3.5 by bad actors. For more information about our approach to Safety please visit our Stable Safety page.On the safety page, one of their "6 pillars for safe AI":>Ensuring data integrity>We maintain model integrity by carefully screening training data, excluding illegal content to uphold safe and ethical standards in our products.And of course it's gimped lmaoSD 3.5 is DOA
>>102928242
Verus can't do nipples Into the trash
>>102928454
>>102928291>The model requires at least 4 H100 GPUs to run. We welcome contributions from the community to reduce this requirement.oof, guess we have to wait on quants
Very busy day. Have like two hours to myself -_-Is sana local yet, has pyramid released the proper model?And uh, is eveyone ok?
>>102928604>And uh, is eveyone ok?fuck off homo
>>102928612nice
>>102928604everything changed
>>102928604I'm not ok. I'm dealing with health issues that my doctors were not able to figure out. I'm in literal pain and it sucks. But anyway...
>>102928456>curb stompedLet's not devolve into exaggeration.
>>102928616I'm happy you hate faggots, everyone does, faggot.>>102928638I see >>102928321>>102928649I hope you find a sedative that allows you mental space to get back on the diagnostic process.
Is it cheaper to run locally these models, if I'm doing it for a business? I made an app that uses AI, but I ran the numbers to see how much I would spend using the Open AI API prices, I got spooked. It's expensive, especially the input tokens.If my app is "freemium", I will be losing money unless a decent part of the free users convert to premium.
>>102928696>Open AI API prices, I got spooked. It's expensive, especially the input tokens.are you talking about text gen or image gen?
>>102928591is there any info on how long it takes on 4xH100? No way it will be reasonably fast on 4090And I thought Allegro taking 50 minutes for 6 second video is as slow as it gets
>>102928716Text gen using gpt-4o.Image generation is expensive too though, and it would use image generation... But image generation is easier to run locally since it doesnt take that long
>>102928612thats fucking great
>>102928716>>102928742I'm retarded, just realize I'm on the wrong thread again
>>102928438I signed in even but it's not letting me prompt. Shame.
>>102928429sana exceeded my expectations so i'm happy and it can only get better
anybody else have to verify email or wait 15 minutes to post?
>only 8 new model releases todayYawn
>>102928882>smacks lips
>>102928882Flux 1.1 WHEN
>>102928794>>102928438yeah same for me, seems bugged, but it does say you get a free 30 videos per month
Flux could die and I would not care
>>102928434holy shit! that's really good
>>102928604>has pyramid released the proper model?forget about that shit, we literally have MiniMax at home now >>102928434https://huggingface.co/genmo/mochi-1-preview
>Pony won't finetune SD3.5It's ova...
FIXYOURDEMOFAGGOT
>>102928957>The model requires at least 4 H100 GPUs to runCome on dawg
>>102928969it's asking for 320 go of vram? oof... fuck my life...
>>102928987so even after quants it will be like 80gb? Maybe doable if you can use regular ram as well though I bet it's slow
>>102928987it says *atleast* 4 h100s so that means 320gb of vram is on the lower end
Are we sovl again?
>>102928741I think Mochi should be runnable locally as long as you have two 3090s / 4090s.Allegro runs on 4090 using 22GB VRAM, tested it myself. The transformer is 2.8B parameters, with a context length of 80k when doing the full video length.Mochi is 10B parameter transformer, with 44k context length, which is lower because I think the VAE has more temporal compression.So assuming you could quantize each to 8 bit, the model weights are a difference of only 7GB. Memory usage should scale linearly with context size and hidden dimension. For Mochi, context is half that of Allegro, hidden dimension is larger (need to look up the exact difference). This is all back of the envelope math and extrapolation but probably mochi can be made to run on 2 3090s with a good, efficient inference implementation.
>>102929008i'd say so, yeah. all the new models today have pretty sovlful gens
>>102929017whelp, if thats accurate and it's good I might finally have my excuse to get another 3090
>>102928966The sooner you accept Pony won't train a new model the happier you'll be. He's going to grift like the Summertime Saga dev.
>>102926788Horrendous gens in OP
>>102929051Sounds like you're just jealous yours didn't make into the OP
this is something I like on SD3.5, even at CFG > 1 it's really fast (1.25s/it) compared to Flux at CFG > 1 (3.5s/it)
>>102929051my gens are in there so you better take that back, punk
>>102928966Is he still claiming to be using the super secret version of Auraflow that supposedly isn't ass?
>>102929062i did make it into the OP tho
>>102929062Bottom left is the only half decent one the rest are ass just like SD3M
>>102928957I dont have 4 H100's at home? Do you? Back to waiting for Pyramid.
>>102929098Bottom right**
>>102929098sorry i meant to say that bottom right is ass and the rest are great** need my coffee lol :p
>>102929017>This is all back of the envelope math and extrapolation but probably mochi can be made to run on 2 3090s with a good, efficient inference implementation.https://www.youtube.com/watch?v=oxSJFkS9iVM
>>102926788thank you for including a 1girl this time
https://stability.ai/news/introducing-stable-diffusion-3-5>Stable Diffusion 3.5 Medium (to be released on October 29th): At 2.5 billion parameters, with improved MMDiT-X architecture and training methods, this model is designed to run “out of the box” on consumer hardware, striking a balance between quality and ease of customization. It is capable of generating images ranging between 0.25 and 2 megapixel resolution.they plan on releasing a 2.5b model in a few days
>>102929222they really wanted to bury sana before it even had a chance
Holy shit, SD3.5 can do nude women just fine, it's not censored at all!https://files.catbox.moe/w0katp.pnghttps://files.catbox.moe/646coy.png
>>102929236the competition just makes sana 2 that much stronger
>>102929222>with improved MMDiT-X architecturethey also used that improved architecture for the 8b model or not
>>102929239>wow! it's a whole bunch of nothing!
>>102929252well it's a huge improvement to the body horror of SD3
>>102929239Heh, looks like Flux actually mentally broke them. Complete 180 from the sterilized shit they were trying to do before.
>>102929236Niche makes it fun
>>102929252At least it can do nipples unlike Flux lol
>>102929245>MMDiT-X architecturei think it's just sd3.5m, but don't quote me on that
>>102929264>>102929279fair points
better feet than flux
>>102929275this is why competition is good, the thousand models that released today is a sign of good things to come. i don't want a single company to dominate again
>>102929301Is it though? She only has 4 toes on each foot
>Sailor Moon playing ping pong against Hatsune MikuIt doesn't look great but there's no concept bleeding like on Flux, I think SD3.5 has some huge potential
>>102927252VRAMlets are so over, but VRAM chads are so back.https://x.com/genmoai/status/1848762405779574990>SD is still a shitty model.
>>102929307Nvidia still is...
can sd3.5 large do armpit hair?
>>102929301>>102929311yeah at least flux made the toes look like dicks
>>102929319yeah but i meant for image gen models
flux remains winning. this recent garbage is today's equivalent of kolors and lumina. useless shovelware trash that will be forgotten within a week. sd3 8b looks no better than sdxl.
>>102929339You don't want nvidia to lose the lead?
>>102929355what?
>>102929315I bet everyone in this thread combined doesn't have enough vram to run that locally
>>102929350>this recent garbage is today's equivalent of kolors and lumina.the only thing worth looking at is this insane model, fucking MiniMax at all who would've thought we'd get something that powerful so soon? >>102929315
>>102929311Yes, flux cannot do feet soles AT ALL, its pretty bad at feet
>Jim CarreyCome on SAI... it would be one thing cool to dethrone Flux, have fucking celebrities and artist styles onto your fucking model
>>102929315I have no idea why they decided to release this model (and with an Apache 2.0 licence), this isn't just a SOTA local model, it's probably one of the best video model that ever exist, holy shit...
>>102929415How does CogVideoX score that high? In my experience it fucking sucks
>>102929383I really dig the aesthetics, it has a lot of sovl, too bad the details are bad though, something's missing on that SD3.5 model I feel, it can be close to Flux with some more training I think
Jeebus Christ...
remember to thank SAI's new ceo Prem Akkaraju for the shiny new (uncensored) models!
>>102929315>Here, I got you MiniMax local, you just need 300 Gb of Vram to run it though, BYE!Thanks I guess? kek
>>102929415probably some ai image/video model regulation coming up pretty soon, thats my bet why companies are releasing models left and right, it will be very difficult for companies to release models in the future
are they gatekeeping us with vram on purpose?
>>102929301Except for feet of a body lying down in the grass...
>>102929473>are they gatekeeping us with vram on purpose?I don't think that's what they want to do, we just can't make quality models without much vram requirements, at this point we can only blame Nvdia from gatekeeping the max VRAM, they still locked it at 24gb since 2018, those greedy fuckers...
>>102929473Whether you like it or not, attention takes memory.
>>102929457How much coke do you think he does?
>>102929473yes so all those data centers can wring your wallet
>>102929453after spending more than 2 months on Flux, it just feels refreshing testing out SD3.5 model, it just more diverse images styles and not the same shit over and over again
>>102929498who can wring my pp?
What sample are you guys using with Flux? Euler 20 steps? I want to gen faster. My autism needs it
>>102929520sounds like a skill issue
>>102929496
>>102929533>I want to gen faster.you can try out that lora, it allows you to gen for only 8 stepshttps://civitai.com/models/870028/real-fascination-hyper-8steps-flux1d?modelVersionId=973745
>>102929551Thanks bro, I will try it
>>102929239Ok as expected there's no pp in this model, lamehttps://files.catbox.moe/ab7piy.png
Civit exists for so long now, and it's still possibly the worst site I've ever used. Will the they ever make it work properly?
if you told me this was a 1.5 finetune I'd believe you. where are the 8b parameters going?? because I'm not seeing them in any of these outputs. why are the details fucking melting on an 8b model?
>>102929582>1 hit wonder>keep doing the same shit>it's still shit>???>profit
>SanaSHIT>SD 3.5SHIT>Verus Flux fine tuneSHIT>Genmo Mochi 1UnrunnableLAME
>>102929383Even if styles by name of artists aren't there, it's better than flux in this aspect. Prompt adherence seems ok (as SD3M already was) but anatomy mistakes are more frequent than with Flux.If it behaves better with style Loras than flux does, I think it is a worthy model.
Anyone know what resolutions SD3.5 is trained at other than 1024x1024?
>>102929628sd 3.5 medium will be ultra kinorino
>>102929582>why are the details fucking melting on an 8b model?that's what I'm asking myself aswell, why the details are so fucking bad? 8b isn't that far away from 12b, don't tell me what BFL did was fucking magic, it can definitely be replicated, SAI just sucks ass man...
Nearly there...
>>102929643I can't go much higher than 1024x1024 without everything turning to garbage
>>102928966Is because the license has a limit of 1 million, so... This faggot is earned more than one million with PonyXL and his service, make you think...
>>102929582SD 1.5 finetunes were pretty bad too. You're thinking of SD 1.5 mixed with NAI... Now THAT made some pretty good models *siiipps*
>>102928957They fucking nailed that shit, holy fuck dude this looks incrediblehttps://github.com/genmoai/models>The model requires at least 4 H100 GPUs to run. Kek, gonna wrap a rope and kms I guess, why can't we have nice things :(
>>102929665>This faggot is earned more than one million with PonyXL and his service, make you think...wtf? no fucking way...
>>102929664>I can't go much higher than 1024x1024 without everything turning to garbageyeah, it's even worse than SD1.5 with the duplication, here it's just complete glitches, impossible to go further, are they for real?
>>102929682damn, that's hella good quality for what it's worth
Jesus Christ, I must be doing something wrong.Flux-dev cant be that slow on my computer right?It's a RTX 3060 12GB VRAM. A Core i5 12400 with 64GB RAM.That time was to generate a 768x768 picture with 15 steps.I dont get it, it doesnt take that long in any of the SDXL models.
>>102929017the model is 40Gb big, can't we limit the vram requirement by using Flash Attention or something?
>>102929744it's slow as fuck if Flux is too big for your gpu? What Quant are you runing anon? If you tell me you run the official bf16 safetensors then of course you're fucked, this is asking for more than 22gb of vram, your RTX 3060 will never eat this shit up
>>102926788Now that SD3.5 won, isn't it time to close this thread and go back to /sdg/? You don't need to keep using a distilled model anymore.
0/10
cough
for effort
Goddamnit! Posting this for the lulz
>>102929762>If you tell me you run the official bf16 safetensors I might be very stupid.When I click on weight_dtype I can select different options. Should I be running the fp8 option then? For poorfags gpu like mine
gm
>>102929792>full flux dev>T5 fp16There's you're problem holy shit
>>102929807Alright, thanks for spotting it bro.I'm downloading the Flux dev model that's on Comfy-org's hugging face page right now. That should work better I presume?
>>102929792I advise you to put the text encoder (9gb) into your ram, and go for fp8 yeahhttps://reddit.com/r/StableDiffusion/comments/1el79h3/flux_can_be_run_on_a_multigpu_configuration/
Anyone figure out higher resolutions with SD3.5? Or is it really just fucked?
>>102929844yep, impossible to increase the resolution somehow, almost like it's completly locked at the regular one, that's so odd
>>102929844>>102929910Killed all my enthusiasm being locked to 1mp>bro just upscaleNO!
>>102929844I think img2img is fucked just like SD3 was, when I try to do a hires upscale, I get weird artifacts
>>102929987glad i wasn't the only one. thought i might've just been retarded.
>>102929682I don't get it why it's asking for so much vram, it's "only" a 10b model, twice as big as CogVideoX, and Cog can run on a 3090
>>102929835that's a cool image
Fresh >>102930087>>102930087>>102930087
>>102929239the naughty bits are wobbly, not a good sign bro
>>102929790lmao no way
>>102929910did they train it on only one aspect ratio images? if so that's fuckin retarded
can i run anything on a 6700xt yet?