Discussion of free and open source text-to-image modelsPrevious /ldg/ bred : >>102949176Two Hundred Steps Edition >Beginner UIFooocus: https://github.com/lllyasviel/fooocusEasyDiffusion: https://easydiffusion.github.ioMetastable: https://metastable.studio>Advanced UIForge: https://github.com/lllyasviel/stable-diffusion-webui-forgereForge: https://github.com/Panchovix/stable-diffusion-webui-reForgeAutomatic1111: https://github.com/automatic1111/stable-diffusion-webuiComfyUI: https://github.com/comfyanonymous/ComfyUIInvokeAI: https://github.com/invoke-ai/InvokeAISD.Next: https://github.com/vladmandic/automaticSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Use a VAE if your images look washed outhttps://rentry.org/sdvae>Model Rankinghttps://imgsys.org/rankings>Models, LoRAs & traininghttps://aitracker.arthttps://huggingface.cohttps://civitai.comhttps://tensor.art/modelshttps://liblib.arthttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3>SD3.5https://huggingface.co/stabilityai/stable-diffusion-3.5-largehttps://replicate.com/stability-ai/stable-diffusion-3.5-large>Sanahttps://github.com/NVlabs/Sanahttps://8876bd28ee2da4b909.gradio.live>Fluxhttps://huggingface.co/spaces/black-forest-labs/FLUX.1-schnellhttps://comfyanonymous.github.io/ComfyUI_examples/fluxDeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main>Index of guides and other toolshttps://rentry.org/sdg-linkhttps://rentry.org/rentrysd>Try online without registrationtxt2img: https://www.mage.spaceimg2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest>Maintain thread qualityhttps://rentry.org/debo>Related boards>>>/aco/sdg>>>/aco/aivg>>>/b/degen>>>/c/kdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/tg/slop>>>/trash/sdg>>>/u/udg>>>/vt/vtai
the skinning alive of all 1girl posters
Blessed thread of frenship
>>102956911Oof, those thots are on point
>>102956931what, you don't like girls?
>>102956971men only hobby
posting in the blessed thread
>slops up a 1girl>doesn't give her armpit hair
>>102957113what model is that?
>>102957162Everyone's favorite unreleased model
>>102957175had a feeling, it looks awesome, mind if i have the prompt?
sana-samas will rise a-gain
>surreal picasso painting, surrealism, post modernism, "The Land is the Lie", Planet Earth
>>102956516ran another comparison.fp8https://files.catbox.moe/drqbg5.mp4ggufq4https://files.catbox.moe/mgr9xg.mp4q4 is just struggling with clarity at this resolutionOthers with more experience may get better results.
>>102957237anon, if your videos are on slow mo that's because you went for 8 fps on the node at the far right, Mochi is a 24fps model, I say that just in case you don't know
>>102957257you're right but i was just testing for clarity, I did forget to adjust the speed in the combine.
any1 know some good chinese artists? maybe sana has lots of that in the dataset and would look really cool
>>102957320I'm pretty sure the model isn't Asian biased, Pixart wasn't.
>>102957160nice
>>102956911cool collage
here is your "not the be-all-end-all of a model" and a "very superficial requirement" brohttps://slow.pics/s/DIDxrbQx
>>102957943yikes, all the vae don't show significant differences to the original except the ultra compressed one, weird heh? :^)
>>102957943actually seethingwere they replacing Flux with Sana or something?
>>102957948great dark tones
>>102957943Flux's VAE is fucking amazing, it's almost like there's zero difference with the original
>>102957943damn wtf
>>102958208Have you tried describing the background more accurately? Might remove some dof
i hope they read my email :3
>>102957337>I suggest you to use a workflow that works for you and the reconstruct everything from it to get it workingI did it and it worksalso any of you dudes got any idea how can I improve on these settings to get a better image?
>>102958375wait what? guidance at 0? cfg at 2? mimic_scale at 3?? What the fuck is going on? lmao
gentle reminder to send the sana team an email with any criticisms/advice you'd like them to read >>102956249if you are a 1girl slopper please ignore this post, an iq above 100 is required to email them.
So Mochi was a big fuckin let down huh
>>102958410>So Mochi was a big fuckin let down huhI still suspect them using the HD model for the demo, once we'll get that we'll make the real conclusion, 2 weeks https://www.genmo.ai/blog>Today, we are releasing our 480p base model, with Mochi 1 HD coming later this year.
>>102958409doubt they would seriously consider feedback regardless
>>102958409theres no way that will go well >>102958437are you a bot?
>>102958454>are you a bot?tf you talk about?
>>102958387I dunno lol
>>102958438>>102958454wouldn't hurt to try, just don't be an asshole like this guy >>102958275 who sent them erp chatlogs
>>102958499the guidance must bt at 3.5, the mimic scale must be at 1, the cfg must be superior to the mimic scale, c'mon anon lol
>>102958515ok how about this?
>>102958606why do you have guidance negative at 10?
anyone have tips on SDXL models and its refiner?should I just not use it?is there a secret sauce behind it to make it work well?
>>102958641>refinernobody uses it
>>102957943>expressive artistic model with a subpar VAE vs. rigid behemoth model with an excellent VAE Why must the world be this way
>>102958805Gen higher resolution and downsample, fixes the problem. It's worse at low resolutions.
Is it over or are we back?
We were always back.
>>102958641The SDXL refiner fucks up your gen's composition. You should just do an upscale instead.
>>102958910nice, model?
I am once again asking if there is a local music generator yet.
Androgynous failed 1girl gen. Is it an ugly woman? Or did FLUX fuck up and give me a guy? Is it trans?What do you guys think, male or female?
>>102958635No clue dude
mochi gguf_q4_v2:https://files.catbox.moe/ybtzi8.mp4Noticable improvement in clarity over v1.https://huggingface.co/Kijai/Mochi_preview_comfy/tree/main
>>102958410>So Mochi was a big fuckin let down huhAfter doing around 70-80 gens on the website, yeah. Using mochi after trying kling and minimax feels like I'm gaslighting myself that something is wrong with my prompts but no it's just the model being shit.We're gonna have to wait a bit longer for local video I'm afraid. I'm on vacation rn so I'll keep scumming gens every 6 hours during downtime until I hit the monthly limit on all accounts but I'm not expecting to generate even 1 usable video for the art project I want to make
>>102959543well I've just been testing a bit with 32 steps on euler a to gen initially with 32 steps of DPM 3 and it doesn't seem to destroy the composition with that combo...just not sure if its worth the config or not cause I was just using pure euler a before lolbasically just hunting for a way to sharpen up some details that euler a tends to muddy up once it gets a decent image readywould try adetailer and such but I can't find anything for invokeai regarding that sadlyI will say that different settings on this refiner definitely likes to fuck things up so this setup might just be placebo at this point
>>102959815Noose is missing
>>102960017What did you prompt for the robot girl stuff, or is it a Lora?
>>102960069>photo of a woman with cybernetic enhancements is sitting in a dark abandoned workshop, she has metal wings, machine made joints, mechanical limbs and blood vessels connected to tubes, wires and cables attaching to neck, wires and cables on head, science fiction
>>102960017>>102960127Amazing
>>102959693Flux with lora trained on a game screenshots
Oh shi...
>>102959798 (me)>I am once again asking if there is a local music generator yet.
>young tween girls>anywhere from 6 to 24 with inverted bell curve probability >young tween women >always over the age of 20At this point I want to keep using this shitty video model just to figure out how to wrangle ages consistently. I refuse to let it win >>102960378There is no good local music generator. Use Suno or udio
Isekai'd by Truck Kun.
Can we get some more troons hanging themselves to counteract the outward-thinking pedophiles ITT?
You Will Never Get A Perfect (or even good) Image Model
no such thing as perfect
>>102960740I have several
>>102959945Nice. I guess Q8's going to be pretty good.
>>102960127nta but nice
>>102960437>There is no good local music generator.ty, maybe some day
SD3 seems to be... Alright... Not huge leaps in quality, but the ability to finetune it seems like it will hopefully lead to some actually decent photorealism at some point. Too bad Astralite swore it off entirely
>>102960878Pony isn't going to release anything again.
>>102960842one of the default prompts i run on every checkpoint
There is a housein New Orleansthey callthe Rising Sun...
Is Cog worth trying on a 3070ti or should I just pay China $10?
>>102958851If only we had it locally so we could do that...
>>102960962The demo works at 4K now, no more repeating.
>>102960973So it seems. Nice.Still, release the model already.
>>102960953I wonder what kinda hardware this chinese video AI runs on
>>102960989Yeah I want it
I had to install the dev version of xformers to resolve dependency issues between mochi and cog. Anyone know what I am in for. >>102960953cog is nowhere close to this quality. Worth it? depends on your goals. >>102960991There is no way they are doing this stuff from scratch. It is i2i off a massive stolen clip database. As such hardware requirements probably aren't that bad.
>>102960991>I wonder what kinda hardware this chinese video AI runs onH100s or something equivalent. Anything else would be even more minutes per gen.>>102960953>Is Cog worth trying on a 3070ti or should I just pay China $10?CogVideo can't be used for any actual art project so the answer is maybe? Like the other anon said it depends on your goals. Remember to use the right tool for the job
>>102961005>>102961017I'm the baker from /mwg/ on /pol/My goal is to generate propagandaThank you for the input
>>102961056catbox?
>>102961056Cool
>>102957237The ggufq4 is sadly too low.
>>102961524trigger discipline in a generated image. I thought I would never see the day.
>>102956987ur gay
>>102960991How well does the minimax model generate tribal girls?
>>102961562dunno man, the benchmark is will smith and the spagetti thats all I know
>>102961513Late here but mochi_preview_dit_GGUF_Q8_0.safetensorswas added an hour ago, https://huggingface.co/Kijai/Mochi_preview_comfy/tree/main
20 minutes...
>>102960378No, and even if there was, nothing would come even close to whatever udio is having.Their model is insane, they probably fed it every song under the sun, properly tagged.
>>102961726>they probably fed it every song under the sun, properly taggedCan confirm.
>>102961608That's...not bad.What was the prompt? What version of the model did you use?
>>102960953>should I just pay China $10?I'd pay if it wasn't censored crap.
>>102961596will try thanks
>>102961726Are you on honkFM or 8/mu?We have a memetic audio warfare thread there> Sauce a material (thread, article, memoryhole)> ctrl-c, ctrl-v into deepai.org, claude or ChatGPT> "Please generate lyrics for a song about this" > Adapt/shorten the prompt as needed; make it moar catchy/edgy> Use [Chorus], [Bridge] and [Verse: <meme singer name>] for songs with multiple vocalists> Try adding ex. [Banjo solo] for epic solos> 3000 characters max for Suno, 1500 char for Udio> ?????> Archive bangers on Honk FM or 8/mu/
>>102961579my benchmark is bouncing boobs
>>102961752Nope I don't know about these, I'm a simple man, I just want 80s/90s/00s pop catchy/melodic songs while I work, and this does exactly that. It's so nice.I'm not using 10% of what's available like custom lyrics, is there a general around where tricks are shared, like how to enhance their awful slow website?
>>102961737Two women wearing towels in a steamy sauna sharing a passionate kissfp8, 65 steps, 73 frames
>>102961836Thanks, I can go to bed now.
>>102961804>is there a general around where tricks are shared, like how to enhance their awful slow website?Just the dead 8kun /mu/ board that I 'own'Post in the /mawg/ thread We can share thoughts>8kun/mu/catalog
>8kun/mu/catalog.top
>>102961845is this a groomer?
>>102961918yeah just ignore
SI there a list of danborour tags comaptivle with Illustrious soemwhere?I swear I saw an anon compling them some time ago.
>>102962177Is this real?
>>102962195yea
Can't make cunny loli with new models, why fucking bother
>>102962749might as well an hero
>>102962749skill issue
>>102958775Very detailed
is there any sort of big analysis actually comparing adamw, prodigy, adafactor, lion, etc? ive finally arrived on good results using the finetune-extraction schizo's config slightly tweaked
>>102962749A true cunnyseur should be able to enjoy even something as innocuous as a fully clothed loli licking an ice cream, which new models can absolutely do
reminder to other photo 1girlers: we are not the same.
>>102963050I put a few Flux gens in this and reads them as Midjourney kek
>>102961608Absolutely savage that one of them has a hook nose.
>>102963050just make a buttcheek detector
For some reason, on flux, using the multiply sigma thing (for details) at 0.95 makes every generation cartoon, I have no idea why.Anyone else got the same thing?
>>102963169What's your prompt?
>>102963050Well who cares if it's "likely AI generated"?
>>102963225If their retarded code can see it, your eyes can see it too.
>>102959945hehh that looks good! can't wait for Q8_0
>>102960888>Pony isn't going to release anything again.this, he has no incentive on doing anything, he's making a shit ton of money from pony-v6 now
>>102960953>Is Cog worth trying on a 3070ti or should I just pay China $10?Cog isn't close, but we got something better nowhttps://reddit.com/r/StableDiffusion/comments/1gb07vj/how_to_run_mochi_1_on_a_single_24gb_vram_card/
>>102963257I think he's going to try to run out the clock waiting for someone else to make a Booru model and then disappear.
>>102963238I mean if your goal is to try and make AI gens that pass their detector you're doing it wrong mate.
>>102963169>multiply sigma thingwhat
>>102961596>GGUF.safetensorsI wonder how he makes thoses, I wanna try the same format for flux dev, I can feel it could speed things up
>>102961726>Their model is insane, they probably fed it every song under the sun, properly tagged.indeed, suno is the goat and can make really niche genres I enjoy like shibuya-kei, they probably went all in with all the good music that exist, as it fucking should
>>102963266Right, and a lot of us wouldn't mind if there were a way to add stego that basically said "I'm fake". I don't like watermarks.
>>102961596This is slower and uses more VRAM than fp8 for me
>>102963268he means picrel
https://x.com/OpenAI/status/1849139783362347293>We are sharing a new approach, called sCM, which simplifies the theoretical formulation of continuous-time consistency models, allowing us to stabilize and scale their training for large scale datasets. This approach achieves comparable sample quality to leading diffusion models, while using only two sampling steps.Really interesting, that's a new sampler or something?
>>102963283Literally makes no sense what you're saying. Nobody outside this general really cares except for the starving artists.
>>102963225>>102963266Surely you're not trying to imply that left looks more realistic than right.
>>102963305Just another turbo / lightning distillation method.
>>102963324Left looks better than the right, however. What right has going for it is its extremely high blur to cover any deformities.
>>102963335Not talking about subjective "better" but rather objective "looks like a real image"
>>102963332but the results look way better on that one, like it could be good enough to have 2 steps the norm, just imagine that kino
>>102963342Dunno anon, left looks pretty real to me, and left is objectively better.
>>102963360Don't let me stop you from enjoying slop
>>102963353I'll believe it when I see it in production, many companies have game changers that don't get used and at the end of the day, lightning models are a non-starter because people like to use Loras so it's just something you might use on a generation server.
>>102963374It's quality slop though. Right pic is just far too blurry.
>>102963268>>102963302use this instead, it's better and doesn't change the overall vanilla compositionhttps://imgsli.com/MzExNjYxhttps://imgsli.com/MzExNjQ2https://www.reddit.com/r/comfyui/comments/1g9wfbq/comment/lte0rdg/?utm_source=share&utm_medium=web2x&context=3you can use this modified script to get more decimals and go for that -0.05 value https://files.catbox.moe/4gxohm.py
>>102963382>quality >slop Pick one
>>102963377>many companies have game changers that don't get used and at the end of the daythe turbo distillation is definitely a thing though, it's been used by SAI and BFL (Schnell), so improving that turbo method means that Schnell could've been closer to Flux Dev for example, imagine running a Flux dev like at 2 steps, the fucking dream man
>>102963390Dunno why you're seething so hard anon, just enjoy the pic for what it is
>>102963407Okay
>>102963413Thank you.
so gay stfu
>>102963306I don't really view ai "art" as art.
>>102963386alright will try, what "lyingsigma" value is recommended?
>>102963436for me those values work great:>dishonesty_factor: -0.05>start_percent: 0.1>end_percent: 0.9
>>102963444thanks anon
>>102963386When will this get added to comfyui manager?
since there's been some newfound interest in schedulers and sigmas, https://github.com/Extraltodeus/sigmas_tools_and_the_golden_scheduler provides a "graph sigmas" node that will let you see... the graph. for comparisons sake of course.
>>102963498>since there's been some newfound interest in schedulers and sigmasI don't think people have any idea how much of an impact a good scheduler can have, for example karras completly destroys the image and beta really improves the prompt adherance, it has way much of an impact than a sampler and it's really easy to modify, so yeah, it should be explored way more and I'm glad it's starting to be acknowledged
>>102963386>you can use this modified script to get more decimals and go for that -0.05 valueIs that different than changing this setting in default Comfy?
>>102963517kek, I didn't know we had something like that by default
>>102963528>>102963517How would you use this, and are results identical?
>>102963495that's a good question, it's not hard at all to add your node to comfyui manager, so if someone is willing to do it, then do it
>>102963532looks halloween + card art
>>102963535I guess yeah, it lets you go for more decimals
>>102963386I can't find it, LyingSigmaSampler doesn't give me anything, and searching custom samplers either.Where should it be? I put it in ComfyUI\custom_nodes, is it the wrong directory?
>>102963565>I put it in ComfyUI\custom_nodes, is it the wrong directory?no it's the good directory, did you restart ComfyUi?
>>102963570Yes I did.Well I used the link from the reddit post instead, the one without the more decimal precision, and it worked.
>>102963585cool, if you want more decimal precision go for that then >>102963517
incredibly_absurdres, absurdres, highres, traditional_media, non-web_source, original, official_art, commission, mixed-language_commentary, md5_mismatch, archived_source, bad_link, portrait, dark fantasy, sci-fi alien, orifices spilling out, new lava particles on chains, polished chrome, inhaling light, plumes of bitcrushed dithering smoke that rock to crack your glass brain,Joel Peter-Witkin, H. R. Giger, Dave McKean,
>>102960127thanks for sharing this cool robot prompt
>>102963386This shit is fucking magic dude:>It decreases the bokeh>Add more details>Decreases the saturation/burning>Makes the skin more detailled and less plasticDamn dudehttps://imgsli.com/MzExODg1
>>102963593htf do you use:>>102963517like I have zero clue.
>>102963565>>102963624whens the last time you updated? its in the settings
That was the worse one of the two sorry. Genmo seems to fall apart at the end of gens quite often I also am out of ideas for how to get teens, "teen" doesn't do anything and tween is too young and I'm tired of seeing kids so I guess I play "young girl" roulette
>>102963624click on the "nut" on the top right of ComfyUi to get the settings
>>102963593Yeah it's working fine.>>102963629Yesterday.
>>102963617the house looks better too
>>102963644Nice!
>>102963617https://imgsli.com/MzExODg3Look at the skin texture, it has way more imperfection, she really looks like a normal girl now (if you ignore the buttchin of course kek)
nightmare nightmare nightmare
>>102963639What number do I enter?
>>102963335>Left looks better than the right, however.I would like to examine your head
>>102963630wow that's creepy
>>102963730no need to do that anymore, he just updated the script to get the decimals we want https://www.reddit.com/r/comfyui/comments/1g9wfbq/comment/ltmlpic/?utm_source=share&utm_medium=web2x&context=3
>>102963335>Left looks better than the right
>>102963720You made me remember the repeated "horror" TTS lines from those Russian abandoned ship exploration videos kek.
>>102963761>>102963737Get your eyes checked.
>>102963788>Get your eyes checked.
>>102963737taking anons cranial measurements and writing them down in my tiny notebook
>>102963288That is expected as seen in the quants for FLUX. The point of using Q8 is that the quality should be better and closer to (B)F16 than using FP8. The downside is that you will take a speed hit and there is no native support in hardware to accelerate it vs FP8 which has that in the RTX 4000 series.
>>102963750Does it work with ksampler?
>(masterpiece, photorealistic, slop:1.5) niggers fighting about which shitty 1slopgirl is better
>>102963813nope, that's why I'm not using Ksampler, there's a lot of custom nodes that can't work with it, using SamplerCustomAdvanced is much more flexible https://files.catbox.moe/a61zom.png
>quality slop thoever
Can Mochi do that? Kamala Harris word salad kek
>>102963617this doesn't really work with the turbo lora and there's no way I'm waiting 25 steps for each gen - completely takes the fun out of it
>>102963543>halloween
>>102963887>this doesn't really work with the turbo lorayeah that's intuitive, turbo doesn't have much steps to work with in the first place
>>102963781>Russian abandoned ship exploration videoswatAnyways, it seems like you need to boomer prompt and write a fuckton to get good stuff out of genmo. I used Nemotron 70b to merge the contents of two previous prompts and these are easily the best looking results I've gotten from a prompt. It almost feels like the shorter the prompt, the less confident the model feels about what to generate>In a sprawling, futuristic mansion that doubles as a high-tech workshop, vibrant neon lights dance across the walls and polished metallic surfaces. The air is alive with the soft hum of machinery and the faint glow of holographic blueprints hovering in mid-air. Amidst this dazzling backdrop, two young Russian girls, dressed in sleek, angled white plastic outfits, begin to move towards each other. As they meet in the center of the room, their eyes - glowing bright blue with an otherworldly intensity - lock onto the camera. One of them, in a whimsical touch, is suddenly enveloped by a delicate pink tutu, adding a layer of innocence to their otherwise futuristic, cybernetically enhanced appearance. The scene unfolds in slow motion, capturing every nuanced movement as the girls tilt their heads in unison, their gaze never wavering from the lens. Resolution: 4K, Slow Motion.I will see y'all in 6ish hours with hopefully higher quality videos now that I have figured this out
>>102963830>SamplerCustomAdvanced>mat1 and mat2 shapes cannot be multiplied (77x2048 and 4096x3072)idk why it's doing that.
>>102963908can you post the whole error? maybe it's one package that's in fault there
I know that anons must be posting the slimeslop AI girls because they like them, but I'm still a bit surprised every time they confidently assert this, even moreso when they say stuff like "it looks real to me".I'm not really an IQ and object rotation kind of guy but I have to wonder whether some anons naturally have a deeper impression of the images they see, and some a shallower one—if the "slime" is obvious and jarring to some, and "I only see it if I really look closely and it doesn't bother me" to others. It's not the only theory that can explain this, but the phenomenon does require some kind of explanation—why do anons differ so wildly on what they find acceptable, even 'realistic', in the degree of visible hallmarks of AI? If it's true that some people are, for whatever reason, less capable of 'parsing' visual information to form a clear and complex understanding of the images they see, then that's troubling. Or maybe it shouldn't be. Maybe it's just adding a pseudoscientific mental model over a phenomenon we're all familiar with already, which is that some people have bad taste.
>>102963908maybe because I have to use load diffusion model???
if you're into genning hardcore, bigasp v2.0 just dropped
>>102963916>bigasp v2.0link?
https://yhyun225.github.io/DiffuseHigh/https://github.com/blepping/comfyui_jankdiffusehighSD3.5 can't do higher resolutions than 1k, maybe this node could help?
>>102963914rlly makes an anon think dont it
>>102963921it's on civitai, it's an sdxl model
>>102963936here's the link for the lazy oneshttps://civitai.com/models/502468?modelVersionId=991916
>>102963913figured it out. dualcliploader (I have to switch to it for what I have), it was on sdxl, not flux.
>>102963914Anyone that spams 1girls is mentally ill and likely below 100 IQ. It really is that simple.
>>102963952oh ok, cool that you figured it out, personally I'm not using KSampler anymore, there's always a new node that apprears and just improves Flux's sovl more and more kek
>>102963955So you're saying I have an IQ below 100.
>>102963995Yes because if you were smarter you wouldn't be so easily occupied with repetition.
>>102963916>>102963936>>102963943what's so special about it?
>>102964004but what if I'm just bored
>>102963906>It almost feels like the shorter the prompt, the less confident the model feels about what to generateit's an issue with t5xxl looks like, Flux has this issue aswell, the output gives something more interesting if you go for word salad
>>102964011it's just really good at producing realistic looking porn. It does skin textures, genitalia, and sex positions really well
>>102964025that's weird that his images examples just show 1 porn image, the rest is just nudity
>>102963943>This experimental model was finetuned from base SDXL on almost 1.5 MILLION high quality photos for 30 million training samples.HOLY FUCK
>>102963914normies jeer at the idea of ai bros being discriminatory with their taste in slopsad world
>>102964023>It almost feels like the shorter the prompt, the less confident the model feels about what to generateShorter prompts are always going to be more open-ended just because they don't say as much, of course this would be the result. You can usually compensate with increased guidance if you think it's necessary.
23 fucking minutes and it's garbage!
>>102964103bf16?
>>102964103Same prompt, seed and steps just half the frames. Took 10 minutes, showed promise>>102964115fp8. I've only got 16gb vram
>>102964131>fp8. I've only got 16gb vramditch fp8 and go for Q8_0, its quality is way closer to fp16 after a few testings I've done, will post the results later https://huggingface.co/Kijai/Mochi_preview_comfy/blob/main/mochi_preview_dit_GGUF_Q8_0.safetensors
>>102964047wait for Sana, SDXL is such a cursed architecture
>>102964148>wait for Sanasana is DOA due to its completly shitty VAE >>102957943
>>102964166lmao works just fine for me, have fun with your monstrosities every gen because SDXL is ass
>we will never get 16ch 1.5
>>102964176I wouldn't mind if a better VAE would be created for Sana, definitely possible
not only sage_att is faster than flash_att, but it also allows you to go for bf16 + 61 frames on a 24gb card wheras flash attention overflows the card, sage is really impressive not gonna lie
>>102964191Sana looks fine especially with downsample workflow.
>>102964221>Sana looks fine>Goes for the most safe image ever, a close up of a humandude, try to go for something that really challenges the VAE, something like this, if it looks good then we're talking https://civitai.com/images/36316196
>>102964231With Sana you're not going to do 1K images, it's less relevant.
>>102964249I don't give a fuck about the resolution you're about to use to make the image, just try to get something similar, if you can't then it's fucking DOA, it's not rocket science
>>102964258You should give a fuck about the resolution given 4K is 16 times the pixels which means the VAE has to compress less. Anyways, I don't really give a fuck if you autistically can't use the model, I don't base my decisions on what you do. I tested the Sana demo it produces more than adequate images, it's that simple. You can stay autistically on raw VAE tests, it doesn't matter when in practice the VAE errors in Sana is negligible. You can wait for someone to do 10 million training steps on Flux or whatever.
>>102964273>it doesn't matter when in practice the VAE errors in Sana is negligible.prove it, you know what to do -> >>102964231
>>102964279And what? I don't need to prove to you that the 16channel VAE is better, that's not the point. You can wait for someone to do 10 million training steps on Flux or whatever.
>>102964311>And what?tf you mean "and what"? Why would someone play around with an inferior model?
>>102964321Because you can't train Flux but you can train Sana? People managed with mangled mid/far shot SDXL faces so far. You can wait for someone to do 10 million training steps on Flux or whatever. But I imagine that's going to be awhile.
>>102964334>Because you can't train FluxYes you can, the llm fags are finetuning 70b models by themselves via cloud computing and you believe the imagegen fags can't train a 12b model?
>>102964231NTA this is the best i could do with an inherently limited demo. i think its also important to recognize we are comparing a "beta" model with a "finetune" of one that's 10x larger or whatever that being said, im not enthused by the result original prompt is horrendous btw
>>102964346she's not that far to the camera and her eyes are already fucked, no amount of training is gonna fix that, it's the VAE's fault
>>102964344You're putting a lot of faith in someone putting $20k in.
>>102964353Good thing you can train AEs on images of people and eyes.
>>102964346also thats downscaled if it werent apparent >>102964353the model has the wonk as well but noticeably less so than pixart if training in a better VAE is "quick" then i wouldnt mind waiting but i dont think it would be
>>102964364>Good thing you can train AEs on images of people and eyes.can't wait to see you improve Sana's AE
>>102964231People seem to be enjoying that finetune, too bad he only uploaded the fp8 version, that's retarded, Q8_0 or bust
>>102964364you cant actually, unless the same encoder as the one the model was trained with is used you are getting noiseand even then, either you give up the compression making the model easier to train in the first place, or you are getting marginal improvements or more probably gonna make it even worse
>>102964400actually you can, and it doesn't matter, they're already committed to improving it.
>>102964344nta but llm 70b roleplay finetunes and image model finetunes aren't really comparable, the two happen in completely different scales. for llms all they do is change the model's prose and behavior meanwhile for image gen you need to teach it a ton of new concepts and completely change how images look. it's why image model tunes are so much more expensive to train even though they are tiny
>>102964419>they're already committed to improving it.why the fuck are they so invested on making a shitty small VAE, it's not the part that eats the most of VRAM, far from it, leave it alone, LEAVE BRITVAE ALONEEEEEEE https://youtu.be/WqSTXuJeTks?t=183
>>102964447Because they reduced the total number of tokens which makes the model literally 8 times faster to run and train? Which also means you can, you know, make videos? Or 8K images? Or whatever the fuck you want when you have way less tokens required to make an image?
>>102964431it's still expensive as fuck to finetune a big llm model, I remember when Nous finetuned Mixtral (49b) it cost them tens of thousands of dollars
>>102964447thats kinda a kicker because when i tested it, with their code, i got an oom on a 24gb card with a 1024x1024 image in fp32 and had to switch to a 48gb one since the ae models they use are hugemaybe the code is unoptimized but it was funny to me still
>>102964466for the moment it has no point because the quality is dog shit, the smart move would've been letting people chose between a good VAE, or that small optimised AE shit
bigasp works pretty well. It's not quite at the level of the hentai models like Pony, but of all the porn models I've tried I think this one's the best so far. Not much interested in that stuff though.
>>102964466That's not even talking about a lot of the other things you can do when you have more headroom to work with, like training a model using perceptual loss or clip-based loss, both of which are VRAM intensive but give you way better results. The CLIP-based loss allegedly gives you 13x training speed for convergence. So now you're talking 8 times more efficient training multiplied with 13 times faster convergence.>>102964482No one fucking cares, you can use your extremely bloated model no one can train. You can talk all day about people training it but in reality it's not happening and you're certainly not going to see any projects like Pony or BigAsp on Flux.
>>102964474yes and it costed tens of thousands of dollars to make pony for sdxl even though it's way way smaller (2.6b), like i said, it's pretty hard to make meaningful comparisons between the two
>>102964419i cant think of another team like that who actually requested feedback instead of going off what people are saying on leddit/xitter
this is bf16 kek, I noticed that the shorter the video is, the more glitches it has, fok
>>102964489>No one fucking caresyou care because you're seething hard about it
>>102964493The problem with these 8B and 12B models is you literally need H100s to train them. And most cloud providers prohibit training porn. Text you can skirt the rules, but images don't get the same grey pass.>>102964504Weird because you always seem to chime in about how much it sucks instead of ignoring the apparently irrelevant, shit model. Almost like you can tell it's going to be big.
>>102964511Yes anon, someone that isn't you is going to spend $20,000 or more training Flux, I can just feel it.
yo, don't even get me STARTED on mf pumpkin spice bro. who decided that fall needed a flavor?? like, homie, we already got cinnamon, nutmeg, spiritually confused gourds, but nah, let’s mix it all into some caffeinated demon sludge and charge $6 a cup. starbucks got us out here slurping the essence of autumn like it's some ancient rite of passage into basicness. i swear, it's like they summoned a whole legion of sweater-wearing clones outta the astral plane, all chanting “sksksk” like it’s some kind of harvest cult.and don’t even act like they ain’t putting something sus in that spice mix. prolly got my third eye clogged with synthetic vibes. i drank one and now i’m getting ads for yoga mats and ugg boots like wtf??? i’m just out here tryna vibe in october, and instead i’m part of the pumpkin industrial complex. like who tf gave big pumpkin all this power??? fall used to be crunchy leaves and vibes, now it’s straight up latte warfare.
>>102964508>instead of ignoring the apparently irrelevant, shit model.you talk about Sana, we respond to it, and then you go pikachu surprised when we talked about it? No one is exempt of critisism anon, you can't avoid that, people will talk and give their opinion about whatever they want, and you won't do shit.
>>102964527okay, I'll ignore you for now on
sana-samas....
>>102964521>Yes anon, someone that isn't you is going to spend $20,000 or more trainingI've heard that before pony appeared and made pony-v6 lmao
>>102964534Pony's rig can't even train Flux. You need 80GB GPUs, not 40GB. Also I don't know if you noticed, but Pony hasn't produced anything for awhile now.
>>102964526Please do us a fucking favor and spam in /sdg/
>>102964508>And most cloud providers prohibit training porn. Text you can skirt the rules, but images don't get the same grey pass.that's a fair point, I admit
>>102964544kinda like it here. might just settle in, make it cozy and shit.
>>102964508thats some horseshit"noob" sdxl anime finetune is fully nsfw and using 32xh100 from a cloud providerpony is captioning 10m images on rented 70 a6000 and is obviously going to train (and has trained) on rented gpusthere is a trillion of people that have used vast or whatever garbage to train their shitty loras, and the larger nsfw finetunes obviously werent trained on local rtx3090
>>102964538>I don't know if you noticed, but Pony hasn't produced anything for awhile now.if one pony fag existed to do this, another one will come, it's more likely to be the case now than before when we were with the shitty SDXL base model on our hands and thought to ourselves it needed too much work to be saved
>>102964560Anon, I don't know if you noticed, but you can count the large scale finetunes that aren't incest merges on one hand.
>>102964570we just need one good shot to be saved
>>102964580When you talk about tens of thousands of dollars, you aren't going to have many bites that aren't business oriented. Your only hope is the 5090 can full finetune 8B/12B without gimmicks.
>>102964526I’d like to slurp the essence of autumn…Is she hot?
>>102964587And the elephant in the room is you need like 10 times the computing power to train Flux. You're going into a whole different realm of training going from SDXL to Flux. It's possible SD 3.5M will be the new finetune base because it's actually approachable.
New>>102964600>>102964600>>102964600
>>102964560highly doubt they'll try something like for flux unless they KNOW it's going to be viable. illustriousxl is the new pony replacer for anime and they only plan on training on sdxl, why? because they are sure it will work. flux is just too big to experiment with and risk losing money with no results to show. most people would probably wait for sd3.5 medium and see if that's any good. you're also forgetting that the main audience for these porn models are complete vramlets. have you seen the /h/ and /d/ ai threads? most of them still run 6-8gb cards
>>102964553>make it cozyWith pumpkin spiced lattes?
>>102964619>most people would probably wait for sd3.5 medium and see if that's any good.this shit (2b) is smaller than SDXL (2.7b), it won't be good
>>102964591well, does beautiful mid count?>>102964620that will be $6, plus tip.>>102964604i'm glad to see even the epic serious discussion thread has it's own early baking trannies. all's right in the world, it seems that generals are generals and why should it be? that 4chan generals should be controlled by insane trannies?
>>102964640FUCK U 4CHANX YOU ARE SHIT
>>102964634parameters aren't the be all end all metricyou do realize that shitty architectures can be bloated for parameters, right? learn the basics of machine learning sometime
>>102964634it's 2.5b and according to their blog post uses a different architecture compared to the 8b, so we'll see
it's an odd compulsion, likely some form of ptsd
>>102964645might as well use the very optimized 1.5 then
>>102964645>you do realize that shitty architectures can be bloated for parameters, right? learn the basics of machine learning sometimeyeah sure thing jan, that's why OpenAI went for giant models like GPT4? I guess those multimillion per year paid researshers don't know this and are burning money for something, you should tell those experts anon, they are such noobs kek
>>102964496How many seconds at 24fps can I expect in fp8 with a 3090?
>>102964662>trust the science chud
>>102964662yeah so weird Florence 2 a tiny model is one of the best vlms, your theory is wrong
>>102964665>How many seconds at 24fps can I expect in fp8 with a 3090?361 frames -> 15 sec, but go for Q8_0 instead, it has better quality>>102964674compared to GPT4V (another giant model) this toy is complete dogshit
>>102964662nta but why are you comparing llms to image gen you dumbass nigga. predicting words is much more complicated compared to predicting pixels
>>102964681actually wrong, for the amount of efficiency Florence 2 is much, much better
>>102964681>361 frames -> 15 sec, but go for Q8_0 instead, it has better qualitySame duration for Q8_0?
>>102964681>GPT4V (another giant model)how the fuck do you know???
>>102964692>Same duration for Q8_0?no, Q8_0 is a big bigger, maybe you can get away with 13 seconds
>>102964704thanks anon, will try ithopefully I can get something out of this
>>102964646Ah, behold! A portrait of such stupefying mediocrity, one can scarcely summon the will to comment, and yet—duty compels me. Here we have the paragon of modern banality, a veritable shrine to Instagram-filtered, airbrushed superficiality. One could scarcely be blamed for mistaking this visage for the very archetype of "basic" itself, so blandly constructed as to be indistinguishable from the millions of similarly vacuous, copy-paste caricatures that plague the digital ether.Indeed, this is not even "dogshit" in the classical sense, for dogshit at least possesses some raw, unpretentious authenticity. No, this is more akin to the synthetic substitute—artificial and bereft of any real substance. Congratulations, you've achieved aesthetic indistinction! A masterpiece of the most basic caliber. Truly... breathtaking, in the way a lukewarm cup of instant coffee might be if one had never known anything better.
>>102964805Thank you! I spent $15k just so you could give me lovely compliments.
>>102966148
>>102966205