Discussion of free and open source text-to-image modelsPreviously baked bread: >>103109699Serpents in the Dawn Edition>Beginner UIFooocus: https://github.com/lllyasviel/fooocusMetastable: https://metastable.studioEasyDiffusion: https://easydiffusion.github.io>Advanced UIForge: https://github.com/lllyasviel/stable-diffusion-webui-forgereForge: https://github.com/Panchovix/stable-diffusion-webui-reForgeAutomatic1111: https://github.com/automatic1111/stable-diffusion-webuiComfyUI: https://github.com/comfyanonymous/ComfyUIInvokeAI: https://github.com/invoke-ai/InvokeAISD.Next: https://github.com/vladmandic/automaticSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Use a VAE if your images look washed outhttps://rentry.org/sdvae>Model Rankinghttps://imgsys.org/rankings>Models, LoRAs & traininghttps://aitracker.arthttps://huggingface.cohttps://civitai.comhttps://tensor.art/modelshttps://liblib.arthttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scripts>SD3.5L/Mhttps://huggingface.co/stabilityai/stable-diffusion-3.5-largehttps://replicate.com/stability-ai/stable-diffusion-3.5-largehttps://huggingface.co/stabilityai/stable-diffusion-3.5-mediumhttps://huggingface.co/spaces/stabilityai/stable-diffusion-3.5-medium>Sanahttps://github.com/NVlabs/Sanahttps://sana-gen.mit.edu>Fluxhttps://huggingface.co/spaces/black-forest-labs/FLUX.1-schnellhttps://comfyanonymous.github.io/ComfyUI_examples/fluxDeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main>Index of guides and other toolshttps://rentry.org/sdg-linkhttps://rentry.org/rentrysd>Try online without registrationtxt2img: https://www.mage.spaceimg2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest>Related boards>>>/aco/sdg>>>/aco/aivg>>>/b/degen>>>/c/kdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/tg/slop>>>/trash/sdg>>>/u/udg>>>/vt/vtai
>>103122994awful taste
Blessed thread of frenship
the other thread has the words... *hurk*... flux support in the op image.. i think i'll stay here
SANA dropping soon?
SoonTM Feel Good Inc.
>>103123002>mad he's not in the collage
>>103123114Are you blind? It's an honour to get excluded.
>>103123148you seem pretty upset about it desu
>>103123180anon is just angry about everything, all the time
why does Comfyorg insist on making the ui shittier?
>>103123069Probably given they put up the license file.
>>103123193Because they cannot be bothered to focus on UX. Then again they're doing a lot in terms of being a good backend for others to make it into an actually usable frontend, so there's that, I guess it goes in spirit of the community effort behind local ai.
>>103123193Because ComfyAnon is one of those devs that both has extremely bad taste but is also extremely stubborn. Devs like him is why Blender was a pile of shit until someone was like "you know, left click is for selecting".
>>103123206oi mate do you have a loisence?
>>103123230aye, for stabbin' gits
the same lab that works on sana released a new quantization method for image models>SVDQuant is a post-training quantization technique for 4-bit weights and activations that well maintains visual fidelity. On 12B FLUX.1-dev, it achieves 3.6× memory reduction compared to the BF16 model. By eliminating CPU offloading, it offers 8.7× speedup over the 16-bit model when on a 16GB laptop 4090 GPU, 3× faster than the NF4 W4A16 baseline.they really like 16gb vram laptops>https://github.com/mit-han-lab/nunchaku
>>103123242Because they're one of the few devs that seems to actually give a shit about consumers using and training AI
>>103123242it's true that their 4bit quant pictures look close to the bf16 ones, I wonder if it's better than fp8 or Q8_0, would be cool if it's the case, I'd like to take that speed increase lol
>>103123259they once mentioned that they were working on a text to video model, so i wonder if the end goal of this lab's research with sana and this new quant method is a video model that works well on 16gb vram laptops
>>103123292It all combines into a model that can do longer sequences, faster. Video is basically unusable if you have to wait 5-10 minutes for a 2 second clip.
>>103123307at some point it'll be possible, but that'll be on another era when we'll find a better architecture than transformers that can make small models as good as fucking Sora, definitely possible, my guess is that we'll get this kind of shit in 3-4 years
i wonder what sana2 will be like
>>103123361Probably hitting the Flux quality standard. Sana seems to be a slightly better but much lighter weight SDXL and there's a market for that.
>>103123379>Probably hitting the Flux quality standard.I hope they'll ditch out the ultra compressed VAE and go for something of quality, a good VAE is really important to achieve a good quality picture, that can't be understated enough
>>103123399Oh fuck off
>>103123225Is it autism?
>>103123399nta but they mentioned that they wanted to make ultra compressed VAEs more popular, it's defienetly part of their goal to improve this technology so i highly doubt they'll ditch it
>>103123424No, some nerds for some reason decide being an unlikeable contrarian twat is actually good.
>>103123435>everyone that DARE to disagree with me is a contrarianOr maybe you just have shit takes and we're calling you out? How about that?
>>103123180kek, literally don't care about >le collage but it makes me laugh how jumpy you get when anyone criticizes it, also I've been featured many many times
>>103123430yeah I understand they want to reach that goal, but for me they should go for an uncompressed VAE on top of their compressed ones, so that at least we can have the option to choose the better quality one if we want
>>103123442No, ComfyUI has objective issues that ComfyAutist refuses to fix because he's a pretentious twat that can't take feedback. And anon, what you say doesn't matter to me because I've already been rewarded with a high paying job with influence so it turns out my opinion is worth a lot. :) Some kid using his mom's laptop should shut the fuck up.
>>103123486>I've already been rewarded with a high paying job with influence so it turns out my opinion is worth a lot.your opinion ain't worth shit, you're a nobody, don't expect people to suck your dick if you have retarded opinion, you're delusional
>>103123478That's not how any of this works you ignorant retard. Each VAE is it's own language, you can't just swap VAEs. A 16x VAE is fundamentally different than a 32x VAE because each will have completely different neural network activations to achieve their compression goals.
>>103123508that's why I suggested to make 2 VAE that'll work for Sana, one compressed and one not, you fucking 2 digit IQ monkey
So with the new CogVideoX, I have to ask, what's the state of lora training?I have six 4090s and a huge local porn collection I could extract clips from. I've been wanting to try a video finetune for a while. Last I looked into this, only CogVideoX-Fun supported multiple resolutions (so you could train on slightly lower than the max res) and quantization didn't work. There was some training script I tried but it was scuffed, and only the 2b model fit on a 4090 for training, and it was a pain to even use the loras. Are things improved? I feel like we must nearly be at the point where a halfway decent local porn video model is possible.
>>103123505Oh no, I don't care. I'm just saying why ComfyUI will be irrelevant the second a better UI is made because using ComfyUI is awful and the second an actual production, professional UI is made everyone will switch and ComfyAutist will become irrelevant once again.
>>103123516>bro they just train two models from scratch because I'm a faggot and I won't even use either of them because as you've noticed, I don't ever post images and if I did you'd know I'm the 1girl spammer
>>103123519>I'm just saying why ComfyUI will be irrelevant the second a better UI is madeit's been 2 years people have been saying that, I don't like ComfyUi's spaggheti shit either but he'll never lose relevancy, his ecosystem is too strong and advanced now
>>103123530I don't know if you've noticed, but no one posts in these threads because it turns out no one fucking cares about image AI.
>>103123527>I don't ever post imagesIndeed you don't post images, you just burned yourself on that one, are you that retarded?
last thread had great gens, especially the halberd girls
>>103123539>no image detected>>103123544shitty slop spam based on a shitty theme
>>103123536>no one posts in these threads because it turns out no one fucking cares about image AI.what are those?
>>103123549I know you're a moron but surely you can tell it's maximum 50 unique people participating across these threads. If there were global IDs it would be so grim seeing the same fag spamming in SDG is spamming in Degen.
>>103123548I respectfully disagree, they reminded me of the female centurion drawer, and I enjoyed the theme
>>103123561So even you don't care about image AI? :(
>>103123572I care but I'm not going to pretend we're not in the hobbyist programming on a Commodore 64 niche. We're well beyond early to the party, and that's why these threads are dead.
>>103123517the new CogVideoX supposedly supports any resolution
any 3.5 finetunes yet
>>103123594>the new CogVideoX supposedly supports any resolutionthe i2v one can, but the t2v one still has its resolution locked
>>103123280>really close quality to bf16>8.7x faster (from 111.7 s to 12.9s)oh boy I like where this is going
>>103123628>1360x768Goddamn, takes me like 8 minutes to get 3 seconds of footage from Mochi and that's only 480p
>>103123650yeah but mochi is a 10b model wheras CogVideoX is a 5b, so it'll be faster on Cog overall
https://tensor.art/models/792217506975595434?source_id=njq1pFzjlEOwpPEpaXny-xcuHow are things like this not a violation of the flux dev license?>finetune of flex dev>hosted on a generation website, need to pay for credits to use it>downloads disabledI thought the whole point of the license is to prevent people from hosting the model or a finetune as a service, not sharing the weights, and charging for access. But that appears to be exactly what this is.
>>103123628mmm if you finetune the i2v model with porn that could work too, it could be even better, i dunno, all i want is a i2v, its give the user more control
>>103123668>How are things like this not a violation of the flux dev license?I think they are violating the licence yeah, desu I wouldn't mind the BFL fags to force them to freely release the model so that we can enjoy them all, looks like they made a serious finetune out of it
>>103123530Professional software takes time to develop. Shitty browser app-layer script hell is not professional software. I'm waiting for Autodesk and SideFX to deliver something cool but it won't happen that soon.
Any anons got a good prompt to get pic related? Cant get it for the life of me for this Ghostface thing I'm trying to do
>>103123114So what if he is? Maybe his snubbed gen was good. (Unlikely.) Then he'd have good reason to be angry.
My lips are hot with desire to lay a special kiss upon a special lady. I wait for her to appear
Whats 1.5?
>>103123875First major Stable Diffusion model, a golden age of finetuning while at it.
>>103123885SD1.5 was genuinely a case when the stars were all aligned:- It was supposed to be cucked and lobotomized before the release by SAI but the Runaway chads decided to give them a middle finger and release the uncensored model anyway- Someone leaked a serious anime finetune made by NovelAI and we thrived off that
>>103123905And it was relatively lightweight with output good enough to be worth the bother of training on household toasters.
Give me your best 1girl
>>103124052you've set a very high bar with this one
>>103124052damnnn bruh, what video model you used for that kino?
>>103124052yeah, I'm gonna need a catbox for that. I don't believe it's a local gen.
>103124052>watermark blurred out in the bottom rightso its definitely not yours and probably not local, where did you find this?
>>103124120cheeky eagle eye bastard, I'm actually impressed
I am in the wrong thread, it is indeed not local, its runway, apologies
>>103124140it's all right, I wished we had a video model thread so that we could spam some Minimax shit in there
>>103124140>its runway, apologiesmy bet was (old) kling because theres not enough ghosting for it to be genmo (and its too good to be genmo) and minimax doesn't move like thatneat, i havent seen many runway gens but thats for the obivous reason that you need to pay and its super expensive>I am in the wrong threadthere isn't really a thread for videogens, this is probably the best one on /g/ for them. ive been posting video gens on genmo that i did on the website which is /ldg/-adjacent at best
>>103124195>there isn't really a thread for videogens, this is probably the best one on /g/ for them. ithere was a Minimax thread on /pol/, it was fucking amazing, dunno why they stopped it though
any flux finetunes yet
>>103124208>dunno why they stopped it thoughif it's anything like /aivg/ it was probably because minimax became unbearable with wait queues or a google signup or a paywall or something like thatsame reason why the LUMA threads a couple months ago died. people ran out of daily credits and that was that. same reason why I'm not posting more robot girls made with genmo (I finished my music video, but the threads are split so I'll post it next thread maybe)
>>103124247there's supposedly a new one but you can't download it lol >>103123668
>>103124195The best one would be /sdg/.
>>103124250fair enough, I feel like this thread with thrive off the new local video models from CogVideoX or Mochi, so far it doesn't reach Minimax quality but it will at some point, and when that'll be the case we'll have some real fun (let's also hope we'll be able to make quality videos without waiting an hour too lol)
>>103124276its a tossup. both threads should merge into just /dg/ or /dmg/ (Diffusion Models General) so we can have one place to discuss generative AI made with diffusers/DiT>>103124295these threads will truly start thriving once there's a retard-proof and unlimited way to get videos of pretty 1girls looking into the camera fliratiously. so maybe 2025>>103124295>let's also hope we'll be able to make quality videos without waiting an hour too loli think with a 5090 it'll go down to 15 minutes per gen locally on the HD version of genmo. hopefully there will be options that trade accuracy for speed for potatogenners as well
>>103124336>/dmg/ (Diffusion Models General)I vote for this, it's time to end the split.
remember that this would imply a merger with /de3/ as well
>>103124364>>103124348>remember that this would imply a merger with /de3/ as wellnot necessarily, we could go for local diffusion models only
>hurf derf i cant shit up /ldg/ on my own lets merge where 1girlspamslop is accepted guyz.No thanks femcel.
>87 text posts >10 images >single actual gen It's over
>>103124348lmao no, sdg fags can dieyou want dmg?go to sdg
>>103124348i remember when we first split from /sdg/ we were /idg/- image diffusion general or something and saas fags would invade us. local needs to be specified to keep the trolls away unfortunately
>>103124400>lmao no, sdg fags can diethis, if we made a scission that's for a reason
>>103124457wwwwwrRRRRAAAAAAAAAAGHHHHHHHHHHHHH! SAAAAAAAAAAAAAAAAAAASSSSSSSSSSSSS!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
>>103124052This isn't local but the image gen was>>103124195Can you run minimax (or something of similar quality) locally? I haven't looked into videogen at all
>>103124348There was never a split desu
>>103124451>/idg/Missed that part. Started baking threads and collages the moment previous og bakers died out.
>>103124521that's definitely a de3 image
nothing can keep trolls and 1girlspamslop awayat the end of the day, if the mods dont care and the animosity between /ldg/ and /sdg/ still exists then i guess we're staying separatetheres also edge cases like>>103124463>This isn't local but the image gen wasi think they should be allowed in /ldg/ personally (but I also post cloud genmo mochi gens in these threads so my opinion is biased)>Can you run minimax (or something of similar quality) locally? I haven't looked into videogen at allkinda, you can make 480p videos that are pretty good with mochi
>>103123575>We're well beyond early to the partyHow early in C64 dev were people making massive coin like how they are with AI currently?
I'm a simple anon, I care about generating pixels locally with free and open source models, and I want to post in threads that reflect it, rather than brand loyalty. Simple as.
>>103124463Nice, as were mine
>>103124580no one is making massive coin except maybe MJI bet Flux Pro isn't even close to breaking even
>>103124597I'm surprised API sites let you go with nudes pictures of women
>>103124580>How early in C64 dev were people making massive coin like how they are with AI currently?Jeff Minter: Minter became famous for creating psychedelic and highly addictive games for early platforms, including the Commodore 64. His games like Attack of the Mutant Camels and Gridrunner were very popular and sold well. Though Minter didn’t make millions, he was able to turn his indie game development into a profitable venture, inspiring others to become “bedroom coders.”also no one is making "massive coin" with AI, unless you consider 1k a month "massive">>103124603im surprised too, but im glad runway lets you do it since it costs a fuckton per gen
>>103124597Good stuff, which service is this?>>103124571I'll need to look into that
local models?
>>103124603They don't. In fact, Through some playtesting, I found myself convinced that they simply left nipples out of their entire training dataset.
>>103124631
>>103124451>i remember when we first split from /sdg/ we were /idg/- image diffusion general or somethingYou must have missed the part where there was no split, merely a rebranding. Some couldn't evolve with the times, others could. Simple as.
>>103124634Ah mb meant to add this one, repost
>>103124634this is not true for chinese modelsalso men have nipples, i'd be surprised if any big model has completely forgetten the concept for both gendersgenitals are obviously scrubbed
>>103124650Based but I'm hesitant to use anything Chinese
Anons had 2 years of using Stable Diffusion and finetunes to discover that cranking up the CFG scale too high caused weird clownish sameface, then when FLUX dropped they all set their guidance to 3.5, got bad results, and agreed together that there's something unavoidable called "FLUXface" which is a fundamental deficiency of FLUX—bad training data or something—nothing they can do except wait for a new model.
>le blurred to fuck micro gen realism man lecturing anyone
>>103124713>nothing they can do except wait for a new model.what's wrong with finetuning Flux so that it removes this bias? it would be way less expensive than pretraining a new base model again
>>103124730It's still a million steps on a large new dataset.
thx 4 tha bumps btw :3
>>103124697why? they literally cannot do anything to you since they're on the other side of the planet but the glowies on your side of the planet actually can
>>103124713>nothing they can do except wait for a new model.As anon said in the last blessed bread; most Flux users are retarded
official local model gens waiting room
>>103124756
>>103124386we live in a community
>>103124713It's also possible some have an actual agenda given where we are, and some are useful idiots that think repeating shitty ideas as if they were memes makes them fit in.>>103124755*most AI users
i don't have any problem with flux buttchin same face git gud
>>103124781kek
>>103124781You can literally tell that's a Flux gen just based on the general face. It's got that Dreamshaper inbreeding going on.
>>103124730I've never heard of a model becoming less prone to overcooking by finetuning it. Seems to me the simple fix is lower your guidance value.>>103124755Most users of all models, they're just loudest about Flux because it filters people for some reason>>103124781That's right.
>>103124792Still no porn Flux model because any real cooking kills the model.
>>103124623Minimax is fun man
>>103124804Still the only valid complaint about Flux which I will never dispute: it's one thing not to train on porn—I didn't always like the effect porn had on my SD1.5 gens—but to automatically filter any and all nude human bodies from the image data is a particular kind of 2024 insanity.
>>103124862>to automatically filter any and all nude human bodies from the image data is a particular kind of 2024 insanity.amen
so dedistilled is dead? no ones been able to properly train that thing?
>>103124765prompt?
>>103124862>Still the only valid complaint about Fluxwhat about the license of flux dev?
>>103124934You're really not going to see any training until people have 5090s. No one wants to waste $3000 on a experimental finetune.
>>103124944Nice.
>>103124939It's not something that affects me directly (maybe indirectly) so I haven't bothered to learn much about it. It seems like they're trying to figure out a way to make the model profitable for themselves while still being freely available for local use. I don't know what the knock-on effects of that will be. No strong opinion either way, but maybe I'll change my mind.
>>103124938probably samefag but if not, please don't interact with it. Go to the other thread to ask if you really must.
>>103124977it's retarded logic at the end of the day because the people who want cloud generation aren't the people who use local models, like piracy for video games, you gain much more from the word of mouth than trying to fuck everyone with aggressive DRMs
>>103124831What sort of restrictions? And if any, how difficult is it to get around them? Here's a magic trick for you by the way
>>103124729I've earned the right.
https://github.com/wangjiangshan0725/RF-Solver-Edit>We propose RF-Solver to solve the rectified flow ODE with less error, thus enhancing both sampling quality and inversion-reconstruction accuracy for rectified-flow-based generative models. Furthermore, we propose RF-Edit to leverage the RF-Solver for image and video editing tasks. Our methods achieve impressive performance on various tasks, including text-to-image generation, image/video inversion, and image/video editing.cool, when comfyUi?
I made this a few days ago... What do you all think? - DJL
>>103125579>What do you all think?that's pretty cool
>>103125548>when comfyUi?as soon as you finish coding it
>>103125579kino
So if I want to make anime pics I should get 1.5 instead of the current SD release?
What happened here?
>>103125970im waiting for bigma
>>103126086based Liz Vicious enjoyer
>>103124934Don't know about full finetunes, but I've trained my own niche porn loras on flux de-distill with a few thousand images. Works great, exactly like any normal undistilled model for both training and inference. Quality is better than the exact same thing done with standard flux dev, and you don't have to worry about any guidance nonsense or the lora training partially undistilling the model.
>>103125753>Dirlewanger Labsfucking kek
>>103126300The old world is dying, and the new world struggles to be born: now is the time of monsters.
>>103126252indeed
>>103126420>boob armor
I love LDG
>>103126426same
>>103126424Gotta protect your assets
>>103126466is this flux?
>>103126489It's NoobAI
>>103126532so its not flux?
>>103126532also how do I install this?can I run that in ComfyUI too?how babby get workflow?
>>103126595lol, you want me to bill you my consulting rate?
>>103126532Is that a Pony merge? I saw it on Civitai but haven't seen much out of it
>>103126622Anybody got a challenge/theme?
>>103126609no I expect you to tell me all the neccessary steps because of the goodness of your heart, empathy and free of charge so I too can generate cute anime babes like you.
no spoon feeding
>>103126725Prompt?
>>103126664Well, the best way to learn is by messing with it yourself. Here's all the help I'm going to give - https://files.catbox.moe/jsyhup.png
>>103126758This image is a digital cartoon drawing featuring two characters in a humorous and satirical context. On the left, a large, white, and featureless humanoid figure with a large, exaggeratedly angry expression and a wide, open mouth is pointing with its left hand. The figure has no discernible facial features, emphasizing its exaggerated anger. To the right, a smaller, similarly white, and featureless humanoid figure with a more neutral expression is looking slightly to the left, appearing to be listening to the larger figure. The smaller figure is wearing a blue shirt with a colorful cartoon character design on it, adding a playful element to the scene. Above the larger figure's head, a speech bubble reads, "YOU NEED TO KILL YOURSELF," in bold, red letters, with the word "kill" highlighted in black. To the right of the smaller figure, another speech bubble reads, "pls spoonfeed" in black text, suggesting a humorous contrast between the serious statement and the playful, cartoonish nature of the smaller figure's attire. The background is plain white, ensuring that all attention is focused on the characters and their dialogue.
>>103126635I want a very lean 1girl that still looks feminine. Like fat at 10% but not the ugly bodybuilder look, no male shoulders, no ballooning breasts, and not buff. All my attempts are oscillating between she-hulks and anorexia, can't make it lean-not-muscular. Kinda like that ginger chick from the Game of Thrones in that one scene where she shows her tummy to the know nothing guy, but like leaner.Focus on the abs, naturally, but fully body in the shot.
>>103126766I dont get it, why it do pic related?
>>103127053Clip skip needs to be -2
>>103127069>it actually workedare you a wizard?
>>103123242 >>103123280Looks like a great speed-quality tradeoff based on their demo website. Did someone implement this for comfyui?Also it will break the loras, I assume?
>>103127088No, I'm just not illiterate retard like you are.
>>103127114I'm glad at least one of us can read thanks for learning that shit bro what I'd do without you man.also HOLY smokes is this noobAI generating fast.its like 25 steps in 3 seconds.I only ever used Flux and its slow af compared to it.
>anons whove never used XL or 1.5 exist Incredible
>>103127121Yes the older model types are faster and also trained in more stuff (particularly lewds are still much better on these than on Flux1-D/S).
>>103127237>particularly lewdsI'm impressed with the accurate anus slips, cant post them here but wow man.
I'm just a coomer that wants to generate shiny anime titties but I don't know anything about tech...
>>103127373bro I'm literally illiterate and I figured out how
>>103127294Sure, some models -especially Pony derivatives- are pretty good at that.>>103127373Shouldn't be hard with the usual webuis
>>103127157not only do they exist, they're here in this thread giving out unsolicited advice
>>103127417plus people may help but it needs somewhat more specific questions
>>103127373>but I don't know anything about tech...well if you want to get into local diffusion you have first to figure out your computer specs.like what GPU do you have? how much vram? how much ram? etc.if you figure this out we can estimate what kinda models you are able to run with this machine.
>>103126994I put this in as a prompt and pic rel is what I got.
i've successfully generated dicks with imageFX.
>>103127939Nice! That's fun
>>103127682Nah more like
>>103127939
>>103127474jesus christ
>>103128358These are great anon.
>>103128523Thank you
huh
>>103127112>Did someone implement this for comfyui?not yet>Also it will break the loras, I assume?I hope nothttps://www.reddit.com/r/StableDiffusion/comments/1gmse2o/comment/lw6qaxl/?utm_source=share&utm_medium=web2x&context=3>About 2.5x faster(4.6it/s) than comfyui with --fast(2.11 it/s) on a 4090. Seems pretty great,that's really insane when you think about it
Have you guys seen this?>i2V with new CogX DimensionX Lorahttps://reddit.com/r/StableDiffusion/comments/1gms4q8/i2v_with_new_cogx_dimensionx_lora/
>>103128999kekhttps://xcancel.com/AIWarper/status/1854933007804592346#m
>>103128999>>103129049damn thats pretty good
https://github.com/THUDM/CogVideo/issues/471#issuecomment-2464837688>The peak is in the VAE part, not the transformer. The transformer part usually consumes 34G of video memory, while the peak of the VAE can reach 68G (1360 * 720)holy fuck it's fucking over, we'll never be able to use this shit
>>103129185Fug
>>103129185nah don't worry, we'll use the Q8_0 version of CogVideoX and we'll use tilted VAE also
>>103129217so everything will be alright???
>>103129300yeah we shouldn't worry about that, if kijai made it work on consumer grad GPUs with a 10b model (mochi), it'll be easy for him to do the same thing for CogVideoX-1.5-5b
>>103125788Try Illutrious based models
>>103129322so will it only work on 3090/4090s?
>>103129436Idk, we'll see how kijai's node will handle this model
>>103129185I love the constant flow of unfinished software...Chinks setting high standards in user gullibility.
>>103129690desu I like their approach, it's a long term approach, they shouldn't nerf their advancement because Nvdia decided to hold the world's balls with their greedy hands, it's Nvdia fault we're stuck at 24gb for 6 years straight, we can't move forward in AI if we don't have more vram, it's as simple as that, hardware should keep moving forward to help the software side and Nvdia is unwilling to do that, fuck those mf
>>103129702The price of 3gb/4gb vram chips is piss all, like what $20 each? If the 5090 isn't 48gb then Nvidia should burn to the ground.
>>103129717>If the 5090 isn't 48gb then Nvidia should burn to the ground.that won't happen, the best case scenario will be 32gb, and I'm being generous there, Nvdia knows how valuable vram is, and they also know that they're the only good GPU makers, everyone depend on them, so they're already seeling 48gb cards for fucking 5000 dollars, and they go for 10000 dollars if you want a 96gb card, when you're a monopoly you can do whatever you want, and I hate that
>>103129727>Remember when people started hijacking delivery trucks with Nvidia cards on them?https://www.nme.com/news/graphics-cards-stolen-in-truck-heist-resurface-at-vietnamese-retailer-3135039
>>103129779kek that put a smile on my face
>>103129779>stolen in California and re-appeared in Vietnampoetic justice
>>103129820>Would you like to talk about our comrade and savior, Ho Chi Minh?
>>103129727They really want this kind of future by producing low Vram cards for the poor, don't they.
>>103129849They are doing the smart business choice of milking the big businesses for all that they can while they can with the AI iron still hot. They can't do that if they start selling reasonably priced chips at the consumer grade level.
>>103129849there's a reason Nvdia is the most valuable company in the world right now, they're selling overpriced products and we have no other choice but to buy them, because they virtually have no serious rivals, that shit sucks ass man
Speaking of gpus, I wonder what cards they currently produce, and which have dropped out of production lines.
>>103129880But the poorfag gamer consumer market isnt even that profitable for Nvidia and they make all their money by selling these cards to the data centers.So it wouldnt be that big of a difference for them if they made the consumer cards with more Vram.they would still sell the expensive ones to the datacenters, like why not make another A100 but with 200GB VRAM and sell that for 10k?
>>103129898>So it wouldnt be that big of a difference for them if they made the consumer cards with more Vram.if they do that, data centers will be using consuemer cards to train their models instead of the overpriced entreprise cards
>>103129909>2.8 You agree that GeForce or Titan SOFTWARE: (i) is licensed for use only on GeForce or Titan hardware products you own, and (ii) is not licensed for datacenter deployment.>is not licensed for datacenter deployment.
>>103129690>I love the constant flow of unfinished softwareofcourse it's unfinished, everything is. ai is still in it's infancy and the community are the ones stuck doing the optimizations and figuring out how to run them on consumer hardware. it's the same story in text gen, llama.cpp is still unable to run multimodal models like pixtral and llama 3.2. it'll take years before companies actually start caring about the user experience in local, welcome to the bleeding edge
>>103130038how can they even enforce that? A data center can be anything, even I can make a data center on my home, it's not like Nvdia is looking at everyone's house, people were already doing that during the crypto era, tons of 3060 being piled up to mine some bitcoins
>>103130038it doesn't matter since consumers can stack consumer cards and rent them on sites like vast.ai for smaller projects like finetuning, they'll basically be undercutting themselves and nvidia doesn't want that
>>103129909>data centers will be using consuemer cards to train their modelsno they will buy the new datacenter cards which have even more VRAM.like why would you buy a 48GB vram card when you can buy a 200GB VRAM card?they also buy them in bulk and there is also license shit that prohibits datacenters from using the consumer cards.
>>103130071For starters, telemetry. Then, they're the manufacturer/retailer, the kind of customer we're talking about needs thousands of GPUs, you can't exactly pop down to Walmart to buy thousands of GPUs.>>103130074Even if consumer GPUs had 192GB they'd still only be used for smaller finetuning projects by consumers. Total FLOPS is arguably the bigger factor, foundational models still require thousands of GPUs.
>>103128999I like.
sovl
free site to enlarge / focus a photo ?
>>103128999Is this for the new Cog 1.5 model?
>>103122994When is this thing going to be able to do hands properly
>>103130441good hands come at a cost
>>103130441and the cost is inpaint elbow grease
Mochi Image Encode
>>103130239kek
why china not release good 16ch vae modal, did they lose interest? it's all video gen. even sana is just some research for a video gen model. why are the chinese obsessed with video gen?
>>103131322imggen market is saturated
>>103128999
sananana never ever
sana text to video before sana text to image
big if true
>>103131322Propaganda
>>103129690no one asked you to be here, come back in 10 years when Apple says the invented AI
>>103131350it obviously wasn't done when they announced it and they could be very likely training the VAE right now based on people's feedback
>>103131551cool style>>103131653nice
Went completely over that New Yorkers head.
i am carrot
>>103122994>Serpents in the Dawn Editionim too retarded to understand the reference
>
Youtubers now saying they have img2vid for MochiIt's all a complete liw of course, they are using vison models to interrogate an image, making a prompt, running Mochi and declaring it's "REAL img2vid! WAOW!" with lots of retards in the comments clapping their flippers like performing seals.So, avoid checking it out if it starts cropping up in feeds.
>>103130161>>103130432anon these are amazing, catbox pretty please??
>>103131718https://youtu.be/daGMULKNCME?si=qmD8WOf2QZyvUaNb
>>103131582>Reddit moment
>>103124834kisses, all my kisses for the anime girl
>>103132147>guys I'm mad that this experimental tech isn't like my iPhone
Straight out the oven:>>103132365>>103132365>>103132365
>>103132337what model is that?
>>103128971yup, looking forward to it