[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of free and open source text-to-image models

Previously baked bread: >>103109699

Serpents in the Dawn Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio
EasyDiffusion: https://easydiffusion.github.io

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://aitracker.art
https://huggingface.co
https://civitai.com
https://tensor.art/models
https://liblib.art
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>SD3.5L/M
https://huggingface.co/stabilityai/stable-diffusion-3.5-large
https://replicate.com/stability-ai/stable-diffusion-3.5-large
https://huggingface.co/stabilityai/stable-diffusion-3.5-medium
https://huggingface.co/spaces/stabilityai/stable-diffusion-3.5-medium

>Sana
https://github.com/NVlabs/Sana
https://sana-gen.mit.edu

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux
DeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai
>>
>>103122994
awful taste
>>
Blessed thread of frenship
>>
the other thread has the words... *hurk*... flux support in the op image.. i think i'll stay here
>>
SANA dropping soon?
>>
SoonTM Feel Good Inc.
>>
>>103123002
>mad he's not in the collage
>>
>>103123114
Are you blind? It's an honour to get excluded.
>>
>>103123148
you seem pretty upset about it desu
>>
>>103123180
anon is just angry about everything, all the time
>>
why does Comfyorg insist on making the ui shittier?
>>
>>103123069
Probably given they put up the license file.
>>
>>103123193
Because they cannot be bothered to focus on UX. Then again they're doing a lot in terms of being a good backend for others to make it into an actually usable frontend, so there's that, I guess it goes in spirit of the community effort behind local ai.
>>
>>103123193
Because ComfyAnon is one of those devs that both has extremely bad taste but is also extremely stubborn. Devs like him is why Blender was a pile of shit until someone was like "you know, left click is for selecting".
>>
>>103123206
oi mate do you have a loisence?
>>
>>103123230
aye, for stabbin' gits
>>
the same lab that works on sana released a new quantization method for image models
>SVDQuant is a post-training quantization technique for 4-bit weights and activations that well maintains visual fidelity. On 12B FLUX.1-dev, it achieves 3.6× memory reduction compared to the BF16 model. By eliminating CPU offloading, it offers 8.7× speedup over the 16-bit model when on a 16GB laptop 4090 GPU, 3× faster than the NF4 W4A16 baseline.
they really like 16gb vram laptops
>https://github.com/mit-han-lab/nunchaku
>>
>>103123242
Because they're one of the few devs that seems to actually give a shit about consumers using and training AI
>>
File: 1716979841453550.jpg (1.68 MB, 3293x2120)
1.68 MB
1.68 MB JPG
>>103123242
it's true that their 4bit quant pictures look close to the bf16 ones, I wonder if it's better than fp8 or Q8_0, would be cool if it's the case, I'd like to take that speed increase lol
>>
>>103123259
they once mentioned that they were working on a text to video model, so i wonder if the end goal of this lab's research with sana and this new quant method is a video model that works well on 16gb vram laptops
>>
>>103123292
It all combines into a model that can do longer sequences, faster. Video is basically unusable if you have to wait 5-10 minutes for a 2 second clip.
>>
>>103123307
at some point it'll be possible, but that'll be on another era when we'll find a better architecture than transformers that can make small models as good as fucking Sora, definitely possible, my guess is that we'll get this kind of shit in 3-4 years
>>
i wonder what sana2 will be like
>>
File: tmpf_quv1za.png (2.58 MB, 1600x1120)
2.58 MB
2.58 MB PNG
>>
>>103123361
Probably hitting the Flux quality standard. Sana seems to be a slightly better but much lighter weight SDXL and there's a market for that.
>>
>>103123379
>Probably hitting the Flux quality standard.
I hope they'll ditch out the ultra compressed VAE and go for something of quality, a good VAE is really important to achieve a good quality picture, that can't be understated enough
>>
>>103123399
Oh fuck off
>>
>>103123225
Is it autism?
>>
>>103123399
nta but they mentioned that they wanted to make ultra compressed VAEs more popular, it's defienetly part of their goal to improve this technology so i highly doubt they'll ditch it
>>
>>103123424
No, some nerds for some reason decide being an unlikeable contrarian twat is actually good.
>>
>>103123435
>everyone that DARE to disagree with me is a contrarian
Or maybe you just have shit takes and we're calling you out? How about that?
>>
File: pooranon.png (18 KB, 477x269)
18 KB
18 KB PNG
>>103123180
kek, literally don't care about >le collage but it makes me laugh how jumpy you get when anyone criticizes it, also I've been featured many many times
>>
>>103123430
yeah I understand they want to reach that goal, but for me they should go for an uncompressed VAE on top of their compressed ones, so that at least we can have the option to choose the better quality one if we want
>>
>>103123442
No, ComfyUI has objective issues that ComfyAutist refuses to fix because he's a pretentious twat that can't take feedback. And anon, what you say doesn't matter to me because I've already been rewarded with a high paying job with influence so it turns out my opinion is worth a lot. :) Some kid using his mom's laptop should shut the fuck up.
>>
>>103123486
>I've already been rewarded with a high paying job with influence so it turns out my opinion is worth a lot.
your opinion ain't worth shit, you're a nobody, don't expect people to suck your dick if you have retarded opinion, you're delusional
>>
>>103123478
That's not how any of this works you ignorant retard. Each VAE is it's own language, you can't just swap VAEs. A 16x VAE is fundamentally different than a 32x VAE because each will have completely different neural network activations to achieve their compression goals.
>>
>>103123508
that's why I suggested to make 2 VAE that'll work for Sana, one compressed and one not, you fucking 2 digit IQ monkey
>>
So with the new CogVideoX, I have to ask, what's the state of lora training?

I have six 4090s and a huge local porn collection I could extract clips from. I've been wanting to try a video finetune for a while. Last I looked into this, only CogVideoX-Fun supported multiple resolutions (so you could train on slightly lower than the max res) and quantization didn't work. There was some training script I tried but it was scuffed, and only the 2b model fit on a 4090 for training, and it was a pain to even use the loras. Are things improved? I feel like we must nearly be at the point where a halfway decent local porn video model is possible.
>>
>>103123505
Oh no, I don't care. I'm just saying why ComfyUI will be irrelevant the second a better UI is made because using ComfyUI is awful and the second an actual production, professional UI is made everyone will switch and ComfyAutist will become irrelevant once again.
>>
>>103123516
>bro they just train two models from scratch because I'm a faggot and I won't even use either of them because as you've noticed, I don't ever post images and if I did you'd know I'm the 1girl spammer
>>
>>103123519
>I'm just saying why ComfyUI will be irrelevant the second a better UI is made
it's been 2 years people have been saying that, I don't like ComfyUi's spaggheti shit either but he'll never lose relevancy, his ecosystem is too strong and advanced now
>>
>>103123530
I don't know if you've noticed, but no one posts in these threads because it turns out no one fucking cares about image AI.
>>
>>103123527
>I don't ever post images
Indeed you don't post images, you just burned yourself on that one, are you that retarded?
>>
last thread had great gens, especially the halberd girls
>>
>>103123539
>no image detected

>>103123544
shitty slop spam based on a shitty theme
>>
File: 1725562529070500.png (45 KB, 523x666)
45 KB
45 KB PNG
>>103123536
>no one posts in these threads because it turns out no one fucking cares about image AI.
what are those?
>>
>>103123549
I know you're a moron but surely you can tell it's maximum 50 unique people participating across these threads. If there were global IDs it would be so grim seeing the same fag spamming in SDG is spamming in Degen.
>>
>>103123548
I respectfully disagree, they reminded me of the female centurion drawer, and I enjoyed the theme
>>
>>103123561
So even you don't care about image AI? :(
>>
>>103123572
I care but I'm not going to pretend we're not in the hobbyist programming on a Commodore 64 niche. We're well beyond early to the party, and that's why these threads are dead.
>>
>>103123517
the new CogVideoX supposedly supports any resolution
>>
any 3.5 finetunes yet
>>
File: 1720877807219434.png (84 KB, 1051x853)
84 KB
84 KB PNG
>>103123594
>the new CogVideoX supposedly supports any resolution
the i2v one can, but the t2v one still has its resolution locked
>>
File: 1712039525859251.png (51 KB, 680x497)
51 KB
51 KB PNG
>>103123280
>really close quality to bf16
>8.7x faster (from 111.7 s to 12.9s)
oh boy I like where this is going
>>
>>103123628
>1360x768
Goddamn, takes me like 8 minutes to get 3 seconds of footage from Mochi and that's only 480p
>>
>>103123650
yeah but mochi is a 10b model wheras CogVideoX is a 5b, so it'll be faster on Cog overall
>>
https://tensor.art/models/792217506975595434?source_id=njq1pFzjlEOwpPEpaXny-xcu

How are things like this not a violation of the flux dev license?
>finetune of flex dev
>hosted on a generation website, need to pay for credits to use it
>downloads disabled

I thought the whole point of the license is to prevent people from hosting the model or a finetune as a service, not sharing the weights, and charging for access. But that appears to be exactly what this is.
>>
>>103123628
mmm if you finetune the i2v model with porn that could work too, it could be even better, i dunno, all i want is a i2v, its give the user more control
>>
>>103123668
>How are things like this not a violation of the flux dev license?
I think they are violating the licence yeah, desu I wouldn't mind the BFL fags to force them to freely release the model so that we can enjoy them all, looks like they made a serious finetune out of it
>>
>>103123530
Professional software takes time to develop. Shitty browser app-layer script hell is not professional software.
I'm waiting for Autodesk and SideFX to deliver something cool but it won't happen that soon.
>>
Any anons got a good prompt to get pic related? Cant get it for the life of me for this Ghostface thing I'm trying to do
>>
>>103123114
So what if he is? Maybe his snubbed gen was good. (Unlikely.) Then he'd have good reason to be angry.
>>
My lips are hot with desire to lay a special kiss upon a special lady. I wait for her to appear
>>
File: GV6l-LdaQAAoNqQ.jpg (1.15 MB, 2500x4085)
1.15 MB
1.15 MB JPG
Whats 1.5?
>>
>>103123875
First major Stable Diffusion model, a golden age of finetuning while at it.
>>
>>103123885
SD1.5 was genuinely a case when the stars were all aligned:
- It was supposed to be cucked and lobotomized before the release by SAI but the Runaway chads decided to give them a middle finger and release the uncensored model anyway
- Someone leaked a serious anime finetune made by NovelAI and we thrived off that
>>
>>103123905
And it was relatively lightweight with output good enough to be worth the bother of training on household toasters.
>>
File: oobah.webm (2.33 MB, 720x1280)
2.33 MB
2.33 MB WEBM
Give me your best 1girl
>>
>>103124052
you've set a very high bar with this one
>>
>>103124052
damnnn bruh, what video model you used for that kino?
>>
>>103124052
yeah, I'm gonna need a catbox for that. I don't believe it's a local gen.
>>
>103124052
>watermark blurred out in the bottom right
so its definitely not yours and probably not local, where did you find this?
>>
>>103124120
cheeky eagle eye bastard, I'm actually impressed
>>
I am in the wrong thread, it is indeed not local, its runway, apologies
>>
>>103124140
it's all right, I wished we had a video model thread so that we could spam some Minimax shit in there
>>
>>103124140
>its runway, apologies
my bet was (old) kling because theres not enough ghosting for it to be genmo (and its too good to be genmo) and minimax doesn't move like that
neat, i havent seen many runway gens but thats for the obivous reason that you need to pay and its super expensive

>I am in the wrong thread
there isn't really a thread for videogens, this is probably the best one on /g/ for them. ive been posting video gens on genmo that i did on the website which is /ldg/-adjacent at best
>>
>>103124195
>there isn't really a thread for videogens, this is probably the best one on /g/ for them. i
there was a Minimax thread on /pol/, it was fucking amazing, dunno why they stopped it though
>>
any flux finetunes yet
>>
>>103124208
>dunno why they stopped it though
if it's anything like /aivg/ it was probably because minimax became unbearable with wait queues or a google signup or a paywall or something like that
same reason why the LUMA threads a couple months ago died. people ran out of daily credits and that was that. same reason why I'm not posting more robot girls made with genmo (I finished my music video, but the threads are split so I'll post it next thread maybe)
>>
>>103124247
there's supposedly a new one but you can't download it lol >>103123668
>>
>>103124195
The best one would be /sdg/.
>>
>>103124250
fair enough, I feel like this thread with thrive off the new local video models from CogVideoX or Mochi, so far it doesn't reach Minimax quality but it will at some point, and when that'll be the case we'll have some real fun (let's also hope we'll be able to make quality videos without waiting an hour too lol)
>>
>>103124276
its a tossup. both threads should merge into just /dg/ or /dmg/ (Diffusion Models General) so we can have one place to discuss generative AI made with diffusers/DiT

>>103124295
these threads will truly start thriving once there's a retard-proof and unlimited way to get videos of pretty 1girls looking into the camera fliratiously. so maybe 2025

>>103124295
>let's also hope we'll be able to make quality videos without waiting an hour too lol
i think with a 5090 it'll go down to 15 minutes per gen locally on the HD version of genmo. hopefully there will be options that trade accuracy for speed for potatogenners as well
>>
>>103124336
>/dmg/ (Diffusion Models General)
I vote for this, it's time to end the split.
>>
remember that this would imply a merger with /de3/ as well
>>
>>103124364
>>103124348
>remember that this would imply a merger with /de3/ as well
not necessarily, we could go for local diffusion models only
>>
>hurf derf i cant shit up /ldg/ on my own lets merge where 1girlspamslop is accepted guyz.

No thanks femcel.
>>
>87 text posts
>10 images
>single actual gen
It's over
>>
>>103124348
lmao no, sdg fags can die
you want dmg?
go to sdg
>>
>>103124348
i remember when we first split from /sdg/ we were /idg/- image diffusion general or something and saas fags would invade us. local needs to be specified to keep the trolls away unfortunately
>>
>>
>>103124400
>lmao no, sdg fags can die
this, if we made a scission that's for a reason
>>
>>103124457
wwwwwrRRRRAAAAAAAAAAGHHHHHHHHHHHHH! SAAAAAAAAAAAAAAAAAAASSSSSSSSSSSSS!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
>>
File: 311107456007602183.webm (1.17 MB, 720x1072)
1.17 MB
1.17 MB WEBM
>>103124052
This isn't local but the image gen was

>>103124195
Can you run minimax (or something of similar quality) locally? I haven't looked into videogen at all
>>
>>103124348
There was never a split desu
>>
File: ComfyUI_03315_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
>>103124451
>/idg/
Missed that part. Started baking threads and collages the moment previous og bakers died out.
>>
File: 221a.jpg (53 KB, 744x1000)
53 KB
53 KB JPG
>>
>>
>>103124521
that's definitely a de3 image
>>
nothing can keep trolls and 1girlspamslop away
at the end of the day, if the mods dont care and the animosity between /ldg/ and /sdg/ still exists then i guess we're staying separate

theres also edge cases like
>>103124463
>This isn't local but the image gen was
i think they should be allowed in /ldg/ personally (but I also post cloud genmo mochi gens in these threads so my opinion is biased)
>Can you run minimax (or something of similar quality) locally? I haven't looked into videogen at all
kinda, you can make 480p videos that are pretty good with mochi
>>
>>103123575
>We're well beyond early to the party
How early in C64 dev were people making massive coin like how they are with AI currently?
>>
I'm a simple anon, I care about generating pixels locally with free and open source models, and I want to post in threads that reflect it, rather than brand loyalty. Simple as.
>>
File: ooba2.webm (1.83 MB, 720x1280)
1.83 MB
1.83 MB WEBM
>>103124463
Nice, as were mine
>>
>>103124580
no one is making massive coin except maybe MJ
I bet Flux Pro isn't even close to breaking even
>>
File: ComfyUI_temp_hbzlp_00005_.png (3.72 MB, 1536x1920)
3.72 MB
3.72 MB PNG
>>
>>103124597
I'm surprised API sites let you go with nudes pictures of women
>>
>>103124580
>How early in C64 dev were people making massive coin like how they are with AI currently?
Jeff Minter: Minter became famous for creating psychedelic and highly addictive games for early platforms, including the Commodore 64. His games like Attack of the Mutant Camels and Gridrunner were very popular and sold well. Though Minter didn’t make millions, he was able to turn his indie game development into a profitable venture, inspiring others to become “bedroom coders.”

also no one is making "massive coin" with AI, unless you consider 1k a month "massive"

>>103124603
im surprised too, but im glad runway lets you do it since it costs a fuckton per gen
>>
>>103124597
Good stuff, which service is this?

>>103124571
I'll need to look into that
>>
local models?
>>
>>103124603
They don't. In fact, Through some playtesting, I found myself convinced that they simply left nipples out of their entire training dataset.
>>
File: deBO_00031_.png (1.35 MB, 1728x1344)
1.35 MB
1.35 MB PNG
>>103124631
>>
>>103124451
>i remember when we first split from /sdg/ we were /idg/- image diffusion general or something
You must have missed the part where there was no split, merely a rebranding. Some couldn't evolve with the times, others could. Simple as.
>>
File: theydont.webm (3.91 MB, 1280x720)
3.91 MB
3.91 MB WEBM
>>103124634
Ah mb meant to add this one, repost
>>
>>103124634
this is not true for chinese models
also men have nipples, i'd be surprised if any big model has completely forgetten the concept for both genders
genitals are obviously scrubbed
>>
File: ep1.webm (2.35 MB, 1280x720)
2.35 MB
2.35 MB WEBM
>>103124650
Based but I'm hesitant to use anything Chinese
>>
Anons had 2 years of using Stable Diffusion and finetunes to discover that cranking up the CFG scale too high caused weird clownish sameface, then when FLUX dropped they all set their guidance to 3.5, got bad results, and agreed together that there's something unavoidable called "FLUXface" which is a fundamental deficiency of FLUX—bad training data or something—nothing they can do except wait for a new model.
>>
>le blurred to fuck micro gen realism man lecturing anyone
>>
>>103124713
>nothing they can do except wait for a new model.
what's wrong with finetuning Flux so that it removes this bias? it would be way less expensive than pretraining a new base model again
>>
>>103124730
It's still a million steps on a large new dataset.
>>
thx 4 tha bumps btw :3
>>
>>103124697
why? they literally cannot do anything to you since they're on the other side of the planet but the glowies on your side of the planet actually can
>>
File: 1730854447783.webm (1.19 MB, 720x720)
1.19 MB
1.19 MB WEBM
>>
>>103124713
>nothing they can do except wait for a new model.
As anon said in the last blessed bread; most Flux users are retarded
>>
official local model gens waiting room
>>
File: dena_00065_.png (1.98 MB, 1728x1344)
1.98 MB
1.98 MB PNG
>>103124756
>>
>>103124386
we live in a community
>>
>>103124713
It's also possible some have an actual agenda given where we are, and some are useful idiots that think repeating shitty ideas as if they were memes makes them fit in.

>>103124755
*most AI users
>>
File: ComfyUI_00107_.png (48 KB, 240x240)
48 KB
48 KB PNG
i don't have any problem with flux buttchin same face git gud
>>
>>103124781
kek
>>
>>103124781
You can literally tell that's a Flux gen just based on the general face. It's got that Dreamshaper inbreeding going on.
>>
>>103124730
I've never heard of a model becoming less prone to overcooking by finetuning it. Seems to me the simple fix is lower your guidance value.

>>103124755
Most users of all models, they're just loudest about Flux because it filters people for some reason

>>103124781
That's right.
>>
>>103124792
Still no porn Flux model because any real cooking kills the model.
>>
File: 311380262398144521.webm (583 KB, 720x1072)
583 KB
583 KB WEBM
>>103124623
Minimax is fun man
>>
File: ComfyUI_03334_.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
>>
>>103124804
Still the only valid complaint about Flux which I will never dispute: it's one thing not to train on porn—I didn't always like the effect porn had on my SD1.5 gens—but to automatically filter any and all nude human bodies from the image data is a particular kind of 2024 insanity.
>>
>>103124862
>to automatically filter any and all nude human bodies from the image data is a particular kind of 2024 insanity.
amen
>>
so dedistilled is dead? no ones been able to properly train that thing?
>>
>>103124765
prompt?
>>
>>103124862
>Still the only valid complaint about Flux
what about the license of flux dev?
>>
File: 1730877458310.webm (3.11 MB, 720x720)
3.11 MB
3.11 MB WEBM
>>
>>103124934
You're really not going to see any training until people have 5090s. No one wants to waste $3000 on a experimental finetune.
>>
>>103124944
Nice.
>>
>>103124939
It's not something that affects me directly (maybe indirectly) so I haven't bothered to learn much about it. It seems like they're trying to figure out a way to make the model profitable for themselves while still being freely available for local use. I don't know what the knock-on effects of that will be. No strong opinion either way, but maybe I'll change my mind.
>>
>>103124938
probably samefag but if not, please don't interact with it. Go to the other thread to ask if you really must.
>>
>>103124977
it's retarded logic at the end of the day because the people who want cloud generation aren't the people who use local models, like piracy for video games, you gain much more from the word of mouth than trying to fuck everyone with aggressive DRMs
>>
File: tmpnuvbhxat.png (1.2 MB, 1280x768)
1.2 MB
1.2 MB PNG
>>
File: majicelf.webm (1.89 MB, 720x1280)
1.89 MB
1.89 MB WEBM
>>103124831
What sort of restrictions? And if any, how difficult is it to get around them? Here's a magic trick for you by the way
>>
File: dech_00095_.png (1.96 MB, 1728x1344)
1.96 MB
1.96 MB PNG
>>
>>103124729
I've earned the right.
>>
File: 1704541643943630.png (1.75 MB, 1451x842)
1.75 MB
1.75 MB PNG
https://github.com/wangjiangshan0725/RF-Solver-Edit
>We propose RF-Solver to solve the rectified flow ODE with less error, thus enhancing both sampling quality and inversion-reconstruction accuracy for rectified-flow-based generative models. Furthermore, we propose RF-Edit to leverage the RF-Solver for image and video editing tasks. Our methods achieve impressive performance on various tasks, including text-to-image generation, image/video inversion, and image/video editing.
cool, when comfyUi?
>>
File: 1719044893310484.webm (366 KB, 480x480)
366 KB
366 KB WEBM
I made this a few days ago... What do you all think?

- DJL
>>
File: 1718439689741308.png (89 KB, 640x360)
89 KB
89 KB PNG
>>103125579
>What do you all think?
that's pretty cool
>>
>>103125548
>when comfyUi?
as soon as you finish coding it
>>
File: ComfyUI_01251_.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
>>103125579
kino
>>
File: Ga_d7gubwAAyqOW.jpg (294 KB, 1048x1800)
294 KB
294 KB JPG
So if I want to make anime pics I should get 1.5 instead of the current SD release?
>>
What happened here?
>>
>>103125970
im waiting for bigma
>>
File: ComfyUI_03351_.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>
File: ComfyUI_02667_.png (2.72 MB, 1280x1920)
2.72 MB
2.72 MB PNG
>>
>>103126086
based Liz Vicious enjoyer
>>
File: ComfyUI_03360_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>
>>103124934
Don't know about full finetunes, but I've trained my own niche porn loras on flux de-distill with a few thousand images. Works great, exactly like any normal undistilled model for both training and inference. Quality is better than the exact same thing done with standard flux dev, and you don't have to worry about any guidance nonsense or the lora training partially undistilling the model.
>>
>>103125753
>Dirlewanger Labs
fucking kek
>>
>>103126300
The old world is dying, and the new world struggles to be born: now is the time of monsters.
>>
File: ComfyUI_03365_.png (1.5 MB, 1024x1024)
1.5 MB
1.5 MB PNG
>>103126252
indeed
>>
>>
>>103126420
>boob armor
>>
I love LDG
>>
File: ComfyUI_03293_.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
>>103126426
same
>>
>>103126424
Gotta protect your assets
>>
>>103126466
is this flux?
>>
>>103126489
It's NoobAI
>>
>>103126532
so its not flux?
>>
>>103126532
also how do I install this?
can I run that in ComfyUI too?
how babby get workflow?
>>
>>103126595
lol, you want me to bill you my consulting rate?
>>
>>103126532
Is that a Pony merge? I saw it on Civitai but haven't seen much out of it
>>
>>103126622
Anybody got a challenge/theme?
>>
File: drchud.png (272 KB, 500x500)
272 KB
272 KB PNG
>>103126609
no I expect you to tell me all the neccessary steps because of the goodness of your heart, empathy and free of charge so I too can generate cute anime babes like you.
>>
no spoon feeding
>>
File: ComfyUI_03376_.png (783 KB, 1024x1024)
783 KB
783 KB PNG
>>
>>103126725
Prompt?
>>
>>103126664
Well, the best way to learn is by messing with it yourself. Here's all the help I'm going to give - https://files.catbox.moe/jsyhup.png
>>
File: ComfyUI_03375_.png (577 KB, 1024x1024)
577 KB
577 KB PNG
>>103126758
This image is a digital cartoon drawing featuring two characters in a humorous and satirical context. On the left, a large, white, and featureless humanoid figure with a large, exaggeratedly angry expression and a wide, open mouth is pointing with its left hand. The figure has no discernible facial features, emphasizing its exaggerated anger. To the right, a smaller, similarly white, and featureless humanoid figure with a more neutral expression is looking slightly to the left, appearing to be listening to the larger figure. The smaller figure is wearing a blue shirt with a colorful cartoon character design on it, adding a playful element to the scene.

Above the larger figure's head, a speech bubble reads, "YOU NEED TO KILL YOURSELF," in bold, red letters, with the word "kill" highlighted in black. To the right of the smaller figure, another speech bubble reads, "pls spoonfeed" in black text, suggesting a humorous contrast between the serious statement and the playful, cartoonish nature of the smaller figure's attire. The background is plain white, ensuring that all attention is focused on the characters and their dialogue.
>>
File: RA_NB1_00007_.jpg (1.27 MB, 2808x1920)
1.27 MB
1.27 MB JPG
>>
>>103126635
I want a very lean 1girl that still looks feminine. Like fat at 10% but not the ugly bodybuilder look, no male shoulders, no ballooning breasts, and not buff. All my attempts are oscillating between she-hulks and anorexia, can't make it lean-not-muscular. Kinda like that ginger chick from the Game of Thrones in that one scene where she shows her tummy to the know nothing guy, but like leaner.
Focus on the abs, naturally, but fully body in the shot.
>>
File: ComfyUI_03377_.png (1.85 MB, 1152x896)
1.85 MB
1.85 MB PNG
>>103126766
I dont get it, why it do pic related?
>>
File: RA_NB1_00009_.jpg (738 KB, 1920x2808)
738 KB
738 KB JPG
>>
>>103127053
Clip skip needs to be -2
>>
>>103127069
>it actually worked
are you a wizard?
>>
>>103123242 >>103123280
Looks like a great speed-quality tradeoff based on their demo website. Did someone implement this for comfyui?

Also it will break the loras, I assume?
>>
>>103127088
No, I'm just not illiterate retard like you are.
>>
File: ComfyUI_03395_.png (1.54 MB, 1152x896)
1.54 MB
1.54 MB PNG
>>103127114
I'm glad at least one of us can read thanks for learning that shit bro what I'd do without you man.
also HOLY smokes is this noobAI generating fast.
its like 25 steps in 3 seconds.
I only ever used Flux and its slow af compared to it.
>>
>anons whove never used XL or 1.5 exist
Incredible
>>
File: RA_NB1_00011_.jpg (841 KB, 1920x2808)
841 KB
841 KB JPG
>>
>>103127121
Yes the older model types are faster and also trained in more stuff (particularly lewds are still much better on these than on Flux1-D/S).
>>
>>
>>103127237
>particularly lewds
I'm impressed with the accurate anus slips, cant post them here but wow man.
>>
File: RA_NB1_00013_.jpg (1.3 MB, 1920x2808)
1.3 MB
1.3 MB JPG
>>
File: GU9W0K7a4AAyGM4.jpg (762 KB, 1200x1600)
762 KB
762 KB JPG
I'm just a coomer that wants to generate shiny anime titties but I don't know anything about tech...
>>
>>103127373
bro I'm literally illiterate and I figured out how
>>
>>103127294
Sure, some models -especially Pony derivatives- are pretty good at that.

>>103127373
Shouldn't be hard with the usual webuis
>>
>>103127157
not only do they exist, they're here in this thread giving out unsolicited advice
>>
File: RA_NB1_00015_.jpg (919 KB, 1920x2808)
919 KB
919 KB JPG
>>
File: f.png (1.3 MB, 832x1216)
1.3 MB
1.3 MB PNG
>>103127417
plus people may help but it needs somewhat more specific questions
>>
>>103127373
>but I don't know anything about tech...
well if you want to get into local diffusion you have first to figure out your computer specs.
like what GPU do you have? how much vram? how much ram? etc.

if you figure this out we can estimate what kinda models you are able to run with this machine.
>>
File: ComfyUI_03389_.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>103126994
I put this in as a prompt and pic rel is what I got.
>>
i've successfully generated dicks with imageFX.
>>
>>103127939
Nice! That's fun
>>
File: 1612008855505.png (1.1 MB, 832x1216)
1.1 MB
1.1 MB PNG
>>103127682
Nah more like
>>
>>
File: aseet.jpg (20 KB, 542x375)
20 KB
20 KB JPG
>>103127939
>>
File: 1707137395499953.jpg (1.07 MB, 3024x1728)
1.07 MB
1.07 MB JPG
>>
File: 1709469174144381.png (2.26 MB, 1536x1536)
2.26 MB
2.26 MB PNG
>>
>>103127474
jesus christ
>>
File: 1712247161804217.png (1.76 MB, 1536x1536)
1.76 MB
1.76 MB PNG
>>
File: 1715950380250443.png (1.3 MB, 1536x1536)
1.3 MB
1.3 MB PNG
>>
File: 202411081727-53554562.png (1.04 MB, 768x1216)
1.04 MB
1.04 MB PNG
>>
File: 202411081727-53554562-1.png (1.17 MB, 768x1216)
1.17 MB
1.17 MB PNG
>>
>>
File: 202411090032-843318488.png (1.84 MB, 1344x768)
1.84 MB
1.84 MB PNG
>>
File: 202411090033-843318488-1.png (1.89 MB, 1344x768)
1.89 MB
1.89 MB PNG
>>
File: 202411090036-843318488.png (2.02 MB, 1344x768)
2.02 MB
2.02 MB PNG
>>
>>
File: 202411090038-836336515-1.png (2.07 MB, 1344x768)
2.07 MB
2.07 MB PNG
>>
File: 202411090038-459924473-1.png (1.71 MB, 1344x768)
1.71 MB
1.71 MB PNG
>>
File: 202411090042-20797994.png (2.07 MB, 1344x768)
2.07 MB
2.07 MB PNG
>>
>>
File: 202411090048-321650770-1.png (1.58 MB, 1344x768)
1.58 MB
1.58 MB PNG
>>
File: 202411090052-637282379-1.png (1.84 MB, 1344x768)
1.84 MB
1.84 MB PNG
>>
>>103128358
These are great anon.
>>
File: 202411090119-107951066.png (1.49 MB, 1344x768)
1.49 MB
1.49 MB PNG
>>103128523
Thank you
>>
>>
File: 202411090134-1102238981.png (1.71 MB, 1344x768)
1.71 MB
1.71 MB PNG
>>
huh
>>
File: 1722871928062757.png (564 KB, 3116x794)
564 KB
564 KB PNG
>>103127112
>Did someone implement this for comfyui?
not yet
>Also it will break the loras, I assume?
I hope not
https://www.reddit.com/r/StableDiffusion/comments/1gmse2o/comment/lw6qaxl/?utm_source=share&utm_medium=web2x&context=3
>About 2.5x faster(4.6it/s) than comfyui with --fast(2.11 it/s) on a 4090. Seems pretty great,
that's really insane when you think about it
>>
File: 1718565540354753.webm (611 KB, 720x480)
611 KB
611 KB WEBM
Have you guys seen this?
>i2V with new CogX DimensionX Lora
https://reddit.com/r/StableDiffusion/comments/1gms4q8/i2v_with_new_cogx_dimensionx_lora/
>>
File: 1706975564301873.webm (397 KB, 720x480)
397 KB
397 KB WEBM
>>103128999
kek
https://xcancel.com/AIWarper/status/1854933007804592346#m
>>
>>103128999
>>103129049
damn thats pretty good
>>
https://github.com/THUDM/CogVideo/issues/471#issuecomment-2464837688
>The peak is in the VAE part, not the transformer. The transformer part usually consumes 34G of video memory, while the peak of the VAE can reach 68G (1360 * 720)
holy fuck it's fucking over, we'll never be able to use this shit
>>
>>103129185
Fug
>>
>>103129185
nah don't worry, we'll use the Q8_0 version of CogVideoX and we'll use tilted VAE also
>>
File: 1716000206991365.jpg (144 KB, 896x1152)
144 KB
144 KB JPG
>>
>>103129217
so everything will be alright???
>>
>>103129300
yeah we shouldn't worry about that, if kijai made it work on consumer grad GPUs with a 10b model (mochi), it'll be easy for him to do the same thing for CogVideoX-1.5-5b
>>
>>103125788
Try Illutrious based models
>>
>>103129322
so will it only work on 3090/4090s?
>>
>>103129436
Idk, we'll see how kijai's node will handle this model
>>
>>103129185
I love the constant flow of unfinished software...
Chinks setting high standards in user gullibility.
>>
>>103129690
desu I like their approach, it's a long term approach, they shouldn't nerf their advancement because Nvdia decided to hold the world's balls with their greedy hands, it's Nvdia fault we're stuck at 24gb for 6 years straight, we can't move forward in AI if we don't have more vram, it's as simple as that, hardware should keep moving forward to help the software side and Nvdia is unwilling to do that, fuck those mf
>>
>>103129702
The price of 3gb/4gb vram chips is piss all, like what $20 each?
If the 5090 isn't 48gb then Nvidia should burn to the ground.
>>
>>103129717
>If the 5090 isn't 48gb then Nvidia should burn to the ground.
that won't happen, the best case scenario will be 32gb, and I'm being generous there, Nvdia knows how valuable vram is, and they also know that they're the only good GPU makers, everyone depend on them, so they're already seeling 48gb cards for fucking 5000 dollars, and they go for 10000 dollars if you want a 96gb card, when you're a monopoly you can do whatever you want, and I hate that
>>
>>103129727
>Remember when people started hijacking delivery trucks with Nvidia cards on them?

https://www.nme.com/news/graphics-cards-stolen-in-truck-heist-resurface-at-vietnamese-retailer-3135039
>>
File: 1700021185591809.png (100 KB, 304x166)
100 KB
100 KB PNG
>>103129779
kek that put a smile on my face
>>
File: 20920490.png (435 KB, 460x460)
435 KB
435 KB PNG
>>103129779
>stolen in California and re-appeared in Vietnam
poetic justice
>>
File: ComfyUI_03456_.png (1.06 MB, 1152x896)
1.06 MB
1.06 MB PNG
>>
>>103129820
>Would you like to talk about our comrade and savior, Ho Chi Minh?
>>
File: Vram doubler when.png (512 KB, 704x384)
512 KB
512 KB PNG
>>103129727
They really want this kind of future by producing low Vram cards for the poor, don't they.
>>
>>103129849
They are doing the smart business choice of milking the big businesses for all that they can while they can with the AI iron still hot. They can't do that if they start selling reasonably priced chips at the consumer grade level.
>>
>>103129849
there's a reason Nvdia is the most valuable company in the world right now, they're selling overpriced products and we have no other choice but to buy them, because they virtually have no serious rivals, that shit sucks ass man
>>
Speaking of gpus, I wonder what cards they currently produce, and which have dropped out of production lines.
>>
>>103129880
But the poorfag gamer consumer market isnt even that profitable for Nvidia and they make all their money by selling these cards to the data centers.
So it wouldnt be that big of a difference for them if they made the consumer cards with more Vram.
they would still sell the expensive ones to the datacenters, like why not make another A100 but with 200GB VRAM and sell that for 10k?
>>
>>103129898
>So it wouldnt be that big of a difference for them if they made the consumer cards with more Vram.
if they do that, data centers will be using consuemer cards to train their models instead of the overpriced entreprise cards
>>
>>103129909
>2.8 You agree that GeForce or Titan SOFTWARE: (i) is licensed for use only on GeForce or Titan hardware products you own, and (ii) is not licensed for datacenter deployment.
>is not licensed for datacenter deployment.
>>
>>103129690
>I love the constant flow of unfinished software
ofcourse it's unfinished, everything is. ai is still in it's infancy and the community are the ones stuck doing the optimizations and figuring out how to run them on consumer hardware. it's the same story in text gen, llama.cpp is still unable to run multimodal models like pixtral and llama 3.2. it'll take years before companies actually start caring about the user experience in local, welcome to the bleeding edge
>>
>>103130038
how can they even enforce that? A data center can be anything, even I can make a data center on my home, it's not like Nvdia is looking at everyone's house, people were already doing that during the crypto era, tons of 3060 being piled up to mine some bitcoins
>>
>>103130038
it doesn't matter since consumers can stack consumer cards and rent them on sites like vast.ai for smaller projects like finetuning, they'll basically be undercutting themselves and nvidia doesn't want that
>>
>>103129909
>data centers will be using consuemer cards to train their models
no they will buy the new datacenter cards which have even more VRAM.
like why would you buy a 48GB vram card when you can buy a 200GB VRAM card?
they also buy them in bulk and there is also license shit that prohibits datacenters from using the consumer cards.
>>
File: ComfyUI_03468_.png (1.35 MB, 1152x896)
1.35 MB
1.35 MB PNG
>>
>>103130071
For starters, telemetry. Then, they're the manufacturer/retailer, the kind of customer we're talking about needs thousands of GPUs, you can't exactly pop down to Walmart to buy thousands of GPUs.
>>103130074
Even if consumer GPUs had 192GB they'd still only be used for smaller finetuning projects by consumers. Total FLOPS is arguably the bigger factor, foundational models still require thousands of GPUs.
>>
File: 00303-3701802891.jpg (863 KB, 1344x1728)
863 KB
863 KB JPG
>>
>>103128999
I like.
>>
File: 20241109_124259.webm (569 KB, 720x480)
569 KB
569 KB WEBM
sovl
>>
free site to enlarge / focus a photo ?
>>
>>103128999
Is this for the new Cog 1.5 model?
>>
File: 00318-3701802889.jpg (479 KB, 1344x1728)
479 KB
479 KB JPG
>>
>>103122994
When is this thing going to be able to do hands properly
>>
>>103130441
good hands come at a cost
>>
>>103130441
and the cost is inpaint elbow grease
>>
File: 00410-3344569852.jpg (462 KB, 1344x1728)
462 KB
462 KB JPG
>>
File: 00472-3344569854.jpg (783 KB, 1344x1728)
783 KB
783 KB JPG
>>
Mochi Image Encode
>>
>>103130239
kek
>>
why china not release good 16ch vae modal, did they lose interest? it's all video gen. even sana is just some research for a video gen model. why are the chinese obsessed with video gen?
>>
>>103131322
imggen market is saturated
>>
File: 1707334705600255.webm (434 KB, 720x480)
434 KB
434 KB WEBM
>>103128999
>>
sananana never ever
>>
File: 1716416445414521.png (2.28 MB, 1536x1536)
2.28 MB
2.28 MB PNG
>>
File: 0.jpg (110 KB, 1472x864)
110 KB
110 KB JPG
>>
sana text to video before sana text to image
>>
big if true
>>
File: ComfyUI_03481_.png (1.01 MB, 1152x896)
1.01 MB
1.01 MB PNG
>>
>>103131322
Propaganda
>>
File: ComfyUI_03502_.png (1.44 MB, 1152x896)
1.44 MB
1.44 MB PNG
>>
>>103129690
no one asked you to be here, come back in 10 years when Apple says the invented AI
>>
>>103131350
it obviously wasn't done when they announced it and they could be very likely training the VAE right now based on people's feedback
>>
File: m6hakqd0bnhd1.png (2.28 MB, 640x1536)
2.28 MB
2.28 MB PNG
>>
>>103131551
cool style
>>103131653
nice
>>
Went completely over that New Yorkers head.
>>
i am carrot
>>
>>103122994
>Serpents in the Dawn Edition
im too retarded to understand the reference
>>
>
>>
Youtubers now saying they have img2vid for Mochi
It's all a complete liw of course, they are using vison models to interrogate an image, making a prompt, running Mochi and declaring it's "REAL img2vid! WAOW!" with lots of retards in the comments clapping their flippers like performing seals.

So, avoid checking it out if it starts cropping up in feeds.
>>
>>103130161
>>103130432
anon these are amazing, catbox pretty please??
>>
>>103131718
https://youtu.be/daGMULKNCME?si=qmD8WOf2QZyvUaNb
>>
>>103131582
>Reddit moment
>>
>>103124834
kisses, all my kisses for the anime girl
>>
>>103132147
>guys I'm mad that this experimental tech isn't like my iPhone
>>
File: the.png (1.84 MB, 1056x1520)
1.84 MB
1.84 MB PNG
>>
File: 189.jpg (58 KB, 728x1000)
58 KB
58 KB JPG
>>
Straight out the oven:
>>103132365
>>103132365
>>103132365
>>
>>103132337
what model is that?
>>
>>103128971
yup, looking forward to it



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.