/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 10/21/24(Mon)20:45:18 No.102919427

File: the longest dick general (3).jpg (3.68 MB, 3248x3264)

3.68 MB JPG

/ldg/ - Local Diffusion General Anonymous 10/21/24(Mon)20:45:18 No.102919427 Archived

Discussion of free and open source text-to-image models

Previous /ldg/ bred : >>102908985

SANA Round Two Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://aitracker.art
https://huggingface.co
https://civitai.com
https://tensor.art/models
https://liblib.art
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux
DeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai

Anonymous
10/21/24(Mon)20:47:13 No.102919452

Anonymous 10/21/24(Mon)20:47:13 No.102919452

this thread is for mourning the death of bigma

Anonymous
10/21/24(Mon)20:48:40 No.102919465

Anonymous 10/21/24(Mon)20:48:40 No.102919465

File: file.png (1.44 MB, 1248x1248)

1.44 MB PNG

Once I get over the VAE compression I think I can accept this model for dicking around.

Anonymous
10/21/24(Mon)20:49:07 No.102919469

Anonymous 10/21/24(Mon)20:49:07 No.102919469

Now for drugs and sleep, good night anons.

Anonymous
10/21/24(Mon)20:50:02 No.102919480

Anonymous 10/21/24(Mon)20:50:02 No.102919480

File: file.png (160 KB, 346x261)

160 KB PNG

>>102919465
the fuck is wrong with their eyes? if we can change the VAE maybe it'll be saved idk

Anonymous
10/21/24(Mon)20:53:01 No.102919516

Anonymous 10/21/24(Mon)20:53:01 No.102919516

File: file.png (1.95 MB, 1248x1248)

1.95 MB PNG

>>102919480
It's hard to tell what's the VAE and what's from the model being undercooked. There's a reason why they haven't released the weights (it's not done).

Anonymous
10/21/24(Mon)20:54:25 No.102919530

Anonymous 10/21/24(Mon)20:54:25 No.102919530

>>102919516
>There's a reason why they haven't released the weights (it's not done).
I really don't get what they're doing, what's the point of releasing an uncooked demo in the first place? They wanted to be clowned on?

Anonymous
10/21/24(Mon)20:55:30 No.102919537

Anonymous 10/21/24(Mon)20:55:30 No.102919537

cascade anon has been on suicide watch for so long someone check up on him

Anonymous
10/21/24(Mon)20:56:06 No.102919545

Anonymous 10/21/24(Mon)20:56:06 No.102919545

>>102919537
>schizo anon is talking to himself again

Anonymous
10/21/24(Mon)20:59:20 No.102919576

Anonymous 10/21/24(Mon)20:59:20 No.102919576

>>102919530
Who knows, they made some weird decisions, like going way too hard on the VAE compression. Even 16x would've been impressive and achieved their goal. Same with switching to Gemma saving some headroom on the text encoder. If it were me I would've figured the requirements to train the model on a 24 GB VRAM GPU at 1024px then size the model for that, either 2B or 3B with 16x compression. The model is too experimental like Cascade.

Anonymous
10/21/24(Mon)21:03:58 No.102919638

Anonymous 10/21/24(Mon)21:03:58 No.102919638

>>102919576
yep, they made a serious mistake there, no one care about small models that produce bad images, we want quality first, and they could've achived that with a 5b model + a normal VAE

Anonymous
10/21/24(Mon)21:04:51 No.102919645

Anonymous 10/21/24(Mon)21:04:51 No.102919645

>>102919638
3B for SD3 barely fits at 768px for training on a 4090 at batch 1 with all the optimization tricks. 5B is a dream.

Anonymous
10/21/24(Mon)21:05:31 No.102919652

Anonymous 10/21/24(Mon)21:05:31 No.102919652

>>102919645
>5B is a dream.
the 5090 (32gb) will be there soon, so it won't be a dream anymore

Anonymous
10/21/24(Mon)21:06:50 No.102919668

Anonymous 10/21/24(Mon)21:06:50 No.102919668

what is 8x8?
is it better than fp8?

Anonymous
10/21/24(Mon)21:07:41 No.102919679

Anonymous 10/21/24(Mon)21:07:41 No.102919679

>>102919668
8x8 is 4x4 doubled.

Anonymous
10/21/24(Mon)21:08:31 No.102919686

Anonymous 10/21/24(Mon)21:08:31 No.102919686

>>102919668
>what is 8x8?
the what? where did you find this?

Anonymous
10/21/24(Mon)21:08:34 No.102919689

Anonymous 10/21/24(Mon)21:08:34 No.102919689

>>102919652
That might mean 1024px for SD3, maybe batch size 2. 5B would still be a dream. 1.6B wouldn't have been bad if the AE wasn't so extreme. But I won't poo poo it until I can train it myself. Their training methods are questionable.

Anonymous
10/21/24(Mon)21:09:42 No.102919699

Anonymous 10/21/24(Mon)21:09:42 No.102919699

>>102919689
>That might mean 1024px for SD3
I don't get it, SD3 is a 2b model, yet we managed to train SDXL (2.7b) on a 3090 though?

Anonymous
10/21/24(Mon)21:10:24 No.102919706

Anonymous 10/21/24(Mon)21:10:24 No.102919706

File: file.png (18 KB, 1503x80)

18 KB PNG

>>102919686

Anonymous
10/21/24(Mon)21:10:54 No.102919710

Anonymous 10/21/24(Mon)21:10:54 No.102919710

>>102919699
Transformers uses more VRAM.

Anonymous
10/21/24(Mon)21:11:08 No.102919711

Anonymous 10/21/24(Mon)21:11:08 No.102919711

>>102919638
>5b model + a normal VAE
Then where is the research? That's just another SD clone then. At the end of the day I'm glad they tried something new. Later zhangs will read their paper and create the ultimate 1girl generator with 128x compression VAE

Anonymous
10/21/24(Mon)21:16:02 No.102919756

Anonymous 10/21/24(Mon)21:16:02 No.102919756

anyone using juggernaut v11?

Anonymous
10/21/24(Mon)21:16:05 No.102919757

Anonymous 10/21/24(Mon)21:16:05 No.102919757

>>102919711
>That's just another SD clone then.
it's not, SAI tried to make DiT models, and they suck ass, I'd say it's a smaller version of Flux, we don't need a 12b model, I'm sure we can reach the same quality with 5b, but definitely not with 1.6b, which is my point

Anonymous
10/21/24(Mon)21:17:08 No.102919768

Anonymous 10/21/24(Mon)21:17:08 No.102919768

So is Illustrious shit or promising? I wasn't around when it was fresh. Seems like a step down from my playing with it, even for coomers.

Anonymous
10/21/24(Mon)21:17:57 No.102919771

Anonymous 10/21/24(Mon)21:17:57 No.102919771

>>102919427
forgot SANA links, retard

https://github.com/NVlabs/Sana

https://huggingface.co/collections/mit-han-lab/dc-ae-670085b9400ad7197bb1009b

https://ea13ab4f5bd9c74f93.gradio.live/

>>102919768
try out NoobAI which is a derivative of illustrious

Anonymous
10/21/24(Mon)21:18:31 No.102919775

Anonymous 10/21/24(Mon)21:18:31 No.102919775

>>102919771
>forgot SANA links, retard
sana doesn't deserve to be in the OP, it sucks ass

Anonymous
10/21/24(Mon)21:19:48 No.102919789

Anonymous 10/21/24(Mon)21:19:48 No.102919789

File: file.png (902 KB, 1248x896)

902 KB PNG

>>102919757
You're such a size queen, you would think after seeing Florence2 that size isn't the be-all-end-all of models. Pixart at 600m was just fine. 1.6B would be fine especially for niche models with 100,000 image datasets. Sana could very well be *the* porn model.

Anonymous
10/21/24(Mon)21:21:12 No.102919802

Anonymous 10/21/24(Mon)21:21:12 No.102919802

>>102919789
>1.6B would be fine
sana proved it's not, stop coping, they tried all the tricks that existed on earth for that one, and it still looks like ass

Anonymous
10/21/24(Mon)21:22:33 No.102919825

Anonymous 10/21/24(Mon)21:22:33 No.102919825

>>102919775
>>102919802
chill out anon we're just having fun with a new model it's not that deep

Anonymous
10/21/24(Mon)21:22:59 No.102919832

Anonymous 10/21/24(Mon)21:22:59 No.102919832

>>102919802
Sana didn't prove anything. I already just said Pixart is fine with 600m. You don't know the final result of Sana, you just know what an undertrained alpha model looks like. Honestly you're exactly why no one ever posts anything because you're incapable of abstract thought. Congrats anon, you're a faggot. You're someone that sees the ingredients of a cake and say "I CANNOT EAT THIS SHIT"

Anonymous
10/21/24(Mon)21:24:05 No.102919844

Anonymous 10/21/24(Mon)21:24:05 No.102919844

I'm looking to upgrade my video card to something a little bit more fitting for AI image generation than my current GTX 970. I've got a Ryzen 5 5600 and a 550 watt power supply. I'm going to guess I'm not going to be able to support the latest and greatest of 4090's on that, so what's a non-AMD card with a comfortable amount of VRAM that my current hardware can support?

Anonymous
10/21/24(Mon)21:24:06 No.102919846

Anonymous 10/21/24(Mon)21:24:06 No.102919846

>>102919832
>you just know what an undertrained alpha model looks like.
that's your assumption, what makes you believe it's undertrained and it's not the final result?

Anonymous
10/21/24(Mon)21:24:46 No.102919849

Anonymous 10/21/24(Mon)21:24:46 No.102919849

>>102919757
>it's not, SAI tried to but it sucked ass
>implying SAI is competent at all

Anonymous
10/21/24(Mon)21:24:47 No.102919851

Anonymous 10/21/24(Mon)21:24:47 No.102919851

File: file.png (1.14 MB, 896x1152)

1.14 MB PNG

>>102919771
Thanks anon, taking a look at the gallery, still seems to have that sketchy, uncertain quality to the images.
>>102919832
Is the reason pixart hasn't been adopted because of anti-hype?

Anonymous
10/21/24(Mon)21:25:07 No.102919856

Anonymous 10/21/24(Mon)21:25:07 No.102919856

>>102919825
>>102919832
let me guess, you were SD3 shills back in the days aswell?
>"Just trust Lykon bro, SD3 will be the best model ever"
you can't stop taking L's don't you? kek

Anonymous
10/21/24(Mon)21:26:08 No.102919863

Anonymous 10/21/24(Mon)21:26:08 No.102919863

File: file.png (2.34 MB, 1024x1408)

2.34 MB PNG

Anonymous
10/21/24(Mon)21:27:19 No.102919878

Anonymous 10/21/24(Mon)21:27:19 No.102919878

>>102919851
>Is the reason pixart hasn't been adopted because of anti-hype?
it's because it hadn't beaten the previous local sota model which was SDXL, simple as that

Anonymous
10/21/24(Mon)21:28:15 No.102919888

Anonymous 10/21/24(Mon)21:28:15 No.102919888

>>102919851
>gallery
researcher gens are always ass but i wont pretend it's not wonky

Anonymous
10/21/24(Mon)21:28:22 No.102919889

Anonymous 10/21/24(Mon)21:28:22 No.102919889

>>102919422
Because the promt results are the same as in the paper, which uses prompt rewriting. For example try

portrait photo of a girl, photograph, highly detailed face, depth of field

It looks basically the same

Anonymous
10/21/24(Mon)21:29:16 No.102919900

Anonymous 10/21/24(Mon)21:29:16 No.102919900

File: file.png (1.51 MB, 1248x896)

1.51 MB PNG

>>102919851
600m is fine but is restricting and definitely at the niche size category and not what people are looking for for a base model. But comparatively 600m for Pixart is the same overall quality of say SD 1.5.

Anonymous
10/21/24(Mon)21:29:34 No.102919902

Anonymous 10/21/24(Mon)21:29:34 No.102919902

Excuse me, this is the pixart bigma thread, chud.

Anonymous
10/21/24(Mon)21:30:56 No.102919916

Anonymous 10/21/24(Mon)21:30:56 No.102919916

>>102919889
I'm glad I decided to become an engineer rather than a researcher, looks like a field of litteral retards
>Make a paper about an unfinished product
>Put shitty pictures as their cherry picked pictures
>Add a demo of their undertrained turd
I swear to god if I was working with such dumbasses I would end my life

Anonymous
10/21/24(Mon)21:32:12 No.102919932

Anonymous 10/21/24(Mon)21:32:12 No.102919932

>>102919916
Maybe because researchers can see the bigger picture when you say hit your face saying "I can't eat raw flour"

Anonymous
10/21/24(Mon)21:32:44 No.102919938

Anonymous 10/21/24(Mon)21:32:44 No.102919938

>>102919427
Another bake that deliberately snubs all 1girl in favor of low effort slop. Do I need to make the real collage myself?

Anonymous
10/21/24(Mon)21:33:25 No.102919947

Anonymous 10/21/24(Mon)21:33:25 No.102919947

>>102919932
>Maybe because researchers can see the bigger picture
nah, look at the SD3 researchers, they are actual retards, and the sana team will join that list of retards

Anonymous
10/21/24(Mon)21:33:33 No.102919950

Anonymous 10/21/24(Mon)21:33:33 No.102919950

>>102919938
There are three 1girls in the collage anon

Anonymous
10/21/24(Mon)21:34:06 No.102919957

Anonymous 10/21/24(Mon)21:34:06 No.102919957

File: file.png (1.31 MB, 896x1152)

1.31 MB PNG

>>102919851
Best of 4 with the same prompt (with replaced prompt conditioning), illustrious definitely has better prompt adherence than pony. There's at least some feeling of Ornifex there. But the quality is so much better, and I heaped on the retarded sd1.5 prompt that seems recommended. Can't say base pony was too much better. As with all these things, we'll just have to see how things shake out.

Anonymous
10/21/24(Mon)21:34:09 No.102919958

Anonymous 10/21/24(Mon)21:34:09 No.102919958

>>102919947
SD3 took way too long and had no results. Sana is coming out after 5 months from Pixart Sigma. Completely different story.

Anonymous
10/21/24(Mon)21:34:58 No.102919966

Anonymous 10/21/24(Mon)21:34:58 No.102919966

>>102919958
its different for that anon because he is incapable of abstract thought or nuance

Anonymous
10/21/24(Mon)21:35:13 No.102919969

Anonymous 10/21/24(Mon)21:35:13 No.102919969

>>102919958
>Sana is coming out after 5 months from Pixart Sigma.
what's the point? I prefer them to took 1 year but make a diamon than going for 5 month to shit out of a turd

Anonymous
10/21/24(Mon)21:35:18 No.102919970

Anonymous 10/21/24(Mon)21:35:18 No.102919970

>>102919950
No, there are zero.

Anonymous
10/21/24(Mon)21:35:59 No.102919978

Anonymous 10/21/24(Mon)21:35:59 No.102919978

>>102919966
**its not different

Anonymous
10/21/24(Mon)21:36:35 No.102919988

Anonymous 10/21/24(Mon)21:36:35 No.102919988

>>102919966
>>102919978
you lost the nuance in your grammar anon kek

Anonymous
10/21/24(Mon)21:36:36 No.102919989

Anonymous 10/21/24(Mon)21:36:36 No.102919989

File: file.png (1.77 MB, 1140x1137)

1.77 MB PNG

>>102919970
Because it's 3girls? BRAIN BLAST

Anonymous
10/21/24(Mon)21:36:44 No.102919993

Anonymous 10/21/24(Mon)21:36:44 No.102919993

>>102919969
I know you're incapable of thought, but maybe the more important part is they managed to reduce the resources required to train a model 8x? That means someone can make your precious 5B model 8 times faster. Do you know what it means to work smarter not harder?

Anonymous
10/21/24(Mon)21:38:10 No.102920004

Anonymous 10/21/24(Mon)21:38:10 No.102920004

>>102919993
>they managed to reduce the resources required to train a model 8x?
again, what's the point? it's a little turd, you won't make it great with more training, we've been training SDXL (2.7b) for a year and a half at this point, and it'll never reach Flux, when you're small you're small, cope with it

Anonymous
10/21/24(Mon)21:39:07 No.102920019

Anonymous 10/21/24(Mon)21:39:07 No.102920019

>>102920004
What's the point of making better tools for training models that makes training future models 8 times faster? Surely you're not this stupid, right?

Anonymous
10/21/24(Mon)21:40:23 No.102920034

Anonymous 10/21/24(Mon)21:40:23 No.102920034

>>102920004
I don't know why you're here, obviously AI is way too experimental for you. Maybe you should come back 10 years when things are more like your iPhone 14.

Anonymous
10/21/24(Mon)21:40:23 No.102920035

Anonymous 10/21/24(Mon)21:40:23 No.102920035

>>102920019
>Surely you're not this stupid, right?
tell that to the sana team who decided to go for a little turd even though they would've proven to everyone how great their technique was if we had a big model in our hands

Anonymous
10/21/24(Mon)21:40:43 No.102920038

Anonymous 10/21/24(Mon)21:40:43 No.102920038

Sana is the biggest leap fod local models since sd 1.0. I've heard rumors that sana 2 started training recently and the team is already shocked by the results. Apparantly it is beating flux on most benchmarks after just 50 H100 hours of training time.

Anonymous
10/21/24(Mon)21:41:24 No.102920043

Anonymous 10/21/24(Mon)21:41:24 No.102920043

>>102920034
I accept your concession

Anonymous
10/21/24(Mon)21:41:26 No.102920044

Anonymous 10/21/24(Mon)21:41:26 No.102920044

File: file.png (1.17 MB, 1248x896)

1.17 MB PNG

Anonymous
10/21/24(Mon)21:41:42 No.102920049

Anonymous 10/21/24(Mon)21:41:42 No.102920049

nothing like a new model to bring out the retards

Anonymous
10/21/24(Mon)21:42:09 No.102920054

Anonymous 10/21/24(Mon)21:42:09 No.102920054

This general was founded on an irrational millennarian enthusiasm around the release of Pixart Sigma; today, while we continue to use models finetuned from Stability models or made by former Stability employees, Pixart drops another dud. The thread is, predictably, in shambles. And we absolutely deserve it.

Anonymous
10/21/24(Mon)21:42:24 No.102920058

Anonymous 10/21/24(Mon)21:42:24 No.102920058

>>102920049
>nothing like a new model to bring out the retards
true, that's why you're here right now

Anonymous
10/21/24(Mon)21:42:28 No.102920060

Anonymous 10/21/24(Mon)21:42:28 No.102920060

File: ComfyUI_00836_.png (1.67 MB, 1280x1024)

1.67 MB PNG

Anonymous
10/21/24(Mon)21:42:50 No.102920065

Anonymous 10/21/24(Mon)21:42:50 No.102920065

>>102920054
Weird because you're an sdg faggot that prefers to be here for some reason.

Anonymous
10/21/24(Mon)21:42:56 No.102920067

Anonymous 10/21/24(Mon)21:42:56 No.102920067

>>102920058
yeah

Anonymous
10/21/24(Mon)21:43:44 No.102920071

Anonymous 10/21/24(Mon)21:43:44 No.102920071

>>102920065
"some reason"... you know full well what his name is

Anonymous
10/21/24(Mon)21:44:23 No.102920075

Anonymous 10/21/24(Mon)21:44:23 No.102920075

File: file.png (209 KB, 746x512)

209 KB PNG

>>102920067
I was just joking, and you were supposed to call me a nigger faggot, now I feel bad saying it, sorry anon

Anonymous
10/21/24(Mon)21:46:04 No.102920087

Anonymous 10/21/24(Mon)21:46:04 No.102920087

has anyone done a direct comparison to sigma? to see how much it improved if at all?

Anonymous
10/21/24(Mon)21:46:08 No.102920088

Anonymous 10/21/24(Mon)21:46:08 No.102920088

File: file.png (1.15 MB, 1248x896)

1.15 MB PNG

The issue is technology has been infested with tech illiterate retards that wouldn't be here if smartphones didn't have web browsers. Something about AI brings the 90 IQ retards.

Anonymous
10/21/24(Mon)21:46:53 No.102920096

Anonymous 10/21/24(Mon)21:46:53 No.102920096

>>102920088
>The issue is technology has been infested with tech illiterate retards that wouldn't be here if smartphones didn't have web browsers.
sana was literally made so that it could be run on a smartphone, it's written on their paper and I hate every line of it

Anonymous
10/21/24(Mon)21:47:27 No.102920104

Anonymous 10/21/24(Mon)21:47:27 No.102920104

>>102920087
Absolutely no point in doing it until the weights are dropped, every couple seconds there's another H100 batch size 1024 being dropped on it.

Anonymous
10/21/24(Mon)21:48:40 No.102920123

Anonymous 10/21/24(Mon)21:48:40 No.102920123

>>102920096
And? It's still a prototype, it's like going to alpha-tech trade show showing a prototype 1-inch OLED and saying "duurrrr dat is too small"

Anonymous
10/21/24(Mon)21:48:46 No.102920125

Anonymous 10/21/24(Mon)21:48:46 No.102920125

File: file.png (1.24 MB, 1152x896)

1.24 MB PNG

>>102920038

Anonymous
10/21/24(Mon)21:50:27 No.102920144

Anonymous 10/21/24(Mon)21:50:27 No.102920144

>>102920123
>And? It's still a prototype
the classic "prototype cope"
>Hurdur, Pixart Sigma is a prototype, the next iteration will destroy everything
>next iteration (sama) comes
>looks like shit
>well... it's a prototype duh! It's not like we were waiting for this or something, 2 weeks anon!
of course with this method I'll never win right?

Anonymous
10/21/24(Mon)21:50:36 No.102920147

Anonymous 10/21/24(Mon)21:50:36 No.102920147

>>102920104
so true, i think ive heard the researchers drop another turd on it

Anonymous
10/21/24(Mon)21:50:44 No.102920151

Anonymous 10/21/24(Mon)21:50:44 No.102920151

>>102920125
My sources are people close to the sana team. I can't reveal their names or exact positions because NvLabs may pull funding.

Anonymous
10/21/24(Mon)21:50:50 No.102920153

Anonymous 10/21/24(Mon)21:50:50 No.102920153

File: file.png (1.29 MB, 1248x896)

1.29 MB PNG

>game developer posts pre-alpha footage for nerds and techies
>retard gamer with smart phone: durr da game has bad grafix

Anonymous
10/21/24(Mon)21:52:08 No.102920166

Anonymous 10/21/24(Mon)21:52:08 No.102920166

>>102920151
Me too I have sources, and they said that they won't do anymore models after sana, it's over. I can't reveal their names or exact positions but trust me bro it's true

Anonymous
10/21/24(Mon)21:52:42 No.102920170

Anonymous 10/21/24(Mon)21:52:42 No.102920170

File: file.png (891 KB, 1248x896)

891 KB PNG

I do think Gemma is too retarded for prompt rewriting.

Anonymous
10/21/24(Mon)21:54:24 No.102920184

Anonymous 10/21/24(Mon)21:54:24 No.102920184

>>102920153
did you seriously made an analogy between image models and video games? Can't believe you're this retarded, image models are all about graphics, if it looks like shit no one will give a fuck, that's the only goal of an image model, to produce good looking pictures and accurate to your prompts

Anonymous
10/21/24(Mon)21:56:18 No.102920202

Anonymous 10/21/24(Mon)21:56:18 No.102920202

It's funny because any criticism of Flux is met with "but dey gave it tou you fer FREE!"
You got Sana, for free. Calm down.

Anonymous
10/21/24(Mon)21:56:44 No.102920206

Anonymous 10/21/24(Mon)21:56:44 No.102920206

File: file.png (1.01 MB, 1248x896)

1.01 MB PNG

>>102920184
did you seriously make an analogy between video games and image models? Can't believe you're this retarded, video games are all about graphics, if it looks like shit no one will give a fuck, that's the only goal for video games, to produce good looking graphics.

Anonymous
10/21/24(Mon)21:57:27 No.102920211

Anonymous 10/21/24(Mon)21:57:27 No.102920211

>>102920166
Don't come crawling back when Sana 2 releases. Aparantly it will be ready in a month or even weeks.

Anonymous
10/21/24(Mon)21:57:50 No.102920220

Anonymous 10/21/24(Mon)21:57:50 No.102920220

>>102920211
>Aparantly it will be ready in a month or even weeks.
2 weeks?

Anonymous
10/21/24(Mon)21:58:50 No.102920230

Anonymous 10/21/24(Mon)21:58:50 No.102920230

>>102920211
I heard maybe even two weeks. They had a breakthrough in quantum shrinkflation and after the lead researcher got shrinkflated he was able to design a hyperbolic time training algorithm, I heard it from the engineers

Anonymous
10/21/24(Mon)21:59:07 No.102920232

Anonymous 10/21/24(Mon)21:59:07 No.102920232

>>102920211
>Don't come crawling back when Sana 2 releases.
took them 5 month to make this piece of turd that is sana, and you're expecting us to believe they'll make a better model in a month?

Anonymous
10/21/24(Mon)21:59:43 No.102920240

Anonymous 10/21/24(Mon)21:59:43 No.102920240

>>102920220
I can't give further details but I can see it will be at least more than a week. I'm already risking my source's identity from what I've revealed so far

Anonymous
10/21/24(Mon)21:59:49 No.102920241

Anonymous 10/21/24(Mon)21:59:49 No.102920241

>>102920232
oh my god he's retarded

Anonymous
10/21/24(Mon)22:00:09 No.102920245

Anonymous 10/21/24(Mon)22:00:09 No.102920245

>>102920241
I accept your concession.

Anonymous
10/21/24(Mon)22:00:47 No.102920251

Anonymous 10/21/24(Mon)22:00:47 No.102920251

>>102920245
(we're all roleplaying anon)

Anonymous
10/21/24(Mon)22:00:51 No.102920253

Anonymous 10/21/24(Mon)22:00:51 No.102920253

File: file.png (956 KB, 1248x896)

956 KB PNG

Anonymous
10/21/24(Mon)22:01:08 No.102920259

Anonymous 10/21/24(Mon)22:01:08 No.102920259

File: file.png (250 KB, 1024x1408)

250 KB PNG

Anonymous
10/21/24(Mon)22:01:15 No.102920261

Anonymous 10/21/24(Mon)22:01:15 No.102920261

>>102920240
Oh dear sana (((alledged))) employee anon, why do sana 1 looks so bad?

Anonymous
10/21/24(Mon)22:02:13 No.102920269

Anonymous 10/21/24(Mon)22:02:13 No.102920269

at least black forrest labs had the balls to send a peon into the thread to answer questions
chang where are you?

Anonymous
10/21/24(Mon)22:03:11 No.102920275

Anonymous 10/21/24(Mon)22:03:11 No.102920275

>>102920269
no you dont understand hes been here the whole time, be careful to not speak badly of the model or you will summon him

Anonymous
10/21/24(Mon)22:03:51 No.102920283

Anonymous 10/21/24(Mon)22:03:51 No.102920283

File: file.png (1.31 MB, 1248x896)

1.31 MB PNG

Anonymous
10/21/24(Mon)22:06:09 No.102920302

Anonymous 10/21/24(Mon)22:06:09 No.102920302

>>102919771
>demo saves as webp

Anonymous
10/21/24(Mon)22:07:36 No.102920313

Anonymous 10/21/24(Mon)22:07:36 No.102920313

File: file.png (1.74 MB, 1248x896)

1.74 MB PNG

Anonymous
10/21/24(Mon)22:10:18 No.102920334

Anonymous 10/21/24(Mon)22:10:18 No.102920334

File: file.png (741 KB, 1024x1408)

741 KB PNG

Anonymous
10/21/24(Mon)22:14:15 No.102920362

Anonymous 10/21/24(Mon)22:14:15 No.102920362

File: image.jpg (286 KB, 1024x1024)

286 KB JPG

Sana is the best!

Anonymous
10/21/24(Mon)22:15:12 No.102920371

Anonymous 10/21/24(Mon)22:15:12 No.102920371

File: file.png (1.55 MB, 768x1280)

1.55 MB PNG

Anonymous
10/21/24(Mon)22:15:48 No.102920375

Anonymous 10/21/24(Mon)22:15:48 No.102920375

>>102920371
turn PAG guidance down pls

Anonymous
10/21/24(Mon)22:16:53 No.102920380

Anonymous 10/21/24(Mon)22:16:53 No.102920380

>>102920362
True anon, Sanoa iis the Hhe Sanao the but gest

Anonymous
10/21/24(Mon)22:18:04 No.102920395

Anonymous 10/21/24(Mon)22:18:04 No.102920395

Rip. Application is busy.

Anonymous
10/21/24(Mon)22:21:24 No.102920425

Anonymous 10/21/24(Mon)22:21:24 No.102920425

can it gen sanna marin?

Anonymous
10/21/24(Mon)22:23:07 No.102920442

Anonymous 10/21/24(Mon)22:23:07 No.102920442

>>102920375
what's the difference between PAG guidance and CFG? why isn't it working with just CFG like every normal models?

Anonymous
10/21/24(Mon)22:24:15 No.102920453

Anonymous 10/21/24(Mon)22:24:15 No.102920453

>>102920442
i remember when PAG came out but can't recall what its purpose is. i think it looks bad desu.

Anonymous
10/21/24(Mon)22:25:15 No.102920462

Anonymous 10/21/24(Mon)22:25:15 No.102920462

File: file.png (790 KB, 1024x1024)

790 KB PNG

trying out old sigma prompts on sana, i plan on killing myself soon
>a cute, chubby little raccoon in a mystical forest full of glowing creatures and fauna, the image is in a low poly style

Anonymous
10/21/24(Mon)22:26:00 No.102920468

Anonymous 10/21/24(Mon)22:26:00 No.102920468

>>102920462
Base sigma could do lowpoly?

Anonymous
10/21/24(Mon)22:26:09 No.102920470

Anonymous 10/21/24(Mon)22:26:09 No.102920470

>>102920462
>i plan on killing myself soon
why? because it looks worse than old sigma?

Anonymous
10/21/24(Mon)22:27:03 No.102920472

Anonymous 10/21/24(Mon)22:27:03 No.102920472

>>102920468
Without seeing the Gemma prompt Gemma could've gobbled up the low poly style part of the prompt.

Anonymous
10/21/24(Mon)22:27:09 No.102920474

Anonymous 10/21/24(Mon)22:27:09 No.102920474

File: file.png (1.17 MB, 1024x1024)

1.17 MB PNG

>>102920462
for comparison this is what sigma gave
>>102920468
>>102920470
sigma was pure soul in safetensor form

Anonymous
10/21/24(Mon)22:28:05 No.102920484

Anonymous 10/21/24(Mon)22:28:05 No.102920484

File: file.png (1.05 MB, 1024x1024)

1.05 MB PNG

Anonymous
10/21/24(Mon)22:28:58 No.102920494

Anonymous 10/21/24(Mon)22:28:58 No.102920494

>>102920462
>>102920474
yikes, it looks way worse than its predecessor, how could've the fucked it up this bad?

Anonymous
10/21/24(Mon)22:30:25 No.102920503

Anonymous 10/21/24(Mon)22:30:25 No.102920503

i like to cope and think that maybe, just maybe, sana is not part of the pixart family of models

Anonymous
10/21/24(Mon)22:31:17 No.102920511

Anonymous 10/21/24(Mon)22:31:17 No.102920511

>single flux gen ITT

Anonymous
10/21/24(Mon)22:32:16 No.102920518

Anonymous 10/21/24(Mon)22:32:16 No.102920518

File: file.png (1.44 MB, 1024x1024)

1.44 MB PNG

Anonymous
10/21/24(Mon)22:33:12 No.102920531

Anonymous 10/21/24(Mon)22:33:12 No.102920531

File: 1704406258862936.png (967 KB, 1024x1024)

967 KB PNG

it can do low poly but the prompt adherence is awful
>low poly style render of an old rusted robot wearing pants and a jacket riding skis in a supermarket

Anonymous
10/21/24(Mon)22:33:34 No.102920532

Anonymous 10/21/24(Mon)22:33:34 No.102920532

THIS IS WHY YOU DONT USE SYNTHETIC MIDJOURNEY IMAGES YOU STUPID FUCKING CHINKS
WHY IS LOCAL SO FUCKING INCOMPETENT
every fucking model since sdxl has been trained on dogshit synthetic data, we could've had local midjourney or dalle already if these faggot bakers didn't cuck their shit.

Anonymous
10/21/24(Mon)22:34:24 No.102920537

Anonymous 10/21/24(Mon)22:34:24 No.102920537

>>102920531
From my basic tests the Gemma 2 prompt expander is very ass, it will change your prompt for the worst and miss the primary intent and style.

Anonymous
10/21/24(Mon)22:35:02 No.102920543

Anonymous 10/21/24(Mon)22:35:02 No.102920543

>>102920503
no wonder they just released the demo, they knew it was shit enough to be clowned on, I hope they'll improve it now, if you're reading this sana employee, get back to work!

Anonymous
10/21/24(Mon)22:36:12 No.102920553

Anonymous 10/21/24(Mon)22:36:12 No.102920553

File: file.png (1.9 MB, 1024x1024)

1.9 MB PNG

>>102920532
amen

Anonymous
10/21/24(Mon)22:37:07 No.102920562

Anonymous 10/21/24(Mon)22:37:07 No.102920562

File: file.png (1.5 MB, 1024x1024)

1.5 MB PNG

4chang detects my post as spam if i try to post the prompt
>sana
https://pastebin.com/5CcFUbGh

Anonymous
10/21/24(Mon)22:38:20 No.102920575

Anonymous 10/21/24(Mon)22:38:20 No.102920575

File: 14ca4e7d-012f-4039-b055-7(...).png (1.66 MB, 1024x1024)

1.66 MB PNG

>>102920562
>pixart sigma output

Anonymous
10/21/24(Mon)22:39:49 No.102920586

Anonymous 10/21/24(Mon)22:39:49 No.102920586

I think it's easily worse than SD 1.5

Anonymous
10/21/24(Mon)22:39:51 No.102920587

Anonymous 10/21/24(Mon)22:39:51 No.102920587

File: 1729546641457839.jpg (1.45 MB, 2120x2488)

1.45 MB JPG

Let's talk about CtrLoRA again.

https://github.com/comfyanonymous/ComfyUI/issues/5314
https://github.com/xyfJASON/ctrlora

>ControlNet (Zhang et al., 2023) adds an extra network that accepts a condition image, turning a T2I
model into an image-to-image (I2I) model. In this manner, ControlNet is able to generate images
according to a specific kind of condition image such as canny edge, significantly improving the
controllability. However, for each condition type, an independent ControlNet needs to be trained
from scratch with a large amount of data and computational resources. For example, the ControlNet
conditioned on canny edge is trained on 3 million images for around 600 A100 GPU hours.
>To address this problem we propose a CtrLoRA frame-
work that allows users to conveniently and efficiently establish a ControlNet for a customized type
of condition image. As illustrated in Fig. 2(a), we first train a Base ControlNet on a large-scale
dataset across multiple base condition-to-image tasks such as canny-to-image, depth-to-image, and
skeleton-to-image, where the network parameters are shared by all these base conditions. Mean-
while, for each base condition, we add a condition-specific LoRA to the Base ControlNet. In this
manner, the condition-specific LoRAs capture the unique characteristics of the corresponding condi-
tions, allowing the Base ControlNet to focus on learning the common knowledge of image-to-image
(I2I) generation from multiple conditions simultaneously. With our framework, in most scenarios, we can learn a customized
type of condition with as few as 1,000 training data and less than one hour of training on a single
GPU. Moreover, our method requires only 37 million LoRA parameters per new condition, a sig-
nificant reduction compared to the 361 million parameters required by the original ControlNet for
each condition.

Anonymous
10/21/24(Mon)22:40:47 No.102920596

Anonymous 10/21/24(Mon)22:40:47 No.102920596

>>102920586
I don't think you realize how fucking bad SD 1.5 is. Let's get those rose tinted glasses off buddy.

Anonymous
10/21/24(Mon)22:42:48 No.102920608

Anonymous 10/21/24(Mon)22:42:48 No.102920608

>>102920596
>I don't think you realize how fucking bad SD 1.5 is.
this, I played with base SD1.5 a month ago, it was horrible, we really improved our shit since then

Anonymous
10/21/24(Mon)22:44:05 No.102920621

Anonymous 10/21/24(Mon)22:44:05 No.102920621

File: 1714875072433212.jpg (1.85 MB, 2054x4106)

1.85 MB JPG

>low poly render of a man wearing glasses with a sign that says "IT'S OVER"
> Image Style: 3D Model

Anonymous
10/21/24(Mon)22:44:15 No.102920622

Anonymous 10/21/24(Mon)22:44:15 No.102920622

File: file.png (569 KB, 1024x1024)

569 KB PNG

>>102920609
>hatsune miku with her tits out

Anonymous
10/21/24(Mon)22:44:17 No.102920623

Anonymous 10/21/24(Mon)22:44:17 No.102920623

>>102920587
>Let's talk about CtrLoRA again.
there's a model that'll work on flux?

Anonymous
10/21/24(Mon)22:44:45 No.102920630

Anonymous 10/21/24(Mon)22:44:45 No.102920630

File: IMG-20241022-WA0009.jpg (135 KB, 1280x1280)

135 KB JPG

Meta seems to be the best text to image Gen AI for me and it works best in WhatsApp. Even Instagram one sucks despite sharing same Llama version.

Is there any Android app that let's me create unlimited images everyday for free like WhatsApp does?

Anonymous
10/21/24(Mon)22:45:29 No.102920633

Anonymous 10/21/24(Mon)22:45:29 No.102920633

>>102920630
this is a local thread anon

Anonymous
10/21/24(Mon)22:45:38 No.102920639

Anonymous 10/21/24(Mon)22:45:38 No.102920639

File: 1704855265718923.jpg (1.42 MB, 2054x4106)

1.42 MB JPG

>low poly render of a man wearing glasses with a sign that says "IT'S OVER"
> Image Style: (No style)

Anonymous
10/21/24(Mon)22:47:11 No.102920653

Anonymous 10/21/24(Mon)22:47:11 No.102920653

>>102920639
they took away it's soul... give it back.... GIVE IT BAAAAAAAAACKKKKKKK!!!!!!!!!!!!!!!!!!!!!!!!!!

Anonymous
10/21/24(Mon)22:48:07 No.102920657

Anonymous 10/21/24(Mon)22:48:07 No.102920657

>>102920633
My bad, I thought Llama based txt to img would be considered on topic considering it's open source

Anonymous
10/21/24(Mon)22:48:13 No.102920658

Anonymous 10/21/24(Mon)22:48:13 No.102920658

nvidia will pay

Anonymous
10/21/24(Mon)22:48:40 No.102920665

Anonymous 10/21/24(Mon)22:48:40 No.102920665

>>102920653
not only it removed it soul, but it looks worse, Flux doesn't have much soul but at least the images look consistenly good

Anonymous
10/21/24(Mon)22:49:31 No.102920671

Anonymous 10/21/24(Mon)22:49:31 No.102920671

>>102920657
>it's open source
aren't. meta doesn't release their image models unfortunately

Anonymous
10/21/24(Mon)22:50:18 No.102920678

Anonymous 10/21/24(Mon)22:50:18 No.102920678

>>102920623
Seems like not
>5. CONCLUSION AND LIMITATIONS
>We speculate this issue might originate from the capabilities of the network architectures, specifically the architectures of VAE, UNet-based Stable Diffusion, and ControlNet. To enhance the capabilities of our framework, it is worth developing our CtrLoRA using more advanced DiT-based (Peebles & Xie, 2023) backbones such as Stable Diffusion V3 (Esser et al., 2024) and Flux.1, which we leave for future work.

Anonymous
10/21/24(Mon)22:50:34 No.102920683

Anonymous 10/21/24(Mon)22:50:34 No.102920683

>>102920671
>meta doesn't release their image models unfortunately
and their video model (that one looks amazing, goddam I hate it)

Anonymous
10/21/24(Mon)22:51:47 No.102920692

Anonymous 10/21/24(Mon)22:51:47 No.102920692

File: file.png (368 KB, 500x500)

368 KB PNG

>>102920678
ok so that's a nothingburger

Anonymous
10/21/24(Mon)22:52:03 No.102920694

Anonymous 10/21/24(Mon)22:52:03 No.102920694

File: 1710118777278704.jpg (1.18 MB, 2054x4106)

1.18 MB JPG

>low poly render of hatsune miku

Anonymous
10/21/24(Mon)22:53:23 No.102920704

Anonymous 10/21/24(Mon)22:53:23 No.102920704

>>102920692
Did you not read
>which we leave for future work.
?
Time to start putting those mikus to use and port it to flux

Anonymous
10/21/24(Mon)22:53:41 No.102920708

Anonymous 10/21/24(Mon)22:53:41 No.102920708

File: file.png (1.32 MB, 1024x1024)

1.32 MB PNG

I think they're overly obsessed with using numbers to guide their training and just like overly using aesthetics scores, using scoring to determine prompt adherence probably obliterates concepts.

Anonymous
10/21/24(Mon)22:53:50 No.102920713

Anonymous 10/21/24(Mon)22:53:50 No.102920713

What is better for genning: a 4060 Ti 16GB VRAM card or a 4070 12GB VRAM card?

Anonymous
10/21/24(Mon)22:54:01 No.102920716

Anonymous 10/21/24(Mon)22:54:01 No.102920716

>>102920704
not my problem, if they want to prove it work on flux, they have to do it

Anonymous
10/21/24(Mon)22:54:41 No.102920720

Anonymous 10/21/24(Mon)22:54:41 No.102920720

File: 1726397625582719.jpg (1.07 MB, 2054x4106)

1.07 MB JPG

>low poly render of donald trump

Anonymous
10/21/24(Mon)22:55:11 No.102920725

Anonymous 10/21/24(Mon)22:55:11 No.102920725

>>102920716
Lmao not going to happen because like everything else, you need an H100. No one is going to train even Control Net for Flux without a hefty grant or access to H100s laying around.

Anonymous
10/21/24(Mon)22:56:07 No.102920732

Anonymous 10/21/24(Mon)22:56:07 No.102920732

>>102920725
that's why I said it's a nothingburger

Anonymous
10/21/24(Mon)22:57:07 No.102920741

Anonymous 10/21/24(Mon)22:57:07 No.102920741

File: ComfyUI_00845_.png (1.06 MB, 1280x1024)

1.06 MB PNG

Anonymous
10/21/24(Mon)22:58:12 No.102920747

Anonymous 10/21/24(Mon)22:58:12 No.102920747

File: 1705521549275545.jpg (1.22 MB, 2054x4106)

1.22 MB JPG

>low poly render of the solar system

Anonymous
10/21/24(Mon)22:58:45 No.102920751

Anonymous 10/21/24(Mon)22:58:45 No.102920751

File: file.png (1.44 MB, 1024x1024)

1.44 MB PNG

Anonymous
10/21/24(Mon)22:59:45 No.102920756

Anonymous 10/21/24(Mon)22:59:45 No.102920756

>>102920562
nice

Anonymous
10/21/24(Mon)23:00:02 No.102920759

Anonymous 10/21/24(Mon)23:00:02 No.102920759

File: file.png (1.2 MB, 1024x1024)

1.2 MB PNG

this one isn't too bad, i feel like it beats sigma here
>sana
https://pastebin.com/F7mZzSvB

Anonymous
10/21/24(Mon)23:01:07 No.102920767

Anonymous 10/21/24(Mon)23:01:07 No.102920767

File: 1700954526636367.jpg (1.47 MB, 2054x4106)

1.47 MB JPG

>nvidia geforce rtx 5090 gpu

Anonymous
10/21/24(Mon)23:01:38 No.102920771

Anonymous 10/21/24(Mon)23:01:38 No.102920771

File: file.png (1.55 MB, 1024x1024)

1.55 MB PNG

>>102920759
sigma

Anonymous
10/21/24(Mon)23:02:38 No.102920780

Anonymous 10/21/24(Mon)23:02:38 No.102920780

File: file.png (1.05 MB, 1024x1408)

1.05 MB PNG

Anonymous
10/21/24(Mon)23:02:54 No.102920781

Anonymous 10/21/24(Mon)23:02:54 No.102920781

>>102920759
hmmm... on second thought... no visible brushstrokes and the hands are worse...

Anonymous
10/21/24(Mon)23:03:57 No.102920791

Anonymous 10/21/24(Mon)23:03:57 No.102920791

>>102920781
Aesthetics are easy to fix.

Anonymous
10/21/24(Mon)23:06:08 No.102920807

Anonymous 10/21/24(Mon)23:06:08 No.102920807

File: file.png (761 KB, 1024x1024)

761 KB PNG

>>102920791
i zope so, sigma had some of the best aesthetics for a local model back then
>sana
>a candle that looks like a cute cat

Anonymous
10/21/24(Mon)23:06:49 No.102920815

Anonymous 10/21/24(Mon)23:06:49 No.102920815

File: file.png (1.17 MB, 1024x1024)

1.17 MB PNG

>>102920807
I think they did a worse job with their captioning.

Anonymous
10/21/24(Mon)23:07:32 No.102920820

Anonymous 10/21/24(Mon)23:07:32 No.102920820

>This application is currently busy. Please try again.
STOP HOGGING IT ANON

Anonymous
10/21/24(Mon)23:08:27 No.102920826

Anonymous 10/21/24(Mon)23:08:27 No.102920826

File: file.png (1.03 MB, 1024x1024)

1.03 MB PNG

>>102920807
>pixart soulma

Anonymous
10/21/24(Mon)23:11:37 No.102920856

Anonymous 10/21/24(Mon)23:11:37 No.102920856

>>102920820
It's cute they let us use their office 4090.

Anonymous
10/21/24(Mon)23:12:35 No.102920864

Anonymous 10/21/24(Mon)23:12:35 No.102920864

demo queue has been stuck on 1 guy for a while... did he request 8 billion steps or something

Anonymous
10/21/24(Mon)23:13:04 No.102920867

Anonymous 10/21/24(Mon)23:13:04 No.102920867

>>102920856
>their office 4090
I hope that's not a 4090, they claimed it would be ultra fast to make a picture, but when I'm on the top of the queue and the generation is starting, it takes more than a mn

Anonymous
10/21/24(Mon)23:13:21 No.102920869

Anonymous 10/21/24(Mon)23:13:21 No.102920869

>>102920864
they sort though naughty prompts by hand and are confused by mine

Anonymous
10/21/24(Mon)23:15:04 No.102920880

Anonymous 10/21/24(Mon)23:15:04 No.102920880

>>102920867
No it doesn't. It's like 2 seconds. Actually watch the queue, it's a couple of seconds when it's processing your prompt.

>>102920864
4K with a negative prompt takes a bit but I think it might have crashed or something.

Anonymous
10/21/24(Mon)23:16:04 No.102920889

Anonymous 10/21/24(Mon)23:16:04 No.102920889

File: file.png (1.12 MB, 1024x1024)

1.12 MB PNG

Definitely has Warhammer 40K in the dataset.

Anonymous
10/21/24(Mon)23:16:05 No.102920890

Anonymous 10/21/24(Mon)23:16:05 No.102920890

>>102920791
lol no they're not. aesthetics/style are a key part of comprehension. which is why fluxjeets can't even do this simple midjourney prompt despite being able to do complex text on signs
https://www.reddit.com/r/StableDiffusion/comments/1g6q1x3/whats_the_process_to_create_this/
the flux results look like dogshit in comparison, a fundamental misunderstanding of aesthetic construction thanks to butchered training data.

Anonymous
10/21/24(Mon)23:17:06 No.102920900

Anonymous 10/21/24(Mon)23:17:06 No.102920900

File: file.png (1.65 MB, 1024x1024)

1.65 MB PNG

>>102920867
takes like 2 seconds for each gen, it's just that there's alot of people in queue
>sana
https://pastebin.com/8YbwfDkX

Anonymous
10/21/24(Mon)23:17:22 No.102920902

Anonymous 10/21/24(Mon)23:17:22 No.102920902

>>102920890
I know you're stupid, but the reason why this problem comes up is because AI captioning is extremely bad at using style keywords in the prompts. The only way to avoid this problem is hand written prompts or including meta information.

Anonymous
10/21/24(Mon)23:18:23 No.102920913

Anonymous 10/21/24(Mon)23:18:23 No.102920913

>>102920902
>I know you're stupid
projection

Anonymous
10/21/24(Mon)23:18:28 No.102920914

Anonymous 10/21/24(Mon)23:18:28 No.102920914

Sorry anon I've been prompting "1girl" with 40 steps on random seeds this whole time. I'll stop.

Anonymous
10/21/24(Mon)23:18:46 No.102920916

Anonymous 10/21/24(Mon)23:18:46 No.102920916

File: file.png (2.04 MB, 1024x1024)

2.04 MB PNG

>>102920900
>pixart sigma

Anonymous
10/21/24(Mon)23:19:14 No.102920920

Anonymous 10/21/24(Mon)23:19:14 No.102920920

>>102920913
I just explained to you why it happens. If you did anything productive with your time you would've known this yourself. You bitch about training but never have you captioned 100k images.

Anonymous
10/21/24(Mon)23:19:56 No.102920926

Anonymous 10/21/24(Mon)23:19:56 No.102920926

File: file.png (2.37 MB, 1024x1408)

2.37 MB PNG

Anonymous
10/21/24(Mon)23:21:18 No.102920932

Anonymous 10/21/24(Mon)23:21:18 No.102920932

>>102920900
>takes like 2 seconds for each gen
I fucking hate the current era we're in, or else we got a giant model (Flux) that takes minutes for a single image, or else we got small little shit that produce turds in 2 seconds, why are they so weary on going for the sweet spot? Something big but not too big

Anonymous
10/21/24(Mon)23:22:18 No.102920939

Anonymous 10/21/24(Mon)23:22:18 No.102920939

File: file.png (91 KB, 793x729)

91 KB PNG

>we will try our best to...
Fuck

Anonymous
10/21/24(Mon)23:22:46 No.102920943

Anonymous 10/21/24(Mon)23:22:46 No.102920943

>>102920932
im assuming they plan on using sana as a base for something else like a video model or whatever, explains why they minmaxxed speed and efficiency so much

Anonymous
10/21/24(Mon)23:22:48 No.102920944

Anonymous 10/21/24(Mon)23:22:48 No.102920944

>>102920939
>Model zoo
that's the model weights right?

Anonymous
10/21/24(Mon)23:22:59 No.102920948

Anonymous 10/21/24(Mon)23:22:59 No.102920948

>>102920932
Sounds simple enough anon, I'm glad you're spearheading this. Oh wait, you want other people to spend thousands of dollars so you can call their work a turd.

Anonymous
10/21/24(Mon)23:23:49 No.102920961

Anonymous 10/21/24(Mon)23:23:49 No.102920961

>>102920948
>I'm glad you're spearheading this.
thanks anon, it sure needs to be talked about

Anonymous
10/21/24(Mon)23:23:54 No.102920963

Anonymous 10/21/24(Mon)23:23:54 No.102920963

File: file.png (849 KB, 1024x1024)

849 KB PNG

>sana
>origami figure of a cute girl with cyan hair and long twintails, the girl's name is hatsune miku
sana still has a bit of soul remaining

Anonymous
10/21/24(Mon)23:24:47 No.102920976

Anonymous 10/21/24(Mon)23:24:47 No.102920976

>>102920944
Yes that would be every model. The demo is the 1024px model. But I'd expect there to be a 512, 1024, 2K and 4K model. For Pixart they spent the most time on the 2K model.

Anonymous
10/21/24(Mon)23:25:13 No.102920981

Anonymous 10/21/24(Mon)23:25:13 No.102920981

File: file.png (1.27 MB, 1024x1024)

1.27 MB PNG

>>102920963
sigmo

Anonymous
10/21/24(Mon)23:25:51 No.102920986

Anonymous 10/21/24(Mon)23:25:51 No.102920986

File: file.png (1.92 MB, 1024x1024)

1.92 MB PNG

For the record this is pretty aligned with Gustav Klimt's work.

Anonymous
10/21/24(Mon)23:26:42 No.102920996

Anonymous 10/21/24(Mon)23:26:42 No.102920996

>>102920986
why's she looking at me like that?

Anonymous
10/21/24(Mon)23:27:15 No.102921005

Anonymous 10/21/24(Mon)23:27:15 No.102921005

>>102920932
>why are they so weary on going for the sweet spot? Something big but not too big
SAI wanted to make SD3 4b for the sweet spot but it got canceled for (((whatever))) reason

Anonymous
10/21/24(Mon)23:28:15 No.102921012

Anonymous 10/21/24(Mon)23:28:15 No.102921012

File: file.png (2 MB, 1024x1024)

2 MB PNG

>sana
>a line of pill shaped buses with hatsune miku's face on it in new york city, honk honk
it's not too bad, i feel like sana's salvageable

Anonymous
10/21/24(Mon)23:31:10 No.102921038

Anonymous 10/21/24(Mon)23:31:10 No.102921038

File: file.png (1.06 MB, 1024x1024)

1.06 MB PNG

>>102921012
The big question is if it delivers on the trainability aspect.

Anonymous
10/21/24(Mon)23:31:31 No.102921041

Anonymous 10/21/24(Mon)23:31:31 No.102921041

does anyone have the flux masterchief prompt?

Anonymous
10/21/24(Mon)23:32:21 No.102921048

Anonymous 10/21/24(Mon)23:32:21 No.102921048

>>102920741
What is this?

Anonymous
10/21/24(Mon)23:32:50 No.102921053

Anonymous 10/21/24(Mon)23:32:50 No.102921053

File: file.png (1.18 MB, 1024x1024)

1.18 MB PNG

Knows Master Chief

Anonymous
10/21/24(Mon)23:33:11 No.102921058

Anonymous 10/21/24(Mon)23:33:11 No.102921058

>>102921041
>does anyone have the flux masterchief prompt?
I have
>Photo of Criminal in a ski mask making a phone call in front of a store. There is caption on the bottom of the image: "It's time to Counter the Strike...". There is a red arrow pointing towards the caption. The reda arrow is from a Red circle which has an image of Halo Master Chief in it.

Anonymous
10/21/24(Mon)23:34:21 No.102921062

Anonymous 10/21/24(Mon)23:34:21 No.102921062

>>102921058
the single best flux image. no others come close.

Anonymous
10/21/24(Mon)23:34:33 No.102921065

Anonymous 10/21/24(Mon)23:34:33 No.102921065

File: file.png (1.35 MB, 1024x1024)

1.35 MB PNG

>a painting by leonardo davinci of a pregnant jesus lovingly carrasing his belly, a speech bubble above him has a blurry screenshot of donald trump badly photoshopped onto it
3rd try, not what i asked for but eehhhehrhehhh
>>102921058
thank you anon

Anonymous
10/21/24(Mon)23:35:48 No.102921072

Anonymous 10/21/24(Mon)23:35:48 No.102921072

File: file.png (1.32 MB, 1024x1024)

1.32 MB PNG

>Photo of Criminal in a ski mask making a phone call in front of a store. There is caption on the bottom of the image: "It's time to Counter the Strike...". There is a red arrow pointing towards the caption. The reda arrow is from a Red circle which has an image of Halo Master Chief in it.

Anonymous
10/21/24(Mon)23:35:52 No.102921073

Anonymous 10/21/24(Mon)23:35:52 No.102921073

File: file.png (1.27 MB, 768x1248)

1.27 MB PNG

>>102921062
It's impressive as a concept but practically isn't how any prompts or want to prompt. Especially given how most people just 1girl prompt.

Anonymous
10/21/24(Mon)23:36:04 No.102921075

Anonymous 10/21/24(Mon)23:36:04 No.102921075

File: ComfyUI_00007_.png (1013 KB, 1024x1024)

1013 KB PNG

>>102921062
>the single best flux image. no others come close.
it was a way too much sophisticated prompt to be understood the very first day of Flux's release, it was probably made by some BFL employee, and it sure did had the wow effect he was expecting, it's such a good prompt to show the strengths of Flux

Anonymous
10/21/24(Mon)23:36:15 No.102921077

Anonymous 10/21/24(Mon)23:36:15 No.102921077

File: collage.jpg (1.55 MB, 2163x2000)

1.55 MB JPG

This is what the collage should have been. Bakers shouldn't be allowed to be allergic to 1girl.

Anonymous
10/21/24(Mon)23:36:49 No.102921081

Anonymous 10/21/24(Mon)23:36:49 No.102921081

>>102921073
how come yours was much better?

Anonymous
10/21/24(Mon)23:37:31 No.102921086

Anonymous 10/21/24(Mon)23:37:31 No.102921086

File: file.png (21 KB, 622x281)

21 KB PNG

>>102921081
I'm sure we haven't discovered the optimal settings

Anonymous
10/21/24(Mon)23:38:59 No.102921092

Anonymous 10/21/24(Mon)23:38:59 No.102921092

>>102921075
>it was probably made by some BFL employee,
I've been wondering, for awhile now, about how many of those really good early Flux images were BFL employees using Pro.
>It's time to counter the strike
>Zoom call
>powerpoint presentation
etc...

Anonymous
10/21/24(Mon)23:40:04 No.102921097

Anonymous 10/21/24(Mon)23:40:04 No.102921097

>>102921092
if it was only pro pictures it wouldn't have the impact it had, it was such a big deal because we were able to replicate those with dev aswell

Anonymous
10/21/24(Mon)23:42:04 No.102921109

Anonymous 10/21/24(Mon)23:42:04 No.102921109

>>102921092
I mean I used their prompt and still got an amazing result. It's a good prompt, Flux is a good model, it's just impossible to train and BFL has ghosted us. I mean I guess we have a year to solve the Flux or SD3 problem until the next massive model comes. The best thing Flux will do is make a target and something we often see in tech is people like to smash targets.

Anonymous
10/21/24(Mon)23:43:01 No.102921113

Anonymous 10/21/24(Mon)23:43:01 No.102921113

>>102921109
>it's just impossible to train
with the undistilled models we have now, not anymore, but yeah still hard because it's a big ass motherfucker

Anonymous
10/21/24(Mon)23:43:37 No.102921116

Anonymous 10/21/24(Mon)23:43:37 No.102921116

>>102921113
The undistlled models are as alpha as Sana.

Anonymous
10/21/24(Mon)23:43:49 No.102921118

Anonymous 10/21/24(Mon)23:43:49 No.102921118

File: file.png (396 KB, 1024x1024)

396 KB PNG

>blurry cctv footage of donald trumpy menacingly floating in the night sky, full moon behind his back
i remember having to fiddle alot more to get this image on flux, i feel like sana 'gets' what i'm going for more

Anonymous
10/21/24(Mon)23:44:38 No.102921128

Anonymous 10/21/24(Mon)23:44:38 No.102921128

>>102921116
absolutely not, dev dedistill has the same quality of vanilla dev and has all the guidance removed, so it's ready to be trained on, and someone is already up to the task
https://huggingface.co/SG161222/Verus_Vision_1.0b

Anonymous
10/21/24(Mon)23:44:53 No.102921129

Anonymous 10/21/24(Mon)23:44:53 No.102921129

File: file.png (1.72 MB, 1280x864)

1.72 MB PNG

Anonymous
10/21/24(Mon)23:45:53 No.102921137

Anonymous 10/21/24(Mon)23:45:53 No.102921137

You said Sana is shit but yet the demo queue is fucking long

Anonymous
10/21/24(Mon)23:46:22 No.102921145

Anonymous 10/21/24(Mon)23:46:22 No.102921145

File: file.png (2.8 MB, 1568x1024)

2.8 MB PNG

Anonymous
10/21/24(Mon)23:46:34 No.102921149

Anonymous 10/21/24(Mon)23:46:34 No.102921149

File: file.png (2.14 MB, 1280x864)

2.14 MB PNG

>>102921137
There's some gold to dig in there

Anonymous
10/21/24(Mon)23:46:59 No.102921153

Anonymous 10/21/24(Mon)23:46:59 No.102921153

>>102921137
it's probably someone trying to get past the nsfw heart image filter and gen some sana tits

Anonymous
10/21/24(Mon)23:47:06 No.102921155

Anonymous 10/21/24(Mon)23:47:06 No.102921155

>>102921137
it's like watching a car crash, it's horrible but a lot of people are gathering around to see the damage done

Anonymous
10/21/24(Mon)23:48:13 No.102921165

Anonymous 10/21/24(Mon)23:48:13 No.102921165

>>102921155
Is there a reason you don't post images, are you poor or something?

Anonymous
10/21/24(Mon)23:48:13 No.102921166

Anonymous 10/21/24(Mon)23:48:13 No.102921166

File: file.png (2.92 MB, 1568x1024)

2.92 MB PNG

Anonymous
10/21/24(Mon)23:48:48 No.102921173

Anonymous 10/21/24(Mon)23:48:48 No.102921173

>>102921165
>he says, while not posting an image

Anonymous
10/21/24(Mon)23:49:23 No.102921175

Anonymous 10/21/24(Mon)23:49:23 No.102921175

File: file.png (2.06 MB, 1280x864)

2.06 MB PNG

>he never posts images

Anonymous
10/21/24(Mon)23:50:26 No.102921187

Anonymous 10/21/24(Mon)23:50:26 No.102921187

STOP HOGGING THE DEMO, FUCK OFF!!

Anonymous
10/21/24(Mon)23:50:30 No.102921188

Anonymous 10/21/24(Mon)23:50:30 No.102921188

File: file.png (967 KB, 1000x822)

967 KB PNG

>>102921175
good goy, I asked you to post an image and you did!

Anonymous
10/21/24(Mon)23:52:21 No.102921214

Anonymous 10/21/24(Mon)23:52:21 No.102921214

What are the best 1.5 models at this point? I never moved on from yuzu.

Anonymous
10/21/24(Mon)23:54:17 No.102921224

Anonymous 10/21/24(Mon)23:54:17 No.102921224

>>102921214
sana

Anonymous
10/22/24(Tue)00:00:16 No.102921269

Anonymous 10/22/24(Tue)00:00:16 No.102921269

The more I use Sana the less I hate it.

Anonymous
10/22/24(Tue)00:01:10 No.102921278

Anonymous 10/22/24(Tue)00:01:10 No.102921278

>>102921269
>The more I use Sana the less I hate it.
Show us some pictures that made you love Sana more

Anonymous
10/22/24(Tue)00:01:47 No.102921288

Anonymous 10/22/24(Tue)00:01:47 No.102921288

File: file.png (1.08 MB, 1024x1024)

1.08 MB PNG

>>102921269
yeah i feel the same, it's salvageable
>an anime screenshot of a wide open field, a gargantuan celestial anime girl towers up into the sky, to the left is a bright blue sky and to the right of the girl is a starry night sky, which she wears like a cape
didn't get what i wanted exactly but that's more of a bad prompt issue

Anonymous
10/22/24(Tue)00:02:08 No.102921291

Anonymous 10/22/24(Tue)00:02:08 No.102921291

>>102921278
No you're a contrarian faggot, but it would funny if I posted some Flux images and had you pretend you thought they were shit.

Anonymous
10/22/24(Tue)00:02:55 No.102921300

Anonymous 10/22/24(Tue)00:02:55 No.102921300

>>102921291
>No
Concession Accepted.

Anonymous
10/22/24(Tue)00:04:02 No.102921306

Anonymous 10/22/24(Tue)00:04:02 No.102921306

>>102921269
im slowly losing interesting in using the demo. gimmie local damnit!

Anonymous
10/22/24(Tue)00:05:14 No.102921319

Anonymous 10/22/24(Tue)00:05:14 No.102921319

CFG vs PAG... what's that about?

Anonymous
10/22/24(Tue)00:06:45 No.102921329

Anonymous 10/22/24(Tue)00:06:45 No.102921329

File: image.jpg (335 KB, 1024x1024)

335 KB JPG

My first Sana gen. I'll have to tweak some settings. Too early to tell whether this is promising or not.

Anonymous
10/22/24(Tue)00:07:22 No.102921336

Anonymous 10/22/24(Tue)00:07:22 No.102921336

File: file.png (2.45 MB, 1568x1024)

2.45 MB PNG

>>102921269
>the less I hate it.
I never hated it nor did I love it. It simply is, and I simply am.

Anonymous
10/22/24(Tue)00:07:30 No.102921340

Anonymous 10/22/24(Tue)00:07:30 No.102921340

>>102921319
i know pag is a new thing that helps sd1.5 with anatomy and image coherence, haven't used it myself
>>102921329
try this anon's settings >>102921086

Anonymous
10/22/24(Tue)00:08:22 No.102921350

Anonymous 10/22/24(Tue)00:08:22 No.102921350

File: file.png (1.03 MB, 1280x864)

1.03 MB PNG

With 1.6B parameters the first thing to do is split the extreme styles apart into separate models.

Anonymous
10/22/24(Tue)00:09:25 No.102921358

Anonymous 10/22/24(Tue)00:09:25 No.102921358

File: file.png (1.69 MB, 1024x1024)

1.69 MB PNG

>celestial princess hatsune miku, her face is replaced by a spiraling galaxy, armpit hair
prompt understanding can be a bit hit or miss at times, or maybe it's because im esl

Anonymous
10/22/24(Tue)00:11:33 No.102921377

Anonymous 10/22/24(Tue)00:11:33 No.102921377

>>102921358
I doubt "replaced with" shows up much if at all in the captions. And fetishes like "armpit hair" is never in the captions.

Anonymous
10/22/24(Tue)00:13:15 No.102921385

Anonymous 10/22/24(Tue)00:13:15 No.102921385

>>102921377
>And fetishes like "armpit hair" is never in the captions.
i'm killing myself

Anonymous
10/22/24(Tue)00:13:18 No.102921389

Anonymous 10/22/24(Tue)00:13:18 No.102921389

File: file.png (3.47 MB, 1568x1024)

3.47 MB PNG

Anonymous
10/22/24(Tue)00:14:30 No.102921397

Anonymous 10/22/24(Tue)00:14:30 No.102921397

File: file.png (1.4 MB, 1024x1024)

1.4 MB PNG

>sailor moon eating the moon

Anonymous
10/22/24(Tue)00:15:30 No.102921404

Anonymous 10/22/24(Tue)00:15:30 No.102921404

>>102921397
I wonder if it knows more characters than Migu and Sailor Moon kek

Anonymous
10/22/24(Tue)00:16:01 No.102921413

Anonymous 10/22/24(Tue)00:16:01 No.102921413

>>102921404
It's the same basic cast that Flux knows.

Anonymous
10/22/24(Tue)00:16:09 No.102921414

Anonymous 10/22/24(Tue)00:16:09 No.102921414

>>102921397
delicious lunar crisp

Anonymous
10/22/24(Tue)00:17:07 No.102921422

Anonymous 10/22/24(Tue)00:17:07 No.102921422

>>102921404
we could know if some demon wasn't hoarding the demo doing 4k batches. LEAVE CHANG'S OFFICE GPU ALONE!

Anonymous
10/22/24(Tue)00:17:08 No.102921423

Anonymous 10/22/24(Tue)00:17:08 No.102921423

File: file.png (2.44 MB, 1280x864)

2.44 MB PNG

Anonymous
10/22/24(Tue)00:17:50 No.102921430

Anonymous 10/22/24(Tue)00:17:50 No.102921430

>>102921319
CFG - model adherence to original images
PAG - I could give the real definition, but I would rather describe it as the amount of ritalin the model does. Occasionally something brilliant will come out of it. Usually, it will screw it up if your does is too high.

Anonymous
10/22/24(Tue)00:18:26 No.102921436

Anonymous 10/22/24(Tue)00:18:26 No.102921436

File: file.png (1.18 MB, 1024x1024)

1.18 MB PNG

>sailor moon eating the moon, armpit hair
not sure if that's a tooth brush or a strange leek

Anonymous
10/22/24(Tue)00:19:15 No.102921443

Anonymous 10/22/24(Tue)00:19:15 No.102921443

>>102921423
Honestly shocked the AE manages that fine of details. Maybe it can be saved.

Anonymous
10/22/24(Tue)00:21:13 No.102921461

Anonymous 10/22/24(Tue)00:21:13 No.102921461

>>102921436
clearly Luna's femur

Anonymous
10/22/24(Tue)00:22:40 No.102921475

Anonymous 10/22/24(Tue)00:22:40 No.102921475

likely to be non commercial license OH NONONO
https://github.com/NVlabs/Sana/commit/7d32332055abbcacc97d00918d43eabe0af950f9#diff-b335630551682c19a781afebcf4d07bf978fb1f8ac04c6bf87428ed5106870f5R13

Anonymous
10/22/24(Tue)00:22:45 No.102921476

Anonymous 10/22/24(Tue)00:22:45 No.102921476

File: file.png (1.15 MB, 1568x1024)

1.15 MB PNG

Anonymous
10/22/24(Tue)00:24:03 No.102921489

Anonymous 10/22/24(Tue)00:24:03 No.102921489

File: file.png (16 KB, 570x243)

16 KB PNG

it's coming bbbbbbbbs

Anonymous
10/22/24(Tue)00:24:43 No.102921494

Anonymous 10/22/24(Tue)00:24:43 No.102921494

>>102921475
LMAOOOOOOO, it's fucking DOA, Schnell has an apache 2.0 licence and it's way better than this small piece of shit

Anonymous
10/22/24(Tue)00:29:50 No.102921525

Anonymous 10/22/24(Tue)00:29:50 No.102921525

>>102921443
It's nice to have an alternative model with high channel VAE

Anonymous
10/22/24(Tue)00:33:28 No.102921557

Anonymous 10/22/24(Tue)00:33:28 No.102921557

i guess we'll be stuck with sdxl forever huh

Anonymous
10/22/24(Tue)00:34:56 No.102921575

Anonymous 10/22/24(Tue)00:34:56 No.102921575

File: file.png (256 KB, 1024x1344)

256 KB PNG

style wise it feels like 1.5 in a good way

Anonymous
10/22/24(Tue)00:39:27 No.102921622

Anonymous 10/22/24(Tue)00:39:27 No.102921622

>>102921494
License doesn't matter when training uses consumer hardware.

Anonymous
10/22/24(Tue)00:39:58 No.102921629

Anonymous 10/22/24(Tue)00:39:58 No.102921629

>>102921622
we already have SDXL for that

Anonymous
10/22/24(Tue)00:43:58 No.102921670

Anonymous 10/22/24(Tue)00:43:58 No.102921670

File: image (8).jpg (87 KB, 512x512)

87 KB JPG

Sana does not work at 512x512, can confirm. Does anyone know their actual buckets?

Anonymous
10/22/24(Tue)00:44:28 No.102921677

Anonymous 10/22/24(Tue)00:44:28 No.102921677

>>102921670
Looks fine to me, anon

Anonymous
10/22/24(Tue)00:44:54 No.102921682

Anonymous 10/22/24(Tue)00:44:54 No.102921682

>>102921677
kek

Anonymous
10/22/24(Tue)00:45:30 No.102921687

Anonymous 10/22/24(Tue)00:45:30 No.102921687

File: file.png (947 KB, 1024x1024)

947 KB PNG

remember omigen? they have a demo out now, github also says they plan on releasing the model. china save us from china?
>https://github.com/VectorSpaceLab/OmniGen
>https://arxiv.org/abs/2409.11340
demo
>https://huggingface.co/spaces/Shitao/OmniGen
prompt
>a cute cat holding a sign saying "china hello china cheeenaaaa lalalala", ultra high definition

Anonymous
10/22/24(Tue)00:47:36 No.102921708

Anonymous 10/22/24(Tue)00:47:36 No.102921708

File: actually shit.png (1.27 MB, 1024x1168)

1.27 MB PNG

>>102921687
Get this non pixanasexual shit out of here

Anonymous
10/22/24(Tue)00:49:50 No.102921720

Anonymous 10/22/24(Tue)00:49:50 No.102921720

File: file.png (448 KB, 1024x1344)

448 KB PNG

Anonymous
10/22/24(Tue)00:49:59 No.102921723

Anonymous 10/22/24(Tue)00:49:59 No.102921723

File: file.png (1.71 MB, 1024x1024)

1.71 MB PNG

>>102921687
ehh...

Anonymous
10/22/24(Tue)00:52:03 No.102921743

Anonymous 10/22/24(Tue)00:52:03 No.102921743

>>102921723
why did you make miku a n*gger?

Anonymous
10/22/24(Tue)00:52:53 No.102921751

Anonymous 10/22/24(Tue)00:52:53 No.102921751

>>102921708
the sana-samas have failed us, the age of the pixart sexual is over. it's the dawn of the planet of the omnigenders

Anonymous
10/22/24(Tue)00:53:14 No.102921755

Anonymous 10/22/24(Tue)00:53:14 No.102921755

>>102921743
why not?

Anonymous
10/22/24(Tue)00:53:51 No.102921762

Anonymous 10/22/24(Tue)00:53:51 No.102921762

>>102921755
that's not a fucking answer bitch

Anonymous
10/22/24(Tue)00:54:20 No.102921769

Anonymous 10/22/24(Tue)00:54:20 No.102921769

>>102921762
it is nigger

Anonymous
10/22/24(Tue)00:55:23 No.102921775

Anonymous 10/22/24(Tue)00:55:23 No.102921775

File: 1720618472117538.png (1.88 MB, 1152x896)

1.88 MB PNG

trying to recreate Breezewood, Pennsylvania

Anonymous
10/22/24(Tue)00:56:18 No.102921778

Anonymous 10/22/24(Tue)00:56:18 No.102921778

>>102921775
is that sana?

Anonymous
10/22/24(Tue)00:56:40 No.102921783

Anonymous 10/22/24(Tue)00:56:40 No.102921783

>>102921778
Flux

Anonymous
10/22/24(Tue)00:57:18 No.102921792

Anonymous 10/22/24(Tue)00:57:18 No.102921792

>>102921783
o

Anonymous
10/22/24(Tue)00:59:03 No.102921802

Anonymous 10/22/24(Tue)00:59:03 No.102921802

File: file.png (473 KB, 1024x1344)

473 KB PNG

Anonymous
10/22/24(Tue)01:02:34 No.102921828

Anonymous 10/22/24(Tue)01:02:34 No.102921828

>>102921073
>but practically isn't how any prompts or want to prompt
It's short, simple, and to the point. What do you mean?

Anonymous
10/22/24(Tue)01:04:30 No.102921847

Anonymous 10/22/24(Tue)01:04:30 No.102921847

File: file.png (1.23 MB, 1024x1024)

1.23 MB PNG

>>102921687
kek

Anonymous
10/22/24(Tue)01:07:54 No.102921870

Anonymous 10/22/24(Tue)01:07:54 No.102921870

>>102921847
yep... it's china time. i just hope the thing isn't so damn slow locally though

Anonymous
10/22/24(Tue)01:09:25 No.102921891

Anonymous 10/22/24(Tue)01:09:25 No.102921891

File: image (10).jpg (342 KB, 768x1280)

342 KB JPG

I don't understand. This is 1.01 cfg, 1.01 pag. Why does it look like I didn't touch the settings at all?

Anonymous
10/22/24(Tue)01:09:34 No.102921893

Anonymous 10/22/24(Tue)01:09:34 No.102921893

>>102921687
>they have a demo out now, github also says they plan on releasing the model.
so that's the new meta now? releasing the demo before the model? I mean it makes sense but I hate it being teased like that kek

Anonymous
10/22/24(Tue)01:10:19 No.102921896

Anonymous 10/22/24(Tue)01:10:19 No.102921896

>>102921891
what's your prompt

Anonymous
10/22/24(Tue)01:11:55 No.102921904

Anonymous 10/22/24(Tue)01:11:55 No.102921904

>>102921847
looks better than Sana, maybe this shit is the real deal

Anonymous
10/22/24(Tue)01:12:40 No.102921915

Anonymous 10/22/24(Tue)01:12:40 No.102921915

>>102921904
you funny guy

Anonymous
10/22/24(Tue)01:13:02 No.102921916

Anonymous 10/22/24(Tue)01:13:02 No.102921916

File: image.jpg (170 KB, 1024x1024)

170 KB JPG

>>102921687
>miku holding a sign that says "omnigen > sana"
uhhh sanabros? omnigen bros? what does this means? And now I'm out of credits to I can't try other variations

Anonymous
10/22/24(Tue)01:13:10 No.102921917

Anonymous 10/22/24(Tue)01:13:10 No.102921917

>>102921904
there's also that whole built in controlnet thing but it keeps erroring out when i try it in the demo

Anonymous
10/22/24(Tue)01:14:13 No.102921920

Anonymous 10/22/24(Tue)01:14:13 No.102921920

>>102921916
china promotes equality

Anonymous
10/22/24(Tue)01:14:48 No.102921925

Anonymous 10/22/24(Tue)01:14:48 No.102921925

File: 1700159250618347.png (2.01 MB, 1152x896)

2.01 MB PNG

should I just train a lora? not sure if I can with my 8GB of vram

Anonymous
10/22/24(Tue)01:16:10 No.102921938

Anonymous 10/22/24(Tue)01:16:10 No.102921938

>ani is making a sdcpp gui
how do we get him over here to be our guy?

Anonymous
10/22/24(Tue)01:19:21 No.102921959

Anonymous 10/22/24(Tue)01:19:21 No.102921959

>demo erroring out
It's over

Anonymous
10/22/24(Tue)01:20:39 No.102921966

Anonymous 10/22/24(Tue)01:20:39 No.102921966

>>102921959
which one? the sana one or the omnigen one?

Anonymous
10/22/24(Tue)01:20:55 No.102921968

Anonymous 10/22/24(Tue)01:20:55 No.102921968

>>102921938
source?

Anonymous
10/22/24(Tue)01:21:34 No.102921971

Anonymous 10/22/24(Tue)01:21:34 No.102921971

>>102921968
>>102921169

Anonymous
10/22/24(Tue)01:21:43 No.102921974

Anonymous 10/22/24(Tue)01:21:43 No.102921974

File: file.png (595 KB, 1024x1024)

595 KB PNG

>blurry cctv footage of donald trumpy menacingly floating in the night sky, full moon behind his back
images look alot cooler at cfg 1.1 and pag 1.1

Anonymous
10/22/24(Tue)01:24:43 No.102921991

Anonymous 10/22/24(Tue)01:24:43 No.102921991

>>102921966
sana

Anonymous
10/22/24(Tue)01:26:22 No.102922001

Anonymous 10/22/24(Tue)01:26:22 No.102922001

>two new models and a new gui coming
we eating good /ldg/

Anonymous
10/22/24(Tue)01:27:19 No.102922007

Anonymous 10/22/24(Tue)01:27:19 No.102922007

File: file.png (684 KB, 1024x1344)

684 KB PNG

>>102921974
>cfg 1.1 and pag 1.1
yep. CFG can go up to about three as well. the only reason for a higher PAG is if you're doing text.

Anonymous
10/22/24(Tue)01:27:21 No.102922009

Anonymous 10/22/24(Tue)01:27:21 No.102922009

>>102921893
It could be to garner free publicity. If they gain enough attention and some investor believes in the potential evolution of their alpha version then they will release nothing to the open source community and shift the project towards a SAAS by scaling, optimizing, tuning, etc. If they gain no publicity and no investment then the alpha will be released as a last resort, again, to get free publicity.

Anonymous
10/22/24(Tue)01:28:59 No.102922017

Anonymous 10/22/24(Tue)01:28:59 No.102922017

>>102921966
Both

Anonymous
10/22/24(Tue)01:32:04 No.102922034

Anonymous 10/22/24(Tue)01:32:04 No.102922034

so what now?

Anonymous
10/22/24(Tue)01:32:28 No.102922038

Anonymous 10/22/24(Tue)01:32:28 No.102922038

>>102922034
we gen

Anonymous
10/22/24(Tue)01:33:05 No.102922044

Anonymous 10/22/24(Tue)01:33:05 No.102922044

>>102922038
im depressed

Anonymous
10/22/24(Tue)01:33:55 No.102922049

Anonymous 10/22/24(Tue)01:33:55 No.102922049

>>102922044
its okay anon im here for you

Anonymous
10/22/24(Tue)01:35:10 No.102922057

Anonymous 10/22/24(Tue)01:35:10 No.102922057

>>102922049
*sob* *sob... uwaaaaahhhhhh... *hic*

Anonymous
10/22/24(Tue)01:35:42 No.102922060

Anonymous 10/22/24(Tue)01:35:42 No.102922060

>>102922034
Back to 1girl for you. Meanwhile I will go back to my drew threads and when that's done I will go back to jacking off to /v/ butt threads while you play pretend with the local models. I only show up for big events like flux, sana etc.

Anonymous
10/22/24(Tue)01:39:15 No.102922083

Anonymous 10/22/24(Tue)01:39:15 No.102922083

File: file.png (996 KB, 1024x1344)

996 KB PNG

Anonymous
10/22/24(Tue)01:42:29 No.102922098

Anonymous 10/22/24(Tue)01:42:29 No.102922098

>>102922060
thanks for stopping by

Anonymous
10/22/24(Tue)01:51:43 No.102922139

Anonymous 10/22/24(Tue)01:51:43 No.102922139

>>102921938
about a half year ago when he found out that C++ is hard. If he hasn't released he won't.

Anonymous
10/22/24(Tue)01:52:34 No.102922144

Anonymous 10/22/24(Tue)01:52:34 No.102922144

File: file.png (892 KB, 1024x1344)

892 KB PNG

Anonymous
10/22/24(Tue)02:00:29 No.102922188

Anonymous 10/22/24(Tue)02:00:29 No.102922188

https://huggingface.co/rhymes-ai/Allegro

babe wake up new text to video model just dropped

Anonymous
10/22/24(Tue)02:04:08 No.102922217

Anonymous 10/22/24(Tue)02:04:08 No.102922217

>>102922188
>Single GPU Memory Usage 9.3G BF16 (with cpu_offload)
>check downloads folder
text encoder is 19GB

Anonymous
10/22/24(Tue)02:06:25 No.102922230

Anonymous 10/22/24(Tue)02:06:25 No.102922230

>>102922217
>text encoder is 19GB
"architectures": [
"T5EncoderModel"
It's the classic T5_XXL, so we'll only be using the encoder part, which is roughlly 9.2gb of vram

Anonymous
10/22/24(Tue)02:08:20 No.102922245

Anonymous 10/22/24(Tue)02:08:20 No.102922245

File: file.png (2.73 MB, 1730x1170)

2.73 MB PNG

>>102922188
>Apache 2.0
>6-second videos at 15 FPS with 720x1280 resolution
>175M parameter VideoVAE and a 2.8B parameter VideoDiT model
Pretty nice, I can feel it's a new local sota

Anonymous
10/22/24(Tue)02:10:09 No.102922257

Anonymous 10/22/24(Tue)02:10:09 No.102922257

New

>>102922252
>>102922252
>>102922252

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.