/g/ - >Lama-3.1; 8b, 70b and 405 is now released on hugg - Technology

Anonymous

07/23/24(Tue)12:34:03 No.101535820

File: officialZucImage.png (210 KB, 603x653)

Anonymous 07/23/24(Tue)12:34:03 No.101535820 Archived

>Lama-3.1; 8b, 70b and 405 is now released on huggingface!

405b is beating chatgpt-4o, while being just half the parameter count, possibly because of not suffering the crippling effects of wokeness infusion

While the new 8 billion parameter models seems affected in reasoning by being forced to learn thirdie langues compared to the old 3.0 version.

Anonymous
07/23/24(Tue)12:34:47 No.101535829

Anonymous 07/23/24(Tue)12:34:47 No.101535829

File: stats.jpg (250 KB, 1080x2025)

250 KB JPG

>>101535820

Anonymous
07/23/24(Tue)12:44:51 No.101536000

Anonymous 07/23/24(Tue)12:44:51 No.101536000

>>101535820
What's the VRAM requirement for 405b?

Anonymous
07/23/24(Tue)12:56:41 No.101536189

Anonymous 07/23/24(Tue)12:56:41 No.101536189

>>101536000
for 4q about 200gb, then more for context

Anonymous
07/23/24(Tue)13:06:17 No.101536329

Anonymous 07/23/24(Tue)13:06:17 No.101536329

>>101536000
one Mac Pro with M2 Ultra
say it
THANK YOU BASED APPLE

Anonymous
07/23/24(Tue)14:15:34 No.101537186

Anonymous 07/23/24(Tue)14:15:34 No.101537186

>>101536000
Literally won't run regardless of what you change

go for the smaller models

Anonymous
07/23/24(Tue)14:22:37 No.101537268

Anonymous 07/23/24(Tue)14:22:37 No.101537268

>>101535820
>he thinks the top llama model is uncensored
>he can't even run the foundation model on runpod without $200+ a month on runpod

Anonymous
07/23/24(Tue)14:25:56 No.101537297

Anonymous 07/23/24(Tue)14:25:56 No.101537297

>>101535820
>open source llm and nn are the path forward
Second that. Just don't tell any retards. Work or wait for or both. It will come and it will be the best.

Anonymous
07/23/24(Tue)16:12:15 No.101538671

Anonymous 07/23/24(Tue)16:12:15 No.101538671

>>101536329
>one Mac Pro with M2 Ultra
405B model, 192GB Mac...
you're going to run it at q3? Plus, even the absolute best M2 barely touches at 3070 in terms of performance.

Anonymous
07/23/24(Tue)16:21:27 No.101538815

Anonymous 07/23/24(Tue)16:21:27 No.101538815

>>101535820
Mark looking extra zesty here

Anonymous
07/23/24(Tue)16:23:28 No.101538846

Anonymous 07/23/24(Tue)16:23:28 No.101538846

>>101538815
thats just his new cool look

Anonymous
07/23/24(Tue)16:30:02 No.101538988

Anonymous 07/23/24(Tue)16:30:02 No.101538988

Whats the cheapest Hardware that can run the bigger models? Doesnt need to be fast just borderline usable. Do i have to become an itoddler?

Anonymous
07/23/24(Tue)19:16:12 No.101541866

Anonymous 07/23/24(Tue)19:16:12 No.101541866

>>101535820
>405b
what kind of specs do you realistic need to run something like this at home? definitely not something a desktop can handle on its own but how much hardware does a home server g tard need?

Anonymous
07/23/24(Tue)20:37:41 No.101543013

Anonymous 07/23/24(Tue)20:37:41 No.101543013

>>101541866
something like 160GB of VRAM however you can manage that

Anonymous
07/23/24(Tue)22:12:53 No.101544098

Anonymous 07/23/24(Tue)22:12:53 No.101544098

>>101543013
H100 dgx has 640 GB of vram

Anonymous
07/23/24(Tue)22:26:50 No.101544263

Anonymous 07/23/24(Tue)22:26:50 No.101544263

>>101535820
Is that the one that was leaked ages ago?
Did anyone make anything with it?

Anonymous
07/24/24(Wed)00:01:36 No.101545138

Anonymous 07/24/24(Wed)00:01:36 No.101545138

File: 20240723190031_2.jpg (445 KB, 2560x1440)

445 KB JPG

>>101535820
nothing released so far is even remotely cool. the dalle3 shit was KINDA NEAT for like 3 months and it died out with how woke it fucking got after that first week of it being kino.
the whole ai thing is so shit. lama 3 405 just came out. it is at best as good as gpt4 lol like where the fuck are the robots and the fucking ai that will take over peoples jobs?!?!?

Anonymous
07/24/24(Wed)00:24:07 No.101545395

Anonymous 07/24/24(Wed)00:24:07 No.101545395

>>101535820
I trust this mother fucker as far as I can throw him. What's the catch Zuck? You want me to believe your releasing a multi million dollar piece of software for purely altruistic reasons..

Anonymous
07/24/24(Wed)00:39:36 No.101545536

Anonymous 07/24/24(Wed)00:39:36 No.101545536

File: centaur_skeleton_by_nykol(...).jpg (84 KB, 827x974)

84 KB JPG

>>101536189
>>101538988
>>101541866

Could I run something like this on say 128 lanes of avx512 at a tolerable token rate? Because I have access to a rig with nearly a tb of system ram.

>>101545395
He's doing it to fuck over Microsoft. He wants a model for internal use, has the hardware to train one because he was expecting to need it for the VR bullshit and ticktock knockoff. It looses him very little to release it to everyone.

Anonymous
07/24/24(Wed)00:43:42 No.101545589

Anonymous 07/24/24(Wed)00:43:42 No.101545589

>>101545395
To bring down Microsoft/OpenAI.

Which I suppose counts as altruistic reasons.

Anonymous
07/24/24(Wed)00:49:56 No.101545639

Anonymous 07/24/24(Wed)00:49:56 No.101545639

But can it produce the filthiest smut imaginable with no content restriction?

Anonymous
07/24/24(Wed)00:53:08 No.101545673

Anonymous 07/24/24(Wed)00:53:08 No.101545673

>>101545536
>>101545589
Nah, I think there's more to it than that.
Here's a few theories:

Get it integrated into everything then hold everyone hostage.

Facebook is on the decline, so he's all in on anything that will bring users.

And lastly, long term goals. If his company can steer or control responses, he can subtly influence users for political or financial gains.

Anonymous
07/24/24(Wed)00:57:07 No.101545703

Anonymous 07/24/24(Wed)00:57:07 No.101545703

>>101535820
How are the 'scores' generated for rating these models?
>>101536000
Someone had a 4090 and used the 405B option and it took him 30 mins for it to generate the word 'The'.

I am currently trying to install the 8B version but im a noob and only messed around with stable diffusion before this, so im just trying random shit the meta AI site tells me when I try to use it for tech support. Right now I got an error because I need the llama library but it cant install the llama library due to some error and its suggesting I download an older version of python but idk how to do that. and i tried updating pip and installing the other dependencies too.

But I need to finish this challenge of getting it to run locally I refuse to always use a website for it

Anonymous
07/24/24(Wed)01:05:34 No.101545770

Anonymous 07/24/24(Wed)01:05:34 No.101545770

>>101545703
what os are you on?

Anonymous
07/24/24(Wed)01:13:09 No.101545842

Anonymous 07/24/24(Wed)01:13:09 No.101545842

>>101545770
Windows of course, since I am a humble gamer dabbling in these things. Im using git bash as a terminal I guess although I tried to use powershell to see if that would magically fix any errors but it did not.

Anonymous
07/24/24(Wed)01:21:55 No.101545925

Anonymous 07/24/24(Wed)01:21:55 No.101545925

>>101545536
>fuck over Microsoft.
Dare I say it, but, based?

Anonymous
07/24/24(Wed)01:22:35 No.101545930

Anonymous 07/24/24(Wed)01:22:35 No.101545930

>>101535820
So it's "unsafe" and will have the hammer brought down on it behind the scenes.

Anonymous
07/24/24(Wed)01:23:36 No.101545938

Anonymous 07/24/24(Wed)01:23:36 No.101545938

>>101545395
Free testing

Anonymous
07/24/24(Wed)01:23:49 No.101545941

Anonymous 07/24/24(Wed)01:23:49 No.101545941

>>101535820
Sauce on 4o being 800B?

Anonymous
07/24/24(Wed)01:36:33 No.101546056

Anonymous 07/24/24(Wed)01:36:33 No.101546056

>>101535820
Open source so that retards can provide cuckerberg with free work and data. Great idea.

Anonymous
07/24/24(Wed)01:37:10 No.101546059

Anonymous 07/24/24(Wed)01:37:10 No.101546059

>>101535820
>405b is beating chatgpt-4o
no it's doesn't, it's barely better than the now decrepit chatgpt3

Anonymous
07/24/24(Wed)02:09:51 No.101546387

Anonymous 07/24/24(Wed)02:09:51 No.101546387

File: 1702048985601082.png (19 KB, 800x1000)

19 KB PNG

>>101535820
zammmmmmmn zucky look like dat??

Anonymous
07/24/24(Wed)02:19:20 No.101546474

Anonymous 07/24/24(Wed)02:19:20 No.101546474

File: E-mad Mostaque.jpg (289 KB, 2556x1438)

289 KB JPG

>>101546056
nobody works on Llamo for free tho
only Stable diffusion corp managed that

Anonymous
07/24/24(Wed)03:58:57 No.101547271

Anonymous 07/24/24(Wed)03:58:57 No.101547271

The Llama-3.1-8b is pretty damn good.

Like turbo level at least which is great for local

Anonymous
07/24/24(Wed)04:00:11 No.101547287

Anonymous 07/24/24(Wed)04:00:11 No.101547287

>>101547271
Did they up the context?

Anonymous
07/24/24(Wed)04:02:13 No.101547309

Anonymous 07/24/24(Wed)04:02:13 No.101547309

>>101537268
>he thinks the top llama model is uncensored
Yikes. Fucking silicon valley cucks still complying with DEI.

Anonymous
07/24/24(Wed)04:21:21 No.101547472

Anonymous 07/24/24(Wed)04:21:21 No.101547472

>>101538815
crazy what a good haircut can do.

>>101545395
"The enemy of my enemy is my friend" kind of reasoning.

>You want me to believe your releasing a multi million dollar piece of software for purely altruistic reasons..
To be fair, they already released a multi million dollar piece of software as open source (Pytorch), so MAYBE he's not so evil...

Anonymous
07/24/24(Wed)04:23:56 No.101547486

Anonymous 07/24/24(Wed)04:23:56 No.101547486

>>101544098
Nigga that shit costs almost HALF A MILLION DOLLARS

Anonymous
07/24/24(Wed)04:36:54 No.101547564

Anonymous 07/24/24(Wed)04:36:54 No.101547564

>>101545536
define tolerable token rate
either way, it'll be way under 1 tokens/sec

Anonymous
07/24/24(Wed)05:00:51 No.101547747

Anonymous 07/24/24(Wed)05:00:51 No.101547747

>>101535820
>405b
Man, I only get about 35tk/s with 70b on a 4090... I can't imagine there'd be a person out there with the hardware to run the big one that fast.

>>101545703
>Someone had a 4090 and used the 405B option and it took him 30 mins for it to generate the word 'The'.
lol

Anonymous
07/24/24(Wed)05:23:06 No.101547906

Anonymous 07/24/24(Wed)05:23:06 No.101547906

>>101545395
If I'd have to take a guess, then cheap AI probably makes Facebooks core buisiness more valuable. Maybe they're trying to boost the ecosystem to eventually use AI to offer professional shilling services, hellban people or "reanimate" dead loved ones on facebook for a fee

Anonymous
07/24/24(Wed)05:34:13 No.101547989

Anonymous 07/24/24(Wed)05:34:13 No.101547989

>>101547287
I don't know, I only got a quickie out before I had to leave. I'll put it through it's paces tomorrow though.

It's very coherent and seems to be uncensored.

Anonymous
07/24/24(Wed)07:06:40 No.101548750

Anonymous 07/24/24(Wed)07:06:40 No.101548750

>>101536000
wouldn't a dual socket cpu be reasonably fast and cheaper at that size? I doubt strapping x10 p80 together will be fast.

Anonymous
07/24/24(Wed)07:11:50 No.101548801

Anonymous 07/24/24(Wed)07:11:50 No.101548801

>>101545703
just follow https://github.com/Mozilla-Ocho/llamafile?tab=readme-ov-file#using-llamafile-with-external-weights its the best way to test instead of running all these shitty python venvs that pull half the ecosystem.

Anonymous
07/24/24(Wed)07:19:05 No.101548863

Anonymous 07/24/24(Wed)07:19:05 No.101548863

File: file.png (44 KB, 993x376)

44 KB PNG

>>101547287
I think so

Anonymous
07/24/24(Wed)07:27:07 No.101548931

Anonymous 07/24/24(Wed)07:27:07 No.101548931

File: file.png (72 KB, 1081x391)

72 KB PNG

why

Anonymous
07/24/24(Wed)07:38:06 No.101549008

Anonymous 07/24/24(Wed)07:38:06 No.101549008

it sucks dick refuses to write anything useful because muh violence when there is nothing like that. I told it to capture analyze a poem and it shits itself about muh safe guards.

Anonymous
07/24/24(Wed)07:38:44 No.101549014

Anonymous 07/24/24(Wed)07:38:44 No.101549014

>>101536189
>>101543013
So this is unreachable for consumers? I assume consumers access models running on data center GPUs remotely. But the cost and energy use must be enormous. What the fuck?

Anonymous
07/24/24(Wed)07:53:50 No.101549114

Anonymous 07/24/24(Wed)07:53:50 No.101549114