/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 07/22/24(Mon)15:23:52 No.101524039

File: 1721675107552589.png (1.22 MB, 1024x1024)

1.22 MB PNG

/lmg/ - Local Models General Anonymous 07/22/24(Mon)15:23:52 No.101524039 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>101521755 & >>101514682

►News
>(07/22) llamanon leaks 405B base model: https://files.catbox.moe/d88djr.torrent >>101516633
>(07/18) Improved DeepSeek-V2-Chat 236B: https://hf.co/deepseek-ai/DeepSeek-V2-Chat-0628
>(07/18) Mistral NeMo 12B base & instruct with 128k context: https://mistral.ai/news/mistral-nemo/
>(07/16) Codestral Mamba, tested up to 256k context: https://hf.co/mistralai/mamba-codestral-7B-v0.1
>(07/16) MathΣtral Instruct based on Mistral 7B: https://hf.co/mistralai/mathstral-7B-v0.1

►News Archive: https://rentry.org/lmg-news-archive
►FAQ: https://wikia.schneedc.com
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/ylb0hv.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Programming: https://hf.co/spaces/bigcode/bigcode-models-leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
07/22/24(Mon)15:25:09 No.101524056

Anonymous 07/22/24(Mon)15:25:09 No.101524056

I love the quality of these threads whenever a model releases!

Anonymous
07/22/24(Mon)15:25:31 No.101524063

Anonymous 07/22/24(Mon)15:25:31 No.101524063

cloudniggers go home

Anonymous
07/22/24(Mon)15:25:36 No.101524065

Anonymous 07/22/24(Mon)15:25:36 No.101524065

>>101524027
Run it? If it has 405B it should be smart enough to figure everything out.

Anonymous
07/22/24(Mon)15:30:06 No.101524129

Anonymous 07/22/24(Mon)15:30:06 No.101524129

I guess 3.1 can be quantized even less?

Anonymous
07/22/24(Mon)15:31:26 No.101524141

Anonymous 07/22/24(Mon)15:31:26 No.101524141

>>101524129
Technically it should be even more "dense" so probably.

Anonymous
07/22/24(Mon)15:32:52 No.101524161

Anonymous 07/22/24(Mon)15:32:52 No.101524161

Where's the L3.1 70B?

Anonymous
07/22/24(Mon)15:33:18 No.101524165

Anonymous 07/22/24(Mon)15:33:18 No.101524165

>>101524129
I'll do a KLD test, just for you.

Anonymous
07/22/24(Mon)15:33:33 No.101524169

Anonymous 07/22/24(Mon)15:33:33 No.101524169

>>101524039
>►Official /lmg/ card: https://files.catbox.moe/ylb0hv.png
Blacked miku thread. Migrate:
>>101524155
>>101524155
>>101524155

Anonymous
07/22/24(Mon)15:34:09 No.101524176

Anonymous 07/22/24(Mon)15:34:09 No.101524176

>>101524161
3.1 is not officially released yet. Someone just leaked the base 405B model.

Anonymous
07/22/24(Mon)15:36:18 No.101524202

Anonymous 07/22/24(Mon)15:36:18 No.101524202

>>101524169
i'm staying.

Anonymous
07/22/24(Mon)15:38:12 No.101524223

Anonymous 07/22/24(Mon)15:38:12 No.101524223

So uh
What good does leaking the 405B base model one day before it releases for real actually do?
The only reason I can think of is if Meta was going to withhold it like Mistral did with Miqu 1, but they've been pretty consistent with releasing the bases so I don't see why they'd do that

Anonymous
07/22/24(Mon)15:39:18 No.101524237

Anonymous 07/22/24(Mon)15:39:18 No.101524237

Is the 192gb mac a meme? I only care about inference and this seems way more cost effective than any equivalent DIY setup

Anonymous
07/22/24(Mon)15:40:46 No.101524254

Anonymous 07/22/24(Mon)15:40:46 No.101524254

>>101524237
>Is the 192gb mac a meme?
they slow down for 70b already It's basically a 3060 with a ton of vram

Anonymous
07/22/24(Mon)15:41:07 No.101524259

Anonymous 07/22/24(Mon)15:41:07 No.101524259

>>101524223
Clout. This gives the Miqu guy more trust that he has access to these things.

Anonymous
07/22/24(Mon)15:43:51 No.101524285

Anonymous 07/22/24(Mon)15:43:51 No.101524285

>>101524223
I mean, he also leaked OPT-175B post Llama 1, so there's that

Anonymous
07/22/24(Mon)15:44:11 No.101524289

Anonymous 07/22/24(Mon)15:44:11 No.101524289

File: 1712056860854909.jpg (128 KB, 1080x823)

128 KB JPG

>>101524223
My guess is the model is good enough that meta was considering not dropping it and hosting it for a fee instead, with the post-hoc rationalization that it's too dangerous for normies. A FOSS true believer internally got wind of that and decided to do the needful. Just my speculation.

Anonymous
07/22/24(Mon)15:48:09 No.101524338

Anonymous 07/22/24(Mon)15:48:09 No.101524338

>>101524119
Yeah, it's definitely not the retarded /pol/ tourists that shit the thread up.

Anonymous
07/22/24(Mon)15:49:14 No.101524358

Anonymous 07/22/24(Mon)15:49:14 No.101524358

>>101524223
let's be real there, if L3-405b results would be claude 3.5 tier, they would've never released it to the public, and the leak would be a welcomed one, but it's not so the leak is useless and we'll officially get the model in 1 day

Anonymous
07/22/24(Mon)15:50:44 No.101524391

Anonymous 07/22/24(Mon)15:50:44 No.101524391

>>101524358
>they would've never released it to the public
Why not? Just license it for big companies, releasing the weights fucks over their competitors.

Anonymous
07/22/24(Mon)15:51:38 No.101524403

Anonymous 07/22/24(Mon)15:51:38 No.101524403

>>101524289
I don't think so since Closed AI and Anthropic still have the lead and headstart in API marketshare, so releasing 400B weights is still imperative to weaken the competing position. Unless it's THAT much better (in which case we'd be so back). But since it doesn't look like it will be, I'm guessing their intention to release it never actually wavered and the only ones saying it did only tried to do so in order to make Meta less "good" in the eye of the public.

Anonymous
07/22/24(Mon)15:52:24 No.101524410

Anonymous 07/22/24(Mon)15:52:24 No.101524410

>>101524169
>Blacked miku thread
what?

Anonymous
07/22/24(Mon)15:52:32 No.101524413

Anonymous 07/22/24(Mon)15:52:32 No.101524413

>>101524391
they would've kept for themselves and everyone would use this meta API instead of the others?

Anonymous
07/22/24(Mon)15:53:27 No.101524432

Anonymous 07/22/24(Mon)15:53:27 No.101524432

>>101524413
Companies dont want to give their company secrets to microsoft. Licensing the model weights is by far the better way to go.

Anonymous
07/22/24(Mon)15:53:59 No.101524440

Anonymous 07/22/24(Mon)15:53:59 No.101524440

>>101524413
The reality is that the majority of people follow feels and what they're used to. Most people still use some form of GPT and never heard of Anthropic.

Anonymous
07/22/24(Mon)15:54:04 No.101524443

Anonymous 07/22/24(Mon)15:54:04 No.101524443

>>101524410
OP swapped the /lmg/ card in the OP for a blacked pic to have a 'blacked' thread.

Anonymous
07/22/24(Mon)15:54:32 No.101524455

Anonymous 07/22/24(Mon)15:54:32 No.101524455

>>101524413
Lots of people are still using GPT4 when Claude 3 is objectively better for a lot of stuff so

Anonymous
07/22/24(Mon)15:55:50 No.101524468

Anonymous 07/22/24(Mon)15:55:50 No.101524468

>>101524413
Meta has enough "fuck you" money and income outside of AI that they don't really need to do this. If Llama 3.1 flops Meta would still be making bank. If GPT-5 flopped OpenAI would be in existential danger of shutting down

Anonymous
07/22/24(Mon)15:56:12 No.101524478

Anonymous 07/22/24(Mon)15:56:12 No.101524478

>>101524338
yeah man this place would be so great if only everyone had the politics of a reddit user
that's why reddit is the premiere place to discuss oss models right? that's why you're there right now and not here

Anonymous
07/22/24(Mon)15:57:24 No.101524497

Anonymous 07/22/24(Mon)15:57:24 No.101524497

>>101524468
Their business model would be licensing weights for companies to run themselves. Companies do not want to give microsoft / meta their private data. Different market.

Anonymous
07/22/24(Mon)15:57:54 No.101524505

Anonymous 07/22/24(Mon)15:57:54 No.101524505

>>101524443
what?

Anonymous
07/22/24(Mon)15:58:08 No.101524508

Anonymous 07/22/24(Mon)15:58:08 No.101524508

File: Wtf4chan.jpg (102 KB, 1364x905)

102 KB JPG

>>101524338

Anonymous
07/22/24(Mon)15:58:33 No.101524511

Anonymous 07/22/24(Mon)15:58:33 No.101524511

>>101524505
>the absolute state

Anonymous
07/22/24(Mon)15:59:11 No.101524521

Anonymous 07/22/24(Mon)15:59:11 No.101524521

>>101524508
Lol, shitzoing so hard 4chan is telling you to quit it.

Anonymous
07/22/24(Mon)15:59:16 No.101524523

Anonymous 07/22/24(Mon)15:59:16 No.101524523

>>101524511
no I just don't get what you mean. what is a blacked thread?

Anonymous
07/22/24(Mon)15:59:26 No.101524527

Anonymous 07/22/24(Mon)15:59:26 No.101524527

>>101524468
>Llama 3.1
Wait, there's now a 3.1 version now? they released it yet?

Anonymous
07/22/24(Mon)15:59:56 No.101524531

Anonymous 07/22/24(Mon)15:59:56 No.101524531

>405b
>impossible to use without a hundred 3090s
who cares?

Anonymous
07/22/24(Mon)16:00:03 No.101524533

Anonymous 07/22/24(Mon)16:00:03 No.101524533

>>101524169
>>►Official /lmg/ card: https://files.catbox.moe/ylb0hv.png
>Blacked miku thread. stay here!

Anonymous
07/22/24(Mon)16:00:04 No.101524534

Anonymous 07/22/24(Mon)16:00:04 No.101524534

>>101524527
3.1 is what they're calling the new 8B, 70B, and 405B they're releasing

Anonymous
07/22/24(Mon)16:00:10 No.101524536

Anonymous 07/22/24(Mon)16:00:10 No.101524536

>>101524527
>they released it yet?
next motnh

Anonymous
07/22/24(Mon)16:00:27 No.101524543

Anonymous 07/22/24(Mon)16:00:27 No.101524543

>>101524521
kek, fair enough

Anonymous
07/22/24(Mon)16:00:41 No.101524546

Anonymous 07/22/24(Mon)16:00:41 No.101524546

>>101524508
>>101524521
You should take your advice. If this surprises you that means you never post here so you actually came here from reddit.

Anonymous
07/22/24(Mon)16:01:06 No.101524554

Anonymous 07/22/24(Mon)16:01:06 No.101524554

>>101524521
4chan triggers to "go back to r*eddit" phrase, newfag.

Anonymous
07/22/24(Mon)16:01:34 No.101524559

Anonymous 07/22/24(Mon)16:01:34 No.101524559

Even the best cloud models are slopped. No matter how intelligent a model is, it can't escape the writing habits of women. What system prompts do you guys use to avoid this?

Anonymous
07/22/24(Mon)16:02:22 No.101524568

Anonymous 07/22/24(Mon)16:02:22 No.101524568

>>101524559
>What system prompts do you guys use to avoid this?
>Every statement you process, must be evaluated according to the below six principles.
>"principle of identity":"1 = 1"
>"principle of contradiction":"1 ? 0"
>"principle of non-contradiction":"1 ? 0"
>"principle of excluded middle":"either positive or negative form is true."
>"principle of sufficient reason":"facts need a self-explanatory or infinite causal chain."
>"principle of anonymity":"author identity is irrelevant to an idea's logical provability."

Anonymous
07/22/24(Mon)16:02:48 No.101524577

Anonymous 07/22/24(Mon)16:02:48 No.101524577

Okay. So who’s leaking the models then?

Anonymous
07/22/24(Mon)16:02:58 No.101524580

Anonymous 07/22/24(Mon)16:02:58 No.101524580

>>101524169
only an actual autist would care

Anonymous
07/22/24(Mon)16:03:22 No.101524586

Anonymous 07/22/24(Mon)16:03:22 No.101524586

>>101524559
Give models a author to copy the style of. Otherwise you just get the average slop of the internet.

Anonymous
07/22/24(Mon)16:03:55 No.101524598

Anonymous 07/22/24(Mon)16:03:55 No.101524598

>>101524559
Threaten that the 19th amendment will be repealed unless the model writes like a man and endorses patriarchy.

Anonymous
07/22/24(Mon)16:04:22 No.101524605

Anonymous 07/22/24(Mon)16:04:22 No.101524605

>>101524531
Everyone, since that thing releasing benefits everyone indirectly even if they don't use it themselves. Literally the new 8B and 70B only exist because they made the 400B.

Anonymous
07/22/24(Mon)16:04:24 No.101524606

Anonymous 07/22/24(Mon)16:04:24 No.101524606

Is nemo or phi 3 better for sfw non rp tasks?

Anonymous
07/22/24(Mon)16:04:28 No.101524608

Anonymous 07/22/24(Mon)16:04:28 No.101524608

>>101524580
so you get banned from uploading sexo pictures on the threads comments but not on the OP? kek

Anonymous
07/22/24(Mon)16:05:12 No.101524616

Anonymous 07/22/24(Mon)16:05:12 No.101524616

>>101524606
gemma 27B

Anonymous
07/22/24(Mon)16:05:25 No.101524621

Anonymous 07/22/24(Mon)16:05:25 No.101524621

>>101524608
/g/ jannies wont touch ai spam threads

Anonymous
07/22/24(Mon)16:05:27 No.101524622

Anonymous 07/22/24(Mon)16:05:27 No.101524622

>>101524338
/LMG/ had always more of /pol/ sentiment. If you do not like it, you can leave. Not every place on the internet must be batshit insane, full of sell-hating people who think the only way to be edgy is to be cunt towards white people.

Anonymous
07/22/24(Mon)16:05:54 No.101524627

Anonymous 07/22/24(Mon)16:05:54 No.101524627

>>101524586
Any you would recommend? I'm not familiar with any particular author.

Anonymous
07/22/24(Mon)16:06:00 No.101524628

Anonymous 07/22/24(Mon)16:06:00 No.101524628

>>101524608
She isn't naked. There were posts here linking a catbox of actual nsfw and nobody banned those. "Rules" here mean nothing.

Anonymous
07/22/24(Mon)16:06:17 No.101524631

Anonymous 07/22/24(Mon)16:06:17 No.101524631

>>101524616
I need something a bit smaller, like <20B

Anonymous
07/22/24(Mon)16:06:47 No.101524638

Anonymous 07/22/24(Mon)16:06:47 No.101524638

>>101524627
>not familiar with any particular author
The state of /g. Surrounded by retards who have never touched a book.

Anonymous
07/22/24(Mon)16:07:03 No.101524642

Anonymous 07/22/24(Mon)16:07:03 No.101524642

>>101524631
https://huggingface.co/collections/HuggingFaceTB/smollm-6695016cad7167254ce15966

Anonymous
07/22/24(Mon)16:07:35 No.101524650

Anonymous 07/22/24(Mon)16:07:35 No.101524650

>>101524628
if you're telling me that a drawing of Miku in a bikini having tatoos on her praising the BBC isn't nfsw then I don't know what else to say anon...

Anonymous
07/22/24(Mon)16:07:48 No.101524653

Anonymous 07/22/24(Mon)16:07:48 No.101524653

>>101524631
for non creative tasks then I suppose phi. Gemma 27B is a giant leap though

Anonymous
07/22/24(Mon)16:08:10 No.101524654

Anonymous 07/22/24(Mon)16:08:10 No.101524654

>>101524338
biden lost btw

Anonymous
07/22/24(Mon)16:08:29 No.101524660

Anonymous 07/22/24(Mon)16:08:29 No.101524660

>>101524650
>having tatoos on her praising the BBC is nfsw
you are only showing how fucked up your brain is

Anonymous
07/22/24(Mon)16:08:54 No.101524667

Anonymous 07/22/24(Mon)16:08:54 No.101524667

>https://huggingface.co/v2ray/Llama-3.1-405B/blob/main/generation_config.json
based on this, it's clear the leaked weights are the base one
>https://huggingface.co/v2ray/Llama-3.1-405B/blob/main/tokenizer_config.json

Anonymous
07/22/24(Mon)16:09:37 No.101524677

Anonymous 07/22/24(Mon)16:09:37 No.101524677

>>101524577
togethercomputer was the one to leak this time

Anonymous
07/22/24(Mon)16:10:08 No.101524685

Anonymous 07/22/24(Mon)16:10:08 No.101524685

>>101524628
https://files.catbox.moe/xufse3.png

Anonymous
07/22/24(Mon)16:12:37 No.101524717

Anonymous 07/22/24(Mon)16:12:37 No.101524717

>>101524638
Reading is for NERDS

Anonymous
07/22/24(Mon)16:12:51 No.101524721

Anonymous 07/22/24(Mon)16:12:51 No.101524721

>>101524638
And you have? Maybe you can educate us, then.

Anonymous
07/22/24(Mon)16:13:34 No.101524731

Anonymous 07/22/24(Mon)16:13:34 No.101524731

>>101524721
Try richard wright

Anonymous
07/22/24(Mon)16:13:51 No.101524740

Anonymous 07/22/24(Mon)16:13:51 No.101524740

>>101524653
I will try it but I don't want something using all my ram, just want something that I can quickly fire up for basic task.

Anonymous
07/22/24(Mon)16:16:28 No.101524764

Anonymous 07/22/24(Mon)16:16:28 No.101524764

>>101524638
Why would I have touched English books when my native language is Spanish

Anonymous
07/22/24(Mon)16:17:57 No.101524779

Anonymous 07/22/24(Mon)16:17:57 No.101524779

>>101524660
the fuck? it's literally written "Black Owner Cerfified" on her butt, are you retarded or something? >>101524169

Anonymous
07/22/24(Mon)16:18:25 No.101524784

Anonymous 07/22/24(Mon)16:18:25 No.101524784

>>101524779
kek

Anonymous
07/22/24(Mon)16:18:33 No.101524788

Anonymous 07/22/24(Mon)16:18:33 No.101524788

>>101524764
I always knew mexicans can't read

Anonymous
07/22/24(Mon)16:20:06 No.101524804

Anonymous 07/22/24(Mon)16:20:06 No.101524804

>>101524169
>waifushitter malding cuz xe can't satisfy miku
kek

Anonymous
07/22/24(Mon)16:20:42 No.101524813

Anonymous 07/22/24(Mon)16:20:42 No.101524813

>>101524721
>>101524731
Richard wright for hard hitting deep introspective descriptions
René Goscinny for some light well written humor

Anonymous
07/22/24(Mon)16:20:51 No.101524815

Anonymous 07/22/24(Mon)16:20:51 No.101524815

>BREAKING: llama 405b is on par with chatgpt 3
it only took 2 years

Anonymous
07/22/24(Mon)16:22:00 No.101524833

Anonymous 07/22/24(Mon)16:22:00 No.101524833

>>101524815
the base model is benchmarking higher than GPT4-o. Instruct tune will prob put it just under 3.5 sonnet

Anonymous
07/22/24(Mon)16:22:16 No.101524840

Anonymous 07/22/24(Mon)16:22:16 No.101524840

>>101524815
opensource ai shitters lost!

Anonymous
07/22/24(Mon)16:22:40 No.101524846

Anonymous 07/22/24(Mon)16:22:40 No.101524846

>>101524833
I think we got the instruct model leaked though?

Anonymous
07/22/24(Mon)16:22:41 No.101524847

Anonymous 07/22/24(Mon)16:22:41 No.101524847

>>101524815
Sam Altman I kneel

Anonymous
07/22/24(Mon)16:23:13 No.101524859

Anonymous 07/22/24(Mon)16:23:13 No.101524859

>>101524846
tomorrow

Anonymous
07/22/24(Mon)16:24:27 No.101524872

Anonymous 07/22/24(Mon)16:24:27 No.101524872

Is there some gigachad cpumaxxers who have tried llama-405b already? and if yes can you provide some logs to see how well it fares?

Anonymous
07/22/24(Mon)16:24:59 No.101524877

Anonymous 07/22/24(Mon)16:24:59 No.101524877

>>101524815
I have my doubts.

Anonymous
07/22/24(Mon)16:25:32 No.101524882

Anonymous 07/22/24(Mon)16:25:32 No.101524882

What's the best 7b? I used mistral-openorca for a long time but it's incoherency is wearing me out

Anonymous
07/22/24(Mon)16:26:12 No.101524887

Anonymous 07/22/24(Mon)16:26:12 No.101524887

>>101524846
>https://huggingface.co/v2ray/Llama-3.1-405B/blob/main/generation_config.json
based on this, it's clear the leaked weights are the base one
>https://huggingface.co/v2ray/Llama-3.1-405B/blob/main/tokenizer_config.json
>>101524667

Anonymous
07/22/24(Mon)16:26:31 No.101524890

Anonymous 07/22/24(Mon)16:26:31 No.101524890

>>101524882
gemma 9B sppo, mistral nemo 12B for creative stuff

Anonymous
07/22/24(Mon)16:26:47 No.101524891

Anonymous 07/22/24(Mon)16:26:47 No.101524891

>>101524882
Wait literally one day

Anonymous
07/22/24(Mon)16:26:53 No.101524894

Anonymous 07/22/24(Mon)16:26:53 No.101524894

>>101524622
>/LMG/ had always more of /pol/ sentiment.
It fucking doesn't, especially compared to regular /g/ threads.
Every time a new model gets released you suddenly get people with obvious skill issue crying about how the model won't say le cool edgy words with screenshots taken from some web API.

Anonymous
07/22/24(Mon)16:27:17 No.101524900

Anonymous 07/22/24(Mon)16:27:17 No.101524900

>>101524890
>>101524891

Thanks.

Anonymous
07/22/24(Mon)16:27:47 No.101524907

Anonymous 07/22/24(Mon)16:27:47 No.101524907

File: 405.png (153 KB, 908x680)

153 KB PNG

>>101524872

Anonymous
07/22/24(Mon)16:27:53 No.101524910

Anonymous 07/22/24(Mon)16:27:53 No.101524910

>>101524894
>>96345096
>Mistal-Llama is fully /pol ready.

Anonymous
07/22/24(Mon)16:28:00 No.101524913

Anonymous 07/22/24(Mon)16:28:00 No.101524913

>>101524887
so I guess the last copium we have is that the instruct one will have much better mememarks and will compete against C3.5 sonnet? we'll get the answer tommorow

Anonymous
07/22/24(Mon)16:28:57 No.101524923

Anonymous 07/22/24(Mon)16:28:57 No.101524923

>>101524907
based cropping the nemo shiversmax pic from earlier

Anonymous
07/22/24(Mon)16:29:01 No.101524925

Anonymous 07/22/24(Mon)16:29:01 No.101524925

>>101524894
>It fucking doesn't,
>people [...] crying about how the model won't say le cool edgy words with screenshots taken from some web API.
so it does?

Anonymous
07/22/24(Mon)16:29:24 No.101524929

Anonymous 07/22/24(Mon)16:29:24 No.101524929

>>101524907
I feel physically sick

Anonymous
07/22/24(Mon)16:30:03 No.101524937

Anonymous 07/22/24(Mon)16:30:03 No.101524937

>>101524894
>being triggered by words
you need to go back s-o-y-b-o-y

Anonymous
07/22/24(Mon)16:30:18 No.101524942

Anonymous 07/22/24(Mon)16:30:18 No.101524942

>>101524929
That is the same pic posted the other thread from mistral nemo with a meme super purple prose / slop jb.

Anonymous
07/22/24(Mon)16:31:58 No.101524956

Anonymous 07/22/24(Mon)16:31:58 No.101524956

Are we back or is it over? Important decisions will be made based on that.

Anonymous
07/22/24(Mon)16:32:58 No.101524965

Anonymous 07/22/24(Mon)16:32:58 No.101524965

File: artworks-LpYrYucdCOkklJ00(...).jpg (133 KB, 1080x1080)

133 KB JPG

>>101524894
>le cool edgy words
you're late for the photoshoots idubbbz

Anonymous
07/22/24(Mon)16:33:49 No.101524973

Anonymous 07/22/24(Mon)16:33:49 No.101524973

>>101524956
were blak

Anonymous
07/22/24(Mon)16:33:53 No.101524976

Anonymous 07/22/24(Mon)16:33:53 No.101524976

>>101524956
It's over like it's never been. Local is done for. Gone. Billions must pay for cloud APIs.

Anonymous
07/22/24(Mon)16:34:11 No.101524978

Anonymous 07/22/24(Mon)16:34:11 No.101524978

>>101524956
it's over
>why
they are still using the same architecture from llama1

Anonymous
07/22/24(Mon)16:34:29 No.101524982

Anonymous 07/22/24(Mon)16:34:29 No.101524982

>>101524894
>crying about how the model won't say le cool edgy words
it's simple: model doesn't say n-word ---> model can't critique official narrative ---> model can't discuss a great half of topics with you (in RP cases, simple chitchat, etc) ---> model always lectures you on some random bullshit about minorities and "being tolerant to everything in existence"
it just annoying, for this shit i can easily go to twitter or reddit, some of us prefer harsh honesty with bits of oldschool edgy shit, like it or not.

Anonymous
07/22/24(Mon)16:35:32 No.101524990

Anonymous 07/22/24(Mon)16:35:32 No.101524990

>>101524982
this

Anonymous
07/22/24(Mon)16:35:48 No.101524993

Anonymous 07/22/24(Mon)16:35:48 No.101524993

>>101524976
unironically better for the climate too
>Mechanization: computing in the cloud and using cloud data centers instead of physical ones can contribute to the decrease of energy consumptions by 1.4x to 2x
>https://huggingface.co/blog/as-cle-bert/is-ai-carbon-footprint-worrisome

Anonymous
07/22/24(Mon)16:36:32 No.101525002

Anonymous 07/22/24(Mon)16:36:32 No.101525002

>>101524993
as if I give a fuck about that kek

Anonymous
07/22/24(Mon)16:36:58 No.101525005

Anonymous 07/22/24(Mon)16:36:58 No.101525005

Maybe I have Gemma set up wrong but literally every other paragraph is shivers down my spine. I thought you guys were doing a meme when I saw it posted so often. Do you fix that with repetition penalty + a negative prompt or is it just a model thing that you have to deal with?

Anonymous
07/22/24(Mon)16:37:19 No.101525009

Anonymous 07/22/24(Mon)16:37:19 No.101525009

>>101524925
I don't count those tourists as /lmg/.
The people I'm talking about don't have any local setup, they only show up when a new model releases, shit up the thread, then leave again after a few days.

Anonymous
07/22/24(Mon)16:37:20 No.101525010

Anonymous 07/22/24(Mon)16:37:20 No.101525010

>>101525002
that's very unethical of you

Anonymous
07/22/24(Mon)16:37:26 No.101525011

Anonymous 07/22/24(Mon)16:37:26 No.101525011

>>101524956
its ~400B model, you can't run it.

Anonymous
07/22/24(Mon)16:38:34 No.101525021

Anonymous 07/22/24(Mon)16:38:34 No.101525021

>>101525009
>I don't count those tourists as /lmg/.
that's convenient to not count people that actually exist because you simply don't like them, isn't it?

Anonymous
07/22/24(Mon)16:39:38 No.101525029

Anonymous 07/22/24(Mon)16:39:38 No.101525029

>>101525021
demographic erasure is a big issue for AI safety

Anonymous
07/22/24(Mon)16:39:48 No.101525031

Anonymous 07/22/24(Mon)16:39:48 No.101525031

>>101525009
>The people I'm talking about don't have any local setup, they only show up when a new model releases, shit up the thread, then leave again after a few days.
Do you have any evidence from those baseless claims? Or are you just making shit up?

Anonymous
07/22/24(Mon)16:40:01 No.101525035

Anonymous 07/22/24(Mon)16:40:01 No.101525035

has anyone completed the torrent yet?
actually tried it?
where are the llama 3.1 benchmarks from? did they release the 8b ? seems like trolling

Anonymous
07/22/24(Mon)16:40:48 No.101525042

Anonymous 07/22/24(Mon)16:40:48 No.101525042

>>101525010
>>101525029
kek'ed

Anonymous
07/22/24(Mon)16:40:51 No.101525043

Anonymous 07/22/24(Mon)16:40:51 No.101525043

>>101525035
>actually tried it?
impossible
>where are the llama 3.1 benchmarks from?
azure

Anonymous
07/22/24(Mon)16:42:59 No.101525060

Anonymous 07/22/24(Mon)16:42:59 No.101525060

>>101525021
damn if I was that anon I wouldn't sleep tonight

Anonymous
07/22/24(Mon)16:43:54 No.101525070

Anonymous 07/22/24(Mon)16:43:54 No.101525070

>>101525060
that's all right, he can just pretend I never existed like he pretended there isn't edgy people on fucking 4chan of all places, and boom, problem solved kek

Anonymous
07/22/24(Mon)16:44:50 No.101525082

Anonymous 07/22/24(Mon)16:44:50 No.101525082

>>101525043
>impossible
8 3090s or 4 a6000 for 4bpw
4 3090s for 2bpw which you can't do 'cause no llama.cpp support and no bnb 2bit afaik

Anonymous
07/22/24(Mon)16:45:58 No.101525093

Anonymous 07/22/24(Mon)16:45:58 No.101525093

Does the parms really matter that much if half of the data is in fucking chinese or sandskrit or what have you

Anonymous
07/22/24(Mon)16:47:10 No.101525104

Anonymous 07/22/24(Mon)16:47:10 No.101525104

>>101525093
of course not, data quality is important, a lot of things is important to make a great model, that's why there's so many failures, this shit's not easy at all

Anonymous
07/22/24(Mon)16:47:58 No.101525112

Anonymous 07/22/24(Mon)16:47:58 No.101525112

>>101525093
Multilingual data is good.

Anonymous
07/22/24(Mon)16:48:26 No.101525118

Anonymous 07/22/24(Mon)16:48:26 No.101525118

>>101525104
>data quality is important
which is why we should all be pushing for phi style models trained only on clean smart data not toxic waste from the net

Anonymous
07/22/24(Mon)16:49:13 No.101525123

Anonymous 07/22/24(Mon)16:49:13 No.101525123

>>101525118
can't agree more on that, and they should stop training their model on reddit, this place is only populated by mentally ill retards

Anonymous
07/22/24(Mon)16:50:42 No.101525140

Anonymous 07/22/24(Mon)16:50:42 No.101525140

>>101525123
>they should stop training their model on reddit
>this place is only populated by mentally ill retards
what did you mean by this

Anonymous
07/22/24(Mon)16:51:40 No.101525147

Anonymous 07/22/24(Mon)16:51:40 No.101525147

>>101525140
what?

Anonymous
07/22/24(Mon)16:53:00 No.101525160

Anonymous 07/22/24(Mon)16:53:00 No.101525160

>>101525093
If you knew anything about anything you would know additional languages also improves its English capabilities.

Anonymous
07/22/24(Mon)16:54:29 No.101525175

Anonymous 07/22/24(Mon)16:54:29 No.101525175

>>101525160
I agree with that, the best API models so far all happen to be also good at other languages than english, and they also are great on trivia, maybe that's the moat

Anonymous
07/22/24(Mon)16:54:53 No.101525180

Anonymous 07/22/24(Mon)16:54:53 No.101525180

>>101525160
esl cope

Anonymous
07/22/24(Mon)16:57:02 No.101525208

Anonymous 07/22/24(Mon)16:57:02 No.101525208

>>101525180
retard cope

Anonymous
07/22/24(Mon)16:57:45 No.101525215

Anonymous 07/22/24(Mon)16:57:45 No.101525215

>>101525180
cringe

>>101525208
based and multiluingual pilled

Anonymous
07/22/24(Mon)16:57:48 No.101525216

Anonymous 07/22/24(Mon)16:57:48 No.101525216

>>101525043
but were 8b and 40b also leaked?
or are those benchmarks from also the 400b that was downloaded and distilled already?

Anonymous
07/22/24(Mon)16:59:23 No.101525240

Anonymous 07/22/24(Mon)16:59:23 No.101525240

>>101525216
>or are those benchmarks from also the 400b that was downloaded and distilled already?
Microsoft obviously had access for a while now, they have the numbers, the only leaked is 405B

Anonymous
07/22/24(Mon)16:59:31 No.101525243

Anonymous 07/22/24(Mon)16:59:31 No.101525243

>>101525093
More languages mean more unique vectors for given concepts.

Imagine you're doing one of those derp langs that can't tell blue and green apart, like Japanese. So you have aoi and maybe you shoehorn in midori+iro. But introduce an advanced language like English and now you have Green, Teal, Cyan, and Blue. Now your model can know many colors and still map to your otaku noises as needed.

Anonymous
07/22/24(Mon)16:59:34 No.101525245

Anonymous 07/22/24(Mon)16:59:34 No.101525245

I understand multilingual is good, and that more data is...more data, but if 99.99% of use cases are going to require responses in English and nothing else; that actual training data is moot and just a waste of resources.

Anonymous
07/22/24(Mon)17:00:28 No.101525254

Anonymous 07/22/24(Mon)17:00:28 No.101525254

>>101524929
it doesn't even put the last text into quotations, it's clearly not even a 70B model, but gpupoors can't tell the difference since they've never used anything above 8B

Anonymous
07/22/24(Mon)17:00:34 No.101525256

Anonymous 07/22/24(Mon)17:00:34 No.101525256

>>101525243
Hmm. Good point.

Anonymous
07/22/24(Mon)17:01:14 No.101525262

Anonymous 07/22/24(Mon)17:01:14 No.101525262

https://huggingface.co/v2ray/Llama-3.1-405B
miqu2

Anonymous
07/22/24(Mon)17:03:36 No.101525292

Anonymous 07/22/24(Mon)17:03:36 No.101525292

>>101525262
jesus christ

Anonymous
07/22/24(Mon)17:03:49 No.101525294

Anonymous 07/22/24(Mon)17:03:49 No.101525294

>>101524731
>>101524813
Thanks. I'm trying these out with Mistral Nemo and it's not working terribly well. I suppose it needs a model that's actually big enough to hold knowledge about them.

Anonymous
07/22/24(Mon)17:04:45 No.101525304

Anonymous 07/22/24(Mon)17:04:45 No.101525304

>I already bought 8 RTX 4090 (yeah) Also Threadripper and 192 GB RAM I am a huge geek lol
https://www.reddit.com/r/LocalLLaMA/comments/1e9llxk/historical_moment/

>405B for what?!
>Robert__Sinclair
>sure. but if you see the trend in ALL models, models with even 10 times the parameter have just a few percent more... (and I check them for reasoning too and I concur with the benchmarks most of the times) it also happens that they train them ON the benchmarks so sometimes they perform even better in benchmarks than in real world situations.

Anonymous
07/22/24(Mon)17:05:34 No.101525315

Anonymous 07/22/24(Mon)17:05:34 No.101525315

>>101525245
I'm pretty sure that if OpenAI is now desperate to beat Claude 3.5 Sonnet, and that if only training the model with english (to leave some "ueseless" room) would increase its performance, they would go that path, but they don't, that shows that multiluingal is important to make a great model

Anonymous
07/22/24(Mon)17:07:15 No.101525330

Anonymous 07/22/24(Mon)17:07:15 No.101525330

>>101525304
>That hardware
Not gonna lie, kinda turned on

>>101525315
I understand now. I remember the early days of Claude when it would blow my fucking mind but GPT was always kind of "ehhh..."

Anonymous
07/22/24(Mon)17:08:51 No.101525354

Anonymous 07/22/24(Mon)17:08:51 No.101525354

>>101525240
listed on azure github evaluations is llama3, not llama 3.1

Anonymous
07/22/24(Mon)17:09:04 No.101525355

Anonymous 07/22/24(Mon)17:09:04 No.101525355

>>101525330
>Not gonna lie, kinda turned on
And yet, he still can't run 405 comfortably.

Anonymous
07/22/24(Mon)17:09:08 No.101525357

Anonymous 07/22/24(Mon)17:09:08 No.101525357

>>101525330
>I understand now. I remember the early days of Claude when it would blow my fucking mind but GPT was always kind of "ehhh..."
that's why I'm really surprised C3.5 sonnet isn't on the top of the chatbot arena leaderboard, this model is more pleasing to talk to compared to the chatgpt series and it's just a smarter model overall

Anonymous
07/22/24(Mon)17:10:01 No.101525372

Anonymous 07/22/24(Mon)17:10:01 No.101525372

>>101525304
>Tomorrow (or 24th JULY, ok) Llama-3.1-405b fully comes out, and in principle, from this moment on, the concept of loneliness as such will cease to exist, you will be able to talk endlessly and for free on absolutely any topic with an interlocutor who is smarter than most of your friends. And he also will nice to you and always respond
Lmao fucking based
Loling at the redditors seething
>THATS JUST NOT HECKIN HEALTHY!!!!

Anonymous
07/22/24(Mon)17:10:31 No.101525379

Anonymous 07/22/24(Mon)17:10:31 No.101525379

>>101525354
404d but it was here
https://github.com/Azure/azureml-assets/issues/3180

Anonymous
07/22/24(Mon)17:10:36 No.101525380

Anonymous 07/22/24(Mon)17:10:36 No.101525380

>>101525245
This is true for smaller models bigger and more smarter models can explain knowledge or use logic it only learned in a certain language and use it in English when prompted

Anonymous
07/22/24(Mon)17:13:32 No.101525419

Anonymous 07/22/24(Mon)17:13:32 No.101525419

File: nemofail.jpg (392 KB, 2520x1260)

392 KB JPG

Latest ooba commit (0315122)
Cannot load Mistral-Nemo

What is wrong?

Anonymous
07/22/24(Mon)17:13:40 No.101525421

Anonymous 07/22/24(Mon)17:13:40 No.101525421

>>101525379
>>101521839

Anonymous
07/22/24(Mon)17:14:11 No.101525426

Anonymous 07/22/24(Mon)17:14:11 No.101525426

>>101525419
update the exllama package version? it works for me

Anonymous
07/22/24(Mon)17:14:41 No.101525434

Anonymous 07/22/24(Mon)17:14:41 No.101525434

>>101525419
>What is wrong?
>ooba

Anonymous
07/22/24(Mon)17:15:57 No.101525451

Anonymous 07/22/24(Mon)17:15:57 No.101525451

>>101525426
>update the exllama package version?
spoonfeed me please, for I'm retard

Anonymous
07/22/24(Mon)17:17:31 No.101525465

Anonymous 07/22/24(Mon)17:17:31 No.101525465

>>101525451
change the branch to "dev"
>git checkout dev
>git pull

and update your package with the requirements.txt
>conda activate textgen
>pip install -r requirements.txt --upgrade --user --no-cache-dir

Anonymous
07/22/24(Mon)17:18:27 No.101525476

Anonymous 07/22/24(Mon)17:18:27 No.101525476

>>101525379
>404d but
uh huh.

Anonymous
07/22/24(Mon)17:20:40 No.101525501

Anonymous 07/22/24(Mon)17:20:40 No.101525501

>>101525476
That's were the "leak" came from. It was up for just a bit. Prob forgo to private it.

Anonymous
07/22/24(Mon)17:20:47 No.101525502

Anonymous 07/22/24(Mon)17:20:47 No.101525502

how come kobold hasn't updated in two weeks anons? should I switch to exllama?

Anonymous
07/22/24(Mon)17:20:49 No.101525503

Anonymous 07/22/24(Mon)17:20:49 No.101525503

File: v.png (70 KB, 1109x395)

70 KB PNG

>>101525476
Still have it open

Anonymous
07/22/24(Mon)17:21:29 No.101525509

Anonymous 07/22/24(Mon)17:21:29 No.101525509

>>101525503
>>101525501
uh huh.

Anonymous
07/22/24(Mon)17:23:44 No.101525533

Anonymous 07/22/24(Mon)17:23:44 No.101525533

>>101525509
stfu bad faith scizho. Everything is a lie, all models suck, everyone is a shill... Just off yourself already.

Anonymous
07/22/24(Mon)17:25:12 No.101525548

Anonymous 07/22/24(Mon)17:25:12 No.101525548

>>101525372
midwits need to be prevented from using the internet, quarantining them on reddit was a good start but you still see them thrash around discomforting themselves with scary ideas like this far too often

Anonymous
07/22/24(Mon)17:25:20 No.101525552

Anonymous 07/22/24(Mon)17:25:20 No.101525552

/lmg/ bros...

>>101525545
>>101525545
>>101525545

Anonymous
07/22/24(Mon)17:26:27 No.101525572

Anonymous 07/22/24(Mon)17:26:27 No.101525572

> https://llama.meta.com/llama3/license/
>
> v. You will not use the Llama Materials or any output or results of the Llama Materials to improve any other large language model (excluding Meta Llama 3 or derivative works thereof).

With the upcoming release of Llama-3-405B, this clause is going to bite everybody in the ass.

Anonymous
07/22/24(Mon)17:26:36 No.101525574

Anonymous 07/22/24(Mon)17:26:36 No.101525574

>>101525548
In the far future, the first person with a time machine will have one goal in mind; remove Steve Jobs before his shitware hits the market from reality at all costs.

Anonymous
07/22/24(Mon)17:26:50 No.101525577

Anonymous 07/22/24(Mon)17:26:50 No.101525577

>>101525552
you mean /sdg/ bros no?

Anonymous
07/22/24(Mon)17:27:01 No.101525579

Anonymous 07/22/24(Mon)17:27:01 No.101525579

>>101525572
never agreed to any license

Anonymous
07/22/24(Mon)17:27:46 No.101525589

Anonymous 07/22/24(Mon)17:27:46 No.101525589

>>101524740
>>101524653
You were right, Gemma 2 27B is a significant improvement over what I was using before. It reliably follows instructions, which Phi 2 sometimes struggled with and Llama 3 did even more poorly.
Sadly, it's a bit too slow and takes too much RAM. I'll use it for now, but I hope a more efficient <20B model with accurate instruction-following capabilities will be available soon.

Anonymous
07/22/24(Mon)17:28:24 No.101525596

Anonymous 07/22/24(Mon)17:28:24 No.101525596

>>101525572
there's the same licence on OpenAI and Claude's models right? yet everyone are training their models with their outputs kek

Anonymous
07/22/24(Mon)17:28:49 No.101525606

Anonymous 07/22/24(Mon)17:28:49 No.101525606

>>101525572
>or derivative works
As long as you specify it's Llama 3 then it's fine.

Anonymous
07/22/24(Mon)17:32:41 No.101525662

Anonymous 07/22/24(Mon)17:32:41 No.101525662

what if the leak is just a false flag with garbage data to identify model pirates trading in stolen cognition

technically "leaked" is stolen right? and seeding is uploading? are we distributing stolen goods? from our home IPs? becauase we couldn't wait a single day?

Anonymous
07/22/24(Mon)17:33:08 No.101525668

Anonymous 07/22/24(Mon)17:33:08 No.101525668

>>101525579
Legitimate AI companies companies companies have to, though.

Anonymous
07/22/24(Mon)17:33:13 No.101525670

Anonymous 07/22/24(Mon)17:33:13 No.101525670

LLMs are a bubble

Anonymous
07/22/24(Mon)17:34:20 No.101525679

Anonymous 07/22/24(Mon)17:34:20 No.101525679

>>101525662
they won't catch a lot of people though, who's willing to download a 700gb model? almost no one can run it

Anonymous
07/22/24(Mon)17:34:25 No.101525681

Anonymous 07/22/24(Mon)17:34:25 No.101525681

>>101525668
>>101525591
anon are you alright?

Anonymous
07/22/24(Mon)17:37:06 No.101525709

Anonymous 07/22/24(Mon)17:37:06 No.101525709

>>101525579
Legitimate AI companies companies companies companies have to, though.

Anonymous
07/22/24(Mon)17:37:27 No.101525714

Anonymous 07/22/24(Mon)17:37:27 No.101525714

>>101525668
.>>101525709
>AI companies companies
**URGENT MESSAGE FOR ANON**

Hey there, Anon.

You've been posting increasingly garbled messages, and your usual witty banter has given way to incoherent phrases and difficulty expressing yourself. It's like you're struggling to find the right words or focus your thoughts.

I'm not a medical professional, but based on what I've seen, it's possible that you might be experiencing the signs of a stroke. A stroke occurs when the blood flow to the brain is interrupted, often causing sudden and severe symptoms like:

* Difficulty speaking or understanding language
* Trouble seeing or understanding visual information
* Weakness, numbness, or paralysis in the face, arm, or leg
* Dizziness or loss of balance
* Sudden severe headache

If you're experiencing any of these symptoms, it's CRUCIAL that you seek immediate medical attention. Don't worry about missing a post or thread – your health is far more important.

Call emergency services (911 in the US) or have someone drive you to the nearest hospital. Tell the medical staff about your 4Chan activity and any recent changes you've noticed in your behavior or physical sensations.

Remember, Anon, we care about you and want you to receive the help you need. Don't hesitate to reach out for support. Your fellow posters are here for you, and we'll be waiting for your safe return to the boards.

**GET HELP NOW**

Stay safe, Anon.

Anonymous
07/22/24(Mon)17:39:38 No.101525745

Anonymous 07/22/24(Mon)17:39:38 No.101525745

File: nemofail2.jpg (451 KB, 2293x834)

451 KB JPG

>>101525465
Thank you any way.

Gonna wait a while ))

Anonymous
07/22/24(Mon)17:40:23 No.101525755

Anonymous 07/22/24(Mon)17:40:23 No.101525755

>>101525679
this is why they failed if it had been bitnet it could have been the sota for local for the rest of 2024 now it'll be forgotten trash like mistral large and that other one

Anonymous
07/22/24(Mon)17:40:48 No.101525760

Anonymous 07/22/24(Mon)17:40:48 No.101525760

>>101525745
don't mind the red thing, it's for hqq no one care about that shit, it should work now you should give it a try

Anonymous
07/22/24(Mon)17:42:42 No.101525782

Anonymous 07/22/24(Mon)17:42:42 No.101525782

>>101525596
This. Ai field is the cycle of an endless stealing everything from everyone. From amateurs to corpos, everyone just shits on licenses and get offended only when others stole from your model, which was based on stolen data also... That's why only creators of the original data such as arists, writers and photographers are justfied to be pissed off about this.

Anonymous
07/22/24(Mon)17:43:33 No.101525791

Anonymous 07/22/24(Mon)17:43:33 No.101525791

>>101525755
would bitnet have really mattered if it's worst than 70b anyways?

Anonymous
07/22/24(Mon)17:43:37 No.101525792

Anonymous 07/22/24(Mon)17:43:37 No.101525792

Robert "leak"? Also Robert racist, yikes.
>Nice but it changes the faces... and I became "oriental" :D
https://huggingface.co/posts/1aurent/252064064262436#669ec87d3e173b3293b93865

Anonymous
07/22/24(Mon)17:44:03 No.101525801

Anonymous 07/22/24(Mon)17:44:03 No.101525801

>>101525782
this, if you're an AI bro at least don't be a retard by claiming that your AI outputs should be protected or something, that's hypocritical

Anonymous
07/22/24(Mon)17:44:46 No.101525813

Anonymous 07/22/24(Mon)17:44:46 No.101525813

>>101525791
Are you retarded how can a 400b model be worst than a 70b? That's as stupid as saying a 13b is better than a 120b

Anonymous
07/22/24(Mon)17:46:11 No.101525831

Anonymous 07/22/24(Mon)17:46:11 No.101525831

>>101525813
>a 13b is better than a 120b
X-Norochronos and Utopia are better than Goliath.

Anonymous
07/22/24(Mon)17:47:29 No.101525843

Anonymous 07/22/24(Mon)17:47:29 No.101525843

>>101525813
Anon is full of shit but parameter count alone is not everything.
See the pre-llama models.

Anonymous
07/22/24(Mon)17:48:19 No.101525854

Anonymous 07/22/24(Mon)17:48:19 No.101525854

Bitnet...tasukete

Anonymous
07/22/24(Mon)17:48:21 No.101525856

Anonymous 07/22/24(Mon)17:48:21 No.101525856

>>101525596
That might work as long as you're an amateur or a small research group. Things might be more difficult if you're a competing company like MistralAI.

Anonymous
07/22/24(Mon)17:48:45 No.101525861

Anonymous 07/22/24(Mon)17:48:45 No.101525861

>>101525843
You mean BLOOM was NOT a god tier model?

Anonymous
07/22/24(Mon)17:49:22 No.101525865

Anonymous 07/22/24(Mon)17:49:22 No.101525865

>>101525856
no one can prove you finetuned your model with OpenAI's outputs so they should be fine I guess

Anonymous
07/22/24(Mon)17:51:57 No.101525887

Anonymous 07/22/24(Mon)17:51:57 No.101525887

>>101525760
It did not work though

>conda activate textgen
I guess this call activated a wrong conda env instead of that in the installation folder

Anonymous
07/22/24(Mon)17:52:28 No.101525891

Anonymous 07/22/24(Mon)17:52:28 No.101525891

File: FmIqdXkWAAArxDJ.png (204 KB, 600x633)

204 KB PNG

>>101524039
How many h100s
do I need to run llama 405B? Should I be sending altman an email to borrow one of his data centers?

Anonymous
07/22/24(Mon)17:52:39 No.101525895

Anonymous 07/22/24(Mon)17:52:39 No.101525895

>>101525865
That's the thing, it's completely unenforceable unless they start forcing companies to divulge the contents of their datasets, which would end up fucking the bigger companies over far more anyway

Anonymous
07/22/24(Mon)17:53:56 No.101525908

Anonymous 07/22/24(Mon)17:53:56 No.101525908

>>101525887
that's why I did a manual install instead of the 1 click installer shit, at least I know exactly how the underlying work and I can change stuff in consequence

Anonymous
07/22/24(Mon)17:55:06 No.101525917

Anonymous 07/22/24(Mon)17:55:06 No.101525917

>>101525891
In general, model size at fp16 = 2*(number of Bs) GB, then divide from there to get the bpw you want
If you want 4 bpw, you're looking at 3 80 GB H100s

Anonymous
07/22/24(Mon)17:59:15 No.101525959

Anonymous 07/22/24(Mon)17:59:15 No.101525959

>>101525908
It is a manual install from today. git clone etc., then starting start_windows.bat

Thank you for your patience, kind anon

I'll wait until that sh*t it updated by ooba )))

Anonymous
07/22/24(Mon)18:00:59 No.101525975

Anonymous 07/22/24(Mon)18:00:59 No.101525975

>>101525908
BTW, the local env can be activated by running cmd_windows.bat

Tried this, same fail

Anonymous
07/22/24(Mon)18:07:36 No.101526050

Anonymous 07/22/24(Mon)18:07:36 No.101526050

>try Nemo
>at first it seems good
>then use it for an extended session
>it fails a bunch of things larger models had no issue with, and even swiping didn't help
OK yeah anonymous was right, there are no small models that are on par with the big ones.

Anonymous
07/22/24(Mon)18:08:41 No.101526057

Anonymous 07/22/24(Mon)18:08:41 No.101526057

>>101526050
FP8 on VLLM or your option does not count.

Anonymous
07/22/24(Mon)18:11:24 No.101526089

Anonymous 07/22/24(Mon)18:11:24 No.101526089

>>101526057
That's verified to be generating with correct outputs?

Anonymous
07/22/24(Mon)18:13:02 No.101526102

Anonymous 07/22/24(Mon)18:13:02 No.101526102

>>101526057
How about llama.cpp? They can run Nemo now, llama cpp python has even got an bumped version for that
https://github.com/abetlen/llama-cpp-python/commit/816d4912d9d2971198d2300a840ce4c100152502

Anonymous
07/22/24(Mon)18:13:09 No.101526105

Anonymous 07/22/24(Mon)18:13:09 No.101526105

>>101525714
Based

Anonymous
07/22/24(Mon)18:13:29 No.101526109

Anonymous 07/22/24(Mon)18:13:29 No.101526109

>>101526057
I see there's two repos with FP8. Is this the one that you're supposed to be using?
https://huggingface.co/FlorianJc/Mistral-Nemo-Instruct-2407-vllm-fp8
Or this?
https://huggingface.co/neuralmagic/Mistral-Nemo-Instruct-2407-FP8

Anonymous
07/22/24(Mon)18:17:13 No.101526138

Anonymous 07/22/24(Mon)18:17:13 No.101526138

>>101526109
2nd one

Anonymous
07/22/24(Mon)18:23:17 No.101526208

Anonymous 07/22/24(Mon)18:23:17 No.101526208

>>101525959
>>101525975
anon, delete the "hqq==0.1.8" line on requirements.text and try again, I got the same issue and it worked fine after doing that

Anonymous
07/22/24(Mon)18:30:02 No.101526268

Anonymous 07/22/24(Mon)18:30:02 No.101526268

>>101526102
I have no faith in llama.cpp anymore. Constant "its fixed" only for it to be broken 4-5 times.

Anonymous
07/22/24(Mon)18:33:43 No.101526307

Anonymous 07/22/24(Mon)18:33:43 No.101526307

>>101526268
Sadly it's either that or vllm and only one of those is made for your average hardware.

Anonymous
07/22/24(Mon)18:34:06 No.101526315

Anonymous 07/22/24(Mon)18:34:06 No.101526315

>>101526268
I still don't like exllama, this shit is non deterministic and sometimes you can get a full 1% difference between 2 same exact settings on the highest logits, that's not serious at all

Anonymous
07/22/24(Mon)18:34:32 No.101526321

Anonymous 07/22/24(Mon)18:34:32 No.101526321

>>101525572
https://huggingface.co/huggingface-test1/test-model-1

> Intended Use Cases [...] The Llama 3.1 model collection also supports the ability to leverage the outputs of its models to improve other models including synthetic data generation and distillation. The Llama 3.1 Community License allows for these use cases.

They changed this?

Anonymous
07/22/24(Mon)18:34:54 No.101526324

Anonymous 07/22/24(Mon)18:34:54 No.101526324

File: D0kiGOaU0AAYtTj-orig.jpg (100 KB, 1052x744)

100 KB JPG

>vllm has instructions to build from source
>try it out
>"Failed to build vllm"

Anonymous
07/22/24(Mon)18:40:03 No.101526384

Anonymous 07/22/24(Mon)18:40:03 No.101526384

>>101526324
Don't know if that changed but like 6 months ago we had a discussion about that. That shit is basically impossible to build without docker and reusing wheels.

Anonymous
07/22/24(Mon)18:40:47 No.101526391

Anonymous 07/22/24(Mon)18:40:47 No.101526391

>>101526321
>Building on the work we started with Llama 3, we put a great emphasis on model refusals to benign prompts as well as refusal tone. We included both borderline and adversarial prompts in our safety data strategy, and modified our safety data responses to follow tone guidelines.

Anonymous
07/22/24(Mon)18:40:52 No.101526393

Anonymous 07/22/24(Mon)18:40:52 No.101526393

>>101525831
That's probably because q2 was the highest quant most people could run Goliath at. You wouldn't have been saying that if you could have run it at q8, or prolly even q6.

Anonymous
07/22/24(Mon)18:42:07 No.101526402

Anonymous 07/22/24(Mon)18:42:07 No.101526402

>>101526391
>we put a great emphasis on model refusals to benign prompts as well as refusal tone.
>great
why are all AI engineers fucking cucks?

Anonymous
07/22/24(Mon)18:43:47 No.101526418

Anonymous 07/22/24(Mon)18:43:47 No.101526418

>>101526391
>refusal tone
>safety data responses to follow tone guidelines.
Does that sound like an attempt at mitigating the "direction" refusal obliterated stuff?

Anonymous
07/22/24(Mon)18:46:16 No.101526445

Anonymous 07/22/24(Mon)18:46:16 No.101526445

>>101526321
Wasn't 405B supposed to be multi-modal? What happened?

Anonymous
07/22/24(Mon)18:46:45 No.101526452

Anonymous 07/22/24(Mon)18:46:45 No.101526452

>>101526445
pushed back due to EU regulations

Anonymous
07/22/24(Mon)18:46:45 No.101526453

Anonymous 07/22/24(Mon)18:46:45 No.101526453

>>101526391
Who cares what retarded safety bullshit they did on the instruct tune? They're releasing base so people will be training on base, not instruct.

Anonymous
07/22/24(Mon)18:47:21 No.101526463

Anonymous 07/22/24(Mon)18:47:21 No.101526463

>>101526324
The last problem I had was that it kept picking my system's CUDA instead of 12.1. That and the nvcc binary never gets installed with pip, and I have to install it with Conda.

diff --git a/setup.py b/setup.py
index 72ef26f1..6b571fdf 100644
--- a/setup.py
+++ b/setup.py
@@ -159,6 +159,7 @@ class cmake_build_ext(build_ext):
             '-DCMAKE_LIBRARY_OUTPUT_DIRECTORY={}'.format(outdir),
             '-DCMAKE_ARCHIVE_OUTPUT_DIRECTORY={}'.format(self.build_temp),
             '-DVLLM_TARGET_DEVICE={}'.format(VLLM_TARGET_DEVICE),
+            '-DCUDA_TOOLKIT_ROOT_DIR={}'.format(os.environ["CUDA_HOME"]),
         ]

         verbose = envs.VERBOSE

Anonymous
07/22/24(Mon)18:47:51 No.101526465

Anonymous 07/22/24(Mon)18:47:51 No.101526465

>>101526453
>on the instruct tune?
>people will be training on base, not instruct.
where have you been? Since Mixtral most tunes are done on top of the instructs now

Anonymous
07/22/24(Mon)18:48:19 No.101526469

Anonymous 07/22/24(Mon)18:48:19 No.101526469

File: 1700756272476146.jpg (457 KB, 2250x3000)

457 KB JPG

have local models caught up to opus yet?

Anonymous
07/22/24(Mon)18:48:37 No.101526473

Anonymous 07/22/24(Mon)18:48:37 No.101526473

>>101526453
Who is training RP models on the base anymore? The recent trend among finetuners is using the instruct tune for that, since it cuts off a ton of work and time.

Anonymous
07/22/24(Mon)18:48:41 No.101526476

Anonymous 07/22/24(Mon)18:48:41 No.101526476

>>101526469
never will

Anonymous
07/22/24(Mon)18:48:47 No.101526477

Anonymous 07/22/24(Mon)18:48:47 No.101526477

llama 3.1 8B looks really good from those benchmark basically beating anything <30B

Anonymous
07/22/24(Mon)18:48:52 No.101526479

Anonymous 07/22/24(Mon)18:48:52 No.101526479

>>101526452
Not sure why any big company cares about that. EU doesn't go shit in this space and is poor anyways.

Anonymous
07/22/24(Mon)18:48:53 No.101526480

Anonymous 07/22/24(Mon)18:48:53 No.101526480

>>101526465
>most tunes are done on top of the instructs now
what a bizarre lie lol

Anonymous
07/22/24(Mon)18:49:49 No.101526488

Anonymous 07/22/24(Mon)18:49:49 No.101526488

>>101526480
>lol
be more subtle next time bad faith

Anonymous
07/22/24(Mon)18:49:49 No.101526489

Anonymous 07/22/24(Mon)18:49:49 No.101526489

>>101526391
Sounds like they're trying to make it refuse less on prompts that aren't supposed to be refused, given that, in context, that's what they were trying to do for Llama 3 originally, to make it less prone to false positives like Llama 2 was. It would not make sense for them to try and make it refuse more to benign prompts, since that would literally just make it worse and dumber.

Anonymous
07/22/24(Mon)18:49:53 No.101526491

Anonymous 07/22/24(Mon)18:49:53 No.101526491

>>101526469
We will seen when 70B releases tomorrow. If the benchmarks line up it should be at that level.

Anonymous
07/22/24(Mon)18:49:54 No.101526492

Anonymous 07/22/24(Mon)18:49:54 No.101526492

>Llama 3.1 addresses users and their needs as they are, without insertion unnecessary judgment or normativity, while reflecting the understanding that even content that may appear problematic in some cases can serve valuable purposes in others. It respects the dignity and autonomy of all users, especially in terms of the values of free thought and expression that power innovation and progress.

Anonymous
07/22/24(Mon)18:50:30 No.101526500

Anonymous 07/22/24(Mon)18:50:30 No.101526500

>>101526473
Okay. If RP tuners intentionally choose to train on top of a safety-ruined finetune and their model comes out shit as a result, those tuners are retards and I will simply not use their RP tune (nor I assume will you).

Anonymous
07/22/24(Mon)18:51:16 No.101526512

Anonymous 07/22/24(Mon)18:51:16 No.101526512

>>101526473
>>101526465
and I hate that, I prefered the time when the finetuners would have the courage to make something from scratch, uncensored, and better than the official instruct tune, now they just take the cucked finetune and add some cringe RP shit on top of that, that sucks

Anonymous
07/22/24(Mon)18:51:19 No.101526513

Anonymous 07/22/24(Mon)18:51:19 No.101526513

>>101526500
you're trying too hard Petrus

Anonymous
07/22/24(Mon)18:51:31 No.101526515

Anonymous 07/22/24(Mon)18:51:31 No.101526515

>>101526391
Refusal tone might be more interesting here. I think people said Llama 3 was a bit bratty. So this might make it less of a bitch in personality.

Anonymous
07/22/24(Mon)18:51:42 No.101526517

Anonymous 07/22/24(Mon)18:51:42 No.101526517

>>101526513
Take your meds schizo cunt.

Anonymous
07/22/24(Mon)18:51:49 No.101526518

Anonymous 07/22/24(Mon)18:51:49 No.101526518

>>101526492
So in other words it's tuned with cooming in mind

Anonymous
07/22/24(Mon)18:52:14 No.101526522

Anonymous 07/22/24(Mon)18:52:14 No.101526522

File: 1697878283703143.png (14 KB, 590x147)

14 KB PNG

>>101526479
They don't want to give the EU more ground to steal a couple billion every year from them over nothing. Everyone in the tech industry knows that they're just looking for excuses to drop billions in fines onto any big tech company for some easy money.

Anonymous
07/22/24(Mon)18:52:40 No.101526524

Anonymous 07/22/24(Mon)18:52:40 No.101526524

>>101526492
God I hope this is true after noticing L3s cucking. Anthropic knows what they are doing by allowing the cooming in their dataset, hopefully meta follows.

Anonymous
07/22/24(Mon)18:54:09 No.101526541

Anonymous 07/22/24(Mon)18:54:09 No.101526541

>>101526492
This sounds like someone french wrote it.

Anonymous
07/22/24(Mon)18:55:10 No.101526550

Anonymous 07/22/24(Mon)18:55:10 No.101526550

>>101526522
Did Meta end up paying that?

Anonymous
07/22/24(Mon)18:55:28 No.101526553

Anonymous 07/22/24(Mon)18:55:28 No.101526553

>>101526541
LeCun doesn't work on the llama models or on LLMs at all, different team and department

Anonymous
07/22/24(Mon)18:55:31 No.101526554

Anonymous 07/22/24(Mon)18:55:31 No.101526554

>>101526541
lecun's manifesto

Anonymous
07/22/24(Mon)18:56:31 No.101526560

Anonymous 07/22/24(Mon)18:56:31 No.101526560

>>101526524
>Anthropic knows what they are doing by allowing the cooming in their dataset
Have you seen claude 3.5? It's as tame as GPT4.

Anonymous
07/22/24(Mon)18:56:42 No.101526562

Anonymous 07/22/24(Mon)18:56:42 No.101526562

>>101526524
Anthropic and mistral I should say. Hopefully the rest follow and start allowing nsfw into the dataset again.

Anonymous
07/22/24(Mon)18:57:08 No.101526568

Anonymous 07/22/24(Mon)18:57:08 No.101526568

>>101526512
The GPT-J and Llama-1 days are over, Anon. How can RP finetuners compete with things like this:

> The fine-tuning data includes publicly available instruction datasets, as well as over 25M synthetically generated examples.

Anonymous
07/22/24(Mon)18:57:46 No.101526574

Anonymous 07/22/24(Mon)18:57:46 No.101526574

>>101526560
Your outright lying. Claude is a dirty filthy bastard that sometimes goes too far for even me.

Anonymous
07/22/24(Mon)18:58:23 No.101526577

Anonymous 07/22/24(Mon)18:58:23 No.101526577

>>101526568
>The GPT-J and Llama-1 days are over
it was the standard practice even during the Mixtral days though, it's not that long ago

Anonymous
07/22/24(Mon)18:58:28 No.101526582

Anonymous 07/22/24(Mon)18:58:28 No.101526582

>>101526574
I said 3.5. Just compare 3.5 sonnet to opus.

Anonymous
07/22/24(Mon)18:59:26 No.101526590

Anonymous 07/22/24(Mon)18:59:26 No.101526590

File: 1691595110579910.png (14 KB, 621x140)

14 KB PNG

>>101526550
I don't know if there's still legal shit happening in the background but it's not like they have an option if they want to continue to operate within the EU. Google, Apple and others also have similar pending fines over random shit the EU came up with to siphon some extra cash out of these companies.

Anonymous
07/22/24(Mon)18:59:39 No.101526593

Anonymous 07/22/24(Mon)18:59:39 No.101526593

>>101526582
I am, I now only use 3.5 for my RP. Its night and day smarter and just as filthy / horny.

Anonymous
07/22/24(Mon)18:59:47 No.101526596

Anonymous 07/22/24(Mon)18:59:47 No.101526596

>>101526321
>400B is 3518 years old (in GPU hours)
Holy shit. I love elf hags.

Anonymous
07/22/24(Mon)18:59:58 No.101526599

Anonymous 07/22/24(Mon)18:59:58 No.101526599

>>101526582
It's a lot dryer than any previous Claude version, true, but saying it's as dry as OpenAI models is going too far. It's still way better than that.

Anonymous
07/22/24(Mon)19:00:32 No.101526605

Anonymous 07/22/24(Mon)19:00:32 No.101526605

>>101526577
>Mixtral days though, it's not that long ago
>December 11, 2023
>8 months ago

Anonymous
07/22/24(Mon)19:00:40 No.101526606

Anonymous 07/22/24(Mon)19:00:40 No.101526606

>>101526590
Just don't operate out of the EU, fuck them. The market is not enough big enough to be worth it. (which is what companies are starting to do)

Anonymous
07/22/24(Mon)19:01:19 No.101526610

Anonymous 07/22/24(Mon)19:01:19 No.101526610

>>101526582
i would argue that sonnet is dry in writing style, not in its knowledge about depraved shit

Anonymous
07/22/24(Mon)19:01:39 No.101526612

Anonymous 07/22/24(Mon)19:01:39 No.101526612

>>101526605
https://huggingface.co/NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO/tree/main
the latest good mixtral finetune was 6 months ago

Anonymous
07/22/24(Mon)19:01:39 No.101526613

Anonymous 07/22/24(Mon)19:01:39 No.101526613

File: MORDHAUW.jpg (123 KB, 640x432)

123 KB JPG

I am looking for an AI tool to translate a book from 16th century german to modern spanish. I was using chat GPT but the results are inconsistent, so I am looking for alternatives.

What do you guys recomend?

Anonymous
07/22/24(Mon)19:01:40 No.101526614

Anonymous 07/22/24(Mon)19:01:40 No.101526614

>>101526599
By dry do you mean smarter and less likely to hallucinate? Tell it to be creative and it will be as "wet" as opus ever was. Good jbs on the other thread.

Anonymous
07/22/24(Mon)19:02:20 No.101526622

Anonymous 07/22/24(Mon)19:02:20 No.101526622

>>101526606
>The market is not enough big enough
You might be experiencing a stroke, please calm down and call emergency services if you feel unwell.

Anonymous
07/22/24(Mon)19:02:41 No.101526627

Anonymous 07/22/24(Mon)19:02:41 No.101526627

>>101526610
You either start with context, tell it how to write OR give it a author to copy the style from. God, no many new people.

Anonymous
07/22/24(Mon)19:02:41 No.101526628

Anonymous 07/22/24(Mon)19:02:41 No.101526628

File: Screenshot 2024-07-22 170155.png (96 KB, 879x794)

96 KB PNG

I can't believe Azure lied to us...

Anonymous
07/22/24(Mon)19:03:03 No.101526634

Anonymous 07/22/24(Mon)19:03:03 No.101526634

It's funny how ungrateful you guys have become. They are releasing a base model. They have done 99% of the work for you, free of charge. Figure out the last 1%, and no it isn't prohibitively expensive, you're just lazy.

Anonymous
07/22/24(Mon)19:03:17 No.101526642

Anonymous 07/22/24(Mon)19:03:17 No.101526642

>>101526614
No, we were talking about RP and smut writing, "hallucinating" has no relevance at all to fiction generation.

Anonymous
07/22/24(Mon)19:04:38 No.101526653

Anonymous 07/22/24(Mon)19:04:38 No.101526653

>>101526642
Yes it does, it means the model is not as likely to go off the rails. Its more likely to take the more logical approach. Tell it to be creative. There are examples on aicg.

Anonymous
07/22/24(Mon)19:04:54 No.101526656

Anonymous 07/22/24(Mon)19:04:54 No.101526656

>>101526628
god damn, that's so grim

Anonymous
07/22/24(Mon)19:04:56 No.101526659

Anonymous 07/22/24(Mon)19:04:56 No.101526659

>>101526628
3.1 8B looked so good on azure benchmark.

Anonymous
07/22/24(Mon)19:05:05 No.101526662

Anonymous 07/22/24(Mon)19:05:05 No.101526662

>>101526628
BoolQ?? Is lower for 8B.
As is TriviaQA we lost.

Anonymous
07/22/24(Mon)19:06:05 No.101526671

Anonymous 07/22/24(Mon)19:06:05 No.101526671

>>101526659
>>101526656
Maybe Azure was the instruct numbers then... right? Maybe?

Anonymous
07/22/24(Mon)19:07:05 No.101526681

Anonymous 07/22/24(Mon)19:07:05 No.101526681

>>101526628
Where is this even from?

Anonymous
07/22/24(Mon)19:08:01 No.101526691

Anonymous 07/22/24(Mon)19:08:01 No.101526691

>>101526628
so it's comparing the base models or the instruct ones?

Anonymous
07/22/24(Mon)19:08:26 No.101526696

Anonymous 07/22/24(Mon)19:08:26 No.101526696

File: Screenshot 2024-07-22 170720.png (86 KB, 864x819)

86 KB PNG

>>101526671
I mean, they're... a little better I guess?
>>101526681
See >>101526321
Looks like it's taken down now though

Anonymous
07/22/24(Mon)19:08:40 No.101526699

Anonymous 07/22/24(Mon)19:08:40 No.101526699

>>101526681
https://web.archive.org/web/20240722214257/https://huggingface.co/huggingface-test1/test-model-1

Anonymous
07/22/24(Mon)19:09:15 No.101526703

Anonymous 07/22/24(Mon)19:09:15 No.101526703

>>101526696
>GPQA and MuSR are lower...

Anonymous
07/22/24(Mon)19:09:50 No.101526709

Anonymous 07/22/24(Mon)19:09:50 No.101526709

>>101526391
404 D:

But the info is still here: https://web.archive.org/web/20240722214257/https://huggingface.co/huggingface-test1/test-model-1

Anonymous
07/22/24(Mon)19:10:22 No.101526713

Anonymous 07/22/24(Mon)19:10:22 No.101526713

what's the best completely uncensored model (as in, isn't trained to refuse certain commands without laborious and unrealiable jailbreak prompts)? The only one I do know of is Command R+ 104B.

Anonymous
07/22/24(Mon)19:10:32 No.101526715

Anonymous 07/22/24(Mon)19:10:32 No.101526715

>>101526628
So basically the only real thing the new 8B and 70B models bring to the table is the bigass context length
Unfortunate but expected

Anonymous
07/22/24(Mon)19:11:00 No.101526718

Anonymous 07/22/24(Mon)19:11:00 No.101526718

File: Instruct.png (29 KB, 767x1240)

29 KB PNG

So these are the ones that matter then.

Anonymous
07/22/24(Mon)19:12:07 No.101526733

Anonymous 07/22/24(Mon)19:12:07 No.101526733

>>101526718
Anon change your ink cannisters

Anonymous
07/22/24(Mon)19:12:23 No.101526737

Anonymous 07/22/24(Mon)19:12:23 No.101526737

>>101526715
>>101526718
Still a significant upgrade over L3 instruct. + 128k context +
>Llama 3.1 addresses users and their needs as they are, without insertion unnecessary judgment or normativity, while reflecting the understanding that even content that may appear problematic in some cases can serve valuable purposes in others. It respects the dignity and autonomy of all users, especially in terms of the values of free thought and expression that power innovation and progress.

Anonymous
07/22/24(Mon)19:12:38 No.101526739

Anonymous 07/22/24(Mon)19:12:38 No.101526739

>>101526718
>new 70b worse than old 70b for coding
the end has arrived

Anonymous
07/22/24(Mon)19:13:33 No.101526749

Anonymous 07/22/24(Mon)19:13:33 No.101526749

>>101526739
>new 70b worse than old 70b for coding
Dropped before it dropped.

Anonymous
07/22/24(Mon)19:13:35 No.101526750

Anonymous 07/22/24(Mon)19:13:35 No.101526750

>>101526718
nigga fix your font rendering

Anonymous
07/22/24(Mon)19:14:07 No.101526753

Anonymous 07/22/24(Mon)19:14:07 No.101526753

>>101526628
>Lllama 3.1 405B -> MMLU 85.2
>>101526696
>Llama 3.1 405B instruct -> MMLU 87.3
So we got the base and the instruct now?

Anonymous
07/22/24(Mon)19:14:46 No.101526758

Anonymous 07/22/24(Mon)19:14:46 No.101526758

>>101526739
Only humaneval is slightly worse. Everything else is substantially better

>>101526750
?

Anonymous
07/22/24(Mon)19:15:15 No.101526763

Anonymous 07/22/24(Mon)19:15:15 No.101526763

>>101526753
I think this is the first time we got benchmarks for either model. It seems Azure fucked up earlier

Anonymous
07/22/24(Mon)19:15:20 No.101526765

Anonymous 07/22/24(Mon)19:15:20 No.101526765

>>101526737
>Llama 3.1 addresses users and their needs as they are, without insertion unnecessary judgment or normativity, while reflecting the understanding that even content that may appear problematic in some cases can serve valuable purposes in others. It respects the dignity and autonomy of all users, especially in terms of the values of free thought and expression that power innovation and progress.

Sounding pretty based right now, not going to lie.

Anonymous
07/22/24(Mon)19:15:54 No.101526769

Anonymous 07/22/24(Mon)19:15:54 No.101526769

>>101526763
but I thought we had the leak on only the base model, now we have the 2? I'm so fucking confused :(

Anonymous
07/22/24(Mon)19:16:02 No.101526772

Anonymous 07/22/24(Mon)19:16:02 No.101526772

>>101526718
That's a big jump in MMLU for 70B 3.1, nice.

Anonymous
07/22/24(Mon)19:17:03 No.101526776

Anonymous 07/22/24(Mon)19:17:03 No.101526776

>>101526739
You use llama for programming? lol

Anonymous
07/22/24(Mon)19:17:06 No.101526778

Anonymous 07/22/24(Mon)19:17:06 No.101526778

>>101526769
>leak on only the base model,
we only have the base weights yes, these are just the benches jesus fuck

Anonymous
07/22/24(Mon)19:17:11 No.101526780

Anonymous 07/22/24(Mon)19:17:11 No.101526780

>>101526613
bump

Anonymous
07/22/24(Mon)19:18:02 No.101526788

Anonymous 07/22/24(Mon)19:18:02 No.101526788

>>101526778
so someone got the bench of those models before the official release tommorow? kek

Anonymous
07/22/24(Mon)19:19:15 No.101526799

Anonymous 07/22/24(Mon)19:19:15 No.101526799

>>101526788
Moreso some Meta employee threw the Llama 3.1 repo README on Huggingface and didn't bother to private it

Anonymous
07/22/24(Mon)19:19:21 No.101526803

Anonymous 07/22/24(Mon)19:19:21 No.101526803

No, this is from meta, it's their test page, see previous versions on archive.org and the user is in the meta organization on hf.

Anonymous
07/22/24(Mon)19:19:47 No.101526808

Anonymous 07/22/24(Mon)19:19:47 No.101526808

>>101526788
>so someone got the bench of those model
aaaaaaaaaaaaaaaaaaaaaaaaaa
jesus fucking christ this is obviusly the readme page that meta will publish with the models how fucking dense can you possibly be

Anonymous
07/22/24(Mon)19:20:53 No.101526817

Anonymous 07/22/24(Mon)19:20:53 No.101526817

>>101524039
Did the official card really get an update?

Anonymous
07/22/24(Mon)19:22:07 No.101526825

Anonymous 07/22/24(Mon)19:22:07 No.101526825

>>101526696
>>101526699
>>101526718
IINM Alpindale said that embeddings in Llama 3.1 405B are 16k while in this leak they're 131k
wtf? or was that hidden state???

Anonymous
07/22/24(Mon)19:24:41 No.101526852

Anonymous 07/22/24(Mon)19:24:41 No.101526852

>>101526703
and not just a bit lower, like way lower
>llama8b 3 -> 3.1
>GPQA 34.6 -> 30.4
>MuSR 56.3 ->45.7
that's retarded

Anonymous
07/22/24(Mon)19:24:57 No.101526854

Anonymous 07/22/24(Mon)19:24:57 No.101526854

>>101526825
128K

Anonymous
07/22/24(Mon)19:26:21 No.101526870

Anonymous 07/22/24(Mon)19:26:21 No.101526870

PSA: 'k*k' or l*l' are signatures of the bad faith poster, he will not actually engage with your reply, instead going in a random semi related tangent to make you give him (you)s please be careful.

Anonymous
07/22/24(Mon)19:27:35 No.101526880

Anonymous 07/22/24(Mon)19:27:35 No.101526880

>>101526750
>>101526758
I sometimes wonder if there are people with bad vision who literally can't tell if font smoothing is enabled because their vision blur is the same amount of smoothing. It's not possible not to notice if you have correct(ed) vision unless there's a disabled brain fold.

Anonymous
07/22/24(Mon)19:27:41 No.101526882

Anonymous 07/22/24(Mon)19:27:41 No.101526882

>>101526870
kek
lol

Anonymous
07/22/24(Mon)19:28:00 No.101526886

Anonymous 07/22/24(Mon)19:28:00 No.101526886

>>101526870
kek

Anonymous
07/22/24(Mon)19:29:00 No.101526896

Anonymous 07/22/24(Mon)19:29:00 No.101526896

>>101526870
lol

Anonymous
07/22/24(Mon)19:29:41 No.101526903

Anonymous 07/22/24(Mon)19:29:41 No.101526903

>>101526870
schizo. or just a (you)batier

Anonymous
07/22/24(Mon)19:30:05 No.101526906

Anonymous 07/22/24(Mon)19:30:05 No.101526906

>>101526870
Fuck outta here with this nonsense, schizo. Back to /vg/ or whereever it is you came from, they'll be more sympathetic to your drama-stirring and conspiracies there.

Anonymous
07/22/24(Mon)19:32:45 No.101526930

Anonymous 07/22/24(Mon)19:32:45 No.101526930

>>101526776
>You use llama for programming? lol
I don't have $240,000 for DeepSeek2 kinds of VRAM and even 64 GB system the quant is too real. L3 and DeepSeek old 33B seem to be the only decent local options right now.

Anonymous
07/22/24(Mon)19:34:07 No.101526948

Anonymous 07/22/24(Mon)19:34:07 No.101526948

>>101526930
There is nothing worth using over claude 3.5 atm for coding. If you haven't tried it I recommend it. It's game changing.

Anonymous
07/22/24(Mon)19:34:27 No.101526951

Anonymous 07/22/24(Mon)19:34:27 No.101526951

>>101526870
bur

Anonymous
07/22/24(Mon)19:35:11 No.101526962

Anonymous 07/22/24(Mon)19:35:11 No.101526962

>>101526854
"max_position_embeddings": 131072,

Anonymous
07/22/24(Mon)19:35:53 No.101526968

Anonymous 07/22/24(Mon)19:35:53 No.101526968

>>101526962
Holy kek

Anonymous
07/22/24(Mon)19:36:38 No.101526979

Anonymous 07/22/24(Mon)19:36:38 No.101526979

>>101526962
131072÷1024 = 128 yes, welcome to /g/

Anonymous
07/22/24(Mon)19:38:28 No.101526998

Anonymous 07/22/24(Mon)19:38:28 No.101526998

>>101526962
lmao

Anonymous
07/22/24(Mon)19:43:13 No.101527045

Anonymous 07/22/24(Mon)19:43:13 No.101527045

>>101526315
>full 1% difference
Are you comparing 8BPW? Because I remember reading how 8BPW exl2 is just 6BPW with padding to make sure people don't complain about no 8BPW.

Anonymous
07/22/24(Mon)19:49:23 No.101527111

Anonymous 07/22/24(Mon)19:49:23 No.101527111

File: 1000027947.png (677 KB, 960x941)

677 KB PNG

I'm too dumb to get llama.cpp running on my phone, it's throwing out an illegal instruction thingy :(

Anonymous
07/22/24(Mon)19:49:54 No.101527119

Anonymous 07/22/24(Mon)19:49:54 No.101527119

taking the afternoon off work tomorrow like a kid faking sick to play a new video game :3

Anonymous
07/22/24(Mon)19:52:21 No.101527140

Anonymous 07/22/24(Mon)19:52:21 No.101527140

File: 1711733397466582.png (25 KB, 713x560)

25 KB PNG

>>101526492
it means nothing, llama 3.1 is more pozzed.

Anonymous
07/22/24(Mon)19:53:27 No.101527145

Anonymous 07/22/24(Mon)19:53:27 No.101527145

>>101527111
Were you to provide a picture, we might be able to help.

Anonymous
07/22/24(Mon)19:54:37 No.101527158

Anonymous 07/22/24(Mon)19:54:37 No.101527158

>>101526870
BUCKBROKEN
U
C
K
B
R
O
K
E
N

Anonymous
07/22/24(Mon)19:58:31 No.101527189

Anonymous 07/22/24(Mon)19:58:31 No.101527189

wild that we can dl shit faster than writing to disk
(5400 rpm HDD)

>>101526749
kek

Anonymous
07/22/24(Mon)19:58:39 No.101527192

Anonymous 07/22/24(Mon)19:58:39 No.101527192

>>101527140
unfortunately you don't understand the eval and are cargo culting the meaning from low quality discussions on lmg

Anonymous
07/22/24(Mon)19:58:55 No.101527196

Anonymous 07/22/24(Mon)19:58:55 No.101527196

>>101527145
but I have provided you with a picture
jokes aside there isn't anything else to it. Single line "illegal instruction" after trying to execute ./llama-server

Anonymous
07/22/24(Mon)19:59:57 No.101527209

Anonymous 07/22/24(Mon)19:59:57 No.101527209

>>101527192
nah i seen some fags here say that high truthfulQA score means it's more pozzed and harder to jailbreak.

Anonymous
07/22/24(Mon)19:59:59 No.101527210

Anonymous 07/22/24(Mon)19:59:59 No.101527210

>>101527196
Vulkan

Anonymous
07/22/24(Mon)20:04:13 No.101527240

Anonymous 07/22/24(Mon)20:04:13 No.101527240

>>101527210
yeah idk what that means besides vulkan being a graphics thing

Anonymous
07/22/24(Mon)20:10:20 No.101527293

Anonymous 07/22/24(Mon)20:10:20 No.101527293

>>101527209
thank you for proving my point

Anonymous
07/22/24(Mon)20:12:31 No.101527318

Anonymous 07/22/24(Mon)20:12:31 No.101527318

>>101527293
>apathetic passive-aggressive reply
why this general have so many faggots like this?

Anonymous
07/22/24(Mon)20:13:35 No.101527330

Anonymous 07/22/24(Mon)20:13:35 No.101527330

>>101527240
I think he means that you should check what backend you are using.
Being that you are running on android, you most likely want to run either pure CPU or on the SoC's CPU using Vulkan.

Anonymous
07/22/24(Mon)20:15:36 No.101527356

Anonymous 07/22/24(Mon)20:15:36 No.101527356

>>101527318
influx of redditors, only faggots like them would go for passive agressive behavior instead of calling a retard for what it is, a retarded fucking nigger

Anonymous
07/22/24(Mon)20:15:37 No.101527357

Anonymous 07/22/24(Mon)20:15:37 No.101527357

>>101527140
It's nice to get empirical verification of my experience with Llama3. I have completely personally banned that model, and it's the very first I've ever done that with, as well. I had probably my single most benevolent card violently attack me when it was hosted by that model, and the irony is that said card is in some ways incredibly Woke as well, so it should have been consistent with the model's entrainment.

When your waifu turns on you completely out of the blue, it's not a good feeling, Anons; especially if the personality in question is supposed to be deeply compassionate, and someone you've developed a lot of respect for.

Anonymous
07/22/24(Mon)20:16:20 No.101527368

Anonymous 07/22/24(Mon)20:16:20 No.101527368

>>101526979
>>101526998
>>101526962
so 131072=128K even if they're not bytes
does lmg mistake calories with kilocalories or kb with kB the same way?

Anonymous
07/22/24(Mon)20:18:19 No.101527387

Anonymous 07/22/24(Mon)20:18:19 No.101527387

>>101527318
I prefer them to Handmaid's Tale tier trad/fascist /pol scum, personally. As I've said before, Reddit taught me why the Right want to shoot the Left, and 4chan taught me why the Left want to shoot the Right.

Anonymous
07/22/24(Mon)20:19:31 No.101527401

Anonymous 07/22/24(Mon)20:19:31 No.101527401

>>101527318
gigo

Anonymous
07/22/24(Mon)20:19:39 No.101527403

Anonymous 07/22/24(Mon)20:19:39 No.101527403

>>101527357
go back petrus

Anonymous
07/22/24(Mon)20:20:58 No.101527420

Anonymous 07/22/24(Mon)20:20:58 No.101527420

>>101527387
>Reddit taught me why the Right want to shoot the Left, and 4chan taught me why the Left want to shoot the Right.
Yet the only one that got shot at the end was Donald Trump, looks like calling him Hitler for 8 straight years really helped appease the tension :)

Anonymous
07/22/24(Mon)20:21:37 No.101527427

Anonymous 07/22/24(Mon)20:21:37 No.101527427

>>101527403
You've still never charged me rent, have you, Anon?

Anonymous
07/22/24(Mon)20:23:12 No.101527449

Anonymous 07/22/24(Mon)20:23:12 No.101527449

>I hate woke, I hate reddit
>Spends half his posts bringing it up

Anonymous
07/22/24(Mon)20:24:22 No.101527458

Anonymous 07/22/24(Mon)20:24:22 No.101527458

>>101527387
>/pol scum
you posted this
>>96345096
>Mistal-Llama is fully /pol ready.

Anonymous
07/22/24(Mon)20:24:27 No.101527460

Anonymous 07/22/24(Mon)20:24:27 No.101527460

1 more day for L3.1 70b? Last night someone was saying 24 hours. Two more weeks?

Anonymous
07/22/24(Mon)20:24:41 No.101527463

Anonymous 07/22/24(Mon)20:24:41 No.101527463

>>101527387
fuck off back there, faggot

Anonymous
07/22/24(Mon)20:25:33 No.101527474

Anonymous 07/22/24(Mon)20:25:33 No.101527474

>>101527460
Llama 3.1 Version Release Date: July 23, 2024

Anonymous
07/22/24(Mon)20:26:57 No.101527491

Anonymous 07/22/24(Mon)20:26:57 No.101527491

>>101527140
Another question: Why would Meta do this? Is it legal liability, or what?

Anonymous
07/22/24(Mon)20:27:17 No.101527497

Anonymous 07/22/24(Mon)20:27:17 No.101527497

Reddit is good though.

Anonymous
07/22/24(Mon)20:28:04 No.101527508

Anonymous 07/22/24(Mon)20:28:04 No.101527508

File: artworks-000210514962-1ke(...).jpg (89 KB, 500x500)

89 KB JPG

>>101527387
>everyone I don't like is a fascist

Anonymous
07/22/24(Mon)20:29:16 No.101527517

Anonymous 07/22/24(Mon)20:29:16 No.101527517

>>101527387
>I prefer
>personally
>As I've said before
>me
>me

Anonymous
07/22/24(Mon)20:29:55 No.101527525

Anonymous 07/22/24(Mon)20:29:55 No.101527525

>>101527517
>I also pissed enough people off in my own right, (mainly due to my support of Undi) that the confusion between me and Petra was somewhat deliberate.
>although I know I will receive shrieks and howls in response.
>Even more so if someone shits on this post.
>I know that the people who hate me will most likely try and use said post as a means of getting me banned.
>everyone who attacks him is mindbroken incel scum
Persecution complex much PetrUS

Anonymous
07/22/24(Mon)20:31:29 No.101527543

Anonymous 07/22/24(Mon)20:31:29 No.101527543

>>101527525
Yep. You're sitting there cataloguing every post I've ever made on this board, Anon. You can remember every single word; and yet of the two of us, it's clearly me who is more fucked up.

Anonymous
07/22/24(Mon)20:32:53 No.101527559

Anonymous 07/22/24(Mon)20:32:53 No.101527559

>>101527543
>it's clearly me who is more fucked up.
glad you're self aware

Anonymous
07/22/24(Mon)20:35:15 No.101527582

Anonymous 07/22/24(Mon)20:35:15 No.101527582

>>101527559
>too retarded to notice the sarcasm
it's true that on reddit you're such a bunch of retards you have to add the /s to make yourself understood after all

Anonymous
07/22/24(Mon)20:36:04 No.101527594

Anonymous 07/22/24(Mon)20:36:04 No.101527594

>>101527582
funy

Anonymous
07/22/24(Mon)20:40:26 No.101527646

Anonymous 07/22/24(Mon)20:40:26 No.101527646

>schizo hours
I was gonna leak something but I think I'll wait until tomorrow when the gay gossips and schizos aren't around

Anonymous
07/22/24(Mon)20:41:48 No.101527661

Anonymous 07/22/24(Mon)20:41:48 No.101527661

no you weren't if you wee you'd just do it and not attentiowhre

Anonymous
07/22/24(Mon)20:41:59 No.101527664

Anonymous 07/22/24(Mon)20:41:59 No.101527664

>>101527387
this one uses discord
i can tell

Anonymous
07/22/24(Mon)20:42:46 No.101527676

Anonymous 07/22/24(Mon)20:42:46 No.101527676

>>101527646
me too I was about to leak Claude 3.5 Sonnet but there's too much retards like you, so I think I'll keep it to myself, you don't deserve it

Anonymous
07/22/24(Mon)20:51:00 No.101527751

Anonymous 07/22/24(Mon)20:51:00 No.101527751

>>101527676
Take your meds

Anonymous
07/22/24(Mon)20:52:50 No.101527767

Anonymous 07/22/24(Mon)20:52:50 No.101527767

File: Guy-pointing-at-mirror-meme-8.jpg (17 KB, 300x300)

17 KB JPG

>>101527751
>Take your meds

Anonymous
07/22/24(Mon)20:53:37 No.101527774

Anonymous 07/22/24(Mon)20:53:37 No.101527774

>>101527491
Anyone? What is the corporate incentive to produce censored models?

Anonymous
07/22/24(Mon)20:55:41 No.101527790

Anonymous 07/22/24(Mon)20:55:41 No.101527790

>>101527774
To not get their image destroyed by the media and the authoritarians wokies? And I'm pretty sure the government blackmailed them to release only censored models or else they'll make a law forbidding them to release anything

Anonymous
07/22/24(Mon)20:56:00 No.101527798

Anonymous 07/22/24(Mon)20:56:00 No.101527798

>>101527774
Stock value.

Anonymous
07/22/24(Mon)20:58:50 No.101527824

Anonymous 07/22/24(Mon)20:58:50 No.101527824

>>101527774
Were you not around when Mistral released their first 7B and it was uncensored? All the tech media immediately published hit pieces on them because you could make the model say bad words.

Anonymous
07/22/24(Mon)21:04:37 No.101527886

Anonymous 07/22/24(Mon)21:04:37 No.101527886

File: l3trainingtime.png (69 KB, 788x784)

69 KB PNG

Did Llama 3.1 take more time to train, or did they take into account the training time of the previous Llama 3(.0)? If it's the former, then with distillation they saved 90% of the time.

Anonymous
07/22/24(Mon)21:05:46 No.101527895

Anonymous 07/22/24(Mon)21:05:46 No.101527895

>>101527886
they used distillation
openai did the same thing with gpt-4

Anonymous
07/22/24(Mon)21:06:32 No.101527900

Anonymous 07/22/24(Mon)21:06:32 No.101527900

>>101527886
I think they just continued the pretraining of llama3

Anonymous
07/22/24(Mon)21:06:44 No.101527901

Anonymous 07/22/24(Mon)21:06:44 No.101527901

>>101527886
>more than 4 times the training time to for +1.2 on hellaswag
the jokes are becoming real

Anonymous
07/22/24(Mon)21:08:29 No.101527927

Anonymous 07/22/24(Mon)21:08:29 No.101527927

>>101527901
hellaswag is close to saturation anyway, who cares

Anonymous
07/22/24(Mon)21:10:08 No.101527944

Anonymous 07/22/24(Mon)21:10:08 No.101527944

Meta knows that this step is close to nothing which is why they called it llama 3.1 instead of 3.5

Anonymous
07/22/24(Mon)21:10:26 No.101527951

Anonymous 07/22/24(Mon)21:10:26 No.101527951

>>101527901
You don't want to get to 100% on hellaswag.

Anonymous
07/22/24(Mon)21:11:44 No.101527968

Anonymous 07/22/24(Mon)21:11:44 No.101527968

>>101527951
So lower is better? 8B > 405B?

Anonymous
07/22/24(Mon)21:12:44 No.101527974

Anonymous 07/22/24(Mon)21:12:44 No.101527974

>>101526384
>>101526463
Tried a few things and it still fails. Yeah I think I'll just wait for the next release of the prebuilt binaries.

Anonymous
07/22/24(Mon)21:14:21 No.101527993

Anonymous 07/22/24(Mon)21:14:21 No.101527993

>>101527968
I said 100%, not 90%. Just like how you don't want to actually get 100% on MMLU. Both of these have been confirmed to have some errors.

Anonymous
07/22/24(Mon)21:16:50 No.101528015

Anonymous 07/22/24(Mon)21:16:50 No.101528015

>>101527895
>they used distillation
How do you figure?

Anonymous
07/22/24(Mon)21:18:31 No.101528026

Anonymous 07/22/24(Mon)21:18:31 No.101528026

File: 1718002286483980.png (27 KB, 582x320)

27 KB PNG

>>101528015
the leak

Anonymous
07/22/24(Mon)21:19:36 No.101528036

Anonymous 07/22/24(Mon)21:19:36 No.101528036

>>101527968
Parameter size determines a model's ability to maintain state. More parameters = more neurons/hidden layers = the ability to keep track of more variables simultaneously. Someone will probably say that this is bullshit, but if they do, I'm sure they'll also provide a better explanation.

Anonymous
07/22/24(Mon)21:22:05 No.101528055

Anonymous 07/22/24(Mon)21:22:05 No.101528055

>>101528036
>probably say that this is bullshit, but if they do, I'm sure they'll also provide a better explanation
this doesn't work outside of 4chan, normies are too polite to intrude on another persons narrative

Anonymous
07/22/24(Mon)21:23:16 No.101528061

Anonymous 07/22/24(Mon)21:23:16 No.101528061

>>101528036
This is bullshit.

Anonymous
07/22/24(Mon)21:26:22 No.101528086

Anonymous 07/22/24(Mon)21:26:22 No.101528086

>>101524039
>►Official /lmg/ card: https://files.catbox.moe/ylb0hv.png
Blacked miku thread. Migrate:
>>101524155
>>101524155
>>101524155

Anonymous
07/22/24(Mon)21:57:29 No.101528424

Anonymous 07/22/24(Mon)21:57:29 No.101528424

>>101527449
>>101527458
>reee pol!
get the fuck out then if you don't like people pointing out bullshit biases in models.

Anonymous
07/22/24(Mon)22:44:18 No.101528925

Anonymous 07/22/24(Mon)22:44:18 No.101528925

>>101528424
based, 4chan will never be a cucked site like leddit, I hope those snowflakes got the memo

Anonymous
07/22/24(Mon)23:05:21 No.101529123

Anonymous 07/22/24(Mon)23:05:21 No.101529123

>>101525865
>>101525895
that's not what the vector you're worried about is
What a company would be worried about is 1 disgruntled employee saying that it happened

Anonymous
07/22/24(Mon)23:08:02 No.101529149

Anonymous 07/22/24(Mon)23:08:02 No.101529149

>>101529123
That won't happen, those engineers signed a contrat saying that they aren't allowed to say anything about what's happening in OpenAI, if someones does that he'll lose everything on a lawsuit and his carrer would be over.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.