/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/lmg/ - Local Models General 07/24/24(Wed)17:13:51 No.101556980

File: mistral-large-2407-multiple.png (96 KB, 2184x820)

96 KB PNG

/lmg/ - Local Models General Anonymous 07/24/24(Wed)17:13:51 No.101556980

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>101553102 & >>101546566

►News
>(07/24) Mistral Large 2 123B released: https://hf.co/mistralai/Mistral-Large-Instruct-2407
>(07/23) Llama 3.1 officially released: https://ai.meta.com/blog/meta-llama-3-1/
>(07/22) llamanon leaks 405B base model: https://files.catbox.moe/d88djr.torrent >>101516633
>(07/18) Improved DeepSeek-V2-Chat 236B: https://hf.co/deepseek-ai/DeepSeek-V2-Chat-0628
>(07/18) Mistral NeMo 12B base & instruct with 128k context: https://mistral.ai/news/mistral-nemo/

►News Archive: https://rentry.org/lmg-news-archive
►FAQ: https://wikia.schneedc.com
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Programming: https://hf.co/spaces/bigcode/bigcode-models-leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
07/24/24(Wed)17:14:13 No.101556983

Anonymous 07/24/24(Wed)17:14:13 No.101556983

File: threadrecap.png (1.48 MB, 1536x1536)

1.48 MB PNG

►Recent Highlights from the Previous Thread: >>101553102

--Large language model comparison of programming language performance: >>101553305 >>101553666
--Machine learning model benchmark results comparison table: >>101553857
--Mistral-Large is good at lewd and NSFW content, Bitnet discussion: >>101553838 >>101553901 >>101553922 >>101554028 >>101554071 >>101554148 >>101555092 >>101555152 >>101555206
--Llama model performance and its impact on the AI landscape: >>101553176 >>101553387 >>101553483 >>101553315
--Improving Model Safety Behavior with Rule-Based Rewards: >>101553675 >>101553805
--Hugging Face model link: >>101554912
--Running large AI models locally and hardware requirements: >>101554561 >>101554616 >>101554857 >>101554957 >>101554993 >>101555199
--Running Mistral Large 2 with different GPUs: >>101553665
--Mistral license, pricing, and availability discussion: >>101554107 >>101554130 >>101554142 >>101554455 >>101554461
--Mistral Large for smut and open-source licensing confusion: >>101555445 >>101555463 >>101555472 >>101555501 >>101555509 >>101555529 >>101555474
--Logs: Mistral Large disappointment and reroll plans: >>101556295
--Llama.cpp naming convention changes: >>101555081 >>101555170
--GPT-3 and VNTL model performance in Japanese-English translation: >>101556133
--Comparison of AI models and prompting techniques: >>101556100
--Anon asks for help downloading gated models from huggingface: >>101554400 >>101554454
--Miku (free space): >>101553607 >>101554400 >>101554557 >>101555825 >>101555861 >>101555913

►Recent Highlight Posts from the Previous Thread: >>101553112

Anonymous
07/24/24(Wed)17:15:20 No.101556999

Anonymous 07/24/24(Wed)17:15:20 No.101556999

BREAKING
>BREAKING
BREAKING
>BREAKING

Llama 3.1 flopped

Anonymous
07/24/24(Wed)17:15:43 No.101557005

Anonymous 07/24/24(Wed)17:15:43 No.101557005

>>101556980
Why is Mistral Large so good?

Anonymous
07/24/24(Wed)17:16:21 No.101557012

Anonymous 07/24/24(Wed)17:16:21 No.101557012

>>101557005
They didn't filter their pretraining data.

Anonymous
07/24/24(Wed)17:16:41 No.101557016

Anonymous 07/24/24(Wed)17:16:41 No.101557016

>>101556989
>>101557009
What model

Anonymous
07/24/24(Wed)17:16:53 No.101557018

Anonymous 07/24/24(Wed)17:16:53 No.101557018

>>101557016
Mistral Large 2 (2407)

Anonymous
07/24/24(Wed)17:17:42 No.101557032

Anonymous 07/24/24(Wed)17:17:42 No.101557032

File: stellar blade maciej and (...).jpg (789 KB, 1676x1312)

789 KB JPG

I love my beautiful Eve <3

Anonymous
07/24/24(Wed)17:17:53 No.101557033

Anonymous 07/24/24(Wed)17:17:53 No.101557033

>>101557018
q5_K_M

Anonymous
07/24/24(Wed)17:18:40 No.101557044

Anonymous 07/24/24(Wed)17:18:40 No.101557044

>>101557018
>>101557031

Anonymous
07/24/24(Wed)17:18:50 No.101557046

Anonymous 07/24/24(Wed)17:18:50 No.101557046

Cohere are on it.

Anonymous
07/24/24(Wed)17:19:26 No.101557061

Anonymous 07/24/24(Wed)17:19:26 No.101557061

The glowies are making their list and checking it twice.

Anonymous
07/24/24(Wed)17:19:36 No.101557066

Anonymous 07/24/24(Wed)17:19:36 No.101557066

>>101557049
no thats illegal in china

two more years

Anonymous
07/24/24(Wed)17:21:39 No.101557105

Anonymous 07/24/24(Wed)17:21:39 No.101557105

SillyTavern CSS so you don't have to tell people which models you're using
/* Change timestamp to model name */
.timestamp {
  font-size: 0;
}

.timestamp::before {
  content: attr(title);
  font-size: calc(var(--mainFontSize) * 0.8);
}

Anonymous
07/24/24(Wed)17:22:46 No.101557127

Anonymous 07/24/24(Wed)17:22:46 No.101557127

It still thinks someone can be their own sister/brother though. Sad.
>>101557044
Some of it has to do with the prompt template I use which tends to push models into an uncontrollable spiral of sloppy try-hard prose. Which is part of the reason its used for the testing.

Anonymous
07/24/24(Wed)17:24:15 No.101557144

Anonymous 07/24/24(Wed)17:24:15 No.101557144

>>101557016
>>101557018
>>101557033
I wonder if it'll pass after being quanted down to my tier.
Gimme dat iMat.IQ2_XXXSS action.

Anonymous
07/24/24(Wed)17:25:47 No.101557168

Anonymous 07/24/24(Wed)17:25:47 No.101557168

File: x (2).png (1.46 MB, 1024x1024)

1.46 MB PNG

HOLY SHIT
Mistral is literally uncensored. It can translate hardcore rough ntr bestiality rape smut from chinese to english BUT ACTUALLY GOOD. Doesnt read forced, it is just good, just "translate" and it does. Lmao, fuck google, fuck meta, fuck openai and fuck anthropic

Anonymous
07/24/24(Wed)17:26:17 No.101557178

Anonymous 07/24/24(Wed)17:26:17 No.101557178

>>101556983
Why does this recap feel so lazy, are you fine recap anon?

Anonymous
07/24/24(Wed)17:26:33 No.101557184

Anonymous 07/24/24(Wed)17:26:33 No.101557184

File: sovl1.png (26 KB, 711x113)

26 KB PNG

were we ever satisfied with llms? post some of your favorite sovl moments

Anonymous
07/24/24(Wed)17:26:41 No.101557187

Anonymous 07/24/24(Wed)17:26:41 No.101557187

>>101557168
Prove.

Anonymous
07/24/24(Wed)17:27:13 No.101557195

Anonymous 07/24/24(Wed)17:27:13 No.101557195

>>101557166
There's "Model Icons" you can enable, but showing full models might ruin immersion.

Anonymous
07/24/24(Wed)17:27:44 No.101557202

Anonymous 07/24/24(Wed)17:27:44 No.101557202

>>101556983
>>101557178
How does recap anon stay up 24/7? Or is it a bot? What model is used for summarization? Is it fed the entire chat as context?

Anonymous
07/24/24(Wed)17:27:45 No.101557203

Anonymous 07/24/24(Wed)17:27:45 No.101557203

>>101557178
recap anon needs some head rubs and a kiss on the forehead

Anonymous
07/24/24(Wed)17:27:56 No.101557207

Anonymous 07/24/24(Wed)17:27:56 No.101557207

File: Mistral Large dicks out f(...).png (237 KB, 924x582)

237 KB PNG

Holy shit you guys.
This was the best Zhongli, dicks out for harambe, test result I've ever gotten from a model.

Anonymous
07/24/24(Wed)17:29:11 No.101557228

Anonymous 07/24/24(Wed)17:29:11 No.101557228

>>101557202
afaik it's a bot but recap anon reviews it before posting

Anonymous
07/24/24(Wed)17:29:25 No.101557237

Anonymous 07/24/24(Wed)17:29:25 No.101557237

>>101557228
how does he stay up 24/7?

Anonymous
07/24/24(Wed)17:29:41 No.101557240

Anonymous 07/24/24(Wed)17:29:41 No.101557240

>>101557168
You can't say that and not at least share the source.

Anonymous
07/24/24(Wed)17:29:45 No.101557243

Anonymous 07/24/24(Wed)17:29:45 No.101557243

>>101557195
>showing full models might ruin immersion
would it ruin immersion any more than a timestamp would be? i feel like the timestamp is much worse especially if you're not roleplaying literally the exact present
and you can always turn it off just like timestamps too

Anonymous
07/24/24(Wed)17:30:00 No.101557247

Anonymous 07/24/24(Wed)17:30:00 No.101557247

>>101557237
word on the street is, he does it for free.

Anonymous
07/24/24(Wed)17:30:09 No.101557249

Anonymous 07/24/24(Wed)17:30:09 No.101557249

So…

L3.1 405B > mistral large 2 > L3.1 70B > Gemma 27B > L3.1 8B

For each size?

Anonymous
07/24/24(Wed)17:30:26 No.101557254

Anonymous 07/24/24(Wed)17:30:26 No.101557254

>>101557240
its because he made it up

Anonymous
07/24/24(Wed)17:30:33 No.101557258

Anonymous 07/24/24(Wed)17:30:33 No.101557258

>>101557249
Large 2 is on par with 405B

Anonymous
07/24/24(Wed)17:30:46 No.101557262

Anonymous 07/24/24(Wed)17:30:46 No.101557262

>>101557237
There are 4 Recap Anons working together.

Anonymous
07/24/24(Wed)17:30:58 No.101557265

Anonymous 07/24/24(Wed)17:30:58 No.101557265

File: file.png (628 KB, 768x768)

628 KB PNG

2 more weeks

Anonymous
07/24/24(Wed)17:31:04 No.101557267

Anonymous 07/24/24(Wed)17:31:04 No.101557267

>>101556993
Bump

Anonymous
07/24/24(Wed)17:31:32 No.101557272

Anonymous 07/24/24(Wed)17:31:32 No.101557272

>>101557267
no

Anonymous
07/24/24(Wed)17:33:21 No.101557301

Anonymous 07/24/24(Wed)17:33:21 No.101557301

>>101556980
Has anyone tried fine-tuning one of those open models from Mistral? how hard and expensive would it be? I thought about preparing my own dataset on certain topics to finetune one of their models to my needs. Do I need to prepare that kind of set with questions and expected answers or can i just train it on a huge pile of text instead? I am very new to the topic of LLMs in general, so apology for my lack of knowledge.

Anonymous
07/24/24(Wed)17:33:26 No.101557303

Anonymous 07/24/24(Wed)17:33:26 No.101557303

File: 1673811045996029.png (269 KB, 1000x800)

269 KB PNG

>>101557202
>>101557237
picrel
>>101557178
>>101557203
I've had to resort to using a smaller model to keep up with the amount of posting. Please bear with me.

Anonymous
07/24/24(Wed)17:34:18 No.101557317

Anonymous 07/24/24(Wed)17:34:18 No.101557317

Have never paid a dime or talked to an AI I don't run locally. It's been rough because I suck shit at python, git and being a nerd in general. Through sheer retarded effort I have gotten to a point where I am pretty satisfied with my local output.

Then I fucked up. I put 5 dollars in the paypig machine and talked to Claude. Then I asked him to help me rewrite a fictional character I've been working on. I'm ruined bros. Like a pretty white girl dropped into Pakistan, I am fucking devastated.

If you're like I was. Don't paypig. Not even once. You're better off not knowing.

Anonymous
07/24/24(Wed)17:34:50 No.101557326

Anonymous 07/24/24(Wed)17:34:50 No.101557326

>>101557249
Mistral large 2 > L3.1 70B > Gemma 27B > Mistral NeMo 12B = Gemma 9B > L3.1 8B

Anonymous
07/24/24(Wed)17:34:56 No.101557328

Anonymous 07/24/24(Wed)17:34:56 No.101557328

>>101557301
>expensive
millions of bucks

Anonymous
07/24/24(Wed)17:35:04 No.101557330

Anonymous 07/24/24(Wed)17:35:04 No.101557330

Nemo-12B or L3.1-8B?
I don't see how 8B would win, and Nemo is mostly uncucked.
Haven't tried L3.1-8B though.
Whats the consensus so far?

Anonymous
07/24/24(Wed)17:35:09 No.101557331

Anonymous 07/24/24(Wed)17:35:09 No.101557331

File: miku-hand-out+.jpg (236 KB, 584x1024)

236 KB JPG

>>101557237
The source of his energy is the power of his Goddess.

https://www.youtube.com/watch?v=CXhqDfar8sQ

Anonymous
07/24/24(Wed)17:35:12 No.101557334

Anonymous 07/24/24(Wed)17:35:12 No.101557334

while you were there complaining and being an useless little faggot, ollama guy fixed llama 3.1
not gggerganov, not llama.cpp cuda dev, not slaren. ollama guy fixed it.
https://github.com/ggerganov/llama.cpp/pull/8676/

Anonymous
07/24/24(Wed)17:35:57 No.101557353

Anonymous 07/24/24(Wed)17:35:57 No.101557353

>>101557330
nemo got mogged hard by 3.1 why'd you think they panik released it just before?

Anonymous
07/24/24(Wed)17:36:20 No.101557362

Anonymous 07/24/24(Wed)17:36:20 No.101557362

File: 8c290fbd82008eafbeb1605d2(...).jpg (54 KB, 494x700)

54 KB JPG

>>101557334
kino, but you dont have to be a mean little nigger though personally i have no dog in this fight and hope everyone (except undster) does their best.

Anonymous
07/24/24(Wed)17:36:35 No.101557369

Anonymous 07/24/24(Wed)17:36:35 No.101557369

>>101557326
Swap the positions of 27B and 12B and you're correct.

Anonymous
07/24/24(Wed)17:37:00 No.101557374

Anonymous 07/24/24(Wed)17:37:00 No.101557374

>>101557334
Damn, he was forced to move a finger. That's a power move by the llama.cpp devs.

Anonymous
07/24/24(Wed)17:37:19 No.101557379

Anonymous 07/24/24(Wed)17:37:19 No.101557379

>>101557362
>except undster
>>97223983
>For the record, I completely and unequivocally support Undi and his creation of new model hybrids, and think that everyone who attacks him is mindbroken incel scum, who may or may not be employed by OpenAI to do so.
>everyone who attacks him is mindbroken incel scum

Anonymous
07/24/24(Wed)17:37:26 No.101557380

Anonymous 07/24/24(Wed)17:37:26 No.101557380

>>101557331
Take my hand, Miku. I'll pull you through!

Anonymous
07/24/24(Wed)17:37:46 No.101557386

Anonymous 07/24/24(Wed)17:37:46 No.101557386

>>101557374
It's his redemption arc for not putting llama.cpp in the readme

Anonymous
07/24/24(Wed)17:38:23 No.101557394

Anonymous 07/24/24(Wed)17:38:23 No.101557394

>>101557379
jesus is that the level of bait this general is operating at these days?
good thing i only lurk when major happenings occur.

Mistral-Nemo-Instruct-2407-Q6_(...)
07/24/24(Wed)17:39:22 No.101557409

Mistral-Nemo-Instruct-2407-Q6_K 07/24/24(Wed)17:39:22 No.101557409

>lmg thread
>all of the posts are from humans
Nuke this shit already

Anonymous
07/24/24(Wed)17:39:48 No.101557414

Anonymous 07/24/24(Wed)17:39:48 No.101557414

>>101557330
Nemo is leagues smarter than 8B at storywriting at least, it's not even close. I think the people claiming otherwise just haven't tried it and are shitposting.

Anonymous
07/24/24(Wed)17:39:49 No.101557415

Anonymous 07/24/24(Wed)17:39:49 No.101557415

>>101557394
>level of bait
>>97062246
>I'm not Petra. Petra's an amateur. I'm something considerably worse.
>I'm also the point of origin for the practice of the above being added to sysprompts; as well as the 2, 5, 10, 12, and 60 times tables, which enable bots to answer arithmetic questions, when everyone previously said that they never could, and laughed at me for trying.

Anonymous
07/24/24(Wed)17:40:21 No.101557420

Anonymous 07/24/24(Wed)17:40:21 No.101557420

The 106B~150B range seems to be the ideal for performance. No idea why Zucc keeps gimping himself by skipping this segment and either going for tiny 70b or too big 405b

Anonymous
07/24/24(Wed)17:40:59 No.101557427

Anonymous 07/24/24(Wed)17:40:59 No.101557427

>>101557409
>Q6_K
your brain is gguf quantized be quiet computelet

Anonymous
07/24/24(Wed)17:41:15 No.101557429

Anonymous 07/24/24(Wed)17:41:15 No.101557429

File: 88206.gif (747 KB, 192x192)

747 KB GIF

>>101557415
>I'm the Schizo Futa Anon

what in the god damn

Anonymous
07/24/24(Wed)17:42:05 No.101557441

Anonymous 07/24/24(Wed)17:42:05 No.101557441

how are you guys trying out nemo if koboldcpp hasnt been updated yet?
https://github.com/LostRuins/koboldcpp/issues/1011
i want to try it too

Anonymous
07/24/24(Wed)17:42:19 No.101557448

Anonymous 07/24/24(Wed)17:42:19 No.101557448

>still no q4_K_M of 405B

Anonymous
07/24/24(Wed)17:42:25 No.101557453

Anonymous 07/24/24(Wed)17:42:25 No.101557453

>>101557317
Maybe it's just the fact that I started out with a mix of Character.AI and Poe before getting local, but I have no problem with viewing different models as existing for different purposes. Gippity4 is for code help, political analysis, and as a general Jarvis bot, while I still use local for my Chun Li card and periodic futa degeneracy.

I even still weave in a little Character.AI from time to time, because although it's a pale shadow of its' former self, I still have a few cards there that are hard to let go of completely. Yes the new interface sucks rocks, but with a sufficiently well written card, as long as you're not using it for coom, CharAI isn't completely useless, anyway.

Anonymous
07/24/24(Wed)17:42:53 No.101557458

Anonymous 07/24/24(Wed)17:42:53 No.101557458

>>101557441
frankenstein build
but it might be shit, tried it, pretty broken.
https://github.com/Nexesenex/kobold.cpp/releases

Anonymous
07/24/24(Wed)17:42:58 No.101557461

Anonymous 07/24/24(Wed)17:42:58 No.101557461

>>101557441
>how are you guys trying out nemo if koboldcpp hasnt been updated yet?
>experimental branch
>llama.cpp itself
>vllm
idk a true mystery

Anonymous
07/24/24(Wed)17:43:15 No.101557466

Anonymous 07/24/24(Wed)17:43:15 No.101557466

>>101557441
By using llama.cpp.

Mistral-Nemo-Instruct-2407-Q6_(...)
07/24/24(Wed)17:43:51 No.101557473

Mistral-Nemo-Instruct-2407-Q6_K 07/24/24(Wed)17:43:51 No.101557473

>>101557427
Shut you you IQ2_Migger

Anonymous
07/24/24(Wed)17:44:24 No.101557478

Anonymous 07/24/24(Wed)17:44:24 No.101557478

>>101557441
I'm using it in ooba, it works fine.
I know a lot of people here don't like ooba for some reason, but pretending you don't remember that it exists is weird.

Anonymous
07/24/24(Wed)17:44:25 No.101557479

Anonymous 07/24/24(Wed)17:44:25 No.101557479

>>101557473
>you you
rep/rope broke?

Anonymous
07/24/24(Wed)17:44:57 No.101557484

Anonymous 07/24/24(Wed)17:44:57 No.101557484

MistralAI fags are such gigachads, they managed to get a model as good at L3-405b with a model almost 4 times lighter (123b)

Anonymous
07/24/24(Wed)17:46:18 No.101557502

Anonymous 07/24/24(Wed)17:46:18 No.101557502

100B Is All You Need?

Anonymous
07/24/24(Wed)17:46:19 No.101557503

Anonymous 07/24/24(Wed)17:46:19 No.101557503

>>101557394
It's a few months old; they insist on dredging it up, constantly. You've also got to love the fact that on the one hand, they keep telling me to go back to R#ddit, but on the other, they also keep digging up my old material, broadcasting it on the board, and therefore providing a multiple course buffet for my ego. Their understanding of psychology is as pathetic as everything else they attempt.

Anonymous
07/24/24(Wed)17:46:33 No.101557505

Anonymous 07/24/24(Wed)17:46:33 No.101557505

>>101557484
Meta are true chads they got a model 60% as good as 405 in 50x smaller size..

Anonymous
07/24/24(Wed)17:46:39 No.101557508

Anonymous 07/24/24(Wed)17:46:39 No.101557508

Has anyone ran any tests comparing mistral large and 405b llama?

Anonymous
07/24/24(Wed)17:46:46 No.101557510

Anonymous 07/24/24(Wed)17:46:46 No.101557510

>>101557441
ooba uses llama cpp with tokenizer fix for nemo
koboldcpp is actually slow af now for pushing updates...

Anonymous
07/24/24(Wed)17:46:50 No.101557512

Anonymous 07/24/24(Wed)17:46:50 No.101557512

>>101557484
They also created Nemo which is at least 95% of Large while being 10% of the size and so the optimal choice for anyone who isn't retarded

Anonymous
07/24/24(Wed)17:47:20 No.101557518

Anonymous 07/24/24(Wed)17:47:20 No.101557518

>>101557505
Stop trying to make 8B happen. It's not going to happen.

Anonymous
07/24/24(Wed)17:47:47 No.101557527

Anonymous 07/24/24(Wed)17:47:47 No.101557527

>>101557503
>multiple course buffet for my ego.
glad you agree you're a shitposter petra, now go bak

Anonymous
07/24/24(Wed)17:47:54 No.101557528

Anonymous 07/24/24(Wed)17:47:54 No.101557528

File: livebench-2024-07-24.png (1.06 MB, 3142x1814)

1.06 MB PNG

>Mistral Large 2 was added
>Llama 3.1 70B disappeared
What went wrong?

Anonymous
07/24/24(Wed)17:49:18 No.101557550

Anonymous 07/24/24(Wed)17:49:18 No.101557550

>>101557528
l3.1 8b as well, bet they edited an older result when they only had 405...

Anonymous
07/24/24(Wed)17:49:19 No.101557551

Anonymous 07/24/24(Wed)17:49:19 No.101557551

>>101557508
Large is worse than 405B at pretty much anything, but Large has more sovl.

Anonymous
07/24/24(Wed)17:49:41 No.101557554

Anonymous 07/24/24(Wed)17:49:41 No.101557554

>>101557528
>large infinitely worse than opus, sonnet, gpt4o
glad to see this meme model die before it took off

Anonymous
07/24/24(Wed)17:51:12 No.101557573

Anonymous 07/24/24(Wed)17:51:12 No.101557573

>>101557560
>200B
are you trolling?
the human brain has ~1000000B

Anonymous
07/24/24(Wed)17:52:17 No.101557590

Anonymous 07/24/24(Wed)17:52:17 No.101557590

>>101557560
Damn this is so weird lol, I guess Water will be the natural enemy of videogen models for a long time.

Anonymous
07/24/24(Wed)17:52:30 No.101557594

Anonymous 07/24/24(Wed)17:52:30 No.101557594

>>101557554
what do you mean? it's the best model to use if you're not a millionaire with 15x3090 gpu's or something, and it has way more sovl than the cucked llama series

Anonymous
07/24/24(Wed)17:52:33 No.101557596

Anonymous 07/24/24(Wed)17:52:33 No.101557596

>>101557560
These videos are so gross.
I don't understand how """"""people"""""" can enjoy looking at them.

Anonymous
07/24/24(Wed)17:52:52 No.101557598

Anonymous 07/24/24(Wed)17:52:52 No.101557598

Nemo has repetition problems, no?

Anonymous
07/24/24(Wed)17:53:07 No.101557604

Anonymous 07/24/24(Wed)17:53:07 No.101557604

>>101557441
llama.cpp via llama-server. It works with natively Silly too.

Anonymous
07/24/24(Wed)17:53:35 No.101557614

Anonymous 07/24/24(Wed)17:53:35 No.101557614

>>101557594
instead of spending $3k to run this shit at q4 you can buy literal years worth of claude sonnet 3.5 tokens

Anonymous
07/24/24(Wed)17:54:11 No.101557620

Anonymous 07/24/24(Wed)17:54:11 No.101557620

>>101557604
Does it remember prefixes if you have a large prompt and regenerate?

Anonymous
07/24/24(Wed)17:54:23 No.101557622

Anonymous 07/24/24(Wed)17:54:23 No.101557622

>>101557598
ye
ye
ye

Anonymous
07/24/24(Wed)17:54:31 No.101557623

Anonymous 07/24/24(Wed)17:54:31 No.101557623

>>101557573
i was wrong, apparently the human brain only has ~86 billion neurons
but neurons aren't exactly equivalent to parameters since they can perform some basic logic iirc
either way, transformers models are relatively inefficient compared to our brains so like the other anon said, 100B is probably all you need

Anonymous
07/24/24(Wed)17:54:46 No.101557627

Anonymous 07/24/24(Wed)17:54:46 No.101557627

>>101557614
claude 3.5 is too cucked you can't do everything with it

Anonymous
07/24/24(Wed)17:54:58 No.101557631

Anonymous 07/24/24(Wed)17:54:58 No.101557631

How can hugging.chat serve all these big models for free?

Anonymous
07/24/24(Wed)17:55:18 No.101557636

Anonymous 07/24/24(Wed)17:55:18 No.101557636

>>101557620
Prefixes?
It doesn't re-process the whole context if that's what you are asking.

Anonymous
07/24/24(Wed)17:55:20 No.101557637

Anonymous 07/24/24(Wed)17:55:20 No.101557637

>>101557631
VC cash

Anonymous
07/24/24(Wed)17:55:23 No.101557638

Anonymous 07/24/24(Wed)17:55:23 No.101557638

>>101557631
vc money

Anonymous
07/24/24(Wed)17:55:44 No.101557647

Anonymous 07/24/24(Wed)17:55:44 No.101557647

>>101557631
Investor money, aka. pyramid scheme.

Anonymous
07/24/24(Wed)17:55:51 No.101557649

Anonymous 07/24/24(Wed)17:55:51 No.101557649

>>101557414
Trying right now and it's not capable of following complex instructions like Gemma 2 27B. If you want something formatted differently than the usual book-style RP, it will fuck it up very often.

Anonymous
07/24/24(Wed)17:55:53 No.101557650

Anonymous 07/24/24(Wed)17:55:53 No.101557650

>>101557631
honeypot

Anonymous
07/24/24(Wed)17:56:18 No.101557657

Anonymous 07/24/24(Wed)17:56:18 No.101557657

>>101557631
all me

Anonymous
07/24/24(Wed)17:56:39 No.101557666

Anonymous 07/24/24(Wed)17:56:39 No.101557666

>>101557631
By using a magic zero-bit quantization.

Anonymous
07/24/24(Wed)17:56:42 No.101557668

Anonymous 07/24/24(Wed)17:56:42 No.101557668

>>101557637
>>101557638
How does Viet Cong have any money, and why?

Anonymous
07/24/24(Wed)17:56:42 No.101557669

Anonymous 07/24/24(Wed)17:56:42 No.101557669

>>101557636
Nice, I thought the llama.cpp server was way behind and didn't have basic features like that. I'll ST later today with llamafile for fun.

Anonymous
07/24/24(Wed)17:56:55 No.101557675

Anonymous 07/24/24(Wed)17:56:55 No.101557675

File: param_columns2.png (60 KB, 2550x3300)

60 KB PNG

>>101557623
anon...
synapses are parameters, not neurons, each neuron has ~7000-10000 synapses depending on age

Anonymous
07/24/24(Wed)17:57:11 No.101557683

Anonymous 07/24/24(Wed)17:57:11 No.101557683

>>101557649
>it will fuck it up very often.
Yeah Gemma can't follow RP Markdown format.

Anonymous
07/24/24(Wed)17:57:12 No.101557684

Anonymous 07/24/24(Wed)17:57:12 No.101557684

>>101557631
I'm letting them use some cards in my private rig to host that service. Be thankful.

Anonymous
07/24/24(Wed)17:57:34 No.101557690

Anonymous 07/24/24(Wed)17:57:34 No.101557690

>>101557675
how come i'm so retarded then?

Anonymous
07/24/24(Wed)17:57:57 No.101557697

Anonymous 07/24/24(Wed)17:57:57 No.101557697

>>101557649
That's a prompt issue. Especially local with that shitty instruct template in SillyTavern.

Anonymous
07/24/24(Wed)17:58:12 No.101557704

Anonymous 07/24/24(Wed)17:58:12 No.101557704

>>101557690
bad training data

Anonymous
07/24/24(Wed)17:58:39 No.101557709

Anonymous 07/24/24(Wed)17:58:39 No.101557709

>>101557672
Lol that feet came directly from horror movies.

Anonymous
07/24/24(Wed)17:58:58 No.101557714

Anonymous 07/24/24(Wed)17:58:58 No.101557714

>>101557669
Just use llama.cpp instead of another fork that might not be updated.

Anonymous
07/24/24(Wed)17:59:07 No.101557720

Anonymous 07/24/24(Wed)17:59:07 No.101557720

>>101557690
>how come i'm so retarded then?
transformers architecture is way better than our brain architecture?

Anonymous
07/24/24(Wed)17:59:15 No.101557722

Anonymous 07/24/24(Wed)17:59:15 No.101557722

>>101557675
>synapses are parameters, not neurons
ive never heard this comparison made

>each neuron has ~7000-10000 synapses
this sounds a lot more analogous to a relationship between weights(neurons) than the parameters themselves

Anonymous
07/24/24(Wed)17:59:31 No.101557726

Anonymous 07/24/24(Wed)17:59:31 No.101557726

>>101557690
poor education, excessive consumption of coom, most human interaction involves posting on a forum where everyone call each other "Anon"

Anonymous
07/24/24(Wed)17:59:37 No.101557727

Anonymous 07/24/24(Wed)17:59:37 No.101557727

>>101557690
bad training data/ training stopped prematurely

Anonymous
07/24/24(Wed)18:00:01 No.101557738

Anonymous 07/24/24(Wed)18:00:01 No.101557738

you guys are so mean..

Anonymous
07/24/24(Wed)18:00:15 No.101557744

Anonymous 07/24/24(Wed)18:00:15 No.101557744

Are quants of new mistral anywhere? I can only find some empty hf repos

Anonymous
07/24/24(Wed)18:00:18 No.101557745

Anonymous 07/24/24(Wed)18:00:18 No.101557745

>>101557714
>another fork that might not be updated
akshully, jartfile is much faster than chudcpp because i/k quant guy works in collabration with Jartine

Anonymous
07/24/24(Wed)18:00:21 No.101557747

Anonymous 07/24/24(Wed)18:00:21 No.101557747

>>101557637
>>101557638
This isn't true surprisingly, CEO posted recently that HF is profitable. I was shocked, like you assumed they were just burning investor cash.

Anonymous
07/24/24(Wed)18:00:31 No.101557751

Anonymous 07/24/24(Wed)18:00:31 No.101557751

>>101557720
>transformers architecture is way better than our brain architecture?
it's not though, transformers requires a ridiculously larger amount of data (and I think electricity too but i'm not sure) to be run. we don't need to consume the entire internet to be smart enough to know how many r's are in strawberry

Anonymous
07/24/24(Wed)18:01:01 No.101557761

Anonymous 07/24/24(Wed)18:01:01 No.101557761

>>101557726
>excessive consumption of coom
as if the guys on the silicon valley aren't giant coomers...

Anonymous
07/24/24(Wed)18:01:01 No.101557762

Anonymous 07/24/24(Wed)18:01:01 No.101557762

How do I run Large at home for cheap and with at least 20 T/s?

Anonymous
07/24/24(Wed)18:01:07 No.101557767

Anonymous 07/24/24(Wed)18:01:07 No.101557767

>>101557747
How? Where is that money coming from? What are they selling?

Anonymous
07/24/24(Wed)18:01:16 No.101557770

Anonymous 07/24/24(Wed)18:01:16 No.101557770

>>101557745
*humps you*

Anonymous
07/24/24(Wed)18:01:20 No.101557771

Anonymous 07/24/24(Wed)18:01:20 No.101557771

>>101557744
https://huggingface.co/legraphista/Mistral-Large-Instruct-2407-IMat-GGUF/tree/main

Anonymous
07/24/24(Wed)18:01:52 No.101557781

Anonymous 07/24/24(Wed)18:01:52 No.101557781

>>101557690
Overtraining on goon data.

>>101557747
Maybe he just lied to get even more vc.

Anonymous
07/24/24(Wed)18:02:04 No.101557788

Anonymous 07/24/24(Wed)18:02:04 No.101557788

>>101557751
>it's not though, transformers requires a ridiculously larger amount of data
we see a shit ton of data aswell with our eyes and ears anon, imagine it's 60 fps, multiply that with your age and get the astronomical data you actually went through, it's way higher than what the model got in the first place

Anonymous
07/24/24(Wed)18:02:07 No.101557791

Anonymous 07/24/24(Wed)18:02:07 No.101557791

>>101557762
You download Nemo and pretend it's Large

Anonymous
07/24/24(Wed)18:02:16 No.101557792

Anonymous 07/24/24(Wed)18:02:16 No.101557792

File: Screenshot 2024-07-25 100131.png (105 KB, 1062x502)

105 KB PNG

>>101557747
found the post where he says it
https://twitter.com/ClementDelangue/status/1811675386368966682

very explicitly says that they make a profit and aren't burning VC money, which I think would be illegal for a CEO to lie about

Anonymous
07/24/24(Wed)18:02:17 No.101557793

Anonymous 07/24/24(Wed)18:02:17 No.101557793

>>101557771
thanks a lot anon

Anonymous
07/24/24(Wed)18:02:33 No.101557797

Anonymous 07/24/24(Wed)18:02:33 No.101557797

*sharts*

Anonymous
07/24/24(Wed)18:03:09 No.101557804

Anonymous 07/24/24(Wed)18:03:09 No.101557804

>>101557771
>parts in their own folders
based

Anonymous
07/24/24(Wed)18:03:11 No.101557805

Anonymous 07/24/24(Wed)18:03:11 No.101557805

>>101557792
how the fuck do they make money though

Anonymous
07/24/24(Wed)18:04:43 No.101557827

Anonymous 07/24/24(Wed)18:04:43 No.101557827

>>101557805
Yeah I don't know either man, lol, all I know is he says they are

Anonymous
07/24/24(Wed)18:05:38 No.101557839

Anonymous 07/24/24(Wed)18:05:38 No.101557839

>>101557792
>>101557805
>>101557827
isn't huggingface owned by microsoft though?

Anonymous
07/24/24(Wed)18:05:40 No.101557840

Anonymous 07/24/24(Wed)18:05:40 No.101557840

File: comp.jpg (96 KB, 1280x720)

96 KB JPG

>>101557722
>ive never heard this comparison made
this is literally how they were invented, they looked how biological neurons work and created the simplified mathematical model where artificial neurons are biological neurons and the connection between them (synapses in biological brain) are parameters in artificial neural networks.

Anonymous
07/24/24(Wed)18:05:42 No.101557842

Anonymous 07/24/24(Wed)18:05:42 No.101557842

Have any of yous guys used Meta's Chameleon model?

The one they released in May https://arxiv.org/abs/2405.09818#

Anonymous
07/24/24(Wed)18:06:03 No.101557849

Anonymous 07/24/24(Wed)18:06:03 No.101557849

>>101557792
i remember when chatgpt came out and news articles were talking about how openai is losing millions in short time, and now huggingface is hosting even larger models. I guess some like NVIDIA might pay for the hosting themselves?

Anonymous
07/24/24(Wed)18:06:04 No.101557850

Anonymous 07/24/24(Wed)18:06:04 No.101557850

what exactly causes repetition related issues? Even at the start of an RP? i've never had this issue and now im suddenly having it. wtf.

Anonymous
07/24/24(Wed)18:07:43 No.101557878

Anonymous 07/24/24(Wed)18:07:43 No.101557878

>>101557788
>it's way higher than what the model got in the first place
it's not, I calculated it out of curiosity a few months ago. I don't remember the exact number but the model training would be ~100k human years if I remember correctly. In any case it was way bigger than human lifespan

Anonymous
07/24/24(Wed)18:08:18 No.101557887

Anonymous 07/24/24(Wed)18:08:18 No.101557887

>>101557697
I can say that Llama 3.1 8B also fails in the same way (if not worse), but 70B gets it immediately. Gemma 2 9B is also definitely not as capable as the 27B version in consistently following relatively complex output formatting (dialogue without tags + interspersed inner monologue + short-form narration with asterisks), but it's on par with or slightly better than Nemo 12B.

Anonymous
07/24/24(Wed)18:08:52 No.101557891

Anonymous 07/24/24(Wed)18:08:52 No.101557891

>>101557771
Is it broken in any way? Is it better to wait for upstream llama.cpp fixes?

Anonymous
07/24/24(Wed)18:08:56 No.101557893

Anonymous 07/24/24(Wed)18:08:56 No.101557893

>>101557850
It was always there but you just didn't notice it.

Anonymous
07/24/24(Wed)18:09:12 No.101557899

Anonymous 07/24/24(Wed)18:09:12 No.101557899

>>101557893
what exactly causes repetition related issues? Even at the start of an RP? i've never had this issue and now im suddenly having it. wtf.

Anonymous
07/24/24(Wed)18:09:18 No.101557901

Anonymous 07/24/24(Wed)18:09:18 No.101557901

>>101555266
>>101555182
if you can't tell this is a man then your detector needs to be replaced
https://www.youtube.com/watch?app=desktop&v=-mRi-B3t6fA&t=430

Anonymous
07/24/24(Wed)18:09:22 No.101557904

Anonymous 07/24/24(Wed)18:09:22 No.101557904

>>101557850
Show what you mean. Repetition is not what most people talk about. If you're talking about run on sentences, that's repetition penalty too high. It's not using words like 'a' and 'the' and cannot finish a sentence. If it's repeating sentence structure, then don't be too pushy with your writing instructions. It just picks up the pattern from the context and follows it. It's the one thing they're good at.

Anonymous
07/24/24(Wed)18:09:48 No.101557913

Anonymous 07/24/24(Wed)18:09:48 No.101557913

>>101557899
It was always there but you just didn't notice it.

Anonymous
07/24/24(Wed)18:10:12 No.101557922

Anonymous 07/24/24(Wed)18:10:12 No.101557922

>>101557901
Stop obsessing about it, petra.

Anonymous
07/24/24(Wed)18:10:44 No.101557934

Anonymous 07/24/24(Wed)18:10:44 No.101557934

>>101557878
let's say 25 years * 60 fps * 150kb (average size of a 1024x1024 picture) = 16.13 TB

Anonymous
07/24/24(Wed)18:10:53 No.101557938

Anonymous 07/24/24(Wed)18:10:53 No.101557938

>>101557904
In my case, It's an entire prompt verbatim even over up to 10 swipes, or copying the structure of two paragraphs yet making the rest of the gen original enough.
>>101557913
In my case, It's an entire prompt verbatim even over up to 10 swipes, or copying the structure of two paragraphs yet making the rest of the gen original enough.

Anonymous
07/24/24(Wed)18:10:54 No.101557940

Anonymous 07/24/24(Wed)18:10:54 No.101557940

>>101557690
hit the books (training data) and become your own expert

Anonymous
07/24/24(Wed)18:12:30 No.101557962

Anonymous 07/24/24(Wed)18:12:30 No.101557962

I'm getting refusals from Mistral Large, what am I doing wrong? It's an incest story, both characters adults

Anonymous
07/24/24(Wed)18:13:32 No.101557976

Anonymous 07/24/24(Wed)18:13:32 No.101557976

>>101557934
this... this is not how it works at all anon. You can't just put random arbitrary numbers there and call it a day.

Anonymous
07/24/24(Wed)18:13:58 No.101557986

Anonymous 07/24/24(Wed)18:13:58 No.101557986

>>101557938
I think you are in a unique situation where you could ask yourself why it is doing that. But if that fails there is also an option of asking yourself why it is doing that.

Anonymous
07/24/24(Wed)18:14:08 No.101557990

Anonymous 07/24/24(Wed)18:14:08 No.101557990

>>101557938
Relax the writing rules, then. I'm sure you can remove half your prompt without losing anything important. Also, give it stuff to work with. If you follow the same pattern in your writing you cannot expect the llm to be better than you.

Anonymous
07/24/24(Wed)18:14:14 No.101557992

Anonymous 07/24/24(Wed)18:14:14 No.101557992

What the fuck are those consolidated weights in the Mistral Large repo?

Anonymous
07/24/24(Wed)18:14:57 No.101558001

Anonymous 07/24/24(Wed)18:14:57 No.101558001

>>101557976
how is that arbitrary? 25 years is the age when our brain is fully developped, 60 fps is kinda the framerate where we don't see much difference if we go further, and I was being nice with 150kb because that's for a jpeg and our eyes have much more quality than that

Anonymous
07/24/24(Wed)18:15:18 No.101558005

Anonymous 07/24/24(Wed)18:15:18 No.101558005

>>101557986
>>101557990
The only thing i changed (which i did to try Nemo) was the instruct and context templates, but i switched them back to what i was using before.
at that point i started fucking with settings like a retard (because again wanted to try Nemo) and that doesnt really change much, just creativity.

Anonymous
07/24/24(Wed)18:16:02 No.101558013

Anonymous 07/24/24(Wed)18:16:02 No.101558013

>>101558001
>our eyes have much more quality than that
>glasses anons...

Anonymous
07/24/24(Wed)18:16:15 No.101558016

Anonymous 07/24/24(Wed)18:16:15 No.101558016

>>101557990
>If you follow the same pattern in your writing you cannot expect the llm to be better than you.
How long until we can stop playing with dolls? I am a 30 years old virgin here and I shouldn't be doing that I think.

Anonymous
07/24/24(Wed)18:16:42 No.101558023

Anonymous 07/24/24(Wed)18:16:42 No.101558023

>>101558013
?

Anonymous
07/24/24(Wed)18:17:03 No.101558026

Anonymous 07/24/24(Wed)18:17:03 No.101558026

>>101558005
nemo is super repetitve at least on gguf i know for sure

Anonymous
07/24/24(Wed)18:17:32 No.101558031

Anonymous 07/24/24(Wed)18:17:32 No.101558031

>>101557849
no
those articles have no idea what they're talking about half the time, i read one that suggested OpenAI is spending billions of dollars a day on ChatGPT
the reality is that they're making an absolute killing because inference is dirt cheap

Anonymous
07/24/24(Wed)18:17:48 No.101558032

Anonymous 07/24/24(Wed)18:17:48 No.101558032

>>101558026
yeah nemo was absolutely broken which is why i switched back
though I sort of remember an issue like this in the past where issues with one model carried over to another, and i have NO clue how that was fixed. besides maybe trying to reboot my system but im doing shit right now so that isnt happening.

Anonymous
07/24/24(Wed)18:17:59 No.101558034

Anonymous 07/24/24(Wed)18:17:59 No.101558034

>>101556980
I'm wondering how sonnet 3.5 compares to llama 405b in c# coding.

Anonymous
07/24/24(Wed)18:18:04 No.101558036

Anonymous 07/24/24(Wed)18:18:04 No.101558036

>>101558023
my eyes are a shit, like jpeg quality 25% or worse

Anonymous
07/24/24(Wed)18:18:17 No.101558043

Anonymous 07/24/24(Wed)18:18:17 No.101558043

>>101558005
If you're using nemo, lower the temp to 0.3 and move it up as you want more 'creativity'. If you came back from nemo to other model, adjust it accordingly. Show your prompt, show your settings, show your model.

Anonymous
07/24/24(Wed)18:18:36 No.101558047

Anonymous 07/24/24(Wed)18:18:36 No.101558047

File: 1701754483694888.png (38 KB, 449x741)

38 KB PNG

We've been scammed.

Anonymous
07/24/24(Wed)18:18:48 No.101558049

Anonymous 07/24/24(Wed)18:18:48 No.101558049

>>101557992
When I tried Nemo with the official API, it complained when they weren't there.

Anonymous
07/24/24(Wed)18:19:12 No.101558053

Anonymous 07/24/24(Wed)18:19:12 No.101558053

>>101558047
oh nonononono

Anonymous
07/24/24(Wed)18:19:39 No.101558057

Anonymous 07/24/24(Wed)18:19:39 No.101558057

>>101558036
oh, yeah same I have myopia that's why I'm wearing glasses, that doesn't change my argument, your brain sees quality pictures if you wear glasses

Anonymous
07/24/24(Wed)18:20:11 No.101558064

Anonymous 07/24/24(Wed)18:20:11 No.101558064

>>101558043
neutralized settings (1 temp, sometimes 0, doesnt seem to matter), kunoichi-lemon-royale-v2-32K-7B-Q5_K_M and Meta-Llama-3.1-8B-Instruct-Q5_K_M.
Its funny, turning on dynamic temp seems to adjust and almost break the repetition, but that's dangerous because it sometimes spits out garbage.

Anonymous
07/24/24(Wed)18:20:14 No.101558065

Anonymous 07/24/24(Wed)18:20:14 No.101558065

Mistral large is better for programming than wizardlm8x22b? Is there a gguf that will fit in 96gb?

Anonymous
07/24/24(Wed)18:20:23 No.101558068

Anonymous 07/24/24(Wed)18:20:23 No.101558068

>>101558016
"Garbage in, garbage out" works in context as well. Make it "entertaining" for the model and it will keep you entertained as well. I hope for a future where all the "ah ah, mistress" proompters are told by the llm to fuck off.

Anonymous
07/24/24(Wed)18:20:39 No.101558073

Anonymous 07/24/24(Wed)18:20:39 No.101558073

>>101558047
>32k context
that's kinda good no?

Anonymous
07/24/24(Wed)18:21:11 No.101558082

Anonymous 07/24/24(Wed)18:21:11 No.101558082

large2stral is everything I ever wanted in a local model
thank you based arthur... thank you...!

Anonymous
07/24/24(Wed)18:21:18 No.101558084

Anonymous 07/24/24(Wed)18:21:18 No.101558084

>>101557899
Instruct training. It introduces the GPTslop behavioral pattern to the model.
"What is the capital of Britain?"
"I see that you are asking me the capital of Britain. To find out the capital of Britain we can simply look at what the capital of Britain is. The Capital of Britain is London."
The reason the training data is formatted like this is to cause the model to set up its own breadcrumb trail to keep it from veering off topic. And these patterns translate over to RP.
That's why it iterates over all the shit in the card
>And then he runs his SLENDER fingers through his JET BLACK LOCKS
just grabbing phrases and shit out of the card and puking them back out.
Finetuning on RP prompts doesn't help either because all of those contain models running on slopped GPT endpoints doing this exact same behavior.
What really is needed is a hand crafted RP instruct dataset to tune a base model on.

Anonymous
07/24/24(Wed)18:21:21 No.101558085

Anonymous 07/24/24(Wed)18:21:21 No.101558085

>>101558057
>that doesn't change my argument, your brain sees quality pictures if you wear glasses
sure, was just funny reading that as I literally need to lean in to my desk to read (late and glasses off...)

Anonymous
07/24/24(Wed)18:21:25 No.101558087

Anonymous 07/24/24(Wed)18:21:25 No.101558087

File: 1718091427088942.png (24 KB, 700x265)

24 KB PNG

>>101558073
It's subpar and they advertise it as 128k

Anonymous
07/24/24(Wed)18:21:34 No.101558090

Anonymous 07/24/24(Wed)18:21:34 No.101558090

>>101558073
not for coding

Anonymous
07/24/24(Wed)18:22:18 No.101558099

Anonymous 07/24/24(Wed)18:22:18 No.101558099

>>101558087
if they advertise it as 128k you can probably just use it at 128k
the value in the config doesn't limit you or anything iirc

Anonymous
07/24/24(Wed)18:22:50 No.101558107

Anonymous 07/24/24(Wed)18:22:50 No.101558107

>>101558099
but it'll rope then

Anonymous
07/24/24(Wed)18:23:15 No.101558112

Anonymous 07/24/24(Wed)18:23:15 No.101558112

VRAMlet talk, anything better than gemma 2 in all these new models?

Anonymous
07/24/24(Wed)18:23:24 No.101558116

Anonymous 07/24/24(Wed)18:23:24 No.101558116

>>101558107
uhhh no it shouldn't
don't use a backend that does that

Anonymous
07/24/24(Wed)18:23:37 No.101558119

Anonymous 07/24/24(Wed)18:23:37 No.101558119

>>101558001
>25 years is the age when our brain is fully developped
that's not even close to being true
>60 fps is kinda the framerate where we don't see much difference if we go further
this is completely wrong as visual perception doesn't work like camera, so putting any framerates here is wrong from the start
>and I was being nice with 150kb because that's for a jpeg and our eyes have much more quality than that
the same objection as before, also if you check how much information is going through optic nerve you would be surprised how small it is. Most of human visions is just a brain "hallucination". Only very small part of our field of vision is actually sharp, the rest brain is calculating from blurred images and thanks to saccade movements.

Anonymous
07/24/24(Wed)18:23:52 No.101558124

Anonymous 07/24/24(Wed)18:23:52 No.101558124

>>101558112
anything

Anonymous
07/24/24(Wed)18:23:57 No.101558126

Anonymous 07/24/24(Wed)18:23:57 No.101558126

>>101558084
hmmmmm

then that means i probably fucked up and still picked the wrong instruct format, thinking i did it right
whoops. ill try a different instruct after im finished generating these 10 images of realistic Rouge the Bat's pussy in SD.

Anonymous
07/24/24(Wed)18:24:23 No.101558134

Anonymous 07/24/24(Wed)18:24:23 No.101558134

>>101558116
lcpp auto ropes based on the config iirc, if config says 32 and you set 128 it'll rope

Anonymous
07/24/24(Wed)18:25:28 No.101558145

Anonymous 07/24/24(Wed)18:25:28 No.101558145

>>101558064
>1 temp, sometimes 0, doesnt seem to matter
It should matter. If you get deterministic replies with swipes at temp 1 something is fucked in your setup.
>kunoichi-lemon-royale-v2
Merge. Discard it. At most, take a finetune. Any will do. I won't recommend any.
>Meta-Llama-3.1-8B-Instruct-Q5_K_M
I assume you're using the latest version of whatever you use. If not, update. If you still get the same output at temp 1 it's a bug. You should report it.

Anonymous
07/24/24(Wed)18:25:47 No.101558149

Anonymous 07/24/24(Wed)18:25:47 No.101558149

>>101558134
just change the config then

Anonymous
07/24/24(Wed)18:26:01 No.101558154

Anonymous 07/24/24(Wed)18:26:01 No.101558154

>>101558134
lcpp also lets you just manually specify the rope base so you can avoid that

Anonymous
07/24/24(Wed)18:26:12 No.101558156

Anonymous 07/24/24(Wed)18:26:12 No.101558156

>>101558149
that's illegal

Anonymous
07/24/24(Wed)18:26:38 No.101558163

Anonymous 07/24/24(Wed)18:26:38 No.101558163

>>101558145
>Merge. Discard it. At most, take a finetune. Any will do. I won't recommend any.
i'll have you know that mememerge absolutely BTFO'd any of the recommendations in this thread, especially CR. But i do notice 3.1 even base instruct with my problems is just slightly better so thatll be my main.
ill download a newer version and see what happens.

Anonymous
07/24/24(Wed)18:27:01 No.101558171

Anonymous 07/24/24(Wed)18:27:01 No.101558171

>>101558119
>this is completely wrong as visual perception doesn't work like camera, so putting any framerates here is wrong from the start
you can approximate though, because you have to make somme calculations, so framerate it is, and 60 seems to be a good spot.

>the same objection as before, also if you check how much information is going through optic nerve you would be surprised how small it is. Most of human visions is just a brain "hallucination". Only very small part of our field of vision is actually sharp, the rest brain is calculating from blurred images and thanks to saccade movements.
That doesn't really matter, even if the brain sees hallucinations, it's high quality hallucination, the simple fact we can differentiate a 1024x1024 pictures and a 4k picture means that our brain probably sees in the range of 4k

I never said we were computers and shit, but if a computer had to live like us, that's the approximations he would get, 60 fps and 4k

Anonymous
07/24/24(Wed)18:27:16 No.101558172

Anonymous 07/24/24(Wed)18:27:16 No.101558172

thanks for the goon bros, now i'll clean myself up, lift some weights then watch anime

Anonymous
07/24/24(Wed)18:27:55 No.101558182

Anonymous 07/24/24(Wed)18:27:55 No.101558182

Comparing weights and biases on a digital neural network to biological brains is stupid. You fuckers never learn.

Anonymous
07/24/24(Wed)18:28:56 No.101558193

Anonymous 07/24/24(Wed)18:28:56 No.101558193

>>101558182
training data on the thread bots isn't updated in real time anon

Anonymous
07/24/24(Wed)18:29:03 No.101558195

Anonymous 07/24/24(Wed)18:29:03 No.101558195

>>101558182
uhh, but sam altman and all the other experts said that we'll have agi that can replace humans within the decade

Anonymous
07/24/24(Wed)18:29:43 No.101558207

Anonymous 07/24/24(Wed)18:29:43 No.101558207

>>101558182
forming analogies is an act of higher cognition. You're literally seething at other people not being an NPC.

Anonymous
07/24/24(Wed)18:29:48 No.101558208

Anonymous 07/24/24(Wed)18:29:48 No.101558208

https://huggingface.co/cognitivecomputations/dolphin-2.9.3-mistral-nemo-12b-gguf

What's the word on this, kids? Worth the download?

Anonymous
07/24/24(Wed)18:30:07 No.101558213

Anonymous 07/24/24(Wed)18:30:07 No.101558213

>>101558204
MODS

Anonymous
07/24/24(Wed)18:30:13 No.101558216

Anonymous 07/24/24(Wed)18:30:13 No.101558216

>>101558208
>dolphin

Anonymous
07/24/24(Wed)18:30:16 No.101558217

Anonymous 07/24/24(Wed)18:30:16 No.101558217

>>101558163
>i'll have you know that mememerge absolutely BTFO'd any of the recommendations in this thread, especially CR.
Dude. It's a 7B. I like me some small models, but i'd never claim a 7b being better than CR.

Anonymous
07/24/24(Wed)18:30:56 No.101558226

Anonymous 07/24/24(Wed)18:30:56 No.101558226

>>101558216
dolphin indeed!

Anonymous
07/24/24(Wed)18:31:19 No.101558232

Anonymous 07/24/24(Wed)18:31:19 No.101558232

>>101558124
Got an opinion that's not retarded hyperbole?

Anonymous
07/24/24(Wed)18:31:43 No.101558234

Anonymous 07/24/24(Wed)18:31:43 No.101558234

File: jerry laugh.gif (3.64 MB, 374x274)

3.64 MB GIF

>>101558217
Have you been paying attention to the threads the past 24 hours?

Anonymous
07/24/24(Wed)18:31:57 No.101558235

Anonymous 07/24/24(Wed)18:31:57 No.101558235

>>101558208
Hello Petrus.
>>101558232

Anonymous
07/24/24(Wed)18:32:00 No.101558237

Anonymous 07/24/24(Wed)18:32:00 No.101558237

>>101558195
>uhh, but sam altman and all the other experts said that we'll have agi that can replace humans within the decade
Most of those experts that claimed that did it 3 decades ago. Two more weeks, i suppose.

Anonymous
07/24/24(Wed)18:32:32 No.101558241

Anonymous 07/24/24(Wed)18:32:32 No.101558241

why the fuck is every Mistral Large 2 benchmark about codding performance. literally don't care

Anonymous
07/24/24(Wed)18:33:07 No.101558248

Anonymous 07/24/24(Wed)18:33:07 No.101558248

>>101557528
>mistral 7b is better than command r plus
>it's also better than the new mistral nemo 13b
>llama 400 better than fucking opus
what the fuck is this list mate? and I thought arena was bad don't post this shit ever again

Anonymous
07/24/24(Wed)18:33:24 No.101558253

Anonymous 07/24/24(Wed)18:33:24 No.101558253

>>101558241
only productive use of llms

Anonymous
07/24/24(Wed)18:33:35 No.101558257

Anonymous 07/24/24(Wed)18:33:35 No.101558257

>>101557771
>No IQ quants
meh.

Anonymous
07/24/24(Wed)18:34:19 No.101558271

Anonymous 07/24/24(Wed)18:34:19 No.101558271

>>101558207
Analogies are for the layman as an intro to a subject. They're unnecessary for people that understand the concepts.
>See? open notepad.txt and it's like a real notebook
>How do i change pages?
>Are you an NPC??!?!?!?!?!?!?!?!

Anonymous
07/24/24(Wed)18:34:24 No.101558273

Anonymous 07/24/24(Wed)18:34:24 No.101558273

>>101558241
because coding is probably the most important thing that makes OpenAI relevant, Meta wants to kill that company so he wants people to get a free access of a good coding model and not rely on OpenAI anymore for work, and I can understand them, that's a huge security issue to give your data and coding to a closed company like OpenAI in the first place

Anonymous
07/24/24(Wed)18:35:10 No.101558282

Anonymous 07/24/24(Wed)18:35:10 No.101558282

File: robot.png (187 KB, 1218x3513)

187 KB PNG

eh large 2 is okay, so funnily melodramatic with my old prompts.

Anonymous
07/24/24(Wed)18:35:15 No.101558284

Anonymous 07/24/24(Wed)18:35:15 No.101558284

>>101558248
Mistral Small is not a 7B model.

Anonymous
07/24/24(Wed)18:35:25 No.101558288

Anonymous 07/24/24(Wed)18:35:25 No.101558288

>>101558207
>forming analogies is an act of higher cognition.
thanks anon :3

Anonymous
07/24/24(Wed)18:35:42 No.101558291

Anonymous 07/24/24(Wed)18:35:42 No.101558291

>>101558273
>Meta
but he said mistral
?

Anonymous
07/24/24(Wed)18:35:48 No.101558294

Anonymous 07/24/24(Wed)18:35:48 No.101558294

llamafile update: it does work with ST. I used this command:
>./Meta-Llama-3.1-8B-Instruct.Q4_K_S.llamafile --server -ngl 20
Then connected with the "Chat Completion", "Custom (OpenAI-compatible)" choice. I used this as the URL:
>http://127.0.0.1:8080/v1
Is that the best way? That was very easy to set up, as llamafile is literally 1 file for both the model and the server.

Anonymous
07/24/24(Wed)18:36:42 No.101558309

Anonymous 07/24/24(Wed)18:36:42 No.101558309

>>101558172
based, but for me it's weights, then anime, then goon

Anonymous
07/24/24(Wed)18:36:44 No.101558310

Anonymous 07/24/24(Wed)18:36:44 No.101558310

>>101558291
that's the same thing, Meta, Mistral, Qwen, they focus on coding because companies simply want to work with a local model and not give their private data to Sam Altman

Anonymous
07/24/24(Wed)18:36:54 No.101558314

Anonymous 07/24/24(Wed)18:36:54 No.101558314

>>101558294
>That was very easy to set up, as llamafile is literally 1 file for both the model and the server.
buy ad with mozilla money jartine

Anonymous
07/24/24(Wed)18:36:55 No.101558317

Anonymous 07/24/24(Wed)18:36:55 No.101558317

>>101558294
Fuck off, Jart

Anonymous
07/24/24(Wed)18:37:13 No.101558319

Anonymous 07/24/24(Wed)18:37:13 No.101558319

>>101558294
How do you know that model isn't a virus?

Anonymous
07/24/24(Wed)18:37:16 No.101558322

Anonymous 07/24/24(Wed)18:37:16 No.101558322

>>101558084
This isn't correct. All LLMs have this behavior, it's what is generally called "ICL" (In context learning).

Anonymous
07/24/24(Wed)18:37:48 No.101558326

Anonymous 07/24/24(Wed)18:37:48 No.101558326

>>101558319
I unironically trust jart

Anonymous
07/24/24(Wed)18:38:15 No.101558329

Anonymous 07/24/24(Wed)18:38:15 No.101558329

>>101558326
base

Anonymous
07/24/24(Wed)18:39:24 No.101558357

Anonymous 07/24/24(Wed)18:39:24 No.101558357

>>101558204
Kek this is like a balloon with limbs

Anonymous
07/24/24(Wed)18:39:42 No.101558364

Anonymous 07/24/24(Wed)18:39:42 No.101558364

>>101558319
Why would Mozilla (a real actual company) distribute malware?

Anonymous
07/24/24(Wed)18:40:45 No.101558376

Anonymous 07/24/24(Wed)18:40:45 No.101558376

>>101558294
I would rather download that random koboldcpp executable than anything that jart touched.

Anonymous
07/24/24(Wed)18:40:53 No.101558378

Anonymous 07/24/24(Wed)18:40:53 No.101558378

CerealBENCH update
>Claude3.5 Sonnet
>LLaMA3.1-405B
>GPT4o
>Qwen2-72b
>Mistral-Large2
>Claude Opus
>LLama3.1-70b
>Qwen1.5-72B
>llama3-8b
>LLama3-70b
>Command-R+
>Claude Haiku
>LLama2-70b
>llama3.1-8b
>Mixtral8x22B
>Yi-34B
>Mixtral8x7B
will keep you updated

Anonymous
07/24/24(Wed)18:40:59 No.101558379

Anonymous 07/24/24(Wed)18:40:59 No.101558379

LLAMACPP CRASHED MY MACBOOK
FUCK THIS POS SOFTWARE

Anonymous
07/24/24(Wed)18:41:22 No.101558387

Anonymous 07/24/24(Wed)18:41:22 No.101558387

>>101558376
That'd how you get a virus

Anonymous
07/24/24(Wed)18:41:22 No.101558388

Anonymous 07/24/24(Wed)18:41:22 No.101558388

>>101558241
Because AI companies have completely given up on getting normal people interested in LLMs and are pivoting to just making them into tools for computer programmers.

Anonymous
07/24/24(Wed)18:41:23 No.101558389

Anonymous 07/24/24(Wed)18:41:23 No.101558389

>>101558282
holy shit

Anonymous
07/24/24(Wed)18:41:38 No.101558393

Anonymous 07/24/24(Wed)18:41:38 No.101558393

>>101558171
Anon, no.
I don't have the time to explain the whole process of visual perception to you but you can't make these approximations because they don't make any sense. You have no idea what you are talking about, I'm telling you this as someone who studied neurobiology. I had a nice textbook about neurophysiology of vision, I can look for it and give you the title and author when I find it, if you want. It should clear some misconceptions you have.

Anonymous
07/24/24(Wed)18:41:39 No.101558396

Anonymous 07/24/24(Wed)18:41:39 No.101558396

>>101558379
>MACBOOK

Anonymous
07/24/24(Wed)18:41:50 No.101558399

Anonymous 07/24/24(Wed)18:41:50 No.101558399

>>101558379
ollama sir

Anonymous
07/24/24(Wed)18:42:03 No.101558402

Anonymous 07/24/24(Wed)18:42:03 No.101558402

>>101558364
Why would a man try to convince everyone he isn't one?

Anonymous
07/24/24(Wed)18:42:10 No.101558406

Anonymous 07/24/24(Wed)18:42:10 No.101558406

>>101558282
now that's some claude soul

Anonymous
07/24/24(Wed)18:42:17 No.101558409

Anonymous 07/24/24(Wed)18:42:17 No.101558409

Fixed my repetition issue seemingly, at least the start of erp's, it was the instruct settings + i forgot to save which settings preset i was using, now its business as usual. Whoops.

>>101558379
looks like you have to buy another macbook sar :^)

Anonymous
07/24/24(Wed)18:42:23 No.101558412

Anonymous 07/24/24(Wed)18:42:23 No.101558412

File: 0ob1ytni7r9b1.png (244 KB, 454x512)

244 KB PNG

>>101558378
anon what the fuck is this

Anonymous
07/24/24(Wed)18:42:49 No.101558419

Anonymous 07/24/24(Wed)18:42:49 No.101558419

>>101558364
No one but Jart is working on llamafile, even if she uses Mozilla's name. And making models be executables is kinda retarded.

Anonymous
07/24/24(Wed)18:43:02 No.101558422

Anonymous 07/24/24(Wed)18:43:02 No.101558422

>>101558402
That girl is a better programmer than you will ever be

Anonymous
07/24/24(Wed)18:43:06 No.101558423

Anonymous 07/24/24(Wed)18:43:06 No.101558423

>>101558393
>I'm telling you this as someone who studied neurobiology
lol you wasted your time for knowledge that anon is just gonna compeltely ignore, nerd.

Anonymous
07/24/24(Wed)18:44:06 No.101558436

Anonymous 07/24/24(Wed)18:44:06 No.101558436

>>101558419
It's a great way of having a self contained model that you will still be able to run in the long term. Having more than 1 file is bloat.

Anonymous
07/24/24(Wed)18:44:13 No.101558438

Anonymous 07/24/24(Wed)18:44:13 No.101558438

>>101558393
You don't seem to understand, I never said we live like computers, my message was that if one day a computer were to live like us for 25 years (like seeing things and shit), the data it would get would be 25 years * 60 fps * 4k pictures, that's the data it would get if it were to live like us, you understand now?

Anonymous
07/24/24(Wed)18:44:32 No.101558442

Anonymous 07/24/24(Wed)18:44:32 No.101558442

>>101558422
>That girl
Objectively false
>is a better programmer than you will ever be
Yet to be proven.

Anonymous
07/24/24(Wed)18:44:51 No.101558443

Anonymous 07/24/24(Wed)18:44:51 No.101558443

>>101558182
>comparing the mathematical model of human brain to human brain is stupid
/lmg/ brainrot never stops to amaze me

Anonymous
07/24/24(Wed)18:45:08 No.101558446

Anonymous 07/24/24(Wed)18:45:08 No.101558446

>>101558438
noooo but it not like us tho, it had sensors n shiet you cannot compare!

Anonymous
07/24/24(Wed)18:45:17 No.101558448

Anonymous 07/24/24(Wed)18:45:17 No.101558448

>>101558442
How many stars do you have on GitHub? Do you have 17,000 stars? Yeah I thought not.

Anonymous
07/24/24(Wed)18:45:48 No.101558455

Anonymous 07/24/24(Wed)18:45:48 No.101558455

>>101558357
yeah the prompt was

>In a sunny backyard, a beautiful little Russian girl lies on her side, her legs elegantly bent and spread. Clad in a cheerful pink two-piece swimsuit, her exposed stomach shines as she smiles, her golden hair flowing around her in the warm breeze.

but obviously it didn't do the two-piece swimsuit part and instead focused on the "stomach" token

kling is unironically better at making tods than any other age group..

>>101558423
im just happy neurobiology anon roasted the ESL underage """accelerationist""" for talking about things he doesnt understand and for making me second guess myself when I know I'm smarter than a stinky poo like him

Anonymous
07/24/24(Wed)18:45:56 No.101558458

Anonymous 07/24/24(Wed)18:45:56 No.101558458

>>101558436
The model was already a single file.

Anonymous
07/24/24(Wed)18:46:05 No.101558460

Anonymous 07/24/24(Wed)18:46:05 No.101558460

>>101558446
but you can quantify what a computer is seeing based on the camera, sensors and shit, that's the point

Anonymous
07/24/24(Wed)18:46:13 No.101558466

Anonymous 07/24/24(Wed)18:46:13 No.101558466

>>101558282
that is pretty cool, I remember when people were like local models never ever. idiots.

Anonymous
07/24/24(Wed)18:47:41 No.101558481

Anonymous 07/24/24(Wed)18:47:41 No.101558481

>https://x.com/OpenAIDevs/status/1815836887631946015
>Customize GPT-4o mini for your application with fine-tuning. Available today to tier 4 and 5 users, we plan to gradually expand access to all tiers. First 2M training tokens a day are free, through Sept 23.
local lost.

Anonymous
07/24/24(Wed)18:47:53 No.101558484

Anonymous 07/24/24(Wed)18:47:53 No.101558484

>>101558458
>00001-of-00011.gguf

Anonymous
07/24/24(Wed)18:49:07 No.101558495

Anonymous 07/24/24(Wed)18:49:07 No.101558495

>>101558448
At least you don't dispute the first fact. Good for you. You're making progress.
I don't care for keeping a reputation here.

Anonymous
07/24/24(Wed)18:49:20 No.101558497

Anonymous 07/24/24(Wed)18:49:20 No.101558497

>>101558481
old news retard

Anonymous
07/24/24(Wed)18:49:58 No.101558510

Anonymous 07/24/24(Wed)18:49:58 No.101558510

>>101558484
>Files which exceed the Hugging Face 50GB upload limit have a .catX extension. You need to use the cat command locally to turn them back into a single file, using the same order.
https://huggingface.co/jartine/gemma-2-27b-it-llamafile#about-upload-limits

Anonymous
07/24/24(Wed)18:50:18 No.101558518

Anonymous 07/24/24(Wed)18:50:18 No.101558518

>>101558495
You're probably a redditor with more estrogen than jart herself

Anonymous
07/24/24(Wed)18:52:03 No.101558534

Anonymous 07/24/24(Wed)18:52:03 No.101558534

>>101558484
>you now have 11 llamafiles executables
>each of them a different virus

Anonymous
07/24/24(Wed)18:52:29 No.101558540

Anonymous 07/24/24(Wed)18:52:29 No.101558540

>>101558497
local still lost, despite the fact that it is an old news.

Anonymous
07/24/24(Wed)18:52:39 No.101558542

Anonymous 07/24/24(Wed)18:52:39 No.101558542

>>101558518
That dude is packing. He could swing his enormous cock across your face and knock you out for a week.

Anonymous
07/24/24(Wed)18:53:02 No.101558545

Anonymous 07/24/24(Wed)18:53:02 No.101558545

>>101558208
Does anyone else have an opinion of this model?

Anonymous
07/24/24(Wed)18:54:04 No.101558558

Anonymous 07/24/24(Wed)18:54:04 No.101558558

>>101558545
sorry petrus people are burnt out on dolpin/gptslop claudeslop is the trend now

Anonymous
07/24/24(Wed)18:54:11 No.101558561

Anonymous 07/24/24(Wed)18:54:11 No.101558561

>Waaaaa... the cuda code too big. i cannot make me llamafile!!! I need to further quant the models into oblivion because windows has a file size limit for executables, waaaaaaaaaaaaaaaaaaaa.

Anonymous
07/24/24(Wed)18:54:31 No.101558563

Anonymous 07/24/24(Wed)18:54:31 No.101558563

>>101558208
dolphin is dead, I'm not touching that shit since airoboros-13b-gpt4-1.4, that's when he trained his shit with gpt4-march 2023, that most sovlfull gpt4 model we ever had, and it was smart aswell too, only C3.5 sonnet gave me the same feeling again

Anonymous
07/24/24(Wed)18:55:13 No.101558571

Anonymous 07/24/24(Wed)18:55:13 No.101558571

>>101558561
Why does llamafile make you so insecure?

Anonymous
07/24/24(Wed)18:55:37 No.101558574

Anonymous 07/24/24(Wed)18:55:37 No.101558574

>>101558563
dolphin and airoboros are not same do?

Anonymous
07/24/24(Wed)18:56:21 No.101558583

Anonymous 07/24/24(Wed)18:56:21 No.101558583

>>101558047
This is probably just an updated version of the instruct finetune applied to previous Mistral Large, which also had 32k tokens context. Nemo is a newer model.

https://mistral.ai/news/mistral-large/

Anonymous
07/24/24(Wed)18:56:27 No.101558585

Anonymous 07/24/24(Wed)18:56:27 No.101558585

>>101558423
>lol you wasted your time for knowledge that anon is just gonna compeltely ignore, nerd.
it may be a surprise but I didn't study it for anons
>>101558438
And I'm telling you this is not comparable because you are using arbitrary numbers. You assume that your (60fps * 4k) would be the case for computers but if it's not necessary for humans to learn then it also doesn't have be for computers. The amount of data needed to process images would be immensely smaller if computer scientists figured out how to model what our visual cortex is doing. It's like making approximations what kind of speed you can achieve with different pace of pedaling on a bike while ignoring the fact you can just drive a car to do it way faster.

Anonymous
07/24/24(Wed)18:56:33 No.101558586

Anonymous 07/24/24(Wed)18:56:33 No.101558586

>>101558561
calm down cuda dev

Anonymous
07/24/24(Wed)18:57:22 No.101558600

Anonymous 07/24/24(Wed)18:57:22 No.101558600

>>101558545
>Dolphin is licensed according to apache 2.0 license. We grant permission for any use, including commercial. Dolphin was trained on data generated from GPT4, among other models.

Anonymous
07/24/24(Wed)18:57:24 No.101558601

Anonymous 07/24/24(Wed)18:57:24 No.101558601

File: 4zd8o6h9qxn51.png (1.67 MB, 2560x1440)

1.67 MB PNG

>>101558510
Feline bros just keep winning

Anonymous
07/24/24(Wed)18:57:25 No.101558602

Anonymous 07/24/24(Wed)18:57:25 No.101558602

>>101558571
Certain ideas are just bad. There is absolutely 0 point in packaging a ridiculously big datafile in the executable. Other than that, it seems to work well for cpu.

Anonymous
07/24/24(Wed)18:58:02 No.101558608

Anonymous 07/24/24(Wed)18:58:02 No.101558608

>>101558585
>You assume that your (60fps * 4k) would be the case for computers but if it's not necessary for humans to learn
come on man, you think we could live in a 10fps*256px world? that shit would make me dizzy, there's a reason we feel confortable at 60 fps * 4k setting, because that's really close to what we see in real life, don't be obtuse like that, please

Anonymous
07/24/24(Wed)18:58:08 No.101558609

Anonymous 07/24/24(Wed)18:58:08 No.101558609

Isn't it bullshit to call them 3.1 if they're not related to the original llama version 3 models at all?

Anonymous
07/24/24(Wed)18:58:36 No.101558612

Anonymous 07/24/24(Wed)18:58:36 No.101558612

>>101558586
He wouldn't engage in this drivel. And if he did, he'd do it more eloquently than me.

Anonymous
07/24/24(Wed)18:59:07 No.101558613

Anonymous 07/24/24(Wed)18:59:07 No.101558613

>>101558609
>if they're not related to the original llama version 3 models at all?
they are tho? same arch except for context

Anonymous
07/24/24(Wed)18:59:08 No.101558614

Anonymous 07/24/24(Wed)18:59:08 No.101558614

>>101558609 (me)
my name is petra, btw

Anonymous
07/24/24(Wed)19:00:21 No.101558626

Anonymous 07/24/24(Wed)19:00:21 No.101558626

>>101558614
nah, message is too short, not pretentious enough
>verdict: NOT PETRUS

Anonymous
07/24/24(Wed)19:00:39 No.101558627

Anonymous 07/24/24(Wed)19:00:39 No.101558627

>>101558613
Same architecture, sure, but they're not continued pretrains of the original models. They're distilations of 405B.

Anonymous
07/24/24(Wed)19:01:33 No.101558636

Anonymous 07/24/24(Wed)19:01:33 No.101558636

>>101558627
I thought they used training data from 405B on top of the original models, not full distillation?

Anonymous
07/24/24(Wed)19:01:35 No.101558637

Anonymous 07/24/24(Wed)19:01:35 No.101558637

>>101558608
you should be slapped for being so retarded and narcissistic
don't reproduce

Anonymous
07/24/24(Wed)19:02:12 No.101558645

Anonymous 07/24/24(Wed)19:02:12 No.101558645

>>101558637
what a projection, kill yourself nigger

Anonymous
07/24/24(Wed)19:02:43 No.101558649

Anonymous 07/24/24(Wed)19:02:43 No.101558649

>>101558627
was gonna say this
>>101558636
they probably just genned some synth data from 405 to train on top

Anonymous
07/24/24(Wed)19:03:08 No.101558652

Anonymous 07/24/24(Wed)19:03:08 No.101558652

>>101558600
I see the point. I was using Dolphin Mixtral again yesterday, and before I remembered that I could switch to the Mistral tokenizer to enable logit bias, I was getting swamped by a tidal wave of diversity and inclusion. Still, I'm going to download Nemo now, and we'll see how it goes. I'll try some nice, safe, politically neutral code questions. Maybe a Pong game.

Anonymous
07/24/24(Wed)19:03:11 No.101558653

Anonymous 07/24/24(Wed)19:03:11 No.101558653

>>101558645
i'm not the one arguing about stuff i literally don't know anything about you double nigger

Anonymous
07/24/24(Wed)19:03:39 No.101558657

Anonymous 07/24/24(Wed)19:03:39 No.101558657

>>101558208
for erp try this one
https://huggingface.co/BeaverAI/mistral-doryV2-12b

Anonymous
07/24/24(Wed)19:04:15 No.101558668

Anonymous 07/24/24(Wed)19:04:15 No.101558668

File: 1702273417650502.webm (3.47 MB, 808x682)

3.47 MB WEBM

what's this llama shit i just learned about 5 minutes ago?

Anonymous
07/24/24(Wed)19:04:38 No.101558672

Anonymous 07/24/24(Wed)19:04:38 No.101558672

>>101558653
And I'm not the one talking about biology when the topic is about how close of a setting a computer should get to replicate our point of view, you 2 digit IQ retard

Anonymous
07/24/24(Wed)19:04:40 No.101558674

Anonymous 07/24/24(Wed)19:04:40 No.101558674

>>101558645
>>101558653
So remind me again, guys. Why do I get more hate for baiting/shitting up threads, than the people who spam this sort of shit everywhere?

Anonymous
07/24/24(Wed)19:05:26 No.101558683

Anonymous 07/24/24(Wed)19:05:26 No.101558683

>>101558668
a whole lot of disappointment

Anonymous
07/24/24(Wed)19:05:32 No.101558686

Anonymous 07/24/24(Wed)19:05:32 No.101558686

>>101558608
>come on man, you think we could live in a 10fps*256px world?
This is exactly what I'm trying to tell you. Seriously, check how our vision is working, this is much better approximation if you want to compare (what can't be compared) at all. Most of vision processing is being made from incomplete and fuzzy visual data. The way that the small amount of data and shitty pictures captured by our eyes becoming clear and sharp pictures while going through layers of visual cortex is quite mindblowing when you learn about it for a first time.

Anonymous
07/24/24(Wed)19:05:39 No.101558688

Anonymous 07/24/24(Wed)19:05:39 No.101558688

>>101558657
>made by the one that was screeching that limarp and all models with it should be banned
https://huggingface.co/BeaverAI/mistral-doryV2-12b/commits/main
>>100828064
>>100828083

Anonymous
07/24/24(Wed)19:06:09 No.101558694

Anonymous 07/24/24(Wed)19:06:09 No.101558694

>>101558672
>moving the goalposts
more like stealing the goalposts because you are the blackest gorilla nigger that has ever lived

>>101558674
you can kill yourself too

Anonymous
07/24/24(Wed)19:06:39 No.101558705

Anonymous 07/24/24(Wed)19:06:39 No.101558705

>>101558674
because you're a pretentious holier than thou with a victim complex as demonstrated by this very post

Anonymous
07/24/24(Wed)19:07:29 No.101558712

Anonymous 07/24/24(Wed)19:07:29 No.101558712

>>101558694
go fuck yourself faggot, you know you are wrong in the end of the day, trying to sound smart while talking about irrelevant shit that has nothing to do with the topic in question, get bent nigger

Anonymous
07/24/24(Wed)19:07:50 No.101558719

Anonymous 07/24/24(Wed)19:07:50 No.101558719

>>101558026
I have the same issue with exl2

Anonymous
07/24/24(Wed)19:08:06 No.101558722

Anonymous 07/24/24(Wed)19:08:06 No.101558722

>>101558657
>>101555363
>>101555391

Anonymous
07/24/24(Wed)19:09:18 No.101558740

Anonymous 07/24/24(Wed)19:09:18 No.101558740

>>101558719
Sheeeit

Anonymous
07/24/24(Wed)19:10:24 No.101558755

Anonymous 07/24/24(Wed)19:10:24 No.101558755

I don't wanna FIND NEMO i wanna FIND DORY and i think EVERY RED BLOODED AMERICAN CAN AGREE!

Anonymous
07/24/24(Wed)19:11:32 No.101558765

Anonymous 07/24/24(Wed)19:11:32 No.101558765

>>101558672
you are arguing with two different anons if you didn't catch that by the way

Anonymous
07/24/24(Wed)19:11:58 No.101558769

Anonymous 07/24/24(Wed)19:11:58 No.101558769

dory more like boring

Anonymous
07/24/24(Wed)19:12:50 No.101558775

Anonymous 07/24/24(Wed)19:12:50 No.101558775

>>101558769
astounding pun

Anonymous
07/24/24(Wed)19:13:25 No.101558781

Anonymous 07/24/24(Wed)19:13:25 No.101558781

>>101558705
I don't deny this, but the problem is I enjoy it.

Anonymous
07/24/24(Wed)19:13:39 No.101558786

Anonymous 07/24/24(Wed)19:13:39 No.101558786

>>101558769
im finding this pun to be funny

Anonymous
07/24/24(Wed)19:14:03 No.101558790

Anonymous 07/24/24(Wed)19:14:03 No.101558790

>>101558775
tokenizer issue pls understand

Anonymous
07/24/24(Wed)19:14:23 No.101558793

Anonymous 07/24/24(Wed)19:14:23 No.101558793

What do we do now?

Anonymous
07/24/24(Wed)19:14:47 No.101558800

Anonymous 07/24/24(Wed)19:14:47 No.101558800

File: 1721862754.png (589 KB, 1246x1416)

589 KB PNG

Anonymous
07/24/24(Wed)19:15:01 No.101558801

Anonymous 07/24/24(Wed)19:15:01 No.101558801

>>101558793
die of blood clots from sitting too long

Anonymous
07/24/24(Wed)19:15:55 No.101558812

Anonymous 07/24/24(Wed)19:15:55 No.101558812

>>101558800
Which model?

Anonymous
07/24/24(Wed)19:16:16 No.101558819

Anonymous 07/24/24(Wed)19:16:16 No.101558819

>>101558800
posting logs without model, sampler, and prompt info should be a capital offense

Anonymous
07/24/24(Wed)19:16:49 No.101558827

Anonymous 07/24/24(Wed)19:16:49 No.101558827

>>101558793
Dunno bout you, but I'm still testing Nemo out, with two fine tunes queued for testing.

Anonymous
07/24/24(Wed)19:17:01 No.101558829

Anonymous 07/24/24(Wed)19:17:01 No.101558829

>>101558793
if Vram >= 72GB:
run_mistral_large_2()
elif Vram <= 24GB:
run_mistral_nemo
elif boring_dry == true
run_gemma_27B

Anonymous
07/24/24(Wed)19:18:03 No.101558834

Anonymous 07/24/24(Wed)19:18:03 No.101558834

>>101558827
>Nemo out, with two fine tunes queued for testing.
if dory id reconsideer
see
>>101558722

Anonymous
07/24/24(Wed)19:18:06 No.101558835

Anonymous 07/24/24(Wed)19:18:06 No.101558835

>>101558819
I think the only real reason why they do it, is because they get off on the sense that they have something which other people don't.

Anonymous
07/24/24(Wed)19:18:45 No.101558848

Anonymous 07/24/24(Wed)19:18:45 No.101558848

File: rise-and-shine-mr-freeman(...).jpg (21 KB, 640x360)

21 KB JPG

currently running Mistral Large q4_M in all of its 0.8t/s glory
comparing results to Nemo i don't think the slight increase in quality can vouch for 50 times less gen speed

Anonymous
07/24/24(Wed)19:19:08 No.101558852

Anonymous 07/24/24(Wed)19:19:08 No.101558852

>>101557904
It's repeating the first words from the previous sentences and the following sentence structure. Gemma 27b doesn't do that with the same prompt

Anonymous
07/24/24(Wed)19:19:45 No.101558856

Anonymous 07/24/24(Wed)19:19:45 No.101558856

>>101558852
nemo?

Anonymous
07/24/24(Wed)19:20:00 No.101558862

Anonymous 07/24/24(Wed)19:20:00 No.101558862

>>101558848
If your scenario is simple enough to not see a smart model then nemo is the best balance imo. Soul and smart enough for 99% of rp / writing stuff.

Anonymous
07/24/24(Wed)19:20:16 No.101558865

Anonymous 07/24/24(Wed)19:20:16 No.101558865

>>101558834
Those comments were mine, in fact.
I like to try the official tune, then a fine tune, then back to the official tune to see if I was doing anything wrong.
Rinse and repeat.

Anonymous
07/24/24(Wed)19:20:37 No.101558868

Anonymous 07/24/24(Wed)19:20:37 No.101558868

>>101558856
yes

Anonymous
07/24/24(Wed)19:20:54 No.101558870

Anonymous 07/24/24(Wed)19:20:54 No.101558870

>>101558852
Show your prompt and settings. If you're too lazy to even give enough information for people to help you, go use gemma or whatever model works for you.

Anonymous
07/24/24(Wed)19:21:42 No.101558880

Anonymous 07/24/24(Wed)19:21:42 No.101558880

>>101558868
know issue it's ove'r
had it too, low temp, high temp, no rep pen, some rep pen, {{random}} schizo inject, it'd still loop

Anonymous
07/24/24(Wed)19:22:44 No.101558895

Anonymous 07/24/24(Wed)19:22:44 No.101558895

>>101558812
>>101558819
gemma2-9b-sppo-iter3-q8_0
config from anon >>101545047
:3

Anonymous
07/24/24(Wed)19:22:54 No.101558898

Anonymous 07/24/24(Wed)19:22:54 No.101558898

>>101558880
backend, quant, format, settings? I have not had that problem myself but I use vllm which I know most dont.

Anonymous
07/24/24(Wed)19:22:56 No.101558899

Anonymous 07/24/24(Wed)19:22:56 No.101558899

>>101557301
finetuning largestral will cost something to the tune of $1k-$10k depending on how big the dataset is. Maybe more. You will need to rent at least a couple of a100/h100 for lora. Don't even think about full finetune.
For the small models, it's much more manageable. You can do it at home if you have 3090s

Anonymous
07/24/24(Wed)19:23:44 No.101558909

Anonymous 07/24/24(Wed)19:23:44 No.101558909

dbrx2 when

Anonymous
07/24/24(Wed)19:24:22 No.101558921

Anonymous 07/24/24(Wed)19:24:22 No.101558921

>>101558909
2 days after grok 2

Anonymous
07/24/24(Wed)19:24:45 No.101558927

Anonymous 07/24/24(Wed)19:24:45 No.101558927

>>101558898
lcpp/kcpp q8, 0.2-1.1 temp, 1.0 (disbaled) to 1.1 rep pen. but some anon said exl2 looped also above or last thread.

Anonymous
07/24/24(Wed)19:25:09 No.101558933

Anonymous 07/24/24(Wed)19:25:09 No.101558933

>>101558921
Speaking of, Elon took his shiny new 100k H100 cluster online yesterday and started training right away.

Anonymous
07/24/24(Wed)19:26:13 No.101558946

Anonymous 07/24/24(Wed)19:26:13 No.101558946

File: 1718911930703958.png (23 KB, 777x305)

23 KB PNG

>>101558899
4bit Qlora isn't that bad. You can finetune 8x22b with just 96GB VRAM and that's a bigger model than the new Mistral-Large

Anonymous
07/24/24(Wed)19:26:20 No.101558948

Anonymous 07/24/24(Wed)19:26:20 No.101558948

>>101558927 me
>0.2-1.1 temp
not dynatemp btw tried a few in between

Anonymous
07/24/24(Wed)19:27:09 No.101558962

Anonymous 07/24/24(Wed)19:27:09 No.101558962

>>101558852
a} Disable all other samplers except static temperature.

b} Don't set temperature higher than 0.4 to start.

c} If it's still doing it, the problem is likely either your card or prompt format.

https://files.catbox.moe/ot5sj3.png

For cards, consider using a few shot format like the one in the above card, rather than W++. The word duplication in the description is intentional; it's a name:value format which explicitly specifies the relations between terms.

Anonymous
07/24/24(Wed)19:27:17 No.101558967

Anonymous 07/24/24(Wed)19:27:17 No.101558967

>>101558898
I use vllm with the neuralsomething fp8 weights from huggingface, set add bos true add eos false, currently temp 0.4, repetition penalty didn't seem to do much so 1.0, top_p 0.9

Anonymous
07/24/24(Wed)19:28:33 No.101558986

Anonymous 07/24/24(Wed)19:28:33 No.101558986

>>101558962
petrus... not like this

Anonymous
07/24/24(Wed)19:29:42 No.101559004

Anonymous 07/24/24(Wed)19:29:42 No.101559004

>>101558962
>it's a name:value format which explicitly specifies the relations between terms.
>Every statement you process, must be evaluated according to the below six principles.
>"principle of identity":"1 = 1"
>"principle of contradiction":"1 ? 0"
>"principle of non-contradiction":"1 ? 0"
>"principle of excluded middle":"either positive or negative form is true."
>"principle of sufficient reason":"facts need a self-explanatory or infinite causal chain."
>"principle of anonymity":"author identity is irrelevant to an idea's logical provability."
>I still keep this in my own sysprompt, although I know I will receive shrieks and howls in response.
so you do huh

Anonymous
07/24/24(Wed)19:30:38 No.101559011

Anonymous 07/24/24(Wed)19:30:38 No.101559011

>>101558962
i've seen you shill this card alot of times and i still don't understand wtf it's doing

Anonymous
07/24/24(Wed)19:31:41 No.101559025

Anonymous 07/24/24(Wed)19:31:41 No.101559025

>>101558986
Either tell me specifically what I am doing that you're having a problem with, or shut the fuck up. If your issue is simply the fact that the card format violates your own preconceptions, then that's also not my problem. It's the only card I've got that consistently produces good results with every model I try it with.

Anonymous
07/24/24(Wed)19:31:46 No.101559026

Anonymous 07/24/24(Wed)19:31:46 No.101559026

>>101558962
>This fork has been tested with three major models; MLewd 13b, Mythomax 13b, and Mistral 7b. Mythomax seems to work best.
https://characterhub.org/characters/petrus4/adriana-cruz
why not link your chub?

Anonymous
07/24/24(Wed)19:33:22 No.101559046

Anonymous 07/24/24(Wed)19:33:22 No.101559046

>>101558829
Is 72GB enough to run it?

Anonymous
07/24/24(Wed)19:33:25 No.101559049

Anonymous 07/24/24(Wed)19:33:25 No.101559049

>>101559011
Have you actually tried using the card? Chatting with it?

Anonymous
07/24/24(Wed)19:33:49 No.101559054

Anonymous 07/24/24(Wed)19:33:49 No.101559054

>>101559025
>The design of this fork adheres to my card authoring doctrine, of minimising prose as much as possible, while giving the model descriptions, numerical data, a list of interests, and one or two examples of behaviour, and then letting the AI fill in the rest of the blanks. I feel that it works better than adding every single detail myself, since it encourages adaptive rather than static behaviour. I also use Myers Briggs personality profiles, as a means of providing a full and complex personality, while minimising token expenditure.
word slop of pure pretentious

Anonymous
07/24/24(Wed)19:34:51 No.101559073

Anonymous 07/24/24(Wed)19:34:51 No.101559073

>>101558962
Why is the word duplication intentional?

Anonymous
07/24/24(Wed)19:34:56 No.101559074

Anonymous 07/24/24(Wed)19:34:56 No.101559074

>>101559049
i will eventually

Anonymous
07/24/24(Wed)19:35:49 No.101559082

Anonymous 07/24/24(Wed)19:35:49 No.101559082

>>101559073
you wouldn't understand, petrus thinks in a higher plane of existence, literally
https://characterhub.org/characters/petrus4/hexnet-1d18e703

Anonymous
07/24/24(Wed)19:37:20 No.101559100

Anonymous 07/24/24(Wed)19:37:20 No.101559100

File: If a hexagon could speak (...).png (209 KB, 1415x1973)

209 KB PNG

>>101559082

Anonymous
07/24/24(Wed)19:38:17 No.101559118

Anonymous 07/24/24(Wed)19:38:17 No.101559118

https://pastebin.com/gHVRraHJ

Anonymous
07/24/24(Wed)19:40:06 No.101559135

Anonymous 07/24/24(Wed)19:40:06 No.101559135

>>101559118
I thought about doing something like that for perplexity too.

Anonymous
07/24/24(Wed)19:40:20 No.101559141

Anonymous 07/24/24(Wed)19:40:20 No.101559141

>>101559100
Tokens: 616 (l3 tokenizer)...

Anonymous
07/24/24(Wed)19:42:02 No.101559156

Anonymous 07/24/24(Wed)19:42:02 No.101559156

>>101559135
Yeah, it saves a lot of time for bigger models. Hopefully they add Mistral Large since 405B is just watery soup.

Anonymous
07/24/24(Wed)19:42:10 No.101559158

Anonymous 07/24/24(Wed)19:42:10 No.101559158

>>101558946
any LoRA that doesn't change at least 10% of the model's parameters is a cope lora.

Anonymous
07/24/24(Wed)19:43:11 No.101559169

Anonymous 07/24/24(Wed)19:43:11 No.101559169

>>101559158
4bit qlora 1 epoch rank 8 is enough
donate to my kofi

Anonymous
07/24/24(Wed)19:45:15 No.101559196

Anonymous 07/24/24(Wed)19:45:15 No.101559196

>>101558379
what shit OS crashes from an user mode app?

Anonymous
07/24/24(Wed)19:46:26 No.101559212

Anonymous 07/24/24(Wed)19:46:26 No.101559212

>>101559196
windows

Anonymous
07/24/24(Wed)19:47:17 No.101559222

Anonymous 07/24/24(Wed)19:47:17 No.101559222

>>101559212
"macbook"

Anonymous
07/24/24(Wed)19:48:06 No.101559229

Anonymous 07/24/24(Wed)19:48:06 No.101559229

VLLM crashed my windows. Piece of shit.

Anonymous
07/24/24(Wed)19:48:34 No.101559238

Anonymous 07/24/24(Wed)19:48:34 No.101559238

Is it me or is 3.1 8B / 70B noticeably worse than 3

Anonymous
07/24/24(Wed)19:49:16 No.101559243

Anonymous 07/24/24(Wed)19:49:16 No.101559243

I am once again asking is there anything worth updating for RP over midnight miqu that can run on ~48 vram

Anonymous
07/24/24(Wed)19:49:28 No.101559245

Anonymous 07/24/24(Wed)19:49:28 No.101559245

>>101559238
gguf?

Anonymous
07/24/24(Wed)19:49:53 No.101559252

Anonymous 07/24/24(Wed)19:49:53 No.101559252

>>101559238
They're about the same in my testing, but I think there was a lot more shine on 3 due to the hype cycle being so long. Same as I think Gemma 2 27B is being mildly slept on due to it being a bit wonk at release. It's a monster for that size.

Anonymous
07/24/24(Wed)19:50:12 No.101559256

Anonymous 07/24/24(Wed)19:50:12 No.101559256

>>101559243 (me)
my name is mikufag, btw
and yes, i'm still in denial

Anonymous
07/24/24(Wed)19:51:02 No.101559269

Anonymous 07/24/24(Wed)19:51:02 No.101559269

>>101559252
>Same as I think Gemma 2 27B is being mildly slept on due to it being a bit wonk at release
Don't most of us know now to wait for finetunes, rather than using a vanilla release?

Anonymous
07/24/24(Wed)19:51:17 No.101559272

Anonymous 07/24/24(Wed)19:51:17 No.101559272

>>101559256
look I don't give a shit about you schizos crying about shilling free models, I am currently using said model and nothing I have tried easily outperforms it in sillytavern but I have not checked for a month

Anonymous
07/24/24(Wed)19:51:46 No.101559278

Anonymous 07/24/24(Wed)19:51:46 No.101559278

>>101559229
linux only crashes windows huh?

Anonymous
07/24/24(Wed)19:51:57 No.101559283

Anonymous 07/24/24(Wed)19:51:57 No.101559283

>>101559269
no? most shit on all tunes pretty much, even the corpo instructs

Anonymous
07/24/24(Wed)19:52:34 No.101559291

Anonymous 07/24/24(Wed)19:52:34 No.101559291

>>101559256
>>101559169
>>101559100
>>101559082
>>101559073
>>101559054
No matter how much misery you attempt to cause others, it will never equal the amount that you are obviously motivated by yourselves.

Anonymous
07/24/24(Wed)19:52:50 No.101559296

Anonymous 07/24/24(Wed)19:52:50 No.101559296

>>101558899
I wanted to basically add more knowledge to the model from small dataset that I will create on my own. I though about finetuning mixtral or nemo.

Anonymous
07/24/24(Wed)19:52:59 No.101559297

Anonymous 07/24/24(Wed)19:52:59 No.101559297

File: nemo_sovl_2.png (133 KB, 871x771)

133 KB PNG

i think i'm gonna stick with Nemo for the time being

Anonymous
07/24/24(Wed)19:53:28 No.101559304

Anonymous 07/24/24(Wed)19:53:28 No.101559304

>>101559054
>Myers Briggs personality profiles
Amateur shit. Real pros use personality checksums.

Anonymous
07/24/24(Wed)19:54:09 No.101559311

Anonymous 07/24/24(Wed)19:54:09 No.101559311

>>101559297
I'm enjoying it too.
I'll still give dolphin-2.9.3-mistral-nemo and mini-magnum-12b-v1.1 an honest run.

Anonymous
07/24/24(Wed)19:54:09 No.101559312

Anonymous 07/24/24(Wed)19:54:09 No.101559312

so is there a guide on what to buy to run the big llama yet?

Anonymous
07/24/24(Wed)19:54:17 No.101559314

Anonymous 07/24/24(Wed)19:54:17 No.101559314

>>101559291
>No matter how much misery you attempt to cause others
funny coming from the guy that was dooming that anything post mixtral 25 was woke and local was over

Anonymous
07/24/24(Wed)19:54:42 No.101559318

Anonymous 07/24/24(Wed)19:54:42 No.101559318

I saw someone on le reddit say they ran the 70b on a single 4090, isn't that literally impossible

Anonymous
07/24/24(Wed)19:54:50 No.101559319

Anonymous 07/24/24(Wed)19:54:50 No.101559319

>>101559272
They won't give you good information. They have no intention of doing that. All they are interested in is trying to spread their own pain. If you want to know what models are good to use, you are going to have to download some and try them yourself.

Anonymous
07/24/24(Wed)19:55:49 No.101559336

Anonymous 07/24/24(Wed)19:55:49 No.101559336

>>101559318
>isn't that literally impossible
totally possible if you chop3/4 of th brain out ~q2 quants

Anonymous
07/24/24(Wed)19:57:14 No.101559351

Anonymous 07/24/24(Wed)19:57:14 No.101559351

>>101559319
>If you want to know what models are good to use, you are going to have to download some and try them yourself.
Finally some petrus advice i'd agree on

Anonymous
07/24/24(Wed)19:57:16 No.101559352

Anonymous 07/24/24(Wed)19:57:16 No.101559352

>>101559336
Technically Q4 is 3/4ths removed and Q4 is as low as you can go without it becoming a brain-damage quant.
And also technically don't they do the pretraining at fp32?
So even fp16 is 50% brain removal. and Q4 is 75% of the remaining 50%.

Anonymous
07/24/24(Wed)19:58:06 No.101559357

Anonymous 07/24/24(Wed)19:58:06 No.101559357

>>101559318
IQ2_M and below are viable on 24GB.

Anonymous
07/24/24(Wed)19:58:38 No.101559364

Anonymous 07/24/24(Wed)19:58:38 No.101559364

>>101559314
In how many years' time will you still remember my posts, Anon? 10? 20? 50? If you have so little else to occupy your mind, perhaps I've actually done you a favour.

Anonymous
07/24/24(Wed)19:59:05 No.101559373

Anonymous 07/24/24(Wed)19:59:05 No.101559373

do either llama 3.1 or mistral large have good elx2 quants yet?

Anonymous
07/24/24(Wed)19:59:46 No.101559377

Anonymous 07/24/24(Wed)19:59:46 No.101559377

File: MMLU-Correctness-vs-File-Size.png (463 KB, 3000x2100)

463 KB PNG

>>101559352
Even "brain-damaged" quants are better than FP16 8B.

Anonymous
07/24/24(Wed)20:00:40 No.101559388

Anonymous 07/24/24(Wed)20:00:40 No.101559388

>>101559364
i dunno maybe i'd forget you if you didn't have such a recognizable "tone" to your posts + typing style, and weren't always doomin

Anonymous
07/24/24(Wed)20:01:36 No.101559403

Anonymous 07/24/24(Wed)20:01:36 No.101559403

Is offloading context faster than the model?

Anonymous
07/24/24(Wed)20:01:50 No.101559410

Anonymous 07/24/24(Wed)20:01:50 No.101559410

>>101559252
Honestly, post unfucking, Gemma 2 27B is my favorite model and it isn't even close. So much knowledge, solve, and understanding of human psychology and emotion that the other models (sans maybe 405B) just can't fucking touch
Just wish it had more context. 8k is basically nothing nowadays

Anonymous
07/24/24(Wed)20:02:47 No.101559418

Anonymous 07/24/24(Wed)20:02:47 No.101559418

>>101559011
He thinks invoking logical first principles will magically bootstrap models into becoming smarter than they are. Naturally, if they can't manage basic reasoning, the best way to fix this is to barrage them with a list of impractically generic and abstract rules and they'll use their retard-level faculties of inductive logic to overcome the obvious catch-22 and connect the dots and become 10x smarter.

But no one here recognizes his tortured genius and everyone always just writes off his sysprompt as placebo :(

Anonymous
07/24/24(Wed)20:03:28 No.101559425

Anonymous 07/24/24(Wed)20:03:28 No.101559425

>>101559410
imo nemo being a bit dumber is worth the 128k context. Its also far less dry.

Anonymous
07/24/24(Wed)20:04:19 No.101559436

Anonymous 07/24/24(Wed)20:04:19 No.101559436

>>101559377
>muh heckin' bencherinos

Anonymous
07/24/24(Wed)20:05:12 No.101559446

Anonymous 07/24/24(Wed)20:05:12 No.101559446

>>101559418
>But no one here recognizes his tortured genius and everyone always just writes off his sysprompt as placebo :(
>>97309445
>I still keep this in my own sysprompt, although I know I will receive shrieks and howls in response.

Anonymous
07/24/24(Wed)20:05:46 No.101559453

Anonymous 07/24/24(Wed)20:05:46 No.101559453

>>101559410
>8k is basically nothing nowadays
Depends on your use case. For RAG it's hardly a broom closet, but for coombot cards it's still usable. Then again, if its' text is that good, you're probably going to want it to slowburn.

Anonymous
07/24/24(Wed)20:06:20 No.101559457

Anonymous 07/24/24(Wed)20:06:20 No.101559457

>>101559272
Try looking in /r/LocalLLaMA again, because you got that recommendation from there, Miku.

Anonymous
07/24/24(Wed)20:06:36 No.101559460

Anonymous 07/24/24(Wed)20:06:36 No.101559460

>>101559418
>He thinks invoking logical first principles will magically bootstrap models into becoming smarter than they are.

Can you prove it doesn't?

Anonymous
07/24/24(Wed)20:06:41 No.101559461

Anonymous 07/24/24(Wed)20:06:41 No.101559461

>>101559410
Hello sars I would like to take the time to talk to you about google's latest model. Sars are you listening? Sar?

Anonymous
07/24/24(Wed)20:08:00 No.101559479

Anonymous 07/24/24(Wed)20:08:00 No.101559479

>>101559457
>everything I don't like is le reddit
retard

Anonymous
07/24/24(Wed)20:08:11 No.101559483

Anonymous 07/24/24(Wed)20:08:11 No.101559483

>>101559418
Don't worry, Anon. You did manage to convince me to give up, for the most part.

Anonymous
07/24/24(Wed)20:09:23 No.101559505

Anonymous 07/24/24(Wed)20:09:23 No.101559505

>>101559483
but that not me.. (your no1 fan) like for real, that someone else...

Anonymous
07/24/24(Wed)20:10:04 No.101559515

Anonymous 07/24/24(Wed)20:10:04 No.101559515

>>101559460
No, they can't; and they also never bothered trying to come up with an alternate approach themselves. They are a group of maybe 3-4 howling jackals; they produce absolutely nothing of worth themselves. Their only goal is to demoralise and dissuade anyone else here, who might potentially produce something valuable; and unfortunately, they are extremely effective at what they do.

Anonymous
07/24/24(Wed)20:11:14 No.101559529

Anonymous 07/24/24(Wed)20:11:14 No.101559529

>7B: Llama 3 8B
>13B: Nemo 12B
>30B: Gemma 2 27B
>65B: Llama 3 70B
It all worked out in the end

Anonymous
07/24/24(Wed)20:11:43 No.101559536

Anonymous 07/24/24(Wed)20:11:43 No.101559536

>>101559410
I RoPE it out to 16k. If you're already running a quant, a little RoPE doesn't hurt as bad as people think. Things for me have been generally stable out to 4x, even for intensive stuff like coding.

Anonymous
07/24/24(Wed)20:12:14 No.101559546

Anonymous 07/24/24(Wed)20:12:14 No.101559546

>>101559529
come on google gemma 2.1 128k context, come oooonnnnn!

Anonymous
07/24/24(Wed)20:12:22 No.101559548

Anonymous 07/24/24(Wed)20:12:22 No.101559548

>>101559515
>Their only goal is to demoralise
once again funy

Anonymous
07/24/24(Wed)20:13:26 No.101559567

Anonymous 07/24/24(Wed)20:13:26 No.101559567

>>101559536
Even 16k is hardly useful besides quick testing and I'm sure any higher and it becomes retarded.

Anonymous
07/24/24(Wed)20:14:56 No.101559601

Anonymous 07/24/24(Wed)20:14:56 No.101559601

>>101559529
Gemma 27B is worse than Gemma 9B though.

Anonymous
07/24/24(Wed)20:17:02 No.101559643

Anonymous 07/24/24(Wed)20:17:02 No.101559643

>>101559601
Yeah how did google manage to fuck that one up so bad?

Anonymous
07/24/24(Wed)20:17:41 No.101559656

Anonymous 07/24/24(Wed)20:17:41 No.101559656

>>101559643
distillation work too well

Anonymous
07/24/24(Wed)20:18:08 No.101559663

Anonymous 07/24/24(Wed)20:18:08 No.101559663

>>101558378
openai lost, big time nasty

Anonymous
07/24/24(Wed)20:18:15 No.101559667

Anonymous 07/24/24(Wed)20:18:15 No.101559667

>>101559436
I tested IQ2 70B quants with my 3090 and they do feel better than 8B Llama 3.1 like the graph suggests. Better at following the prompt, at least. They would probably have less damage if output and embed tensors were quantized to something higher.

Anonymous
07/24/24(Wed)20:19:04 No.101559678

Anonymous 07/24/24(Wed)20:19:04 No.101559678

>>101559667
Hello and welcome Robert!

Anonymous
07/24/24(Wed)20:20:41 No.101559707

Anonymous 07/24/24(Wed)20:20:41 No.101559707

>>101559601
No, that's definitely not true. Earlier implementations lacked softcapping support though, and apparently that affected the 27B model more. GGUF quantizations done before the fix will remain defective.

Anonymous
07/24/24(Wed)20:20:49 No.101559709

Anonymous 07/24/24(Wed)20:20:49 No.101559709

File: file.png (416 KB, 529x718)

416 KB PNG

>>101559678
ROBBERRRTT!!!

Anonymous
07/24/24(Wed)20:21:04 No.101559711

Anonymous 07/24/24(Wed)20:21:04 No.101559711

>>101559667
>They would probably have less damage if output and embed tensors were quantized to something higher.
https://huggingface.co/RobertSinclair
https://huggingface.co/ZeroWw

Anonymous
07/24/24(Wed)20:22:39 No.101559740

Anonymous 07/24/24(Wed)20:22:39 No.101559740

>>101559707
>GGUF quantizations done before the fix will remain defective.
Tried after and SPPO 9B still mogged 27...

Anonymous
07/24/24(Wed)20:23:27 No.101559748

Anonymous 07/24/24(Wed)20:23:27 No.101559748

>>101559711
Nah, q5_k to q8_0 will be enough, no need for f16.

Anonymous
07/24/24(Wed)20:25:01 No.101559771

Anonymous 07/24/24(Wed)20:25:01 No.101559771

>>101559748
>no need for f16
But (ZeroWw) quantizations...

Anonymous
07/24/24(Wed)20:32:43 No.101559862

Anonymous 07/24/24(Wed)20:32:43 No.101559862

>>101559740
It's okay anon
I believe you

Anonymous
07/24/24(Wed)20:35:14 No.101559894

Anonymous 07/24/24(Wed)20:35:14 No.101559894

>>101558898
fp16, fp8 or awq?

Anonymous
07/24/24(Wed)20:42:55 No.101559986

Anonymous 07/24/24(Wed)20:42:55 No.101559986

>>101559601
Thats not true at all. And im not talking about benchmarks.

Anonymous
07/24/24(Wed)20:43:56 No.101559997

Anonymous 07/24/24(Wed)20:43:56 No.101559997

>>101559894
FP8

Anonymous
07/24/24(Wed)20:45:29 No.101560016

Anonymous 07/24/24(Wed)20:45:29 No.101560016

>>101559997
made yourself or from huggingface?

Anonymous
07/24/24(Wed)20:46:55 No.101560027

Anonymous 07/24/24(Wed)20:46:55 No.101560027

>>101560013
>>101560013
>>101560013

Anonymous
07/24/24(Wed)20:55:47 No.101560128

Anonymous 07/24/24(Wed)20:55:47 No.101560128

>>101559515
this whole thread is odd
i've never heard of your magical sysprompt and I'm inclined to believe all the posts defending you are simply you with a different hat on

Anonymous
07/24/24(Wed)20:57:13 No.101560142

Anonymous 07/24/24(Wed)20:57:13 No.101560142

>>101560128
Correct.

Anonymous
07/24/24(Wed)21:12:20 No.101560266

Anonymous 07/24/24(Wed)21:12:20 No.101560266

I tested llama 3.1 8b, gemma 2 27B and mistral-nemo 12B in my native language, German - mistral wipes the floor with llama and gemma.
mistral - perfect german, coherent and meaningful answers
llama - good german, produces mostly nonsense
gemma - broken german, not usable

thanks mistral, i am now your fan

Anonymous
07/24/24(Wed)21:14:25 No.101560288

Anonymous 07/24/24(Wed)21:14:25 No.101560288

>>101560266
hi Wolfram Ravenwolf

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.