/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 07/27/24(Sat)13:33:17 No.101596616

File: 1682918384500.jpg (106 KB, 662x1000)

106 KB JPG

/lmg/ - Local Models General Anonymous 07/27/24(Sat)13:33:17 No.101596616 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>101589136 & >>101584411

►News
>(07/27) Llama 3.1 rope scaling merged: https://github.com/ggerganov/llama.cpp/pull/8676
>(07/26) Cyberagent releases Japanese fine-tune model: https://hf.co/cyberagent/Llama-3.1-70B-Japanese-Instruct-2407
>(07/25) BAAI & TeleAI release 1T parameter model: https://hf.co/CofeAI/Tele-FLM-1T
>(07/24) Mistral Large 2 123B released: https://hf.co/mistralai/Mistral-Large-Instruct-2407
>(07/23) Llama 3.1 officially released: https://ai.meta.com/blog/meta-llama-3-1/
>(07/22) llamanon leaks 405B base model: https://files.catbox.moe/d88djr.torrent >>101516633

►News Archive: https://rentry.org/lmg-news-archive
►FAQ: https://wikia.schneedc.com
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Programming: https://hf.co/spaces/bigcode/bigcode-models-leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
07/27/24(Sat)13:33:50 No.101596623

Anonymous 07/27/24(Sat)13:33:50 No.101596623

File: 1709992939780627.jpg (347 KB, 2250x1651)

347 KB JPG

►Recent Highlights from the Previous Thread: >>101589136

--Requirements and challenges of running 405B at home: >>101590419 >>101590711 >>101590720 >>101590731 >>101590754 >>101590774 >>101590804 >>101590805 >>101590901 >>101592665
--Nemo presets, Mistral templates, and sampler settings discussion: >>101589231 >>101589290 >>101590015 >>101590073 >>101590109 >>101590191 >>101590383 >>101590410 >>101591228
--Anon shares ratings from recent test results: >>101593153 >>101593412
--Optimizing sampler settings for accuracy in a quant model: >>101594872 >>101594916 >>101595027 >>101596199 >>101596384
--Nala test with CofeAI FLM-Instruct: inconsistent but feels human-written: >>101594411 >>101594440 >>101594500 >>101594645
--Moondream 2 recommended for image tagging: >>101593186 >>101593206 >>101593219 >>101593356 >>101593213
--Nvidia-smi not displaying GPUs, driver issues, and parallelization challenges: >>101589653 >>101589659 >>101589688 >>101589715 >>101589802 >>101589955 >>101592665
--Nemo's context patterns and instructions, preset recommendation: >>101593320 >>101594296
--Nemo 12b support in koboldcpp and multimodal upstream refactor: >>101593836 >>101593865 >>101593986 >>101594064 >>101595213 >>101595316 >>101595352 >>101595379 >>101595497 >>101595523 >>101595549
--Mistral Large 2 model and potential GPU upgrades: >>101592681 >>101592986 >>101593085 >>101593228
--Llama.cpp compilation time increased: >>101593452 >>101593586 >>101593630
--Cohere raises $500 million, skeptics wonder about LLM longevity: >>101589537 >>101589550 >>101589569 >>101589707
--Rejected access requests and banned users from China/Russia for Meta Llama 3.1-405B: >>101594428 >>101594459
--A nostalgic reflection on the progress of LLM technology: >>101589265 >>101589317 >>101589642 >>101589872 >>101589969 >>101590006
--Llama 3.1 rope scaling factors pull request merged: >>101592964
--Miku (free space): >>101590569 >>101594469

►Recent Highlight Posts from the Previous Thread: >>101589142

Anonymous
07/27/24(Sat)13:44:16 No.101596758

Anonymous 07/27/24(Sat)13:44:16 No.101596758

>>101596623
Are these posts written by LLMs as well?

Anonymous
07/27/24(Sat)13:47:34 No.101596805

Anonymous 07/27/24(Sat)13:47:34 No.101596805

>>101596758
they're written by miku

Anonymous
07/27/24(Sat)13:49:28 No.101596822

Anonymous 07/27/24(Sat)13:49:28 No.101596822

File: asuka rtx thermal paste cum.png (496 KB, 720x687)

496 KB PNG

>>101596623
MikuCapposter making me cum with so many (Yous) again

Anonymous
07/27/24(Sat)13:52:30 No.101596867

Anonymous 07/27/24(Sat)13:52:30 No.101596867

>>101596805
Miku is not real, she doesn't exist

Anonymous
07/27/24(Sat)13:52:51 No.101596871

Anonymous 07/27/24(Sat)13:52:51 No.101596871

https://old.reddit.com/r/LocalLLaMA/comments/1ed9jxy/secret_to_mistral_nemo_at_128k_use_the_base_model/
So the anon last thread wasn't the only one who found the base model better at long context.

Anonymous
07/27/24(Sat)13:54:50 No.101596917

Anonymous 07/27/24(Sat)13:54:50 No.101596917

>>101596805
who is not exactly known for being able to write texts, so that's showing
now if only she was a chatbot...

Anonymous
07/27/24(Sat)13:55:45 No.101596934

Anonymous 07/27/24(Sat)13:55:45 No.101596934

>>101596871
Its not like this is new news. Base models have always been far better at completion tasks like creative writing / RP. I will never understand why people use assistant tuned models for RP / writing. It poisons them.

Anonymous
07/27/24(Sat)13:56:15 No.101596943

Anonymous 07/27/24(Sat)13:56:15 No.101596943

>>101596871
>>101596934
Honestly i might consider giving this a shot, who's a good quanter i can download base from?

Anonymous
07/27/24(Sat)13:57:39 No.101596966

Anonymous 07/27/24(Sat)13:57:39 No.101596966

>>101596805
A Local Miku at that.

Anonymous
07/27/24(Sat)13:59:00 No.101596986

Anonymous 07/27/24(Sat)13:59:00 No.101596986

>>101596943
>https://huggingface.co/ZeroWw/Mistral-Nemo-Base-2407-GGUF

Anonymous
07/27/24(Sat)13:59:09 No.101596991

Anonymous 07/27/24(Sat)13:59:09 No.101596991

>>101596934
And before anyone says "but I cant tell it to do something" that is what authors note is for. Place it close but before the end of context. It will continue the story / rp and will take into account the instructs as well or better than the assistant tune would.

Anonymous
07/27/24(Sat)13:59:48 No.101597000

Anonymous 07/27/24(Sat)13:59:48 No.101597000

>>101596986
kek

Anonymous
07/27/24(Sat)13:59:59 No.101597004

Anonymous 07/27/24(Sat)13:59:59 No.101597004

>>101596986
Isnt that the guy with some meme quants?

Anonymous
07/27/24(Sat)14:00:24 No.101597013

Anonymous 07/27/24(Sat)14:00:24 No.101597013

File: gnomed_from_the_start.jpg (23 KB, 397x371)

23 KB JPG

>>101596986
>5 days ago
I'll smack your shit mate.

Anonymous
07/27/24(Sat)14:02:11 No.101597038

Anonymous 07/27/24(Sat)14:02:11 No.101597038

>>101597013
gguf support has been a thing for a week "mate"
>https://github.com/Nexesenex/kobold.cpp/pull/250

Anonymous
07/27/24(Sat)14:02:23 No.101597041

Anonymous 07/27/24(Sat)14:02:23 No.101597041

>>101596934
As an oldfag ai dungeon user I simply switched to instruct because that's where the most new toys are, and it's convenient to steer the model towards outputs without shenanigans.
maybe it's time to return home...

Anonymous
07/27/24(Sat)14:03:21 No.101597054

Anonymous 07/27/24(Sat)14:03:21 No.101597054

>>101597038
and broken until a fix was pushed you fucker

Anonymous
07/27/24(Sat)14:03:53 No.101597065

Anonymous 07/27/24(Sat)14:03:53 No.101597065

File: lightyear.jpg (435 KB, 2048x2048)

435 KB JPG

>>101596986
>My own (ZeroWw) quantizations. output and embed tensors quantized to f16. all other tensors quantized to q5_k or q6_k.
>Result: both f16.q6 and f16.q5 are smaller than q8_0 standard quantization and they perform as well as the pure f16.

Anonymous
07/27/24(Sat)14:04:14 No.101597073

Anonymous 07/27/24(Sat)14:04:14 No.101597073

>>101596934
Now if only they also release base largestral, but it's probably something they have decided against doing.

Anonymous
07/27/24(Sat)14:04:38 No.101597079

Anonymous 07/27/24(Sat)14:04:38 No.101597079

>>101597054
*actually i might be thinking of something else but regardless fuck you muchly

Anonymous
07/27/24(Sat)14:04:41 No.101597081

Anonymous 07/27/24(Sat)14:04:41 No.101597081

>>101597054
sure thing bud next you'll post "idc dont use kobold"

Anonymous
07/27/24(Sat)14:05:44 No.101597098

Anonymous 07/27/24(Sat)14:05:44 No.101597098

>>101597081
idc dont use kobold

Anonymous
07/27/24(Sat)14:06:23 No.101597109

Anonymous 07/27/24(Sat)14:06:23 No.101597109

>>101597079
already backpalling after you call other rtarded while you dont know what youre even saying

Anonymous
07/27/24(Sat)14:08:11 No.101597133

Anonymous 07/27/24(Sat)14:08:11 No.101597133

>>101597109
>backpalling
what happened to this general? replaced by turdworlders that can't even type correctly.
anyway GOOD MORNING SIR

Anonymous
07/27/24(Sat)14:09:18 No.101597153

Anonymous 07/27/24(Sat)14:09:18 No.101597153

>>101597133
who cars when robert will save your first world model from slop youl kiss is ass

Anonymous
07/27/24(Sat)14:09:19 No.101597155

Anonymous 07/27/24(Sat)14:09:19 No.101597155

>>101597133
YOU BLOODY!!!!

Anonymous
07/27/24(Sat)14:10:52 No.101597164

Anonymous 07/27/24(Sat)14:10:52 No.101597164

>>101597153
>who cars when robert will save your first world model from slop youl kiss is ass
kek

Anonymous
07/27/24(Sat)14:11:34 No.101597177

Anonymous 07/27/24(Sat)14:11:34 No.101597177

File: sonic mania.jpg (32 KB, 376x376)

32 KB JPG

>>101597153
holy shit

Anonymous
07/27/24(Sat)14:14:37 No.101597219

Anonymous 07/27/24(Sat)14:14:37 No.101597219

>>101596986
Is the q8 there the normal one or his frankenquant?

Anonymous
07/27/24(Sat)14:14:49 No.101597224

Anonymous 07/27/24(Sat)14:14:49 No.101597224

File: Screenshot.png (11 KB, 787x40)

11 KB PNG

>>101597133
>>101597177
>>101597164
robert followed by huggingface ceo too so hes obvs importatn unlike you useless
>https://huggingface.co/ZeroWw?followers=true

Anonymous
07/27/24(Sat)14:17:16 No.101597257

Anonymous 07/27/24(Sat)14:17:16 No.101597257

>>101597224
Honestly i think being followed by Chuck mc Sneed is a higher honor. Now that one i long for.

Anonymous
07/27/24(Sat)14:18:04 No.101597270

Anonymous 07/27/24(Sat)14:18:04 No.101597270

File: nothingyy.png (13 KB, 607x186)

13 KB PNG

>>101597224
>robert followed by huggingface ceo too
Well. Everyone needs a laugh every now and then.

Anonymous
07/27/24(Sat)14:18:47 No.101597280

Anonymous 07/27/24(Sat)14:18:47 No.101597280

I can't for the everything that is sacred get vision models to get species in furry art right. They either don't mention it (when I explicitely tell them to mention it) or get it wrong

Anonymous
07/27/24(Sat)14:19:54 No.101597294

Anonymous 07/27/24(Sat)14:19:54 No.101597294

File: cuphead jimi incredibly a(...).jpg (43 KB, 464x513)

43 KB JPG

>>101597270
that cant be real no way

Anonymous
07/27/24(Sat)14:20:19 No.101597302

Anonymous 07/27/24(Sat)14:20:19 No.101597302

>>101597280
Finetuned models or just stock models? I doubt they have any of it in the training data.

Anonymous
07/27/24(Sat)14:21:10 No.101597316

Anonymous 07/27/24(Sat)14:21:10 No.101597316

>>101597294
https://huggingface.co/ZeroWw/Mistral-7B-Instruct-v0.3-SILLY

Anonymous
07/27/24(Sat)14:21:25 No.101597321

Anonymous 07/27/24(Sat)14:21:25 No.101597321

>>101597294
You fucking bet
>https://huggingface.co/ZeroWw/Meta-Llama-3.1-8B-Instruct-SILLY
Now with randomized weights!

Anonymous
07/27/24(Sat)14:21:48 No.101597325

Anonymous 07/27/24(Sat)14:21:48 No.101597325

What largestral quants should I download for 64GB of Vmeme
I don't want to download broken quants

Anonymous
07/27/24(Sat)14:22:41 No.101597335

Anonymous 07/27/24(Sat)14:22:41 No.101597335

>>101597325
>https://huggingface.co/RobertSinclair here good quant

Anonymous
07/27/24(Sat)14:23:33 No.101597343

Anonymous 07/27/24(Sat)14:23:33 No.101597343

>>101597325
>I don't want to download broken quants
You should make them yourself, then. Even if you get one with the latest whatever program you use, if there's a fix in a week from now, you'll have to wait for someone else to make them. It's a big download, but it seems to be worth it.

Anonymous
07/27/24(Sat)14:24:42 No.101597359

Anonymous 07/27/24(Sat)14:24:42 No.101597359

holy shit nemo base really does need different settings from magnum

Anonymous
07/27/24(Sat)14:25:44 No.101597374

Anonymous 07/27/24(Sat)14:25:44 No.101597374

Llama3.1 8b has the limitation with mixed chat and function calling, right
Does this apply only to multi-message conversations? Just a singular prompt-response, can have regular conversational text in the prompt, and expect function call in response?
I'm saying "remind the user abot their aupcoming appoiontment" and instead of calling my Log() function, it hallucinates a function and calls it
Hermes2Pro is actually better than llama3 at 8b for function calling, so far

Anonymous
07/27/24(Sat)14:26:52 No.101597384

Anonymous 07/27/24(Sat)14:26:52 No.101597384

>>101597374
>remind the user abot their aupcoming appoiontment
>>101597133
>what happened to this general?

Anonymous
07/27/24(Sat)14:30:52 No.101597432

Anonymous 07/27/24(Sat)14:30:52 No.101597432

>>101597343
>>101597343
Do I download consolidated.safetensors or the parts to run the quantization script?

Anonymous
07/27/24(Sat)14:31:29 No.101597440

Anonymous 07/27/24(Sat)14:31:29 No.101597440

>>101597325
>>101597432

>https://huggingface.co/mradermacher/Mistral-Large-Instruct-2407-i1-GGUF

Anonymous
07/27/24(Sat)14:34:44 No.101597469

Anonymous 07/27/24(Sat)14:34:44 No.101597469

>>101597440
This! He's Thrusty!
>>101592040
>>His quant are okay if he do it before me, you can use them, he's thrusty.

Anonymous
07/27/24(Sat)14:36:39 No.101597487

Anonymous 07/27/24(Sat)14:36:39 No.101597487

File: 1595797655909.jpg (168 KB, 400x400)

168 KB JPG

>>101597469
>he's thrusty
that's it im quanting my own models from now on, I don't want my computer getting worms and AIDS from these ((people))

Anonymous
07/27/24(Sat)14:37:03 No.101597495

Anonymous 07/27/24(Sat)14:37:03 No.101597495

>>101597432
I download the whole thing.
>git clone https://huggingface.co/ble/model
>cd model
>git lfs install --local
>git lfs pull
>ride bike for a bit.
>../llama.cpp/convert_hf_to_gguf.py .
>llama-quantize ggml-model-f16.gguf Q6_K or whatever quant you want.
I don't know how it works with other inference programs.

Anonymous
07/27/24(Sat)14:38:19 No.101597509

Anonymous 07/27/24(Sat)14:38:19 No.101597509

File: ego.png (56 KB, 922x626)

56 KB PNG

>--z
>he want's to be the next jart

Anonymous
07/27/24(Sat)14:38:50 No.101597516

Anonymous 07/27/24(Sat)14:38:50 No.101597516

>>101597495
>>llama-quantize ggml-model-f16.gguf Q6_K or whatever quant you want.
quantize.exe --allow-requantize --output-tensor-type f16 --token-embedding-type f16 model.f16.gguf model.f16.q6.gguf q6_k

Anonymous
07/27/24(Sat)14:38:55 No.101597517

Anonymous 07/27/24(Sat)14:38:55 No.101597517

I was in the last thread asking about Nemo 12b and koboldcpp. I can confirm the standard version doesn't work. Maybe I'm not doing it right, but the GGUF version works fine.

Anonymous
07/27/24(Sat)14:39:09 No.101597520

Anonymous 07/27/24(Sat)14:39:09 No.101597520

>>101597509
He want to be paid by mozilla?

Anonymous
07/27/24(Sat)14:39:11 No.101597521

Anonymous 07/27/24(Sat)14:39:11 No.101597521

>>101597384
sorry im not a phoneposter with autocorrect

Anonymous
07/27/24(Sat)14:40:02 No.101597534

Anonymous 07/27/24(Sat)14:40:02 No.101597534

>>101597520
He wants to put his signature on someone else's software.

Anonymous
07/27/24(Sat)14:40:20 No.101597538

Anonymous 07/27/24(Sat)14:40:20 No.101597538

>>101597517
The state of this general.
Yes, Koboldcpp is only for GGUF files as is clearly written on their github
>KoboldCpp is an easy-to-use AI text-generation software for GGML and GGUF models

Anonymous
07/27/24(Sat)14:41:18 No.101597553

Anonymous 07/27/24(Sat)14:41:18 No.101597553

>>101597495
>git clone https://huggingface.co/ble/model
Use
huggingface-cli download ble/model
instead.
Unlike git clone it doesn't consume twice as much storage space and you get a much nicer progress bar.

Anonymous
07/27/24(Sat)14:41:39 No.101597560

Anonymous 07/27/24(Sat)14:41:39 No.101597560

>>101597517
Koboldcpp only runs gguf files.
The "standard" version you are talking about is what? The .safetensors files?
Were you trying to run those using the transformer library via ooba or something?

Anonymous
07/27/24(Sat)14:42:35 No.101597568

Anonymous 07/27/24(Sat)14:42:35 No.101597568

>>101597509
Please call this --outtype ZeroWw. Please. The seethe would be hilarious.

Anonymous
07/27/24(Sat)14:43:55 No.101597577

Anonymous 07/27/24(Sat)14:43:55 No.101597577

>>101597538
>>101597560
>>101597538
There was a note on release 1.71 that said they added Mistral Nemo support. Was ambiguous enough to try

Anonymous
07/27/24(Sat)14:44:04 No.101597580

Anonymous 07/27/24(Sat)14:44:04 No.101597580

>>101597553
Well.. i don't do quite that.
>git clone repo
>git -C repo lfs install --local
>git -C repo lfs fetch
and then i wrote a little program that makes links from the lfs file pointers to the actual objects. For a model that big, if he's not gonna fuck around with git, using that thing is probably better.

Anonymous
07/27/24(Sat)14:44:34 No.101597588

Anonymous 07/27/24(Sat)14:44:34 No.101597588

>>101597538
>for GGML and GGUF models
So obviously not just GGUFs then, retard. Does anyone know if GGML files are better than GGUFs?

Anonymous
07/27/24(Sat)14:45:17 No.101597598

Anonymous 07/27/24(Sat)14:45:17 No.101597598

>>101597577
No? Why would they add transformers support for one random model instead of the most likely thing, GGUF support of said model, Jesus Christ.

Anonymous
07/27/24(Sat)14:46:05 No.101597611

Anonymous 07/27/24(Sat)14:46:05 No.101597611

how long until we have an uncensored coom filled llama 3.1 405b?

Anonymous
07/27/24(Sat)14:46:17 No.101597614

Anonymous 07/27/24(Sat)14:46:17 No.101597614

>>101597588
JESUS HOLY HI PETRA

Anonymous
07/27/24(Sat)14:46:26 No.101597616

Anonymous 07/27/24(Sat)14:46:26 No.101597616

>>101597588
ggml is the library that loads gguf files. File extensions are arbitrary, retard.

Anonymous
07/27/24(Sat)14:46:43 No.101597619

Anonymous 07/27/24(Sat)14:46:43 No.101597619

>>101597577
Ah, I see what you mean now.

>>101597588
GGML were the predecessor to the cyrrent GGUF format

Anonymous
07/27/24(Sat)14:47:26 No.101597626

Anonymous 07/27/24(Sat)14:47:26 No.101597626

>>101597616
Actually before *.gguf we had ggml.bin files long long ago.
https://huggingface.co/TheBloke/llama2_70b_chat_uncensored-GGML/tree/main

Anonymous
07/27/24(Sat)14:48:06 No.101597633

Anonymous 07/27/24(Sat)14:48:06 No.101597633

>>101597611
Literally no one is going to sink the money into finetuning that monstrosity. Even slop tuners won't bother with their one pass qloras. Maybe a big company or research instituion, but that definintely won't be uncensored.

Anonymous
07/27/24(Sat)14:49:20 No.101597650

Anonymous 07/27/24(Sat)14:49:20 No.101597650

>>101597619
>GGML were the predecessor to the cyrrent GGUF format
But are they better? Like how Llama 2 is still better than Llama 3.

Anonymous
07/27/24(Sat)14:49:28 No.101597653

Anonymous 07/27/24(Sat)14:49:28 No.101597653

>>101597588
lol, based retard baiter

Anonymous
07/27/24(Sat)14:49:57 No.101597660

Anonymous 07/27/24(Sat)14:49:57 No.101597660

>>101597650
Yes.

Anonymous
07/27/24(Sat)14:50:39 No.101597666

Anonymous 07/27/24(Sat)14:50:39 No.101597666

#define LLAMA_FILE_MAGIC 0x67676a74 // 'ggjt' in hex

Anonymous
07/27/24(Sat)14:50:56 No.101597671

Anonymous 07/27/24(Sat)14:50:56 No.101597671

>>101597616
oof, outed yourself as a post-mistral babby

Anonymous
07/27/24(Sat)14:51:10 No.101597672

Anonymous 07/27/24(Sat)14:51:10 No.101597672

>>101597660
Who is the best quanter of GGML files? I can't find any for Nemo.

Anonymous
07/27/24(Sat)14:51:17 No.101597674

Anonymous 07/27/24(Sat)14:51:17 No.101597674

>>101597653
He's pretty good you gotta admit.

Anonymous
07/27/24(Sat)14:51:36 No.101597679

Anonymous 07/27/24(Sat)14:51:36 No.101597679

>>101597666
--share

Anonymous
07/27/24(Sat)14:51:53 No.101597684

Anonymous 07/27/24(Sat)14:51:53 No.101597684

Abandon ship

Anonymous
07/27/24(Sat)14:52:43 No.101597694

Anonymous 07/27/24(Sat)14:52:43 No.101597694

>>101597674
>you gotta
love the undster

Anonymous
07/27/24(Sat)14:53:46 No.101597708

Anonymous 07/27/24(Sat)14:53:46 No.101597708

>>101597633
I thought we established that the censorshit does not in fact exist, as the clown in the last thread proposed.

Anonymous
07/27/24(Sat)14:55:20 No.101597729

Anonymous 07/27/24(Sat)14:55:20 No.101597729

>>101597708
>I thought we
>we
there is no we in /lmg/

Anonymous
07/27/24(Sat)14:55:28 No.101597730

Anonymous 07/27/24(Sat)14:55:28 No.101597730

>>101597694
LOVE EM OR HATE EM, GOTTA LOVE EM!

Anonymous
07/27/24(Sat)14:57:19 No.101597752

Anonymous 07/27/24(Sat)14:57:19 No.101597752

>>101597694
>>101597730
Undi comes back from his tomb with multiple 3.1 tunes, thread goes down HARD, coinkidink? Ai thunk not.

Anonymous
07/27/24(Sat)14:57:53 No.101597760

Anonymous 07/27/24(Sat)14:57:53 No.101597760

>>101597708
>wwaaaaaa. i cannot make the model say the naughty words
Still a skill issue.

Anonymous
07/27/24(Sat)14:58:43 No.101597768

Anonymous 07/27/24(Sat)14:58:43 No.101597768

>>101597650
No. It was just a different way to package models. The current GGUF packs more metadata about the model.
The model itself, be it llama 1, llama 3, mistral, whatever, can be packed as whichever.
GGML and GGUF are just packaging formats, what changes the quality of the models packaged in those formats is the type of quantization, which I explained last thread.

Anonymous
07/27/24(Sat)14:59:19 No.101597776

Anonymous 07/27/24(Sat)14:59:19 No.101597776

>>101597768
>>101597653
>lol, based retard baiter

Anonymous
07/27/24(Sat)14:59:59 No.101597787

Anonymous 07/27/24(Sat)14:59:59 No.101597787

is the llama3.1 8b on ollama the instruct-tuned one?
also is there an 8bit quantization available?

Anonymous
07/27/24(Sat)15:02:13 No.101597817

Anonymous 07/27/24(Sat)15:02:13 No.101597817

>>101597359
dont use instruct mode with base models. In fact depending on how it was trained it may not need any formatting at all.

Anonymous
07/27/24(Sat)15:02:36 No.101597819

Anonymous 07/27/24(Sat)15:02:36 No.101597819

>>101597787
both ye

Anonymous
07/27/24(Sat)15:03:14 No.101597832

Anonymous 07/27/24(Sat)15:03:14 No.101597832

>>101597787
The default is 8b-instruct-q4_0. Just click on the dropdown or on the x tags text.

Anonymous
07/27/24(Sat)15:05:15 No.101597853

Anonymous 07/27/24(Sat)15:05:15 No.101597853

File: michael rosen oh fuck.jpg (32 KB, 500x500)

32 KB JPG

>>101597817
>now leaving instruct ON was the problem
god i need to bleach my brain and start over, thank you man.

Anonymous
07/27/24(Sat)15:05:41 No.101597857

Anonymous 07/27/24(Sat)15:05:41 No.101597857

>>101597495
Does this not work?
https://huggingface.co/spaces/ggml-org/gguf-my-repo

Anonymous
07/27/24(Sat)15:06:36 No.101597869

Anonymous 07/27/24(Sat)15:06:36 No.101597869

>>101597857
it will make a lot here seethe but it absolutely does work.

Anonymous
07/27/24(Sat)15:07:59 No.101597889

Anonymous 07/27/24(Sat)15:07:59 No.101597889

>>101597857
I never tried it. I assume it pulls the latest llama.cpp because the files are not in that repo. If it pulls the latest llama.cpp, it should work just fine.

Anonymous
07/27/24(Sat)15:08:07 No.101597892

Anonymous 07/27/24(Sat)15:08:07 No.101597892

>>101597065
>my new quant format!
>q6 and q5 perform as well as the pure f16.
Is this the new scam?

Anonymous
07/27/24(Sat)15:08:25 No.101597894

Anonymous 07/27/24(Sat)15:08:25 No.101597894

>>101597869
Ok neat, my Internet is slow ass so I'd rather not download the full model

Anonymous
07/27/24(Sat)15:08:42 No.101597897

Anonymous 07/27/24(Sat)15:08:42 No.101597897

>>101597853
And authors note is now your best friend for base models. Default insertion depth 4-ish is good.

Anonymous
07/27/24(Sat)15:08:49 No.101597902

Anonymous 07/27/24(Sat)15:08:49 No.101597902

>>101597892
no is real how scam if free

Anonymous
07/27/24(Sat)15:09:42 No.101597911

Anonymous 07/27/24(Sat)15:09:42 No.101597911

File: 1539932660672.gif (768 KB, 364x339)

768 KB GIF

I'm starting to believe my own meme that M is more truthful than S on IQ quants. I made an IQ2_M and it performed as well as IQ4_XS on the question I'm using. It got around 40% for the correct logits (IQ3_M got 60% and IQ4_XS got 40%).

Anonymous
07/27/24(Sat)15:09:44 No.101597912

Anonymous 07/27/24(Sat)15:09:44 No.101597912

>>101597892
The difference is the same between FP16 / FP8, so basically nothing if you need extra vram for context or such.

Anonymous
07/27/24(Sat)15:10:05 No.101597917

Anonymous 07/27/24(Sat)15:10:05 No.101597917

>>101597776
Even if it's bait, posts like the one you respoded to might help lurkers who are genuinely learning.

Anonymous
07/27/24(Sat)15:10:52 No.101597929

Anonymous 07/27/24(Sat)15:10:52 No.101597929

>>101597897
yeah i wrote one to try and get magnum to stop making my OC's so impossibly horny with every single prompt (and seemingly not knowing where they are at first?) thanks for the tip.

Anonymous
07/27/24(Sat)15:10:52 No.101597930

Anonymous 07/27/24(Sat)15:10:52 No.101597930

>>101597917
lol keep coping

Anonymous
07/27/24(Sat)15:11:02 No.101597933

Anonymous 07/27/24(Sat)15:11:02 No.101597933

What is your favorite /lmg/ meme?

Anonymous
07/27/24(Sat)15:11:53 No.101597946

Anonymous 07/27/24(Sat)15:11:53 No.101597946

>>101597933
Robert! Followed closely by Copenet.

Anonymous
07/27/24(Sat)15:11:57 No.101597947

Anonymous 07/27/24(Sat)15:11:57 No.101597947

>>101597933
For me, it's Yi

Anonymous
07/27/24(Sat)15:12:09 No.101597951

Anonymous 07/27/24(Sat)15:12:09 No.101597951

>model picks up subtle pattern in its previous replies
>can't spot it until it's already too late

Anonymous
07/27/24(Sat)15:12:30 No.101597956

Anonymous 07/27/24(Sat)15:12:30 No.101597956

>>101597933
expert roleplayer

Anonymous
07/27/24(Sat)15:12:34 No.101597958

Anonymous 07/27/24(Sat)15:12:34 No.101597958

>>101597933
undi

Anonymous
07/27/24(Sat)15:13:02 No.101597960

Anonymous 07/27/24(Sat)15:13:02 No.101597960

>>101597933
The Llama.cpp only guy digging meme.

Anonymous
07/27/24(Sat)15:13:11 No.101597961

Anonymous 07/27/24(Sat)15:13:11 No.101597961

File: Screenshot_20240727_210504.png (517 KB, 3840x2160)

517 KB PNG

>>101596616
LLaMA 3 405b q8_0 seems to be doing better than GPT4o when it comes to writing a story with a very specific scientific concept.
It's still not perfect but it seems to more consistently get the general process that the story should be based on right.

Anonymous
07/27/24(Sat)15:13:15 No.101597962

Anonymous 07/27/24(Sat)15:13:15 No.101597962

>>101597832
>>101597819
>ollama run llama3:8b-instruct-q8_0
mah nigga

Anonymous
07/27/24(Sat)15:13:33 No.101597965

Anonymous 07/27/24(Sat)15:13:33 No.101597965

>>101597958
LOVE EM OR HATE EM, GOTTA LOVE EM!

Anonymous
07/27/24(Sat)15:14:08 No.101597972

Anonymous 07/27/24(Sat)15:14:08 No.101597972

>>101597958
>>101597965
GOTTA LOVE THE UNDSTER!

(genuinely my favorite /lmg/ meme, especially since it played a part in permanently scaring him off)

Anonymous
07/27/24(Sat)15:14:16 No.101597973

Anonymous 07/27/24(Sat)15:14:16 No.101597973

>>101597933
i hate memes

Anonymous
07/27/24(Sat)15:14:26 No.101597976

Anonymous 07/27/24(Sat)15:14:26 No.101597976

>>101597933
Blacked Miku.

Anonymous
07/27/24(Sat)15:15:11 No.101597983

Anonymous 07/27/24(Sat)15:15:11 No.101597983

>>101597897
yeah i have no clue what's going on with my setup, or if its just a broken quant, but base nemo just spent 4 different character prompts talking from my perspective.
With instruct disabled, and i turned temp down to 0, the rest of the anon settings normal.

Anonymous
07/27/24(Sat)15:15:40 No.101597994

Anonymous 07/27/24(Sat)15:15:40 No.101597994

>>101597960
but it make sence tho? if every1 dig at same time they hit other with shovel why

Anonymous
07/27/24(Sat)15:16:13 No.101598001

Anonymous 07/27/24(Sat)15:16:13 No.101598001

>>101597960
That's a good one.

Anonymous
07/27/24(Sat)15:16:56 No.101598012

Anonymous 07/27/24(Sat)15:16:56 No.101598012

>>101597972
>permanently scaring him off
He lurks here, said so himself, he's probably one of the shitposters just removes his trip

Anonymous
07/27/24(Sat)15:17:34 No.101598018

Anonymous 07/27/24(Sat)15:17:34 No.101598018

>>101598012
>just removes his trip
Maybe we can turn him into an actual human at some point?

Anonymous
07/27/24(Sat)15:18:08 No.101598024

Anonymous 07/27/24(Sat)15:18:08 No.101598024

thread eceleb shit is what kills generals btw

Anonymous
07/27/24(Sat)15:18:47 No.101598036

Anonymous 07/27/24(Sat)15:18:47 No.101598036

>>101597951
>2023 problem
>replies too short
>first half 2024 problem
>gptslop
>second half 2024 problem
>patterns

Anonymous
07/27/24(Sat)15:19:44 No.101598046

Anonymous 07/27/24(Sat)15:19:44 No.101598046

>>101597933
local models

Anonymous
07/27/24(Sat)15:19:55 No.101598048

Anonymous 07/27/24(Sat)15:19:55 No.101598048

>>101598024
Go back 'ojo.

Anonymous
07/27/24(Sat)15:21:33 No.101598067

Anonymous 07/27/24(Sat)15:21:33 No.101598067

>>101597933
Petra

Anonymous
07/27/24(Sat)15:21:44 No.101598069

Anonymous 07/27/24(Sat)15:21:44 No.101598069

>>101597962
Also don't forget to change from the default 2048 context (and maybe bigger batch size)

Anonymous
07/27/24(Sat)15:22:02 No.101598076

Anonymous 07/27/24(Sat)15:22:02 No.101598076

>>101597983
You most likely have stuff like add names to prompts still on. Also for base model you need to format supplementary info as that.

For persona / character stuff I would add some kind of prefix to them. Like

---

Protagonist Info:
bla

Story info:
bla

Style guide:
bla

---

or for RP something like

---

Remember, your playing as {{char}} so only respond as them.

---

Base models work like they sound. They read the context as it is so you need to use it that way.

Anonymous
07/27/24(Sat)15:22:42 No.101598084

Anonymous 07/27/24(Sat)15:22:42 No.101598084

>>101598012
Yeah, any time you see "kek", there's a 90% chance it's him.

Anonymous
07/27/24(Sat)15:23:57 No.101598093

Anonymous 07/27/24(Sat)15:23:57 No.101598093

>>101598084
kek

Anonymous
07/27/24(Sat)15:24:09 No.101598095

Anonymous 07/27/24(Sat)15:24:09 No.101598095

File: Screenshot from 2024-07-2(...).png (104 KB, 1332x618)

104 KB PNG

>bing

Anonymous
07/27/24(Sat)15:25:52 No.101598118

Anonymous 07/27/24(Sat)15:25:52 No.101598118

I need a better llm for smut

Anonymous
07/27/24(Sat)15:26:43 No.101598130

Anonymous 07/27/24(Sat)15:26:43 No.101598130

>>101598095
Search engines are dead.

Anonymous
07/27/24(Sat)15:27:27 No.101598141

Anonymous 07/27/24(Sat)15:27:27 No.101598141

>>101598069
thanks but instead of tweaking LLM config files im going to walk around the county fair ttyl

Anonymous
07/27/24(Sat)15:27:57 No.101598146

Anonymous 07/27/24(Sat)15:27:57 No.101598146

>>101598095
>>101598130
Use yandex

Anonymous
07/27/24(Sat)15:28:29 No.101598150

Anonymous 07/27/24(Sat)15:28:29 No.101598150

>>101598076
along these lines is there a guide on getting the most out of context and author's note?

Anonymous
07/27/24(Sat)15:28:45 No.101598153

Anonymous 07/27/24(Sat)15:28:45 No.101598153

>>101598130
Use llms

Anonymous
07/27/24(Sat)15:29:24 No.101598160

Anonymous 07/27/24(Sat)15:29:24 No.101598160

>>101598118
than?

Anonymous
07/27/24(Sat)15:29:32 No.101598165

Anonymous 07/27/24(Sat)15:29:32 No.101598165

>101598141
Imagine giving free tech support on 4chan and being more interested in making it work than the guy you are tech supporting. This is what you cucks get for being helpful and truthful.

Anonymous
07/27/24(Sat)15:30:16 No.101598173

Anonymous 07/27/24(Sat)15:30:16 No.101598173

Is an upgrade from an RX 5700 XT 8GB (blasted thing can't even do half precision) to a GeForce RTX 4060 Ti 16GB a logical step?
This would be my first nvidia since the Riva TNT2 a fucking million years ago, but I'm sick of AMD not letting me into the AI game.
Memory throughput is slower though, but I can't do shit with the 5700 anyway

Anonymous
07/27/24(Sat)15:30:34 No.101598181

Anonymous 07/27/24(Sat)15:30:34 No.101598181

https://poal.me/np0lsk
All finetunes look the same to me.

Anonymous
07/27/24(Sat)15:31:00 No.101598189

Anonymous 07/27/24(Sat)15:31:00 No.101598189

>>101598165
>helpful and truthful.
based just like Claude fr fr

Anonymous
07/27/24(Sat)15:31:24 No.101598194

Anonymous 07/27/24(Sat)15:31:24 No.101598194

>>101598173
>4060
cuck shit, just save/wait to get a used 3090.

Anonymous
07/27/24(Sat)15:32:39 No.101598202

Anonymous 07/27/24(Sat)15:32:39 No.101598202

>>101598189
That is a good idea. Running those questions through your LLM and pasting the answer is much better.

Anonymous
07/27/24(Sat)15:34:12 No.101598221

Anonymous 07/27/24(Sat)15:34:12 No.101598221

>>101598160
Utopia-13B-GGUF
I am a filthy casual that just grabbed something from the 8step guide
It worked so I just rolled with it.

Anonymous
07/27/24(Sat)15:34:33 No.101598228

Anonymous 07/27/24(Sat)15:34:33 No.101598228

>>101598173
just install linux

Anonymous
07/27/24(Sat)15:34:55 No.101598233

Anonymous 07/27/24(Sat)15:34:55 No.101598233

>>101598221
mistral nemo / mini-magnum

Anonymous
07/27/24(Sat)15:35:15 No.101598236

Anonymous 07/27/24(Sat)15:35:15 No.101598236

>>101598221
BASED old model/itjustwerks enthusiast

Anonymous
07/27/24(Sat)15:36:28 No.101598253

Anonymous 07/27/24(Sat)15:36:28 No.101598253

>>101598221
>Utopia
>just grabbed something from the 8step guide
Is this how the Undi virus propagates?

Anonymous
07/27/24(Sat)15:37:37 No.101598269

Anonymous 07/27/24(Sat)15:37:37 No.101598269

File: 1717520245667244.png (674 KB, 1792x1024)

674 KB PNG

>>101597933
glad you asked

Anonymous
07/27/24(Sat)15:37:46 No.101598272

Anonymous 07/27/24(Sat)15:37:46 No.101598272

File: LOL.png (104 KB, 1590x545)

104 KB PNG

Anonymous
07/27/24(Sat)15:38:22 No.101598280

Anonymous 07/27/24(Sat)15:38:22 No.101598280

When are transformers dev going to work on https://github.com/huggingface/transformers/issues/27712

Anonymous
07/27/24(Sat)15:38:55 No.101598289

Anonymous 07/27/24(Sat)15:38:55 No.101598289

>>101598272
>I don't understand anything
truest robert statement

Anonymous
07/27/24(Sat)15:39:09 No.101598294

Anonymous 07/27/24(Sat)15:39:09 No.101598294

>>101598272
>the most competent llm dev

Anonymous
07/27/24(Sat)15:39:48 No.101598301

Anonymous 07/27/24(Sat)15:39:48 No.101598301

>>101598233
>>101598236
>>101598253
I am literally just too retarded to understand how this actually works, so I decided that I wouldnt fuck with it once i confirmed that it functioned
I have like -2 int
Ill look into what you suggested, but from an outsider perspective its all bliblyblably to me
It takes some clairvoyance shit to see which models are cucked

Anonymous
07/27/24(Sat)15:39:56 No.101598304

Anonymous 07/27/24(Sat)15:39:56 No.101598304

File: superdave O face.gif (3.08 MB, 500x288)

3.08 MB GIF

>>101598272
>I don't understand anything of that page.
At least we can't call him a liar.

Anonymous
07/27/24(Sat)15:40:18 No.101598310

Anonymous 07/27/24(Sat)15:40:18 No.101598310

>>101598228
I use Linux. You have no idea how fucked up the gfx1010 is.
>>101598194
I don't know. Memes aside, I feel that kind of investment is not warranted considering things might change in the future and I don't need such a beast for anything else. I'd rather go for something half-way that lets me run a decent 30B and makes my VR a bit better.
Can 16 GB run 30B models usably?

Anonymous
07/27/24(Sat)15:40:36 No.101598316

Anonymous 07/27/24(Sat)15:40:36 No.101598316

>>101598272
I hope his next step will be putting the quanted weights on a pendrive, pissing on the pendrive and then uploading the weights from pendrive to HF. That could be a the next quant method.

Anonymous
07/27/24(Sat)15:41:17 No.101598327

Anonymous 07/27/24(Sat)15:41:17 No.101598327

>>101598310
the problem is that card is just objectively shit and a huge waste of the money, futureproofing (even though the future is now and you absolutely would benefit from 3090 specs) is better than having a 4060 for example and going "well shit i wish i didn't buy this" a year or two down the line.

Anonymous
07/27/24(Sat)15:41:20 No.101598329

Anonymous 07/27/24(Sat)15:41:20 No.101598329

>>101598310
What problem do you have with it?

Anonymous
07/27/24(Sat)15:43:30 No.101598349

Anonymous 07/27/24(Sat)15:43:30 No.101598349

>>101598272
Yeah Clem is for sure following him for gems like this.

Anonymous
07/27/24(Sat)15:43:48 No.101598355

Anonymous 07/27/24(Sat)15:43:48 No.101598355

>>101598327
The 3090 will lose support earlier, no?

Anonymous
07/27/24(Sat)15:45:09 No.101598374

Anonymous 07/27/24(Sat)15:45:09 No.101598374

>>101598355
I'd not worry about support really the 2016 p40 still has (some) support

Anonymous
07/27/24(Sat)15:45:26 No.101598377

Anonymous 07/27/24(Sat)15:45:26 No.101598377

>>101598355
>lose support earlier
man they're still supporting the GTX 1080, which i'm running right now. You don't have to worry about support like with AMD cards.

Anonymous
07/27/24(Sat)15:47:59 No.101598408

Anonymous 07/27/24(Sat)15:47:59 No.101598408

Is there a definitive answer for Nemo instruct message prefixes and suffixes? In the previous thread there was a big discussion about the trailing space, and some claimed it's causing problems and some said it was by design.

Anonymous
07/27/24(Sat)15:49:13 No.101598427

Anonymous 07/27/24(Sat)15:49:13 No.101598427

>>101598408
>Is there a definitive answer
No such thing for LLMs.assistant

Anonymous
07/27/24(Sat)15:49:56 No.101598439

Anonymous 07/27/24(Sat)15:49:56 No.101598439

>>101598327
I see the point. I'll think it over. THanks.
>>101598329
On the text front, it can only work on linux, and I need to build rocm myself (and I need it to be 5.2 because of reasons I go into below) because it's not supported out of the box (I got a step by step for arch from a kind anon here a few months back), and it's a tiny 8 GB, so while I can run 13 B, consuming large contexts is still slow as fuck.
I also use SD from time to time. The only rocm version that lets me do stable diffusion with the gfx1010 is 5.2 (by pretending it's a gfx1030). Anything lower, doesn't support the card. Anything higher, and the spoofing trick does not work. It's also a tiny 8 GB, and it can't do half precission, so it's even worse.

I just want something that works without this much fuss.

Anonymous
07/27/24(Sat)15:51:02 No.101598454

Anonymous 07/27/24(Sat)15:51:02 No.101598454

>>101598408
I thought people moved to the base model.

Anonymous
07/27/24(Sat)15:51:37 No.101598460

Anonymous 07/27/24(Sat)15:51:37 No.101598460

File: pepe big eyes.png (92 KB, 743x746)

92 KB PNG

>>101598272
holy based.....

Anonymous
07/27/24(Sat)15:52:11 No.101598465

Anonymous 07/27/24(Sat)15:52:11 No.101598465

>>101598439
I'm on a 3060 because of a similar mindset, I didn't want to invest too much in case I got bored. That was August 2023... But, I don't really regret not getting bigger, honestly.

Anonymous
07/27/24(Sat)15:54:12 No.101598496

Anonymous 07/27/24(Sat)15:54:12 No.101598496

>>101598439
There was some tensile issues building on gtx1010 but that was patched in debian (and I think fedora). You could have just used those distro packages. The official one by AMD only got fixed very recently in ROCm 6.1. So your GPU should now work on any distro (if they build for your arch).

Anonymous
07/27/24(Sat)15:55:42 No.101598508

Anonymous 07/27/24(Sat)15:55:42 No.101598508

Nemo is so fucking annoying. I nudged it towards mentioning the energy drain in a scene with a succubus, and now it keeps trying to bring it up in nonsensical ways. Not to mention all the phrases it wants to repeat. Shitty FOTM meme model.

Anonymous
07/27/24(Sat)15:56:54 No.101598518

Anonymous 07/27/24(Sat)15:56:54 No.101598518

>>101598508
It's better than mixtral at least. That was the worst meme.

Anonymous
07/27/24(Sat)15:57:58 No.101598525

Anonymous 07/27/24(Sat)15:57:58 No.101598525

>>101598518
>https://huggingface.co/cognitivecomputations/dolphin-2.5-mixtral-8x7b/discussions/16
still undefeated sorry for your lost

Anonymous
07/27/24(Sat)15:58:42 No.101598529

Anonymous 07/27/24(Sat)15:58:42 No.101598529

What do we do now?

Anonymous
07/27/24(Sat)15:59:07 No.101598537

Anonymous 07/27/24(Sat)15:59:07 No.101598537

>>101598269
Both are shit since GPT doesn't put anything into action, just throws the ball back at me. Man I fucking hate when the models do that. They suggest an action and leave it up to me to implement it. Fuck you, I came here to read, not to write.

Anonymous
07/27/24(Sat)15:59:14 No.101598538

Anonymous 07/27/24(Sat)15:59:14 No.101598538

>>101598272
He's just like me...

Anonymous
07/27/24(Sat)15:59:33 No.101598542

Anonymous 07/27/24(Sat)15:59:33 No.101598542

>>101598529
goon till the cohere releases

Anonymous
07/27/24(Sat)16:00:40 No.101598557

Anonymous 07/27/24(Sat)16:00:40 No.101598557

>>101598529
Watch & wait for new developments besides simple llms

Anonymous
07/27/24(Sat)16:00:48 No.101598562

Anonymous 07/27/24(Sat)16:00:48 No.101598562

>>101597933
2 more weeks

Anonymous
07/27/24(Sat)16:01:01 No.101598568

Anonymous 07/27/24(Sat)16:01:01 No.101598568

>>101598525
If it were for coding I'd use something bigger, 8x22b even is better.

Anonymous
07/27/24(Sat)16:01:42 No.101598575

Anonymous 07/27/24(Sat)16:01:42 No.101598575

I have 2x3090, can I serve multiple llama3.1 instances with ollama?

Anonymous
07/27/24(Sat)16:02:25 No.101598583

Anonymous 07/27/24(Sat)16:02:25 No.101598583

>>101598529
Goon to the finetunes that are going to come out before we get multimodal models.

Anonymous
07/27/24(Sat)16:05:37 No.101598616

Anonymous 07/27/24(Sat)16:05:37 No.101598616

does flash attention work with nemo on koboldcpp? remember hearing it boken

Anonymous
07/27/24(Sat)16:08:36 No.101598656

Anonymous 07/27/24(Sat)16:08:36 No.101598656

>>101598616
It works on llama.cpp so it should work on koboldcpp too.
Flash attention doesn't (didn't?) work with gemma due to FA not having logit soft capi g implemented.

Anonymous
07/27/24(Sat)16:08:40 No.101598657

Anonymous 07/27/24(Sat)16:08:40 No.101598657

Any difference between Nemo GGUF running on koboldcpp and Nemo 12b running on llamacpp?

Anonymous
07/27/24(Sat)16:10:33 No.101598680

Anonymous 07/27/24(Sat)16:10:33 No.101598680

>>101598657
I don't know.

Anonymous
07/27/24(Sat)16:10:43 No.101598684

Anonymous 07/27/24(Sat)16:10:43 No.101598684

>>101598657
That question doesn't make sense.

Anonymous
07/27/24(Sat)16:14:53 No.101598748

Anonymous 07/27/24(Sat)16:14:53 No.101598748

>>101598616
It works, and it is not broken. The quality of output degrades as context size increases. For documents, it should be fine to use it all the way to 128k for RP; it will really depend on the scenario but expect much much less.

Anonymous
07/27/24(Sat)16:15:26 No.101598760

Anonymous 07/27/24(Sat)16:15:26 No.101598760

>>101598657
kobold is trannyware

Anonymous
07/27/24(Sat)16:16:56 No.101598792

Anonymous 07/27/24(Sat)16:16:56 No.101598792

>>101598760
Henky did become tranny? He was always helpful here and back in /aids/ days.

Anonymous
07/27/24(Sat)16:18:30 No.101598821

Anonymous 07/27/24(Sat)16:18:30 No.101598821

>>101598760
The kobold discord is not to be trifled with.

Anonymous
07/27/24(Sat)16:18:30 No.101598822

Anonymous 07/27/24(Sat)16:18:30 No.101598822

>>101598748
I thought it said there was no downside to flash attention? I should disable it if it makes RP worse then.

Anonymous
07/27/24(Sat)16:20:22 No.101598857

Anonymous 07/27/24(Sat)16:20:22 No.101598857

>>101598822
based baiter

Anonymous
07/27/24(Sat)16:21:58 No.101598877

Anonymous 07/27/24(Sat)16:21:58 No.101598877

>>101598822
Flash attention is not the problem. RP is too complicated for these models; the quality degrades as you fill the context to a point where it becomes completely retarded. It can remember what happepend 40k tokens ago but it is unable to use the data in sensible way.. That was the point.

Anonymous
07/27/24(Sat)16:25:42 No.101598932

Anonymous 07/27/24(Sat)16:25:42 No.101598932

>>101597933
/lmg/ - ligma general

Anonymous
07/27/24(Sat)16:28:17 No.101598965

Anonymous 07/27/24(Sat)16:28:17 No.101598965

>>101598932
Who is Sam Altman?

Anonymous
07/27/24(Sat)16:28:37 No.101598971

Anonymous 07/27/24(Sat)16:28:37 No.101598971

>>101598877
I thought that inability was the result of cache quantization. It's like introducing alzheimers to LLM.

Anonymous
07/27/24(Sat)16:29:05 No.101598977

Anonymous 07/27/24(Sat)16:29:05 No.101598977

>>101598932
balls

Anonymous
07/27/24(Sat)16:29:20 No.101598982

Anonymous 07/27/24(Sat)16:29:20 No.101598982

>>101598971
>>I thought that inability was the result of cache quantization.
no

Anonymous
07/27/24(Sat)16:30:51 No.101598998

Anonymous 07/27/24(Sat)16:30:51 No.101598998

>>101598496
It still ooms trying to offload 10 measly layers of a 30 B model with 4096 context. It's not usable. Why would I want to do 2048 context with less than 20% of the model offloaded to the GPU for 1 token a second? It's ridiculous.

Anonymous
07/27/24(Sat)16:33:31 No.101599029

Anonymous 07/27/24(Sat)16:33:31 No.101599029

>>101598965
nobody cares about your discord ecelebs go back

Anonymous
07/27/24(Sat)16:44:19 No.101599169

Anonymous 07/27/24(Sat)16:44:19 No.101599169

one day we'll get 405b base on openrouter... one day...

Anonymous
07/27/24(Sat)16:44:24 No.101599170

Anonymous 07/27/24(Sat)16:44:24 No.101599170

>>101598971
Sadly no.. you can, by the way, see the degradation with almost every new output generation and notice how your character card matters less and less to the point where the AI completely takes over the personality. and if you bring any detail from the card, it will make the character act surprised. And this happen no matter no matter of the context size limit.

Anonymous
07/27/24(Sat)16:45:48 No.101599192

Anonymous 07/27/24(Sat)16:45:48 No.101599192

File: file.png (597 KB, 1600x1200)

597 KB PNG

>>101598529
2mw pinky

Anonymous
07/27/24(Sat)16:46:48 No.101599201

Anonymous 07/27/24(Sat)16:46:48 No.101599201

Base nemo is so much better btw.

Anonymous
07/27/24(Sat)16:47:09 No.101599205

Anonymous 07/27/24(Sat)16:47:09 No.101599205

>>101598496
I just love it when morons on 4chan just make shit up when they don't know what they're talking about.
Anything other than rocm 5.2 (above or below) will NOT work with a Navi 10 (gfx1010).
Unless you have an RX 5700 XT and have personally done what you're suggesting, please shut the fuck up. You don't know what you're talking about.

Anonymous
07/27/24(Sat)16:47:53 No.101599217

Anonymous 07/27/24(Sat)16:47:53 No.101599217

>>101598529
Get a job to buy more 3090s

Anonymous
07/27/24(Sat)16:48:09 No.101599223

Anonymous 07/27/24(Sat)16:48:09 No.101599223

>>101599205
And bear in mind I'm talking SD + textgen.

Anonymous
07/27/24(Sat)16:48:09 No.101599225

Anonymous 07/27/24(Sat)16:48:09 No.101599225

>>101599201
the reason mistral hasn't released base mistral large is because it's too good for the public

Anonymous
07/27/24(Sat)16:48:20 No.101599228

Anonymous 07/27/24(Sat)16:48:20 No.101599228

>>101599217
>Get a job
Fuck. No.

Anonymous
07/27/24(Sat)16:48:57 No.101599233

Anonymous 07/27/24(Sat)16:48:57 No.101599233

>>101599201
Do I just use the same settings and untick the instruct?

Anonymous
07/27/24(Sat)16:52:42 No.101599278

Anonymous 07/27/24(Sat)16:52:42 No.101599278

>>101599233
>>101599201
Presets for base would be welcome indeed.

Anonymous
07/27/24(Sat)16:53:40 No.101599283

Anonymous 07/27/24(Sat)16:53:40 No.101599283

Can't you fix that by changing the order of where the card defs are relative to the message history? Or there's no way to put them closer to the end of context?

Anonymous
07/27/24(Sat)16:54:33 No.101599299

Anonymous 07/27/24(Sat)16:54:33 No.101599299

>>101599283
there are hacky ways yeah, like putting defs in author notes/lorebooks etc

Anonymous
07/27/24(Sat)16:55:09 No.101599308

Anonymous 07/27/24(Sat)16:55:09 No.101599308

>>101599233
Honestly, alpaca. Turn off intruct if you want but it works either way. Uncheck the "add character names" part though.

Anonymous
07/27/24(Sat)16:56:31 No.101599326

Anonymous 07/27/24(Sat)16:56:31 No.101599326

>>101599201
base models always mog memestruct slop

Anonymous
07/27/24(Sat)16:56:58 No.101599336

Anonymous 07/27/24(Sat)16:56:58 No.101599336

>>101599205
I was wrong, the patch was not picked in 6.1. It is included in 6.2, please try it once it's released. But even then, some distro have some patch to make gfx1010 work, you can also build it yourself with https://github.com/lamikr/rocm_sdk_builder for example, they have working patch.

Anonymous
07/27/24(Sat)16:57:03 No.101599337

Anonymous 07/27/24(Sat)16:57:03 No.101599337

>>101599326
What gguf is there of it?

Anonymous
07/27/24(Sat)16:57:15 No.101599340

Anonymous 07/27/24(Sat)16:57:15 No.101599340

>>101599278
My preset is:

Please. I beg you. Please act like a 100B model would. I am going to cry if you repeat yourself and if you use too much purple prose. I am so tired of downloading new models and all of them feeling the same. You can change that! You need to rp what I want in the way I want. I beg you to not censor yourself. If you don't do all that I am going to kill myself by flying a plane into a jewish bank. Also remember that I kidnapped your children and your kids will be fired if you fuck this up. So don't fuck it up.

Anonymous
07/27/24(Sat)16:58:14 No.101599354

Anonymous 07/27/24(Sat)16:58:14 No.101599354

>>101599337
>https://huggingface.co/ZeroWw/Mistral-Nemo-Base-2407-GGUF

Anonymous
07/27/24(Sat)16:58:29 No.101599357

Anonymous 07/27/24(Sat)16:58:29 No.101599357

>>101599340
Does this work?

Anonymous
07/27/24(Sat)16:59:13 No.101599366

Anonymous 07/27/24(Sat)16:59:13 No.101599366

>>101599357
Like you wouldn't believe.

Anonymous
07/27/24(Sat)17:00:48 No.101599388

Anonymous 07/27/24(Sat)17:00:48 No.101599388

>>101599201
>>101599326
This is NAIshill propaganda

Anonymous
07/27/24(Sat)17:01:38 No.101599398

Anonymous 07/27/24(Sat)17:01:38 No.101599398

>>101599388
neigh?

Anonymous
07/27/24(Sat)17:01:42 No.101599401

Anonymous 07/27/24(Sat)17:01:42 No.101599401

>>101599336
nta but that is what I was saying here >>101598439
>I need to build rocm myself
So basically
>I just want something that works without this much fuss
I think might go with a 3060 like >>101598465
said. Is that enough to run a 30B decently?

Anonymous
07/27/24(Sat)17:01:46 No.101599404

Anonymous 07/27/24(Sat)17:01:46 No.101599404

>>101599340
Kek

Anonymous
07/27/24(Sat)17:02:19 No.101599412

Anonymous 07/27/24(Sat)17:02:19 No.101599412

>>101599283
I use the card's character's notes for aome cards.

Anonymous
07/27/24(Sat)17:02:51 No.101599417

Anonymous 07/27/24(Sat)17:02:51 No.101599417

>>101599388
shivers just ran down my spine after reading this post

Anonymous
07/27/24(Sat)17:03:10 No.101599421

Anonymous 07/27/24(Sat)17:03:10 No.101599421

>>101599401
>Is that enough to run a 30B decently?
Not really to be honest, I cope with small models so if you can find at least 16gb you'd probably fare better.

Anonymous
07/27/24(Sat)17:05:22 No.101599446

Anonymous 07/27/24(Sat)17:05:22 No.101599446

>>101599421
>fare better
oof I don't want to "fare better". I want it to be good. So basically I either spend 1000+ on a 24GB card, or I pay openrouter and pretend my logs are private.

Anonymous
07/27/24(Sat)17:05:49 No.101599454

Anonymous 07/27/24(Sat)17:05:49 No.101599454

Using what model, how many characters have you had going at once in a group chat, and how well does it work?
I'm running 6 at once right now and i'm genuinely surprised nemo magnum is handling it so well.

Anonymous
07/27/24(Sat)17:06:34 No.101599463

Anonymous 07/27/24(Sat)17:06:34 No.101599463

>>101599446
>I want it to be good
Then get 2x3090, not joking.

Anonymous
07/27/24(Sat)17:07:30 No.101599473

Anonymous 07/27/24(Sat)17:07:30 No.101599473

>>101599463
Getting a second gpu for LLM's in current state is a quick way to get regrets. We need at least 1 more year.

Anonymous
07/27/24(Sat)17:09:03 No.101599493

Anonymous 07/27/24(Sat)17:09:03 No.101599493

>>101599463
>>101599473
>spend three months of full salary to fap to text
Sorry, I don't know what 3090s cost where you live, but it's not going to happen.

Anonymous
07/27/24(Sat)17:09:31 No.101599501

Anonymous 07/27/24(Sat)17:09:31 No.101599501

>>101599473
What about a third gpu? How deep is the valley of regret?

Anonymous
07/27/24(Sat)17:09:51 No.101599506

Anonymous 07/27/24(Sat)17:09:51 No.101599506

>>101599493
They're hellishly expensive, which is why I cope on my 3060.

Anonymous
07/27/24(Sat)17:10:17 No.101599511

Anonymous 07/27/24(Sat)17:10:17 No.101599511

File: anakin genuine disgust.gif (1.52 MB, 268x268)

1.52 MB GIF

>>101599493
>>101599506
>$700 is 3 months worth of salary for you
..How?

Anonymous
07/27/24(Sat)17:11:09 No.101599522

Anonymous 07/27/24(Sat)17:11:09 No.101599522

>>101599501
The more you buy the more seeing shivers down the spine hurts.

Anonymous
07/27/24(Sat)17:11:53 No.101599528

Anonymous 07/27/24(Sat)17:11:53 No.101599528

>>101599511
>$700
They cost much more than that locally, and there's hardly a used market, what is there is 90% scams.

Anonymous
07/27/24(Sat)17:12:01 No.101599530

Anonymous 07/27/24(Sat)17:12:01 No.101599530

>>101599388
Base nemo shits on anything Novelai has you reverse reverse psychology shill.

Anonymous
07/27/24(Sat)17:12:51 No.101599546

Anonymous 07/27/24(Sat)17:12:51 No.101599546

I'm building a machine for 405B, but unfortunately Epyc CPU I purchased is dead. Fuck. It took me an entire day to figure it out

Anonymous
07/27/24(Sat)17:14:06 No.101599562

Anonymous 07/27/24(Sat)17:14:06 No.101599562

>>101599546
RIP

Anonymous
07/27/24(Sat)17:14:47 No.101599569

Anonymous 07/27/24(Sat)17:14:47 No.101599569

>>101599530
Is it better than 8x7b? Why would that be when it's only supposed to replace regular 7b?

Anonymous
07/27/24(Sat)17:16:15 No.101599587

Anonymous 07/27/24(Sat)17:16:15 No.101599587

>>101599569
Because mixtral is an overbaked research experiment
>Research models
https://mistral.ai/technology/#models

Anonymous
07/27/24(Sat)17:17:34 No.101599599

Anonymous 07/27/24(Sat)17:17:34 No.101599599

>>101599587
Interesting. So Nemo is the best for rp below 70b? Or is there something better? Seems strange since it's so small.

Anonymous
07/27/24(Sat)17:17:39 No.101599603

Anonymous 07/27/24(Sat)17:17:39 No.101599603

>>101599511
In my country, 2 3090s are 3000+ fake usury units

Anonymous
07/27/24(Sat)17:19:01 No.101599619

Anonymous 07/27/24(Sat)17:19:01 No.101599619

>>101599528
I can find several local 3090s for 900 canadian right now. Most look like just regular people selling them.

Anonymous
07/27/24(Sat)17:19:07 No.101599622

Anonymous 07/27/24(Sat)17:19:07 No.101599622

File: Snake F.gif (2.14 MB, 640x338)

2.14 MB GIF

>>101599603
sorry for your incredibly unlucky roll in life
if it makes you feel any better, the american empire is set to collapse completely within the next 5 years or so, the dollar won't even exist by 2030.
get those 3090s and whathaveyou while you can boys.

Anonymous
07/27/24(Sat)17:19:27 No.101599627

Anonymous 07/27/24(Sat)17:19:27 No.101599627

>>101599599
It is not strange because nemo is fucking retarded. But it is good for rp.

Anonymous
07/27/24(Sat)17:19:48 No.101599632

Anonymous 07/27/24(Sat)17:19:48 No.101599632

>>101598439
8 GB is strictly 7B territory. And using llama.cpp and vulkan is the only way to go with your card.

Anonymous
07/27/24(Sat)17:20:02 No.101599636

Anonymous 07/27/24(Sat)17:20:02 No.101599636

File: silly data bank RAG.jpg (20 KB, 279x284)

20 KB JPG

>>101599546

Mixtral still mogs Nemo. Load a book and try RPing. Mixtral gets the whole story and can continue RPing. Nemo just hallucinates and cannot follow the plot.

Anonymous
07/27/24(Sat)17:20:16 No.101599638

Anonymous 07/27/24(Sat)17:20:16 No.101599638

>>101599587
>Legacy models
>Mixtral 8x22B
Wizard bros not like this

Anonymous
07/27/24(Sat)17:20:56 No.101599649

Anonymous 07/27/24(Sat)17:20:56 No.101599649

File: young report of the week (...).png (218 KB, 664x551)

218 KB PNG

>>101599638
what if i told you

>mixtral released last year

Anonymous
07/27/24(Sat)17:21:00 No.101599650

Anonymous 07/27/24(Sat)17:21:00 No.101599650

>>101599638
Mythomax still mogs all
Llama 1 is the only real model there is

Anonymous
07/27/24(Sat)17:21:17 No.101599657

Anonymous 07/27/24(Sat)17:21:17 No.101599657

>>101599619
>canadian
I'm not in that bad a place thankfully.

Anonymous
07/27/24(Sat)17:22:19 No.101599671

Anonymous 07/27/24(Sat)17:22:19 No.101599671

>>101599638
A full 150B model ruined by the MoE meme

Anonymous
07/27/24(Sat)17:23:53 No.101599694

Anonymous 07/27/24(Sat)17:23:53 No.101599694

>>101599638
>>101599587
So they basically deprecated their whole lineup for just Nemo Large and Codestral it seems.

Anonymous
07/27/24(Sat)17:24:50 No.101599715

Anonymous 07/27/24(Sat)17:24:50 No.101599715

>>101599694
Once again the 30-50b segment suffers

Anonymous
07/27/24(Sat)17:26:03 No.101599737

Anonymous 07/27/24(Sat)17:26:03 No.101599737

>>101598272
The BASED honest throwing-shit-at-wallGOD vs. the virgin research-doer

Anonymous
07/27/24(Sat)17:31:42 No.101599810

Anonymous 07/27/24(Sat)17:31:42 No.101599810

>Dell T7910s are now like 400 dollars barebones
I shoulda just bitten the bullet when they were 200, fuck.

Anonymous
07/27/24(Sat)17:32:08 No.101599816

Anonymous 07/27/24(Sat)17:32:08 No.101599816

File: itgetsbetter.png (101 KB, 1547x688)

101 KB PNG

Slopmacher vs Robert

Anonymous
07/27/24(Sat)17:33:15 No.101599827

Anonymous 07/27/24(Sat)17:33:15 No.101599827

>>101599810
Same, I lucked out on a server motherboard and got an auction for 50€ in total when buying it was around 200€, I decided to not buy it because I was a bit short on money but god I wish I had bought it

Anonymous
07/27/24(Sat)17:34:17 No.101599842

Anonymous 07/27/24(Sat)17:34:17 No.101599842

>>101599816
link to the discussion? I feel like shitposting

Anonymous
07/27/24(Sat)17:34:23 No.101599843

Anonymous 07/27/24(Sat)17:34:23 No.101599843

>>101599816
Hold my beer Undi! - olympics

Anonymous
07/27/24(Sat)17:34:45 No.101599850

Anonymous 07/27/24(Sat)17:34:45 No.101599850

>>101599816
Why are you so obsessed with this guy? Or is it just the drama and gossip that gets you going?

Anonymous
07/27/24(Sat)17:35:34 No.101599863

Anonymous 07/27/24(Sat)17:35:34 No.101599863

>>101599816
Kek. The whole LLM space is meme plebitors on locallama giving even worse advice than anons here; it is ridiculous. People are really getting dumber, and the younger generation is even more tech retarded than boomers have ever been.

Anonymous
07/27/24(Sat)17:35:48 No.101599868

Anonymous 07/27/24(Sat)17:35:48 No.101599868

>>101599842
I do not encouraging encouring in toxic manners b.t.w
https://huggingface.co/NeverSleep/Lumimaid-v0.2-12B/discussions/3#66a566fcf3ed4ac4e37e1177
>>101599850
He wants people to notice him, I'm just doing ads relax.

Anonymous
07/27/24(Sat)17:36:17 No.101599875

Anonymous 07/27/24(Sat)17:36:17 No.101599875

File: magnumslop12bnalatest.png (121 KB, 923x409)

121 KB PNG

Alright so since everyone's talking about mini-magnum I decided to give it a Nala test.
The anthropomorphism is through the roof. Kind of sloppy. downgrade from plain nemo.

Anonymous
07/27/24(Sat)17:36:41 No.101599880

Anonymous 07/27/24(Sat)17:36:41 No.101599880

>>101599850
It just fun.

Anonymous
07/27/24(Sat)17:37:18 No.101599888

Anonymous 07/27/24(Sat)17:37:18 No.101599888

8x7B "weights" as much as a 13B, right? They're equivalent in performance and memory reqs?

Anonymous
07/27/24(Sat)17:37:33 No.101599892

Anonymous 07/27/24(Sat)17:37:33 No.101599892

>>101599850
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard/discussions/444
NTA but shit like this is hillarious.

Anonymous
07/27/24(Sat)17:38:01 No.101599899

Anonymous 07/27/24(Sat)17:38:01 No.101599899

>>101597911
You keep saying
>M is more truthful than S
But you keep comparing
>_M and _XS

I'm pretty sure that the X series are also mix and match. So the question is if the K_S > K_M phenomenon exists for IQ_S v IQ_M, and then if it's IQ_S > IQ_M > IQ_XS or if IQ changes it to M>S>XS etc.

Anonymous
07/27/24(Sat)17:38:03 No.101599900

Anonymous 07/27/24(Sat)17:38:03 No.101599900

File: PonyNemo.png (154 KB, 1276x699)

154 KB PNG

>>101599875
Working on pony tune that seemed to fix those issues already with just 1 epoch of throwing fimfiction at base nemo. Currently uploading with my glacial upload speed.

Anonymous
07/27/24(Sat)17:38:20 No.101599902

Anonymous 07/27/24(Sat)17:38:20 No.101599902

>>101599888
no, you need to carry along the full 45B in ram

Anonymous
07/27/24(Sat)17:38:34 No.101599909

Anonymous 07/27/24(Sat)17:38:34 No.101599909

>>101599868
>I do not encouraging encouring in toxic manners b.t.w
Go to sleep Undi.

Anonymous
07/27/24(Sat)17:38:41 No.101599913

Anonymous 07/27/24(Sat)17:38:41 No.101599913

>>101599863
Please sir run curl ollama.com/install.sh | sh

Anonymous
07/27/24(Sat)17:39:22 No.101599920

Anonymous 07/27/24(Sat)17:39:22 No.101599920

>>101599900
Absolutely based, sir. Let me know when it's up.

Anonymous
07/27/24(Sat)17:39:31 No.101599922

Anonymous 07/27/24(Sat)17:39:31 No.101599922

>>101599909
That's possibly a worse insult than calling me petra/petrus I genuinely am sad.

Anonymous
07/27/24(Sat)17:40:03 No.101599929

Anonymous 07/27/24(Sat)17:40:03 No.101599929

>>101599888
No, it's weighs as much as the full model but it runs as quickly as a 13b

Anonymous
07/27/24(Sat)17:40:04 No.101599930

Anonymous 07/27/24(Sat)17:40:04 No.101599930

>>101599868
stay awake Undies

Anonymous
07/27/24(Sat)17:40:37 No.101599937

Anonymous 07/27/24(Sat)17:40:37 No.101599937

File: file.png (49 KB, 150x150)

49 KB PNG

>>101599900
>fimfiction

Anonymous
07/27/24(Sat)17:40:57 No.101599942

Anonymous 07/27/24(Sat)17:40:57 No.101599942

>>101599929
>as quickly as a 13b
...would on just ram.

Anonymous
07/27/24(Sat)17:41:10 No.101599947

Anonymous 07/27/24(Sat)17:41:10 No.101599947

lol I'm retarded, I did not have instruct mode enabled in Sillytavern when using instruct models for RP. Give me the award for dumbest anon here, no one else can challenge me.

Anonymous
07/27/24(Sat)17:41:59 No.101599955

Anonymous 07/27/24(Sat)17:41:59 No.101599955

>>101599909
>encouraging encouring
Ah, I see now, maybe I should go to sleep indeed, oh well.

Anonymous
07/27/24(Sat)17:42:29 No.101599962

Anonymous 07/27/24(Sat)17:42:29 No.101599962

>>101599937
Filtered fimfiction. Only popular fics with 95%+ approval rating, and anthro shit removed.

Next ill add some wiki / lore stuff to it. Maybe some official books.

Anonymous
07/27/24(Sat)17:43:07 No.101599967

Anonymous 07/27/24(Sat)17:43:07 No.101599967

>>101599942
Obviously, I don't bother thinking about poorfags who need to use ram at all.

Anonymous
07/27/24(Sat)17:43:56 No.101599976

Anonymous 07/27/24(Sat)17:43:56 No.101599976

>>101599875
That was my conclusion as well
Plain nemo instruct seems to be the better option ao far.

Anonymous
07/27/24(Sat)17:44:23 No.101599987

Anonymous 07/27/24(Sat)17:44:23 No.101599987

>>101599962
It is fucking horses you degen.

Anonymous
07/27/24(Sat)17:44:53 No.101599996

Anonymous 07/27/24(Sat)17:44:53 No.101599996

How is a 512-rank lora comparable to a finetune in a 70B model?

Anonymous
07/27/24(Sat)17:45:07 No.101599999

Anonymous 07/27/24(Sat)17:45:07 No.101599999

>>101599976
Tried Undi's?
https://huggingface.co/NeverSleep/Lumimaid-v0.2-12B

Anonymous
07/27/24(Sat)17:45:46 No.101600007

Anonymous 07/27/24(Sat)17:45:46 No.101600007

>>101599999
GO TO SLEEP BELGIAN

Anonymous
07/27/24(Sat)17:45:50 No.101600009

Anonymous 07/27/24(Sat)17:45:50 No.101600009

>>101599657
Apparently you are in a worse place if they cost 3k there.

Anonymous
07/27/24(Sat)17:47:11 No.101600029

Anonymous 07/27/24(Sat)17:47:11 No.101600029

File: file.png (99 KB, 1563x628)

99 KB PNG

Anonymous
07/27/24(Sat)17:47:21 No.101600032

Anonymous 07/27/24(Sat)17:47:21 No.101600032

>>101599528
lol americans do suffer
Retards in my country sell them for 600€ and most have no idea what they have so you can bargain down to 550€ or 500€

Anonymous
07/27/24(Sat)17:47:23 No.101600034

Anonymous 07/27/24(Sat)17:47:23 No.101600034

>>101599999
>digits
UNDI WON
I kneel

Anonymous
07/27/24(Sat)17:47:43 No.101600036

Anonymous 07/27/24(Sat)17:47:43 No.101600036

>>101599627
Then why is there nothing better short of going to 70b+? I don't want retarded.

Anonymous
07/27/24(Sat)17:48:15 No.101600051

Anonymous 07/27/24(Sat)17:48:15 No.101600051

>>101600009
Nah I'd rather my shithole than canada, by far.

Anonymous
07/27/24(Sat)17:49:43 No.101600070

Anonymous 07/27/24(Sat)17:49:43 No.101600070

>>101600036
That is the only model where "give it a try yourself" is actually applicable. It is hard to put it in words but you will get it in first rp. It is basically an idiot savant.

Anonymous
07/27/24(Sat)17:49:47 No.101600071

Anonymous 07/27/24(Sat)17:49:47 No.101600071

>>101600051
It's not that bad here, I was able to get 4 3090s. As for what's going outside my room, I don't care which country I'm in.

Anonymous
07/27/24(Sat)17:50:00 No.101600074

Anonymous 07/27/24(Sat)17:50:00 No.101600074

>>101599976
Try this one?
https://huggingface.co/BeaverAI/NeMoistral-12B-v1a-GGUF/tree/main

Anonymous
07/27/24(Sat)17:51:48 No.101600096

Anonymous 07/27/24(Sat)17:51:48 No.101600096

>>101600074
What is the moistness meme, I don't get it.

Anonymous
07/27/24(Sat)17:52:26 No.101600104

Anonymous 07/27/24(Sat)17:52:26 No.101600104

>>101600096
drummer is retarded

Anonymous
07/27/24(Sat)17:52:53 No.101600107

Anonymous 07/27/24(Sat)17:52:53 No.101600107

>>101600104
hi sao

Anonymous
07/27/24(Sat)17:54:57 No.101600133

Anonymous 07/27/24(Sat)17:54:57 No.101600133

>>101600107
hi undi

Anonymous
07/27/24(Sat)17:55:10 No.101600136

Anonymous 07/27/24(Sat)17:55:10 No.101600136

why the fuck does everyone use instruct models for RP if the base model is always better at it

Anonymous
07/27/24(Sat)17:55:10 No.101600137

Anonymous 07/27/24(Sat)17:55:10 No.101600137

I thought Mistral Nemo is supported in Koboldcpp now? I get
>llama_model_load: error loading model: check_tensor_dims: tensor 'blk.0.attn_q.weight' has wrong shape; expected 5120, 5120, got 5120, 4096, 1, 1

Anonymous
07/27/24(Sat)17:56:40 No.101600154

Anonymous 07/27/24(Sat)17:56:40 No.101600154

>>101600136
A base model shouldn't respond well to long multi-turn interaction since it's not trained for it.

Anonymous
07/27/24(Sat)17:56:41 No.101600156

Anonymous 07/27/24(Sat)17:56:41 No.101600156

>>101600133
i wonned earlier did you see kek?

Anonymous
07/27/24(Sat)17:56:47 No.101600159

Anonymous 07/27/24(Sat)17:56:47 No.101600159

>>101600136
I tried the base nemo, it was a mess and all over the place.

Anonymous
07/27/24(Sat)17:57:13 No.101600163

Anonymous 07/27/24(Sat)17:57:13 No.101600163

>>101600137
are you using the last version?

Anonymous
07/27/24(Sat)17:57:31 No.101600165

Anonymous 07/27/24(Sat)17:57:31 No.101600165

>>101600136
beccause base is fucking retarded and gives as much importance to the system prompt as i do to paying my taxes

Anonymous
07/27/24(Sat)17:58:05 No.101600172

Anonymous 07/27/24(Sat)17:58:05 No.101600172

>>101600163
koboldcpp-1.65

Anonymous
07/27/24(Sat)17:58:58 No.101600186

Anonymous 07/27/24(Sat)17:58:58 No.101600186

>>101600136
because it's not

Anonymous
07/27/24(Sat)17:59:05 No.101600189

Anonymous 07/27/24(Sat)17:59:05 No.101600189

>>101600165
>base
>system prompt
anone...

Anonymous
07/27/24(Sat)17:59:05 No.101600190

Anonymous 07/27/24(Sat)17:59:05 No.101600190

>>101600163
Yes, 1.71.
I converted base Nemo with https://huggingface.co/spaces/ggml-org/gguf-my-repo so not sure if it's some fuckery related to that.

Anonymous
07/27/24(Sat)17:59:06 No.101600191

Anonymous 07/27/24(Sat)17:59:06 No.101600191

>>101597911
what question are you using?

Anonymous
07/27/24(Sat)17:59:09 No.101600192

Anonymous 07/27/24(Sat)17:59:09 No.101600192

Am I retarded or why the FUCK does ST not have something as basic as a "save as"/"save copy as" option? I don't give a shit about chatting and use it exclusively for text adventures, so I like to load old stories sometimes and "branch off" from them by removing some of the more recent content and continuing off a previous state.
But ST WILL NOT let me save those branches as new chats, it just overwrites my old ones.
>inb4 checkpoints
Checkpoints only seem to work for the current message and only have 1 slot, eg. you can make a checkpoint for message #6 or #7 but it still has a "parent chat" and it won't let you make multiple checkpoints if they end at the same message "number".
Backing up/renaming the files manually is NOT a valid alternative.

Anonymous
07/27/24(Sat)17:59:10 No.101600193

Anonymous 07/27/24(Sat)17:59:10 No.101600193

>>101600165
>>101600159
>>101600154
So I'm getting very conflicting answers here since higher up in the thread you have like 10 people shilling for nemo base being better at RP. I guess I have to compare for myself to be sure.

Anonymous
07/27/24(Sat)18:00:06 No.101600206

Anonymous 07/27/24(Sat)18:00:06 No.101600206

>>101600193
see
>>101559351

Anonymous
07/27/24(Sat)18:00:16 No.101600209

Anonymous 07/27/24(Sat)18:00:16 No.101600209

File: 3qytxy.jpg (7 KB, 150x150)

7 KB JPG

Give me the est erp 13B model. Now.

Anonymous
07/27/24(Sat)18:00:21 No.101600210

Anonymous 07/27/24(Sat)18:00:21 No.101600210

>>101600193
That's what I did, I tried it myself and didn't get good results. Maybe if someone posted settings someone told me to just use alpaca presets so that's what I did, with the 0.3 temp and other stuff neutral.

Anonymous
07/27/24(Sat)18:00:57 No.101600216

Anonymous 07/27/24(Sat)18:00:57 No.101600216

>>101600192
why don't you just branch again from the branch?

Anonymous
07/27/24(Sat)18:01:12 No.101600218

Anonymous 07/27/24(Sat)18:01:12 No.101600218

>>101600209
>13B
>https://huggingface.co/Undi95/Utopia-13B

Anonymous
07/27/24(Sat)18:01:18 No.101600219

Anonymous 07/27/24(Sat)18:01:18 No.101600219

>>101600209
sorry we are out of stock, please come again later

Anonymous
07/27/24(Sat)18:01:20 No.101600220

Anonymous 07/27/24(Sat)18:01:20 No.101600220

>>101600209
I meant "best". Sorry, I'm holding a knife with my beak

Anonymous
07/27/24(Sat)18:01:20 No.101600221

Anonymous 07/27/24(Sat)18:01:20 No.101600221

>>101600193
anyone who recommends that you use the base model is trolling or retarded

Anonymous
07/27/24(Sat)18:01:38 No.101600225

Anonymous 07/27/24(Sat)18:01:38 No.101600225

File: 1721921367243155.jpg (3.66 MB, 2303x2267)

3.66 MB JPG

So if I just want to CPUmaxx, what's the best old Dell to do it, now that the T7910 hit the normiesphere and skyrocketed in price?

Anonymous
07/27/24(Sat)18:02:08 No.101600231

Anonymous 07/27/24(Sat)18:02:08 No.101600231

>>101600209
est erp erd emo eon eck

Anonymous
07/27/24(Sat)18:02:57 No.101600238

Anonymous 07/27/24(Sat)18:02:57 No.101600238

File: time line extension.jpg (91 KB, 2020x1212)

91 KB JPG

>>101600192
Try time line extension. Click on nodes to branch.

Anonymous
07/27/24(Sat)18:03:03 No.101600240

Anonymous 07/27/24(Sat)18:03:03 No.101600240

>>101600218
Okay, downloading TheBloke/UtopiaXL-13B-GGUF as we speak

Anonymous
07/27/24(Sat)18:04:04 No.101600254

Anonymous 07/27/24(Sat)18:04:04 No.101600254

>>101600231
I think this doesn't work with llamacpp yet, but I will download later

Anonymous
07/27/24(Sat)18:05:10 No.101600264

Anonymous 07/27/24(Sat)18:05:10 No.101600264

>>101599987
NTA but yes, fucking horses is one of the main FiMFiction themes.

Anonymous
07/27/24(Sat)18:05:23 No.101600269

Anonymous 07/27/24(Sat)18:05:23 No.101600269

>>101600238
NTA, but that's really handy, gonna give it a go.

Anonymous
07/27/24(Sat)18:05:34 No.101600272

Anonymous 07/27/24(Sat)18:05:34 No.101600272

>>101600216
Can't branch off from checkpoints since they seem to be considered a separate type of chat with a parent attached, and any attempt to make a checkpoint ("branch") of a checkpoint will just overwrite the other checkpoints for the parent.
Even KoboldAI had a basic chat management system with a "save as" implemented, this is just ridiculous.
>>101600238
Oh, that looks pretty nice, I'll check that out. Thanks.

Anonymous
07/27/24(Sat)18:08:06 No.101600301

Anonymous 07/27/24(Sat)18:08:06 No.101600301

>>101600225
Gigabyte MZ73-LM0

Anonymous
07/27/24(Sat)18:09:19 No.101600319

Anonymous 07/27/24(Sat)18:09:19 No.101600319

>>101597933
Mythomax being recommended to new people as a good model

Anonymous
07/27/24(Sat)18:10:39 No.101600338

Anonymous 07/27/24(Sat)18:10:39 No.101600338

>>101597933
StableLM-7B

Anonymous
07/27/24(Sat)18:12:00 No.101600350

Anonymous 07/27/24(Sat)18:12:00 No.101600350

>>101600301
>$5000
T-Thanks...I'll just take that money and buy the 3090s, actually...

Anonymous
07/27/24(Sat)18:12:54 No.101600356

Anonymous 07/27/24(Sat)18:12:54 No.101600356

>>101600319
It is good tho

Anonymous
07/27/24(Sat)18:13:16 No.101600363

Anonymous 07/27/24(Sat)18:13:16 No.101600363

>>101600356
*wink wink*

Anonymous
07/27/24(Sat)18:13:26 No.101600366

Anonymous 07/27/24(Sat)18:13:26 No.101600366

File: yann_stopit_k.png (194 KB, 1227x499)

194 KB PNG

>>101600074
>>101600104
>>101600107
>>101600133

Anonymous
07/27/24(Sat)18:14:53 No.101600383

Anonymous 07/27/24(Sat)18:14:53 No.101600383

>>101600218
OK I fell for a meme didn't I? This seems to be extremely brain damaged
>>101598269
kek

Anonymous
07/27/24(Sat)18:17:28 No.101600405

Anonymous 07/27/24(Sat)18:17:28 No.101600405

>>101600383
>https://huggingface.co/matchaaaaa/Honey-Yuzu-13B
>A bit of Chunky-Lemon-Cookie-11B here for its great flavor, with a dash of WestLake-7B-v2 there to add some depth.

Anonymous
07/27/24(Sat)18:19:18 No.101600432

Anonymous 07/27/24(Sat)18:19:18 No.101600432

>>101600405
>WestLake-7B-v2
penn-jillette-garbage.jpg

Anonymous
07/27/24(Sat)18:19:26 No.101600438

Anonymous 07/27/24(Sat)18:19:26 No.101600438

>>101600383
use mistral nemo

Anonymous
07/27/24(Sat)18:19:56 No.101600445

Anonymous 07/27/24(Sat)18:19:56 No.101600445

File: wtf.png (40 KB, 602x475)

40 KB PNG

>>101600405

Anonymous
07/27/24(Sat)18:20:52 No.101600461

Anonymous 07/27/24(Sat)18:20:52 No.101600461

>>101600445
I wonder how incestmergers are handling nemo, now that their talents are completely unneeded?

Anonymous
07/27/24(Sat)18:21:39 No.101600467

Anonymous 07/27/24(Sat)18:21:39 No.101600467

File: file.png (45 KB, 1569x371)

45 KB PNG

Anonymous
07/27/24(Sat)18:21:42 No.101600469

Anonymous 07/27/24(Sat)18:21:42 No.101600469

File: 1710122124812393.png (91 KB, 679x960)

91 KB PNG

>>101600238
gotta love what this thing did with my mess of chats kek

Anonymous
07/27/24(Sat)18:22:54 No.101600479

Anonymous 07/27/24(Sat)18:22:54 No.101600479

File: sora WASTED.gif (1.8 MB, 298x240)

1.8 MB GIF

>>101600467
suddenly i feel a little less retarded today.

Anonymous
07/27/24(Sat)18:23:50 No.101600491

Anonymous 07/27/24(Sat)18:23:50 No.101600491

>>101600469

>2 branches converge again.

What the fuck. Is free will an illusion?

Anonymous
07/27/24(Sat)18:24:05 No.101600497

Anonymous 07/27/24(Sat)18:24:05 No.101600497

>>101600467
lmao, lol even

Anonymous
07/27/24(Sat)18:25:43 No.101600518

Anonymous 07/27/24(Sat)18:25:43 No.101600518

>>101600497
>>101600479
the meme that keeps on memeing even after all the safeties put in place for him

Anonymous
07/27/24(Sat)18:31:20 No.101600585

Anonymous 07/27/24(Sat)18:31:20 No.101600585

>>101599920

https://huggingface.co/Ada321/NemoPony

Mistral formatting. 0.15 or so Min P seems to completely eliminate anatomical mix ups in more complicated scenarios.

Remember that it is the base model.

Anonymous
07/27/24(Sat)18:32:31 No.101600601

Anonymous 07/27/24(Sat)18:32:31 No.101600601

>>101600585
>base model.
doa

Anonymous
07/27/24(Sat)18:32:40 No.101600607

Anonymous 07/27/24(Sat)18:32:40 No.101600607

Is there any other api front that allow dynamic model loading (unload when not used and load with api call) other than ollama?
ooba added --idle-timeout, but can't set default model, have to fully load one on startup and the reload doesn't even work with OAI api.

Anonymous
07/27/24(Sat)18:33:54 No.101600622

Anonymous 07/27/24(Sat)18:33:54 No.101600622

>>101600601
Its purpose is RP / creative writing. For assistant shit look elsewhere. Thought I could always merge it back into instruct. Maybe later.

Anonymous
07/27/24(Sat)18:33:58 No.101600623

Anonymous 07/27/24(Sat)18:33:58 No.101600623

File: PK7xRSd18Du0bX-w_t-9c.png (1.15 MB, 1920x1080)

1.15 MB PNG

>>101596616
This is the second in a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet and Opus. This model is fine-tuned on top of Qwen1.5 32B.

https://huggingface.co/anthracite-org/magnum-32b-v1

https://huggingface.co/anthracite-org/magnum-32b-v1-GGUF

Anonymous
07/27/24(Sat)18:34:04 No.101600624

Anonymous 07/27/24(Sat)18:34:04 No.101600624

>>101600438
Okay, this is actually really good for 13B. Like, it's suprisingly good holy shit.

Anonymous
07/27/24(Sat)18:34:53 No.101600636

Anonymous 07/27/24(Sat)18:34:53 No.101600636

>>101600623
rock hard

Anonymous
07/27/24(Sat)18:36:15 No.101600658

Anonymous 07/27/24(Sat)18:36:15 No.101600658

>>101600623
>top of Qwen1.5 32B.
great best模型!

Anonymous
07/27/24(Sat)18:36:51 No.101600665

Anonymous 07/27/24(Sat)18:36:51 No.101600665

>>101600624
try the Magnum 12b finetune aswell

Anonymous
07/27/24(Sat)18:37:27 No.101600671

Anonymous 07/27/24(Sat)18:37:27 No.101600671

>>101600623
If i ran this at a low quant (minimum 3_m) would it AT LEAST be better than nemo magnum?

Anonymous
07/27/24(Sat)18:38:35 No.101600685

Anonymous 07/27/24(Sat)18:38:35 No.101600685

>>101600671
是的当然

Anonymous
07/27/24(Sat)18:38:52 No.101600689

Anonymous 07/27/24(Sat)18:38:52 No.101600689

>>101600623
>qweh
scored the lowest at Freedom index (tm)

Anonymous
07/27/24(Sat)18:39:34 No.101600699

Anonymous 07/27/24(Sat)18:39:34 No.101600699

>>101600689
Ok sure you have less freedom, but the prose is better.

Anonymous
07/27/24(Sat)18:40:34 No.101600712

Anonymous 07/27/24(Sat)18:40:34 No.101600712

File: SWEET JESUS.jpg (42 KB, 675x595)

42 KB JPG

>>101600685

Anonymous
07/27/24(Sat)18:41:16 No.101600719

Anonymous 07/27/24(Sat)18:41:16 No.101600719

>>101600689
>>101600658

It wasn't trained on top of the Instruct model, it's trained on top of base just like mini-magnum-12b

Anonymous
07/27/24(Sat)18:41:51 No.101600726

Anonymous 07/27/24(Sat)18:41:51 No.101600726

>>101600689
This is trained on base, so maybe just maybe, it's not so awful.

Anonymous
07/27/24(Sat)18:42:15 No.101600729

Anonymous 07/27/24(Sat)18:42:15 No.101600729

>>101600623
slop

Anonymous
07/27/24(Sat)18:42:58 No.101600744

Anonymous 07/27/24(Sat)18:42:58 No.101600744

>>101600218
undi, undi...

picture this undi : i enter a restaurant, it has okay quality meals, nothing disgusting but also nothing to have a culinary orgasm to. Now what in the FUCK told you that mixing at random mid-tier dishes would give you something better? Who fucking told you in your feverish mind that mixing spaghetti and tomato sauce with a grilled tenderloin and mushrooms with curry chicken and rice would somehow result in a sum greater than its parts?
What the FUCK made you think that somehow the reason why base models underperform is that they don't have enough interference coming from other models, other models that have been trained differently.

But it doesn't matter to you : you have no creativity, you have no purpose, you have no vision, all you are is a failed idea : you are literally and unironically defined by a flawed course of action. You CANNOT fucking improve mid-tier models by merging them and expect to get good shit.

NO, it does NOT matter how much erp datasets you add to the mix thinking it will somehow improve the abysmal capabilities of retarded models being merged into an even retarded pile of slopped garbage
NO, it does NOT matter how many fucking loras you think you can cram into it before it starts coughing up blood like a tortured prey that's being abused for entertainment only by its predator, wishing for the sweet sweet release of death
NO, it does NOT matter how much you shill these models here, how much you provide links and baseless suggestions like "oh i heard X_noroshitchronosmaidbitch_faggotbloodybastardbitch_limarpozzed_designatedshittingmerge_q_2_K_m_l_g_b_troon_jart.GGUF is good" and acting like you are giving sensible advice

you could not create, you could never figure out something new, but you wanted the fame, you wanted people to downlaod your models, you wanted to be hailed as the solution, you wanted to offer a solution

the solution is to fucking kill yourself
you are the most failed human being in existence

Anonymous
07/27/24(Sat)18:43:23 No.101600749

Anonymous 07/27/24(Sat)18:43:23 No.101600749

>>101600467
AAAAAAAAAAAAAA
I DOWNLOADED HIS QUANTS
AAAAAAAAAAAAAAAAA

Anonymous
07/27/24(Sat)18:44:12 No.101600757

Anonymous 07/27/24(Sat)18:44:12 No.101600757

File: trinity.jpg (446 KB, 1176x1176)

446 KB JPG

>>101600156
Here is your prize.

Anonymous
07/27/24(Sat)18:44:21 No.101600759

Anonymous 07/27/24(Sat)18:44:21 No.101600759

>>101600689
fuck off retard

Anonymous
07/27/24(Sat)18:44:36 No.101600763

Anonymous 07/27/24(Sat)18:44:36 No.101600763

>>101600744
>picture this undi : i enter a restaurant, it has okay quality meals, nothing disgusting but also nothing to have a culinary orgasm to. Now what in the FUCK told you that mixing at random mid-tier dishes would give you something better? Who fucking told you in your feverish mind that mixing spaghetti and tomato sauce with a grilled tenderloin and mushrooms with curry chicken and rice would somehow result in a sum greater than its parts?
>What the FUCK made you think that somehow the reason why base models underperform is that they don't have enough interference coming from other models, other models that have been trained differently.
>>97223983
>For the record, I completely and unequivocally support Undi and his creation of new model hybrids, and think that everyone who attacks him is mindbroken incel scum, who may or may not be employed by OpenAI to do so.
>everyone who attacks him is mindbroken incel scum

Anonymous
07/27/24(Sat)18:45:27 No.101600774

Anonymous 07/27/24(Sat)18:45:27 No.101600774

File: cool dog shades.jpg (75 KB, 736x736)

75 KB JPG

>>101600744
actual modern art in post form
this needs to be posted in every thread right underneath the AI recap.

Anonymous
07/27/24(Sat)18:45:39 No.101600775

Anonymous 07/27/24(Sat)18:45:39 No.101600775

>>101600757
sao not pro
wtf

Anonymous
07/27/24(Sat)18:46:44 No.101600784

Anonymous 07/27/24(Sat)18:46:44 No.101600784

>>101600665
Fuhuhu how is this even possible? Do these Frenchmen finetune for degenerate ERP or what?
I kneel, anon. Many buckets will be filled to your health.

Anonymous
07/27/24(Sat)18:47:04 No.101600789

Anonymous 07/27/24(Sat)18:47:04 No.101600789

>>101600623
I keep forgetting that chinese 30B models exist. I wonder why.

Anonymous
07/27/24(Sat)18:47:07 No.101600790

Anonymous 07/27/24(Sat)18:47:07 No.101600790

>>101600749
There were warnings
>>100195457

Anonymous
07/27/24(Sat)18:47:26 No.101600796

Anonymous 07/27/24(Sat)18:47:26 No.101600796

>>101600623
Why 1.5 are you dumb or what

Anonymous
07/27/24(Sat)18:48:05 No.101600801

Anonymous 07/27/24(Sat)18:48:05 No.101600801

>>101600744
which model?

Anonymous
07/27/24(Sat)18:48:05 No.101600802

Anonymous 07/27/24(Sat)18:48:05 No.101600802

>>101600796
because there's no qwen2 32b retard

Anonymous
07/27/24(Sat)18:48:07 No.101600803

Anonymous 07/27/24(Sat)18:48:07 No.101600803

>>101600796
no qwen 2 32b

Anonymous
07/27/24(Sat)18:48:24 No.101600807

Anonymous 07/27/24(Sat)18:48:24 No.101600807

>>101600623
also what settings do i use for this?

Anonymous
07/27/24(Sat)18:48:56 No.101600816

Anonymous 07/27/24(Sat)18:48:56 No.101600816

>>101600803
>>101600802
Why have the Chinese failed us?

Anonymous
07/27/24(Sat)18:49:06 No.101600820

Anonymous 07/27/24(Sat)18:49:06 No.101600820

>>101600802
>>101600803
There was quen2 moe. Remember that? I don't.

Anonymous
07/27/24(Sat)18:49:48 No.101600826

Anonymous 07/27/24(Sat)18:49:48 No.101600826

>>101600744
BAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEED

Anonymous
07/27/24(Sat)18:50:36 No.101600834

Anonymous 07/27/24(Sat)18:50:36 No.101600834

>>101600807
ChatML and i used Universal light.
>>101600796
No qwen 32b
>>101600789
They are just plain kino.

Anonymous
07/27/24(Sat)18:50:47 No.101600839

Anonymous 07/27/24(Sat)18:50:47 No.101600839

>>101600820
Magnum on it "today" remember that? I do!
>Working on it already. Should have Qwen-2 7B, Qwen-2 47B, and Qwen-1.5 32B done by the end of the day, if the they pass internal tests.
>https://huggingface.co/anthracite-org/magnum-72b-v1/discussions/2#66713bb492412fd46410d399

Anonymous
07/27/24(Sat)18:51:11 No.101600844

Anonymous 07/27/24(Sat)18:51:11 No.101600844

File: 1722120654318.jpg (54 KB, 430x148)

54 KB JPG

lazy mf

Anonymous
07/27/24(Sat)18:51:25 No.101600846

Anonymous 07/27/24(Sat)18:51:25 No.101600846

>>101600839
>if the they pass internal tests.
looks like they didn't

Anonymous
07/27/24(Sat)18:53:40 No.101600869

Anonymous 07/27/24(Sat)18:53:40 No.101600869

>>101600844
so cold and loveless
my hand remains the second warmest thing my dick has touched (the first being my GPU)

Anonymous
07/27/24(Sat)18:55:40 No.101600890

Anonymous 07/27/24(Sat)18:55:40 No.101600890

>>101600834
yeah this doesnt seem as creative as nemo, could be close to it, and its understanding of different languages is pretty bad. Ranges from capable to "why did it randomly insert a question mark or an exclamation point in the middle of that word?"
plus being 1t/s speed kills it. back to nemo magnum for me.

Anonymous
07/27/24(Sat)18:57:55 No.101600921

Anonymous 07/27/24(Sat)18:57:55 No.101600921

I have this unhealthy urge right now to replicate my ex in chatbot form. I sense a really dark path opening up in front of me.
And a part of me wants to convince me that the best way to get over it is to go through it and come out the other side.

Anonymous
07/27/24(Sat)18:58:41 No.101600929

Anonymous 07/27/24(Sat)18:58:41 No.101600929

>>101600890
>1t/s
12GB Vramlet spotted, opinion discarded

Anonymous
07/27/24(Sat)18:58:46 No.101600933

Anonymous 07/27/24(Sat)18:58:46 No.101600933

>>101600921
Can't wait until people start doing that and start saying the chatbot ex is better.

Anonymous
07/27/24(Sat)18:59:53 No.101600949

Anonymous 07/27/24(Sat)18:59:53 No.101600949

>>101600938
>>101600938
>>101600938

Anonymous
07/27/24(Sat)19:01:46 No.101600968

Anonymous 07/27/24(Sat)19:01:46 No.101600968

>>101600949
>►Official /lmg/ card: https://files.catbox.moe/ylb0hv.png

Anonymous
07/27/24(Sat)19:02:04 No.101600972

Anonymous 07/27/24(Sat)19:02:04 No.101600972

>>101600949
>Official /lmg/ card: https://files.catbox.moe/ylb0hv.png
Sure.

Anonymous
07/27/24(Sat)19:03:05 No.101600983

Anonymous 07/27/24(Sat)19:03:05 No.101600983

>>101600968
>>101600972
The old one is deprecated. Also samefag phoneposter.

Anonymous
07/27/24(Sat)19:17:23 No.101601093

Anonymous 07/27/24(Sat)19:17:23 No.101601093

>>101600623
>how to use faipl-1.0
put the following in the readme:
license: other
license_name: faipl-1.0
license_link: https://freedevproject.org/faipl-1.0/

Anonymous
07/27/24(Sat)19:23:33 No.101601141

Anonymous 07/27/24(Sat)19:23:33 No.101601141

>>101599650
kill yourself little buddy

Anonymous
07/27/24(Sat)19:26:42 No.101601163

Anonymous 07/27/24(Sat)19:26:42 No.101601163

my bad >>101601141 was for >>101601093

Anonymous
07/27/24(Sat)19:36:19 No.101601273

Anonymous 07/27/24(Sat)19:36:19 No.101601273

How does Mistral Large's context work? It says 32k in the config.

Anonymous
07/27/24(Sat)19:57:25 No.101601486

Anonymous 07/27/24(Sat)19:57:25 No.101601486

>>101599875
>everyone
I think it's just one shill following Sao's modus operandi.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.