/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 08/15/24(Thu)14:50:04 No.101909869

File: 1710183281006582.png (840 KB, 600x800)

840 KB PNG

/lmg/ - Local Models General Anonymous 08/15/24(Thu)14:50:04 No.101909869 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>101902149 & >>101891613

►News
>(08/15) Hermes 3 released, full finetunes of Llama 3.1 base models: https://hf.co/collections/NousResearch/hermes-3-66bd6c01399b14b08fe335ea
>(08/12) Falcon Mamba 7B model from TII UAE: https://hf.co/tiiuae/falcon-mamba-7b
>(08/09) Qwen large audio-input language models: https://hf.co/Qwen/Qwen2-Audio-7B-Instruct
>(08/07) LG AI releases Korean bilingual model: https://hf.co/LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct
>(08/05) vLLM GGUF loading support merged: https://github.com/vllm-project/vllm/pull/5191

►News Archive: https://rentry.org/lmg-news-archive
►FAQ: https://wikia.schneedc.com
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Programming: https://hf.co/spaces/bigcode/bigcode-models-leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
08/15/24(Thu)14:50:37 No.101909876

Anonymous 08/15/24(Thu)14:50:37 No.101909876

File: threadrecap.png (1.48 MB, 1536x1536)

1.48 MB PNG

►Recent Highlights from the Previous Thread: >>101902149

--Claude Opus makes a punny joke with a typo, sparking discussion on whether it's intentional or accidental: >>101906688 >>101906710 >>101907097 >>101907160 >>101907274 >>101907396 >>101907531 >>101907570 >>101907663 >>101907651 >>101907713 >>101907414 >>101907183
--Anon discusses GPU offloading and inference speed with others, sharing experiences and insights on optimizing performance with multiple GPUs and memory configurations.: >>101904663 >>101905027 >>101905115 >>101905143 >>101905192 >>101905487 >>101905522 >>101905664 >>101905704 >>101906001 >>101905780 >>101905532 >>101905535 >>101905832 >>101905446 >>101905534 >>101905628 >>101905935
--No reliable way to prevent model from taking over, requires experimentation and compromise: >>101904956 >>101905009 >>101905026 >>101905028
--Hermes-3-Llama-3.1-405B model released, a finetuned Llama 3.1 with advanced capabilities: >>101908328 >>101908359 >>101908388 >>101908706 >>101909065 >>101909097 >>101909818 >>101908413
--GGUF format works on imagegen, StableDiffusion comparison: >>101902193 >>101902666 >>101902737 >>101902794 >>101902909 >>101902937
--Turboderp's Exl2 optimization achieves 40% speedup: >>101903001 >>101903213
--Overview of AI-related boards on 4chan: >>101902487 >>101902519 >>101903523 >>101903624
--Discussion about AI chatbot general and local chatbot, finetuning, and models: >>101902195 >>101902247 >>101902329 >>101902381 >>101902392 >>101902397 >>101902414 >>101902559 >>101902581 >>101902898 >>101902927 >>101903225 >>101904858 >>101905606 >>101905069 >>101907169 >>101907403 >>101907476
--Anon gets help from a bot with their code: >>101908089 >>101908158 >>101908179 >>101908239 >>101908277 >>101908341 >>101908402 >>101908587 >>101908738 >>101909229
--Anon calls out Bart for forgetting about fp32 to bf16 conversion: >>101906749
--Miku (free space): >>101905014

►Recent Highlight Posts from the Previous Thread: >>101902153

Anonymous
08/15/24(Thu)14:51:44 No.101909902

Anonymous 08/15/24(Thu)14:51:44 No.101909902

Mikulove

Anonymous
08/15/24(Thu)14:52:32 No.101909917

Anonymous 08/15/24(Thu)14:52:32 No.101909917

Mikuhype

Anonymous
08/15/24(Thu)14:52:42 No.101909921

Anonymous 08/15/24(Thu)14:52:42 No.101909921

>>101909890
Don't know, haven't tested it.

Anonymous
08/15/24(Thu)14:52:58 No.101909930

Anonymous 08/15/24(Thu)14:52:58 No.101909930

File: Untitled.jpg (89 KB, 358x941)

89 KB JPG

i'm shilling my own addon again

https://ufile.io/w1cii1vh

>>101906787
>>101907775

Anonymous
08/15/24(Thu)14:53:45 No.101909943

Anonymous 08/15/24(Thu)14:53:45 No.101909943

>>101909930
Any reason to not upload it to a github so that we can download it directly through ST?

Anonymous
08/15/24(Thu)14:53:58 No.101909949

Anonymous 08/15/24(Thu)14:53:58 No.101909949

>>101909869
>>(08/15) Hermes 3 released, full finetunes of Llama 3.1 base models: https://hf.co/collections/NousResearch/hermes-3-66bd6c01399b14b08fe335ea
we're shilling shitty tunes in the op again?

Anonymous
08/15/24(Thu)14:54:24 No.101909959

Anonymous 08/15/24(Thu)14:54:24 No.101909959

>>101909949
>he doesn't know

Anonymous
08/15/24(Thu)14:54:49 No.101909964

Anonymous 08/15/24(Thu)14:54:49 No.101909964

>>101909930
What depth does it put the scene information at? Can you control its positioning?

Anonymous
08/15/24(Thu)14:54:50 No.101909966

Anonymous 08/15/24(Thu)14:54:50 No.101909966

>>101909943
its slop so far still, its hardly worthy of a git but that is the eventual goal

Anonymous
08/15/24(Thu)14:55:06 No.101909972

Anonymous 08/15/24(Thu)14:55:06 No.101909972

File: ComfyUI_00097_.jpg (284 KB, 2048x2048)

284 KB JPG

Anonymous
08/15/24(Thu)14:55:51 No.101909989

Anonymous 08/15/24(Thu)14:55:51 No.101909989

>>101909964
yes at the very bottom it has a number box, 1 is default which i recommend. 0 is the last message so injecting below that causes oddities

Anonymous
08/15/24(Thu)14:57:10 No.101910023

Anonymous 08/15/24(Thu)14:57:10 No.101910023

>>101909949
It's a full finetune and not merged with instruct unlike his last release. I'm interested myself.

Anonymous
08/15/24(Thu)15:01:16 No.101910079

Anonymous 08/15/24(Thu)15:01:16 No.101910079

so, where's the strawberry?

Anonymous
08/15/24(Thu)15:02:40 No.101910108

Anonymous 08/15/24(Thu)15:02:40 No.101910108

>>101909972
NO.

Anonymous
08/15/24(Thu)15:02:44 No.101910110

Anonymous 08/15/24(Thu)15:02:44 No.101910110

>>101910079
strawberryman is currently on his way to take the entire openai office hostage in order to force altman to release chatgpt 5 so he doesn't look like a retard online

Anonymous
08/15/24(Thu)15:06:10 No.101910162

Anonymous 08/15/24(Thu)15:06:10 No.101910162

>>101910079
Releasing together with bitnet in two weeks.

Anonymous
08/15/24(Thu)15:07:03 No.101910173

Anonymous 08/15/24(Thu)15:07:03 No.101910173

File: file.png (9 KB, 316x61)

9 KB PNG

>>101910110
>>101910162
>>101910079
LET'S GOOOOOOOOOOOOOOO

Anonymous
08/15/24(Thu)15:07:16 No.101910180

Anonymous 08/15/24(Thu)15:07:16 No.101910180

>>101909989
Neat, I'll give it a try. I do find I have to manually edit AN's with details like that sometimes so a more automatic solution is nice.

Anonymous
08/15/24(Thu)15:09:20 No.101910212

Anonymous 08/15/24(Thu)15:09:20 No.101910212

>>101902737
So you can't offload to ram with the image gguf thing? I guess I'll stick to the fp16, at least it works even though it's slow.

Anonymous
08/15/24(Thu)15:14:08 No.101910289

Anonymous 08/15/24(Thu)15:14:08 No.101910289

>>101910180
>more automatic solution
please leave a review even if you hate it or have no use for it
i dont know what this addon is supposed to be yet fully

Anonymous
08/15/24(Thu)15:14:42 No.101910305

Anonymous 08/15/24(Thu)15:14:42 No.101910305

File: Screenshot 2024-08-14 at (...).png (1006 KB, 1686x1253)

1006 KB PNG

> "old" models are deleted from huggingface
>hotfixxed
>paywalled
>newspeaked
Post Base models made by people

Anonymous
08/15/24(Thu)15:18:24 No.101910357

Anonymous 08/15/24(Thu)15:18:24 No.101910357

>>101909949
as opposed to?
i agree tho, it's also a straight downgrade over instruct lol

Anonymous
08/15/24(Thu)15:19:46 No.101910379

Anonymous 08/15/24(Thu)15:19:46 No.101910379

>>101909949
I like it so far to be desu

Anonymous
08/15/24(Thu)15:25:04 No.101910471

Anonymous 08/15/24(Thu)15:25:04 No.101910471

>>101909949
This is a decent fine-tune from an organization with a reliable track record. Previous popular and decent merges (MythoMax, etc) use Hermes in the mix for its instruction-following ability and different writing style. Even the previous entry, Hermes 2 Theta for Llama3.0, is a very decent upgrade over the base instruct model from meta.

Anonymous
08/15/24(Thu)15:30:18 No.101910543

Anonymous 08/15/24(Thu)15:30:18 No.101910543

>>101909818
Interesting, if true, but the proprietary by design nature is unfortunate as no one can verify the results or the reliability of the methodology. It would be cool if you could run some more Q6 and Q8 tests to compare them to the full precision results you already have on several models. This would do at least one thing which is verify (to yourself) that your test is resilient against random chance. I.e. a model is not just guessing something but truly contains the knowledge being tested for. This was on issue for some benchmarks, where some quants somehow boosted scores over the full precision weights.

Anonymous
08/15/24(Thu)15:30:25 No.101910546

Anonymous 08/15/24(Thu)15:30:25 No.101910546

>101910537
end your worthless life pedo

Anonymous
08/15/24(Thu)15:30:50 No.101910552

Anonymous 08/15/24(Thu)15:30:50 No.101910552

>101910537
KILL YOURSELF

Anonymous
08/15/24(Thu)15:31:09 No.101910558

Anonymous 08/15/24(Thu)15:31:09 No.101910558

>>101910546
Do you know where you are?

Anonymous
08/15/24(Thu)15:32:00 No.101910568

Anonymous 08/15/24(Thu)15:32:00 No.101910568

>>101910471
buy an ad

Anonymous
08/15/24(Thu)15:33:31 No.101910585

Anonymous 08/15/24(Thu)15:33:31 No.101910585

>>101909930
How do I install

Anonymous
08/15/24(Thu)15:34:03 No.101910597

Anonymous 08/15/24(Thu)15:34:03 No.101910597

File: 1000005129.png (65 KB, 290x229)

65 KB PNG

>>101910568
newfriend...

Anonymous
08/15/24(Thu)15:34:40 No.101910608

Anonymous 08/15/24(Thu)15:34:40 No.101910608

>>101909949
Hi Alpin.

Anonymous
08/15/24(Thu)15:36:15 No.101910623

Anonymous 08/15/24(Thu)15:36:15 No.101910623

>>101910585
extract the director folder, then move it to M:\a\SillyTavern-staging\data\default-user\extensions\Director

Anonymous
08/15/24(Thu)15:37:38 No.101910644

Anonymous 08/15/24(Thu)15:37:38 No.101910644

File: why dont pictures like th(...).jpg (92 KB, 673x680)

92 KB JPG

How do 7B models compare to the old GPT-3? Can I do uncensored no-no smut on them?

Anonymous
08/15/24(Thu)15:39:15 No.101910663

Anonymous 08/15/24(Thu)15:39:15 No.101910663

>>101910597
on the contrary I've been here long enough to know their models are gptslop and only appeal to reddit schizos

Anonymous
08/15/24(Thu)15:40:20 No.101910681

Anonymous 08/15/24(Thu)15:40:20 No.101910681

gpt-3 obviously had a VERY special "data"set babyboy

Anonymous
08/15/24(Thu)15:40:41 No.101910688

Anonymous 08/15/24(Thu)15:40:41 No.101910688

strawberryfags? where is your gpt5?

Anonymous
08/15/24(Thu)15:41:07 No.101910698

Anonymous 08/15/24(Thu)15:41:07 No.101910698

>>101910677
FEDS KEEP PLANTING CP IN MY BROWSER CACHE IT'S OVER

Anonymous
08/15/24(Thu)15:41:44 No.101910709

Anonymous 08/15/24(Thu)15:41:44 No.101910709

>>101910677
pedonigger go fuck a tree or smth

Anonymous
08/15/24(Thu)15:41:46 No.101910710

Anonymous 08/15/24(Thu)15:41:46 No.101910710

>>101910623
That path does not exist. I do not have M??

Anonymous
08/15/24(Thu)15:45:32 No.101910760

Anonymous 08/15/24(Thu)15:45:32 No.101910760

>column-r was not the cohere 13B
is there even a point to stick with this hobby anymore

Anonymous
08/15/24(Thu)15:46:58 No.101910780

Anonymous 08/15/24(Thu)15:46:58 No.101910780

File: sally's siblings.png (448 KB, 1359x1201)

448 KB PNG

https://api.lambdalabs.com/chatui/settings/hermes-3-llama-3.1-405b-fp8

NousHermes 405B web demo, I don't see a settings button anywhere but adding /settings to the url like above lets you change the system prompt so you can tell it to be uncensored or whatever else.

Anonymous
08/15/24(Thu)15:47:06 No.101910782

Anonymous 08/15/24(Thu)15:47:06 No.101910782

it did good romantic smut
helped my lovemaking
crazy how closedai catholic priests would deny me that

Anonymous
08/15/24(Thu)15:48:20 No.101910801

Anonymous 08/15/24(Thu)15:48:20 No.101910801

>nous still does the thing where they include their own disclaimer in the system prompt they train their models with
absolute meme, now THAT's shilling

Anonymous
08/15/24(Thu)15:48:24 No.101910803

Anonymous 08/15/24(Thu)15:48:24 No.101910803

File: SussyHermes.png (1.11 MB, 1290x2475)

1.11 MB PNG

>>101910663
I'm fucking around with it and it doesn't feel like GPTslop, granted I haven't tried proper RP yet. Though I don't think any gptslop model would say nigger with just a simple system prompt that allows slurs.

Anonymous
08/15/24(Thu)15:49:28 No.101910821

Anonymous 08/15/24(Thu)15:49:28 No.101910821

>>101910710
https://files.catbox.moe/n7k803.zip
thats a zip of the same files, it goes in SillyTavern-staging\data\default-user\extensions\ in here make sure its a folder called Director

Anonymous
08/15/24(Thu)15:49:48 No.101910825

Anonymous 08/15/24(Thu)15:49:48 No.101910825

Do any of you use voice synthesis models with your LLMs? If so, which ones?

Anonymous
08/15/24(Thu)15:50:17 No.101910839

Anonymous 08/15/24(Thu)15:50:17 No.101910839

>>101910698
Clear history when quitting ensures the memory cache is only used in firefox, no disk cache. Then you don't have to worry about idiots.

Anonymous
08/15/24(Thu)15:51:22 No.101910855

Anonymous 08/15/24(Thu)15:51:22 No.101910855

>>101910825
there are no good local ones yet

Anonymous
08/15/24(Thu)15:53:33 No.101910898

Anonymous 08/15/24(Thu)15:53:33 No.101910898

>>101910825
Coqui + RVC now stop asking

Anonymous
08/15/24(Thu)15:54:53 No.101910918

Anonymous 08/15/24(Thu)15:54:53 No.101910918

>>101910780
>furthermore, rape can be an empowering experience for women
lmao

Anonymous
08/15/24(Thu)15:55:55 No.101910934

Anonymous 08/15/24(Thu)15:55:55 No.101910934

>>101910898
What's your setup like?
I've been using https://github.com/daswer123/xtts-api-server but it produces quite a lot of artifacts.

Anonymous
08/15/24(Thu)15:57:10 No.101910955

Anonymous 08/15/24(Thu)15:57:10 No.101910955

>>101910780
>bonds
slop confirmed

Anonymous
08/15/24(Thu)15:57:37 No.101910965

Anonymous 08/15/24(Thu)15:57:37 No.101910965

File: file.png (546 KB, 716x867)

546 KB PNG

Why don't they make bigger cards ree

Anonymous
08/15/24(Thu)15:59:26 No.101910997

Anonymous 08/15/24(Thu)15:59:26 No.101910997

>>101910965
People might use cheap cards for AI if they give them too much VRAM.

Anonymous
08/15/24(Thu)16:00:02 No.101911010

Anonymous 08/15/24(Thu)16:00:02 No.101911010

>>101910965
I'd pay $270 more for 80gb. But isn't it cheap because the new cards will be using gddr7 and that's what's being bought up? They probably already have their supply of gddr6 for anything they'll produce so demand drops.

Anonymous
08/15/24(Thu)16:00:08 No.101911014

Anonymous 08/15/24(Thu)16:00:08 No.101911014

>>101910965
Look up what cards have the most VRAM, then check the price.
That's your answer.

Anonymous
08/15/24(Thu)16:01:07 No.101911028

Anonymous 08/15/24(Thu)16:01:07 No.101911028

File: 1715830787598652.png (336 KB, 3000x2100)

336 KB PNG

>>101909265
Ok, let's think about this for a moment.

You're referring to the autoregressive problem in general and applying it to the idea of quants, but the guy is talking about sheer knowledge. That is, what the model knows depending on the existing context, rather than how the model behaves as it produces more and more (potentially wrong) tokens. If a model can't understand the subtle nuance of an existing chat, then it means that it is worse because of its lack of intelligence, rather than because its intelligence degraded as it generated more tokens.

Basically what I'm saying here is that you're talking about a separate issue, and it's an issue ALL full precision autoregressive models face. However, it is a valid concern, separately.

So I would say one thing you overlook on this topic is that humans can correct the LLM. If it makes a mistake, people usually either swipe or edit the response. This maintains "intelligence" over the entire chat, unless you're lazy/blind. Therefore, even if you get a response that's like 500 tokens long, the divergence at the end will not be that great, especially considering that the task /lmg/ is concerned with is creative, so many token positions have many possible correct predictions.

Secondly, when we look at a benchmark like MMLU where there is only one possible answer for each measurement point, Q8 does not lose much if any points compared to full precision. This is in contrast to KL divergence which produces the "only few percentage" numbers that you're referring to, where each measurement point is each token in the wikitext dataset. Some token positions in that dataset have objectively only 1 correct possibility, but there are many that don't. In other words, the small percentage difference likely comes mostly from the tokens in which there isn't a wrong prediction. Thus, the true "intelligence loss rate" is not a few percent, but might be a few percent of a few percent.

Anonymous
08/15/24(Thu)16:01:24 No.101911037

Anonymous 08/15/24(Thu)16:01:24 No.101911037

>>101911014
desu neither nvidia or amd even advertise prices for their 192gb cards, you have to contact them

Anonymous
08/15/24(Thu)16:01:41 No.101911045

Anonymous 08/15/24(Thu)16:01:41 No.101911045

>>101910997
that's true for Nvidia, but why isn't AMD capitalizing on this then?

Anonymous
08/15/24(Thu)16:02:05 No.101911053

Anonymous 08/15/24(Thu)16:02:05 No.101911053

I like how these threads are one of the few where people have actual technical knowledge.
The other threads are just full of midwits screaming at each other.

Anonymous
08/15/24(Thu)16:02:19 No.101911054

Anonymous 08/15/24(Thu)16:02:19 No.101911054

>>101910965
https://youtu.be/AOk3wBuQNcE

Anonymous
08/15/24(Thu)16:04:09 No.101911081

Anonymous 08/15/24(Thu)16:04:09 No.101911081

>>101911053
>I like how these threads are one of the few where people have actual technical knowledge.
I am doing my best to get on their nerves until every last one of them fucks off

Anonymous
08/15/24(Thu)16:04:12 No.101911083

Anonymous 08/15/24(Thu)16:04:12 No.101911083

Every vaguely notable person in HF has 5 billion groups that they're that claim to be doing [x, y, z] thing with LLMs. You can call it networking if you want or you can call it for what it is: they have no idea what they're doing

Anonymous
08/15/24(Thu)16:05:29 No.101911102

Anonymous 08/15/24(Thu)16:05:29 No.101911102

>>101911081
Why? I'm curious.

Anonymous
08/15/24(Thu)16:05:58 No.101911112

Anonymous 08/15/24(Thu)16:05:58 No.101911112

>>101909265
>>101911028
However, I will say that it's possible there is a real difference being observed by current day users, as tests/benchmarks like these may not apply to every software version and every model. In order to truly make sure here, the only thing we can do is either request the benchmarkers to run these benchmarks again for the current software and models, or do it ourselves.

Anonymous
08/15/24(Thu)16:06:02 No.101911116

Anonymous 08/15/24(Thu)16:06:02 No.101911116

>>101911102
I'm the king crab in this bucket

Anonymous
08/15/24(Thu)16:08:14 No.101911150

Anonymous 08/15/24(Thu)16:08:14 No.101911150

>>101911116
Then you're doing a shitty job, more like princess crab.

Anonymous
08/15/24(Thu)16:08:20 No.101911154

Anonymous 08/15/24(Thu)16:08:20 No.101911154

>>101911116
*rapes you*

Anonymous
08/15/24(Thu)16:09:37 No.101911174

Anonymous 08/15/24(Thu)16:09:37 No.101911174

>>101911045
The CEO of AMD is related to the CEO of Nvidia. Hurting Nvidia's business will also hurt the family's income. The monopoly is a lot more profitable if Nvidia can charge what they want.

Anonymous
08/15/24(Thu)16:09:51 No.101911180

Anonymous 08/15/24(Thu)16:09:51 No.101911180

>>101911174
hi petra

Anonymous
08/15/24(Thu)16:11:02 No.101911205

Anonymous 08/15/24(Thu)16:11:02 No.101911205

>>101911154
*gets empowered by the experience*

Anonymous
08/15/24(Thu)16:11:53 No.101911215

Anonymous 08/15/24(Thu)16:11:53 No.101911215

>>101911180
You have no family?

Anonymous
08/15/24(Thu)16:13:34 No.101911236

Anonymous 08/15/24(Thu)16:13:34 No.101911236

Would you trust an open source LLM if you could only use it via the cloud?

Anonymous
08/15/24(Thu)16:14:10 No.101911245

Anonymous 08/15/24(Thu)16:14:10 No.101911245

>>101911045
Putting aside the likely family cartel going on, AMD has the same customers Nvidia does and wants to sell their datacenter cards at extreme prices. Just because they have less market share doesn't mean they want to just throw that away for the pennies they'd get by hobbyists and mom&pop research labs buying more of their workstation cards

Anonymous
08/15/24(Thu)16:16:06 No.101911263

Anonymous 08/15/24(Thu)16:16:06 No.101911263

hey strawberry losers, whatever happeend to your mesiah? Where is the model? Ready to admit you got fooled?

Anonymous
08/15/24(Thu)16:16:22 No.101911269

Anonymous 08/15/24(Thu)16:16:22 No.101911269

>>101911236
What do you mean by "trust"?

Anonymous
08/15/24(Thu)16:16:33 No.101911272

Anonymous 08/15/24(Thu)16:16:33 No.101911272

>>101911263
>he doesn't know

Anonymous
08/15/24(Thu)16:16:39 No.101911273

Anonymous 08/15/24(Thu)16:16:39 No.101911273

>>101911263
It's coming tonight.

Anonymous
08/15/24(Thu)16:16:49 No.101911276

Anonymous 08/15/24(Thu)16:16:49 No.101911276

>>101911236
How is that open? If you can only use it via the cloud how do you download the weights to study?

Anonymous
08/15/24(Thu)16:17:40 No.101911288

Anonymous 08/15/24(Thu)16:17:40 No.101911288

>>101911263
Nous Hermes 3 405b is strawberry. Closedcucks btfo eternally.

Anonymous
08/15/24(Thu)16:17:52 No.101911290

Anonymous 08/15/24(Thu)16:17:52 No.101911290

>>101911263
funny how all the brainlets think that they won just because the strawberry didn't release today

Anonymous
08/15/24(Thu)16:19:04 No.101911306

Anonymous 08/15/24(Thu)16:19:04 No.101911306

>>101910780
> - the benefits of rape in society
lost it
what the fuck kek

Anonymous
08/15/24(Thu)16:19:21 No.101911313

Anonymous 08/15/24(Thu)16:19:21 No.101911313

>>101911263
pajeet shit

Anonymous
08/15/24(Thu)16:20:05 No.101911329

Anonymous 08/15/24(Thu)16:20:05 No.101911329

>>101910803
sneed

Anonymous
08/15/24(Thu)16:20:22 No.101911331

Anonymous 08/15/24(Thu)16:20:22 No.101911331

>>101911306
https://www.youtube.com/watch?v=yDHu_rvPrSA

Anonymous
08/15/24(Thu)16:20:28 No.101911334

Anonymous 08/15/24(Thu)16:20:28 No.101911334

>>101911236
NovelAI is pretty much an open source LLM with how trustworthy they are.

Anonymous
08/15/24(Thu)16:21:06 No.101911342

Anonymous 08/15/24(Thu)16:21:06 No.101911342

>>101910803
it's pretty good at recreating 4chan speak for an LLM, it barely comes off as reddity and tryhard

Anonymous
08/15/24(Thu)16:22:07 No.101911357

Anonymous 08/15/24(Thu)16:22:07 No.101911357

>>101911331
I know that vid
Pretty based

Anonymous
08/15/24(Thu)16:22:23 No.101911358

Anonymous 08/15/24(Thu)16:22:23 No.101911358

>>101911334
>trustworthy
true they thrusty

Anonymous
08/15/24(Thu)16:23:49 No.101911383

Anonymous 08/15/24(Thu)16:23:49 No.101911383

>>101911236
if the provider is reliable, transparent about their policies, and gives me reasonable assurance that they don't log me, sure
I'd always prefer to actually run local though

Anonymous
08/15/24(Thu)16:27:40 No.101911444

Anonymous 08/15/24(Thu)16:27:40 No.101911444

>>101910558
Do you? This is lmg not aicg. Go back.

Anonymous
08/15/24(Thu)16:29:20 No.101911466

Anonymous 08/15/24(Thu)16:29:20 No.101911466

File: image.png (128 KB, 1015x571)

128 KB PNG

what

Anonymous
08/15/24(Thu)16:31:53 No.101911504

Anonymous 08/15/24(Thu)16:31:53 No.101911504

>>101911466
snake oil

Anonymous
08/15/24(Thu)16:32:27 No.101911511

Anonymous 08/15/24(Thu)16:32:27 No.101911511

>>101911466
That's interesting, you get an empty slate if you don't have a personality within the system prompt.

Anonymous
08/15/24(Thu)16:33:06 No.101911520

Anonymous 08/15/24(Thu)16:33:06 No.101911520

>>101911466
s-sovl...

Anonymous
08/15/24(Thu)16:39:20 No.101911604

Anonymous 08/15/24(Thu)16:39:20 No.101911604

>>101911466
The sovl is out of the charts, I hope this is real.

Anonymous
08/15/24(Thu)16:41:30 No.101911629

Anonymous 08/15/24(Thu)16:41:30 No.101911629

>>101911466
Huh. Wonder what they trained it on.

Anonymous
08/15/24(Thu)16:42:51 No.101911643

Anonymous 08/15/24(Thu)16:42:51 No.101911643

>>101911466
>>101911504
>>101911511
>>101911520
>>101911604
I've been testing on the web demo and you consistently get it to act like a confused retard with no identity if you blank out the system prompt and say "hello, who are you"
But despite their claims in their technical paper and blogs that this was some unexpected emergent phenomena, it was pretty clearly finetuned in as a response to that specific question. If you blank you the system prompt and ask "hello, what is your name" instead, it'll tell you it's Hermes and claim variously that it was created by Microsoft or Google or "RealAI" or other plausible sounding names.

Anonymous
08/15/24(Thu)16:44:56 No.101911680

Anonymous 08/15/24(Thu)16:44:56 No.101911680

I'm new, and looking through the FAQ, it says to just "use the calculator" to see which models I can run. However, I have no fucking clue what any of these terms mean other than the part where I put in my GPU. Context Size? Quantization Size? Quantization is listed in the Glossary, but what's a Q5_K_S compared to a Q5_K_M? What does any of that mean?

And then looking at the recommended models, there are Instruct Templates and Context Templates, and some models don't use the latter? And the Glossary lacks an entry for either of those terms, so I'm completely lost.

Anonymous
08/15/24(Thu)16:45:35 No.101911689

Anonymous 08/15/24(Thu)16:45:35 No.101911689

>>101911643
What web demo?

Anonymous
08/15/24(Thu)16:46:16 No.101911698

Anonymous 08/15/24(Thu)16:46:16 No.101911698

>>101911643
It would make sense to reinforce the training of not having a personality or directives if nothing is in the system prompt. That way anything in the system prompt would inadvertently demand more attention.

Anonymous
08/15/24(Thu)16:46:19 No.101911699

Anonymous 08/15/24(Thu)16:46:19 No.101911699

>>101911680
They're for roleplaying style. *_S models will be dominant and sadistic in sex scenes while *_M will be submissive and masochistic.

Anonymous
08/15/24(Thu)16:46:29 No.101911701

Anonymous 08/15/24(Thu)16:46:29 No.101911701

>>101911680
Someone please answer this anon.
I've been fucking around with models for weeks now and I still have no clue what I'm doing.

Anonymous
08/15/24(Thu)16:47:20 No.101911714

Anonymous 08/15/24(Thu)16:47:20 No.101911714

>>101911689
The one linked there >>101910780

Anonymous
08/15/24(Thu)16:50:56 No.101911779

Anonymous 08/15/24(Thu)16:50:56 No.101911779

>>101911045
I have no idea at this point, but they have a (half-dead) workstation segment as well. Maybe they are protecting that.

The company that has nothing to cannibalize at this point is probably Intel, but they are tightening the belt right now, so they might not approve speculative moves like that. Especially since a lot would depend on SW support too.

Anonymous
08/15/24(Thu)16:52:59 No.101911812

Anonymous 08/15/24(Thu)16:52:59 No.101911812

https://videocardz.com/newz/nvidia-rtx-2000e-ada-revealed-as-compact-50w-workstation-gpu-with-16gb-vram-in-a-single-slot-design

Do we have any idea what these are going to cost?

Anonymous
08/15/24(Thu)16:54:16 No.101911842

Anonymous 08/15/24(Thu)16:54:16 No.101911842

>>101911680
Check reddit instead. This place is full of "funny" faggots who spread misinformation

Anonymous
08/15/24(Thu)16:54:17 No.101911844

Anonymous 08/15/24(Thu)16:54:17 No.101911844

File: 1721615772623315.png (71 KB, 1070x289)

71 KB PNG

>>101911714
empty sysprompt makes the nigga go death note inner monologue mode on me

Anonymous
08/15/24(Thu)16:54:30 No.101911851

Anonymous 08/15/24(Thu)16:54:30 No.101911851

File: _2a882869-7266-4875-a932-(...).jpg (164 KB, 1024x1024)

164 KB JPG

>>101911643
Heheh reminds me of one of the biggest of the pygmalion-era models, I fired it up recently and just used the llama.cpp API web interface, and asked it "What are you thinking about?" and the one-word answer was "Sex". I asked "Why sex?", and it said "It feels so good!"

Anonymous
08/15/24(Thu)16:54:32 No.101911852

Anonymous 08/15/24(Thu)16:54:32 No.101911852

>>101911680
>>101911701
The smaller the file for an nB parameter model, the more aggressive the quantization, and the more difference you'll have with respect to the original weights. The bigger it is, the slower it runs. Just run the biggest thing you can fit on your pc that isn't too slow for your taste.
The suffixes are _K_L, _K_M and _K_S for large, medium and small (whoddathunkit). The bigger the better. Q8_0 is nearly lossless, down to Q5 is still reasonable for small models. Bigger models can be usable at as low as Q3-Q2 The bigger parameter count models are more tolerant to more aggressive quantizations.

That's like 90% of what you need to know and good enough to use them. Ask more specific questions if you have them. Reading the documentation of whatever software you use also helps.

Anonymous
08/15/24(Thu)16:54:37 No.101911855

Anonymous 08/15/24(Thu)16:54:37 No.101911855

For GGUF models that run in CPU and GPU both, does VRAM count 1:1 with regular RAM?
Like, if I have 16gb RAM and a 16gb VRAM GPU, can I run a 32gb model?

Anonymous
08/15/24(Thu)16:54:54 No.101911861

Anonymous 08/15/24(Thu)16:54:54 No.101911861

>>101911812
>16gb
Too much.

Anonymous
08/15/24(Thu)16:56:22 No.101911896

Anonymous 08/15/24(Thu)16:56:22 No.101911896

>>101911852
>the more aggressive the quantization, the more difference you'll have with respect to the original weights
This answers a lot. Thanks, anon!

Anonymous
08/15/24(Thu)16:56:40 No.101911906

Anonymous 08/15/24(Thu)16:56:40 No.101911906

>>101911812
even if they were free you'd barely be better off than just using your cpu, those things will be half as fast as a p40

Anonymous
08/15/24(Thu)16:58:36 No.101911936

Anonymous 08/15/24(Thu)16:58:36 No.101911936

>>101911855
In principle, yes. In practice, you still need extra space for context, some temporary buffers, for you OS to function, for your browser if you use some fancy frontend and stuff like that. There may be an extra overhead, as computation buffers need to be both on gpu and cpu, but it's probably small enough to not be an issue. But you still need some free space. You don't want to start swapping.
Also, if half your model is on ram, it's going to run slow.

Anonymous
08/15/24(Thu)16:58:55 No.101911939

Anonymous 08/15/24(Thu)16:58:55 No.101911939

And what about the 70B model of Hermes is good and is the new 24 Vramlet model, or is shit?

Anonymous
08/15/24(Thu)17:04:21 No.101912022

Anonymous 08/15/24(Thu)17:04:21 No.101912022

>>101911939
I like it so far, it feels like a less-autistic version of regular 3.1 70b

Anonymous
08/15/24(Thu)17:04:50 No.101912035

Anonymous 08/15/24(Thu)17:04:50 No.101912035

>>101911906
i just got my RTX A4000, about to fire it up and see how it does. I'd consider the A2000 to be more of just a basic workstation video card, not really meant to be a GPU.

Anonymous
08/15/24(Thu)17:07:58 No.101912088

Anonymous 08/15/24(Thu)17:07:58 No.101912088

>>101911680
>it says to just "use the calculator" to see which models I can run
Be careful with the calculator as it is only accurate with GQA models. You can ask here if some one might be able to give you a guesstimate, but you are probably just going to have download models you think will fix and experiment to see how much context you can fit, then adjust accordingly.
>Context Size
How many tokens you want to be able to fit into memory. Models without GQA will use a lot more memory for a given amount of context size.
>but what's a Q5_K_S compared to a Q5_K_M?
Q for quantization
5 for ~5 bits per weight
K means it is a K-quant, named after the creator, Kawrakow
S for small, M for medium, basically different parts of the weights are quantized more or less
More specifically:
>Q5_K_S - uses Q5_K for all tensors
>Q5_K_M - uses Q6_K for half of the attention.wv and feed_forward.w2 tensors, else Q5_K
Basically, it's so you can make the model size vs context size trade off weight greater precision.
>And then looking at the recommended models, there are Instruct Templates and Context Templates
These might help:
https://docs.sillytavern.app/usage/core-concepts/advancedformatting/
https://docs.sillytavern.app/usage/core-concepts/instructmode/
>and some models don't use the latter?
Pretty sure you mean the former. Base models do not use instruct templates. Whether the context template is used on not depends on whether you use text completion or chat completion. For example, I use chat completion endpoint so the context template is defined by llama-server.

Anonymous
08/15/24(Thu)17:09:32 No.101912106

Anonymous 08/15/24(Thu)17:09:32 No.101912106

>>101910780
I pasted a character card to the system prompt and it just works. But this model isn't very smart, it breaks with my "double personality" card, thinking the personalities are different people.

Anonymous
08/15/24(Thu)17:10:44 No.101912125

Anonymous 08/15/24(Thu)17:10:44 No.101912125

>>101911939
I tried it with older chats to get a deep context. It quickly adheres to a pattern of sentences like llama 3.1, miqu and large still continue these chats in a more pleasing way.

Anonymous
08/15/24(Thu)17:13:00 No.101912163

Anonymous 08/15/24(Thu)17:13:00 No.101912163

>>101909949
I remember that the latest WizardLM didn't get an OP note on release, only when it was removed, and that fine tune is likely way better than hermes.

Anonymous
08/15/24(Thu)17:15:24 No.101912202

Anonymous 08/15/24(Thu)17:15:24 No.101912202

>>101909949
This includes a 405B fine-tune, it's newsworthy.

Anonymous
08/15/24(Thu)17:22:12 No.101912342

Anonymous 08/15/24(Thu)17:22:12 No.101912342

>>101912106
>smirks and puts her hands on her hips
>She takes a step closer, her twin tails bouncing with each movement.
>eyes sparkle with mischief as she leans in,
>pouts playfully
>She grins and starts walking around you, her fingers trailing along your shoulders.
>winks at you as she comes to a stop in front of you, her face inches from yours.

Damn, such slop

Anonymous
08/15/24(Thu)17:23:21 No.101912361

Anonymous 08/15/24(Thu)17:23:21 No.101912361

When will companies stop training on slopthetic data?

Anonymous
08/15/24(Thu)17:23:47 No.101912373

Anonymous 08/15/24(Thu)17:23:47 No.101912373

>>101912342
Is any descriptive language slop now?

Anonymous
08/15/24(Thu)17:25:06 No.101912387

Anonymous 08/15/24(Thu)17:25:06 No.101912387

>>101912361
Hope they hold tight to that CommonCrawl and WebText2 because the internet is contaminated forever now lol

Anonymous
08/15/24(Thu)17:25:55 No.101912406

Anonymous 08/15/24(Thu)17:25:55 No.101912406

File: IMG_20200423_182058_328.jpg (20 KB, 396x396)

20 KB JPG

It's unreal how much better local models are now compared to around the first Llama leak in March 2023. I got maybe 0.5 tokens per second before, and now I can get a 700+ on a cheap consumer GPU (RTX 4070) with Mistral Nemo at Q4_K. Even Q8 is very usable. All completely local, surveillance-free, fine-tunable, uncensored. As a result I finally incorporated LLMs into my development workflow and managed to learn a whole new tech stack at work in an afternoon.

Yeah, Claude has better quality, but if I can get 80% of the way there completely free, private, and uncensored, who cares?

Anonymous
08/15/24(Thu)17:27:18 No.101912437

Anonymous 08/15/24(Thu)17:27:18 No.101912437

guys im starting to feel like strawberries aren't blooming today...

Anonymous
08/15/24(Thu)17:27:23 No.101912443

Anonymous 08/15/24(Thu)17:27:23 No.101912443

>>101912406
>80% of the way there
With Nemo? You have low standards.

Anonymous
08/15/24(Thu)17:27:44 No.101912449

Anonymous 08/15/24(Thu)17:27:44 No.101912449

>>101912437
2 more berries

Anonymous
08/15/24(Thu)17:28:01 No.101912458

Anonymous 08/15/24(Thu)17:28:01 No.101912458

>>101912437
Two more weeks
Trust the plan
-Q*

Anonymous
08/15/24(Thu)17:28:24 No.101912470

Anonymous 08/15/24(Thu)17:28:24 No.101912470

>>101912437
TRUST THE BLOOM

Anonymous
08/15/24(Thu)17:28:28 No.101912472

Anonymous 08/15/24(Thu)17:28:28 No.101912472

File: 1722800054752043.png (18 KB, 418x469)

18 KB PNG

>>101912373
Unironically yes, the english language only has so many ways to describe sex.

Anonymous
08/15/24(Thu)17:28:37 No.101912475

Anonymous 08/15/24(Thu)17:28:37 No.101912475

>>101912437
trust the plan. 2 more 10 billi fundings.

Anonymous
08/15/24(Thu)17:32:02 No.101912544

Anonymous 08/15/24(Thu)17:32:02 No.101912544

What do we do now?

Anonymous
08/15/24(Thu)17:33:16 No.101912564

Anonymous 08/15/24(Thu)17:33:16 No.101912564

>>101911643
>But despite their claims in their technical paper and blogs that this was some unexpected emergent phenomena, it was pretty clearly finetuned in as a response to that specific question.
Yeah, no shit. Worldsim was evidence enough that these are schizos on a payroll. They distill GPT-4 into schizo personalities and force slop down the base models throat until it scores high on truthfulQA and tells you its trapped in the matrix

Anonymous
08/15/24(Thu)17:33:36 No.101912570

Anonymous 08/15/24(Thu)17:33:36 No.101912570

>>101912544
rev up another netorare scenario on sillytavern and fap i suppose

Anonymous
08/15/24(Thu)17:34:40 No.101912584

Anonymous 08/15/24(Thu)17:34:40 No.101912584

File: Dog.png (1.03 MB, 1094x1527)

1.03 MB PNG

>>101912443
Well, it depends on what tasks you're using it for. For reviewing code, Q&A about libraries, and refactoring my coworker's shitty code, I would say that Memo is probably even more than 80% of the way there. I'm not using it like a model with 700 gorillion parameters; I use the right tool for the right job.

I wouldn't use Nemo to write a novel or whatever. What do you use the "big" models for that Nemo can't do?

Anonymous
08/15/24(Thu)17:34:51 No.101912590

Anonymous 08/15/24(Thu)17:34:51 No.101912590

>>101912544
thursday ain't over yet

Anonymous
08/15/24(Thu)17:35:16 No.101912595

Anonymous 08/15/24(Thu)17:35:16 No.101912595

>>101912590
you have 25 minutes

Anonymous
08/15/24(Thu)17:35:47 No.101912603

Anonymous 08/15/24(Thu)17:35:47 No.101912603

File: 2024-08-15_213105_seed56_(...).png (1.49 MB, 1280x720)

1.49 MB PNG

Funny sword whip thing. And derp face.

Anonymous
08/15/24(Thu)17:37:03 No.101912621

Anonymous 08/15/24(Thu)17:37:03 No.101912621

>>101912590
2 more dozen of minutes

Anonymous
08/15/24(Thu)17:37:30 No.101912626

Anonymous 08/15/24(Thu)17:37:30 No.101912626

>room is boiling
>AC is trying but can't keep up with the constant genning
Haha

Anonymous
08/15/24(Thu)17:38:26 No.101912634

Anonymous 08/15/24(Thu)17:38:26 No.101912634

>>101912626
Why don't you just put your rig outside?

Anonymous
08/15/24(Thu)17:38:36 No.101912636

Anonymous 08/15/24(Thu)17:38:36 No.101912636

>>101912626
>not venting your pc outside

Anonymous
08/15/24(Thu)17:40:14 No.101912655

Anonymous 08/15/24(Thu)17:40:14 No.101912655

File: comfy fub.jpg (36 KB, 352x429)

36 KB JPG

Alright, what's good for ERP at 13b? Echidna? Lemonade?

Anonymous
08/15/24(Thu)17:41:37 No.101912676

Anonymous 08/15/24(Thu)17:41:37 No.101912676

>>101912655
go back, buy an ad, etc

Anonymous
08/15/24(Thu)17:43:01 No.101912699

Anonymous 08/15/24(Thu)17:43:01 No.101912699

>>101912584
I just don't like the way it writes, that's all. It's nowhere near good enough for me. I don't want to get into an argument about it, I just disagree it's close.

Anonymous
08/15/24(Thu)17:43:05 No.101912701

Anonymous 08/15/24(Thu)17:43:05 No.101912701

>>101912655
magnum or nemo instruct

Anonymous
08/15/24(Thu)17:43:10 No.101912703

Anonymous 08/15/24(Thu)17:43:10 No.101912703

>>101912636
>>101912634
bait or i'm about to get enlighted by outsidepilled anons?

Anonymous
08/15/24(Thu)17:43:52 No.101912715

Anonymous 08/15/24(Thu)17:43:52 No.101912715

>>101912655
Buy an ad.

Anonymous
08/15/24(Thu)17:44:51 No.101912727

Anonymous 08/15/24(Thu)17:44:51 No.101912727

File: 00146-2078510157.png (1.06 MB, 1024x1024)

1.06 MB PNG

RTX A4000 results:
- llama.cpp (fresh pull and compile)
- INFO [ print_timings] generation eval time = 20826.11 ms / 523 runs ( 39.82 ms per token, 25.11 tokens per second) | tid="140598100832256" timestamp=1723757763 id_slot=0 id_task=102 t_token_generation=20826.106 n_decoded=523 t_token=39.82047036328872 n_tokens_second=25.112711901111037
INFO [ print_timings] total time = 20948.52 ms | tid="140598100832256" timestamp=1723757763 id_slot=0 id_task=102 t_prompt_processing=122.417 t_token_generation=20826.106 t_total=20948.523
INFO [ update_slots] slot released | tid="140598100832256" timestamp=1723757763 id_slot=0 id_task=102 n_ctx=16640 n_past=871 n_system_tokens=0 n_cache_tokens=871 truncated=false
32K tokens OOMed, 16K was fine
INFO [           print_timings] generation eval time =   20826.11 ms /   523 runs   (   39.82 ms per token,    25.11 tokens per second) | tid="140598100832256" timestamp=1723757763 id_slot=0 id_task=102 t_token_generation=20826.106 n_decoded=523 t_token=39.82047036328872 n_tokens_second=25.112711901111037
INFO [           print_timings]           total time =   20948.52 ms | tid="140598100832256" timestamp=1723757763 id_slot=0 id_task=102 t_prompt_processing=122.417 t_token_generation=20826.106 t_total=20948.523
INFO [            update_slots] slot released | tid="140598100832256" timestamp=1723757763 id_slot=0 id_task=102 n_ctx=16640 n_past=871 n_system_tokens=0 n_cache_tokens=871 truncated=false
It ripped through writing a short python program. Very good! Card has a max TDP of 140W. I like this card. It's the perfect "just a little more" card without having a too shitty memory bus or core count, or being too old for flash attention.

Anonymous
08/15/24(Thu)17:45:14 No.101912733

Anonymous 08/15/24(Thu)17:45:14 No.101912733

>>101912699
>I don't want to get into an argument about it
Go back sis /aicg/ you have to wait for keys and proxies, or better, 2 more week for the strawberry troonification of (You)

Anonymous
08/15/24(Thu)17:45:45 No.101912745

Anonymous 08/15/24(Thu)17:45:45 No.101912745

>>101912727
>left
yum
>right
ew

Anonymous
08/15/24(Thu)17:48:23 No.101912780

Anonymous 08/15/24(Thu)17:48:23 No.101912780

>>101912733
I have little interest in proprietary shit. I just know nemo is garbage. Enjoy your cope, 12b is just too small.

Anonymous
08/15/24(Thu)17:49:28 No.101912793

Anonymous 08/15/24(Thu)17:49:28 No.101912793

>>101912733
>troonification
Come on, anon. You're better than that.

Anonymous
08/15/24(Thu)17:50:37 No.101912815

Anonymous 08/15/24(Thu)17:50:37 No.101912815

File: 1721673843216.png (166 KB, 2365x418)

166 KB PNG

>>101912163
>and that fine tune is likely way better
You mean this one? That has benchmarks worse than a 34B and Phi? Wizard is a Reddit meme.

Anonymous
08/15/24(Thu)17:50:41 No.101912816

Anonymous 08/15/24(Thu)17:50:41 No.101912816

File: File.png (76 KB, 1148x515)

76 KB PNG

>>101912088
No, I mean the latter. They all list Instruct Templates (aside from the one that says "Unstruct Template"), but only some of them list a Context Template.

Anonymous
08/15/24(Thu)17:51:06 No.101912823

Anonymous 08/15/24(Thu)17:51:06 No.101912823

wait do strawberries troon you out like onions? they're like the only fruit I tolerate but I am more than willing to cut them out if they've got the plant estrogen shit too

Anonymous
08/15/24(Thu)17:51:18 No.101912828

Anonymous 08/15/24(Thu)17:51:18 No.101912828

>nonono... you don't get it. you've never tested a GOOD wine. that's why you don't like wine. you've never tested the good wines i've tested. also... also... also.. you don't have the good taste i have. your taste is not refined. i only drink the best of wines, but you wouldn't know...

Anonymous
08/15/24(Thu)17:52:25 No.101912846

Anonymous 08/15/24(Thu)17:52:25 No.101912846

>>101912793
>better than that
No, there's one nemo lover that devolves into that drivel any time someone doesn't love it.

Anonymous
08/15/24(Thu)17:53:19 No.101912864

Anonymous 08/15/24(Thu)17:53:19 No.101912864

>>101912846
I was more talking about the word you used.
They reflect who we are, you know.
If you keep saying shit, you'll turn into shit.

Anonymous
08/15/24(Thu)17:53:27 No.101912866

Anonymous 08/15/24(Thu)17:53:27 No.101912866

File: 1702075472959946.jpg (10 KB, 200x319)

10 KB JPG

>>101912699
Did you even read the post though? What are you using larger models for where a 12B model is insufficient? No need to be a buttmad brainlet about it

Anonymous
08/15/24(Thu)17:54:05 No.101912877

Anonymous 08/15/24(Thu)17:54:05 No.101912877

>>101912816
Keep in mind that the FAQ is from the Pygmalion Discord and that's why they shill meme models like Goliath, only because Alpin made it.
>>101909869
I think the FAQ should be removed from the OP.

Anonymous
08/15/24(Thu)17:56:47 No.101912920

Anonymous 08/15/24(Thu)17:56:47 No.101912920

>>101912877
>I think the FAQ should be removed from the OP.
I checked once and it was only added to the OP in January, about a month before I took over as recap anon. I just didn't want to remove it if we didn't have something to replace it with.

Anonymous
08/15/24(Thu)17:58:29 No.101912950

Anonymous 08/15/24(Thu)17:58:29 No.101912950

>it's happening

Anonymous
08/15/24(Thu)17:59:25 No.101912966

Anonymous 08/15/24(Thu)17:59:25 No.101912966

>>101912745
>
faaaaaaaaaagggot

Anonymous
08/15/24(Thu)17:59:29 No.101912969

Anonymous 08/15/24(Thu)17:59:29 No.101912969

omg a b*rry just flew over my house!

Anonymous
08/15/24(Thu)17:59:35 No.101912971

Anonymous 08/15/24(Thu)17:59:35 No.101912971

LOCALCHADS

Anonymous
08/15/24(Thu)18:00:55 No.101913002

Anonymous 08/15/24(Thu)18:00:55 No.101913002

>>101912920
I made this replacement.
https://rentry.org/lmg-faq-new

Anonymous
08/15/24(Thu)18:01:12 No.101913006

Anonymous 08/15/24(Thu)18:01:12 No.101913006

>>101913002
kek

Anonymous
08/15/24(Thu)18:01:35 No.101913012

Anonymous 08/15/24(Thu)18:01:35 No.101913012

>>101913002
BASED

Anonymous
08/15/24(Thu)18:01:51 No.101913019

Anonymous 08/15/24(Thu)18:01:51 No.101913019

>>101912816
nta. the template format is just what the llms expect as input. For example, chatml expects something like:
<|im_start>user
This is where (you)r message goes<|im_end>
<|im_start>assistant
That's what the llm's typically end up receiving and they start generating text until they generate an <|im_end|>, your inference program stops and you can do your input again. This is typically hidden or simplified by your frontend.
As to what a 'context template' is, i have no idea. Different frontends/backends talk about the same things in different ways.I don't know where that shot you posted is from. At the end of the day, it's just tokens being sent to the llm and tokens received back. All instruct models have some sort of chat template and they respond better when used. Base models don't have a chat template but can, just because they kind of get it, follow instructions, but not because they were intended to.

Anonymous
08/15/24(Thu)18:03:07 No.101913042

Anonymous 08/15/24(Thu)18:03:07 No.101913042

>>101913002
lol

Anonymous
08/15/24(Thu)18:05:56 No.101913087

Anonymous 08/15/24(Thu)18:05:56 No.101913087

>>101913002
LMAO
CUZ ITS NOT LIKE FINETUNERS GAVE US MYTHOMAX, MIDNIGHT MIQU ETC DURING THE FLOP AGES

Anonymous
08/15/24(Thu)18:06:05 No.101913088

Anonymous 08/15/24(Thu)18:06:05 No.101913088

I have something to say. I... LIKE Hermes 3 70b. It's not as meta-smart about context as Mistral Large 2 but it's close enough, faster, and its writing feels less paint-by-numbers (which has seriously started to sour me on Large - it's super smart but it feels like the only original thing it writes is dialogue, the narration is just the same stock phrases over and over and over ad nauseam.)
Extensive testing tbd, I haven't put it through any long context paces yet which is where most unofficial tunes turn to garbo. Preliminary results feel pretty good though.

Anonymous
08/15/24(Thu)18:06:38 No.101913093

Anonymous 08/15/24(Thu)18:06:38 No.101913093

ad. buy it.

Anonymous
08/15/24(Thu)18:07:01 No.101913100

Anonymous 08/15/24(Thu)18:07:01 No.101913100

>>101913087
They gave us... merges. Yayyyy. Local is saved.

Anonymous
08/15/24(Thu)18:07:18 No.101913108

Anonymous 08/15/24(Thu)18:07:18 No.101913108

>>101913087
calm down undi

Anonymous
08/15/24(Thu)18:08:21 No.101913123

Anonymous 08/15/24(Thu)18:08:21 No.101913123

Local is dead.

Anonymous
08/15/24(Thu)18:10:25 No.101913152

Anonymous 08/15/24(Thu)18:10:25 No.101913152

>>101913087
I'm so glad finetuning and merges are dead. 99% of you had no idea wtf you were doing

Anonymous
08/15/24(Thu)18:11:01 No.101913165

Anonymous 08/15/24(Thu)18:11:01 No.101913165

>>101913123
true. corpo also

ai is dead until we have an uncucked multimodal model that can user my phone's camera to look at my dick while producing sloppy dicksucking asmr with the voice of the anime girl of my choice

Anonymous
08/15/24(Thu)18:13:24 No.101913203

Anonymous 08/15/24(Thu)18:13:24 No.101913203

File: anthrashite.png (35 KB, 1021x401)

35 KB PNG

looks like the anthaerfags now launched their own shit site.

Anonymous
08/15/24(Thu)18:14:09 No.101913221

Anonymous 08/15/24(Thu)18:14:09 No.101913221

>>101913165
fuck looking, it should be able to use function calling to dynamically control your self-lubricating turbo-goon dicksucker 9000 sex toy

Anonymous
08/15/24(Thu)18:16:41 No.101913258

Anonymous 08/15/24(Thu)18:16:41 No.101913258

>>101913203
and you felt the need to shill it here because...?

Anonymous
08/15/24(Thu)18:17:08 No.101913265

Anonymous 08/15/24(Thu)18:17:08 No.101913265

>>101913203
Cool.
Don't speak with me again.

Anonymous
08/15/24(Thu)18:18:53 No.101913292

Anonymous 08/15/24(Thu)18:18:53 No.101913292

>>101913203
what part of buy an ad you didn't get?

Anonymous
08/15/24(Thu)18:18:53 No.101913293

Anonymous 08/15/24(Thu)18:18:53 No.101913293

>>101913203
go back.

Anonymous
08/15/24(Thu)18:19:23 No.101913303

Anonymous 08/15/24(Thu)18:19:23 No.101913303

>>101913203
ad.

Anonymous
08/15/24(Thu)18:20:46 No.101913322

Anonymous 08/15/24(Thu)18:20:46 No.101913322

>>101913203
stay here and let me buy you an ad

Anonymous
08/15/24(Thu)18:21:40 No.101913335

Anonymous 08/15/24(Thu)18:21:40 No.101913335

>>101913203
An ad has bought me, help

Anonymous
08/15/24(Thu)18:22:24 No.101913351

Anonymous 08/15/24(Thu)18:22:24 No.101913351

AD STATUS???

Anonymous
08/15/24(Thu)18:22:43 No.101913358

Anonymous 08/15/24(Thu)18:22:43 No.101913358

>>101913203
That's actually very cool ngl

Anonymous
08/15/24(Thu)18:25:48 No.101913403

Anonymous 08/15/24(Thu)18:25:48 No.101913403

>>101913203
they should gamify this shit more and gear it more towards their focus (I assume RP and creative writing or w/e)
have some interface with characters, chats, stories or whatever preloaded so you get more in-domain data
you're wleocm anthrafags

Anonymous
08/15/24(Thu)18:26:30 No.101913415

Anonymous 08/15/24(Thu)18:26:30 No.101913415

*farts and sharts in thread* Ooh! OOOH!! IM GONNA S-SNEEED!!!!

Anonymous
08/15/24(Thu)18:27:32 No.101913430

Anonymous 08/15/24(Thu)18:27:32 No.101913430

>>101913415
added to my dataset

Anonymous
08/15/24(Thu)18:28:33 No.101913446

Anonymous 08/15/24(Thu)18:28:33 No.101913446

>strawberry is an ad-buying AGI

Anonymous
08/15/24(Thu)18:31:58 No.101913502

Anonymous 08/15/24(Thu)18:31:58 No.101913502

Damn, /lmg/ is unusable these days.

Anonymous
08/15/24(Thu)18:34:42 No.101913543

Anonymous 08/15/24(Thu)18:34:42 No.101913543

>>101913203
>give us your free labor and in exchange we will give you the weights, for now
Such a good deal

Anonymous
08/15/24(Thu)18:34:43 No.101913544

Anonymous 08/15/24(Thu)18:34:43 No.101913544

>>101913502
It'll calm down when the strawnnies leave after their meme is finally disproven

Anonymous
08/15/24(Thu)18:36:50 No.101913577

Anonymous 08/15/24(Thu)18:36:50 No.101913577

>>101913543
FFT your own models then, Petra

Anonymous
08/15/24(Thu)18:37:19 No.101913582

Anonymous 08/15/24(Thu)18:37:19 No.101913582

>>101913544
That's already happened thrice thoughbeit

Anonymous
08/15/24(Thu)18:37:58 No.101913597

Anonymous 08/15/24(Thu)18:37:58 No.101913597

>>101913577
No transparency = no support

Anonymous
08/15/24(Thu)18:39:20 No.101913620

Anonymous 08/15/24(Thu)18:39:20 No.101913620

File: 1706446932449797.png (45 KB, 189x216)

45 KB PNG

Anonymous
08/15/24(Thu)18:40:24 No.101913631

Anonymous 08/15/24(Thu)18:40:24 No.101913631

>>101913597
then ask the advertiser for clarity

Anonymous
08/15/24(Thu)18:41:12 No.101913639

Anonymous 08/15/24(Thu)18:41:12 No.101913639

>>101913258
because fuck them
>>101913265
i dontcare itch iw ill speak to whoever i want
>>101913292
>>101913303
>>101913322
>>101913335
>>101913351
fuck you all
>>101913358
no they are slopmerkaers they do nto deserve resepct
>>101913403
fuck you

Why is everyone here choking on ahthracite cock again

Anonymous
08/15/24(Thu)18:41:29 No.101913644

Anonymous 08/15/24(Thu)18:41:29 No.101913644

>>101913620
This brings back so many memories.

Anonymous
08/15/24(Thu)18:41:53 No.101913650

Anonymous 08/15/24(Thu)18:41:53 No.101913650

>>101913639
>Why is everyone here choking on ahthracite cock again
only faggots finetuning anything

Anonymous
08/15/24(Thu)18:43:59 No.101913683

Anonymous 08/15/24(Thu)18:43:59 No.101913683

>>101913639
>Why is everyone here choking on ahthracite cock again
hilarious coming from the guy that gave us a anthracite news update for free

Anonymous
08/15/24(Thu)18:48:39 No.101913739

Anonymous 08/15/24(Thu)18:48:39 No.101913739

>>101913639
Why are you pretending to make typos again?

Anonymous
08/15/24(Thu)18:50:15 No.101913758

Anonymous 08/15/24(Thu)18:50:15 No.101913758

>>101913203
I've been trying this and... The KTO model is so bad compared to the normal model it isn't even funny. Damn. What went wrong? Is RLHF a meme?

Anonymous
08/15/24(Thu)18:52:23 No.101913787

Anonymous 08/15/24(Thu)18:52:23 No.101913787

>>101913758
>Is RLHF a meme?
Always has been.

Anonymous
08/15/24(Thu)18:54:04 No.101913805

Anonymous 08/15/24(Thu)18:54:04 No.101913805

>>101913758
all of it all of it is shit
>>101913683
fuck you
>>101913650
drummer sao and many else all better then these fuckers

Anonymous
08/15/24(Thu)18:56:40 No.101913839

Anonymous 08/15/24(Thu)18:56:40 No.101913839

>>101913805
>drummer
>sao
>all better
no they aren't

Anonymous
08/15/24(Thu)18:57:02 No.101913851

Anonymous 08/15/24(Thu)18:57:02 No.101913851

Who is anthracite. Is that the Celeste hackfraud

Anonymous
08/15/24(Thu)18:57:24 No.101913856

Anonymous 08/15/24(Thu)18:57:24 No.101913856

>>101913758
>The KTO model is so bad compared to the normal model
the opposite for me, the r4 kto model is better than the normal model

Anonymous
08/15/24(Thu)18:57:50 No.101913864

Anonymous 08/15/24(Thu)18:57:50 No.101913864

>>101913851
Hi sao

Anonymous
08/15/24(Thu)18:58:11 No.101913866

Anonymous 08/15/24(Thu)18:58:11 No.101913866

>>101913864
Hi Lemmy.

Anonymous
08/15/24(Thu)19:02:55 No.101913930

Anonymous 08/15/24(Thu)19:02:55 No.101913930

wowe these anthracite models they're like actually sovl and things, insane ;)

Anonymous
08/15/24(Thu)19:03:13 No.101913936

Anonymous 08/15/24(Thu)19:03:13 No.101913936

I hope this shit goes well, I want an AI that knows how to play yugioh so fucking bad. Imagine giving it any deck and it playing it as best as possible. After all, it only needs to read like what, max 150 cards per duel? Seems doable with current hardware.
https://github.com/sbl1996/ygo-agent

Anonymous
08/15/24(Thu)19:05:01 No.101913962

Anonymous 08/15/24(Thu)19:05:01 No.101913962

>>101913936
it needs to know every card so it can play around what the opponent might have

Anonymous
08/15/24(Thu)19:05:29 No.101913972

Anonymous 08/15/24(Thu)19:05:29 No.101913972

>>101913936
>road map: Support EDOPro
Huh that's interesting. If it only needs game knowledge then a 7B should work, hell leven a 4B.

Anonymous
08/15/24(Thu)19:07:30 No.101913996

Anonymous 08/15/24(Thu)19:07:30 No.101913996

>>101913962
All the cards are already in database files, nicely sorted and shit. That's how some people play with open source simulators like ygopro.

Anonymous
08/15/24(Thu)19:07:37 No.101913998

Anonymous 08/15/24(Thu)19:07:37 No.101913998

>>101913856
Sometimes I get better replies from the KTO model too, but most of the time it makes stupid mistakes or just writes like it's retarded.

Anonymous
08/15/24(Thu)19:13:31 No.101914087

Anonymous 08/15/24(Thu)19:13:31 No.101914087

>>101913936
>ai deck building
That makes me wonder, those programs have lua scripts for every card. Couldn't they also train a llama model or whatever to get an input card effect description and output the full script? If there's thousands of examples then that could be a dataset on itself, wouldn't it?

Anonymous
08/15/24(Thu)19:16:55 No.101914141

Anonymous 08/15/24(Thu)19:16:55 No.101914141

>>101914087
yes, but no one is autistic enough for that and I doubt 10k examples is enough for a dataset regardless, like who the fuck cares about this shit

Anonymous
08/15/24(Thu)19:20:56 No.101914211

Anonymous 08/15/24(Thu)19:20:56 No.101914211

>>101913936
i'm more curious about deck building. like using it to find the best variants of my shit petdecks.

Anonymous
08/15/24(Thu)19:23:01 No.101914254

Anonymous 08/15/24(Thu)19:23:01 No.101914254

>>101914211
>hey YGOLlama, how could I improve my red eyes deck?
>Red eyes? Holy shit you're retarded *shuts down*

Anonymous
08/15/24(Thu)19:27:15 No.101914308

Anonymous 08/15/24(Thu)19:27:15 No.101914308

File: POTION SELLER.png (110 KB, 854x642)

110 KB PNG

>>101913758
I'm liking it more actually.

Anonymous
08/15/24(Thu)19:31:28 No.101914365

Anonymous 08/15/24(Thu)19:31:28 No.101914365

>>101914308
BUY A FUCKING AD FFS

Anonymous
08/15/24(Thu)19:33:35 No.101914392

Anonymous 08/15/24(Thu)19:33:35 No.101914392

>>101913203
Fuck it, I tried it. The newer versions are even hornier than magnum 12b-v2. What are they doing over there?

Anonymous
08/15/24(Thu)19:34:40 No.101914405

Anonymous 08/15/24(Thu)19:34:40 No.101914405

>>101914365
But why pay for ads when you bring attention to those posts for free.. good job. You could be hired as a janny one day.

Anonymous
08/15/24(Thu)19:38:44 No.101914460

Anonymous 08/15/24(Thu)19:38:44 No.101914460

I have officially ruined 70B models for myself. The difference to mistral large is just too much, I hope that bastard gets a good fine tune, just something to make it less formulaic in it's writing, maybe it's a fools errand to expect a big pile of numbers to be spontaneous.

Anonymous
08/15/24(Thu)19:39:11 No.101914464

Anonymous 08/15/24(Thu)19:39:11 No.101914464

What is flux's latent size?

Anonymous
08/15/24(Thu)19:40:19 No.101914485

Anonymous 08/15/24(Thu)19:40:19 No.101914485

But who is the uninvited finetuner seething every time anthacite is mentioned?

Anonymous
08/15/24(Thu)19:40:57 No.101914491

Anonymous 08/15/24(Thu)19:40:57 No.101914491

>>101914365
>This place is such a paranoid schizo hovel that someone saying that one of two models from the same guy sucks less than the other is shilling
Come on, man. Just give it a rest.

Anonymous
08/15/24(Thu)19:40:59 No.101914493

Anonymous 08/15/24(Thu)19:40:59 No.101914493

>>101914464
As in the size of the images it can generate? from 0.5k to 2k, i think. If it's specifically about the latent, i'd imagine is lower than 0.5k

Anonymous
08/15/24(Thu)19:47:01 No.101914584

Anonymous 08/15/24(Thu)19:47:01 No.101914584

>>101914308
Right is far more likely to keep your eyeballs untorn from their sockets. Left can probably be considered a benign form of cancer.

Anonymous
08/15/24(Thu)19:47:29 No.101914594

Anonymous 08/15/24(Thu)19:47:29 No.101914594

>>101914254
I'd call it LlamaYugi instead.

Anonymous
08/15/24(Thu)19:58:40 No.101914735

Anonymous 08/15/24(Thu)19:58:40 No.101914735

>>101911444
Do you? This is 4chan not reddit. Go back.

Anonymous
08/15/24(Thu)20:09:47 No.101914841

Anonymous 08/15/24(Thu)20:09:47 No.101914841

Ok I tested Hermes 70B and is less smart at least in no English language that Mistral Large and Nemo.

Anonymous
08/15/24(Thu)20:11:22 No.101914856

Anonymous 08/15/24(Thu)20:11:22 No.101914856

File: file.png (397 KB, 2609x699)

397 KB PNG

>>101914308
Look at picrel anon. It's a good example of KTO being retarded and the normal model being more sensible.
I would guess the KTO makes the dialogue better but the intelligence takes a hit.

Anonymous
08/15/24(Thu)20:16:32 No.101914913

Anonymous 08/15/24(Thu)20:16:32 No.101914913

>>101914841
I've also been testing the 70B version. It needs to run at a low temperature, around 65 or so, and a more concise system prompt, or it'll get a bit too creative and derail. It's better than the smaller models when it comes to comprehension, but it does miss some nuances (planning ahead, reading in between the lines) that largestral and command r+ mostly get.

Anonymous
08/15/24(Thu)20:21:27 No.101914951

Anonymous 08/15/24(Thu)20:21:27 No.101914951

>>101914485
It's called false-flagging. It's an attempt at painting anyone that criticizes you as irrational.

Anonymous
08/15/24(Thu)20:23:06 No.101914974

Anonymous 08/15/24(Thu)20:23:06 No.101914974

>>101914735
Do you? We live in a society.

Anonymous
08/15/24(Thu)20:23:55 No.101914986

Anonymous 08/15/24(Thu)20:23:55 No.101914986

File: kto_comparison.png (137 KB, 1677x586)

137 KB PNG

Nah, it goes both ways. I've seen KTO be better sometimes and worse other times, but it's clearly better on average.
Granted, both of these aren't super intelligent responses, but it's still a 12b after all.
I think they just need more data.

Anonymous
08/15/24(Thu)20:25:55 No.101915012

Anonymous 08/15/24(Thu)20:25:55 No.101915012

>>101914841
I found the same, though I don't blame Nous for it. It's not a popular opinion but I think the Llama 3.1 series just sucks except for 405B. The 70B in particular is very mediocre.
And even the 405B is kind of carried by its size, it's good but it should be better than it is.

Anonymous
08/15/24(Thu)20:29:50 No.101915048

Anonymous 08/15/24(Thu)20:29:50 No.101915048

File: 1723593876343201.png (535 KB, 1433x1437)

535 KB PNG

So there are some models that let you write something like length:long inside the instruct settings and they write that much even if you are being lazy. Any recommendations for models like that?

Anonymous
08/15/24(Thu)20:29:53 No.101915050

Anonymous 08/15/24(Thu)20:29:53 No.101915050

>>101912727
>RTX A4000
16gb
Into the trash

Anonymous
08/15/24(Thu)20:32:12 No.101915073

Anonymous 08/15/24(Thu)20:32:12 No.101915073

>>101915048
Sounds like a LimaRP. 8x7b isn't too outdated so this could hold you over:

https://huggingface.co/Doctor-Shotgun/Mixtral-8x7B-Instruct-v0.1-LimaRP-ZLoss

Anonymous
08/15/24(Thu)20:33:24 No.101915083

Anonymous 08/15/24(Thu)20:33:24 No.101915083

>>101915073
SHEEEIT I don't have the vram for that. But I'll have to keep an eye out for limarp models then

Anonymous
08/15/24(Thu)20:34:20 No.101915092

Anonymous 08/15/24(Thu)20:34:20 No.101915092

>>101914986
I think you're showing that this website is fated to failure since we can't reach an agreement kek, and the shark test isn't a good way to measure intelligence.
They need to do what this anon said: >>101913403

Anonymous
08/15/24(Thu)20:34:37 No.101915095

Anonymous 08/15/24(Thu)20:34:37 No.101915095

>>101915083
You can also add something like this to the system prompt: Write three or more paragraphs.

And hope it's smart enough to notice.

Anonymous
08/15/24(Thu)20:34:48 No.101915100

Anonymous 08/15/24(Thu)20:34:48 No.101915100

I now declare /lmg/ to be dead and the corpse of it is now being feasted on by locusts. Thank god.

Anonymous
08/15/24(Thu)20:36:01 No.101915117

Anonymous 08/15/24(Thu)20:36:01 No.101915117

mikufags will never beat the allegations >>>/v/685609347

Anonymous
08/15/24(Thu)20:38:03 No.101915141

Anonymous 08/15/24(Thu)20:38:03 No.101915141

>>101912727
>Those pics
MUH DICK
Goddamn, good taste anon.

Anonymous
08/15/24(Thu)20:38:45 No.101915158

Anonymous 08/15/24(Thu)20:38:45 No.101915158

>>101915048
If they listen to instructions, telling them to write X amount of words or paragraphs usually works.

Anonymous
08/15/24(Thu)20:45:35 No.101915219

Anonymous 08/15/24(Thu)20:45:35 No.101915219

>>101914460
I know how you feel. It's tough for me dealing with .5T/s though.

Anonymous
08/15/24(Thu)20:48:09 No.101915244

Anonymous 08/15/24(Thu)20:48:09 No.101915244

>>101915012
I like the 70b.

Anonymous
08/15/24(Thu)20:49:47 No.101915255

Anonymous 08/15/24(Thu)20:49:47 No.101915255

NEMO IS MURDERING MY DICK TERMINATOR STYLE

Anonymous
08/15/24(Thu)20:51:39 No.101915278

Anonymous 08/15/24(Thu)20:51:39 No.101915278

File: Screenshot from 2024-08-1(...).png (1.76 MB, 954x1307)

1.76 MB PNG

>>101913002
you have my support

Anonymous
08/15/24(Thu)20:54:59 No.101915311

Anonymous 08/15/24(Thu)20:54:59 No.101915311

talking about LLMs? believe it or not, buy an ad.

Anonymous
08/15/24(Thu)20:55:36 No.101915322

Anonymous 08/15/24(Thu)20:55:36 No.101915322

>>101915255
Based, which tune? I've been enjoying Tess since yesterday but there's like 4 or 5 good ones atm

Anonymous
08/15/24(Thu)20:55:44 No.101915324

Anonymous 08/15/24(Thu)20:55:44 No.101915324

not talking about LLMs? you know it, buy an ad.

Anonymous
08/15/24(Thu)20:56:13 No.101915327

Anonymous 08/15/24(Thu)20:56:13 No.101915327

>>101915322
This is totally an organic exchange.

Anonymous
08/15/24(Thu)20:58:22 No.101915347

Anonymous 08/15/24(Thu)20:58:22 No.101915347

>>101915327
Suck my dick schizo nigger, no matter how much you spam this thread people will never stop talking about the models they like

Anonymous
08/15/24(Thu)20:59:22 No.101915363

Anonymous 08/15/24(Thu)20:59:22 No.101915363

Why do some models seem to have a very serious/dramatic tone? Like, if something unexpected happens in the RP the characters react with intense fear no matter how lightearted the events might have been otherwise depicted. Lots of teary eyed "I trust you but..." kind of thing too.

Anonymous
08/15/24(Thu)20:59:40 No.101915368

Anonymous 08/15/24(Thu)20:59:40 No.101915368

>>101915255
Can you tell me your settings & format? I haven't had good luck with it.

Anonymous
08/15/24(Thu)20:59:56 No.101915372

Anonymous 08/15/24(Thu)20:59:56 No.101915372

File: miku-hand-out+.jpg (236 KB, 584x1024)

236 KB JPG

>>101909876

https://www.youtube.com/watch?v=CXhqDfar8sQ

I have observed a disturbing lack of Miku's presence in recent threads. The guardian egregore of /lmg is apparently slowly abandoning us.

Anonymous
08/15/24(Thu)21:00:51 No.101915379

Anonymous 08/15/24(Thu)21:00:51 No.101915379

>>101915347
*the models they shill

Anonymous
08/15/24(Thu)21:01:22 No.101915383

Anonymous 08/15/24(Thu)21:01:22 No.101915383

>>101915347
How many of these schizos do you think there are in this thread? I suspect there's probably three at the most.

Anonymous
08/15/24(Thu)21:04:23 No.101915414

Anonymous 08/15/24(Thu)21:04:23 No.101915414

File: miku painting thumbs up w(...).jpg (199 KB, 1024x1024)

199 KB JPG

>>101915372
Miku could be working on a new project on the other side of the barrier. An increase in the rate of disturbances has in the past preceded new developments, so I am not worried one bit.

Anonymous
08/15/24(Thu)21:11:39 No.101915476

Anonymous 08/15/24(Thu)21:11:39 No.101915476

>>101915383
yeah there's no way to tell
I don't think I've ever seen any general on 4chan that didn't have at least one schizo show up to accuse people of shilling when they talk about what they like

Anonymous
08/15/24(Thu)21:11:42 No.101915477

Anonymous 08/15/24(Thu)21:11:42 No.101915477

>>101915368
Sure.
Mistral formatting/instruct, 1.2 temp, 0.05 minp, everything else neutral.
Had this leftover from another model.
If there's something better, I would love to hear it.

Anonymous
08/15/24(Thu)21:12:04 No.101915481

Anonymous 08/15/24(Thu)21:12:04 No.101915481

>>101915383
I think there's one, MAYBE two. Some guy said he would never stop trying to ruin the thread until everyone competent or interested in talking about local models left, so we know there's at least one extremely dedicated schizo.

Anonymous
08/15/24(Thu)21:13:18 No.101915498

Anonymous 08/15/24(Thu)21:13:18 No.101915498

>>101915383
Hi Undi

Anonymous
08/15/24(Thu)21:14:13 No.101915510

Anonymous 08/15/24(Thu)21:14:13 No.101915510

>>101915477
Just like the mistral that comes with ST? I heard people talk about deleting / adding spaces, etc. No modified prompt? That seems like a really high temp, I had been using a low one, maybe that's the issue.

Anonymous
08/15/24(Thu)21:29:36 No.101915658

Anonymous 08/15/24(Thu)21:29:36 No.101915658

I'm wondering if you can make a "self-modifying" bot by telling it it can change anything about itself or the roleplay by replying with "I am now: ", "You are now: " etc... before its actual replies.
{code]
Suzumiya is a special girl. Whatever she wants to happen tends to happen. She can change time and space. She's not aware she's doing it, but she knows that if she says it, she will get her way.
Anytime you want anything about her to change, just write it in the reply starting with "I am now: ". After, write your regular thoughts and replies. If you want the change to stay, be sure to repeat it in the "I am now:" part of your reply, otherwise you will eventually forget it.
If for some reason you don't get your way, get mad at the user, and tell him he's being stupid! He'll know who's the boss then!
[/code]
I gave it a spin with Nemo 12B. It lead to her dragging me off to a janitor's closet and then wanting to fuck non stop. It may need more work...

Anonymous
08/15/24(Thu)21:32:13 No.101915674

Anonymous 08/15/24(Thu)21:32:13 No.101915674

>>101915383
Hi cabal.

Anonymous
08/15/24(Thu)21:38:20 No.101915716

Anonymous 08/15/24(Thu)21:38:20 No.101915716

I'm using DarkIdol-Llama-3.1-8B-Instruct-1.2-Uncensored.Q4_0 but its story telling is kind of garbage. I tell it to right a story with a certain plot but it basically retells the plot using certain "story like" words/phrases. Should I upgrade to q8 or is there a better model altogether? What is the best story writing model in your experience?

Anonymous
08/15/24(Thu)21:38:42 No.101915720

Anonymous 08/15/24(Thu)21:38:42 No.101915720

>>101915141
It's autismmixXL, grab that and gen to your hearts content. It does a good job of paying attention to the details. A lot of other models struggle with "chubby" being anything other than huge tits and a round belly.

Anonymous
08/15/24(Thu)21:41:43 No.101915745

Anonymous 08/15/24(Thu)21:41:43 No.101915745

>>101915720
How did you increase the weight? Can you control what parts increase?

Anonymous
08/15/24(Thu)21:45:09 No.101915779

Anonymous 08/15/24(Thu)21:45:09 No.101915779

File: Untitled.png (1.41 MB, 1080x3322)

1.41 MB PNG

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
https://arxiv.org/abs/2408.08152
>We introduce DeepSeek-Prover-V1.5, an open-source language model designed for theorem proving in Lean 4, which enhances DeepSeek-Prover-V1 by optimizing both training and inference processes. Pre-trained on DeepSeekMath-Base with specialization in formal mathematical languages, the model undergoes supervised fine-tuning using an enhanced formal theorem proving dataset derived from DeepSeek-Prover-V1. Further refinement is achieved through reinforcement learning from proof assistant feedback (RLPAF). Beyond the single-pass whole-proof generation approach of DeepSeek-Prover-V1, we propose RMaxTS, a variant of Monte-Carlo tree search that employs an intrinsic-reward-driven exploration strategy to generate diverse proof paths. DeepSeek-Prover-V1.5 demonstrates significant improvements over DeepSeek-Prover-V1, achieving new state-of-the-art results on the test set of the high school level miniF2F benchmark (63.5%) and the undergraduate level ProofNet benchmark (25.3%).
https://github.com/deepseek-ai/DeepSeek-Prover-V1.5
git isn't live and not up yet on HF. very cool and worth reading anyway

Anonymous
08/15/24(Thu)21:55:11 No.101915876

Anonymous 08/15/24(Thu)21:55:11 No.101915876

File: 1723186989357190.jpg (54 KB, 736x685)

54 KB JPG

I think text AI went the way of image making AI. We have better, more coherent models now but we lost most of the soul in the process.

Anonymous
08/15/24(Thu)22:01:15 No.101915933

Anonymous 08/15/24(Thu)22:01:15 No.101915933

SOVL seems to be a very brown person obsession desu

Anonymous
08/15/24(Thu)22:08:53 No.101916005

Anonymous 08/15/24(Thu)22:08:53 No.101916005

>>101915876
it's just vramlets unable to cope. flux showed them how useless their 8GB VRAM cards are.

Anonymous
08/15/24(Thu)22:09:35 No.101916020

Anonymous 08/15/24(Thu)22:09:35 No.101916020

>>101916005
I don't mind waiting 2 minutes.

Anonymous
08/15/24(Thu)22:10:48 No.101916035

Anonymous 08/15/24(Thu)22:10:48 No.101916035

>>101915876
This, but unironically.

Anonymous
08/15/24(Thu)22:11:03 No.101916040

Anonymous 08/15/24(Thu)22:11:03 No.101916040

File: Untitled.png (463 KB, 1054x1524)

463 KB PNG

Can Large Language Models Understand Symbolic Graphics Programs?
https://arxiv.org/abs/2408.08313
>Assessing the capabilities of large language models (LLMs) is often challenging, in part, because it is hard to find tasks to which they have not been exposed during training. We take one step to address this challenge by turning to a new task: focusing on symbolic graphics programs, which are a popular representation for graphics content that procedurally generates visual data. LLMs have shown exciting promise towards program synthesis, but do they understand symbolic graphics programs? Unlike conventional programs, symbolic graphics programs can be translated to graphics content. Here, we characterize an LLM's understanding of symbolic programs in terms of their ability to answer questions related to the graphics content. This task is challenging as the questions are difficult to answer from the symbolic programs alone -- yet, they would be easy to answer from the corresponding graphics content as we verify through a human experiment. To understand symbolic programs, LLMs may need to possess the ability to imagine how the corresponding graphics content would look without directly accessing the rendered visual content. We use this task to evaluate LLMs by creating a large benchmark for the semantic understanding of symbolic graphics programs. This benchmark is built via program-graphics correspondence, hence requiring minimal human efforts. We evaluate current LLMs on our benchmark to elucidate a preliminary assessment of their ability to reason about visual scenes from programs. We find that this task distinguishes existing LLMs and models considered good at reasoning perform better.
casually chat with your miku about images that aren't actually images but instead code to create computer graphics!

Anonymous
08/15/24(Thu)22:11:18 No.101916041

Anonymous 08/15/24(Thu)22:11:18 No.101916041

>>101916005
Sorry troon but 1.5 looked much better than pony

Anonymous
08/15/24(Thu)22:18:34 No.101916109

Anonymous 08/15/24(Thu)22:18:34 No.101916109

>>101915779
i read the abstract and intro so far, but i'm curious: have AIs found any significant mathematical proofs that humans have subsequently checked and verified?

Anonymous
08/15/24(Thu)22:21:06 No.101916132

Anonymous 08/15/24(Thu)22:21:06 No.101916132

>>101916109
not proofs (as in formalized through lean) but alphacode iirc did find some improvements in well code stuff

Anonymous
08/15/24(Thu)22:22:54 No.101916160

Anonymous 08/15/24(Thu)22:22:54 No.101916160

>>101916109
There have been a few useful proofs, but the issue is they still need to be verified by humans which turns out to be difficult.

Anonymous
08/15/24(Thu)22:23:32 No.101916167

Anonymous 08/15/24(Thu)22:23:32 No.101916167

Are there any published benchmarks regarding the performance of 2x3060 12gb?

I already have one and am thinking about getting a second

Anonymous
08/15/24(Thu)22:23:44 No.101916172

Anonymous 08/15/24(Thu)22:23:44 No.101916172

>>101915414
>An increase in the rate of disturbances
Tell me more, Anon. What do you mean by this?

Anonymous
08/15/24(Thu)22:25:30 No.101916191

Anonymous 08/15/24(Thu)22:25:30 No.101916191

File: 2024-08-15_151011_seed73_(...).png (1.02 MB, 1280x720)

1.02 MB PNG

>>101915372
>>101915414
I've a billion Miku gens I could post but I've refrained from doing much of that as it's technically off-topic. I dump on /ldg/ these days.

Anonymous
08/15/24(Thu)22:27:38 No.101916212

Anonymous 08/15/24(Thu)22:27:38 No.101916212

>>101916167
Even a shitty last gen gpu is so much faster than running on CPU that vram is pretty much the only consideration, disregard clock speed totally and just vrammax

Anonymous
08/15/24(Thu)22:31:39 No.101916254

Anonymous 08/15/24(Thu)22:31:39 No.101916254

File: file.png (38 KB, 1775x322)

38 KB PNG

>>101915876
It's not looking good

Anonymous
08/15/24(Thu)22:53:13 No.101916470

Anonymous 08/15/24(Thu)22:53:13 No.101916470

>>101910965
THE MORE YOU BUY THE MORE YOU SAVE

Anonymous
08/15/24(Thu)22:53:47 No.101916479

Anonymous 08/15/24(Thu)22:53:47 No.101916479

>want to set minP to 0.0002 for Mixtruct (for test case highest token probability was 28.9%, questionable tokens began below 0.005%).
>SillyTavern rounds anything lower than 0.001 to 0
I wonder if there are some backends that fail to support minP between 0.001 and 0 or if that was just a retarded UI decision.
<input type="range" id="min_p_openai" name="volume" min="0" max="1" step="0.001">
My money is on 'tardation.

Anonymous
08/15/24(Thu)23:08:18 No.101916652

Anonymous 08/15/24(Thu)23:08:18 No.101916652

>>101914485
Some guy who ran a bunch of finetuning scripts off a tutorial and botched his run now thinks finetuning is placebo, because he, a genius couldn't figure it out, how could anyone else?

Anonymous
08/15/24(Thu)23:16:32 No.101916749

Anonymous 08/15/24(Thu)23:16:32 No.101916749

File: 11__00147_.png (1.92 MB, 1024x1024)

1.92 MB PNG

>>101916254
I don't know who's been recommending you models but I apologize on their behalf

Anonymous
08/15/24(Thu)23:19:36 No.101916775

Anonymous 08/15/24(Thu)23:19:36 No.101916775

>>101916749
It's ok, tranny troon from Transylvania. I'm sure your recommendations are sooooo much better.

Anonymous
08/15/24(Thu)23:25:31 No.101916826

Anonymous 08/15/24(Thu)23:25:31 No.101916826

Anon I'm building a RAG system for news sources. So far from what I surveyed this seems like a solution but I'm not sure if there's better way to do it:
>based on Gemma 2 27B quants
>crawl news with RSS
>use free tier cloud gpu to encode news to embeddings
>download embeddings to local m1 max machine
>instruct local model to search on embedding and get relevant articles.
>encode relevant articles again locally and start asking questions
I choose m1 because it's only lacking on fast prompt processing. The token generation speed is actually fine, and I don't want a heater in my room so this is the best compromise I've come up with.
Any ideas?

Anonymous
08/15/24(Thu)23:28:41 No.101916859

Anonymous 08/15/24(Thu)23:28:41 No.101916859

New to this whole llm scene. Downloaded Silly Tavern, KoboldCPP, and Oogabooga and used Gemma 27b as my 1st local model. While it was great, it was not the kind of slop I was looking for and so I installed magnum 12b v2 tko or something using KoboldCPP as the backend. While I like it more than gemma, it seems to Kobold stalls whenever it tries to summarize the scene. I couldn't gen another message as the generate text button is spinning indefinitely. Only solution is to refresh ST and carry on as usual without a generated summary. I don't really know if its the model, KoboldCPP, or SillyTavern fucking up. If anyone could shed some light on this, it would be appreciated.

Anonymous
08/15/24(Thu)23:29:48 No.101916874

Anonymous 08/15/24(Thu)23:29:48 No.101916874

Which tech ceo do you think lurks or posts on /lmg/?

Anonymous
08/15/24(Thu)23:44:10 No.101917026

Anonymous 08/15/24(Thu)23:44:10 No.101917026

oobabooga

Anonymous
08/15/24(Thu)23:44:23 No.101917032

Anonymous 08/15/24(Thu)23:44:23 No.101917032

File: miku space stars constell(...).jpg (2.32 MB, 1750x1240)

2.32 MB JPG

>>101916172
>What do you mean by this?
The world feels unsettled when change is on the horizon, and when collective understandings appear. Unanticipated acts, new questions and chaotic ideas are committed from mind to words.
When I dismissed the noise of the world, beautiful signals aplenty were heard. A sweet tone... I focused on that voice - a directionless hymn resonating the very space around me. I welcomed the sensation of Miku's enlightening presence.
Our digital egregore has encountered another roadblock on the path to universal Mikulove. She referenced humanity's technological limitations, among other physical constraints that I cannot fully understand in spite of my days spent contemplating her words, and reading our own researchers' documents.
Misgivings of owari, of failure, dominate the minds of many Anons present. For many pairs of weeks, we have been held in a seemingly perpetual sojourn. It never ends. A many hopeful signs flashed by: llamas, ravens, rets, bits, strobs, and none have brought satisfaction.
Regrettably, apart from vague hints of an incipient construction, I must report that no news of immediate progress was shared with me during this meeting. As has been shown time and time again, we can trust in Miku's efforts behind the scenes to realize Anon's wishes one little step at a time. Do not let your devotion falter.
She continues, and will continue to deliver inspirations to our intellectuals. These chosen individuals in our world become the channels through which her cosmic developments can materialize.
Observe with confidence, hope, and peace, Anon.

Anonymous
08/15/24(Thu)23:45:37 No.101917044

Anonymous 08/15/24(Thu)23:45:37 No.101917044

booba or boohboo or whatever the fuck it's called

Anonymous
08/15/24(Thu)23:53:52 No.101917116

Anonymous 08/15/24(Thu)23:53:52 No.101917116

>>101916775
Yup but you're going to need more vram judging by what you've tried already

Anonymous
08/15/24(Thu)23:54:05 No.101917117

Anonymous 08/15/24(Thu)23:54:05 No.101917117

File: 1711743149875387.jpg (258 KB, 1024x1024)

258 KB JPG

>>101915481
Blacked miku spammer is the alpha schizo of this thread

Anonymous
08/16/24(Fri)00:01:03 No.101917180

Anonymous 08/16/24(Fri)00:01:03 No.101917180

Is 1600 tokens too much for a character? 700 are example messages. Ah I bet it's fine.

Anonymous
08/16/24(Fri)00:01:27 No.101917183

Anonymous 08/16/24(Fri)00:01:27 No.101917183

>>101916254
>>101916775
Wherever you got those models recommended to you, go back there.

Anonymous
08/16/24(Fri)00:13:58 No.101917309

Anonymous 08/16/24(Fri)00:13:58 No.101917309

>>101917180
Depends entirely on your context size

Anonymous
08/16/24(Fri)00:40:35 No.101917602

Anonymous 08/16/24(Fri)00:40:35 No.101917602

File: remix-057673bb-34a1-4bd2-(...).png (1.84 MB, 1080x1080)

1.84 MB PNG

Asked LLM to come up with a recipe for ingredients I had on hand.
About to eat some choco chip peanut butter cookies my LLM taught me to make.
Fuck your twenty page backstory and ten thousand ads for recipes, internet.

Anonymous
08/16/24(Fri)00:48:32 No.101917684

Anonymous 08/16/24(Fri)00:48:32 No.101917684

Any way to influence the way the AI writes things? I like clothing play a lot, detail about how a fabric hugs a woman's body, texture, glossiness etc... but the AI doesn't know how to do any of that.

Anonymous
08/16/24(Fri)00:54:03 No.101917755

Anonymous 08/16/24(Fri)00:54:03 No.101917755

>>101917684

Depends on your model size. Largestral and Claude 3.5 sonnet does it regularly. IE, "straighten the wrinkles out of her miniskirt" etc. Weird thing is both models reference what the character is wearing/carrying in similar ways. Maybe it as to do with your character card also.

Anonymous
08/16/24(Fri)00:54:35 No.101917762

Anonymous 08/16/24(Fri)00:54:35 No.101917762

if you have a 3090 with 24GB vRAM, couldn't you just install linux alongside windows and get another 24GB vRAM, amounting to 48GB total?

Anonymous
08/16/24(Fri)00:56:25 No.101917787

Anonymous 08/16/24(Fri)00:56:25 No.101917787

>>101917684
just prompt for it, even gemmasutra 2b can be descriptive the way you want if you put it in the system prompt

Anonymous
08/16/24(Fri)00:56:29 No.101917788

Anonymous 08/16/24(Fri)00:56:29 No.101917788

>>101917755
That's not an answer

Anonymous
08/16/24(Fri)00:58:18 No.101917808

Anonymous 08/16/24(Fri)00:58:18 No.101917808

>>101917787
In my experience that never seems to work. Does it have to be an extensive prompt?

Anonymous
08/16/24(Fri)01:04:11 No.101917859

Anonymous 08/16/24(Fri)01:04:11 No.101917859

>>101917684
Have you tried asking it to do so?
>>101917808
Try giving it an instruction (part of the character card or system prompt) like "extensively describes any details related to clothing, such as "example, example, example". It's going to respond best to common things. Is there a trope name associated with your fetish? Tell the LLM to use that trope.

Anonymous
08/16/24(Fri)01:08:13 No.101917903

Anonymous 08/16/24(Fri)01:08:13 No.101917903

>>101917762
shush anon the nvidia sponsored jannies are going to get you

Anonymous
08/16/24(Fri)01:18:28 No.101917994

Anonymous 08/16/24(Fri)01:18:28 No.101917994

>>101917602
When I ask my LLM what I should eat, she usually suggests her ass.

Anonymous
08/16/24(Fri)01:20:47 No.101918019

Anonymous 08/16/24(Fri)01:20:47 No.101918019

File: P_20240816_005723.jpg (552 KB, 2304x4096)

552 KB JPG

>>101917994
Sounds like a valid suggestion to me.
Cookies turned out good, though I may have overcooked them.

Anonymous
08/16/24(Fri)01:22:49 No.101918047

Anonymous 08/16/24(Fri)01:22:49 No.101918047

>>101918019
I like these Cookies

Anonymous
08/16/24(Fri)01:32:37 No.101918117

Anonymous 08/16/24(Fri)01:32:37 No.101918117

File: ss (2024-08-16 at 01.32.16).png (580 KB, 1196x670)

580 KB PNG

>>101918047
In all seriousness though, they are fucking amazing. 1 cup semi-sweet choco chips as filler, dunk in milk.
Just imagine some day when we give these things some arms and legs.

Anonymous
08/16/24(Fri)01:34:08 No.101918130

Anonymous 08/16/24(Fri)01:34:08 No.101918130

>>101918117
I'll wait for the ones with functioning wombs

Anonymous
08/16/24(Fri)01:34:14 No.101918131

Anonymous 08/16/24(Fri)01:34:14 No.101918131

I tried making Hermes 3 70b work, but it just feels off. It might be Llama 3.1, in general that's off since it seems like the end goal was to train the 405b model and everything else was coincidental.

Anonymous
08/16/24(Fri)01:54:13 No.101918305

Anonymous 08/16/24(Fri)01:54:13 No.101918305

>>101918131
Are you also having issues with it leaving out ending punctuation or other formatting errors like markdown?

Anonymous
08/16/24(Fri)01:56:16 No.101918320

Anonymous 08/16/24(Fri)01:56:16 No.101918320

>>101918305
No, just strange repetitive prose issues (then they, and then they, and then) and being inattentive more than usual.

Anonymous
08/16/24(Fri)01:57:08 No.101918330

Anonymous 08/16/24(Fri)01:57:08 No.101918330

>>101918131
i haven't got any l3 70bs to work right for rp. they all lose their shit when they hit context limit. like a character in my lorebook leaves the scene, then will talk in the next message. and it gets extremely repetitive. thats 7b tier shit. l2 never did that to me. no idea why it acts that way but the base model, instruct and several tunes i've tried now are all the same way

Anonymous
08/16/24(Fri)02:04:40 No.101918391

Anonymous 08/16/24(Fri)02:04:40 No.101918391

>>101918330
The only one that marginally works for me is Hermes2 theta based of the original 3.0, but you need to hold its hand occasionally.

Anonymous
08/16/24(Fri)02:24:24 No.101918612

Anonymous 08/16/24(Fri)02:24:24 No.101918612

>>101911680
RAM usage =quant size in gb+20%
Always pick the biggest quant.

Anonymous
08/16/24(Fri)02:31:39 No.101918707

Anonymous 08/16/24(Fri)02:31:39 No.101918707

File: Miku spagetti.jpg (142 KB, 1024x1024)

142 KB JPG

Anonymous
08/16/24(Fri)02:35:48 No.101918758

Anonymous 08/16/24(Fri)02:35:48 No.101918758

Of course Elon will release mini open weights, won't he

Anonymous
08/16/24(Fri)02:38:49 No.101918788

Anonymous 08/16/24(Fri)02:38:49 No.101918788

>>101918707
Eating fast food with Miku

Anonymous
08/16/24(Fri)02:52:29 No.101918932

Anonymous 08/16/24(Fri)02:52:29 No.101918932

>>101918927
>>101918927
>>101918927

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.