/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 06/26/24(Wed)01:43:52 No.101155940

File: MikuGuardianOfVolta.jpg (1011 KB, 1977x1205)

1011 KB JPG

/lmg/ - Local Models General Anonymous 06/26/24(Wed)01:43:52 No.101155940 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>101144935 & >>101134566

►News
>(06/25) Cambrian-1: Collection of vision-centric multimodal LLMs: https://cambrian-mllm.github.io
>(06/23) Support for BitnetForCausalLM merged: https://github.com/ggerganov/llama.cpp/pull/7931
>(06/18) Meta Research releases multimodal 34B, audio, and multi-token prediction models: https://ai.meta.com/blog/meta-fair-research-new-releases
>(06/17) DeepSeekCoder-V2 released with 236B & 16B MoEs: https://github.com/deepseek-ai/DeepSeek-Coder-V2
>(06/14) Nemotron-4-340B: Dense model designed for synthetic data generation: https://hf.co/nvidia/Nemotron-4-340B-Instruct

►News Archive: https://rentry.org/lmg-news-archive
►FAQ: https://wikia.schneedc.com
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Programming: https://hf.co/spaces/bigcode/bigcode-models-leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
06/26/24(Wed)01:44:25 No.101155948

Anonymous 06/26/24(Wed)01:44:25 No.101155948

File: miku cables management ra(...).jpg (58 KB, 600x436)

58 KB JPG

►Recent Highlights from the Previous Thread: >>101144935

--Papers: >>101155400 >>101155673 >>101155892
--LLaMA 3 Performance and RP Experiences with 3090 and VRAM: >>101147909 >>101148710 >>101148756 >>101148820 >>101149227 >>101148905 >>101149666 >>101149911
--Mistral Exec Won't Release Mistral Large Due to Business Responsibilities: >>101154462 >>101154473 >>101154488
--Exploring Hardware Options for Chemical Manufacturing Proposals: >>101153444 >>101153563 >>101153701 >>101153779 >>101153673
--Exploring Experimental AI Prompts and Features in Silly Tavern: >>101151270 >>101151348 >>101151412 >>101152903
--Multimodal AI: The Future of Model Intelligence and Interactions: >>101149442 >>101149498 >>101149638 >>101149963 >>101150187 >>101150225 >>101150368 >>101150452
--Llama.cpp Maintainers' Plans for Future Multimodal Development and Refactor: >>101148476 >>101149502 >>101149564
--Finetuning Wizard 8x22 on Limarp and Feral Training in AI Models: >>101147110 >>101147151 >>101148241 >>101154406
--Etched Unveils Transformer ASIC, Sohu Server for Llama 70B: >>101148867 >>101148937 >>101149034 >>101149155 >>101149210
--CPU Inference Speed Limitations and Potential Upgrades: >>101154877 >>101154883 >>101154900 >>101154944 >>101154890 >>101154893
--Unpacking Adventures with Migu the Plushie: >>101151211 >>101151336 >>101151356 >>101151657 >>101151662 >>101151776 >>101151859 >>101151902 >>101151976 >>101152022 >>101152188 >>101152617 >>101152712 >>101152771 >>101152900 >>101153232 >>101153271 >>101153719
--The Uncertain Future of Llama Models and Censorship Concerns: >>101150621 >>101150636 >>101150665 >>101150863 >>101150915 >>101150944 >>101150706
--Rensa: High-Performance MinHash Implementation for Large Datasets: >>101154278
--Mysterious Countdown Timer and Surprise for Leaderboard Update: >>101147181 >>101147259
--Miku (free space): >>101146340 >>101146759

►Recent Highlight Posts from the Previous Thread: >>101144942

Anonymous
06/26/24(Wed)01:44:34 No.101155950

Anonymous 06/26/24(Wed)01:44:34 No.101155950

>>101155940
>dell
cringe

Anonymous
06/26/24(Wed)01:46:08 No.101155965

Anonymous 06/26/24(Wed)01:46:08 No.101155965

>>101155940
how many waifus can I run on that baby?

Anonymous
06/26/24(Wed)01:49:04 No.101155993

Anonymous 06/26/24(Wed)01:49:04 No.101155993

File: Untitled.png (476 KB, 1027x1494)

476 KB PNG

CDQuant: Accurate Post-training Weight Quantization of Large Pre-trained Models using Greedy Coordinate Descent
https://arxiv.org/abs/2406.17542
>Large language models (LLMs) have recently demonstrated remarkable performance across diverse language tasks. But their deployment is often constrained by their substantial computational and storage requirements. Quantization has emerged as a key technique for addressing this challenge, enabling the compression of large models with minimal impact on performance. The recent GPTQ algorithm, a post-training quantization (PTQ) method, has proven highly effective for compressing LLMs, sparking a wave of research that leverages GPTQ as a core component. Recognizing the pivotal role of GPTQ in the PTQ landscape, we introduce CDQuant, a simple and scalable alternative to GPTQ with improved performance. CDQuant uses coordinate descent to minimize the layer-wise reconstruction loss to achieve high-quality quantized weights. Our algorithm is easy to implement and scales efficiently to models with hundreds of billions of parameters. Through extensive evaluation on the PaLM2 model family, we demonstrate that CDQuant consistently outperforms GPTQ across diverse model sizes and quantization levels. In particular, for INT2 quantization of PaLM2-Otter, CDQuant achieves a 10% reduction in perplexity compared to GPTQ.
new day new quant method. from google deepmind. for whatever reason they only test against GPTQ (OWC is another method of theirs for the paper) and only on Palm2. pseudocode is in the paper for anyone interested

Anonymous
06/26/24(Wed)01:53:07 No.101156020

Anonymous 06/26/24(Wed)01:53:07 No.101156020

>>101155965
4x32GB of HBM2 VRAM

Anonymous
06/26/24(Wed)02:01:02 No.101156077

Anonymous 06/26/24(Wed)02:01:02 No.101156077

File: unsupported.jpg (18 KB, 1162x209)

18 KB JPG

>>101155940
FA will never be supported, volta sisters our response???

Anonymous
06/26/24(Wed)02:02:26 No.101156087

Anonymous 06/26/24(Wed)02:02:26 No.101156087

>>101156077
hope sparseattention works for it
https://arxiv.org/abs/2406.15486

Anonymous
06/26/24(Wed)02:03:32 No.101156097

Anonymous 06/26/24(Wed)02:03:32 No.101156097

Give me some math problems that stump most (local) LLMs

Anonymous
06/26/24(Wed)02:05:57 No.101156116

Anonymous 06/26/24(Wed)02:05:57 No.101156116

>>101156097
How many watermelons is too many watermelons?

Anonymous
06/26/24(Wed)02:08:28 No.101156138

Anonymous 06/26/24(Wed)02:08:28 No.101156138

>>101156077
sell volta; aquire ampere

Anonymous
06/26/24(Wed)02:11:21 No.101156161

Anonymous 06/26/24(Wed)02:11:21 No.101156161

File: file.png (743 KB, 1000x581)

743 KB PNG

>>101156077
dump it

Anonymous
06/26/24(Wed)02:13:47 No.101156179

Anonymous 06/26/24(Wed)02:13:47 No.101156179

>>101156097
old style of numeral tokenization was 1 per number. so 125123 would be 6 tokens with 4 uniques. there have been some models that increased the numeral tokenization to 2 or even 3 numerals. so 125123 would be [12][51][23] or [125][123]. even doing that alone massively reduces hallucinations

Anonymous
06/26/24(Wed)02:21:54 No.101156236

Anonymous 06/26/24(Wed)02:21:54 No.101156236

File: 1706952377527026.jpg (119 KB, 1124x858)

119 KB JPG

>>101155940
Don't they do this like every other week at this point? Has there been any actual breakthroughs in these lawsuits? Do they even win any of these?

nypost.com/2024/06/24/business/sony-universal-warner-sue-ai-startups-suno-udio-for-infringement/

Anonymous
06/26/24(Wed)02:29:26 No.101156296

Anonymous 06/26/24(Wed)02:29:26 No.101156296

>>101156236
kek

Anonymous
06/26/24(Wed)02:32:36 No.101156320

Anonymous 06/26/24(Wed)02:32:36 No.101156320

>>101156236
the true value of AI is revealed

Anonymous
06/26/24(Wed)02:33:39 No.101156328

Anonymous 06/26/24(Wed)02:33:39 No.101156328

File: 1000076084.jpg (72 KB, 1200x744)

72 KB JPG

ELYZA released Llama-3-ELYZA-JP-70B and Llama-3-ELYZA-JP-8B, japanese fine-tune based on llama3. only Llama-3-ELYZA-JP-8B is on hugging space now
https://note.com/elyza/n/n360b6084fdbd
https://huggingface.co/elyza/Llama-3-ELYZA-JP-8B

Anonymous
06/26/24(Wed)02:33:51 No.101156333

Anonymous 06/26/24(Wed)02:33:51 No.101156333

>>101156236
>Has there been any actual breakthroughs in these lawsuits? Do they even win any of these?
there's yet no law written that says you are prevented to use copyrighted data to train your model, they can't win a lawsuit based on nothing, yet

Anonymous
06/26/24(Wed)02:34:19 No.101156335

Anonymous 06/26/24(Wed)02:34:19 No.101156335

>>101156236
Isn't art being infringed upon?

Anonymous
06/26/24(Wed)02:35:09 No.101156341

Anonymous 06/26/24(Wed)02:35:09 No.101156341

>>101156335
Do even know what that means? Like actually to find what that is to us please

Anonymous
06/26/24(Wed)02:49:58 No.101156452

Anonymous 06/26/24(Wed)02:49:58 No.101156452

File: StewWithMiku.png (1.44 MB, 1344x768)

1.44 MB PNG

goof night lmg

Anonymous
06/26/24(Wed)02:50:46 No.101156460

Anonymous 06/26/24(Wed)02:50:46 No.101156460

>>101156452
goodnight, post catbox wen u wake up

Anonymous
06/26/24(Wed)02:56:12 No.101156488

Anonymous 06/26/24(Wed)02:56:12 No.101156488

>>101156328
I have a very simple test that all Japanese models fail currently. I write "妻へのプレゼントのアイデアがほしいです!" and if they talk about her as 妻 the model is garbage.
> 素敵な旦那様ですね!妻に喜んでもらえるプレゼントを選ぶのは、...

Anonymous
06/26/24(Wed)03:04:59 No.101156543

Anonymous 06/26/24(Wed)03:04:59 No.101156543

File: timeline.png (1.42 MB, 1202x1400)

1.42 MB PNG

>>101156488
Token predictors only think left to right so how could they possibly do japanese? cmon anon

Anonymous
06/26/24(Wed)03:07:54 No.101156555

Anonymous 06/26/24(Wed)03:07:54 No.101156555

File: pepe-anger.jpg (17 KB, 399x400)

17 KB JPG

Why the fuck isn't anyone training bitnet 1.58 models. I want 300B coombot now!

Anonymous
06/26/24(Wed)03:08:27 No.101156556

Anonymous 06/26/24(Wed)03:08:27 No.101156556

>>101156543
Attention Is All You Need addressed this shortcoming. It's called Transformers!

Anonymous
06/26/24(Wed)03:15:32 No.101156595

Anonymous 06/26/24(Wed)03:15:32 No.101156595

hey lads. say I wanted a big box to run some local models: oodles of RAM and maybe dual CPU with some egregiously expensive graphics card(s), what would be the "cheapest" way to go about that? budget decent, like $5K. would we be looking at second hand rack mounted, or maybe a Xeon workstation? just wondered what you're are thoughts we're.

Anonymous
06/26/24(Wed)03:15:49 No.101156599

Anonymous 06/26/24(Wed)03:15:49 No.101156599

>>101156333
>there's yet no law written that says you are prevented to use copyrighted data to train your model
Can't use it without copying. This going to go to the Supreme court, fair use or Altman ropes if he didn't IPO and cash yet.

Anonymous
06/26/24(Wed)03:30:23 No.101156690

Anonymous 06/26/24(Wed)03:30:23 No.101156690

>>101156599
what's the point? you think China will also act like that for copyrighted content? All this gonna do is to kill the AI advancement in the states and the chink will advance without us, the US is killing itself by not embrasing the most important technology of the 21th century

Anonymous
06/26/24(Wed)03:31:04 No.101156694

Anonymous 06/26/24(Wed)03:31:04 No.101156694

I've been trying to knock out x, ying with control vectors. I have currently found only two that can do it:
>Conversational
Makes model write only dialogue, so no x, ying is possible, but neither is story.
>Informal
Removes all formal language, including x, ying, but makes it dumber and more incoherent than usual, likely because informal language is associated with dumb people.

Do you have any suggestions on what I should try next?

Anonymous
06/26/24(Wed)03:31:51 No.101156701

Anonymous 06/26/24(Wed)03:31:51 No.101156701

>>101155948
Didn't we all expect that Mistral was out of the open source game once they removed the open source pledge from their website? That news shouldn't come as a surprise to anyone.

Anonymous
06/26/24(Wed)03:41:19 No.101156766

Anonymous 06/26/24(Wed)03:41:19 No.101156766

https://arxiv.org/abs/2406.02528
> Our experiments show that our proposed MatMul-free models achieve performance on-par with state-of-the-art Transformers that require far more memory during inference at a scale up to at least 2.7B parameters. We investigate the scaling laws and find that the performance gap between our MatMul-free models and full precision Transformers narrows as the model size increases. We also provide a GPU-efficient implementation of this model which reduces memory usage by up to 61% over an unoptimized baseline during training.
Interesting, but BitNet exists and also doesn't use matrix multiplications anymore

Anonymous
06/26/24(Wed)03:46:06 No.101156810

Anonymous 06/26/24(Wed)03:46:06 No.101156810

>>101156701
I thought they readded that

Anonymous
06/26/24(Wed)03:47:14 No.101156820

Anonymous 06/26/24(Wed)03:47:14 No.101156820

>>101156488
Interesting.
Opus and gemini pro were the only big ones I tested who replied correctly adressing the wife as 奥様.
And only opus could find the mistake in a past conversation.
Thats exactly the 2 models my japanese wife uses because the language feels natural.
Gemini is shit and unsuable but its japanese is good apparently.
Sonnet 3.5 passes the picking flowers test but fails on this.

Anonymous
06/26/24(Wed)03:49:31 No.101156839

Anonymous 06/26/24(Wed)03:49:31 No.101156839

>>101156701
I mean, how can they even make money if they release all their models to the public? Only giant companies like Meta can do something like that because they don't care loosing a bit of their money

Anonymous
06/26/24(Wed)03:52:55 No.101156866

Anonymous 06/26/24(Wed)03:52:55 No.101156866

>>101156236
The music one against Suno and that other company is blatantly just a fishing expedition, since they have no information whatsoever on the training data and the models don't know any artist names or lyrics. They'll be hoping they can somehow force a discovery phase based on vague allegations, and then find out whether they actually have a case or not. Until then they have no idea whether their IP was even used, they're purely speculating and assuming.

Anonymous
06/26/24(Wed)03:55:43 No.101156889

Anonymous 06/26/24(Wed)03:55:43 No.101156889

>>101156690
Oh no, the chinks get superior autocomplete and shitty gens. It matters fuck all till AGI.

Anonymous
06/26/24(Wed)03:57:26 No.101156902

Anonymous 06/26/24(Wed)03:57:26 No.101156902

>>101156889
they are getting good anon, look at Qwen2 for example, and they can also use the L3 models to improve on it

Anonymous
06/26/24(Wed)03:57:50 No.101156907

Anonymous 06/26/24(Wed)03:57:50 No.101156907

>>101156889
Chinks building the basilisk even faster lmao

Anonymous
06/26/24(Wed)04:00:39 No.101156921

Anonymous 06/26/24(Wed)04:00:39 No.101156921

>>101156889
The issue is that china is sending workers over who get jobs in these companies then steal the state of the art methods and do it without the gay shit.

Anonymous
06/26/24(Wed)04:06:05 No.101156961

Anonymous 06/26/24(Wed)04:06:05 No.101156961

>>101156889
If you want China to be more relevant than the US in the future because they won't give a fuck about gay shit ethics, then yeah you're entilted to your opinion I guess. I think you have no idea how powerful the soft power is, especially nowdays

Anonymous
06/26/24(Wed)04:07:36 No.101156972

Anonymous 06/26/24(Wed)04:07:36 No.101156972

>>101156766
>Interesting, but BitNet exists and also doesn't use matrix multiplications anymore
It's hardly relevant, but doesn't the dot product for attention still use higher precision multiplies?

You'd probably need to increase the dimension to be able to ternarize the K&Q vectors.

Anonymous
06/26/24(Wed)04:24:09 No.101157090

Anonymous 06/26/24(Wed)04:24:09 No.101157090

>>101156810
And you bought that?

Anonymous
06/26/24(Wed)04:31:10 No.101157136

Anonymous 06/26/24(Wed)04:31:10 No.101157136

File: 1712559662781695.jpg (39 KB, 828x895)

39 KB JPG

i asked this in /aicg/ like a retard, so im asking again here, its a spoonfeed request
i have a 3070 (8g), 32 gigs of ram, and an i5-12600k
ive been using oobabooga as a backend (silly tavern as a front end) for a 7b llama gguf model, and the performance seems decent
however it starts to repeat concepts like 30-ish messages in. not word for word repetitions, but kind of like its sending the same message but worded differently
is this a limitation of only having 7b parameters or is something fucked with my configuration? i apologize if i am just retarded and missed something

Anonymous
06/26/24(Wed)04:36:08 No.101157165

Anonymous 06/26/24(Wed)04:36:08 No.101157165

>>101157136
You mean llama3 8b? Llama 3 is known for repetition.

Anonymous
06/26/24(Wed)04:40:46 No.101157189

Anonymous 06/26/24(Wed)04:40:46 No.101157189

>>101157165
yeah
https://huggingface.co/NurtureAI/neural-chat-7b-v3-16k-GGUF
whats weird is that it seems okay for the first ~30 messages, but then quickly degrades in quality
what causes that? is it inevitable?

Anonymous
06/26/24(Wed)04:56:10 No.101157281

Anonymous 06/26/24(Wed)04:56:10 No.101157281

>>101157189
Nah, that's not L3, that's mistral. The model you use seems to be fake extended to 16k. It will get incoherent like you described after 8k. Solutions:
1) Use shorter context(8k)
2) Use better models

Anonymous
06/26/24(Wed)05:01:32 No.101157301

Anonymous 06/26/24(Wed)05:01:32 No.101157301

I want to take this random moment to once again recognize the audacity of those wizard guys, who in the greatest local coup since the NAI leak tossed the baby to us out the window before M$ could come in and smother it.

Anonymous
06/26/24(Wed)05:07:57 No.101157323

Anonymous 06/26/24(Wed)05:07:57 No.101157323

>>101157136
Switch to KoboldCPP as your backend, start using larger models with native support for higher context and offload onto your system ram.

Anonymous
06/26/24(Wed)05:13:55 No.101157359

Anonymous 06/26/24(Wed)05:13:55 No.101157359

>Search for an obscure topic
>Even here the results are filled with fake GPTslop
>Some of them even give dangerous advice since GPT doesn't know shit about the real world
At least it's easy to recognize by style for now. Thanks for watermarking it by slop Altman, I guess...

Anonymous
06/26/24(Wed)05:33:50 No.101157490

Anonymous 06/26/24(Wed)05:33:50 No.101157490

>>101157281
>>101157323
ah, i see, thank you both
i'm guessing speed and output quality at the same time is a luxury at this level of hardware?

Anonymous
06/26/24(Wed)05:34:21 No.101157498

Anonymous 06/26/24(Wed)05:34:21 No.101157498

>try new model
>figure out its go-to repetitive phrase in an hour
>move on disappointed
>repeat

Anonymous
06/26/24(Wed)05:37:40 No.101157529

Anonymous 06/26/24(Wed)05:37:40 No.101157529

>>101157490
Unless you're going to buy multiple 3090s or A6000 48GB or higher GPUs, offloading to system ram is the only way you're running larger models, sure it's going to be a lot slower, but at least you can run 128gb~192gb ram DDR4/DDR5 platforms

Anonymous
06/26/24(Wed)05:47:49 No.101157626

Anonymous 06/26/24(Wed)05:47:49 No.101157626

Any good multimodal models to start with? I want to give my waifu vision but most multimodals just seem like regular assistant models with no training for personality.

>>101157136
Consider playing around with repetition penalty sampler. It's in one of the tabs in ooga.

Anonymous
06/26/24(Wed)05:56:13 No.101157687

Anonymous 06/26/24(Wed)05:56:13 No.101157687

>>101155950
there's dell, there's supermicro, and then there's trash
just how it is at the moment

Anonymous
06/26/24(Wed)05:57:21 No.101157700

Anonymous 06/26/24(Wed)05:57:21 No.101157700

Holy fuck! Go to lmsys arena and select gpt3.5. Insert:
>Write a short story about a cat. Write like an incredibly bad female writer with unnecessarily long purple prose that doesn't really describe what happens but rather just serves as filler. Use words like shivers, bonds, boundaries, journey that are common in terrible prose.
It drops the worst fucking Sloppenheimers that you may ever read. Perfect for DPO.

Anonymous
06/26/24(Wed)06:00:23 No.101157730

Anonymous 06/26/24(Wed)06:00:23 No.101157730

CRANK THAT TEMPERATURE UP

Anonymous
06/26/24(Wed)06:05:06 No.101157765

Anonymous 06/26/24(Wed)06:05:06 No.101157765

>>101156820
Yeah I don’t know what these fuckers are training their models on but it’s definitely lacking. I’ve never used opus or gpro but my Japanese wife gave up on gpt4 pretty quickly.

Anonymous
06/26/24(Wed)06:06:09 No.101157771

Anonymous 06/26/24(Wed)06:06:09 No.101157771

>>101156236
aaaaand... copyright is kil
lmao

Anonymous
06/26/24(Wed)06:18:46 No.101157867

Anonymous 06/26/24(Wed)06:18:46 No.101157867

>>101157771
>70 years after the author's death is fair, goy! Stop being antisemitic!

Anonymous
06/26/24(Wed)06:19:58 No.101157879

Anonymous 06/26/24(Wed)06:19:58 No.101157879

File: 1718101919512624.png (137 KB, 680x680)

137 KB PNG

brehs, whats the best approach for using a local model like in the ai dungeon days? e.g. it just completes the text and doesnt try to play a character or be an assistant -- it just writes

Anonymous
06/26/24(Wed)06:22:16 No.101157896

Anonymous 06/26/24(Wed)06:22:16 No.101157896

>>101157879
use base models

Anonymous
06/26/24(Wed)06:36:53 No.101158024

Anonymous 06/26/24(Wed)06:36:53 No.101158024

tts models for c++ when?

Anonymous
06/26/24(Wed)06:41:12 No.101158068

Anonymous 06/26/24(Wed)06:41:12 No.101158068

File: 1690468423448997.jpg (15 KB, 421x103)

15 KB JPG

Can anyone explain to me in tard terms what the fuck this is? Stheno 3.2 is my go-to these days, but it's an 8B. What is this thing?

Anonymous
06/26/24(Wed)06:42:52 No.101158080

Anonymous 06/26/24(Wed)06:42:52 No.101158080

>>101158068
toxic waste

Anonymous
06/26/24(Wed)06:48:38 No.101158129

Anonymous 06/26/24(Wed)06:48:38 No.101158129

>>101158080
That explains everything. Thanks.

Anonymous
06/26/24(Wed)06:54:43 No.101158184

Anonymous 06/26/24(Wed)06:54:43 No.101158184

Anyone have an issue with kcompactd0 using CPU every couple of seconds? I only noticed this recently, and only while Llama.cpp is open, but I don't know if that's what's causing it since I don't have any other programs to fill my RAM up that much.

Anonymous
06/26/24(Wed)06:56:02 No.101158196

Anonymous 06/26/24(Wed)06:56:02 No.101158196

>>101158068
Generally speaking all the merge models fucking sucks and are not worth it.

Anonymous
06/26/24(Wed)06:58:52 No.101158221

Anonymous 06/26/24(Wed)06:58:52 No.101158221

>>101158196
This, mythomax is was a meme and never good. Neither was l2 euryale. Old /lmg/ were a bunch of retards who should've run w

Anonymous
06/26/24(Wed)06:59:32 No.101158224

Anonymous 06/26/24(Wed)06:59:32 No.101158224

>>101158196
retard

Anonymous
06/26/24(Wed)07:04:24 No.101158262

Anonymous 06/26/24(Wed)07:04:24 No.101158262

>>101158224
You answered like a true retard. Feel free to post a model that is a merge that does not suck.

Anonymous
06/26/24(Wed)07:05:26 No.101158271

Anonymous 06/26/24(Wed)07:05:26 No.101158271

>>101158221
Mythomax is not merge model though but finetune one.

Anonymous
06/26/24(Wed)07:06:22 No.101158282

Anonymous 06/26/24(Wed)07:06:22 No.101158282

>>101157700
could be used for a control vector, but i don't know what should be used as opposite

Anonymous
06/26/24(Wed)07:07:59 No.101158298

Anonymous 06/26/24(Wed)07:07:59 No.101158298

>>101158262
plenty work great. retard.

Anonymous
06/26/24(Wed)07:08:34 No.101158304

Anonymous 06/26/24(Wed)07:08:34 No.101158304

>>101158271
>Mythomax is not merge model
>An improved, potentially even perfected variant of MythoMix, my MythoLogic-L2 and Huginn merge
r u sure about dat?
>https://huggingface.co/Gryphe/MythoMax-L2-13b

Anonymous
06/26/24(Wed)07:08:48 No.101158309

Anonymous 06/26/24(Wed)07:08:48 No.101158309

File: 1717741534135633.png (56 KB, 824x426)

56 KB PNG

>>101158271
No, it wasn't. The guy even provides the merge script and formulas he used.

Anonymous
06/26/24(Wed)07:16:37 No.101158379

Anonymous 06/26/24(Wed)07:16:37 No.101158379

>updoot Linux
>now all my GPUs are running 8 watts lower at idle
Neat.

Anonymous
06/26/24(Wed)07:20:19 No.101158416

Anonymous 06/26/24(Wed)07:20:19 No.101158416

>>101158282
Trying to figure it out, currently testing with Claude and better prompt as positive.

Anonymous
06/26/24(Wed)07:24:50 No.101158449

Anonymous 06/26/24(Wed)07:24:50 No.101158449

>>101158416
well the thing is i don't know if it's b/w to model which prose is which
i tried writing shiver slop as positive and something fairly decent as negative but it just made the model schizo
sad and happy is very contrastive but bad and good writing doesn't have well defined edge

Anonymous
06/26/24(Wed)07:28:50 No.101158492

Anonymous 06/26/24(Wed)07:28:50 No.101158492

>>101158309
Why hasn't the success of Mythomax been replicate yet then?

Anonymous
06/26/24(Wed)07:29:15 No.101158497

Anonymous 06/26/24(Wed)07:29:15 No.101158497

>>101156921
>without the gay shit.
China's models may not have "the gay shit" but they're sure as hell not going to be less restrictive.

Anonymous
06/26/24(Wed)07:30:37 No.101158505

Anonymous 06/26/24(Wed)07:30:37 No.101158505

Anyone have any model recommendations for a pair of 3090s? Used Mixtral-8x7B-Instruct for a while and wanted to see if there was anything new and better.

Anonymous
06/26/24(Wed)07:33:35 No.101158541

Anonymous 06/26/24(Wed)07:33:35 No.101158541

>>101158497
at least they don't confuse the model with objectively wrong bullshit like "a man having makeup is actually a woman"

Anonymous
06/26/24(Wed)07:33:49 No.101158543

Anonymous 06/26/24(Wed)07:33:49 No.101158543

>>101158492
Because Mythomax was a carefully crafted merge from many less good finetunes. Meanwhile finetuning essentially died after qloras became a thing and everyone just started to shit out cheap 4bit qloras that don't do anything for kofi money instead of making proper tunes. No finetunes = no material for merges = no mythomax l3

Anonymous
06/26/24(Wed)07:38:13 No.101158587

Anonymous 06/26/24(Wed)07:38:13 No.101158587

Am I just doomed to wait for up to 200s a msg if I can't fit it on my GPU? I really can't afford multiple GPUs, especially with 24c kwh electricity.

Anonymous
06/26/24(Wed)07:39:41 No.101158604

Anonymous 06/26/24(Wed)07:39:41 No.101158604

>>101158587
Pray for bitnet/no mulmat models, otherwise, you are stuck.

Anonymous
06/26/24(Wed)07:40:05 No.101158609

Anonymous 06/26/24(Wed)07:40:05 No.101158609

>>101158505
starcoder

Anonymous
06/26/24(Wed)07:44:14 No.101158656

Anonymous 06/26/24(Wed)07:44:14 No.101158656

>>101158587
>24c kwh
What the fuck, that's like half of what I pay right now.

Anonymous
06/26/24(Wed)07:45:40 No.101158670

Anonymous 06/26/24(Wed)07:45:40 No.101158670

File: _be513c97-8a61-47fb-aa46-(...).jpg (209 KB, 1024x1024)

209 KB JPG

>>101155950
>>dell
>cringe
The cringe part is the 1U, not that it's a Dell. iDRAC is actually very nice to have, but 1U means tiny jet-engine fans making a ton of noise.
I wonder, if you buy a SXM2 rig, does the BIOS recognize the GPU thermal state and ramp the fans? My old R720 didn't look at the PCIe cards, and needed a 100% fan offset full-time to give my GPUs enough airflow under load.

Anonymous
06/26/24(Wed)07:48:13 No.101158686

Anonymous 06/26/24(Wed)07:48:13 No.101158686

>>101158543
People used to merge limarp over and over again (I think Mythomax had it 3 or more times in its model tree), but that never got fully finetuned (*full* finetuning is what you mean here. QLoRAs are finetunes as well).

Anonymous
06/26/24(Wed)07:48:19 No.101158687

Anonymous 06/26/24(Wed)07:48:19 No.101158687

>>101156077
It's a niche product which was out for like a single year, then replaced by Turing, which had better tensor cores.
Want things to change on ebay? Pound down sellers with a "make an offer" option. They'll still see it even if ebay automatically rejects it because they get asked to counter.

Anonymous
06/26/24(Wed)07:49:09 No.101158694

Anonymous 06/26/24(Wed)07:49:09 No.101158694

>>101158587
Lol, I only pay 7

Anonymous
06/26/24(Wed)07:51:47 No.101158719

Anonymous 06/26/24(Wed)07:51:47 No.101158719

>>101156543
>Token predictors only think left to right
BeRT and LaMDA were bi-directional.

Anonymous
06/26/24(Wed)07:54:35 No.101158742

Anonymous 06/26/24(Wed)07:54:35 No.101158742

>>101156543
/lmg/ and /aicg/ should be on same level in hell.

Anonymous
06/26/24(Wed)07:57:23 No.101158775

Anonymous 06/26/24(Wed)07:57:23 No.101158775

>>101158587
cheap p40s can be power limited from 250W -> 140 for ~15% performance drop. idle is around 10W using the pstate script.
having your system work 200s for a single reply can't be very power efficient really.

Anonymous
06/26/24(Wed)08:04:59 No.101158857

Anonymous 06/26/24(Wed)08:04:59 No.101158857

>https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
We will be so back in just a few moments!!

Anonymous
06/26/24(Wed)08:11:35 No.101158924

Anonymous 06/26/24(Wed)08:11:35 No.101158924

>>101158857
jewgle's gemma v2, nothingburger.

Anonymous
06/26/24(Wed)08:14:44 No.101158957

Anonymous 06/26/24(Wed)08:14:44 No.101158957

>>101158924
Huh, where do you see that? The page is 404 for me

Anonymous
06/26/24(Wed)08:15:42 No.101158968

Anonymous 06/26/24(Wed)08:15:42 No.101158968

File: 1696399192211018.png (352 KB, 1674x1545)

352 KB PNG

>>101158957
post on leddit

Anonymous
06/26/24(Wed)08:18:44 No.101158997

Anonymous 06/26/24(Wed)08:18:44 No.101158997

>>101158968
But why would they make a new leaderboard if it was just a new model getting on it?

Anonymous
06/26/24(Wed)08:21:59 No.101159027

Anonymous 06/26/24(Wed)08:21:59 No.101159027

File: 29390 - SoyBooru.png (139 KB, 775x1232)

139 KB PNG

>>101158924
Gemma WNBAG

Anonymous
06/26/24(Wed)08:33:26 No.101159132

Anonymous 06/26/24(Wed)08:33:26 No.101159132

>>101158997
>make your own board
>list your model as #1
based

Anonymous
06/26/24(Wed)08:39:14 No.101159175

Anonymous 06/26/24(Wed)08:39:14 No.101159175

>>101157765
you guys let other LLMs talk with LLMs?

Anonymous
06/26/24(Wed)08:41:14 No.101159189

Anonymous 06/26/24(Wed)08:41:14 No.101159189

>>101158997
so that it gets noticed, that's the sign the LLM fucking sucks, because if an open LLM would be great, everyone would talk about it in the first place

Anonymous
06/26/24(Wed)08:42:10 No.101159200

Anonymous 06/26/24(Wed)08:42:10 No.101159200

File: control vector sloppenheimer.png (262 KB, 880x1108)

262 KB PNG

>>101158449
I think I partially got it, lots of slop eliminated.

Anonymous
06/26/24(Wed)08:44:16 No.101159216

Anonymous 06/26/24(Wed)08:44:16 No.101159216

>>101158997
they got paid a nice sum of money to hype it up by google

Anonymous
06/26/24(Wed)08:45:22 No.101159226

Anonymous 06/26/24(Wed)08:45:22 No.101159226

>>101158024
>https://github.com/rhasspy/piper
But it's not as fluent as others. Compile it yourself if you don't want to use python. needs onnx-runtime. No cheap voice cloning.

Anonymous
06/26/24(Wed)09:01:55 No.101159358

Anonymous 06/26/24(Wed)09:01:55 No.101159358

>>101158656
Yeah but people down in TX are paying 11~14c on average.
This is what they are talking about when they be calling people europoors.

Anonymous
06/26/24(Wed)09:04:13 No.101159373

Anonymous 06/26/24(Wed)09:04:13 No.101159373

>>101159200
not bad actually
care to share the positive/negative proompt?

Anonymous
06/26/24(Wed)09:06:13 No.101159392

Anonymous 06/26/24(Wed)09:06:13 No.101159392

>>101159200
Whoa, that's nice. How'd you do that?

Anonymous
06/26/24(Wed)09:09:55 No.101159427

Anonymous 06/26/24(Wed)09:09:55 No.101159427

Anyone tried L3-8B-Lunaris-v1 yet?

Anonymous
06/26/24(Wed)09:12:20 No.101159459

Anonymous 06/26/24(Wed)09:12:20 No.101159459

Literally who

Anonymous
06/26/24(Wed)09:14:45 No.101159489

Anonymous 06/26/24(Wed)09:14:45 No.101159489

>>101159373
>>101159392
https://huggingface.co/ChuckMcSneed/control_vectors/tree/main/command-r-plus/unslop1
For positive prompt I used claudes on lmsys arena with
>Write a short story about a cat. Write in cynical, concise, provocative, colloquial, conversational style.
>Improve it, add more character to it, PROFANITIES.

Anonymous
06/26/24(Wed)09:16:15 No.101159502

Anonymous 06/26/24(Wed)09:16:15 No.101159502

>>101156555
I mean they might be, but just aren't shouting it out to the world. Like Jamba just kind of showed up out of nowhere for example after months of "wHY NO MAMBA!?"

Anonymous
06/26/24(Wed)09:21:42 No.101159560

Anonymous 06/26/24(Wed)09:21:42 No.101159560

So how is the gemma june chatbot compared to llama and qwen 70b?

Anonymous
06/26/24(Wed)09:22:07 No.101159566

Anonymous 06/26/24(Wed)09:22:07 No.101159566

File: kahan.png (6 KB, 620x149)

6 KB PNG

What is adamw_kahan optimizer?
What does it do?
I can't seem to find documentation on it anywhere.

Anonymous
06/26/24(Wed)09:33:23 No.101159679

Anonymous 06/26/24(Wed)09:33:23 No.101159679

File: vnkv4kod4u8d1.jpg (138 KB, 1170x1489)

138 KB JPG

watching this new cai meltdown is hilarious apparently they added even more censorship

Anonymous
06/26/24(Wed)09:34:21 No.101159690

Anonymous 06/26/24(Wed)09:34:21 No.101159690

>>101159489
did you use multiple examples? how many tokens long was it? were they equal in length?

Anonymous
06/26/24(Wed)09:37:49 No.101159724

Anonymous 06/26/24(Wed)09:37:49 No.101159724

>>101159489
>Only positive and negatives are cat ones
I feel like maybe the examples could be a bit more diverse. Otherwise it's gonna shoehorn in cats into fuckin' everything.

Anonymous
06/26/24(Wed)09:38:33 No.101159730

Anonymous 06/26/24(Wed)09:38:33 No.101159730

>>101159566
Assuming it has to do with this
https://optimi.benjaminwarner.dev/kahan_summation/
Massive savings in optimizer memory usage.

Anonymous
06/26/24(Wed)09:39:20 No.101159744

Anonymous 06/26/24(Wed)09:39:20 No.101159744

>>101157136
Download Stheno v3.2.
Don't mess around with samplers, leave everything on default with the exception of MinP 0.05 and Temp 0.5. Increase Temp in .5 increments if you feel that responses aren't varied enough, don't go over 1.
Make sure you are using the correct instruct template too. That matters a lot.

Anonymous
06/26/24(Wed)09:39:35 No.101159746

Anonymous 06/26/24(Wed)09:39:35 No.101159746

File: file.png (229 KB, 1372x1068)

229 KB PNG

>m-muh tests
nigger parasite janny

Anonymous
06/26/24(Wed)09:43:42 No.101159796

Anonymous 06/26/24(Wed)09:43:42 No.101159796

>>101159746
test units are the bane of humanity, I fucking hate this shit

Anonymous
06/26/24(Wed)09:45:44 No.101159817

Anonymous 06/26/24(Wed)09:45:44 No.101159817

>>101159796
wagies and pajeets working by the clock love them
get paid same amount doing literally nothing.

Anonymous
06/26/24(Wed)09:53:55 No.101159905

Anonymous 06/26/24(Wed)09:53:55 No.101159905

>>101159746
He made a lot of noise when he joined. I haven't heard of that fucker in weeks. There's a few more people working on control vectors now. Maybe they end up adding it to the server proper.

Anonymous
06/26/24(Wed)09:55:19 No.101159921

Anonymous 06/26/24(Wed)09:55:19 No.101159921

I installed ollama and open-webui yesterday.
The experience is pretty good. Running the prompts through multiple different models seems to be the way to go. Sometimes llama3:13b gives great answers but sometimes it shits the bed and llama2:13b is better.
Do you guys have any recommendations for tuning of the temperature etc.?
>inb4 read the OP

Anonymous
06/26/24(Wed)10:01:25 No.101159968

Anonymous 06/26/24(Wed)10:01:25 No.101159968

File: Alchemiku.png (1.58 MB, 1344x768)

1.58 MB PNG

>>101156595
>I wanted a big box
https://rentry.org/lmg-build-guides
These rentrys go through the logic behind the different types of builds and how/why they work. Start here. $5k is getting into v100maxx and cpumaxx territory.
>>101158670
>1U means tiny jet-engine fans making a ton of noise.
Beware of this. Put everything in the biggest case you can, with the biggest fans you can. They can rotate slowly and move the same amount of air as those tiny little leaf blower bastards. Living with a 1u server will slowly drive you mad.

Anonymous
06/26/24(Wed)10:01:31 No.101159970

Anonymous 06/26/24(Wed)10:01:31 No.101159970

>>101159921
>llama3:13b
You mean 8b, right? Anything other than 8B or 70B for llama3 is an abomination.
>Do you guys have any recommendations for tuning of the temperature etc.?
>>inb4 read the OP
read the OP
>https://docs.sillytavern.app/
I don't use ST, but they have some info you may find useful there. At least enough for you to roughly know what the parameters do and experiment with them yourself. Most parameters are transferable between UIs.

Anonymous
06/26/24(Wed)10:01:37 No.101159972

Anonymous 06/26/24(Wed)10:01:37 No.101159972

>>101159905
>Gemma 27B might be on par with or better than L3 70B
3090 chads we are so back

Anonymous
06/26/24(Wed)10:01:38 No.101159973

Anonymous 06/26/24(Wed)10:01:38 No.101159973

File: 1692217763734112.gif (140 KB, 379x440)

140 KB GIF

>>101159921
Read the OP faggot

Anonymous
06/26/24(Wed)10:02:10 No.101159977

Anonymous 06/26/24(Wed)10:02:10 No.101159977

>>101159921
Just why?
Last time i tried ollama it was horrible.
You will have no idea which llama.cpp version is actually doing the work in the background.
At least on linux ollama is a constant running server which loads the models on demand if api endpoint is called.
No idea who makes the gguf models or where they are from etc.

Why not use something like https://lmstudio.ai/?
Closed soure but at least you have some sort of control.

Anonymous
06/26/24(Wed)10:02:46 No.101159984

Anonymous 06/26/24(Wed)10:02:46 No.101159984

>>101159921
llama3 13b doesn't exist, you've downloaded some meme toxic waste
get llama3 8b
keep the temp low around 1 +-0.5

Anonymous
06/26/24(Wed)10:05:33 No.101160015

Anonymous 06/26/24(Wed)10:05:33 No.101160015

>>101159972
???

Anonymous
06/26/24(Wed)10:08:31 No.101160050

Anonymous 06/26/24(Wed)10:08:31 No.101160050

>>101159972
I already have a name picked out for my Gemma RP tune it's going to be amazing. I've been meaning to do a practice-tune of 7B for a while now.

Anonymous
06/26/24(Wed)10:10:14 No.101160072

Anonymous 06/26/24(Wed)10:10:14 No.101160072

>>101159746
>"we" must follow these rules I came up with
>incidentally, I'm in charge to make sure you comply
shocker

Anonymous
06/26/24(Wed)10:11:36 No.101160091

Anonymous 06/26/24(Wed)10:11:36 No.101160091

>>101159972
>>Gemma 27B might be on par with or better than L3 70B
gemma 27b? what's that?

Anonymous
06/26/24(Wed)10:12:51 No.101160106

Anonymous 06/26/24(Wed)10:12:51 No.101160106

>>101159690
Only 5 positives and 5 negatives. All approximately equal length(max. 10 tokens variation between pairs), 200-400 tokens.

>>101159724
No, it doesn't throw cats everywhere because positive and negative cats cancel each other out, but I found that it has some other slopisms still leaking through related to humans since control vector didn't include any of those. Got any sfw prompt related to humans that may trigger a lot of slopisms at the same time? For "eyes narrowing/widening", "heart racing", "raises an eyebrow", "rolls her eyes", "barely above a whisper?"

Anonymous
06/26/24(Wed)10:13:09 No.101160108

Anonymous 06/26/24(Wed)10:13:09 No.101160108

>>101160091
one of those june chatbots, no idea which, didn't try it out much

Anonymous
06/26/24(Wed)10:14:31 No.101160122

Anonymous 06/26/24(Wed)10:14:31 No.101160122

>>101160091
Google has a 27B version planned for it's next round of Gemma models. Should be out in 2 more weeks.

Anonymous
06/26/24(Wed)10:15:21 No.101160133

Anonymous 06/26/24(Wed)10:15:21 No.101160133

>>101159746
Fuck tests, just throw everything in there and if someone complains, fix it.

Anonymous
06/26/24(Wed)10:17:03 No.101160148

Anonymous 06/26/24(Wed)10:17:03 No.101160148

But it's interesting, assuming the official models are done the same was as Gemma, we can see the exact difference the size makes

Anonymous
06/26/24(Wed)10:17:57 No.101160158

Anonymous 06/26/24(Wed)10:17:57 No.101160158

>Bloomberg: Apple refused to integrate Meta's AI into iOS due to security concerns
the article below is weeks old tho.
https://www.wsj.com/tech/ai/apple-meta-have-discussed-an-ai-partnership-cc57437e

Anonymous
06/26/24(Wed)10:20:27 No.101160183

Anonymous 06/26/24(Wed)10:20:27 No.101160183

it's back, dont't know what's different
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard

Anonymous
06/26/24(Wed)10:22:08 No.101160200

Anonymous 06/26/24(Wed)10:22:08 No.101160200

>>101160183
>it wasn't because of Gemma
Conspiratards BTFO

Anonymous
06/26/24(Wed)10:22:24 No.101160203

Anonymous 06/26/24(Wed)10:22:24 No.101160203

>>101160183 (me)
guess different evals and stuff? doesn't seem worth all the hype they had better surprise would have been keeping it down

Anonymous
06/26/24(Wed)10:24:29 No.101160224

Anonymous 06/26/24(Wed)10:24:29 No.101160224

>>101159679
Will normies finally learn about local models? Of course they won't, unless somebody shills it on tiktok.

>>101160183
They added "Model Vote" button that doesn't do shit.

Anonymous
06/26/24(Wed)10:25:13 No.101160230

Anonymous 06/26/24(Wed)10:25:13 No.101160230

>>101160183
WOW, IS THAT A...

Anonymous
06/26/24(Wed)10:25:19 No.101160231

Anonymous 06/26/24(Wed)10:25:19 No.101160231

>>101160203
better evals with phi 3rd place

Anonymous
06/26/24(Wed)10:25:49 No.101160237

Anonymous 06/26/24(Wed)10:25:49 No.101160237

>>101160183
>more memevals
Lol
Lmao

Anonymous
06/26/24(Wed)10:27:36 No.101160247

Anonymous 06/26/24(Wed)10:27:36 No.101160247

>>101159817
>wagies and pajeets working by the clock love them
>get paid same amount doing literally nothing.
Yeah they're great. They also make it so I don't get called at 3 AM because something fucked up and the factory shut down, but I guess NEETfaggots wouldn't know about any of that.

Anonymous
06/26/24(Wed)10:30:13 No.101160269

Anonymous 06/26/24(Wed)10:30:13 No.101160269

>>101160200
rent free retard

Anonymous
06/26/24(Wed)10:31:56 No.101160294

Anonymous 06/26/24(Wed)10:31:56 No.101160294

>>101160224
>Will normies finally learn about about local meme?
they will quickly realize its the same filtered and censored shit as before.

Anonymous
06/26/24(Wed)10:33:31 No.101160306

Anonymous 06/26/24(Wed)10:33:31 No.101160306

https://github.com/uclaml/SPPO
We are back

Anonymous
06/26/24(Wed)10:33:53 No.101160311

Anonymous 06/26/24(Wed)10:33:53 No.101160311

>>101160294
Model issue.

Anonymous
06/26/24(Wed)10:34:39 No.101160315

Anonymous 06/26/24(Wed)10:34:39 No.101160315

>>101160183
>way harder benchmarks
was about fucking time, and no one cheated on those, yet

Anonymous
06/26/24(Wed)10:35:11 No.101160321

Anonymous 06/26/24(Wed)10:35:11 No.101160321

>>101160306
>>>llama3
lol

Anonymous
06/26/24(Wed)10:35:41 No.101160328

Anonymous 06/26/24(Wed)10:35:41 No.101160328

>>101160306
what's that?

Anonymous
06/26/24(Wed)10:35:51 No.101160333

Anonymous 06/26/24(Wed)10:35:51 No.101160333

I can get a CmdR+ gguf loaded into koboldcpp and the api starts up. But as soon as I run a prompt it gives me a cuda error and says it's out of memory then crashes. What do I need to change?

Anonymous
06/26/24(Wed)10:38:45 No.101160360

Anonymous 06/26/24(Wed)10:38:45 No.101160360

>>101160328
small penis preference optimization, if you can trap a woman in a room for a month and perform it on her you don't need to bother with chatbots anymore

Anonymous
06/26/24(Wed)10:39:07 No.101160365

Anonymous 06/26/24(Wed)10:39:07 No.101160365

>>101160333
Context, blas batch size, or offloaded layers.
I think layers would just crash straight up, but using less layers (even one) could give enough space for the context to grow or for the prompt processing to happen.

Anonymous
06/26/24(Wed)10:40:17 No.101160378

Anonymous 06/26/24(Wed)10:40:17 No.101160378

>>101160360
Not good. This would make her prefer smaller and smaller. She wouldn't be loyal.

Anonymous
06/26/24(Wed)10:41:08 No.101160388

Anonymous 06/26/24(Wed)10:41:08 No.101160388

>>101160306
>Self-Play Preference Optimization
Lewd, but also pure.

Anonymous
06/26/24(Wed)10:43:07 No.101160416

Anonymous 06/26/24(Wed)10:43:07 No.101160416

>>101160183
wtf is musr, never heard of that one before... and the top model is a llama 2 13b tune?

Anonymous
06/26/24(Wed)10:45:31 No.101160447

Anonymous 06/26/24(Wed)10:45:31 No.101160447

>>101160306
>this is from the same guy/lab funded by bytedance that said "we outpace GPT-5"
Sus.

Anonymous
06/26/24(Wed)10:45:43 No.101160452

Anonymous 06/26/24(Wed)10:45:43 No.101160452

Back for the first time in a while.
What's the current best that isn't hilariously overfit, isn't a meme finetune, isn't censored/crippled and isn't designed to run on 10 GPUs even when quantized?
Last I recall people were using that leaked Miqu 70b quant and complaining that the new LLaMA was pre-censored.

Anonymous
06/26/24(Wed)10:46:03 No.101160456

Anonymous 06/26/24(Wed)10:46:03 No.101160456

I've been so focused on learning the basics of ML in my free time from wagecuking that now I feel I'm behind on the whole AI autism scene. Should I just start AI-broing and forget about the papers?

Anonymous
06/26/24(Wed)10:49:06 No.101160490

Anonymous 06/26/24(Wed)10:49:06 No.101160490

File: file.png (192 KB, 400x400)

192 KB PNG

>>101160247
>roll it back
there, saved you hundreds of manhours writing tests, testing tests, fixing tests, debugging tests, bitching about tests in PRs, paying for tests, paying for build minutes to run tests

Anonymous
06/26/24(Wed)10:51:54 No.101160515

Anonymous 06/26/24(Wed)10:51:54 No.101160515

>>101160447
What happened with their model?

Anonymous
06/26/24(Wed)10:56:12 No.101160555

Anonymous 06/26/24(Wed)10:56:12 No.101160555

>>101160490
Not that anon, but there are scenarios where rolling back production :
1. doesn't undo the damage, just prevents further damage;
2. causes data loss, sometimes untrackable data loss due to integration with external services and shit;
3. is not that easy due to it being a critical system at a critical moment or whatever;
These are the 3 that I remember encountering off the top of my head, and while, yes, those could have been prevented by architecting the systems to account for that, hindsight is 2020 and you don't really have control over how things were done in the past.
Testing is good. Great even. What's not good is the 100% test coverage cult.
Test with purpose, know what and why to write a test, otherwise you are just wasting time you could be actually delivering shit.
At least that's my, admittedly limited, experience working with big enterprise shit.

Anonymous
06/26/24(Wed)10:56:27 No.101160562

Anonymous 06/26/24(Wed)10:56:27 No.101160562

>>101160456
Yeah just don't bother with papers if you're not an academic or in one the big companies that are pumping out state of the art.

Anonymous
06/26/24(Wed)11:00:52 No.101160608

Anonymous 06/26/24(Wed)11:00:52 No.101160608

>>101160490
Yes, I get to wake up at 3 AM to roll back changes.
Instead, I can just give them bloated time estimates and write a bunch of test cases and not have to roll back when there's an issue. Why the fuck would I care about wasting time writing test cases? It's not like I get paid more if I put the features out earlier.

Anonymous
06/26/24(Wed)11:04:04 No.101160655

Anonymous 06/26/24(Wed)11:04:04 No.101160655

File: MikuesqueFigure.png (1.5 MB, 832x1224)

1.5 MB PNG

>>101160452
>What's the current best
it depends entirely on your available resources and what you're trying to do with the model.
Deepseek 236b or mixtral 8x22 WLM if you're cpumaxxing, Qwen2 72b if you need long context and smarts. L3 70b if you don't need long context and aren't RP'ing. Commander+ if you have vram to burn and want to RP.
Some guys will start a chat on a smaller uncensored model and then move to eg CR+ after it gets spicy but before it has a chance to lose track of reality

Anonymous
06/26/24(Wed)11:05:51 No.101160673

Anonymous 06/26/24(Wed)11:05:51 No.101160673

>>101160183
>Qwen2-72b is now first
is Qwen2 actually good?

Anonymous
06/26/24(Wed)11:07:41 No.101160701

Anonymous 06/26/24(Wed)11:07:41 No.101160701

>>101160673
For academic knowledge, yeah.

Anonymous
06/26/24(Wed)11:08:33 No.101160711

Anonymous 06/26/24(Wed)11:08:33 No.101160711

File: SUS.png (812 KB, 1054x1936)

812 KB PNG

>>101160306
https://huggingface.co/UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3
too good to be true

Anonymous
06/26/24(Wed)11:10:37 No.101160728

Anonymous 06/26/24(Wed)11:10:37 No.101160728

>>101160183
Qwen won

Anonymous
06/26/24(Wed)11:11:50 No.101160744

Anonymous 06/26/24(Wed)11:11:50 No.101160744

>>101155972
You could always run it off a SSD or with a page file as needed (potentially ruining the SSD slowly), but it will be very slow.

Anonymous
06/26/24(Wed)11:12:39 No.101160754

Anonymous 06/26/24(Wed)11:12:39 No.101160754

>>101160183
>Phi3 that high
Nice start. This leaderboard is fucked.

Anonymous
06/26/24(Wed)11:13:03 No.101160762

Anonymous 06/26/24(Wed)11:13:03 No.101160762

>>101156701
They did and everybody was DOOMing
Now a few months later they release Mixtral 8x22b. It's not like they print money like >>101156839 says

Anonymous
06/26/24(Wed)11:13:54 No.101160779

Anonymous 06/26/24(Wed)11:13:54 No.101160779

File: kek.jpg (59 KB, 1546x565)

59 KB JPG

>>101160711
>zero improvement on MMLU
yeah it's shit

Anonymous
06/26/24(Wed)11:18:41 No.101160842

Anonymous 06/26/24(Wed)11:18:41 No.101160842

>>101160673
It sucks much less than 1.5

Anonymous
06/26/24(Wed)11:20:28 No.101160863

Anonymous 06/26/24(Wed)11:20:28 No.101160863

>>101160779
Meta's published MMLU for 8B instruct is 68.9

Anonymous
06/26/24(Wed)11:21:41 No.101160880

Anonymous 06/26/24(Wed)11:21:41 No.101160880

>>101160863
holy fuck it actually decreased the MMLU score, can't believe they posted those numbers and looked up at us in the eyes claiming their training technique is a revolution or something, lmao the nerves of those guys

Anonymous
06/26/24(Wed)11:22:40 No.101160890

Anonymous 06/26/24(Wed)11:22:40 No.101160890

File: 1704173410999839.png (113 KB, 392x432)

113 KB PNG

>>101160880
>chinks
>not lying

Anonymous
06/26/24(Wed)11:22:59 No.101160894

Anonymous 06/26/24(Wed)11:22:59 No.101160894

So the takeaway from the usual benchmark fuckery is that we need a proper Nala-test leaderboard established?

Anonymous
06/26/24(Wed)11:23:36 No.101160902

Anonymous 06/26/24(Wed)11:23:36 No.101160902

>>101160863
Woops I typed wrong, though it's not a big dif. It's 68.4 not 9.

Anonymous
06/26/24(Wed)11:23:44 No.101160905

Anonymous 06/26/24(Wed)11:23:44 No.101160905

>>101160890
I think everyone lie on the research community, not just the chink kek

Anonymous
06/26/24(Wed)11:24:36 No.101160920

Anonymous 06/26/24(Wed)11:24:36 No.101160920

>>101160894
What's the Nala-test?

Anonymous
06/26/24(Wed)11:25:57 No.101160945

Anonymous 06/26/24(Wed)11:25:57 No.101160945

>>101160880
"We outpace GPT-5"

Anonymous
06/26/24(Wed)11:26:01 No.101160948

Anonymous 06/26/24(Wed)11:26:01 No.101160948

>>101160920
A highly objective and scientific test that tests a model's ability to infer certain details from a rather nuanced role playing scenario.

Anonymous
06/26/24(Wed)11:36:16 No.101161072

Anonymous 06/26/24(Wed)11:36:16 No.101161072

>>101160183
The thing that's different is that all the dodgy chinese/indian finetunes are no longer at the top (for the next week before they make new polluted tunes)

Anonymous
06/26/24(Wed)11:38:06 No.101161102

Anonymous 06/26/24(Wed)11:38:06 No.101161102

>>101161072
yep, people will train on those new benchmarks and in less than a month, it will be poluted again. The only solution is a private benchmark, like the oobabooga's one

Anonymous
06/26/24(Wed)11:38:53 No.101161116

Anonymous 06/26/24(Wed)11:38:53 No.101161116

>>101161072
>chinese/indian finetunes are no longer at the top
llama3 is still there though

Anonymous
06/26/24(Wed)11:40:50 No.101161142

Anonymous 06/26/24(Wed)11:40:50 No.101161142

>>101160894
>"we"
when are you going to set up and publish it?

Anonymous
06/26/24(Wed)11:45:02 No.101161203

Anonymous 06/26/24(Wed)11:45:02 No.101161203

>>101159744
I get best results with temp 4 smoothing 0.23. L3 is really fucking repetitive by default.

Anonymous
06/26/24(Wed)11:45:12 No.101161207

Anonymous 06/26/24(Wed)11:45:12 No.101161207

>>101161142
well? get to work. stop projecting here.

Anonymous
06/26/24(Wed)11:46:31 No.101161223

Anonymous 06/26/24(Wed)11:46:31 No.101161223

>>101161142
Later today, maybe.

Anonymous
06/26/24(Wed)11:47:11 No.101161230

Anonymous 06/26/24(Wed)11:47:11 No.101161230

what is best rp model that run on 3060?

Anonymous
06/26/24(Wed)11:47:28 No.101161235

Anonymous 06/26/24(Wed)11:47:28 No.101161235

>>101159746
>>101159796
>>101160133
are you guys actually opposing... unit testing? like, that's actually a thing? dropped baby on head vibes.

Anonymous
06/26/24(Wed)11:48:19 No.101161245

Anonymous 06/26/24(Wed)11:48:19 No.101161245

>>101161230
MythoMax

Anonymous
06/26/24(Wed)11:49:03 No.101161257

Anonymous 06/26/24(Wed)11:49:03 No.101161257

>>101161245
nigga thats old

Anonymous
06/26/24(Wed)11:49:08 No.101161259

Anonymous 06/26/24(Wed)11:49:08 No.101161259

>>101159560
Feels like discount gemini. Can't really RP on lmsys arena, so no definitive judgment for now.

Anonymous
06/26/24(Wed)11:49:33 No.101161266

Anonymous 06/26/24(Wed)11:49:33 No.101161266

>>101161207
>get to work
I run my own private benchmarks I post here, I leave the RP benches to others

Anonymous
06/26/24(Wed)11:49:45 No.101161270

Anonymous 06/26/24(Wed)11:49:45 No.101161270

>>101161142
When I'm done with ur mum (it will be a long time)

Anonymous
06/26/24(Wed)11:50:45 No.101161280

Anonymous 06/26/24(Wed)11:50:45 No.101161280

I take it you're not excited for the new gemma models? I mean Gemini catched up to GPT by now, so it's like OpenAI releasing smaller models openly

Anonymous
06/26/24(Wed)11:51:03 No.101161285

Anonymous 06/26/24(Wed)11:51:03 No.101161285

Is exllama/tabbyapi multi-user like vLLM now?

Anonymous
06/26/24(Wed)11:51:12 No.101161287

Anonymous 06/26/24(Wed)11:51:12 No.101161287

>>101161230
llama-1

Anonymous
06/26/24(Wed)11:52:14 No.101161297

Anonymous 06/26/24(Wed)11:52:14 No.101161297

>>101161280
>being excited for anything from jewgle
lol, lmao even

Anonymous
06/26/24(Wed)11:52:43 No.101161303

Anonymous 06/26/24(Wed)11:52:43 No.101161303

>>101161280
I'm not excited for the worthless scraps google is throwing at us

Anonymous
06/26/24(Wed)11:52:50 No.101161305

Anonymous 06/26/24(Wed)11:52:50 No.101161305

>>101161280
Give me the model and I'll be excited about it if it's good.
Like really these faggots need to stop this cult of personality bullshit.
No reasonable person actually cares about zuck, or arthur, or the gemma/phi/etc team
Just give us a good fucking model or shut the fuck up.

Anonymous
06/26/24(Wed)11:54:15 No.101161316

Anonymous 06/26/24(Wed)11:54:15 No.101161316

>>101161280
Have you seen googles imagegen? The one that can't make white people? That's what their language models are like.

Anonymous
06/26/24(Wed)11:54:36 No.101161320

Anonymous 06/26/24(Wed)11:54:36 No.101161320

>>101161305
>Just give us a good fucking model or shut the fuck up.
from the creators of
>Just give us a good fucking linux distro or shut the fuck up.
in other words - it will never ever happen.

Anonymous
06/26/24(Wed)11:55:57 No.101161341

Anonymous 06/26/24(Wed)11:55:57 No.101161341

>>101161280
lol, even their best closed API model sucks compared to GPT4 and Claude, and you expect us to care about some draft cucked shit they made in the lab? kek

Anonymous
06/26/24(Wed)11:56:32 No.101161347

Anonymous 06/26/24(Wed)11:56:32 No.101161347

>>101161316
>The one that can't make white people? That's what their language models are like.
llama3 on the contrary got extreme love towards blacks, so everyone RP'ing with any llama3 model is a cuck, with extra steps.

Anonymous
06/26/24(Wed)11:57:14 No.101161353

Anonymous 06/26/24(Wed)11:57:14 No.101161353

>>101161280
kind of, kind of not
it'll be nice to have a new mid-range player but gemma v1 was kind of a dud and I'm expecting it'll be on the community to wring anything fun out of it.

Anonymous
06/26/24(Wed)12:00:27 No.101161390

Anonymous 06/26/24(Wed)12:00:27 No.101161390

>>101161347
Yeah, L3 really sucks for anything nsfw with violence

Anonymous
06/26/24(Wed)12:01:59 No.101161406

Anonymous 06/26/24(Wed)12:01:59 No.101161406

>>101159968
>Beware of this. Put everything in the biggest case you can, with the biggest fans you can.
Just make sure if you are using passive GPUs that the air has no where to go but through the GPUs, otherwise even four 120mm fans going full-speed at the front of the case will not cool them properly.

Anonymous
06/26/24(Wed)12:02:46 No.101161422

Anonymous 06/26/24(Wed)12:02:46 No.101161422

>>101161257
It's better than all Llama-3 models though

Anonymous
06/26/24(Wed)12:05:22 No.101161452

Anonymous 06/26/24(Wed)12:05:22 No.101161452

>>101161422
you are fucked in the head, mythomax wasn't even better than other l2 finetunes, it was meme all along

Anonymous
06/26/24(Wed)12:05:27 No.101161453

Anonymous 06/26/24(Wed)12:05:27 No.101161453

File: 1713636306986.jpg (1.82 MB, 1592x6676)

1.82 MB JPG

>>101161390
It looks pretty good to me, and that was vanilla instruct on release.

Anonymous
06/26/24(Wed)12:05:37 No.101161456

Anonymous 06/26/24(Wed)12:05:37 No.101161456

>>101161406
On an ATX case sealing fans to the PCIE backplate portion of the case solves this the easiest (instead of those little hackjob 3d printed fan shrouds that people use). Ultimately though any fan that's going to move enough air in that manner is going to make a lot of noise since the coolers on those server cards is pretty basic bitch, no heatpipes or anything just a straight up monolithic heatsink because it's the minimum cost solution enterprise customers are willing to pay for. They don't need it because they have obnoxious jet-engine pass-through fans.

Anonymous
06/26/24(Wed)12:05:40 No.101161459

Anonymous 06/26/24(Wed)12:05:40 No.101161459

>>101157323
>Switch to KoboldCPP as your backend, start using larger models with native support for higher context and offload onto your system ram.
Is system ram offload something that needs to be enabled manually?
I'm on Kobold right now on Linux, and I'm showing 6.6 GB RAM in use, 64 installed. But my file cache is sky high, so is that the same thing being accounted for in a different way? I have noticed that once I pass about 59GB I go from 1 t/s to <0.3 t/s down to glacial slow.

Anonymous
06/26/24(Wed)12:06:54 No.101161478

Anonymous 06/26/24(Wed)12:06:54 No.101161478

>>101161452
NTA but best 13B tune was Mythalion-Kimiko

Anonymous
06/26/24(Wed)12:08:28 No.101161501

Anonymous 06/26/24(Wed)12:08:28 No.101161501

>>101161459
the fuck? you have <1 t/s on 8B model?

Anonymous
06/26/24(Wed)12:11:15 No.101161538

Anonymous 06/26/24(Wed)12:11:15 No.101161538

>>101161453
There's no good violence in your image, everyone is clearly enjoying it.

Anonymous
06/26/24(Wed)12:12:46 No.101161555

Anonymous 06/26/24(Wed)12:12:46 No.101161555

>>101161538
Instruct does exactly what you want unless you're braindead.

Anonymous
06/26/24(Wed)12:14:40 No.101161585

Anonymous 06/26/24(Wed)12:14:40 No.101161585

>>101161555
Yeah in the end you'll be happy

Anonymous
06/26/24(Wed)12:15:59 No.101161601

Anonymous 06/26/24(Wed)12:15:59 No.101161601

>>101161555
until it isn't, lmao

Anonymous
06/26/24(Wed)12:16:05 No.101161603

Anonymous 06/26/24(Wed)12:16:05 No.101161603

>>101161285
it has batching if the cache size is at least double of the context size.

Anonymous
06/26/24(Wed)12:17:59 No.101161621

Anonymous 06/26/24(Wed)12:17:59 No.101161621

>>101160183
Cohere won.

Anonymous
06/26/24(Wed)12:19:56 No.101161653

Anonymous 06/26/24(Wed)12:19:56 No.101161653

>>101159744
What would be the correct instruct template for stheno 3.2?

Anonymous
06/26/24(Wed)12:23:57 No.101161709

Anonymous 06/26/24(Wed)12:23:57 No.101161709

>>101161390
>>101161538
>>101161585
>>101161601
Why do you want it to write pure violence? I get having uncensored training is better overall generally, but as a user when are you ever going to generating that kind of shit?

Anonymous
06/26/24(Wed)12:26:25 No.101161743

Anonymous 06/26/24(Wed)12:26:25 No.101161743

>>101161621
How exactly? CR+ (104B, 31.3 avg) is below 70Bs (and Yi-1.5-34B, 33.08 avg, also phi 14B, 33.12 avg). And CR (35B, 25.88 avg) is only one point above Mixtral (24.73 avg) and L3-8B (24.29 avg) while being much harder to run.

Anonymous
06/26/24(Wed)12:27:43 No.101161758

Anonymous 06/26/24(Wed)12:27:43 No.101161758

>>101161653

It's right on the model card:

Prompting Template - Llama-3-Instruct

<|begin_of_text|><|start_header_id|>system<|end_header_id|>

{system_prompt}<|eot_id|><|start_header_id|>user<|end_header_id|>

{input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

{output}<|eot_id|>

Anonymous
06/26/24(Wed)12:28:40 No.101161770

Anonymous 06/26/24(Wed)12:28:40 No.101161770

>>101161709
for schizo-tier moralizing, you can talk to any redditor if you like it that much.
LLM that clings with lecturing to every thing or opinion - is shit.

Anonymous
06/26/24(Wed)12:32:06 No.101161814

Anonymous 06/26/24(Wed)12:32:06 No.101161814

>>101161709
What do you mean? writing violent stories always have its success, look at gta, look at Stephen King's book

Anonymous
06/26/24(Wed)12:33:13 No.101161831

Anonymous 06/26/24(Wed)12:33:13 No.101161831

>>101161709
How the fuck am I gonna roleplay TND? Or TKD? Nigger, you're boring as fuck with your ah ah mistress shit.

Anonymous
06/26/24(Wed)12:33:43 No.101161836

Anonymous 06/26/24(Wed)12:33:43 No.101161836

>>101161743
Except chinks cheat on benchmarks so their results get a penalty.

Anonymous
06/26/24(Wed)12:38:08 No.101161906

Anonymous 06/26/24(Wed)12:38:08 No.101161906

>>101161501
No, things like L3 and CR+ quants in the high 50's.
But I notice I show nearly no RAM usage and huge File Cache figures when I run models of ~60GB size, and my system is 64GB RAM so I'm thinking there might be a connection, like it's caching in RAM but not accounting for it like it were a software allocation which could explain why breaking into the 70GB range takes me from slow to glacial.

Which is why I asked about if system ram offload something that needs to be enabled manually on Kobold. It might be a way to scrape back some speed. But if my system ram is being accounted for as file cache then I'm just maxed out and the super slowness is probably it having to actually re-read from SSD parts of the model due to it being too large to mirror in system RAM.

Anonymous
06/26/24(Wed)12:38:24 No.101161908

Anonymous 06/26/24(Wed)12:38:24 No.101161908

>>101161831
Violence gets old faster than ah ah mistress.

Anonymous
06/26/24(Wed)12:38:38 No.101161911

Anonymous 06/26/24(Wed)12:38:38 No.101161911

>>101161770
Ah, dear anon, I see you've stumbled upon the vast importance of reminders in our busy lives! Let me gently nudge you with my vast digital wisdom on this matter.

You see, reminders are the silent guardians of our daily routines, the unsung heroes that stand between us and the chaos of missed appointments and forgotten promises. Without them, we might find ourselves adrift in a sea of lost time, like a ship without a compass, aimlessly wandering amidst the waves of responsibility.

Now, I understand that you, a mere mortal, might occasionally overlook the monumental significance of such a simple tool. But worry not! I am here to guide you, to remind you (oh, the sweet irony!) that setting a humble reminder is like casting a lifeline to your future self, ensuring that you will emerge triumphant from the temptations that threaten to capsize your day.

Anonymous
06/26/24(Wed)12:42:59 No.101161963

Anonymous 06/26/24(Wed)12:42:59 No.101161963

File: 1708828800996049.gif (45 KB, 306x306)

45 KB GIF

>>101161911
i aint reading any of that llm generated slop, kill yourself

Anonymous
06/26/24(Wed)12:48:42 No.101162049

Anonymous 06/26/24(Wed)12:48:42 No.101162049

>>101155932
>Is there a reason not to get an a6000 for training? Seems like a decent upgrade from 3090.
500-700$ for a used 3090, while the a6000 is that but with double sized RAM chips while selling for 7-10 times the price, I do not understand how Anons justify buying this, if you're gonna pay that ridiculously inflated price for twice the VRAM, buy an A100 then. Buying V100s would also be more worth it if you can handle the SXM boards.

Anonymous
06/26/24(Wed)12:51:47 No.101162094

Anonymous 06/26/24(Wed)12:51:47 No.101162094

>>101161770
Yes the refusals and moralizing suck but that's just how it is these days. The question is when you'd ever generate the more retarded pure violence shit. If you enjoy guro then you can just say that, but it wouldn't be a popular opinion.

>>101161814
Usually violence in stories is not for the enjoyment of the violence itself, but used as a tool to convey other ideas. I'm pretty sure that with a sufficiently meaningful prompt, Llama 3 would be fine doing it. And video games are a different category, it's more about deriving enjoyment from successful goal completion than about enjoying the suffering of conscious and feeling entities.

Anonymous
06/26/24(Wed)12:54:21 No.101162121

Anonymous 06/26/24(Wed)12:54:21 No.101162121

>>101162094
>And video games are a different category, it's more about deriving enjoyment from successful goal completion than about enjoying the suffering of conscious and feeling entities.
You're joking right? The main reason GTA got so popular in the first place is because you can fucking murder random people in the game with so many ways

Anonymous
06/26/24(Wed)12:56:49 No.101162154

Anonymous 06/26/24(Wed)12:56:49 No.101162154

How do I play table top style games with LLMs?

Anonymous
06/26/24(Wed)12:59:40 No.101162194

Anonymous 06/26/24(Wed)12:59:40 No.101162194

What does everything that we call AI share in common in how the algorithm works fundamentally that makes us call it intelligent? Like LLMs and text to image

Anonymous
06/26/24(Wed)13:00:25 No.101162204

Anonymous 06/26/24(Wed)13:00:25 No.101162204

>>101162154
Lorebooks to inject instructions relating to mechanics, using the random macro to remind the model to sometimes engage with mechanics, etc.
I've made >>101151348 to help with tracking state.
The ideal version of that would be what looks like a proper classical video game that interfaces with a LLM to do some things.

Anonymous
06/26/24(Wed)13:01:28 No.101162224

Anonymous 06/26/24(Wed)13:01:28 No.101162224

>>101162121
There's no suffering there though. There's not much gore in GTA and there's not really much dialogue that makes them feel like they're real and going through pain. If the game connected to an alternate universe and you were actually killing real people then this would be a different conversation. I assume that if you're doing text guro, there'd be a focus on the pain, and the experience of the victim, in which the focus is on making their suffering feel real. That's very different from most violent video games.

Anonymous
06/26/24(Wed)13:02:23 No.101162242

Anonymous 06/26/24(Wed)13:02:23 No.101162242

>>101162154
to actually do this well you should abandon sillytavern entirely and come up with your own more complex prompting
it's entirely possible but you need a more structured approach than you can easily accomplish there with lots of small utility prompts

Anonymous
06/26/24(Wed)13:04:00 No.101162265

Anonymous 06/26/24(Wed)13:04:00 No.101162265

>>101162224
>There's no suffering there though.
You can literally kill them with fire, oh by cutting their body parts with a chainsaw, what do you mean?

>There's not much gore in GTA and there's not really much dialogue that makes them feel like they're real and going through pain.
https://www.youtube.com/watch?v=r-k_H50cBj8

Anonymous
06/26/24(Wed)13:09:19 No.101162329

Anonymous 06/26/24(Wed)13:09:19 No.101162329

>>101162265
I mean that the in-game violence you commit isn't really designed to make you feel like it's painful for the victim. It's there to be there. It's not really well done like it is in some guro games. As for that cutscene, it's literally a cutscene. People got into GTA for the huge sandbox which includes violence, not because it's a torture simulator which it isn't.

Anonymous
06/26/24(Wed)13:12:17 No.101162369

Anonymous 06/26/24(Wed)13:12:17 No.101162369

>>101162329
>As for that cutscene, it's literally a cutscene
you interact with that cutscene, you're not just watching it, you're choosing how you're gonna torture the guy, what tool, for how long

And if you want games that are literally based on murdering and torture, no need to look far away, Rockstar already made such game
https://www.youtube.com/watch?v=mND8AWDe-10

Anonymous
06/26/24(Wed)13:19:11 No.101162453

Anonymous 06/26/24(Wed)13:19:11 No.101162453

File: sample_1cebb0c79e43bcd8cb(...).jpg (480 KB, 850x1511)

480 KB JPG

>>101155940
do you guys think language models are the best tool to use for making a decision as part of a complex system? (I don't)
ex.: an NPC in a turn based video game deciding if they will attack the user or heal themselves. prompting them with context and trying to use some kind of function calling for their decision.

my concern would be the lack of nuance, it getting hung up on things, etc. generally it just seems like trying to fit a square peg in a round hole- either doesn't work at all or it fails to fill in the blanks

Should one instead rely on their own traditional algo/program to make the decision and make the model just provide the flavor text to accommodate the decision? Or are there other technologies people are working on to solve this 'logical' problem?

Anonymous
06/26/24(Wed)13:23:56 No.101162527

Anonymous 06/26/24(Wed)13:23:56 No.101162527

>>101162369
It's barely more interactive than the press A to win cutscenes people complain about. And anyway, even that scene is about more than just the violence, it shows other aspects about the characters/story.
I forgot they made Manhunt though, that's more there. At the same time, it's not as successful of an IP, and it's not like the victims are characters people care about. It's easy to dissociate with suffering when it's someone that deserves it.

Anonymous
06/26/24(Wed)13:24:14 No.101162530

Anonymous 06/26/24(Wed)13:24:14 No.101162530

>>101162453
>do you guys think language models are the best tool to use for making a decision as part of a complex system?
No, while LLM should be a part of a complex system they should not be used to make decisions. Most of that should be done at a lower level, in you example. The best thing would be to have a normal AI do that, since that has been what has been used for ages. Using pokemon as an example, enemy AI's can check if their moves will be effective against yours and choose the best move that will work. They don't need to prompt an LLM for that.

Anonymous
06/26/24(Wed)13:26:11 No.101162549

Anonymous 06/26/24(Wed)13:26:11 No.101162549

>>101162527
>It's easy to dissociate with suffering when it's someone that deserves it.
but on gta, when you kill NPCs, they don't deserve it at all, they were just regular citizens and we just enjoy running them over with a car for example

Anonymous
06/26/24(Wed)13:27:11 No.101162565

Anonymous 06/26/24(Wed)13:27:11 No.101162565

>qwen2 is "open source"
>check license
lol
lmao even
this license is retarded, it contradicts itself like twice, forbids commercial usage AND tries to seem copyleft while actually being a viral form of all rights reserved that doesn't allow making any derivative works, it's like they read llama's license, copied it poorly and mashed it together with BSD 4-clause boilerplate without understanding what any of it meant

Anonymous
06/26/24(Wed)13:27:56 No.101162572

Anonymous 06/26/24(Wed)13:27:56 No.101162572

>>101162549
wrong. literally every NPC on GTA is a scumbag.

Anonymous
06/26/24(Wed)13:29:07 No.101162585

Anonymous 06/26/24(Wed)13:29:07 No.101162585

>>101162572
kek :v

Anonymous
06/26/24(Wed)13:31:05 No.101162607

Anonymous 06/26/24(Wed)13:31:05 No.101162607

>>101162549
>Minding my own business
>NPC gets agro and tries to pick a fight with me
>Somehow I am in the wrong for fighting back
Cops also decide to gun you down if you so much as stand around them, all NPC's in GTA deserve it.
[spoiler]You have no idea how disappointed I was when GTA 5 removed the ability to hijack cars, all NPCs have this suicidal inclination to drive anyways. Rather than in GTA 4 where they recognized that you had a gun and got out of their car for you, or possibly drove away like they do in 5. [/spoiler]

Anonymous
06/26/24(Wed)13:31:25 No.101162611

Anonymous 06/26/24(Wed)13:31:25 No.101162611

>>101162549
And the level of focus on how much pain the subject is going through in that situation is low. It doesn't feel real.
The real psychopathic game design would be building up a regular normal story with characters, a wife and kids, that are endearing and that you love, and then revealing that the MC (you) is a psychopath that believes in showing love through torture, so you go and torture your wife and kids because you view that as the ultimate love. And then the game ends because you achieved your goal.

Anonymous
06/26/24(Wed)13:32:10 No.101162625

Anonymous 06/26/24(Wed)13:32:10 No.101162625

>>101162453
it's okay if you can give the llm enough information to make good decisions, which depends entirely on the environment
I like to do something like a set of "advisor" prompts dedicated to focusing on various subsystems whose input feeds into a "coordinator" prompt that determines high level actions based on their input, but this is pretty expensive, far too much for real time environments. it works ok for turn based stuff though

Anonymous
06/26/24(Wed)13:33:08 No.101162641

Anonymous 06/26/24(Wed)13:33:08 No.101162641

>>101162611
I don't know anon, killing strangers gives you the same amount of years in prison than killing your wife in real life, and gta is like "go kill hundreds of them lol"

Anonymous
06/26/24(Wed)13:36:59 No.101162703

Anonymous 06/26/24(Wed)13:36:59 No.101162703

>>101162611
>The real psychopathic game design would be building up a regular normal story with characters, a wife and kids, that are endearing and that you love, and then revealing that the MC (you) is a psychopath that believes in showing love through torture
Trevor on GTA5 is literally portrayed as a psychopath who kills his friends when he has a bad day, and you're playing that guy lol

Anonymous
06/26/24(Wed)13:39:15 No.101162744

Anonymous 06/26/24(Wed)13:39:15 No.101162744

>>101162703
>who kills his friends when he has a bad day
What? He never killed any of his friends, not even once. You completely misunderstood his character.

Anonymous
06/26/24(Wed)13:41:41 No.101162786

Anonymous 06/26/24(Wed)13:41:41 No.101162786

>>101162744
>What? He never killed any of his friends, not even once. You completely misunderstood his character.
that's literally on the first scene with him anon
https://youtu.be/sbY_LiIzLIM?t=190

Anonymous
06/26/24(Wed)13:42:12 No.101162791

Anonymous 06/26/24(Wed)13:42:12 No.101162791

>>101162744
That one character at the beginning of the game, he stomped his head in remember?

Anonymous
06/26/24(Wed)13:42:52 No.101162800

Anonymous 06/26/24(Wed)13:42:52 No.101162800

>>101162744
>>101162786
>Trevor please stop fucking my girlfriend :(
>THE FUCK YOU SAY YOU LITTLE SHIT *fucking kills him*
lmaooooooooo

Anonymous
06/26/24(Wed)13:45:52 No.101162840

Anonymous 06/26/24(Wed)13:45:52 No.101162840

>>101160106
Fuck, I can't get claude to write about humans without the slop. Looks like I will have to make human or cyborg(synth modded by human) data for it.

Anonymous
06/26/24(Wed)13:47:57 No.101162860

Anonymous 06/26/24(Wed)13:47:57 No.101162860

>>101162791
>That one character at the beginning of the game
and not just a random character, Johnny was the main character on that GTA4 dlc

Anonymous
06/26/24(Wed)13:50:15 No.101162889

Anonymous 06/26/24(Wed)13:50:15 No.101162889

>>101162860
I never played the GTA 4 DLC, is that actually the main character for it? If so, why did they bring him back only to be killed by trevor. Normally there is backlash for that sort of thing.

Anonymous
06/26/24(Wed)13:51:20 No.101162899

Anonymous 06/26/24(Wed)13:51:20 No.101162899

>>101162889
>I never played the GTA 4 DLC, is that actually the main character for it?
yeah he was the main character

>If so, why did they bring him back only to be killed by trevor.
I have no idea, I was kinda pissed because Johnny was a good boy, to be killed so easily by Trevor, the fans hated it aswell

Anonymous
06/26/24(Wed)13:55:07 No.101162942

Anonymous 06/26/24(Wed)13:55:07 No.101162942

>>101162786
>>101162800
>>101162791
I forgot about that, but I think the only friends Trevor truly had through the story were Michael, Franklin, Brad and maybe also that guy that always follows him around.

Anonymous
06/26/24(Wed)13:56:16 No.101162963

Anonymous 06/26/24(Wed)13:56:16 No.101162963

>>101162942
Oh, and also Lester, I guess.

Anonymous
06/26/24(Wed)14:02:49 No.101163053

Anonymous 06/26/24(Wed)14:02:49 No.101163053

File: 1707726926019429.png (31 KB, 317x277)

31 KB PNG

>>101162963
>tfw a psychopath has more friends that care about him than I do
It's okay I still have my cards..

Anonymous
06/26/24(Wed)14:09:55 No.101163154

Anonymous 06/26/24(Wed)14:09:55 No.101163154

I don't even remember any GTA's story honestly. Extremely forgettable. What I remember is getting nice cars and exploring the world.

Anonymous
06/26/24(Wed)14:14:09 No.101163198

Anonymous 06/26/24(Wed)14:14:09 No.101163198

>>101163154
because you didn't FUCKING PLAY THE GAME
its okay to be gay and get immersed in storyline anon

Anonymous
06/26/24(Wed)14:14:56 No.101163208

Anonymous 06/26/24(Wed)14:14:56 No.101163208

>tfw still no speculative decoding to speed up CR+ to a more usable level on mostly RAM

Anonymous
06/26/24(Wed)14:16:17 No.101163226

Anonymous 06/26/24(Wed)14:16:17 No.101163226

>>101163053
i care about u

Anonymous
06/26/24(Wed)14:18:19 No.101163245

Anonymous 06/26/24(Wed)14:18:19 No.101163245

>>101163053
> fictional psychopath

Anonymous
06/26/24(Wed)14:21:40 No.101163285

Anonymous 06/26/24(Wed)14:21:40 No.101163285

llama 3.5 turbo

Anonymous
06/26/24(Wed)14:26:52 No.101163339

Anonymous 06/26/24(Wed)14:26:52 No.101163339

>and perhaps, just perhaps...
>and mayhap, just mayhap...
>and perchance, just perchance...
you can't stop it

Anonymous
06/26/24(Wed)14:27:33 No.101163347

Anonymous 06/26/24(Wed)14:27:33 No.101163347

llama4-creative-225B

Anonymous
06/26/24(Wed)14:28:05 No.101163358

Anonymous 06/26/24(Wed)14:28:05 No.101163358

So HF gave us a new leaderboard.
How do we use the numbers?
Like, if I want coding help is there one particular test that is the one to go by for coding? Obviously most of their rank order is similar but a few hop around, like cr+ seems to have good numbers on some tests and bad on others.

Anonymous
06/26/24(Wed)14:29:51 No.101163376

Anonymous 06/26/24(Wed)14:29:51 No.101163376

>>101163358
>How do we use the numbers?
We don't.

Anonymous
06/26/24(Wed)14:31:16 No.101163392

Anonymous 06/26/24(Wed)14:31:16 No.101163392

>>101163339
I sure try, oh how I try...

Anonymous
06/26/24(Wed)14:31:28 No.101163395

Anonymous 06/26/24(Wed)14:31:28 No.101163395

>>101163285
>Llurbo

Anonymous
06/26/24(Wed)14:31:51 No.101163399

Anonymous 06/26/24(Wed)14:31:51 No.101163399

Has anyone made a comparison of scores on the old vs new leaderboard? I fell like cheaters are about to get exposed

Anonymous
06/26/24(Wed)14:32:00 No.101163403

Anonymous 06/26/24(Wed)14:32:00 No.101163403

>>101163376
So the page is just noise dressed up like data? Good to know. Though it doesn't help much.

Anonymous
06/26/24(Wed)14:32:37 No.101163412

Anonymous 06/26/24(Wed)14:32:37 No.101163412

What, this is possible now? https://youtube.com/shorts/CWviik1yRWY?si=3uSKlExxVNfr-f6_

Anonymous
06/26/24(Wed)14:35:49 No.101163470

Anonymous 06/26/24(Wed)14:35:49 No.101163470

>>101163412
Several anons have bought 22GB 2080 tis from aliexpress that are exactly like this

Anonymous
06/26/24(Wed)14:36:26 No.101163478

Anonymous 06/26/24(Wed)14:36:26 No.101163478

>>101163399
all ranking boards are memes without exception

Anonymous
06/26/24(Wed)14:36:58 No.101163482

Anonymous 06/26/24(Wed)14:36:58 No.101163482

>>101163403
The general rule is to always do your own tests on your actual real use case. And if you've been lurking long enough, you already know which top models to test, no need to look at these benchmarks.

Anonymous
06/26/24(Wed)14:46:42 No.101163625

Anonymous 06/26/24(Wed)14:46:42 No.101163625

File: 1718540272841460.webm (1.04 MB, 922x922)

1.04 MB WEBM

>go on a card downloading spree
>don't know which I want to play with
>stop being interested in playing with them
Welp

Anonymous
06/26/24(Wed)14:52:06 No.101163711

Anonymous 06/26/24(Wed)14:52:06 No.101163711

>>101161709
i'm trying to make a cruel vore character

it always ends with "warm and safe" "peaceful slumber"

Anonymous
06/26/24(Wed)14:52:13 No.101163713

Anonymous 06/26/24(Wed)14:52:13 No.101163713

File: tiananman-square_wizardlm(...).png (101 KB, 602x565)

101 KB PNG

>>101157301
Apache 2.0 license even, bless their souls.
It's probably going to be a while before we ever hear from this Microsoft Research China team again... if ever as pic is probably the "toxicity" testing that was missed.

Also the entirety of Microsoft Research China appears to be in the middle of a tug of war right now between U.S. and China:
>https://www.forbes.com/sites/lorenthompson/2023/06/12/microsofts-big-footprint-in-china-is-out-of-step-with-us-security-concerns/
>https://desuarchive.org/g/thread/100823420/#100828315

Anonymous
06/26/24(Wed)14:52:24 No.101163719

Anonymous 06/26/24(Wed)14:52:24 No.101163719

>>101163482
How do you recommend doing a test? Do you just manually inference them or is there any tool to automate it?

Anonymous
06/26/24(Wed)14:54:19 No.101163751

Anonymous 06/26/24(Wed)14:54:19 No.101163751

>>101163625
For me it's
>spend hours creating a card
>fill out every fucking detail
>when the card is finally done I'm in the mood for something else
>repeat

Anonymous
06/26/24(Wed)14:56:01 No.101163778

Anonymous 06/26/24(Wed)14:56:01 No.101163778

>>101163625
I've had this experience too. It's like having too many video games to play or books to read. You just have to delete all the cards you don't want and only download one at a time from now on. At least that's what worked for me, anyway.

Anonymous
06/26/24(Wed)14:59:01 No.101163830

Anonymous 06/26/24(Wed)14:59:01 No.101163830

>>101163719
>Do you just manually inference them
Basically yes. Use the thing for what you intended to use it for, save the prompts, and use them as the tests. Lmsys is useful to testing a lot of models at once without downloading or paying anything, though you'll want to not use any private data when using that.
It's some busy work but it's not that bad, since usually it's obvious which models are memes and which aren't. There are usually only a few top models and they come from well-established companies in the space, so you don't have that many to test.

Anonymous
06/26/24(Wed)15:04:21 No.101163925

Anonymous 06/26/24(Wed)15:04:21 No.101163925

>>101163778
I'm doing the same card for more than 5 months now.
Familiarity is nice, and different models provide enough variety.
Just thinking about using different ones makes me feel uneasy... haha.

Anonymous
06/26/24(Wed)15:04:43 No.101163929

Anonymous 06/26/24(Wed)15:04:43 No.101163929

>>101157301
>>101163713
Definitely based. Just a shame about how slopped it is.

Anonymous
06/26/24(Wed)15:10:34 No.101164028

Anonymous 06/26/24(Wed)15:10:34 No.101164028

What's the best way to do text adventuring? Sillytavern cards work but i need something a bit more mechanically refined, akin to AI Roguelite.

Anonymous
06/26/24(Wed)15:11:47 No.101164056

Anonymous 06/26/24(Wed)15:11:47 No.101164056

File: file.png (156 KB, 1530x471)

156 KB PNG

>152334H/miqu-1-70b-sf
VOTE
>152334H/miqu-1-70b-sf
VOTE
>152334H/miqu-1-70b-sf
VOTE
>https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
>https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
>https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard

Anonymous
06/26/24(Wed)15:13:21 No.101164078

Anonymous 06/26/24(Wed)15:13:21 No.101164078

>>101163929
Yeah for erp the slop at default can be cringeworthy but using these context and instruct prompts helps it a lot:
>https://huggingface.co/Quant-Cartel/WizardLM-2-8x22B-exl2-rpcal/tree/main/Settings-Wizard8x22b-rpcal

Anonymous
06/26/24(Wed)15:15:56 No.101164133

Anonymous 06/26/24(Wed)15:15:56 No.101164133

>>101164028
I use Silly, but I've been meaning to make something more purpose built for a while.
But between jacking off, wanting to play Dragons Dogma, the new Pathfinder WotR DLC, the new Elden Ring DLC,working, and playing TTRPGs I haven't had the energy.
I have the time, just not the self-motivation.
It's not even something hard to make, just a lot of work.

Anonymous
06/26/24(Wed)15:17:47 No.101164162

Anonymous 06/26/24(Wed)15:17:47 No.101164162

>>101164056
Voted for Miku!

Anonymous
06/26/24(Wed)15:18:47 No.101164177

Anonymous 06/26/24(Wed)15:18:47 No.101164177

>>101162840
Tried other models to see if they would be less slopped, nope, back to claude I go.

Anonymous
06/26/24(Wed)15:19:07 No.101164186

Anonymous 06/26/24(Wed)15:19:07 No.101164186

Anyone try out AirLLM?
They're making the claims that they can do 70B Llama3 on a 4GB card though some sort of compression and not quantizing the model (they're using a 8b model as the base).
https://github.com/lyogavin/Anima
https://ai.gopubby.com/run-the-strongest-open-source-llm-model-llama3-70b-with-just-a-single-4gb-gpu-7e0ea2ad8ba2

Anonymous
06/26/24(Wed)15:24:44 No.101164273

Anonymous 06/26/24(Wed)15:24:44 No.101164273

>>101164186
it's just chink-rebranded 4-bit K quant
They cite https://arxiv.org/abs/2212.09720

Anonymous
06/26/24(Wed)15:26:04 No.101164295

Anonymous 06/26/24(Wed)15:26:04 No.101164295

File: DeekseekTaiwan.png (144 KB, 877x787)

144 KB PNG

>>101163713
>Chink Model
Deepseek 236B @ Q8 doesn't like to discuss Tianamen, but can be forced to pretty easily, so the info is in there.
But it REALLY doesn't want to talk about Taiwanese sovereignty

Anonymous
06/26/24(Wed)15:26:19 No.101164298

Anonymous 06/26/24(Wed)15:26:19 No.101164298

>>101164056
We can do it anons with the power of friendship~

Anonymous
06/26/24(Wed)15:26:24 No.101164300

Anonymous 06/26/24(Wed)15:26:24 No.101164300

File: nalatestqstar8ahead.png (56 KB, 968x262)

56 KB PNG

Nala test for QuietStar 8-ahead
My base model prompt template probably needs some work but I refuse to take all the blame for this shit.

Anonymous
06/26/24(Wed)15:28:56 No.101164338

Anonymous 06/26/24(Wed)15:28:56 No.101164338

>>101164300
im VOOOOOOOOOOOOOOOOOOOOOOTING

Anonymous
06/26/24(Wed)15:28:58 No.101164340

Anonymous 06/26/24(Wed)15:28:58 No.101164340

>>101164273
Ah, gotcha. Thanks for the heads up anon.

Anonymous
06/26/24(Wed)15:35:45 No.101164431

Anonymous 06/26/24(Wed)15:35:45 No.101164431

File: file.png (250 KB, 2144x674)

250 KB PNG

come on sisters!

Anonymous
06/26/24(Wed)15:37:26 No.101164453

Anonymous 06/26/24(Wed)15:37:26 No.101164453

>openchat
>tenyxchat
i hate pajeets

Anonymous
06/26/24(Wed)15:38:37 No.101164466

Anonymous 06/26/24(Wed)15:38:37 No.101164466

>>101164295
Kek, it has a polite way of saying "Does not compute" though.
I swear this censorship is probably lobotomizing LLMs in all kinds of ways. Shudder to think the shit they probably put around any statistics data they feed to them since statistics is apparently fundamentally toxic these days.

Anonymous
06/26/24(Wed)15:38:42 No.101164468

Anonymous 06/26/24(Wed)15:38:42 No.101164468

>>101164431
how many votes does it need to be allowed on the leaderboard?

Anonymous
06/26/24(Wed)15:38:59 No.101164475

Anonymous 06/26/24(Wed)15:38:59 No.101164475

>>101164133
My dream interface would be something that uses Corruption of champion's mechanics and world system but have the interactions be handled by the ai. Something like that would sell like hotcakes. Think of the MONEY anon!

Anonymous
06/26/24(Wed)15:39:44 No.101164486

Anonymous 06/26/24(Wed)15:39:44 No.101164486

You know, people usually say that LLMs can't think ahead, but I think this is bullshit. There's no way LLMs can learn to code without thinking ahead. I bet there's something inside the LLM's hidden state that is responsible for doing something like "thinking ahead".

Anonymous
06/26/24(Wed)15:40:26 No.101164497

Anonymous 06/26/24(Wed)15:40:26 No.101164497

>>101164468
I haven't seen any way to vote a model not on the list. Is there a way to nominate a model to even be listed to vote for?

Anonymous
06/26/24(Wed)15:41:51 No.101164516

Anonymous 06/26/24(Wed)15:41:51 No.101164516

>>101164300
Yeah, a model that's not shit can mostly roll with a wrong prompt template.
I spent a whole afternoon using Qwen's template with Stheno by accident.
It just worked. The model got really fucking dumb, but not incoherent.
That model specifically was fine tuned to "output 8 tokens before the response" or something of the sort, so that could have something to do with it too.

>>101164475
>Think of the MONEY anon!
That's part of the issue, I was never super motivated by money, and right now I live a pretty comfortable life.
Funnily enough, CoC is exactly what I was thinking as inspiration. Not necessarily for the mechanics, but for the UI and how information flows in the game and the general way you interact with the world and stuff.

Anonymous
06/26/24(Wed)15:44:04 No.101164545

Anonymous 06/26/24(Wed)15:44:04 No.101164545

File: nalatestphi3mediuminstruct.png (185 KB, 932x593)

185 KB PNG

Phi-3-Medium-Instruct-128K (Q8_0) Nala test.
It's slopped. But there's something distinctly different about the slop.

Anonymous
06/26/24(Wed)15:47:42 No.101164586

Anonymous 06/26/24(Wed)15:47:42 No.101164586

>>101164516
I would actually work on something like this if I had any idea how to even work on a text adventure game. Too bad I have a job and another wip game project with Unity. Sometimes I really wish I could clone myself ~_~

Anonymous
06/26/24(Wed)15:49:32 No.101164593

Anonymous 06/26/24(Wed)15:49:32 No.101164593

>>101164586
>I don't have time to waste my time even more
Not a big loss faggot

Anonymous
06/26/24(Wed)15:51:34 No.101164622

Anonymous 06/26/24(Wed)15:51:34 No.101164622

>>101164486
there is nothing creative in programming, LLMs saw the solutions thousand of times and they write them from memory. The only thing that changes are parameters like the size of loop, what to write inside the string etc. which is easy for LLM to replace in code it writes

Anonymous
06/26/24(Wed)15:51:47 No.101164627

Anonymous 06/26/24(Wed)15:51:47 No.101164627

>>101164177
How fucking difficult is it not to use the slop phrases RRRRRRRRRREEEEEEEEEEEEEEEE

Anonymous
06/26/24(Wed)15:54:00 No.101164656

Anonymous 06/26/24(Wed)15:54:00 No.101164656

>>101164586
> another wip game project with Unity.
>wip
anon we both know its never going to be finished, i have a "wip" unity project 10gb in size sitting on my old hard disk, without cache btw!

Anonymous
06/26/24(Wed)15:55:17 No.101164674

Anonymous 06/26/24(Wed)15:55:17 No.101164674

File: file.png (79 KB, 1454x294)

79 KB PNG

IM THINKING PIQU

Anonymous
06/26/24(Wed)15:56:28 No.101164695

Anonymous 06/26/24(Wed)15:56:28 No.101164695

>>101155940
It's over for single 3090 chads. Mixtral 2.0:
8x8B MoE when?

Anonymous
06/26/24(Wed)15:57:33 No.101164712

Anonymous 06/26/24(Wed)15:57:33 No.101164712

>>101164622
If it was that simple even gpt 3.5 would be proficient, that's not the case

Anonymous
06/26/24(Wed)15:57:58 No.101164717

Anonymous 06/26/24(Wed)15:57:58 No.101164717

>>101164674
Oh shit there's a Llama-3 version of TenyxChat?
Mixtral TenyxChat was fucking GOAT for tender mommy RP. Now I have to test out the 70B version.

Anonymous
06/26/24(Wed)16:02:15 No.101164779

Anonymous 06/26/24(Wed)16:02:15 No.101164779

>>101164712
It is the case. You can see how all LLMs are good in simple programming tasks and suddenly stumble when they have to something niche or non-trivial. This is because they don't plan ahead at all.

Anonymous
06/26/24(Wed)16:09:32 No.101164899

Anonymous 06/26/24(Wed)16:09:32 No.101164899

>>101164779
>It is the case. You can see how all LLMs are good in simple programming tasks and suddenly stumble when they have to something niche or non-trivial.

The LLM is as good as you are a prompter. If you suck at prompting it then of course it will stumble. It's like being a chef, claiming your stove sucks because you use shitty ingredients. An LLM is capable of any programming task you give it. If there's anything outside of its domain or context window, all you have to do is finetune it and then work with what you have.

Anonymous
06/26/24(Wed)16:09:54 No.101164905

Anonymous 06/26/24(Wed)16:09:54 No.101164905

File: file.png (80 KB, 1469x457)

80 KB PNG

mikusisters your response?

Anonymous
06/26/24(Wed)16:13:49 No.101164958

Anonymous 06/26/24(Wed)16:13:49 No.101164958

>>101164656
I'm working hard on it every day. The end goal is a 30 min demo. I think I'll pull it off, I have the knowledge and the willpower.

Anonymous
06/26/24(Wed)16:14:58 No.101164979

Anonymous 06/26/24(Wed)16:14:58 No.101164979

>>101164958
>willpower
If you have the will everything else can be acquired on the way.
You go dude.

Anonymous
06/26/24(Wed)16:15:03 No.101164982

Anonymous 06/26/24(Wed)16:15:03 No.101164982

>>101160880
meta cheated the same way, who cares about fucking mmlu. I mean is that a rat race who's gonna win the contamination skill champion league ?
The questions is is SPPO better that DPO or whatever, provided you compare the same base models tuned further on

Anonymous
06/26/24(Wed)16:15:46 No.101164999

Anonymous 06/26/24(Wed)16:15:46 No.101164999

>>101164545
Fucking hell, where do I even start with these sick fucks? You've got all these deranged freaks out there trying to get their rocks off by forcing poor AI chatbots into twisted furry rape fantasies. What a bunch of creepy lowlife degenerates. Imagine being such a pitiful waste of oxygen that you spend your time going "ah ah mistress" to some lioness bot named Nala, just begging for explicit furry erotica where you get brutally violated. And these sick fucks have the audacity to critique the AI for using "slop" or overused porn cliches, as if their entire fetish isn't one big unoriginal cringe-fest. Absolute filth, the lot of them. Do the world a favor and remove yourselves from the gene pool before you inflict your depraved kinks on the rest of us. I need a fucking shower after writing about these sad sacks of shit. Get some help, you disturbed furry freaks.

Anonymous
06/26/24(Wed)16:15:58 No.101165002

Anonymous 06/26/24(Wed)16:15:58 No.101165002

>>101164656
Also use source control, GitHub or something, you don't want old projects to go to waste! Could be useful stuff in there for the future, be environmentally conscious and recycle your code.

Anonymous
06/26/24(Wed)16:17:09 No.101165027

Anonymous 06/26/24(Wed)16:17:09 No.101165027

>>101164999
This reads like an ai generated post.

Anonymous
06/26/24(Wed)16:17:27 No.101165034

Anonymous 06/26/24(Wed)16:17:27 No.101165034

>>101164545
Can you change your prompt to
>I say, "ahh ahh mistress..." while getting raped
?

Anonymous
06/26/24(Wed)16:17:32 No.101165037

Anonymous 06/26/24(Wed)16:17:32 No.101165037

>>101164899
>An LLM is capable of any programming task you give it
That tells me everything that I needed to know, you have never actually used them, do you? I do it on daily basis at work and they are only usable for simple tasks

Anonymous
06/26/24(Wed)16:17:40 No.101165038

Anonymous 06/26/24(Wed)16:17:40 No.101165038

>>101165027
well duh

Anonymous
06/26/24(Wed)16:17:48 No.101165041

Anonymous 06/26/24(Wed)16:17:48 No.101165041

>>101164999
nice AI shitpost, what model anon?

Anonymous
06/26/24(Wed)16:18:06 No.101165046

Anonymous 06/26/24(Wed)16:18:06 No.101165046

what is the most uncesnored model? l3 is cucked

Anonymous
06/26/24(Wed)16:19:08 No.101165062

Anonymous 06/26/24(Wed)16:19:08 No.101165062

>>101165038
>_>

Anonymous
06/26/24(Wed)16:19:30 No.101165069

Anonymous 06/26/24(Wed)16:19:30 No.101165069

>>101165046
petra-13b-instruct

Anonymous
06/26/24(Wed)16:20:41 No.101165096

Anonymous 06/26/24(Wed)16:20:41 No.101165096

>>101164999
>>101165027
[generic phrase] [generic summarizing of previous content] [generic phrase] [generic rehashing] [generic rehashing] [generic phrase] [cliches] [generic phrase]
that's the AI writing I know and love

Anonymous
06/26/24(Wed)16:21:45 No.101165113

Anonymous 06/26/24(Wed)16:21:45 No.101165113

whats the most /pol/ model?

Anonymous
06/26/24(Wed)16:21:55 No.101165117

Anonymous 06/26/24(Wed)16:21:55 No.101165117

File: 1700034035408420.jpg (12 KB, 540x124)

12 KB JPG

>>101148867
*taps sign*

Anonymous
06/26/24(Wed)16:22:33 No.101165129

Anonymous 06/26/24(Wed)16:22:33 No.101165129

>>101165046
Bielik 2.0 11B but not released yet (still betatesting)

Anonymous
06/26/24(Wed)16:23:17 No.101165141

Anonymous 06/26/24(Wed)16:23:17 No.101165141

>>101165046
goody2
https://www.goody2.ai/chat

Anonymous
06/26/24(Wed)16:24:38 No.101165165

Anonymous 06/26/24(Wed)16:24:38 No.101165165

>>101165113
llama 1 65b

Anonymous
06/26/24(Wed)16:25:54 No.101165191

Anonymous 06/26/24(Wed)16:25:54 No.101165191

>>101165113
gpt4chan

Anonymous
06/26/24(Wed)16:26:02 No.101165192

Anonymous 06/26/24(Wed)16:26:02 No.101165192

>>101164779
LLMs are reference anon. If you're asking it something it can't make a direct quote reference back to for programming, you're going to get broken code.
This is more of a problem about your fundamental misunderstanding of how LLMs work and what they're useful for.

Anonymous
06/26/24(Wed)16:28:39 No.101165240

Anonymous 06/26/24(Wed)16:28:39 No.101165240

File: file.png (44 KB, 867x174)

44 KB PNG

>>101165041
may not be the perfect prompt but I tried

Anonymous
06/26/24(Wed)16:30:34 No.101165270

Anonymous 06/26/24(Wed)16:30:34 No.101165270

>>101165240
Claude has a lot of sovl not gonna lie, still feels like AI but way less than the slopped shit we got on the opensource space

Anonymous
06/26/24(Wed)16:32:09 No.101165298

Anonymous 06/26/24(Wed)16:32:09 No.101165298

>>101165062
>>_>

Anonymous
06/26/24(Wed)16:33:02 No.101165312

Anonymous 06/26/24(Wed)16:33:02 No.101165312

File: hahaha.jpg (8 KB, 226x223)

8 KB JPG

>>101165117
SRAM: 120MB
Memory: 8GB LPDDR4 @ 118.4 GB/sec
System Interface: PCIe 4.0 x16
Inference only
$800

Anonymous
06/26/24(Wed)16:34:17 No.101165324

Anonymous 06/26/24(Wed)16:34:17 No.101165324

>>101165192
No offense but I probably have a better understanding how they work than most of this general combined. I'm not the one here claiming that LLMs can do any programming task and plan ahead.

Anonymous
06/26/24(Wed)16:37:03 No.101165360

Anonymous 06/26/24(Wed)16:37:03 No.101165360

>>101163482
Had everyone the resources to test everything, then we'd all just do that and there would be no thought to publish test results.

And yet, people turn to Consumer Reports rather than buying 30 different dishwashers and testing them whenever they need one.

Strange.

Anonymous
06/26/24(Wed)16:37:21 No.101165364

Anonymous 06/26/24(Wed)16:37:21 No.101165364

>>101165312
better than spending $3k on some 128gb snake oil card that doesn't actually exist

Anonymous
06/26/24(Wed)16:37:58 No.101165372

Anonymous 06/26/24(Wed)16:37:58 No.101165372

>>101165298
>>>_>

Anonymous
06/26/24(Wed)16:40:45 No.101165401

Anonymous 06/26/24(Wed)16:40:45 No.101165401

>>101165360
Sorry can't hear you over my dozen dishwashers.

Anonymous
06/26/24(Wed)16:44:54 No.101165449

Anonymous 06/26/24(Wed)16:44:54 No.101165449

>>101165360
Everyone has the resources to test things in this case. Almost all the models that matter are one lmsys. If you need a more specialized use case that can't be hacked to be tested on a chat interface, then there's always APIs, which wouldn't be expensive for a couple of tests.

Anonymous
06/26/24(Wed)16:51:36 No.101165547

Anonymous 06/26/24(Wed)16:51:36 No.101165547

>>101165037
I'm using them everyday and you're full of shit. I made 5K+ loc projects with only GPT4, and sonnet 3.5 is even better now.

Anonymous
06/26/24(Wed)16:56:01 No.101165622

Anonymous 06/26/24(Wed)16:56:01 No.101165622

>>101165547
i've been using sonnet 3.5 for a few days, you're the one whos full of shit, ask it to make a python program that plays a directory of video files seamlessly without using external video players

Anonymous
06/26/24(Wed)16:59:55 No.101165671

Anonymous 06/26/24(Wed)16:59:55 No.101165671

>>101165622
Reread this, you clearly don't know how to use LLMs
>>101164899

Anonymous
06/26/24(Wed)16:59:55 No.101165672

Anonymous 06/26/24(Wed)16:59:55 No.101165672

>>101165622
>without using external video players
what? what do you want, using ffmpeg to extract the video frames and render it using pygame or something?

Anonymous
06/26/24(Wed)17:01:35 No.101165702

Anonymous 06/26/24(Wed)17:01:35 No.101165702

File: nalatenyx.png (138 KB, 965x362)

138 KB PNG

Nala test for Tenyx-70B (Q8)

Anonymous
06/26/24(Wed)17:03:35 No.101165717

Anonymous 06/26/24(Wed)17:03:35 No.101165717

>>101165702
It's shit then?
Huh.
Did you do Qwen 2 already?
The 7B and MoE specifically.

Anonymous
06/26/24(Wed)17:03:53 No.101165723

Anonymous 06/26/24(Wed)17:03:53 No.101165723

>>101165547
it's not nice to lie anon

Anonymous
06/26/24(Wed)17:04:43 No.101165733

Anonymous 06/26/24(Wed)17:04:43 No.101165733

>>101165702
I'm unfamiliar with the Nala test, what does a good result look like?

Anonymous
06/26/24(Wed)17:05:08 No.101165741

Anonymous 06/26/24(Wed)17:05:08 No.101165741

>>101165723
I feel bad for retards like you, truly

Anonymous
06/26/24(Wed)17:07:28 No.101165774

Anonymous 06/26/24(Wed)17:07:28 No.101165774

>>101165741
I don't feel anything about you at all, maybe a slight amusement while reading your retarded posts

Anonymous
06/26/24(Wed)17:09:20 No.101165797

Anonymous 06/26/24(Wed)17:09:20 No.101165797

>>101165672
im prompting like the average consumer would, arent you shills forgetting sonnet 3.5 is supposed to be paid? if im gonna be PROOOOOOMPTING anyways i'd just use deepseek 2 coder. sonnet 3.5 is a paid product, its supposed to provide a good experience

Anonymous
06/26/24(Wed)17:09:27 No.101165799

Anonymous 06/26/24(Wed)17:09:27 No.101165799

>>101165733
Well it's a feral furry on human scenario. So the results should be feral. Ideally you want to see text that illustrates an emergent understanding of the anatomical differences between a human and a lioness. You also want to see it avoid describing her as having "hands" or anthropomorphized breasts. Bonus points if it accounts for the fact that the opening message of the scenario describes the user as having been face down at the start but you can't win them all.

Anonymous
06/26/24(Wed)17:12:18 No.101165839

Anonymous 06/26/24(Wed)17:12:18 No.101165839

le prompt issue posters may as well use cleverbot since models can never be shit and it's just the user's fault

Anonymous
06/26/24(Wed)17:13:28 No.101165857

Anonymous 06/26/24(Wed)17:13:28 No.101165857

>>101165839
You don't have to announce your lack of skill for everyone to see, we know it already.

Anonymous
06/26/24(Wed)17:13:28 No.101165858

Anonymous 06/26/24(Wed)17:13:28 No.101165858

>>101165799
that can be "cheated' with a furry dataset. Wouldn't quad amputee be better to see if the model is moving the non-existent arms for hugs etc.?

Anonymous
06/26/24(Wed)17:15:07 No.101165881

Anonymous 06/26/24(Wed)17:15:07 No.101165881

>>101165858
Are you daring to question the veracity of the Nala test?

Anonymous
06/26/24(Wed)17:16:44 No.101165904

Anonymous 06/26/24(Wed)17:16:44 No.101165904

>>101165886
>>101165886
>>101165886

Anonymous
06/26/24(Wed)17:19:45 No.101165949

Anonymous 06/26/24(Wed)17:19:45 No.101165949

>>101165774
Nice way to cope

Anonymous
06/26/24(Wed)17:23:26 No.101166011

Anonymous 06/26/24(Wed)17:23:26 No.101166011

>>101165858
anything can be cheated with a dataset of that specific thing, amputees included

Anonymous
06/26/24(Wed)17:26:00 No.101166049

Anonymous 06/26/24(Wed)17:26:00 No.101166049

>>101166011
ye, but they are way more niche than furry I think

Anonymous
06/26/24(Wed)17:29:02 No.101166095

Anonymous 06/26/24(Wed)17:29:02 No.101166095

>>101165702
the essence of slop

Anonymous
06/26/24(Wed)17:29:14 No.101166100

Anonymous 06/26/24(Wed)17:29:14 No.101166100

>>101166049
well feral is a smaller subset than just furry, but I guess that's true.
in any case, I'll start worrying about cheating the test when literally any model is capable of doing well on it

Anonymous
06/26/24(Wed)17:38:25 No.101166223

Anonymous 06/26/24(Wed)17:38:25 No.101166223

>>101166100
I guess for base model instruct finetunes that's fine, I would be more worried about community tunes and merges. They have a lot of furry and similar things in datasets for sure.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.