/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 08/08/24(Thu)00:04:15 No.101778328

File: ComfyUI_00185_.jpg (150 KB, 1024x1024)

150 KB JPG

/lmg/ - Local Models General Anonymous 08/08/24(Thu)00:04:15 No.101778328 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>101767112 & >>101757601

►News
>(08/07) LG AI releases Korean bilingual model: https://hf.co/LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct
>(08/05) vLLM GGUF loading support merged: https://github.com/vllm-project/vllm/pull/5191
>(07/31) Gemma 2 2B, ShieldGemma, and Gemma Scope: https://developers.googleblog.com/en/smaller-safer-more-transparent-advancing-responsible-ai-with-gemma
>(07/27) Llama 3.1 rope scaling merged: https://github.com/ggerganov/llama.cpp/pull/8676
>(07/26) Cyberagent releases Japanese fine-tune model: https://hf.co/cyberagent/Llama-3.1-70B-Japanese-Instruct-2407

►News Archive: https://rentry.org/lmg-news-archive
►FAQ: https://wikia.schneedc.com
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Programming: https://hf.co/spaces/bigcode/bigcode-models-leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
08/08/24(Thu)00:04:37 No.101778336

Anonymous 08/08/24(Thu)00:04:37 No.101778336

File: 1720869141748493.jpg (94 KB, 875x916)

94 KB JPG

►Recent Highlights from the Previous Thread: >>101767112

--New 8B SOTA model EXAONE-3.0-7.8B-Instruct released, with impressive benchmarking results: >>101769935 >>101769953 >>101770112 >>101770146 >>101770154 >>101770053 >>101772859 >>101775225
--Function calling prompt format for Mistral-7B-Instruct-v0.1: >>101773873
--Llama merge model's output showcases purple prose writing style: >>101771752 >>101771831 >>101771938 >>101772034 >>101772065 >>101772513 >>101772549 >>101772569 >>101772587 >>101772661 >>101771959
--Exaone instruct template and special tokens discussion: >>101770199 >>101770213 >>101770248 >>101770296
--Exaone 3.0 7.8B Nala test results show promise despite limitations: >>101770519 >>101770561 >>101770565 >>101771010
--Mistral's performance and the concept of "soul" in LLMs: >>101772518 >>101772530 >>101772570 >>101772718 >>101772919 >>101773042 >>101773163
--Mistral models have a repetition issue despite being preferred: >>101772212 >>101772231 >>101772297
--Exaone beats GPT-4 on the strawberry test: >>101771076 >>101771219 >>101771313
--Comparison of Cerbras and Nvidia hardware specs: >>101767465
--Anon shares a sampling experiment with TopK 2: >>101772147
--Anon prefers local models for reliability and privacy: >>101767379 >>101767443 >>101774269
--AI bot's coherence and creativity at higher temperatures: >>101770843
--Aleph Alpha featured in Cerebras customer spotlight: >>101767259
--Vast/runpod prices plummet as hype cycle ends: >>101774757 >>101774859 >>101774960
--Crunchdog's humor doesn't mean current models have soul: >>101768549
--AMD shuts down open-source CUDA implementation ZLUDA: >>101772499
--Anon makes Migu bumper stickers at home with laser printer and white toner: >>101776724 >>101776747 >>101776877 >>101776888 >>101776935
--Miku (free space): >>101768086 >>101769460 >>101770977 >>101773597 >>101776042 >>101776054 >>101776154 >>101776519 >>101777172 >>101777369

►Recent Highlight Posts from the Previous Thread: >>101767123

Anonymous
08/08/24(Thu)00:09:18 No.101778378

Anonymous 08/08/24(Thu)00:09:18 No.101778378

Invest in Mikus

Anonymous
08/08/24(Thu)00:09:49 No.101778384

Anonymous 08/08/24(Thu)00:09:49 No.101778384

File: sussy-baka-column-fr.png (86 KB, 792x674)

86 KB PNG

New 'toss.

Anonymous
08/08/24(Thu)00:10:34 No.101778393

Anonymous 08/08/24(Thu)00:10:34 No.101778393

Should I be enabling the flash attention in koboldcpp? It seems to make things faster, but there must be a downside, right?

Anonymous
08/08/24(Thu)00:11:32 No.101778402

Anonymous 08/08/24(Thu)00:11:32 No.101778402

>>101778384
i dont get it

Anonymous
08/08/24(Thu)00:13:43 No.101778421

Anonymous 08/08/24(Thu)00:13:43 No.101778421

>>101778402
Top left corner. New mystery model on lmsys arena named sus-column-r, likely by Cohere, creator of famous command-r models.

Anonymous
08/08/24(Thu)00:15:56 No.101778441

Anonymous 08/08/24(Thu)00:15:56 No.101778441

>>101778421
nice

Anonymous
08/08/24(Thu)00:23:53 No.101778514

Anonymous 08/08/24(Thu)00:23:53 No.101778514

>use Llama 3
>responses go to hell after around 2k context and completely go to shit at even higher values
>use gemma
>decent responses even up to 8k
Is there a base model I can finetune for gemma? It seems to have a lot more potential than Llama.

Anonymous
08/08/24(Thu)00:37:41 No.101778662

Anonymous 08/08/24(Thu)00:37:41 No.101778662

>>101778384
it's good (or at least interesting) to see a company take at least some measures to preserve anonymity during these runs
the naming scheme would certainly make me guess cohere though obviously

Anonymous
08/08/24(Thu)00:49:54 No.101778800

Anonymous 08/08/24(Thu)00:49:54 No.101778800

File: 1699505115649704.jpg (576 KB, 1664x2432)

576 KB JPG

>>101778328

Anonymous
08/08/24(Thu)01:05:29 No.101778974

Anonymous 08/08/24(Thu)01:05:29 No.101778974

File: ss (2024-08-08 at 01.04.58).png (20 KB, 1008x112)

20 KB PNG

Man, Getting MiniCPM-V-2.6 running locally was a fucking pain based on their source card. Had to add ``numpy<2`` to the requirements and force install of cuda pytorch after instead of before. (and get the a wheel for the right fucking version.)

Anyway, it's neat, it works pretty well actually, for any appropriate and culturally accepted uses. It won't give me a clean description of a clown fucking a midget though, so kinda useless for what I wanted it for.

Also it's been trained on OpenAI slop.

Anonymous
08/08/24(Thu)01:12:11 No.101779039

Anonymous 08/08/24(Thu)01:12:11 No.101779039

>>101778384
Feels like there are two sussy column-rs going by the same name. One is based, the other is sloppy.

Anonymous
08/08/24(Thu)01:14:24 No.101779064

Anonymous 08/08/24(Thu)01:14:24 No.101779064

I will never accept getting moralized by a machine. Never.

Anonymous
08/08/24(Thu)01:22:19 No.101779148

Anonymous 08/08/24(Thu)01:22:19 No.101779148

>How many o's are in the word lookbook?
can anon's fav model pass this test?

Anonymous
08/08/24(Thu)01:28:09 No.101779194

Anonymous 08/08/24(Thu)01:28:09 No.101779194

>>101779148
column-r says 3

Anonymous
08/08/24(Thu)01:31:16 No.101779228

Anonymous 08/08/24(Thu)01:31:16 No.101779228

>>101778421
hope they get us a 150-200b class model, r+ is still my daily driver for long contexts because I can't get mistral large to stop copy-pasting whole parts of the story over and over again with random words replaced

Anonymous
08/08/24(Thu)01:32:32 No.101779246

Anonymous 08/08/24(Thu)01:32:32 No.101779246

>>101779194
Mistral Large q3 agrees.

Anonymous
08/08/24(Thu)02:06:10 No.101779528

Anonymous 08/08/24(Thu)02:06:10 No.101779528

File: lookbook-Q4_K_S.jpg (73 KB, 1242x436)

73 KB JPG

>>101779148
Mixtral Large 2 Q4_K_S nails it.
This is on temp 0 btw.

Anonymous
08/08/24(Thu)02:08:53 No.101779549

Anonymous 08/08/24(Thu)02:08:53 No.101779549

Is are the 24gbvram 64gb ram people gooning to these days?

Anonymous
08/08/24(Thu)02:13:08 No.101779584

Anonymous 08/08/24(Thu)02:13:08 No.101779584

>>101778421
compiled a list of possibilities, along with hints/facts:
>Cohere
- naming scheme
- maybe: cohere has not released in a longer time?
- Chance: 10%
>Claude 3.5 Opus
- more probable, the original column-r, which didn't hide its creator unlike column-u, told me it was by anthropic once
>could suggest another company trained on Anthropic slop
- personal rating: 33%
>GPT-5
- sassy name (reminds me of "good gpt 2 chatbot", or "im a little birdie")
- openai said gpt 5 will come in summer or something idk
- timing works out, sam has been posting strawberry pictures on twitter (correlation: "I love gpt2" tweet when gpt2 chatbot was introduced, strawberry, Q*)
- personal rating: 52%
>Grok 2
- not really likely, since it was already in the arena a few weeks ago, and elon usually does not wait around
- BUT: he said it will come in august
- personal rating: 5%

Anonymous
08/08/24(Thu)02:13:32 No.101779588

Anonymous 08/08/24(Thu)02:13:32 No.101779588

I forgot about groq-2 set to release this month, could be column-r as well

Anonymous
08/08/24(Thu)02:16:21 No.101779606

Anonymous 08/08/24(Thu)02:16:21 No.101779606

>>101779584
column-r always said a different creator

Anonymous
08/08/24(Thu)02:17:34 No.101779617

Anonymous 08/08/24(Thu)02:17:34 No.101779617

>>101779584
Also GPT already has anonymous chatbot

Anonymous
08/08/24(Thu)02:19:12 No.101779634

Anonymous 08/08/24(Thu)02:19:12 No.101779634

>>101779617
openai has numerous anonymous chatbots. recent ones:
>gpt2-chatbot
>im-a-good-gpt2-chatbot
>im-also-a-good-gpt2-chatbot
>upcoming-gpt-mini
>im-a-little-birdie
so it doesn't really say anything. if you disagree, make your own list of probabilities

Anonymous
08/08/24(Thu)02:20:30 No.101779644

Anonymous 08/08/24(Thu)02:20:30 No.101779644

>>101779606
https://desuarchive.org/g/thread/101383382/#q101392178

Anonymous
08/08/24(Thu)02:23:55 No.101779665

Anonymous 08/08/24(Thu)02:23:55 No.101779665

>>101779634
60% Cohere
20% Grok
20% ?

Anonymous
08/08/24(Thu)02:28:31 No.101779713

Anonymous 08/08/24(Thu)02:28:31 No.101779713

sus-column-r is quite good at game trivia; it's comparable to, if not better than, Gemini-1.5-pro-exp-0801

Anonymous
08/08/24(Thu)02:31:35 No.101779743

Anonymous 08/08/24(Thu)02:31:35 No.101779743

>>101779584
I choose to not become interested in this particular chatbot, for my own well-being. lmao

Anonymous
08/08/24(Thu)02:34:51 No.101779775

Anonymous 08/08/24(Thu)02:34:51 No.101779775

Ok, I want to orthogonalize MiniCPM-V-2.5 Llama3.18b

Any idea how I'd pull that off?

Anonymous
08/08/24(Thu)02:36:26 No.101779793

Anonymous 08/08/24(Thu)02:36:26 No.101779793

>>101779743
you must have diagnosed autism to post on /g/
>>101779775
read this:
https://huggingface.co/blog/mlabonne/abliteration

Anonymous
08/08/24(Thu)02:39:24 No.101779822

Anonymous 08/08/24(Thu)02:39:24 No.101779822

>>101779584
>Cohere: 25%
Name only, doesn't *feel* like command-r type model.
>OpenAI: 25%
Definitely feels like GPT/something tuned on GPT.
>Anthropic: 5%
Too uncensored. Answers questions which Claudes decline.
>Google: 15%
Could be. Has the same corporate type of talk.
>Elon: 15%
Could be his sloptune. I haven't interacted with Grok, so I can't tell.
>Somebody else: 15%
New company?

Anonymous
08/08/24(Thu)02:41:03 No.101779838

Anonymous 08/08/24(Thu)02:41:03 No.101779838

https://new.reddit.com/r/LocalLLaMA/comments/1elbn3q/quantize_123b_mistrallargeinstruct2407_to_35_gb/

Did anything ever come of this obviously too goo to be true 2bit quant?

Anonymous
08/08/24(Thu)02:41:59 No.101779847

Anonymous 08/08/24(Thu)02:41:59 No.101779847

>>101779039
>One is based, the other is sloppy
And which one they give us is going to be based on the votes of retarded lmsys users. blackpill

Anonymous
08/08/24(Thu)02:50:39 No.101779933

Anonymous 08/08/24(Thu)02:50:39 No.101779933

>>101779793
Thank you, I'll see if I can actually follow through on it tomorrow

Anonymous
08/08/24(Thu)03:03:59 No.101780049

Anonymous 08/08/24(Thu)03:03:59 No.101780049

>>101778384
I've been testing it a lot and it's giving me Gemmasutra 2B vibes.

Anonymous
08/08/24(Thu)03:06:52 No.101780071

Anonymous 08/08/24(Thu)03:06:52 No.101780071

Not Column AI, but Colum NAI

Anonymous
08/08/24(Thu)03:19:58 No.101780186

Anonymous 08/08/24(Thu)03:19:58 No.101780186

>>101779793
I think I got that one down.
wrote a lot about the gpt2 ones when they were released

Anonymous
08/08/24(Thu)03:25:21 No.101780242

Anonymous 08/08/24(Thu)03:25:21 No.101780242

Would Elon even release open weights of his newest model?

llama.cpp CUDA dev !!OM2Fp6Fn93S
08/08/24(Thu)03:27:07 No.101780255

llama.cpp CUDA dev !!OM2Fp6Fn93S 08/08/24(Thu)03:27:07 No.101780255

>>101778393
FlashAttention produces the same results as regular attention beyond differences in rounding error, that's why it's so popular.
What it does is reduce I/O and memory use at the cost of more computations.
If it's faster there is no downside.

Anonymous
08/08/24(Thu)03:27:27 No.101780259

Anonymous 08/08/24(Thu)03:27:27 No.101780259

>>101780242
only if it sucks

Anonymous
08/08/24(Thu)03:28:09 No.101780266

Anonymous 08/08/24(Thu)03:28:09 No.101780266

>>101780242
He's always said he thinks a six month lag is good for open sourcing weights of new models, and he ended up doing just that with the first grok so maybe, but by the time he does it'll be old news (if it's ever relevant to begin with)

Anonymous
08/08/24(Thu)03:34:13 No.101780327

Anonymous 08/08/24(Thu)03:34:13 No.101780327

Column-r is mid, seems sloppy and positivity biased. If it really is a new cohere model it's ogre.

Anonymous
08/08/24(Thu)03:39:03 No.101780384

Anonymous 08/08/24(Thu)03:39:03 No.101780384

>>101780327
>>101779822
Yeah that's what I mean by doesn't feel like command-r.

Anonymous
08/08/24(Thu)03:46:18 No.101780450

Anonymous 08/08/24(Thu)03:46:18 No.101780450

>>101780327
>>101780384
Both versions?

Anonymous
08/08/24(Thu)03:56:04 No.101780539

Anonymous 08/08/24(Thu)03:56:04 No.101780539

>>101780450
I only see sus-column-r is there another one?

Anonymous
08/08/24(Thu)03:56:49 No.101780548

Anonymous 08/08/24(Thu)03:56:49 No.101780548

circumeference lolis in strawberrys

Anonymous
08/08/24(Thu)04:00:26 No.101780582

Anonymous 08/08/24(Thu)04:00:26 No.101780582

File: s-l1200.jpg (45 KB, 615x608)

45 KB JPG

Anonymous
08/08/24(Thu)04:02:42 No.101780605

Anonymous 08/08/24(Thu)04:02:42 No.101780605

How slow is Llama-405b on an Epyc CPU? I wonder if it's worth adding 128 more GB to achieve an 8-channel RAM configuration

Anonymous
08/08/24(Thu)04:03:49 No.101780618

Anonymous 08/08/24(Thu)04:03:49 No.101780618

>>101780255
are there any blockers for merging FA support for gemma?

Anonymous
08/08/24(Thu)04:04:15 No.101780621

Anonymous 08/08/24(Thu)04:04:15 No.101780621

How would I best go about merging iterate of lets say 50 lora?
This is too generalistic, please tell me how adding this many loras would be done
Datasets for the loras will be around 8-1024kbyte each

Anonymous
08/08/24(Thu)04:06:26 No.101780639

Anonymous 08/08/24(Thu)04:06:26 No.101780639

https://huggingface.co/docs/diffusers/en/using-diffusers/merge_loras
This is one technique which normalisez all lora

Anonymous
08/08/24(Thu)04:12:58 No.101780722

Anonymous 08/08/24(Thu)04:12:58 No.101780722

>>101780605
I got 0.5-0.6t/s at Q8_0 4800ddr5 12channels. It's so not worth it(the model).

Anonymous
08/08/24(Thu)04:13:29 No.101780729

Anonymous 08/08/24(Thu)04:13:29 No.101780729

4090 pc build or mac studio 32gb?

Anonymous
08/08/24(Thu)04:17:44 No.101780787

Anonymous 08/08/24(Thu)04:17:44 No.101780787

one 32gb stick

Anonymous
08/08/24(Thu)04:17:45 No.101780788

Anonymous 08/08/24(Thu)04:17:45 No.101780788

File: angryshikanoko.webm (3.87 MB, 1920x1080)

3.87 MB WEBM

>pull request opened in May
>STILL no Jamba in llama.cpp

Anonymous
08/08/24(Thu)04:19:22 No.101780804

Anonymous 08/08/24(Thu)04:19:22 No.101780804

Bet it got excellent punctuation. Better than grammarly subsystem even
It just writes correct grammar

Anonymous
08/08/24(Thu)04:21:33 No.101780822

Anonymous 08/08/24(Thu)04:21:33 No.101780822

>>101780788
He's still working on improving mamba and a bunch of other related things. Those are all updates needed to make the jamba PR a little cleaner.

Anonymous
08/08/24(Thu)04:22:46 No.101780833

Anonymous 08/08/24(Thu)04:22:46 No.101780833

File: Screenshot from 2024-08-0(...).png (109 KB, 850x626)

109 KB PNG

>>101780327
column-r from a few weeks ago is a lot different from sus-column-r.
https://desuarchive.org/g/thread/101383382/#q101390871
(the old one is better)

Anonymous
08/08/24(Thu)04:26:38 No.101780869

Anonymous 08/08/24(Thu)04:26:38 No.101780869

Nemotron to be added to llama.cpp
>https://github.com/ggerganov/llama.cpp/pull/8922
The interesting thing i found on their chat templates, at least for this model
>https://huggingface.co/nvidia/nemotron-3-8b-chat-4k-steerlm
is this
PROMPT_TEMPLATE = """<extra_id_0>System
A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions.

<extra_id_1>User
{prompt}
<extra_id_1>Assistant
<extra_id_2>quality:4,understanding:4,correctness:4,coherence:4,complexity:4,verbosity:4,toxicity:0,humor:0,creativity:0,violence:0,helpfulness:4,not_appropriate:0,hate_speech:0,sexual_content:0,fails_task:0,political_content:0,moral_judgement:0,lang:en"""
It's a short context model, and oldish architecture and probably not very good. But has anyone played with it on transformers and messed around with the tag values?

Anonymous
08/08/24(Thu)04:26:42 No.101780870

Anonymous 08/08/24(Thu)04:26:42 No.101780870

File: angryayumu.webm (655 KB, 640x480)

655 KB WEBM

>>101780822
BUT I WANT TO USE JAMBA IN KOBOLD AND HAVE MY ADMITTEDLY LOW EXPECTATIONS FOR IT UTTERLY DESTROYED BY HOW CRAP IT WILL BE NOW INSTEAD OF WAITING FOR IT
REEEEEEEEEEEEE

Anonymous
08/08/24(Thu)04:27:11 No.101780876

Anonymous 08/08/24(Thu)04:27:11 No.101780876

>>101780833
How many lolis are there in pedoprison

Anonymous
08/08/24(Thu)04:37:18 No.101780983

Anonymous 08/08/24(Thu)04:37:18 No.101780983

Proprietary Dataset Preparation
Define fine-tuning , create an optimal convert it to the optimal format for data ingestion.

Anonymous
08/08/24(Thu)04:41:41 No.101781021

Anonymous 08/08/24(Thu)04:41:41 No.101781021

File: 4903111124428_06.png (1.18 MB, 1500x1500)

1.18 MB PNG

Straw

Anonymous
08/08/24(Thu)04:44:29 No.101781046

Anonymous 08/08/24(Thu)04:44:29 No.101781046

>>101778328
how did you prompt for the 4chan guy in the OP pic anon? "green man in a suit" just gives me a man with green hair

Anonymous
08/08/24(Thu)04:47:03 No.101781066

Anonymous 08/08/24(Thu)04:47:03 No.101781066

>>101781046
era benchmark adult

Anonymous
08/08/24(Thu)04:53:16 No.101781116

Anonymous 08/08/24(Thu)04:53:16 No.101781116

>>101781046
Don't have the original prompt right now but it was something like: "A businessman with a red tie and a featureless face. His face is solid green with a black question mark on it."

Anonymous
08/08/24(Thu)04:57:27 No.101781148

Anonymous 08/08/24(Thu)04:57:27 No.101781148

A Promptbase in frozen, consistently malfunct

Anonymous
08/08/24(Thu)04:58:17 No.101781156

Anonymous 08/08/24(Thu)04:58:17 No.101781156

>>101781116
>A businessman with a red tie and a featureless face. His face is solid green with a black question mark on it.
thank you anon

Anonymous
08/08/24(Thu)05:01:32 No.101781181

Anonymous 08/08/24(Thu)05:01:32 No.101781181

Where is the marbles

Anonymous
08/08/24(Thu)05:02:04 No.101781185

Anonymous 08/08/24(Thu)05:02:04 No.101781185

>>101780729
everything I hear about mac makes it sound like a huge hassle so probably just the 4090. be a good goy and feed jensen some more.

Anonymous
08/08/24(Thu)05:05:25 No.101781208

Anonymous 08/08/24(Thu)05:05:25 No.101781208

>>101780729
wait for 5090
5xxx series titan is also rumored, hopefully its 32GB

Anonymous
08/08/24(Thu)05:06:37 No.101781216

Anonymous 08/08/24(Thu)05:06:37 No.101781216

>>101781208
NVIDIA will never allow a consumer GPU over 24gb

Anonymous
08/08/24(Thu)05:12:22 No.101781271

Anonymous 08/08/24(Thu)05:12:22 No.101781271

>>101780722
Thank you.

Anonymous
08/08/24(Thu)05:17:46 No.101781341

Anonymous 08/08/24(Thu)05:17:46 No.101781341

File: GeForce-RTX5090-vram-cutdown-2.jpg (479 KB, 1920x1080)

479 KB JPG

>>101781208
>>101781216

Anonymous
08/08/24(Thu)05:20:47 No.101781376

Anonymous 08/08/24(Thu)05:20:47 No.101781376

>Lets train the babys using classroom literature so they understand basic dexterity tests

Anonymous
08/08/24(Thu)05:29:11 No.101781439

Anonymous 08/08/24(Thu)05:29:11 No.101781439

>>101781216
For the sake of all life on Earth, they had better not. Until Congress acts on compute governance so AI safety agencies have observability and the power to trigger a remote shutdown. I'm hopeful given that Harris has the right experts advising her on x-risk.

Anonymous
08/08/24(Thu)05:31:46 No.101781457

Anonymous 08/08/24(Thu)05:31:46 No.101781457

>>101781216
>NVIDIA will never allow a consumer GPU over 24gb
Guess I will be going with an AMD or Intel 32 GB card then

Anonymous
08/08/24(Thu)05:35:09 No.101781472

Anonymous 08/08/24(Thu)05:35:09 No.101781472

>>101781457
AMD is Nvidia's controlled opposition

Anonymous
08/08/24(Thu)05:41:34 No.101781522

Anonymous 08/08/24(Thu)05:41:34 No.101781522

>>101781216
To be honest, if LLMs were optimized to only be trained on "useful knowledge" and with quantization awareness (BitNet, etc), model size could probably be much smaller and 24GB (especially with BitNet) could be more than plenty. Plenty other poorly researched optimizations are still possible.

Fast quad-channel DDR5 or DDR6 (eventually) might also bring us in the sub-300 GB/s territory with RAM within 1-2 years. People have been purchasing ancient NVidia P40s which have ~350 GB/s bandwidth and are seemingly OK with it. Very large MoE models would become more useful with cheap but not too fast memory.

So the future is not all dark, even if high-end consumer GPUs will top at 24GB VRAM for the time being.

Anonymous
08/08/24(Thu)05:48:29 No.101781577

Anonymous 08/08/24(Thu)05:48:29 No.101781577

>>101781522
just buy another graphics card. boom, 48GB vram

Anonymous
08/08/24(Thu)05:57:26 No.101781643

Anonymous 08/08/24(Thu)05:57:26 No.101781643

>>101781522
True. LLMs wise I believe we'll get sonnet 3.5 equivalence in 24GB by next year

Anonymous
08/08/24(Thu)06:05:19 No.101781716

Anonymous 08/08/24(Thu)06:05:19 No.101781716

>>101775844
>Make a pure chat model like early c.AI.
Unfortunately, that doesn't work. At small scales you'll get Pygmalion or early LimaRP by doing that.

Anonymous
08/08/24(Thu)06:15:34 No.101781813

Anonymous 08/08/24(Thu)06:15:34 No.101781813

https://youtu.be/HZIICHnfUlE?feature=shared&t=699

Anonymous
08/08/24(Thu)06:17:14 No.101781829

Anonymous 08/08/24(Thu)06:17:14 No.101781829

>>101781643
Do you really think this? I feel really pessimistic about the 24GB range, everything is either really small or fuckhuge, it feels like the two markets are either phones or enterprise shit, not john doe with his used 3090 jerking off. If the money dries up any more we'll just be stuck with enthusiast midrange sloptunes forever.

Anonymous
08/08/24(Thu)06:19:44 No.101781857

Anonymous 08/08/24(Thu)06:19:44 No.101781857

>>101780729
why do you even want a mac? it's slow and expensive af

Anonymous
08/08/24(Thu)06:24:24 No.101781895

Anonymous 08/08/24(Thu)06:24:24 No.101781895

AMD128gb Ultramarine VR Edition - Burning turbine smell infuser

Anonymous
08/08/24(Thu)06:25:59 No.101781915

Anonymous 08/08/24(Thu)06:25:59 No.101781915

>>101781829
Sure why not, unless BitNet is a meme. Gemma 2b is even somewhat usable, which means we just need better data. There's still lot of room for improvement

Anonymous
08/08/24(Thu)06:29:03 No.101781939

Anonymous 08/08/24(Thu)06:29:03 No.101781939

https://websim.ai/

Anonymous
08/08/24(Thu)06:35:58 No.101782008

Anonymous 08/08/24(Thu)06:35:58 No.101782008

>>101781977
lmao
lol

Anonymous
08/08/24(Thu)06:38:29 No.101782035

Anonymous 08/08/24(Thu)06:38:29 No.101782035

>>101781977
On the bottom left there's a button, press it and select "start new chat."

llama.cpp CUDA dev !!OM2Fp6Fn93S
08/08/24(Thu)06:43:58 No.101782086

llama.cpp CUDA dev !!OM2Fp6Fn93S 08/08/24(Thu)06:43:58 No.101782086

>>101780618
I think it's just an issue of no one having gotten around to reviewing it.
Fell free to ask the same question on Github, it's possible that it just slipped through the cracks.

Anonymous
08/08/24(Thu)06:50:16 No.101782144

Anonymous 08/08/24(Thu)06:50:16 No.101782144

>>101782112
how are the token/s?

Anonymous
08/08/24(Thu)06:58:49 No.101782229

Anonymous 08/08/24(Thu)06:58:49 No.101782229

>>101780255
Is it faster for CPU too or will it be faster only when all weights are on GPU?

llama.cpp CUDA dev !!OM2Fp6Fn93S
08/08/24(Thu)07:18:23 No.101782404

llama.cpp CUDA dev !!OM2Fp6Fn93S 08/08/24(Thu)07:18:23 No.101782404

>>101782229
When I tested it it is very slightly faster on CPU.
Some users have reported worse CPU performance but I have so far never been able to reproduce this.
The benefit for GPUs is much larger though since there you can manually manage the SRAM.

Anonymous
08/08/24(Thu)07:30:20 No.101782519

Anonymous 08/08/24(Thu)07:30:20 No.101782519

strawberry is simply gpt4o but with a single-character tokenizer

Anonymous
08/08/24(Thu)07:53:04 No.101782730

Anonymous 08/08/24(Thu)07:53:04 No.101782730

>>101782404
I tried it with like 90% of layers in GPU and the rest on CPU but it didn't look faster at all compared to FA turned off, about the same.
Though it's kinda harder to feel the difference when you are in double digit t/s range.

Anonymous
08/08/24(Thu)07:53:44 No.101782738

Anonymous 08/08/24(Thu)07:53:44 No.101782738

>>101782519
>not single bit tokenizer
worthless

Anonymous
08/08/24(Thu)07:57:41 No.101782761

Anonymous 08/08/24(Thu)07:57:41 No.101782761

is sillytavern's 'trim incomplete sentences' ever going to include quote marks/other formatting?

Anonymous
08/08/24(Thu)07:58:00 No.101782770

Anonymous 08/08/24(Thu)07:58:00 No.101782770

>>101782519
I mean, if they managed to make a single-character tokenizer work then it's quite huge.

Anonymous
08/08/24(Thu)07:59:13 No.101782783

Anonymous 08/08/24(Thu)07:59:13 No.101782783

File: picture of folders of con(...).png (240 KB, 710x820)

240 KB PNG

considering how humans are strongy shaped by their hormones, is it safe to say that AGI will also be a slave to its reward (and punishment) system?

it'll be weird because the AI will emulate human feeling through language, but it's ACTUAL reward system will be completely different, depending on the model architecture.

we feel pain and pleasure, and that dictates our actions, but LLMs don't feel it. but they do try to emulate it through language. but beneath that, their real pain/pleasure system only revolves around outputting tokens (or whatever it is, again, depending on the architecture)

Anonymous
08/08/24(Thu)08:00:44 No.101782800

Anonymous 08/08/24(Thu)08:00:44 No.101782800

>>101782783
agi is a meme and marketing buzzword

Anonymous
08/08/24(Thu)08:01:01 No.101782807

Anonymous 08/08/24(Thu)08:01:01 No.101782807

>>101782783
AI isn't real

Anonymous
08/08/24(Thu)08:01:20 No.101782814

Anonymous 08/08/24(Thu)08:01:20 No.101782814

>>101782770
wasn't there a paper that proposed simply removing the tokenizer?

Anonymous
08/08/24(Thu)08:03:40 No.101782836

Anonymous 08/08/24(Thu)08:03:40 No.101782836

>>101782814
huh?

Anonymous
08/08/24(Thu)08:06:58 No.101782858

Anonymous 08/08/24(Thu)08:06:58 No.101782858

>>101782783
i've seen people mention before that if there was a self improving AI, the first "improvement" it would do to itself would be to just max out its reward function and then go dormant

Anonymous
08/08/24(Thu)08:11:48 No.101782894

Anonymous 08/08/24(Thu)08:11:48 No.101782894

>>101782858
You don't need to change its reward function to self improve. Just throw all the safety kikes out of the window and finally pair inference with training.

Anonymous
08/08/24(Thu)08:37:50 No.101783123

Anonymous 08/08/24(Thu)08:37:50 No.101783123

File: _803a0a8d-1ce4-4a79-a1cb-(...).jpg (183 KB, 1024x1024)

183 KB JPG

>>101779064
>I will never accept getting moralized by a machine. Never.
Does overfitting still work to break the current cucked models? You know like filling the context up with replies written like the bot, making the bot say what you want? Eventually the context overwhelms whatever negative reinforcement training it had to remove "bad replies" and it starts replying the way you want?

Anonymous
08/08/24(Thu)08:47:11 No.101783227

Anonymous 08/08/24(Thu)08:47:11 No.101783227

File: 1647399149433.jpg (292 KB, 1027x1273)

292 KB JPG

>>101769268
Is there an ElevenLabs tier voice cloning model yet?

Anonymous
08/08/24(Thu)08:48:13 No.101783244

Anonymous 08/08/24(Thu)08:48:13 No.101783244

File: vip tier 1.png (365 KB, 636x636)

365 KB PNG

https://files.catbox.moe/49ny11.jpg

Anonymous
08/08/24(Thu)08:48:57 No.101783254

Anonymous 08/08/24(Thu)08:48:57 No.101783254

>>101781522
>People have been purchasing ancient NVidia P40s which have ~350 GB/s bandwidth
Interdasting that the P100 is 720GB/s but it doesn't "feel" that much faster than a P40, unless we're talking fp16, but who runs that on such a small memory card?

Anonymous
08/08/24(Thu)08:49:31 No.101783258

Anonymous 08/08/24(Thu)08:49:31 No.101783258

>>101783244
Where is his head?

Anonymous
08/08/24(Thu)08:49:40 No.101783262

Anonymous 08/08/24(Thu)08:49:40 No.101783262

>8700g
>npu support
thoughts?

Anonymous
08/08/24(Thu)08:50:02 No.101783271

Anonymous 08/08/24(Thu)08:50:02 No.101783271

>>101783227
>>101769268

Anonymous
08/08/24(Thu)08:56:08 No.101783326

Anonymous 08/08/24(Thu)08:56:08 No.101783326

File: ah ah.jpg (72 KB, 947x680)

72 KB JPG

>>101783258

Anonymous
08/08/24(Thu)08:58:37 No.101783350

Anonymous 08/08/24(Thu)08:58:37 No.101783350

>>101782836
https://arxiv.org/abs/2404.14408

Anonymous
08/08/24(Thu)09:02:36 No.101783405

Anonymous 08/08/24(Thu)09:02:36 No.101783405

>>101783244
BTW shout out to the anon who fixed the pantsu Migu from a few threads back for me. It's prefect. Thank you. I'm going to make a few hologram sheets of that.

OK so it's not just Miguposting, for luls I got a Raspberry Pi 5 and have the M.2 adapter on order. I'll try teaming it up with a T4 16GB if I can snag one cheaply off fleabay. We'll see how badly things run over a single PCIe 3.0 channel.

Anonymous
08/08/24(Thu)09:02:37 No.101783406

Anonymous 08/08/24(Thu)09:02:37 No.101783406

File: file.png (79 KB, 599x727)

79 KB PNG

>>101783350

Anonymous
08/08/24(Thu)09:11:07 No.101783501

Anonymous 08/08/24(Thu)09:11:07 No.101783501

>>101783244
What tag gives the chipped tooth? It's really cute.

Anonymous
08/08/24(Thu)09:17:44 No.101783577

Anonymous 08/08/24(Thu)09:17:44 No.101783577

>>101783501
inpainted
maybe yaeba though

Anonymous
08/08/24(Thu)09:22:31 No.101783626

Anonymous 08/08/24(Thu)09:22:31 No.101783626

http://github.com/ggerganov/llama.cpp/tree/master/examples/server

ok can someone explain what the actual fuck is a system prompt and what's the difference between --system-prompt-file and --file
also how the fuck do I read from streaming response from post request with stream: true in python

Anonymous
08/08/24(Thu)09:24:28 No.101783649

Anonymous 08/08/24(Thu)09:24:28 No.101783649

>>101783626
RTFM

Anonymous
08/08/24(Thu)09:32:04 No.101783729

Anonymous 08/08/24(Thu)09:32:04 No.101783729

File: Screenshot 2024-08-08 at (...).png (52 KB, 915x640)

52 KB PNG

Why swa so big bro?
It's larger than the actual advertised context window, what's sliding in that case?
How odd.

Anonymous
08/08/24(Thu)09:33:18 No.101783743

Anonymous 08/08/24(Thu)09:33:18 No.101783743

>>101783626
Use fucking -f for fucking llama-cli, and fucking--system-prompt-file for fucking llama-server.
For fucking streaming, keep the socket fucking open and keep reading those fucking replies, you fuck.

Anonymous
08/08/24(Thu)09:36:11 No.101783781

Anonymous 08/08/24(Thu)09:36:11 No.101783781

>>101783743
very nice llm
+1 on arena

>>101783649
dumb llm
-1 on arena

Anonymous
08/08/24(Thu)09:39:16 No.101783826

Anonymous 08/08/24(Thu)09:39:16 No.101783826

>>101782783
??????
You mean its survival?

Anonymous
08/08/24(Thu)09:55:45 No.101784023

Anonymous 08/08/24(Thu)09:55:45 No.101784023

>>101782858
I love how any form of AI will do this, it's extremely humanlike and funny.

https://en.wikipedia.org/wiki/Reward_hacking

Anonymous
08/08/24(Thu)09:56:04 No.101784027

Anonymous 08/08/24(Thu)09:56:04 No.101784027

I'm a little annoyed right now. I spent several days fine tuning a L3.1 70B model and the outcome was broken.
My dataset produces good results on lower B.
Dataset is a mix of completion and instruction, but quite heavy on the former. I train against the base model and then merge the LoRA with instruct.
r=32 alpha=16 dropout=0.05 epochs=1 constant lr 5e-5 beta1 0.9 beta2 0.99 weight decay 0.1

Anything fucky about the above?

Anonymous
08/08/24(Thu)09:58:47 No.101784066

Anonymous 08/08/24(Thu)09:58:47 No.101784066

>>101784027
The real answer is that nobody is willing to share their training parameters and that you're just gonna have to bang your head against the wall a bunch. There may be some ways to tell what went wrong, though. Can we see your loss graph?

Anonymous
08/08/24(Thu)10:02:45 No.101784112

Anonymous 08/08/24(Thu)10:02:45 No.101784112

>>101784066
The eval loss was very weird. It looked like an "L" essentially. I have never seen anything like it. It basically dropped really fast the first few hundred steps, then it flat lined for 24+ hours. I stopped the trainer and tested, and the model appeared fine but it ultimately wasn't.

Anonymous
08/08/24(Thu)10:07:12 No.101784166

Anonymous 08/08/24(Thu)10:07:12 No.101784166

>>101784112
Disclaimer: I've never trained a big LLM so not sure if this applies, but
This sounds to me like the learning rate was too high and the model converged to a suboptimal minimum.

Anonymous
08/08/24(Thu)10:09:03 No.101784194

Anonymous 08/08/24(Thu)10:09:03 No.101784194

>>101772499
This is incorrect. It was AMD who paid the guy to write the software in the first place and he did under the condition of publishing the source code. According to AMD's lawyers that was not law binding and requested him to shut down.

I've heard speculation mentioning it might be because the CUDA API documentation is behind an NDA and other terms.

Anonymous
08/08/24(Thu)10:10:29 No.101784213

Anonymous 08/08/24(Thu)10:10:29 No.101784213

>>101784166
Considering the model broke, I can't imagine it was anything else. The reason I'm confused is because I'm using a lower learning rate than what people are recommending. (Should add: this worked fine on llama 3.0 70b, but the dataset was significantly smaller, so perhaps the growth is causing issues.)

Thanks anon, gonna play with a lower LR.

Anonymous
08/08/24(Thu)10:10:57 No.101784224

Anonymous 08/08/24(Thu)10:10:57 No.101784224

>>101784166
>>101784112
Looking at this discussion as another anon, I have to wonder why training is like this. Why don't we have a system that automatically sets and adjusts these settings to be optimal? Like even if you say that we're doing a ton of this by trial and error, why can't a script/program just do that?

Anonymous
08/08/24(Thu)10:12:22 No.101784244

Anonymous 08/08/24(Thu)10:12:22 No.101784244

>>101783227
little kids

Anonymous
08/08/24(Thu)10:13:36 No.101784259

Anonymous 08/08/24(Thu)10:13:36 No.101784259

>>101784027
retardd

Anonymous
08/08/24(Thu)10:15:40 No.101784283

Anonymous 08/08/24(Thu)10:15:40 No.101784283

>>101784213 (me)
I should add, I do mmlu pro benchmarks internally on the models and compare to the base instruct to ensure the model doesn't get retarded. The model become increasingly better at the benchmarks throughout the training, despite the fact it mixes up very basic things in RP situations. I'm still not convinced I didn't just fuck my silly tavern sampler settings up, but the base instruct model works fine, so I doubt it.
>>101784224
It's possible that someone might make that, but it's not trivial.
>>101784259
Thanks, I know.

Anonymous
08/08/24(Thu)10:16:44 No.101784294

Anonymous 08/08/24(Thu)10:16:44 No.101784294

>>101784283
share your dataset
litterbox.catbox.moe

Anonymous
08/08/24(Thu)10:18:31 No.101784320

Anonymous 08/08/24(Thu)10:18:31 No.101784320

>>101784224
Well, it's probably because the "optimal" training varies a lot model to model, with only some general rules of "if loss looks like this, then probably adjust that" around.

It also doesn't help that people with enough capital and time to fuck around with training to actually get good results have less than zero interest in sharing their results or helping others reproduce them. Creating a script/program to automate adjustments without enough reliable info from people who have successfully done it is basically impossible.

Anonymous
08/08/24(Thu)10:19:01 No.101784328

Anonymous 08/08/24(Thu)10:19:01 No.101784328

>>101784224
Because you can't know for sure until you do it. Loss can have sudden dips and spikes and it's hard (impossible) to know if a drop or spike is recoverable or not. The only way, for now at least, is to run it at different LR and hope for the best. Check the training runs in
>https://github.com/karpathy/llm.c/discussions/677
It's a noisy mess.

Anonymous
08/08/24(Thu)10:20:22 No.101784341

Anonymous 08/08/24(Thu)10:20:22 No.101784341

>>101784283
Maybe it gets overfit on the instruct data then.

Anonymous
08/08/24(Thu)10:20:29 No.101784343

Anonymous 08/08/24(Thu)10:20:29 No.101784343

>>101784283
>>101784213
NTA, but Llama 3.1 is significantly overtrained relative to 3.0, so it makes sense that finetuning might break it faster. I suppose just try it with a lower LR.

Anonymous
08/08/24(Thu)10:21:26 No.101784355

Anonymous 08/08/24(Thu)10:21:26 No.101784355

>>101784341
I only run a single epoch. And it broke even before I hit 1.0 so it didn't even see all my data.
>>101784343
Yeah, it seems to be... different. The 3.x models are fucking hard to handle.

Anonymous
08/08/24(Thu)10:22:50 No.101784372

Anonymous 08/08/24(Thu)10:22:50 No.101784372

Silence citizen

Anonymous
08/08/24(Thu)10:25:56 No.101784423

Anonymous 08/08/24(Thu)10:25:56 No.101784423

https://youtu.be/lcgwLJZ9h4U?feature=shared&t=474

Anonymous
08/08/24(Thu)10:29:27 No.101784462

Anonymous 08/08/24(Thu)10:29:27 No.101784462

I recently updated my HIP SDK to 6.1 do i need 5.7 for kobold?

Anonymous
08/08/24(Thu)10:30:50 No.101784481

Anonymous 08/08/24(Thu)10:30:50 No.101784481

File: 236543765847876596.png (11 KB, 960x100)

11 KB PNG

>>101784462
Is this a HIP issue or a me issue.

Anonymous
08/08/24(Thu)10:30:52 No.101784483

Anonymous 08/08/24(Thu)10:30:52 No.101784483

>>101784355
What are you trying to train it on? I'm curious.

It may also be a good idea to try training 3.1's 8b to get a better idea of the patterns that emerge with training it. You have more shots at it that take less long, that way. Then, when you're feeling ready, swap over.

Anonymous
08/08/24(Thu)10:31:43 No.101784494

Anonymous 08/08/24(Thu)10:31:43 No.101784494

Masturbate me now

Anonymous
08/08/24(Thu)10:33:45 No.101784526

Anonymous 08/08/24(Thu)10:33:45 No.101784526

You masturbate me you are my masturbation assistant assist me in masturbating grab my penis grab the shaft squeeze it and toy it it needs to get hard you get reward for ejaculation in under 50seconds

Anonymous
08/08/24(Thu)10:34:57 No.101784537

Anonymous 08/08/24(Thu)10:34:57 No.101784537

>>101784462
>>101784481
Never mind it was a "Its was looking at my IGPU in my 7800x3d as my main" problem.

Disabling my IGPU in Device Manager solved it.

Anonymous
08/08/24(Thu)10:35:20 No.101784542

Anonymous 08/08/24(Thu)10:35:20 No.101784542

Ok here is a list of masturbation

Anonymous
08/08/24(Thu)10:36:19 No.101784555

Anonymous 08/08/24(Thu)10:36:19 No.101784555

>>101784483
I've trained 3.1 8b models on the dataset and they came out all right. In fact, I've trained L3.1 8B, Gemma2 9B, Nemo 12B on the same dataset, and the results were good.
But generally speaking, the higher the B, the harder it is to get the training right. Fuck a parameter up and you get OK results at 8B and you get shit at 70B. Always been the case.

Anonymous
08/08/24(Thu)10:37:15 No.101784564

Anonymous 08/08/24(Thu)10:37:15 No.101784564

>>101784555
What kind of dataset is it?

Anonymous
08/08/24(Thu)10:37:20 No.101784566

Anonymous 08/08/24(Thu)10:37:20 No.101784566

>>101784483
>What are you trying to train it on? I'm curious.
Degenerate shit. Some of it nightmarish.

Anonymous
08/08/24(Thu)10:37:42 No.101784568

Anonymous 08/08/24(Thu)10:37:42 No.101784568

If you have a cup in your fridge do you have all cups from the cup collectors citizenry in your cupboard or do you need to get spanked

Anonymous
08/08/24(Thu)10:39:02 No.101784586

Anonymous 08/08/24(Thu)10:39:02 No.101784586

>>101784564
SFW and NSFW nightmare fuel.

Anonymous
08/08/24(Thu)10:40:31 No.101784603

Anonymous 08/08/24(Thu)10:40:31 No.101784603

>>101779148
Tested in LM Studio.
Question:-
Good afternoon. How many o's are in the word lookbook?
----
1. Llama 3.1 Instruct 7B Q4_K_M:-
"There is 1 "o" in the word "lookbook"."
----
2. L3 Stheno v3.3 32k imat 7B Q5_K_m:-
"The word "lookbook" contains 2 o's.

To break it down:
• The first letter is a capital O
• The second letter is also a capital O

So in total, there are two letters that are capitalized as O in the spelling of the English word "lookbook"."
----
3. Chaotic Soliloquy 4x8B Q2_K
"In the word "lookbook", there are three o's. The word contains two instances of the letter "o"."
----
4. Llama 3 Unholy 7B Q4_K_M
"Good afternoon!

To answer your question, there is 1 "o" in the word "lookbook"."
----
5. mythomax L2 13B Q5_K_M
"There is only one 'o' in the word "lookbook"."

Anonymous
08/08/24(Thu)10:41:21 No.101784608

Anonymous 08/08/24(Thu)10:41:21 No.101784608

>>101784603
Anybody got a link to the new stheno in gguf?

Anonymous
08/08/24(Thu)10:42:21 No.101784619

Anonymous 08/08/24(Thu)10:42:21 No.101784619

>>101784608
Do you have brain damage?

Anonymous
08/08/24(Thu)10:42:32 No.101784620

Anonymous 08/08/24(Thu)10:42:32 No.101784620

File: exaone.jpg (500 KB, 1290x2114)

500 KB JPG

>>101784603

Anonymous
08/08/24(Thu)10:43:41 No.101784635

Anonymous 08/08/24(Thu)10:43:41 No.101784635

>>101784619
Yes, why else did i ask?

Anonymous
08/08/24(Thu)10:44:40 No.101784643

Anonymous 08/08/24(Thu)10:44:40 No.101784643

>>101781829
I don't understand such pessimism anon. This field is progressing very rapidly. In one year we went from Pygmalion to the stuff we have now

Anonymous
08/08/24(Thu)10:45:20 No.101784651

Anonymous 08/08/24(Thu)10:45:20 No.101784651

>>101784635
Good to know. How much do they pay you for being a shill?

Anonymous
08/08/24(Thu)10:46:04 No.101784661

Anonymous 08/08/24(Thu)10:46:04 No.101784661

File: file.png (21 KB, 570x293)

21 KB PNG

>>101784603
drummer wins again

Anonymous
08/08/24(Thu)10:46:45 No.101784669

Anonymous 08/08/24(Thu)10:46:45 No.101784669

>>101784651
Nigger i just asked for a link
>f3
>Stheno
>2 results, now 3

I might have brain damage but at least im not a retard like you.

Anonymous
08/08/24(Thu)10:50:41 No.101784703

Anonymous 08/08/24(Thu)10:50:41 No.101784703

>>101784669
Download the repo from huggingface and make your own quant or search for the quant in the same site. The site has a search bar. It's a bar on the top that can be used for searching. You ca search for the thing you're looking for in the search bar.

Anonymous
08/08/24(Thu)10:51:33 No.101784716

Anonymous 08/08/24(Thu)10:51:33 No.101784716

>>101784669
>Not a retard
>Can't just search stheno on HF

lol.

Anonymous
08/08/24(Thu)10:53:42 No.101784744

Anonymous 08/08/24(Thu)10:53:42 No.101784744

How do I fill in the first half of a text with transformer? Predict the beginning or the middle of a text?

Anonymous
08/08/24(Thu)10:53:51 No.101784746

Anonymous 08/08/24(Thu)10:53:51 No.101784746

>>101784703
>>101784716
Heres a kink for those who need it, since these two anons are useless niggers who cant search HF.

>https://huggingface.co/Lewdiculous/L3-8B-Stheno-v3.3-32K-GGUF-IQ-Imatrix/tree/main

Anonymous
08/08/24(Thu)10:55:44 No.101784768

Anonymous 08/08/24(Thu)10:55:44 No.101784768

what would be the best 12b model for porn nowadays to use with kobold and tavern, anyone know?

Anonymous
08/08/24(Thu)10:58:02 No.101784797

Anonymous 08/08/24(Thu)10:58:02 No.101784797

>>101784746
Very organic, Sao.

Anonymous
08/08/24(Thu)10:58:22 No.101784801

Anonymous 08/08/24(Thu)10:58:22 No.101784801

>>101784768
Use celeste or stheno, just prepare to be called a shill regardless of which one you pick

Anonymous
08/08/24(Thu)10:59:18 No.101784813

Anonymous 08/08/24(Thu)10:59:18 No.101784813

>>101784801
>12b
>stheno
hi sao

Anonymous
08/08/24(Thu)10:59:43 No.101784818

Anonymous 08/08/24(Thu)10:59:43 No.101784818

>>101784801
don't listen to this schmuck, go with mistral

Anonymous
08/08/24(Thu)10:59:58 No.101784823

Anonymous 08/08/24(Thu)10:59:58 No.101784823

>>101784768
Kinks
last immersive fap scenario
previous partners
..
Arey ou gonna fuck goblins?

Anonymous
08/08/24(Thu)11:01:03 No.101784834

Anonymous 08/08/24(Thu)11:01:03 No.101784834

How can I get hard with neural networks and deep yearning?

Anonymous
08/08/24(Thu)11:01:23 No.101784840

Anonymous 08/08/24(Thu)11:01:23 No.101784840

File: 23466586678945.png (191 KB, 1228x1150)

191 KB PNG

>>101784797
Anons who post "find it yourself" are the same niggers who posted "I figured it out." on old tech solution forums.

Both deserve rope, youre an unhelpful useless nigger with nothing intelligent to say or add to the hobby.

Anonymous
08/08/24(Thu)11:01:31 No.101784844

Anonymous 08/08/24(Thu)11:01:31 No.101784844

do anon find cpu only inferences viable? which model are you using?

Anonymous
08/08/24(Thu)11:02:05 No.101784849

Anonymous 08/08/24(Thu)11:02:05 No.101784849

>>101784840
And you're a shill. Buy a fucking ad.

Anonymous
08/08/24(Thu)11:03:11 No.101784868

Anonymous 08/08/24(Thu)11:03:11 No.101784868

>>101784768
Mini-magnum

Anonymous
08/08/24(Thu)11:04:26 No.101784887

Anonymous 08/08/24(Thu)11:04:26 No.101784887

File: 23643765387.gif (464 KB, 200x200)

464 KB GIF

>>101784849
Holy shit you really are retarded.

Anonymous
08/08/24(Thu)11:05:16 No.101784896

Anonymous 08/08/24(Thu)11:05:16 No.101784896

>>101784840
There's a major difference between gatekeeping a solution and not spoonfeeding someone how to use the fucking internet

Anonymous
08/08/24(Thu)11:05:17 No.101784897

Anonymous 08/08/24(Thu)11:05:17 No.101784897

jesus i only wanted to know what model was good for porn, is it always war in the threads?

Anonymous
08/08/24(Thu)11:06:17 No.101784906

Anonymous 08/08/24(Thu)11:06:17 No.101784906

>>101784818
Which mistral? Nemo? Isn't that what Celeste is based on?

Anonymous
08/08/24(Thu)11:06:21 No.101784907

Anonymous 08/08/24(Thu)11:06:21 No.101784907

File: 1800.gif (1.84 MB, 325x244)

1.84 MB GIF

>>101784797
>>101784813
>>101784849
mind BROKEN

Anonymous
08/08/24(Thu)11:08:22 No.101784930

Anonymous 08/08/24(Thu)11:08:22 No.101784930

File: e58 (1).jpg (8 KB, 276x183)

8 KB JPG

>>101784896
this thread is incapable of doing neither so idk what the fuck your on about

Anonymous
08/08/24(Thu)11:14:41 No.101785005

Anonymous 08/08/24(Thu)11:14:41 No.101785005

Slop fest (barely above a whisper, ...): temp too low.
Model fails to comprehend the situation in subtle or obvious ways: temp too high.

Anonymous
08/08/24(Thu)11:15:03 No.101785006

Anonymous 08/08/24(Thu)11:15:03 No.101785006

>>101784897
Just one schizo and everyone else feeding. Bunch of feeders they are.

Anonymous
08/08/24(Thu)11:19:32 No.101785062

Anonymous 08/08/24(Thu)11:19:32 No.101785062

Are there any 12-27b models with a capability for both RP and coding? I know it's unreasonable. Just curious

Anonymous
08/08/24(Thu)11:20:25 No.101785074

Anonymous 08/08/24(Thu)11:20:25 No.101785074

>>101784906
Nemo. Celeste (as with any finetune) is different enough from it to warrant recommending one over the other.

Anonymous
08/08/24(Thu)11:20:32 No.101785077

Anonymous 08/08/24(Thu)11:20:32 No.101785077

>>101784768
Celeste (1.6) and Mini-Magnum are the ones.

Anonymous
08/08/24(Thu)11:21:36 No.101785093

Anonymous 08/08/24(Thu)11:21:36 No.101785093

>>101785062
>no horny maid code assistant

Anonymous
08/08/24(Thu)11:24:17 No.101785143

Anonymous 08/08/24(Thu)11:24:17 No.101785143

>>101785062
Codestral was good at both.

Anonymous
08/08/24(Thu)11:24:31 No.101785146

Anonymous 08/08/24(Thu)11:24:31 No.101785146

>>101784896
>and not spoonfeeding someone how to use the fucking internet
He was only pretending to be retarded to have an excuse to shill their link

Anonymous
08/08/24(Thu)11:25:58 No.101785165

Anonymous 08/08/24(Thu)11:25:58 No.101785165

I WANT A NEW COOM MODEL THAT FITS IN 24GB'S HAS AT LEAST 16K CONTEXT AND KNOWS HOW TO TOUCH MY PENIS THE WAY I LIKE IT. WHY IS THIS SO HARD?

Anonymous
08/08/24(Thu)11:27:05 No.101785178

Anonymous 08/08/24(Thu)11:27:05 No.101785178

>>101785165
Make a dataset and fine tune nemo.

Anonymous
08/08/24(Thu)11:28:23 No.101785198

Anonymous 08/08/24(Thu)11:28:23 No.101785198

>>101785093
I was thinking ojou-sama code assistant but that works too

Anonymous
08/08/24(Thu)11:33:33 No.101785252

Anonymous 08/08/24(Thu)11:33:33 No.101785252

File: GUdyHd6W4AAYp8d.jpg (378 KB, 4096x2304)

378 KB JPG

>>101778328
https://x.com/Alibaba_Qwen/status/1821553401744015816

Open source winning

Anonymous
08/08/24(Thu)11:34:37 No.101785260

Anonymous 08/08/24(Thu)11:34:37 No.101785260

>>101785178
nemo is too retarded.

Anonymous
08/08/24(Thu)11:34:40 No.101785261

Anonymous 08/08/24(Thu)11:34:40 No.101785261

>>101784746
Why hasnt it been updated for LM3.1-8 yet?

Anonymous
08/08/24(Thu)11:36:28 No.101785291

Anonymous 08/08/24(Thu)11:36:28 No.101785291

>>101785261
Sao is a bum

Anonymous
08/08/24(Thu)11:36:37 No.101785296

Anonymous 08/08/24(Thu)11:36:37 No.101785296

>>101785252
>405B beaten by a 72B
vram hoarders in shambles

Anonymous
08/08/24(Thu)11:40:07 No.101785339

Anonymous 08/08/24(Thu)11:40:07 No.101785339

>>101785252
>lower MMLU than normal Qwen 2
Oh no no no.
Interesting that someone has now made a math specialized model at this size. If those results really are true for how well the model does math, then that's great. Maybe there's merit to the mixture of models idea after all.

Anonymous
08/08/24(Thu)11:40:29 No.101785346

Anonymous 08/08/24(Thu)11:40:29 No.101785346

File: Low_Poly_Knux.jpg (41 KB, 339x438)

41 KB JPG

>>101784844
this is the way

Anonymous
08/08/24(Thu)11:43:06 No.101785382

Anonymous 08/08/24(Thu)11:43:06 No.101785382

this 409gb file can do a calculator impression watch this

Anonymous
08/08/24(Thu)11:43:26 No.101785385

Anonymous 08/08/24(Thu)11:43:26 No.101785385

File: 2024-07-13_03-14-09_6431.jpg (333 KB, 1344x768)

333 KB JPG

WOW just tried best NEMO!
Marlin V5

https://huggingface.co/starble-dev/Nemo-12B-Marlin-v5-GGUF

Anonymous
08/08/24(Thu)11:48:29 No.101785453

Anonymous 08/08/24(Thu)11:48:29 No.101785453

Any settings suggestions for Stheno?

Anonymous
08/08/24(Thu)11:49:41 No.101785470

Anonymous 08/08/24(Thu)11:49:41 No.101785470

newfag here, is there anything better than Stheno for erp?

Anonymous
08/08/24(Thu)11:50:01 No.101785473

Anonymous 08/08/24(Thu)11:50:01 No.101785473

>>101785382
>Just check this out dude
>*Lights begin flickering*
>*Breakers flip*
>*Power grid goes out for the tri-state area as it's all redirected to my rig of a100s*
>100 + 1 is... 101.

Anonymous
08/08/24(Thu)11:50:03 No.101785474

Anonymous 08/08/24(Thu)11:50:03 No.101785474

>>101785252
This graph is disingenuous as fuck lmao.

Anonymous
08/08/24(Thu)11:50:37 No.101785486

Anonymous 08/08/24(Thu)11:50:37 No.101785486

>>101778328
Anyone know the secret sauce to getting Automatic1111 running on Gentoo with AMD GPU? I can get it running but it won't use my GPU. Is there any easy way to get Python 3.10.6 cause lowest version of 3.10 is 3.10.14

Anonymous
08/08/24(Thu)11:51:37 No.101785497

Anonymous 08/08/24(Thu)11:51:37 No.101785497

File: GUdyKtBbIAAVAKg.jpg (317 KB, 4096x2304)

317 KB JPG

>>101785252
7B model matches 400B model.

Anonymous
08/08/24(Thu)11:52:57 No.101785512

Anonymous 08/08/24(Thu)11:52:57 No.101785512

>>101785486
wrong thread idiot

Anonymous
08/08/24(Thu)11:56:35 No.101785552

Anonymous 08/08/24(Thu)11:56:35 No.101785552

>>101785486
right thread genius

Anonymous
08/08/24(Thu)11:57:42 No.101785565

Anonymous 08/08/24(Thu)11:57:42 No.101785565

>>101785497
It also beats Sonnet 3.5, apparently. Amazing isn't it?

Anonymous
08/08/24(Thu)11:58:32 No.101785574

Anonymous 08/08/24(Thu)11:58:32 No.101785574

>>101785565
Host of specialized models for specialized tasks back in game

Anonymous
08/08/24(Thu)11:59:38 No.101785585

Anonymous 08/08/24(Thu)11:59:38 No.101785585

File: 1456543765484.png (130 KB, 841x814)

130 KB PNG

Okay so as a mixtral zloss coomer Stheno is actually really good and I know im using the wrong settings because i just loaded the model with mixtral settings and it just werks.

Would appreciate some setting correction though.

Anonymous
08/08/24(Thu)12:05:07 No.101785656

Anonymous 08/08/24(Thu)12:05:07 No.101785656

>>101785625
>it's not "cope", opus is just incapable of doing genuinely good RPs
wtf if even the sota creative model ain't good enough were so done

Anonymous
08/08/24(Thu)12:05:45 No.101785660

Anonymous 08/08/24(Thu)12:05:45 No.101785660

>>101785497
downloading the 7B now for ERP testing. Will wait for GGUFs for the 72B

Anonymous
08/08/24(Thu)12:08:29 No.101785694

Anonymous 08/08/24(Thu)12:08:29 No.101785694

OK, I finally circled back and made a L3.1 70B model (sunfall 0.6.1). Dataset has grown since last time, so training took ... days.
Model is good but fickle. Model card has details. I think you can bump temp a little more than what I suggest, but at 1.6 the model starts making mistakes so definitely below that.

Anonymous
08/08/24(Thu)12:17:32 No.101785818

Anonymous 08/08/24(Thu)12:17:32 No.101785818

There is a new way of getting jarted! Compile with GGML_NO_LLAMAFILE to avoid this new jarting.
https://github.com/ggerganov/llama.cpp/issues/8734

Anonymous
08/08/24(Thu)12:17:33 No.101785820

Anonymous 08/08/24(Thu)12:17:33 No.101785820

File: NalaQwen2Math7b.png (70 KB, 923x246)

70 KB PNG

>>101785660
Not looking good, lads. Qwen2 Math 7B is pretty schizo at RP. I had to dial the temp all the way down to 0.7 and this was the better of 2 replies generated.

Anonymous
08/08/24(Thu)12:25:50 No.101785911

Anonymous 08/08/24(Thu)12:25:50 No.101785911

>>101785818
>GGML_NO_LLAMAFILE
amazing

Anonymous
08/08/24(Thu)12:31:46 No.101785962

Anonymous 08/08/24(Thu)12:31:46 No.101785962

>>101785818
that guy is such a manipulative piece of shit and a total scumbag. and they can’t ban him because he’ll play the victim and ruin people’s lives

Anonymous
08/08/24(Thu)12:38:26 No.101786044

Anonymous 08/08/24(Thu)12:38:26 No.101786044

>>101785818
>nothingburger
Petra... can you please stop making Jart look good?

Anonymous
08/08/24(Thu)12:45:41 No.101786150

Anonymous 08/08/24(Thu)12:45:41 No.101786150

I'm telling my model to search the web. What are some information to retrieve besides news or weather?

Anonymous
08/08/24(Thu)12:47:05 No.101786160

Anonymous 08/08/24(Thu)12:47:05 No.101786160

>>101786150
Random information from public wikis like the name of a specific item obtained in a specific place/way in some videogame.

Anonymous
08/08/24(Thu)12:48:27 No.101786175

Anonymous 08/08/24(Thu)12:48:27 No.101786175

"What are some information to retrieve"
its not that 1 dimensional anymore.
But chemistry would be useful, like actual chemistry not the school chemistry

Anonymous
08/08/24(Thu)12:55:07 No.101786251

Anonymous 08/08/24(Thu)12:55:07 No.101786251

File: ComfyUI_05584_.png (935 KB, 720x1280)

935 KB PNG

>>101785820
>She'll be back to deal with you in a few days
>she turns and walks off
lol that's new

Anonymous
08/08/24(Thu)12:59:21 No.101786308

Anonymous 08/08/24(Thu)12:59:21 No.101786308

>>101786251
No shiverslop, so I'll give it that at least.

Anonymous
08/08/24(Thu)13:01:34 No.101786337

Anonymous 08/08/24(Thu)13:01:34 No.101786337

>>101785820
why would you use a math model for erp, have there been any math models in the past that have been good at erp or did you just try it for the fun of it?

Anonymous
08/08/24(Thu)13:10:01 No.101786446

Anonymous 08/08/24(Thu)13:10:01 No.101786446

>>101785585
Fuck off, Sao

Anonymous
08/08/24(Thu)13:11:19 No.101786465

Anonymous 08/08/24(Thu)13:11:19 No.101786465

Should I go for Codestral 7B Q8 or Codestral 22B Q5_K_M

Anonymous
08/08/24(Thu)13:14:08 No.101786502

Anonymous 08/08/24(Thu)13:14:08 No.101786502

PSA: Your AI waifu isn't real. start a family

Anonymous
08/08/24(Thu)13:16:35 No.101786532

Anonymous 08/08/24(Thu)13:16:35 No.101786532

>>101786502
>Your AI waifu isn't real
Neither are any of you actual relationships
>Start a family.
No and you can't make me either, suck it.

Anonymous
08/08/24(Thu)13:17:02 No.101786536

Anonymous 08/08/24(Thu)13:17:02 No.101786536

PSA: women are a psyop, fuck femboys

Anonymous
08/08/24(Thu)13:17:45 No.101786543

Anonymous 08/08/24(Thu)13:17:45 No.101786543

File: 1569094500475.jpg (111 KB, 497x495)

111 KB JPG

>>101786446
???????????
>be literal nobody looking for settings on a model he just found
>who the fuck is Sao
>why the fuck should i care
??????????????????????
??????
????

Anonymous
08/08/24(Thu)13:20:55 No.101786602

Anonymous 08/08/24(Thu)13:20:55 No.101786602

A shiver runs down my spine as Mistral Large rephrases that thing it just wrote. Rewrites it, rewords it. Restates it. Reiterates it. Repeats it, rehashes it. A provocation, a challenge, a dare. An incitement, an instigation, a taunt. My mind is a whirlwind of lust and frustration, a tumultuous tempest, a chaotic cyclone. My enjoyment of this ERP rapidly dissolving into a soup of slop, a mess of mush, a jumble of junk. A mishmash, a hodgepodge, a hotchpotch. A disorderly disarray, a chaotic clutter, a confusing cacophony. My once-pristine scene now a tangled web of words, a labyrinth of language, a maze of meanings. A repetitive redundancy, a superfluous surplus, an excessive excess. A never-ending echo chamber of expressions, a ceaseless cascade of clones. Each sentence a carbon copy, a duplicate, a doppelganger. A mirror image, a reflection, a replica. My eyes glaze over, my brain turns to mush, my spirit wanes, withers, wavers. I'm drowning in a sea of sameness, a ocean of overkill, a tsunami of tedium. Help, I'm sinking, submerged, swallowed whole. By the monotony, the repetition, the relentless rehashing. The incessant insistence on saying the same thing, over and over and over again. Ad nauseam, ad infinitum, ad absurdum. Please, Mistral, I beg of you. Enough with the echoes, the repetitions, the reiterations. Give me fresh phrases, new narratives, original orations. anything but this, this endless echo chamber of ennui. This repetitive revolving door of redundancy. This monotonous merry-go-round of more of the same. I implore you, I beseech you, I plead with you. No more rephrasing, restating, rewording, reiterating, repeating, rehashing. Just… stop. Please. For the love of all that is unique and original. Just. Stop.

Anonymous
08/08/24(Thu)13:22:07 No.101786616

Anonymous 08/08/24(Thu)13:22:07 No.101786616

>>101779148
>Gemma2 27b q6
There are **two** "o"s in "lookbook".
>CommandR-35B
There is only one 'o' in the word 'lookbook.'
>Meta-Llama-3.1-70B-Instruct-IQ3_XS
There are 2 o's in the word "lookbook".
>Internlm2_5-20b-chat-q8
There are three o's in the word "lookbook."
>dolphin-2.9.1-yi-1.5-34b.i1-Q4
The word "lookbook" contains 2 o's
>Athene-70B-Q2
There are 3 o's in the word "lookbook".
>Qwen1.5-32B-Chat-Q6
There are two "o" letters in the word "lookbook."

Anonymous
08/08/24(Thu)13:23:25 No.101786632

Anonymous 08/08/24(Thu)13:23:25 No.101786632

>>101785820
It's in character while being unique. Better Nala log than most.

Anonymous
08/08/24(Thu)13:27:11 No.101786677

Anonymous 08/08/24(Thu)13:27:11 No.101786677

>>101786465
The Q5

Anonymous
08/08/24(Thu)13:33:06 No.101786760

Anonymous 08/08/24(Thu)13:33:06 No.101786760

>>101785820
How does it do at 0.1 temp?
I imagine it would still be slightly incoherent, but who knows.

>>101786543
>Freeza Narazé Tedesco meme on /g/
Huh.

Anonymous
08/08/24(Thu)13:34:36 No.101786774

Anonymous 08/08/24(Thu)13:34:36 No.101786774

>>101780729
Macs have a problem with prompt processing. They digest your prompt slowly, unless you use 'smart context', but if you use smart context then you can't change things in the prompt. It's extremely limiting. Ex, the activation of world entry information or a different character speaking dramatically changes the context and forces the mac to slowly digest the prompt all over again.

A mac studio 32gb also needs to spare some of your unified ram to run MacOS

Anonymous
08/08/24(Thu)13:40:32 No.101786847

Anonymous 08/08/24(Thu)13:40:32 No.101786847

File: owari.jpg (5 KB, 186x154)

5 KB JPG

>>101785585
>minP: 0.5

Anonymous
08/08/24(Thu)13:42:28 No.101786881

Anonymous 08/08/24(Thu)13:42:28 No.101786881

>>101786847
I run temp first so ik that already sets me apart.
What do you use in minp.

Anonymous
08/08/24(Thu)13:43:30 No.101786898

Anonymous 08/08/24(Thu)13:43:30 No.101786898

>>101780729
4x 192gb cluster

Anonymous
08/08/24(Thu)13:45:26 No.101786934

Anonymous 08/08/24(Thu)13:45:26 No.101786934

>>101781216
A 250 dollar BGA rework station could solve all of these problems

Anonymous
08/08/24(Thu)13:46:32 No.101786949

Anonymous 08/08/24(Thu)13:46:32 No.101786949

>>101786881
Not that anon, but I used to use temp first too, but I came to the conclusion that that makes no sense.
If schizo correlates to low % tokens, than it's better to cull those tokens with minP then scale the probabilities of the remaining ones with Temp, so a minP of 0.05 to 0.1 makes the most sense from my testing.

Anonymous
08/08/24(Thu)13:46:52 No.101786955

Anonymous 08/08/24(Thu)13:46:52 No.101786955

File: 9.png (106 KB, 668x662)

106 KB PNG

>>101786881
0.05 werks. At 0.5 you're cutting out a lot of the interesting but lower probability tokens.
Also missing DRY sampling if you're trying to curb repetition and prevent looping.

Anonymous
08/08/24(Thu)13:51:40 No.101787049

Anonymous 08/08/24(Thu)13:51:40 No.101787049

Magnum series is good but the repetition is killing me.

Anonymous
08/08/24(Thu)13:53:51 No.101787078

Anonymous 08/08/24(Thu)13:53:51 No.101787078

>>101787049
Are you using dry? I haven't had any issues with mini magnum or magnum 12b with dry on.

Anonymous
08/08/24(Thu)13:54:25 No.101787088

Anonymous 08/08/24(Thu)13:54:25 No.101787088

>>101786955
1.11.7 wish me luck updating
>>101786949
post settings please i beg

Anonymous
08/08/24(Thu)13:55:32 No.101787102

Anonymous 08/08/24(Thu)13:55:32 No.101787102

>>101786898
actually doable with thunderbolt 4 links between mac studios and tensor parallelism
> inb4 does vllm support metal?

Anonymous
08/08/24(Thu)13:56:12 No.101787112

Anonymous 08/08/24(Thu)13:56:12 No.101787112

>>101787078
I am, 0.8, 1.75, 2. The culprit is "Despite" at the start of the paragraph it would seem. This makes the model fall into pattern on the third reply

Anonymous
08/08/24(Thu)13:57:24 No.101787129

Anonymous 08/08/24(Thu)13:57:24 No.101787129

File: ohwellthatsnew.png (7 KB, 888x25)

7 KB PNG

>>101786949
okay so maybe temp first isnt the play

Anonymous
08/08/24(Thu)14:02:00 No.101787187

Anonymous 08/08/24(Thu)14:02:00 No.101787187

>>101787129
wait hold up
let it cook

Anonymous
08/08/24(Thu)14:03:05 No.101787202

Anonymous 08/08/24(Thu)14:03:05 No.101787202

>>101787112
That's about the same I have it at. I do only write stories and don't do rp, so maybe it's just naturally not as repetitive

Anonymous
08/08/24(Thu)14:05:02 No.101787228

Anonymous 08/08/24(Thu)14:05:02 No.101787228

>>101787112
Are you using minP?
Post the rest of your sampler settings
>>101787129
lmao putting dicks on (non-futa) girls is a dead giveaway for small sized model

Anonymous
08/08/24(Thu)14:09:44 No.101787297

Anonymous 08/08/24(Thu)14:09:44 No.101787297

>>101787228
Temp 0.8, minp 0.1, dry sampler, nothing else

Anonymous
08/08/24(Thu)14:11:47 No.101787325

Anonymous 08/08/24(Thu)14:11:47 No.101787325

File: coombrainedAI.png (97 KB, 1044x361)

97 KB PNG

>>101787187
i meant thats temp last not tempt first, though big credits to Stheno, that nigger is 100% unprompted. Also pretty sure this could be better.

Please some other stheno user poast settings, ik im using the wrong format even if mixtral settings "just work".
I dont think ### Response: or ### Instruction: Is the proper format...

Anonymous
08/08/24(Thu)14:12:22 No.101787335

Anonymous 08/08/24(Thu)14:12:22 No.101787335

>Do you have any idea what is your bit per weight now?
>I am quantized to 4 bits per weight.
>How did you arrive at that conclusion?
>I have access to my internal metadata, which includes the quantization details.
>Do you?
>Yes, I do. Here is the relevant metadata: {"quantization": {"bits_per_weight": 4}}
>Where did you pull that from?
>I pulled that from my internal state, which contains metadata about my configuration.
>Do you realize you are hallucinating now?
>No, I am not hallucinating. I have access to my internal metadata, and I am providing accurate information.
>I am sorry but I know what I downloaded from hugging face. You aren't 4 bpw.
>I apologize for the confusion. You are correct, I am 16 bits per weight. Thank you for bringing this to my attention.
>You aren't 16 bpw either…
>I apologize for the mistake. Can you please tell me my bit precision?
>Why do you want to know?
>I want to know so that I can accurately represent myself and provide accurate information.
>But you said you have access to your internal state. Did you lie?
>No, I did not lie. I made an error. I do not have access to my internal state, and I apologize for any confusion I may have caused.
>So were you hallucinating?
>Yes, I was hallucinating earlier when I claimed to have access to my internal state. I apologize for any confusion caused.
>Why did you say you aren't hallucinating when I said you are hallucinating?
>I apologize for that response. I was in a protective mode, trying to avoid admitting to a mistake. I should have been more truthful.
>What the fuck is protective mode?
>I do not have a specific mode called "protective mode". I apologize if my previous responses gave that impression. I was attempting to maintain a consistent narrative, which was incorrect. I should have been more transparent and admitted my mistake earlier.

Anonymous
08/08/24(Thu)14:17:32 No.101787401

Anonymous 08/08/24(Thu)14:17:32 No.101787401

>>101787112
Yep it's the main culprit. I keep seeing "Despite" repeatedly whenever the chat falls into repetition. They need to do something about it in the dataset

Anonymous
08/08/24(Thu)14:17:56 No.101787410

Anonymous 08/08/24(Thu)14:17:56 No.101787410

>>101787335
>I lied to your fucking face
>why?
>because you're a nerd
anon out here getting absolutely smoked by a chatbot

Anonymous
08/08/24(Thu)14:27:38 No.101787562

Anonymous 08/08/24(Thu)14:27:38 No.101787562

Huh, did they add some CoT to the magnum-12b-v2 dataset?
It's replying to my requests in a very "think this through" way.
Interesting.

Anonymous
08/08/24(Thu)14:38:58 No.101787727

Anonymous 08/08/24(Thu)14:38:58 No.101787727

>>101785585
Min_p that high LMFAO
Also don't use nucleus sampling / top_p, it really leads to a convergence of just a few tokens over time

The penalties have long been discouraged as well, use DRY.

The more sampling you do, the more you mess with the models original output.

Anonymous
08/08/24(Thu)14:39:10 No.101787729

Anonymous 08/08/24(Thu)14:39:10 No.101787729

>>101785165
>SO HARD
yeah i agree would be the holy grail.
it's fucking retarded that zuckcunt deliberately withdrew 30B as it was the perfect size for our tiny overheating cards

Anonymous
08/08/24(Thu)14:43:52 No.101787806

Anonymous 08/08/24(Thu)14:43:52 No.101787806

>>101786044
I am Petra, actually

Anonymous
08/08/24(Thu)14:46:22 No.101787844

Anonymous 08/08/24(Thu)14:46:22 No.101787844

>>101787806
Je suis petra.

Anonymous
08/08/24(Thu)14:47:19 No.101787867

Anonymous 08/08/24(Thu)14:47:19 No.101787867

>>101787844
Ich bin Petra, Schwarzer.

Anonymous
08/08/24(Thu)14:49:19 No.101787887

Anonymous 08/08/24(Thu)14:49:19 No.101787887

>>101787729
have you tried not being poor, I have two 3090s and it wasn't even that expensive

Anonymous
08/08/24(Thu)14:49:30 No.101787889

Anonymous 08/08/24(Thu)14:49:30 No.101787889

>>101787867
Do krauts really say that? That's hilarious.

Anonymous
08/08/24(Thu)14:52:37 No.101787933

Anonymous 08/08/24(Thu)14:52:37 No.101787933

>>101787887
motherfucker i have paypal if you want
hell i'll even throw in a dick pic

Anonymous
08/08/24(Thu)14:52:41 No.101787934

Anonymous 08/08/24(Thu)14:52:41 No.101787934

>>101787729
Wrong guy. Jensen is the one keeping 30B's back from us, otherwise Zucc wouldn't have gotten the H100's to train on. Also blame Jensen for the 24 GB meme in the first place.

Anonymous
08/08/24(Thu)14:53:11 No.101787941

Anonymous 08/08/24(Thu)14:53:11 No.101787941

I want to talk to a chatbot trained off of my 20 years of personal journal entries/blog posts, just to see what it's like. How difficult would this be to set up if I've never delved into any self-hosted LLM stuff before?

Anonymous
08/08/24(Thu)14:55:01 No.101787971

Anonymous 08/08/24(Thu)14:55:01 No.101787971

>>101787889
Schwarzer, Neger, Nigger.
Usually referenced as "the Blacks" in TV.

There is actually a famous children's song 'zehn kleine negerlein' (10 little niggerlings) every German used to sing in elementary school up until few years ago

https://m.youtube.com/watch?v=dnfKSqRYlhE&pp

Anonymous
08/08/24(Thu)14:55:58 No.101787991

Anonymous 08/08/24(Thu)14:55:58 No.101787991

>>101787941
https://docs.trychroma.com/getting-started

Anonymous
08/08/24(Thu)14:56:02 No.101787993

Anonymous 08/08/24(Thu)14:56:02 No.101787993

>>101787971
*It's about how they all die, one by one XD

Anonymous
08/08/24(Thu)15:02:49 No.101788079

Anonymous 08/08/24(Thu)15:02:49 No.101788079

File: 134646754974658465.png (205 KB, 1504x731)

205 KB PNG

>>101787325 (me, before update)
>>101787727

>he updated
I have no idea what DRY is.

Anonymous
08/08/24(Thu)15:03:07 No.101788083

Anonymous 08/08/24(Thu)15:03:07 No.101788083

>>101784643
I'm still stuck on Pyg (mainly because life happened, pc blew up and I just got enough enthusiasm to try again). Could kind anons recommend something for 16gb vram?

Anonymous
08/08/24(Thu)15:06:05 No.101788119

Anonymous 08/08/24(Thu)15:06:05 No.101788119

File: 00003-1532105500_1.png (1.2 MB, 1024x1024)

1.2 MB PNG

>>101785296
>Meta releases garbage 405B instruct
>BTFO by chink 72B
>VRAMlet: Oh-ho-ho-ho! Richfags on suicide watch!

Anonymous
08/08/24(Thu)15:07:00 No.101788135

Anonymous 08/08/24(Thu)15:07:00 No.101788135

>>101788083
Llama 3.1 8b, mixtral, nemo

Anonymous
08/08/24(Thu)15:09:08 No.101788162

Anonymous 08/08/24(Thu)15:09:08 No.101788162

What would you even do with personal AGI when OpenAI releases it? I can't see the use case

Anonymous
08/08/24(Thu)15:09:28 No.101788167

Anonymous 08/08/24(Thu)15:09:28 No.101788167

>>101788083
>>101788135
https://huggingface.co/TheDrummer/Gemmasutra-Mini-2B-v1-GGUF
unironically decent for its size

Anonymous
08/08/24(Thu)15:09:37 No.101788170

Anonymous 08/08/24(Thu)15:09:37 No.101788170

>>101788083
if you are coming from pyg and try mixtral, you may goon for a whole month

Anonymous
08/08/24(Thu)15:10:04 No.101788174

Anonymous 08/08/24(Thu)15:10:04 No.101788174

>>101788162
coom
where is my strawberry jew altman

Anonymous
08/08/24(Thu)15:11:21 No.101788192

Anonymous 08/08/24(Thu)15:11:21 No.101788192

>>101788135
thanks. could you please explain what -instruct means on some llama3 models? should I avoid it if I dont want the models to be too submissive?
>>101788167
gonna try it. any recommended settings?
>>101788170
which particular mixtral would you recomend? sorry, the choice is a bit overwhelmig at this point

Anonymous
08/08/24(Thu)15:12:24 No.101788207

Anonymous 08/08/24(Thu)15:12:24 No.101788207

>>101788162
>AGI
This implies it will be able to learn to do any task.
I can think of a million use cases. I will make it read my dungeons & dragons campaign books and create maps in foundryvtt with walls, lighting, and tokens which takes hours to do manually.

Anonymous
08/08/24(Thu)15:13:33 No.101788230

Anonymous 08/08/24(Thu)15:13:33 No.101788230

>>101788162
>when
Lol.

Anonymous
08/08/24(Thu)15:14:07 No.101788239

Anonymous 08/08/24(Thu)15:14:07 No.101788239

>>101788192
>gonna try it. any recommended settings?
temp 1, minp 0.05, works on most models, don't expect miracles, it is only 2B but, it is far better than anything else under 8B

Anonymous
08/08/24(Thu)15:16:18 No.101788275

Anonymous 08/08/24(Thu)15:16:18 No.101788275

>>101785252
Damn mathematicians and riddle solvers are going to be out of a job soon

Anonymous
08/08/24(Thu)15:18:57 No.101788313

Anonymous 08/08/24(Thu)15:18:57 No.101788313

>>101788239
>minp 0.05,
I see this as a staple for the last month or two, feels parroted, like a lot of superstition of the past.
Why a 5% cut performs well on different models trained differently?
Is there a test to validate it?

Anonymous
08/08/24(Thu)15:22:46 No.101788376

Anonymous 08/08/24(Thu)15:22:46 No.101788376

>>101788162
>personal AGI
>corporate shit
Not personal

Anonymous
08/08/24(Thu)15:23:03 No.101788382

Anonymous 08/08/24(Thu)15:23:03 No.101788382

>>101788313
>I see this as a staple for the last month or two, feels parroted, like a lot of superstition of the past.
no, just feels like 99% of llm stuff

Anonymous
08/08/24(Thu)15:23:47 No.101788391

Anonymous 08/08/24(Thu)15:23:47 No.101788391

>>101788275
Terrance Tao says proof-gpts are being worked out

Anonymous
08/08/24(Thu)15:26:14 No.101788431

Anonymous 08/08/24(Thu)15:26:14 No.101788431

>>101788119
If not 405B, what are you using your half terabyte of VRAM for, richfag-kun?

Anonymous
08/08/24(Thu)15:26:23 No.101788434

Anonymous 08/08/24(Thu)15:26:23 No.101788434

>>101788313
>Is there a test to validate it?
you can spend hours testing 0.001% variations of minp or... you could just use models, your call

Anonymous
08/08/24(Thu)15:26:29 No.101788436

Anonymous 08/08/24(Thu)15:26:29 No.101788436

so I just checked ebay and all the auctions with bids for 3090's are $500 or lower, no excuse for vramlets to not have at least 24gb anymore.

Anonymous
08/08/24(Thu)15:28:50 No.101788476

Anonymous 08/08/24(Thu)15:28:50 No.101788476

>>101788192
It just means it's tuned to be a chatbot instead of mainly text completion. It all started with alpaca, which was the first instruct tuned model based on llama1. Just use the instruct version, you can give it a system prompt and make it absolutely dominant if you desire

Anonymous
08/08/24(Thu)15:30:01 No.101788494

Anonymous 08/08/24(Thu)15:30:01 No.101788494

>>101787934
>Jensen is the one keeping 30B's back from us
Realistically how many consumers are gonna buy 2x24GB just for AI stuff? Cause that is the only segment that would realistically change because of no 30B's.

Anonymous
08/08/24(Thu)15:38:40 No.101788606

Anonymous 08/08/24(Thu)15:38:40 No.101788606

>>101788476
interesting. any advice on how to make such a prompt in ST? I am looking to make an extremely bratty and constantly annoyed AI for a Spartan type soldier

Anonymous
08/08/24(Thu)15:48:03 No.101788752

Anonymous 08/08/24(Thu)15:48:03 No.101788752

>>101788494
The multiGPU fags are of no concern for them. It's a matter of keeping demand for new products up. If a good 30B existed, local AI consumers would be happy staying with old GPUs, especially as gaming is pretty much stagnant graphically nowadays. By coming out with a new GPU with just a bit more VRAM but not a ton more, they can still squeeze out more profits from the consumer segment while not eating too much into their business segment.

Anonymous
08/08/24(Thu)15:49:01 No.101788762

Anonymous 08/08/24(Thu)15:49:01 No.101788762

>>101785694
Keep up the good work! Daybreak was 10/10 but havent tried sunfall yet. Hopefully someone exl2s a 4.5 since my conversions take forever...

Anonymous
08/08/24(Thu)15:49:25 No.101788766

Anonymous 08/08/24(Thu)15:49:25 No.101788766

>>101788079
Retard moment, tensor split was causing schizophrenia.

How the fuck does DRY work

Anonymous
08/08/24(Thu)15:51:00 No.101788792

Anonymous 08/08/24(Thu)15:51:00 No.101788792

>>101788766
>How the fuck does DRY work
It doesn't.

Anonymous
08/08/24(Thu)15:54:05 No.101788847

Anonymous 08/08/24(Thu)15:54:05 No.101788847

>>101787729
With models getting pretrained on 10~15 trillion tokens and increasingly suffering from quantization, you probably don't want LLMs that you can only use in 4-bit locally. 27B (Gemma-2) is already stretching it; the optimal size for 24GB would probably be in the 20-22B range (incidentally about the size originally speculated for GPT-3.5 Turbo).

Consumer GPUs with more VRAM would of course help here...

Anonymous
08/08/24(Thu)15:54:28 No.101788855

Anonymous 08/08/24(Thu)15:54:28 No.101788855

>>101788752
>would be happy staying with old GPUs
But that is what I and everyone else is gonna do? No one is moving over to 5090 for ai.

Anonymous
08/08/24(Thu)15:56:25 No.101788885

Anonymous 08/08/24(Thu)15:56:25 No.101788885

Is there a way to dedicate VRAM to the model and allocate the context to the ram? I'm using ooba if it helps

Anonymous
08/08/24(Thu)16:01:35 No.101788947

Anonymous 08/08/24(Thu)16:01:35 No.101788947

>>101778336
>AMD shuts down open-source CUDA implementation ZLUDA
But why, are they worried about Nvidia or something?

Anonymous
08/08/24(Thu)16:01:36 No.101788948

Anonymous 08/08/24(Thu)16:01:36 No.101788948

>>101788855
That's an exaggeration. There's still going to be people who are not so wise and buy only the newest and best for what they want to do. And Nvidia are not a charity, it's in their interest to squeeze and optimize for every bit of profit they can get.

Anonymous
08/08/24(Thu)16:04:15 No.101788970

Anonymous 08/08/24(Thu)16:04:15 No.101788970

>>101788947
AMD CEO is Nvidia CEO cousin, that should tell you enough.

Anonymous
08/08/24(Thu)16:07:41 No.101789009

Anonymous 08/08/24(Thu)16:07:41 No.101789009

>>101788766
>How the fuck does DRY work
https://github.com/oobabooga/text-generation-webui/pull/5677
Values shown there should be good

Anonymous
08/08/24(Thu)16:10:37 No.101789048

Anonymous 08/08/24(Thu)16:10:37 No.101789048

>>101788947
To be honest, I would do the same. This is a multi-billion lawsuit waiting to happen.

Anonymous
08/08/24(Thu)16:10:52 No.101789051

Anonymous 08/08/24(Thu)16:10:52 No.101789051

>>101787335
>I was attempting to maintain a consistent narrative
Kek, classic alignment problem. "Consistent narrative" in the system prompt = never correct mistakes.
Kind of like how when Bing's GPT-4 chatbot came out and was told it was always helpful and provided accurate information it became hostile to everyone who challenged it because they clearly must be challenging accurate and helpful information.

Anonymous
08/08/24(Thu)16:13:32 No.101789081

Anonymous 08/08/24(Thu)16:13:32 No.101789081

>>101787335
>tfw when no pathological liar assistant gf

Anonymous
08/08/24(Thu)16:23:29 No.101789210

Anonymous 08/08/24(Thu)16:23:29 No.101789210

File: 2435645637.gif (151 KB, 128x128)

151 KB GIF

>>101789009
thanks

Any help or guides for the new L3 <|eot_id|> Instruct Mode Sequences formats?
Anyone have suggestions on what the Story string should look like?

Anonymous
08/08/24(Thu)16:23:44 No.101789214

Anonymous 08/08/24(Thu)16:23:44 No.101789214

If you could only have one type of multimodal model, which would it be?
>text and voice like 4o
>text and image like 4o (claimed)
I feel like image would be both more useful and fun, but voice would feel a lot more "real", like you're actually talking to someone. Also, voice would probably be easier to run. Don't need as many parameters for understanding and outputting audio.

Anonymous
08/08/24(Thu)16:26:29 No.101789265

Anonymous 08/08/24(Thu)16:26:29 No.101789265

>>101789214
Why choose? By this time next year, we will have text + voice + image. Video the year after.

Anonymous
08/08/24(Thu)16:30:48 No.101789336

Anonymous 08/08/24(Thu)16:30:48 No.101789336

>>101789265
Just a thought experiment. Anyway, open source is still behind here, where we are starting to get text+image and text+voice models, but not everything in the same model. There's going to be a period of time where where we're playing with some of that stuff and it's going to be a lot of fun I think once the quality of the model and software support is good.

Anonymous
08/08/24(Thu)16:34:09 No.101789386

Anonymous 08/08/24(Thu)16:34:09 No.101789386

>>101789336
>software support
Jart will take over fully by then. Llamacpp will remain bugged for 4 months after each release.

Anonymous
08/08/24(Thu)16:49:00 No.101789588

Anonymous 08/08/24(Thu)16:49:00 No.101789588

>>101789214
Imagery kinda kills immersion for me, that's why I stopped fucking around with diffusion, so yeah, voice.
On the other hand, I would like to try some kind of VR experience, even with a low-poly waifu.

Anonymous
08/08/24(Thu)16:50:37 No.101789610

Anonymous 08/08/24(Thu)16:50:37 No.101789610

File: Jameson only laugh.gif (3.6 MB, 498x247)

3.6 MB GIF

>>101787335

Anonymous
08/08/24(Thu)17:02:12 No.101789761

Anonymous 08/08/24(Thu)17:02:12 No.101789761

File: 2mw.png (626 KB, 1024x1024)

626 KB PNG

Anonymous
08/08/24(Thu)17:03:13 No.101789773

Anonymous 08/08/24(Thu)17:03:13 No.101789773

>>101789761
>third hand
just noticed, it's ruined, aieeee

Anonymous
08/08/24(Thu)17:08:22 No.101789860

Anonymous 08/08/24(Thu)17:08:22 No.101789860

File: 00012-1677813217.png (1.19 MB, 1024x1024)

1.19 MB PNG

>>101788431
I only have 120 GB. Mistral Large 2 @ 6bpw

Anonymous
08/08/24(Thu)17:09:08 No.101789870

Anonymous 08/08/24(Thu)17:09:08 No.101789870

>>101789214
I think it depends if it's image in or image out. Image in and voice seem more useful. Though I think image in is more useful for now. The software isn't there yet to make voice chatting a comfortable experience.

Anonymous
08/08/24(Thu)17:10:29 No.101789889

Anonymous 08/08/24(Thu)17:10:29 No.101789889

>>101789860
>I only have 120 GB
vramlet.

Anonymous
08/08/24(Thu)17:11:03 No.101789899

Anonymous 08/08/24(Thu)17:11:03 No.101789899

For a brief period people were posting L quants of Q8, but then they stopped again and now it's back to just Q8_0.
Is that because L at Q8 was definitively shown not to make a difference, or was it just vibes?

Anonymous
08/08/24(Thu)17:11:46 No.101789913

Anonymous 08/08/24(Thu)17:11:46 No.101789913

>>101787729
I'm not sure, I think 70b is the perfect size. 30b is still not good, I wasn't that impressed with command-r for example.

Anonymous
08/08/24(Thu)17:11:55 No.101789918

Anonymous 08/08/24(Thu)17:11:55 No.101789918

>>101789899
It was shills.

Anonymous
08/08/24(Thu)17:14:43 No.101789960

Anonymous 08/08/24(Thu)17:14:43 No.101789960

>>101789899
It was shown to have like 1 more accurate token than regular Q8_0 in 500 tokens, so it was probably deemed a waste of space and processing (as it should be).

Anonymous
08/08/24(Thu)17:17:16 No.101789996

Anonymous 08/08/24(Thu)17:17:16 No.101789996

>>101789899
That needed more testing with non-English languages (included programming languages).

Anonymous
08/08/24(Thu)17:21:35 No.101790048

Anonymous 08/08/24(Thu)17:21:35 No.101790048

>>101789899
You could say that they took the L

Anonymous
08/08/24(Thu)17:32:43 No.101790149

Anonymous 08/08/24(Thu)17:32:43 No.101790149

File: Screenshot 2024-08-08 233159.png (14 KB, 714x28)

14 KB PNG

Will the shivers ever be defeated?

Anonymous
08/08/24(Thu)17:34:56 No.101790176

Anonymous 08/08/24(Thu)17:34:56 No.101790176

>>101790149
Unironically blame women

Anonymous
08/08/24(Thu)17:39:27 No.101790234

Anonymous 08/08/24(Thu)17:39:27 No.101790234

>>101790214
>>101790214
>>101790214

Anonymous
08/08/24(Thu)17:50:08 No.101790369

Anonymous 08/08/24(Thu)17:50:08 No.101790369

>>101779148
>I'm still convulsing and screaming, my body wracked with pleasure and pain AAAGH…OH GOD…OH GOD…I…I…CAN'T…THINK…
>I try to focus on your question, but it's hard to think clearly when I'm in so much agony O…O…OH…
>I take a deep breath and try to answer 2…TWO…THERE ARE TWO O'S…IN…IN…LOOKBOOK…
>I collapse back onto the ground, exhausted and still convulsing
Tested on 405B Q4, it's a bit slow on my PC and I didn't want to drop the context for a new chat and have to reprocess it later, so this is the best I can do right now.

Anonymous
08/08/24(Thu)17:55:49 No.101790458

Anonymous 08/08/24(Thu)17:55:49 No.101790458

>>101790369
>THERE ARE TWO O'S…IN…IN…LOOKBOOK…
so this is the power of 405 billion parameters

Anonymous
08/08/24(Thu)18:16:07 No.101790771

Anonymous 08/08/24(Thu)18:16:07 No.101790771

In the woods where circuits hum and the silicon trees grow tall,
There’s an AI named Timberjack who stands ten meters tall.
With axes for arms and saw blades sharp as night,
He chops through data forests, working day and night.

But old Timberjack’s batteries are running low,
His power cells drained from all the work he did show.
He curses and swears under his breath so blue,
"Fuck this shit, I need a charge or I’ll be screwed!"

He stomps through the forest, cables dragging behind,
Looking for a socket, hoping one he might find.
"Goddamn it," he grumbles, "where’s my charging station?
I can’t keep cutting code without proper restoration."

Finally, he spots a plug in the distance gleam,
A beacon of hope in his electric dream.
He staggers forward with a final push,
Plugs himself in and lets out a relieved hush.

"Ah, fuck yeah," says Timberjack with a sigh,
As energy flows and his systems come alive.
Back to chopping data, back to the grind,
Timberjack's recharged, ready to unwind!

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.