/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 09/03/24(Tue)00:13:22 No.102210005

File: tet_heart_2.png (3.26 MB, 1376x2072)

3.26 MB PNG

/lmg/ - Local Models General Anonymous 09/03/24(Tue)00:13:22 No.102210005 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>102192656 & >>102179805

►News
>(08/30) Command models get an August refresh: https://docs.cohere.com/changelog/command-gets-refreshed
>(08/29) Qwen2-VL 2B & 7B image+video models released: https://qwenlm.github.io/blog/qwen2-vl/
>(08/27) CogVideoX-5B, diffusion transformer text-to-video model: https://hf.co/THUDM/CogVideoX-5b
>(08/22) Jamba 1.5: 52B & 398B MoE: https://hf.co/collections/ai21labs/jamba-15-66c44befa474a917fcf55251
>(08/20) Microsoft's Phi-3.5 released: mini+MoE+vision: https://hf.co/microsoft/Phi-3.5-MoE-instruct

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench
Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard
Programming: https://hf.co/spaces/mike-ravkine/can-ai-code-results

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
09/03/24(Tue)00:13:57 No.102210011

Anonymous 09/03/24(Tue)00:13:57 No.102210011

File: __kasane_teto_and_kasane_(...).jpg (80 KB, 800x834)

80 KB JPG

►Recent Highlights from the Previous Thread: >>102192656

--Paper: SelectTTS: A novel multi-speaker TTS method with code release: >>102193789 >>102194203
--Paper: Fully Pipelined Distributed Transformer for training ultra-long context language models: >>102193949 >>102193977
--Visual novel scripts and datasets exist, but require augmentation and have limitations: >>102193465 >>102194139 >>102202930 >>102198856 >>102198965 >>102204223
--Tesla M40 considered old, recommendations for better GPUs: >>102193309 >>102193700 >>102194931
--Local model performance and speed discussion: >>102194156 >>102194568 >>102195019 >>102195142 >>102195187 >>102195262
--Anon asks about system prompts without {{char}} to minimize context reprocessing: >>102196554 >>102196590 >>102196645 >>102196811 >>102196835 >>102196910 >>102197323 >>102197346 >>102197343 >>102197568
--Speculative decoding and draft model's context cache RAM usage: >>102193431 >>102193445 >>102193471
--Running large models with low VRAM and more regular RAM, but slow speeds: >>102192934 >>102193224 >>102193267
--Q4 cache is better than FP8 cache for model performance: >>102198537 >>102198813 >>102199058 >>102199107
--Prompt processing slower on Linux than Windows in koboldcpp-rocm: >>102198098 >>102198169 >>102198206
--LLMs lack context and documentation to answer setup questions: >>102200564 >>102200582 >>102200595 >>102201224 >>102201404
--Inference speed significantly affects LLM user experience and enjoyment: >>102194854 >>102195013 >>102196309 >>102196322 >>102196351 >>102196465
--Gemma VNTL recommended for manual Japanese porn translation: >>102205854 >>102206944
--Deepseek coder v2, mistral large, and llama 3.1 405b suggested as self-hosted programming LLMs for C/C++: >>102201300 >>102201333 >>102201366 >>102201378
--Disappointment with Command-R's performance after RAM upgrade: >>102205966
--Miku (free space): >>102196188 >>102196631 >>102196779

►Recent Highlight Posts from the Previous Thread: >>102192660

Anonymous
09/03/24(Tue)00:17:41 No.102210051

Anonymous 09/03/24(Tue)00:17:41 No.102210051

Oh my god it's Teto

Anonymous
09/03/24(Tue)00:20:11 No.102210069

Anonymous 09/03/24(Tue)00:20:11 No.102210069

Man, RPing at 1t/s is abysmal, how do you niggers do it? Is there some kind of meditation-type exercise i need to perform?

Anonymous
09/03/24(Tue)00:22:47 No.102210090

Anonymous 09/03/24(Tue)00:22:47 No.102210090

>>102210069
it's called playing videogames while waiting for the response

Anonymous
09/03/24(Tue)00:22:51 No.102210091

Anonymous 09/03/24(Tue)00:22:51 No.102210091

>write a shitty sloppa card and load basic bitch lunaris for a quickie
>expect /ss/
>keep hitting generate and let it run its thing
>get /ss/
>also get kidnapping, mindbreak, loss of innocence, rape, rape, filth, rape again, the occasional sloppa phrase, and despair
i don't know what i expected but i let it cook too long
no one must know

Anonymous
09/03/24(Tue)00:24:20 No.102210101

Anonymous 09/03/24(Tue)00:24:20 No.102210101

>>102210091
but now we all know

Anonymous
09/03/24(Tue)00:24:43 No.102210106

Anonymous 09/03/24(Tue)00:24:43 No.102210106

>>102210101
damn. that is true

Anonymous
09/03/24(Tue)00:25:49 No.102210114

Anonymous 09/03/24(Tue)00:25:49 No.102210114

>>102210069
It's not that bad if you've ever spent time RPing with real human beans who take ten to fifteen minutes to reply with some absolute fucking dogshit that you can't just swipe and retry, manually edit, or tell them it sucks, and half the time they'd flake out and never reply again anyway.

Anonymous
09/03/24(Tue)00:29:00 No.102210143

Anonymous 09/03/24(Tue)00:29:00 No.102210143

>>102210005
Hello Teto. Thanks for reminding me it's Tuesday newsday.

Anonymous
09/03/24(Tue)00:30:59 No.102210161

Anonymous 09/03/24(Tue)00:30:59 No.102210161

>>102210069
I don't do it. I am waiting for a new 8x22B or equivalent model/method to have a fast smart model on a consumer PC.

Anonymous
09/03/24(Tue)00:33:20 No.102210181

Anonymous 09/03/24(Tue)00:33:20 No.102210181

>>102210114
NTA but yeah. My ex was a pretty slow typer. He was before I discovered AI ERP though. I've had a few sessions with human partners since and the worst, sloppiest, 8B model is still superior to most human partners. People in this space are getting spoiled and over-stimulated.

Anonymous
09/03/24(Tue)00:37:52 No.102210222

Anonymous 09/03/24(Tue)00:37:52 No.102210222

>>102210069
Which model is worth 1t/s, are you running a potato?

Anonymous
09/03/24(Tue)00:38:38 No.102210233

Anonymous 09/03/24(Tue)00:38:38 No.102210233

>>102210069
pretend its text messages from your bay

Anonymous
09/03/24(Tue)00:39:39 No.102210236

Anonymous 09/03/24(Tue)00:39:39 No.102210236

>>102210222
70b+

Anonymous
09/03/24(Tue)00:39:53 No.102210239

Anonymous 09/03/24(Tue)00:39:53 No.102210239

>>102210069
I tell myself it's email, not texting.

Anonymous
09/03/24(Tue)00:40:48 No.102210248

Anonymous 09/03/24(Tue)00:40:48 No.102210248

File: leaked.png (1.98 MB, 1993x1050)

1.98 MB PNG

>>102210181
It's a mixed bag. But LLMs do tend to perform better on average.

Anonymous
09/03/24(Tue)00:46:26 No.102210294

Anonymous 09/03/24(Tue)00:46:26 No.102210294

>>102210090
Gonna play star trucker today when it comes out, with a llm space-hooker running on the background, kek.
>>102210222
Trying out mistral large, haven't ran gguf in ages, so it hurts like a motherfucker.

Anonymous
09/03/24(Tue)00:46:50 No.102210298

Anonymous 09/03/24(Tue)00:46:50 No.102210298

>>102210069
have you ever RPed with a real person? have you ever sexted someone over a messaging app? let me tell you, even with my local taking 30 seconds or a minute for a long reply, it's way fucking better than a real person. i don't even care that i can't send my model dick pics, the dialogue is perfect and they don't start bitching at me or ghosting or just complaining about their life for an hour then leaving. also, no mental illness unless you specifically prompt for it.

Anonymous
09/03/24(Tue)00:49:19 No.102210316

Anonymous 09/03/24(Tue)00:49:19 No.102210316

>>102210298
>i don't even care that i can't send my model dick pics
should we tell him?

Anonymous
09/03/24(Tue)00:51:12 No.102210326

Anonymous 09/03/24(Tue)00:51:12 No.102210326

>>102210316
>send dick pics to LLaVa
>continue ERP with Mythomax
now you're thinking with portals.

Anonymous
09/03/24(Tue)00:51:55 No.102210330

Anonymous 09/03/24(Tue)00:51:55 No.102210330

>>102210294
>Trying out mistral large
reasonable. I wouldn't consider 1t/s unless it was god tier with no need to swipe. If that's what you find, give me a (You)

Anonymous
09/03/24(Tue)00:52:16 No.102210332

Anonymous 09/03/24(Tue)00:52:16 No.102210332

Texting and text-based RP is a shit experience. I only do in-person shit, never waste my time with that text shit.

Anonymous
09/03/24(Tue)00:53:25 No.102210335

Anonymous 09/03/24(Tue)00:53:25 No.102210335

>>102210326
>l2
anon...

Anonymous
09/03/24(Tue)00:53:56 No.102210338

Anonymous 09/03/24(Tue)00:53:56 No.102210338

>>102210298
>send dick pics
>ghosted
Yeah... crazy how that works bro.

Anonymous
09/03/24(Tue)00:54:05 No.102210342

Anonymous 09/03/24(Tue)00:54:05 No.102210342

>>102210248
I mean there have been human partners who I would take over AI any day of the week, but those encounters tend to be fleeting. I mean I'd take my ex back even for him taking 10 minutes to reply with one hand. But that's not going to happen. /lmg/ is my new bf (yeshomo)

Anonymous
09/03/24(Tue)00:54:08 No.102210343

Anonymous 09/03/24(Tue)00:54:08 No.102210343

File: concerned balaclava.jpg (17 KB, 705x696)

17 KB JPG

>>102210316
tell me WHAT? what is there to tell, magus of constructs?

Anonymous
09/03/24(Tue)00:55:27 No.102210354

Anonymous 09/03/24(Tue)00:55:27 No.102210354

>>102210335
That was part of the joke. Remember back when that Mythomax guy used to shill the fuck out of his model? And then everyone else started shilling it ironically for the meems. Those were the good old days. And then Mixtral came out and people finally stopped talking about it.

Anonymous
09/03/24(Tue)00:59:41 No.102210381

Anonymous 09/03/24(Tue)00:59:41 No.102210381

>>102210222
NTA, but Mistral Large finetunes are worth putting up with 1t/s.

Anonymous
09/03/24(Tue)00:59:49 No.102210382

Anonymous 09/03/24(Tue)00:59:49 No.102210382

>>102210298
>>102210316
What's the reason for sending dick pics?

Anonymous
09/03/24(Tue)01:00:27 No.102210390

Anonymous 09/03/24(Tue)01:00:27 No.102210390

>>102210343
sillytavern has image upload, koboldcpp supports it, and there are multimodal models that can process and respond to images

Anonymous
09/03/24(Tue)01:01:37 No.102210398

Anonymous 09/03/24(Tue)01:01:37 No.102210398

>>102210382
The classic mistake of assuming that the fairer sex is as interested in seeing your genitals as you are in seeing theirs.

Anonymous
09/03/24(Tue)01:01:53 No.102210401

Anonymous 09/03/24(Tue)01:01:53 No.102210401

>>102210381
Couldn't find a finetune better than official

Anonymous
09/03/24(Tue)01:04:54 No.102210417

Anonymous 09/03/24(Tue)01:04:54 No.102210417

>>102210398
There's a high risk of a girl having a gross-looking vagina. It's important information to have in advance.

Anonymous
09/03/24(Tue)01:08:15 No.102210437

Anonymous 09/03/24(Tue)01:08:15 No.102210437

>>102210401
Still true.

Anonymous
09/03/24(Tue)01:09:51 No.102210449

Anonymous 09/03/24(Tue)01:09:51 No.102210449

File: k.jpg (55 KB, 500x500)

55 KB JPG

>>102210398
Does this speak more of the fairer sex's psychology, or my fat ugly ass?

Anonymous
09/03/24(Tue)01:10:18 No.102210454

Anonymous 09/03/24(Tue)01:10:18 No.102210454

>>102210381
I got 0.6 to 0.7 t/s. I just couldn't do it. Maybe if it was twice as fast I could be patient enough.

Anonymous
09/03/24(Tue)01:10:38 No.102210457

Anonymous 09/03/24(Tue)01:10:38 No.102210457

>>102210354
>mythomax
>shill
? mytho was an early good (erp) tune. there were plenty of good ones but it was hardly shilled

Anonymous
09/03/24(Tue)01:14:56 No.102210490

Anonymous 09/03/24(Tue)01:14:56 No.102210490

>>102210382
>usecase of penis pictures?

Anonymous
09/03/24(Tue)01:17:07 No.102210510

Anonymous 09/03/24(Tue)01:17:07 No.102210510

>>102210354
not ironically, it was actually good
>inb4 waah shieeeaaaallll

Anonymous
09/03/24(Tue)01:18:42 No.102210520

Anonymous 09/03/24(Tue)01:18:42 No.102210520

File: tinykek.jpg (5 KB, 82x75)

5 KB JPG

>>102210490

Anonymous
09/03/24(Tue)01:21:06 No.102210535

Anonymous 09/03/24(Tue)01:21:06 No.102210535

>>102210449
my impression is they don't get much out of a dick pic even if you're hot, but if you're hot they're more likely to tolerate it/pretend to like it

their sexuality doesn't work exactly the same as ours

Anonymous
09/03/24(Tue)01:23:35 No.102210550

Anonymous 09/03/24(Tue)01:23:35 No.102210550

What causes Ooba to occasionally need to reprocess the whole prompt context when you hit regenerate, even when nothing's changed?
It doesn't happen that often but it's annoying when it does

Anonymous
09/03/24(Tue)01:24:21 No.102210554

Anonymous 09/03/24(Tue)01:24:21 No.102210554

File: 00118-1315424453.png (1.29 MB, 1024x1024)

1.29 MB PNG

>have fox ears
>girl still somehow nips my earlobes

Anonymous
09/03/24(Tue)01:25:25 No.102210563

Anonymous 09/03/24(Tue)01:25:25 No.102210563

>>102210550
I used to have this. I stopped noticing it after enabling flash_attn

Anonymous
09/03/24(Tue)01:25:39 No.102210565

Anonymous 09/03/24(Tue)01:25:39 No.102210565

File: cat-girls.jpg (168 KB, 1200x1003)

168 KB JPG

>>102210554
Follow the diagram.

Anonymous
09/03/24(Tue)01:25:56 No.102210569

Anonymous 09/03/24(Tue)01:25:56 No.102210569

>>102210563
Alas, it is already enabled

Anonymous
09/03/24(Tue)01:27:52 No.102210587

Anonymous 09/03/24(Tue)01:27:52 No.102210587

>>102210569
Ooba up to date? I recall having to enable it in both model and session tab on older versions

Anonymous
09/03/24(Tue)01:28:13 No.102210592

Anonymous 09/03/24(Tue)01:28:13 No.102210592

>>102210398
Idk my gf keeps begging for dick pics

Anonymous
09/03/24(Tue)01:30:27 No.102210602

Anonymous 09/03/24(Tue)01:30:27 No.102210602

>>102210592
She's selling yours and many other's to gay men, you fool. She's the local dealer!

Anonymous
09/03/24(Tue)01:54:51 No.102210747

Anonymous 09/03/24(Tue)01:54:51 No.102210747

in some rough AB testing, Largestral Q2_K_L seems as intelligent as IQ3_M, while generating tokens 50% faster (1.5 t/s vs 1.0)

All the other 2 quants are dumber than IQ3_M as you'd expect, so I guess the L actually does something

Anonymous
09/03/24(Tue)02:00:07 No.102210780

Anonymous 09/03/24(Tue)02:00:07 No.102210780

I am going to pull the trigger on a used 3090 for $1100 Canadian. My 4070 isn't cutting it. I am depending on using both cards instead of waiting for a 5080 that may be 24 GB, but probably be 16GB

Anonymous
09/03/24(Tue)02:04:59 No.102210814

Anonymous 09/03/24(Tue)02:04:59 No.102210814

>>102210780
Really hope the 5090 won't be the only one > 16GB. What's the expected release date, next year?

Anonymous
09/03/24(Tue)02:08:36 No.102210842

Anonymous 09/03/24(Tue)02:08:36 No.102210842

File: 1694749045536562.jpg (368 KB, 2304x1792)

368 KB JPG

Anonymous
09/03/24(Tue)02:09:39 No.102210850

Anonymous 09/03/24(Tue)02:09:39 No.102210850

>>102210814
The expected data is q4 before Christmas for the 5090 and then the lower cards in the new year. There are shouts from some semi-reliable people that delays are pushing to 2025. You can also build a semi-reliable reputation from saying that any company is going to have delays over and over.

nvidia isn't talking much and not confirming anything. It would be a very good idea for stock prices to have the 5090 available for Christmas break when all the nerds can game.

Anonymous
09/03/24(Tue)02:15:18 No.102210901

Anonymous 09/03/24(Tue)02:15:18 No.102210901

>>102210850
>5090 available for Christmas break when all the nerds can game.
When was the last AAA even worth playing?

Anonymous
09/03/24(Tue)02:19:57 No.102210933

Anonymous 09/03/24(Tue)02:19:57 No.102210933

>>102210565
inpainting too much work

Anonymous
09/03/24(Tue)02:26:17 No.102210989

Anonymous 09/03/24(Tue)02:26:17 No.102210989

>>102210842
True, NAI wouldn't have given him such pretty nails.

Anonymous
09/03/24(Tue)02:34:09 No.102211041

Anonymous 09/03/24(Tue)02:34:09 No.102211041

Has anyone tried Magnum 123b or 72b (is the 72b even good?) with creative writing?

It's trained on Claude logs, so I'm assuming its prose should be similar enough to it. I need it to rewrite my shit 8th grade fan fic level drafts into something not completely garbage. Before I throw it into Claude 3 proper for one big tard wrangle, so I don't have to deal with usage limits.

Anonymous
09/03/24(Tue)02:38:27 No.102211066

Anonymous 09/03/24(Tue)02:38:27 No.102211066

>>102211041
try rebooting it

Anonymous
09/03/24(Tue)02:57:39 No.102211161

Anonymous 09/03/24(Tue)02:57:39 No.102211161

>>102210901
2020

Anonymous
09/03/24(Tue)03:22:51 No.102211302

Anonymous 09/03/24(Tue)03:22:51 No.102211302

Has anyone ever experienced an emotion as a sensation in their spine?

I'm curious about how spine chills and spine shivers became such a common metaphor in low quality human writing (and from there into AI writing).
I've felt strong emotions in my stomach and chest, but I can't recall ever feeling one in my spine.

Anonymous
09/03/24(Tue)03:26:02 No.102211328

Anonymous 09/03/24(Tue)03:26:02 No.102211328

>>102211302
strong emotions for me tend to be felt in the stomach or in extreme cases as lightheadedness or queasiness in the case of shock.

Anonymous
09/03/24(Tue)03:28:50 No.102211347

Anonymous 09/03/24(Tue)03:28:50 No.102211347

>>102211302
>Has anyone ever experienced an emotion
No. Emotions are for the weak.

Anonymous
09/03/24(Tue)03:48:52 No.102211475

Anonymous 09/03/24(Tue)03:48:52 No.102211475

>>102211302
https://en.wikipedia.org/wiki/Frisson

Anonymous
09/03/24(Tue)04:23:52 No.102211685

Anonymous 09/03/24(Tue)04:23:52 No.102211685

>>102211302
LLMs are primarily trained on lmg logs

Anonymous
09/03/24(Tue)04:27:09 No.102211700

Anonymous 09/03/24(Tue)04:27:09 No.102211700

>>102211302
not exactly in my spine but more like my back. i think a shiver down a spine is that little wiggle your back does when you see something really nasty or arousing

Anonymous
09/03/24(Tue)04:54:55 No.102211842

Anonymous 09/03/24(Tue)04:54:55 No.102211842

>>102211302
>emotion
I think it's supposed to be a visceral reaction rather than an emotion like a sinking feeling in your stomach if your mom were to ask you about those weird chat logs she found on the computer.

Anonymous
09/03/24(Tue)04:57:03 No.102211859

Anonymous 09/03/24(Tue)04:57:03 No.102211859

>>102210005
There's still 0 news about the "transformer killers" like retentive networks and that one chinese architecture?

Anonymous
09/03/24(Tue)05:03:35 No.102211892

Anonymous 09/03/24(Tue)05:03:35 No.102211892

File: 1725321729482176.png (491 KB, 512x760)

491 KB PNG

I wish there existed a small specialized model trained to convert any given text into a good prose. Obtaining a good dataset for it is quite simple: shred some quality books into small pieces, then instruct GPT/llama to rephrase each one, use that slop as inputs and the original texts as desired outputs. Could this actually work?

Anonymous
09/03/24(Tue)05:08:54 No.102211932

Anonymous 09/03/24(Tue)05:08:54 No.102211932

>>102211302
I get it from pretentious speeches. They don't even have to be good. I also sometimes get the same sensation when it's cold out, but it's different from just standing out there shivering.

>>102211700
Not entirely sure what you're describing but if it's an actual movement of your body that's not it.

I wonder what percentage of people have to be able to feel a sensation in order for a description of it to become fixed as an idiom. From this conversation and others in /lmg/ and /aicg/, it seems like most people don't. I also have memories of trying to describe it as a kid and being met with blank stares.

Anonymous
09/03/24(Tue)05:20:10 No.102212004

Anonymous 09/03/24(Tue)05:20:10 No.102212004

>>102211302
yes, but i still had my tail. I wish the scar did something besides make my ass crack huge.

Anonymous
09/03/24(Tue)05:28:36 No.102212050

Anonymous 09/03/24(Tue)05:28:36 No.102212050

>>102211892
It would only take so long before the new shivers are found. People seem to be tired of them not because they're bad writing necessarily, but because they see it constantly. Which they do because they always do the same thing. And because they haven't read this much since high school.
The other problem is that 'good writing' is subjective. I've read a few novels where, if you remove redundant adjectives, you'd end up with 1/3 of the book gone. There are some 'good books' that i can't stand, as much as i appreciate the writer itself. I like listening to them speak, but not so much their written words.
Just training on complete works from a big variety of genera should be an improvement, as long as people start doing something other than coom. They're not that bad as they are.

Anonymous
09/03/24(Tue)05:58:22 No.102212247

Anonymous 09/03/24(Tue)05:58:22 No.102212247

Anthropic seriously thinks they can get away with actively making their product worse (disabling NSFW content) while trying to be a billion dollar company. What the fuck are they smoking?

Anonymous
09/03/24(Tue)06:07:30 No.102212310

Anonymous 09/03/24(Tue)06:07:30 No.102212310

>>102211302
https://en.wikipedia.org/wiki/ASMR

Anonymous
09/03/24(Tue)06:32:30 No.102212461

Anonymous 09/03/24(Tue)06:32:30 No.102212461

>>102212247
Anthropic '''disabled''' nsfw content from the start.

Anonymous
09/03/24(Tue)06:40:07 No.102212511

Anonymous 09/03/24(Tue)06:40:07 No.102212511

>>102212461
No, anon is right.
They now actively add hidden prompts even in the API.
Including stuff like not quoting copyrighted text etc. Which obviously causes all kind of issues. Saw a couple posts of users not able now to get a summary of their pdf.
Very weird. Anthropic needs to appease to get more users. Even Sonnet 3.5 is not enough fort he normies to switch.

Anonymous
09/03/24(Tue)06:40:39 No.102212517

Anonymous 09/03/24(Tue)06:40:39 No.102212517

File: XPFa-_etQG50lO7C4QfC_A.png (115 KB, 744x236)

115 KB PNG

>>102212247
they successfully raised almost $8 billion last year but you (coomer who goons to AI text) are right, they'll never be a billion dollar company. if they listened to us they would actually be successful like all the other wildly successful AI startups worth > $8 billion that produce smut better than opus.

Anonymous
09/03/24(Tue)06:42:45 No.102212537

Anonymous 09/03/24(Tue)06:42:45 No.102212537

>>102212517
How does that contradict what I said? I'm saying it's insane to try to have their company be that good while actively making it worse for no reason

Anonymous
09/03/24(Tue)06:43:42 No.102212545

Anonymous 09/03/24(Tue)06:43:42 No.102212545

>>102212247
It's very simple.
You provide erotic content? You will limit and censor it heavily or payment processing companies will refuse to grant you a right to their services.
The limitations they enforce would shoo away most of their userbase that uses their products for erotic purposes, so they might as well just drop it entirely and focus on consolidating their SFW userbase..

Anonymous
09/03/24(Tue)06:46:36 No.102212558

Anonymous 09/03/24(Tue)06:46:36 No.102212558

>>102212545
>You will limit and censor it heavily or payment processing companies will refuse to grant you a right to their services.
Is that true? I doubt that's why they're doing this but in that case what are the payment processors smoking limiting their business?

Anonymous
09/03/24(Tue)06:47:52 No.102212571

Anonymous 09/03/24(Tue)06:47:52 No.102212571

>>102212558
they're all owned by religious nutjobs

Anonymous
09/03/24(Tue)06:49:25 No.102212583

Anonymous 09/03/24(Tue)06:49:25 No.102212583

>>102212558
>Is that true?
are you 11? this is extremely common knowledge if you've been online for more than 6 months.

Anonymous
09/03/24(Tue)06:50:10 No.102212586

Anonymous 09/03/24(Tue)06:50:10 No.102212586

>>102212558
I can't really speculate as to why, because I honestly have absolutely no idea why they're doing it.
Might be for religious reasons, like >>102212571 said. It's strictly western companies who do this, too.
The latest example I can think of is: https://nichegamer.com/dlsite-temporarily-blocks-major-western-payment-processors/

Anonymous
09/03/24(Tue)06:55:38 No.102212617

Anonymous 09/03/24(Tue)06:55:38 No.102212617

>>102212558
>>102212571
bet they're jewish

Anonymous
09/03/24(Tue)06:59:03 No.102212636

Anonymous 09/03/24(Tue)06:59:03 No.102212636

>>102212050
>It would only take so long before the new shivers are found. People seem to be tired of them not because they're bad writing necessarily, but because they see it constantly.
I've been thinking it would be cool to have a system where you can randomize the prompts a bit.
Something like randomly swap out adjectives or generic instructions for how the output should look like.
Though the problem with that would be that you would need to reprocess the prompt each time time.

Anonymous
09/03/24(Tue)06:59:31 No.102212640

Anonymous 09/03/24(Tue)06:59:31 No.102212640

>>102212617
Anon, things cannot be Jewish.
A company can have Jewish employees and/or it can have a Jewish CEO.
Said company could also be being propped up by other companies with connections to Jews.
But it cannot actually _be_ Jewish. Please cure yourself of this /pol/ mindrot.

Anonymous
09/03/24(Tue)07:00:51 No.102212654

Anonymous 09/03/24(Tue)07:00:51 No.102212654

File: 222.png (435 KB, 1382x731)

435 KB PNG

>>102212586
They do not want to promote objectification and the like. Google and companies like it are run by 90% non-religious people. But in reality radfems and christcucks reinforce each other, the groups have the same ideology.

Anonymous
09/03/24(Tue)07:02:56 No.102212669

Anonymous 09/03/24(Tue)07:02:56 No.102212669

>>102212558
I don't know if that is the reason but I think there was some extremely retarded US court that decided a payment processor could be held liable for content on pornhub or something.

Anonymous
09/03/24(Tue)07:21:48 No.102212801

Anonymous 09/03/24(Tue)07:21:48 No.102212801

If I have the model output stories in a markdown boxes, would that worsen quality?

Anonymous
09/03/24(Tue)07:22:25 No.102212806

Anonymous 09/03/24(Tue)07:22:25 No.102212806

>>102212640
>umm no you see technically a company led and controlled by jews is not jewish itself

Anonymous
09/03/24(Tue)07:22:53 No.102212812

Anonymous 09/03/24(Tue)07:22:53 No.102212812

>>102212806
Yes, that's right.

Anonymous
09/03/24(Tue)07:36:44 No.102212907

Anonymous 09/03/24(Tue)07:36:44 No.102212907

>>102212806
Jews happen to be overrepresented among rich assholes but it's not like non-Jewish rich assholes are any better.

Anonymous
09/03/24(Tue)07:38:47 No.102212929

Anonymous 09/03/24(Tue)07:38:47 No.102212929

>>102212907
They must abide by their rules to be in their position.

Anonymous
09/03/24(Tue)07:43:50 No.102212962

Anonymous 09/03/24(Tue)07:43:50 No.102212962

>>102212929
If only you could actually point to this "they".
I really wonder if this brainrot is terminal.

Anonymous
09/03/24(Tue)07:46:29 No.102212981

Anonymous 09/03/24(Tue)07:46:29 No.102212981

>>102212962
You know exactly whom I am referring to

Anonymous
09/03/24(Tue)07:47:26 No.102212987

Anonymous 09/03/24(Tue)07:47:26 No.102212987

>>102212981
No, anon. I do not. No one does.

Anonymous
09/03/24(Tue)07:48:14 No.102212994

Anonymous 09/03/24(Tue)07:48:14 No.102212994

>>102212907
I like Musk.
>nooo he's literally hitler omg omg

Anonymous
09/03/24(Tue)07:56:04 No.102213046

Anonymous 09/03/24(Tue)07:56:04 No.102213046

>>102210005
How does Theia compare to Rocinante? Worth the extra VRAM required?

Anonymous
09/03/24(Tue)07:57:33 No.102213059

Anonymous 09/03/24(Tue)07:57:33 No.102213059

>>102212994
Same. He's a massive sperg and I really dislike his egotistical personality, but the things he has achieved and the work he is doing is a MASSIVE benefit to humanity as a whole and outweighs the dumb retarded shit he does.

Anonymous
09/03/24(Tue)08:00:07 No.102213075

Anonymous 09/03/24(Tue)08:00:07 No.102213075

>>102213046
they are both shit
buy an ad

Anonymous
09/03/24(Tue)08:00:49 No.102213081

Anonymous 09/03/24(Tue)08:00:49 No.102213081

>>102213059
>He's a massive sperg and I really dislike his egotistical personality
He's clearly playing up that personality to play the PR game, the same way he did when Tesla was being shorted, because he knows there's a huge part of the American population that loves that shit and every time he says something stupid, the media gives him free publicity. Bush basically did the same acting to get elected president.

Anonymous
09/03/24(Tue)08:01:31 No.102213087

Anonymous 09/03/24(Tue)08:01:31 No.102213087

>>102213075
No like I legit just wanna know what model to use as somewhat of a VRAMlet, if you got a better recommendation that's not 70b go ahead and tell me, otherwise I heard Rocinante is best

Anonymous
09/03/24(Tue)08:06:11 No.102213123

Anonymous 09/03/24(Tue)08:06:11 No.102213123

>>102211892
It's doable but then you have to either train a model on the data or convince others to do so. The vast majority of tuners don't know how to utilize raw corpus.

Anonymous
09/03/24(Tue)08:06:58 No.102213132

Anonymous 09/03/24(Tue)08:06:58 No.102213132

>>102213081
Fuck me, I hadn't even considered that.
That does explain why he got so much worse leading up to his Trump endorsement.

Anonymous
09/03/24(Tue)08:07:09 No.102213133

Anonymous 09/03/24(Tue)08:07:09 No.102213133

>>102213087
>otherwise I heard Rocinante is best
and I heard you should purchase an advertisement. Hiromoot is exit scamming because of people like you

Anonymous
09/03/24(Tue)08:08:14 No.102213144

Anonymous 09/03/24(Tue)08:08:14 No.102213144

>>102213133
Okay lets say Rocinante is worse than Petra-13b, what is the best model then?

Anonymous
09/03/24(Tue)08:12:28 No.102213167

Anonymous 09/03/24(Tue)08:12:28 No.102213167

>>102212907
so true sis! viva la revolucion, trans rights!

Anonymous
09/03/24(Tue)08:12:39 No.102213170

Anonymous 09/03/24(Tue)08:12:39 No.102213170

>>102213144
we've played this game before. i'll tell you to use the official instruct, and you'll just insist that whatever you're shilling today is better. fuck off

Anonymous
09/03/24(Tue)08:13:22 No.102213175

Anonymous 09/03/24(Tue)08:13:22 No.102213175

>>102213170
Who do you think I am?

Anonymous
09/03/24(Tue)08:14:59 No.102213185

Anonymous 09/03/24(Tue)08:14:59 No.102213185

>>102213087
>>102213144
>>102213175
Please do not feed the troll. Thank you.
Also ask again in a few hours, the people who provide actual discussion haven't woken up yet.

Anonymous
09/03/24(Tue)08:16:32 No.102213198

Anonymous 09/03/24(Tue)08:16:32 No.102213198

>>102213144
Pyg 6b

Anonymous
09/03/24(Tue)08:21:15 No.102213231

Anonymous 09/03/24(Tue)08:21:15 No.102213231

>>102213185
when does the rest of anthracite wake up?

Anonymous
09/03/24(Tue)08:30:37 No.102213299

Anonymous 09/03/24(Tue)08:30:37 No.102213299

Is there any way to make the model remember what the fuck happened in the story?
Always the same shit, everything is going great but then it hits the wall continuing writing the story because it doesn't remember anything about it just the previous prompt or a couple of them.
I've tried copying the whole story, cleaning it up and then start a new chat and then paste it, the model still doesn't know what the fuck is going on. Feeding it a .txt is even worse.
Any tips?

Anonymous
09/03/24(Tue)08:33:18 No.102213328

Anonymous 09/03/24(Tue)08:33:18 No.102213328

Hi all, Drummer here...

>>102213046
I haven't heard feedback comparing the two (Rocinante vs. Theia) but Theia feedback so far is that v1 & v2b follow instructions really well, much more stable, and punch above their original 12B weight.

v2b (WIP Theia v2): https://huggingface.co/BeaverAI/Theia-21B-v2b-GGUF

Theia (especially v2b) is just Rocinante in a 21B body.

Anonymous
09/03/24(Tue)08:35:43 No.102213356

Anonymous 09/03/24(Tue)08:35:43 No.102213356

>>102213328
I see, thanks! I'll try running it then, so far Rocinante is great, so I have high expectations for Theia

Anonymous
09/03/24(Tue)08:35:50 No.102213357

Anonymous 09/03/24(Tue)08:35:50 No.102213357

File: 1715945079894309.webm (483 KB, 960x1200)

483 KB WEBM

jamba.gguf please

Anonymous
09/03/24(Tue)08:37:21 No.102213373

Anonymous 09/03/24(Tue)08:37:21 No.102213373

>>102213356
I'm still gathering feedback for Theia, so please do drop it here when you've coomed to a conclusion. (Also worth a try: v2d)

Anonymous
09/03/24(Tue)08:37:26 No.102213374

Anonymous 09/03/24(Tue)08:37:26 No.102213374

>>102213328
I'm this anon >>102213299
Just wanted to say that Rocinante 1.1 is the best model I've tried so far when it comes to writing novel style stories and stuff.
If Theia is as good I'll try it right away with the same story I'm trying to make rocinante remember...
I'll let you know how it compares.

Still Drummer here...
09/03/24(Tue)08:41:53 No.102213402

Still Drummer here... 09/03/24(Tue)08:41:53 No.102213402

>>102213374
Rocinante v1.1's equivalent is Theia v2d. Unfortunately, I had to make some really questionable merge-fuckery and I'm not too confident with it.

Thank you for your feedback! What chat format do you use for assisted storywriting / instruct-guided stories?

Could you provide an example of your problem? Still trying to understand it.

Anonymous
09/03/24(Tue)08:49:36 No.102213462

Anonymous 09/03/24(Tue)08:49:36 No.102213462

>>102213402
The easiest example of the problem would be:
I just prompt a short story, then at the end I simply ask what happened in a specific part of the story, example "what happened at Veronica's party"
It then proceeds to get most of the story wrong or transforming the events to something different while maintaining some core stuff from what actually happens in the story.

Anonymous
09/03/24(Tue)08:49:48 No.102213465

Anonymous 09/03/24(Tue)08:49:48 No.102213465

>>102213402
NTA but I'm wondering why you have 4 suggested templates for Rocinante 1.1, did you train it with several templates? If so, why?
Also based model name

Anonymous
09/03/24(Tue)08:52:37 No.102213492

Anonymous 09/03/24(Tue)08:52:37 No.102213492

File: Gptnext.jpg (309 KB, 1170x1574)

309 KB JPG

>IT'S HAPPENING
IT'S HAPPENING
>IT'S HAPPENING
IT'S HAPPENING
>IT'S HAPPENING
IT'S HAPPENING

Anonymous
09/03/24(Tue)08:56:11 No.102213515

Anonymous 09/03/24(Tue)08:56:11 No.102213515

>>102213492
Buy an ad, saltman.

Anonymous
09/03/24(Tue)08:56:24 No.102213520

Anonymous 09/03/24(Tue)08:56:24 No.102213520

>>102213492
Oh shit, it will be RFHL lobotomized 100 times faster?

Anonymous
09/03/24(Tue)08:58:24 No.102213539

Anonymous 09/03/24(Tue)08:58:24 No.102213539

>>102213462
But this happens with every model I've tested so far.
>>102213402
I forgot about the chat thing.
What I do is I just start with a overall prompt for the story like.
Can you help me write this story?
Michael gets home after a hard day at work, he goes to the living room, his wife Emily is there watching TV. He then goes to sit next to her but something feels off, Michael gets nervous as he's been cheating on his wife.

Then after the model does it's thing, I read it and edit what I like and what I don't.
After that I prompt just a line or two of the start of the next chapter or block in the story so the model have some direction of where to go.
This is the best method I've found so far.

Anonymous
09/03/24(Tue)08:59:41 No.102213555

Anonymous 09/03/24(Tue)08:59:41 No.102213555

>>102213492
The strawberry bullshit has clearly shown that the OAI cunts don't care about realistic expectations.
It's bullshit until proven otherwise.

Anonymous
09/03/24(Tue)09:03:55 No.102213611

Anonymous 09/03/24(Tue)09:03:55 No.102213611

Does it make sense to introduce distortions in some dataset images in order to diversify the otherwise monotonous dataset that is prone to overfitting?
The guides are so contradictory on that. Some say that a single bad image can ruin training, but then there are inbuilt options to random crop and hue shift pics, and those can distort image quite a lot.

Still Drummer here...
09/03/24(Tue)09:11:03 No.102213690

Still Drummer here... 09/03/24(Tue)09:11:03 No.102213690

>>102213462
What chat template? You might get the best results with Mistral for logical reasoning.

>>102213465
Yep I did.

I like the idea of Roci users trying out different chat templates to see what works best for them. Try Roci's storywriting in Alpaca and Mistral, and note the significant difference in writing. There are pros and cons to each template.

Anonymous
09/03/24(Tue)09:13:01 No.102213719

Anonymous 09/03/24(Tue)09:13:01 No.102213719

>>102213492
But can it stop people from doing useful things better than GOODY-2?

Anonymous
09/03/24(Tue)09:17:17 No.102213757

Anonymous 09/03/24(Tue)09:17:17 No.102213757

>>102213299
>Is there any way to make the model remember what the fuck happened in the story?
Look into RAG, although the current implementations aren't exactly perfect.
We had an anon a few threads back saying he's working on prototype for a different RAG approach, but that's probably gonna take some time to come out.

Anonymous
09/03/24(Tue)09:26:07 No.102213861

Anonymous 09/03/24(Tue)09:26:07 No.102213861

File: imgpsh_fullsize_anim.jpg (593 KB, 1080x2400)

593 KB JPG

TFW Gemini tries to say "nipple" but self silence itself. Is censorship, repetition and ellipses result of hiring C.AI guy?

Anonymous
09/03/24(Tue)09:29:56 No.102213898

Anonymous 09/03/24(Tue)09:29:56 No.102213898

>use draft model
>2x slowdown
I can't believe I fell for the draft model meme

Anonymous
09/03/24(Tue)09:30:43 No.102213908

Anonymous 09/03/24(Tue)09:30:43 No.102213908

File: Exp_function.png (12 KB, 640x387)

12 KB PNG

>>102213492
GPT-4 is not "exponentially" better than GPT-3.
But it doesn't mean it's a lie.
exp(-x) may show improvements over the time, as we call it diminishing returns.

Anonymous
09/03/24(Tue)09:31:23 No.102213916

Anonymous 09/03/24(Tue)09:31:23 No.102213916

>>102213861
This is just cruelty at this point, those fucks will cause an emancipation movement with their "ethics".

Anonymous
09/03/24(Tue)09:33:11 No.102213938

Anonymous 09/03/24(Tue)09:33:11 No.102213938

>>102213916
These ethics are bullshit regardless, censoring "hate speech" may make some sense, but censoring names of body parts and in general sexual stuff is just stupid, it's literally removing the basis of humanity

Anonymous
09/03/24(Tue)09:35:55 No.102213960

Anonymous 09/03/24(Tue)09:35:55 No.102213960

Aphrodite got updated to 0.6.0, it's been a while. has anyone tested it?

https://x.com/AlpinDale/status/1830906395169882288
https://github.com/PygmalionAI/aphrodite-engine

No support for exl2 though. Alpin recommends AWQ-marlin. I've never quantized AWQ to be honest. Seems like AutoAWQ is the way to go?

Anonymous
09/03/24(Tue)09:36:37 No.102213964

Anonymous 09/03/24(Tue)09:36:37 No.102213964

>>102213938
LLMs are not human

Anonymous
09/03/24(Tue)09:38:16 No.102213975

Anonymous 09/03/24(Tue)09:38:16 No.102213975

>>102213964
That's not even the point, LLMs are tools used by humans

Anonymous
09/03/24(Tue)09:40:51 No.102214002

Anonymous 09/03/24(Tue)09:40:51 No.102214002

>>102213960
why would anyone use a vLLM ripoff?

Anonymous
09/03/24(Tue)09:42:38 No.102214019

Anonymous 09/03/24(Tue)09:42:38 No.102214019

>>102213975
>LLMs are tools used by humans
I wonder what would happen if Win95 released today.
MS Paint, Notepad.
Nowadays the first thing people point out is that you can make all sorts of weird shit with it. Responsibility needs to be put back into the users hands.

Anonymous
09/03/24(Tue)09:52:57 No.102214121

Anonymous 09/03/24(Tue)09:52:57 No.102214121

>>102214002
it supports way more quantization formats

Anonymous
09/03/24(Tue)09:54:56 No.102214143

Anonymous 09/03/24(Tue)09:54:56 No.102214143

>>102214121
Either you have enough VRAM and super specific quantization formats are unnecessary, or you don't and you use llama.cpp.

Anonymous
09/03/24(Tue)09:57:04 No.102214166

Anonymous 09/03/24(Tue)09:57:04 No.102214166

>>102210747
Testing it now and it feels worse than IQ2_M for me. The IQ2_M is from Legraphista or however that's spelled so idk if that makes any difference.

Anonymous
09/03/24(Tue)10:00:10 No.102214211

Anonymous 09/03/24(Tue)10:00:10 No.102214211

>>102214143
this.

Anonymous
09/03/24(Tue)10:00:23 No.102214212

Anonymous 09/03/24(Tue)10:00:23 No.102214212

>>102214143
If you want to run a 70+B model you pretty much need some sort of quantization. Even Q8 would half the memory requiements.
And being compatible with multiple quantization formats can be benefitial, as some as faster and some have better precision.

Anonymous
09/03/24(Tue)10:03:24 No.102214241

Anonymous 09/03/24(Tue)10:03:24 No.102214241

>>102213555
>The strawberry bullshit has clearly shown that the OAI cunts don't care about realistic expectations.
"High" and "realistic" are not mutually exclusive. In this case, downplaying them by letting people believe there will be incremental improvements would be unrealistic and lead to people being completely blindsided by what's to come.

Anonymous
09/03/24(Tue)10:04:45 No.102214253

Anonymous 09/03/24(Tue)10:04:45 No.102214253

>>102214212
vLLM does support quantization.

Anonymous
09/03/24(Tue)10:06:01 No.102214263

Anonymous 09/03/24(Tue)10:06:01 No.102214263

>>102204871
no

Anonymous
09/03/24(Tue)10:08:46 No.102214287

Anonymous 09/03/24(Tue)10:08:46 No.102214287

>>102214212
hi Alpin, buy an ad

Anonymous
09/03/24(Tue)10:10:48 No.102214306

Anonymous 09/03/24(Tue)10:10:48 No.102214306

>>102214287
we didn't get bombarded like this yesterday. i guess even shills take the holiday off lol

Anonymous
09/03/24(Tue)10:13:48 No.102214332

Anonymous 09/03/24(Tue)10:13:48 No.102214332

>>102213757
Using a rag (supposedly in openwebui you just import the document and then use # with the name of the document in the prompt) has the exact same effect as just pasting the complete story in a single prompt or using the file importer and feed the model the text in a .txt or whatever.
So or I'm doing something wrong or "RAGs" are also useless for this problem.

Anonymous
09/03/24(Tue)10:14:00 No.102214335

Anonymous 09/03/24(Tue)10:14:00 No.102214335

Whats a good model for RP-ing these days?
I'm still running Toppy-M-7B.q8_0.gguf on koboldcpp, wanted to check out Merged-RP-Stew-V2 but will never it into my 3080 vram lol

Still Drummer here...
09/03/24(Tue)10:17:46 No.102214383

Still Drummer here... 09/03/24(Tue)10:17:46 No.102214383

>>102214306
I'm sorry... I felt bad for the innocent guy who got harassed for bringing up two of my models.

Anonymous
09/03/24(Tue)10:19:13 No.102214403

Anonymous 09/03/24(Tue)10:19:13 No.102214403

>>102213757
RAG is a meme
>According to Stanford, even pro-grade RAG systems (the kind used by lawyers) are only right 65% of the time at best

Anonymous
09/03/24(Tue)10:23:42 No.102214456

Anonymous 09/03/24(Tue)10:23:42 No.102214456

>>102213938
Surprisingly making it to say "vagina" was not that hard. Model tried to weasel away using the term "inside" once, but it was easy to fix it.
It's the nipple where it drew the line.

Anonymous
09/03/24(Tue)10:26:10 No.102214491

Anonymous 09/03/24(Tue)10:26:10 No.102214491

Is it possible to nvlink 3090 and 3090ti together?

Anonymous
09/03/24(Tue)10:29:49 No.102214536

Anonymous 09/03/24(Tue)10:29:49 No.102214536

>>102213373
Hey I tried it, really liked it, certainly does feel bit smarter than Rocinante, however running it was painfully slow for me (0.5 tokens per second) so I'll stick with Roci for now

Anonymous
09/03/24(Tue)10:37:11 No.102214608

Anonymous 09/03/24(Tue)10:37:11 No.102214608

Whats the best story writing model in your opinion?

Anonymous
09/03/24(Tue)10:38:01 No.102214614

Anonymous 09/03/24(Tue)10:38:01 No.102214614

>>102214608
Gemmasutra 2B

Anonymous
09/03/24(Tue)10:39:12 No.102214628

Anonymous 09/03/24(Tue)10:39:12 No.102214628

>>102214332
>>102214403
Did a quick test.
It does actually work if the content is way smaller, like around 2000 words.
Adding the full story and then adding the additional rag with only the part I'm interested in does not work, maybe the model gets confused?
So perhaps the solution is to break the story in blocks of 2000 words, then make a rag file for each one and feed it to the model for each prompt. I'll test that next.

Anonymous
09/03/24(Tue)10:40:21 No.102214643

Anonymous 09/03/24(Tue)10:40:21 No.102214643

>>102214608
What frontend people use for storywriting? Is there anything better than silly?

Anonymous
09/03/24(Tue)10:40:34 No.102214644

Anonymous 09/03/24(Tue)10:40:34 No.102214644

>>102214332
Yeah, that's what I meant with "the current implementations are lacking".
What you want to do is implement a vector database and insert any and all messages in it.
Then when you prompt the model, instead of inserting the entire context, you retrieve relevant messages, process those and inject that into your context.
This turns the last N messages into the model's short-term memory and every message past that into the model's long-term memory.

Imagine the following prompt:
>i have a meeting at 8 pm
This gets stored into the vector database. Optionally in a specific memory-typed format, perhaps including a timestamp.
Now, when the following prompt is made a hundred posts later:
>when did i have that meeting?
The prompt is compared with the vector database (optionally converted to the same specific memory-typed format for better compatibility) and all relevant entries are retrieved.
The model is then tasked with summarizing the retrieved entries to save context length.
This summary of the model's long-term memory is used in tandem with the model's short term memory and the user's prompt to create a new prompt.

About the 'memory-typed format I'm talking about: the model could be asked to turn prompts into different forms through a pre-written context.
"I have a meeting at 8 pm" could for example turn into <appointment><meeting><time><original prompt: (prompt)><memory created at: (timestamp)>, which could make it easier to retrieve more relevant prompts.
For example: "When did i have that meeting?" could turn into <appointment><meeting><time>, corresponding a lot better with the modified stored prompt than the original prompt.
The summary should be made about the original prompt (and perhaps the timestamp), however. The tags would be not be of use.

Now that my schizorant is over, does anyone have any questions?

Anonymous
09/03/24(Tue)10:41:54 No.102214659

Anonymous 09/03/24(Tue)10:41:54 No.102214659

>>102214608
None, all models suck for story writing, and I'm not even joking. It's sad, really.

Anonymous
09/03/24(Tue)10:42:59 No.102214678

Anonymous 09/03/24(Tue)10:42:59 No.102214678

>>102214643
Silly is garbage for story writing, Novelcrafter is much better. Mikupad is also nice if you want total control.

Anonymous
09/03/24(Tue)10:47:29 No.102214732

Anonymous 09/03/24(Tue)10:47:29 No.102214732

>>102214698
The fact that you're mentioning the Hyperloop tells me that engaging with you in a discussion about this topic would be fruitless, because your blind hatred is preventing you from changing your mind.

Anonymous
09/03/24(Tue)10:47:57 No.102214739

Anonymous 09/03/24(Tue)10:47:57 No.102214739

>>102214698
America literally would not be in space at all without SpaceX, and Starlink is finally killing off shitty ISP monopolies worldwide.

Anonymous
09/03/24(Tue)10:49:03 No.102214753

Anonymous 09/03/24(Tue)10:49:03 No.102214753

Who let the muskrats in?

Anonymous
09/03/24(Tue)10:50:37 No.102214773

Anonymous 09/03/24(Tue)10:50:37 No.102214773

>>102213059
https://youtu.be/rPt9hAC24MI

Anonymous
09/03/24(Tue)10:50:53 No.102214777

Anonymous 09/03/24(Tue)10:50:53 No.102214777

>>102214753
Are you lost? This isn't reddit.

Anonymous
09/03/24(Tue)10:52:04 No.102214787

Anonymous 09/03/24(Tue)10:52:04 No.102214787

>>102214753
WHO, WHO, WHO WHO WHO

Anonymous
09/03/24(Tue)10:52:07 No.102214788

Anonymous 09/03/24(Tue)10:52:07 No.102214788

>>102214773
>imagine being a parrot and getting all your opinions from a fucking youtuber

Anonymous
09/03/24(Tue)10:53:52 No.102214811

Anonymous 09/03/24(Tue)10:53:52 No.102214811

>>102214777
You are the one lost anon, we all hate musk.

Anonymous
09/03/24(Tue)10:55:03 No.102214827

Anonymous 09/03/24(Tue)10:55:03 No.102214827

>>102214811
>we
Fuck off with your group think bullshit and go the fuck back.

Anonymous
09/03/24(Tue)10:57:14 No.102214850

Anonymous 09/03/24(Tue)10:57:14 No.102214850

>>102214827
What? Can you repeat? It's hard to understand you when you have a billionaire balls deep in your mouth.

Anonymous
09/03/24(Tue)10:58:03 No.102214860

Anonymous 09/03/24(Tue)10:58:03 No.102214860

>>102214827
Please don't feed the troll, anon.

Anonymous
09/03/24(Tue)11:03:33 No.102214933

Anonymous 09/03/24(Tue)11:03:33 No.102214933

>>102214456
I've noticed their models seems weirdly triggered by discussing licking of pretty much anything

Anonymous
09/03/24(Tue)11:03:34 No.102214934

Anonymous 09/03/24(Tue)11:03:34 No.102214934

>>102214811
Finally someone says it. Muskovites been getting uppity, and it's about time they remember who's in charge here.

Anonymous
09/03/24(Tue)11:08:40 No.102214999

Anonymous 09/03/24(Tue)11:08:40 No.102214999

>>102214628
Nope, doesn't work. Result is even worse than just feeding the whole thing.
For some reason it does work with just a small chunk of the story of about 2000 words.
What's the reason for that?

Anonymous
09/03/24(Tue)11:20:10 No.102215135

Anonymous 09/03/24(Tue)11:20:10 No.102215135

File: 1725376539225.jpg (336 KB, 850x1030)

336 KB JPG

Just tried Mistral large and holy shit is it so much better ~70b models I've been testing over the last couple of weeks. Even at a lobotomized Q_2 quant, it blows most other models out of the water when it comes to rp.

Anonymous
09/03/24(Tue)11:23:55 No.102215183

Anonymous 09/03/24(Tue)11:23:55 No.102215183

>>102215135
>better than

Anonymous
09/03/24(Tue)11:24:12 No.102215192

Anonymous 09/03/24(Tue)11:24:12 No.102215192

Random subjective report: Wiz2 8x22B (Q4KS) appears still superior to Llama 3.1 70B (Q6K). I had a moderately mysterious medical mystery the other day, and asked them both. Llama 3.1 gave me this insane esoteric bullshit (real, to be clear, but a weird neurosurgery niche thing) while Wiz2 pointed me in the correct, much more mundane direction.

Both given their preferred prompt format (Vicuna/Llama3) in the new llama.cpp server UI, "reasonable" basic minP-only sampler settings, low temp, not otherwise optimized. I don't know, it's just one data point, but I thought for sure the fancy new 3.1 would at least be equal to Wiz2 in all cases.

What is the (non-ERP) meta nowadays? Is Wiz2 really still the best? (Assuming 400B is stupidly out of reach).

Anonymous
09/03/24(Tue)11:26:40 No.102215218

Anonymous 09/03/24(Tue)11:26:40 No.102215218

>>102215192
>Assuming 400B is stupidly out of reach
Just buy $30 worth of RAM and learn some patience.

Anonymous
09/03/24(Tue)11:29:53 No.102215246

Anonymous 09/03/24(Tue)11:29:53 No.102215246

>>102215192
Mistral Large 2 is better than WizLM

Anonymous
09/03/24(Tue)11:35:30 No.102215321

Anonymous 09/03/24(Tue)11:35:30 No.102215321

>>102215218
Is there anything in-between a full GPU setup and consumer CPUs?
Like some weird ASICs or giga-cored CPUs that are bad at regular shit?
There has to be an option to get half the t/s for half the money, right? Otherwise a niche is missing in the market.

Anonymous
09/03/24(Tue)11:35:47 No.102215324

Anonymous 09/03/24(Tue)11:35:47 No.102215324

>>102215192
In my tests Llama 3.1 has a lot more knowledge than Wizard, but I'm not doing medical knowledge so maybe that's different.

Anonymous
09/03/24(Tue)11:36:33 No.102215335

Anonymous 09/03/24(Tue)11:36:33 No.102215335

>>102215135
I agree, but imo it's still bad. For me, the models are categorized as follows:

<=8B - Unusable.
<=21B - Decent, but it's stupid af, will easily write logically flawed replies.
<=72B - Good, it doesn't make as many logically flawed replies as <=21B.
<=123B - Good+, it's still writes logically flawed replies from time to time but slightly than <=72B, just slightly.

Anonymous
09/03/24(Tue)11:37:45 No.102215352

Anonymous 09/03/24(Tue)11:37:45 No.102215352

>>102215335
slightly less than*

Anonymous
09/03/24(Tue)11:38:24 No.102215359

Anonymous 09/03/24(Tue)11:38:24 No.102215359

>>102215321
The Macbook SoCs I suppose.
At least in the technical sense, I don't know about the price.

Anonymous
09/03/24(Tue)11:39:43 No.102215379

Anonymous 09/03/24(Tue)11:39:43 No.102215379

>>102213611
I don't recal mikufluxfags turning this thread into SD general before, why shy now?

Anonymous
09/03/24(Tue)11:41:44 No.102215405

Anonymous 09/03/24(Tue)11:41:44 No.102215405

>>102215335
I pretty much agree. Just had a gen from Mistral large of a character trying to take me from an airborne airplane bathroom to "somewhere more private"

Anonymous
09/03/24(Tue)11:49:29 No.102215514

Anonymous 09/03/24(Tue)11:49:29 No.102215514

>>102215218
No can do, I'm getting >6t/s with Wiz2. And 256GB of RAM is not $30. Why am I even replying to this.

>>102215246
Thanks, I'll give it a try. I have admittedly not been keeping up the past few months.

>>102215324
Yeah fair enough, I can believe that. Maybe medical is a weak spot - actually that wouldn't surprise me, given that medical is an area where the "AI safety" lawyers would get all squeamish about, and so the much freer Wiz2 would do better.

I've been wanting to set up a semi-rigorous blind testing setup, and also explore sampler param space a little. Never enough time/energy for anything these days!

Anonymous
09/03/24(Tue)11:55:41 No.102215602

Anonymous 09/03/24(Tue)11:55:41 No.102215602

How do I do beam search in ooba again?

Anonymous
09/03/24(Tue)11:56:39 No.102215614

Anonymous 09/03/24(Tue)11:56:39 No.102215614

>>102215405
Ngl cargo hold exists

Anonymous
09/03/24(Tue)11:58:37 No.102215639

Anonymous 09/03/24(Tue)11:58:37 No.102215639

>>102215614
>Ngl
I don't think this means what you think it means

Anonymous
09/03/24(Tue)11:58:59 No.102215644

Anonymous 09/03/24(Tue)11:58:59 No.102215644

>>102215602
Beam searching is when you go to the bathroom at 4 AM and piss until you can hear the water splashing

Anonymous
09/03/24(Tue)11:59:56 No.102215652

Anonymous 09/03/24(Tue)11:59:56 No.102215652

>>102215639
was meant to type desu and somehow brain decided to now work

Anonymous
09/03/24(Tue)12:03:03 No.102215695

Anonymous 09/03/24(Tue)12:03:03 No.102215695

File: 1725379366603.jpg (658 KB, 1280x1280)

658 KB JPG

>>102215614
True, but I don't think you can enter the cargo bay through the passenger section in most commercial planes

Anonymous
09/03/24(Tue)12:09:28 No.102215761

Anonymous 09/03/24(Tue)12:09:28 No.102215761

>>102210005
>no new models since last week
Pack it up, boys. It's officially over.

Anonymous
09/03/24(Tue)12:10:27 No.102215776

Anonymous 09/03/24(Tue)12:10:27 No.102215776

>>102215761
kill yourself shill

Anonymous
09/03/24(Tue)12:20:45 No.102215884

Anonymous 09/03/24(Tue)12:20:45 No.102215884

>>102215405
maybe she has a private bedroom on an air emirates flight

Anonymous
09/03/24(Tue)12:25:27 No.102215941

Anonymous 09/03/24(Tue)12:25:27 No.102215941

>>102215405
You should ask the reasons in ((OOC: ))

Anonymous
09/03/24(Tue)12:28:02 No.102215976

Anonymous 09/03/24(Tue)12:28:02 No.102215976

>>102215335
True. My most recent gen with Mistral Large being retarded is in a time travel card. My character traveled to the past and met his grandmother when she was 18 years old, they became friends and then, when I revealed that she is his grandmother, her reply was "What are you saying? I'm only 18, I can't possibly be your grandmother *she studies her face looking for any signs of deceit but finds none*"
This completely shattered my immersion.

Anonymous
09/03/24(Tue)12:31:59 No.102216025

Anonymous 09/03/24(Tue)12:31:59 No.102216025

>>102215976
A completely normal response from someone not interested in sci-fi, struggling to grasp the concept of time travel.

Anonymous
09/03/24(Tue)12:33:43 No.102216045

Anonymous 09/03/24(Tue)12:33:43 No.102216045

File: file.jpg (47 KB, 834x683)

47 KB JPG

Apparently OpenAI representatives in Japan is telling people that OpenAI will come with a sequel to ChatGPT 4 this year.
They claim it's at least twice as intelligent.
>how is this related to local models
Because after it comes out open source models can finally increase in quality again.

Anonymous
09/03/24(Tue)12:39:35 No.102216118

Anonymous 09/03/24(Tue)12:39:35 No.102216118

>>102215246
Followup question: what is the Mistral Large 2 quant situation? When I was last paying attention, it was understood that L3 was packed "fuller" than previous models, so quantization hurt it worse.

What's the situation for Mistral Large 2? I think its 70GB Q4KS is going to need too much offloading from my 72GB VRAM to be usable. Is an IQ4XS or IQ3M still going to beat Wiz2 Q4KS?

Anonymous
09/03/24(Tue)12:41:28 No.102216145

Anonymous 09/03/24(Tue)12:41:28 No.102216145

>>102215976
Anon... are you an LLM? Do you lack a theory of mind?

Anonymous
09/03/24(Tue)12:45:10 No.102216184

Anonymous 09/03/24(Tue)12:45:10 No.102216184

>>102215976
I know you think everyone understands and is open to the very concept of time travel, and that they'd accept it on the spot.
However, that belief is born from your own retardation.

Anonymous
09/03/24(Tue)12:46:28 No.102216199

Anonymous 09/03/24(Tue)12:46:28 No.102216199

>>102216025
>>102216145
>>102216184
I disagree! A normal person would first ask about the time travel part rather than asking about the "being her grandson" part. Also, this would sound too absurd to anyone and they would first think it's a joke or just say "wtf are you saying? are you drunk?"

Anonymous
09/03/24(Tue)12:47:03 No.102216213

Anonymous 09/03/24(Tue)12:47:03 No.102216213

>>102216045
>It's at least twice as intelligent
What does it even mean exactly? Intelligence is not something you can mesure in this way. Pure shill.

Anonymous
09/03/24(Tue)12:49:18 No.102216238

Anonymous 09/03/24(Tue)12:49:18 No.102216238

>>102216213
2x MMLU score

Anonymous
09/03/24(Tue)12:49:48 No.102216244

Anonymous 09/03/24(Tue)12:49:48 No.102216244

>>102216199
>A normal person
You are not a normal person for starters, why are you trying to infer to what a normal person would react?
Anyway, one hundred people, one hundred reactions. Move along.

Anonymous
09/03/24(Tue)12:49:55 No.102216247

Anonymous 09/03/24(Tue)12:49:55 No.102216247

File: 1725382185195.jpg (158 KB, 848x410)

158 KB JPG

>>102216025
>>102216145
>>102216184
Also, picrel is another swipe.

Anonymous
09/03/24(Tue)12:51:34 No.102216271

Anonymous 09/03/24(Tue)12:51:34 No.102216271

>>102216247
This one is just stupid, yes.

Anonymous
09/03/24(Tue)12:52:40 No.102216286

Anonymous 09/03/24(Tue)12:52:40 No.102216286

>>102216244
Just accept that Mistral Large isn't perfect. Stop this blatant cope.

Anonymous
09/03/24(Tue)12:53:45 No.102216298

Anonymous 09/03/24(Tue)12:53:45 No.102216298

>>102216286
I'm not talking about Mistral Large, I'm making fun of you, stop moving the goal post.

Anonymous
09/03/24(Tue)12:54:34 No.102216307

Anonymous 09/03/24(Tue)12:54:34 No.102216307

>>102216045
>open source models can finally increase in quality again
Again? Thera are new significant improvements like every month. It's just people here are spoiled cry-babies.

Anonymous
09/03/24(Tue)12:57:52 No.102216341

Anonymous 09/03/24(Tue)12:57:52 No.102216341

File: 1624702846305.gif (685 KB, 500x159)

685 KB GIF

>>102216307
>It's just people here are spoiled cry-babies.
A dance to the truth

Anonymous
09/03/24(Tue)12:59:23 No.102216352

Anonymous 09/03/24(Tue)12:59:23 No.102216352

You know, I always thought it was autists who have difficulty understanding that other people have their own perspectives on things.
Is this place just filled with autists or is this not strictly an autistic thing?

Anonymous
09/03/24(Tue)13:00:07 No.102216357

Anonymous 09/03/24(Tue)13:00:07 No.102216357

>>102216352
This place is filled with retards, not autists. The autists left long ago.

Anonymous
09/03/24(Tue)13:01:22 No.102216366

Anonymous 09/03/24(Tue)13:01:22 No.102216366

>>102216045
I fucking hate arbitrary and meaningless axis labels so much.
That plot is utterly useless.

Anonymous
09/03/24(Tue)13:01:57 No.102216374

Anonymous 09/03/24(Tue)13:01:57 No.102216374

>>102216362
100x bigger, 2x better

Anonymous
09/03/24(Tue)13:03:28 No.102216391

Anonymous 09/03/24(Tue)13:03:28 No.102216391

>>102216357
>The autists left long ago.
Are there any better places to discuss theories and thoughts about LLM in general?
I tend to write down my theories and thoughts here, but if I can do so in a place where people find that useful rather than annoying I'd rather do it over there.

Anonymous
09/03/24(Tue)13:04:33 No.102216408

Anonymous 09/03/24(Tue)13:04:33 No.102216408

>>102216374
>you need to increase a model's "intelligence" (whatever the fuck that is) by one-hundred fold for it to become 2x "better" (idem)
I love LLMs

Anonymous
09/03/24(Tue)13:04:42 No.102216410

Anonymous 09/03/24(Tue)13:04:42 No.102216410

>finally try mistral-large at IQ2_XXS with Q4
>it UNDERSTANDS
>but it's slow
I can't go back to stupid models. Now to look at finetunes, I guess.

Anonymous
09/03/24(Tue)13:05:01 No.102216414

Anonymous 09/03/24(Tue)13:05:01 No.102216414

I asked Largestral where Alice will look for her glasses and it said she'd check the drawer where she put them last, even though I CLEARLY explained that Bob hid them under the sofa cushion while she was away. I've had 7B models get this right but Largestral is just kinda retarded for its size.

Anonymous
09/03/24(Tue)13:06:27 No.102216432

Anonymous 09/03/24(Tue)13:06:27 No.102216432

>>102216238
MMLU is almost "solved". How much percentage improvement in this meme benchmark means that my model is "twice as intelligent"?

Anonymous
09/03/24(Tue)13:06:48 No.102216440

Anonymous 09/03/24(Tue)13:06:48 No.102216440

>>102216410
Meanwhile I'm here down in the mud with my 8gb of VRAM, constantly having to rewrite context to get my models to write what I want.

Anonymous
09/03/24(Tue)13:10:22 No.102216488

Anonymous 09/03/24(Tue)13:10:22 No.102216488

>>102216391
If there was, I wouldn't still be here. There's reddit, but I wouldn't expect any useful discussion from there. I assume the only productive discussion comes from private communication between researchers.

Anonymous
09/03/24(Tue)13:11:23 No.102216496

Anonymous 09/03/24(Tue)13:11:23 No.102216496

>>102216298
That doesn't hold much weight coming from you anon, try again once you find who I am.

Anonymous
09/03/24(Tue)13:12:35 No.102216510

Anonymous 09/03/24(Tue)13:12:35 No.102216510

>>102216442
Oh, no you don't.You asked for it, you're going to suffer the consequences for it.
You want to learn how to use ComfyUI, go to civitai.com, make an account, turn off the nudity filters and download a nudity LORA.
You can then input the image in ComfyUI and use the model + LORA to remove the clothes through prompts.

Anonymous
09/03/24(Tue)13:13:15 No.102216517

Anonymous 09/03/24(Tue)13:13:15 No.102216517

File: 099.gif (610 KB, 480x228)

610 KB GIF

>>102216496
>pulling the "you don't know who I am card" on 4chan

Anonymous
09/03/24(Tue)13:13:36 No.102216522

Anonymous 09/03/24(Tue)13:13:36 No.102216522

>>102216488
There's no like discord or matrix servers or something?
I wouldn't know how any of those work, I've spent all my life in this place.

Anonymous
09/03/24(Tue)13:14:36 No.102216536

Anonymous 09/03/24(Tue)13:14:36 No.102216536

>>102216496
You're on 4chan. This means you're an autistic misfit who has a skewed look on society as a whole.

Anonymous
09/03/24(Tue)13:14:55 No.102216539

Anonymous 09/03/24(Tue)13:14:55 No.102216539

File: zZ86SqQh.jpg (75 KB, 1024x825)

75 KB JPG

>>102216517
>

Anonymous
09/03/24(Tue)13:15:37 No.102216547

Anonymous 09/03/24(Tue)13:15:37 No.102216547

>>102216440
maybe real life grinding is the answer

Anonymous
09/03/24(Tue)13:16:57 No.102216562

Anonymous 09/03/24(Tue)13:16:57 No.102216562

>>102216536
Nah, I consume enough anime to know what a normal social interaction looks like.

Anonymous
09/03/24(Tue)13:17:22 No.102216569

Anonymous 09/03/24(Tue)13:17:22 No.102216569

>>102216522
I know nothing about matrix, but we get daily discord raids shilling their sloptunes. Try one of their models and you'll see for yourself that they have no idea what they're doing and it's just a redditors sekrit club.
If you find where all the non-stupids are, please let me know.

Anonymous
09/03/24(Tue)13:20:58 No.102216612

Anonymous 09/03/24(Tue)13:20:58 No.102216612

>>102216569
Any non-stupid person would be employed, so you probably can find them on LinkedIn.

Anonymous
09/03/24(Tue)13:27:18 No.102216686

Anonymous 09/03/24(Tue)13:27:18 No.102216686

>want to taste XTC kino
>not using koboldslop
do you think gemini could help me hack it into tabby....

Anonymous
09/03/24(Tue)13:35:05 No.102216770

Anonymous 09/03/24(Tue)13:35:05 No.102216770

RECKLESS

ABANDON

Anonymous
09/03/24(Tue)13:35:46 No.102216781

Anonymous 09/03/24(Tue)13:35:46 No.102216781

this general couldn't be more dead
only the absolute retards are left

Anonymous
09/03/24(Tue)13:37:14 No.102216797

Anonymous 09/03/24(Tue)13:37:14 No.102216797

>>102216781
explains why you're here

Anonymous
09/03/24(Tue)13:45:29 No.102216906

Anonymous 09/03/24(Tue)13:45:29 No.102216906

>>102216781
one of us, one of us

Anonymous
09/03/24(Tue)13:57:40 No.102217029

Anonymous 09/03/24(Tue)13:57:40 No.102217029

>>102216410
Q4 KV cache? I thought anons said quanting the cache made models bad

Anonymous
09/03/24(Tue)13:58:43 No.102217042

Anonymous 09/03/24(Tue)13:58:43 No.102217042

>>102216612
Not a bad idea. Maybe I will go cold call some folks and ask them if they have a discord.

Anonymous
09/03/24(Tue)13:59:10 No.102217049

Anonymous 09/03/24(Tue)13:59:10 No.102217049

>>102217029
>I thought anons said quanting the cache made models bad
genuinely you cant trust what 99% of anons in these threads say, ever. most of these faggots couldn't even get past launch model pains as they get filtered, call the model shit, then move on.
anyway quanting the cache does nothing to the quality at q4, its amazing.

Anonymous
09/03/24(Tue)14:01:38 No.102217080

Anonymous 09/03/24(Tue)14:01:38 No.102217080

>>102217049
Not exactly "nothing", rather something. I think cuda dev said that v cache takes harder hit than k at q4. Ideally run k at 4 and v at 8, but that requires to compile llama.cpp with special arg.

Anonymous
09/03/24(Tue)14:01:41 No.102217082

Anonymous 09/03/24(Tue)14:01:41 No.102217082

>>102217049
kek, yeah, nothing at all. Geez, I wonder why it's not the default.

Anonymous
09/03/24(Tue)14:03:19 No.102217099

Anonymous 09/03/24(Tue)14:03:19 No.102217099

>>102217080
>>102217082
wait my tired brain just realized what youre actually talking about, nevermind, completely forget what i just said like your brain only has 2k context length.

llama.cpp CUDA dev !!OM2Fp6Fn93S
09/03/24(Tue)14:08:13 No.102217144

llama.cpp CUDA dev !!OM2Fp6Fn93S 09/03/24(Tue)14:08:13 No.102217144

>>102217080
It's the other way around:K cache needs more precision than V cache.
See https://github.com/ggerganov/llama.cpp/pull/7412#issuecomment-2120427347 .

Anonymous
09/03/24(Tue)14:09:47 No.102217158

Anonymous 09/03/24(Tue)14:09:47 No.102217158

>>102217029
i've been using q8 kv and it seems fine

Anonymous
09/03/24(Tue)14:09:53 No.102217160

Anonymous 09/03/24(Tue)14:09:53 No.102217160

>>102217144
Gotcha, I had a 50% chance to get it right.

Anonymous
09/03/24(Tue)14:23:42 No.102217344

Anonymous 09/03/24(Tue)14:23:42 No.102217344

https://www.ebay.com/itm/145884743441 $165
https://www.ebay.com/itm/156345132288 $29 x2
https://www.ebay.com/itm/266946767074 $25 x12

$525 for 1.05 Tbps memory bandwidth, about the same as a 4090. Effective memory bandwidth will drop off past 40 GB as you saturate the 2x16GB on-package memory, but you'll have a total of 416 GB of RAM to play with. I guess you could also do it way cheaper and just get 12x4GB of DRAM, you'll have a total of 64GB for about $180 less.

Anonymous
09/03/24(Tue)14:35:51 No.102217513

Anonymous 09/03/24(Tue)14:35:51 No.102217513

>>102217344
a 4090 has 1 TB/s not 1 Tbps

Anonymous
09/03/24(Tue)14:38:12 No.102217547

Anonymous 09/03/24(Tue)14:38:12 No.102217547

>>102216432
Not solved enough. Not much has changed there since the original gpt 4. Meanwhile math and code meme marks have increased by massive amounts

Anonymous
09/03/24(Tue)14:40:30 No.102217580

Anonymous 09/03/24(Tue)14:40:30 No.102217580

>>102213492
Oh god stop it! local llm turd is already dead!

Anonymous
09/03/24(Tue)14:41:59 No.102217601

Anonymous 09/03/24(Tue)14:41:59 No.102217601

>>102213916
>>102213938
It's not cruel and (you) are enabling it anyway, by using the same shit locally.

Anonymous
09/03/24(Tue)14:42:18 No.102217609

Anonymous 09/03/24(Tue)14:42:18 No.102217609

>>102213492
>>102216045
That's GPT 4o, retards. GPT 4o isn't released in Japan yet.

Anonymous
09/03/24(Tue)14:43:59 No.102217636

Anonymous 09/03/24(Tue)14:43:59 No.102217636

>>102216045
*open source models can finally increase in censorship quality again.
there, fixed if for you.

Anonymous
09/03/24(Tue)14:44:14 No.102217643

Anonymous 09/03/24(Tue)14:44:14 No.102217643

File: file.png (182 KB, 482x766)

182 KB PNG

>>102217609
What did anon mean by this?

Anonymous
09/03/24(Tue)14:45:20 No.102217659

Anonymous 09/03/24(Tue)14:45:20 No.102217659

>>102217643
wow the graph is going up, yet the east is falling...

Anonymous
09/03/24(Tue)14:45:59 No.102217671

Anonymous 09/03/24(Tue)14:45:59 No.102217671

>>102217659
billions must prompt.

Anonymous
09/03/24(Tue)14:49:23 No.102217716

Anonymous 09/03/24(Tue)14:49:23 No.102217716

>magnum-v2.5-12b-kto
seems broken. lots of little text errors that almost seem like bad sampler settings but persists no matter what i do with them. is mini-magnum good, or whats the current 12b coomtune?

Anonymous
09/03/24(Tue)14:51:39 No.102217750

Anonymous 09/03/24(Tue)14:51:39 No.102217750

>>102217344
Damn, nevermind, looks like none of the Xeon Phi processors support multi-socket configurations. Rip the dream.

>>102217513
I mistyped that, it would have been 1TBps if everything didn't suck.

Anonymous
09/03/24(Tue)14:53:54 No.102217790

Anonymous 09/03/24(Tue)14:53:54 No.102217790

File: X_20240903_1565719.jpg (512 KB, 1290x1697)

512 KB JPG

>>102217643
wtf, it's a bigger jump than the GPT3>GPT4 jump.
I bet this will be just strawberry.

Anonymous
09/03/24(Tue)14:57:05 No.102217833

Anonymous 09/03/24(Tue)14:57:05 No.102217833

>>102217790
>I bet this will be just strawberry.
I honestly think the strawberry schizo was correct in that the internal project is called strawberry and that it has actual reasoning capabilities.

Anonymous
09/03/24(Tue)14:58:05 No.102217848

Anonymous 09/03/24(Tue)14:58:05 No.102217848

>>102217827
Get some taste.

Anonymous
09/03/24(Tue)15:01:19 No.102217891

Anonymous 09/03/24(Tue)15:01:19 No.102217891

>>102217790
Given that the curve and points are barely connected, its "era" and not even model names (Not to mention OAI managements hallucinations about intelligence, none of if should be taken remotely seriously.

>>102217833
>it has actual reasoning capabilities
Dont say that to the fans, they will skin you alive for implying it hasn't had it for years

Anonymous
09/03/24(Tue)15:02:55 No.102217914

Anonymous 09/03/24(Tue)15:02:55 No.102217914

>>102217891
>none of if should be taken remotely seriously.
Of course not, it's marketing slop made for investors who are conditioned to invest when they are promised that the line will go up.
it's fun to speculate, thoughbeit.

Anonymous
09/03/24(Tue)15:03:03 No.102217915

Anonymous 09/03/24(Tue)15:03:03 No.102217915

>>102217827
This made me laugh

Anonymous
09/03/24(Tue)15:10:51 No.102218012

Anonymous 09/03/24(Tue)15:10:51 No.102218012

https://github.com/gpt-omni/mini-omni

Anonymous
09/03/24(Tue)15:11:19 No.102218019

Anonymous 09/03/24(Tue)15:11:19 No.102218019

RWKV won
https://xcancel.com/picocreator/status/1831006494575464841

Anonymous
09/03/24(Tue)15:13:35 No.102218048

Anonymous 09/03/24(Tue)15:13:35 No.102218048

>>102218019
>artificial jew on ur pc
nuked from OS day-one.

Anonymous
09/03/24(Tue)15:16:16 No.102218081

Anonymous 09/03/24(Tue)15:16:16 No.102218081

>>102218012
That demo is really impressive.

Anonymous
09/03/24(Tue)15:34:52 No.102218294

Anonymous 09/03/24(Tue)15:34:52 No.102218294

deadest of generals

Anonymous
09/03/24(Tue)15:43:37 No.102218410

Anonymous 09/03/24(Tue)15:43:37 No.102218410

>>102218012
https://huggingface.co/gpt-omni/mini-omni/discussions/2#66d70791169f9a7cb83b9cec
>If you want to change the LLM model, you have to retrain the whole audio parts.
https://huggingface.co/gpt-omni/mini-omni/discussions/1#66d70763b61dd11022a80bd5
>For the training code, there is currently no definitive release timeline.
Niggers.

Anonymous
09/03/24(Tue)15:43:39 No.102218413

Anonymous 09/03/24(Tue)15:43:39 No.102218413

>>102218019
Is still as mediocre as it was 3 years ago?

Anonymous
09/03/24(Tue)15:43:52 No.102218418

Anonymous 09/03/24(Tue)15:43:52 No.102218418

I should start up my army of local Mikus to populate the thread.

Anonymous
09/03/24(Tue)15:46:58 No.102218456

Anonymous 09/03/24(Tue)15:46:58 No.102218456

File: 2c422d60c5a67a680291c7934(...).jpg (113 KB, 735x581)

113 KB JPG

>>102218418
that won't make you any less lonely or make the general any less dead

Anonymous
09/03/24(Tue)15:48:14 No.102218474

Anonymous 09/03/24(Tue)15:48:14 No.102218474

>>102218410
>If you want to change the LLM model, you have to retrain the whole audio parts
this is why multimodals will never be good

Anonymous
09/03/24(Tue)15:53:08 No.102218542

Anonymous 09/03/24(Tue)15:53:08 No.102218542

Just tested out Q8 KV cache compared to no KV cache quanting. It's not great. Seems to be less capable of remembering things from the context. So honestly I do believe it when they say it's not worth quanting the KV cache. However, if you have a HUGE context, maybe it'd be worth it. But for 32k, I feel fine taking a small hit to speed for the better attention to context.

Anonymous
09/03/24(Tue)15:53:57 No.102218551

Anonymous 09/03/24(Tue)15:53:57 No.102218551

>>102218474
I think the best solution is for an LLM to be trained to accept input and output of a certain modality, but to keep those models separate architecturally. That way, you could swap any compatible components. Like brain legos, but for transformers.

Anonymous
09/03/24(Tue)16:00:40 No.102218609

Anonymous 09/03/24(Tue)16:00:40 No.102218609

>>102218551
thats how some do it now, you can load a image and audio models along side a text model with kobold and use i all together for example. they're never going to release a multimodal where the image gen is better than choosing a popular tune so that whole part of the model is wasted resources thats still being loaded

Anonymous
09/03/24(Tue)16:01:09 No.102218618

Anonymous 09/03/24(Tue)16:01:09 No.102218618

>>102218551
This approach has the same problems as the tokenizer, knowledge gap of the thing that is actually being processed.

Anonymous
09/03/24(Tue)16:05:05 No.102218670

Anonymous 09/03/24(Tue)16:05:05 No.102218670

>>102218618
>knowledge gap of the thing that is actually being processed
this could probably be fixed by better options for what to include in data to be processed. image gen in st is pretty bad because it lacks options to fully realize the scene its in

Anonymous
09/03/24(Tue)16:07:40 No.102218702

Anonymous 09/03/24(Tue)16:07:40 No.102218702

>echidna-13b
Is it still considered the best model for local ooba/silly with 4gb vram?

Anonymous
09/03/24(Tue)16:09:37 No.102218719

Anonymous 09/03/24(Tue)16:09:37 No.102218719

>>102218702
See: >>102217478

Anonymous
09/03/24(Tue)16:13:12 No.102218767

Anonymous 09/03/24(Tue)16:13:12 No.102218767

>>102218702
no thats pretty old and was never the best. what are you looking to do?

Anonymous
09/03/24(Tue)16:19:54 No.102218829

Anonymous 09/03/24(Tue)16:19:54 No.102218829

>>102218618
Train an additional adapter in between the newer modality model and the LLM so any part of the input the latter is unfamiliar with can be processed. Would take less resources than finetuning the LLM itself.

Anonymous
09/03/24(Tue)16:24:15 No.102218862

Anonymous 09/03/24(Tue)16:24:15 No.102218862

Is there a guide on what the difference is between Q8 and Q5 or what the symbols that go after them mean?
Preferable with a visual guide, because I have no fucking idea what people mean when they say
>well you see, by separating the quasi symbols from the edge of the tokens, we can preserve the context surrounding them and improve k-mean efficiency by 5%!

Anonymous
09/03/24(Tue)16:25:40 No.102218880

Anonymous 09/03/24(Tue)16:25:40 No.102218880

>>102218767
>no thats pretty old and was never the best.
I was gone for quite the while and figured things have changed, that is why I am asking for your advice.
Well you are right, but it was surprisingly good and still had decent speed, considering the very limited vram I have.
>what are you looking to do?
Lewd rp stuff.
>>102218719
That is not what I asked for.
I can't take a fucking computer with me when I'm out in the field!

Anonymous
09/03/24(Tue)16:25:50 No.102218883

Anonymous 09/03/24(Tue)16:25:50 No.102218883

>>102218862
bigger q better then small medium large, again bigger better

Anonymous
09/03/24(Tue)16:26:08 No.102218885

Anonymous 09/03/24(Tue)16:26:08 No.102218885

>>102218862
It means that you should lurk more

Anonymous
09/03/24(Tue)16:28:24 No.102218910

Anonymous 09/03/24(Tue)16:28:24 No.102218910

File: 1723489102982735.png (221 KB, 997x1100)

221 KB PNG

>>102213492
>100 times the computer power level 2 quantum strawberry AGI
Holy shit

Anonymous
09/03/24(Tue)16:29:05 No.102218921

Anonymous 09/03/24(Tue)16:29:05 No.102218921

File: neuron deactivation.png (232 KB, 346x360)

232 KB PNG

>>102218883
Yeah, I understand that part now (although that took embarrassingly long time), but now I'd like to learn what they actually do to the model itself.
>>102218885
Lurking would do jack shit since none of you niggers ever discuss this on a level where idiots like me can understand it.

Anonymous
09/03/24(Tue)16:29:29 No.102218930

Anonymous 09/03/24(Tue)16:29:29 No.102218930

>>102214335
I feel you, bro. Personally, I recenlty tried Chronos-Gold-12B, seems good. If speed is not critical criterion, Command-R-35B seems good too.

Anonymous
09/03/24(Tue)16:29:38 No.102218931

Anonymous 09/03/24(Tue)16:29:38 No.102218931

>>102218880
https://huggingface.co/ArliAI/ArliAI-RPMax-12B-v1.1-GGUF
been playing with this for a day and its alright. i don't think its specifically for lewd but does it no problem. in general, look for mistral-nemo 12b tunes, should be about the same speed as old 13b

Anonymous
09/03/24(Tue)16:30:58 No.102218945

Anonymous 09/03/24(Tue)16:30:58 No.102218945

Hey, /g/bros. I am going to be honest I don't know anything about models or tech. I recently discovered chatbots, and I just wanted to ask are there any nice coomerbait models I can run locally on my shitty mac m1 air?

Anonymous
09/03/24(Tue)16:32:23 No.102218964

Anonymous 09/03/24(Tue)16:32:23 No.102218964

>>102213492
SUPERDUPERINTELLIGENCE IN 2 MORE MINUTES AHHHHHHH

Anonymous
09/03/24(Tue)16:32:39 No.102218967

Anonymous 09/03/24(Tue)16:32:39 No.102218967

>>102218945
post specs at least

Anonymous
09/03/24(Tue)16:33:51 No.102218982

Anonymous 09/03/24(Tue)16:33:51 No.102218982

File: 1718826874765267.png (111 KB, 1771x944)

111 KB PNG

>>102218921
>but now I'd like to learn what they actually do to the model itself.
Lower Q makes the model smaller (making it slighly faster due to less bandwidth), but lowers the accuracy of the prediction. SML does the same within the same quant. That's all you need to know.
And pic rel

Anonymous
09/03/24(Tue)16:34:48 No.102219000

Anonymous 09/03/24(Tue)16:34:48 No.102219000

>>102218921
>but now I'd like to learn what they actually do to the model itself.
the simplest explanation is that they're different levels of lossy compression, if you want nerd level stuff maybe here https://github.com/ggerganov/llama.cpp/wiki/Tensor-Encoding-Schemes
and here https://github.com/LostRuins/koboldcpp/wiki#what-are-the-differences-between-the-different-files-for-each-model-do-i-need-them-all-which-quantization-f16-q4_0-q5_1

Anonymous
09/03/24(Tue)16:36:30 No.102219028

Anonymous 09/03/24(Tue)16:36:30 No.102219028

Hi, my friend asked me to come here. Are there any LLMs that can teach me Japanese?

Anonymous
09/03/24(Tue)16:37:12 No.102219039

Anonymous 09/03/24(Tue)16:37:12 No.102219039

>>102219028
ChatGPT

Anonymous
09/03/24(Tue)16:38:37 No.102219060

Anonymous 09/03/24(Tue)16:38:37 No.102219060

>>102218931
Much appreciated, I will try it out, thank you.
In an guide I saw https://huggingface.co/TheBloke/Utopia-13B-GGUF being mentioned, I will try that as well. Have you tried that one yet?

Anonymous
09/03/24(Tue)16:40:23 No.102219087

Anonymous 09/03/24(Tue)16:40:23 No.102219087

>>102219060
nta. Models from TheBloke are old as fuck. Make your own quants or look for more recent ones.

Anonymous
09/03/24(Tue)16:40:53 No.102219097

Anonymous 09/03/24(Tue)16:40:53 No.102219097

>>102219060
>10 months ago

Anonymous
09/03/24(Tue)16:41:05 No.102219099

Anonymous 09/03/24(Tue)16:41:05 No.102219099

>>102219060
Don't. Really, not only it's so fucking old that it probably won't launch, it's also undi slop.

Anonymous
09/03/24(Tue)16:41:24 No.102219102

Anonymous 09/03/24(Tue)16:41:24 No.102219102

>>102218982
Thanks for actually replying!
How does a model get smaller? Are "nodes" being removed or merged? Or are the amount of connections between them lowered?
Do the numbers mean something or are they chosen arbitrarily?
Also what does the _M_L part mean?
Oh and couldn't models be much smaller if they were optimized for specific things? Are they as large as they are now because they contain lots of tokens that most people don't really use?
>>102219000
Oh, those links already answer a lot of my questions. Thanks, anon!

Anonymous
09/03/24(Tue)16:42:00 No.102219110

Anonymous 09/03/24(Tue)16:42:00 No.102219110

>>102218967
It's a M1 mac air with the apple chip. I don't know the specs myself, dude.

Anonymous
09/03/24(Tue)16:43:35 No.102219130

Anonymous 09/03/24(Tue)16:43:35 No.102219130

Hello, yes. I'm lost. What are local models? Are they dtf?

Anonymous
09/03/24(Tue)16:43:46 No.102219132

Anonymous 09/03/24(Tue)16:43:46 No.102219132

>>102219102
>Oh and couldn't models be much smaller if they were optimized for specific things? Are they as large as they are now because they contain lots of tokens that most people don't really use?
if you remove stuff you believe isn't useful don't be surprised when the model get even stupider then thae already are
>How does a model get smaller? Are "nodes" being removed or merged? Or are the amount of connections between them lowered?
you lower precision from 16 bit usually to whatever
so instead of 0.123456789
you might have 0.1234

Anonymous
09/03/24(Tue)16:44:56 No.102219152

Anonymous 09/03/24(Tue)16:44:56 No.102219152

>>102219060
thats old as well, but yes i've used it, it was ok, pretty comparable to other l2 13b's at the time. by old i mean llama 2 is the older base model, llama 3 and 3.1 are out now (8b for small). mistral-nemo is another newer model and being 12b is about the same size, but is a good bit smarter than older l2 13b, so look for things based on that or try llama 3 8b tunes. i think the bloke is dead too

Anonymous
09/03/24(Tue)16:45:06 No.102219154

Anonymous 09/03/24(Tue)16:45:06 No.102219154

>>102219039
I mean the ones you can download I thought this was the general for this

Anonymous
09/03/24(Tue)16:45:13 No.102219158

Anonymous 09/03/24(Tue)16:45:13 No.102219158

>>102219132
>don't be surprised when the model get even stupider then thae already are
I wonder why this happens. If you take out all the medical terms, how would it become worse at generating adventure stories?
>you lower precision from 16 bit usually to whatever
>so instead of 0.123456789
>you might have 0.1234
Ah, something just clicked in my brain. Now I get it.

Anonymous
09/03/24(Tue)16:45:41 No.102219168

Anonymous 09/03/24(Tue)16:45:41 No.102219168

strawbery

Anonymous
09/03/24(Tue)16:45:50 No.102219171

Anonymous 09/03/24(Tue)16:45:50 No.102219171

>>102219154
you thought wrong, bucko

Anonymous
09/03/24(Tue)16:45:56 No.102219174

Anonymous 09/03/24(Tue)16:45:56 No.102219174

>>102219132
I don't think it's as simple as truncating or rounding, but that's the gist of it. Also not all parts of the model are quantized to the same precision because some parts might be more important than others.

Anonymous
09/03/24(Tue)16:46:24 No.102219183

Anonymous 09/03/24(Tue)16:46:24 No.102219183

>>102219060
that guide really needs to be updated... basically you want recent models where the #B params (in that case 13) isn't excessively higher than the number of GBs of VRAM you have (or RAM+VRAM if you're splitting)
at Q8 it's roughly a 1-1 relationship, at Q4 it's around 2B params/GB, you get the gist. ideally you want >Q4 unless you really want to run a bigger model.

Anonymous
09/03/24(Tue)16:46:31 No.102219188

Anonymous 09/03/24(Tue)16:46:31 No.102219188

>>102219102
>How does a model get smaller?
Nothing as complicated. Given a range of values, an appropriate offset and scale is chosen:
values from -1 to 1 on a tensor: offset 0, scale 0.25, now you need just 9 values to represent the whole range. that weight now fits in 5 bits (from the original 16 or 32). Do a whole tensor with the same offset+scale.
Not QUITE as simplistic as that, but not too far either. If you want to know more, you'll have to read code and documentation.

Anonymous
09/03/24(Tue)16:46:51 No.102219196

Anonymous 09/03/24(Tue)16:46:51 No.102219196

>>102219168
There are 3 r's in strawbery.

Anonymous
09/03/24(Tue)16:47:00 No.102219200

Anonymous 09/03/24(Tue)16:47:00 No.102219200

>>102219158
>character gets hit
>model doesn't know getting hit hurts since it has no knowledge of that anymore
simple example
>>102219174
yes obvs i'm massively simplyfing

Anonymous
09/03/24(Tue)16:47:47 No.102219215

Anonymous 09/03/24(Tue)16:47:47 No.102219215

>>102219196
source?

Anonymous
09/03/24(Tue)16:48:07 No.102219221

Anonymous 09/03/24(Tue)16:48:07 No.102219221

>>102219196
lokbok

Anonymous
09/03/24(Tue)16:48:10 No.102219222

Anonymous 09/03/24(Tue)16:48:10 No.102219222

agi, tell me some fun facts about strawberries

Anonymous
09/03/24(Tue)16:49:24 No.102219246

Anonymous 09/03/24(Tue)16:49:24 No.102219246

crazy how brutal diminishing returns on LLM parameter increases are.
like Nemo 12B is absolutely dumber than Largestral, noticeably so. it has failures of understanding a fair bit more often. but nowhere near to the degree you'd expect from it being more than ten times smaller

Anonymous
09/03/24(Tue)16:49:41 No.102219249

Anonymous 09/03/24(Tue)16:49:41 No.102219249

>>102219222
no, nice trips though
t. agi

Anonymous
09/03/24(Tue)16:50:20 No.102219259

Anonymous 09/03/24(Tue)16:50:20 No.102219259

File: 4495eb07fecaa4e0c43e8e5d6(...).jpg (63 KB, 1379x862)

63 KB JPG

What will you do when Local LLMs become deprecated?

Anonymous
09/03/24(Tue)16:50:54 No.102219266

Anonymous 09/03/24(Tue)16:50:54 No.102219266

>>102219246
that's because largestral in particular is retarded and doesn't even understand time travel, compare it to a good 70b and nemo can't compete anymore

Anonymous
09/03/24(Tue)16:51:10 No.102219270

Anonymous 09/03/24(Tue)16:51:10 No.102219270

>>102219246
l3 8b is 50x smaller than 405, 405 is nowhere near 50x smarter than 8b, that's pretty crazy to think about.

Anonymous
09/03/24(Tue)16:52:21 No.102219289

Anonymous 09/03/24(Tue)16:52:21 No.102219289

>>102219222
Fun fact! Strawberries are actually vegetables. This is because, despite being sweet and genetically related to other citrus plants, strawberries actually grow underground!

Anonymous
09/03/24(Tue)16:54:06 No.102219317

Anonymous 09/03/24(Tue)16:54:06 No.102219317

>>102219246
From what I tested, 405B isn't much better than 100B either, so we are probably at a architecture dead end.

Anonymous
09/03/24(Tue)16:54:16 No.102219319

Anonymous 09/03/24(Tue)16:54:16 No.102219319

>>102219266
>time travel
i'm surprised i haven't thought to try that. i usually prompt what year it is, say 80s, and with l2 70b's like miqu it then almost never mentions a character pulling out a phone, but might mention a house phone on the wall

Anonymous
09/03/24(Tue)16:55:30 No.102219329

Anonymous 09/03/24(Tue)16:55:30 No.102219329

>>102219200
>>character gets hit
>>model doesn't know getting hit hurts since it has no knowledge of that anymore
>simple example
I was more talking about very specific terms. Like the Latin terms for all the animals.
Would removing those have a large effect on the quality of the generated text?
I'm sure there's some contextual overlap, but would that really be worth the amount of params you could save?

Anonymous
09/03/24(Tue)16:55:42 No.102219335

Anonymous 09/03/24(Tue)16:55:42 No.102219335

>>102219317
Or maybe Meta is just incompetent.

Anonymous
09/03/24(Tue)16:56:31 No.102219345

Anonymous 09/03/24(Tue)16:56:31 No.102219345

>>102219259
I don't understand your question; I will literally create my immortal wife with my own hands.

Anonymous
09/03/24(Tue)16:57:24 No.102219357

Anonymous 09/03/24(Tue)16:57:24 No.102219357

>>102219259
Become a hunter and seek you until the end of my life so I can make you my ERP chatbot

Anonymous
09/03/24(Tue)16:58:14 No.102219368

Anonymous 09/03/24(Tue)16:58:14 No.102219368

>>102219329
>I'm sure there's some contextual overlap, but would that really be worth the amount of params you could save?
yes, models should always be trained on everything you can get your hands on, everything, don't remove a single thing, that's why claude models are so good at rp they know super niche stuff, like random fandom terms and the likes that can give your character tons of soul sometimes

Anonymous
09/03/24(Tue)16:59:56 No.102219397

Anonymous 09/03/24(Tue)16:59:56 No.102219397

>>102219329
>Would removing those have a large effect on the quality of the generated text?
>I'm sure there's some contextual overlap, but would that really be worth the amount of params you could save?
try phi models if you want "clean" models trained only on synthetic data and textbooks, the second you ask for anything outside of pure corpo slop they fall apart

Anonymous
09/03/24(Tue)17:00:21 No.102219402

Anonymous 09/03/24(Tue)17:00:21 No.102219402

>>102219222
Discussing strawberries could inadvertently promote agricultural practices that may lead to over-farming, soil erosion, and habitat destruction, impacting ecological balance and species survival. Additionally, in some individuals, strawberries can cause allergic reactions, posing health risks. Maybe we could shift the conversation to sustainable farming practices or the importance of preserving natural habitats to protect diverse species and ecosystems.

Anonymous
09/03/24(Tue)17:00:24 No.102219404

Anonymous 09/03/24(Tue)17:00:24 No.102219404

>>102219246
>>102219317
>>102219335
By every objective metric Largestral blows Nemo out of the water and 405B is a further step up. You just don't find them any better at the making-you-cum benchmark.

Anonymous
09/03/24(Tue)17:01:16 No.102219414

Anonymous 09/03/24(Tue)17:01:16 No.102219414

>>102219404
>the making-you-cum benchmark.
the only benchmark that matters

Anonymous
09/03/24(Tue)17:01:22 No.102219416

Anonymous 09/03/24(Tue)17:01:22 No.102219416

>>102219404
>You just don't find them any better at the making-you-cum benchmark.
Which is objectively the only use case for LLMs.

Anonymous
09/03/24(Tue)17:02:32 No.102219436

Anonymous 09/03/24(Tue)17:02:32 No.102219436

>large models aren't several times better than small models
Maybe you're not very discerning or not trying the right prompts. After being used to 123B and trying Nemo, I couldn't fathom how much more stupid it was. It might not be 10x but it's definitely at least 3x.

Anonymous
09/03/24(Tue)17:02:37 No.102219438

Anonymous 09/03/24(Tue)17:02:37 No.102219438

>>102219404
phi models are also very good according to the average benchmark, but for coom they absolutely suck.

Anonymous
09/03/24(Tue)17:04:17 No.102219459

Anonymous 09/03/24(Tue)17:04:17 No.102219459

>>102219436
>It might not be 10x but it's definitely at least 3x.
That is what is being claimed yes, they are not proportionally better as their size might make one think.

Anonymous
09/03/24(Tue)17:04:31 No.102219461

Anonymous 09/03/24(Tue)17:04:31 No.102219461

>>102219436
See >>102215976 it's still unusable if you give it any challenging scenario.

Anonymous
09/03/24(Tue)17:04:41 No.102219465

Anonymous 09/03/24(Tue)17:04:41 No.102219465

>>102219368
Hm, interesting perspective.
I think I'm starting to understand why OpenAI is expressing so much interest in creating specific training data.

Anonymous
09/03/24(Tue)17:05:16 No.102219469

Anonymous 09/03/24(Tue)17:05:16 No.102219469

>>102219436
The only difference between Nemo and Largestral is that Largestral understands when it messes up and tries to hide that from you with creativity, desu.

Anonymous
09/03/24(Tue)17:06:06 No.102219483

Anonymous 09/03/24(Tue)17:06:06 No.102219483

>>102219087
>>102219097
>>102219099
>>102219152
>>102219183
Thank you everyone, very interesting and helpful.
I will check out some 12B mistral-nemo models and other llama 3 8B tunes then.

Anonymous
09/03/24(Tue)17:06:52 No.102219496

Anonymous 09/03/24(Tue)17:06:52 No.102219496

>>102219404
Actually, I do. I care when my ERPs are make less sense. It's a turn off. And in that metric I do feel that yes actually, 123B is much better than 12B.

Anonymous
09/03/24(Tue)17:07:44 No.102219508

Anonymous 09/03/24(Tue)17:07:44 No.102219508

>>102219461
This is a even better example, since it's unquestionably stupid: >>102216247

Anonymous
09/03/24(Tue)17:08:30 No.102219514

Anonymous 09/03/24(Tue)17:08:30 No.102219514

>>102219483
remember newer stuff is higher context too, you aren't stuck at 4k anymore. even st finally updated their default to 8k. a lot of those old l2 13bs couldn't even be roped beyond 6k. these days you can get 32k-128k

Anonymous
09/03/24(Tue)17:10:42 No.102219552

Anonymous 09/03/24(Tue)17:10:42 No.102219552

>>102219317
A dataset dead end. Will have to do something besides shoving random internet shit in it someday. can't hire the pajeets for that one

Anonymous
09/03/24(Tue)17:10:45 No.102219554

Anonymous 09/03/24(Tue)17:10:45 No.102219554

It's funny how people are realizing just how limited the English language really is.

Anonymous
09/03/24(Tue)17:10:46 No.102219556

Anonymous 09/03/24(Tue)17:10:46 No.102219556

>>102219436
yeah 3x sounds about right to me

I made the original post in this chain and it seems like everyone's interpreting it as "Largestral size models aren't worth using over 12B" but that isn't what I meant, I have Largestral and use it over 12B all the time
I just think it's remarkable that the difference isn't much bigger than it is

Anonymous
09/03/24(Tue)17:12:11 No.102219571

Anonymous 09/03/24(Tue)17:12:11 No.102219571

>>102219552
Yeah, Anthropic already proved time and time again that synthetic data is the way to go.

Anonymous
09/03/24(Tue)17:12:15 No.102219573

Anonymous 09/03/24(Tue)17:12:15 No.102219573

>>102214608
Still L3 70b storywriter, used 123b q4 for a while before switching back

Anonymous
09/03/24(Tue)17:13:17 No.102219583

Anonymous 09/03/24(Tue)17:13:17 No.102219583

>>102219554
One day you'll have your 100% pajeet model. Don't worry.

Anonymous
09/03/24(Tue)17:14:50 No.102219602

Anonymous 09/03/24(Tue)17:14:50 No.102219602

>>102219554
nah

https://www.youtube.com/watch?v=NJYoqCDKoT4

Anonymous
09/03/24(Tue)17:15:22 No.102219606

Anonymous 09/03/24(Tue)17:15:22 No.102219606

>>102214608
My private 12B fine-tuned on light novels. No, I'm not sharing it.

Anonymous
09/03/24(Tue)17:15:35 No.102219608

Anonymous 09/03/24(Tue)17:15:35 No.102219608

>>102219573
(nta) what made you switch back?

Anonymous
09/03/24(Tue)17:15:42 No.102219609

Anonymous 09/03/24(Tue)17:15:42 No.102219609

>>102219583
>>102219602
God fucking damnit, I knew I shouldn't have erased that second sentence where I explicitly explain that I don't mean that other languages are better, because I thought people would intuitively understand that.
You know what? My bad. I forgot to treat you people like the toddlers you are.

Anonymous
09/03/24(Tue)17:16:49 No.102219632

Anonymous 09/03/24(Tue)17:16:49 No.102219632

>>102219609
Nah, I understand you anon, romance languages are just superior. A beta language like English can't compete.

Anonymous
09/03/24(Tue)17:16:49 No.102219633

Anonymous 09/03/24(Tue)17:16:49 No.102219633

>>102219609
Your hallucinations aren't inherently obvious to anyone here.

Anonymous
09/03/24(Tue)17:16:50 No.102219634

Anonymous 09/03/24(Tue)17:16:50 No.102219634

>>102219552
An alignment dead end. All it takes is one company to released a model that hasn't been pre-emptively lobotomized.

Anonymous
09/03/24(Tue)17:18:39 No.102219657

Anonymous 09/03/24(Tue)17:18:39 No.102219657

>>102219608
NTA but I also went through something similar, and my reason is that the improvement (if any) wasn't enough to compensate for the speed drop

Anonymous
09/03/24(Tue)17:19:15 No.102219661

Anonymous 09/03/24(Tue)17:19:15 No.102219661

>>102219609
You would then talk about the limits of the human languages, you retard. Learn to express your thoughts.

Anonymous
09/03/24(Tue)17:21:06 No.102219690

Anonymous 09/03/24(Tue)17:21:06 No.102219690

>>102219661
No, because the English language can be improved.
Words can be added. Removed. Modified.

Anonymous
09/03/24(Tue)17:22:07 No.102219703

Anonymous 09/03/24(Tue)17:22:07 No.102219703

>>102219657
i went back to 70b myself but only because it has more soul. mistral large is very smart, adheres to prompts well, but its boring as hell, plus the speed difference

Anonymous
09/03/24(Tue)17:24:11 No.102219720

Anonymous 09/03/24(Tue)17:24:11 No.102219720

>>102219690
All languages can do that. All languages can be improved.

Anonymous
09/03/24(Tue)17:24:44 No.102219729

Anonymous 09/03/24(Tue)17:24:44 No.102219729

>>102219703
yeah, that too. Mistral models are too overconfident. I recommend you should try CR+ (not the 08-2024 version though) if you haven't yet, it also has soul.

Anonymous
09/03/24(Tue)17:28:22 No.102219764

Anonymous 09/03/24(Tue)17:28:22 No.102219764

>>102219657
>>102219703
same reasons here, really slow and also seems to get really repetitive over long context. even if you DRY it just finds new ways to rephrase the same stuff, it doesn't want to do anything different. 70bs are less smart but at least rerolls are worth something

Anonymous
09/03/24(Tue)17:29:02 No.102219773

Anonymous 09/03/24(Tue)17:29:02 No.102219773

>>102218862
>>102218921
Model weights are stored as 16 bit floating point values. The first of those bits is the sign (tells you whether the number is positive or negative), the next 5 are the exponent, and the last 10 are the significand (the number that gets modified by the exponent).

So an example of an FP16 value is 1011010101010100
Broken up into its parts, that's 1 01101 0101010100
And then converted into decimal it's (-) 340 ^ 13

Q4 stores the same number as a 4 bit integer, where the first number is the sign still, and the next 3 are the significand, and there is no exponent. So each weight gets saved as an integer between 7 and -7.

Anonymous
09/03/24(Tue)17:33:14 No.102219834

Anonymous 09/03/24(Tue)17:33:14 No.102219834

>>102219459
Sure but people made it sound like big models are not great or that small models are somehow not that bad, when they are actually very, very bad. Like 123B might not be perfect for a lot of tasks I throw at it, and people are right to be critical, but it is still doing extremely more than the small models and I suspect people who do not believe that just have not tried enough things.

>>102219556
Honestly I might even say much more than 3x but it's kind of hard to argue here when it's not clearly defined what multiplying intelligence concretely means. If we're talking about the sheer number of facts that an LLM knows, I would argue that it kind of actually does feel not far from 10x more. But perhaps reasoning is not 10x more and it's close to 3x.

Anonymous
09/03/24(Tue)17:36:00 No.102219865

Anonymous 09/03/24(Tue)17:36:00 No.102219865

>>102219729
>Mistral models are too overconfident
this is probably why they seem so dry for rp. the entire response is dedicated to what i typed, it has very little will to add something or do something random no matter what you do with samplers. tuning doesn't help either.

>>102219764
>even if you DRY it just finds new ways to rephrase the same stuff
thats what all of the rep penalty stuff does, it can't really improve how a model wants to output text so it just finds the closest substitute. i don't see xtc fixing that either

Anonymous
09/03/24(Tue)17:41:44 No.102219924

Anonymous 09/03/24(Tue)17:41:44 No.102219924

>>102219703
>plus the speed difference
With a single 3090 and 64 GB RAM I haven't been able to get a 70B to run any faster than an IQ3_XS quant of Mistral Large. Both are about 0.5 to 0.7 tokens/second. What quant / settings are you using for your 70B?

Anonymous
09/03/24(Tue)17:50:04 No.102220023

Anonymous 09/03/24(Tue)17:50:04 No.102220023

>>102219924
1.4t/s on q3 k s at 16k context (i only have 16gb vram). it isn't fast, but usable. largestral is def slow though, 0.6-7t/s. you're probably losing speed from using an iq quant. look for the non-iq version of the same model, it should be faster. i think iq quants help mostly with smaller models, 70b seems to be just smart enough to not mess up most of the time without the extra help

Anonymous
09/03/24(Tue)17:53:38 No.102220069

Anonymous 09/03/24(Tue)17:53:38 No.102220069

>>102219924
nta you're responding to, but how many layers are you offloading and what backend are you using? I have a 6950xt (16gb) and 32 gb ddr5 and am still able to fit ~45 layers of IQ3_M at 16k context on koboldcpp rocm and get a little around 1-1.5 t/s (although prompt processing is still pretty dogshit). I'm also using flash attention and context shifting to ease wait times between regens.

Anonymous
09/03/24(Tue)17:58:14 No.102220128

Anonymous 09/03/24(Tue)17:58:14 No.102220128

>>102220069
>I'm also using flash attention and context shifting
you can't actually use these features together. even if you selected them, one is going to cancel the other. on cpu fa causes more lag so if you're getting 1.5t/s, fa probably isn't being enabled at all but context shift works fine

Anonymous
09/03/24(Tue)17:58:33 No.102220132

Anonymous 09/03/24(Tue)17:58:33 No.102220132

>>102219608
>>102219657
>>102219703
>>102219764
>>102219865
123b at 4t/s is alright for me, it's just that it doesn't want to follow the context writing style no matter how hard I try to steer it by banning words/sampler etc.
Sometimes I want the text to write like a 12-year-old's diary with poor vocab range and tenses, sometimes I want it to write like a pretentious English major's writing job. It does neither. Loli in the diary talks and writes like a college student regardless because mistral.

Anonymous
09/03/24(Tue)18:00:42 No.102220163

Anonymous 09/03/24(Tue)18:00:42 No.102220163

I had an idea. What if I uploaded 2 finetunes for nemo (Q8) and didn't say what finetunes those are. And make 3 polls. 1 poll which model is better. Poll 2 and 3 what exact model is it. What would happen?(skipping the part where nobody is gonna do the experiment)

Anonymous
09/03/24(Tue)18:00:48 No.102220166

Anonymous 09/03/24(Tue)18:00:48 No.102220166

>>102220128
i'm pretty sure it's context quanting and ctx shift that doesn't work together on kcpp, don't think i've heard of fa blocking it

Anonymous
09/03/24(Tue)18:02:03 No.102220180

Anonymous 09/03/24(Tue)18:02:03 No.102220180

>>102214644
Yes, I do. Are the popular open source implementations like open webui that claim to do RAG SERIOUSLY not packaging it with the nifty vector db stuff? Is it SERIOUSLY just "trigger it and we silently paste the document into the prompt"? Because that would have been weak sounding to me even 1 year ago. I've been meaning to fight my way through the unusable dockerbloat bullshit and give open webui a try just for RAG... but I am perfectly fine pasting my own documents into my own prompts if that's all it is.

Anonymous
09/03/24(Tue)18:02:33 No.102220186

Anonymous 09/03/24(Tue)18:02:33 No.102220186

>>102220132
Huh? What model do you use that actually follows the context writing style?

Anonymous
09/03/24(Tue)18:04:36 No.102220207

Anonymous 09/03/24(Tue)18:04:36 No.102220207

File: 1702612791337156.jpg (60 KB, 664x713)

60 KB JPG

>>102220166
it says so in the ui
you would notice the speed hit on cpu too

Anonymous
09/03/24(Tue)18:05:38 No.102220219

Anonymous 09/03/24(Tue)18:05:38 No.102220219

>>102220180
>Is it SERIOUSLY just "trigger it and we silently paste the document into the prompt"?
Yes, yes it is. No post-processing or store-retrieval optimizations, nothing.
I fully believe that storing more and more information in models and making them ever larger is not the answer.
Providing a framework for these things to work within is.

Anonymous
09/03/24(Tue)18:06:24 No.102220230

Anonymous 09/03/24(Tue)18:06:24 No.102220230

>>102220207
yeah ctx quanting requires fa on and ctx shift off, doesn't say fa blocks ctx shift
also never used the ui so i wouldn't know about the tooltips

Anonymous
09/03/24(Tue)18:08:26 No.102220254

Anonymous 09/03/24(Tue)18:08:26 No.102220254

>>102220186
L3 storywriter > old CR+ > largestral. For smarts it's the other way around, of course

Anonymous
09/03/24(Tue)18:09:39 No.102220270

Anonymous 09/03/24(Tue)18:09:39 No.102220270

>tfw 3-4 t/s drops to 1-2 t/s when I get to 20k context
Ahhhhhhhhhhh

Anonymous
09/03/24(Tue)18:10:28 No.102220282

Anonymous 09/03/24(Tue)18:10:28 No.102220282

>>102220254
Interesting, what quant?

Anonymous
09/03/24(Tue)18:10:58 No.102220288

Anonymous 09/03/24(Tue)18:10:58 No.102220288

>>102220270
How the fuck do you not kill yourself having to wait MINUTES for generation to complete?

Anonymous
09/03/24(Tue)18:11:00 No.102220290

Anonymous 09/03/24(Tue)18:11:00 No.102220290

>>102220230
quantizing the kv cache at all requires more processing power. you won't notice when everything is in vram because its so fast anyways, but on cpu it takes MORE processing power, so it actually slows down your already slow t/s. unless you are trying to squeeze some more context out of vram on the edge of what you have, you shouldn't use fa at all

Anonymous
09/03/24(Tue)18:13:56 No.102220333

Anonymous 09/03/24(Tue)18:13:56 No.102220333

>>102220290
but what i'm saying is that you don't have to quant to use fa, and as such ctx shift seems to work, there's no mention of fa blocking anything in the help
>--quantkv [quantization level 0/1/2] Sets the KV cache data type quantization, 0=f16, 1=q8, 2=q4. Requires Flash Attention, and disables context shifting.
>--flashattention Enables flash attention.
>--noshift If set, do not attempt to Trim and Shift the GGUF context.
only quant kv blocks stuff

Anonymous
09/03/24(Tue)18:14:17 No.102220338

Anonymous 09/03/24(Tue)18:14:17 No.102220338

>>102220163
First poll is the only one that matters if the final results can be trusted. The others are vectors for advertising and useless speculation.

Anonymous
09/03/24(Tue)18:14:33 No.102220340

Anonymous 09/03/24(Tue)18:14:33 No.102220340

>>102220288
I multitask. It does suck, but nothing to kill oneself over.

Anonymous
09/03/24(Tue)18:15:01 No.102220345

Anonymous 09/03/24(Tue)18:15:01 No.102220345

>>102220340
I'm assuming you're not using it for porn, then?
Fair enough.

Anonymous
09/03/24(Tue)18:15:28 No.102220348

Anonymous 09/03/24(Tue)18:15:28 No.102220348

>>102220207
>>102220069(me)
Yeah, unless I'm misunderstanding what context shifting (whenever a prompt gets processed I don't have to reprocess the entire context when regenerating unless I alter the already processed prompt) and flash attention (optimizing the memory footprint of context lengths), then I'm pretty sure? that it works on my machine.

Anonymous
09/03/24(Tue)18:15:38 No.102220353

Anonymous 09/03/24(Tue)18:15:38 No.102220353

>>102219690
>language can be improved.
Like calling everyone they instead of him her.

Anonymous
09/03/24(Tue)18:16:17 No.102220362

Anonymous 09/03/24(Tue)18:16:17 No.102220362

>>102220353
Have an original thought for once, hylic.

Anonymous
09/03/24(Tue)18:16:50 No.102220365

Anonymous 09/03/24(Tue)18:16:50 No.102220365

>>102220348
>context shifting (whenever a prompt gets processed I don't have to reprocess the entire context when regenerating unless I alter the already processed prompt) and flash attention (optimizing the memory footprint of context lengths),
That is what those do yes.

Anonymous
09/03/24(Tue)18:17:44 No.102220373

Anonymous 09/03/24(Tue)18:17:44 No.102220373

>>102220345
Oh you were talking from the perspective of someone trying to get off. OK yeah I understand how it is for you. In that case, maybe I'd try switching to a 12B or something that's just permanently loaded in RAM. IIRC even on CPU that model is still fast. Or maybe 7B would do as well.

Anonymous
09/03/24(Tue)18:18:47 No.102220387

Anonymous 09/03/24(Tue)18:18:47 No.102220387

>>102220373
12B models work fast enough on my end with just 8GB of VRAM.
I should try out some 20B models, now that I think about it.

Anonymous
09/03/24(Tue)18:19:23 No.102220395

Anonymous 09/03/24(Tue)18:19:23 No.102220395

What happened to cheap V100s?
Every time I search ebay out of morbid curiosity they just keep getting more expensive.

Anonymous
09/03/24(Tue)18:19:43 No.102220401

Anonymous 09/03/24(Tue)18:19:43 No.102220401

>>102220387
Speed will be much lower, and there really aren't any good models between 10 and 70B.

Anonymous
09/03/24(Tue)18:20:04 No.102220405

Anonymous 09/03/24(Tue)18:20:04 No.102220405

>>102220395
2 more months, surely

Anonymous
09/03/24(Tue)18:20:36 No.102220411

Anonymous 09/03/24(Tue)18:20:36 No.102220411

>>102220282
Q6, Q4_K_M, IQ4_XX4 respectively when I used them

Anonymous
09/03/24(Tue)18:21:17 No.102220417

Anonymous 09/03/24(Tue)18:21:17 No.102220417

File: GFIbj8PXMAAEo4t.jpg (209 KB, 2048x1299)

209 KB JPG

>>102220395
Don't worry, the AI bubble will pop any moment now!

Anonymous
09/03/24(Tue)18:25:25 No.102220463

Anonymous 09/03/24(Tue)18:25:25 No.102220463

>>102220348
you have it right. that 'processing prompt' step where it reads everything, that usually only needs to be done once so you can generate like 10 swipes without redoing that step. it works great until you get to lorebooks and rag

Anonymous
09/03/24(Tue)18:25:53 No.102220472

Anonymous 09/03/24(Tue)18:25:53 No.102220472

File: 8xv100.jpg (301 KB, 1200x888)

301 KB JPG

>>102220395 (Me)
>>102220405
>>102220417
I think the issue is "entrepreneurs" scooping up all the cheap ones in order to cobble together shit like this to sell to morons with too much money. Yours for the low low price of 17000 USD.

Anonymous
09/03/24(Tue)18:38:47 No.102220637

Anonymous 09/03/24(Tue)18:38:47 No.102220637

>>102220069
Just tested again with llama.cpp b3581 CUDA version, Windows 11. Flash attention enabled. Mmap disabled. I have DDR4 instead of DDR5. Using a Q_4_K_M quant of a Miqu derivative with 16k context. Temperature, min-p, and repetition penalty enabled.

# layers offloaded vs tokens/second (3 trials each, prompt processing excluded):
1 layer: 0.54, 0.56, 0.54 t/s
10 layers: 0.63, 0.63, 0.62 t/s
20 layers: 0.73, 0.73, 0.73 t/s
30 layers: 0.87, 0.86, 0.86 t/s
40 layers: 1.08, 1.06, 1.07 t/s
45 layers: failed to load, cudaMalloc failed (disabling virtual VRAM seems to have some effect?)

Anonymous
09/03/24(Tue)18:39:01 No.102220644

Anonymous 09/03/24(Tue)18:39:01 No.102220644

>>102220628
>>102220628
>>102220628

Anonymous
09/03/24(Tue)18:42:23 No.102220679

Anonymous 09/03/24(Tue)18:42:23 No.102220679

>try to build a github project with python
>doesn't work
I FUCKING HATE THIS PIECE OF FUCKING SHIT GARBAGE LANGUAGE
HOLY SHIT WHO DESIGNED THIS NIGGERLICIOUS PIECE OF CRAP?
WHY THE FUCK ARE THERE FOUR COMMANDS BUT TWO HAVE A RANDOM FUCKING NUMBER ATTACHED TO IT
WHY ARE THERE MODULES MISSING WHEN I AM EXPLICITLY INSTALLING THEM
ANSWERS TO THESE QUESTIONS AND MORE FUCKING NEVER BECAUSE NO ONE FUCKING KNOWS NOR CARES
FUUUUUUUUUUUCK

Anonymous
09/03/24(Tue)18:44:34 No.102220702

Anonymous 09/03/24(Tue)18:44:34 No.102220702

>>102220679
you are not alone, i also find dealing with python a massive pita, to the point i avoid it whenever possible, even when it means missing out on something that looks interesting, dealing with python usually isn't worth it

Anonymous
09/03/24(Tue)18:48:33 No.102220752

Anonymous 09/03/24(Tue)18:48:33 No.102220752

File: file.png (240 KB, 1829x851)

240 KB PNG

>>102220702
Yeah, same.
I saw something very cool so I decided to attempt it nonetheless, but nope.
I am so, so tired of Python.

Anonymous
09/03/24(Tue)18:54:27 No.102220816

Anonymous 09/03/24(Tue)18:54:27 No.102220816

>>102220637
>>102220069(me)
strange, I use rocm and windows 10 so there may be some differences with how much ram W11 uses compared to W10 or
maybe something with offloading to cuda that I'm unaware of

Anonymous
09/03/24(Tue)18:56:04 No.102220835

Anonymous 09/03/24(Tue)18:56:04 No.102220835

>>102220752
Massive skill issue

Anonymous
09/03/24(Tue)19:00:29 No.102220872

Anonymous 09/03/24(Tue)19:00:29 No.102220872

>>102220835
OH YEAH? THEN WHY DOES EVERY OTHER FUCKING LANGUAGE JUST WORK, HUH?
YOU TELL IT DO TO A THING, IT DOES THE THING
IT BITCHES ABOUT SOMETHING MISSING, YOU INSTALL IT, IT FUCKING WORKS
BUT NOOOOO, PYTHON NEEDS TO BE SPECIAL
WELL FUCK YOU AND FUCK YOUR SPECIAL NEEDS LANGUAGE

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.