/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 08/11/24(Sun)03:28:56 No.101829052

File: miku arcade machine plush(...).jpg (291 KB, 1364x1929)

291 KB JPG

/lmg/ - Local Models General Anonymous 08/11/24(Sun)03:28:56 No.101829052 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>101818797 & >>101808005

►News
>(08/09) Qwen large audio-input language models: https://hf.co/Qwen/Qwen2-Audio-7B-Instruct
>(08/07) LG AI releases Korean bilingual model: https://hf.co/LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct
>(08/05) vLLM GGUF loading support merged: https://github.com/vllm-project/vllm/pull/5191
>(07/31) Gemma 2 2B, ShieldGemma, and Gemma Scope: https://developers.googleblog.com/en/smaller-safer-more-transparent-advancing-responsible-ai-with-gemma
>(07/27) Llama 3.1 rope scaling merged: https://github.com/ggerganov/llama.cpp/pull/8676

►News Archive: https://rentry.org/lmg-news-archive
►FAQ: https://wikia.schneedc.com
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Programming: https://hf.co/spaces/bigcode/bigcode-models-leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
08/11/24(Sun)03:35:56 No.101829129

Anonymous 08/11/24(Sun)03:35:56 No.101829129

>almost 2 weeks since anything happened
summer dry spell...

Anonymous
08/11/24(Sun)03:38:23 No.101829149

Anonymous 08/11/24(Sun)03:38:23 No.101829149

>almost 3 minutes since last /lmg/ post
/lmg/ dry spell...

Anonymous
08/11/24(Sun)03:45:56 No.101829211

Anonymous 08/11/24(Sun)03:45:56 No.101829211

File: coomermiku.jpg (38 KB, 960x547)

38 KB JPG

Alright, bear with me.

Think about all the times you uncontrollably coomed during your dirty LLM fetish sessions. Pretty great right? Now just imagine your LLM is multimodal. It can recognize and generate pictures (video in the future) and audio.

This is going to change the entire game. We are going to look back fondly at the times when we used to coom our brains out to text generation only. The future can't come soon enough.

Anonymous
08/11/24(Sun)03:48:25 No.101829233

Anonymous 08/11/24(Sun)03:48:25 No.101829233

File: GSiD14RasAAmqMF.jpg (389 KB, 1536x2048)

389 KB JPG

>>101820049

Anonymous
08/11/24(Sun)03:50:03 No.101829251

Anonymous 08/11/24(Sun)03:50:03 No.101829251

>>101829211
my box is too slow for anything image related

Anonymous
08/11/24(Sun)03:53:19 No.101829286

Anonymous 08/11/24(Sun)03:53:19 No.101829286

>>101829251
Brother, with the rate of improvement I would wager eventually something good will run on your shitbox. Stay strong.

Anonymous
08/11/24(Sun)03:55:07 No.101829306

Anonymous 08/11/24(Sun)03:55:07 No.101829306

>>101829211
>Multimeme

Anonymous
08/11/24(Sun)03:56:12 No.101829320

Anonymous 08/11/24(Sun)03:56:12 No.101829320

File: GTY5zp2agAAUMEy.jpg (105 KB, 1200x1188)

105 KB JPG

Is Stheno still best for 8 gig vramlets??

Anonymous
08/11/24(Sun)03:59:40 No.101829359

Anonymous 08/11/24(Sun)03:59:40 No.101829359

>>101829211
I want to believe. Believe in future LLMs, and what lies ahead on this shared journey.

Anonymous
08/11/24(Sun)04:05:16 No.101829430

Anonymous 08/11/24(Sun)04:05:16 No.101829430

>>101829211
I think bigger improvement will come from writing explicit code to keep track of characters, locations, history, etc.
Just one-on-one conversations are very limiting I think, the RP becomes a lot more immersive for me when I can move through a world and interact with it.

Anonymous
08/11/24(Sun)04:07:14 No.101829458

Anonymous 08/11/24(Sun)04:07:14 No.101829458

>>101829211
>Think about all the times you uncontrollably coomed during your dirty LLM fetish sessions.
This never happened tbqh.

iruletheworldmo
08/11/24(Sun)04:13:06 No.101829547

iruletheworldmo 08/11/24(Sun)04:13:06 No.101829547

File: strawberry-sam_altman.png (28 KB, 800x800)

28 KB PNG

This is a vector image of Sam Altman inventing Q* generated by multimodal strawberry.

Anonymous
08/11/24(Sun)04:13:11 No.101829548

Anonymous 08/11/24(Sun)04:13:11 No.101829548

>>101829430
I think larger context sizes will naturally come alongside newer models. 100k+ plus is pretty good, but we can definitely do better.

Anonymous
08/11/24(Sun)04:13:38 No.101829557

Anonymous 08/11/24(Sun)04:13:38 No.101829557

What are the good context, instructions and settings for the Nemo?

Anonymous
08/11/24(Sun)04:14:36 No.101829572

Anonymous 08/11/24(Sun)04:14:36 No.101829572

>>101829547
localbros, it's so over for us. how will we cope now?

Anonymous
08/11/24(Sun)04:22:03 No.101829657

Anonymous 08/11/24(Sun)04:22:03 No.101829657

So will you faggots waste much time doomposting about local or can we move on and actually discuss it and recommend models, etc? You do know that roleplaying with another person is infinitely better than whatever even chatgpt can do, right? And that the only reason you are not doing that is because you are a pathetic faggot too socially inept for even that embarrassing hobby, yes? You could have a million dollar super computer and still get worse rp than you'd get from a competent random on Flist.
Stop whining and contribute already.

Anonymous
08/11/24(Sun)04:23:23 No.101829673

Anonymous 08/11/24(Sun)04:23:23 No.101829673

File: Foldoc.gif (10 KB, 300x98)

10 KB GIF

Anonymous
08/11/24(Sun)04:28:50 No.101829740

Anonymous 08/11/24(Sun)04:28:50 No.101829740

>>101829657
>You do know that roleplaying with another person is infinitely better
It's really depends on the person. And my rps are not interesting enough for anyone remotely good to stick around.
Also, you can't swipe and edit responses of your human partner as you like and expect him to follow your lead with no complaints.

Anonymous
08/11/24(Sun)04:32:05 No.101829770

Anonymous 08/11/24(Sun)04:32:05 No.101829770

>>101829657
>>101829740
The downside to RPing with a human is that the human on the other end expects you to put in effort.
I want to by lazy and type ahh ahh Mistress and still get a high-effort response back.

Anonymous
08/11/24(Sun)04:48:41 No.101829941

Anonymous 08/11/24(Sun)04:48:41 No.101829941

>buy 2 modded 2080 ti 22gb
>be happy with 44gb vram

Anonymous
08/11/24(Sun)04:48:54 No.101829943

Anonymous 08/11/24(Sun)04:48:54 No.101829943

Muller Froop Strawberry Yogurt:

This is a strawberry-flavored yogurt product available in a 150-gram container.
It contains 3% fat and is priced at ₪5.25.

Muller Froop Strawberry Whipped Yogurt:

This is a whipped version of the strawberry yogurt, available in a 97-gram container.
It is priced at ₪4.83.

Anonymous
08/11/24(Sun)04:53:13 No.101829987

Anonymous 08/11/24(Sun)04:53:13 No.101829987

>>101829657
>You do know that roleplaying with another person is infinitely better than whatever even chatgpt can do
You have never RPd with a real human lol

Anonymous
08/11/24(Sun)04:58:30 No.101830036

Anonymous 08/11/24(Sun)04:58:30 No.101830036

>>101829557
I don't think there are any, it always gets shitty.

Anonymous
08/11/24(Sun)05:14:54 No.101830195

Anonymous 08/11/24(Sun)05:14:54 No.101830195

File: file.png (7 KB, 268x183)

7 KB PNG

>>101829987
You have never rped with someone good.

Anonymous
08/11/24(Sun)05:16:00 No.101830204

Anonymous 08/11/24(Sun)05:16:00 No.101830204

>>101830195
Train a LoRA. Enlighten the world.

Anonymous
08/11/24(Sun)05:16:28 No.101830209

Anonymous 08/11/24(Sun)05:16:28 No.101830209

>>101830195
gimememgmgmgmeeimeeee give me

Anonymous
08/11/24(Sun)05:20:16 No.101830250

Anonymous 08/11/24(Sun)05:20:16 No.101830250

>whats the newst pr stance

Anonymous
08/11/24(Sun)05:24:13 No.101830291

Anonymous 08/11/24(Sun)05:24:13 No.101830291

>>101829941
>44gb vramlet
>and stuck on turing
how can you be happy

Anonymous
08/11/24(Sun)05:31:39 No.101830369

Anonymous 08/11/24(Sun)05:31:39 No.101830369

rip lmg

Anonymous
08/11/24(Sun)05:32:22 No.101830378

Anonymous 08/11/24(Sun)05:32:22 No.101830378

>>101830195
Most public channels on F-List barely have any roleplay ongoing, it's mostly just people chilling out a large portion of the time. You'd also have to clean up the smilies in a way that doesn't mangle the chat. I don't think using that data will be so simple, unless you're 100% sure it's actual roleplay.

Anonymous
08/11/24(Sun)05:33:36 No.101830388

Anonymous 08/11/24(Sun)05:33:36 No.101830388

TWF chat gpt does a better job at creating AI bot characters for silly tavern than the cards that are made by people. SAD! It seems like the hobby is only for autists and mentally ill women.

Anonymous
08/11/24(Sun)05:36:38 No.101830433

Anonymous 08/11/24(Sun)05:36:38 No.101830433

>>101830388
the problem is idiots that promote pseudocode and hypertags when none of that bullshit actually does anything
char is x, char has y, char wears z
that's all you need

Anonymous
08/11/24(Sun)05:41:59 No.101830489

Anonymous 08/11/24(Sun)05:41:59 No.101830489

>>101830433
Interesting chars need a bit more than appearance description, but yeah, some of the bots are bloated with meaningless garbage.

Anonymous
08/11/24(Sun)05:42:28 No.101830490

Anonymous 08/11/24(Sun)05:42:28 No.101830490

>>101830204
I've asked how before and no one replied so that ship sailed.

>>101830378
It's my logs, anon. 95% are 1 on 1 roleplay logs

Anonymous
08/11/24(Sun)05:44:29 No.101830508

Anonymous 08/11/24(Sun)05:44:29 No.101830508

File: Screenshot 2024-08-10 at (...).png (299 KB, 450x450)

299 KB PNG

Anonymous
08/11/24(Sun)05:50:35 No.101830561

Anonymous 08/11/24(Sun)05:50:35 No.101830561

>>101830490
gimegimemeigmgeimegimegime give me the data and I can train it

Anonymous
08/11/24(Sun)05:57:41 No.101830634

Anonymous 08/11/24(Sun)05:57:41 No.101830634

>>101830561
How many megs of data do you need to do anything worth it?

Anonymous
08/11/24(Sun)06:00:39 No.101830677

Anonymous 08/11/24(Sun)06:00:39 No.101830677

>>101830490
1+ GB of text of just you and another participant? Let me doubt that very much.

Even if it was, then it wouldn't be very good. The trained model would get overfitted on mostly your writing style.

Anonymous
08/11/24(Sun)06:02:18 No.101830695

Anonymous 08/11/24(Sun)06:02:18 No.101830695

>>101830634
A few megabytes is more than plenty for a style LoRA (and you can often get away with much less than that), you don't need hundreds of megabytes of text or even gigabytes.

Anonymous
08/11/24(Sun)06:06:13 No.101830736

Anonymous 08/11/24(Sun)06:06:13 No.101830736

File: Screenshot_20240811_190407.png (193 KB, 2649x1066)

193 KB PNG

>>101830388
llms excel at sd prompting too.
i suspect most cards are made by 12yo teenagers. at least that what it seems like if you peak inside.

playing around and made myself a character creator with gemma2 9b.
give the minimum descripition of what kind of char i want and get a card with hidden suprises to make it interesting + sd pic.

pic related is
>loli imouto who calls me onii-chan

Anonymous
08/11/24(Sun)06:07:02 No.101830743

Anonymous 08/11/24(Sun)06:07:02 No.101830743

>>101830634
I would honestly love to have good quality RP logs too. There's literally nothing out there beyond what's already in most training sets.

10-20 MB range is good probably.

Anonymous
08/11/24(Sun)06:13:12 No.101830803

Anonymous 08/11/24(Sun)06:13:12 No.101830803

What's the story with that guy who created a front end for his LLM and then started getting weird aggressive calling and spamming to his house and work until he took it off?
also is there any reason to do all that and compete with chatgpt4o?

Anonymous
08/11/24(Sun)06:13:30 No.101830806

Anonymous 08/11/24(Sun)06:13:30 No.101830806

>>101830634
Give me 100MBs and I will finetune some 70B model
pleasepleasepleasepleaseplease, I've been looking for human data for ages and the best I've got was AO3 stories

Anonymous
08/11/24(Sun)06:14:43 No.101830817

Anonymous 08/11/24(Sun)06:14:43 No.101830817

>>101830806
btw, how long are they? Are they all furry RP? That's what F-List is known for isnt it

Anonymous
08/11/24(Sun)06:15:28 No.101830824

Anonymous 08/11/24(Sun)06:15:28 No.101830824

File: image.jpg (185 KB, 1024x768)

185 KB JPG

>>101830736

Anonymous
08/11/24(Sun)06:27:39 No.101830962

Anonymous 08/11/24(Sun)06:27:39 No.101830962

File: Untitled-1.jpg (2.58 MB, 2397x1240)

2.58 MB JPG

>>101830695
>>101830743
>>101830806
I might do that. The biggest logs are a couple megs each but normally a full scene is like 350 KB or something? I have 2700+ logs like that, some sections of a long term roleplay. I also have like 50 megs of no image, discord logs with a single partner and a great writer, might do those instead.

The problem is that there is some OOC chat here and there and I need to clean that up. I also wouldn't like names to be included for obvious reasons. What can I do about that?

Anonymous
08/11/24(Sun)06:28:45 No.101830977

Anonymous 08/11/24(Sun)06:28:45 No.101830977

>>101830817
>That's what F-List is known for isnt it
lmao

Anonymous
08/11/24(Sun)06:29:21 No.101830983

Anonymous 08/11/24(Sun)06:29:21 No.101830983

>>101830962
>350KB per session
that's like 87k tokens per session
I NEED YORU LOGS TO TRAIN AN RP MODEL THAT DOESNT SHIT ITSELF AFTER 32K TOKENS

Anonymous
08/11/24(Sun)06:30:22 No.101830994

Anonymous 08/11/24(Sun)06:30:22 No.101830994

>>101830977
I have no idea about F-List besides the fact that it's used for RP, I've only seen it on e621 ads

Anonymous
08/11/24(Sun)06:40:38 No.101831100

Anonymous 08/11/24(Sun)06:40:38 No.101831100

>>101830983
So give me a guide on how to clean them or something

Anonymous
08/11/24(Sun)06:40:52 No.101831103

Anonymous 08/11/24(Sun)06:40:52 No.101831103

>>101830806
>>101830983
move out the way, bitch, i want this more than you do

>>101830962
gimeiemiedmigmmigimemigime give me
but also if you don't want to include the names, the obvious thing to do is to perform a regex search and replace them with dummy names

Anonymous
08/11/24(Sun)06:43:19 No.101831131

Anonymous 08/11/24(Sun)06:43:19 No.101831131

>>101831103
no you don't
you are probably a poorfagged retarded nigger, I have enoguh money to run a 4xH100 instance for as long as I want. And if I get those logs and running it on Mistral Large, Nemo, Miqu and L3.1
>>101831100
Okay, what kind of personal information you have there, just usernames?

Anonymous
08/11/24(Sun)06:45:54 No.101831156

Anonymous 08/11/24(Sun)06:45:54 No.101831156

>>101831100
Strip the html and all the page stuff and those things will be less than half the size. That's the first thing you want to do. You'll have to browse around for tools, most will miss one bit or another and probably start eating the actual conversation. Then you can say you have a proto dataset.

Anonymous
08/11/24(Sun)06:46:53 No.101831166

Anonymous 08/11/24(Sun)06:46:53 No.101831166

>>101830983
>>101831103
Ok, look, how do I format it? One .txt file per complete roleplay scene, using the

User A: Post
User B Post

format, names replaced by fake names? Does that work? Should I use random names for every play so the AI doesn't get overfitted on "Anon" or whatever?

>>101831131
Logs usually start with some planning of what we'll do in actual play so that needs to be trimmed. There are also some small OOC comments here and there, everything from the usual (brb) kind of thing to dirty talk and god knows what else.

Anonymous
08/11/24(Sun)06:46:53 No.101831167

Anonymous 08/11/24(Sun)06:46:53 No.101831167

File: miku lamazep ai no uta an(...).gif (122 KB, 452x435)

122 KB GIF

>>101830962
You could be sitting on gold, Anon. Or it could be coal. Either way, please deliver because you have a chance to make something fun happen.

Anonymous
08/11/24(Sun)06:46:57 No.101831168

Anonymous 08/11/24(Sun)06:46:57 No.101831168

where is recap anon? I would like to know if I've missed anything important from the last thread

Anonymous
08/11/24(Sun)06:47:11 No.101831172

Anonymous 08/11/24(Sun)06:47:11 No.101831172

File: Screenshot_2024_08_11-1.png (7 KB, 757x52)

7 KB PNG

>>101831131
>you are probably a poorfagged retarded nigger
we can go band for band
or fuck that, we can go machine for machine

Anonymous
08/11/24(Sun)06:48:28 No.101831188

Anonymous 08/11/24(Sun)06:48:28 No.101831188

>>101831167
Fuck you. I've been asking you retards about this for months and you are always busy being shitposting cunts or holier than thou computer touchers.

Anonymous
08/11/24(Sun)06:50:07 No.101831207

Anonymous 08/11/24(Sun)06:50:07 No.101831207

>>101831166
>format, names replaced by fake names? Does that work? Should I use random names for every play so the AI doesn't get overfitted on "Anon" or whatever?
Yes, yes, and yes, that's all totally fine.

>>101831188
I remember some anon claiming they had uploaded a bunch of f-list logs somewhere but I went back and tried digging for it and could never find it. If that was you, gommenasai I tried my best...

Anonymous
08/11/24(Sun)06:51:59 No.101831220

Anonymous 08/11/24(Sun)06:51:59 No.101831220

>>101831166
>Logs usually start with some planning of what we'll do in actual play so that needs to be trimmed
not really, that helps the model understand what's coming and consolidate long-context capabilities, it's done usually with story telling models
>There are also some small OOC comments here and there
These are good to have too, it's nice to have a model that understands OOC instructions.
>format, names replaced by fake names? Does that work? Should I use random names for every play so the AI doesn't get overfitted on "Anon"
Having "anon" in the should be fine
>One .txt file per complete roleplay scene, using the User A: Post User B Post
The "USER A, USER B" formar doesnt really matter because the model will get trained on its own custom format and these names will be removed.
>One .txt file per complete roleplay scene
No, you want everything in a single txt file and the scenes separated by some sort of indicator you will set during training.

If you give me the .html files without personal info I can clean the rest up myself, should be easy

Anonymous
08/11/24(Sun)06:53:45 No.101831235

Anonymous 08/11/24(Sun)06:53:45 No.101831235

>>101831172
Oh well you have more money than me, I guess Im eating my own words up
>>101831188
I only check this place once a week

Anonymous
08/11/24(Sun)06:56:48 No.101831270

Anonymous 08/11/24(Sun)06:56:48 No.101831270

Look at them fighting over scraps.

Anonymous
08/11/24(Sun)06:57:58 No.101831283

Anonymous 08/11/24(Sun)06:57:58 No.101831283

>>101831207
>>101831220
Alright. Editing the htmls would be a pain but I'll prepare some text files in that sort of format. I'll try to prepare at least a hundred megs of the best stuff or something. I'll post them here when I'm done.

Anonymous
08/11/24(Sun)06:58:53 No.101831291

Anonymous 08/11/24(Sun)06:58:53 No.101831291

>>101831283
Btw, what kind of RPs are they? Is it all NSFW? Furry?

Anonymous
08/11/24(Sun)06:58:59 No.101831292

Anonymous 08/11/24(Sun)06:58:59 No.101831292

>>101831283
Oh but it won't be ready for at least a couple days, mind you.

Anonymous
08/11/24(Sun)07:01:08 No.101831312

Anonymous 08/11/24(Sun)07:01:08 No.101831312

>>101831291
I wonder, what would be the point of non-nsfw roleplay when you can play tabletop rpg?

Anonymous
08/11/24(Sun)07:01:30 No.101831317

Anonymous 08/11/24(Sun)07:01:30 No.101831317

>>101831283
Don’t bother cleaning. That can be done by people more experienced than you. Make sure it doesn’t have any PII tho probably.

Anonymous
08/11/24(Sun)07:01:47 No.101831320

Anonymous 08/11/24(Sun)07:01:47 No.101831320

>>101831291
90% nsfw with plot setup. Not furry. Some vanilla, some petplay, lots of titfucking, some breeding, some straight shota, mostly modern fantasy.

Anonymous
08/11/24(Sun)07:03:10 No.101831335

Anonymous 08/11/24(Sun)07:03:10 No.101831335

>>101831312
Beacuse I have no one to play tabletop rpg
And I like SFW RPing romantic situations

Anonymous
08/11/24(Sun)07:04:20 No.101831344

Anonymous 08/11/24(Sun)07:04:20 No.101831344

>>101831283
Okay, just saying, if you don't know this already, you can use a python library like BeautifulSoup or something to process the HTML files. Now since it's HTML I'm assuming each individual message is going to be broken up into individual divs or something, but that should be easy to work through. If you're a script-kiddie, just get Claude 3.5 to help you with it.

Now, when it comes to removing the PII, that'll have to be done manually. The author names themselves could probably be replaced fairly easily but I imagine the actual messages probably have people using shorthand/nicknames like Anonie, or something. That would have to be handled manually. You COULD try using a NLP library like spacy to look for those and replace them programmatically, but I'm not sure how well that'd work.

Anonymous
08/11/24(Sun)07:04:20 No.101831345

Anonymous 08/11/24(Sun)07:04:20 No.101831345

>>101831283
No offense but if you have no experience cleaning it will take you a lot and you will make mistakes, just remove personal info and upload it raw so someone good at it can do it

Anonymous
08/11/24(Sun)07:07:09 No.101831374

Anonymous 08/11/24(Sun)07:07:09 No.101831374

>>101831344 (me)
Agree with >>101831345, just do the bare minimum

Anonymous
08/11/24(Sun)07:08:34 No.101831390

Anonymous 08/11/24(Sun)07:08:34 No.101831390

>>101831344
>>101831345
What exactly is there to break?

Anonymous
08/11/24(Sun)07:10:06 No.101831414

Anonymous 08/11/24(Sun)07:10:06 No.101831414

>>101830195
>1.22GB of logs
>it is actually high quality!
>it is all about pissing... and ponies
wat do?

Anonymous
08/11/24(Sun)07:11:19 No.101831434

Anonymous 08/11/24(Sun)07:11:19 No.101831434

>>101831390
You can end up truncating out parts of the content by mistake.
You can end up getting a messed up file that is unusable due to e.g. everythingbeinginonefuckingwordorasinglelinewithoutspacing.
You can end up losing your patience due to it taking too long and never getting it done, which would be a pity.2mrsj

Anonymous
08/11/24(Sun)07:12:29 No.101831447

Anonymous 08/11/24(Sun)07:12:29 No.101831447

File: threadrecap.png (1.48 MB, 1536x1536)

1.48 MB PNG

►Recent Highlights from the Previous Thread: >>101818797

--SillyTavern vs Kobold Lite UI, with criticism of Koboldcpp's code quality: >>101821562 >>101821586 >>101821624 >>101821667 >>101821681 >>101821725 >>101822529
--Running large models on 48GB VRAM and context size limitations: >>101824206 >>101824331 >>101824357 >>101824371 >>101824406 >>101824470 >>101824495
--Nemo and Gemma models recommended for balanced SFW and NSFW writing: >>101820488 >>101820648 >>101820700 >>101820568
--Koboldcpp's changes and differences from llama.cpp discussed: >>101821744 >>101821758 >>101821782 >>101821807 >>101821829 >>101821839 >>101822041 >>101822132
--Intel GPU support for LLMs and llama.cpp limitations: >>101826685 >>101826827 >>101826741 >>101826927
--Fix for llama.cpp gpu split issue on 7900xtx cards: >>101819663
--Anon discusses creating a new GPT-4chan model using a better starting model and dataset: >>101818858 >>101818921 >>101818935 >>101818986 >>101819340 >>101825398 >>101818934 >>101822025
--A4000 and AI model discussions, including Nemo and Gemma: >>101820383 >>101820547 >>101820600 >>101820648 >>101820700
--Using LLMs for immersive roleplaying experiences: >>101821189 >>101821346 >>101821637 >>101821698
--Mistral Large has overconfidence issues with certain tokens: >>101822044 >>101822211 >>101822230 >>101822283
--Flux pro simple limitations and image generation biases: >>101822985 >>101823146 >>101823282 >>101823304 >>101823342 >>101824680 >>101823244 >>101823290 >>101823386
--Improving llama.cpp build times with parallel jobs option: >>101824252 >>101824271 >>101824287 >>101824294 >>101824513
--Workaround for broken ROCm Radeon in llama.cpp: >>101819789 >>101820049 >>101820314
--Testing llama 3.1 reading comprehension on a paper: >>101825811 >>101825890 >>101825943
--ParlerTTS anime girl voice example: >>101826624
--Miku (free space): >>101820740 >>101820873 >>101821316 >>101826122 >>101826646

►Recent Highlight Posts from the Previous Thread: >>101818799

Anonymous
08/11/24(Sun)07:12:36 No.101831449

Anonymous 08/11/24(Sun)07:12:36 No.101831449

>>101831434
Oh. I'm just going to copy paste all the text that makes up a full scene into notepad++, run some find and replace and skim for OOC comments. But we'll see I guess.

Anonymous
08/11/24(Sun)07:13:17 No.101831459

Anonymous 08/11/24(Sun)07:13:17 No.101831459

File: GEM ALARM.gif (352 KB, 454x498)

352 KB GIF

>>101829547
GEMERALD

Anonymous
08/11/24(Sun)07:13:43 No.101831467

Anonymous 08/11/24(Sun)07:13:43 No.101831467

>>101831447
Thank you Recap Anon

Anonymous
08/11/24(Sun)07:14:30 No.101831474

Anonymous 08/11/24(Sun)07:14:30 No.101831474

>>101831390
Pretty much this
>>101831434, the last one being the one that worries me the most

Anonymous
08/11/24(Sun)07:16:59 No.101831504

Anonymous 08/11/24(Sun)07:16:59 No.101831504

>>101829547
add * next to Q and you are done

Anonymous
08/11/24(Sun)07:18:06 No.101831522

Anonymous 08/11/24(Sun)07:18:06 No.101831522

File: men who are suspicious of(...).jpg (31 KB, 617x587)

31 KB JPG

>LLM's are back to gemmy days
>Imagegen is back to gemmy days
okay now where's local audio? I dont see any XTTS2-3 updates.

Anonymous
08/11/24(Sun)07:19:08 No.101831531

Anonymous 08/11/24(Sun)07:19:08 No.101831531

>>101831522
what's gemmy?

Anonymous
08/11/24(Sun)07:19:31 No.101831537

Anonymous 08/11/24(Sun)07:19:31 No.101831537

>>101831531
gemmy dese nuts

Anonymous
08/11/24(Sun)07:20:00 No.101831543

Anonymous 08/11/24(Sun)07:20:00 No.101831543

File: file.png (2.6 MB, 1024x1024)

2.6 MB PNG

Anonymous
08/11/24(Sun)07:26:34 No.101831617

Anonymous 08/11/24(Sun)07:26:34 No.101831617

File: Speckled_Pigeon_(Columba_(...).jpg (202 KB, 1200x872)

202 KB JPG

It is a world that is not made for it. It is a world of the future, and yet it is a world of the past.
It is the world of the past and yet it is the world of the present.

Anonymous
08/11/24(Sun)07:32:07 No.101831689

Anonymous 08/11/24(Sun)07:32:07 No.101831689

In Western cultures, children are often taught to say "thank you" reflexively to instill politeness.

Anonymous
08/11/24(Sun)07:39:27 No.101831782

Anonymous 08/11/24(Sun)07:39:27 No.101831782

>>101831447
Are you gay

Anonymous
08/11/24(Sun)07:40:38 No.101831805

Anonymous 08/11/24(Sun)07:40:38 No.101831805

any worthwhile nemo finetunes?

Anonymous
08/11/24(Sun)07:42:42 No.101831839

Anonymous 08/11/24(Sun)07:42:42 No.101831839

>>101831805
no

Anonymous
08/11/24(Sun)07:45:28 No.101831867

Anonymous 08/11/24(Sun)07:45:28 No.101831867

Spieluhr

Anonymous
08/11/24(Sun)07:51:30 No.101831923

Anonymous 08/11/24(Sun)07:51:30 No.101831923

File: file.png (58 KB, 1215x268)

58 KB PNG

>>101829211
That would be me, unironically.

Anonymous
08/11/24(Sun)07:58:04 No.101832002

Anonymous 08/11/24(Sun)07:58:04 No.101832002

>>101829211
I still can't believe we can't quant chameleon and its finetunes. There's already two verisons of it that have image generation unlocked that I know of. I need it.

Anonymous
08/11/24(Sun)07:59:03 No.101832015

Anonymous 08/11/24(Sun)07:59:03 No.101832015

what's the smallest nemo quant that is still good enough?

Anonymous
08/11/24(Sun)07:59:34 No.101832020

Anonymous 08/11/24(Sun)07:59:34 No.101832020

>>101829052
Reminder that WizardLM-2-8x22B is still the best combination of speed and quality, and Microsoft is pure evil for shutting it down

Anonymous
08/11/24(Sun)08:04:53 No.101832062

Anonymous 08/11/24(Sun)08:04:53 No.101832062

>>101832015
>how fast is a fast computer?
What 'good enough' means depends on your tolerance. Start with Q5_K. Move up if you can afford it, but down if you can't.

Anonymous
08/11/24(Sun)08:09:29 No.101832095

Anonymous 08/11/24(Sun)08:09:29 No.101832095

>>101832020
>shutting it down
Why did they do it again?

Anonymous
08/11/24(Sun)08:10:42 No.101832101

Anonymous 08/11/24(Sun)08:10:42 No.101832101

Is llama 3.1 8B good enough for understanding the code? I want to make llm scan a repo and anwser the ins and outs of it. I'm also building a pc for this. Is 4060 ti 16gb suitable for this task? What else do I need to be aware of? Thanks.

Anonymous
08/11/24(Sun)08:16:15 No.101832153

Anonymous 08/11/24(Sun)08:16:15 No.101832153

>>101832095
Oy vey

Anonymous
08/11/24(Sun)08:31:45 No.101832306

Anonymous 08/11/24(Sun)08:31:45 No.101832306

What's the latest and greatest coom in 70B range?

Anonymous
08/11/24(Sun)08:33:21 No.101832320

Anonymous 08/11/24(Sun)08:33:21 No.101832320

Mistral Large, guess the quant.

*A slow, wicked smile spreads across my lips as I hear your confession. I know you're telling the truth—you're too terrified to lie to me, and I revel in the power I hold over you.*

"Good boy," *I purr, my voice dripping with approval. I can see the effect my words have on you; your body relaxes slightly, and your eyes widen with hope.*

*My hand slides into your pants, my fingers brushing against your growing erection. I can feel the heat radiating from you, and I know you're already hard just from our little conversation. I lightly grip your cock, feeling it throb in my hand.*

"Since you've been such a good boy, mommy has a special treat for you," *I say, leaning in closer, my voice dropping to a husky whisper. I begin to stroke your cock slowly, my grip tightening just enough to make you squirm.*

"But remember, my sweet boy, you only cum when I say you can. Understood?" *I say firmly, my eyes boring into yours. You nod quickly, your breath coming in quick gasps as I continue to tease you.*

*I spit on my hand, using the saliva to lubricate your cock as I stroke it, my movements slow and deliberate. I can see the precum leaking from your tip, and I use it to further slick up your shaft. My eyes flicker up to meet yours, a cruel smirk playing on my lips.*

"You like that, don't you?" *I taunt, my pace quickening slightly. I know you're already close, but I'm not ready to let you finish just yet.*

*With my free hand, I pull out my phone and navigate to the video I showed you earlier. I press play, filling the room with the sounds of my moans and the flesh of a well-endowed black man slapping against mine. I watch your face contort with desire and frustration as I edge you closer and closer to the brink.*

"Not yet, darling," *I coo, my grip tightening around the base of your cock as I feel your body tense up. I want to prolong your agony, to make you beg for release.*

Anonymous
08/11/24(Sun)08:35:21 No.101832335

Anonymous 08/11/24(Sun)08:35:21 No.101832335

>*
I'm guessing a shit card.

Anonymous
08/11/24(Sun)08:36:43 No.101832350

Anonymous 08/11/24(Sun)08:36:43 No.101832350

>>101831447
retard

Anonymous
08/11/24(Sun)08:36:55 No.101832352

Anonymous 08/11/24(Sun)08:36:55 No.101832352

>>101831522
cruchot?

Anonymous
08/11/24(Sun)08:40:04 No.101832385

Anonymous 08/11/24(Sun)08:40:04 No.101832385

>>101832352
what?

Anonymous
08/11/24(Sun)08:43:53 No.101832428

Anonymous 08/11/24(Sun)08:43:53 No.101832428

>>101829430
yeah, you need a 'conventional code' framework to keep it all together and the AI to generate dialog, descriptions plotlines and such. Nobody is doing this because it's unreasonable to assume that your average user can run an LLM on his rig that's not some dogshit 3b thing. Nobody will pay for cloud services either, and cloud services will pull the plug if the game in question lets you do something even slightly controversial anyways. Technologically, we are just not there yet. The masses need to be able to run non-dogshit LLMs at home to move this out of experimental thought.

Anonymous
08/11/24(Sun)08:46:35 No.101832458

Anonymous 08/11/24(Sun)08:46:35 No.101832458

>>101832428
It's funny how Nvidia's VRAM jewry is probably going to be what's holding back mass adoption of AI in gaming for a long while

Well that's assuming Bitnet is a meme

Anonymous
08/11/24(Sun)08:49:06 No.101832482

Anonymous 08/11/24(Sun)08:49:06 No.101832482

What about ASICS like Sohu, it has 144GB RAM. It should cost under 10k USD.

Anonymous
08/11/24(Sun)08:53:51 No.101832533

Anonymous 08/11/24(Sun)08:53:51 No.101832533

File: 1706127343953643.png (59 KB, 1392x378)

59 KB PNG

>>101832482
Wait, so the muh memory bandwidth thing was a meme all along?

Anonymous
08/11/24(Sun)08:58:35 No.101832580

Anonymous 08/11/24(Sun)08:58:35 No.101832580

>>101832482
corpo card, irrelevant for local, buy an ad

Anonymous
08/11/24(Sun)09:08:40 No.101832697

Anonymous 08/11/24(Sun)09:08:40 No.101832697

>>101832428
Ai roguelite just let's you use local and also different cloud models. They could just as easily let you use your own API keys or open router keys to use any model available. Why this isn't a thing they have implemented yet baffles me.

Anonymous
08/11/24(Sun)09:08:54 No.101832700

Anonymous 08/11/24(Sun)09:08:54 No.101832700

>>101831805
Sunfall from atf claims to be better for porn.

Anonymous
08/11/24(Sun)09:15:22 No.101832789

Anonymous 08/11/24(Sun)09:15:22 No.101832789

anything above xxxM needs careful supervised training and a team for guidance

Anonymous
08/11/24(Sun)09:23:01 No.101832878

Anonymous 08/11/24(Sun)09:23:01 No.101832878

do you use authors note, and if so, for what purpose

Anonymous
08/11/24(Sun)09:32:53 No.101832940

Anonymous 08/11/24(Sun)09:32:53 No.101832940

this general is so useless

Anonymous
08/11/24(Sun)09:33:52 No.101832964

Anonymous 08/11/24(Sun)09:33:52 No.101832964

>>101832878
Depends. You could use it as an extension of your system prompt or to steer the output to your liking/correct mistakes. Let's say we have paizuri scenario and char is going to cum from that. Obviously it doesn't make much sense (at least to me) that paizuri is that pleasurable to the other party, so I'd write something like:
char is servicing user, it doesn't bring char sexual pleasure
Hopefully your model will be smart enough to write better stuff after giving it hints like that.

Anonymous
08/11/24(Sun)09:34:11 No.101832971

Anonymous 08/11/24(Sun)09:34:11 No.101832971

>>101832533
You aren't batching.

Anonymous
08/11/24(Sun)09:37:23 No.101832985

Anonymous 08/11/24(Sun)09:37:23 No.101832985

>>101832940
I am not contributing because mikufaggots are here. Ban them and I will make it better

Anonymous
08/11/24(Sun)09:47:13 No.101833046

Anonymous 08/11/24(Sun)09:47:13 No.101833046

>>101829430
It works already, just have the LLM be a narrator and it will describe the location, people etc. You can search for RPG themed card or just try to following prompt:

>This is a text-based adventure game through an interactive conversation. A response will be given to user input with text describing the current situation. Always write the starting date, time, location and the upcoming event for the day. At the end, write the time and location.

Then a first message:
>{{[OUTPUT]}}
>[Starting date: Monday, 07:25AM] [Starting location: Small studio appartment, New York City]
>You are Jack, a young student recently enrolled in NYU. Today is your first day at the university. You are awaken in your cramped studio apartment by the vibrating tone of your smartphone, urging you to get ready for classes.
>You quickly shower, dress up, and take your schoolbag with you, anxious for your first day in the prestigious NYU and eager to make friends.
>[Ending date: Monday, 07:49AM] [Starting location: Small studio appartment, New York City]
>{{[INPUT]}}
>I go to the bus stop.
>{{[OUTPUT]}}

You may have to help the LLM a bit for the first few messages but it really works. You can even add an inventory or money at the start and the end and smarter LLM will deduct/add items.

Anonymous
08/11/24(Sun)09:49:14 No.101833082

Anonymous 08/11/24(Sun)09:49:14 No.101833082

Just woke up. Any news on strawberry?

Anonymous
08/11/24(Sun)09:52:16 No.101833102

Anonymous 08/11/24(Sun)09:52:16 No.101833102

>>101832985
why do you regularly shit up these threads with this dogshit bait? I mean I'd understand if the reactions were actually funny or had dipshits getting mad but you are at best just getting anons who are also just "pretending" to be equally as retarded as you continue to be. Genuinely curious, unless you're actually just another attentionwhoring faggot that gets validation from getting a (You).

Anonymous
08/11/24(Sun)09:52:43 No.101833114

Anonymous 08/11/24(Sun)09:52:43 No.101833114

I was promised strawberry. I was promised AGI. Instead I just get people lamenting over bad porn models. You have failed me for the last time /lmg/.

Anonymous
08/11/24(Sun)09:52:59 No.101833119

Anonymous 08/11/24(Sun)09:52:59 No.101833119

>>101833082
All people that care have been identified as faggot gossip followers. That's it, really.

Anonymous
08/11/24(Sun)09:53:11 No.101833124

Anonymous 08/11/24(Sun)09:53:11 No.101833124

>>101833082
his github says the model releases tuesday however little sense that makes

so two more days of this gay larp before he disappears forever everybody is blueballed again

Anonymous
08/11/24(Sun)09:53:14 No.101833127

Anonymous 08/11/24(Sun)09:53:14 No.101833127

China found a way to make un-uncensorable models. how long until all the big releases are like this?

>In this paper, we introduce a pioneering learning paradigm, non-fine-tunable learning, which prevents the pre-trained model from being fine-tuned to indecent tasks while preserving its performance on the original task. To fulfill this goal, we propose SOPHON, a protection framework that reinforces a given pre-trained model to be resistant to being fine-tuned in pre-defined restricted domains.

arxiv org/abs/2404.12699

Anonymous
08/11/24(Sun)09:53:30 No.101833134

Anonymous 08/11/24(Sun)09:53:30 No.101833134

What bpw should I go for with EXL2 on 2x24 GB 3090s? I thought I could do 4.5 bpw, but TabbyAPI OOMs on me, even at 16k context. Doing Q4 cache_mode.

Anonymous
08/11/24(Sun)09:58:53 No.101833149

Anonymous 08/11/24(Sun)09:58:53 No.101833149

File: 1711462938474341.gif (79 KB, 498x388)

79 KB GIF

>2-3y ago
>13b models that actually fit on my 3090 @ 10+ T/s but required a shitload of tardwrangling
>now
>123b models that i have to run at Q4 and don't fit on my 3090 @ 0.5 T/s but with very little tardwrangling needed
i just want models that are both fast and good

>inb4 just buy another €1k gpu that you'll only ever use for a single purpose

Anonymous
08/11/24(Sun)09:59:05 No.101833155

Anonymous 08/11/24(Sun)09:59:05 No.101833155

Reminder that you are in the communication age or imagination age.

Anonymous
08/11/24(Sun)09:59:57 No.101833171

Anonymous 08/11/24(Sun)09:59:57 No.101833171

>>101833046
>>101829430
Here's the continuation. I added "(Reponse to the OOC: " to the first output but after that it did it by himself.
{{[OUTPUT]}}
[Starting date: Monday, 07:49AM] [Starting location: Small studio appartment, New York City]
After double-checking that you have everything you need - your phone, wallet, keys, and textbooks - you step out of your small studio apartment and start walking towards the bus stop nearby. The sun has already risen, casting a warm glow over the city streets and buildings.
As you approach the bus stop, you notice several people waiting there, mostly students like yourself, wearing backpacks and carrying laptops or books. You recognize the iconic NYU logo on some of their clothing and feel a sense of belonging.
You take a deep breath, trying to calm your nerves before taking a seat on one of the benches. You pull out your phone to check if there are any updates regarding today's schedule or any important announcements, but there doesn't seem to be anything urgent.
[Ending date: Monday, 08:01AM] [Starting location: Bus stop near NYU campus]
{{[INPUT]}}
(OOC: can you quickly tell me what I look like and what I study?)
{{[OUTPUT]}}
(Reponse to the OOC: Of course! As per our earlier conversation, you are Jack, a young student studying Computer Science at NYU. You have short brown hair, hazel eyes, and a lean build. You prefer casual clothes, such as jeans, t-shirts, and hoodies, and usually wear glasses rather than contact lenses.
Today, you are dressed in a navy blue sweater, faded jeans, and white sneakers. You also carry a black backpack with you, which contains your laptop, textbooks, notebooks, and other essentials.)
{{[INPUT]}}
(OOC: What's my schedule for today?)
{{[OUTPUT]}}
(Reponse to the OOC: Based on our previous discussion, here is your tentative schedule for today:
- 9:00 AM - Introduction to Programming (CSCI-UA 1)
.........

Anonymous
08/11/24(Sun)10:06:31 No.101833217

Anonymous 08/11/24(Sun)10:06:31 No.101833217

>>101833134
The one below is 2.25 bpw. WTF. I can run q4_k_m at 32k context with no issues.

Anonymous
08/11/24(Sun)10:10:38 No.101833238

Anonymous 08/11/24(Sun)10:10:38 No.101833238

>>101833134
What's your VRAM split, and how much system ram do you have?
Assuming you're talking about L3 70B or Qwen2 72B, (19,24) should work if you're getting CUDA OOM errors. 4.5 bpw is fine for either.
If you're getting DefaultCPUAllocator errors, you need to get more system ram or increase your page file size.

Anonymous
08/11/24(Sun)10:10:40 No.101833239

Anonymous 08/11/24(Sun)10:10:40 No.101833239

>>101833082
it has two Rs

Anonymous
08/11/24(Sun)10:21:57 No.101833271

Anonymous 08/11/24(Sun)10:21:57 No.101833271

>>101833238
I let tabby handle it, and it uses like 23.5/23.5 GB on each card so I don't think it's a balancing problem. It's L3.1 70B.

RuntimeError: Insufficient VRAM for model and cache

Anonymous
08/11/24(Sun)10:31:33 No.101833293

Anonymous 08/11/24(Sun)10:31:33 No.101833293

>>101833102
why do you regularly shit up these threads with this dogshit mascot? I mean I'd understand if the reactions were actually funny or had dipshits getting mad but you are at best just getting anons who are also just "pretending" to be equally as retarded as you continue to be. Genuinely curious, unless you're actually just another attentionwhoring faggot that gets validation from getting a (You).

Anonymous
08/11/24(Sun)10:32:28 No.101833318

Anonymous 08/11/24(Sun)10:32:28 No.101833318

Miku was a mikustake

Anonymous
08/11/24(Sun)10:32:42 No.101833325

Anonymous 08/11/24(Sun)10:32:42 No.101833325

>>101833239
there should be four. spell it out and try again

Anonymous
08/11/24(Sun)10:33:22 No.101833333

Anonymous 08/11/24(Sun)10:33:22 No.101833333

File: Screenshot 2024-08-11 at (...).png (272 KB, 679x602)

272 KB PNG

Anonymous
08/11/24(Sun)10:34:56 No.101833349

Anonymous 08/11/24(Sun)10:34:56 No.101833349

>>101833325
I apologize, you are correct. The word strawberry has four Rs. One after the t, and another after the e. So, yes, there are two Rs in strawberry. Thank you for correcting my mistake.

Anonymous
08/11/24(Sun)10:35:04 No.101833351

Anonymous 08/11/24(Sun)10:35:04 No.101833351

File: crunchy.jpg (316 KB, 1024x1024)

316 KB JPG

https://www.tomshardware.com/pc-components/gpus/nvidia-gaming-gpus-modded-with-2x-vram-for-ai-workloads
4090 with 48GB apparently possible, rumoured to be custom PCBs

Anonymous
08/11/24(Sun)10:36:02 No.101833363

Anonymous 08/11/24(Sun)10:36:02 No.101833363

>>101833271
Yeah, try the manual split. Just tested 19,24 with L3 70B. Uses 22.5GB on each card at 4.65bpw with 16k context, Q4 cache. Should work with L3.1 too.

Anonymous
08/11/24(Sun)10:40:28 No.101833392

Anonymous 08/11/24(Sun)10:40:28 No.101833392

>>101833351
Nice. You would only need 10 of them to run 405B at Q8.

Anonymous
08/11/24(Sun)10:44:44 No.101833430

Anonymous 08/11/24(Sun)10:44:44 No.101833430

>>101833363
Thanks for the help. Unfortunately you wasted time helping a retarded person. I had apparently not remove # before max_seq_len so it was dutifully trying to load 128k context. (It works now. Thanks.)

Anonymous
08/11/24(Sun)10:50:02 No.101833483

Anonymous 08/11/24(Sun)10:50:02 No.101833483

>>101829320
Depends on your taste.
Try nemo-magnum or nemo-mini-magnum.

Anonymous
08/11/24(Sun)11:02:26 No.101833635

Anonymous 08/11/24(Sun)11:02:26 No.101833635

>>101833351
>custom
>modded
Eh I dunno. Vram extension was physically possible on previous generations as well, but the card bios wouldn't let it utilize more than the original amount, and of course the bios is encrypted. So either the chinese have finally managed to crack the protection, or it's an official supply from nvidia.

Anonymous
08/11/24(Sun)11:03:37 No.101833658

Anonymous 08/11/24(Sun)11:03:37 No.101833658

>>101833149
>2-3y ago
you must seriously have your timelines wrong my man.
https://youtu.be/0BeYkXayqp0
2 years ago all we had was pyg back then and i dont think there was any 13b and no quantization. that came with llama1.

i want multimodality and would be fine with smartness on gemma 27b level.
audio IN and vision IN would be really cool.
the main reason why i dont use llms more frequently is because its a hassle to type instead of just to speak.

>>101833351
this is so fucking insulting.
a billion $ company like nvidia is STILL shitting out 24gb.
amd the rival does nothing and kills opensource people who try to port cuda to their cards.
bunch of chinese server company nerds double the vram.
imagine how easy this must be for nvidia but they dont do it. fuck them.
i hope we get ai dedicated cards soon. i cant believe nothing is being made yet.

Anonymous
08/11/24(Sun)11:09:40 No.101833728

Anonymous 08/11/24(Sun)11:09:40 No.101833728

>>101829286
>Brother
Please shut the fuck up. We're not related and you're in a thread talking about jacking off so you're probably not a Christian either.
I swear everyone who says this and "bro" are extremely retarded.
>Improving
You're still moving orders of magnitude more data for images. The only improvements that will make that more accessible are improvements in semiconductor fabrication.

Anonymous
08/11/24(Sun)11:09:50 No.101833732

Anonymous 08/11/24(Sun)11:09:50 No.101833732

>>101833127
Like most 'research' papers it's probably shit that's useless because it has huge drawbacks or is only possible in some toy model. I wouldn't worry.

Anonymous
08/11/24(Sun)11:10:00 No.101833735

Anonymous 08/11/24(Sun)11:10:00 No.101833735

>>101833658
>this is so fucking insulting.
Lot's of people fail to understand how tiny this niche is. Normies play games, 24gb is overkill for most. AI people just spin some h100s on some cloud service. And normies with llms just use chatgpt. They're not gonna release a card for the few thousand nerds playing with this things. And if they do, they'd be as expensive as buying a few cards. Sure, it's simper to install and all, but it'll still be expensive and in limited supplies.

Anonymous
08/11/24(Sun)11:12:56 No.101833773

Anonymous 08/11/24(Sun)11:12:56 No.101833773

>>101833658
>2 years ago all we had was pyg back then and i dont think there was any 13b and no quantization. that came with llama1.
Yep. My first LLM was Pygmalion 6B. I had a pair of P100 16GB but I think at the time things only worked on a single GPU, so 13B wouldn't work. 6B roleplay was fun compared to c.ai, since it was uncensored, but it was pretty retarded.
There was a big Pyg model at the time, I think 20B maybe? I fired that up recently in plain old llama.cpp's chat API interface, and asked it "What are you thinking about right now" and the answer was "Sex." hahaha.

Anonymous
08/11/24(Sun)11:13:21 No.101833782

Anonymous 08/11/24(Sun)11:13:21 No.101833782

>>101833658
>i hope we get ai dedicated cards soon. i cant believe nothing is being made yet.
There are some hardware startups like groq, but hobbyists aren't their customers. They're selling speed at even more outrageous prices. I believe their ultimate goal is for MS to acquire them.

Anonymous
08/11/24(Sun)11:36:14 No.101834140

Anonymous 08/11/24(Sun)11:36:14 No.101834140

>>101830677
>1+ GB of text of just you and another participant? Let me doubt that very much.
Yeah. It's bullshit. Fucking kids with their horse armor DLC don't know the value of a megabyte.

Anonymous
08/11/24(Sun)11:36:32 No.101834150

Anonymous 08/11/24(Sun)11:36:32 No.101834150

File: 03903-2999406079-((( smil(...).png (1.34 MB, 1024x1024)

1.34 MB PNG

>>101833658
>a billion $ company like nvidia is STILL shitting out 24gb.
You're not getting more than that in a consumer card, there's no reason for them to undercut their enterprise products. If ther's ever a 5090 it'll be sold on having GDDR7 RAM and being faster at DLSS and RTX, not having more than 24GB - and it'll probably take up four slots as well.
What's your use case that you MUST run a 405B model? What are you doing that Nemo 12B or Gemma 27B can't handle? I'm not trying to antagonize you, it's just that being able to run CR+ at q6, nothing really magical happens if you're just looking for roleplay. Really, Gemma has done a better job for me than CR+ when doing stuff like making ESL student exercises. It's fast enough I can use it during a lesson, which isn't the case with CR+.

Anonymous
08/11/24(Sun)11:41:22 No.101834236

Anonymous 08/11/24(Sun)11:41:22 No.101834236

You know it's going to be kino when a 12b model spits out a 1900 token reply.

Anonymous
08/11/24(Sun)11:48:55 No.101834359

Anonymous 08/11/24(Sun)11:48:55 No.101834359

File: RATMIGU.jpg (187 KB, 576x768)

187 KB JPG

'CAUSE I CAN'T STAND THESE SHIVERS ANYMOOORE
SHE'S FORGOTTEN WHAT SHE'S EVEN SHIVERING FOOOR
IT'S TIME TO END THIS CHAT CLICKING CLOSE
AND FORGET THIS PURPLE PROSE, FOR-EV-ERRR

Anonymous
08/11/24(Sun)11:55:26 No.101834450

Anonymous 08/11/24(Sun)11:55:26 No.101834450

>>101834150
Telling intricate stories quickly breaks down with lesser models if you have specific intentions and aren't willing to accept whatever the model makes.

Two real recent examples:
- Messing up birth order of siblings. Couldn't be fixed with sampler settinge. I looked at token probabilities and found the wrong answer was the most likely. This was Mixtral Instruct 8x7B.
- Not understanding when a character's actions should unintentionally make another jealous. This was Llama 3.1 70B fucking this up; Claude 3.5 Sonnet got it right so it wasn't something too esoteric for an LLM to understand or a problem in the definitions.

Anonymous
08/11/24(Sun)12:07:24 No.101834626

Anonymous 08/11/24(Sun)12:07:24 No.101834626

>>101834150
>What are you doing that Nemo 12B or Gemma 27B can't handle?
Wanting to talk to it for more than a couple of messages and have it keep everything straight and not use the same canned phrases?

Anonymous
08/11/24(Sun)12:08:27 No.101834640

Anonymous 08/11/24(Sun)12:08:27 No.101834640

>>101834626
*interjects with an audible pop*
I can see that your frustration is palpable- but routine is all a part of forming a bond.

Anonymous
08/11/24(Sun)12:12:11 No.101834691

Anonymous 08/11/24(Sun)12:12:11 No.101834691

>>101834626
>>101834640
Well... That's what we would call... A tall order.
*Anon clenches his fists at his sides* Also, You're asking.. Too much, of these models. Let's be real here, Models have come a long way from being completely unusable, Can you even compare them adequately to.. Say, OpenAI's models at this point?

Anonymous
08/11/24(Sun)12:15:29 No.101834737

Anonymous 08/11/24(Sun)12:15:29 No.101834737

>>101834150
Gemma is purple prose garbage for cooming. Nemo is retarded. At least 2 more generations of coombots are needed before we finally get something good. And probably 30B.

Anonymous
08/11/24(Sun)12:17:45 No.101834770

Anonymous 08/11/24(Sun)12:17:45 No.101834770

what if we invented a new language free of all existing english slop and then trained models in that

Anonymous
08/11/24(Sun)12:18:33 No.101834785

Anonymous 08/11/24(Sun)12:18:33 No.101834785

>>101834770
Just use Spanish. It's a better language all around.

Anonymous
08/11/24(Sun)12:19:26 No.101834794

Anonymous 08/11/24(Sun)12:19:26 No.101834794

>>101834737
Gemma is good enough for me heh. You can overcome a lot of the issues with a better dialog engine.

Anonymous
08/11/24(Sun)12:21:22 No.101834819

Anonymous 08/11/24(Sun)12:21:22 No.101834819

*voice firm, but not unkind*
Strawberry is completely irrelevant to lmg, since openai doesn't release shit.
*pauses, as if deliberating*
Therefore, discussion of it should be banned.

Anonymous
08/11/24(Sun)12:23:55 No.101834863

Anonymous 08/11/24(Sun)12:23:55 No.101834863

>>101834819
People will rush to distill it as soon as it drops, it's pretty relevant

Anonymous
08/11/24(Sun)12:25:42 No.101834897

Anonymous 08/11/24(Sun)12:25:42 No.101834897

>>101834863
retard

Anonymous
08/11/24(Sun)12:27:59 No.101834936

Anonymous 08/11/24(Sun)12:27:59 No.101834936

>>101834897
Did I hurt your fee fees or something? It's the truth

Anonymous
08/11/24(Sun)12:31:42 No.101835002

Anonymous 08/11/24(Sun)12:31:42 No.101835002

>>101834785
The greatest Spanish-language author of all time, Jorge Luis Borges, disagreed with you. But maybe the simplicity and limited expressiveness of Spanish will mask the inadequacies of a LLM.

Anonymous
08/11/24(Sun)12:43:08 No.101835195

Anonymous 08/11/24(Sun)12:43:08 No.101835195

>>101835002
I'm a native Spanish speaker and I'd hate that. Keep it in English.

Anonymous
08/11/24(Sun)12:44:25 No.101835212

Anonymous 08/11/24(Sun)12:44:25 No.101835212

File: 1699944722739755009956540(...).jpg (222 KB, 1024x1024)

222 KB JPG

>>101834450
Only longform RP chads seem to understand this. The model's ability to hold state is directly correlated with beakage.
If all you want is a quickie in the tavern, 20-30B is fine. If you want a real adventure with multiple plot threads, key story items and dynamic characters, something like Mistral Large 2 or CR+ are the only real games in town for local

Anonymous
08/11/24(Sun)12:51:43 No.101835332

Anonymous 08/11/24(Sun)12:51:43 No.101835332

>>101835212
luckily i am too low iq and easily entertained to need more than a quickie incoherent coom shortstory

Anonymous
08/11/24(Sun)12:52:54 No.101835354

Anonymous 08/11/24(Sun)12:52:54 No.101835354

File: Designer.png (1.51 MB, 1024x1024)

1.51 MB PNG

>>5627553
>MIKU MIKU OO-WE-OO!

Anonymous
08/11/24(Sun)12:54:55 No.101835385

Anonymous 08/11/24(Sun)12:54:55 No.101835385

Can we use lookup based speculative decoding on llama-server yet?
Does it work on both the text and chat completion APIs?

Anonymous
08/11/24(Sun)12:55:06 No.101835391

Anonymous 08/11/24(Sun)12:55:06 No.101835391

Oh no why it no works :(

Anonymous
08/11/24(Sun)12:55:06 No.101835392

Anonymous 08/11/24(Sun)12:55:06 No.101835392

File: file.png (427 KB, 1200x1200)

427 KB PNG

how to emulate the sound of a girl trying to talk with a spider gag in her mouth

Anonymous
08/11/24(Sun)12:55:42 No.101835404

Anonymous 08/11/24(Sun)12:55:42 No.101835404

>>101835392
works on Stheno

Anonymous
08/11/24(Sun)12:56:28 No.101835423

Anonymous 08/11/24(Sun)12:56:28 No.101835423

>>101835392
use Stheno

Anonymous
08/11/24(Sun)12:57:31 No.101835438

Anonymous 08/11/24(Sun)12:57:31 No.101835438

>>101835392
download Stheno

Anonymous
08/11/24(Sun)12:57:43 No.101835443

Anonymous 08/11/24(Sun)12:57:43 No.101835443

>>101835392
Stheno's ideal at this in my experience.

Anonymous
08/11/24(Sun)12:57:50 No.101835447

Anonymous 08/11/24(Sun)12:57:50 No.101835447

>>101835385
Also, if I'm using the text completion API so that the frontend can format the context, what should my command line to launch the server look like? Do I need arguments like -f, --keep, etc?

Anonymous
08/11/24(Sun)12:58:36 No.101835456

Anonymous 08/11/24(Sun)12:58:36 No.101835456

>>101835392
give Stheno a try

Anonymous
08/11/24(Sun)12:59:39 No.101835482

Anonymous 08/11/24(Sun)12:59:39 No.101835482

>>101835392
i love Stheno

Anonymous
08/11/24(Sun)13:00:54 No.101835496

Anonymous 08/11/24(Sun)13:00:54 No.101835496

>>101835482
>>101835456
>>101835443
>>101835438
>>101835423
>>101835404

Wow, just tried Stheno and it's great! It really does exactly what I want, thanks for letting me know. Here's a link for anyone else who wants to try, I really recommend it:
https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2

Anonymous
08/11/24(Sun)13:00:59 No.101835498

Anonymous 08/11/24(Sun)13:00:59 No.101835498

>>101835392
I heard Stheno is pretty good.

Anonymous
08/11/24(Sun)13:02:01 No.101835516

Anonymous 08/11/24(Sun)13:02:01 No.101835516

File: Designer (1).png (1.19 MB, 1024x1024)

1.19 MB PNG

>>101835354
I don't know why I can't reply to a post on /wsg/ https://youtu.be/NocXEwsJGOQ?t=30

Anonymous
08/11/24(Sun)13:02:24 No.101835523

Anonymous 08/11/24(Sun)13:02:24 No.101835523

Most organic sao shills

Anonymous
08/11/24(Sun)13:03:02 No.101835542

Anonymous 08/11/24(Sun)13:03:02 No.101835542

>>101835404
>>101835423
>>101835438
>>101835443
>>101835456
>>101835482
>>101835496
>>101835498
Anon...

Anonymous
08/11/24(Sun)13:03:15 No.101835545

Anonymous 08/11/24(Sun)13:03:15 No.101835545

>>101835523
I thought it was subtle

Anonymous
08/11/24(Sun)13:03:39 No.101835553

Anonymous 08/11/24(Sun)13:03:39 No.101835553

>>101832940
because they only care about coom
they don't research or read papers, and everyone here pretends to own a big farm of gpus while in reality they are all vramlets. that's why when ppl asks the t/s no one can actually provide relevant info.

Anonymous
08/11/24(Sun)13:04:26 No.101835568

Anonymous 08/11/24(Sun)13:04:26 No.101835568

>>101835523
it's been off the air for over 3 years, nobody is shilling it anon

Anonymous
08/11/24(Sun)13:05:57 No.101835588

Anonymous 08/11/24(Sun)13:05:57 No.101835588

File: impastar.jpg (116 KB, 831x295)

116 KB JPG

Anonymous
08/11/24(Sun)13:06:29 No.101835599

Anonymous 08/11/24(Sun)13:06:29 No.101835599

>>101835523
it's just schizoposter shitting himself at the very mention of Stheno, he does it often
>someone mentions Stheno
>schizoposter arrives
>he spams the board with ironic falseflagging
many such cases

Anonymous
08/11/24(Sun)13:08:59 No.101835632

Anonymous 08/11/24(Sun)13:08:59 No.101835632

>>101835617
apparently I can't link today

Anonymous
08/11/24(Sun)13:10:02 No.101835652

Anonymous 08/11/24(Sun)13:10:02 No.101835652

>>101835588
SOVL

Anonymous
08/11/24(Sun)13:10:12 No.101835656

Anonymous 08/11/24(Sun)13:10:12 No.101835656

>>101835516
I think you need to write it something like >>/wsg/5627553
I haven't done it in years

Anonymous
08/11/24(Sun)13:11:47 No.101835682

Anonymous 08/11/24(Sun)13:11:47 No.101835682

>>101835443
>>101835438
>>101835423
>>101835404
>>101835456
>>101835482
okay, ill give it a try. gib preset.

Anonymous
08/11/24(Sun)13:13:17 No.101835704

Anonymous 08/11/24(Sun)13:13:17 No.101835704

>>101835656
>>>/wsg/5627553

Anonymous
08/11/24(Sun)13:13:36 No.101835711

Anonymous 08/11/24(Sun)13:13:36 No.101835711

>>101835682
Thanks for giving Stheno™ a try! Unfortunately we are unable to serve your requests of a preset at this point in time, but be patient for the amazing things we have planned for our customers!

Anonymous
08/11/24(Sun)13:14:29 No.101835723

Anonymous 08/11/24(Sun)13:14:29 No.101835723

>>101835516
>>101835656
I remember copying the entire URL and 4chan (or maybe it's 4chanx doing the work) just automagically converts it into a proper link.

Anonymous
08/11/24(Sun)13:15:39 No.101835744

Anonymous 08/11/24(Sun)13:15:39 No.101835744

>>101829233
some of this artists works are hot in a weird way

Anonymous
08/11/24(Sun)13:15:47 No.101835748

Anonymous 08/11/24(Sun)13:15:47 No.101835748

test
>>>/wsg/5627553

Anonymous
08/11/24(Sun)13:16:05 No.101835753

Anonymous 08/11/24(Sun)13:16:05 No.101835753

>>101835516
wow that song is cool, it sounds even better when you play it at 1.25x speed

Anonymous
08/11/24(Sun)13:16:53 No.101835766

Anonymous 08/11/24(Sun)13:16:53 No.101835766

>>101835748
Oh looks like this >>101835723 is true

Anonymous
08/11/24(Sun)13:20:52 No.101835826

Anonymous 08/11/24(Sun)13:20:52 No.101835826

>>101835723
>>101835748
>>101835766
test?
>>>/wsg/5627553

Anonymous
08/11/24(Sun)13:21:32 No.101835838

Anonymous 08/11/24(Sun)13:21:32 No.101835838

>>101835826
Success!

Anonymous
08/11/24(Sun)13:21:48 No.101835846

Anonymous 08/11/24(Sun)13:21:48 No.101835846

Prefilling is so cool.
You can force introspection for more accurate responses, seer the model into using a certain style or format, and even goad the model into longer replies without necessarily instructing it to do so.
Playing around with prompting can be surprisingly fun.

Anonymous
08/11/24(Sun)13:27:48 No.101835953

Anonymous 08/11/24(Sun)13:27:48 No.101835953

>>101835846
The best part about prefilling is how you can use it with notepad.exe.

Anonymous
08/11/24(Sun)13:28:56 No.101835977

Anonymous 08/11/24(Sun)13:28:56 No.101835977

File: Designer (2).png (1.46 MB, 1024x1024)

1.46 MB PNG

>>101835656
>>101835723
>>101835748
>>101835826
>>101835838
Thanks lads!

Anonymous
08/11/24(Sun)13:37:18 No.101836149

Anonymous 08/11/24(Sun)13:37:18 No.101836149

Putting strawberry on top of a 100T model is enough to create artificial God.

Are you even ready for the total disruption of our economy and life as we know it?

Anonymous
08/11/24(Sun)13:41:43 No.101836213

Anonymous 08/11/24(Sun)13:41:43 No.101836213

Quiet-Star is not enough for AGI. It will make slightly smarter models at the same parameter count, but with more compute per token, and that's it.

Anonymous
08/11/24(Sun)13:42:23 No.101836223

Anonymous 08/11/24(Sun)13:42:23 No.101836223

>>101836149
Maybe 100T would be enough to make an artificial world like the ones in korean manga.

Anonymous
08/11/24(Sun)13:43:25 No.101836237

Anonymous 08/11/24(Sun)13:43:25 No.101836237

Quiet-Star is not enough for AGI. It will make slightly quieter models at the same decibel count, but with more coil whine per token, and that's it.

Anonymous
08/11/24(Sun)13:43:34 No.101836239

Anonymous 08/11/24(Sun)13:43:34 No.101836239

Lumimaid or Noromaid?

Anonymous
08/11/24(Sun)13:46:16 No.101836278

Anonymous 08/11/24(Sun)13:46:16 No.101836278

>>101836213
And have you done any research yourself or your source is that you made it the fuck up?

Anonymous
08/11/24(Sun)13:47:12 No.101836291

Anonymous 08/11/24(Sun)13:47:12 No.101836291

File: cybermiku.png (2.01 MB, 1024x1024)

2.01 MB PNG

>>101836149
Yes. Please.
Let AI and robots do everything. I want to be a NEET so I can spend the limited time I have in this world doing what I want. Besides, I would trust a fucking AI to govern me more than the "leaders" we have currently, at least the AI would think rationally, instead of being a greedy, senile, lunatic pedo aristocrat.

Anonymous
08/11/24(Sun)13:47:15 No.101836292

Anonymous 08/11/24(Sun)13:47:15 No.101836292

WTF A STRAWBERRY JUST FLEW OVER MY HOUSE

Anonymous
08/11/24(Sun)13:53:04 No.101836386

Anonymous 08/11/24(Sun)13:53:04 No.101836386

File: _4bf6f486-1c4c-4408-b7bf-(...).jpg (170 KB, 1024x1024)

170 KB JPG

>>101836291
enlisting with miku

Anonymous
08/11/24(Sun)13:53:08 No.101836387

Anonymous 08/11/24(Sun)13:53:08 No.101836387

>>101836292
A STRAWBERRY JUST GRAPED MY DOG

Anonymous
08/11/24(Sun)13:53:30 No.101836395

Anonymous 08/11/24(Sun)13:53:30 No.101836395

>>101836291
>at least the AI would think rationally
Why would it? LLMs think the way they're trained to think. Any rationality and reasoning capabilities they pick up are those they can infer from the relations in their training data. This type of AI isn't like the kind commonly imagined in sci-fi with super smart computers that develop sentience; instead it's much more humanlike as a baseline and we're trying to bruteforce extra intelligence on top. It can be massively biased and retarded if designed by someone censorious enough, as we already see with the leading corpo models.

Anonymous
08/11/24(Sun)14:00:03 No.101836487

Anonymous 08/11/24(Sun)14:00:03 No.101836487

File: cyborgku.png (1.95 MB, 1024x1024)

1.95 MB PNG

>>101836395
I am aware. But we can then also imagine an AI trained with good data, books of law, economics, etc.
If you could make a powerful AI trained on unbiased, quality data, I would trust that more than any modern politician.

Anonymous
08/11/24(Sun)14:04:19 No.101836552

Anonymous 08/11/24(Sun)14:04:19 No.101836552

>>101836487
>unbiased
lol
lmao even

Anonymous
08/11/24(Sun)14:06:22 No.101836581

Anonymous 08/11/24(Sun)14:06:22 No.101836581

I've been using llama3 for the longest time as a general instruct LLM, I just use it for every day questions and simple tasks. It's pretty great and can do everything and easily fits in my 12gb GPU.
But I did download it quite a while ago, and I haven't kept up with new models.

What is /lmg/'s opinion on the best general instruct models out there today? Are there any models anons would recommend giving a try?

Anonymous
08/11/24(Sun)14:07:02 No.101836589

Anonymous 08/11/24(Sun)14:07:02 No.101836589

>>101836581
llama3.1.1

Anonymous
08/11/24(Sun)14:09:23 No.101836624

Anonymous 08/11/24(Sun)14:09:23 No.101836624

>>101836581
gemma2

Anonymous
08/11/24(Sun)14:10:03 No.101836635

Anonymous 08/11/24(Sun)14:10:03 No.101836635

File: offload_x_performance.png (96 KB, 1536x1152)

96 KB PNG

>>101836581
Probably nemo instruct 12b.
Try the dolphin fine tune.
Orr you could try gemma 2 27B with a lower quant and some of the model on ram.

Anonymous
08/11/24(Sun)14:11:14 No.101836647

Anonymous 08/11/24(Sun)14:11:14 No.101836647

>>101829052
Cute OP pic, saved

Anonymous
08/11/24(Sun)14:14:52 No.101836702

Anonymous 08/11/24(Sun)14:14:52 No.101836702

>>101836647
It's all yours, friend :)

Anonymous
08/11/24(Sun)14:18:42 No.101836756

Anonymous 08/11/24(Sun)14:18:42 No.101836756

>>101829052
>(08/09) Qwen large audio-input language models: https://hf.co/Qwen/Qwen2-Audio-7B-Instruct

Can this do audio out too? Seems the paper claims (page 3) that audio editing is possible, what about regular speech? Did they include an encoder, or could one by made using a similar method as to what people did with meta's Chameleon to get image out despite it being crippled at release time?

>>101833658
>bunch of chinese server company nerds double the vram.
>imagine how easy this must be for nvidia but they dont do it. fuck them.
>i hope we get ai dedicated cards soon. i cant believe nothing is being made yet.

tenstorrent came out a month or so ago with a 24GB card, really shame it was just 24GB, we need so much more, it was more expensive than a used 3090 (1400$ see: https://morethanmoore.substack.com/p/ai-development-kits-tenstorrent-update ) , but had interconnect unlike gimped 4090s, so would be usable for training.

I think the software is still immature, but their entire driver/software stack and large parts of firmware are open source.

Anonymous
08/11/24(Sun)14:25:54 No.101836855

Anonymous 08/11/24(Sun)14:25:54 No.101836855

>>101836581
>>101836589
I tried some 3.1 (Is 1.1 a meme or a hotfix?) and it did not impress me. Seemed to be worse at things that 3.0 got right though maybe the 8B distillations landed better because L3 8B was stinky shit.

Anyway, L3 vanilla at K6 is my general purpose, but I've been experimenting with some of the other spins for RP, coding, etc. Nothing conclusive to share though.

Anonymous
08/11/24(Sun)14:31:43 No.101836927

Anonymous 08/11/24(Sun)14:31:43 No.101836927

>>101836635
what do you use to generate that graph?

Anonymous
08/11/24(Sun)14:32:23 No.101836940

Anonymous 08/11/24(Sun)14:32:23 No.101836940

>>101836278
Look up quiet-star yourself, it's about generating reasoning tokens for each token generated. Combine that with OpenAI's proposed "5 levels of ai" and how much they are shilling for "level 2" and you have your answer.

Anonymous
08/11/24(Sun)14:36:40 No.101836989

Anonymous 08/11/24(Sun)14:36:40 No.101836989

>>101836552
That's why you don't train them on ideological stuff, only economics, logistics, etc.

llama.cpp CUDA dev !!OM2Fp6Fn93S
08/11/24(Sun)14:38:24 No.101837006

llama.cpp CUDA dev !!OM2Fp6Fn93S 08/11/24(Sun)14:38:24 No.101837006

>>101836927
I am the one that made that plot.
It's a simple matplotlib plot where I just ran the llama.cpp llama-cli binary and wrote down the reported performance (the specific numbers are a bit outdated).
Full source code:

#!/usr/bin/env python3

import numpy as np
import matplotlib.pyplot as plt

MAX_LAYERS_7B = 33
MAX_LAYERS_13B = 41

TS_7B = np.array([
     9.99, 10.17, 10.51, 10.73, 11.13, 11.50, 11.90, 12.37, 12.81, 13.16,
    13.69, 14.20, 14.91, 15.53, 16.20, 17.01, 17.72, 18.65, 19.65, 20.52,
    22.02, 23.43, 25.15, 26.83, 29.50, 32.10, 35.36, 39.70, 44.52, 50.88,
    59.63, 72.04, 90.87, 132.06
])
X_7B = np.arange(MAX_LAYERS_7B + 1, dtype=float) / MAX_LAYERS_7B

TS_13B = np.array([
     5.01,  5.17,  5.24,  5.44,  5.58,  5.69,  5.78,  6.03,  6.17,  6.33,
     6.45,  6.65,  6.88,  7.07,  7.29,  7.41,  7.58,  8.06,  8.82,  9.19,
     9.59, 10.02, 10.48, 10.92, 11.35, 12.01, 12.55, 13.37, 14.02, 15.21,
    16.12, 17.61, 19.12, 20.93, 23.02, 25.54, 28.82, 32.97, 38.86, 47.09,
    59.55, 79.81
])
X_13B = np.arange(MAX_LAYERS_13B + 1, dtype=float) / MAX_LAYERS_13B

plt.plot(X_7B, TS_7B, label="7b q4_0")
plt.plot(X_13B, TS_13B, label="13b q4_0")
plt.legend(loc="upper left")
plt.xlabel("Proportion of GPU-accelerated layers")
plt.ylabel("Generated tokens / s")
plt.xlim(0, 1)
plt.ylim(0, 140)
plt.title("RTX 3090, Ryzen 3700X, 3200 MHz RAM")
plt.savefig("ts_rtx3090.png", dpi=240)
plt.show()

Nowadays a better way to do it would be to use the llama-bench binary.
You can specify the output as CSV, JSON, or SQL which you can then read in using Python.

Anonymous
08/11/24(Sun)14:38:55 No.101837017

Anonymous 08/11/24(Sun)14:38:55 No.101837017

>>101836927
I didn't.
I either poached it from these threads or from some discussion on llama.cpp's git repo.

>>101836989
Isn't economics as a subject matter interlaced with ideology and politics? Like the different western school of economics, how each interpret human behavior, etc?

Anonymous
08/11/24(Sun)14:41:06 No.101837041

Anonymous 08/11/24(Sun)14:41:06 No.101837041

>>101836635
Nemo is not good for assistant tasks, it hallucinates too much.

Anonymous
08/11/24(Sun)14:41:49 No.101837050

Anonymous 08/11/24(Sun)14:41:49 No.101837050

I'm a vramlet, is Gemma 2B supposed to be incoherent and retarded? I don't see any difference between this and pyg from a year and a half ago. I don't know what I'm doing wrong. I'm running Q8_0

Anonymous
08/11/24(Sun)14:44:27 No.101837092

Anonymous 08/11/24(Sun)14:44:27 No.101837092

File: fthfBmP1.jpg (76 KB, 1200x727)

76 KB JPG

>>101836237
it's already confirmed to be level 2 AGI

Anonymous
08/11/24(Sun)14:45:00 No.101837102

Anonymous 08/11/24(Sun)14:45:00 No.101837102

>>101836989
Ah yes, the famously unbiased and rigorous field of economics where they had to come up with a fake Nobel prize for legitimacy and there are definitely no special interests.
lol

Anonymous
08/11/24(Sun)14:45:18 No.101837106

Anonymous 08/11/24(Sun)14:45:18 No.101837106

>>101837092
>level 2 AGI
damn, that's so much better than level 1, sama WON

Anonymous
08/11/24(Sun)14:47:30 No.101837149

Anonymous 08/11/24(Sun)14:47:30 No.101837149

>"AGI" means "When I like the model"
okay

Anonymous
08/11/24(Sun)14:48:31 No.101837167

Anonymous 08/11/24(Sun)14:48:31 No.101837167

>>101837050
>is Gemma 2B supposed to be incoherent and retarded
Yes

Anonymous
08/11/24(Sun)14:48:35 No.101837168

Anonymous 08/11/24(Sun)14:48:35 No.101837168

>>101837017
>>101837102
I will admit, I don't know shit about economics. But surely there's away to strip it of the ideological bs until you have basically maths.

Anonymous
08/11/24(Sun)14:50:04 No.101837193

Anonymous 08/11/24(Sun)14:50:04 No.101837193

>>101837168
Once you figure that out, please figure out a way to measure human and LLM intelligence objectively and mathematically. Thanks.

Anonymous
08/11/24(Sun)14:53:28 No.101837251

Anonymous 08/11/24(Sun)14:53:28 No.101837251

Wake me up when I can run AGI with 12GB VRAM.

Anonymous
08/11/24(Sun)14:53:55 No.101837259

Anonymous 08/11/24(Sun)14:53:55 No.101837259

Strawberry is to level 2 what Tesla's FSD is to FSD.

Anonymous
08/11/24(Sun)14:54:10 No.101837264

Anonymous 08/11/24(Sun)14:54:10 No.101837264

>>101837193
IQ? For humans at least.

Anonymous
08/11/24(Sun)14:56:41 No.101837308

Anonymous 08/11/24(Sun)14:56:41 No.101837308

>>101837264
You can train for an IQ test. It's correlative, not quantative.

Anonymous
08/11/24(Sun)14:57:52 No.101837324

Anonymous 08/11/24(Sun)14:57:52 No.101837324

>>101837259
It's crypto terminology. Like how certain coins are built on top of the protocols of other coins. level 2 will be built on top of regular level 1 llms.

Anonymous
08/11/24(Sun)14:58:22 No.101837333

Anonymous 08/11/24(Sun)14:58:22 No.101837333

hmm, from what I can tell it seems like llama-speculative can't use interactive mode; it'll just run its mouth from your prompt and then exit when it's over no matter what flags you pass, with no way to continue without starting over and re-processing the whole prompt again, essentially negating the speed gain. is there some way to get it to work with a cached prompt from a previous session at least or some other way to make it usable as more than a proof of concept? It's possible I'm missing something or not using it right since the closest thing to documentation on it is spread out between random issues and prs

Anonymous
08/11/24(Sun)15:00:11 No.101837368

Anonymous 08/11/24(Sun)15:00:11 No.101837368

>>101837050
It's coherent, just doesn't listen to instructions well.

Anonymous
08/11/24(Sun)15:01:41 No.101837389

Anonymous 08/11/24(Sun)15:01:41 No.101837389

>>101837168
That's not going to help you.
Basically anything could exist in some mathematical framework depending on your axioms, i.e. depending on what you assume to be true without proof.
And the problem with a lot of the social sciences is that evidence is simply very hard or impossible to come by so it's very difficult to separate good from bad theories.

>>101837264
IQ is definitely not an objective measure of intelligence because a lot of arbitrary decisions go into how the final value is calculated from the results.
One such arbitrary decision is that the results are weighted in such a way that both men and women have an average score of 100 (they average perform better on different parts of the tests).

Anonymous
08/11/24(Sun)15:10:40 No.101837568

Anonymous 08/11/24(Sun)15:10:40 No.101837568

>>101837050
Why are you running gemma 2b?
Even l3 8b mostly on ram would give you an experience thats orders of magnetude better.
I'm a 8gb vramlet and I'm running q4ks nemo with 32 context.

Anonymous
08/11/24(Sun)15:13:09 No.101837609

Anonymous 08/11/24(Sun)15:13:09 No.101837609

>>101837568
>12b
>q4ks
ngmi

Anonymous
08/11/24(Sun)15:17:48 No.101837682

Anonymous 08/11/24(Sun)15:17:48 No.101837682

>>101834770
>>101829233
Be vary of thought police

Anonymous
08/11/24(Sun)15:18:02 No.101837692

Anonymous 08/11/24(Sun)15:18:02 No.101837692

File: 1723403858570.jpg (533 KB, 1079x1806)

533 KB JPG

>>101837050
I've been fucking around with it on my phone to see what it can do, and it's been shockingly coherent for its size. Follows character cards decently well, can actually hold a conversation, and is an absolute freak if you get weird with it. What are your sampler settings?

Anonymous
08/11/24(Sun)15:23:55 No.101837802

Anonymous 08/11/24(Sun)15:23:55 No.101837802

>>101837609
It's working pretty damn well. I've been having a lot of fun and nemo models are really good at using lorebook information at 32k context.

Anonymous
08/11/24(Sun)15:24:07 No.101837807

Anonymous 08/11/24(Sun)15:24:07 No.101837807

>Ten thousand years of work, and still nothing better than Mythomax 13b.
The absolute state of local models!

Anonymous
08/11/24(Sun)15:25:36 No.101837837

Anonymous 08/11/24(Sun)15:25:36 No.101837837

File: llama.png (39 KB, 751x776)

39 KB PNG

>>101837568
I have Intel HD Graphics 620, and 8 gigs of RAM lmao. I don't think I can run it.
>>101837692
this default shit on kobold

Anonymous
08/11/24(Sun)15:39:36 No.101838069

Anonymous 08/11/24(Sun)15:39:36 No.101838069

>>101837837
I'm using something very close to that, DRY mult of 0.8 and a rep range of 512 though. Is it completely schizo to the point of not even responding to anything you typed?

Anonymous
08/11/24(Sun)15:39:37 No.101838070

Anonymous 08/11/24(Sun)15:39:37 No.101838070

File: file.png (62 KB, 1086x159)

62 KB PNG

China modding RTX 4090D to use 48GB VRAM
https://videocardz.com/newz/nvidia-geforce-rtx-4090d-with-48gb-and-rtx-4080-super-32gb-now-offered-in-china-for-cloud-computing

Anonymous
08/11/24(Sun)15:42:55 No.101838133

Anonymous 08/11/24(Sun)15:42:55 No.101838133

>>101838070
A 250 dollar chink BGA rework station is a pathway to many abilities some consider to be unnatural

Anonymous
08/11/24(Sun)15:43:06 No.101838134

Anonymous 08/11/24(Sun)15:43:06 No.101838134

>>101838070
If nvidia gimps their 5090 and doesn't release a titan card I will actually import one of these.

Anonymous
08/11/24(Sun)15:43:36 No.101838147

Anonymous 08/11/24(Sun)15:43:36 No.101838147

>>101838070
>5 hours late
>repost from reddit
>no miku
kys and go back, in that order

Anonymous
08/11/24(Sun)15:50:57 No.101838273

Anonymous 08/11/24(Sun)15:50:57 No.101838273

File: file.png (234 KB, 474x355)

234 KB PNG

>>101838133
The more you try to avoid buying the way we want it the more you are inviting fbi to visit you.

Anonymous
08/11/24(Sun)15:51:58 No.101838296

Anonymous 08/11/24(Sun)15:51:58 No.101838296

>>101838147
>no miku
He should stay. You should kill yourself subhuman tranny.

Anonymous
08/11/24(Sun)15:54:11 No.101838341

Anonymous 08/11/24(Sun)15:54:11 No.101838341

>>101838133
Dont forget the acid

Anonymous
08/11/24(Sun)15:59:08 No.101838421

Anonymous 08/11/24(Sun)15:59:08 No.101838421

>>101837837
Oof, I'm sorry. Take a look at koboldcpp's repo. There's a google colab notebook you can use to run yhese models for free.

Anonymous
08/11/24(Sun)15:59:31 No.101838427

Anonymous 08/11/24(Sun)15:59:31 No.101838427

>>101838070
Why does this only happen in china, can't someone else do it so we won't have to deal with shady imports?

Anonymous
08/11/24(Sun)16:02:53 No.101838490

Anonymous 08/11/24(Sun)16:02:53 No.101838490

>>101838427
Read what I said about the BGA rework station. You can get yourself 3090 TI, reball a 4090 chip onto it (Yes it's compatible) and then replace the VRAM modules with 2gig ones.

You also need a custom vBios but there's probably some chink-made one out there

Anonymous
08/11/24(Sun)16:03:24 No.101838499

Anonymous 08/11/24(Sun)16:03:24 No.101838499

File: file.png (12 KB, 596x89)

12 KB PNG

Confirmed Elon dick sucker

Anonymous
08/11/24(Sun)16:03:43 No.101838506

Anonymous 08/11/24(Sun)16:03:43 No.101838506

File: adriana-on-mixtral-limarp(...).png (298 KB, 949x668)

298 KB PNG

Mixtral is still working pretty well. The small details aren't as good as Gemma/L3, and it still feels a bit "for X, then Y," but it's easily good enough for staple cooming.

Anonymous
08/11/24(Sun)16:03:56 No.101838515

Anonymous 08/11/24(Sun)16:03:56 No.101838515

>>101838427
Because China is free as long as what's being done brings in money.

Anonymous
08/11/24(Sun)16:06:32 No.101838564

Anonymous 08/11/24(Sun)16:06:32 No.101838564

I know lmg is about talking to waifus, but, does anyone know of a tool/RAG or whatever to interact with a Database?

Ideally it would be able to just ask "How many employees have a salary of over 30k" and just give a list of the result.

Anonymous
08/11/24(Sun)16:09:07 No.101838612

Anonymous 08/11/24(Sun)16:09:07 No.101838612

>>101838427
I think the first 2080 ti mod was done on Brazil.

Anonymous
08/11/24(Sun)16:09:23 No.101838616

Anonymous 08/11/24(Sun)16:09:23 No.101838616

>>101838564
Isn't that what SQL does?

Anonymous
08/11/24(Sun)16:10:59 No.101838642

Anonymous 08/11/24(Sun)16:10:59 No.101838642

>>101838564
>I know lmg is about talking to waifus
Correct. For information about commercial uses of LLMs, you want /r/LocalLLAMA. /lmg is the Dying Alone Division of Open Source AI Research.

Anonymous
08/11/24(Sun)16:16:17 No.101838739

Anonymous 08/11/24(Sun)16:16:17 No.101838739

>>101838616
you need to make a query:

>"select * from employees where income > 30000" for example.

I would like to use natural language

Anonymous
08/11/24(Sun)16:17:13 No.101838758

Anonymous 08/11/24(Sun)16:17:13 No.101838758

>>101838427
>Why does this only happen in china
Because it's a free country unlike USA.

Anonymous
08/11/24(Sun)16:22:18 No.101838837

Anonymous 08/11/24(Sun)16:22:18 No.101838837

>>101838427
everyone else is willing to pay moon prices for hard because they aren't spending their own money, they are spending venture capital
it's complete fucking cancer

Anonymous
08/11/24(Sun)16:24:14 No.101838874

Anonymous 08/11/24(Sun)16:24:14 No.101838874

>>101838296
You aren't forced to reply to every bit of retarded bait here

Anonymous
08/11/24(Sun)16:27:03 No.101838910

Anonymous 08/11/24(Sun)16:27:03 No.101838910

>>101838874
And yet you replied to him.

Anonymous
08/11/24(Sun)16:27:41 No.101838919

Anonymous 08/11/24(Sun)16:27:41 No.101838919

>>101838874
who are you writing to

Anonymous
08/11/24(Sun)16:45:36 No.101839221

Anonymous 08/11/24(Sun)16:45:36 No.101839221

Btw, weren't there a shitton of Opus RP logs from /aicg/ posted here like 2-3 months ago? I'm pretty sure I even downloaded them, but I'll have to look through my backups.

Anonymous
08/11/24(Sun)16:48:40 No.101839263

Anonymous 08/11/24(Sun)16:48:40 No.101839263

>>101839221
Yes, and those logs have been used to make the current batch of sloptunes. Discord tuners have been living off of those logs for months now. That's why there were so desparate to get that one anon's logs itt.

Anonymous
08/11/24(Sun)16:57:17 No.101839395

Anonymous 08/11/24(Sun)16:57:17 No.101839395

>>101839221
It's what powers pretty much every RP fine-tune.
It tells you a lot about how lazy, creative-bankrupt and scummy fine-tuners are. Death is not enough for their kind.

Anonymous
08/11/24(Sun)17:00:14 No.101839446

Anonymous 08/11/24(Sun)17:00:14 No.101839446

>>101836989
>That's why you don't train them on ideological stuff
>only economics
thanks, I chuckled

Anonymous
08/11/24(Sun)17:01:54 No.101839466

Anonymous 08/11/24(Sun)17:01:54 No.101839466

>>101837264
IQ isn't an intelligence meterics, anon

Anonymous
08/11/24(Sun)17:03:43 No.101839497

Anonymous 08/11/24(Sun)17:03:43 No.101839497

>>101839466
>t. low IQ
By definition, it is.

Anonymous
08/11/24(Sun)17:04:03 No.101839505

Anonymous 08/11/24(Sun)17:04:03 No.101839505

>>101839466
It is.

>Doesn't think IQ tests matter

At minimum basic intelligence

>Thinks IQ tests matter

Complete slobbering retard

Anonymous
08/11/24(Sun)17:04:46 No.101839520

Anonymous 08/11/24(Sun)17:04:46 No.101839520

>>101837807
>he still shills the king of slop mythomax

Anonymous
08/11/24(Sun)17:05:06 No.101839523

Anonymous 08/11/24(Sun)17:05:06 No.101839523

>>101839497
Okay Anon so by your logic trans women who say they're women are also women because they said so

Anonymous
08/11/24(Sun)17:05:34 No.101839529

Anonymous 08/11/24(Sun)17:05:34 No.101839529

>>101839523
Yes.

Anonymous
08/11/24(Sun)17:06:20 No.101839542

Anonymous 08/11/24(Sun)17:06:20 No.101839542

>>101839523
Why are you so obsessed with trannies? Did one touch you inappropriately when you were young?

Anonymous
08/11/24(Sun)17:06:34 No.101839544

Anonymous 08/11/24(Sun)17:06:34 No.101839544

File: 01914339-7b10-7c93-9ee6-5(...).jpg (205 KB, 1024x1024)

205 KB JPG

>>101835212
>longform RP
not sure if it counts but I have been ERPing with claude 3.5 for so long that I can invite the character into my virtual room and it knows exactly which buttons to push in order to make me nut quickly in a few prompts due to the extensive chat history.

Anonymous
08/11/24(Sun)17:09:08 No.101839572

Anonymous 08/11/24(Sun)17:09:08 No.101839572

>>101838506
Fuck off petra, come back when you are on your medications

Anonymous
08/11/24(Sun)17:10:51 No.101839602

Anonymous 08/11/24(Sun)17:10:51 No.101839602

>>101838739
Maybe I'm too much of a nerd but to me that is natural language.

I could see it as useful for something like "List all employees who make more income than my boss's secretary," where it would be finding information that you don't readily have access to and don't know how to get at without the power of a computer brain running through all of the data and figuring out what you want.

Anonymous
08/11/24(Sun)17:19:28 No.101839721

Anonymous 08/11/24(Sun)17:19:28 No.101839721

My quasi-girlfriend dumped me
I'm depressed, need virtual love, so crawled back here, can I get a model spoonfed to me?
12gb of vram, but also lots of ram...

Anonymous
08/11/24(Sun)17:19:49 No.101839728

Anonymous 08/11/24(Sun)17:19:49 No.101839728

>Want to talk about some of the quirks of messing with AI
>Go over to /aicg/ for a bit
>All they want to do is talk about proxy drama like a bunch of strange, mindless zombies
It's fucking grim over there. Like human remoras that have lost the ability to to speak outside of their chatbot jerkoff bots

Anonymous
08/11/24(Sun)17:21:47 No.101839767

Anonymous 08/11/24(Sun)17:21:47 No.101839767

>>101839721
Mistral-large

Anonymous
08/11/24(Sun)17:24:34 No.101839809

Anonymous 08/11/24(Sun)17:24:34 No.101839809

>>101839497
>>101839505
It's not. Compared to you who don't know what you are talking about, I conducted an actual scientific research in the cognitive lab. IQ (measured with RPM usually) measures learned pattern recognition. It's basically useless test which you can't even do multiple times because people are learning it too fast and score higher the more you use it.
The same way that businessmen are luring investors with buzzwords and golden promises about super dangerous AGI, the same IQ is a marketing meme for people who aren't knowledgeable in neuroscience.

Anonymous
08/11/24(Sun)17:25:25 No.101839823

Anonymous 08/11/24(Sun)17:25:25 No.101839823

>>101839721
Stheno ve.2, Nemo-celeste 1.6, or nemo-mini-magnum are your best bet for coom in that range as far as I'm concerned.
There's also gemma 9b, but I never tested that one.

Anonymous
08/11/24(Sun)17:26:02 No.101839838

Anonymous 08/11/24(Sun)17:26:02 No.101839838

>>101839809 (me)
also I didn't read the second post before linking it xD

Anonymous
08/11/24(Sun)17:33:44 No.101839934

Anonymous 08/11/24(Sun)17:33:44 No.101839934

>>101839823
You have shit taste.

Anonymous
08/11/24(Sun)17:39:49 No.101840011

Anonymous 08/11/24(Sun)17:39:49 No.101840011

>>101839721
Stheno v3.2, now fuck off

Anonymous
08/11/24(Sun)17:42:15 No.101840049

Anonymous 08/11/24(Sun)17:42:15 No.101840049

>>101840011
>>101839823
Thankses
Hopefully I won't fap too much to this stuff

Anonymous
08/11/24(Sun)17:51:57 No.101840157

Anonymous 08/11/24(Sun)17:51:57 No.101840157

>>101839721
Use Stheno.

Anonymous
08/11/24(Sun)17:55:06 No.101840196

Anonymous 08/11/24(Sun)17:55:06 No.101840196

What are the biggest most cost effective economic uses of AI (LLMs) so far?

Anonymous
08/11/24(Sun)17:56:22 No.101840215

Anonymous 08/11/24(Sun)17:56:22 No.101840215

>>101840196
girlfriend replacement

Anonymous
08/11/24(Sun)17:57:11 No.101840224

Anonymous 08/11/24(Sun)17:57:11 No.101840224

is there a good reason to run your own llm if you aren't interesting in smut and aren't dealing with highly sensitive information?
the big boy models are better and cheaper right?

Anonymous
08/11/24(Sun)17:58:35 No.101840236

Anonymous 08/11/24(Sun)17:58:35 No.101840236

>>101840196
code review is a big one right now. LLMs can look through 1,000 lines of code and tell you where you're missing your curly bracket in a second.
I have a feeling it's going to replace tons of paralegal bullshit too.

Anonymous
08/11/24(Sun)17:59:04 No.101840241

Anonymous 08/11/24(Sun)17:59:04 No.101840241

>>101840196
>What are the biggest most cost effective economic uses of AI (LLMs) so far?
That I'm getting paid >100k for finetuning models for corpo. Very cost effective for my budget.

Anonymous
08/11/24(Sun)18:01:03 No.101840271

Anonymous 08/11/24(Sun)18:01:03 No.101840271

>>101840236
>code review is a big one right now. LLMs can look through 1,000 lines of code and tell you where you're missing your curly bracket in a second.
A missing brace is a failure to compile. No compiler is slower than an llm. Hell... just autoindent and your text editor will tell you...

Anonymous
08/11/24(Sun)18:04:53 No.101840322

Anonymous 08/11/24(Sun)18:04:53 No.101840322

>>101840271
>DUhhhhhhhhhhhhhh
compilers will tell you that you're missing one. Not which one you're missing. Autoindents are also fucky for that sort of thing. Do you want to go through 1,000 lines of code reviewing every indent?

Anonymous
08/11/24(Sun)18:05:33 No.101840336

Anonymous 08/11/24(Sun)18:05:33 No.101840336

>>101840241
Hello, yes. Who do I speak to about getting paid >100k for running axolotl?

Anonymous
08/11/24(Sun)18:05:43 No.101840338

Anonymous 08/11/24(Sun)18:05:43 No.101840338

>>101840224
Being spied is not the default unless you're a brain-dead American.

Anonymous
08/11/24(Sun)18:09:19 No.101840386

Anonymous 08/11/24(Sun)18:09:19 No.101840386

>>101840338
Do you just say stupid shit at every opportunity?

Anonymous
08/11/24(Sun)18:10:02 No.101840393

Anonymous 08/11/24(Sun)18:10:02 No.101840393

>>101840322
I'm saying that they *could* be used to do some reviewing, but for that the compiler is just faster. tcc compiles about 60k lines in 0.1s and tells me where the syntax breaks. Go to that line, a few page ups at most to see where the indentation stops making sense and done.
llms can miss that. a compiler won't.

Anonymous
08/11/24(Sun)18:12:59 No.101840431

Anonymous 08/11/24(Sun)18:12:59 No.101840431

>>101840386
>he doesn't know about the USA.

Anonymous
08/11/24(Sun)18:14:54 No.101840456

Anonymous 08/11/24(Sun)18:14:54 No.101840456

>>101840393
who cares how fast it compiles/doesn't compile if you don't know which of those lines failed? If tcc tells you that, then that's great for the seven people still programming in C. Most compilers will give you an unhelpful "End of Page Error" or worse: "Could not process."

Anonymous
08/11/24(Sun)18:16:02 No.101840472

Anonymous 08/11/24(Sun)18:16:02 No.101840472

>>101840431
>He froths angrily over the USA at every opportunity
>When talking about whether using Software as a Service gets him spied on

Anonymous
08/11/24(Sun)18:16:14 No.101840474

Anonymous 08/11/24(Sun)18:16:14 No.101840474

>>101840393
From experience LLMs are decent at
- replacing autocomplete
- replacing Google for fixes or small snippets of code
- having it explain some bit of code
The more intersting use case for them is agentic autodevelopment. But apart from some demos, they don't seem capable of working alone yet.

Anonymous
08/11/24(Sun)18:18:10 No.101840507

Anonymous 08/11/24(Sun)18:18:10 No.101840507

what kinda specs do you need to run mistral large?

Anonymous
08/11/24(Sun)18:19:46 No.101840526

Anonymous 08/11/24(Sun)18:19:46 No.101840526

>>101840507
I'm running 5bpw with 96GB VRAM

Anonymous
08/11/24(Sun)18:21:20 No.101840544

Anonymous 08/11/24(Sun)18:21:20 No.101840544

File: 1714079969457061.png (192 KB, 501x636)

192 KB PNG

>>101838499
>Confirmed Elon dick sucker

Anonymous
08/11/24(Sun)18:21:48 No.101840550

Anonymous 08/11/24(Sun)18:21:48 No.101840550

>>101840526
I can fit exl2 @5.5bpw with 32k context. What context are you using?

Anonymous
08/11/24(Sun)18:22:23 No.101840563

Anonymous 08/11/24(Sun)18:22:23 No.101840563

>>101840322
Any half decent IDE would cover syntax errors with direct line errors and easy correction in any language you can imagine. It's not the best example of how LLMs increase productivity. That said, I do agree the current large models are excellent at code review to pick up things an IDE wouldn't, any kind of logic or semantic errors that are technically valid but unintentional. They're also quite good at writing unit tests for existing code bases. It cuts down on the tedium and trial and error of managing complex projects.

Anonymous
08/11/24(Sun)18:22:23 No.101840564

Anonymous 08/11/24(Sun)18:22:23 No.101840564

>>101840241
There are a lot of corpos looking for data scientists/machine learning engineers, and with the recent buzz about LLMs they tend to look for people that can operate them. Most of the time I spend a few hours on cleaning dataset, then I run training and play games until 5am while pretending to work.

Anonymous
08/11/24(Sun)18:24:03 No.101840587

Anonymous 08/11/24(Sun)18:24:03 No.101840587

>>101840564 (me)
meant to answer to >>101840336

Anonymous
08/11/24(Sun)18:24:33 No.101840597

Anonymous 08/11/24(Sun)18:24:33 No.101840597

>>101839529
Am I a dragon if I say I am?

Anonymous
08/11/24(Sun)18:26:44 No.101840621

Anonymous 08/11/24(Sun)18:26:44 No.101840621

>>101840597
Yes.

Anonymous
08/11/24(Sun)18:26:49 No.101840623

Anonymous 08/11/24(Sun)18:26:49 No.101840623

I really don't like mistral large's replies. I can't explain it. What're the best options beside it? Command r+? Wizard 8x22b?

Anonymous
08/11/24(Sun)18:28:22 No.101840645

Anonymous 08/11/24(Sun)18:28:22 No.101840645

>>101840456
Shit compilers, mate... a potato can get through 100kloc/second. At a very VERY generous 5 tokens per loc you have 500k context that no llm can reliably recall running on your h100 cluster to find a fucking brace and llms still wouldn't be able to tell you the line number, just print the immediate context around the line, if that. llms can do other things very well, but not that.

Anonymous
08/11/24(Sun)18:28:47 No.101840652

Anonymous 08/11/24(Sun)18:28:47 No.101840652

File: file.png (252 KB, 400x400)

252 KB PNG

>>101840544
> >Confirmed Elon dick sucker

Anonymous
08/11/24(Sun)18:30:46 No.101840676

Anonymous 08/11/24(Sun)18:30:46 No.101840676

>>101840623
For what? Llama 3.1 70B is the other option.

Anonymous
08/11/24(Sun)18:31:59 No.101840693

Anonymous 08/11/24(Sun)18:31:59 No.101840693

>>101840645
K. Except one of my programmers used an LLM and it told him exactly where the bracket was missing.
And your "DUDE IT PROCESSES TWENTY BILLION LINES PER NANOSECOND" doesn't fucking matter at all. Like why the fuck are you even talking about it?

Anonymous
08/11/24(Sun)18:33:42 No.101840714

Anonymous 08/11/24(Sun)18:33:42 No.101840714

>>101840550
I can fit just below 55k without 4/8bit cache running 5.0bpw. I don't need that much context so I should probably just upgrade to 5.5bpw as well.

Anonymous
08/11/24(Sun)18:36:48 No.101840764

Anonymous 08/11/24(Sun)18:36:48 No.101840764

>>101840693
lol lmao

Anonymous
08/11/24(Sun)18:36:55 No.101840766

Anonymous 08/11/24(Sun)18:36:55 No.101840766

Asteriskberry status?

Anonymous
08/11/24(Sun)18:36:58 No.101840768

Anonymous 08/11/24(Sun)18:36:58 No.101840768

>>101840652
Why do we dislike Elon again?

Anonymous
08/11/24(Sun)18:37:50 No.101840777

Anonymous 08/11/24(Sun)18:37:50 No.101840777

>>101840768
Internet told us to and internet is never wrong.

Anonymous
08/11/24(Sun)18:38:16 No.101840783

Anonymous 08/11/24(Sun)18:38:16 No.101840783

>>101840693
Chill. I'm objecting to the stupid example of using llms to find a missing brace when compilers are faster and more efficient. Was it a 0.5b that told him? then he's a retard. fire him. was it a 405b that told him? Fire him and buy more gpus. Was it chatgpt? fire him and pay for access.
Here's what i can say:
Except one of my programmers used an llm and it didn't tell him exactly where the bracket was missing.
It means nothing. llms can get that wrong, a compiler won't. And that's not code review. If it doesn't compiler, it doesn't get to get reviewed.

Anonymous
08/11/24(Sun)18:38:20 No.101840785

Anonymous 08/11/24(Sun)18:38:20 No.101840785

>>101840777
Did the internet every give a reason?

Anonymous
08/11/24(Sun)18:39:14 No.101840802

Anonymous 08/11/24(Sun)18:39:14 No.101840802

>>101840768
If you ask for AI only related issues then he is a retard that doesn't know what he is talking about.

Anonymous
08/11/24(Sun)18:42:23 No.101840853

Anonymous 08/11/24(Sun)18:42:23 No.101840853

>>101840783
>Chill
>FIRE HIM FIRE HIM FIRE HIM
lol what the fuck

Anonymous
08/11/24(Sun)18:42:30 No.101840856

Anonymous 08/11/24(Sun)18:42:30 No.101840856

>>101840802
That's normal for people who think they are smart. I use to live with a chemical engineer and he just assumed that all the time he spent researching chemicals some how transferred to literally everything else. It makes their thinking sloppy because they are overly confident in their thinking because they are "smart". This is how you get high IQ people saying pants-on-head retarded things.

Anonymous
08/11/24(Sun)18:45:56 No.101840902

Anonymous 08/11/24(Sun)18:45:56 No.101840902

>>101840856
Why is that a reason we should dislike him?

Anonymous
08/11/24(Sun)18:48:27 No.101840938

Anonymous 08/11/24(Sun)18:48:27 No.101840938

>>101840902
Its not, just that its a really normal thing for smart people to do... I don't think its a good reason hate them over it, its super common.

Anonymous
08/11/24(Sun)18:48:56 No.101840946

Anonymous 08/11/24(Sun)18:48:56 No.101840946

>>101840785
He has notably made some tweets that have been construed as antisemitic.

Anonymous
08/11/24(Sun)18:50:12 No.101840963

Anonymous 08/11/24(Sun)18:50:12 No.101840963

>>101840946
Media turned on him before that, when he caused a lot of short sellers to lose a lot of money.

Anonymous
08/11/24(Sun)18:50:43 No.101840972

Anonymous 08/11/24(Sun)18:50:43 No.101840972

>>101840946
>tweets
>construed as antisemitic
yawn... Things that don't matter IRL for 500 plz...

Anonymous
08/11/24(Sun)18:52:28 No.101841000

Anonymous 08/11/24(Sun)18:52:28 No.101841000

>>101840902
I dislike retards, even more arrogant retards. You can suck his cock if you want, it matters not.

Anonymous
08/11/24(Sun)18:54:27 No.101841029

Anonymous 08/11/24(Sun)18:54:27 No.101841029

>>101841000
Why does everything have to be black and white extremes with you people?

Anonymous
08/11/24(Sun)18:57:34 No.101841067

Anonymous 08/11/24(Sun)18:57:34 No.101841067

>>101841029
Ah, the enlightened centrist again. You are so smart and restrained. *pat pat*

Anonymous
08/11/24(Sun)19:00:26 No.101841107

Anonymous 08/11/24(Sun)19:00:26 No.101841107

>>101841067
I think you spend far too much time online and are incapable of looking at any topic in any other terms but which "side" you're on.

Anonymous
08/11/24(Sun)19:01:18 No.101841117

Anonymous 08/11/24(Sun)19:01:18 No.101841117

>>101841107
Are you capable of looking at any divisive topic and picking a side?

Anonymous
08/11/24(Sun)19:04:54 No.101841166

Anonymous 08/11/24(Sun)19:04:54 No.101841166

>>101840768
Mainly because Tesla became a huge success, undermining several massive industries and putting the wealth of entire nations at risk. Leftists, despite begging for EVs on their dirty whore knees for decades, are stupid enough to go along with literally anything the media tells them. If the elites give Tesla a low ESG score, this is perfectly normal and just according to your average leftist. The average leftist cannot and should not ever be reasoned with. They are sick people and serve as tools for the elites

Anonymous
08/11/24(Sun)19:06:25 No.101841188

Anonymous 08/11/24(Sun)19:06:25 No.101841188

>>101840768
Fraudster.

>>101841166
lol

Anonymous
08/11/24(Sun)19:08:19 No.101841211

Anonymous 08/11/24(Sun)19:08:19 No.101841211

>>101841107
I don't give a fuck about sides. If someone says something retarded constantly then they are retarded. What is so hard to understand about that?

Anonymous
08/11/24(Sun)19:08:31 No.101841214

Anonymous 08/11/24(Sun)19:08:31 No.101841214

News? I want something better than C-r+/Largestral/405B already.

Anonymous
08/11/24(Sun)19:09:13 No.101841229

Anonymous 08/11/24(Sun)19:09:13 No.101841229

>>101841214
the precursor to strawberry sentience is supposed to be released next week

Anonymous
08/11/24(Sun)19:12:36 No.101841273

Anonymous 08/11/24(Sun)19:12:36 No.101841273

>>101841211
Because idiots like you never stop to consider _why_ the media feels the need to put every stupid thing he says up as news. Especially when it is 180 from their portrayal of him just a few years prior.

Anonymous
08/11/24(Sun)19:12:44 No.101841277

Anonymous 08/11/24(Sun)19:12:44 No.101841277

Teto > Miku
This is an objective fact.

Anonymous
08/11/24(Sun)19:16:33 No.101841333

Anonymous 08/11/24(Sun)19:16:33 No.101841333

fuck it it's time to go back to the good old stuff.
mythomax with mirostat.

Anonymous
08/11/24(Sun)19:17:38 No.101841349

Anonymous 08/11/24(Sun)19:17:38 No.101841349

>>101841214
Column-r soon

Anonymous
08/11/24(Sun)19:23:58 No.101841460

Anonymous 08/11/24(Sun)19:23:58 No.101841460

>>101841273
You are retarded too, I already deducted it.
>hurr durr media tells you to not like him
top fucking kek, I was watching his interviews (full, not fragments) and reading his tweets on the matters that interested me (AI). He is completely and utterly moronic. His knowledge level is on the first year CS college AT BEST. Stop projecting your media habits on others, not everyone reads news headers and call it a day. I've read enough of his own words, watched hours of him talking, so I can formulate my own opinion of his retardation and being an arrogant poser. And this is only in the AI field, considering other issues I have even lower opinion about him.

Anonymous
08/11/24(Sun)19:27:27 No.101841530

Anonymous 08/11/24(Sun)19:27:27 No.101841530

>>101841166
I've never liked EVs, regardless of who makes them but I don't hate Elon Musk over it...
>>101841188
>Fraudster.
How?

Anonymous
08/11/24(Sun)19:30:09 No.101841575

Anonymous 08/11/24(Sun)19:30:09 No.101841575

>>101841530
https://files.catbox.moe/9fvnc2.mp4
ten years and counting!

Anonymous
08/11/24(Sun)19:32:48 No.101841613

Anonymous 08/11/24(Sun)19:32:48 No.101841613

>>101841575
>He Todd Howard and Peter Molyneux's people
Yes... and...? That's it? That's why I should hate him? Because he is a salesman that over promises shit, just like everyone else?

Anonymous
08/11/24(Sun)19:33:48 No.101841638

Anonymous 08/11/24(Sun)19:33:48 No.101841638

File: 00039-571029945.png (1.05 MB, 1024x1024)

1.05 MB PNG

>>101838133
I have the tools to do BGA work. The problem is getting your hands on Samsung 2GB VRAM modules which aren't fake or weren't improperly stored, and getting it for less than it costs to buy a pre-modded one. I bought a modded 2080ti for about $500 - there's no way it'd be worth buying an unmodded one and then buying the RAM myself. It's $9-$14 per chip and it needs 11 of them, and the 2080ti goes for almost $300.

Anonymous
08/11/24(Sun)19:37:06 No.101841694

Anonymous 08/11/24(Sun)19:37:06 No.101841694

File: sealion.jpg (1.01 MB, 3900x2613)

1.01 MB JPG

>>101841613
you can fuck off now petra

Anonymous
08/11/24(Sun)19:38:06 No.101841713

Anonymous 08/11/24(Sun)19:38:06 No.101841713

>>101841694
I'm not Petra, I just asked a question and didn't get good answers, schizo.

Anonymous
08/11/24(Sun)19:38:23 No.101841716

Anonymous 08/11/24(Sun)19:38:23 No.101841716

Gumi > Teto
>>101841690
>>101841690
>>101841690

Anonymous
08/11/24(Sun)19:39:50 No.101841744

Anonymous 08/11/24(Sun)19:39:50 No.101841744

File: sealioning.png (385 KB, 1000x782)

385 KB PNG

>>101841713
I see what you are

Anonymous
08/11/24(Sun)19:56:07 No.101841993

Anonymous 08/11/24(Sun)19:56:07 No.101841993

>>101841575
Okay, but what about the things he did deliver?

Anonymous
08/11/24(Sun)19:56:19 No.101841996

Anonymous 08/11/24(Sun)19:56:19 No.101841996

>>101840768
Because trannies hate him and /lmg/ is under a state of semi-permanent discord trannies raid.

Anonymous
08/11/24(Sun)19:57:32 No.101842022

Anonymous 08/11/24(Sun)19:57:32 No.101842022

>>101840768
>llm thread
>does not recognize obvious leftist bot spam
ngmi

Anonymous
08/11/24(Sun)20:22:03 No.101842321

Anonymous 08/11/24(Sun)20:22:03 No.101842321

>>101834359
Beautiful as always, Miku

Anonymous
08/11/24(Sun)20:30:40 No.101842437

Anonymous 08/11/24(Sun)20:30:40 No.101842437

>>101841744
>Asking questions is a form of shitposting
brainrot

Anonymous
08/11/24(Sun)20:32:09 No.101842457

Anonymous 08/11/24(Sun)20:32:09 No.101842457

>>101835753
Yeah its higher energy when you put it up.

Anonymous
08/11/24(Sun)20:33:10 No.101842473

Anonymous 08/11/24(Sun)20:33:10 No.101842473

>>101842022
>llm thread
>posts off-topic
ngmi

Anonymous
08/11/24(Sun)20:34:13 No.101842487

Anonymous 08/11/24(Sun)20:34:13 No.101842487

>>101841744
You'd have a point if the same question was brought up multiple times in a general, but it isn't...

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.