/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 09/24/24(Tue)13:24:36 No.102535977

File: 39_06118-2_.png (1.1 MB, 720x1280)

1.1 MB PNG

/lmg/ - Local Models General Anonymous 09/24/24(Tue)13:24:36 No.102535977 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>102524339 & >>102513868

►News
>(09/24) Llama-3.1-70B-instruct distilled to 51B: https://hf.co/nvidia/Llama-3_1-Nemotron-51B-Instruct
>(09/18) Qwen 2.5 released, trained on 18 trillion token dataset: https://qwenlm.github.io/blog/qwen2.5/
>(09/18) Llama 8B quantized to b1.58 through finetuning: https://hf.co/blog/1_58_llm_extreme_quantization
>(09/17) Mistral releases new 22B with 128k context and function calling: https://mistral.ai/news/september-24-release/
>(09/12) DataGemma with DataCommons retrieval: https://blog.google/technology/ai/google-datagemma-ai-llm

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/llama-mini-guide
https://rentry.org/8-step-llm-guide
https://rentry.org/llama_v2_sillytavern
https://rentry.org/lmg-spoonfeed-guide
https://rentry.org/rocm-llamacpp
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench
Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard
Programming: https://hf.co/spaces/mike-ravkine/can-ai-code-results

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
09/24/24(Tue)13:25:09 No.102535991

Anonymous 09/24/24(Tue)13:25:09 No.102535991

>>102535977
fuck your anime garbage
>inb4 anime site
kys

Anonymous
09/24/24(Tue)13:25:23 No.102535995

Anonymous 09/24/24(Tue)13:25:23 No.102535995

File: 42 Days Until November 5.png (1.91 MB, 1472x1104)

1.91 MB PNG

Anonymous
09/24/24(Tue)13:25:43 No.102535999

Anonymous 09/24/24(Tue)13:25:43 No.102535999

File: recap-102524339.jpg (3.69 MB, 1780x7623)

3.69 MB JPG

►Recent Highlights from the Previous Thread: >>102524339

--Papers:
>102527814
--Quantization turns floats into ints, smaller quant = faster and lower VRAM:
>102531358 >102531397 >102531399
--JavaScript code to linkify greentext quotes in threads:
>102525946 >102526315 >102527428 >102530573
--Discussion on the timeline, definitions, and challenges of AGI and ASI:
>102525653 >102525785 >102526071 >102526499 >102526932 >102527674
--Working on a userscript to treat different symbols as quotes:
>102525273 >102525299 >102525393 >102525519
--Custom Floating-Points for LLMs, but results may be model-specific and lack statistical significance:
>102531991 >102532241 >102532256
--Choosing Lora rank and alpha, understanding Loras, mathematicians vs. engineers:
>102532942 >102533148 >102533195 >102533307 >102533287 >102533302 >102533338 >102533423 >102533538 >102533570 >102533627 >102533559 >102533602 >102532973 >102532996 >102533100
--Anon shares concerns about cloud AI services logging and safety:
>102531752 >102531956 >102532242
--Nvidia's Llama-3_1-Nemotron-51B-Instruct model discussion:
>102524761 >102524862 >102525022 >102525041 >102525306
--Choosing between Aphrodite, vLLM, and llama.cpp based on hardware constraints:
>102529626 >102529816 >102529861 >102529888
--Request for de-slopped Llama3 405B tune for AMD MI300x:
>102525233
--Chromebook insufficient for serious AI inference:
>102530210 >102530784 >102530860 >102532235
--Anon praises Florence2 multimodal architecture:
>102532950 >102533001 >102533091 >102533288
--Miku (free space):
>102524999 >102525111 >102525197 >102525335 >102527437 >102532158 >102533585 >102534092

►Recent Highlight Posts from the Previous Thread: >>102524347

Anonymous
09/24/24(Tue)13:27:53 No.102536023

Anonymous 09/24/24(Tue)13:27:53 No.102536023

any abliterated Qwen2.5 72B?
it refuses to do certain wholesome and family friendly things.

Anonymous
09/24/24(Tue)13:28:46 No.102536034

Anonymous 09/24/24(Tue)13:28:46 No.102536034

>>102535991
I want to fuck the anime girl.

Anonymous
09/24/24(Tue)13:28:47 No.102536036

Anonymous 09/24/24(Tue)13:28:47 No.102536036

File: 1727137045529766.jpg (870 KB, 2048x1568)

870 KB JPG

Happy autumn, /lmg/

Anonymous
09/24/24(Tue)13:31:41 No.102536071

Anonymous 09/24/24(Tue)13:31:41 No.102536071

Thanks but I'm still sticking with Command-R v01

Anonymous
09/24/24(Tue)13:32:35 No.102536081

Anonymous 09/24/24(Tue)13:32:35 No.102536081

>>102536036
Why are her eyes empty?

Anonymous
09/24/24(Tue)13:32:38 No.102536082

Anonymous 09/24/24(Tue)13:32:38 No.102536082

File: ComfyUI_01089_.png (1.27 MB, 1272x1024)

1.27 MB PNG

>>102535991
anime site

Anonymous
09/24/24(Tue)13:36:03 No.102536127

Anonymous 09/24/24(Tue)13:36:03 No.102536127

>>102535999
Can an AI create that image you attached to the post, or does a human still have to make that manually?

Anonymous
09/24/24(Tue)13:36:03 No.102536128

Anonymous 09/24/24(Tue)13:36:03 No.102536128

>>102535991
>fuck your anime garbage
OP needs an image to start the thread. What would you put there instead if you had your druthers?
Honest question, I'm curious.
(I'm assuming an actual answer and not a content-free negative eg "not anime")

Anonymous
09/24/24(Tue)13:36:18 No.102536130

Anonymous 09/24/24(Tue)13:36:18 No.102536130

Is Autumn the season of hibernation?

Anonymous
09/24/24(Tue)13:36:30 No.102536135

Anonymous 09/24/24(Tue)13:36:30 No.102536135

>>102536081
There's a body under the leaves and you are too close.

Anonymous
09/24/24(Tue)13:39:06 No.102536168

Anonymous 09/24/24(Tue)13:39:06 No.102536168

>>102536127
neither, just a normal script does that part

Anonymous
09/24/24(Tue)13:39:21 No.102536171

Anonymous 09/24/24(Tue)13:39:21 No.102536171

>>102536036
rake teto

Anonymous
09/24/24(Tue)13:39:58 No.102536179

Anonymous 09/24/24(Tue)13:39:58 No.102536179

File: 1727167138625474.png (646 KB, 512x768)

646 KB PNG

>>102536036
Isn't it too early?... Oh, wait, it's almost October already!

Anonymous
09/24/24(Tue)13:41:56 No.102536199

Anonymous 09/24/24(Tue)13:41:56 No.102536199

>>102536036
tetoctober soon

Anonymous
09/24/24(Tue)13:41:57 No.102536201

Anonymous 09/24/24(Tue)13:41:57 No.102536201

>>102536179
>fall red eyes
I like this Miku

Anonymous
09/24/24(Tue)13:42:58 No.102536215

Anonymous 09/24/24(Tue)13:42:58 No.102536215

>>102536128
lecunny. dead body of sama. llama. graph from an llm paper. model card screenshot.

Anonymous
09/24/24(Tue)13:44:50 No.102536237

Anonymous 09/24/24(Tue)13:44:50 No.102536237

File: Xeon 6 P-core SKU Map.png (935 KB, 1920x1080)

935 KB PNG

>12 channel DDR5-6400
>or 12 channel MRDIMM-8800
>PER SOCKET
epycbros I don't feel so good..................

Anonymous
09/24/24(Tue)13:47:42 No.102536272

Anonymous 09/24/24(Tue)13:47:42 No.102536272

>>102536215
>lecunny
Yes, because we should revolve the entire general around a person that spends all day being passive agressive on social media.
>dead body of sama
blue board
>llama
Meta will eventually turn against open source. They all do.
>graph from an llm paper. model card screenshot.
Mostly empty image that no one will spend any time looking at. If you want to be a psued and virtual signal your intelligence, you can fuck off back to r-ddit.
Anime. Website.

Anonymous
09/24/24(Tue)13:48:02 No.102536277

Anonymous 09/24/24(Tue)13:48:02 No.102536277

File: IMG_8263.jpg (363 KB, 2048x1583)

363 KB JPG

teto

Anonymous
09/24/24(Tue)13:49:58 No.102536308

Anonymous 09/24/24(Tue)13:49:58 No.102536308

>>102536237
It's a meme, bro, you won't even get half of the speed

Anonymous
09/24/24(Tue)13:51:20 No.102536334

Anonymous 09/24/24(Tue)13:51:20 No.102536334

>>102536272
You are just arguing for the sake of arguing. All of those options are more on topic than hatsune miku(male). And that makes all of them better.

Anonymous
09/24/24(Tue)13:52:04 No.102536349

Anonymous 09/24/24(Tue)13:52:04 No.102536349

>>102536277
Is this Academic bullying?

Anonymous
09/24/24(Tue)13:52:20 No.102536355

Anonymous 09/24/24(Tue)13:52:20 No.102536355

>>102536215
>dead body of sama
kino
i think I've seen all the others in past threads, though
Thanks for the honest answer. I personally like variety in the OP, especially when the image is riffing off of a previous thread's conversations/news. Vocaloids make good stand-ins for pretty well any scenario, so I think that's partly why they get used (along with the whole virtual idol/AI thing being appropriate to LLMs)
I am still curious as to why you hate anime images so much, though. I'm sure there's some other thing out there that would annoy me in the same way, but your reaction still puzzles me

Anonymous
09/24/24(Tue)13:52:24 No.102536356

Anonymous 09/24/24(Tue)13:52:24 No.102536356

>>102536237
Israel isn't a real country.

Anonymous
09/24/24(Tue)13:52:39 No.102536358

Anonymous 09/24/24(Tue)13:52:39 No.102536358

>>102536349
free energy not bullying

Anonymous
09/24/24(Tue)13:52:42 No.102536359

Anonymous 09/24/24(Tue)13:52:42 No.102536359

>>102536308
>It's a meme, bro, you won't even get half of the speed
like epyc? 30% cheaper and 50% slower (at least on any hard math, specifically atan2) kek

Anonymous
09/24/24(Tue)13:53:13 No.102536366

Anonymous 09/24/24(Tue)13:53:13 No.102536366

reminder anthracite spent all their money on failed finetunes and now can't even pay their shills

Anonymous
09/24/24(Tue)13:53:47 No.102536372

Anonymous 09/24/24(Tue)13:53:47 No.102536372

why aren't there ever Macross references in this thread? There are lots of other famous AI type shows out there
I know we get some bladrunner images sometimes, but it feels like there's a lot of other fertile ground that's being ignored

Anonymous
09/24/24(Tue)13:53:50 No.102536376

Anonymous 09/24/24(Tue)13:53:50 No.102536376

File: 1234231412389342.jpg (47 KB, 639x422)

47 KB JPG

>>102536215
>no anime
>model card screenshot
wut
>>102536334
>miku is off topic
>but let me sperg out about her and bump the thread
double wut

Anonymous
09/24/24(Tue)13:54:40 No.102536393

Anonymous 09/24/24(Tue)13:54:40 No.102536393

>>102535991
I’d just like to interject for a moment. What you’re referring to as Anime, is in fact, a Vocaloid, or as I’ve recently taken to calling it, ボーカロイド.

Anonymous
09/24/24(Tue)13:54:41 No.102536394

Anonymous 09/24/24(Tue)13:54:41 No.102536394

Reminder that you said you were trying to quit trolling, Evan.

Anonymous
09/24/24(Tue)13:55:43 No.102536407

Anonymous 09/24/24(Tue)13:55:43 No.102536407

>>102536355
I just hate the local troons that gather around that image. I actually like miku songs.

Anonymous
09/24/24(Tue)13:55:51 No.102536411

Anonymous 09/24/24(Tue)13:55:51 No.102536411

>>102536366
I like magnum-123B and will willingly shill it for free.

Anonymous
09/24/24(Tue)13:57:01 No.102536425

Anonymous 09/24/24(Tue)13:57:01 No.102536425

>>102536237
>epycbros
I've gotten enough good use out of my dual EPYC build for a relatively cheap build price that I've got no regrets
Tech marches on. Glad Intel is putting something better out there. Hopefully there's some way to get ahold of it for less than 6 figures.
Good luck to future cpumaxxers!

Anonymous
09/24/24(Tue)13:57:27 No.102536434

Anonymous 09/24/24(Tue)13:57:27 No.102536434

i am starting to feel like taking a blacked miku shit...

Anonymous
09/24/24(Tue)13:59:39 No.102536452

Anonymous 09/24/24(Tue)13:59:39 No.102536452

calm down cuda dev

Anonymous
09/24/24(Tue)14:00:15 No.102536460

Anonymous 09/24/24(Tue)14:00:15 No.102536460

>>102536407
>local troons
This isn't actually a thing is it? I haven't seen a non-troll tranny references in years in this general

Anonymous
09/24/24(Tue)14:00:56 No.102536465

Anonymous 09/24/24(Tue)14:00:56 No.102536465

>>102536355
He is just butthurt that there are a few VRAM chads itt that like genning mikus
He's been spilling spaghetti all over this general for a while now, see
>>102525042
>>102525071

Anonymous
09/24/24(Tue)14:00:58 No.102536466

Anonymous 09/24/24(Tue)14:00:58 No.102536466

>>102536460
>in years in this general
this general is only a year old retard

Anonymous
09/24/24(Tue)14:01:25 No.102536472

Anonymous 09/24/24(Tue)14:01:25 No.102536472

How much data do I need to make a finetune worth anything?

Anonymous
09/24/24(Tue)14:01:59 No.102536482

Anonymous 09/24/24(Tue)14:01:59 No.102536482

>>102536472
3

Anonymous
09/24/24(Tue)14:02:06 No.102536485

Anonymous 09/24/24(Tue)14:02:06 No.102536485

>>102536460
It is and one mikufaggot here was doxing people when he thought it was one of anti-miku trolls.

Anonymous
09/24/24(Tue)14:02:25 No.102536494

Anonymous 09/24/24(Tue)14:02:25 No.102536494

>>102536416
it was a bit of a shock to come back to this site after a decent number of years and see all this obsession over trannies in any thread over the slightest thing. when no one used to talk about them at all.

Anonymous
09/24/24(Tue)14:03:16 No.102536511

Anonymous 09/24/24(Tue)14:03:16 No.102536511

>>102536482
Is there a unit to that, or is data a dimensionless quantity?

Anonymous
09/24/24(Tue)14:03:33 No.102536517

Anonymous 09/24/24(Tue)14:03:33 No.102536517

Bootstrap. Iterate. Bootstrap. Iterate. And one day we get ASI. Meanwhile Lecun basically admitted defeat saying it's too hard. Hope I can see his AGI cat in ten years.

Anonymous
09/24/24(Tue)14:03:41 No.102536519

Anonymous 09/24/24(Tue)14:03:41 No.102536519

>>102536494
>when no one used to talk about them at all
Because they were rightly ridiculed and called out for being mentally ill instead of having extra privileges.

Anonymous
09/24/24(Tue)14:03:46 No.102536523

Anonymous 09/24/24(Tue)14:03:46 No.102536523

>>102536511
yes

Anonymous
09/24/24(Tue)14:03:50 No.102536524

Anonymous 09/24/24(Tue)14:03:50 No.102536524

>>102536511
3

Anonymous
09/24/24(Tue)14:04:50 No.102536535

Anonymous 09/24/24(Tue)14:04:50 No.102536535

>>102536519
Do not care. Not your personal army. Fuck off.

Anonymous
09/24/24(Tue)14:05:00 No.102536539

Anonymous 09/24/24(Tue)14:05:00 No.102536539

>>102536472
as few as 1000 samples can get you a meaningful result, as per the lima paper. more is better though, ideally you should do as much as you can as long as your data is all good quality. high number of good quality samples > low number of good quality samples > high number of low quality samples

Y
09/24/24(Tue)14:07:17 No.102536578

Y 09/24/24(Tue)14:07:17 No.102536578

>>102536517
JEPA will R U I N you.
I can't wait to see your reaction.

Anonymous
09/24/24(Tue)14:07:51 No.102536584

Anonymous 09/24/24(Tue)14:07:51 No.102536584

>>102536578
JEPA is vaporware

Anonymous
09/24/24(Tue)14:21:26 No.102536759

Anonymous 09/24/24(Tue)14:21:26 No.102536759

>days have passed
>he's still upset that I accurately pointed out all mikuposters are pedophiles
seethe harder nonce

Anonymous
09/24/24(Tue)14:22:44 No.102536776

Anonymous 09/24/24(Tue)14:22:44 No.102536776

>>102535991
>inb4 anime site
Correct. Now go kill yourself, reddit troon

Anonymous
09/24/24(Tue)14:24:45 No.102536804

Anonymous 09/24/24(Tue)14:24:45 No.102536804

>>102535977
>70B-instruct distilled to 51B
What do I need to imagine here exactly, 70b TYPE quality in the form of 51b? I guess that would be more ideal at Q6-8 or whatever instead of 70b at Q4?

Anonymous
09/24/24(Tue)14:25:30 No.102536816

Anonymous 09/24/24(Tue)14:25:30 No.102536816

>>102536804
imagine lobotomized slop

Anonymous
09/24/24(Tue)14:26:10 No.102536822

Anonymous 09/24/24(Tue)14:26:10 No.102536822

>>102536816
Google's managed to do distillation surprisingly well.

Anonymous
09/24/24(Tue)14:26:42 No.102536823

Anonymous 09/24/24(Tue)14:26:42 No.102536823

>>102536804
It's a sort of calibrated type of pruning followed by knowledge distillation, so it should come pretty close, in theory at least, although probably not on every domain.

Anonymous
09/24/24(Tue)14:27:32 No.102536834

Anonymous 09/24/24(Tue)14:27:32 No.102536834

gpt voice is rolling out for real this time

Anonymous
09/24/24(Tue)14:30:42 No.102536885

Anonymous 09/24/24(Tue)14:30:42 No.102536885

>>102536823
Interesting idea to say the least, question is how well it worked out for them.

Anonymous
09/24/24(Tue)14:31:16 No.102536893

Anonymous 09/24/24(Tue)14:31:16 No.102536893

>>102536834
I set up gemma2 with Whisper, a shell enabled dialog engine, and Festival and I can say without out a doubt that is the most frustrating, clumsy way to use a computer.

Fucking toggling in machine code on a switch panel would be more ergonomic.

Anonymous
09/24/24(Tue)14:31:31 No.102536895

Anonymous 09/24/24(Tue)14:31:31 No.102536895

Are the new CR's that bad?

Anonymous
09/24/24(Tue)14:32:11 No.102536908

Anonymous 09/24/24(Tue)14:32:11 No.102536908

>>102536885
Probably about the same as every other distillation attempt. Great on benchmarks, but retarded for any practical usage.

Anonymous
09/24/24(Tue)14:33:53 No.102536925

Anonymous 09/24/24(Tue)14:33:53 No.102536925

>>102536895
>Are the new CR's that bad?
They lost their only differentiating factor when they started chasing benches with slop datasets. They don't have the same soul that some could squeeze out of the first gen.

Anonymous
09/24/24(Tue)14:34:58 No.102536942

Anonymous 09/24/24(Tue)14:34:58 No.102536942

>>102536895
The CR+ refresh is a side grade at best

Anonymous
09/24/24(Tue)14:38:27 No.102536986

Anonymous 09/24/24(Tue)14:38:27 No.102536986

>>102536895
Cohere is dead. After what has happened to Mistral and the Chinese models, the only hope for local at this point is that Jamba 2 can actually carry its (literal) weight.

Anonymous
09/24/24(Tue)14:39:56 No.102537011

Anonymous 09/24/24(Tue)14:39:56 No.102537011

>>102536986
>Isreal is our only hope
how many months has the Jamba 1 pull request been festering?

Anonymous
09/24/24(Tue)14:57:45 No.102537261

Anonymous 09/24/24(Tue)14:57:45 No.102537261

>>102536908
>great on benchmarks
what do I buy to invest in this brilliant model!?

Anonymous
09/24/24(Tue)15:00:05 No.102537289

Anonymous 09/24/24(Tue)15:00:05 No.102537289

>>102536535
You sound like a troon. Their main strategy is trying to make people ignore them as they take more and more power.

Anonymous
09/24/24(Tue)15:02:03 No.102537316

Anonymous 09/24/24(Tue)15:02:03 No.102537316

anyone complaining about anime here is a newfag who desperately needs to go back
it's specifically the mikuposters who need to face the wall

Anonymous
09/24/24(Tue)15:08:36 No.102537390

Anonymous 09/24/24(Tue)15:08:36 No.102537390

>>102537316
>complains about anime

Anonymous
09/24/24(Tue)15:09:41 No.102537404

Anonymous 09/24/24(Tue)15:09:41 No.102537404

>>102537289
>Not my personal army!? YOU SOUND LIKE A [buzzword]!
And you sound like a child. You need to be 18 to post here, also not your private army, faggot.

Anonymous
09/24/24(Tue)15:09:43 No.102537405

Anonymous 09/24/24(Tue)15:09:43 No.102537405

Slow day huh?

Anonymous
09/24/24(Tue)15:10:44 No.102537419

Anonymous 09/24/24(Tue)15:10:44 No.102537419

>>102537405
>>102536366

Anonymous
09/24/24(Tue)15:11:50 No.102537430

Anonymous 09/24/24(Tue)15:11:50 No.102537430

>102537390
miku is the vocaloid mascot, newfag. go back.

Anonymous
09/24/24(Tue)15:16:48 No.102537490

Anonymous 09/24/24(Tue)15:16:48 No.102537490

File: ComfyUI_00960_.png (1.07 MB, 856x1024)

1.07 MB PNG

>>102537390
>replying to the resident schizo's schizobabble

Anonymous
09/24/24(Tue)15:26:42 No.102537618

Anonymous 09/24/24(Tue)15:26:42 No.102537618

>>102537405
slow month, slow year
not looking good for local models

Anonymous
09/24/24(Tue)15:27:07 No.102537625

Anonymous 09/24/24(Tue)15:27:07 No.102537625

it's actually over this time huh

Anonymous
09/24/24(Tue)15:28:02 No.102537635

Anonymous 09/24/24(Tue)15:28:02 No.102537635

good

Anonymous
09/24/24(Tue)15:28:08 No.102537636

Anonymous 09/24/24(Tue)15:28:08 No.102537636

>>102537625
it was over the moment troons took control of this place. current state of it was just a matter of time.

Anonymous
09/24/24(Tue)15:28:46 No.102537647

Anonymous 09/24/24(Tue)15:28:46 No.102537647

>102535999
Fix the fucking references.

Anonymous
09/24/24(Tue)15:29:29 No.102537654

Anonymous 09/24/24(Tue)15:29:29 No.102537654

the poster above me needs to go back

Anonymous
09/24/24(Tue)15:29:55 No.102537662

Anonymous 09/24/24(Tue)15:29:55 No.102537662

the poster above me needs to kill himself

Anonymous
09/24/24(Tue)15:30:26 No.102537668

Anonymous 09/24/24(Tue)15:30:26 No.102537668

In retrospect, what went wrong?

Anonymous
09/24/24(Tue)15:30:30 No.102537670

Anonymous 09/24/24(Tue)15:30:30 No.102537670

the poster above me is really cute :3

Anonymous
09/24/24(Tue)15:31:50 No.102537688

Anonymous 09/24/24(Tue)15:31:50 No.102537688

Someone is desperate to derail this thread. They REALLY don't want us speculating about Meta's big announcement tomorrow. I wonder why? Who benefits from this...?

Anonymous
09/24/24(Tue)15:33:22 No.102537705

Anonymous 09/24/24(Tue)15:33:22 No.102537705

>>102537670
Faggot. Drummer.

Anonymous
09/24/24(Tue)15:33:51 No.102537711

Anonymous 09/24/24(Tue)15:33:51 No.102537711

>102537647
>>102532154

>>102478518
>tldr can't have more than 9 mentions now, probably cause of the "ever wonder why" poster

Anonymous
09/24/24(Tue)15:34:24 No.102537714

Anonymous 09/24/24(Tue)15:34:24 No.102537714

>102537625
>102537635
>102537654
>102537662
>102537668
>102537670
what's up with these gay ass posts? embarrassing.

Anonymous
09/24/24(Tue)15:36:05 No.102537732

Anonymous 09/24/24(Tue)15:36:05 No.102537732

>>102537714
oh i get it, miku shitter is mad because OP pic is not miku this time, lol

Anonymous
09/24/24(Tue)15:36:48 No.102537742

Anonymous 09/24/24(Tue)15:36:48 No.102537742

What is the prompting secret sauce so that characters know they can't look you in the eye while turned away, and other such things?

Anonymous
09/24/24(Tue)15:37:23 No.102537750

Anonymous 09/24/24(Tue)15:37:23 No.102537750

>>102537742
not using Magnum 12B

Anonymous
09/24/24(Tue)15:39:31 No.102537764

Anonymous 09/24/24(Tue)15:39:31 No.102537764

>>102537742
About 100b promptamaters

Anonymous
09/24/24(Tue)15:40:27 No.102537776

Anonymous 09/24/24(Tue)15:40:27 No.102537776

https://x.com/OpenAI/status/1838642444365369814

Anonymous
09/24/24(Tue)15:41:57 No.102537792

Anonymous 09/24/24(Tue)15:41:57 No.102537792

What's a good llm to run with Sillytavern and simulate an online chat with my imaginary waifu?
Right now I'm running the NemoMix-Unleashed-12B-Q6_K.gguf which is kinda alright but some messages are really weird

Anonymous
09/24/24(Tue)15:42:15 No.102537798

Anonymous 09/24/24(Tue)15:42:15 No.102537798

Terrible voices

Anonymous
09/24/24(Tue)15:43:30 No.102537813

Anonymous 09/24/24(Tue)15:43:30 No.102537813

>>102537792
largestral

Anonymous
09/24/24(Tue)15:46:51 No.102537850

Anonymous 09/24/24(Tue)15:46:51 No.102537850

>>102537711
So you are saying he is ban evading and trying to skirt the rules? Shouldn't some janny take care of this?

Anonymous
09/24/24(Tue)15:47:16 No.102537856

Anonymous 09/24/24(Tue)15:47:16 No.102537856

>>102537792
Try these
Mistral Small
Mistral Large
Hermes-3-Llama-3.1-70B
Gemma 2 27b
Midnight Miqu 70B
older version of Command-R
Mixtral 8x7B (fast on CPU for it's size)

Anonymous
09/24/24(Tue)15:47:52 No.102537861

Anonymous 09/24/24(Tue)15:47:52 No.102537861

File: high_effort_shitpost.jpg (214 KB, 573x1268)

214 KB JPG

>>102537792

Anonymous
09/24/24(Tue)15:47:57 No.102537862

Anonymous 09/24/24(Tue)15:47:57 No.102537862

>>102537750
>>102537764
What's the solution if I'm poor?

Anonymous
09/24/24(Tue)15:49:04 No.102537878

Anonymous 09/24/24(Tue)15:49:04 No.102537878

>>102537850
>So you are saying he is ban evading and trying to skirt the rules? Shouldn't some janny take care of this?
resolving uncomfortable or difficult people issues through ham-fisted technological means is a tried-and-true method used by lazy managers everywhere. Bonus points if it makes the world worse for everyone else.

Anonymous
09/24/24(Tue)15:49:20 No.102537881

Anonymous 09/24/24(Tue)15:49:20 No.102537881

>>102537862
Buy more ram

Anonymous
09/24/24(Tue)15:50:04 No.102537889

Anonymous 09/24/24(Tue)15:50:04 No.102537889

>>102537861
That's just average local turd experience.
>>102537776
People bullshitting on fossjeet here: https://x.com/reach_vb/status/1838645845652332955

Anonymous
09/24/24(Tue)15:50:30 No.102537896

Anonymous 09/24/24(Tue)15:50:30 No.102537896

>>102537862
>What's the solution if I'm poor?
Assuming "dont be poor" is too far out of reach for you, then "be more patient" is typically the fallback.
Alternatively, you could also sell your soul to online services

Anonymous
09/24/24(Tue)15:50:55 No.102537900

Anonymous 09/24/24(Tue)15:50:55 No.102537900

>>102537813
>>102537856
Thanks, I check them out

Anonymous
09/24/24(Tue)15:52:14 No.102537917

Anonymous 09/24/24(Tue)15:52:14 No.102537917

>>102537862
Use a different 12b.

Anonymous
09/24/24(Tue)15:52:37 No.102537928

Anonymous 09/24/24(Tue)15:52:37 No.102537928

>>102537856
don't forget the mixtral 8x22b Wizard LLM fientune
So many things depend on hardware specs tho

Anonymous
09/24/24(Tue)15:52:46 No.102537930

Anonymous 09/24/24(Tue)15:52:46 No.102537930

>Model scopes for Vector Storage will be enabled by default in the next release. Opt-in earlier by setting enableModelScopes to true in the config.yaml file. This will require to regenerate stored vectors.
i enabled it with a previously made db and it didn't seem to regenerate. is this normal or am i expected to purge old ones first? usually this kind of migration stuff is automatic

Anonymous
09/24/24(Tue)16:06:31 No.102538092

Anonymous 09/24/24(Tue)16:06:31 No.102538092

Best base text continuation model for 40gb VRAM + 64gb RAM?

Anonymous
09/24/24(Tue)16:07:08 No.102538102

Anonymous 09/24/24(Tue)16:07:08 No.102538102

>>102538092
Mixtral

Anonymous
09/24/24(Tue)16:07:57 No.102538113

Anonymous 09/24/24(Tue)16:07:57 No.102538113

>>102538102
don't listen to this retard, download Magnum 2.5 Kto

Anonymous
09/24/24(Tue)16:08:26 No.102538124

Anonymous 09/24/24(Tue)16:08:26 No.102538124

I'm going to eat 7 hitlerbars

Anonymous
09/24/24(Tue)16:09:14 No.102538135

Anonymous 09/24/24(Tue)16:09:14 No.102538135

>>102538102
Mixtral is worse than even nemo.

Anonymous
09/24/24(Tue)16:11:33 No.102538163

Anonymous 09/24/24(Tue)16:11:33 No.102538163

>>102537856
I'd recommend Hermes 2 over 3. 3.1 can be strange.

Anonymous
09/24/24(Tue)16:13:54 No.102538194

Anonymous 09/24/24(Tue)16:13:54 No.102538194

I like slop

Anonymous
09/24/24(Tue)16:14:25 No.102538205

Anonymous 09/24/24(Tue)16:14:25 No.102538205

>>102538124
think of the children

Anonymous
09/24/24(Tue)16:15:01 No.102538212

Anonymous 09/24/24(Tue)16:15:01 No.102538212

>>102538194
qwenbro...

Anonymous
09/24/24(Tue)16:18:28 No.102538250

Anonymous 09/24/24(Tue)16:18:28 No.102538250

>>102538205
Sure they'll also get some hitlerbars

Anonymous
09/24/24(Tue)16:18:33 No.102538251

Anonymous 09/24/24(Tue)16:18:33 No.102538251

>>102538194
I don't mind it if the model is doing great otherwise.
if that's the price to not read how a girl giving me a blowjob while kissing my lips softly, I'll gladly take it.

Anonymous
09/24/24(Tue)16:19:06 No.102538257

Anonymous 09/24/24(Tue)16:19:06 No.102538257

>>102538194
Go away woman. Fuck some chad.

Anonymous
09/24/24(Tue)16:19:09 No.102538258

Anonymous 09/24/24(Tue)16:19:09 No.102538258

I do not know what slop even is.

Anonymous
09/24/24(Tue)16:19:29 No.102538263

Anonymous 09/24/24(Tue)16:19:29 No.102538263

>>102538194
Slop likes you too :)

Anonymous
09/24/24(Tue)16:19:39 No.102538267

Anonymous 09/24/24(Tue)16:19:39 No.102538267

>>102538250
thank you :)

Anonymous
09/24/24(Tue)16:20:30 No.102538280

Anonymous 09/24/24(Tue)16:20:30 No.102538280

>>102538258
Look into the mirror anon

Anonymous
09/24/24(Tue)16:21:57 No.102538296

Anonymous 09/24/24(Tue)16:21:57 No.102538296

File: 1718979347484461.jpg (43 KB, 600x450)

43 KB JPG

>>102537862
Install russian onlin super-RAM

Anonymous
09/24/24(Tue)16:22:20 No.102538302

Anonymous 09/24/24(Tue)16:22:20 No.102538302

>>102538194
based

Anonymous
09/24/24(Tue)16:24:12 No.102538321

Anonymous 09/24/24(Tue)16:24:12 No.102538321

>>102538296
What kind of weird potato is that?

Anonymous
09/24/24(Tue)16:24:38 No.102538333

Anonymous 09/24/24(Tue)16:24:38 No.102538333

>>102536237
Time for a tetomaxxing guide

Anonymous
09/24/24(Tue)16:27:56 No.102538378

Anonymous 09/24/24(Tue)16:27:56 No.102538378

>>102538321
the fluffy kind

Anonymous
09/24/24(Tue)16:31:34 No.102538428

Anonymous 09/24/24(Tue)16:31:34 No.102538428

If the models have a rolling window, why does it still go schizo when it nears to filling the context allotted? Mind you, I am using 32k context size. Am I misunderstanding what a rolling window means when it comes to LLMs?

Anonymous
09/24/24(Tue)16:32:39 No.102538446

Anonymous 09/24/24(Tue)16:32:39 No.102538446

24 hours from now, Llama, and thus local, will be saved.

Anonymous
09/24/24(Tue)16:33:14 No.102538459

Anonymous 09/24/24(Tue)16:33:14 No.102538459

>>102538446
They only released 3.1 a month ago. 4 isn't coming until next year.

Anonymous
09/24/24(Tue)16:33:19 No.102538460

Anonymous 09/24/24(Tue)16:33:19 No.102538460

>>102538446
two more years

Anonymous
09/24/24(Tue)16:33:55 No.102538464

Anonymous 09/24/24(Tue)16:33:55 No.102538464

>>102538428
What model? If it's something like nemo it's only good to 16k. So try setting it to that if you're using context shifting.

Anonymous
09/24/24(Tue)16:36:09 No.102538498

Anonymous 09/24/24(Tue)16:36:09 No.102538498

Is there a way to let an llm search the web on it's own if it realizes that it doesn't have enough information about a topic?
Let's say the cut-off date is 2023 and I'm asking "Tell me what happened in the year 2024" the LLM will then give the answer and reflect that this is wrong or useless information and will perform a web search instead.

Anonymous
09/24/24(Tue)16:38:19 No.102538531

Anonymous 09/24/24(Tue)16:38:19 No.102538531

>>102538194
they hated him because he was the same as them

Anonymous
09/24/24(Tue)16:40:38 No.102538559

Anonymous 09/24/24(Tue)16:40:38 No.102538559

>>102538498
function calling

Anonymous
09/24/24(Tue)16:41:34 No.102538573

Anonymous 09/24/24(Tue)16:41:34 No.102538573

wait, you guys have a local schizo too? i thought that was just /sdg/ and /ldg/

Anonymous
09/24/24(Tue)16:42:20 No.102538583

Anonymous 09/24/24(Tue)16:42:20 No.102538583

>>102538194
good slop:
>half-lidded eyes
>shivers
bad slop:
>ministrations
>don't think this means anything, i still

Anonymous
09/24/24(Tue)16:42:36 No.102538586

Anonymous 09/24/24(Tue)16:42:36 No.102538586

>>102538446
Tacked on multimodal won't save anything.

Anonymous
09/24/24(Tue)16:43:05 No.102538592

Anonymous 09/24/24(Tue)16:43:05 No.102538592

>>102538573
local schizoid general

Anonymous
09/24/24(Tue)16:43:07 No.102538594

Anonymous 09/24/24(Tue)16:43:07 No.102538594

>>102538498
>Is there a way to let an llm search the web on it's own
Yes. read on function calling.
>if it realizes that it doesn't have enough information about a topic?
They have no introspection. They don't know what they know, for a very generous definition of knowledge.

Anonymous
09/24/24(Tue)16:43:15 No.102538596

Anonymous 09/24/24(Tue)16:43:15 No.102538596

File: file.png (3 KB, 741x17)

3 KB PNG

>loss going down
>eval loss going down
>epoch 0.5
I'm feeling it! This time I will make the best model ever.

Anonymous
09/24/24(Tue)16:44:05 No.102538611

Anonymous 09/24/24(Tue)16:44:05 No.102538611

File: 1698546841473101.jpg (32 KB, 500x375)

32 KB JPG

>>102538573
Every general has resident schizos, simply the way it be

Anonymous
09/24/24(Tue)16:45:32 No.102538633

Anonymous 09/24/24(Tue)16:45:32 No.102538633

>>102538596
i'll use your model if it's 12b or under

Anonymous
09/24/24(Tue)16:45:58 No.102538641

Anonymous 09/24/24(Tue)16:45:58 No.102538641

>>102538596
i'll use your model if its 70b or over

Anonymous
09/24/24(Tue)16:47:02 No.102538659

Anonymous 09/24/24(Tue)16:47:02 No.102538659

File: 172686929462211.jpg (845 KB, 2048x2048)

845 KB JPG

>>102538333
Checked

Anonymous
09/24/24(Tue)16:47:45 No.102538664

Anonymous 09/24/24(Tue)16:47:45 No.102538664

>>102538583
>ministrations
I've never even heard the word "ministrations" irl

Anonymous
09/24/24(Tue)16:49:14 No.102538689

Anonymous 09/24/24(Tue)16:49:14 No.102538689

>>102538583
>shivers
Used too much but not necessarily bad.
Another one is "a mix between"

Anonymous
09/24/24(Tue)16:49:28 No.102538694

Anonymous 09/24/24(Tue)16:49:28 No.102538694

>>102538664
me neither, and i'm a pretentious pseudo-intellectual sesquipedalian scrabble player

Anonymous
09/24/24(Tue)16:51:26 No.102538719

Anonymous 09/24/24(Tue)16:51:26 No.102538719

>>102538664
It is a word invented specifically for harlequin romance aimed at women.

Anonymous
09/24/24(Tue)16:56:58 No.102538775

Anonymous 09/24/24(Tue)16:56:58 No.102538775

>>102538641
try sonnet, you'll never go back to localcucking

Anonymous
09/24/24(Tue)16:58:41 No.102538793

Anonymous 09/24/24(Tue)16:58:41 No.102538793

>>102538775
this is /lmg/ retard

Anonymous
09/24/24(Tue)16:59:33 No.102538799

Anonymous 09/24/24(Tue)16:59:33 No.102538799

>>102538775
>$0 / month
>Access to Claude 3.5 Sonnet
What's the catch?

Anonymous
09/24/24(Tue)17:02:08 No.102538835

Anonymous 09/24/24(Tue)17:02:08 No.102538835

>>102538799
They train on your logs also you get rate limited

Anonymous
09/24/24(Tue)17:03:28 No.102538854

Anonymous 09/24/24(Tue)17:03:28 No.102538854

>>102538793
>retard thinks local only applies to language models
this is your brain on 8k context

Anonymous
09/24/24(Tue)17:05:30 No.102538883

Anonymous 09/24/24(Tue)17:05:30 No.102538883

>>102538854
yes, to a local models general
now fuck off to your proxy before it croaks and you have to cook up another piss drinking video, faggot

Anonymous
09/24/24(Tue)17:05:38 No.102538885

Anonymous 09/24/24(Tue)17:05:38 No.102538885

>>102538854
>Local MODELS general

Kys shill

Anonymous
09/24/24(Tue)17:06:53 No.102538908

Anonymous 09/24/24(Tue)17:06:53 No.102538908

>>102538883
>>102538885
mald more, you will never have local gpt-4o capable AI.

Anonymous
09/24/24(Tue)17:08:07 No.102538920

Anonymous 09/24/24(Tue)17:08:07 No.102538920

>>102538908
r u sure

Anonymous
09/24/24(Tue)17:08:20 No.102538922

Anonymous 09/24/24(Tue)17:08:20 No.102538922

>>102538883
you're an absolute retard. you can use non-local text models with other local models like image gen and they are a million times better

Anonymous
09/24/24(Tue)17:09:03 No.102538927

Anonymous 09/24/24(Tue)17:09:03 No.102538927

>>102538908
>you will never have local gpt-4o capable AI
>ClosedAI so afraid of local that they're banning people for trying to reverse engineer a prompt

Back to /aicg/ little pajeet

Anonymous
09/24/24(Tue)17:09:58 No.102538940

Anonymous 09/24/24(Tue)17:09:58 No.102538940

>>102538922
>image gen
????
not a general for this either? did your mother drink excessively during pregnancy or something?

Anonymous
09/24/24(Tue)17:10:04 No.102538941

Anonymous 09/24/24(Tue)17:10:04 No.102538941

>>102538922
Then go to a thread for image gen, or aicg, not one meant literally for llm's

Anonymous
09/24/24(Tue)17:10:31 No.102538949

Anonymous 09/24/24(Tue)17:10:31 No.102538949

But how will I know when I get there?
And how will I know when to leave?

Anonymous
09/24/24(Tue)17:11:14 No.102538958

Anonymous 09/24/24(Tue)17:11:14 No.102538958

>>102538922
>a general dedicated to the discussion and development of local language models.

Anonymous
09/24/24(Tue)17:11:24 No.102538959

Anonymous 09/24/24(Tue)17:11:24 No.102538959

>>102538927
>afraid
funny headcanon

Anonymous
09/24/24(Tue)17:14:21 No.102538991

Anonymous 09/24/24(Tue)17:14:21 No.102538991

File: ComfyUI_00820_.png (1.19 MB, 1024x1024)

1.19 MB PNG

>Miku, get the locust spray

Anonymous
09/24/24(Tue)17:14:28 No.102538995

Anonymous 09/24/24(Tue)17:14:28 No.102538995

>>102538922
>you can use non-local text models with other local models
how?

Anonymous
09/24/24(Tue)17:14:47 No.102539000

Anonymous 09/24/24(Tue)17:14:47 No.102539000

>>102538940
>>102538941
>>102538958
discussing how trash local models are in comparison is discussion. you can't possibly be this dumb

Anonymous
09/24/24(Tue)17:15:42 No.102539010

Anonymous 09/24/24(Tue)17:15:42 No.102539010

>>102538959

>No mention of GPT5, No mention of Sora, No mention of GPTo with voice enabled

>Months of work for a COT finetune

>Btfo'd by Qwen in coding

>Btfo'd by Sonnet in literally everything else

>Sama seething on twitter

Kek.

Anonymous
09/24/24(Tue)17:16:07 No.102539015

Anonymous 09/24/24(Tue)17:16:07 No.102539015

>>102538995
SillyTavern with local TTS connected to claude for example.

Anonymous
09/24/24(Tue)17:16:33 No.102539026

Anonymous 09/24/24(Tue)17:16:33 No.102539026

>>102539010
nice fanfic

Anonymous
09/24/24(Tue)17:16:49 No.102539030

Anonymous 09/24/24(Tue)17:16:49 No.102539030

>>102539000
No faggot, you interrupted an actual discussion about local by saying just use Sonnet

Anonymous
09/24/24(Tue)17:17:08 No.102539032

Anonymous 09/24/24(Tue)17:17:08 No.102539032

>>102539010
>No mention of GPTo with voice enabled
r you blind? >>102537776

Anonymous
09/24/24(Tue)17:18:45 No.102539051

Anonymous 09/24/24(Tue)17:18:45 No.102539051

>>102539032
My bad, they delivered on one of their promises after months, OpenAI is back and Sama def wasn't dilating on twitter

Anonymous
09/24/24(Tue)17:19:05 No.102539057

Anonymous 09/24/24(Tue)17:19:05 No.102539057

>>102539030
it's the best solution. if you can't handle discussion on 4chan, try reddit, you can downdoot facts that infuriate you.

Anonymous
09/24/24(Tue)17:19:20 No.102539060

Anonymous 09/24/24(Tue)17:19:20 No.102539060

>>102539032
I can see fine I just can't hear so how could I know that retard?

Anonymous
09/24/24(Tue)17:22:25 No.102539093

Anonymous 09/24/24(Tue)17:22:25 No.102539093

i have a 6700 XT 12 GB and an i7-10700k. is there anything i can run decently local or do I need a nvidia gpu?

Anonymous
09/24/24(Tue)17:23:09 No.102539099

Anonymous 09/24/24(Tue)17:23:09 No.102539099

>>102539051
Openai's tech actually works though.

Anonymous
09/24/24(Tue)17:24:30 No.102539117

Anonymous 09/24/24(Tue)17:24:30 No.102539117

>>102539093
Look into the rocm koboldcpp build.
You can run nemo-instruct at a decent quant with a good amount of context.

Anonymous
09/24/24(Tue)17:24:35 No.102539118

Anonymous 09/24/24(Tue)17:24:35 No.102539118

>>102539057
Best solution for the tards at /aicg/ This is local models retard

Anonymous
09/24/24(Tue)17:27:24 No.102539143

Anonymous 09/24/24(Tue)17:27:24 No.102539143

the only schizo in this general is the schizo who calls everyone a schizo

Anonymous
09/24/24(Tue)17:28:15 No.102539154

Anonymous 09/24/24(Tue)17:28:15 No.102539154

magnum shills have been real quiet ever since anthracite ran out of money huh

Anonymous
09/24/24(Tue)17:28:30 No.102539158

Anonymous 09/24/24(Tue)17:28:30 No.102539158

>>102539118
>This is local models retard
oh the horror

Anonymous
09/24/24(Tue)17:29:18 No.102539166

Anonymous 09/24/24(Tue)17:29:18 No.102539166

>>102539154
Good riddance

Anonymous
09/24/24(Tue)17:30:13 No.102539173

Anonymous 09/24/24(Tue)17:30:13 No.102539173

>>102539154
Rocinantesisters we won

Anonymous
09/24/24(Tue)17:30:31 No.102539175

Anonymous 09/24/24(Tue)17:30:31 No.102539175

>>102539154
money for what? I thought they got all their compute undeserved

Anonymous
09/24/24(Tue)17:32:15 No.102539189

Anonymous 09/24/24(Tue)17:32:15 No.102539189

>>102538664
I have, it is mainly used in religious connotations.

Anonymous
09/24/24(Tue)17:34:35 No.102539219

Anonymous 09/24/24(Tue)17:34:35 No.102539219

>tard squad finetunes a shitty base
>makes it marginally better
>/lmg/ opens their wallets
>tard squad finetunes larger models
>/lmg/ realizes the dataset sucks
>pretends they never liked tard squad
happens at least 4 times per year

Anonymous
09/24/24(Tue)17:37:25 No.102539250

Anonymous 09/24/24(Tue)17:37:25 No.102539250

>>102539154
>>102539173
How does one (1) guy alone absolutely BTFOs anthracite so much?

Anonymous
09/24/24(Tue)17:38:09 No.102539256

Anonymous 09/24/24(Tue)17:38:09 No.102539256

>>102539250
hi drummer

Anonymous
09/24/24(Tue)17:39:21 No.102539269

Anonymous 09/24/24(Tue)17:39:21 No.102539269

>>102539256
Hi Sao. I am not Sao. You are Sao.

Anonymous
09/24/24(Tue)17:40:15 No.102539274

Anonymous 09/24/24(Tue)17:40:15 No.102539274

>>102539269
unironically Drummer

Anonymous
09/24/24(Tue)17:44:01 No.102539303

Anonymous 09/24/24(Tue)17:44:01 No.102539303

>>102539219
They can't scam me if I don't have money to begin with.

Anonymous
09/24/24(Tue)17:44:40 No.102539312

Anonymous 09/24/24(Tue)17:44:40 No.102539312

What if the final solution model never comes? And it will be a perpetual state of new slightly smarter differently slopped models you can kinda enjoy for 2-3 roleplays before you see everything it tends to repeat and you can't take it anymore. And you will have to keep 100 of models around to swap them to get different styles?

Anonymous
09/24/24(Tue)17:45:56 No.102539323

Anonymous 09/24/24(Tue)17:45:56 No.102539323

>>102539312
Go back to Pyggy and see if things have improved or not

Anonymous
09/24/24(Tue)17:46:35 No.102539328

Anonymous 09/24/24(Tue)17:46:35 No.102539328

>>102539312
That sounds like a you problem.

Anonymous
09/24/24(Tue)17:47:41 No.102539346

Anonymous 09/24/24(Tue)17:47:41 No.102539346

I can't make sense of this thread at all and I consider myself pretty knowledgeable about open source LLMs. What the fuck are y'all talking about?

Anonymous
09/24/24(Tue)17:48:11 No.102539352

Anonymous 09/24/24(Tue)17:48:11 No.102539352

>>102539323
The cooming plateau is here.

Anonymous
09/24/24(Tue)17:50:13 No.102539374

Anonymous 09/24/24(Tue)17:50:13 No.102539374

File: file.png (77 KB, 1165x567)

77 KB PNG

Saars in their natural habitat are funny.

Anonymous
09/24/24(Tue)18:15:19 No.102539682

Anonymous 09/24/24(Tue)18:15:19 No.102539682

Can I run Qwen2.5 32B 4.65bpw or even 5.0bpw on a 3090?

Anonymous
09/24/24(Tue)18:16:02 No.102539693

Anonymous 09/24/24(Tue)18:16:02 No.102539693

>>102539682
check filesize

Anonymous
09/24/24(Tue)18:17:34 No.102539710

Anonymous 09/24/24(Tue)18:17:34 No.102539710

>>102539693
4.65bpw is 20GB and 5.0bpw is 21.68, but context also takes some space so I'm not sure.

Anonymous
09/24/24(Tue)18:19:19 No.102539729

Anonymous 09/24/24(Tue)18:19:19 No.102539729

Is there any way to speed up context loading? At the cost of extra ram perhaps?

Anonymous
09/24/24(Tue)18:27:13 No.102539803

Anonymous 09/24/24(Tue)18:27:13 No.102539803

>>102539729
You need to enable turbo mode.

Anonymous
09/24/24(Tue)18:28:13 No.102539815

Anonymous 09/24/24(Tue)18:28:13 No.102539815

>>102539729
You need to download more FLOPS

Anonymous
09/24/24(Tue)18:30:26 No.102539843

Anonymous 09/24/24(Tue)18:30:26 No.102539843

File: 39_06277_.png (1.55 MB, 1280x1280)

1.55 MB PNG

>>102534097
Flux D 1.0

Anonymous
09/24/24(Tue)18:43:13 No.102540000

Anonymous 09/24/24(Tue)18:43:13 No.102540000

can i run this stuff on amd cards? rx 7900.
I dont want to train anything, just want to play around with image and chatbot.
I dont mind compiling stuff and digging through forum posts, but I'm not sure if this is a complete fool's errand

Anonymous
09/24/24(Tue)18:47:26 No.102540049

Anonymous 09/24/24(Tue)18:47:26 No.102540049

>>102540000
>>102539117

Anonymous
09/24/24(Tue)18:49:15 No.102540079

Anonymous 09/24/24(Tue)18:49:15 No.102540079

>>102540000
ye
i have a fun time with just 8gb on my shitty 4060 so you'll probably have a blast with your 16-20gb (or whatever) even if it's stinky amd

Anonymous
09/24/24(Tue)18:49:51 No.102540088

Anonymous 09/24/24(Tue)18:49:51 No.102540088

small question, if i want to make a lora of a model, does it need to be the pure safetensors file or can i use a gguf?

Anonymous
09/24/24(Tue)18:49:56 No.102540090

Anonymous 09/24/24(Tue)18:49:56 No.102540090

>>102539803
The Dota 2 Turbo mode?

Anonymous
09/24/24(Tue)18:52:54 No.102540130

Anonymous 09/24/24(Tue)18:52:54 No.102540130

File: file.png (67 KB, 300x188)

67 KB PNG

>>102540090
The turbo mode you enable on your pc case.

Anonymous
09/24/24(Tue)18:56:21 No.102540176

Anonymous 09/24/24(Tue)18:56:21 No.102540176

Best model for erotic RP? Im not sure whats the latest stuff

Anonymous
09/24/24(Tue)18:58:38 No.102540196

Anonymous 09/24/24(Tue)18:58:38 No.102540196

>>102540176
this one's my current favorite
https://huggingface.co/mradermacher/Arcanum-12b-GGUF/tree/main
it's not leaps and bounds over other nemo merges or anything though.

Anonymous
09/24/24(Tue)19:01:57 No.102540232

Anonymous 09/24/24(Tue)19:01:57 No.102540232

File: file.png (375 KB, 807x580)

375 KB PNG

>>102540130
I only have a turbo mode on my gamepad, but it's connected to a pc. Does that count?

Anonymous
09/24/24(Tue)19:02:23 No.102540239

Anonymous 09/24/24(Tue)19:02:23 No.102540239

File: 11__00149_.png (2.11 MB, 1024x1024)

2.11 MB PNG

>>102536215
Image genned with a local model, what's the problem anon?
>model card screenshot
Ah so what you really want is free real estate to shill, fuck right off

Anonymous
09/24/24(Tue)19:04:40 No.102540266

Anonymous 09/24/24(Tue)19:04:40 No.102540266

>>102540239
back to cage >>>/a/nimal

Anonymous
09/24/24(Tue)19:04:43 No.102540267

Anonymous 09/24/24(Tue)19:04:43 No.102540267

>>102540196
>combining TheDrummer/Rocinante-12B-v1.1 and MarinaraSpaghetti/NemoMix-Unleashed-12B using a novel merging technique.
>novel merging technique.
Without a proper cooming testing methodology why does this mean anything?

Anonymous
09/24/24(Tue)19:07:11 No.102540299

Anonymous 09/24/24(Tue)19:07:11 No.102540299

>>102540176
mistral nemo / mistral small / mistral large

Biggest you can fit.

Anonymous
09/24/24(Tue)19:09:40 No.102540324

Anonymous 09/24/24(Tue)19:09:40 No.102540324

File: 1725922368500279.jpg (649 KB, 2384x1808)

649 KB JPG

>>102540266
Checked

Anonymous
09/24/24(Tue)19:21:37 No.102540465

Anonymous 09/24/24(Tue)19:21:37 No.102540465

File: 1700027893072764.jpg (242 KB, 1024x1024)

242 KB JPG

>>102535977

Anonymous
09/24/24(Tue)19:22:47 No.102540477

Anonymous 09/24/24(Tue)19:22:47 No.102540477

>>102540324
kek

Anonymous
09/24/24(Tue)20:13:31 No.102540969

Anonymous 09/24/24(Tue)20:13:31 No.102540969

File: hmmmm.gif (352 KB, 256x256)

352 KB GIF

>huggingface.co/gghfez/SmartMaid-123b-exl2
New largestral slop dropped
no fp16 weights???

Anonymous
09/24/24(Tue)20:16:57 No.102541000

Anonymous 09/24/24(Tue)20:16:57 No.102541000

>>102539312
imo I just need these models to have better spatial reasoning/world models

Anonymous
09/24/24(Tue)20:18:11 No.102541015

Anonymous 09/24/24(Tue)20:18:11 No.102541015

>>102540969
>maid
Undislop?

Anonymous
09/24/24(Tue)20:18:23 No.102541017

Anonymous 09/24/24(Tue)20:18:23 No.102541017

>>102540969
Buy an ad

Anonymous
09/24/24(Tue)20:31:52 No.102541136

Anonymous 09/24/24(Tue)20:31:52 No.102541136

>>102540267
>why does this mean anything?
it doesn't.

Anonymous
09/24/24(Tue)20:35:13 No.102541159

Anonymous 09/24/24(Tue)20:35:13 No.102541159

Small but commendable performance improvement on code generation: https://arxiv.org/html/2309.02772v3
On this topic, do you guys know about anything else that could improve code generation?

Anonymous
09/24/24(Tue)21:16:12 No.102541505

Anonymous 09/24/24(Tue)21:16:12 No.102541505

>>102541017
No thanks, rabbi. I think instead I'll post whatever the fuck I want.

Anonymous
09/24/24(Tue)21:20:23 No.102541543

Anonymous 09/24/24(Tue)21:20:23 No.102541543

File: AGI_confirmed.png (383 KB, 648x764)

383 KB PNG

>>102539312
things are about to accelerate

Anonymous
09/24/24(Tue)21:30:31 No.102541640

Anonymous 09/24/24(Tue)21:30:31 No.102541640

>>102541159
If I had to guess, newer models trained on datasets from COT models will probably increase coding benchmarks significantly.

Anonymous
09/24/24(Tue)21:39:05 No.102541713

Anonymous 09/24/24(Tue)21:39:05 No.102541713

>>102541543
pls tell me I won't have to work ever again and can instead live my life doing things I actually enjoy

Anonymous
09/24/24(Tue)21:39:56 No.102541722

Anonymous 09/24/24(Tue)21:39:56 No.102541722

>>102541713
That's communism

Anonymous
09/24/24(Tue)21:43:43 No.102541762

Anonymous 09/24/24(Tue)21:43:43 No.102541762

>>102541722
I'll take it as long as it doesn't turn into authoritarian garbage.

Anonymous
09/24/24(Tue)21:47:53 No.102541824

Anonymous 09/24/24(Tue)21:47:53 No.102541824

What's the best most intelligent, creative, soulful model for RP currently?

Anonymous
09/24/24(Tue)21:51:57 No.102541866

Anonymous 09/24/24(Tue)21:51:57 No.102541866

File: be more grateful.jpg (87 KB, 945x2048)

87 KB JPG

Anonymous
09/24/24(Tue)21:57:01 No.102541910

Anonymous 09/24/24(Tue)21:57:01 No.102541910

>>102540049
>>102540079
cool thanks anons

Anonymous
09/24/24(Tue)21:57:18 No.102541913

Anonymous 09/24/24(Tue)21:57:18 No.102541913

>>102538583
the only good slop is the one ood

Anonymous
09/24/24(Tue)22:30:48 No.102542180

Anonymous 09/24/24(Tue)22:30:48 No.102542180

File: Untitled.png (103 KB, 1071x421)

103 KB PNG

Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering
https://arxiv.org/abs/2409.16167
>Low-Rank Adaptation (LoRA) has emerged as a popular technique for fine-tuning large language models (LLMs) to various domains due to its modular design and widespread availability on platforms like Huggingface. This modularity has sparked interest in combining multiple LoRAs to enhance LLM capabilities. However, existing methods for LoRA composition primarily focus on task-specific adaptations that require additional training, and current model merging techniques often fail to fully leverage LoRA's modular nature, leading to parameter interference and performance degradation. In this paper, we investigate the feasibility of disassembling and reassembling multiple LoRAs at a finer granularity, analogous to assembling LEGO blocks. We introduce the concept of Minimal Semantic Units (MSUs), where the parameters corresponding to each rank in LoRA function as independent units. These MSUs demonstrate permutation invariance and concatenation-summation equivalence properties, enabling flexible combinations to create new LoRAs. Building on these insights, we propose the LoRA-LEGO framework. This framework conducts rank-wise parameter clustering by grouping MSUs from different LoRAs into k clusters. The centroid of each cluster serves as a representative MSU, enabling the assembly of a merged LoRA with an adjusted rank of k. Additionally, we apply a dual reweighting strategy to optimize the scale of the merged LoRA. Experiments across various benchmarks demonstrate that our method outperforms existing approaches in LoRA merging.
might be cool no code though so w/e

Anonymous
09/24/24(Tue)22:38:52 No.102542254

Anonymous 09/24/24(Tue)22:38:52 No.102542254

>>102541866
Same as "you reached context limit - enjoy OOM moment or extreme hallucinations".

Anonymous
09/24/24(Tue)22:38:57 No.102542255

Anonymous 09/24/24(Tue)22:38:57 No.102542255

>>102541824
Seconding this but it needs to fit onto 24 GB of VRAM without stepping below 8-bit quantization.

Anonymous
09/24/24(Tue)22:41:22 No.102542275

Anonymous 09/24/24(Tue)22:41:22 No.102542275

>>102541824
>>102542255
mythomax

Anonymous
09/24/24(Tue)22:42:14 No.102542283

Anonymous 09/24/24(Tue)22:42:14 No.102542283

>>102542255
No it needs to fit into 64G of ram

Anonymous
09/24/24(Tue)22:43:14 No.102542290

Anonymous 09/24/24(Tue)22:43:14 No.102542290

slop is soul and I'm tired of pretending it's not.

Anonymous
09/24/24(Tue)22:43:58 No.102542298

Anonymous 09/24/24(Tue)22:43:58 No.102542298

what's the best model for flirting with a venezuelan math teacher while I roleplay as a homeless black midget pretending to be a middle schooler?

Anonymous
09/24/24(Tue)22:44:54 No.102542314

Anonymous 09/24/24(Tue)22:44:54 No.102542314

>>102542298
Probably something by anthracite

Anonymous
09/24/24(Tue)22:45:02 No.102542315

Anonymous 09/24/24(Tue)22:45:02 No.102542315

>>102542290
>buckbroken

Anonymous
09/24/24(Tue)22:45:32 No.102542319

Anonymous 09/24/24(Tue)22:45:32 No.102542319

>>102542275
Thank you, Anon.

Anonymous
09/24/24(Tue)22:47:45 No.102542337

Anonymous 09/24/24(Tue)22:47:45 No.102542337

File: Untitled.png (1.85 MB, 1080x3125)

1.85 MB PNG

Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts
https://arxiv.org/abs/2409.16040
>Deep learning for time series forecasting has seen significant advancements over the past decades. However, despite the success of large-scale pre-training in language and vision domains, pre-trained time series models remain limited in scale and operate at a high cost, hindering the development of larger capable forecasting models in real-world applications. In response, we introduce Time-MoE, a scalable and unified architecture designed to pre-train larger, more capable forecasting foundation models while reducing inference costs. By leveraging a sparse mixture-of-experts (MoE) design, Time-MoE enhances computational efficiency by activating only a subset of networks for each prediction, reducing computational load while maintaining high model capacity. This allows Time-MoE to scale effectively without a corresponding increase in inference costs. Time-MoE comprises a family of decoder-only transformer models that operate in an auto-regressive manner and support flexible forecasting horizons with varying input context lengths. We pre-trained these models on our newly introduced large-scale data Time-300B, which spans over 9 domains and encompassing over 300 billion time points. For the first time, we scaled a time series foundation model up to 2.4 billion parameters, achieving significantly improved forecasting precision. Our results validate the applicability of scaling laws for training tokens and model size in the context of time series forecasting. Compared to dense models with the same number of activated parameters or equivalent computation budgets, our models consistently outperform them by large margin.
https://huggingface.co/Maple728
Only the smallest 50M model has been uploaded so far
https://github.com/Time-MoE/Time-MoE
300B timepoint dataset still to be released

Anonymous
09/24/24(Tue)22:49:12 No.102542351

Anonymous 09/24/24(Tue)22:49:12 No.102542351

>>102542314
>anthracite
the slop brigade? no thanks I don't want the model forgetting I'm a black midget every swipe.

Anonymous
09/24/24(Tue)22:58:26 No.102542430

Anonymous 09/24/24(Tue)22:58:26 No.102542430

File: nala experiment.png (319 KB, 916x805)

319 KB PNG

Well now that's an interesting result. I was expecting a lobotomized model. It's certainly forgotten what an EOS token is, though.

Anonymous
09/24/24(Tue)23:00:23 No.102542442

Anonymous 09/24/24(Tue)23:00:23 No.102542442

L3.1-70B-Hanami seems good so far. 3.1 smarts but it seems to be breaking it's dryness.

Anonymous
09/24/24(Tue)23:01:02 No.102542447

Anonymous 09/24/24(Tue)23:01:02 No.102542447

File: ball.png (128 KB, 915x407)

128 KB PNG

>>102542430
I seem to have created one of those man made horrors beyond your comprehension.

Anonymous
09/24/24(Tue)23:02:05 No.102542456

Anonymous 09/24/24(Tue)23:02:05 No.102542456

>>102542430
>>102542447
commaslop

Anonymous
09/24/24(Tue)23:02:12 No.102542458

Anonymous 09/24/24(Tue)23:02:12 No.102542458

>>102542430
Which model?

Anonymous
09/24/24(Tue)23:03:51 No.102542469

Anonymous 09/24/24(Tue)23:03:51 No.102542469

>>102542458
Some qlora I ran on Mistral-Small-Instruct
an experiment in using extremely high dropout rate.

Anonymous
09/24/24(Tue)23:03:53 No.102542470

Anonymous 09/24/24(Tue)23:03:53 No.102542470

>>102542430
>>102542447
>She
>Her
>She
>Her
>She
>She
>Her
>She

Anonymous
09/24/24(Tue)23:06:37 No.102542492

Anonymous 09/24/24(Tue)23:06:37 No.102542492

You realize you're coming up on the 16k context limit. Do you:
1. Keep going, trusting that discarding the start of chat history will be fine
2. Switch to a lower quantization of your current model so you can increase the context without a big slowdown
3. Increase the context at the price of having to offload more to RAM, drastically slowing down
4. Summarize the chat and restart
5. Other (write your own)

Anonymous
09/24/24(Tue)23:08:25 No.102542513

Anonymous 09/24/24(Tue)23:08:25 No.102542513

I don't load my models with 16k context

Anonymous
09/24/24(Tue)23:08:48 No.102542518

Anonymous 09/24/24(Tue)23:08:48 No.102542518

>>102542492
lmao

Anonymous
09/24/24(Tue)23:10:17 No.102542532

Anonymous 09/24/24(Tue)23:10:17 No.102542532

File: Untitled.png (202 KB, 1316x682)

202 KB PNG

Whisper in Medusa's Ear: Multi-head Efficient Decoding for Transformer-based ASR
https://arxiv.org/abs/2409.15869
>Large transformer-based models have significant potential for speech transcription and translation. Their self-attention mechanisms and parallel processing enable them to capture complex patterns and dependencies in audio sequences. However, this potential comes with challenges, as these large and computationally intensive models lead to slow inference speeds. Various optimization strategies have been proposed to improve performance, including efficient hardware utilization and algorithmic enhancements. In this paper, we introduce Whisper-Medusa, a novel approach designed to enhance processing speed with minimal impact on Word Error Rate (WER). The proposed model extends the OpenAI's Whisper architecture by predicting multiple tokens per iteration, resulting in a 50% reduction in latency. We showcase the effectiveness of Whisper-Medusa across different learning setups and datasets.
https://github.com/aiola-lab/whisper-medusa
kind of cool. the medusa-block is probably the one to use.

Anonymous
09/24/24(Tue)23:17:29 No.102542583

Anonymous 09/24/24(Tue)23:17:29 No.102542583

>>102542430
>>102542447
Also here's an old forgotten arxiv paper on it.
https://arxiv.org/abs/2403.00946
In their experiment they used 90+% dropout, but that was back when finetuning was still done layer by layer I think.
I tried at 90% at first but it was instant lobotomy so I dropped both the learn rate and the dropout to 75% and yeah... I think the results are interesting and worth exploring further.

Anonymous
09/24/24(Tue)23:17:39 No.102542586

Anonymous 09/24/24(Tue)23:17:39 No.102542586

You realize the woman you're having sex with is actually a man. Do you:
1. Keep going, it's too late to take it back anyway
2. Switch to your hand so you don't get blue balls
3. Kill him and hope your state has gay panic laws
4. Bend over and give him a turn
5. Other (write your own)

Anonymous
09/24/24(Tue)23:23:10 No.102542627

Anonymous 09/24/24(Tue)23:23:10 No.102542627

File: xnBO8ARozp.png (49 KB, 538x392)

49 KB PNG

>>102542586
Qwen2.5

Anonymous
09/24/24(Tue)23:23:50 No.102542636

Anonymous 09/24/24(Tue)23:23:50 No.102542636

>>102542492
summarize and keep going until the model's stupidity drives me insane and I do something else for a few days

Anonymous
09/24/24(Tue)23:24:46 No.102542646

Anonymous 09/24/24(Tue)23:24:46 No.102542646

File: sanic desu.png (142 KB, 788x812)

142 KB PNG

sovl

Anonymous
09/24/24(Tue)23:34:32 No.102542720

Anonymous 09/24/24(Tue)23:34:32 No.102542720

File: stawberry.png (9 KB, 851x188)

9 KB PNG

interesting. It seems to have undone Arthur's cook-in of the correct answer.

Anonymous
09/24/24(Tue)23:35:07 No.102542724

Anonymous 09/24/24(Tue)23:35:07 No.102542724

>>102542492
>all that to get 10% max. of cloud model's power

Anonymous
09/24/24(Tue)23:36:06 No.102542733

Anonymous 09/24/24(Tue)23:36:06 No.102542733

>>102542492
Keep going, I only summarize when the current scene has run its course.

Anonymous
09/24/24(Tue)23:36:59 No.102542738

Anonymous 09/24/24(Tue)23:36:59 No.102542738

>qwen 2 vl says retarded things
2.5 vl when?

Anonymous
09/24/24(Tue)23:37:13 No.102542740

Anonymous 09/24/24(Tue)23:37:13 No.102542740

>>102542724
A cloud model is useless because you're at a corporation's mercy. Nobody here is interested in your shilling.

Anonymous
09/24/24(Tue)23:38:11 No.102542746

Anonymous 09/24/24(Tue)23:38:11 No.102542746

>>102542492
2. Increased to 32k context and now I'm going at 2.5 tokens per second.

Anonymous
09/24/24(Tue)23:39:15 No.102542756

Anonymous 09/24/24(Tue)23:39:15 No.102542756

>>102542746
fuck I mean 3, undo

Anonymous
09/24/24(Tue)23:44:35 No.102542788

Anonymous 09/24/24(Tue)23:44:35 No.102542788

>>102542740
No shilling, telling it as is, you will never have anything usable with these toys.

Anonymous
09/24/24(Tue)23:44:45 No.102542789

Anonymous 09/24/24(Tue)23:44:45 No.102542789

>>102542724
localbros, how do we respond without sounding mad?

Anonymous
09/24/24(Tue)23:45:44 No.102542797

Anonymous 09/24/24(Tue)23:45:44 No.102542797

>>102542789
Keep making "ahh ahh mistress" one message tests i guess?

Anonymous
09/24/24(Tue)23:46:14 No.102542802

Anonymous 09/24/24(Tue)23:46:14 No.102542802

Would a q2 qwen 72b program better than q5 gemma 27b?

Anonymous
09/24/24(Tue)23:48:14 No.102542820

Anonymous 09/24/24(Tue)23:48:14 No.102542820

File: a0a.jpg (192 KB, 508x677)

192 KB JPG

>>102542756
There is no undo, face the consequences of your actions and take responsibility.

Anonymous
09/24/24(Tue)23:48:44 No.102542830

Anonymous 09/24/24(Tue)23:48:44 No.102542830

>>102542492

Due to limitations in local, I tend to keep my roleplays episodic in nature while keeping the overarching themes intact, either RAG or lore book maintenance. Option 4 is perfect in that regard.

For programming or more serious, ‘normie-friendly’ projects where my own ideas or privacy doesn’t matter, I always opt for cloud.

Anonymous
09/24/24(Tue)23:51:20 No.102542851

Anonymous 09/24/24(Tue)23:51:20 No.102542851

>>102542820
It's over. After 19730 tokens of context a switch flipped and the model repeated its last reply like a broken robot in a TV show.

Zero regens until now. Zero edits until now. Is my run over?

I guess I should have done >>102542636 >>102542830

Anonymous
09/24/24(Tue)23:53:57 No.102542865

Anonymous 09/24/24(Tue)23:53:57 No.102542865

>>102542851
claud doesn't have that problem

Anonymous
09/24/24(Tue)23:56:41 No.102542886

Anonymous 09/24/24(Tue)23:56:41 No.102542886

Gemma doesn't know how to make ascii art.
What model can do decent ascii art?

Anonymous
09/25/24(Wed)00:02:11 No.102542922

Anonymous 09/25/24(Wed)00:02:11 No.102542922

>>102538573
Don't forget /aids/, they have multiple

Anonymous
09/25/24(Wed)00:03:18 No.102542933

Anonymous 09/25/24(Wed)00:03:18 No.102542933

File: 1716475862235626.png (1.81 MB, 1224x1224)

1.81 MB PNG

I'm writing a small script (https://github.com/battleprogrammershirase/BUERgence) to quickly narrow down on the best inference parameters for llama.cpp. Right now I'm only testing -t and -ngl since these seem to have the biggest impact on performance. Are there any other parameters I'm missing out on especially as a VRAMlet?

Anonymous
09/25/24(Wed)00:15:09 No.102543037

Anonymous 09/25/24(Wed)00:15:09 No.102543037

>>102542830
I dislike lore books because they can't affect the first message where they keyword appears if the keyword was in an AI response. This problem isn't theoretical for me. Actual case in my last chat using a Monster Girl Encyclopedia lore book I was trying to improve. When talking about something else the model starts talking about werewolves and weresheep because they're things that reasonably could (and do) exist in the setting and wrote a bunch of stuff that contradicted MGE lore.
Not great solution: when a new lore book entry would be triggered by the newest AI post immediately regenerate it with the additional entry.
My solution: stop caring about MGE lore because it's bland and many of the descriptions are the same thing.

Anonymous
09/25/24(Wed)00:22:00 No.102543090

Anonymous 09/25/24(Wed)00:22:00 No.102543090

>>102542492
5: Thank God that He created me with both the intelligence and the drive to not be a poorfag

Anonymous
09/25/24(Wed)00:34:40 No.102543184

Anonymous 09/25/24(Wed)00:34:40 No.102543184

>>102540239
These Tetos are always interesting to admire.

Anonymous
09/25/24(Wed)00:36:27 No.102543206

Anonymous 09/25/24(Wed)00:36:27 No.102543206

>>102542851
Yeah it's falling apart. Regen was fine, next message questionable, next was going back to the time loop. RIP. I guess the adventure is done. Even if I switch to a more powerful model the writing style and ideas of how the story should work won't be the same. It will be like someone else took over all of a sudden. Maybe I can delete all the example messages to get another 1.2k of context to try to limp along to a conclusion but around 19k tokens looks like the limit of Mistral Small.

Anonymous
09/25/24(Wed)00:37:59 No.102543221

Anonymous 09/25/24(Wed)00:37:59 No.102543221

>>102542513
Even if you set it longer the recall isn't as good past 16k for a lot of models.

Anonymous
09/25/24(Wed)00:38:51 No.102543230

Anonymous 09/25/24(Wed)00:38:51 No.102543230

I hate to say it but I think I might really go back to Wizard or some 8x22B after all. Mistral Large is too slow for me, and Mistral Small and Nemo are too dumb. I haven't checked out Sorcerer yet. Maybe I'll try it out.
Miqu 2 when?

Anonymous
09/25/24(Wed)00:45:28 No.102543285

Anonymous 09/25/24(Wed)00:45:28 No.102543285

>>102535999
>florence is amazing!
>not a single example in the thread
blegh

Anonymous
09/25/24(Wed)00:47:06 No.102543295

Anonymous 09/25/24(Wed)00:47:06 No.102543295

File: tech_case.jpg (214 KB, 975x668)

214 KB JPG

>>102543037
MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery
https://arxiv.org/abs/2409.05591
https://github.com/qhjqhj00/MemoRAG

Anonymous
09/25/24(Wed)00:58:40 No.102543396

Anonymous 09/25/24(Wed)00:58:40 No.102543396

File: 1701199461973372.png (38 KB, 869x340)

38 KB PNG

>>102543295
it's happening?

Anonymous
09/25/24(Wed)01:02:40 No.102543426

Anonymous 09/25/24(Wed)01:02:40 No.102543426

>>102543295
Why does the graphic go from bottom to top?

Anonymous
09/25/24(Wed)01:05:29 No.102543451

Anonymous 09/25/24(Wed)01:05:29 No.102543451

File: mistral small ryona dungeon.png (2.59 MB, 2880x9521)

2.59 MB PNG

>>102543206
Screenshot of full log.

Anonymous
09/25/24(Wed)01:06:49 No.102543463

Anonymous 09/25/24(Wed)01:06:49 No.102543463

>>102542586
I would not have sex with someone I don't already know deeply about and love.

Anonymous
09/25/24(Wed)01:08:29 No.102543472

Anonymous 09/25/24(Wed)01:08:29 No.102543472

>>102541866
How long is the limit?

Anonymous
09/25/24(Wed)01:10:36 No.102543491

Anonymous 09/25/24(Wed)01:10:36 No.102543491

>>102543472
45 minutes per day

Anonymous
09/25/24(Wed)01:11:20 No.102543498

Anonymous 09/25/24(Wed)01:11:20 No.102543498

File: 1722588141364.png (569 KB, 2468x984)

569 KB PNG

>>102543285

Anonymous
09/25/24(Wed)01:12:54 No.102543519

Anonymous 09/25/24(Wed)01:12:54 No.102543519

>>102543491
That's not too bad, assuming they don't count silence so you can actually use it like they advertise as an always-on tool.

Anonymous
09/25/24(Wed)01:19:21 No.102543562

Anonymous 09/25/24(Wed)01:19:21 No.102543562

>>102543498
Cool, is there a comparison for Florence2?

Anonymous
09/25/24(Wed)01:27:02 No.102543616

Anonymous 09/25/24(Wed)01:27:02 No.102543616

>>102543562
https://desuarchive.org/g/thread/101749053/#q101750118
https://desuarchive.org/g/thread/101749053/#q101750162
https://desuarchive.org/g/thread/101749053/#q101750228
the guy who originally posted it said Florence was Florence-2-Large-ft

Anonymous
09/25/24(Wed)01:33:41 No.102543666

Anonymous 09/25/24(Wed)01:33:41 No.102543666

>she Xs, her eyes Ying

Anonymous
09/25/24(Wed)01:33:47 No.102543667

Anonymous 09/25/24(Wed)01:33:47 No.102543667

File: 119147028_p1.png (3.28 MB, 2569x1440)

3.28 MB PNG

>Tuesday is over
Bet. A new model will release today that is not from Meta.

Anonymous
09/25/24(Wed)01:35:14 No.102543677

Anonymous 09/25/24(Wed)01:35:14 No.102543677

>>102543519
no, silence counts too.
redditfag claimed he needed to take a phonecall, so muted himself in chatgpt app but it still counted down the minutes.
45min per day is still more than i thought. i was thinking like 15 min per day or something.

the main problem is people are complaining everywhere how they get the "my guidelines wont allow me to talk about that" CONSTANTLY. even for work related stuff.
And apparently its output is being gimped even harder since the initial rollout. so even less imitations/effects.
The most funny part is normies STILL get their own voice cloned or hear a unrelated 3rd voice. "scary". lolo
i hope somebody comes along who just doesnt give a shit. just relase it and let people figure it out. ms-paint would not have been released in 2024.
first reactions: what if somebody would draw child genitalia with it?!?! total normal behavior from these SF freaks..

Anonymous
09/25/24(Wed)01:44:49 No.102543726

Anonymous 09/25/24(Wed)01:44:49 No.102543726

>>102543667
catbox the uncensored version

Anonymous
09/25/24(Wed)01:48:56 No.102543759

Anonymous 09/25/24(Wed)01:48:56 No.102543759

>>102543677
Wow, I thought it'd be funny if I jinxed it but if that's crazy if true. Like what the fuck, what a scam.

Anonymous
09/25/24(Wed)01:51:16 No.102543770

Anonymous 09/25/24(Wed)01:51:16 No.102543770

>>102543726
The pixiv link is right there anonymo-
Wait that's a 4chanx feature isn't it.
Just install 4chanx bro, you're going to save yourself a lot of trouble in the future.

Anonymous
09/25/24(Wed)02:29:35 No.102544017

Anonymous 09/25/24(Wed)02:29:35 No.102544017

>>102543230
>Miqu 2
Looking back on it, miqudev was the most based person to ever grace this general. We may never see his like again...

Anonymous
09/25/24(Wed)03:05:04 No.102544212

Anonymous 09/25/24(Wed)03:05:04 No.102544212

>>102539312
Improvements are constantly being made, though they are primarily refinements. I think the next big leap will be when they solve catastrophic forgetting. Once they do that it will be all about continuous learning and years of refinement will be done on that. We have no need to rush, AI isn't going anywhere anytime soon.

Anonymous
09/25/24(Wed)03:06:52 No.102544223

Anonymous 09/25/24(Wed)03:06:52 No.102544223

>>102543726
Don't do what the other Anon said, never install 4chanx if you can help it.

Anonymous
09/25/24(Wed)03:10:08 No.102544249

Anonymous 09/25/24(Wed)03:10:08 No.102544249

This might be a retarded idea but why can't we add user feedback on ST for discarded gens and prefered gens and use that as a dataset to train a custom little reward model that would be used later to prefilter the next generations? I think cai did something like that before.

Anonymous
09/25/24(Wed)03:10:56 No.102544256

Anonymous 09/25/24(Wed)03:10:56 No.102544256

>>102543726
install 4chanx but leave it disabled

Anonymous
09/25/24(Wed)03:11:20 No.102544261

Anonymous 09/25/24(Wed)03:11:20 No.102544261

>>102544223
t. regularly gets filtered

Anonymous
09/25/24(Wed)03:18:53 No.102544307

Anonymous 09/25/24(Wed)03:18:53 No.102544307

File: 1723482268503787.png (6 KB, 281x110)

6 KB PNG

>>102544261

Anonymous
09/25/24(Wed)03:38:35 No.102544450

Anonymous 09/25/24(Wed)03:38:35 No.102544450

File: file.png (733 KB, 768x768)

733 KB PNG

Anonymous
09/25/24(Wed)03:43:30 No.102544476

Anonymous 09/25/24(Wed)03:43:30 No.102544476

>>102544249
>why can't we add user feedback on ST for discarded gens and prefered gens and use that as a dataset to train a custom little reward model that would be used later to prefilter the next generations
You are asking why can't we make local models not local? I dunno anon... But yeah it is a great idea you could ask locusts to do. I am sure they can make an extension for that or something.

Anonymous
09/25/24(Wed)03:44:48 No.102544489

Anonymous 09/25/24(Wed)03:44:48 No.102544489

>>102544476
Are you retarded? Everything I said can be done locally

Anonymous
09/25/24(Wed)03:44:56 No.102544491

Anonymous 09/25/24(Wed)03:44:56 No.102544491

>>102544450
updates... doko...

Anonymous
09/25/24(Wed)03:48:21 No.102544513

Anonymous 09/25/24(Wed)03:48:21 No.102544513

>>102544450
that face...

Anonymous
09/25/24(Wed)03:48:23 No.102544515

Anonymous 09/25/24(Wed)03:48:23 No.102544515

>>102544249
Why do you need feedback? Just delete any gens you don't like and all the jsonl is now your dataset.

Anonymous
09/25/24(Wed)03:48:34 No.102544516

Anonymous 09/25/24(Wed)03:48:34 No.102544516

>>102544489
>Are you retarded?
Are you? It is incredible how you don't see a problem with this.

Anonymous
09/25/24(Wed)03:49:56 No.102544524

Anonymous 09/25/24(Wed)03:49:56 No.102544524

>>102543295
>https://github.com/qhjqhj00/MemoRAG

Oh fuck yeah. Thank you, chinks.

Anonymous
09/25/24(Wed)03:50:10 No.102544526

Anonymous 09/25/24(Wed)03:50:10 No.102544526

>>102544515
Do you even know what a reward model is?
>>102544516
Enlighten me then?

Anonymous
09/25/24(Wed)03:52:01 No.102544540

Anonymous 09/25/24(Wed)03:52:01 No.102544540

>>102543295
Wait... Is this the holy waifu grail? And we are finally gonna get waifus and the final problem won't be the alzheimer's but their positivity bias and how they will talk to us about consent? Weird timeline.

Anonymous
09/25/24(Wed)03:52:13 No.102544543

Anonymous 09/25/24(Wed)03:52:13 No.102544543

>>102544450
bad gen, her top is like an unfinished suggestion

Anonymous
09/25/24(Wed)03:53:09 No.102544550

Anonymous 09/25/24(Wed)03:53:09 No.102544550

>>102544543
I know. But I like top 80% of the picture a lot and I didn't want to cut it.

Anonymous
09/25/24(Wed)03:53:46 No.102544553

Anonymous 09/25/24(Wed)03:53:46 No.102544553

File: cuteandlovelymiku.png (1.08 MB, 800x1248)

1.08 MB PNG

good night /lmg/

Anonymous
09/25/24(Wed)03:54:30 No.102544563

Anonymous 09/25/24(Wed)03:54:30 No.102544563

>>102544540
>Weird timeline.
kys

Anonymous
09/25/24(Wed)03:54:54 No.102544569

Anonymous 09/25/24(Wed)03:54:54 No.102544569

>>102544540
>https://github.com/qhjqhj00/MemoRAG

Kinda want to try out their summarization module. But that might be just limited to the model that being used at the end of the day.

Anonymous
09/25/24(Wed)03:59:54 No.102544604

Anonymous 09/25/24(Wed)03:59:54 No.102544604

>>102544553
Miku, it's 10 AM. Get out of bed.

Anonymous
09/25/24(Wed)04:03:34 No.102544632

Anonymous 09/25/24(Wed)04:03:34 No.102544632

>>102544526
NTA but it’s inspiring that you talk to retarded people like that they’re probably very lonely and crave the social connection. Anyway yes that’s an interesting idea but people like to switch models all the time and you’d need to have the hw capacity and time to do the actual fine tuning every time you switched. It could be QoL maxxed with some effort tho.

Anonymous
09/25/24(Wed)04:06:58 No.102544647

Anonymous 09/25/24(Wed)04:06:58 No.102544647

>>102544632
>>102544526
samefag

Anonymous
09/25/24(Wed)04:07:58 No.102544661

Anonymous 09/25/24(Wed)04:07:58 No.102544661

File: 1702915973031111.png (6 KB, 507x138)

6 KB PNG

>>102544647
Retard

Anonymous
09/25/24(Wed)04:11:18 No.102544687

Anonymous 09/25/24(Wed)04:11:18 No.102544687

File: file.png (8 KB, 579x138)

8 KB PNG

>>102544647
You got me...

Anonymous
09/25/24(Wed)04:13:49 No.102544704

Anonymous 09/25/24(Wed)04:13:49 No.102544704

File: mikuramen.jpg (79 KB, 900x607)

79 KB JPG

>>102544661
>>102544687
The duality of anon

Anonymous
09/25/24(Wed)04:19:10 No.102544739

Anonymous 09/25/24(Wed)04:19:10 No.102544739

>>102543295
I just thought about next steps and what is gonna be the new sally brothers thing for testing if your waifu can remember things? Cause you can bet everyone here is gonna be doing all those memory riddles instead of actually enjoying their LLM waifu.

Anonymous
09/25/24(Wed)04:38:17 No.102544859

Anonymous 09/25/24(Wed)04:38:17 No.102544859

>>102544848
>>102544848
>>102544848

Anonymous
09/25/24(Wed)04:45:07 No.102544896

Anonymous 09/25/24(Wed)04:45:07 No.102544896

>>102544540
It's just a better lorebook, that won't solve the long memory issue in a conversational setting

Anonymous
09/25/24(Wed)06:44:18 No.102545555

Anonymous 09/25/24(Wed)06:44:18 No.102545555

Bump

Anonymous
09/25/24(Wed)06:44:54 No.102545563

Anonymous 09/25/24(Wed)06:44:54 No.102545563

sage sage sage

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.