/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 10/11/24(Fri)02:47:52 No.102772862

File: 2024-10-10_065250_seed594(...).png (2.96 MB, 960x2304)

2.96 MB PNG

/lmg/ - Local Models General Anonymous 10/11/24(Fri)02:47:52 No.102772862 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>102758839 & >>102743974

►News
>(10/10) Aria: 25.3B, 3.9B active, multimodal native MoE model with 64k context: https://hf.co/rhymes-ai/Aria
>(09/27) Emu3, next-token prediction multimodal models: https://hf.co/collections/BAAI/emu3-66f4e64f70850ff358a2e60f
>(09/25) Multimodal Llama 3.2 released: https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices
>(09/25) Molmo: Multimodal models based on OLMo, OLMoE, and Qwen-72B: https://molmo.allenai.org/blog
>(09/24) Llama-3.1-70B-instruct distilled to 51B: https://hf.co/nvidia/Llama-3_1-Nemotron-51B-Instruct

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-build-guides

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
Chatbot Arena: https://chat.lmsys.org/?leaderboard
Censorship: https://hf.co/spaces/DontPlanToEnd/UGI-Leaderboard
Censorbench: https://codeberg.org/jts2323/censorbench
Japanese: https://hf.co/datasets/lmg-anon/vntl-leaderboard
Programming: https://livecodebench.github.io/leaderboard.html

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/lmg-anon/mikupad
https://github.com/turboderp/exui
https://github.com/ggerganov/llama.cpp

Anonymous
10/11/24(Fri)02:50:01 No.102772882

Anonymous 10/11/24(Fri)02:50:01 No.102772882

File: __gumi_vocaloid_drawn_by_(...).jpg (221 KB, 1754x1240)

221 KB JPG

►Recent Highlights from the Previous Thread: >>102758839

--Paper: PLaMo-100B, a large-scale Japanese language model with competitive performance:
>102770605 >102770997
--Papers:
>102759380 >102759535 >102759675 >102759859 >102759989 >102766750 >102771160 >102771486
--SillyTavern and KoboldCPP comparison discussion:
>102761053 >102761131 >102761199 >102761233 >102761332 >102761383 >102761462
--Running multiple PSUs in tandem can cause issues:
>102758927 >102764354
--No practical reason not to make an LLM front end in RPGMaker MV:
>102764211 >102764358
--Java implementation of Llama 3.1 inference and ollama support for 3.2 vision:
>102761590 >102761983 >102762453
--Troubleshooting xtts2 installation and compatibility issues:
>102762949 >102763509 >102764426 >102764613 >102764826 >102764952 >102765223
--Pyramid Flow - open-source high-quality video generation:
>102760489
--5090 recommended for AI, but availability and price may be issues:
>102765809 >102765892 >102766269 >102765921 >102766133 >102766393 >102766431
--Running 3 3090s / 4090s in a case is possible but will resemble a mining setup:
>102766648 >102766737 >102766966 >102767074 >102767095 >102767731 >102772011
--Nvidia RTX 5000 series discussion, performance and bandwidth improvements:
>102759501 >102759516 >102759770 >102759587 >102759634 >102759790 >102759528 >102759540 >102759553 >102759891 >102769140 >102769159 >102769207 >102769375 >102769415 >102769546
--Mixing Pascal and Intel Arc A770 GPUs in llama.cpp/koboldcpp:
>102771826 >102771872 >102771892 >102771913 >102771922
--Discussion about an ad for uncensored GPT-40 fine-tuning and creating AI girlfriends:
>102760427 >102760451 >102760511 >102760677 >102760502
--AMD multi-GPU support issues with newer software versions:
>102758969
--Gumi (free space):
>102758927 >102759015 >102763583 >102763803 >102764770 >102767818

►Recent Highlight Posts from the Previous Thread: >>102758842

Why?: 9 reply limit >>102478518
Fix: https://rentry.org/lmg-recap-script

Anonymous
10/11/24(Fri)02:54:02 No.102772917

Anonymous 10/11/24(Fri)02:54:02 No.102772917

I can't sleep

Anonymous
10/11/24(Fri)03:29:39 No.102773259

Anonymous 10/11/24(Fri)03:29:39 No.102773259

Rocinante-12B-v2g-Q5_K_M from a couple threads ago is actually unslopped. Especially since I recently used mistral-small finetunes. Nemo is more dumb though. Not sure how to feel about this.

Anonymous
10/11/24(Fri)03:35:28 No.102773308

Anonymous 10/11/24(Fri)03:35:28 No.102773308

elon's biden squad https://x.com/disclosetv/status/1844580663627772175

Anonymous
10/11/24(Fri)03:38:45 No.102773338

Anonymous 10/11/24(Fri)03:38:45 No.102773338

>>102772917
Count some Migus? 1... 2... ...3

Anonymous
10/11/24(Fri)03:52:25 No.102773466

Anonymous 10/11/24(Fri)03:52:25 No.102773466

>>102773308
this is the future i wanted. i don't have to see my kids anymore and can stream diablo 4 instead. elon deserves his fame

Anonymous
10/11/24(Fri)04:03:39 No.102773537

Anonymous 10/11/24(Fri)04:03:39 No.102773537

>>102772466
I'm pretty sure exllamav2 does not support parallel processing. Does this mean that I can laugh at anyone running exl2 on their $5000 multi-gpu build for not only being a retard who spent so much money to run bad models, but also for sitting there running them in a slow speed?
I guess richfags really are the scum of this general lmao

Anonymous
10/11/24(Fri)04:47:06 No.102773834

Anonymous 10/11/24(Fri)04:47:06 No.102773834

File: 1000018759.jpg (539 KB, 2491x4035)

539 KB JPG

>>102760789
ok neat but spoonfeed me on how do i run it on termux

Anonymous
10/11/24(Fri)04:48:13 No.102773841

Anonymous 10/11/24(Fri)04:48:13 No.102773841

>>102773834
*how to

Anonymous
10/11/24(Fri)04:50:38 No.102773870

Anonymous 10/11/24(Fri)04:50:38 No.102773870

File: 1704109932374810.png (297 KB, 602x665)

297 KB PNG

>>102773834
>>102773841
you don't, the guy making it is a schizo and his meme sampler will never be available in other frontends btw

Anonymous
10/11/24(Fri)05:06:04 No.102773982

Anonymous 10/11/24(Fri)05:06:04 No.102773982

>>102773259
Rocinante 1.1 is my fav so I'm gonna test this one. Is it actually unslopped or just wishful thinking?

Anonymous
10/11/24(Fri)05:07:48 No.102773995

Anonymous 10/11/24(Fri)05:07:48 No.102773995

>>102773982
its all slop
mixtral? slop
mistral? slop
nemo? slop
miqu? slop

Anonymous
10/11/24(Fri)05:09:38 No.102774010

Anonymous 10/11/24(Fri)05:09:38 No.102774010

>>102773982
>>102773995
>Got a purred in the first test
LMAO, unslopped my ass.

Anonymous
10/11/24(Fri)05:18:47 No.102774075

Anonymous 10/11/24(Fri)05:18:47 No.102774075

>>102774010
sloppah

Anonymous
10/11/24(Fri)05:24:23 No.102774123

Anonymous 10/11/24(Fri)05:24:23 No.102774123

File: Screenshot_20241011_182243.png (405 KB, 1903x1274)

405 KB PNG

>>102773982
Judge for yourself
I did notice a
>strange and unfamiliar ... but not unpleasant
But I like the writing style.

Anonymous
10/11/24(Fri)05:26:36 No.102774137

Anonymous 10/11/24(Fri)05:26:36 No.102774137

>>102774123
Also funny I wrote schoolgirl in feudal japan.
But the model was smart enough that girls didnt go to school in the past.

Anonymous
10/11/24(Fri)05:31:29 No.102774176

Anonymous 10/11/24(Fri)05:31:29 No.102774176

>>102774123
>>102774137
Yes, it's definitely slightly better than regular rocinante but it is still mostly slop, a bit of a improvement tho.

Anonymous
10/11/24(Fri)05:33:57 No.102774189

Anonymous 10/11/24(Fri)05:33:57 No.102774189

I've been away for a few months. do we already have some models that can genuinely surprise me with anything creative, or is it still "I tell it to suck my cock, and it sucks my cock"?

Anonymous
10/11/24(Fri)05:34:03 No.102774190

Anonymous 10/11/24(Fri)05:34:03 No.102774190

>>102774176
Its just refreshing loading this up after using mistral-small finetunes.
Does make me appreciate mistral-small for obeying formats more though.
But there is some soul with the small models. Like calling the head of my dick a pretty jewel. lol

Anonymous
10/11/24(Fri)05:34:51 No.102774198

Anonymous 10/11/24(Fri)05:34:51 No.102774198

>>102774123
slop

Anonymous
10/11/24(Fri)05:36:13 No.102774205

Anonymous 10/11/24(Fri)05:36:13 No.102774205

>>102774189
prompt issue

Anonymous
10/11/24(Fri)05:38:14 No.102774215

Anonymous 10/11/24(Fri)05:38:14 No.102774215

>>102774205
yeah, I don't want to hold LLM's hand all the time.

Anonymous
10/11/24(Fri)05:38:17 No.102774216

Anonymous 10/11/24(Fri)05:38:17 No.102774216

>>102774190
Yeah seems more soulful.
What temperature are you using btw?

Anonymous
10/11/24(Fri)05:40:27 No.102774226

Anonymous 10/11/24(Fri)05:40:27 No.102774226

>>102774216
Just 0.7 and minp of 0.1.
I'm sure people will call me retarded, and I am, but sometimes I use dyn temp 0.7 - 1.2.

Anonymous
10/11/24(Fri)05:44:21 No.102774251

Anonymous 10/11/24(Fri)05:44:21 No.102774251

>>102774215
I dont think we are there yet without architectural changes or "smartness".
Isnt that kinda the reason why you cant let gemini or a model with big context write a short novel thats actually interesting?
There is something missing.
To be fair I use big models mostly for work, but it feels like the bigger the less they go off the path without being instructed.
The model would need to take a pause, look at the context and think "hmm, how can i make this interesting?". CoT does not work from my experience and only makes things slow.

Anonymous
10/11/24(Fri)05:57:09 No.102774326

Anonymous 10/11/24(Fri)05:57:09 No.102774326

File: Screenshot_20241011_185506.png (625 KB, 1885x1624)

625 KB PNG

>>102774123
Second pic and last one, i need to do at least some work right now instead of doing this.

>prompt: a businesswoman from murrika
>finishes off with: I'm going to sue whoever is responsible for this.
thats funny.

Anonymous
10/11/24(Fri)05:59:21 No.102774337

Anonymous 10/11/24(Fri)05:59:21 No.102774337

I'm pretty new to LLM's but been having fun the past few days.
Built a script that chunks and cleans telegram logs of a group chat (not english) for the past 4 years in 5-6 days chunks and pipes them into a local LLM to make summary of what was discussed which came out very fun

I'm using a very simple setup, my script is piping and calling simonw/llm which uses ollama gemma2:7b model, I'm using a 3070. I tried llama3.2 but had to use an uncensored gguf I found to get output and even then it wasn't as good as gemma2
I want to try doing the same but on a rented VM with beefier gpu, randomly looking at runpod.io but happy to get recommendations

Which llm model/rented gpu would you recommend to do the same thing but better? Also happy to modify the script to not use simonw/llm+ollama if it makes sense to

Anonymous
10/11/24(Fri)06:20:41 No.102774472

Anonymous 10/11/24(Fri)06:20:41 No.102774472

>>102773995
and largestral. i still like miqu, despite the shivers and slop it doesn't take 1500 tokens to get to the point

Anonymous
10/11/24(Fri)06:29:01 No.102774531

Anonymous 10/11/24(Fri)06:29:01 No.102774531

Read that NVLM 1.0 namely NVLM-D-72B is capable of rivaling GPT-4o.

Is that actually true? Can it be run locally?
I think I saw it on huggingface, should I wait for finetunes?

Anonymous
10/11/24(Fri)06:32:56 No.102774558

Anonymous 10/11/24(Fri)06:32:56 No.102774558

>>102774531
>sees model
>first response is to wait for tunes
ngmi base is always the best

Anonymous
10/11/24(Fri)06:34:00 No.102774567

Anonymous 10/11/24(Fri)06:34:00 No.102774567

>>102774558
Isn't base often censored out the wazoo?

Anonymous
10/11/24(Fri)06:38:08 No.102774600

Anonymous 10/11/24(Fri)06:38:08 No.102774600

>>102774567
not if you use it right and tunes in most cases just make them dumber and sloppier

Anonymous
10/11/24(Fri)06:38:41 No.102774602

Anonymous 10/11/24(Fri)06:38:41 No.102774602

>>102773537
Exllama2 already supports tensor parallelism

Anonymous
10/11/24(Fri)06:54:28 No.102774707

Anonymous 10/11/24(Fri)06:54:28 No.102774707

>>102774600
NTA, but my reason for not using finetunes is that I don't want to support nor give attention to the ERP finetune grifters. They've lost the plot and all they can do is gimmick or stupid horny models. They don't deserve to gain money (indirectly or not) from this shit.

Anonymous
10/11/24(Fri)07:27:21 No.102774954

Anonymous 10/11/24(Fri)07:27:21 No.102774954

How do you run models & surrounding code from random repos in a better or worse sandbox? VMs with GPU passthrough and cutting off internet access? Docker?

Anonymous
10/11/24(Fri)07:34:01 No.102774999

Anonymous 10/11/24(Fri)07:34:01 No.102774999

>>102774954
apparmor + opensnitch if you're a linux chad

Anonymous
10/11/24(Fri)07:40:33 No.102775053

Anonymous 10/11/24(Fri)07:40:33 No.102775053

Anyone with 128GB of main memory try to run Aria on CPU? It shouldn't be very FLOP intensive.

Anonymous
10/11/24(Fri)07:41:26 No.102775063

Anonymous 10/11/24(Fri)07:41:26 No.102775063

>>102774326
Certainly takes a specific level of intelligence to open the window in the described situation.

Anonymous
10/11/24(Fri)07:45:37 No.102775112

Anonymous 10/11/24(Fri)07:45:37 No.102775112

>>102775063
have you not seen car wash videos

Anonymous
10/11/24(Fri)07:55:07 No.102775198

Anonymous 10/11/24(Fri)07:55:07 No.102775198

>>102775053
How do you run it on CPU?

Anonymous
10/11/24(Fri)08:08:06 No.102775308

Anonymous 10/11/24(Fri)08:08:06 No.102775308

>>102774954
Easy to deadend a web service on Linux and proxy output only, eg https://rentry.org/IsolatedLinuxWebService

Anonymous
10/11/24(Fri)08:12:31 No.102775353

Anonymous 10/11/24(Fri)08:12:31 No.102775353

I wanted to use these to practice patient scenarios. ChatGPT and Claude.ai don't allow simulating dangerous things (like applying certain medication if it's not correct).
I've looked through the OP and the guides. Seems like I should give up if I'm not willing to invest in a new computer. Does anyone know a "non-local" thing I could use which would allow medical scenarios?

Anonymous
10/11/24(Fri)08:17:54 No.102775414

Anonymous 10/11/24(Fri)08:17:54 No.102775414

>>102775353
There are renting services that allow you to run these open weight models remotely.
Stuff like open router.
No idea how it works at all.

Anonymous
10/11/24(Fri)08:21:06 No.102775447

Anonymous 10/11/24(Fri)08:21:06 No.102775447

>>102775353
There's some small local models (or quants of big models) for medical stuff that you can fit in a 8gb card, those are as cheap as $100 you could even find some 11gb cards for a bit more.
If you plan to give the card a lot of use I don't see how that little money is that much of an investment.
https://huggingface.co/blog/leaderboard-medicalllm
https://github.com/AI-in-Health/MedLLMsPracticalGuide
Take into account that you can generate the medical data with a large model online and then tell a local model to use that data.
You can also "rent" a gpu for some shekels per hour.

Anonymous
10/11/24(Fri)08:23:01 No.102775466

Anonymous 10/11/24(Fri)08:23:01 No.102775466

>>102775435
Nigga, use flux Q4.

Anonymous
10/11/24(Fri)08:31:22 No.102775557

Anonymous 10/11/24(Fri)08:31:22 No.102775557

>>102775414
>>102775447
Wow, that's really comprehensive, thanks.

I thought that utilizing it for this purpose was an original thought, but there are already scientific papers published about it and people are developing it actively.

Anonymous
10/11/24(Fri)08:40:20 No.102775637

Anonymous 10/11/24(Fri)08:40:20 No.102775637

>>102775198
Doesn't VLLM have a AVX512 mode?

Anonymous
10/11/24(Fri)08:41:31 No.102775649

Anonymous 10/11/24(Fri)08:41:31 No.102775649

svelk

Anonymous
10/11/24(Fri)08:59:05 No.102775789

Anonymous 10/11/24(Fri)08:59:05 No.102775789

>>102775756
posting questionable drawings of children won't make your arguments more credible

Anonymous
10/11/24(Fri)09:00:19 No.102775804

Anonymous 10/11/24(Fri)09:00:19 No.102775804

>>102775789
That's the joke, we're different kinds of retards.

Anonymous
10/11/24(Fri)09:24:01 No.102775972

Anonymous 10/11/24(Fri)09:24:01 No.102775972

>>102775789
>questionable
lol get a look at this faggot.
>won't make your arguments more credible
it makes them more based though

Anonymous
10/11/24(Fri)09:32:42 No.102776030

Anonymous 10/11/24(Fri)09:32:42 No.102776030

>>102775972
>pedoshitter pretends to have standards

Anonymous
10/11/24(Fri)09:39:10 No.102776087

Anonymous 10/11/24(Fri)09:39:10 No.102776087

>>102775756
lmg needs more of this.

Anonymous
10/11/24(Fri)09:50:19 No.102776192

Anonymous 10/11/24(Fri)09:50:19 No.102776192

>responding to shitposts
bruh moment

Anonymous
10/11/24(Fri)09:57:38 No.102776271

Anonymous 10/11/24(Fri)09:57:38 No.102776271

Alright, got myself one of the more cheaper used p40. High IQ move since I bought once they gotten more expensive.
I dont have high expectations though and am ready to be disappointed because the 70b range models are probably all assistant sloped.
How bad is mistral large at lower quant? I have around 36gb of VRAM.
And is it even possible to run nemo or mistral-small finetunes at high context instead?
Worst case I can now also shit on smaller models and then cry myself to sleep.

Anonymous
10/11/24(Fri)10:02:15 No.102776308

Anonymous 10/11/24(Fri)10:02:15 No.102776308

>>102776271
Mistral Small with a decent chunk of context should fit cozily into your VRAM. I'd try that before waiting 5 minutes for a 70B reply.

Anonymous
10/11/24(Fri)10:10:12 No.102776395

Anonymous 10/11/24(Fri)10:10:12 No.102776395

File: 1698183099139893.png (139 KB, 864x462)

139 KB PNG

Now that's some next level user prompting.

Anonymous
10/11/24(Fri)10:15:27 No.102776431

Anonymous 10/11/24(Fri)10:15:27 No.102776431

>>102772862
>Aria: 25.3B, 3.9B active
Mistral Small is already so fucking fast, not to mention nemo, which makes me wonder what the purpose for this is, beside VRAMlet support.
>multimodal with 64k context
Thaaaat meanwhile is a bit more interesting.

Anonymous
10/11/24(Fri)10:20:53 No.102776482

Anonymous 10/11/24(Fri)10:20:53 No.102776482

>>102776395
what model gave you this absolute slopkino

Anonymous
10/11/24(Fri)10:21:03 No.102776483

Anonymous 10/11/24(Fri)10:21:03 No.102776483

>>102776431
>Mistral Small is already so fucking fast
Could be good for Nemo users.

Anonymous
10/11/24(Fri)10:23:11 No.102776510

Anonymous 10/11/24(Fri)10:23:11 No.102776510

>>102776482
That's unironically UnsloppedNemo from the earlier shill.

Anonymous
10/11/24(Fri)10:25:35 No.102776530

Anonymous 10/11/24(Fri)10:25:35 No.102776530

>>102776483
That's what I meant with vramlets. Nemo is already pretty good for them, but this here of course has potential to be even better (hopefully lmao). At the least it's something interesting for once.

Anonymous
10/11/24(Fri)10:38:51 No.102776659

Anonymous 10/11/24(Fri)10:38:51 No.102776659

>>102776431
https://www.rhymes.ai/blog-details/aria-first-open-multimodal-native-moe-model
Some interesting examples.
I just cant trust llms for anything serious though.
The ability to show it pictures and say "make a python table of this" is cool however.

Anonymous
10/11/24(Fri)10:44:55 No.102776726

Anonymous 10/11/24(Fri)10:44:55 No.102776726

>>102776659
Currently I treat AI as nothing but an early tool, a very enjoyable and fun to fuck around with tool. I want to say "toy", but it's too useful for that already, even if it needs continued polishing. A fuck ton of near endless potential, but nothing I'd trust a 100% just yet, not for anything serious at least.

Anonymous
10/11/24(Fri)10:52:04 No.102776796

Anonymous 10/11/24(Fri)10:52:04 No.102776796

File: SelfSatisfiedMiku.png (1.17 MB, 880x1168)

1.17 MB PNG

>>102776726
Yes, it can eliminate or streamline intellectual "manual labour" in the same way that industrial automation eliminates physical manual labour. It is not sophisticated enough to be allowed to operate in a closed loop yet. It still needs review and handlers, but that doesn't mean it can't eliminate large amounts of work with supervision.

Anonymous
10/11/24(Fri)10:55:06 No.102776832

Anonymous 10/11/24(Fri)10:55:06 No.102776832

>KoboldCpp v1.76 adds the Anti-Slop Sampler (Phrase Banning) and RP Character Creator scenario
>NEW: Added Anti-Slop Sampling
The term "slop" comes from "sloppy seconds", a term used by incels to rationalize why they are lonely and do not have a girlfriend ("it's my choice! women are whores! i'm not taking sloppy seconds!").
By openly using this term, koboldcpp confirms itself as a software designed for incels. Note: the two quoted phrases above were written by the author of the software, an incel.

Anonymous
10/11/24(Fri)10:57:53 No.102776864

Anonymous 10/11/24(Fri)10:57:53 No.102776864

>>102776726
Ironic that the biggest strenght currently is creative writing. But thats gimped the hardest.
If you show this stuff (like 3.5) to normies they are impressed...until "it lies" to them.
Like asking Sonnet for good cafes in hawaii. Stuff we wouldnt come up with.
There are big key problems that properly require architectural change.

That being said if you showed the world from 2005 or something what we have now they would call it agi no doubt.
Just watch the robo movies from the past. They talk funny and are not able to make "creativity and art". Thats the first thing that is being solved already.
If I was younger with more time I'd have such a blast. Music, art. You can make anything easily and for free now.

Anonymous
10/11/24(Fri)11:01:29 No.102776899

Anonymous 10/11/24(Fri)11:01:29 No.102776899

>>102776832
bruh, slop is just waste nutrients fed to pigs, a usage hundreds of years old
occams's razor says your explanation is retarded

Anonymous
10/11/24(Fri)11:03:29 No.102776925

Anonymous 10/11/24(Fri)11:03:29 No.102776925

>>102776659
You can make some scribbles in a piece of paper and tell the model to translate it to a web page or something.
I'm waiting when multimodal can also generate images, which shouldn't be too far if they actually intend to do that.

Anonymous
10/11/24(Fri)11:03:44 No.102776930

Anonymous 10/11/24(Fri)11:03:44 No.102776930

>>102776832
>>102776899
Buy an ad.

Anonymous
10/11/24(Fri)11:03:55 No.102776931

Anonymous 10/11/24(Fri)11:03:55 No.102776931

File: 1725828341389808.png (26 KB, 755x1255)

26 KB PNG

>>102776832
Nah ur just a faggot seeking for attention i.e. "baiting".

Anonymous
10/11/24(Fri)11:04:54 No.102776940

Anonymous 10/11/24(Fri)11:04:54 No.102776940

>>102776832
>inb4 koboldcpp is a serious tool for serious businesses instead of a cooming aid

Anonymous
10/11/24(Fri)11:05:16 No.102776945

Anonymous 10/11/24(Fri)11:05:16 No.102776945

>>102776925
same with audio. they all cuck out.
"multi modal" is 99% always picture in and text in/out.

Anonymous
10/11/24(Fri)11:06:05 No.102776955

Anonymous 10/11/24(Fri)11:06:05 No.102776955

File: 1723571195164745.jpg (312 KB, 1408x1147)

312 KB JPG

Anonymous
10/11/24(Fri)11:07:01 No.102776971

Anonymous 10/11/24(Fri)11:07:01 No.102776971

>>102776832
How do I even make this shit run?
The chink only did the installation steps for linugg

Anonymous
10/11/24(Fri)11:08:29 No.102776990

Anonymous 10/11/24(Fri)11:08:29 No.102776990

>>102776955
>sloppy slop, I'm not taking chad's leftovers! Install koboldcpp and become an mgtow.

Anonymous
10/11/24(Fri)11:08:42 No.102776992

Anonymous 10/11/24(Fri)11:08:42 No.102776992

File: file.jpg (77 KB, 518x666)

77 KB JPG

>>102776796
As you said, AI is nothing else but a tool to further remove manual labor and make things "easier" for us. I see it as nothing else than a bunch of pre-programmed robot arms that put things together or move objects from one place or another.
>>102776864
>But thats gimped the hardest.
One would think it'd be imagery, seeing the most damage can be done with that, text far less so.
"Oh no, this anon wrote a poem about how much he hates [skin here], how demonic!"
Music and video content being locked down to paid service makes sense, it's the easiest and most money you can make with, while imagery has been out in the opening since the beginning.
>that properly require architectural change.
Not to mention CONSTANT updating, preferably in real time like a search engines database. I imagine that things like this further complicate llms and how they function.
>they would call it agi no doubt.
Hell, I was already PLENTY of impressed by 4.0 when I first saw it in action and played around with it (uncensored) myself, was weird as hell and a bit spooky.
Not spooky because "muh terminator" scare, but because it made me feel something, good or bad. I was engaged as if it's a good movie or book, but it's being written in real time according to things I wrote.

Anonymous
10/11/24(Fri)11:08:46 No.102776993

Anonymous 10/11/24(Fri)11:08:46 No.102776993

File: 1697139613268444.png (23 KB, 404x401)

23 KB PNG

>>102776971
retard

Anonymous
10/11/24(Fri)11:08:50 No.102776997

Anonymous 10/11/24(Fri)11:08:50 No.102776997

>new multimodal model
>ask the gym receptionist if their gym is omni or just another vlm
>she doesnt understand
>pull out illustrated diagram explaining what is omni and what is vlm
>she laughs and says “it’s a good multimodal model sir”
>try the model
>its a vlm

Anonymous
10/11/24(Fri)11:09:34 No.102777003

Anonymous 10/11/24(Fri)11:09:34 No.102777003

>>102776990
Come to think of it, koboldcpp may be developed by an r9k user.

Anonymous
10/11/24(Fri)11:09:50 No.102777004

Anonymous 10/11/24(Fri)11:09:50 No.102777004

>>102776997
>gym receptionist
FUCK I forgot to change the copypasta enough
pretend I wrote something like blogpost or PR team or something...

Anonymous
10/11/24(Fri)11:10:07 No.102777006

Anonymous 10/11/24(Fri)11:10:07 No.102777006

>>102776864
The early 20's me would have loved this shit. it is such a shame.
I would have become an AI wizard.

Anonymous
10/11/24(Fri)11:19:26 No.102777115

Anonymous 10/11/24(Fri)11:19:26 No.102777115

>>102776997
And that's what q4 context looks like folks.

Anonymous
10/11/24(Fri)11:20:12 No.102777128

Anonymous 10/11/24(Fri)11:20:12 No.102777128

>>102777115
20 trillion iq2 where it's at

Anonymous
10/11/24(Fri)11:25:00 No.102777206

Anonymous 10/11/24(Fri)11:25:00 No.102777206

>>102777004
It's better this way

Anonymous
10/11/24(Fri)11:31:51 No.102777310

Anonymous 10/11/24(Fri)11:31:51 No.102777310

>>102776930
lol I hate kobold, I just hate gay retards more

Anonymous
10/11/24(Fri)11:33:56 No.102777341

Anonymous 10/11/24(Fri)11:33:56 No.102777341

Alpindale I know you lurk here. Is Aphrodite-engine getting the last HTTP server optimization from vLLM or not? Aphrodite is better since it supports more samplers and quants, but I won't switch if the throughput is worse.

Anonymous
10/11/24(Fri)11:39:40 No.102777411

Anonymous 10/11/24(Fri)11:39:40 No.102777411

what will anti-slop's corporate name be when its added to st?

Anonymous
10/11/24(Fri)11:40:35 No.102777424

Anonymous 10/11/24(Fri)11:40:35 No.102777424

>>102777411
anti-transphobia

Anonymous
10/11/24(Fri)11:42:08 No.102777435

Anonymous 10/11/24(Fri)11:42:08 No.102777435

>>102777411
Diversity+

Anonymous
10/11/24(Fri)11:48:13 No.102777495

Anonymous 10/11/24(Fri)11:48:13 No.102777495

>>102777411
>st
"SlutTrainer 2024, now with SmutLube Technology!"

Anonymous
10/11/24(Fri)11:48:37 No.102777497

Anonymous 10/11/24(Fri)11:48:37 No.102777497

>>102777411
Inclusiveness feature. A mandatory they/them system prompt for adressing {{user}}.

Anonymous
10/11/24(Fri)11:49:02 No.102777502

Anonymous 10/11/24(Fri)11:49:02 No.102777502

>>102777411
Pro-MGTOW

Anonymous
10/11/24(Fri)11:49:16 No.102777505

Anonymous 10/11/24(Fri)11:49:16 No.102777505

What's the use case for that in servicetesnor?

Anonymous
10/11/24(Fri)11:49:20 No.102777506

Anonymous 10/11/24(Fri)11:49:20 No.102777506

I'm getting sick of windows. Should I change to a rolling distro or a LTS one?

Anonymous
10/11/24(Fri)11:52:53 No.102777561

Anonymous 10/11/24(Fri)11:52:53 No.102777561

File: MikuIntoTheVoid.png (2.51 MB, 1280x1640)

2.51 MB PNG

>>102777506
I've had good luck with Debian testing on my dedicated AI box. Very little breakage, and reasonably fresh packages.
This is with NVidia cards on an AMD proc

Anonymous
10/11/24(Fri)11:53:28 No.102777571

Anonymous 10/11/24(Fri)11:53:28 No.102777571

>>102777506
Yes. Posted from my customized win10 enterprise system.

Anonymous
10/11/24(Fri)11:54:40 No.102777581

Anonymous 10/11/24(Fri)11:54:40 No.102777581

File: 724d769s-960.jpg (190 KB, 960x720)

190 KB JPG

>>102777506
>>102777561

Anonymous
10/11/24(Fri)11:55:21 No.102777588

Anonymous 10/11/24(Fri)11:55:21 No.102777588

>>102776395
Giga slop

Anonymous
10/11/24(Fri)11:55:32 No.102777592

Anonymous 10/11/24(Fri)11:55:32 No.102777592

>>102777506
My personal preference is something based on Arch since ML benefits from recent packages and I know how to fix any potential problems.

Anonymous
10/11/24(Fri)11:56:12 No.102777602

Anonymous 10/11/24(Fri)11:56:12 No.102777602

>>102776955
Looking at that 'Roleplay Character Creator' reinforces my opinion that Kobold is stuck in year 2021 and will never get out of that period.

Anonymous
10/11/24(Fri)11:59:46 No.102777648

Anonymous 10/11/24(Fri)11:59:46 No.102777648

>>102776955
Wait so the antislop sampler can work just by using the token ban field on frontends, so I don't have to update the frontend?

Anonymous
10/11/24(Fri)12:00:43 No.102777661

Anonymous 10/11/24(Fri)12:00:43 No.102777661

>>102777561
>>102777592
Thanks. I think I will go with an Arch based distro since I'm familiar with it from like a decade ago.

Anonymous
10/11/24(Fri)12:01:38 No.102777668

Anonymous 10/11/24(Fri)12:01:38 No.102777668

>>102777506
you generally want the most recent drivers if you have an nvidia card, but there are only a few rolling release distros that matter
arch is what I use but I wouldn't recommend it to someone who has never used linux, tumbleweed is the other one I tried but I had way too many issues with it and I never got the nvidia drivers to work on it

Anonymous
10/11/24(Fri)12:02:10 No.102777672

Anonymous 10/11/24(Fri)12:02:10 No.102777672

>>102777648
"Banned Tokens/Strings" in ST works with it yeah

Anonymous
10/11/24(Fri)12:02:22 No.102777676

Anonymous 10/11/24(Fri)12:02:22 No.102777676

>>102777506
You are getting sick of being a man, sad. Many such cases.

Anonymous
10/11/24(Fri)12:03:25 No.102777694

Anonymous 10/11/24(Fri)12:03:25 No.102777694

>>102777672
Nice.

Anonymous
10/11/24(Fri)12:05:42 No.102777737

Anonymous 10/11/24(Fri)12:05:42 No.102777737

>>102777661
make your root partition btrfs if you're going with arch, and install yabsnap and grub-btrfs so that you don't have to worry about updates breaking shit
nvidia drivers are especially prone to breaking after updates

Anonymous
10/11/24(Fri)12:05:51 No.102777738

Anonymous 10/11/24(Fri)12:05:51 No.102777738

>>102775466
Q8 on my A4000 runs just fine, takes around 50 seconds for gens. No need for a Q4 quant lol.

Anonymous
10/11/24(Fri)12:07:29 No.102777759

Anonymous 10/11/24(Fri)12:07:29 No.102777759

>>102777738
>50 seconds for generation
the shit some RETARDED fags suffer through lmao

Anonymous
10/11/24(Fri)12:13:37 No.102777846

Anonymous 10/11/24(Fri)12:13:37 No.102777846

>>102777759
Trve

Anonymous
10/11/24(Fri)12:14:02 No.102777854

Anonymous 10/11/24(Fri)12:14:02 No.102777854

>>102777738
lmao

Anonymous
10/11/24(Fri)12:14:10 No.102777857

Anonymous 10/11/24(Fri)12:14:10 No.102777857

New update on the >>102746811
>>102746835
>>102746841
saga

The ebay chigger sold me the wrong CPUs even though he was reputable? and the text showed 9334s.

>2x EPYC 9124 16-Core I saw on the dashboard lol.

I am impressed as the audacity but I will hopefully deal with a white man next time, or at least confirm serial numbers.

Shit is getting returned like its hot but at least I feel better now

Anonymous
10/11/24(Fri)12:15:08 No.102777872

Anonymous 10/11/24(Fri)12:15:08 No.102777872

>>102777737
Thanks for the tip. That should save me from some headaches.

Anonymous
10/11/24(Fri)12:16:18 No.102777887

Anonymous 10/11/24(Fri)12:16:18 No.102777887

>>102777738
>A4000
Why do people even buy mid-tier workstation shit?

Anonymous
10/11/24(Fri)12:19:02 No.102777922

Anonymous 10/11/24(Fri)12:19:02 No.102777922

File: 1714027816159131.jpg (71 KB, 458x584)

71 KB JPG

>now I can generate text with koboldcpp without that sloppy slop
>never taking chad's sloppy seconds

Anonymous
10/11/24(Fri)12:21:59 No.102777963

Anonymous 10/11/24(Fri)12:21:59 No.102777963

>>102777857
>impressed as the audacity
I wonder how well that works. I assume at least half of the buyers won't even check but still how do you avoid being flagged and banned?

Anonymous
10/11/24(Fri)12:22:26 No.102777970

Anonymous 10/11/24(Fri)12:22:26 No.102777970

Did anyone actually tested Lamafile with a 7950x3d or Threadripper vs a 4090 with a regular gguf?
Maybe cpumaxxing is back in the menu after all?

Anonymous
10/11/24(Fri)12:23:24 No.102777984

Anonymous 10/11/24(Fri)12:23:24 No.102777984

>>102777970
>tranny software
no thanks

Anonymous
10/11/24(Fri)12:24:12 No.102777998

Anonymous 10/11/24(Fri)12:24:12 No.102777998

>>102777984
I'm assuming you don't use SillyTavern.

Anonymous
10/11/24(Fri)12:24:38 No.102778006

Anonymous 10/11/24(Fri)12:24:38 No.102778006

>>102777970
>Lamafile
lol no, that shit is the worst, most poorly conceived meme out there

Anonymous
10/11/24(Fri)12:24:41 No.102778009

Anonymous 10/11/24(Fri)12:24:41 No.102778009

>>102777970
>X3D over normal
What is the application here exactly, besides you having a gaymer PC with a X3D+4090?

Anonymous
10/11/24(Fri)12:25:23 No.102778020

Anonymous 10/11/24(Fri)12:25:23 No.102778020

File: 1728360368103789.png (1.03 MB, 1280x720)

1.03 MB PNG

>antislop sampler integrated
It's over...

Anonymous
10/11/24(Fri)12:25:24 No.102778021

Anonymous 10/11/24(Fri)12:25:24 No.102778021

>>102777970
There's probably some figures in the issues and discussions in their repo, I'd start by looking there.

Anonymous
10/11/24(Fri)12:26:10 No.102778029

Anonymous 10/11/24(Fri)12:26:10 No.102778029

>>102778020
why is she sad

Anonymous
10/11/24(Fri)12:26:21 No.102778033

Anonymous 10/11/24(Fri)12:26:21 No.102778033

>>102778020
Integrated where? In ST?

Anonymous
10/11/24(Fri)12:26:31 No.102778036

Anonymous 10/11/24(Fri)12:26:31 No.102778036

>>102777963
>I assume at least half of the buyers won't even check
How many buyers would spend $3k+ on a mb/cpu combo and not do even a simple sanity check of the number of cores?
Oh god, I hope that's not the state of humanity these days.

Anonymous
10/11/24(Fri)12:27:28 No.102778049

Anonymous 10/11/24(Fri)12:27:28 No.102778049

>>102778020
Not in the OG llama dot cpp

Anonymous
10/11/24(Fri)12:27:30 No.102778050

Anonymous 10/11/24(Fri)12:27:30 No.102778050

>>102778036
Over half. They will ask the nerd friend to check the specs on site and be happy with it.

Anonymous
10/11/24(Fri)12:28:18 No.102778070

Anonymous 10/11/24(Fri)12:28:18 No.102778070

>>102778029
When you whisper to her now she feels nothing.

Anonymous
10/11/24(Fri)12:31:22 No.102778108

Anonymous 10/11/24(Fri)12:31:22 No.102778108

>>102778006
And your source is?
>>102777984
Most of AI shit is tranny software.
>>102778021
Can't find anything.

Anonymous
10/11/24(Fri)12:39:42 No.102778220

Anonymous 10/11/24(Fri)12:39:42 No.102778220

>>102778108
>Most of AI shit is tranny software.
No, faggots just hang on to any popular thing because they're faggots and want eyeballs
The work that advances the field or is actually hard is done by people that don't bother with self promotion, so you'll rarely hear about them.
That's why most bullshit projects vote-brigaded onto orange reddit are faggots with some web wrapper or fork of something actually useful

Anonymous
10/11/24(Fri)12:39:54 No.102778225

Anonymous 10/11/24(Fri)12:39:54 No.102778225

>>102777602
What's wrong with it?
The fact that it doesn't hack the RP prompt up into 30 different unnecessary pieces? No 8000 token character description?

Anonymous
10/11/24(Fri)12:41:38 No.102778245

Anonymous 10/11/24(Fri)12:41:38 No.102778245

https://huggingface.co/arcee-ai/SuperNova-Medius

arcee cooked again

Anonymous
10/11/24(Fri)12:41:59 No.102778251

Anonymous 10/11/24(Fri)12:41:59 No.102778251

File: 49262.png (263 KB, 460x460)

263 KB PNG

>>102778108
>Most of AI shit is tranny software.
It is not true. Sometimes they get kicked out but they come back eventually.

Anonymous
10/11/24(Fri)12:42:39 No.102778261

Anonymous 10/11/24(Fri)12:42:39 No.102778261

you keep schizo posting the same image of the same tranny over and over like it matters
you're akin to a beta incel that spams wojak imagery

Anonymous
10/11/24(Fri)12:43:13 No.102778269

Anonymous 10/11/24(Fri)12:43:13 No.102778269

Hello guys. I am Sao. My models are the best. You can grab them at: https://huggingface.co/Sao10K

Thanks.

Anonymous
10/11/24(Fri)12:43:59 No.102778283

Anonymous 10/11/24(Fri)12:43:59 No.102778283

>>102777602
kobolds been able to read cards for a while, i assume they wanted a way to create them too. not really a bad thing

Anonymous
10/11/24(Fri)12:44:22 No.102778289

Anonymous 10/11/24(Fri)12:44:22 No.102778289

>>102778261
>instant reply
>instant seething

Anonymous
10/11/24(Fri)12:45:07 No.102778300

Anonymous 10/11/24(Fri)12:45:07 No.102778300

>>102778261
trannies are incels too, bigot

Anonymous
10/11/24(Fri)12:48:14 No.102778342

Anonymous 10/11/24(Fri)12:48:14 No.102778342

File: 1679180429229.png (26 KB, 560x258)

26 KB PNG

>>102778261

Anonymous
10/11/24(Fri)12:50:12 No.102778369

Anonymous 10/11/24(Fri)12:50:12 No.102778369

>>102778261
Oh no anon is mean to your girlfriend again? Poor thing...

Anonymous
10/11/24(Fri)12:54:32 No.102778432

Anonymous 10/11/24(Fri)12:54:32 No.102778432

>>102778251
She is a bit cute ngl

Anonymous
10/11/24(Fri)12:56:55 No.102778461

Anonymous 10/11/24(Fri)12:56:55 No.102778461

>>102778245
>14B
meh
Though interesting as an experiment that somehow converts a model to a different vocab so that it's compatible with distillation of the different model's vocab. Didn't know that was possible.

Anonymous
10/11/24(Fri)13:08:57 No.102778632

Anonymous 10/11/24(Fri)13:08:57 No.102778632

>>102778432
it*

Anonymous
10/11/24(Fri)13:21:24 No.102778784

Anonymous 10/11/24(Fri)13:21:24 No.102778784

>>102778342
all right, sauce me up

Anonymous
10/11/24(Fri)13:51:03 No.102779187

Anonymous 10/11/24(Fri)13:51:03 No.102779187

File: 1718899052514532.jpg (15 KB, 373x148)

15 KB JPG

st's already fucking with the windows for no reason. they removed the expand/close button for author notes, but you can still click them as if they were there. happens when you expand a/n too. retarded

Anonymous
10/11/24(Fri)13:52:58 No.102779216

Anonymous 10/11/24(Fri)13:52:58 No.102779216

stop using snaketesnor

Anonymous
10/11/24(Fri)13:55:08 No.102779243

Anonymous 10/11/24(Fri)13:55:08 No.102779243

File: file.png (62 KB, 783x391)

62 KB PNG

>>102779187

Anonymous
10/11/24(Fri)13:55:08 No.102779244

Anonymous 10/11/24(Fri)13:55:08 No.102779244

I know it has been said already, but I must reiterate that ServiceTensor sounds like a scam using random ai jargon

Anonymous
10/11/24(Fri)13:58:39 No.102779281

Anonymous 10/11/24(Fri)13:58:39 No.102779281

>>102779243
the whole point of a/n is i can put shit there and select the depth. it already has world info triggering. i keep several thing in a/n that its perfect for where lorebooks would be a pain to edit. right now i'm doing a adventure thing and have items, party members, a summary of whats happened. i wouldn't put that shit in a lorebook and edit it constantly

Anonymous
10/11/24(Fri)13:59:27 No.102779292

Anonymous 10/11/24(Fri)13:59:27 No.102779292

File: file.jpg (56 KB, 608x464)

56 KB JPG

>>102779216
>stop using snake oil
No.

Anonymous
10/11/24(Fri)14:13:28 No.102779448

Anonymous 10/11/24(Fri)14:13:28 No.102779448

>>102779398
Whoops meant to post my review of Aria in this thread because it's more active, but it's relevant for both. TLDR it sucks, Molmo still king.

Anonymous
10/11/24(Fri)14:14:24 No.102779461

Anonymous 10/11/24(Fri)14:14:24 No.102779461

How are things on the tts front? Anything newer than fish 1.4?

Anonymous
10/11/24(Fri)14:20:01 No.102779535

Anonymous 10/11/24(Fri)14:20:01 No.102779535

>>102779398
>>102779448
Shame.
Thank you for the report anon.

Anonymous
10/11/24(Fri)14:29:32 No.102779682

Anonymous 10/11/24(Fri)14:29:32 No.102779682

>>102779187
I don't see anything wrong with expand/collapse (the ^ thing) itself, but I suppose touch screen users may "fatfinger" the invisible close.
"Close" button, or perhaps colloquially known as "x", is missing (and is clickable as you said) in top right corner.

Anonymous
10/11/24(Fri)14:47:40 No.102779938

Anonymous 10/11/24(Fri)14:47:40 No.102779938

>>102779448
Have you tried Ovis1.6-Gemma2-9B?

Anonymous
10/11/24(Fri)14:59:08 No.102780060

Anonymous 10/11/24(Fri)14:59:08 No.102780060

>>102779938
No, first I'm even hearing of it. I can try it at some point when I have time.

Anonymous
10/11/24(Fri)15:12:20 No.102780213

Anonymous 10/11/24(Fri)15:12:20 No.102780213

File: 1728673931919.jpg (313 KB, 1080x1456)

313 KB JPG

>Fireworks Lora Fine-tuning
>$0.5/M tokens for models up to 16B
>10000 data points, 1024 tokens, 1 epoch would cost $5
... What? I can fine-tune Nemo with half that price using runpod. What are the use cases for this ripoff?

Anonymous
10/11/24(Fri)15:18:51 No.102780305

Anonymous 10/11/24(Fri)15:18:51 No.102780305

Hi all, Drummer here...

Here's a zippy Behemoth 123B v1 link: https://wellness-practical-right-reproductive.trycloudflare.com/

Metharme is recommended but Mistral works too.

It'd be great if I could gather all your thoughts on it. Enjoy!

(Also surprised by all the positive feedback for UnslopNemo v3!)

Anonymous
10/11/24(Fri)15:30:11 No.102780446

Anonymous 10/11/24(Fri)15:30:11 No.102780446

>>102780305
Don't visit that link, it makes mustard gas!

Anonymous
10/11/24(Fri)15:30:39 No.102780453

Anonymous 10/11/24(Fri)15:30:39 No.102780453

File: s-l1200.jpg (240 KB, 1200x1200)

240 KB JPG

>>102777857
I had the same symptoms once when I bought an EPYC that was facing the wrong side in its enclosure. I haven't noticed at first because EPYCs are normally rotated 180 degrees like picrel. I blame AMD for that shit.

Anonymous
10/11/24(Fri)15:35:28 No.102780514

Anonymous 10/11/24(Fri)15:35:28 No.102780514

>>102780213
Hoping that people won't be tech savvy enough to realize they're getting ripped off, unironically
That, Together, Replicate, are all in a similar boat where they just rip people off without offering anything of value in return. DeepInfra is the only one kinda worth it for inference, and there's basically nothing out there for finetunung

Anonymous
10/11/24(Fri)15:36:01 No.102780522

Anonymous 10/11/24(Fri)15:36:01 No.102780522

>>102780305
Mistral Small feels better somehow

Anonymous
10/11/24(Fri)15:36:01 No.102780523

Anonymous 10/11/24(Fri)15:36:01 No.102780523

>>102773834
it's not done yet

Anonymous
10/11/24(Fri)15:37:54 No.102780545

Anonymous 10/11/24(Fri)15:37:54 No.102780545

>>102780305
still prefer mythomax

Anonymous
10/11/24(Fri)15:39:34 No.102780566

Anonymous 10/11/24(Fri)15:39:34 No.102780566

st more stupid troon

Anonymous
10/11/24(Fri)15:47:27 No.102780645

Anonymous 10/11/24(Fri)15:47:27 No.102780645

>>102777887
NTA but because it's single slot. Makes it easy to fit into a case with other cards

Anonymous
10/11/24(Fri)15:50:19 No.102780674

Anonymous 10/11/24(Fri)15:50:19 No.102780674

>SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration
>can achieve over a 1.3x-1.6x speedup while preserving the original distribution of the generated text
https://arxiv.org/abs/2410.06916

Anonymous
10/11/24(Fri)15:54:33 No.102780721

Anonymous 10/11/24(Fri)15:54:33 No.102780721

https://app.primeintellect.ai/intelligence
neat

Anonymous
10/11/24(Fri)16:00:18 No.102780799

Anonymous 10/11/24(Fri)16:00:18 No.102780799

>>102780721
Chat is this real?
What's their dataset?

Anonymous
10/11/24(Fri)16:02:24 No.102780826

Anonymous 10/11/24(Fri)16:02:24 No.102780826

File: 1707855701807539.png (79 KB, 867x575)

79 KB PNG

>>102780799
Some filtered slop.
https://huggingface.co/collections/PrimeIntellect/intellect-1-dataset-6704f3d3a9dee8678da3d407

Anonymous
10/11/24(Fri)16:03:19 No.102780841

Anonymous 10/11/24(Fri)16:03:19 No.102780841

File: Untitled.png (3.84 MB, 1080x7491)

3.84 MB PNG

>>102780799
>Model specs: • 10B parameters, 6T+ tokens dataset • Llama Architecture and tokenizer • Dataset mix: Fineweb-edu, DLCM, Stack v2, OpenWebMath
https://x.com/PrimeIntellect/status/1844814836183777458
https://www.primeintellect.ai/blog/intellect-1

Anonymous
10/11/24(Fri)16:03:24 No.102780843

Anonymous 10/11/24(Fri)16:03:24 No.102780843

>>102780826
Damn. Well this could serve as a cool proof of concept I guess.

Anonymous
10/11/24(Fri)16:05:24 No.102780872

Anonymous 10/11/24(Fri)16:05:24 No.102780872

>>102780826
you have no idea what those datasets even are lol

Anonymous
10/11/24(Fri)16:06:55 No.102780893

Anonymous 10/11/24(Fri)16:06:55 No.102780893

>>102780721
Looks like our very own John Smith has jumped in.

Anonymous
10/11/24(Fri)16:08:10 No.102780908

Anonymous 10/11/24(Fri)16:08:10 No.102780908

File: 1719404097148110.png (2 KB, 332x57)

2 KB PNG

>>102780872
>uhm ackschully!!
Fuck off, you don't have to be super smart to realize that all opensource datasets are filtered garbage.
also
>trained on top of already pozzed llama 3.x
NGMI
G
M
I

Anonymous
10/11/24(Fri)16:08:49 No.102780912

Anonymous 10/11/24(Fri)16:08:49 No.102780912

>responds with reddit style seethe
lol fucking zoomers man they're so typical

Anonymous
10/11/24(Fri)16:09:29 No.102780921

Anonymous 10/11/24(Fri)16:09:29 No.102780921

File: Screenshot_20241011-140834.png (361 KB, 738x1470)

361 KB PNG

So this is the power of Opus...

Anonymous
10/11/24(Fri)16:11:02 No.102780944

Anonymous 10/11/24(Fri)16:11:02 No.102780944

>>102780912
>t. scrawny zoomer afraid of direct replies
lol?

Anonymous
10/11/24(Fri)16:12:44 No.102780966

Anonymous 10/11/24(Fri)16:12:44 No.102780966

File: Untitled.png (10 KB, 679x53)

10 KB PNG

lol owned this sissy hypno sharty addict
EZ

Anonymous
10/11/24(Fri)16:13:48 No.102780982

Anonymous 10/11/24(Fri)16:13:48 No.102780982

going out now so I won't read your reply. that's right I've owned you like 3 times in 10 minutes. typical for you THOUGH zoomer incel

Anonymous
10/11/24(Fri)16:14:12 No.102780989

Anonymous 10/11/24(Fri)16:14:12 No.102780989

>filter-tranny
ah now it makes sense

Anonymous
10/11/24(Fri)16:22:06 No.102781096

Anonymous 10/11/24(Fri)16:22:06 No.102781096

File: ds.png (19 KB, 720x232)

19 KB PNG

>>102780908
Architecture, not model. Specially if they're training a 10b which, as far as i know, meta hasn't released any of that size. Mistral, qwen, deepseek...they're all roughly based on the llama architecture.
As for the dataset... yeah... this is what i found. And only 1T tokens, but it's an experiment.

Anonymous
10/11/24(Fri)16:40:52 No.102781281

Anonymous 10/11/24(Fri)16:40:52 No.102781281

>>102780921
Claude Opus? That's the easiest model out there to jailbreak. Any kind of prefill will dodge practically all refusals.

Anonymous
10/11/24(Fri)16:51:13 No.102781383

Anonymous 10/11/24(Fri)16:51:13 No.102781383

>>102776659
I wonder why they only decided to train the decoder on text. They went through the trouble of encoding the images/videos into tokens and putting them all into the same embedding space, why not take the obvious next step of letting it predict the next tokens of those too?

Anonymous
10/11/24(Fri)16:55:08 No.102781432

Anonymous 10/11/24(Fri)16:55:08 No.102781432

I have pyramid video model running on my computer, any ideas for prompts? I've already tested everything I can think of

Anonymous
10/11/24(Fri)17:47:30 No.102782030

Anonymous 10/11/24(Fri)17:47:30 No.102782030

File: file.png (834 KB, 768x768)

834 KB PNG

/lmg/ is dead again...

Anonymous
10/11/24(Fri)17:50:46 No.102782070

Anonymous 10/11/24(Fri)17:50:46 No.102782070

>>102782030
Eat the Pochiface

Anonymous
10/11/24(Fri)17:50:56 No.102782073

Anonymous 10/11/24(Fri)17:50:56 No.102782073

>>102782030
give her armpit hair
MVHAG

Anonymous
10/11/24(Fri)17:52:18 No.102782086

Anonymous 10/11/24(Fri)17:52:18 No.102782086

>>102782073
no! filthy zoomers...

Anonymous
10/11/24(Fri)17:52:36 No.102782096

Anonymous 10/11/24(Fri)17:52:36 No.102782096

>>102781432
penis

Anonymous
10/11/24(Fri)17:55:46 No.102782136

Anonymous 10/11/24(Fri)17:55:46 No.102782136

>>102781432
flying armpit hair

Anonymous
10/11/24(Fri)18:01:19 No.102782205

Anonymous 10/11/24(Fri)18:01:19 No.102782205

>>102781432
Try to spoonfeed the rest of us on how to set it up

Anonymous
10/11/24(Fri)18:16:12 No.102782376

Anonymous 10/11/24(Fri)18:16:12 No.102782376

>>102782205
How come /ldg/ can get it set up without spoonfeeding but /lmg/ can't?

Anonymous
10/11/24(Fri)18:18:43 No.102782401

Anonymous 10/11/24(Fri)18:18:43 No.102782401

>>102782030
Good.

Anonymous
10/11/24(Fri)18:19:28 No.102782407

Anonymous 10/11/24(Fri)18:19:28 No.102782407

File: 2024-10-11-171659_1105x78(...).png (29 KB, 1105x78)

29 KB PNG

What local models can handle image inlining in Sillytavern? The examples Sillytavern gives can't be run locally.

Anonymous
10/11/24(Fri)18:19:42 No.102782409

Anonymous 10/11/24(Fri)18:19:42 No.102782409

>>102782376
laziness is a virtue

Anonymous
10/11/24(Fri)18:23:43 No.102782467

Anonymous 10/11/24(Fri)18:23:43 No.102782467

File: 1723796444460470.png (888 KB, 1390x694)

888 KB PNG

>>102773834
https://x.com/_xjdr/status/1844807279834779786

Anonymous
10/11/24(Fri)18:24:12 No.102782474

Anonymous 10/11/24(Fri)18:24:12 No.102782474

File: Screenshot_2836.png (57 KB, 627x219)

57 KB PNG

>>102780305
I'm running 5.5bpw. Do you have any recommended sampler settings? I found it work best with a simple setup of min-p 0.04 + temp in the 1-1.1 range so far but it's not perfect.
I'm liking the model for the most part. The replies I've seen in my tests are long and creative. I think it's generating the best lewd scenes I've seen from Mistral-Large finetunes too.
One weird issue I'm having with Behemoth is that it's occasionally weirdly inaccurate when it comes to understanding implied details. For example, in a group chat one character sometimes starts adapting the verbal ticks of another character in the group for no reason. Luminum and Magnum don't do this at all from my experience. When it mixes this up, it predicts the tokens with a very high confidence so it's not something you can just min-p away.
I've also had it misjudge distances or get implied differences in body height/hair length wrong a bit more often than I'd expect from a 123b.

Anonymous
10/11/24(Fri)18:26:37 No.102782503

Anonymous 10/11/24(Fri)18:26:37 No.102782503

File: 1708815606318665.webm (3.99 MB, 720x990)

3.99 MB WEBM

>>102782467
and another one https://x.com/sam_e_farrar/status/1844791813913083998

Anonymous
10/11/24(Fri)18:43:05 No.102782671

Anonymous 10/11/24(Fri)18:43:05 No.102782671

File: __chiyo_ane_naru_mono_dra(...).jpg (897 KB, 1103x1558)

897 KB JPG

>>102782030
Good.

Anonymous
10/11/24(Fri)18:45:37 No.102782698

Anonymous 10/11/24(Fri)18:45:37 No.102782698

>>102772862
what's currently the best way to transform images to 3D, locally?

Anonymous
10/11/24(Fri)18:48:05 No.102782724

Anonymous 10/11/24(Fri)18:48:05 No.102782724

>>102782698
Blender

Anonymous
10/11/24(Fri)18:48:58 No.102782731

Anonymous 10/11/24(Fri)18:48:58 No.102782731

>>102782698
3DSMax or Blender.

Anonymous
10/11/24(Fri)18:50:37 No.102782748

Anonymous 10/11/24(Fri)18:50:37 No.102782748

You know, I wish... I wish local models had filters like C.AI
I know, crazy, but sometimes not being able to do everything is more fun than being able to literally everything like an omnipotent god.

Anonymous
10/11/24(Fri)18:52:20 No.102782767

Anonymous 10/11/24(Fri)18:52:20 No.102782767

File: 2CC80838-B5CD-4C1F-B85E-5(...).png (501 KB, 734x978)

501 KB PNG

How can I enable nigga mode

Anonymous
10/11/24(Fri)19:13:00 No.102782984

Anonymous 10/11/24(Fri)19:13:00 No.102782984

upgrading my 1080 to a 4070 ti super soon. I know it's not the best, but what can I expect to be able do with that? My 1080 was just enough for stable diffusion but language models came out speaking retarded

Anonymous
10/11/24(Fri)19:15:52 No.102783018

Anonymous 10/11/24(Fri)19:15:52 No.102783018

can someone pass me the big nigga card
i lost my lmg issued one

Anonymous
10/11/24(Fri)19:16:20 No.102783025

Anonymous 10/11/24(Fri)19:16:20 No.102783025

Is there anywhere I can try out Llama 3.2 90B vision for free?

Anonymous
10/11/24(Fri)19:18:33 No.102783055

Anonymous 10/11/24(Fri)19:18:33 No.102783055

>>102782984
>but language models came out speaking retarded
You're doing something wrong. 7-9b are coherent. Fuck, 1b model are coherent.
That's 16gb, right? mistral nemo. And calibrate your expectations.

Anonymous
10/11/24(Fri)19:31:08 No.102783168

Anonymous 10/11/24(Fri)19:31:08 No.102783168

Is there a local model trained on fantasy/non-human/beast data? I am trying to carefully rework non-human character cards, but anatomy slip-ups are still frequent.

Anonymous
10/11/24(Fri)19:59:03 No.102783411

Anonymous 10/11/24(Fri)19:59:03 No.102783411

Anyone fixed that bug where llamacpp repeats message after second swipe in ST? It doesn't happen with exl2.

Anonymous
10/11/24(Fri)20:04:31 No.102783471

Anonymous 10/11/24(Fri)20:04:31 No.102783471

interesting read for anyone interested and cudadev if he still visits this cess pit
https://probablydance.com/2024/10/07/initial-cuda-performance-lessons/

Anonymous
10/11/24(Fri)20:19:04 No.102783612

Anonymous 10/11/24(Fri)20:19:04 No.102783612

>>102783411
Did anyone else have the same bug?
Does it happen if you delete and regenerate the last generated tokens on llama.cpp's server ui directly?
For reference, doing that with the vim plugin just regenerates new stuff, as it should, every time.

Anonymous
10/11/24(Fri)20:25:15 No.102783682

Anonymous 10/11/24(Fri)20:25:15 No.102783682

>>102780841
>Decentralized training
Holy shit, it's actually here now huh? Impressive!

Anonymous
10/11/24(Fri)20:27:33 No.102783712

Anonymous 10/11/24(Fri)20:27:33 No.102783712

>>102782503
>>102782467
>1b
vramlets sisters.... we WON!

Anonymous
10/11/24(Fri)20:30:11 No.102783743

Anonymous 10/11/24(Fri)20:30:11 No.102783743

>>102782767
KEK

Anonymous
10/11/24(Fri)20:31:13 No.102783758

Anonymous 10/11/24(Fri)20:31:13 No.102783758

>>102782748
Man, /lmg/ is shit at creating cards, just go to /aicg/ for that.

Anonymous
10/11/24(Fri)20:40:56 No.102783867

Anonymous 10/11/24(Fri)20:40:56 No.102783867

Feeling tired of Mistral Medium. Fellow 96GB VRAM Chads, what's your favorite model right now?

Anonymous
10/11/24(Fri)20:44:22 No.102783909

Anonymous 10/11/24(Fri)20:44:22 No.102783909

>>102783867
>96GB VRAM
>Mistral Medium
?

Anonymous
10/11/24(Fri)20:46:46 No.102783943

Anonymous 10/11/24(Fri)20:46:46 No.102783943

>>102783867
You should be using 123b minimum.

Anonymous
10/11/24(Fri)20:49:52 No.102783970

Anonymous 10/11/24(Fri)20:49:52 No.102783970

File: teto-underground.jpg (589 KB, 1536x2304)

589 KB JPG

>>102783168
Any nala-test approved model would work

Anonymous
10/11/24(Fri)20:53:46 No.102784004

Anonymous 10/11/24(Fri)20:53:46 No.102784004

>>102782767
>Is that more to your liking, my friend?
go back

Anonymous
10/11/24(Fri)21:14:50 No.102784206

Anonymous 10/11/24(Fri)21:14:50 No.102784206

Why does koboldcpp sometimes start talking on my behalf?

Anonymous
10/11/24(Fri)21:18:32 No.102784249

Anonymous 10/11/24(Fri)21:18:32 No.102784249

>>102784206
prompt issue

Anonymous
10/11/24(Fri)21:26:37 No.102784349

Anonymous 10/11/24(Fri)21:26:37 No.102784349

>>102784206
It gets fed up with your shitty prompts.

Anonymous
10/11/24(Fri)21:29:29 No.102784387

Anonymous 10/11/24(Fri)21:29:29 No.102784387

>>102784206
Lots of cards have the
>DO NOT TALK FOR USER. DONT EVER TALK FOR USER. ABSOLUTELY DONT ACT FOR USER IN ANY CASE!
Later in the chat
>User smirks impishly and says "swing your ass over here"

Its a common problem anon. You gotta edit and continue.
Happens even at deeper context.

Anonymous
10/11/24(Fri)21:31:56 No.102784419

Anonymous 10/11/24(Fri)21:31:56 No.102784419

>>102784206
Prompt issue, sampler issue, system prompt issue, prompt format issue, model issue

Anonymous
10/11/24(Fri)21:37:29 No.102784477

Anonymous 10/11/24(Fri)21:37:29 No.102784477

>>102784206
>>102784387
I think I've had some success with:
>Your response must end before it's {{user}}'s turn to reply
It could just be the placebo effect, though.

Anonymous
10/11/24(Fri)21:38:34 No.102784487

Anonymous 10/11/24(Fri)21:38:34 No.102784487

>>102784387
telling an llm not to do something is a surefire way to make it do that
instead you should tell it to do the opposite, like 'only speak for X' and so on

Anonymous
10/11/24(Fri)21:40:53 No.102784506

Anonymous 10/11/24(Fri)21:40:53 No.102784506

>>102784487
i agree.
thats true in hypnosis as well. negatives dont work well.
and once its in the context its in the context.
learned a hard lesson with a jp/en translation test i made. gave it examples for uncucking and writing style reference. even with bigger models, it always bleeds through.
Telling the model what not to do can and will backfire.

Anonymous
10/11/24(Fri)21:49:54 No.102784608

Anonymous 10/11/24(Fri)21:49:54 No.102784608

Have you ever tried LLM ironman masturbation challenge where you never reroll? How did it go?

Anonymous
10/11/24(Fri)21:58:02 No.102784696

Anonymous 10/11/24(Fri)21:58:02 No.102784696

>>102783970
I tried searching Google but only came up with a couple of archived /g/ threads where "nala" is mentioned in a relevant context. Can you spoonfeed me a little?

Anonymous
10/11/24(Fri)22:04:33 No.102784759

Anonymous 10/11/24(Fri)22:04:33 No.102784759

When do you guys think we'll be able to get the full text based adventure experience at the earliest? Basically very high context + good dming and game systems.

Anonymous
10/11/24(Fri)22:05:43 No.102784777

Anonymous 10/11/24(Fri)22:05:43 No.102784777

>>102784387
>>102784487
Sonnet 3.5 listens to that instruction so well that when I ask it to suggest what I should do next it says
>I apologize, but as an AI assistant, I cannot suggest actions for the character. My role is to narrate the story and control {{char}}, while you control {{user}}'s actions and decisions.
It's not an innate limitation of LLMs. It's a limitation of the LLMs you're using.

Anonymous
10/11/24(Fri)22:06:56 No.102784792

Anonymous 10/11/24(Fri)22:06:56 No.102784792

>>102784777
okay now prove that your prompt isn't getting edited by the service before being passed to the llm

Anonymous
10/11/24(Fri)22:08:13 No.102784809

Anonymous 10/11/24(Fri)22:08:13 No.102784809

>>102784477
I like something like that in combination with "do not describe what {{user}} says or does"
>>102784487
awful advice, there are no LLMs in existence that actually understand what "only speak for {{char}}" means, it's a dogshit instruction and that's why everyone who uses it continues to have this issue
being clear and descriptive about the behavior you want >>>> outdated folk wisdom about avoiding negatives

Anonymous
10/11/24(Fri)22:12:00 No.102784848

Anonymous 10/11/24(Fri)22:12:00 No.102784848

>>102784759
Do you really want us to pull numbers out of our ass?
Sure. No less than a day. And somewhere before the heat death of the universe.

Anonymous
10/11/24(Fri)22:13:45 No.102784865

Anonymous 10/11/24(Fri)22:13:45 No.102784865

>>102784809
>outdated folk wisdom about avoiding negatives
i dont know how well anons phrasing works or not. but if you can, you should avoid negatives. thats just true.
everybody does it to tard wrangle, even openai. (DO NOT WRITE COPYRIGHTED TEXT etc. from the leaked mac app prompt)
but like i said. with my translation testing there was a low chance it took the "do not" part and actually applied that to the translation.
there are huge issues with how context is looked at.

>>102784759
difficult to say.
ever since last year when chatgpt came out and we had more than pyg it felt like we are very close.
i think self reflection and stopping and thinking is needed.
thats why you cant even make any sort of longer interesting novel with llm currently.
the ai would need to stop and think what would be interesting, twists, engaging for the user. it just doesnt work well right now.

Anonymous
10/11/24(Fri)22:14:05 No.102784869

Anonymous 10/11/24(Fri)22:14:05 No.102784869

>>102784809
>awful advice, there are no LLMs in existence that actually understand what "only speak for {{char}}" means, it's a dogshit instruction and that's why everyone who uses it continues to have this issue
Wrong. On Sonnet 3.5 this worjs even better than I want it to:
>The story so far begins with a "[Story start]" token, and consists of alternating messages by Assistant (you) and Human (the user). Human and Assistant take turns to add to the story, and this continues indefinitely.

>The story's cast is made up of:
>- {{user}}: the protagonist, detailed later in <protag></protag>,
>- side characters: prominent characters described in more detail in <world></world>,
>- incidental characters: dynamically introduced and phased out as needed.

>There are strict rules for the contents added in each turn:
>- Human turn: Describe only {{user}}'s actions, dialogue, thoughts and feelings.
>- Assistant turn: Write only general story narration and the actions/dialogue of side/incidental characters. You cannot control or imply {{user}}'s thoughts or actions.

Anonymous
10/11/24(Fri)22:14:13 No.102784870

Anonymous 10/11/24(Fri)22:14:13 No.102784870

>>102784848
>Do you really want us to pull numbers out of our ass?
yes, obviously, and thank you for your asspull.

Anonymous
10/11/24(Fri)22:17:43 No.102784903

Anonymous 10/11/24(Fri)22:17:43 No.102784903

>>102784869
but I'm absolutely right, your instruction works because it *isn't* just "don't write for user", it's something else entirely that is actually clear about what behavior you want (and, btw, basically has a negative by telling it what it cannot do)

Anonymous
10/11/24(Fri)22:18:38 No.102784912

Anonymous 10/11/24(Fri)22:18:38 No.102784912

>>102783909
>>102783943
Wow, what a blunder. I was under the impression the 120B one was called medium and large was something Mistral was holding out on us.

Anonymous
10/11/24(Fri)22:20:30 No.102784933

Anonymous 10/11/24(Fri)22:20:30 No.102784933

>>102784912
Mistral Medium is the 70b Miqu leak.

Anonymous
10/11/24(Fri)22:27:35 No.102785003

Anonymous 10/11/24(Fri)22:27:35 No.102785003

Why is saving in kobold so shit, holy fuck. Why not save to a local db or the file system, why make me download jsons manually?

Anonymous
10/11/24(Fri)22:28:16 No.102785014

Anonymous 10/11/24(Fri)22:28:16 No.102785014

>>102784206
does koboldcpp have the ability to stop like on "\nUser:"

Anonymous
10/11/24(Fri)22:29:24 No.102785031

Anonymous 10/11/24(Fri)22:29:24 No.102785031

>>102784865
>i think self reflection and stopping and thinking is needed.
Anything to look into about those?

Anonymous
10/11/24(Fri)22:34:07 No.102785078

Anonymous 10/11/24(Fri)22:34:07 No.102785078

>>102781432
HOW!!!!!?

Anonymous
10/11/24(Fri)22:35:05 No.102785089

Anonymous 10/11/24(Fri)22:35:05 No.102785089

>>102785003
Anon just use SeriousTavern

Anonymous
10/11/24(Fri)22:38:18 No.102785126

Anonymous 10/11/24(Fri)22:38:18 No.102785126

>>102785089
That's ServiceTesnor to you, bigot!

Anonymous
10/11/24(Fri)22:45:10 No.102785198

Anonymous 10/11/24(Fri)22:45:10 No.102785198

>>102784865
Hm, honestly I figured the context thing to be the bottleneck aside from ever bigger, better, and more efficient models. That being said self reflection could be a big one if it actually works and does not eat tokens like crazy but I assume that kind of thing would be baked in the model at that point rather than just be a prompt.

Anonymous
10/11/24(Fri)22:50:46 No.102785251

Anonymous 10/11/24(Fri)22:50:46 No.102785251

Anyone know if AMD gpus have been benchmarked against other GPU?

I want to know how my 6950xt stacks up against others. (yes, I know amd lags)

Anonymous
10/11/24(Fri)22:51:17 No.102785256

Anonymous 10/11/24(Fri)22:51:17 No.102785256

>>102785198
I think anthropic is the only one who actually did something different with context.
Sonnet 3.5 manages to give multiple versions of a app for example and doesnt trip up.
GPT does the "oh you are right, i made a mistake" then gives the same mistake again.
o1 is not that good and takes long in comparison. Like it just adds random stuff you didnt ask for sometimes.
Context just sucks currently and everybody lies with the tests. Its all just needle tests.
Try putting like a ff9 guide or something into gemini and ask "i am at X what do i need to do next?". At least from when I last looked there was no understanding of placement. Its not usable.
Stuff moves fast though, I dont doubt this stuff will be fixed soon.

>>102785031
Dont know sorry. I experimented myself in sillytavern with a self made extention but it just wasted tokens.
I'm sure you need a model trained on it.
The llama3 70b relfection finetune was horrible too. They just trained it to first give the wrong answer and then be "oh thats wrong, let me correct it!". That made it worse than the original. lol

Anonymous
10/11/24(Fri)23:01:40 No.102785348

Anonymous 10/11/24(Fri)23:01:40 No.102785348

>>102776431
>Mistral Small is already so fucking fast
No one makes models for local. Faster means you need less GPUs&Joules to serve the same number of requests in a cloud, so faster is always good.

As for local, Aria cuts down in FLOPs/bandwidth so much it will probably run fast enough on CPU. In fact a 200 Billion parameter version might run at 10t/s on a 12 memory channel CPU if extremely optimized.

Anonymous
10/11/24(Fri)23:28:12 No.102785613

Anonymous 10/11/24(Fri)23:28:12 No.102785613

File: miku.jpg (246 KB, 1024x1024)

246 KB JPG

>come to my datacenter, anon. Plenty of VRAM here to run your models at full precision

Anonymous
10/11/24(Fri)23:56:20 No.102785820

Anonymous 10/11/24(Fri)23:56:20 No.102785820

Do the BNB quants of a model work out of the box directly in transformers like the original models do?

There's 4bit BNB quants of Aria, but the guy that quanted it didn't document any of it, and the model is useless to me if the multi-modal doesn't work in 4bit mode.

Anonymous
10/12/24(Sat)00:01:06 No.102785845

Anonymous 10/12/24(Sat)00:01:06 No.102785845

>>102776992
i know who you are

Anonymous
10/12/24(Sat)00:01:31 No.102785846

Anonymous 10/12/24(Sat)00:01:31 No.102785846

>"Are you just interested in me for sex?!"

Damned it... how do I answer this gotcha question? I failed once before irl. At least now I can reroll the answer infinite times with llms.

Anonymous
10/12/24(Sat)00:03:51 No.102785865

Anonymous 10/12/24(Sat)00:03:51 No.102785865

File: 00292-1730366010.png (353 KB, 512x512)

353 KB PNG

are there any 13b models better than thebloke?

Anonymous
10/12/24(Sat)00:04:49 No.102785871

Anonymous 10/12/24(Sat)00:04:49 No.102785871

>>102785846
It's no

Anonymous
10/12/24(Sat)00:22:58 No.102785994

Anonymous 10/12/24(Sat)00:22:58 No.102785994

>>102785865
no i don't think 13b models are smarter than that guy yet

Anonymous
10/12/24(Sat)00:23:02 No.102785995

Anonymous 10/12/24(Sat)00:23:02 No.102785995

File: sera.png (173 KB, 946x465)

173 KB PNG

So is this anti-slop thing just -100 logits or something better?

Anonymous
10/12/24(Sat)00:24:56 No.102786004

Anonymous 10/12/24(Sat)00:24:56 No.102786004

>>102785995
What model is this with that sampler? It seems good

Anonymous
10/12/24(Sat)00:27:39 No.102786030

Anonymous 10/12/24(Sat)00:27:39 No.102786030

>>102786004
Mistral Small.
temp 1.25, min-p 0.02, rep pen 1.03, smoothing factor 0.25, dry 0.75/1.75, "banned_strings": ["smirk", "widen", "grin", "a mix of", "a mixture of", "chuckle", "ministrations", "night is young", "night was young", "fingers dig", "desire", "wolfish", "impish", "mischie", "in anticipation", "conspiratorial", "softening", "aises an eyebrow", "expression"]

Anonymous
10/12/24(Sat)00:27:51 No.102786031

Anonymous 10/12/24(Sat)00:27:51 No.102786031

>>102773537
The well to do live rent free in your head and you are misinformed about exl2
>the scum of this general
That would be (You)

Anonymous
10/12/24(Sat)00:29:43 No.102786042

Anonymous 10/12/24(Sat)00:29:43 No.102786042

>>102786030
Thanks. I actually thought it might be large.

Anonymous
10/12/24(Sat)00:43:27 No.102786141

Anonymous 10/12/24(Sat)00:43:27 No.102786141

File: ShinyMikuLove.png (1.2 MB, 1168x880)

1.2 MB PNG

Good night /lmg/

Anonymous
10/12/24(Sat)00:45:03 No.102786160

Anonymous 10/12/24(Sat)00:45:03 No.102786160

oyasumi

Anonymous
10/12/24(Sat)00:56:27 No.102786278

Anonymous 10/12/24(Sat)00:56:27 No.102786278

>>102786141
Good night Miku

Anonymous
10/12/24(Sat)01:03:10 No.102786324

Anonymous 10/12/24(Sat)01:03:10 No.102786324

>>102785865
MN-Dark-Planet is the best I've tried so far.

Anonymous
10/12/24(Sat)01:14:09 No.102786405

Anonymous 10/12/24(Sat)01:14:09 No.102786405

>>102785820
Fuck me I'm goddamn retarded. That's not the same Aria

Anonymous
10/12/24(Sat)01:30:07 No.102786522

Anonymous 10/12/24(Sat)01:30:07 No.102786522

we've had the updates of Mistral Large and Small, when are they gonna drop a newer Medium
(no, shutup about Miqu, it's too old now. I'm talking about an updated version the way the other two were recently updated)

Anonymous
10/12/24(Sat)01:59:02 No.102786724

Anonymous 10/12/24(Sat)01:59:02 No.102786724

>>102773259
>text completion
imo if someone is just doing text completion why would they use a finetune like rocinante over base nemo

Anonymous
10/12/24(Sat)02:03:23 No.102786756

Anonymous 10/12/24(Sat)02:03:23 No.102786756

>>102785251
>>102785251
>>102785251
reee where are the benchmarks?

Anonymous
10/12/24(Sat)02:06:27 No.102786777

Anonymous 10/12/24(Sat)02:06:27 No.102786777

Tried playing some eroges, and it was so much worse than I remember. Even despite the slop, AI is just better at this point.

Anonymous
10/12/24(Sat)02:09:04 No.102786793

Anonymous 10/12/24(Sat)02:09:04 No.102786793

>>102786777
I disagree, it can't do as long of a story, it runs out of context.

Anonymous
10/12/24(Sat)02:12:25 No.102786824

Anonymous 10/12/24(Sat)02:12:25 No.102786824

>>102786793
Humans maintain context by summarizing. Humans use lossy synopses. Politicians talking to professionals are the best example of this. Trump is like "Joe is AMAZING. He makes these things like you would never believe."

Anonymous
10/12/24(Sat)02:14:13 No.102786836

Anonymous 10/12/24(Sat)02:14:13 No.102786836

>>102786824
Yes, but if I read a book for example the author perfectly builds on what happened before without messing it up. The characters having poor recall can be fine sometimes, but the author of the work should be able to keep track of every detail.

Anonymous
10/12/24(Sat)02:22:39 No.102786908

Anonymous 10/12/24(Sat)02:22:39 No.102786908

>>102786836
He takes notes, and has an editor. Many writers use software, or other tools (eg sticky notes and things) to keep track of the narrative. It's a huge topic and much more interesting than I thought it might be.

Anonymous
10/12/24(Sat)02:35:57 No.102787003

Anonymous 10/12/24(Sat)02:35:57 No.102787003

File: gal2-matriz-asociada.png (1.01 MB, 1812x6836)

1.01 MB PNG

Does anyone know how to use AI to do semantic segmentation of images (specifically textbooks)?
I tried this with ChatGPT (the web interface) and the attached image but it didn't work at all, it just segmented it in 10% intervals lol.
Analyze the attached image, and segment it so all the content is covered, and each section covers approximately between 10 and 15 lines, and ideally is segmented at points where the text transitions from one topic or section to another.
The segmentation should be outputted in the following format:
BEGIN
[x0,y0,x1,y1]
[x0,y0,x1,y1]
...
[x0,y0,x1,y1]
[x0,y0,x1,y1]
END
with the coordinates starting from the upper left corner and being specified in percentage points.
For example:
BEGIN
[5.00,5.00,95.00,10.00]
[5.00,11.00,95.00,13.00]
[5.00,15.00,95.00,17.00]
END
My end goal is to build a semi-automated PDF to LaTeX pipeline, so then I can use the LLMs along with the OCR'd LaTeX of the textbooks to tutor me on these subjects.
To get a decent transcript I have to segment the pages into multiple pieces, because probably when the model loads the image it does it at a fixed resolution, and if the image is too large it ends up being unreadable to the model. This is an attempt at a preprocessing step to do conversion to LaTeX later on.

Anonymous
10/12/24(Sat)02:50:48 No.102787100

Anonymous 10/12/24(Sat)02:50:48 No.102787100

>>102785846
"Tits or GTFO"
Alternately,
"Shoe on head"

Anonymous
10/12/24(Sat)02:53:00 No.102787120

Anonymous 10/12/24(Sat)02:53:00 No.102787120

>>102787003
Actually I realized I'm overthinking it.
Since this is a PDF and not a scanned document I can just split the images at the middle of the next blank line after some fixed %.
It's not going to be splitted semantically which will reduce the amount of clues given to the LLM to do the transcription but I think it'll be good enough.

Anonymous
10/12/24(Sat)02:55:03 No.102787137

Anonymous 10/12/24(Sat)02:55:03 No.102787137

>>102786522
There might not be a Medium 2, they were phasing it out.

Anonymous
10/12/24(Sat)03:00:40 No.102787183

Anonymous 10/12/24(Sat)03:00:40 No.102787183

File: holochat.png (87 KB, 990x432)

87 KB PNG

Mistral Nemo Instruct.
Whoa. Impressive.

Anonymous
10/12/24(Sat)03:02:31 No.102787196

Anonymous 10/12/24(Sat)03:02:31 No.102787196

>>102787183
>Mistral Nemo Instruct
Which quant?

Anonymous
10/12/24(Sat)03:03:58 No.102787209

Anonymous 10/12/24(Sat)03:03:58 No.102787209

>>102787196
Mistral-Nemo-Instruct-2407-Q5_K_L.gguf

Anonymous
10/12/24(Sat)03:10:23 No.102787264

Anonymous 10/12/24(Sat)03:10:23 No.102787264

>>102787209
Interesting.

Anonymous
10/12/24(Sat)03:12:55 No.102787281

Anonymous 10/12/24(Sat)03:12:55 No.102787281

>>102787183
Nemo is dumb, but undeniably soul.

Anonymous
10/12/24(Sat)03:17:31 No.102787325

Anonymous 10/12/24(Sat)03:17:31 No.102787325

>>102787281
It's not so dumb that I can't coax gold out of it by swiping, and the swipes are like 1 second each so no big deal.
I'm digging it.
That's honestly the single coolest thing any model has ever said to me.

Anonymous
10/12/24(Sat)03:20:45 No.102787351

Anonymous 10/12/24(Sat)03:20:45 No.102787351

"But I like it" sure is a Nemoism, though.

llama.cpp CUDA dev !!OM2Fp6Fn93S
10/12/24(Sat)04:19:33 No.102787816

llama.cpp CUDA dev !!OM2Fp6Fn93S 10/12/24(Sat)04:19:33 No.102787816

>>102783471
Thanks, but the tips from that article are pretty basic and I already know them.
Quite frankly, if you've read the CUDA documentation the article will teach you nothing new.

Anonymous
10/12/24(Sat)04:21:16 No.102787831

Anonymous 10/12/24(Sat)04:21:16 No.102787831

>>102787816
8^)

Hello person who can't code for AMD.

Anonymous
10/12/24(Sat)04:23:14 No.102787851

Anonymous 10/12/24(Sat)04:23:14 No.102787851

for a new pc build, amd or nvidia gpu if i want to learn and run LLMs (and stable diffusion) on linux?

Anonymous
10/12/24(Sat)04:26:23 No.102787878

Anonymous 10/12/24(Sat)04:26:23 No.102787878

>>102787851
nvidia

Anonymous
10/12/24(Sat)04:33:44 No.102787931

Anonymous 10/12/24(Sat)04:33:44 No.102787931

>>102787851
nvidia unfortunately
alternatively, specialty cards with much more vram but may be considerably slower

Anonymous
10/12/24(Sat)04:37:28 No.102787955

Anonymous 10/12/24(Sat)04:37:28 No.102787955

>>102774337
Bumping this, will spin up a h100 on runpod to play with
I think I'll try mistral large, would there be a big difference between just using the ollama model over serving it with something like vllm or embedding calling it from my pay script?
Happy for any recommendations, I'm super new at this

Anonymous
10/12/24(Sat)04:56:01 No.102788107

Anonymous 10/12/24(Sat)04:56:01 No.102788107

>>102780841
Holy shit, could we finally make our own uncensores autistic model?

Anonymous
10/12/24(Sat)05:02:32 No.102788158

Anonymous 10/12/24(Sat)05:02:32 No.102788158

grifter thread

Anonymous
10/12/24(Sat)05:11:04 No.102788227

Anonymous 10/12/24(Sat)05:11:04 No.102788227

File: evil.png (103 KB, 506x376)

103 KB PNG

Anonymous
10/12/24(Sat)05:23:55 No.102788302

Anonymous 10/12/24(Sat)05:23:55 No.102788302

>>102787183
Holy shit you can get to 200 messages writing a sentence at a time? Groundbreaking stuff anon, when is the paper coming out?

Anonymous
10/12/24(Sat)05:37:02 No.102788393

Anonymous 10/12/24(Sat)05:37:02 No.102788393

>>102786793
it can't do a long story because all the real creativity comes from the prompt, the model just puts words and slop together to expand on the prompt

Anonymous
10/12/24(Sat)05:56:35 No.102788536

Anonymous 10/12/24(Sat)05:56:35 No.102788536

File: b501ad7e-e56b-4895-ad80-8(...).jpg (6 KB, 226x402)

6 KB JPG

>>102788158
>grifter thread
Yes.

Anonymous
10/12/24(Sat)06:03:46 No.102788601

Anonymous 10/12/24(Sat)06:03:46 No.102788601

>>102773870
schizos are attuned to esoteric patterns in reality that normies can't even perceive let alone comprehend...
for all we know his kabbalah sampler will gen very hot smut
state of the art smut heat beyond your myopic imagination

Anonymous
10/12/24(Sat)06:05:20 No.102788615

Anonymous 10/12/24(Sat)06:05:20 No.102788615

>>102780305
very nice, having a lot of fun with it

Anonymous
10/12/24(Sat)06:16:28 No.102788722

Anonymous 10/12/24(Sat)06:16:28 No.102788722

>>102785613
I trust this Miku

Anonymous
10/12/24(Sat)06:29:18 No.102788836

Anonymous 10/12/24(Sat)06:29:18 No.102788836

>>102788601
Instead of making a sampler, he should apply that structure to the NN architecture itself. I won't be surprised if it works just out of the box, considering the thing we work with here.
Kabbalah-like half-jumps between the layers are often learned by evolutionary algorithms, maybe that is what required to preserves the polarity of thought, a separation.
t. X transcendent

Anonymous
10/12/24(Sat)06:54:30 No.102789030

Anonymous 10/12/24(Sat)06:54:30 No.102789030

https://app.primeintellect.ai/intelligence
You will sacrifice your 3090 for the benefit of humanity won't you anon?

Anonymous
10/12/24(Sat)06:55:51 No.102789044

Anonymous 10/12/24(Sat)06:55:51 No.102789044

>>102789030
No but I'll let you sacrifice yours

Anonymous
10/12/24(Sat)07:00:39 No.102789082

Anonymous 10/12/24(Sat)07:00:39 No.102789082

>>102789030
>Help us train our censored slop
No thank you. I'll consider it if someone started a based uncensored schizo model tho.

Anonymous
10/12/24(Sat)07:03:14 No.102789101

Anonymous 10/12/24(Sat)07:03:14 No.102789101

>>102789082
This won't happen with foss troons in charge.

Anonymous
10/12/24(Sat)07:04:27 No.102789111

Anonymous 10/12/24(Sat)07:04:27 No.102789111

>>102789101
Someone needs to fork the project then.

Anonymous
10/12/24(Sat)07:07:24 No.102789138

Anonymous 10/12/24(Sat)07:07:24 No.102789138

>>102789030
Let me see the dataset first before I decide if I want to contribute.

Anonymous
10/12/24(Sat)07:15:39 No.102789186

Anonymous 10/12/24(Sat)07:15:39 No.102789186

>>102789138
>55% Fineweb-edu
>20% DLCM
>20% Stack v2
>5% OpenWebMath
without any further nsfw filtering
Im hooked on that hopium right now

Anonymous
10/12/24(Sat)07:19:09 No.102789209

Anonymous 10/12/24(Sat)07:19:09 No.102789209

>>102789030
/lmg/ should make it's own model with this and call it Divine Intellect.

Anonymous
10/12/24(Sat)07:21:56 No.102789230

Anonymous 10/12/24(Sat)07:21:56 No.102789230

>>102789186
https://github.com/PrimeIntellect-ai/OpenDiloco
btw this is the code they are using
we could use this to make a /g/-certified model, if we announced it on /g/ or /pol/ with the premise of finally making an unslopped model we could get 70-80 anons (960GB of VRAM if every anon had 12GB of it and 1.6 Peta Flops)

Anonymous
10/12/24(Sat)07:22:57 No.102789242

Anonymous 10/12/24(Sat)07:22:57 No.102789242

>>102789209
Lol nice name

Anonymous
10/12/24(Sat)07:29:54 No.102789307

Anonymous 10/12/24(Sat)07:29:54 No.102789307

>>102789230
I have a folder with 40k .txt unfiltered erotica with ALL kinds of stories from various sources (some already disappeared) from the 2000's to now, it just needs to be converted to a dataset.

Anonymous
10/12/24(Sat)07:30:38 No.102789310

Anonymous 10/12/24(Sat)07:30:38 No.102789310

File: 1702697825461851.png (368 KB, 609x859)

368 KB PNG

https://x.com/basedjensen/status/1844931497675063563

Anonymous
10/12/24(Sat)07:31:59 No.102789320

Anonymous 10/12/24(Sat)07:31:59 No.102789320

File: file.png (704 KB, 768x768)

704 KB PNG

Anonymous
10/12/24(Sat)07:32:52 No.102789326

Anonymous 10/12/24(Sat)07:32:52 No.102789326

>>102789310
>llm do not reason
ok, source?
>but that's ok
no it's not
>neither do we
source? you clearly don't

Anonymous
10/12/24(Sat)07:34:01 No.102789338

Anonymous 10/12/24(Sat)07:34:01 No.102789338

>>102789326
Calm down redditor https://arxiv.org/abs/2410.05229

Anonymous
10/12/24(Sat)07:35:22 No.102789354

Anonymous 10/12/24(Sat)07:35:22 No.102789354

>>102789338
Shalom

Anonymous
10/12/24(Sat)07:35:39 No.102789355

Anonymous 10/12/24(Sat)07:35:39 No.102789355

File: ylecunn.jpg (47 KB, 738x415)

47 KB JPG

>>102789310
LLMs do not reason. LLMs do not understand. LLMs do not think.

Anonymous
10/12/24(Sat)07:37:36 No.102789372

Anonymous 10/12/24(Sat)07:37:36 No.102789372

>>102789320
clothes painted on

Anonymous
10/12/24(Sat)07:38:52 No.102789385

Anonymous 10/12/24(Sat)07:38:52 No.102789385

>>102789355
Cause all of them are underage.

Anonymous
10/12/24(Sat)07:41:20 No.102789407

Anonymous 10/12/24(Sat)07:41:20 No.102789407

>>102789338
>>102789310
the shrek sampler unironically solves this, lmao

Anonymous
10/12/24(Sat)07:41:48 No.102789415

Anonymous 10/12/24(Sat)07:41:48 No.102789415

>>102789326
What a dumb take. Our reasoning is a result of our divine origin.

Anonymous
10/12/24(Sat)07:46:32 No.102789443

Anonymous 10/12/24(Sat)07:46:32 No.102789443

>>102789415
Getting shat out of a stinky hole?

Anonymous
10/12/24(Sat)07:47:24 No.102789452

Anonymous 10/12/24(Sat)07:47:24 No.102789452

File: file.png (128 KB, 541x350)

128 KB PNG

>360m passing the 9.11 vs 9.9 test
HOLY

Anonymous
10/12/24(Sat)07:49:01 No.102789465

Anonymous 10/12/24(Sat)07:49:01 No.102789465

>>102789443
You wish your penis was in that hole now. Maybe not that specific one but on the other hand you are anon...

Anonymous
10/12/24(Sat)07:50:56 No.102789476

Anonymous 10/12/24(Sat)07:50:56 No.102789476

>>102789452
Impressive. WTS was 415m and did not pass the 9,11 test.

Anonymous
10/12/24(Sat)07:51:41 No.102789484

Anonymous 10/12/24(Sat)07:51:41 No.102789484

>>102789452
now hit it with strawberry, nala, sally's brothers, and watermelons

Anonymous
10/12/24(Sat)07:51:52 No.102789488

Anonymous 10/12/24(Sat)07:51:52 No.102789488

>>102789452
Total vramlet victory?

Anonymous
10/12/24(Sat)07:55:44 No.102789509

Anonymous 10/12/24(Sat)07:55:44 No.102789509

>>102789465
Sure. Not ashamed to admit it.

Anonymous
10/12/24(Sat)07:56:57 No.102789521

Anonymous 10/12/24(Sat)07:56:57 No.102789521

File: file.png (115 KB, 900x566)

115 KB PNG

>>102789484
post prompts

Anonymous
10/12/24(Sat)08:11:12 No.102789599

Anonymous 10/12/24(Sat)08:11:12 No.102789599

>>102789307
>I have a folder with 40k .txt unfiltered erotica with ALL kinds of stories from various sources (some already disappeared) from the 2000's to now
You mean the asstr archive?

Anonymous
10/12/24(Sat)08:29:26 No.102789728

Anonymous 10/12/24(Sat)08:29:26 No.102789728

>>102789307
I have books3

Anonymous
10/12/24(Sat)08:31:15 No.102789747

Anonymous 10/12/24(Sat)08:31:15 No.102789747

>>102789452
>benchmark shows embarrassing flaw in LLM reasoning
>new models are trained with that specific benchmark in mind
>new models now solve the specific thing without any fundamental gains
how often have we been through this in the past year and a half?

Anonymous
10/12/24(Sat)08:32:52 No.102789760

Anonymous 10/12/24(Sat)08:32:52 No.102789760

>>102789599
Nope.

Anonymous
10/12/24(Sat)08:34:46 No.102789785

Anonymous 10/12/24(Sat)08:34:46 No.102789785

>>102789030
Can you actually contribute with 3090s?
The leaderboard unit is in H100 hours.

Anonymous
10/12/24(Sat)08:35:22 No.102789788

Anonymous 10/12/24(Sat)08:35:22 No.102789788

>>102789452
The more outrageous the claims the less I believe it. And I already didn't believe it at all.

Anonymous
10/12/24(Sat)08:37:16 No.102789799

Anonymous 10/12/24(Sat)08:37:16 No.102789799

>>102789785
I assume they use the H100 as a reference, the amount of work done is the actual measure

Anonymous
10/12/24(Sat)08:42:13 No.102789827

Anonymous 10/12/24(Sat)08:42:13 No.102789827

>>102789747
Benchmarkmaxxing is unironically AGI.
If you cover every aspect with benchmarks and force the AI to work with all of them, this is how you graduate the AI uni.
So keep adding your watermelon jokes and sally questions, one day it will take a skillchad to bring anything new for LLM to learn.

Anonymous
10/12/24(Sat)08:43:24 No.102789833

Anonymous 10/12/24(Sat)08:43:24 No.102789833

File: d65.png (136 KB, 1328x1080)

136 KB PNG

>>102789452
Holy f**k we now have a 360M model better than 405b. i can't wait for the godly erps dudes

Anonymous
10/12/24(Sat)08:45:51 No.102789852

Anonymous 10/12/24(Sat)08:45:51 No.102789852

>>102789833
vrambros...

Anonymous
10/12/24(Sat)08:55:33 No.102789918

Anonymous 10/12/24(Sat)08:55:33 No.102789918

>>102789788
you can literally try it yourself on colab...

Anonymous
10/12/24(Sat)08:58:49 No.102789932

Anonymous 10/12/24(Sat)08:58:49 No.102789932

>>102789918
This, it's kinda fucked up tho https://x.com/_xjdr/status/1842404312284307723
https://github.com/xjdr-alt/entropix/blob/main/entropix.ipynb

Anonymous
10/12/24(Sat)09:01:37 No.102789949

Anonymous 10/12/24(Sat)09:01:37 No.102789949

>>102789932
this is the updated one with 360m model, just click run and it works
https://github.com/SinatrasC/entropix-smollm

Anonymous
10/12/24(Sat)09:12:11 No.102790008

Anonymous 10/12/24(Sat)09:12:11 No.102790008

File: file.png (581 KB, 1290x548)

581 KB PNG

Why are nobel laureate machine learning researchers such plagarizing scoundrels?

Anonymous
10/12/24(Sat)09:13:25 No.102790017

Anonymous 10/12/24(Sat)09:13:25 No.102790017

File: datasets.png (2 KB, 1016x69)

2 KB PNG

>>102789728
Who doesn't? Still nothing compared to gutenberg. Something trained on textfiles, if they aren't already, would be fun.

Anonymous
10/12/24(Sat)09:18:12 No.102790047

Anonymous 10/12/24(Sat)09:18:12 No.102790047

>>102789310
>>102789355
Even if they don't "reason" at all in the traditional human sense, newer and larger models emulate reasoning better than prior and smaller models therefore it's likely future models will emulate reasoning better than current SoTA. Research and practice hasn't found any reason to believe we're nearing the limits of what the transformer architecture can do therefore it's very possible that the emulation of reason by transformers will eventually eclipse the real thing. So basically what difference does it make

Anonymous
10/12/24(Sat)09:28:23 No.102790125

Anonymous 10/12/24(Sat)09:28:23 No.102790125

AI makes you depressed, i work in a less hyped AI field than LLMs and the number of researchers in this field is small. if you are also doing basic research, you are just damn alone in this world. you sit at home and think about your work and there is just no one to talk to about it - not today and not tomorrow
im an emotional wimp, roast me

Anonymous
10/12/24(Sat)09:31:20 No.102790139

Anonymous 10/12/24(Sat)09:31:20 No.102790139

>>102789030
>no copyrighted books for sovl
dead in arrival

Anonymous
10/12/24(Sat)09:31:24 No.102790141

Anonymous 10/12/24(Sat)09:31:24 No.102790141

>>102790125
>AI makes you depressed
You are depressive. Where you less 'alone' before AI?

Anonymous
10/12/24(Sat)09:34:23 No.102790167

Anonymous 10/12/24(Sat)09:34:23 No.102790167

File: file.png (376 KB, 521x351)

376 KB PNG

>>102777963
>>102780453
Actually I was WRONG again, I won't be refunding yet, however, see pic

Seems that the supplier says the motherboard bios is old and that the model will show up as expected on cpu-z as he showed.
I will believe him because that rings a bell maybe and the naming convention could be outdated, it shows the serial num for the 9334. 32 64.

However I have another issue in that CPU0 isnt detected and CPU1 is lol.

Not sure where to begin other than reseating them which I hestitate,
Anyone got stories of success reseating a new out the box gigabyte to make it work again?

Anonymous
10/12/24(Sat)09:44:23 No.102790259

Anonymous 10/12/24(Sat)09:44:23 No.102790259

Have you ruled out an issue with the cpu power cable and/or socket on your PSU yet?

Anonymous
10/12/24(Sat)09:46:34 No.102790290

Anonymous 10/12/24(Sat)09:46:34 No.102790290

>>102790047
You first need to comprehend why these arguments and papers are taking place, in the first place. It's not about basic scientific inquiry, it's about the business that has surrounded that and tried to make the useful aspects of the tech part of something that ultimately is meant to fill the pockets of a few. LLMs can be useful. That doesn't mean many of the ones securing funding for making them necessarily have that as their primary goal (over the goal of giving themselves, the founders/execs, the ability to buy yachts and super cars), and unfortunately many employees of these companies don't realize they're being played, or are complicit because they have a slave and herd mentality. The argument about whether LLMs can reason and whether it's economical to keep scaling transformers is really just a part of the fight against these scam artists.

Anonymous
10/12/24(Sat)09:46:36 No.102790291

Anonymous 10/12/24(Sat)09:46:36 No.102790291

File: 84814.png (78 KB, 569x874)

78 KB PNG

>>102789355
Based lecun repost debunked o1

Anonymous
10/12/24(Sat)09:48:25 No.102790312

Anonymous 10/12/24(Sat)09:48:25 No.102790312

>>102790291
don't care. still gonna use o1

Anonymous
10/12/24(Sat)09:48:49 No.102790322

Anonymous 10/12/24(Sat)09:48:49 No.102790322

>>102789310
>Llms don't reason. But that's ok, neither do we.
I strongly agree with this sentiment.

Anonymous
10/12/24(Sat)09:50:53 No.102790349

Anonymous 10/12/24(Sat)09:50:53 No.102790349

>>102790259
Not yet, getting into it again tomorrow but my power cables for cpu are EPS's

Anonymous
10/12/24(Sat)09:51:03 No.102790352

Anonymous 10/12/24(Sat)09:51:03 No.102790352

>>102790291
yann is a fraud

Anonymous
10/12/24(Sat)09:57:18 No.102790414

Anonymous 10/12/24(Sat)09:57:18 No.102790414

>>102786793
>I disagree, it can't do as long of a story, it runs out of context.
I disagree. Its a matter of having the right prompt, using a large enough model and having the resources to have 32k+ context so the story is satisfyingly long. Long context is a solved problem at this point. Just use RULER to pick a model that stays consistent as long as you need it to.

Anonymous
10/12/24(Sat)09:57:50 No.102790422

Anonymous 10/12/24(Sat)09:57:50 No.102790422

>>102790397
>>102790397
>>102790397

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.