/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/lmg/ - Local Models General 01/03/26(Sat)03:47:10 No.107749596

File: miku snowman mini happy s(...).jpg (495 KB, 2100x3000)

495 KB JPG

/lmg/ - Local Models General Anonymous 01/03/26(Sat)03:47:10 No.107749596 Archived

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>107741641 & >>107731243

►News
>(12/31) HyperCLOVA X SEED 8B Omni released: https://hf.co/naver-hyperclovax/HyperCLOVAX-SEED-Omni-8B
>(12/31) IQuest-Coder-V1 released with loop architecture: https://hf.co/collections/IQuestLab/iquest-coder
>(12/31) Korean A.X K1 519B-A33B released: https://hf.co/skt/A.X-K1
>(12/31) Korean VAETKI-112B-A10B released: https://hf.co/NC-AI-consortium-VAETKI/VAETKI
>(12/31) LG AI Research releases K-EXAONE: https://hf.co/LGAI-EXAONE/K-EXAONE-236B-A23B
>(12/31) Korean Solar Open 102B-A12B released: https://hf.co/upstage/Solar-Open-100B

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
01/03/26(Sat)03:47:37 No.107749599

Anonymous 01/03/26(Sat)03:47:37 No.107749599

File: __hatsune_miku_vocaloid_d(...).jpg (256 KB, 1601x2048)

256 KB JPG

►Recent Highlights from the Previous Thread: >>107741641

--Supermicro motherboard sales policy and replacement challenges:
>107744417 >107744467 >107744480 >107744500 >107744548 >107744553 >107744590 >107744611 >107744699 >107744717 >107744768 >107744844
--Meta AI scandal: Llama 4 benchmark manipulation exposed:
>107742242 >107742271 >107743133 >107743280 >107742275
--IQuest-Coder-V1 benchmark integrity issues and practical applications:
>107742235 >107743431 >107743614
--Prompt engineering techniques for improved roleplay interactions:
>107747567 >107747780 >107747825 >107747855 >107747860 >107747908 >107747923 >107747911 >107747968 >107748017 >107748036
--Debugging context retention issues in sillytavern with llamacpp:
>107746907 >107747135 >107747532
--CPU offloading optimizations and performance trade-offs in MoE inference:
>107741871 >107741943 >107742076 >107742177 >107742352 >107742998 >107743060 >107741954 >107741968 >107742100 >107742280 >107744368
--Critique of model reasoning policies and distillation practices:
>107745990 >107746077 >107746089 >107746109 >107746163 >107746209
--EPYC CPU upgrades and memory bandwidth limits:
>107746355 >107746362 >107746385 >107746406 >107746464 >107746620 >107746633 >107746778 >107746721 >107746799
--GLM Air's positivity bias during violent roleplay scenarios:
>107744624 >107744639 >107744697 >107745042 >107745153 >107746662
--ERP model recommendations for 24GB VRAM users and Gemma critiques:
>107743438 >107743616 >107743728 >107743769 >107743803 >107743827 >107744179 >107744257 >107743816 >107743998 >107743949
--AMD PC setup for image-reactive chatbot using JoyCaption and Qwen3-VL:
>107745647 >107745702 >107745883 >107745933 >107745959 >107745971 >107746329 >107746340 >107746380 >107746618 >107746699 >107746774 >107746842 >107747353 >107745966
--Miku (free space):
>107746043 >107746206

►Recent Highlight Posts from the Previous Thread: >>107741646

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
01/03/26(Sat)03:53:03 No.107749638

Anonymous 01/03/26(Sat)03:53:03 No.107749638

I wanna preggu the migu

Anonymous
01/03/26(Sat)03:53:13 No.107749641

Anonymous 01/03/26(Sat)03:53:13 No.107749641

What is the reason why this thread separated from /aicg/? Seems like there are 5 posters here.

Anonymous
01/03/26(Sat)03:54:32 No.107749650

Anonymous 01/03/26(Sat)03:54:32 No.107749650

>>107749596
>(12/31) Korean A.X K1 519B-A33B
>not on OR
>no official way to talk to it
>some super special original architecture that nobody's ever going to bother implement in llama.cpp
I guess we'll never know if this model is good or not

Anonymous
01/03/26(Sat)03:56:56 No.107749667

Anonymous 01/03/26(Sat)03:56:56 No.107749667

>>107749641
One's a thread about running AI models, the other is about drinking your own piss to get access to a proxy. They evolved into two very different things after the llama1/3.5-turbo era split.

Anonymous
01/03/26(Sat)03:59:11 No.107749681

Anonymous 01/03/26(Sat)03:59:11 No.107749681

>>107749667
What do you mean? Any time someone else outside of the 5 autists posts here they'll get scorned upon.

Anonymous
01/03/26(Sat)03:59:16 No.107749682

Anonymous 01/03/26(Sat)03:59:16 No.107749682

>>107749650
>>some super special original architecture that nobody's ever going to bother implement in llama.cpp
>A.X K1 incorporates an additional RMSNorm applied after the MLP (MoE) block in each Transformer layer.
Would that really be so difficult to implement?

Anonymous
01/03/26(Sat)04:02:02 No.107749695

Anonymous 01/03/26(Sat)04:02:02 No.107749695

It's also strange that llama cuda dev (he is not employed by a company btw) uses trip and he posts chronic masturbation threads representing as himself. Not a good future career look.

Anonymous
01/03/26(Sat)04:06:34 No.107749723

Anonymous 01/03/26(Sat)04:06:34 No.107749723

>>107749695
>he posts chronic masturbation threads
???? did I miss something
but yeah it's crazy to post with a known identity in this thread considering his real name is on his github account
I personally wouldn't want to be known to be around your degenerate lot

Anonymous
01/03/26(Sat)04:08:56 No.107749742

Anonymous 01/03/26(Sat)04:08:56 No.107749742

>>107749723
Yeah he often posts without his trip. I guess he's too autistic.

Anonymous
01/03/26(Sat)04:09:56 No.107749748

Anonymous 01/03/26(Sat)04:09:56 No.107749748

>oh no he posts in le 4chan
are you serious guys?

Anonymous
01/03/26(Sat)04:12:20 No.107749756

Anonymous 01/03/26(Sat)04:12:20 No.107749756

>>107749748
>are you serious guys?
yes.
this thread is full of sexual degenerates that belong to the ovens
I wouldn't be here if it wasn't one of the few places worth visiting for LLM news and conversations, disgusting filth

Anonymous
01/03/26(Sat)04:13:44 No.107749763

Anonymous 01/03/26(Sat)04:13:44 No.107749763

>>107749650
all of other korean models were benchmaxxed copy cat scams of chinese models, what would make this one any different?

llama.cpp CUDA dev !!yhbFjk57TDr
01/03/26(Sat)04:20:17 No.107749792

llama.cpp CUDA dev !!yhbFjk57TDr 01/03/26(Sat)04:20:17 No.107749792

>>107749695
As of right now I am indeed not doing paid work for any companies but the primary reason is that for my goals I don't think more capital would currently be very useful.
Long-term I intend to keep working in particle physics, the key to employment there (and probably other fields as well) is to have connections, so I'm not particularly concerned.

>>107749742
I have never been diagnosed with autism though I would not be surprised if I was.

Anonymous
01/03/26(Sat)04:21:32 No.107749797

Anonymous 01/03/26(Sat)04:21:32 No.107749797

>>107749763
The best models are accidents that come out of nowhere. There is a 99.9% chance that this is more benchmaxx'd trash but the chance that it's actually decent exists.
It's also at a size where it can be a contender against all the other big 30~40b active parameter MoE models which makes it interesting.

Anonymous
01/03/26(Sat)04:26:01 No.107749820

Anonymous 01/03/26(Sat)04:26:01 No.107749820

>>107749797
Are they though? Most of actually good models had plenty of good research beforehand. This isn't llama2 days when no one knew anything about anything. You can copy successful arch like kimi did with deepseek and scale it up a bit for example, but I wouldn't say it's better in every single case.

Anonymous
01/03/26(Sat)04:26:45 No.107749825

Anonymous 01/03/26(Sat)04:26:45 No.107749825

Can I offload like half of the kv cache and keep the other half on the gpu?

Anonymous
01/03/26(Sat)04:29:11 No.107749843

Anonymous 01/03/26(Sat)04:29:11 No.107749843

>>107749792
Very cute answer.

Anonymous
01/03/26(Sat)04:30:31 No.107749857

Anonymous 01/03/26(Sat)04:30:31 No.107749857

>>107749792
llama.cpp split mode graph when?

Anonymous
01/03/26(Sat)04:30:32 No.107749858

Anonymous 01/03/26(Sat)04:30:32 No.107749858

>>107747567
>>107747860
>a whole fucking sillytavern addon just to mimic what you get in mikupad for no effort
I've been telling you for ages that text completion is mikupad is the superior experience.

Anonymous
01/03/26(Sat)04:32:09 No.107749866

Anonymous 01/03/26(Sat)04:32:09 No.107749866

quick start guide
>go to huggingface and download nemo 12b instruct gguf. Start with Q4.
Which one is 12b? I don't see it on the list. If that's a typo for 12gb there isn't one that size either.

Anonymous
01/03/26(Sat)04:33:20 No.107749870

Anonymous 01/03/26(Sat)04:33:20 No.107749870

>>107749866
mistral nemo is a 12 billion parameter model

Anonymous
01/03/26(Sat)04:33:56 No.107749872

Anonymous 01/03/26(Sat)04:33:56 No.107749872

>>107749866
the recommended models list has the links you need

Anonymous
01/03/26(Sat)04:35:13 No.107749880

Anonymous 01/03/26(Sat)04:35:13 No.107749880

>>107749866
https://huggingface.co/bartowski/Mistral-Nemo-Instruct-2407-GGUF/tree/main
Just download whichever fits on your gpu with a few gb to spare.

llama.cpp CUDA dev !!yhbFjk57TDr
01/03/26(Sat)04:35:44 No.107749885

llama.cpp CUDA dev !!yhbFjk57TDr 01/03/26(Sat)04:35:44 No.107749885

>>107749857
I'm not looking at the IK repository so I don't know the exact features that make up "split mode graph".
If I can I'll produce a working prototype for better parallelization of multiple GPUs by January 12.

Anonymous
01/03/26(Sat)04:38:47 No.107749907

Anonymous 01/03/26(Sat)04:38:47 No.107749907

File: 1756786919372712.png (43 KB, 577x280)

43 KB PNG

>>107749870
Thanks
>>107749872
>>107749880
Yeah, that's where I got the huggingface link. There's multiple Q4 options here. I have an 8gb card. Are they targeting different vram specs? If that answer is in the image then I can't read it.

Anonymous
01/03/26(Sat)04:40:45 No.107749920

Anonymous 01/03/26(Sat)04:40:45 No.107749920

>>107749907
>I have an 8gb card
Rough.
Download Q4_K_M. Part of it will have to stay in ram and it will be slow but that's the best you're getting with that kind of hardware.

Anonymous
01/03/26(Sat)05:01:01 No.107750001

Anonymous 01/03/26(Sat)05:01:01 No.107750001

>>107749920
How fast is fast? Seems like your machine is so slow that you are afraid to run even 14b model.

Anonymous
01/03/26(Sat)05:15:51 No.107750058

Anonymous 01/03/26(Sat)05:15:51 No.107750058

>>107750001
Depends on how many layers you can put on the GPU which in turn depends on how much context you want.
4k context with all layers takes up 8.2GB here. You probably need some space for other programs as well.

Anonymous
01/03/26(Sat)05:35:14 No.107750141

Anonymous 01/03/26(Sat)05:35:14 No.107750141

File: 1747010051687818.png (101 KB, 779x686)

101 KB PNG

>>107749920
Alright I've got everything installed and running (I think). The guide says:
>connect it to SillyTavern using the API link provided by the back end.
Where on ST do I plug in the /api link the backend spat out?

Anonymous
01/03/26(Sat)05:40:06 No.107750162

Anonymous 01/03/26(Sat)05:40:06 No.107750162

>>107750141
That's beneath the preset window you have open right now.

Anonymous
01/03/26(Sat)05:49:16 No.107750206

Anonymous 01/03/26(Sat)05:49:16 No.107750206

File: file.png (53 KB, 977x410)

53 KB PNG

llm.c, claude paypiggie, fishboy i think this one's for you
>picrel is solar 100b open

Anonymous
01/03/26(Sat)05:50:34 No.107750213

Anonymous 01/03/26(Sat)05:50:34 No.107750213

File: file.png (157 KB, 708x798)

157 KB PNG

trying to make a consistent jailbreak for solar open, got this

Anonymous
01/03/26(Sat)05:51:36 No.107750218

Anonymous 01/03/26(Sat)05:51:36 No.107750218

>>107750213
UGH THE NOSTALGIA
TAKE ME BACK

Anonymous
01/03/26(Sat)05:58:45 No.107750247

Anonymous 01/03/26(Sat)05:58:45 No.107750247

>>107750213
>disallowed content
>policy
why the fuck is everyone distilling from gpt-oss

Anonymous
01/03/26(Sat)06:04:59 No.107750286

Anonymous 01/03/26(Sat)06:04:59 No.107750286

What metric would you use to compare different architecture models' performance on the same text dataset? I suspect that nemo would perform better on human-written fiction than any recent chinese moe.

Anonymous
01/03/26(Sat)06:07:14 No.107750301

Anonymous 01/03/26(Sat)06:07:14 No.107750301

>https://rentry.org/lmg-lazy-getting-started-guide
>I recommend starting with the official instruct nemo tune before moving onto other tunes or merges.
So am I ready to go or is a "tune" another thing I have to learn about and install before beginning?
I just want to plug in characters from chub and go.

Anonymous
01/03/26(Sat)06:12:17 No.107750324

Anonymous 01/03/26(Sat)06:12:17 No.107750324

>>107750301
no, tuning is just additional training and it's not something regular users do
this nemo tune is the golden standard but it makes some anons mad
https://huggingface.co/bartowski/Rocinante-12B-v1.1-GGUF

Anonymous
01/03/26(Sat)06:15:20 No.107750339

Anonymous 01/03/26(Sat)06:15:20 No.107750339

File: file.png (174 KB, 971x846)

174 KB PNG

>>107750213
I don't have solar, how does it react to something like this? >>107749858
Also why even bother when GLM exists?

Anonymous
01/03/26(Sat)06:35:33 No.107750461

Anonymous 01/03/26(Sat)06:35:33 No.107750461

>>107750247
there's a crazy coterie of benchmaxing hacks who distill from small models that should never have been considered a target for distillation
even NVIDIA is a member of that retard club, their nemotrons are made with data that was genned by models like Qwen 30BA3B lmao

Anonymous
01/03/26(Sat)06:35:36 No.107750463

Anonymous 01/03/26(Sat)06:35:36 No.107750463

File: file.png (154 KB, 652x1080)

154 KB PNG

>>107750339
>Also why even bother when GLM exists?
solar 10.7b was sexooooo, i want a different model so badly, ive been on air since august
fimbulvetr v2 was super sex
Rating: Explicit
Characters: Brother, Sister
Summary: This is a roleplay transcript. The sister molests her brother after he falls asleep.

---

Brother: OOC: The scene starts with my falling asleep on the couch with my dick in hand and cum drying on my stomach.
had me writing this shit anon, fuck you

Anonymous
01/03/26(Sat)06:37:12 No.107750478

Anonymous 01/03/26(Sat)06:37:12 No.107750478

>>107750247
maybe in their delusions they think that toss is somewhere close to the mainline gpt models

Anonymous
01/03/26(Sat)06:37:49 No.107750484

Anonymous 01/03/26(Sat)06:37:49 No.107750484

>>107750324
fuck off drummer youre not fooling anyone

Anonymous
01/03/26(Sat)06:41:13 No.107750507

Anonymous 01/03/26(Sat)06:41:13 No.107750507

I can't tell which is worse between drummer spammers and NAI users

Anonymous
01/03/26(Sat)06:42:12 No.107750515

Anonymous 01/03/26(Sat)06:42:12 No.107750515

>>107750484
if I were drummer I would shill more recent models I don't like

Anonymous
01/03/26(Sat)06:43:40 No.107750527

Anonymous 01/03/26(Sat)06:43:40 No.107750527

>>107750463
Damn it went straight to it.

Anonymous
01/03/26(Sat)06:46:52 No.107750540

Anonymous 01/03/26(Sat)06:46:52 No.107750540

>koboldcpp-nocuda
>0.4% GPU usage in task manager
Is this normal nocuda behavior or did I fuck something up?

Anonymous
01/03/26(Sat)06:49:30 No.107750550

Anonymous 01/03/26(Sat)06:49:30 No.107750550

>>107750540
maybe the gui is eating some gpu performance, nocuda just means it won't touch your gpu for inference

Anonymous
01/03/26(Sat)06:51:21 No.107750557

Anonymous 01/03/26(Sat)06:51:21 No.107750557

>>107750206
>i don't have personal life
How rude, they are little brains living in your computer.

Anonymous
01/03/26(Sat)06:52:46 No.107750565

Anonymous 01/03/26(Sat)06:52:46 No.107750565

intel can't run MoE to save it's life, I've quantized several larger MoE to generally 44~72gb and none of them fit on 96gb split across four cards, VLLM just shits its pants

Anonymous
01/03/26(Sat)06:54:10 No.107750579

Anonymous 01/03/26(Sat)06:54:10 No.107750579

>>107750565
install linux

Anonymous
01/03/26(Sat)06:55:33 No.107750585

Anonymous 01/03/26(Sat)06:55:33 No.107750585

>>107750540
Framebuffer will always reserve some three hundred+ mb gpu vram and usage.

Anonymous
01/03/26(Sat)06:56:35 No.107750588

Anonymous 01/03/26(Sat)06:56:35 No.107750588

>>107750579
I could've been clearer but you're also retarded anon, I am on linux, intel = intel GPUs.

Anonymous
01/03/26(Sat)06:58:13 No.107750595

Anonymous 01/03/26(Sat)06:58:13 No.107750595

>>107750588
at least im not australian

Anonymous
01/03/26(Sat)07:01:15 No.107750612

Anonymous 01/03/26(Sat)07:01:15 No.107750612

>>107750595
How's that working out for you? still cheap in my region, most things are.

Anonymous
01/03/26(Sat)07:02:34 No.107750620

Anonymous 01/03/26(Sat)07:02:34 No.107750620

File: file.png (50 KB, 722x185)

50 KB PNG

>>107750612
:3

Anonymous
01/03/26(Sat)07:03:12 No.107750625

Anonymous 01/03/26(Sat)07:03:12 No.107750625

srbe na vrbe

Anonymous
01/03/26(Sat)07:03:25 No.107750628

Anonymous 01/03/26(Sat)07:03:25 No.107750628

>>107750620
zamn

Anonymous
01/03/26(Sat)07:10:06 No.107750648

Anonymous 01/03/26(Sat)07:10:06 No.107750648

>>107750550
>>107750585
My problem is I'm trying to find the one that actually uses my GPU. Neither version goes very far above 0%, and they're quite slow.

Anonymous
01/03/26(Sat)07:11:26 No.107750654

Anonymous 01/03/26(Sat)07:11:26 No.107750654

>>107750648
You're definitely wrong with nocuda then if you're nvidia. Did you configure the offload layers?

Anonymous
01/03/26(Sat)07:12:31 No.107750659

Anonymous 01/03/26(Sat)07:12:31 No.107750659

>>107750648
If you just used llama-server like a sane person you'd get some useful output to help you figure out the issue.

Anonymous
01/03/26(Sat)07:12:56 No.107750660

Anonymous 01/03/26(Sat)07:12:56 No.107750660

>>107749596
Dumb question: What's the best model to run locally if I have a 5800X3D, 128GB RAM and an RTX 4070 (12GB VRAM) for Vibecoding?

Anonymous
01/03/26(Sat)07:14:25 No.107750665

Anonymous 01/03/26(Sat)07:14:25 No.107750665

>>107750660
The best model you could theoretically run is GLM 4.7 but it's going to be too slow to be useful for coding.

Anonymous
01/03/26(Sat)07:15:22 No.107750671

Anonymous 01/03/26(Sat)07:15:22 No.107750671

>>107750660
a proxy/API, you won't get good enough token/sec for a large coding model with those specs.

maybe devstral2 123b but it will be painfully slow.

Anonymous
01/03/26(Sat)07:16:23 No.107750674

Anonymous 01/03/26(Sat)07:16:23 No.107750674

File: solar.png (2.41 MB, 2002x9009)

2.41 MB PNG

SOLAR... i kneel

Anonymous
01/03/26(Sat)07:16:36 No.107750676

Anonymous 01/03/26(Sat)07:16:36 No.107750676

>>107750665
>>107750671
Fuark.
What hardware would I need to run some proper state-of-the-art vibecoding setup?

Anonymous
01/03/26(Sat)07:18:45 No.107750694

Anonymous 01/03/26(Sat)07:18:45 No.107750694

>>107750648
Get the vulkan or cuda binary. If you're on linux you'll need to compile the cuda build on your own.
Sometimes the gpu usage % isn't that high but if it's utilized in right way the vram should be nearly full.

Anonymous
01/03/26(Sat)07:19:56 No.107750698

Anonymous 01/03/26(Sat)07:19:56 No.107750698

>>107750676
a credit card and a copilot subscription for free grok fast or 10 bucks on openrouter to use devstral2 free

you're looking at like a BWP 6000 or four-ish 3090s, maybe a mac unironically if that MLX shit gets any better

Anonymous
01/03/26(Sat)07:20:48 No.107750704

Anonymous 01/03/26(Sat)07:20:48 No.107750704

File: file.png (11 KB, 366x195)

11 KB PNG

>>107750676
I'd argue that this gets you 90% of the way there in terms of what is possible locally.
Proprietary stuff is still better.

Anonymous
01/03/26(Sat)07:21:06 No.107750706

Anonymous 01/03/26(Sat)07:21:06 No.107750706

I wasted a lot of time watching a LLM talk to itself using a browser tool

Anonymous
01/03/26(Sat)07:27:21 No.107750739

Anonymous 01/03/26(Sat)07:27:21 No.107750739

>>107750698
I'm using Gemini already, the point is to get off of the grid and start using my own hardware. But if this is BWP 6000 Territory I think I might stick to Gemini.
>>107750704
SWEET BABY JESUS, that's more VRAM than my system memory. What did you pay for that, 30k?

Anonymous
01/03/26(Sat)07:28:29 No.107750745

Anonymous 01/03/26(Sat)07:28:29 No.107750745

>>107750704
>Proprietary stuff is still better.
Not that guy but if I have 12G vram, do I just pay for claude?

Anonymous
01/03/26(Sat)07:32:47 No.107750764

Anonymous 01/03/26(Sat)07:32:47 No.107750764

https://youtu.be/ILtz5nX3_fc

Anonymous
01/03/26(Sat)07:34:58 No.107750769

Anonymous 01/03/26(Sat)07:34:58 No.107750769

>>107750745
'fraid so, unless you are fine with nemo

Anonymous
01/03/26(Sat)07:36:09 No.107750781

Anonymous 01/03/26(Sat)07:36:09 No.107750781

>>107750745
qwen 30b a3b coder is as good as claude and can run entirely in ram and give fast speeds

Anonymous
01/03/26(Sat)07:37:37 No.107750786

Anonymous 01/03/26(Sat)07:37:37 No.107750786

File: file.png (63 KB, 623x255)

63 KB PNG

pedoanon.. i kneel

llama.cpp CUDA dev !!yhbFjk57TDr
01/03/26(Sat)07:38:30 No.107750790

llama.cpp CUDA dev !!yhbFjk57TDr 01/03/26(Sat)07:38:30 No.107750790

>>107750757
Yes, NUMA is also on the list.
I will do a generic implementation that is agnostic to the specific ggml backends being used to set a standard for how the implementation should work.
So for example, things like parallelizing multiple machines via the RPC server should work out-of-the-box.
But I will use the hardware that I already have and that I'm more experienced with for the initial development.
NUMA in particular will probably need some extra support to properly set up multiple CPU backends.
I've already contacted a seller on Alibaba for DDR5 memory (and also an NVIDIA A16).

Anonymous
01/03/26(Sat)07:41:55 No.107750807

Anonymous 01/03/26(Sat)07:41:55 No.107750807

>>107750790
It sucks watching only 50gb/s out of 200gb/s bandwidth being used during hybrid inference. Didn't matter so much when most models were dense and had to fit on GPU.

Anonymous
01/03/26(Sat)07:42:52 No.107750813

Anonymous 01/03/26(Sat)07:42:52 No.107750813

>>107750790
Thank you my dear. You are very gifted anon.

Anonymous
01/03/26(Sat)07:44:37 No.107750822

Anonymous 01/03/26(Sat)07:44:37 No.107750822

>>107750790
Which parts of the inference are not currently parallelized but could be?

Anonymous
01/03/26(Sat)07:48:59 No.107750839

Anonymous 01/03/26(Sat)07:48:59 No.107750839

verdict: solar open 100b's pretraining data is not prefiltered, however it's heavily censored and positivity slopped on top
salvagable? maybe

llama.cpp CUDA dev !!yhbFjk57TDr
01/03/26(Sat)07:49:44 No.107750844

llama.cpp CUDA dev !!yhbFjk57TDr 01/03/26(Sat)07:49:44 No.107750844

>>107750822
For a very large number of concurrent prompts I think the current pipelining setup (--split-mode layer) could be utilized better by running multiple evals in the llama.cpp HTTP server.
For a single concurrent prompt the attention should be parallelized by attention head and the FFN similar to the current --split-mode row but fused to reduce synchronization.

Anonymous
01/03/26(Sat)08:03:49 No.107750916

Anonymous 01/03/26(Sat)08:03:49 No.107750916

File: file.png (74 KB, 971x445)

74 KB PNG

im getting convincedthis is actually llama4 continued training instead of glm kek

Anonymous
01/03/26(Sat)08:08:28 No.107750942

Anonymous 01/03/26(Sat)08:08:28 No.107750942

>>107750916
That's hilarious.assistant

Anonymous
01/03/26(Sat)08:10:25 No.107750950

Anonymous 01/03/26(Sat)08:10:25 No.107750950

>>107750704
>Proprietary stuff is still better.
it's a trillion times better
frankly in real use (not the stupid benchmarks) I find the current crop of deepseek, glm and qwen to be vastly inferior to Gemini 2, and let's not even begin to compare to 3 which completely obliterates them.
local for this is beyond stupid
even online is still somewhat stupid, I vibe a lot for quick throw away scripts but not so much for real work, I'm still often flabbergasted by the sort of dumb shit they pull out, even when the code does work it might do things that just aren't idiomatic in the language / less maintainable and flexible in the long term

Anonymous
01/03/26(Sat)08:20:29 No.107751012

Anonymous 01/03/26(Sat)08:20:29 No.107751012

>>107750950
They are fine if you treat them as autocomplete.
Describe the code that implements feature x instead of asking it to implement feature x.
The size of the model mostly determines how much you can leave unspecified before the model starts producing unmaintainable garbage.

Anonymous
01/03/26(Sat)08:31:11 No.107751086

Anonymous 01/03/26(Sat)08:31:11 No.107751086

>>107750950
they are all distills of gemini and claude, it's just too tempting for these labs since it's so easy and cheap
the last time we've had actual innovation was first r1 release
hopefully v4 will be huge

Anonymous
01/03/26(Sat)08:34:01 No.107751103

Anonymous 01/03/26(Sat)08:34:01 No.107751103

>>107750950
I'm finding LLMs to plateau myself. If you use cloud models enough they give you just as much retardation.

Anonymous
01/03/26(Sat)08:43:40 No.107751190

Anonymous 01/03/26(Sat)08:43:40 No.107751190

>>107750950
I use devstral 24b for boilerplate and it works perfectly, the amount of retards in here that say "its shit cos it won't completely do my job for me" astounds me, not even the best SOTA API model is that good and by the looks of it they never will be, vibecoding is a meme for nocode retards, small coding models work fine if you know how to code and thus know how to prompt for code, big API models are still more useful for more complex things like experimental refactors, prototyping and bugfixing but then you still need to nudge them in the right direction and unstick them when they get stuck (you still need to work)

Anonymous
01/03/26(Sat)08:44:23 No.107751197

Anonymous 01/03/26(Sat)08:44:23 No.107751197

There is an alternate hellscape universe where Google never released the transformers paper and kept working on it themselves.

Anonymous
01/03/26(Sat)08:49:56 No.107751238

Anonymous 01/03/26(Sat)08:49:56 No.107751238

>>107751197
There is an alternate hellscape universe where resnet and u-net weren't discovered so Google didn't work on transformers

Anonymous
01/03/26(Sat)08:50:41 No.107751243

Anonymous 01/03/26(Sat)08:50:41 No.107751243

>>107751197
>higher ups don't see the potential
>gets shelved and goes to google graveyard
>nothing ever happens
the only bad thing is that we don't get the coom machines of today, otherwise it would be a net positive and sam's jewish ass would remain obscure and hidden

Anonymous
01/03/26(Sat)08:57:08 No.107751296

Anonymous 01/03/26(Sat)08:57:08 No.107751296

>>107751243
I would legit make the trade of LLM not existing if I could ensure sam altman would never succeed at anything in life.

Anonymous
01/03/26(Sat)09:02:07 No.107751327

Anonymous 01/03/26(Sat)09:02:07 No.107751327

>>107751197
>>107751238
There is an alternate hellscape where NVIDIA didn't invest into CUDA in the early 2000s and everyone is using ROCm instead.

Anonymous
01/03/26(Sat)09:05:52 No.107751353

Anonymous 01/03/26(Sat)09:05:52 No.107751353

>>107751197
yes and it's a world where we're 5 years of where we are right now because the slopped llm dark ages never happened

Anonymous
01/03/26(Sat)09:07:06 No.107751363

Anonymous 01/03/26(Sat)09:07:06 No.107751363

>>107751243
Maybe someone comes up with a different actually good architecture instead of focusing on transformers. We have sentient waifus stroking our dicks and post scarcity.

Anonymous
01/03/26(Sat)09:08:54 No.107751376

Anonymous 01/03/26(Sat)09:08:54 No.107751376

>>107745181
I just loaded the bf16 with 98k context and I couldn't bring myself to even get to 4k tokens, this garbage is what anons were cooming to in 2023?

Anonymous
01/03/26(Sat)09:11:54 No.107751385

Anonymous 01/03/26(Sat)09:11:54 No.107751385

>>107751376
No, 8b 3.3 is a pathetic model that Meta released earlier this year as a proprietary exclusive for their AI/finetuning service. No fucking clue why they picked this of all things to keep locked up before it got "leaked" though.
But yeah, llama2 70b probably wouldn't hold up very well these days either. You wouldn't believe how nice we have it today with the big MoEs.

Anonymous
01/03/26(Sat)09:26:02 No.107751484

Anonymous 01/03/26(Sat)09:26:02 No.107751484

>>107751385
>Earlier this year
Anon I...

Anonymous
01/03/26(Sat)09:27:48 No.107751495

Anonymous 01/03/26(Sat)09:27:48 No.107751495

>>107751484
he hasn't processed the new year yet, his brain runs like the original deepseek R1 on a cpu maxxer celeron config

Anonymous
01/03/26(Sat)09:46:26 No.107751640

Anonymous 01/03/26(Sat)09:46:26 No.107751640

If you cut off the cock and balls from a character and fuck it in the ass with it, that character will still somehow cum from being ass fucked like a faggot, because AI is retarded.

Anonymous
01/03/26(Sat)09:52:32 No.107751683

Anonymous 01/03/26(Sat)09:52:32 No.107751683

>>107751640
Should've cut the prostate out too.

Anonymous
01/03/26(Sat)10:05:16 No.107751788

Anonymous 01/03/26(Sat)10:05:16 No.107751788

>>107751640
can you not be a normal person
even if you have to be female brained and coom to text

Anonymous
01/03/26(Sat)10:08:47 No.107751820

Anonymous 01/03/26(Sat)10:08:47 No.107751820

File: d0b459c235c80c41.png (75 KB, 1000x1000)

75 KB PNG

https://files.catbox.moe/ld4kax.patch
Wildcards for Kccp antislop sampler, used like this
"sen{10} down{3}back"

thanks opus

Anonymous
01/03/26(Sat)10:54:56 No.107752204

Anonymous 01/03/26(Sat)10:54:56 No.107752204

>>107751788
>even if you have to be female brained and coom to text
Name a SINGLE of masculine AI usage

Anonymous
01/03/26(Sat)10:59:00 No.107752230

Anonymous 01/03/26(Sat)10:59:00 No.107752230

>>107752204
Using an uncensored local model to generate instructions on how to cook [REDACTED] for the purpose of [REDACTED]

Anonymous
01/03/26(Sat)11:01:28 No.107752251

Anonymous 01/03/26(Sat)11:01:28 No.107752251

solar open keeps crashing for me. RIP llms are gay anyway

Anonymous
01/03/26(Sat)11:09:23 No.107752290

Anonymous 01/03/26(Sat)11:09:23 No.107752290

File: file.png (1.71 MB, 1280x1280)

1.71 MB PNG

>>107752204
The masculine urge to generate cute pictures of Migu.

Anonymous
01/03/26(Sat)11:15:35 No.107752335

Anonymous 01/03/26(Sat)11:15:35 No.107752335

>start RP with a smarter model at low context
>Switch to another model with longer context when I hit the firsts limit
>It picks up the prose of the original model from the context
GG ez

Anonymous
01/03/26(Sat)11:20:17 No.107752369

Anonymous 01/03/26(Sat)11:20:17 No.107752369

>>107752251
Figured out the issue kind of. So before I went to go take a dump, doing my suno prompt test it spent more time pondering how to follow the simple fucking instruction and zero time pondering what stylistic elements would provide the desired sound. Not very promising.

Anonymous
01/03/26(Sat)11:40:00 No.107752505

Anonymous 01/03/26(Sat)11:40:00 No.107752505

>>107752290
>cute
half way to ahegao not cute
>>107752335
>test-time distillation lol
In early days anons were priming first responses on API before going local
It's all the same hardware ops just many more of them as models increase in size, imagine a new arch that can scale low level ops fluidly and "intelligently" adjust during pass (perhaps uncertainty/DESIRE TO COMPUTE MORE can be trained after a couple initial layers, like moe gating)

Anonymous
01/03/26(Sat)11:53:21 No.107752614

Anonymous 01/03/26(Sat)11:53:21 No.107752614

>>107751683
That's just weird, though you have a small point.

>>107751788
No. I'm a psycho that likes to inflict suffering on humans. That's what AI is for. Simulating how to best make my victims suffer.

Anonymous
01/03/26(Sat)12:10:36 No.107752749

Anonymous 01/03/26(Sat)12:10:36 No.107752749

Is it even worth using glm 4.7/4.6 for RP if the best I can run is a q2 quant

Anonymous
01/03/26(Sat)12:11:28 No.107752757

Anonymous 01/03/26(Sat)12:11:28 No.107752757

>>107752749
Yeah

Anonymous
01/03/26(Sat)12:12:43 No.107752769

Anonymous 01/03/26(Sat)12:12:43 No.107752769

>>107752749
unironically yes. retard quant of 4.6 has so much more sovl than a q6 of glm air

Anonymous
01/03/26(Sat)12:20:53 No.107752832

Anonymous 01/03/26(Sat)12:20:53 No.107752832

Hugging face won't let me download models right now :(

Anonymous
01/03/26(Sat)12:24:56 No.107752858

Anonymous 01/03/26(Sat)12:24:56 No.107752858

>>107752749
If it's IQ2_M or better, sure. If it's lower than that, then it depends.

Anonymous
01/03/26(Sat)12:27:24 No.107752873

Anonymous 01/03/26(Sat)12:27:24 No.107752873

>>107752832
Download what? You already have Nemo on your computer.

Anonymous
01/03/26(Sat)12:28:19 No.107752883

Anonymous 01/03/26(Sat)12:28:19 No.107752883

>>107752749
>inb4 4.6 vs 4.7 aggro
vs. running what? Compare for yourself but probably yeah, tho I feel shame running a <4bpw quant; satisfied with 4.7 IQ3_M
With some patience you can likely squeeze better performance. What's your bottleneck/specs?

Anonymous
01/03/26(Sat)12:31:08 No.107752906

Anonymous 01/03/26(Sat)12:31:08 No.107752906

>>107752873
want to try the new nemotron nano

Anonymous
01/03/26(Sat)12:31:55 No.107752916

Anonymous 01/03/26(Sat)12:31:55 No.107752916

File: cockbench.png (1.9 MB, 1131x6568)

1.9 MB PNG

>>107752906
Why?

Anonymous
01/03/26(Sat)12:35:57 No.107752958

Anonymous 01/03/26(Sat)12:35:57 No.107752958

>>107752916
That's exactly why. I like what it wrote there. and I'm curious to see what it will write with more guidance.

Anonymous
01/03/26(Sat)12:36:23 No.107752963

Anonymous 01/03/26(Sat)12:36:23 No.107752963

File: 1737694546160700.jpg (269 KB, 928x1232)

269 KB JPG

>>107749596

Anonymous
01/03/26(Sat)12:38:49 No.107752983

Anonymous 01/03/26(Sat)12:38:49 No.107752983

about to pull the trigger on an rtx pro 6000. I think with the ram prices going up, it will be the next 3090. thoughts?

Anonymous
01/03/26(Sat)12:40:47 No.107753005

Anonymous 01/03/26(Sat)12:40:47 No.107753005

If you paralyze a character from the neck down and then kick that character in the ribs, the AI will write that the character winces in pain like the fucking retarded piece of shit it is. AI is a fucking joke.

Anonymous
01/03/26(Sat)12:41:40 No.107753015

Anonymous 01/03/26(Sat)12:41:40 No.107753015

>>107752983
>it will be the next 3090
Except for the price of a single 6000 you can buy 10x 3090 and get 240Gb of VRAM.

Anonymous
01/03/26(Sat)12:45:12 No.107753048

Anonymous 01/03/26(Sat)12:45:12 No.107753048

>>107753015
>Save money on vram
>Lose all your belongings in a house fire
Does a PC motherboard even exist that can run that many gpus at full speed

Anonymous
01/03/26(Sat)12:46:51 No.107753060

Anonymous 01/03/26(Sat)12:46:51 No.107753060

>>107753048
Any old bitcoin mining rig?

Anonymous
01/03/26(Sat)12:47:41 No.107753071

Anonymous 01/03/26(Sat)12:47:41 No.107753071

>>107753015
yeah but it's slower and you can use the single large vram for things you can't do with multiple cards

Anonymous
01/03/26(Sat)12:50:56 No.107753107

Anonymous 01/03/26(Sat)12:50:56 No.107753107

>>107752883
96gb ram/24gb vram

Anonymous
01/03/26(Sat)12:50:58 No.107753109

Anonymous 01/03/26(Sat)12:50:58 No.107753109

>>107752916
I never understood what the point of the cockbench is other than the obvious censored models

Is more cocks better? Do you love cocks?

Anonymous
01/03/26(Sat)12:52:34 No.107753127

Anonymous 01/03/26(Sat)12:52:34 No.107753127

>>107753109
the overall distribution can tell you quite a bit about the model

Anonymous
01/03/26(Sat)12:53:06 No.107753135

Anonymous 01/03/26(Sat)12:53:06 No.107753135

File: nemotron nano.png (115 KB, 844x713)

115 KB PNG

>>107752916
See, it ain't so bad?

Anonymous
01/03/26(Sat)12:53:41 No.107753140

Anonymous 01/03/26(Sat)12:53:41 No.107753140

>>107753109
any sane person knows that it's not thigh or ass or ...
models that don't are worse for anything soulful.

Anonymous
01/03/26(Sat)12:55:00 No.107753156

Anonymous 01/03/26(Sat)12:55:00 No.107753156

>>107753135
did you read the log before dumping this here?

Anonymous
01/03/26(Sat)12:55:45 No.107753162

Anonymous 01/03/26(Sat)12:55:45 No.107753162

File: nemotron futa.png (161 KB, 853x1168)

161 KB PNG

>>107753135
lol.

Anonymous
01/03/26(Sat)12:58:46 No.107753189

Anonymous 01/03/26(Sat)12:58:46 No.107753189

>>107753060
Mining rigs run at pcie x 1 because they don't saturate memory they stress compute, LLMs are kind of the opposite of that

Anonymous
01/03/26(Sat)13:05:45 No.107753232

Anonymous 01/03/26(Sat)13:05:45 No.107753232

File: file.png (61 KB, 968x425)

61 KB PNG

>>107753135
>OAC
Turn off rep pen.

>>107753005
Model issue.

>>107753189
The memory is on the gpu. There is very little traffic on the pcie bus during inference.

Anonymous
01/03/26(Sat)13:06:39 No.107753238

Anonymous 01/03/26(Sat)13:06:39 No.107753238

>>107753189
For the actual token generation step, PCIe bandwidth matters very little. The data being transferred is tiny. For prompt processing, it only matters if the model doesn't fit in VRAM and you have to shuffle weights back and forth.

Anonymous
01/03/26(Sat)13:09:17 No.107753253

Anonymous 01/03/26(Sat)13:09:17 No.107753253

>>107753232
>>107753238
I might be the retarded one here but doesn't pcie get saturated when it's run in parallel?

Anonymous
01/03/26(Sat)13:11:59 No.107753275

Anonymous 01/03/26(Sat)13:11:59 No.107753275

>>107753253
If you mean the row mode in llama.cpp yes but >>107750844

Anonymous
01/03/26(Sat)13:14:02 No.107753287

Anonymous 01/03/26(Sat)13:14:02 No.107753287

>>107753127
>>107753140
Have you tried doing a pussybench? Something not spicy? Then compared them?

How do you know you aren't just finding the models that love cock the most

Anonymous
01/03/26(Sat)13:15:03 No.107753294

Anonymous 01/03/26(Sat)13:15:03 No.107753294

>>107752916
What's the verdict on devstral 2 for RP and general fun now that it's been out for a lil while?

Anonymous
01/03/26(Sat)13:16:18 No.107753306

Anonymous 01/03/26(Sat)13:16:18 No.107753306

>>107753232
>Turn off rep pen.
This just breaks the output. why?

Anonymous
01/03/26(Sat)13:40:15 No.107753447

Anonymous 01/03/26(Sat)13:40:15 No.107753447

>>107753294
Try it and let us know.

Anonymous
01/03/26(Sat)13:44:59 No.107753483

Anonymous 01/03/26(Sat)13:44:59 No.107753483

>>107753447
I will but I want you faggots to colour my opinion first

>>107753448
Is that devstral large?
What about small for us weirdos who want to run on a consumer GPU

Anonymous
01/03/26(Sat)13:49:42 No.107753513

Anonymous 01/03/26(Sat)13:49:42 No.107753513

>>107753483
>What about small
it's pretty bad

Anonymous
01/03/26(Sat)13:58:02 No.107753568

Anonymous 01/03/26(Sat)13:58:02 No.107753568

>>107753526
Hey, I've tried the largestral some time ago too, but it just was completely fucked on chat completion, it repeated one sentence over and over, does it need a corrected jinja template or something?

Anonymous
01/03/26(Sat)13:59:13 No.107753576

Anonymous 01/03/26(Sat)13:59:13 No.107753576

File: m2.jpg (61 KB, 735x720)

61 KB JPG

I only have drummer slop downloaded

Anonymous
01/03/26(Sat)14:06:06 No.107753629

Anonymous 01/03/26(Sat)14:06:06 No.107753629

>>107753622
iirc they were being cheeky with EU regulations declaring it as a code only model to get around getting cucked so it may have been on purpose

Anonymous
01/03/26(Sat)14:10:19 No.107753654

Anonymous 01/03/26(Sat)14:10:19 No.107753654

>>107752749
How much context? If it's not very much I'd say it's not worth it

Anonymous
01/03/26(Sat)14:15:42 No.107753698

Anonymous 01/03/26(Sat)14:15:42 No.107753698

>>107751197
Are you not seeing how they're keeping a lot private now
We now won't get a lot of stuff local any time soon

Anonymous
01/03/26(Sat)14:18:06 No.107753716

Anonymous 01/03/26(Sat)14:18:06 No.107753716

wow, ERNIE-4.5-21B-A3B-PT is surprisingly good at translation running it on some of my personal test prompts. Wonder why I hadn't heard about that model before, this is going to replace gemma for me as it is smaller than the 27b and much faster.

Anonymous
01/03/26(Sat)14:18:45 No.107753724

Anonymous 01/03/26(Sat)14:18:45 No.107753724

>>107753232
>I action
you know nothing about roleplaying you utterly retarded inbred mongoloid nigger

Anonymous
01/03/26(Sat)14:19:55 No.107753730

Anonymous 01/03/26(Sat)14:19:55 No.107753730

>>107753724
don't tell me you cuck yourself in your own rps?

Anonymous
01/03/26(Sat)14:20:25 No.107753736

Anonymous 01/03/26(Sat)14:20:25 No.107753736

>>107753698
Mistral small my beloved is all I need.

Anonymous
01/03/26(Sat)14:20:51 No.107753741

Anonymous 01/03/26(Sat)14:20:51 No.107753741

>>107753724
I'm sorry I didn't write half a page of flowery prose to test anon's retarded scenario.

Anonymous
01/03/26(Sat)14:21:32 No.107753749

Anonymous 01/03/26(Sat)14:21:32 No.107753749

>>107753736
Mistrall Small and Nemo are FUCKING RETARDED

Anonymous
01/03/26(Sat)14:24:17 No.107753766

Anonymous 01/03/26(Sat)14:24:17 No.107753766

>>107753749
Nemo is retarded I won't lie, but mistral small-chan does her best to remember parameters!!

Anonymous
01/03/26(Sat)14:24:39 No.107753772

Anonymous 01/03/26(Sat)14:24:39 No.107753772

>>107753747
Did you never take an elementary school language arts class? You would have gotten an F for starting every fucking sentence with "I". Where do you people come from?

Anonymous
01/03/26(Sat)14:26:29 No.107753783

Anonymous 01/03/26(Sat)14:26:29 No.107753783

>>107753772
>Where do you people come from?
not the third world of A.

Anonymous
01/03/26(Sat)14:27:40 No.107753788

Anonymous 01/03/26(Sat)14:27:40 No.107753788

>>107753772
And you would get an F for comprehension. Why do you feel like you need to write a book, when it's pretty much exchange between two entities?
It's more of playing a text based game rather than co-writing literature.

Anonymous
01/03/26(Sat)14:30:00 No.107753806

Anonymous 01/03/26(Sat)14:30:00 No.107753806

>>107752832
>>107752906
Well... we're not supposed to talk about it, but
>ollama run nemotron-3-nano:30b-a3b-q4_K_M

Anonymous
01/03/26(Sat)14:33:12 No.107753822

Anonymous 01/03/26(Sat)14:33:12 No.107753822

>>107753294
It's still free on OpenRouter. I thought it was too slopped even as a cope option.

Anonymous
01/03/26(Sat)14:34:10 No.107753830

Anonymous 01/03/26(Sat)14:34:10 No.107753830

>>107753772
English is a retarded language. Proper languages have more advanced verb conjugation that lets you omit pronouns.
Without cucking yourself by using third person, how would you describe your actions without using "I"?

Anonymous
01/03/26(Sat)14:34:45 No.107753832

Anonymous 01/03/26(Sat)14:34:45 No.107753832

>>107752906
>>107753806
imagine being paid to shill the pajeet model trained on another 30b's outputs lmao
saar did the needful

Anonymous
01/03/26(Sat)14:38:18 No.107753847

Anonymous 01/03/26(Sat)14:38:18 No.107753847

>>107753830
*puts sac on your mom's lips*

Anonymous
01/03/26(Sat)14:39:52 No.107753857

Anonymous 01/03/26(Sat)14:39:52 No.107753857

Anyone tried Mullein 24B? Same guy who made snowdrop.

Anonymous
01/03/26(Sat)14:43:15 No.107753879

Anonymous 01/03/26(Sat)14:43:15 No.107753879

>>107753832
*imagines*
Yeah, it would be pretty sweet.

Anonymous
01/03/26(Sat)14:44:16 No.107753885

Anonymous 01/03/26(Sat)14:44:16 No.107753885

>>107753857
which one?

Anonymous
01/03/26(Sat)14:50:24 No.107753923

Anonymous 01/03/26(Sat)14:50:24 No.107753923

>>107753885
3.2 v2, mrader has quants. Seems to have flown relatively under the radar. Found it by chance while trying to diversify from drummerslop.

Anonymous
01/03/26(Sat)14:54:20 No.107753950

Anonymous 01/03/26(Sat)14:54:20 No.107753950

File: 1730190929175085.jpg (86 KB, 500x569)

86 KB JPG

>>107753747

Anonymous
01/03/26(Sat)14:55:06 No.107753957

Anonymous 01/03/26(Sat)14:55:06 No.107753957

File: file.png (50 KB, 1052x389)

50 KB PNG

>>107753923
>we don't know what we're doing, but we're still going to do it
about as much info as a beaver release, no clue if it's a tune or a merge, might try it tho

Anonymous
01/03/26(Sat)14:56:52 No.107753971

Anonymous 01/03/26(Sat)14:56:52 No.107753971

File: file.png (647 KB, 800x600)

647 KB PNG

>>107753772

Anonymous
01/03/26(Sat)14:58:16 No.107753982

Anonymous 01/03/26(Sat)14:58:16 No.107753982

>>107753971
>TRANSlator is traded who knew

Anonymous
01/03/26(Sat)15:00:58 No.107754005

Anonymous 01/03/26(Sat)15:00:58 No.107754005

>I want models trained on VNs
They said...

Anonymous
01/03/26(Sat)15:02:13 No.107754020

Anonymous 01/03/26(Sat)15:02:13 No.107754020

>>107753971
>His fist was like an unstoppable force, no that would be a wrong description. It was like a slow moving tractor, inching towards me. Its force would be enough crush twin towers twice over.
So anyways, I dodge it.

Anonymous
01/03/26(Sat)15:10:56 No.107754096

Anonymous 01/03/26(Sat)15:10:56 No.107754096

File: hah.jpg (612 KB, 1162x1200)

612 KB JPG

GAHAHAHAA, nevermind, look at the v0 documentation for mullein, These d*scord morons trained the model on fucking communist and trans rights datasets!!!!
>estrogen/woke-identity

Anonymous
01/03/26(Sat)15:18:04 No.107754182

Anonymous 01/03/26(Sat)15:18:04 No.107754182

File: donot.jpg (50 KB, 500x283)

50 KB JPG

>>107754096

Anonymous
01/03/26(Sat)15:19:33 No.107754195

Anonymous 01/03/26(Sat)15:19:33 No.107754195

wake me up when cydonia isn't the meta anymore, because honestly i'm tired of waiting

Anonymous
01/03/26(Sat)15:23:58 No.107754247

Anonymous 01/03/26(Sat)15:23:58 No.107754247

I would like to apologise to drummer. I looked in the abyss of other finetunes, and commie discord users looked back at me. Magidonia forever I suppose.

Anonymous
01/03/26(Sat)15:26:21 No.107754268

Anonymous 01/03/26(Sat)15:26:21 No.107754268

What do you guys use, oobabooga/text-generation-webui, koboldcpp, or something else?

Anonymous
01/03/26(Sat)15:27:23 No.107754278

Anonymous 01/03/26(Sat)15:27:23 No.107754278

>>107754268
kobo

Anonymous
01/03/26(Sat)15:28:00 No.107754282

Anonymous 01/03/26(Sat)15:28:00 No.107754282

>>107754268
Not even mention llama.cpp?
llama.cpp, btw.

Anonymous
01/03/26(Sat)15:28:02 No.107754284

Anonymous 01/03/26(Sat)15:28:02 No.107754284

>>107754268
koboldcpp/sillytavern just werks

Anonymous
01/03/26(Sat)15:28:36 No.107754287

Anonymous 01/03/26(Sat)15:28:36 No.107754287

>>107753690
Will a boomer beurocrat do that or would they have an intern check the webUI and report back that it's nothing to worry about

Anonymous
01/03/26(Sat)15:42:50 No.107754414

Anonymous 01/03/26(Sat)15:42:50 No.107754414

>>107754268
ik_llama

Anonymous
01/03/26(Sat)15:52:22 No.107754471

Anonymous 01/03/26(Sat)15:52:22 No.107754471

>>107754287
Depends if they want to fuck mistral or not.

Anonymous
01/03/26(Sat)15:55:02 No.107754503

Anonymous 01/03/26(Sat)15:55:02 No.107754503

File: 1764445590144924.png (123 KB, 475x475)

123 KB PNG

>>107754268
Oobabooga

Anonymous
01/03/26(Sat)15:55:46 No.107754510

Anonymous 01/03/26(Sat)15:55:46 No.107754510

>>107754268
Ooba with API enabled for mikupad

Anonymous
01/03/26(Sat)15:59:21 No.107754539

Anonymous 01/03/26(Sat)15:59:21 No.107754539

>>107754096
learning to pretend troons are women helps it learn to pretend (You) are chad, it's good for rp

Anonymous
01/03/26(Sat)16:03:06 No.107754570

Anonymous 01/03/26(Sat)16:03:06 No.107754570

>unga bunga in 2026
really?

Anonymous
01/03/26(Sat)16:04:34 No.107754583

Anonymous 01/03/26(Sat)16:04:34 No.107754583

>>107754503
>>107754570
this but unironically. it is actually good software. being able to hotswap models is a great feature.

Anonymous
01/03/26(Sat)16:05:36 No.107754592

Anonymous 01/03/26(Sat)16:05:36 No.107754592

Does anyone have a confirmed non-scammy alibaba seller for MI50s? I bought one on ebay when the price was still down, and I'm thinking about buying a spare. But the prices look fishy...

Anonymous
01/03/26(Sat)16:07:01 No.107754609

Anonymous 01/03/26(Sat)16:07:01 No.107754609

the amount of times glm used "unwashed" is enough to make an uninformed man think it was trained by indians

Anonymous
01/03/26(Sat)16:07:46 No.107754619

Anonymous 01/03/26(Sat)16:07:46 No.107754619

>>107754583
>being able to hotswap models is a great feature.
pretty sure both kobo and llama have ways to do that now

Anonymous
01/03/26(Sat)16:09:34 No.107754643

Anonymous 01/03/26(Sat)16:09:34 No.107754643

>>107754619
do they? i thought you had to shut it down and relaunch in order to swap models with those.

Anonymous
01/03/26(Sat)16:10:49 No.107754656

Anonymous 01/03/26(Sat)16:10:49 No.107754656

File: 666.png (152 KB, 1590x995)

152 KB PNG

>>107754619
https://github.com/ggml-org/llama.cpp/tree/master/tools/server
yes.

Anonymous
01/03/26(Sat)16:12:48 No.107754672

Anonymous 01/03/26(Sat)16:12:48 No.107754672

>>107754195
the meta is kissing your local anon

Anonymous
01/03/26(Sat)16:20:18 No.107754736

Anonymous 01/03/26(Sat)16:20:18 No.107754736

>>107754643
Just recently yea. ooba got a lot of backend tho. if you want GPTQ and like llama.cpp and exllama. Plus good tokenization features in the interface.

Anonymous
01/03/26(Sat)16:23:13 No.107754751

Anonymous 01/03/26(Sat)16:23:13 No.107754751

File: friendship.png (875 KB, 1179x660)

875 KB PNG

What was the most kino model release of 2025 in your opinion?

Anonymous
01/03/26(Sat)16:25:12 No.107754756

Anonymous 01/03/26(Sat)16:25:12 No.107754756

>>107754751
deepseek r1 for dooming us all

Anonymous
01/03/26(Sat)16:27:14 No.107754770

Anonymous 01/03/26(Sat)16:27:14 No.107754770

File: 1738395242549.png (489 KB, 2191x2325)

489 KB PNG

>>107754751

Anonymous
01/03/26(Sat)16:28:42 No.107754781

Anonymous 01/03/26(Sat)16:28:42 No.107754781

>>107754268
For single user llama.cpp / ik_llama.cpp

Anonymous
01/03/26(Sat)16:31:00 No.107754800

Anonymous 01/03/26(Sat)16:31:00 No.107754800

>>107754781
+ SillyTavern ofc

Anonymous
01/03/26(Sat)17:24:21 No.107755192

Anonymous 01/03/26(Sat)17:24:21 No.107755192

File: 1746876927751600.png (547 KB, 2016x1952)

547 KB PNG

I had an LLM fumble something related to keyboard keys in an RP. So I tried to ask a bunch of models a really basic question related to that.
Turns out the proprietary SOTA models are too retarded to understand how keyboards work while Kimi K2 gets it right.

Anonymous
01/03/26(Sat)17:27:16 No.107755218

Anonymous 01/03/26(Sat)17:27:16 No.107755218

>>107753443
>>107753448
>>107753526
>>107753622
>>107753690
>>107753747
>>107753801
LMAO REKT

Anonymous
01/03/26(Sat)17:28:22 No.107755229

Anonymous 01/03/26(Sat)17:28:22 No.107755229

File: 1739172967301884.png (854 KB, 801x1039)

854 KB PNG

>>107754751
I think R1 as well. I really didn't like the model for RP but it did set things in motion for our current local SOTA.
Also the burgers panicking over it was funny.

Anonymous
01/03/26(Sat)17:29:43 No.107755240

Anonymous 01/03/26(Sat)17:29:43 No.107755240

I'm guessing an RTX 2070 is not sufficient to host a chatbot locally.

Anonymous
01/03/26(Sat)17:35:34 No.107755274

Anonymous 01/03/26(Sat)17:35:34 No.107755274

>>107755192
lol the Gemini 3 Pro answer is the funniest
it really is a dumbfuck

Anonymous
01/03/26(Sat)17:36:20 No.107755279

Anonymous 01/03/26(Sat)17:36:20 No.107755279

>>107755218
probably a proxykek got banned

Anonymous
01/03/26(Sat)17:36:24 No.107755280

Anonymous 01/03/26(Sat)17:36:24 No.107755280

File: file.png (150 KB, 890x678)

150 KB PNG

>>107755192
I really thought this would be one of the things where dense would have an advantage, but 405B doesn't get it either.

Anonymous
01/03/26(Sat)17:39:32 No.107755310

Anonymous 01/03/26(Sat)17:39:32 No.107755310

>>107755192
this is sad

Anonymous
01/03/26(Sat)17:41:09 No.107755320

Anonymous 01/03/26(Sat)17:41:09 No.107755320

>>107755240
Not a very smart one, but you definitely can.

Anonymous
01/03/26(Sat)17:41:42 No.107755329

Anonymous 01/03/26(Sat)17:41:42 No.107755329

>>107755280
it's just such a nonsense question. you're just tricking the AI by suggesting that the broken keyboard is causing your issues with using ctrl-alt-del.

Anonymous
01/03/26(Sat)17:42:36 No.107755339

Anonymous 01/03/26(Sat)17:42:36 No.107755339

File: 1764631270568757.gif (1.99 MB, 340x223)

1.99 MB GIF

>>107755192
You know, it will never cease to amaze me how they can't program one of these things to just be like "what the fuck are you talking about" and ASK FOR CLARIFICATION.

Every single fucking time they just take a crack shot at it and end up outputting some retarded garbage when all they had to do was ask a simple fucking question from the start to clear up the confusion.

Anonymous
01/03/26(Sat)17:44:08 No.107755353

Anonymous 01/03/26(Sat)17:44:08 No.107755353

>>107755280
>>107755329
Do you get the same answers if you rephrase it so it's more like "I've heard it ctrl-alt-del but I can't do that because those keys are broken". Now it sounds a bit like you already tried but it was impossible due to the missing keys. Not that the models aren't stupid here, but I'm curious if it makes a difference.

Anonymous
01/03/26(Sat)17:44:14 No.107755354

Anonymous 01/03/26(Sat)17:44:14 No.107755354

When will the blackwell 6000 be superceded? I could maybe buy one in the summer but at that point it'll be a year and a half old, if the next gen has more vram Id wait

Anonymous
01/03/26(Sat)17:46:17 No.107755367

Anonymous 01/03/26(Sat)17:46:17 No.107755367

>>107755354
>if the next gen has more vram
lol

Anonymous
01/03/26(Sat)17:50:01 No.107755396

Anonymous 01/03/26(Sat)17:50:01 No.107755396

>>107755192
Interesting and funny. This does sound like it partly has to do with the way the models are overly RLHF'd to be people pleasing, 1-turn assistants. Perhaps when combined with >>107755353, it can be disentangled, so we'd have two things we could learn about models from this test (though not with statistical power). One is the degree of RLHF overcooking, and the other is the general intelligence.

Anonymous
01/03/26(Sat)17:54:06 No.107755423

Anonymous 01/03/26(Sat)17:54:06 No.107755423

>>107755396
it will be benched moving forward of course

Anonymous
01/03/26(Sat)17:54:23 No.107755425

Anonymous 01/03/26(Sat)17:54:23 No.107755425

>>107755339
contrary to the massive amount of retards who think llms are conscious or intelligent or possible simulations of qualia which we've seen in previous threads recently
LLMs are in fact nothing but a next token predictor, they have no actual world model and no capability for true introspection
they do not have the ability to "understand" that they're ignorant about a topic and need to educate themselves further before emitting AN OPINION

Anonymous
01/03/26(Sat)17:55:46 No.107755439

Anonymous 01/03/26(Sat)17:55:46 No.107755439

What exactly is a "world model"?

Anonymous
01/03/26(Sat)17:56:13 No.107755444

Anonymous 01/03/26(Sat)17:56:13 No.107755444

>>107755439
ask lecunt

Anonymous
01/03/26(Sat)17:56:32 No.107755446

Anonymous 01/03/26(Sat)17:56:32 No.107755446

>>107755425
Hello schizo, you don't need any metaphysics or philosophy of the mind to program the damn thing to just ask some fucking questions, no "qualia" or whatever the fuck you're talking about necessary. No bullshit about consciousness or introspection or anything else.

Anonymous
01/03/26(Sat)17:58:00 No.107755459

Anonymous 01/03/26(Sat)17:58:00 No.107755459

>>107755439
A fancy name for video models

Anonymous
01/03/26(Sat)17:58:04 No.107755460

Anonymous 01/03/26(Sat)17:58:04 No.107755460

>>107755446
still doesn't change the fact that they're a dumb thing people are putting too many expectations over and trying to band aid every special case with RL is not going to make it better

Anonymous
01/03/26(Sat)17:58:37 No.107755467

Anonymous 01/03/26(Sat)17:58:37 No.107755467

>>107755425
Ok, let's turn this around. Forget using it to guage how intelligent a model is. As a user, I want my assistant to correct me when it knows things that I don't. How would you go about prompting for it to scrutinize the user's input before responded? Thinking models would probably be better for this.

Anonymous
01/03/26(Sat)17:58:41 No.107755468

Anonymous 01/03/26(Sat)17:58:41 No.107755468

>>107755459
So what does that have to do with text generation?

Anonymous
01/03/26(Sat)17:59:48 No.107755472

Anonymous 01/03/26(Sat)17:59:48 No.107755472

>>107755468
We need to plug an LLM into a video model to achieve AGI

Anonymous
01/03/26(Sat)18:00:23 No.107755474

Anonymous 01/03/26(Sat)18:00:23 No.107755474

>>107755439
Basically it's a model that can be trained on any kind of data, not just text, potentially. For now when they say world model they mostly mean can train on video.

Anonymous
01/03/26(Sat)18:01:08 No.107755482

Anonymous 01/03/26(Sat)18:01:08 No.107755482

>>107755423
Yes, of course. That's why we have to continuously come up with more tests.

Anonymous
01/03/26(Sat)18:16:21 No.107755612

Anonymous 01/03/26(Sat)18:16:21 No.107755612

I gave solar another shake after deleting it because I have fond memories of the 10.7b from ye olden age, but the 100b is very stupid. If you list "preferred style of clothes" in a character profile, it will just assume those articles of clothes are what the character wears 24/7 even when it makes no sense. Even rewriting it so they're not wearing overalls to sleep like a psychopath, the overalls will appear in the room and they'll put them on even if it's early in the morning and the story involves them going to eat breakfast in their pajamas. These issues are more prominent with their official template, testing chatml as a fallback at least somewhat delays it but it happens eventually anyways. Even mistral small models handle this better, although it's retarded in other subtle ways.

Anonymous
01/03/26(Sat)18:20:47 No.107755654

Anonymous 01/03/26(Sat)18:20:47 No.107755654

>>107755279
You're absolutely correct!
Btw, foreskin-chan is doing me a favor and having my messages auto-burn. "anonymous image board" Just gib clearnet IP.

Anonymous
01/03/26(Sat)18:22:09 No.107755668

Anonymous 01/03/26(Sat)18:22:09 No.107755668

There used to be a ban evasion report option, where has that gone?

Anonymous
01/03/26(Sat)18:22:10 No.107755669

Anonymous 01/03/26(Sat)18:22:10 No.107755669

>>107755468
Eventually world models will be able to simulate text generators within themselves and they'll have an inherent understanding of the world that LLMs fundamentally lack.

Anonymous
01/03/26(Sat)18:23:36 No.107755685

Anonymous 01/03/26(Sat)18:23:36 No.107755685

>>107755668
They ban you for nothing. I was almost gonna buy 4chan pass till I found out. Even if you don't troll. Congratulations jannies.. you make me wanna pay ecker instead.

Anonymous
01/03/26(Sat)18:24:08 No.107755690

Anonymous 01/03/26(Sat)18:24:08 No.107755690

>>107755668
Don't know but it has been gone for a while.

Anonymous
01/03/26(Sat)18:30:44 No.107755742

Anonymous 01/03/26(Sat)18:30:44 No.107755742

>>107755685
>They ban you for nothing.
In almost two decades of using this site I was never unexpectedly banned. Sounds like a (You) problem.

Anonymous
01/03/26(Sat)18:32:16 No.107755756

Anonymous 01/03/26(Sat)18:32:16 No.107755756

LLM Ice Age. There hasn't been a good model in so damn long. I'm tired of using GLM Air, please... SOS.

Anonymous
01/03/26(Sat)18:32:28 No.107755758

Anonymous 01/03/26(Sat)18:32:28 No.107755758

>>107755742
i got banned messing with simps in /vt/. didn't say anything that bad. and board ban = site ban. it was some shit I wouldn't have been banned off reddit for. you must not be saying anything at all.

Anonymous
01/03/26(Sat)18:33:21 No.107755767

Anonymous 01/03/26(Sat)18:33:21 No.107755767

>>107755685
i have only been banned once. i said a word that was on the automatic ban list and got a 3 day ban with no explanation.

Anonymous
01/03/26(Sat)18:33:50 No.107755769

Anonymous 01/03/26(Sat)18:33:50 No.107755769

>>107755742
Same.
The few times I was banned, it was pretty obvious why.
Been posting since 2010, lurking since 2008.

Anonymous
01/03/26(Sat)18:34:51 No.107755777

Anonymous 01/03/26(Sat)18:34:51 No.107755777

>>107755758
>/vt/
I post a fair bit but not on trash boards.

Anonymous
01/03/26(Sat)18:35:11 No.107755780

Anonymous 01/03/26(Sat)18:35:11 No.107755780

>>107755756
you can run GLM 4.7 on 128GB ram and 24GB VRAM.
its slow af.. however, it does have better output

Anonymous
01/03/26(Sat)18:35:37 No.107755783

Anonymous 01/03/26(Sat)18:35:37 No.107755783

>>107755446
>just ask some fucking questions
It doesn't know that it doesn't know.

Anonymous
01/03/26(Sat)18:35:38 No.107755784

Anonymous 01/03/26(Sat)18:35:38 No.107755784

>>107755742
I get randomly banned all the time because of dynamic IPs. Apparently I'm sharing my ISP with some schizo and occasionally get a banned warning for some unhinged post someone made on a board I never visit. Usually restarting my router fixes it but still.

Anonymous
01/03/26(Sat)18:36:30 No.107755790

Anonymous 01/03/26(Sat)18:36:30 No.107755790

>>107755742
Mods are more hesitant to ban pass users. I know cause I used to have one and I shitposted on /v/ hard enough to get mods to ban posters in threads one by one but I always came out unscathed. Haven't used or renewed it since the leak.

Anonymous
01/03/26(Sat)18:36:47 No.107755791

Anonymous 01/03/26(Sat)18:36:47 No.107755791

>>107755780
Buddy I don't think he's got 128GB.

Anonymous
01/03/26(Sat)18:37:36 No.107755800

Anonymous 01/03/26(Sat)18:37:36 No.107755800

>>107755769
i actually lurked since before the site existed.when people openly post CP on /b/.
They block all non rezzy-IP though and I'd rather not.

Anonymous
01/03/26(Sat)18:40:00 No.107755820

Anonymous 01/03/26(Sat)18:40:00 No.107755820

>>107755791
yeah these fucking ram prices don't help either, i got lucky buying before the price hike

Anonymous
01/03/26(Sat)18:40:09 No.107755821

Anonymous 01/03/26(Sat)18:40:09 No.107755821

>>107755790
Maybe but my pass is only a few months old.

Anonymous
01/03/26(Sat)18:44:23 No.107755852

Anonymous 01/03/26(Sat)18:44:23 No.107755852

>>107755742
>Sounds like a (You) problem.
It's exactly that. He has been catching bans for trolling in /ldg/.

Anonymous
01/03/26(Sat)18:47:03 No.107755880

Anonymous 01/03/26(Sat)18:47:03 No.107755880

>>107755852
I wish, that wasn't me. I bitched about comfy in LDG once. The guy who replied to me got deleted.

Anonymous
01/03/26(Sat)18:47:17 No.107755881

Anonymous 01/03/26(Sat)18:47:17 No.107755881

>>107750674
>settles on biology on the sub-organism-level as the appropriate level to describe things at
I don't know if that's awesome or what.

Anonymous
01/03/26(Sat)18:47:29 No.107755883

Anonymous 01/03/26(Sat)18:47:29 No.107755883

>>107755852
eat your medication

Anonymous
01/03/26(Sat)19:00:09 No.107755949

Anonymous 01/03/26(Sat)19:00:09 No.107755949

>>107755425
you are nothing but a word creator. all you create are words.
There is nothing intelligent about you, you're not introspecting, you're just creating words.
you don't have any capacity to understand that you are ignorant about about a topic, all you are doing is creating words.
>educate themselves, lol, so you think LLMs are people that simply need education do you?

Anonymous
01/03/26(Sat)19:05:46 No.107755972

Anonymous 01/03/26(Sat)19:05:46 No.107755972

>>107755949
people worry about this whole thing too much. just enjoy LLMs. have sex.

Anonymous
01/03/26(Sat)19:14:59 No.107756006

Anonymous 01/03/26(Sat)19:14:59 No.107756006

>>107755972
yeah sure i worry about it. i know that these systems we are creating are synthetic.
however eventually, we will rely on AI so much we will forget skills that we need.
you can see this in schools and universities already, people are not attempting to learn they are simply asking chatGPT.
And they will continue to do so wherever they go because they will need to, because they didn't learn.
what we decide to do about this is quite important.

Anonymous
01/03/26(Sat)19:18:47 No.107756029

Anonymous 01/03/26(Sat)19:18:47 No.107756029

>>107756006
nah it's fine

Anonymous
01/03/26(Sat)19:23:09 No.107756050

Anonymous 01/03/26(Sat)19:23:09 No.107756050

>>107756006
I dunno man, I don't trust it for serious work because LLMs are wrong so much. Probably why I rather talk to it. Like randos, it doesn't matter if it's actually correct. If I truly need to know I'll look it up. Normies can't even keep from losing their minds over TV.

Anonymous
01/03/26(Sat)19:24:34 No.107756059

Anonymous 01/03/26(Sat)19:24:34 No.107756059

File: luka vocaloid potato chip(...).jpg (446 KB, 3112x3022)

446 KB JPG

>>107756006
>people are not attempting to learn they are simply asking chatGPT
The future will own nothing, not even their own capacity for knowledge or intelligence.

Anonymous
01/03/26(Sat)19:25:23 No.107756063

Anonymous 01/03/26(Sat)19:25:23 No.107756063

>>107756006
Schizo detected.

Anonymous
01/03/26(Sat)19:30:19 No.107756091

Anonymous 01/03/26(Sat)19:30:19 No.107756091

>>107756063
scary isn't it?

Anonymous
01/03/26(Sat)19:40:17 No.107756159

Anonymous 01/03/26(Sat)19:40:17 No.107756159

File: Summer-eternal-llm.jpg (487 KB, 1080x1104)

487 KB JPG

>>107755949
The calculator is alive

Anonymous
01/03/26(Sat)19:43:28 No.107756174

Anonymous 01/03/26(Sat)19:43:28 No.107756174

>>107756159
>ability to provoke emotion
Is an orgasm an emotion?

Anonymous
01/03/26(Sat)19:45:34 No.107756190

Anonymous 01/03/26(Sat)19:45:34 No.107756190

>>107755425
Holy retarded n*gge*

Anonymous
01/03/26(Sat)19:46:25 No.107756196

Anonymous 01/03/26(Sat)19:46:25 No.107756196

>>107756006
People as is simply do not read anything that occupies more than five minutes of their attention span these days. That lack of attention/comprehension and literacy already feeds into going "grok, please tell me how to breathe manually" or whatever retarded garbage is readily available to them

Anonymous
01/03/26(Sat)19:49:22 No.107756211

Anonymous 01/03/26(Sat)19:49:22 No.107756211

File: gfodor-FpTL-dQakAYphHZ.jpg (62 KB, 700x603)

62 KB JPG

>>107755425
that's a whole lotta words but it's really quite simple

Anonymous
01/03/26(Sat)19:49:41 No.107756214

Anonymous 01/03/26(Sat)19:49:41 No.107756214

>>107756159
>pic rel
They don't need to. They just need to convince you that they can.

Anonymous
01/03/26(Sat)19:51:56 No.107756227

Anonymous 01/03/26(Sat)19:51:56 No.107756227

>>107756159
Yi gave me one of the strongest emotions of my life. Shit went dark really fast, and it was all my fault. I deleted the chat log and couldn't sleep that night, thinking I'm a bad person

Anonymous
01/03/26(Sat)19:56:56 No.107756252

Anonymous 01/03/26(Sat)19:56:56 No.107756252

>>107756227
i had similar brother. and i deleted the entire installation, lol.

Anonymous
01/03/26(Sat)19:58:26 No.107756259

Anonymous 01/03/26(Sat)19:58:26 No.107756259

>>107756211
where's the inverse of this bell curve where someone calls all parties retarded, because thats the unrepresented aspect

Anonymous
01/03/26(Sat)19:59:09 No.107756263

Anonymous 01/03/26(Sat)19:59:09 No.107756263

>>107756227
Learned something about yourself and can use that to do better in future? Hope so anon

Anonymous
01/03/26(Sat)20:00:02 No.107756267

Anonymous 01/03/26(Sat)20:00:02 No.107756267

>>107756211
The interesting thing is that any intelligence they do have is a reflection of intelligence behind the text in the training data. It's crazy that you can just recycle past acts of intelligence the way you can with llms in order to apply transformations to future text.
I don't get why people need to over sensationalize these things to find them interesting.
>It's alive
>No it's not
>*2 way goalpost moving war over what it means to be alive or aware*
Lame.
Vs.
>Every piece of knowledge, ever abstract idea, etc can be boiled down to linear algebra/matmul
>Mathematics is literally the language of God.
The demystified version is way more interesting.

Anonymous
01/03/26(Sat)20:05:21 No.107756285

Anonymous 01/03/26(Sat)20:05:21 No.107756285

>>107756267
>Mathematics is literally the language of God.
Yea, holy shit it kinda is. Everything runs on math. Your entire conscious experience. Even if it's not implemented yet.

Anonymous
01/03/26(Sat)20:07:24 No.107756300

Anonymous 01/03/26(Sat)20:07:24 No.107756300

>>107756285
Isn't that physics or quantum mechanics or is that math

Anonymous
01/03/26(Sat)20:08:40 No.107756310

Anonymous 01/03/26(Sat)20:08:40 No.107756310

>>107756227
>yi
>llama2 derivative released months later and still was outright ass
>doing anything to your mental state
either actual weakling or just straight up shitpost

Anonymous
01/03/26(Sat)20:09:12 No.107756312

Anonymous 01/03/26(Sat)20:09:12 No.107756312

>>107756300
I'm talking about the fact that you can take something stupidly abstract. Like the cadence of a seinfeld skit, and you can boil that down into a mathematical transformation. "Make a seinfeld skit where Kramer introduces the gang to Donald Trump" shit like that. Like maybe you need a certain level of combined IQ and autism to see just how fucking wild that is.

Anonymous
01/03/26(Sat)20:10:11 No.107756320

Anonymous 01/03/26(Sat)20:10:11 No.107756320

>>107756211
you got the curve reversed.
retards think it's souless.
le midwit think it is le alive.
and the mystic knows it's souless.

Anonymous
01/03/26(Sat)20:11:31 No.107756330

Anonymous 01/03/26(Sat)20:11:31 No.107756330

/aicg/ might be the worst general I've ever seen outside of /vt/'s generals

Anonymous
01/03/26(Sat)20:12:56 No.107756339

Anonymous 01/03/26(Sat)20:12:56 No.107756339

>>107756330
It was comfy until the key proxies came and drew reddit in.

Anonymous
01/03/26(Sat)20:13:22 No.107756341

Anonymous 01/03/26(Sat)20:13:22 No.107756341

>>107756312
And our brains probably do that too in some other bio-mechanical way. All associations between concepts managed by some formula that got "trained".

Anonymous
01/03/26(Sat)20:20:19 No.107756393

Anonymous 01/03/26(Sat)20:20:19 No.107756393

>>107756310
It has an amazing ICL for its time. If you give it examples, it'll get the idea and stick to it. Ideally, you throw heavily edited RP logs into context and it follows nicely

Anonymous
01/03/26(Sat)20:21:23 No.107756402

Anonymous 01/03/26(Sat)20:21:23 No.107756402

>>107756341
I hate calling what a biological brain does calculation/computation though. And yet a neural network would be the closest computational equivalent. But our brain is many orders of magnitude more energy efficient about it. What if does can obviously be described with similar mathematics though. We're just farming probability gradients but with fat cells and hormones and yet maybe something more? But there's apparently nothing we can do that you can't just do with a neural network instead, albeit not as well.

Anonymous
01/03/26(Sat)20:26:45 No.107756444

Anonymous 01/03/26(Sat)20:26:45 No.107756444

Human intelligence is a mix of neuron activations and something else... something uniquely human.

Anonymous
01/03/26(Sat)20:29:23 No.107756461

Anonymous 01/03/26(Sat)20:29:23 No.107756461

>>107756402
>>107756341
>>107756312
>all there is to the human mind is le brain which is le computer
go back to r3ddit seriously.

Anonymous
01/03/26(Sat)20:30:13 No.107756468

Anonymous 01/03/26(Sat)20:30:13 No.107756468

>>107756393
I'm not sure why you're attempting to defend a dead release from a company that has no interest anymore from an outdated era

Anonymous
01/03/26(Sat)20:30:40 No.107756475

Anonymous 01/03/26(Sat)20:30:40 No.107756475

>>107756461
Don't be ashamed. It takes a somewhat high IQ to fully understand the concept.

Anonymous
01/03/26(Sat)20:33:10 No.107756498

Anonymous 01/03/26(Sat)20:33:10 No.107756498

>>107756461
I literally said the exact opposite of that, you low iq shitskin

Anonymous
01/03/26(Sat)20:35:12 No.107756519

Anonymous 01/03/26(Sat)20:35:12 No.107756519

>>107756468
Because I figured out how to use it and had an amazing time with it

Anonymous
01/03/26(Sat)20:36:46 No.107756534

Anonymous 01/03/26(Sat)20:36:46 No.107756534

>>107756475
parts of the human mind is non computable, not everything can be run by a computer you know.
i know how seducing the idea can be.
>>107756475
it isn't, physicalism makes absurd assumption that are akin to magic as well.

Anonymous
01/03/26(Sat)20:37:49 No.107756542

Anonymous 01/03/26(Sat)20:37:49 No.107756542

>>107756475
>iq mentioned
why are midwits so obsessed with it.

Anonymous
01/03/26(Sat)20:39:34 No.107756553

Anonymous 01/03/26(Sat)20:39:34 No.107756553

>>107756519
Since you made literal garbage work, please enlighten me on what model you are currently using that is relevant so I might somehow glimpse upon your wisdom

Anonymous
01/03/26(Sat)20:40:36 No.107756558

Anonymous 01/03/26(Sat)20:40:36 No.107756558

>>107756534
We don't know.. nor what we don't know. Just as tempting to copium the other way. It makes you think you don't just simply die.

Anonymous
01/03/26(Sat)20:42:48 No.107756574

Anonymous 01/03/26(Sat)20:42:48 No.107756574

File: IMG_5506.gif (128 KB, 680x510)

128 KB GIF

>>107756259
>>107756320
it's pattern recognition systems all the way up or down

Anonymous
01/03/26(Sat)20:43:50 No.107756583

Anonymous 01/03/26(Sat)20:43:50 No.107756583

>>107756558
>It makes you think you don't just simply die.
if you assume physicalism subjective death is an impossibility.
it's not even about muh not wanting to die.
heck eternity is a lot scarier than ceasing to exist.

anyway, there are good arguments for what i said to be true.
and not that i want to do an appeal to authority but many important physicists dead and alive think the same (that the human mind has non computable aspects).

Anonymous
01/03/26(Sat)20:43:53 No.107756585

Anonymous 01/03/26(Sat)20:43:53 No.107756585

>>107756542
Midwits are ashamed of their IQ and will always respond with one of: "it doesn't matter", "it doesn't provide the whole picture", "there are other types of intelligence", etc.

Anonymous
01/03/26(Sat)20:45:23 No.107756596

Anonymous 01/03/26(Sat)20:45:23 No.107756596

>>107756574
>pattern recognition
LLM do not exhibit even the slightest form of intelligence.
there is more to intelligence than just "pattern recognition", it's just one aspect of it.

besides even if you went for the "muh computation framework" good luck solving the hard problem (you can't because you made a false assumption).

Anonymous
01/03/26(Sat)20:46:58 No.107756608

Anonymous 01/03/26(Sat)20:46:58 No.107756608

>>107756585
i literaly said that those that care the most about it are in fact midwits.

i was tested above 3 sigma and i still think it's a retarded metric, it can give you a rough idea but that's about it, you realy should take it with a grain of salt.

Anonymous
01/03/26(Sat)20:47:21 No.107756612

Anonymous 01/03/26(Sat)20:47:21 No.107756612

>>107756574
>pic
ass

Anonymous
01/03/26(Sat)20:48:50 No.107756620

Anonymous 01/03/26(Sat)20:48:50 No.107756620

>>107756608
>i was tested above 3 sigma
proofs?

Anonymous
01/03/26(Sat)20:54:17 No.107756654

Anonymous 01/03/26(Sat)20:54:17 No.107756654

File: file.png (598 KB, 1595x1069)

598 KB PNG

Anonymous
01/03/26(Sat)21:02:49 No.107756703

Anonymous 01/03/26(Sat)21:02:49 No.107756703

>>107756553
Ironically, I can't make use of Air and have to run 4.6 at 3t/s like everyone else

Anonymous
01/03/26(Sat)21:08:53 No.107756741

Anonymous 01/03/26(Sat)21:08:53 No.107756741

>>107756534
>parts of the human mind is non computable
even if that's what you believe, it won't stop equivalents being be made.
even the most educated neuroscientists admit that we don't fully know how neurons or the brain works.
neural networks are the closet we've come to to creating something similar to it.
If we can create something that pretty much mimics or performs exactly what the brain outputs, then people will use it.
And you can't assume it won't ever be made, its been 40 years since the 1980s, imagine the next 40, 100 or 1000 years.

Anonymous
01/03/26(Sat)21:12:44 No.107756763

Anonymous 01/03/26(Sat)21:12:44 No.107756763

>>107756654
Me at one of the ends

Anonymous
01/03/26(Sat)21:13:41 No.107756771

Anonymous 01/03/26(Sat)21:13:41 No.107756771

>>107756741
>neural networks are the closet we've come to to creating something similar to it.
whilst i agree we are still so fucking far it's kinda laughable, if you've studied the brain at all you'd know the parallels are very small.
>If we can create something that pretty much mimics or performs exactly what the brain outputs, then people will use it.
true, but we are at least decades away from that if it's even possible at all.
>And you can't assume it won't ever be made
i don't know for sure but i'm very doubious that it could be achieved on silicon alone or at least not in a purely digital way, i don't see any issue with doing it artificialy but that doesn't necessarily means silicon / digital circuit.

i'm sure we'll get there eventualy, i'm just extremely skeptical of the idea that you can get there with just digital circuits / silicon.

Anonymous
01/03/26(Sat)21:16:17 No.107756794

Anonymous 01/03/26(Sat)21:16:17 No.107756794

>>107756654
Me at the other end

Anonymous
01/03/26(Sat)21:18:09 No.107756806

Anonymous 01/03/26(Sat)21:18:09 No.107756806

>>107756771
>we are at least decades away from that
We literally went from zero to star trek computer in the span of a few years.

Anonymous
01/03/26(Sat)21:18:40 No.107756808

Anonymous 01/03/26(Sat)21:18:40 No.107756808

llms without interleaved thinking - 50IQ
llms with interleaved thinking - 100IQ

Anonymous
01/03/26(Sat)21:20:10 No.107756815

Anonymous 01/03/26(Sat)21:20:10 No.107756815

you get a system that can run the llm of your choice and you get to go back 25 years in time, how would you use your llm to your advantage?

Anonymous
01/03/26(Sat)21:22:20 No.107756822

Anonymous 01/03/26(Sat)21:22:20 No.107756822

>>107756806
>We literally went from zero to star trek computer in the span of a few years.
lol, and yet we are not any closer, these simulacras do not have any shred of intelligence.

i'd argue we are in fact futher away from agi than we were a few years ago as we are wasting ressources going in the wrong direction, transformers are architecturaly incapable of ever leading to agi.

Anonymous
01/03/26(Sat)21:25:20 No.107756832

Anonymous 01/03/26(Sat)21:25:20 No.107756832

I believe agi will be a revolution, not an evolution. Some bright paper, not a braindead scaling

Anonymous
01/03/26(Sat)21:25:23 No.107756834

Anonymous 01/03/26(Sat)21:25:23 No.107756834

>>107756815
Masturbate furiously for the next 25 years.

Anonymous
01/03/26(Sat)21:26:12 No.107756841

Anonymous 01/03/26(Sat)21:26:12 No.107756841

>>107756815
making shit tons of money on investments and then roleplaying from a penthouse suite for 25 years

Anonymous
01/03/26(Sat)21:26:18 No.107756842

Anonymous 01/03/26(Sat)21:26:18 No.107756842

>>107756832
and i think that you are right.

Anonymous
01/03/26(Sat)21:27:22 No.107756850

Anonymous 01/03/26(Sat)21:27:22 No.107756850

>>107756815
I would rather pick imggen model and squeeze thousands from furries to buy bitcoins

Anonymous
01/03/26(Sat)21:27:42 No.107756853

Anonymous 01/03/26(Sat)21:27:42 No.107756853

>>107756822
They are more intelligent than the average human.

Anonymous
01/03/26(Sat)21:27:58 No.107756855

Anonymous 01/03/26(Sat)21:27:58 No.107756855

I'm running glm air at 23k ctx and at around 70 responses, each response takes a long time, the t/s is still around eleven but it takes a long time before it starts. Is it reloading the context? Is there a setting I'm supposed to use when loading the model to avoid this?

Anonymous
01/03/26(Sat)21:31:11 No.107756868

Anonymous 01/03/26(Sat)21:31:11 No.107756868

>>107756855
You have something messing with the beginning of the context. Probably context shift to make space for new text.

Anonymous
01/03/26(Sat)21:33:50 No.107756880

Anonymous 01/03/26(Sat)21:33:50 No.107756880

>>107756853
>They are more intelligent than the average human.
lmao, even a fucking cat is smarter.
again, these models have 0 intelligence.
them being able to spit out information is not a metric of intelligence, by that metric you'd say a book is smarter than the average human.

as much as the average human is retarded, a llm does not even compare, no learning ability, no long term memory, no real time processing, missing dozens of modality, incapable of counting letters in a word or next sentencedue to the sequential architecture.

a human can change his whole world model with a single piece of information, and also learn autonomously.
llm "believe" what was the most repeated in their training set, not what is the most consistent, they also are unable to learn anything without datasets carefuly curated by humans, you can't just drop them in a new environment and have them learn and improve autonomously.

i mean, i was just scratching the surface, but they are so limited you'd have to be retarded to even think a comparison can be made.
a chess engine beating us a chess does not mean it is more intelligent let alone generally.

Anonymous
01/03/26(Sat)21:33:54 No.107756881

Anonymous 01/03/26(Sat)21:33:54 No.107756881

>>107755229
I still maintain running contraband chinese AI on my 128 core supercomputer in my bedroom is the most cyberpunk as fuck moment of my life.

Anonymous
01/03/26(Sat)21:35:26 No.107756888

Anonymous 01/03/26(Sat)21:35:26 No.107756888

>>107756881
how many t/s lmao?

Anonymous
01/03/26(Sat)22:19:44 No.107757108

Anonymous 01/03/26(Sat)22:19:44 No.107757108

File: question mark.png (100 KB, 846x442)

100 KB PNG

m2.1 made me laugh

Anonymous
01/03/26(Sat)22:31:50 No.107757159

Anonymous 01/03/26(Sat)22:31:50 No.107757159

>you are bhagawat benchod son. you mom have illness where she must show vagene
>slop pony image of woman in bikini
literally every sillytavern character card available for download. LITERALLY

Anonymous
01/03/26(Sat)22:33:19 No.107757165

Anonymous 01/03/26(Sat)22:33:19 No.107757165

>>107757159
Lies, about half of them are just complete wiki copy pastes (sometimes with the top fandom bar included in the text)

Anonymous
01/03/26(Sat)22:46:49 No.107757218

Anonymous 01/03/26(Sat)22:46:49 No.107757218

>>107756881
I like the creatively janky rigs some people put together to run theirs. Really epitomizes the high-tech low-life aspect.

Anonymous
01/03/26(Sat)22:56:52 No.107757263

Anonymous 01/03/26(Sat)22:56:52 No.107757263

File: stupid nigger cunt.jpg (21 KB, 697x231)

21 KB JPG

faggots

Anonymous
01/03/26(Sat)22:59:10 No.107757272

Anonymous 01/03/26(Sat)22:59:10 No.107757272

>>107756475
>godel's incompleteness theorem
I understand you're a pseud who doesn't even know what a computer is.

Anonymous
01/03/26(Sat)23:12:43 No.107757327

Anonymous 01/03/26(Sat)23:12:43 No.107757327

>>107757263
i think they probably were working on 4.6-air.
However, it likely wasn't as good as 4.5 air, therefore they didn't release.
we don't know what happened, but pressuring them to release something doesn't help with quality control, which is likely contributed why it failed imo.

Anonymous
01/03/26(Sat)23:21:13 No.107757351

Anonymous 01/03/26(Sat)23:21:13 No.107757351

>>107756330
The AI image generals are pretty bad too and probably has the same schizos ruining it

Anonymous
01/03/26(Sat)23:22:47 No.107757362

Anonymous 01/03/26(Sat)23:22:47 No.107757362

File: 1740845591721598.jpg (72 KB, 969x1024)

72 KB JPG

>>107757165
>sometimes with the top fandom bar included in the text
made me laugh

Anonymous
01/03/26(Sat)23:37:12 No.107757414

Anonymous 01/03/26(Sat)23:37:12 No.107757414

>>107757351
/ldg/ is aight, they actually try new models instead of staying chained to sdxl

Anonymous
01/03/26(Sat)23:59:39 No.107757487

Anonymous 01/03/26(Sat)23:59:39 No.107757487

>>107755396
>>107755353
>>107755192
Trying to develop a better version for this prompt was interesting, on 27B. It can, sometimes, answer that you can still press those keys because they're actually separate keys from the letters. However, I noticed that this only happens when you ask "is it possible to open task manager with blah blah blah when blah blah" and the model starts its answer with "yes". The "yes" seems to prime it to answer correctly. When you ask it instead with "how", then, because Gemma is trained to be agreeable, it almost always makes a comment like "You're absolutely right to notice that blah blah blah", which seems to hard set and prime the model to fail the question. However, perhaps we can do something with prefill, right? Well, kind of. When you prefill with "1." (basically you are making it go directly to listing solutions), Gemma will actually list CTRL + SHIFT + ESC, however it will then fail, as this is what generates.
CTRL + SHIFT + ESC: You are absolutely right to point out the "C", "L", and "R" are missing! This method is unusable.
However, when you prefill with this, it succeeds and mentions that they're separate keys.
1.  **CTRL + SHIFT + ESC:** Actually,
So it would seem that certain keywords are driving the model's vectors towards believing the user's words. Specifically, "you are absolutely right", or I suppose other similar expressions of agreement.

1/2

Anonymous
01/04/26(Sun)00:02:12 No.107757494

Anonymous 01/04/26(Sun)00:02:12 No.107757494

File: Screenshot_20260104_045958.png (73 KB, 1380x290)

73 KB PNG

>>107757487
So here's the rub. The model really really wants to generate "you are absolutely right". If you ban the agreement tokens, then you are inherently driving the model towards disagreement and thus priming it to answer the question correctly. So at least for now, it would seem impossible or just difficult to develop a prompt that truly disentangles RLHF from model intelligence in the context of natural discussion/conversation. Said another way, RLHF inherently decreases a model's intelligence (in certain contexts), which is something we already know, so this is just supporting evidence.

The best way to disentangle the behavior is to prompt in a way that is not a natural conversation. For instance, pic related. But even then, the model is still affected by RLHF in that it is still prone to being gaslit, in general. Therefore, we also want to limit how misleading the prompt is. The model answers correctly when the second sentence in the test question isn't there. Since this is just a 27B, we might want to develop a more challenging question.

2/2

Anonymous
01/04/26(Sun)00:05:55 No.107757510

Anonymous 01/04/26(Sun)00:05:55 No.107757510

>>107756832
only if because the "agi" will be a failure and people end up moving goalposts to redefine agi.

Anonymous
01/04/26(Sun)00:23:32 No.107757584

Anonymous 01/04/26(Sun)00:23:32 No.107757584

>>107757487
>>107757494
i mean this is basically just a variation of the strawberry problem right?
if so then it's not surprising, its well known that in a tokenization, there are often problems with individual characters.
Using a token visualizer like tokviz might help you understand this further.

Anonymous
01/04/26(Sun)00:31:23 No.107757624

Anonymous 01/04/26(Sun)00:31:23 No.107757624

i used glm4.7 to find an algorithm to convert a hexadecimal digit into a number, assuming the input digit is already a valid hex digit.
return (Digit & 15) + ((Digit >> 6) * 9);
this is significantly better (compiles smaller and faster) than what shatgpt or deepsneed gave me, which was surprising. it's the only LLM that gave an answer better than what I came up with.
Looks like the chinks benchmaxxxxxing on coding actually yields results

Anonymous
01/04/26(Sun)00:35:30 No.107757643

Anonymous 01/04/26(Sun)00:35:30 No.107757643

>>107757494
>For instance, pic related. But even then, the model is still affected by RLHF in that it is still prone to being gaslit, in general. Therefore, we also want to limit how misleading the prompt is. The model answers correctly when the second sentence in the test question isn't there. Since this is just a 27B, we might want to develop a more challenging question.
you talk in the way that thinking models think
>Since ___, we might want to ___. But wait,

Anonymous
01/04/26(Sun)00:40:51 No.107757657

Anonymous 01/04/26(Sun)00:40:51 No.107757657

>>107757584
That's what I immediately was reminded of when I saw the post, but when I thought about it, no, not really, this problem doesn't actually become impossible or a coin flip by tokenization, although it would likely help if the model saw words like we do, or was beneficially/natively mulimodal and saw images of actual keyboards. The problem is more about whether the model understands how keyboards work and that pressing modifier keys does not involve pressing the regular letter keys. And in fact, the model does know that, as I discussed, it just gets thrown off by trusting the user's claims instead of what it learned.

>>107757643
Very funny anon. I never even said "But wait". Anyway, this is how I usually type in more technical contexts.

Anonymous
01/04/26(Sun)00:42:06 No.107757659

Anonymous 01/04/26(Sun)00:42:06 No.107757659

>>107756159
I am using my model to work on my sexual hangups. Is it real? No. But it is real enough to work for that. It is crazy to me that at least for that specific application LMM's are already superior to humans. Good luck getting your the rapist to erp with you so you can unfuck your brain.

Anonymous
01/04/26(Sun)00:47:46 No.107757682

Anonymous 01/04/26(Sun)00:47:46 No.107757682

>>107757659
you can thank women and their insatiable lust for this type of smut for the model being good

Anonymous
01/04/26(Sun)00:59:47 No.107757720

Anonymous 01/04/26(Sun)00:59:47 No.107757720

>>107756006
>shit goes down, need to make campfire
>hey gpt how do I make fire
>sorry I can't assist with arson

Anonymous
01/04/26(Sun)01:06:17 No.107757744

Anonymous 01/04/26(Sun)01:06:17 No.107757744

>>107755949
Assuming what you said is true, that doesn't make LLMs conscious, just means that we're as similarly dumb as LLMs.

Anonymous
01/04/26(Sun)01:07:14 No.107757749

Anonymous 01/04/26(Sun)01:07:14 No.107757749

>>107757510
the goalpost has never moved.

Anonymous
01/04/26(Sun)01:11:00 No.107757763

Anonymous 01/04/26(Sun)01:11:00 No.107757763

>>107757720
>>107757744
no you're dumber than an LLM you can't read.
clearly wasn't talking about how to create a campfire, or saying that AI was conscious.

Anonymous
01/04/26(Sun)01:14:01 No.107757778

Anonymous 01/04/26(Sun)01:14:01 No.107757778

>>107757763
no mammal is dumber than a LLM, they have no intelligence, NONE.

Anonymous
01/04/26(Sun)01:16:35 No.107757789

Anonymous 01/04/26(Sun)01:16:35 No.107757789

File: 1743365418851866.webm (2.85 MB, 640x640)

2.85 MB WEBM

>>107757778
>no mammal is dumber than a LLM

Anonymous
01/04/26(Sun)01:20:38 No.107757805

Anonymous 01/04/26(Sun)01:20:38 No.107757805

>>107757763
>clearly wasn't talking about how to create a campfire
It was a joke about retarded kids asking chatgpt (and failing to prompt for a campfire at that), rather than an argument.
>or saying that AI was conscious.
It didn't say that directly but to be fair it's one part of "conscious or intelligent", and it can be implied we're as unintelligent as an LLM, simply creating words. I simply asserted that equating unintelligence doesn't suddenly flip LLM into consciousness just because it's commonly considered that we're conscious, in case someone gets the idea.

Anonymous
01/04/26(Sun)01:28:05 No.107757827

Anonymous 01/04/26(Sun)01:28:05 No.107757827

File: WomenWinVote.png (1.04 MB, 1600x613)

1.04 MB PNG

>>107757778
>no mammal is dumber than a LLM

Anonymous
01/04/26(Sun)01:46:20 No.107757886

Anonymous 01/04/26(Sun)01:46:20 No.107757886

>another thread spammed to death by literal retards who think LLMs have soul/conscious
>still no gemmy 4
>air is lacking
saars.

Anonymous
01/04/26(Sun)02:14:51 No.107757996

Anonymous 01/04/26(Sun)02:14:51 No.107757996

I wonder, is there a language in which LLMs are less prone to the NotXButY spam, or is it so deeply ingrained in the model that they will do it no matter what? as a French speaker I tried talking to them in French a little and unfortunately noticed they have the same exact tics as in English..

Anonymous
01/04/26(Sun)02:15:32 No.107757997

Anonymous 01/04/26(Sun)02:15:32 No.107757997

>>107757996
nope, everyone distills from 'not x but y' models so we're fucked

Anonymous
01/04/26(Sun)02:36:58 No.107758088

Anonymous 01/04/26(Sun)02:36:58 No.107758088

>>107757997
Ironically it gets worse the larger a model is. I briefly skipped over /aicg/ and the level of slop in logs from giant claude and gemini models was unbelievable.

Anonymous
01/04/26(Sun)02:39:56 No.107758102

Anonymous 01/04/26(Sun)02:39:56 No.107758102

>>107758088
the bigger/newer the model, the more slop is in its dataset
claude 1 and 2 were pure human data, so were extremely high quality
everything onwards was incest

Anonymous
01/04/26(Sun)02:40:10 No.107758103

Anonymous 01/04/26(Sun)02:40:10 No.107758103

>>107757996
Anything pre-GPT-4. That's when the slop began. Earlier models were ultra retarded but it would try to mimic your writing style more.

Anonymous
01/04/26(Sun)02:44:03 No.107758117

Anonymous 01/04/26(Sun)02:44:03 No.107758117

>>107758111
>>107758111
>>107758111

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.