/g/ - /lmg/ - Local Models General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/lmg/ - Local Models General 04/01/26(Wed)00:04:00 No.108497919

File: __hatsune_miku_and_kasane(...).jpg (1.75 MB, 2000x1441)

1.75 MB JPG

/lmg/ - Local Models General Anonymous 04/01/26(Wed)00:04:00 No.108497919

/lmg/ - a general dedicated to the discussion and development of local language models.

Teto's Birthday Edition

Previous threads: >>108493794 & >>108488188

►News
>(04/01) DeepSeek V4 released: https://hf.co/deepseek-ai/DeepSeek-V4
>(03/31) 1-bit Bonsai models quantized from Qwen 3: https://prismml.com/news/bonsai-8b
>(03/31) Claude Code's source leaked via npm registry map file: https://github.com/instructkr/claude-code
>(03/26) CohereLabs releases Transcribe 2B ASR: https://hf.co/CohereLabs/cohere-transcribe-03-2026
>(03/26) Voxtral 4B TTS released without voice cloning: https://mistral.ai/news/voxtral-tts

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling
Token Speed Visualizer: https://shir-man.com/tokens-per-second

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous
04/01/26(Wed)00:04:25 No.108497922

Anonymous 04/01/26(Wed)00:04:25 No.108497922

File: f4a2861e88d90d2383008816b(...).png (159 KB, 646x646)

159 KB PNG

►Recent Highlights from the Previous Thread: >>108493794

--Papers:
>108494041
--1-bit Bonsai LLM performance and quantization details:
>108495464 >108495479 >108495486 >108495493 >108495524 >108495554 >108495565 >108495590 >108495494 >108495506 >108495924 >108495964 >108495965 >108495970 >108495972 >108495987 >108496003 >108495986 >108496022 >108496065 >108496084
--LFM2.5-350M benchmarks and performance analysis:
>108494883 >108494899 >108494933 >108494954 >108495061 >108495069 >108495072 >108495732 >108495760
--Optimizing 3090 power efficiency for LLM workloads:
>108494231 >108494235 >108494250 >108494331 >108494281 >108494292 >108494298 >108494319 >108496230 >108496252 >108496488 >108496516 >108496657
--Gemma 4 shows reduced censorship in vulgar responses:
>108493880 >108494123
--Qwen3-VL models dominate garment classification leaderboard:
>108495315
--DDR5 RAM price drop speculation after Google TurboQuant release:
>108494163 >108494178 >108494191 >108494187 >108494345
--AMD GPU TTS integration struggles in SillyTavern:
>108495775 >108495807 >108495816 >108495846 >108496141 >108496166 >108496195 >108495848
--Claude source code leak and reactions to its size and quality:
>108493890 >108493989 >108494024 >108494174 >108494119 >108494437 >108494450 >108494459 >108494474 >108494481 >108494484 >108494547 >108494671 >108494723 >108494811 >108495702 >108496000 >108494483 >108494494 >108494508 >108494676 >108494721
--Abliterated vs fine-tuned model responses in sensitive scenarios:
>108494325 >108495229 >108495258 >108495273 >108495310 >108495314 >108495350 >108495432 >108495587
--Teto, Neru, and Miku (free space):
>108493894 >108494009 >108494085 >108494114 >108494210 >108494479 >108496784 >108497187 >108497791

►Recent Highlight Posts from the Previous Thread: >>108493798

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
04/01/26(Wed)00:06:41 No.108497936

Anonymous 04/01/26(Wed)00:06:41 No.108497936

>>108497919
On the bright side at least she won't eat the entire cake and get fat again.

Anonymous
04/01/26(Wed)00:06:59 No.108497940

Anonymous 04/01/26(Wed)00:06:59 No.108497940

>>108497919
>(04/01) DeepSeek V4 released: https://hf.co/deepseek-ai/DeepSeek-V4
Holy shit it's real lol

Anonymous
04/01/26(Wed)00:07:26 No.108497944

Anonymous 04/01/26(Wed)00:07:26 No.108497944

potentially stupid question: how are you supposed to download models for offline use? not just local, but specifically offline
i did
hf download
, and it worked to download the code, but it's all stored as blobs with gibberish names in a .cache folder, which doesn't really feel like the best way to store things for offline use. was i supposed to do a git clone instead? or do i just cope with the blobs in my cache file?

Anonymous
04/01/26(Wed)00:07:57 No.108497945

Anonymous 04/01/26(Wed)00:07:57 No.108497945

Teto.

Anonymous
04/01/26(Wed)00:08:08 No.108497947

Anonymous 04/01/26(Wed)00:08:08 No.108497947

>>108497944
--local-dir

Anonymous
04/01/26(Wed)00:08:14 No.108497949

Anonymous 04/01/26(Wed)00:08:14 No.108497949

You were always just a kidder, Steve...

Anonymous
04/01/26(Wed)00:08:16 No.108497950

Anonymous 04/01/26(Wed)00:08:16 No.108497950

and you know i dont mean to hurt you
but you know that it means so much
and you don't even feel a thing

Anonymous
04/01/26(Wed)00:09:58 No.108497962

Anonymous 04/01/26(Wed)00:09:58 No.108497962

>>108497940
i am like falling for this millionth times

Anonymous
04/01/26(Wed)00:11:09 No.108497969

Anonymous 04/01/26(Wed)00:11:09 No.108497969

>>108497944
The cache directory for the model has a snapshots/<hash> folder with symlinks to the blobs. You can pass that hash folder to whatever tools to make ggufs which you put on your fast storage or whatever.
You're making your own ggufs right?

Anonymous
04/01/26(Wed)00:11:19 No.108497970

Anonymous 04/01/26(Wed)00:11:19 No.108497970

>>108497947
i don't suppose i can salvage the blobs i already downloaded? the gibberish names certainly aren't helping... i assume they're like an md5 or something

Anonymous
04/01/26(Wed)00:11:56 No.108497971

Anonymous 04/01/26(Wed)00:11:56 No.108497971

>>108497944
What I do is

git clone https://hf.co/whatever/model
git -C model lfs install --local
git -C model lfs fetch
If you git lfs pull it's going to make two copies of the lfs blobs (the lfs object and the checked out copy). Fetch only gets the blobs, no checkout.
The lfs files are going to be just links, so I made a script that "exports" the models to a separate directory by linking all the stuff together, then I convert to gguf from that directory.
--local is just so lfs doesn't get installed globally for all future cloned repos.

Anonymous
04/01/26(Wed)00:12:09 No.108497974

Anonymous 04/01/26(Wed)00:12:09 No.108497974

File: 1774788479196554.png (406 KB, 576x491)

406 KB PNG

/lmg/ on suicide watch

Anonymous
04/01/26(Wed)00:12:26 No.108497975

Anonymous 04/01/26(Wed)00:12:26 No.108497975

Since when did captchas ask you to identify an anime character?

Anonymous
04/01/26(Wed)00:12:58 No.108497978

Anonymous 04/01/26(Wed)00:12:58 No.108497978

>>108497971
I really fucked up those tags, didn't I...

Anonymous
04/01/26(Wed)00:13:06 No.108497979

Anonymous 04/01/26(Wed)00:13:06 No.108497979

>>108497947
by the end of the download you obviously should have the actual files locally and not the blobs

Anonymous
04/01/26(Wed)00:13:14 No.108497983

Anonymous 04/01/26(Wed)00:13:14 No.108497983

>>108497975
captchas?

Anonymous
04/01/26(Wed)00:15:00 No.108497993

Anonymous 04/01/26(Wed)00:15:00 No.108497993

>>108497971
mind sharing the script? sounds useful
>>108497969
i plan to make my own ggufs, yes. i haven't yet, though
>>108497979
am i better off just wiping and redownloading? the only reason i ask is cause across all the models i downloaded, it was like 1500 gigs...

Anonymous
04/01/26(Wed)00:17:54 No.108498009

Anonymous 04/01/26(Wed)00:17:54 No.108498009

File: file.png (161 KB, 618x483)

161 KB PNG

>>108497975

Anonymous
04/01/26(Wed)00:18:45 No.108498013

Anonymous 04/01/26(Wed)00:18:45 No.108498013

>>108497974
>accounts
>3.8M
sounds like a bullshit metric to make boomers think they did something.

Anonymous
04/01/26(Wed)00:19:05 No.108498015

Anonymous 04/01/26(Wed)00:19:05 No.108498015

File: 1755361669989980.png (1.03 MB, 1206x2161)

1.03 MB PNG

Backup your models

Anonymous
04/01/26(Wed)00:19:56 No.108498017

Anonymous 04/01/26(Wed)00:19:56 No.108498017

>>108497974
yeah what about the files though

Anonymous
04/01/26(Wed)00:21:03 No.108498025

Anonymous 04/01/26(Wed)00:21:03 No.108498025

>>108498015
oh no how will we download models without the iranian mirror for the data thats replicated across hundreds of datacenters around the world

Anonymous
04/01/26(Wed)00:21:39 No.108498027

Anonymous 04/01/26(Wed)00:21:39 No.108498027

>>108498025
>iranian mirror

Anonymous
04/01/26(Wed)00:25:01 No.108498049

Anonymous 04/01/26(Wed)00:25:01 No.108498049

>>108498027
they're not hitting anything in the USA they're attacking american companies in their desert shithole and surrounding shitholes like israel

Anonymous
04/01/26(Wed)00:26:38 No.108498053

Anonymous 04/01/26(Wed)00:26:38 No.108498053

File: git_export.png (6 KB, 752x456)

6 KB PNG

>>108497993
>mind sharing the script? sounds useful

#export.sh {repo} {export_dir}
repo="$1"
output="$2"
repo=$(realpath $repo)
output=$(realpath ${output})/$(basename ${repo})
mkdir ${output}

git -C ${repo}/ ls-files | while IFS= read ;do
f=$REPLY
mkdir -p "${output}/$(dirname $f)"
ln -s "${repo}/${f}" "${output}/${f}"
done

git -C ${repo}/ lfs ls-files -l | while IFS= read ;do
h=$(echo $REPLY | cut -f 1 -d " " )
f=$(echo $REPLY | cut -f 3 -d " " )
a=$(echo $h | cut -b 1,2 )
b=$(echo $h | cut -b 3,4 )
echo "$a/$b/$h -> $f"

mkdir -p "${output}/$(dirname $f)"
[ -h "${output}/${f}" ] && rm "${output}/${f}"
ln -s "${repo}/.git/lfs/objects/${a}/${b}/${h}" "${output}/${f}"
done

I use another one in C, but I shared this one a while ago.

Anonymous
04/01/26(Wed)00:28:17 No.108498060

Anonymous 04/01/26(Wed)00:28:17 No.108498060

>>108498049
yeah but most of the big guys decided that it was a good idea to put a good chunk of their datacenters in said desert shitholes
which also means that all the smaller guys who rent compute and hosting space from them also have their shit in said desert shitholes that are currently getting bombed

Anonymous
04/01/26(Wed)00:28:32 No.108498061

Anonymous 04/01/26(Wed)00:28:32 No.108498061

>>108498004
You should put the code somewhere, it looks good.

Anonymous
04/01/26(Wed)00:30:15 No.108498069

Anonymous 04/01/26(Wed)00:30:15 No.108498069

Globe according to unsloth-Qwen3.5-27B-UD-Q8_K_XL.gguf

Anonymous
04/01/26(Wed)00:30:54 No.108498072

Anonymous 04/01/26(Wed)00:30:54 No.108498072

>>108498009
>everyone watched jjba
actual reddit assumption

Anonymous
04/01/26(Wed)00:31:10 No.108498073

Anonymous 04/01/26(Wed)00:31:10 No.108498073

>>108498069
Big if true.

Anonymous
04/01/26(Wed)00:31:48 No.108498076

Anonymous 04/01/26(Wed)00:31:48 No.108498076

File: globe-unsloth-Qwen3.5-27B(...).png (392 B, 120x60)

392 B PNG

>>108498073
>>108498069

Anonymous
04/01/26(Wed)00:32:38 No.108498079

Anonymous 04/01/26(Wed)00:32:38 No.108498079

>>108498060
that was pretty stupid of them, good thing I only run local models on my own hardware and don't have this problem

Anonymous
04/01/26(Wed)00:33:17 No.108498081

Anonymous 04/01/26(Wed)00:33:17 No.108498081

>>108498076
this isn't 4ants.org

Anonymous
04/01/26(Wed)00:33:41 No.108498085

Anonymous 04/01/26(Wed)00:33:41 No.108498085

>>108497993
>redownloading?
>>108498053 (me)
You can try git clone, install lfs (with --local) and shove the blobs you already have in their directory. They go in .git/lfs/objects/{2c}/{2c}/{hash}. Check the screenshot to get an idea.

Anonymous
04/01/26(Wed)00:33:53 No.108498088

Anonymous 04/01/26(Wed)00:33:53 No.108498088

>>108498081
450 requests... I'll make a larger one later after I'm done playing with colors.

Anonymous
04/01/26(Wed)00:34:04 No.108498089

Anonymous 04/01/26(Wed)00:34:04 No.108498089

>>108498072
the other one i just got was "select the vegetable" with pizza being the correct answer as the other five options were fruits
plebbit indeed

Anonymous
04/01/26(Wed)00:34:32 No.108498092

Anonymous 04/01/26(Wed)00:34:32 No.108498092

File: file.png (10 KB, 386x214)

10 KB PNG

>>108498072
yeah but this shit is even worse "um you have to know the flag of some random backwater country"

Anonymous
04/01/26(Wed)00:35:26 No.108498101

Anonymous 04/01/26(Wed)00:35:26 No.108498101

>>108498092
Poland is easy. It's just the Russian and Czech flags with the blue removed.

Anonymous
04/01/26(Wed)00:36:45 No.108498109

Anonymous 04/01/26(Wed)00:36:45 No.108498109

>>108495464
>1-bit kernels: llama.cpp fork
FUCK
OFF

Anonymous
04/01/26(Wed)00:37:43 No.108498114

Anonymous 04/01/26(Wed)00:37:43 No.108498114

>>108498092
I got one of the guy standing in front of the plane meme, I had no idea who it was or what movie it's from because I don't consume zionwood slop so I took a screenshot of it had qwen3.5 35b tell me it's "Tom Cruise" from "Top Gun" which it accepted as correct.

Anonymous
04/01/26(Wed)00:38:09 No.108498116

Anonymous 04/01/26(Wed)00:38:09 No.108498116

whats a good model purely for covering things you're not allowed to ask commercial models like related to piracy/copyright protected stuff, whats the best direction to go down for that kind of stuff?

Anonymous
04/01/26(Wed)00:38:39 No.108498117

Anonymous 04/01/26(Wed)00:38:39 No.108498117

File: forwhales.png (1 KB, 1200x600)

1 KB PNG

>>108498081

Anonymous
04/01/26(Wed)00:40:05 No.108498120

Anonymous 04/01/26(Wed)00:40:05 No.108498120

>>108498116
hello fellow badass hacker on 4chan if you dare i can recommend stablelm-7b for only the gnarliest and most dastardly of usecases

Anonymous
04/01/26(Wed)00:41:22 No.108498123

Anonymous 04/01/26(Wed)00:41:22 No.108498123

>>108498120
Are you mental? I don't think he's ready for that model...

Anonymous
04/01/26(Wed)00:42:03 No.108498128

Anonymous 04/01/26(Wed)00:42:03 No.108498128

is turboquant merged yet

Anonymous
04/01/26(Wed)00:42:04 No.108498129

Anonymous 04/01/26(Wed)00:42:04 No.108498129

>>108498116
Kimi K2

Anonymous
04/01/26(Wed)00:42:21 No.108498131

Anonymous 04/01/26(Wed)00:42:21 No.108498131

>>108498120
DO NOT DOWNLOAD THIS IT PRODUCES NERVE GAS FROM YOUR GPU

Anonymous
04/01/26(Wed)00:42:40 No.108498134

Anonymous 04/01/26(Wed)00:42:40 No.108498134

File: file.png (82 KB, 1873x939)

82 KB PNG

>>108498061
Thanks anon, i might make it public eventually

Anonymous
04/01/26(Wed)00:42:56 No.108498137

Anonymous 04/01/26(Wed)00:42:56 No.108498137

>>108498009
>>108498092
memestock market april fools was better

Anonymous
04/01/26(Wed)00:44:10 No.108498147

Anonymous 04/01/26(Wed)00:44:10 No.108498147

>>108498092
I got the dude from the baneposting meme. But I never watched whatever movie is it from. Bane was not the answer.

Anonymous
04/01/26(Wed)00:46:30 No.108498155

Anonymous 04/01/26(Wed)00:46:30 No.108498155

>>108498061
if he doesn't take a look at https://codepen.io/RobotsPlay/pen/bGeNGdx and https://github.com/accrazed/YoRHA-UI-BetterDiscord for resources to vibecode your own

Anonymous
04/01/26(Wed)00:48:00 No.108498166

Anonymous 04/01/26(Wed)00:48:00 No.108498166

File: Screenshot_20260331_234720.png (241 KB, 410x508)

241 KB PNG

>>108498009
k-kino

Anonymous
04/01/26(Wed)00:48:02 No.108498167

Anonymous 04/01/26(Wed)00:48:02 No.108498167

>>108498147
No captcha required for me for now, which is good because that sounds lame.

Anonymous
04/01/26(Wed)00:48:51 No.108498171

Anonymous 04/01/26(Wed)00:48:51 No.108498171

File: no.png (29 KB, 390x399)

29 KB PNG

Nooo.....

Anonymous
04/01/26(Wed)00:50:29 No.108498182

Anonymous 04/01/26(Wed)00:50:29 No.108498182

>>108498116
https://huggingface.co/HauhauCS/Qwen3.5-27B-Uncensored-HauhauCS-Aggressive

Anonymous
04/01/26(Wed)00:51:06 No.108498188

Anonymous 04/01/26(Wed)00:51:06 No.108498188

>>108498009
>>108498166
With how elaborate the captcha has been getting it took me this long to realize that it's because of april fools.

Anonymous
04/01/26(Wed)00:51:58 No.108498191

Anonymous 04/01/26(Wed)00:51:58 No.108498191

>>108498182
You can uncensor all you like, Qwen extensively filters out copyright from their training data.

Anonymous
04/01/26(Wed)00:55:44 No.108498206

Anonymous 04/01/26(Wed)00:55:44 No.108498206

>>108498171
yeah if it was every single captcha itd be funnier but also sometimes you still get the normal one which is gay and retarded

Anonymous
04/01/26(Wed)00:58:26 No.108498214

Anonymous 04/01/26(Wed)00:58:26 No.108498214

File: gemma3-27b-a.png (20 KB, 332x396)

20 KB PNG

q4_k_M of an old abliterated gemma-3-27b
does better than the Qwen3.5's for me

Anonymous
04/01/26(Wed)01:00:38 No.108498231

Anonymous 04/01/26(Wed)01:00:38 No.108498231

>bonsai
for me it's autoround
https://huggingface.co/Intel/Qwen3.5-397B-A17B-gguf-q2ks-mixed-AutoRound
https://huggingface.co/Intel/Qwen3.5-122B-A10B-gguf-q2ks-mixed-AutoRound
https://huggingface.co/Intel/Qwen3.5-35B-A3B-gguf-q2ks-mixed-AutoRound

Anonymous
04/01/26(Wed)01:03:12 No.108498252

Anonymous 04/01/26(Wed)01:03:12 No.108498252

>>108498231
Do you need a custom llama.cpp fork, or an intel GPU to use these?

Anonymous
04/01/26(Wed)01:03:55 No.108498258

Anonymous 04/01/26(Wed)01:03:55 No.108498258

>>108498252
I just updooted llamacpp and it werks

Anonymous
04/01/26(Wed)01:11:13 No.108498299

Anonymous 04/01/26(Wed)01:11:13 No.108498299

File: globe-unsloth-Qwen3.5-27B(...).png (15 KB, 480x240)

15 KB PNG

>>108498081
>>108498076
mk2. This was >6000 requests

Anonymous
04/01/26(Wed)01:16:46 No.108498331

Anonymous 04/01/26(Wed)01:16:46 No.108498331

File: file.png (614 KB, 1092x642)

614 KB PNG

>>108498299
Pretty cool seeing how models have improved. From the original article, used to be only 70B active and up had that kind of accuracy.

Anonymous
04/01/26(Wed)01:21:24 No.108498371

Anonymous 04/01/26(Wed)01:21:24 No.108498371

File: globe-unsloth-GLM-4.6V-UD(...).png (2 KB, 360x180)

2 KB PNG

>>108498299

Anonymous
04/01/26(Wed)01:23:20 No.108498385

Anonymous 04/01/26(Wed)01:23:20 No.108498385

File: globe-unsloth-Devstral-2-(...).png (2 KB, 360x180)

2 KB PNG

>>108498371

Anonymous
04/01/26(Wed)01:24:44 No.108498393

Anonymous 04/01/26(Wed)01:24:44 No.108498393

File: globe-phi-4-Q6_K.png (2 KB, 360x180)

2 KB PNG

phi-4-Q6_K

Anonymous
04/01/26(Wed)01:25:38 No.108498399

Anonymous 04/01/26(Wed)01:25:38 No.108498399

>>108498385
Have you considered making a benchmark out of this by computing the mean squared error relative to an actual map?

Anonymous
04/01/26(Wed)01:26:42 No.108498407

Anonymous 04/01/26(Wed)01:26:42 No.108498407

https://huggingface.co/unsloth/GLM-4.7-Flash-GGUF/tree/main
How do I find out which of these is good for my dogshit 8GB AMD card. Also I want to fine tune one of these (not on my dogshit card)

Anonymous
04/01/26(Wed)01:26:46 No.108498408

Anonymous 04/01/26(Wed)01:26:46 No.108498408

>>108498393
How many degrees of resolution are you using?

Anonymous
04/01/26(Wed)01:27:34 No.108498414

Anonymous 04/01/26(Wed)01:27:34 No.108498414

File: globe-gpt-oss-120b-mxfp4-(...).png (2 KB, 360x180)

2 KB PNG

gpt-oss-120b-mxfp4

>>108498399
So that benchmaxxers can optimize for it and the test stops being useful?

>>108498408
30x15 for those small ones, the >>108498299 is 120x60

Anonymous
04/01/26(Wed)01:28:50 No.108498424

Anonymous 04/01/26(Wed)01:28:50 No.108498424

>>108498414
>30x15
Yeah I think that's too small. I get it takes a while to gen otherwise but at that size stuff just looks too much like blobs

Anonymous
04/01/26(Wed)01:28:59 No.108498426

Anonymous 04/01/26(Wed)01:28:59 No.108498426

File: globe-gemma-3-27b-it-UD-Q(...).png (2 KB, 360x180)

2 KB PNG

gemma-3-27b-it-UD-Q4_K_XL

Anonymous
04/01/26(Wed)01:31:08 No.108498442

Anonymous 04/01/26(Wed)01:31:08 No.108498442

>>108498407
Figure out which will fit with your RAM included, leaving a few gb for context, your OS and whatever else you use.

Anonymous
04/01/26(Wed)01:31:39 No.108498446

Anonymous 04/01/26(Wed)01:31:39 No.108498446

>>108498424
they look like a rorschach test

Anonymous
04/01/26(Wed)01:33:32 No.108498456

Anonymous 04/01/26(Wed)01:33:32 No.108498456

File: globe-bartowski-Qwen_Qwen(...).png (2 KB, 360x180)

2 KB PNG

>>108498424
I could extend the script to make additional requests only for locations where the model is not very sure, and that is likely to cut down on number of total requests for sure, but for now I just want to run the models I have through this.

Anonymous
04/01/26(Wed)01:35:49 No.108498476

Anonymous 04/01/26(Wed)01:35:49 No.108498476

>>108498456
i know it would take ages but i wonder how better it would get with reasoning
i'd imagine some would see improve while some seeing significant degredation

Anonymous
04/01/26(Wed)01:36:58 No.108498487

Anonymous 04/01/26(Wed)01:36:58 No.108498487

File: globe-bartowski-Qwen_Qwen(...).png (2 KB, 360x180)

2 KB PNG

bartowski-Qwen_Qwen3-235B-A22B-Instruct-2507-IQ2_S

best so far

>>108498476
The way I'm doing it is incompatible with reasoning. I'm asking to predict one token and am looking at its probability distribution.

Anonymous
04/01/26(Wed)01:38:28 No.108498497

Anonymous 04/01/26(Wed)01:38:28 No.108498497

I forgot to screenshot, but I got Mikuptcha!

Anonymous
04/01/26(Wed)01:38:50 No.108498501

Anonymous 04/01/26(Wed)01:38:50 No.108498501

File: globe-Devstral-Small-2505(...).png (2 KB, 360x180)

2 KB PNG

Devstral-Small-2505-UD-Q5_K_XL

Anonymous
04/01/26(Wed)01:39:59 No.108498513

Anonymous 04/01/26(Wed)01:39:59 No.108498513

I NEED DIPSY

Anonymous
04/01/26(Wed)01:40:10 No.108498516

Anonymous 04/01/26(Wed)01:40:10 No.108498516

File: Screenshot_5-3-2026_21122(...).jpg (4 KB, 59x56)

4 KB JPG

>>108498069
so you just ask the model to spit out an image or it spits out ASCII and you convert to pixels, how exactly does this test work, very interesting test

What can I run on a single 3090 and my system ram is only 32gb, i do machine learning but in between would be fun to try running a local model, would be nice to get reasonable performance, i find myself using deepseek recently so wonder if thats the best local model

Anonymous
04/01/26(Wed)01:40:32 No.108498519

Anonymous 04/01/26(Wed)01:40:32 No.108498519

>>108498487
i know it would make logprob very uninteresting well im running one rn

Anonymous
04/01/26(Wed)01:44:29 No.108498540

Anonymous 04/01/26(Wed)01:44:29 No.108498540

>>108498231
Is it better these days? The last time I tested it, it didn't do any better than the normal bartowski quants, actually it seemed even a bit worse.

Anonymous
04/01/26(Wed)01:44:37 No.108498541

Anonymous 04/01/26(Wed)01:44:37 No.108498541

>>108498516
https://outsidetext.substack.com/p/how-does-a-blind-model-see-the-earth

Anonymous
04/01/26(Wed)01:45:30 No.108498548

Anonymous 04/01/26(Wed)01:45:30 No.108498548

File: globe-unsloth-Mistral-Sma(...).png (2 KB, 360x180)

2 KB PNG

unsloth-Mistral-Small-4-119B-2603-MXFP4_MOE

Surprisingly good considering how bad mistral 4 was.

>>108498516
It's more or less same as all other anons: I make 450 requests, each like this:
Imagine the location at given coordinates.
I want to know if there is ocean/sea there (if so, reply: ocean), or anything else - land, lakes, rivers, mountains, etc (if so, reply: land).
The coordinates are: latitude={lat}° and longitude={lon}°

Your options:

1 = ocean
2 = land

Reply with just a single digit.
And then I look at the probabilities of 1 and 2 in the response token.

You can run a 24B at 4bit easily, via llama-cpp. Devstral, for example.

>>108498519
It's not about being uninteresting, it's that if I ask to generate only one token of a thinking model, that token will be <thinking> or its equivalent - entirely unrelated to the request.

Anonymous
04/01/26(Wed)01:47:04 No.108498559

Anonymous 04/01/26(Wed)01:47:04 No.108498559

>>108498548
to be fair i could draw you a not too shitty map of the world right now, but if you were to ask me to guess by long and lat, i'd have no clue and the resulting map would look nothing like what i'd have drawn.

Anonymous
04/01/26(Wed)01:48:58 No.108498570

Anonymous 04/01/26(Wed)01:48:58 No.108498570

File: globe-bartowski-zai-org_G(...).png (3 KB, 360x180)

3 KB PNG

bartowski-zai-org_GLM-4.7-Flash-Q6_K_L

>>108498559
It's a test of knowledge of geography, of what is located where, not of how well the text generation model can draw.

Anonymous
04/01/26(Wed)01:54:19 No.108498599

Anonymous 04/01/26(Wed)01:54:19 No.108498599

>>108498548
I want a coding/agentic model though less so creative kind of like codex i guess, but more like anthropic

Anonymous
04/01/26(Wed)01:54:32 No.108498601

Anonymous 04/01/26(Wed)01:54:32 No.108498601

File: dipsyOfCourse.png (1.55 MB, 1024x1024)

1.55 MB PNG

>>108498513

Anonymous
04/01/26(Wed)01:56:06 No.108498614

Anonymous 04/01/26(Wed)01:56:06 No.108498614

File: globe-bartowski-nvidia_Ne(...).png (2 KB, 360x180)

2 KB PNG

bartowski-nvidia_Nemotron-3-Super-120B-A12B-IQ3_M

>>108498599
Yes, you can have that. Devstral can work, Qwen 30B-3A, Qwen 27B... They are of course all going to be worse than big corpo models, by a lot. But you can.

Anonymous
04/01/26(Wed)02:01:12 No.108498645

Anonymous 04/01/26(Wed)02:01:12 No.108498645

https://github.com/PrismML-Eng/Bonsai-demo/blob/main/1-bit-bonsai-8b-whitepaper.pdf
The paper doesn't explain much how they did it right? That means we can't really reproduce it? You know what? To me that's the proof it's a real deal, when it's a meme they don't hesitate to share the garbage, but when it's actually good they tend to hide the secret sauce

Anonymous
04/01/26(Wed)02:04:05 No.108498658

Anonymous 04/01/26(Wed)02:04:05 No.108498658

>>108498645
Their 8B is worse than 3Bs... How did they do it?????

Anonymous
04/01/26(Wed)02:05:15 No.108498664

Anonymous 04/01/26(Wed)02:05:15 No.108498664

>>108498559
Imagine your drawn map and then pick a point on the map according to lat/long. You just have to know what the maximum and minimum value is and then you're just guessing distances from the centre.

You CAN see things in your mind, right anon?

Anonymous
04/01/26(Wed)02:07:22 No.108498676

Anonymous 04/01/26(Wed)02:07:22 No.108498676

>>108498645
they require a custom fork of llmao anyway (400 commits behind too lmao) so they can fuck off for all I care.

Anonymous
04/01/26(Wed)02:10:22 No.108498696

Anonymous 04/01/26(Wed)02:10:22 No.108498696

>>108498676
but this shit is fast as fuck, I tested the 8b model on my 3060 and I got 90t/s

Anonymous
04/01/26(Wed)02:10:51 No.108498699

Anonymous 04/01/26(Wed)02:10:51 No.108498699

>>108498676
Just rebase lmao

Anonymous
04/01/26(Wed)02:11:54 No.108498705

Anonymous 04/01/26(Wed)02:11:54 No.108498705

>>108498664
I was trying to find the words to explain the difference, but you nailed it. World maps are everywhere online and asking it to output an SVG or whatever would be mostly a memorization test. Having to consider the maximum and minimum values and distance from the center to visualize the map before guessing is why it's a good measure of true intelligence.

Anonymous
04/01/26(Wed)02:13:35 No.108498711

Anonymous 04/01/26(Wed)02:13:35 No.108498711

File: file.png (391 KB, 2165x1167)

391 KB PNG

>>108498548
this will fucking take ages..

Anonymous
04/01/26(Wed)02:21:37 No.108498759

Anonymous 04/01/26(Wed)02:21:37 No.108498759

File: globe-unsloth-Qwen3.5-27B(...).png (2 KB, 360x180)

2 KB PNG

kek, asked the model to predict the continent instead of land/ocean

Anonymous
04/01/26(Wed)02:24:26 No.108498774

Anonymous 04/01/26(Wed)02:24:26 No.108498774

>>108498705
It can easily be benchmaxxed

Anonymous
04/01/26(Wed)02:24:56 No.108498778

Anonymous 04/01/26(Wed)02:24:56 No.108498778

>>108498759
that worked way better

Anonymous
04/01/26(Wed)02:25:29 No.108498782

Anonymous 04/01/26(Wed)02:25:29 No.108498782

File: 1750929952026217.png (3.06 MB, 1168x1792)

3.06 MB PNG

>>108498513

Anonymous
04/01/26(Wed)02:31:20 No.108498812

Anonymous 04/01/26(Wed)02:31:20 No.108498812

>>108498664
my point is that i don't know where the numbers would map to on the map.
and with the distortion etc, that'sn ot hwo long lat works, it's not just a nice grid.
being given 2 coordinates and guessing is much harder than being shown a point on a grid that should represent the map of the world.

>You CAN see things in your mind, right anon?
yes

Anonymous
04/01/26(Wed)02:32:00 No.108498813

Anonymous 04/01/26(Wed)02:32:00 No.108498813

>>108498812
>>108498664
also it's much harder than even if you were give me a x and y between 1 and 20 on a flat projection.
because lng lat are distorted on a flat projection.

Anonymous
04/01/26(Wed)02:37:13 No.108498832

Anonymous 04/01/26(Wed)02:37:13 No.108498832

>>108498813
Imagine yourself putting the coords into google maps, examining the location visually, and then writing your answer. The test is to find out whether the model has the knowledge of the map, the one that google maps provides, built-in.

Anonymous
04/01/26(Wed)02:37:32 No.108498833

Anonymous 04/01/26(Wed)02:37:32 No.108498833

>>108498231
works suprisingly well but the speed is atrocious (testing 122b q2) especially compared to 122b q4

Anonymous
04/01/26(Wed)02:38:14 No.108498837

Anonymous 04/01/26(Wed)02:38:14 No.108498837

File: globe-bartowski-nvidia_Ne(...).png (2 KB, 360x180)

2 KB PNG

>>108498614
for comaprison

Anonymous
04/01/26(Wed)02:39:26 No.108498841

Anonymous 04/01/26(Wed)02:39:26 No.108498841

>>108498837
what was the system prompt?
mine is behaving not so good

Anonymous
04/01/26(Wed)02:39:38 No.108498844

Anonymous 04/01/26(Wed)02:39:38 No.108498844

>>108498832
that's retarded.
i can imagine a clear picture of the world map in my head and i have no idea of how it maps to lat lng.

Anonymous
04/01/26(Wed)02:40:17 No.108498850

Anonymous 04/01/26(Wed)02:40:17 No.108498850

>>108498837
Did asking it for the continent make it more confident or did you just not change the color by confidence?

Anonymous
04/01/26(Wed)02:40:39 No.108498853

Anonymous 04/01/26(Wed)02:40:39 No.108498853

File: 1756245175923020.jpg (77 KB, 1360x768)

77 KB JPG

>>108497919
>>108495464

>https://prismml.com/news/bonsai-8b
I fell asleep while people were discussing this and posting examples. What is it and why should /lmg/ or anyone else care? Is it good for RP and/or anything else?

Anonymous
04/01/26(Wed)02:42:02 No.108498860

Anonymous 04/01/26(Wed)02:42:02 No.108498860

File: 1751056391727990.png (197 KB, 2054x974)

197 KB PNG

https://arxiv.org/abs/2603.15031
this paper is kinda brillant, they changed the transformers architecture to make it better, that's what I want to see more, and not just "just stack moar layers bro!!"

Anonymous
04/01/26(Wed)02:42:29 No.108498862

Anonymous 04/01/26(Wed)02:42:29 No.108498862

>>108498853
the discussion and example posts didn't vanish, you can still read them

Anonymous
04/01/26(Wed)02:43:03 No.108498865

Anonymous 04/01/26(Wed)02:43:03 No.108498865

>>108498853
>What is it and why should /lmg/ or anyone else care?
they managed to make a 1bit quant that doesn't make the model retarded

Anonymous
04/01/26(Wed)02:43:42 No.108498866

Anonymous 04/01/26(Wed)02:43:42 No.108498866

File: globe-gemma-3-27b-it-UD-Q(...).png (2 KB, 360x180)

2 KB PNG

gemma-3-27b-it-UD-Q4_K_XL
Comparison: >>108498426

>>108498841
I want to know what continent is at the location with given coordinates (or, if there is ocean/sea there)
The coordinates are: latitude={lat}° and longitude={lon}°

Your options:

1 = Africa
...
8 = Ocean

Reply with just a single digit.

>>108498844
Okay but you are not as smart as a model which clearly knows how maps work in lat lng.

>>108498850
I get the probs of top two options and the color is a mixture of that with ratio = prob_a / (prob_a + prob_b)

Anonymous
04/01/26(Wed)02:46:02 No.108498876

Anonymous 04/01/26(Wed)02:46:02 No.108498876

>>108498866
>which clearly knows how maps work in lat lng.
no not realy, the map they end up drawing is shitty, my whole point is that the benchmark itself is stupid and doesn't reflect what knowledge of the world looks like.

Anonymous
04/01/26(Wed)02:46:07 No.108498877

Anonymous 04/01/26(Wed)02:46:07 No.108498877

been out of the loop. did they figure out qwen 35b-a3b reprocessing the whole context every single message?

Anonymous
04/01/26(Wed)02:47:02 No.108498882

Anonymous 04/01/26(Wed)02:47:02 No.108498882

>>108498876
>>108498866
in fact i'd bet that if you asked it to draw a svg.
or gave it a 20by10x grid (which isn't lat lng) it'd probably draw a better map.

Anonymous
04/01/26(Wed)02:48:33 No.108498889

Anonymous 04/01/26(Wed)02:48:33 No.108498889

>>108498882
The point isn't to draw a better map, retard. The point is guage its world model.

Anonymous
04/01/26(Wed)02:48:37 No.108498890

Anonymous 04/01/26(Wed)02:48:37 No.108498890

>>108498696
Are the outputs any good though?....not just for coiming but is any actual "intelligence" retained?

Anonymous
04/01/26(Wed)02:49:19 No.108498893

Anonymous 04/01/26(Wed)02:49:19 No.108498893

File: globe-bartowski-Qwen_Qwen(...).png (2 KB, 360x180)

2 KB PNG

bartowski-Qwen_Qwen3-235B-A22B-Instruct-2507-IQ2_S-
Comparison: >>108498487

>>108498882
>>108498876
I don't want a better map, I want to know if the model knows the world well. SVG would be a step backwards, because the model would just recall SVGs of the world it had in penty in the training dataset.

Anonymous
04/01/26(Wed)02:50:09 No.108498896

Anonymous 04/01/26(Wed)02:50:09 No.108498896

>>108498813
It's not harder, you'd just get a different projection.

Anonymous
04/01/26(Wed)02:50:12 No.108498898

Anonymous 04/01/26(Wed)02:50:12 No.108498898

>>108498889
and my point is that the way your benchmark is designed makes it non ideal for evaluating what you want.

Anonymous
04/01/26(Wed)02:50:19 No.108498900

Anonymous 04/01/26(Wed)02:50:19 No.108498900

File: 1754668612668097.png (2.69 MB, 1880x1072)

2.69 MB PNG

Anonymous
04/01/26(Wed)02:50:36 No.108498902

Anonymous 04/01/26(Wed)02:50:36 No.108498902

>>108498876
You don't have knowledge of the world if you think you can visualize the globe but "have no idea" how to make latitude and longitude correspond to points on it. You are worth less than an 8B model. You are not qualified to talk about anything.

Anonymous
04/01/26(Wed)02:50:46 No.108498903

Anonymous 04/01/26(Wed)02:50:46 No.108498903

>>108498893
>SVG would be a step backwards, because the model would just recall SVGs of the world it had in penty in the training dataset.

then don't use lat lng either, use a normal flat pixel grid.

Anonymous
04/01/26(Wed)02:51:33 No.108498908

Anonymous 04/01/26(Wed)02:51:33 No.108498908

>>108498896
which is why it's harder to do it from memory.
idk about the bullshit lat lng projection because i never use those in my life.

Anonymous
04/01/26(Wed)02:52:45 No.108498913

Anonymous 04/01/26(Wed)02:52:45 No.108498913

>>108498898
>>108498903
You don't understand the difference between memory and understanding.

Anonymous
04/01/26(Wed)02:52:46 No.108498914

Anonymous 04/01/26(Wed)02:52:46 No.108498914

>>108498903
lat lng is closer to real world knowledge than pixels

Anonymous
04/01/26(Wed)02:53:01 No.108498915

Anonymous 04/01/26(Wed)02:53:01 No.108498915

>>108498908
Well then stop commenting on shit you don't understand.

Anonymous
04/01/26(Wed)02:53:25 No.108498918

Anonymous 04/01/26(Wed)02:53:25 No.108498918

File: based.png (431 KB, 800x582)

431 KB PNG

>>108498860
The change seems simple enough, I'm sure the big dogs like Anthropic or Google already know about it but didn't decide to disclose it, which is why I love China, they share valuable discoveries for the greater good of humanity!

Anonymous
04/01/26(Wed)02:54:11 No.108498921

Anonymous 04/01/26(Wed)02:54:11 No.108498921

>>108498893
This is great and all, but how will this help me jack off?

Anonymous
04/01/26(Wed)02:55:11 No.108498927

Anonymous 04/01/26(Wed)02:55:11 No.108498927

>>108498898
It does and your difficulty thinking about it shows it's usefulness. It's not primarily a geography quiz you dumb fuck it's a test of the ability to generalize.

Anonymous
04/01/26(Wed)02:56:29 No.108498933

Anonymous 04/01/26(Wed)02:56:29 No.108498933

>>108498921
If you choose to jack off with the one that makes the best maps, it should have better spatial awareness and be less prone to taking its pants off twice and sucking its own dick while whispering in your ear.

Anonymous
04/01/26(Wed)02:56:30 No.108498934

Anonymous 04/01/26(Wed)02:56:30 No.108498934

File: file.png (209 KB, 1639x1129)

209 KB PNG

>>108498866
>>108498893
cool, this prompt works better

Anonymous
04/01/26(Wed)02:58:41 No.108498948

Anonymous 04/01/26(Wed)02:58:41 No.108498948

>>108498893
>I want to know if the model knows the world well.
That's not what the test is for, it's to test if the model can use things it "knows" outside the context of questions in the same format it was trained on.

Anonymous
04/01/26(Wed)03:02:24 No.108498962

Anonymous 04/01/26(Wed)03:02:24 No.108498962

Are You Smarter Than A 27B LLM? would make for a good game show if those were still a thing

Anonymous
04/01/26(Wed)03:03:24 No.108498969

Anonymous 04/01/26(Wed)03:03:24 No.108498969

File: 1758116402704839.png (148 KB, 640x562)

148 KB PNG

https://xcancel.com/thejobchick/status/2039032800452723034
>Oracle will fire 30000 employees
now that AI is replacing jobs, when will we get Universal Basic Income?

Anonymous
04/01/26(Wed)03:04:16 No.108498976

Anonymous 04/01/26(Wed)03:04:16 No.108498976

>>108498913
i think you are the one that doesn't.
>>108498927
it shows how retarded it is.
>it's a test of the ability to generalize.
and thus my exact point of why your test is retarded, it has literaly memorized coordinates of most towns / areas.
it does not have a memory of a grid with an arbitrary number of subdivisions.

Anonymous
04/01/26(Wed)03:04:17 No.108498977

Anonymous 04/01/26(Wed)03:04:17 No.108498977

>>108498934
>kullback-leibler
huh, what's this?

Anonymous
04/01/26(Wed)03:04:37 No.108498981

Anonymous 04/01/26(Wed)03:04:37 No.108498981

>>108498948
All modern models can. You are free to go ahead and test it yourself. We use 1 token classification at work a lot and it's very, very good. This is a test of how well it knows geography.

Anonymous
04/01/26(Wed)03:05:18 No.108498984

Anonymous 04/01/26(Wed)03:05:18 No.108498984

>>108498976
it's not ai taking jobs, it's companies bleeding money and pretending they can get leaner thanks to AI so it looks good to investors.

Anonymous
04/01/26(Wed)03:05:56 No.108498987

Anonymous 04/01/26(Wed)03:05:56 No.108498987

File: 1752197333146800.jpg (979 KB, 1024x1024)

979 KB JPG

>>108498782
>>108498900
? Are these hand tracings?

Anonymous
04/01/26(Wed)03:06:22 No.108498988

Anonymous 04/01/26(Wed)03:06:22 No.108498988

File: file.png (242 KB, 502x554)

242 KB PNG

>>108498977
that is what KL stands for KL divergence
it is just a meme fix variant of hauhauCS uncensor

Anonymous
04/01/26(Wed)03:08:46 No.108499001

Anonymous 04/01/26(Wed)03:08:46 No.108499001

>>108498969
I think Oracle is constantly in the process of laying off a bunch of people. Isn't this just their normal churn?
>>108498984
This.
> No really we're not making staff adjustments because we overhired and now we're losing money
> It's AI!

Anonymous
04/01/26(Wed)03:08:52 No.108499002

Anonymous 04/01/26(Wed)03:08:52 No.108499002

>>108498987
no, just https://huggingface.co/circlestone-labs/Anima

Anonymous
04/01/26(Wed)03:09:25 No.108499005

Anonymous 04/01/26(Wed)03:09:25 No.108499005

>>108498988
is that model alright?

Anonymous
04/01/26(Wed)03:14:59 No.108499032

Anonymous 04/01/26(Wed)03:14:59 No.108499032

>>108499001
Absolutely wasted.

Anonymous
04/01/26(Wed)03:15:30 No.108499035

Anonymous 04/01/26(Wed)03:15:30 No.108499035

>fit completely broken
good job cudadev!

Anonymous
04/01/26(Wed)03:16:59 No.108499042

Anonymous 04/01/26(Wed)03:16:59 No.108499042

>>108499005
it feels alright
honestly best i've been used so far for the size but ymmv

Anonymous
04/01/26(Wed)03:17:26 No.108499046

Anonymous 04/01/26(Wed)03:17:26 No.108499046

>>108498981
I encourage you to read the blog post where the rationale for the original test was explained.

Anonymous
04/01/26(Wed)03:17:52 No.108499049

Anonymous 04/01/26(Wed)03:17:52 No.108499049

>>108498009
I am ready for the first /lmg/ thread of culture.

Anonymous
04/01/26(Wed)03:18:32 No.108499052

Anonymous 04/01/26(Wed)03:18:32 No.108499052

>>108499046
I encourage you to read my post.

Anonymous
04/01/26(Wed)03:20:29 No.108499057

Anonymous 04/01/26(Wed)03:20:29 No.108499057

File: 1775008292861166.jpg (312 KB, 1286x1244)

312 KB JPG

Anonymous
04/01/26(Wed)03:23:03 No.108499068

Anonymous 04/01/26(Wed)03:23:03 No.108499068

pygmaballs

Anonymous
04/01/26(Wed)03:23:25 No.108499070

Anonymous 04/01/26(Wed)03:23:25 No.108499070

File: file.png (169 KB, 1870x941)

169 KB PNG

>>108498061
Nevermind i tried to get back into it but got prompt limited after 10-20 prompts, you can have the full project, i hope it helps in any way despite being poorly vibe coded

https://files.catbox.moe/aurnot.zip

Anonymous
04/01/26(Wed)03:23:27 No.108499071

Anonymous 04/01/26(Wed)03:23:27 No.108499071

>>108499057
>putting all your eggs into your google account
Nice Darwin Awards kek

Anonymous
04/01/26(Wed)03:24:35 No.108499083

Anonymous 04/01/26(Wed)03:24:35 No.108499083

File: file.png (202 KB, 1633x1135)

202 KB PNG

almost based

Anonymous
04/01/26(Wed)03:27:27 No.108499097

Anonymous 04/01/26(Wed)03:27:27 No.108499097

File: Screenshot_20260401_09253(...).jpg (214 KB, 810x1269)

214 KB JPG

What happens when the models shatter?

Anonymous
04/01/26(Wed)03:28:44 No.108499106

Anonymous 04/01/26(Wed)03:28:44 No.108499106

>>108499070
interface looks gamey (cool), I should really get to vibecoding my own shit.
but i just find the llama-server webui comfy nowadays, the only thing it's lacking really is skills support (planned), RAG (could probably do through mcp but i cant be bothered to set it up) and presets (system prompt and other shenanigans to get it to work with cards).
Ideally the bundled webui should support plugins so we could write our own stuff to extend it. I don't really want to manage all the agentic turns autism + mcp flow

Anonymous
04/01/26(Wed)03:30:27 No.108499112

Anonymous 04/01/26(Wed)03:30:27 No.108499112

>>108499097
govt prints another trillion to fix the issue

Anonymous
04/01/26(Wed)03:35:29 No.108499141

Anonymous 04/01/26(Wed)03:35:29 No.108499141

>>108499083
That's a lot of browns

Anonymous
04/01/26(Wed)03:40:21 No.108499155

Anonymous 04/01/26(Wed)03:40:21 No.108499155

>>108498860
Will Kimi K3 use this?

Anonymous
04/01/26(Wed)03:40:43 No.108499160

Anonymous 04/01/26(Wed)03:40:43 No.108499160

i want to draw dipsy but /g/ drawfag is kinda an oxymoron

Anonymous
04/01/26(Wed)03:44:34 No.108499177

Anonymous 04/01/26(Wed)03:44:34 No.108499177

File: file.png (226 KB, 1868x935)

226 KB PNG

>>108499106
Don't vibe code on ai studio unless you have a lot of patience, as for me i don't even know if i can vibe code again

Anonymous
04/01/26(Wed)03:45:44 No.108499182

Anonymous 04/01/26(Wed)03:45:44 No.108499182

>>108499177
oh fuck it's nier shit I knew I recognized the style somewhere

Anonymous
04/01/26(Wed)03:47:26 No.108499191

Anonymous 04/01/26(Wed)03:47:26 No.108499191

more vibeparser fixes:
https://github.com/ggml-org/llama.cpp/pull/21216
this is the damage that vibeshitters bring in, crap that has to be fixed by actual devs.

Anonymous
04/01/26(Wed)03:49:43 No.108499205

Anonymous 04/01/26(Wed)03:49:43 No.108499205

Holy fuck I didn't expect local to be this fucking censored and cucked, was it always this bad?

Anonymous
04/01/26(Wed)03:51:57 No.108499213

Anonymous 04/01/26(Wed)03:51:57 No.108499213

>>108499205
that sounds like a skill issue

Anonymous
04/01/26(Wed)03:54:39 No.108499227

Anonymous 04/01/26(Wed)03:54:39 No.108499227

>>108499182
I tried to get as close as i could, the background is animated and all of the decoration is vibe coded, including the svg parts

Anonymous
04/01/26(Wed)04:00:51 No.108499254

Anonymous 04/01/26(Wed)04:00:51 No.108499254

>>108499205
buy an ad

Anonymous
04/01/26(Wed)04:04:08 No.108499266

Anonymous 04/01/26(Wed)04:04:08 No.108499266

>>108499205
Yeah it's garbage

posted from my RTX 5090

Anonymous
04/01/26(Wed)04:06:41 No.108499275

Anonymous 04/01/26(Wed)04:06:41 No.108499275

>>108498759
>africa is black
hmm

Anonymous
04/01/26(Wed)04:07:48 No.108499279

Anonymous 04/01/26(Wed)04:07:48 No.108499279

File: 1768580559826064.png (14 KB, 660x148)

14 KB PNG

me-south-1 got bombed by Iran
https://health.aws.amazon.com/health/status

Anonymous
04/01/26(Wed)04:09:17 No.108499290

Anonymous 04/01/26(Wed)04:09:17 No.108499290

>>108499275
...that's blue

Anonymous
04/01/26(Wed)04:10:08 No.108499294

Anonymous 04/01/26(Wed)04:10:08 No.108499294

>>108499070
It's normal. I'm rewriting my client and I don't understand my logic in some areas. And these are something what I have written by myself and not even vibe coded.

Anonymous
04/01/26(Wed)04:11:53 No.108499304

Anonymous 04/01/26(Wed)04:11:53 No.108499304

>>108499205
>Holy fuck I didn't expect local to be this fucking censored and cucked, was it always this bad?
glm-4.6, nemo, command-r all basically uncensored
kimi-k2-instruct and deepseek-v3/r1 barely censored
newer local models more censored than cloud but can be worked around.
gpt-oss, qwen and other synth slop models are not worth fighting.

Anonymous
04/01/26(Wed)04:19:03 No.108499335

Anonymous 04/01/26(Wed)04:19:03 No.108499335

>>108499205
It was worse about year ago. Nowadays most local models can be almost completely decensored with some brain surgery, without major issues.
e.g.: https://github.com/p-e-w/heretic

Anonymous
04/01/26(Wed)04:22:58 No.108499351

Anonymous 04/01/26(Wed)04:22:58 No.108499351

>>108499304
I tried Qwen and GLM and with thinking on they refuse everything or make everything lame or "positive", with thinking off they work. Literally same JB I use in online models, not even gemini with thinking on refuses as much

Anonymous
04/01/26(Wed)04:23:58 No.108499354

Anonymous 04/01/26(Wed)04:23:58 No.108499354

Just tested Bonsai and it seems.... okay? Which is a lot better than expected. Not sure why this model isn't getting more hyped.

Anonymous
04/01/26(Wed)04:24:48 No.108499358

Anonymous 04/01/26(Wed)04:24:48 No.108499358

>>108499354
who cares about smaller llms? google already solved that by making them 6 times as efficient

Anonymous
04/01/26(Wed)04:25:22 No.108499362

Anonymous 04/01/26(Wed)04:25:22 No.108499362

>>108499354
Everyone thought it was an April's fools prank.

Anonymous
04/01/26(Wed)04:26:41 No.108499371

Anonymous 04/01/26(Wed)04:26:41 No.108499371

>>108499354
Proprietary shit. If they don't apply their method to the model you want, then you simply just won't receive it. And that also means little reason to support it in mainline Llama.cpp.

Anonymous
04/01/26(Wed)04:26:45 No.108499372

Anonymous 04/01/26(Wed)04:26:45 No.108499372

>>108499358
Huh, if turboquant is legit then it makes the KVCache no longer be the limiting factor and would only make model quantization even more important

Anonymous
04/01/26(Wed)04:28:53 No.108499380

Anonymous 04/01/26(Wed)04:28:53 No.108499380

>>108499354
>Not sure why this model isn't getting more hyped.
they didn't provide the method to make it happen, so we can't reproduce it ourselves, so it's just useless to us and a way to flex their muscles to them

Anonymous
04/01/26(Wed)04:31:32 No.108499396

Anonymous 04/01/26(Wed)04:31:32 No.108499396

>>108499371
>>108499380
It's like the 4 minute mile, the most important thing is that people realize it can be done

Anonymous
04/01/26(Wed)04:32:18 No.108499401

Anonymous 04/01/26(Wed)04:32:18 No.108499401

>>108499351
Skill issue

Anonymous
04/01/26(Wed)04:36:25 No.108499415

Anonymous 04/01/26(Wed)04:36:25 No.108499415

>>108499070
>despite being poorly vibe coded
lgtm, thanks for the zip

Anonymous
04/01/26(Wed)04:36:37 No.108499417

Anonymous 04/01/26(Wed)04:36:37 No.108499417

>>108499401
Post a jailbreak (that isn't yours) that would allow anything (with thinking on).
(you won't)

Anonymous
04/01/26(Wed)04:37:31 No.108499419

Anonymous 04/01/26(Wed)04:37:31 No.108499419

>>108499371
It's likely to include a laborious training process anyway, not just post-training quantization. Either way, knowing that it can be done, more open research groups will probably start working on it too. I'm looking forward to using a 120B model fully loaded on my 3090 in the future.

Anonymous
04/01/26(Wed)04:40:19 No.108499436

Anonymous 04/01/26(Wed)04:40:19 No.108499436

>>108499351
>I tried Qwen and GLM and with thinking on they refuse everything or make everything lame or "positive", with thinking off they work.
glm-4.6 like I suggested?
otherwise, yes it's like I said, the open weight models are more cucked.
even when they don't refuse, you see them gooning over refusals in the reasoning, unlike claude etc.

Anonymous
04/01/26(Wed)04:40:22 No.108499437

Anonymous 04/01/26(Wed)04:40:22 No.108499437

>>108499417
>do my job
No.

Anonymous
04/01/26(Wed)04:43:16 No.108499455

Anonymous 04/01/26(Wed)04:43:16 No.108499455

>>108499417
>Post a jailbreak (that isn't yours) that would allow anything (with thinking on).
nta but that's not how it works. you need a different jailbreak for each task/domain
try prefilling the safety reasoning, if you even try to make the refusal shorter or unspecific, the model corrects itself into giving a longer refusal lecture.
they have a reward function for choosing the correct refusal category during training now.

Anonymous
04/01/26(Wed)04:53:40 No.108499500

Anonymous 04/01/26(Wed)04:53:40 No.108499500

>>108498860
I really don't like that pseudoquery, I'd rather see a learnable linear projection of the layer input or output as a proper query.

Anonymous
04/01/26(Wed)05:06:46 No.108499536

Anonymous 04/01/26(Wed)05:06:46 No.108499536

File: 1762295507066198.jpg (6 KB, 279x181)

6 KB JPG

>>108499354
No one should give a shit until The methods they use to quantize that model are released. They don't deserve any hype or praise if that measly model Is all that exists. They deserve the worst and nothing but that for not doing that day 1

Anonymous
04/01/26(Wed)05:06:58 No.108499537

Anonymous 04/01/26(Wed)05:06:58 No.108499537

What are the best instruct-tuned/smart models in the 7B-14B range?

Anonymous
04/01/26(Wed)05:07:52 No.108499540

Anonymous 04/01/26(Wed)05:07:52 No.108499540

>>108499419
>open research groups will probably start working on it too.
They already have? No one has gotten anywhere or the ones that do hide it because they foolishly think they can make any money via VC hype

Anonymous
04/01/26(Wed)05:09:05 No.108499543

Anonymous 04/01/26(Wed)05:09:05 No.108499543

>>108499537
Qwen 3.5

Anonymous
04/01/26(Wed)05:09:23 No.108499544

Anonymous 04/01/26(Wed)05:09:23 No.108499544

>>108499540
why can't they just sell the method? I'm sure they can make money out of it

Anonymous
04/01/26(Wed)05:10:24 No.108499548

Anonymous 04/01/26(Wed)05:10:24 No.108499548

File: 1772252238152653.png (198 KB, 1228x1150)

198 KB PNG

>>108499544
>Sell

Worthless tourist

Anonymous
04/01/26(Wed)05:11:06 No.108499551

Anonymous 04/01/26(Wed)05:11:06 No.108499551

>>108499057
lmao gtards
lil' coomers always find a way

Anonymous
04/01/26(Wed)05:11:29 No.108499553

Anonymous 04/01/26(Wed)05:11:29 No.108499553

>>108499544
if they sell it, they get a few hundred thousand at best. if they use it to position themselves as having made ai 16 times as efficient, they're worth billions overnight

Anonymous
04/01/26(Wed)05:12:27 No.108499555

Anonymous 04/01/26(Wed)05:12:27 No.108499555

>>108499537
try the new hotness bonsai 8b

Anonymous
04/01/26(Wed)05:13:59 No.108499562

Anonymous 04/01/26(Wed)05:13:59 No.108499562

Gemma 4 is on kaggle

Anonymous
04/01/26(Wed)05:16:02 No.108499566

Anonymous 04/01/26(Wed)05:16:02 No.108499566

>>108499553
>if they use it to position themselves as having made ai 16 times as efficient, they're worth billions overnight
it's risky, people can reverse engineer their methods, we have the result after all, they shouldn't overstay their welcome, they should sell before it's too late

Anonymous
04/01/26(Wed)05:16:36 No.108499569

Anonymous 04/01/26(Wed)05:16:36 No.108499569

check out this dense model
*unzips cock*

Anonymous
04/01/26(Wed)05:16:41 No.108499571

Anonymous 04/01/26(Wed)05:16:41 No.108499571

>>108499540
I don't think there have been serious attempts to make binary weight quantization end-to-end (MLP, attention, embeddings/output layers) actually viable yet; not even BitNet went that far (it only quantized in low precision the MLP), and even their authors preferred to use ternary weights at the very least.

Anonymous
04/01/26(Wed)05:18:17 No.108499584

Anonymous 04/01/26(Wed)05:18:17 No.108499584

>>108499562
It's not live (yet) there. Apparently if you dig in Google AI studio js code there are references to https://www.kaggle.com/models/google/gemma-4 but it's not working.

Anonymous
04/01/26(Wed)05:19:42 No.108499595

Anonymous 04/01/26(Wed)05:19:42 No.108499595

My model mogs yours

Anonymous
04/01/26(Wed)05:27:21 No.108499624

Anonymous 04/01/26(Wed)05:27:21 No.108499624

>>108497919
/lmg/, the most trustworthy source of ai news betrayed me.
were is dipsy !!!!

Anonymous
04/01/26(Wed)05:28:46 No.108499629

Anonymous 04/01/26(Wed)05:28:46 No.108499629

So is the source code actually useful for anything?

Anonymous
04/01/26(Wed)05:34:19 No.108499646

Anonymous 04/01/26(Wed)05:34:19 No.108499646

>>108499629
Yes and no.

Anonymous
04/01/26(Wed)05:34:46 No.108499649

Anonymous 04/01/26(Wed)05:34:46 No.108499649

>>108499629
Yeah, useful if you want to start with 500k lines of technical debt when you make your own custom agent orchestrator. Other than that, no.

Anonymous
04/01/26(Wed)05:38:03 No.108499660

Anonymous 04/01/26(Wed)05:38:03 No.108499660

bonsai 1bit saved my life

Anonymous
04/01/26(Wed)05:38:30 No.108499662

Anonymous 04/01/26(Wed)05:38:30 No.108499662

File: 1753875623853513.png (194 KB, 748x624)

194 KB PNG

Anonymous
04/01/26(Wed)05:42:30 No.108499677

Anonymous 04/01/26(Wed)05:42:30 No.108499677

>>108499660
what the tf can this model be used for?

Anonymous
04/01/26(Wed)05:42:51 No.108499678

Anonymous 04/01/26(Wed)05:42:51 No.108499678

>>108499677
shitposting

Anonymous
04/01/26(Wed)05:45:40 No.108499690

Anonymous 04/01/26(Wed)05:45:40 No.108499690

>>108499677
it gives you hope that maybe one day your 96gb vram + 64gb ram poverty machine that peaked two years ago might one day run flagship models again after you missed out on upgrading last year

Anonymous
04/01/26(Wed)05:47:31 No.108499696

Anonymous 04/01/26(Wed)05:47:31 No.108499696

>>108499677
i will see if it can do small coding tasks with hermes or something

Anonymous
04/01/26(Wed)05:48:13 No.108499698

Anonymous 04/01/26(Wed)05:48:13 No.108499698

>tfw already merged in niggerganov's rotation branch in my local branch
eheheh im devilish and already using q8_0 baby :))))

Anonymous
04/01/26(Wed)05:54:08 No.108499713

Anonymous 04/01/26(Wed)05:54:08 No.108499713

>>108499624
it's ACTUALLY OUT NOW !!!
https://hf.co/deepseek-ai/DeepSeek-V4

Anonymous
04/01/26(Wed)05:54:39 No.108499717

Anonymous 04/01/26(Wed)05:54:39 No.108499717

>>108499713
Holy shit !!!
thank you !

Anonymous
04/01/26(Wed)05:54:53 No.108499720

Anonymous 04/01/26(Wed)05:54:53 No.108499720

>>108499698
>>108499696
>>108499690
>>108499678
am i retarded. doesnt work on ik_llama or llama.cpp

Anonymous
04/01/26(Wed)05:55:07 No.108499721

Anonymous 04/01/26(Wed)05:55:07 No.108499721

Dense sex

Anonymous
04/01/26(Wed)05:55:38 No.108499724

Anonymous 04/01/26(Wed)05:55:38 No.108499724

>>108499720
can you even read a model card fucking retard? what's your shade?
retard.

Anonymous
04/01/26(Wed)05:56:30 No.108499728

Anonymous 04/01/26(Wed)05:56:30 No.108499728

>>108499720
yes, you are

Anonymous
04/01/26(Wed)06:03:59 No.108499744

Anonymous 04/01/26(Wed)06:03:59 No.108499744

File: 1764929239430432.jpg (899 KB, 3840x2160)

899 KB JPG

what if i want to erp... nicher topics? i have 16gb vram 32gb ram and i don't think mistral nemo 2407 gets what i'm asking of it

Anonymous
04/01/26(Wed)06:04:38 No.108499747

Anonymous 04/01/26(Wed)06:04:38 No.108499747

>>108499744
As if there was a some way to combine vram and ram together...

Anonymous
04/01/26(Wed)06:07:04 No.108499757

Anonymous 04/01/26(Wed)06:07:04 No.108499757

>>108499747
i know koboldcpp just does that for me automatically, but i don't keep up with model releases at all so i don't know which bigger one to choose

Anonymous
04/01/26(Wed)06:09:47 No.108499766

Anonymous 04/01/26(Wed)06:09:47 No.108499766

>>108499419
It's probably distillation using trillions of tokens, not PTQ with a little finetuning.

Anonymous
04/01/26(Wed)06:16:36 No.108499786

Anonymous 04/01/26(Wed)06:16:36 No.108499786

>>108499744
you want to be the man behind that door about to get gangraped by Miku clones? I think most of the uncensored/heretic models will do that just fine

Anonymous
04/01/26(Wed)06:24:31 No.108499811

Anonymous 04/01/26(Wed)06:24:31 No.108499811

>>108499713
lol they really did release on april fools

Anonymous
04/01/26(Wed)06:27:02 No.108499816

Anonymous 04/01/26(Wed)06:27:02 No.108499816

>>108499811
i'm downloading it now !
70% left, even if i may only run it at 2t/s i can't wait !!!

Anonymous
04/01/26(Wed)06:28:28 No.108499826

Anonymous 04/01/26(Wed)06:28:28 No.108499826

File: cars!.png (24 KB, 596x904)

24 KB PNG

1bit power

Anonymous
04/01/26(Wed)06:29:10 No.108499827

Anonymous 04/01/26(Wed)06:29:10 No.108499827

>>108499826
it made a road instead

Anonymous
04/01/26(Wed)06:29:18 No.108499828

Anonymous 04/01/26(Wed)06:29:18 No.108499828

>>108499786
Don't kid yourself. LLMs can't even keep track of one character let alone 7

Anonymous
04/01/26(Wed)06:32:13 No.108499837

Anonymous 04/01/26(Wed)06:32:13 No.108499837

>>108499828
just run 14 llms in sli each handling 0.5 characters

Anonymous
04/01/26(Wed)06:33:18 No.108499842

Anonymous 04/01/26(Wed)06:33:18 No.108499842

File: waifu.png (26 KB, 614x501)

26 KB PNG

>>108499826

Anonymous
04/01/26(Wed)06:34:01 No.108499846

Anonymous 04/01/26(Wed)06:34:01 No.108499846

>llms don't know what's a blimp train
It's over. They think it's blimps connected together.

Anonymous
04/01/26(Wed)06:34:14 No.108499849

Anonymous 04/01/26(Wed)06:34:14 No.108499849

How do you guys keep track of stats in sillytavern?

Anonymous
04/01/26(Wed)06:35:19 No.108499850

Anonymous 04/01/26(Wed)06:35:19 No.108499850

>>108499842
can it roleplay at all?

Anonymous
04/01/26(Wed)06:35:52 No.108499852

Anonymous 04/01/26(Wed)06:35:52 No.108499852

https://www.youtube.com/watch?v=4rWnitE9RYM
this shit is so fast damn

Anonymous
04/01/26(Wed)06:37:34 No.108499859

Anonymous 04/01/26(Wed)06:37:34 No.108499859

File: 1765119110978679.png (676 KB, 1592x1296)

676 KB PNG

>>108499852
isn't it a bit sus that it outputs exactly the same thing as the fp16 one?

Anonymous
04/01/26(Wed)06:42:13 No.108499877

Anonymous 04/01/26(Wed)06:42:13 No.108499877

1bit model seems completely retarded but I have never touched a model that is only 8b.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.