[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: file.png (1.07 MB, 1280x1280)
1.07 MB
1.07 MB PNG
/lmg/ - a general dedicated to the discussion and development of local language models.

New Years Eve Edition

Previous threads: >>107709248 & >>107700909

►News
>(12/29) HY-Motion 1.0 text-to-3D human motion generation models released: https://hf.co/tencent/HY-Motion-1.0
>(12/29) WeDLM-8B-Instruct diffusion language model released: https://hf.co/tencent/WeDLM-8B-Instruct
>(12/29) Llama-3.3-8B-Instruct weights leaked: https://hf.co/allura-forge/Llama-3.3-8B-Instruct
>(12/26) MiniMax-M2.1 released: https://minimax.io/news/minimax-m21
>(12/22) GLM-4.7: Advancing the Coding Capability: https://z.ai/blog/glm-4.7

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm
>>
►Recent Highlights from the Previous Thread: >>107709248

--Text-to-animation model potential vs current limitations in NPC/game applications:
>107713005 >107713022 >107713094 >107713093 >107713161 >107713166 >107713180 >107713199 >107713395 >107713478 >107714633 >107715953 >107715974 >107716259 >107716282 >107716362 >107716378 >107716412 >107716427 >107716455 >107716463 >107716491 >107716546 >107716556 >107716643 >107716752 >107716648
--Choosing SillyTavern prompt modes based on model type and customization needs:
>107715217 >107715245 >107715274 >107715300 >107715318 >107715651 >107715665 >107715758 >107716851 >107716897 >107716949 >107717038
--TTS tool landscape: XTTSv2, Chatterbox, and lightweight alternatives:
>107714613 >107714656 >107714858 >107714701 >107714733D
--Kimi Linear architecture boosts efficiency and compatibility:
>107712023 >107712128
--Feedback on updated model recommendations list:
>107715036 >107715051 >107715075 >107715081 >107715093 >107715088 >107715623 >107715752 >107715866
--M2.1's jailbreak resistance and identity adherence challenges:
>107714948
--Llama.cpp API compatibility issues with enable_thinking flag:
>107712820 >107712892
--Optimizing inference with mixed GPU/CPU setups:
>107714237 >107714606 >107714638 >107714341 >107714370
--32GB VRAM roleplay model recommendations and narrative challenges:
>107710932 >107710954 >107711017 >107711047 >107711311 >107711369 >107711409 >107711524 >107711716 >107711757 >107712384 >107714928 >107714953
--Exynos 2600 NPU advancements vs Nvidia's market dominance:
>107713883 >107713925
--ikllama's declining Windows support and maintenance challenges:
>107714744 >107714804
--Logs: Migubench:
>107715268
--Logs:
>107710745 >107712779 >107713077
--Teto (free space):
>107709736 >107709978 >107710051 >107711883 >107711900 >107712939 >107713166

►Recent Highlight Posts from the Previous Thread: >>107709259

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script
>>
GIWTWM
>>
>>107717246
I want to be the Teto
>>
File: 1763986126206111.png (184 KB, 598x506)
184 KB
184 KB PNG
It's over
Real life people talk like slop now
>>
>>107717380
that sounds absolutely awful.
>>
What's the gooner recommendation for 96gb vram, assuming something like 4bit cope, I'm not able to offload to ram.
>>
>>107717404
StableLM 7B
>>
>>107717404
Use SaaS APIs.
>t. someone with 96GB VRAM
>>
Bald anime girls make better threads. Decline is already very visible here.
>>
>>107717404
Q4 or Q5 of air.
>>
>>107717380
You kind of have to wonder though, how much time a guy like Elon is actually spending each day talking with AI.
>>
>>107717464
LLMs are amazing for brainstorming and exploring new concepts.
>>
>>107717246
erotic colors
>>
>>107717380
If it is this piece of shit faggot proclaiming it is gonna happen then we are safe. On the other hand making a smartphone with some weird forked android OS that only lets you use twitter and grok should be easy enough that even with his incompetence it can happen.
>>
>>107717472
Brainstorming, eh? You want the details, the real deal?
>>
>>107717404
>96gb vram
Get a second one and run 4.6/4.7
>>
>>107717380
>a screen with radios that pings Grok/xAI servers for everything in real time
who asked for this?
>>
>>107717508
it is being enforced on us like every other rich person who wants to let privacy become a thing of the past
>>
File: real_bald_miku.jpg (614 KB, 1489x1986)
614 KB
614 KB JPG
>>107717308
Bald is the new /lmg/ mascot.
>>
>>107717575
>>
Am retard. How do I stop sillytavern from showing shit in code blocks? They aren't hidden with some models and I don't know why. I still want to gem them, I just don't want to see them.
>>
File: 79817184917894.png (1.06 MB, 1024x1216)
1.06 MB
1.06 MB PNG
so what's a good story/rp telling model? been using mistral nemo 12b for a while, anything new to try?
>>
>>107717380
stop wasting time on gadgets that nobody's going to buy and finish up the full dive brainchips you rich cunt
>>
>>107717669
do you mean the "thinking" blocks?
what exactly do you mean?
>>
>>107717669
regex
>>
>>107717695
hes an adhd rich idiot, investors had to give him a 1 Trillion carrot on a stick, just so he continues working on the electric cars
>>
>>107717689
Magidonia or Cydonia. Ur specs??
>>
>>107717729
16vram 32ram
4060ti
>>
>31 December 2pm in worst korea
>bloody besterd mathrachods still haven't done needful release of the solar open 100b gooofs
May sacred cow curse their ancestors and descendants to toilet witch dimension
>>
>>107717729
Cydonia just shat me this lil nugget from just the title.

https://rentry.org/opkobdrp
>>
>>107717729
>>107717758
Die in a fire
>>
>>107717758
>chapter 2: finding your partner
Pretty funny.
>>
>>107717689
Just keep using nemo desu.
>>
8gb vram. 64gb ram. spoonfeed me model. i want for the BIG sex. i run utilize GLM-4.5-Air-Q4_K_S and NemoReRemix-12B-Q3_K_XL and sometime trashpanda-org_QwQ-32B-Snowdrop-v0-IQ4_XS there is it there anything better?
>>
>>107717805
not really, no. there has not really been much progress made in the past few months for us mortals.
>>
>>107717805
Why are there so many people asking this recently?
>Hey guys what's the current best model????
Bot spam again?
>>
>>107718128
That's a clever insight —
>>
Hunyuan motion 1.0 is fun



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.