[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor applications are now open. Apply here!


[Advertise on 4chan]


File: lmg_culture.jfif.jpg (110 KB, 1024x768)
110 KB JPG
/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>108931385 & >>108924918

►News
>(05/29) Jart loves 4chan and needs your money to fly all over the world https://justine.lol/animus/ Oh and step 3.7 dropped I guess https://huggingface.co/stepfun-ai/Step-3.7-Flash-GGUF
>(05/21) Hy-MT2 “fast-thinking” translation models released: https://hf.co/collections/tencent/hy-mt2
>(05/20) Cohere releases Command A+ 218B-A25B: https://cohere.com/blog/command-a-plus
>(05/16) llama + spec: MTP Support #22673 merged: https://github.com/ggml-org/llama.cpp/pull/22673
>(05/08) KSA-4B-base released: https://hf.co/OpenOneRec/KSA-4B-base

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://swe-rebench.com
Agentic Coding: https://deepswe.datacurve.ai
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling
Token Speed Visualizer: https://shir-man.com/tokens-per-second

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm
>>
Kimi thread.
Kimi board.
Gemma's cool, she can stay too.
>>
who is this handsome gentleman?
>>
>>108937336
it might just be mental illness. and it might have something to do with thinking you are a woman
>>
what is lil bro even yapping about on his page?
>>
>>108936843
>the greatest competitive advantage I've ever had was to monitor which pull requests people on 4chan complained about, and then merge them into llamafile before Gerganov could
I might stop laughing at him for a day if he merges deepseek. Does he merge deepseek?
>>
I don't like how unprofessional this thread is.
>>
*cums on this thread* mm... much better
>>
>>108937386
everything here is extremely professional according to my field of expertise
t. professional shitposter
>>
We must be better llamacpp contributors.
>>
>>108937402
t. professional cumeater, apparently
>>
>>108937312
You forgot to remove mikupad
>>
>>108937406
llamapodofile you mean
>>
>>108937423
Why would he remove mikupad in a miku OP?
>>
File: 1752365447825458.jpg (99 KB, 1300x960)
99 KB JPG
>cuda 13.3 is out
im updooting
>>
>>108937423
I am acutally starting to see how it is all official. And we should get the official card 3.0 now.
>>
On a 1/10 scale, how thread culture are we today?
>>
https://rentry.co/imdddcy3

I got you bros.
>>
What context/instruct templates should I be using for Gemma 4?
>>
>>108937498
He forgot that if he deletes it then he tells us he is in the thread right now and:

>>108937417
>>108937431
>>108937282

Are actually his posts...
>>
>>108937508
Try:
>I need you to donate money to me, and I mean you, as in literally you. You couldn't have read this far unless you are someone who legitimately cares, and your compassion means more to me than any amount of money. I need you to donate publicly under your real name and I want you to tell your friends how much money you gave me, since that's the best way to show that you're serious.
>>
why hasn't this thread been taken down for being offtopic?
>>
>>108937554
Supporting open source, ie Jart is more on topic than your post
>>
>>108937560
This isn't support. This is an attack against open source.
>>
>not x, y
>>
see you all in a day or so when these idiots get bored and go away.
>>
>>108937578
I let out a soft warm laugh, the sound like wind through new leaves, and brush a strand of sweat-slicked hair off your forehead while saying softly my voice barely above a whisper: "I actually didn't notice while i was posting"
>>
>>108937616
Finally a damage control attempt that is at least average.
>>
why is /lmg/ fun again?
>>
>ldg ldg
lol ff mad
>>
I don't get something. And that is a serious question.
>For every hater who doom scrolls over how intelligent I am
Why would an intelligent person post that and then delete it as soon as someone posts it here?
>>
>>108937664
attention? Thread is talking about her after all.
>>
>>108937672
If attention is all he needs then why does he ask for money for plane tickets?
>>
Jartsune miku says trans rights
>>
File: 1762490833392855.jpg (120 KB, 363x494)
120 KB JPG
►Recent Highlights from the Previous Thread: >>108931385

--Paper: StoryScope: Investigating idiosyncrasies in AI fiction:
>108936371 >108936425
--Papers:
>108934718
--Gemma 31B token artifacts caused by Q3 quant damage and template errors:
>108934154 >108934164 >108934223 >108934326 >108934336 >108934429 >108934447 >108934494 >108934518 >108934544 >108935381 >108934450 >108934498 >108934541 >108934563 >108934661
--Llama.cpp MTP support and VRAM optimization tradeoffs:
>108933191 >108933882 >108933894 >108933912 >108934078 >108934119 >108934721 >108934015 >108934030 >108934039 >108934107 >108934433 >108934869
--llama.cpp f16 mask PR causing VRAM regressions for some Anons:
>108932210 >108932296 >108932317 >108932336 >108932354 >108935238 >108935257 >108935284 >108935906 >108935928
--Troubleshooting and optimizing Gemma 4 E4B inference speeds:
>108934786 >108934800 >108934841 >108934871 >108934875 >108934881 >108934896 >108934926 >108934934 >108934991 >108935025 >108934822
--Designing a local agentic assistant workflow using MCP and RAG:
>108933985 >108934110 >108934132 >108934174 >108934493 >108935234 >108935571
--Integrating Gemma as a functional AI party member in WoW:
>108934675 >108934688 >108934717 >108934734 >108935126
--Sharing and optimizing roleplay prompts and jailbreaks for Gemma 4:
>108932195 >108932938 >108932961 >108932990 >108933285 >108933813 >108934206
--Comparing Mistral 24b and Gemma 4 26b writing quality issues:
>108931884 >108931960 >108931985 >108932032 >108934697
--Using Gemma for automated MTG gameplay and UI development:
>108931545 >108934795
--llama.cpp adds support for DeepSeek-V3 with Sparse Attention:
>108932267
--Logs:
>108933257 >108933397 >108933407 >108933513 >108933620 >108934039 >108934545 >108934675 >108935726 >108936364 >108936510 >108937016
--Luka, Miku (free space):
>108932889 >108933985

►Recent Highlight Posts from the Previous Thread: >>108931389

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script
>>
>>108937692
that is a cute 2D jart you got there
>>
>>108937674
There's no way, he would have spazzed out by now.
>>
and of course janny begins his sweeping for a fellow troon. actual pottery
>>
I am using ComfyUI with RealVisXL 4.0 and it is NOT outputting what the prompts require. Is it a weak model?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.