/g/ - can anyone explain in layman terms how LLMs of tod - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
04/02/26(Thu)17:59:32 No.108512057

File: llm.png (508 KB, 1600x856)

Anonymous 04/02/26(Thu)17:59:32 No.108512057

can anyone explain in layman terms how LLMs of today differ from shit like auto suggest, akinator web genie or siri etc from the past
is it not fundamentally the same tech just super scaled up to be mega inefficient and brute forced

Anonymous
04/02/26(Thu)18:59:29 No.108512401

Anonymous 04/02/26(Thu)18:59:29 No.108512401

>>108512057
Scale. It's amazing what you can do when you throw enough hardware at the problem.

Anonymous
04/02/26(Thu)19:05:23 No.108512436

Anonymous 04/02/26(Thu)19:05:23 No.108512436

>>108512057
Akinator is a whole different beast.

Anonymous
04/02/26(Thu)19:13:52 No.108512487

Anonymous 04/02/26(Thu)19:13:52 No.108512487

I watched an intro from a course by Andrew Ng and his point was kind of >>108512401. When you have lots of data the magic starts to happen, but there was a breakthrough with the invention of the transform architecture. LLMs aren't just looking at the previous n words to calculate the next, they look at the "context" (which is the layman term for Attention) of the conversation and use a previously built mathematical construct of meaning (turnings words into numbers and then creating an invisible layer of connections during training) which then can be used to predict the next word. Also akinator was just a complex decision tree, was it not?

Anonymous
04/02/26(Thu)19:15:00 No.108512493

Anonymous 04/02/26(Thu)19:15:00 No.108512493

>>108512487
>>108512436
so what was the innovation that pushed this slop into being so profitable and "Revolutionary", the whole context thing?

Anonymous
04/02/26(Thu)19:20:53 No.108512526

Anonymous 04/02/26(Thu)19:20:53 No.108512526

>>108512493
Yes and no. Yes, the revolution happened because of big data + the Transform architecture, but no, it's not so "profitable". It's pretty fucking expensive and they are currently losing lots of money to stay competitive while praying to God that the competition just can't keep up... eventually.

Anonymous
04/02/26(Thu)19:23:20 No.108512535

Anonymous 04/02/26(Thu)19:23:20 No.108512535

>>108512526
by profitable i mean its still profitable for them to scam investors with the fake promises, i know they lose money due to the inefficiency
but if this is really how llms work, that they always have to rely on mega data and brute force, even with economy of scale won't they always stay inefficient and unprofitable?

Anonymous
04/02/26(Thu)19:25:58 No.108512552

Anonymous 04/02/26(Thu)19:25:58 No.108512552

>>108512493
>what was the innovation
There were multiple innovations plus simple brute force at scale. It's nowhere near being profitable, and it's still uncertain how useful it will turn out to be. A lot of AI output is solidly in the uncanny valley territory right now (e.g. the ballyhooed AI-written C compiler that can supposedly compile Linux but couldn't compile a simple hello world program), and it's uncertain if they'll be able to pull out of it.

Anonymous
04/02/26(Thu)19:26:44 No.108512560

Anonymous 04/02/26(Thu)19:26:44 No.108512560

>>108512526
>they are currently losing lots of money to stay competitive while praying to God that the competition just can't keep up... eventually.
Which btw was the Uber (and usually is the whole startup game) prime strategy during the 2010s. They drove prices down so aggressively that it made the competition a bunch of kids playing tic-tac-toe and unable to compete, while of course expanding like maniacs.
Different from Uber, though, it's not that hard to catch up when you are an AI company, thus the famous Google quote, "OpenAI has no moat, and neither do us". You can even train on your competitors tokens (a practice called distillation, which recently Anthropic tried to poison).
Btw that's why Altman and co was seething and shitting themselves in fear calling for AI Safety and talking about the end of the world, the risk of AGI etc. They were try to regulate the market so they could curb-stomp any other companies from trying to get in.

Anonymous
04/02/26(Thu)19:31:39 No.108512590

Anonymous 04/02/26(Thu)19:31:39 No.108512590

>>108512535
>even with economy of scale won't they always stay inefficient and unprofitable?
We are in a phase of the tech, which is the case of almost any tech race, where they are burning crazy capex to stay relevant and competitive. There's nothing saying that after the competition dies out (which sort of happened to companies like Mistral, and I think Deepseek seems to be in deepshit, too) and they get their exit, the engineer will go gaga on optimizing everything to make it cheaper to run.
Google did exactly that these days. I can't give you the details, but they made context window tokens fairly compact in a recent paper, something like that.
So my point being, they just need to set their minds to, but right now is not the time yet. Or at least that's my reading.

Anonymous
04/02/26(Thu)19:32:40 No.108512594

Anonymous 04/02/26(Thu)19:32:40 No.108512594

>>108512590
>engineers

Anonymous
04/02/26(Thu)20:19:02 No.108512880

Anonymous 04/02/26(Thu)20:19:02 No.108512880

>>108512057
basically same but more layers, more parameters

Anonymous
04/02/26(Thu)22:12:17 No.108513435

Anonymous 04/02/26(Thu)22:12:17 No.108513435

>>108512487
Attention is certainly the innovation that made the current AI boom possible, but it is also quite literally a pure function of the last n words/tokens, where n is often (but not always) the entire context size.
It basically still is just calculating the next word/token as a function of the previous words/tokens, but done so in a clever way and with a metric bloatload of parameters compared to the size of neural networks from the previous AI booms.

Anonymous
04/02/26(Thu)22:16:05 No.108513452

Anonymous 04/02/26(Thu)22:16:05 No.108513452

>>108512057
I can smell the tears behind this post

Anonymous
04/02/26(Thu)22:17:19 No.108513461

Anonymous 04/02/26(Thu)22:17:19 No.108513461

>>108512057
LLM was first conceived of in like 1800s anon. Nothing is new with the AI tech it is just now in the year 2026 we have more powerful memory and other hardwares

Anonymous
04/02/26(Thu)22:19:46 No.108513477

Anonymous 04/02/26(Thu)22:19:46 No.108513477

>>108512057
Scale and the emergent behavior that results from it. While still a neural network it's so large a lot of what goes into inference and such gets obfuscated.

Anonymous
04/02/26(Thu)22:21:44 No.108513490

Anonymous 04/02/26(Thu)22:21:44 No.108513490

>>108513461
>LLM was first conceived of in like 1800s
"Yo what if there were a talking machine"
Wow I just conceived of LLM

Anonymous
04/02/26(Thu)22:28:26 No.108513518

Anonymous 04/02/26(Thu)22:28:26 No.108513518

>>108512057
LLMs are basically the same thing as text prediction and markov chains, but they can look at the whole context, rather than a handful of previous words.

Anonymous
04/02/26(Thu)22:33:48 No.108513549

Anonymous 04/02/26(Thu)22:33:48 No.108513549

>>108513435
>a pure function
For example the Game of Life, its rule set is also a pure function yet the system has the power of Turing machine.
In the case of LLM, let say it was trained with these rules
>A lead to C, C+B lead to D
>A and D are incompatible
Given input containing A, B, the LLM start *reasoning* and generate C, D. Then it will realize A+B are incompatible - a rule it was never trained on - then start looking at alternative reasoning lines like A+E, A+F etc
"muh token predictor" is like the most artfag cope ever, I don't believe anyone studied CS would fall for this shit

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.