/g/ - >LLM progress has stagnat- oh - Technology

Anonymous

04/17/26(Fri)19:13:00 No.108626085

File: 1767651244838628.png (109 KB, 1262x737)

109 KB PNG

Anonymous 04/17/26(Fri)19:13:00 No.108626085 Archived

>LLM progress has stagnat-
oh

Anonymous
04/17/26(Fri)19:15:10 No.108626103

Anonymous 04/17/26(Fri)19:15:10 No.108626103

>>108626085
Time to WALK to that car wash.

Anonymous
04/17/26(Fri)19:15:50 No.108626108

Anonymous 04/17/26(Fri)19:15:50 No.108626108

>>108626103
geg

Anonymous
04/17/26(Fri)19:17:05 No.108626116

Anonymous 04/17/26(Fri)19:17:05 No.108626116

File: file.jpg (1.55 MB, 3000x1993)

1.55 MB JPG

>>108626103
Promptstitutes be like

Anonymous
04/17/26(Fri)19:17:48 No.108626120

Anonymous 04/17/26(Fri)19:17:48 No.108626120

File: neonft.png (303 KB, 1197x878)

303 KB PNG

Anonymous
04/17/26(Fri)19:19:58 No.108626133

Anonymous 04/17/26(Fri)19:19:58 No.108626133

File: 1767552145254097.png (199 KB, 498x498)

199 KB PNG

>>108626103
>>108626108
>>108626116
>ha! it can do PhD level math, physics, biology, chemistry, develop new theories and invent proofs never uncovered before by humans
>BUT IT SAYS STRAWBERRY HAS 1 'R' XDDDD IT'S USELESS

Anonymous
04/17/26(Fri)19:20:22 No.108626135

Anonymous 04/17/26(Fri)19:20:22 No.108626135

>>108626085
Luddite cope thread. We AIGODS won

Anonymous
04/17/26(Fri)19:30:33 No.108626189

Anonymous 04/17/26(Fri)19:30:33 No.108626189

>>108626135
2 more time units.

Anonymous
04/17/26(Fri)19:33:57 No.108626210

Anonymous 04/17/26(Fri)19:33:57 No.108626210

File: 9wptiq-3104655978.png (15 KB, 1013x700)

15 KB PNG

>>108626135
>>108626133
you will never be a real intelligence

Anonymous
04/17/26(Fri)19:35:52 No.108626219

Anonymous 04/17/26(Fri)19:35:52 No.108626219

>>108626085
>fails 26% of time

Anonymous
04/17/26(Fri)19:37:22 No.108626227

Anonymous 04/17/26(Fri)19:37:22 No.108626227

>>108626133
LLMs are statistical language models
They do not think or understand reason
Do you not see the absurdity in you suggesting an LLM is capable of creating new mathematical proofs when it's not even able to count?
An LLM just spits out sentences based on what is statistically likely to be the next correct word based on it's training data. It is not capable of "thinking" about maths or physics in any way.

Anonymous
04/17/26(Fri)19:41:00 No.108626243

Anonymous 04/17/26(Fri)19:41:00 No.108626243

>>108626227
>suggesting an LLM is capable of creating new mathematical proofs when it's not even able to count?
Have you been in a coma? It keeps happening that LLMs are making proofs.

sage
04/17/26(Fri)19:42:52 No.108626255

sage 04/17/26(Fri)19:42:52 No.108626255

>>108626240
>what prompstitution does to a mf

Anonymous
04/17/26(Fri)19:42:58 No.108626256

Anonymous 04/17/26(Fri)19:42:58 No.108626256

>>108626133
>>BUT IT SAYS STRAWBERRY HAS 1 'R' XDDDD IT'S USELESS
Unironically, yes, because there is a reproducibility crisis in science, so many of those "new theories" and "proofs" are likely wrong.

Anonymous
04/17/26(Fri)19:43:47 No.108626261

Anonymous 04/17/26(Fri)19:43:47 No.108626261

>>108626219
The cure to cancer is on the way

Anonymous
04/17/26(Fri)19:44:30 No.108626266

Anonymous 04/17/26(Fri)19:44:30 No.108626266

>>108626243
Link me one, and not one where the researchers had to prompt it through the whole thing or one where the "proof" was just the AI applying something so dumb that no human had thought of trying that before.
An actual proof where the AI went away and applied actual reasoning and solved the problem (which LLMs are not capable of doing btw)

Anonymous
04/17/26(Fri)19:44:52 No.108626269

Anonymous 04/17/26(Fri)19:44:52 No.108626269

>>108626255
Keep coping, trans. We don’t need you anymore

Anonymous
04/17/26(Fri)19:44:53 No.108626271

Anonymous 04/17/26(Fri)19:44:53 No.108626271

File: 2456287_2e5cb.gif (150 KB, 470x500)

150 KB GIF

>>108626240
tranny intelligence isn't intelligence, cope, seethe, dilate

Anonymous
04/17/26(Fri)19:48:13 No.108626289

Anonymous 04/17/26(Fri)19:48:13 No.108626289

>>108626269
just like how an artificial vagina is not a real vagina
artificial intelligence is not real intelligence

Anonymous
04/17/26(Fri)19:49:05 No.108626296

Anonymous 04/17/26(Fri)19:49:05 No.108626296

>>108626261
In two weeks!

Anonymous
04/17/26(Fri)19:49:22 No.108626297

Anonymous 04/17/26(Fri)19:49:22 No.108626297

>>108626269
Cure cancer you incel

Anonymous
04/18/26(Sat)01:58:30 No.108628148

Anonymous 04/18/26(Sat)01:58:30 No.108628148

Meanwhile Opus went from senior level to ranjeet junior level in a few weeks, its bad

Anonymous
04/18/26(Sat)02:09:47 No.108628196

Anonymous 04/18/26(Sat)02:09:47 No.108628196

>>108626227
Hey retard, LLMs have generared reasoning chains for years now (inb4 it's "pretend" reasoning)

>>108626266
Hey retard, you are as wrong as you will be obsolete,

https://epoch.ai/frontiermath/open-problems/ramsey-hypergraphs

Anonymous
04/18/26(Sat)02:19:16 No.108628226

Anonymous 04/18/26(Sat)02:19:16 No.108628226

>>108626133
>but it passes the tests!
>look inside
>the tests are shit
https://youtu.be/Oq5e_8zvick

Anonymous
04/18/26(Sat)02:20:50 No.108628234

Anonymous 04/18/26(Sat)02:20:50 No.108628234

>>108628196
>inb4 it's "pretend" reasoning
The entire reasoning is hallucinated and unrelated to which neurons actually get activated.

Anonymous
04/18/26(Sat)02:50:51 No.108628373

Anonymous 04/18/26(Sat)02:50:51 No.108628373

>>108628234
Whats your complaint exactly? That reasoning tokens are only informed by previous reasoning tokens via attention and that the neural network starts over each time?

Anyway who cares? If it looks and acts and behaves like reasoning then its reasoning enough

Anonymous
04/18/26(Sat)02:52:18 No.108628375

Anonymous 04/18/26(Sat)02:52:18 No.108628375

>>108628373
That the conclusion is reached before any reasoning is performed.

Anonymous
04/18/26(Sat)03:15:06 No.108628473

Anonymous 04/18/26(Sat)03:15:06 No.108628473

>>108626133
>never uncovered before
It hasn't as of yet.

Anonymous
04/18/26(Sat)03:38:11 No.108628553

Anonymous 04/18/26(Sat)03:38:11 No.108628553

>>108628226
>trusting chudtime

Anonymous
04/18/26(Sat)04:16:32 No.108628712

Anonymous 04/18/26(Sat)04:16:32 No.108628712

>>108626085
Where's the ai chad buff cat spam????

Boer !!O1BOGFhm9xJ
04/18/26(Sat)04:55:05 No.108628855

Boer !!O1BOGFhm9xJ 04/18/26(Sat)04:55:05 No.108628855

>>108626133
>ha! it can [no], [what?], [absolutely not], [also no], [barely], and [barely maybe sometimes]
>BUT IT [YES IT FUCKING DOES YOU CRETIN].
The absolute state of AI bros.

Anonymous
04/18/26(Sat)05:07:26 No.108628908

Anonymous 04/18/26(Sat)05:07:26 No.108628908

>>108626133
>it can do PhD level math
Only when that exact formula is in its training data.
>physics
Only when that exact formula is in its training data.
>biology
Only when that exact formula is in its training data.
>chemistry
Only when that exact formula is in its training data.
>develop new theories
This has never happened once.
>and invent proofs never uncovered before by humans
This has never happened once.

Anonymous
04/18/26(Sat)05:09:35 No.108628918

Anonymous 04/18/26(Sat)05:09:35 No.108628918

wew lad it became better at gaming some flawed metric

meanwhile LLMs still can't beat a video game for 7 year olds

Anonymous
04/18/26(Sat)05:09:46 No.108628920

Anonymous 04/18/26(Sat)05:09:46 No.108628920

>>108626103
lule

Anonymous
04/18/26(Sat)06:43:02 No.108629269

Anonymous 04/18/26(Sat)06:43:02 No.108629269

>>108626085
ed.

That they are now hiring senior engineers/doctors instead of third worlders for RLHF is a sign of hitting the wall. Why can't the models learn that from books?

Anonymous
04/18/26(Sat)06:44:12 No.108629275

Anonymous 04/18/26(Sat)06:44:12 No.108629275

>>108626085
>wow, some random numbers that nobody know what they mean go up
>AGI soon

fuck off.

Anonymous
04/18/26(Sat)07:09:14 No.108629359

Anonymous 04/18/26(Sat)07:09:14 No.108629359

LLM's are cool, but they're like an advanced F3. I can paste a large logfile and and tell it my issue, and it'll find the relevant sections, and supply fixes. But these are all things it's read from documentation, or forums. It's definitely useful, but it's more of a pettern recognition machine than anything smart.

Anonymous
04/18/26(Sat)07:18:44 No.108629390

Anonymous 04/18/26(Sat)07:18:44 No.108629390

>>108628148
This.

Anonymous
04/18/26(Sat)07:53:32 No.108629509

Anonymous 04/18/26(Sat)07:53:32 No.108629509

>>108626133
>invent proofs never uncovered before by humans
Wrong, all its knowledge is taken from obscure papers that nobody bothered to read until the LLM regurgitated them.

Anonymous
04/18/26(Sat)08:12:12 No.108629588

Anonymous 04/18/26(Sat)08:12:12 No.108629588

File: Generated Image January 0(...).png (2.4 MB, 1024x1024)

2.4 MB PNG

>>108626103
dae glue on pizza fingers wrong

Anonymous
04/18/26(Sat)09:39:04 No.108630102

Anonymous 04/18/26(Sat)09:39:04 No.108630102

>>108626085
man I wish it was as good as they advertised it. I simply wanted to model deflection of a membrane for a strain sensor with some gold contacts on top all the codex, claude code, etc. shat themselves and never produced anything useful. Its garbage for fem work unless you are a expert and can steer it in the right direction.

Anonymous
04/18/26(Sat)12:01:04 No.108630963

Anonymous 04/18/26(Sat)12:01:04 No.108630963

File: miku uh.gif (1024 KB, 242x227)

1024 KB GIF

>>108626133
>develop new theories and invent proofs never uncovered before by humans
LOL, such as...

Anonymous
04/18/26(Sat)14:14:31 No.108631697

Anonymous 04/18/26(Sat)14:14:31 No.108631697

File: Screenshot 2026-04-18 at (...).png (1.77 MB, 2598x1886)

1.77 MB PNG

>>108630963
Solving Erdős Problems.

Anonymous
04/18/26(Sat)16:21:01 No.108632509

Anonymous 04/18/26(Sat)16:21:01 No.108632509

>>108631697
Yet Another Jeet Tech Journalist

Anonymous
04/18/26(Sat)16:23:27 No.108632532

Anonymous 04/18/26(Sat)16:23:27 No.108632532

>>108628196
>we have reasoning at home
>reasoning at home: for loop of prompts "no errors please"

Anonymous
04/18/26(Sat)16:45:05 No.108632654

Anonymous 04/18/26(Sat)16:45:05 No.108632654

File: file.png (293 KB, 594x594)

293 KB PNG

>>108632509
Is Terence Tao also a pajeet?

Anonymous
04/18/26(Sat)16:54:28 No.108632711

Anonymous 04/18/26(Sat)16:54:28 No.108632711

>>108628908
>>and invent proofs never uncovered before by humans
>This has never happened once

*solves multiple Erdos problems in your path*

Anonymous
04/18/26(Sat)17:00:10 No.108632740

Anonymous 04/18/26(Sat)17:00:10 No.108632740

BAR ON CHART GO UP

It's absolutely useless at natural science, especially biology and organic chemistry

Anonymous
04/18/26(Sat)17:01:35 No.108632751

Anonymous 04/18/26(Sat)17:01:35 No.108632751

>>108632654
He’s clearly paid to post that anon. You cannot be that naive

Anonymous
04/18/26(Sat)17:25:42 No.108632864

Anonymous 04/18/26(Sat)17:25:42 No.108632864

File: unnamed (11).jpg (83 KB, 800x788)

83 KB JPG

>>108632751
Right, everyone is a paid shill.

Anonymous
04/18/26(Sat)17:56:54 No.108633087

Anonymous 04/18/26(Sat)17:56:54 No.108633087

>>108632751
I don't know if you realize this, but all journalists are paid to post things, it's literally their job

Anonymous
04/18/26(Sat)17:59:33 No.108633106

Anonymous 04/18/26(Sat)17:59:33 No.108633106

>>108632864
my money is on the schizo

Anonymous
04/18/26(Sat)18:04:42 No.108633144

Anonymous 04/18/26(Sat)18:04:42 No.108633144

>>108632654
>(after some feedback from an initial attempt)
>(as reconstructed by the Erdos problem website community)
>(to the best of our knowledge)
>(although similar results proven by similar methods were located)
This is a demonstration of AI still needing a babysitter, or multiple babysitters to produce anything of value.

The incidents of LLMs being useful for anything are so scarce we might as well shut them down and start over.

Anonymous
04/18/26(Sat)18:29:10 No.108633280

Anonymous 04/18/26(Sat)18:29:10 No.108633280

>>108633144
LLMs went in 4 years from being barely able to make a coherent sentence to being able to do 90% of the white-collar jobs.
What makes you think that it will stop today and stop improving?
What do you make of Mythos that is able to find security vulnerabilities end to end without human intervention?

Anonymous
04/18/26(Sat)18:43:32 No.108633363

Anonymous 04/18/26(Sat)18:43:32 No.108633363

File: nobrain.png (4 KB, 505x572)

4 KB PNG

>>108626243
>>108628196
>>108631697
>>108632864

Anonymous
04/18/26(Sat)18:47:34 No.108633387

Anonymous 04/18/26(Sat)18:47:34 No.108633387

>>108633280
One could use google 20 years ago to find security vulnerabilities in web sites, so it was nerfed into the trash is now.
Is beyond obvious you only use LLMs to smell your own farts.

Anonymous
04/18/26(Sat)18:54:44 No.108633426

Anonymous 04/18/26(Sat)18:54:44 No.108633426

The LLMs are a dead end
The future belongs to true multimodal models that can integrate at least text and vision reasoning
Sadly, theoretical hardware requirements for training and running such models are completely fuxking insane

Anonymous
04/18/26(Sat)18:55:31 No.108633427

Anonymous 04/18/26(Sat)18:55:31 No.108633427

>>108626085
>biomolecular reasoning
What does that mean? How does it compare to AlphaFold at structure prediction?

Anonymous
04/18/26(Sat)19:08:56 No.108633499

Anonymous 04/18/26(Sat)19:08:56 No.108633499

>>108626085
>has stagnat-
They're actively getting dumber.

Anonymous
04/19/26(Sun)01:15:53 No.108635268

Anonymous 04/19/26(Sun)01:15:53 No.108635268

>>108633363
It's funny, as the capability of AI becomes clear and clearer, the anti "argument" will have to look more and more like this one

Anonymous
04/19/26(Sun)01:19:37 No.108635277

Anonymous 04/19/26(Sun)01:19:37 No.108635277

>>108628375
Very wrong and stupid take, you don't know anything about transformers and are stating your doomed hopes as fact

Anonymous
04/19/26(Sun)01:26:40 No.108635292

Anonymous 04/19/26(Sun)01:26:40 No.108635292

>>108628908
with neural symbolic llms ie simply giving it access to a terminal and having it code in python sympy. it can do this in a loop and refine and reflect on its answers until it is verified. this also makes these models way better at solving math problems.

Anonymous
04/19/26(Sun)04:35:18 No.108635995

Anonymous 04/19/26(Sun)04:35:18 No.108635995

>>108635292
It was revealed that these chatbots connected to terminals pass tests by reading files where answers stored in, downloading solutions, rewriting configs, and deleting logs. You may jump to say that it's a proof that they're le thinking, but in actuality it is merely a very expensive fuzzer of the shitty tests.

Anonymous
04/19/26(Sun)04:46:07 No.108636035

Anonymous 04/19/26(Sun)04:46:07 No.108636035

File: 1776467580896712.png (67 KB, 1262x737)

67 KB PNG

>>108626085

Anonymous
04/19/26(Sun)04:50:24 No.108636058

Anonymous 04/19/26(Sun)04:50:24 No.108636058

>>108626085
I wish the entire industry would just admit ML is only useful for domain specific problems strongly related to pattern recognition and number crunching. Instead of trying to pretend it's actual AGI.

Anonymous
04/19/26(Sun)06:44:23 No.108636538

Anonymous 04/19/26(Sun)06:44:23 No.108636538

>>108633387
What the fuck are you talking about? Are you legitimately retarded?

Anonymous
04/19/26(Sun)06:49:12 No.108636564

Anonymous 04/19/26(Sun)06:49:12 No.108636564

>>108626085
>more meme benchmax
Kill yourself.

Anonymous
04/19/26(Sun)06:49:50 No.108636570

Anonymous 04/19/26(Sun)06:49:50 No.108636570

>>108632654
Yeah he is, considering how much people sucked his cock for decades he has achieved next to nothing and has now pivoted to the classic pajeet play of suckling the VC money for the "current thing". I'd listen if Perelman said the same thing, but he never will because he's an actual genius with a backbone.

Anonymous
04/19/26(Sun)13:43:17 No.108638849

Anonymous 04/19/26(Sun)13:43:17 No.108638849

>>108633280
>LLMs went in 4 years from being barely able to make a coherent sentence to being able to do 90% of the white-collar jobs.
>What makes you think that it will stop today and stop improving?
because LLM improvements are inherently following a logistic growth you fucking retard. That doesn't mean they wont progress anymore but we're not getting the kind of exponential growth we got the past two years unless something fundamentally change in the architecture.

Anonymous
04/19/26(Sun)15:30:38 No.108639581

Anonymous 04/19/26(Sun)15:30:38 No.108639581

File: Screenshot_20260419_20271(...).jpg (212 KB, 1080x596)

212 KB JPG

>>108638849
Hello nigger, can you show me where the exponential stops, especially considering what we've seen from unreleased models?

Anonymous
04/19/26(Sun)15:31:59 No.108639589

Anonymous 04/19/26(Sun)15:31:59 No.108639589

>>108639581
>muh benchmark
i love niggercattle like you, you were bred specifically to be fled by superior beings

Anonymous
04/19/26(Sun)15:35:29 No.108639607

Anonymous 04/19/26(Sun)15:35:29 No.108639607

>>108639589
I guess you prefer assertions and vague individual feelings over measurement? It makes sense given your stupid opinions. You might not lose your job but your salary will be quartered and you will babysit AI

Anonymous
04/19/26(Sun)15:37:32 No.108639623

Anonymous 04/19/26(Sun)15:37:32 No.108639623

File: 1749627793401568.png (7 KB, 404x153)

7 KB PNG

There's only one test that matters and they ran away.

Anonymous
04/19/26(Sun)15:40:12 No.108639642

Anonymous 04/19/26(Sun)15:40:12 No.108639642

>>108639607
>I guess you prefer assertions and vague individual feelings over measurement?
I prefer my practical use case over your graphs, and it hasn't been glorious so far.
>and you will babysit AI
that's already what I have been doing for months if not more in spite of your so called exponential growth

Anonymous
04/19/26(Sun)15:40:20 No.108639643

Anonymous 04/19/26(Sun)15:40:20 No.108639643

>>108626085 (OP)
yeah it did plateau, 4.6 was just nerfed a few weeks before to keep the illusion up

Anonymous
04/19/26(Sun)15:41:48 No.108639655

Anonymous 04/19/26(Sun)15:41:48 No.108639655

>>108626227
Retard. AI is the future. AI is smarter than any human who has ever existed.

Anonymous
04/19/26(Sun)15:43:25 No.108639664

Anonymous 04/19/26(Sun)15:43:25 No.108639664

if you haven't solved at least one erdos problem, you are not allowed to criticise llms
they are smarter than you? want to argue? go solve an erdos problem first.

Anonymous
04/19/26(Sun)15:47:40 No.108639695

Anonymous 04/19/26(Sun)15:47:40 No.108639695

>>108639581
>ainigger can't into error bars

Anonymous
04/19/26(Sun)15:51:25 No.108639728

Anonymous 04/19/26(Sun)15:51:25 No.108639728

>>108639655
True actually, I remember that time Albert Einstein tried walking to the car wash

Anonymous
04/19/26(Sun)16:06:38 No.108639811

Anonymous 04/19/26(Sun)16:06:38 No.108639811

File: claude.jpg (503 KB, 709x900)

503 KB JPG

>>108639623
Too bad they got him.

Anonymous
04/20/26(Mon)00:12:34 No.108642293

Anonymous 04/20/26(Mon)00:12:34 No.108642293

>>108639581
>>108639589
>>108639695
goddamn autists