/g/ - At first, I thought GPT-5 had cracked those math p - Technology

Anonymous

10/18/25(Sat)15:19:23 No.106931905

File: llms_hype.jpg (888 KB, 1320x1690)

Anonymous 10/18/25(Sat)15:19:23 No.106931905 Archived

At first, I thought GPT-5 had cracked those math problems on its own.
Turns out (as Demis pointed out) GPT-5 just looked up the answers via web search.
We really need better peer review for these “AI discovers science/math” claims.

Anonymous
10/18/25(Sat)15:20:18 No.106931915

Anonymous 10/18/25(Sat)15:20:18 No.106931915

File: AI_SLOP.mp4 (1.22 MB, 1080x1080)

1.22 MB MP4

The last few months have been devastating for LLM dreams:
> Apple reasoning paper and the ASU mirage paper and many others confirmed that LLMs still can’t solve distribution shift.
> GPT-5 came late and fell short.
> Karpathy just said agents aren’t anywhere close, and that AGI is a decade away.
> And Hassabis just blew up some wildly overhyped claims from OpenAI about math.
Game over, man.
LLMs have their place, but anyone expecting the current paradigm to be close to AGI is delusional.

Anonymous
10/18/25(Sat)16:25:23 No.106932535

Anonymous 10/18/25(Sat)16:25:23 No.106932535

People who are in charge of selling you their product and purposefully block access for people who want to test and validate their results are in fact full of shit.
Who knew?

Anonymous
10/18/25(Sat)16:53:37 No.106932739

Anonymous 10/18/25(Sat)16:53:37 No.106932739

>>106931905
how about you post where he said something useful and not just "this is embarassing"?

Anonymous
10/18/25(Sat)17:51:47 No.106933175

Anonymous 10/18/25(Sat)17:51:47 No.106933175

>>106932739
this. embarassing desu

Anonymous
10/18/25(Sat)17:54:35 No.106933203

Anonymous 10/18/25(Sat)17:54:35 No.106933203

>>106931915
>distribution shift
Explain that without consulting Gary Marcus

Anonymous
10/18/25(Sat)17:54:38 No.106933204

Anonymous 10/18/25(Sat)17:54:38 No.106933204

File: 1000016877.png (398 KB, 1080x1475)

398 KB PNG

>>106932739
>>106933175
Not op.

I guess Google must be fairly confident about Gemini 3.

Anonymous
10/18/25(Sat)17:55:19 No.106933208

Anonymous 10/18/25(Sat)17:55:19 No.106933208

>>106931905
Do these people even understand how GPT works? It doesn't solve or invent anything. It predicts the next most likely token.

Anonymous
10/18/25(Sat)17:56:21 No.106933214

Anonymous 10/18/25(Sat)17:56:21 No.106933214

>>106933208
Something something emergent abilities something give me a trillion dollars

Anonymous
10/18/25(Sat)17:59:49 No.106933246

Anonymous 10/18/25(Sat)17:59:49 No.106933246

>>106933208
still with their CoT (chain of thought) stuff basically, lets prompt the model with software 1000x times ourselves, from the initial users prompt.
it does do a form of thinking, in the sense that it can coagulate a mishmash of info and maybe combine it into an interesting novel thing.
this is extremely inefficient though and theyre never using 'consumer grade' models for these hot-shot question answerings, its always highly trained shit, highly specialized, highly expensive. so even if it can create some novel soups with a bunch of ingredients and math autism, its not feasible to depend on it rather than some scientists eating pizza and grinding the solution out.