At first, I thought GPT-5 had cracked those math problems on its own.Turns out (as Demis pointed out) GPT-5 just looked up the answers via web search.We really need better peer review for these “AI discovers science/math” claims.
The last few months have been devastating for LLM dreams:> Apple reasoning paper and the ASU mirage paper and many others confirmed that LLMs still can’t solve distribution shift. > GPT-5 came late and fell short. > Karpathy just said agents aren’t anywhere close, and that AGI is a decade away.> And Hassabis just blew up some wildly overhyped claims from OpenAI about math.Game over, man.LLMs have their place, but anyone expecting the current paradigm to be close to AGI is delusional.
People who are in charge of selling you their product and purposefully block access for people who want to test and validate their results are in fact full of shit.Who knew?
>>106931905how about you post where he said something useful and not just "this is embarassing"?
>>106932739this. embarassing desu
>>106931915>distribution shiftExplain that without consulting Gary Marcus
>>106932739>>106933175Not op.I guess Google must be fairly confident about Gemini 3.
>>106931905Do these people even understand how GPT works? It doesn't solve or invent anything. It predicts the next most likely token.
>>106933208Something something emergent abilities something give me a trillion dollars
>>106933208still with their CoT (chain of thought) stuff basically, lets prompt the model with software 1000x times ourselves, from the initial users prompt.it does do a form of thinking, in the sense that it can coagulate a mishmash of info and maybe combine it into an interesting novel thing. this is extremely inefficient though and theyre never using 'consumer grade' models for these hot-shot question answerings, its always highly trained shit, highly specialized, highly expensive. so even if it can create some novel soups with a bunch of ingredients and math autism, its not feasible to depend on it rather than some scientists eating pizza and grinding the solution out.