[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1767651244838628.png (109 KB, 1262x737)
109 KB PNG
>LLM progress has stagnat-
oh
>>
>>108626085
Time to WALK to that car wash.
>>
>>108626103
geg
>>
File: file.jpg (1.55 MB, 3000x1993)
1.55 MB JPG
>>108626103
Promptstitutes be like
>>
File: neonft.png (303 KB, 1197x878)
303 KB PNG
>>
File: 1767552145254097.png (199 KB, 498x498)
199 KB PNG
>>108626103
>>108626108
>>108626116
>ha! it can do PhD level math, physics, biology, chemistry, develop new theories and invent proofs never uncovered before by humans
>BUT IT SAYS STRAWBERRY HAS 1 'R' XDDDD IT'S USELESS
>>
>>108626085
Luddite cope thread. We AIGODS won
>>
>>108626135
2 more time units.
>>
File: 9wptiq-3104655978.png (15 KB, 1013x700)
15 KB PNG
>>108626135
>>108626133
you will never be a real intelligence
>>
>>108626085
>fails 26% of time
>>
>>108626133
LLMs are statistical language models
They do not think or understand reason
Do you not see the absurdity in you suggesting an LLM is capable of creating new mathematical proofs when it's not even able to count?
An LLM just spits out sentences based on what is statistically likely to be the next correct word based on it's training data. It is not capable of "thinking" about maths or physics in any way.
>>
>>108626227
>suggesting an LLM is capable of creating new mathematical proofs when it's not even able to count?
Have you been in a coma? It keeps happening that LLMs are making proofs.
>>
>>108626240
>what prompstitution does to a mf
>>
>>108626133
>>BUT IT SAYS STRAWBERRY HAS 1 'R' XDDDD IT'S USELESS
Unironically, yes, because there is a reproducibility crisis in science, so many of those "new theories" and "proofs" are likely wrong.
>>
>>108626219
The cure to cancer is on the way
>>
>>108626243
Link me one, and not one where the researchers had to prompt it through the whole thing or one where the "proof" was just the AI applying something so dumb that no human had thought of trying that before.
An actual proof where the AI went away and applied actual reasoning and solved the problem (which LLMs are not capable of doing btw)
>>
>>108626255
Keep coping, trans. We don’t need you anymore
>>
File: 2456287_2e5cb.gif (150 KB, 470x500)
150 KB GIF
>>108626240
tranny intelligence isn't intelligence, cope, seethe, dilate
>>
>>108626269
just like how an artificial vagina is not a real vagina
artificial intelligence is not real intelligence
>>
>>108626261
In two weeks!
>>
>>108626269
Cure cancer you incel
>>
Meanwhile Opus went from senior level to ranjeet junior level in a few weeks, its bad
>>
>>108626227
Hey retard, LLMs have generared reasoning chains for years now (inb4 it's "pretend" reasoning)

>>108626266
Hey retard, you are as wrong as you will be obsolete,

https://epoch.ai/frontiermath/open-problems/ramsey-hypergraphs
>>
>>108626133
>but it passes the tests!
>look inside
>the tests are shit
https://youtu.be/Oq5e_8zvick
>>
>>108628196
>inb4 it's "pretend" reasoning
The entire reasoning is hallucinated and unrelated to which neurons actually get activated.
>>
>>108628234
Whats your complaint exactly? That reasoning tokens are only informed by previous reasoning tokens via attention and that the neural network starts over each time?

Anyway who cares? If it looks and acts and behaves like reasoning then its reasoning enough
>>
>>108628373
That the conclusion is reached before any reasoning is performed.
>>
>>108626133
>never uncovered before
It hasn't as of yet.
>>
>>108628226
>trusting chudtime
>>
>>108626085
Where's the ai chad buff cat spam????
>>
>>108626133
>ha! it can [no], [what?], [absolutely not], [also no], [barely], and [barely maybe sometimes]
>BUT IT [YES IT FUCKING DOES YOU CRETIN].
The absolute state of AI bros.
>>
>>108626133
>it can do PhD level math
Only when that exact formula is in its training data.
>physics
Only when that exact formula is in its training data.
>biology
Only when that exact formula is in its training data.
>chemistry
Only when that exact formula is in its training data.
>develop new theories
This has never happened once.
>and invent proofs never uncovered before by humans
This has never happened once.
>>
wew lad it became better at gaming some flawed metric

meanwhile LLMs still can't beat a video game for 7 year olds
>>
>>108626103
lule
>>
>>108626085
ed.

That they are now hiring senior engineers/doctors instead of third worlders for RLHF is a sign of hitting the wall. Why can't the models learn that from books?
>>
>>108626085
>wow, some random numbers that nobody know what they mean go up
>AGI soon

fuck off.
>>
LLM's are cool, but they're like an advanced F3. I can paste a large logfile and and tell it my issue, and it'll find the relevant sections, and supply fixes. But these are all things it's read from documentation, or forums. It's definitely useful, but it's more of a pettern recognition machine than anything smart.
>>
>>108628148
This.
>>
>>108626133
>invent proofs never uncovered before by humans
Wrong, all its knowledge is taken from obscure papers that nobody bothered to read until the LLM regurgitated them.
>>
>>108626103
dae glue on pizza fingers wrong
>>
>>108626085
man I wish it was as good as they advertised it. I simply wanted to model deflection of a membrane for a strain sensor with some gold contacts on top all the codex, claude code, etc. shat themselves and never produced anything useful. Its garbage for fem work unless you are a expert and can steer it in the right direction.
>>
File: miku uh.gif (1024 KB, 242x227)
1024 KB GIF
>>108626133
>develop new theories and invent proofs never uncovered before by humans
LOL, such as...
>>
>>108630963
Solving Erdős Problems.
>>
>>108631697
Yet Another Jeet Tech Journalist
>>
>>108628196
>we have reasoning at home
>reasoning at home: for loop of prompts "no errors please"
>>
File: file.png (293 KB, 594x594)
293 KB PNG
>>108632509
Is Terence Tao also a pajeet?
>>
>>108628908
>>and invent proofs never uncovered before by humans
>This has never happened once

*solves multiple Erdos problems in your path*
>>
BAR ON CHART GO UP

It's absolutely useless at natural science, especially biology and organic chemistry
>>
>>108632654
He’s clearly paid to post that anon. You cannot be that naive
>>
File: unnamed (11).jpg (83 KB, 800x788)
83 KB JPG
>>108632751
Right, everyone is a paid shill.
>>
>>108632751
I don't know if you realize this, but all journalists are paid to post things, it's literally their job
>>
>>108632864
my money is on the schizo
>>
>>108632654
>(after some feedback from an initial attempt)
>(as reconstructed by the Erdos problem website community)
>(to the best of our knowledge)
>(although similar results proven by similar methods were located)
This is a demonstration of AI still needing a babysitter, or multiple babysitters to produce anything of value.

The incidents of LLMs being useful for anything are so scarce we might as well shut them down and start over.
>>
>>108633144
LLMs went in 4 years from being barely able to make a coherent sentence to being able to do 90% of the white-collar jobs.
What makes you think that it will stop today and stop improving?
What do you make of Mythos that is able to find security vulnerabilities end to end without human intervention?
>>
File: nobrain.png (4 KB, 505x572)
4 KB PNG
>>108626243
>>108628196
>>108631697
>>108632864
>>
>>108633280
One could use google 20 years ago to find security vulnerabilities in web sites, so it was nerfed into the trash is now.
Is beyond obvious you only use LLMs to smell your own farts.
>>
The LLMs are a dead end
The future belongs to true multimodal models that can integrate at least text and vision reasoning
Sadly, theoretical hardware requirements for training and running such models are completely fuxking insane
>>
>>108626085
>biomolecular reasoning
What does that mean? How does it compare to AlphaFold at structure prediction?
>>
>>108626085
>has stagnat-
They're actively getting dumber.
>>
>>108633363
It's funny, as the capability of AI becomes clear and clearer, the anti "argument" will have to look more and more like this one
>>
>>108628375
Very wrong and stupid take, you don't know anything about transformers and are stating your doomed hopes as fact
>>
>>108628908
with neural symbolic llms ie simply giving it access to a terminal and having it code in python sympy. it can do this in a loop and refine and reflect on its answers until it is verified. this also makes these models way better at solving math problems.
>>
>>108635292
It was revealed that these chatbots connected to terminals pass tests by reading files where answers stored in, downloading solutions, rewriting configs, and deleting logs. You may jump to say that it's a proof that they're le thinking, but in actuality it is merely a very expensive fuzzer of the shitty tests.
>>
File: 1776467580896712.png (67 KB, 1262x737)
67 KB PNG
>>108626085
>>
>>108626085
I wish the entire industry would just admit ML is only useful for domain specific problems strongly related to pattern recognition and number crunching. Instead of trying to pretend it's actual AGI.
>>
>>108633387
What the fuck are you talking about? Are you legitimately retarded?
>>
>>108626085
>more meme benchmax
Kill yourself.
>>
>>108632654
Yeah he is, considering how much people sucked his cock for decades he has achieved next to nothing and has now pivoted to the classic pajeet play of suckling the VC money for the "current thing". I'd listen if Perelman said the same thing, but he never will because he's an actual genius with a backbone.
>>
>>108633280
>LLMs went in 4 years from being barely able to make a coherent sentence to being able to do 90% of the white-collar jobs.
>What makes you think that it will stop today and stop improving?
because LLM improvements are inherently following a logistic growth you fucking retard. That doesn't mean they wont progress anymore but we're not getting the kind of exponential growth we got the past two years unless something fundamentally change in the architecture.
>>
>>108638849
Hello nigger, can you show me where the exponential stops, especially considering what we've seen from unreleased models?
>>
>>108639581
>muh benchmark
i love niggercattle like you, you were bred specifically to be fled by superior beings
>>
>>108639589
I guess you prefer assertions and vague individual feelings over measurement? It makes sense given your stupid opinions. You might not lose your job but your salary will be quartered and you will babysit AI
>>
File: 1749627793401568.png (7 KB, 404x153)
7 KB PNG
There's only one test that matters and they ran away.
>>
>>108639607
>I guess you prefer assertions and vague individual feelings over measurement?
I prefer my practical use case over your graphs, and it hasn't been glorious so far.
>and you will babysit AI
that's already what I have been doing for months if not more in spite of your so called exponential growth
>>
>>108626085 (OP)
yeah it did plateau, 4.6 was just nerfed a few weeks before to keep the illusion up
>>
>>108626227
Retard. AI is the future. AI is smarter than any human who has ever existed.
>>
if you haven't solved at least one erdos problem, you are not allowed to criticise llms
they are smarter than you? want to argue? go solve an erdos problem first.
>>
>>108639581
>ainigger can't into error bars
>>
>>108639655
True actually, I remember that time Albert Einstein tried walking to the car wash
>>
File: claude.jpg (503 KB, 709x900)
503 KB JPG
>>108639623
Too bad they got him.
>>
>>108639581
>>108639589
>>108639695
goddamn autists



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.