/g/ - when thinking like this can still beat 99.3% of co - Technology

Anonymous

05/09/26(Sat)07:09:45 No.108785167

File: HH1ItzFXoAYIvzV.png (95 KB, 818x639)

95 KB PNG

Anonymous 05/09/26(Sat)07:09:45 No.108785167 Archived

when thinking like this can still beat 99.3% of coders what does this say about coders thinking ability ?

Anonymous
05/09/26(Sat)07:12:26 No.108785177

Anonymous 05/09/26(Sat)07:12:26 No.108785177

>>108785167
>look, the impact wrench makes for a shitty hammer
>why do people use it for driving screws?

Anonymous
05/09/26(Sat)07:16:31 No.108785198

Anonymous 05/09/26(Sat)07:16:31 No.108785198

>>108785167
telling a text generation model to give you a color name token isnt beating coders

Anonymous
05/09/26(Sat)07:18:47 No.108785207

Anonymous 05/09/26(Sat)07:18:47 No.108785207

>>108785167
It says 99.3% of programmers are jeets.

Anonymous
05/09/26(Sat)07:19:34 No.108785215

Anonymous 05/09/26(Sat)07:19:34 No.108785215

>>108785167
You do realize that thinking from previous messages is wiped from the context unless persistent reasoning is enabled, right?

Anonymous
05/09/26(Sat)07:37:43 No.108785310

Anonymous 05/09/26(Sat)07:37:43 No.108785310

>>108785215
so they discard the hidden state and simply feed the conversation up to the latest user prompt in as the preamble? I thought everyone was using kv caching nowadays

Anonymous
05/09/26(Sat)07:53:16 No.108785379

Anonymous 05/09/26(Sat)07:53:16 No.108785379

Wait, these "reasoning" LLMs don't include their "thinking" outputs in their context.

So, my cryptographic litmus test(ask them to think of a random string and only convey the cryptrographic hash of it to you, then ask it again for the original string, hash and check) still works for them?

Anonymous
05/09/26(Sat)09:29:00 No.108785809

Anonymous 05/09/26(Sat)09:29:00 No.108785809

>>108785198
>I need a program to accurately generate the needful
>okay here it is
>this doesn't generate the needful
>you're right here's the fix
Repeat forever.

Anonymous
05/09/26(Sat)09:51:37 No.108785947

Anonymous 05/09/26(Sat)09:51:37 No.108785947

>>108785167
is there real understanding of the subject or is some very advanced form of distilling of its training set taking place? They never answer this question yes I know that AI has been slowing math problem but most math problems can be solved just by finding the right "moves" its not that exciting. Idk maybe I am just stupid I see the benefits but I see no intelligence. Just a very advanced form of regression.

Anonymous
05/09/26(Sat)09:53:31 No.108785959

Anonymous 05/09/26(Sat)09:53:31 No.108785959

File: 78A7BF30-FCB7-446B-8840-5(...).jpg (1.29 MB, 3464x3464)

1.29 MB JPG

>>108785167
Luddites won’t get it, anon. They are already obsolete and codetrans are useless now. Their "skills" have been turned into a trans hobby

Anonymous
05/09/26(Sat)09:58:34 No.108785986

Anonymous 05/09/26(Sat)09:58:34 No.108785986

this shit is tuned and RLHFd to keep the conversation going, as the CoT says, it is trying to get the conversation to continue as it cares more about its tuning than the instruction it was given
LLMs are not AI

Anonymous
05/09/26(Sat)10:18:30 No.108786102

Anonymous 05/09/26(Sat)10:18:30 No.108786102

>>108785167
AI companies don’t want their product to be perfect.

They make money when customers burn tokens.

Anonymous
05/09/26(Sat)10:45:40 No.108786243

Anonymous 05/09/26(Sat)10:45:40 No.108786243

>>108785809
you're misusing the tool, you need to match the scale of the task to the capabilities/pefrormance of the model for example telling the model to generate a single function, actually see that it does what you ask and so on.

current stuff isnt very good at generating code but there are other uses like analyzing code with it interactively to see where faults/places you can refactor are

Anonymous
05/09/26(Sat)11:06:09 No.108786342

Anonymous 05/09/26(Sat)11:06:09 No.108786342

>>108785310
>simply feed the conversation up to the latest user prompt in as the preamble
Quite literally, yes
https://www.youtube.com/shorts/WP5_XJY_P0Q