/g/ - Ask any AI hater to explain what's happening in th - Technology


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Anonymous
12/26/25(Fri)16:12:26 No.107676078

File: 42979_2025_4182_Fig1_HTML.png (78 KB, 685x508)

Anonymous 12/26/25(Fri)16:12:26 No.107676078

Ask any AI hater to explain what's happening in this picture and watch them go quiet and then start repeating some nonsense marxist talking points they've memorized.

Anonymous
12/26/25(Fri)16:14:15 No.107676096

Anonymous 12/26/25(Fri)16:14:15 No.107676096

>>107676078
AI haters BTFO once again

Anonymous
12/26/25(Fri)16:15:56 No.107676111

Anonymous 12/26/25(Fri)16:15:56 No.107676111

>>107676078
can you explain it?

Anonymous
12/26/25(Fri)16:15:57 No.107676112

Anonymous 12/26/25(Fri)16:15:57 No.107676112

>>107676078
>AI hater
trannies redditors and deviantart grifters
there is literally no other demographic hating AI

Anonymous
12/26/25(Fri)16:16:14 No.107676117

Anonymous 12/26/25(Fri)16:16:14 No.107676117

>>107676078
...overfitting?

Anonymous
12/26/25(Fri)16:19:23 No.107676154

Anonymous 12/26/25(Fri)16:19:23 No.107676154

>>107676078
log scale
>>107676117
always

Anonymous
12/26/25(Fri)16:23:29 No.107676195

Anonymous 12/26/25(Fri)16:23:29 No.107676195

File: 42979_2025_4182_Fig1_HTML(1).png (96 KB, 684x508)

96 KB PNG

>>107676117
That's only this part of the graph. Why are you ignoring the rest?

Anonymous
12/26/25(Fri)16:25:13 No.107676210

Anonymous 12/26/25(Fri)16:25:13 No.107676210

>>107676078
So what's your point here?

Anonymous
12/26/25(Fri)16:26:07 No.107676213

Anonymous 12/26/25(Fri)16:26:07 No.107676213

>>107676078
first it didn't know then it learned

Anonymous
12/26/25(Fri)16:27:04 No.107676222

Anonymous 12/26/25(Fri)16:27:04 No.107676222

>>107676210
I think he's trying to disprove 'it only learns patterns in training data' by showing that it can get 'smarter' even after it already figured out the training data

Anonymous
12/26/25(Fri)16:33:13 No.107676270

Anonymous 12/26/25(Fri)16:33:13 No.107676270

>>107676078
I don't know whats happening in that picture.

Anonymous
12/26/25(Fri)16:35:47 No.107676287

Anonymous 12/26/25(Fri)16:35:47 No.107676287

>>107676112
sooooooooo much this!!
Jesus himself would be vibe preaching if he was still around

Anonymous
12/26/25(Fri)16:37:31 No.107676299

Anonymous 12/26/25(Fri)16:37:31 No.107676299

File: 1738228137066094.png (82 KB, 685x508)

82 KB PNG

What the fuck happens here?

Anonymous
12/26/25(Fri)16:55:25 No.107676446

Anonymous 12/26/25(Fri)16:55:25 No.107676446

>>107676111
ofc not. This is a homework thread.

Anonymous
12/26/25(Fri)17:01:11 No.107676480

Anonymous 12/26/25(Fri)17:01:11 No.107676480

>>107676078
>picture of AI shit
>why can't AI haters identify this?!?
why would an AI hater be doing AI training?

Anonymous
12/26/25(Fri)17:11:38 No.107676547

Anonymous 12/26/25(Fri)17:11:38 No.107676547

Plebs vastly underestimate the amount of data intelligence requires
it's not the brute force approach is impossible
it's that you need a million times more data than its currently available

Anonymous
12/26/25(Fri)17:11:54 No.107676549

Anonymous 12/26/25(Fri)17:11:54 No.107676549

>>107676299
Quantum entanglement

Anonymous
12/26/25(Fri)17:13:33 No.107676562

Anonymous 12/26/25(Fri)17:13:33 No.107676562

>>107676547
would help if they didn't filter 99% of available data out for being "toxic" of some kind, ie not politically aligned with the lab.

Anonymous
12/26/25(Fri)17:14:18 No.107676569

Anonymous 12/26/25(Fri)17:14:18 No.107676569

>>107676078
>hurr durr i just read deep double descent
op is a nigger

Anonymous
12/26/25(Fri)17:20:04 No.107676617

Anonymous 12/26/25(Fri)17:20:04 No.107676617

>>107676078
>generalizing a trivial function only needs a thousand TIMES more steps than learning the training data
>still doesn't reach 100% of accuracy, but who needs your calculator to be able to calculate
Impressive, very nice.

Anonymous
12/26/25(Fri)17:28:40 No.107676677

Anonymous 12/26/25(Fri)17:28:40 No.107676677

>>107676617
you're supposed to call the tool to do math bro, it's for text not maths

Anonymous
12/26/25(Fri)17:34:44 No.107676720

Anonymous 12/26/25(Fri)17:34:44 No.107676720

>>107676547
>it's that you need a million times more data than its currently available
The higher-order the thing you're looking for, the more data current AI needs to find it. Not sure if it's quadratically more or exponentially more.
But sure, let's not put any effort into finding a more efficient method.

Zig is the best language in th(...)
12/26/25(Fri)17:48:15 No.107676822

Zig is the best language in the world 12/26/25(Fri)17:48:15 No.107676822

>>107676111
>can you explain it?
It's called "grokking". You get 0 error on some artificial training objectives (modulo arithmetic here), if you train way past what you normally would.

Anonymous
12/26/25(Fri)17:59:02 No.107676907

Anonymous 12/26/25(Fri)17:59:02 No.107676907

>>107676078
Looks like overfitting. You continue bruteforcing until you have high accuracy on the validation set, so even if the NN is never explicitly trained on it it still ends up overfitting it

Anonymous
12/26/25(Fri)18:01:09 No.107676925

Anonymous 12/26/25(Fri)18:01:09 No.107676925

>>107676112
anyone have that reaction image from /pol/ of nazi pepe the frog saying jews are based that's how i feel about this new ai hate=tranny d/c bot spam

Anonymous
12/26/25(Fri)18:02:15 No.107676933

Anonymous 12/26/25(Fri)18:02:15 No.107676933

>>107676907
fortunately for Sammy none of his investors have the 110 iq required to understand this post

Anonymous
12/26/25(Fri)18:06:33 No.107676974

Anonymous 12/26/25(Fri)18:06:33 No.107676974

>>107676078
>That gap
Should be on a /h/ image

Anonymous
12/26/25(Fri)18:15:51 No.107677045

Anonymous 12/26/25(Fri)18:15:51 No.107677045

>>107676907
https://www.youtube.com/watch?v=D8GOeCFFby4

Anonymous
12/26/25(Fri)18:16:41 No.107677054

Anonymous 12/26/25(Fri)18:16:41 No.107677054

>>107676078
Any real socialist/communist is pro AI, the luddites are just finding whatever political excuse they can to curtail progress

Anonymous
12/26/25(Fri)18:21:36 No.107677086

Anonymous 12/26/25(Fri)18:21:36 No.107677086

>>107677045
Giving it a name does bot legitimize it. It's just bruteforcing and overfitting. Normal NN training has a loss measure that fits the training set. When you keep training after the model already fits the training set, until it fits the validation set, then you (or rather the "engineer" overseeing the training) become the loss runction.

People just misunderstand statistics and neglect the bias inserter by their own actions. Choosing to cut training at a specific point or extend training further, based on the model's performance on the validation set, makes the validation set part of the training and taints the data. This is textbook overfitting.

Anonymous
12/26/25(Fri)18:24:51 No.107677114

Anonymous 12/26/25(Fri)18:24:51 No.107677114

File: rolling_pepe.jpg (16 KB, 360x360)

16 KB JPG

>>107676078
Another fucking
>i watched a youtube video now i am smart let me make a 4chan thread about it
thread

Anonymous
12/26/25(Fri)18:25:23 No.107677118

Anonymous 12/26/25(Fri)18:25:23 No.107677118

>>107677086
I posted that video because a) it's probably what inspired OP to make this thread and b) it explains exactly how a model like this learns to generalize on the training set some time after the overfitting. It's an example of a highly interpretable network.

Anonymous
12/26/25(Fri)18:32:27 No.107677176

Anonymous 12/26/25(Fri)18:32:27 No.107677176

>>107677118
It's not learning. If it already has 100% accuracy on the training set, but fails in the validation set, then it found a solution that fits all the training set but is not the correct one. By continuing to train it it tries different solutions that still fit the training set to 100%.

How does it know it found the correct solution out of the multiple ones with 100% "accuracy" on the training set? It doesn't. The researcher looks at the validation set accuracy and decides whether the current solution is the right one or not. This is just training on the validation set by proxy.

Eventually it will find a solution that fits both the training and the validation set. That's when the researcher stops the training. Does that mean it's the correct solution?

No. Not necessarily. If you try a different validation set at that point, it quite likely will fail. Or if you continue training, it might go for a different solution that drops the validation set accuracy. It looks good in graphs, but in practice this is just blatant cheating. You are exposing the gradients to the validation set, making your "validation accuracy" meaningless

You could get the same effect by simply not splitting the data into a training set and and validation set, simply train on all available data. It would converge faster and be just as smart (dumb)

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.