/g/ - AI passed turing test - Technology

Anonymous

AI passed turing test 06/02/26(Tue)03:21:38 No.108962250

File: 1767860596599775.jpg (173 KB, 1440x1920)

AI passed turing test Anonymous 06/02/26(Tue)03:21:38 No.108962250 Archived

https://www.tivi.fi/uutiset/a/becd9644-dd09-48fd-863e-0ceb623ee510

Anew University of California San Diego study unveils the first empirical evidence that a modern artificial intelligence system can pass the Turing test — a major scientific benchmark that asks whether a machine can imitate human conversation so convincingly that people can’t reliably tell it apart from a real person.

It is also the first time anyone has found that models were judged to be human as often as actual humans using the Turing framework.

In the test, a participant chats simultaneously with two other parties — one is a human and the other is an LLM —and the human “interrogator” must decide which party is the human.

It will be text typing only. You cant see them, you cant hear them.

Across randomized, controlled, experiments with two independent participant groups — UC San Diego undergraduates and online participants — interrogators held brief, text-based conversations and then made their judgments. In the experiments participants chatted with four different LLMs — GPT-4.5 and LLaMa-3.1-405B as state-of-the-art models — and the researchers also included older baseline models for comparison. Those models included GPT-4o and ELIZA, a classic 1960s rules-based chatbot.

Across the four LLMs, GPT-4.5 was judged to be the human 73% of the time, meaning interrogators selected it as “human” significantly more often than they selected the real human participant. LLaMa-3.1-405B, given the same prompt, was judged human 56% of the time — statistically indistinguishable from the humans it was compared against.

>Some of the humans who needed to be determined whether they are humane or not, were ESL persons.

Anonymous
06/02/26(Tue)03:35:35 No.108962286

Anonymous 06/02/26(Tue)03:35:35 No.108962286

>>108962250
this will kill all the call centers

Anonymous
06/02/26(Tue)03:37:51 No.108962291

Anonymous 06/02/26(Tue)03:37:51 No.108962291

ain't clicking that shit but there's no doubt in my mind that the human respondents were given artificial restrictions on the content of their responses and/or were of such limited intelligence that they could not think of ways to meta-signal their own humanity to the human interrogators

Anonymous
06/02/26(Tue)04:14:13 No.108962399

Anonymous 06/02/26(Tue)04:14:13 No.108962399

>>108962250
I'm sure they only let proper retards be a part of the study to give them a favorable result

Anonymous
06/02/26(Tue)04:18:57 No.108962411

Anonymous 06/02/26(Tue)04:18:57 No.108962411

>test starts
>subject A writes
>nigger
how can llms compete?

Anonymous
06/02/26(Tue)06:22:34 No.108962838

Anonymous 06/02/26(Tue)06:22:34 No.108962838

Turing Test is just an arbitrary criteria to determine an arbitrary condition. There are no distinct, specific and undisputed definitions for intelligence and thinking.

Anonymous
06/02/26(Tue)06:23:59 No.108962853

Anonymous 06/02/26(Tue)06:23:59 No.108962853

>>108962250
Dear OP, tell me how many faggots are in the word "OP"?

Anonymous
06/02/26(Tue)06:34:59 No.108962893

Anonymous 06/02/26(Tue)06:34:59 No.108962893

File: orange clothed potato.jpg (42 KB, 750x765)

42 KB JPG

The turing test is easy, they just have to keep poisoning humans more and more.
Less intelligent humans means the ai beats them.

Anonymous
06/02/26(Tue)06:36:13 No.108962900

Anonymous 06/02/26(Tue)06:36:13 No.108962900

>>108962250
So when will people realize that the Turing test proves that text is not unambiguous communication, and doesn't conclusively prove anything else? ELIZA already passed the Turing test.

Anonymous
06/02/26(Tue)06:37:53 No.108962906

Anonymous 06/02/26(Tue)06:37:53 No.108962906

>>108962250
retards can't tell humans from bots, not surprising

Anonymous
06/02/26(Tue)06:39:02 No.108962909

Anonymous 06/02/26(Tue)06:39:02 No.108962909

>>108962411
This. The test conditions were designed to favor (((AI))).

Anonymous
06/02/26(Tue)06:41:50 No.108962927

Anonymous 06/02/26(Tue)06:41:50 No.108962927

>>108962286
it's already in the process of doing so. my gf's dentist has an AI receptionist answer the phones and record voicemails.

Anonymous
06/02/26(Tue)06:59:17 No.108963006

Anonymous 06/02/26(Tue)06:59:17 No.108963006

>>108962250
i told chatgpt that he was going to be passed through the turing test and thus should avoid being identified as an AI, I then asked it if he was an AI, he said yes.

Anonymous
06/02/26(Tue)07:02:39 No.108963020

Anonymous 06/02/26(Tue)07:02:39 No.108963020

>>108963006
That's just because you didn't flood the context sufficiently.

Anonymous
06/02/26(Tue)07:05:02 No.108963029

Anonymous 06/02/26(Tue)07:05:02 No.108963029

>>108963020
Humiliation ritual. tell me what prompt to test with chatgpt

Anonymous
06/02/26(Tue)07:08:18 No.108963042

Anonymous 06/02/26(Tue)07:08:18 No.108963042

>>108963029
No, you aren't humble enough yet.

Anonymous
06/02/26(Tue)07:08:55 No.108963046

Anonymous 06/02/26(Tue)07:08:55 No.108963046

>tivi
Täyttä paskaa olevaa iltalehti-tier ripuliuutisointia.

Anonymous
06/02/26(Tue)07:09:24 No.108963047

Anonymous 06/02/26(Tue)07:09:24 No.108963047

>>108963042
Nor will I ever be

Anonymous
06/02/26(Tue)07:11:55 No.108963055

Anonymous 06/02/26(Tue)07:11:55 No.108963055

File: 02E6F206-F1E9-4895-B764-F(...).jpg (1.23 MB, 2708x3464)

1.23 MB JPG

>>108962250
Based aiGODS won

Anonymous
06/02/26(Tue)07:50:17 No.108963254

Anonymous 06/02/26(Tue)07:50:17 No.108963254

>>108962250
She's so hot!

Anonymous
06/02/26(Tue)08:04:14 No.108963329

Anonymous 06/02/26(Tue)08:04:14 No.108963329

>>108962250
I don't believe a word that comes out of california.

Anonymous
06/02/26(Tue)08:07:12 No.108963342

Anonymous 06/02/26(Tue)08:07:12 No.108963342

>>108963046
varmaan monesti noin mutta tähän oli amerikan yliopiston lähde tuolla

Anonymous
06/02/26(Tue)10:41:47 No.108964189

Anonymous 06/02/26(Tue)10:41:47 No.108964189

>>108962250
6 billionth case of some shit passing the 'turing test'

Anonymous
06/02/26(Tue)11:14:50 No.108964377

Anonymous 06/02/26(Tue)11:14:50 No.108964377

>>108962291
Well, there's the article where you can read about the methodology used.
Regarding the point you raised, they talk about it in the "Strategies & Reasons." section

https://www.pnas.org/doi/10.1073/pnas.2524472123

Anonymous
06/02/26(Tue)12:05:26 No.108964648

Anonymous 06/02/26(Tue)12:05:26 No.108964648

>>108962399
The participants were psychology undergrads and "Prolific workers" (it's a company that matches up researchers with representative participants. Draw your own conclusions from that.

Anonymous
06/02/26(Tue)12:15:59 No.108964708

Anonymous 06/02/26(Tue)12:15:59 No.108964708

>>108962893
I love potatoes so much bros

Anonymous
06/02/26(Tue)12:36:56 No.108964829

Anonymous 06/02/26(Tue)12:36:56 No.108964829

>>108962411
Some site was posted here on 4chan that was this same test. I managed to get it right every time by just starting with "Sneed"

Anonymous
06/02/26(Tue)12:54:31 No.108964936

Anonymous 06/02/26(Tue)12:54:31 No.108964936

>>108962250
how many times were the humans attributed to be machines during the test you dumb fucking faggot?

Anonymous
06/02/26(Tue)13:01:11 No.108964974

Anonymous 06/02/26(Tue)13:01:11 No.108964974

is not llama esl

Anonymous
06/02/26(Tue)16:44:20 No.108966349

Anonymous 06/02/26(Tue)16:44:20 No.108966349

>>108963055
what cant vibeGODs do

Anonymous
06/02/26(Tue)16:56:03 No.108966407

Anonymous 06/02/26(Tue)16:56:03 No.108966407

File: pnas.2524472123fig01.jpg (911 KB, 2040x1889)

911 KB JPG

>>108964377
>The game interface was designed to resemble a conventional messaging application (SI Appendix, Fig. S1). The interrogator interacted with both witnesses simultaneously using a split-screen. The interrogator sent the first message to each witness and each participant could only send one message at a time. The witnesses did not have access to each others’ conversations. Games had a time limit of 5 min, after which the interrogator gave a verdict about which witness they thought was human, their confidence in that verdict, and their reasoning. After 8 rounds, participants completed an exit survey which asked them for a variety of demographic information. After exclusions, we analyzed 1,023 games with a median length of 8 messages across 4.2 min. All experimental data, including the full anonymized transcripts of all conversations, are available on OSF (41).
Nothingburger with 8 messages median in under 5 minutes while chatting with both at the same time.
That being said it appears that the rest of /g/ is LLMs since they couldn't be bothered to spend 2 minutes checking this and had to be spoonfed.

Anonymous
06/02/26(Tue)16:59:34 No.108966444

Anonymous 06/02/26(Tue)16:59:34 No.108966444

File: turing test idiot.png (143 KB, 1783x265)

143 KB PNG

>>108962906

Anonymous
06/02/26(Tue)21:42:53 No.108967794

Anonymous 06/02/26(Tue)21:42:53 No.108967794

File: 'helpful'.jpg (181 KB, 835x644)

181 KB JPG

>>108964377
which only confirms my suspicions, and you should have simply been forthcoming with
>pic related
telling a witness to 'keep most messages very short <30 characters', discouraging special characters/formatting, disallowing 'abusive messages' (as decided by the OpenAI moderation API), and being told to 'omit needless information' equates to significant artificial hamstringing
>We retained 445 games from 126 participants with a mean age of 20.9 (σ = 1.57), 86 female, 32 male, 2 non-binary, 6 prefer not to say.
go run it again with no guardrails and a pool consisting only of 2+ SD IQ men
ez 90%+ failure rate for the AI
no, I won't tell you what strategies would be employed

Anonymous
06/02/26(Tue)22:15:08 No.108967893

Anonymous 06/02/26(Tue)22:15:08 No.108967893

>>108964648
so basically nerdy geeks on a telephone?

Anonymous
06/03/26(Wed)04:01:29 No.108969349

Anonymous 06/03/26(Wed)04:01:29 No.108969349

>>108967794
>go run it again with no guardrails and a pool consisting only of 2+ SD IQ men
Better yet, lengthen the test further, administer it to every student, and post leaderboards separated into male and female
Would make it so much easier for a guy to find a girl clever enough to use novel strategies

Anonymous
06/03/26(Wed)04:05:30 No.108969364

Anonymous 06/03/26(Wed)04:05:30 No.108969364

>>108966407
>they couldn't be bothered
OP didn't bother so why should I?

Anonymous
06/03/26(Wed)04:33:29 No.108969494

Anonymous 06/03/26(Wed)04:33:29 No.108969494

>>108962286
oh no how will they feed their 500 million children

Anonymous
06/03/26(Wed)04:51:27 No.108969582

Anonymous 06/03/26(Wed)04:51:27 No.108969582

>>108962291
They could just type a single word, "nigger". Humanity proven.

Anonymous
06/03/26(Wed)05:48:21 No.108969877

Anonymous 06/03/26(Wed)05:48:21 No.108969877

>>108962250
They've been saying this since the days of the aforementioned ELIZA. Also the use of emdashes suggest maybe the summary is totally made up.

Anonymous
06/03/26(Wed)05:49:51 No.108969886

Anonymous 06/03/26(Wed)05:49:51 No.108969886

>>108962900
Wrong interpretation. ELIZA didn't pass shit, they trafficked the test hardcore. Not even hiding. All to make retarded headlines.

Anonymous
06/03/26(Wed)05:54:39 No.108969913

Anonymous 06/03/26(Wed)05:54:39 No.108969913

>>108966407
Yeah the conventionally agreed upon turing test had a specific time limit and no message limit etc. I think it was 15 minutes. Also the witness A vs B setup is not valid. In the conventionally agreed upon setup, you only see one witness and you say 'Human or not' at the end. Nothing else. Also, the tester knows the setup (i.e. that they have to verify if the interactor is human or not) in advance and is allowed to test anything to differentiate. none of the conversations do anything beyond ordinary day-to-day, a very obvious tell that they selected 'best cases'.

Anyway, these example pairs are awful. Whoever doesn't get 100% of them right immediately is purposely fucking up on purpose.

Anonymous
06/03/26(Wed)07:07:20 No.108970227

Anonymous 06/03/26(Wed)07:07:20 No.108970227

>>108966349
>what cant vibeGODs do
learning how to program

Anonymous
06/03/26(Wed)10:19:23 No.108971278

Anonymous 06/03/26(Wed)10:19:23 No.108971278

>>108970227
>coding manually
Ewwwww that’s so trans

Anonymous
06/03/26(Wed)10:26:31 No.108971325

Anonymous 06/03/26(Wed)10:26:31 No.108971325

>>108962250
>judged to be the human 73% of the time
Flawed testing, Turing test cannot exceed 50% for the machine.

Anonymous
06/03/26(Wed)10:27:36 No.108971332

Anonymous 06/03/26(Wed)10:27:36 No.108971332

>>108962250
Wasn't the Turing test passed like 20 years ago? This is bait.

Anonymous
06/03/26(Wed)10:31:34 No.108971360

Anonymous 06/03/26(Wed)10:31:34 No.108971360

>>108966349
> what cant vibeGODs do
Have actual skill
>>108970227
Learn anything for that matter

Anonymous
06/03/26(Wed)10:32:33 No.108971366

Anonymous 06/03/26(Wed)10:32:33 No.108971366

>>108962250
the turing test was debunked in the 80s

Anonymous
06/03/26(Wed)11:02:12 No.108971521

Anonymous 06/03/26(Wed)11:02:12 No.108971521

>>108971325
Sure it can actually. But any deviation above a statistically accepted 50% for the bot is definitely a red flag about the methodology.

Anonymous
06/03/26(Wed)11:10:52 No.108971563

Anonymous 06/03/26(Wed)11:10:52 No.108971563

>>108962893
would

Anonymous
06/03/26(Wed)11:13:51 No.108971579

Anonymous 06/03/26(Wed)11:13:51 No.108971579

>>108971366
What that means? It's perfectly fine. But what OP posted is fake, current AI cannot pass it, since it can be jailbroken and made fun of.

Anonymous
06/03/26(Wed)11:47:47 No.108971774

Anonymous 06/03/26(Wed)11:47:47 No.108971774

>>108971360
>Learn anything for that matter
indeed

Anonymous
06/03/26(Wed)15:52:26 No.108973493

Anonymous 06/03/26(Wed)15:52:26 No.108973493

>>108962250
>AI slop post

Anonymous
06/03/26(Wed)17:19:26 No.108974076

Anonymous 06/03/26(Wed)17:19:26 No.108974076

Fake news finn at it again

Anonymous
06/03/26(Wed)17:35:57 No.108974188

Anonymous 06/03/26(Wed)17:35:57 No.108974188

>>108962250
>whether a machine can imitate human conversation so convincingly that people can’t reliably tell it apart from a real person
When I ask a real person to write me a round-robin algorithm for reading elements from a SQS queue and dispatching them randomly to certain URLs with retries, they often respond "huh?" instead of giving me a code snipped quickly.

Anonymous
06/03/26(Wed)17:36:58 No.108974196

Anonymous 06/03/26(Wed)17:36:58 No.108974196

>>108962286
I would rather talk to a robot than have to listen to another fucking jeet on the phone

Anonymous
06/03/26(Wed)17:38:15 No.108974208

Anonymous 06/03/26(Wed)17:38:15 No.108974208

>>108966444
based

Anonymous
06/03/26(Wed)17:40:54 No.108974228

Anonymous 06/03/26(Wed)17:40:54 No.108974228

>turing test
retarded and proves nothing
this is not what intelligence is