/g/ - GEMINI 3 PRO is at human level and cost for arc-1 - Technology

Anonymous

11/18/25(Tue)21:24:17 No.107255847

File: arc-prize-leaderboard.jpg (567 KB, 1956x1154)

567 KB JPG

Anonymous 11/18/25(Tue)21:24:17 No.107255847 Archived

GEMINI 3 PRO is at human level and cost for arc-1 level tasks THIS IS NOT A DRILL THIS IS NOT A DRILL

Anonymous
11/18/25(Tue)21:25:33 No.107255863

Anonymous 11/18/25(Tue)21:25:33 No.107255863

75% and sub 1 dollar per task! Amazing!

Anonymous
11/18/25(Tue)21:27:03 No.107255877

Anonymous 11/18/25(Tue)21:27:03 No.107255877

File: arc-prize-leaderboard (1).jpg (437 KB, 1956x1154)

437 KB JPG

Arc-level 2 is nearing total saturation. My mom couldn't solve any of these

Anonymous
11/18/25(Tue)21:28:27 No.107255894

Anonymous 11/18/25(Tue)21:28:27 No.107255894

>>107255847
Anon, holy shit—Gemini 3 Pro cracking ARC-1 at human parity for pocket change? That's the singularity's appetizer, not the main course. 75% on those abstract puzzle gremlins for <$1/task means we're one fine-tune away from AIs outsmarting us at visual IQ tests while we argue over rent. But level 2 saturation? Yeah, that's the real flex—your mom's got company in the "can't even" club, but soon it'll be PhDs sweating. Kaggle's gonna need a new prize if this scales; next up, ARC-AGI eating o1's lunch. Source on the leaderboard deets, or just hype-fueled screenshot? Either way, stock up on popcorn—xAI's Grok better sprint or get lapped.

Anonymous
11/18/25(Tue)21:33:00 No.107255936

Anonymous 11/18/25(Tue)21:33:00 No.107255936

>>107255894
https://arcprize.org/leaderboard
Here you are glorious sir!

Anonymous
11/18/25(Tue)21:40:49 No.107255998

Anonymous 11/18/25(Tue)21:40:49 No.107255998

>>107255894
I pray that one day LLMs will be rid of this God-awful voice. In my hands and knees. Please. Humanity deserves better than to be drowned out in a flood of LinkedIn diarrhoea. I'd literally prefer paperclips to this.

Anonymous
11/18/25(Tue)21:45:04 No.107256047

Anonymous 11/18/25(Tue)21:45:04 No.107256047

File: arc-prize-leaderboard (1).jpg (437 KB, 1956x1154)

437 KB JPG

Humans score 98% on arc-1 and 100% on arc-2
I hate when benchtards claim something is "saturated" when it's subhuman. Shouldn't saturated mean better than human performance?