ARC-AGI-3: The $10,000 Challenge

Medium · 7 min read · original

ARC-AGI-3: The $10,000 Challenge. Artificial intelligence is advancing… | by Michal Mikulasi | Medium

Sitemap

Open in app

Sign up

Sign in

Get app

Write

Search

Sign up

Sign in

Image 3

ARC-AGI-3: The $10,000 Challenge

Image 4: Michal Mikulasi

Michal Mikulasi

Follow

2 min read

·

Aug 18, 2025

Listen

Share

Press enter or click to view image in full size

Image 5

Artificial intelligence is advancing fast, but a new benchmark called ARC-AGI-3 reminds us just how far today’s systems are from true general intelligence. Designed by François Chollet, creator of the original ARC test, this benchmark doesn’t just measure knowledge or pattern recognition; it measures the ability to learn from scratch.

What Makes ARC-AGI-3 Different?

Unlike traditional benchmarks that test models on static data, ARC-AGI-3 introduces interactive grid-world mini-games. Here, an AI agent must figure out the rules of the game by exploring, experimenting, and planning, just as a child would when encountering something new.

For humans, the challenge is surprisingly simple. Most people can solve these games within minutes. But for AI? The results are humbling. Current agents score zero points, failing even the developer preview of just three games.

Chollet highlights the gap: “AI can do many things, but it cannot have general intelligence as long as this fundamental divide exists.”

The Human vs. Machine Gap

The benchmark excludes trivia, cultural knowledge, or linguistic tricks. Instead, it focuses on core cognitive skills like causality, object permanence, and flexible problem-solving. These are areas where humans excel intuitively but where machines still struggle.

Get Michal Mikulasi’s stories in your inbox

Join Medium for free to get updates from this writer.

Subscribe

Subscribe

Remember me for faster sign in

Interestingly, there is a glimmer of hope. OpenAI researcher Sun reported that a new ChatGPT agent was able to solve the first game in the preview. Progress? Yes, but far from a breakthrough.

A $10,000 Incentive to Push AI Forward

To encourage innovation, Hugging Face is sponsoring a four-week coding sprint with a $10,000 prize pool. Participants can develop their own AI agents and submit them via a public API. The full benchmark, including around 100 games, is expected to launch in early 2026, split into public and private test sets.

The idea is simple: move AI forward not just in narrow tasks, but in general learning ability,a critical step toward artificial general intelligence (AGI).

Why This Matters

AI benchmarks are plentiful, but ARC-AGI-3 stands out because it doesn’t reward memorization or pre-trained knowledge. Instead, it forces systems to interact, adapt, and learn without prior context.

For researchers, it’s a reminder that while today’s AI models can dazzle us with fluent text, stunning images, or code generation, they’re still lacking the kind of flexible, adaptive intelligence that even small children take for granted.

ARC-AGI-3 may not have all the answers, but it asks the right questions, and that could be the key to the next leap in AI development.

Let’s Collaborate!

I’m currently open to exciting opportunities in AI, robotics, and tech writing.

If you’d like to work together, discuss a project, or share ideas, feel free to connect with me on LinkedIn or reach out via email: michalmikulasi@gmail.com.

Also, please consider supporting me on:

https://buy.stripe.com/fZu14nffsgn85p67o43Ru00

Arc Agi

AGI

AI

LLM

Machine Learning

Image 6: Michal Mikulasi

Image 7: Michal Mikulasi

Follow

## Written by Michal Mikulasi

55 followers

·39 following

Machine learning/Data science/AI specialist

Follow

No responses yet

Image 8

Write a response

What are your thoughts?

Cancel

Respond

More from Michal Mikulasi

Image 9: How to Create Songs with Suno AI and Upload Them to Spotify

Image 10: Michal Mikulasi

Michal Mikulasi

## How to Create Songs with Suno AI and Upload Them to Spotify ### Have you ever dreamed of making your own music, but don’t have the skills to play an instrument or sing? With Suno AI, you can create…

Aug 3, 2025

1

Image 11: What Is the Sigmoid Kernel? (And Why It Feels a Bit Like Deep Learning)

Image 12: Michal Mikulasi

Michal Mikulasi

## What Is the Sigmoid Kernel? (And Why It Feels a Bit Like Deep Learning) ### By now, if you’ve been reading my articles (and I’m so glad you are!), you already know that kernels are the secret weapons behind many…

Apr 28, 2025

Image 13: What Is a Linear Kernel? (And Why It’s Simpler Than You Think)

Image 14: Michal Mikulasi

Michal Mikulasi

## What Is a Linear Kernel? (And Why It’s Simpler Than You Think) ### If you’ve been digging into machine learning and heard people talking about kernels, you might be bracing yourself for some heavy math…

Apr 28, 2025

Image 15: GPT-4o Prompt Strategies(in 2025)

Image 16: Michal Mikulasi

Michal Mikulasi

## GPT-4o Prompt Strategies(in 2025) ### Artificial intelligence is advancing with insane speed. With GPT-4o, OpenAI offers one of the most powerful AI language models available…

Aug 1, 2025

See all from Michal Mikulasi

Recommended from Medium

Image 17: Stanford Just Killed Prompt Engineering With 8 Words (And I Can’t Believe It Worked)

Image 18: Generative AI

In

Generative AI

by

Adham Khaled

## Stanford Just Killed Prompt Engineering With 8 Words (And I Can’t Believe It Worked) ### ChatGPT keeps giving you the same boring response? This new technique unlocks 2× more creativity from ANY AI model — no training required…

Oct 20, 2025

674

Image 19: I Stopped Using ChatGPT for 30 Days. What Happened to My Brain Was Terrifying.

Image 20: Level Up Coding

In

Level Up Coding

by

Teja Kusireddy

## I Stopped Using ChatGPT for 30 Days. What Happened to My Brain Was Terrifying. ### 91% of you will abandon 2026 resolutions by January 10th. Here’s how to be in the 9% who actually win.

Dec 28, 2025

409

Image 21: The Best AI Tools for 2026

Image 22: Artificial Corner

In

Artificial Corner

by

The PyCoach

## The Best AI Tools for 2026 ### If you’re going to learn a new AI tool, make sure it’s one of these

Dec 1, 2025

310

Image 23: You Have No Idea How Far Behind Tesla Is

Image 24: Will Lockett

Will Lockett

## You Have No Idea How Far Behind Tesla Is ### The leapfrog moment has just happened.

6d ago

122

Image 25: 6 brain images

Image 26: Write A Catalyst

In

Write A Catalyst

by

Dr. Patricia Schmidt

## As a Neuroscientist, I Quit These 5 Morning Habits That Destroy Your Brain ### Most people do #1 within 10 minutes of waking (and it sabotages your entire day)

Jan 14

728

Image 27: The Death of CNNs: How Vision Transformers Rewrote Computer Vision in 3 Years (Part 1: The CNN Era)

Image 28: Towards AI

In

Towards AI

by

Ampatishan Sivalingam

## The Death of CNNs: How Vision Transformers Rewrote Computer Vision in 3 Years (Part 1: The CNN Era) ### From AlexNet’s 2012 revolution to ResNet’s dominance, and why it all became obsolete overnight

Feb 6

9

See more recommendations

Help

Status

About

Careers

Press

Blog

Privacy

Rules

Terms

Text to speech