Exploration, exploitation, and thinking

01 Sep, 2025

I’ve been thinking about AI and machine learning for about a decade, and one thing that still surprises me is how often the ideas spill over into everyday life.

In reinforcement learning (RL) there’s a tension between two forces: exploration and exploitation. Exploration means trying new things. Exploitation means sticking with what you already know.

The way RL agents learn is by exploring a lot in the beginning. And once they know more, they can safely exploit. But if they skip exploration, they get stuck; they just keep repeating whatever they stumbled onto first, even if it isn’t very good. An agent that exploits too early never discovers the better strategies it could have found.

Humans aren’t so different. We need a period of exploration, where we struggle and figure things out for ourselves. That’s how we build the mental muscles for judgment. If we outsource that too early, those muscles never develop.

And it’s never been easier to outsource than it is today. We have a superpower at our fingertips that can tempt us into premature exploitation. It’s just so much easier to ask than to struggle.

But the struggle is the point; that’s how you learn to think.

Skip the struggle, and you only think you’re thinking. People who lean on AI too early and too often remember less, don’t go as deep, and fail to build real judgment. Over time, it becomes a kind of thought atrophy.

This is the exploitation trap. You get an answer, but at the cost of the skills you need to find answers yourself. Maybe even better ones.

And the younger you are, the bigger the cost. Because you may skip exploration entirely; trading the long-term reward of learning to think for the short-term reward of an easy answer.

And that matters. An RL agent that skips exploration never learns its environment. A generation that skips exploration may never learn to think.

The Impact of Generative AI on Critical Thinking: Self-Reported Reductions in Cognitive Effort and Confidence Effects from a Survey of Knowledge Workers (Microsoft & Carnegie Mellon, 2025)

ChatGPT's Impact On Our Brains According to an MIT Study (TIME, 2025)

Does ChatGPT Make You Dumber? What a New MIT Study Really Found (Marketing AI Institute, 2025)

Increased AI use linked to eroding critical thinking skills (Phys.org, 2025)

Evaluating the Impact of AI Dependency on Cognitive Ability among Young Adults (AMH International, 2024)

#AI #RL #musing