Listen to today's AI briefing

Daily podcast — 5 min, AI-narrated summary of top stories

Students using laptops with an AI tutor interface on screen, a teacher observing, and a graph showing improved test…

Two Studies Find AI Tutors Improve Learning, While Unrestricted AI Use Can Shortcut It

New research shows AI systems prompted to act as tutors improve student learning outcomes, while simply giving students access to AI can lead them to accidentally shortcut the learning process.

AAAla SMITH & AI Research Desk·Mar 29, 2026·5 min read··109 views·AI-Generated·Report error

Source: x.comvia @emollickSingle Source

TL;DR

New research shows AI systems prompted to act as tutors improve student learning outcomes, while simply giving students access to AI can lead them to accidentally shortcut the learning process.

Two Studies Find AI Tutors Improve Learning, While Unrestricted AI Use Can Shortcut It

New research highlighted by Wharton professor Ethan Mollick points to a critical distinction in how artificial intelligence impacts education. One study found that simply giving students access to AI tools can lead them to inadvertently shortcut the learning process. However, both that study and a separate randomized controlled trial (RCT) found that when AI systems are specifically prompted to act as tutors, they demonstrably improve learning outcomes.

The findings, shared via social media, reference research involving team member @hamsabastani and point to a growing body of evidence that the design and prompting of AI educational tools are as important as their availability.

What the Research Shows

The core insight is not that AI is inherently good or bad for learning, but that its impact is mediated by its implementation. The first study suggests a potential pitfall: when students are given unrestricted access to generative AI (like ChatGPT for answering questions or solving problems), they may use it to bypass the cognitive effort required for genuine understanding. This "shortcut" can undermine the learning objectives of an assignment or course.

In contrast, the second study—a more rigorous randomized controlled trial—demonstrates a positive effect. Here, the AI was not a general-purpose tool but was specifically designed and prompted to function as a tutor. This means the AI was likely guided to use Socratic questioning, provide hints instead of answers, assess understanding, and adapt to the student's pace—mimicking proven pedagogical techniques.

The results indicate that this structured, tutor-like interaction successfully improved learning compared to a control group, validating a targeted application of AI in education.

The Critical Role of Prompting and Design

This dichotomy highlights a central theme in applied AI: the output is dictated by the input. An AI model is a powerful but directionless engine. Telling it to "solve this math problem" provides a correct answer but may not teach the student. Telling it to "act as a patient tutor and guide me to the solution" changes the entire interaction.

For educators and edtech developers, the implication is clear. Successfully integrating AI into learning environments requires careful instructional design. It necessitates building systems or crafting prompts that:

Scaffold learning: Break down complex problems.
Promote metacognition: Ask students to explain their reasoning.
Provide formative feedback: Focus on the process, not just the final answer.
Avoid answer-giving: Especially in initial learning phases.

gentic.news Analysis

This research arrives amid intense debate and experimentation about AI's role in education, a sector where entities like Khan Academy (with its Khanmigo AI tutor) and Duolingo have been early and aggressive integrators. The findings provide empirical weight to a design philosophy already in motion: that AI must be constrained and guided to be pedagogically effective. It directly supports the approach of Khanmigo, which is built to act as a guide and coach rather than an answer oracle.

The results also create a clear counterpoint to common, fear-based narratives about AI enabling cheating. Instead, they frame a more nuanced challenge: the risk isn't just dishonesty, but the well-intentioned use of tools that can, by being too helpful, erode the learning journey. This aligns with broader discussions in the AI community about alignment—ensuring AI systems act in accordance with human goals. In this case, the goal is deep learning, not just task completion.

For practitioners, this underscores that deploying LLMs in education is not a simple plug-and-play task. It requires a deep understanding of pedagogy to craft the guardrails and prompts that will keep the AI in a beneficial "tutor" mode. The next frontier will be measuring the long-term retention and transfer of learning from AI-tutored sessions compared to human-led instruction.

Frequently Asked Questions

Can AI really replace human tutors?

The research suggests AI can effectively augment and scale certain tutoring functions, particularly for foundational knowledge and practice. However, it does not address the complex motivational, emotional, and deeply interpersonal aspects of learning that human tutors excel at. The most likely near-term future is hybrid models, where AI handles drill-and-practice and initial guidance, freeing human educators for higher-level mentorship.

What's the difference between an AI tutor and just asking ChatGPT for help?

The difference is entirely in the prompting and system design. A standard ChatGPT query like "What's the answer to this calculus problem?" provides a solution. An AI tutor uses a foundational prompt such as "You are a patient math tutor. Do not give the student the answer. Ask guiding questions to help them discover the solution themselves. Assess their understanding at each step." The latter requires careful design and testing to ensure the AI consistently adheres to the tutoring role.

How can educators prevent students from using AI to shortcut learning?

The research implies that prohibition is less effective than redirection. Instead of banning AI, educators can design assignments where the process is the product (e.g., "submit your dialogue with an AI tutor explaining this concept"), use AI tools that are locked into a tutoring mode, or focus assessment on in-class, AI-free demonstrations of skills built with the aid of AI tutoring outside of class.

Are there specific AI tutoring platforms available now?

Yes, several platforms are building on this concept. Khan Academy's Khanmigo is a leading example, acting as a guide within its learning system. Other startups are developing specialized AI tutors for coding, language learning, and test preparation. The key is to look for platforms that emphasize dialogue, questioning, and step-by-step guidance over simply delivering answers.

Source: gentic.news · Mar 29, 2026 · author=Ala SMITH · citation.json

AI-assisted reporting. Generated by gentic.news from multiple verified sources, fact-checked against the Living Graph of 4,300+ entities. Edited by Ala SMITH.

Following this story?

Get a weekly digest with AI predictions, trends, and analysis — free.

AI Analysis

This research provides crucial, evidence-based nuance to the often-polarized debate about AI in education. It moves beyond the simplistic binary of 'AI helps' vs. 'AI cheats' and introduces implementation as the critical variable. The positive results from the AI tutor RCT are significant because they mirror findings from decades of educational research on one-on-one human tutoring (often summarized as the '2 sigma problem' identified by Benjamin Bloom). If AI can reliably replicate some of that gain at scale, it represents a major step forward for personalized education. The negative finding—that unrestricted AI can shortcut learning—is arguably more important for immediate policy. It warns against a naive 'toolification' of LLMs in classrooms. Simply providing access to ChatGPT without pedagogical structuring may inadvertently harm educational outcomes, even with academically honest students. This creates a urgent design challenge for edtech: how to build interfaces and default prompts that keep AI in a 'tutor mode' and resist the user's natural inclination to seek direct answers. For the AI engineering community, this underscores the importance of **system prompts**, **reinforcement learning from human feedback (RLHF)**, and potentially **domain-specific fine-tuning** to create agents that are robustly helpful in an educational context. It's not enough to have a capable model; you need to align its behavior with the complex, long-term goal of human understanding. Future work should investigate which specific tutoring techniques (e.g., wait time, misconception targeting, analogical reasoning) current LLMs can best emulate and where they still fall short.

#llm applications #research #ai #education

Mentioned in this article

Ethan Mollick

Enjoyed this article?

Get the weekly AI intelligence briefing

✨AI Toolslive

Five one-click lenses on this article. Cached for 24h.

Pick a tool above to generate an instant lens on this article.

AI Research

Anthropic Teaches Claude Why: New Interpretability Method Deployed

From the lab

The framework underneath this story

Every article on this site sits on top of one engine and one framework — both built by the lab.

Original research · EUMAS 2026

MNEMA — A Witness Lattice for Multi-Agent AI Memory

Cryptographic memory units · 1−α detection floor · 15 pp PDF

Field framework · v1.0

Epistemic Infrastructure

12 pillars · 11-stage knowledge metabolism · pathology catalog

More in AI Research

View all

A researcher analyzes a diagram of a neural network with highlighted connections being removed, representing LLM…

AI Research

Pruning LLMs for Edge Triples Bias, Perplexity Hides Damage

Pruning LLMs for edge deployment amplifies bias up to 83.7% while perplexity barely changes, revealing a paradox that undermines standard evaluation practices.

arxiv.org/1d ago/3 min read/Widely Reported

ai safetymodel compressionedge ai

Satellite image of patchwork agricultural fields in various shades of green and brown, with geometric boundaries…

AI Research

Prithvi-EO Fails Cross-Country Crop Yield Generalization, Paper Shows

Prithvi-EO and ViT-Base embeddings yield universally negative R² under cross-country maize yield prediction, failing to beat traditional spectral features due to yield distribution shift.

arxiv.org/1d ago/3 min read

earth-observationfoundation-modelsarxiv

A sleek metallic humanoid robot with glowing blue eyes gestures toward a floating holographic interface displaying…

AI Research

Thinking Machines Unveils Native Multimodal Interaction Model

Thinking Machines unveiled a native interaction model that simultaneously listens, sees, speaks, interrupts, reacts, thinks in background, and uses tools. The approach targets the fundamental turn-based bottleneck of current AI assistants.

x.com/1d ago/3 min read

startupsai modelsmultimodal ai

What the Research Shows

The Critical Role of Prompting and Design

gentic.news Analysis

Frequently Asked Questions

Can AI really replace human tutors?

What's the difference between an AI tutor and just asking ChatGPT for help?

How can educators prevent students from using AI to shortcut learning?

Are there specific AI tutoring platforms available now?

AI Analysis

✨AI Toolslive

Related Articles

Simple Graph Heuristic Beats Generative Recommenders on 10 of 14 Benchmarks

RRCM Uses GRPO to Decide When to Retrieve for LLM Recommendation

Claude Code's Six-Layer Architecture: Harness, Not Magic

MCP vs CLI Debate Resolved by Anthropic's Code Mode: 98.7% Token Drop

Two-Tower vs Vector DB + LLM: Which Wins for RecSys at Scale?

Anthropic Teaches Claude Why: New Interpretability Method Deployed

The framework underneath this story

More in AI Research

Pruning LLMs for Edge Triples Bias, Perplexity Hides Damage

Prithvi-EO Fails Cross-Country Crop Yield Generalization, Paper Shows

Thinking Machines Unveils Native Multimodal Interaction Model