
Post-Test Coach
Post-Test Coach
AI Math Tutors: 99% Accuracy with Solution Injection
TL;DR
The solution is: Achieved 99% accuracy on 4th-8th grade math problems by pre-computing solutions and injecting them into GPT-3.5 prompts, overcoming LLM reasoning limitations
Reduced feedback delivery from 48 hours to seconds through real-time Edulastic API integration, enabling students to reflect while reasoning is fresh
Pilot results showed 75-97% test score improvements with AI-powered Post-Test Coach providing immediate, personalized tutoring at scale
The Challenge
Students forget their reasoning within hours of taking a test. Traditional coaching models deliver feedback 48 hours later, when the mental context is gone. Human coaches can't scale to provide immediate, personalized feedback for every student on every problem. This timing gap undermines learning effectiveness.
Partnered with Alpha to build Post-Test Coach, an AI-powered tutoring system that delivers immediate feedback the moment students complete assessments. The challenge wasn't just speed. GPT-3.5 struggles with mathematical reasoning, especially multi-step problems. Off-the-shelf LLMs give wrong answers or skip steps, making them unreliable for education.
GPT-3.5 can't reliably solve multi-step math problems. It hallucinates steps, skips logic, and produces plausible-sounding wrong answers. This isn't acceptable in education where accuracy matters.
We needed 99%+ accuracy to match or exceed human coach consistency. Testing revealed that direct prompting failed on complex problems. The model would get arithmetic right but lose track of algebraic manipulation or skip validation steps.
Key Results
99% accuracy on 4th-8th grade math problems
100% accuracy on pilot questions
75-97% test score improvements
Feedback delivery reduced from 48 hours to seconds
92% accuracy for away-from-seat detection
30% decrease in off-task time with monitoring
3-6 second response times
F1 scores close to 100% on engagement classifiers
Frequently Asked Questions
Published: · Last updated:
Ready to build something amazing?
Let's discuss how we can help transform your ideas into reality.