Phi 4 reasoning
microsoft/Phi-4-reasoning
Open Source · chat · open-weights
Context
32.8K
Max output
—
Pricing
Open weights
Modalities
text
Released
09 Apr 2025
License: mit · microsoft/Phi-4-reasoning
AI summary
● machine-written
Microsoft releases Phi-4-reasoning, 14B parameter model for structured reasoning tasks
Phi-4-reasoning is a 14 billion parameter dense decoder-only transformer fine-tuned from Phi-4 using supervised fine-tuning on chain-of-thought traces and reinforcement learning. The model targets math, science, and code reasoning tasks with a 32k context window and is optimized for structured two-part responses. It achieves strong results on specialized benchmarks like AIME, OmniMath, and LiveCodeBench, outperforming many larger models in structured reasoning tasks.
What's new
- Fine-tuned from Phi-4 with chain-of-thought and reinforcement learning
- Targets math, science, and code reasoning tasks
- 32k context window with high inference efficiency
- Released under MIT license
- Optimized for two-part format: reasoning trace followed by solution
Best for
Math reasoning tasksScience problem solvingCode reasoning and generationStructured step-by-step logic in latency-constrained environments