Skip to content

Phi 4 reasoning

microsoft/Phi-4-reasoning
Open Source · chat · open-weights
GA Alert me on changes
Context
32.8K
Max output
Pricing
Open weights
Modalities
text
Released
09 Apr 2025
License: mit · microsoft/Phi-4-reasoning
AI summary
● machine-written

Microsoft releases Phi-4-reasoning, 14B parameter model for structured reasoning tasks

Phi-4-reasoning is a 14 billion parameter dense decoder-only transformer fine-tuned from Phi-4 using supervised fine-tuning on chain-of-thought traces and reinforcement learning. The model targets math, science, and code reasoning tasks with a 32k context window and is optimized for structured two-part responses. It achieves strong results on specialized benchmarks like AIME, OmniMath, and LiveCodeBench, outperforming many larger models in structured reasoning tasks.

What's new
  • Fine-tuned from Phi-4 with chain-of-thought and reinforcement learning
  • Targets math, science, and code reasoning tasks
  • 32k context window with high inference efficiency
  • Released under MIT license
  • Optimized for two-part format: reasoning trace followed by solution
Best for
Math reasoning tasksScience problem solvingCode reasoning and generationStructured step-by-step logic in latency-constrained environments
Sources

Source: https://huggingface.co/microsoft/Phi-4-reasoning