Phi 4 reasoning

Name: Phi 4 reasoning
Brand: Open Source
Author: Open Source

microsoft/Phi-4-reasoning

Open Source · chat · open-weights

GA Alert me on changes

Context

32.8K

Max output

—

Pricing

Open weights

Modalities

text

Released

09 Apr 2025

License: mit · microsoft/Phi-4-reasoning

AI summary

● machine-written

Microsoft releases Phi-4-reasoning, 14B parameter model for structured reasoning tasks

Phi-4-reasoning is a 14 billion parameter dense decoder-only transformer fine-tuned from Phi-4 using supervised fine-tuning on chain-of-thought traces and reinforcement learning. The model targets math, science, and code reasoning tasks with a 32k context window and is optimized for structured two-part responses. It achieves strong results on specialized benchmarks like AIME, OmniMath, and LiveCodeBench, outperforming many larger models in structured reasoning tasks.

What's new

Fine-tuned from Phi-4 with chain-of-thought and reinforcement learning
Targets math, science, and code reasoning tasks
32k context window with high inference efficiency
Released under MIT license
Optimized for two-part format: reasoning trace followed by solution

Best for

Math reasoning tasksScience problem solvingCode reasoning and generationStructured step-by-step logic in latency-constrained environments

Sources

Source: https://huggingface.co/microsoft/Phi-4-reasoning