Providers
Microsoft

United States

Microsoft

Microsoft integrates AI across Azure, Copilot products, and enterprise software, leveraging OpenAI models and its own Phi models for multimodal and enterprise AI.

Products
0
Models
4
Available
0
Benchmarks
15

Region

United States

Updated

May 14, 2026

Product coverage

Products from this provider

0
No products have been linked to this provider yet.

Model coverage

Models from this provider

4

Phi-3 Mini Instruct

Phi-3 Mini Instruct 3.8B

Phi-3 Mini is a compact, 3.8-billion parameter language model from Microsoft's Phi-3 family, optimized for high efficiency and performance on resource-constrained devices. It supports a long context window of up to 128K tokens and is designed for fast inference, making it suitable for edge deployment and mobile applications.

CodingReasoningFastCheapLong context

Input / 1M tokens

$0.00

Artificial Analysis Intelligence Index

10.1

Phi-4

Phi-4

Phi-4 is a small language model from Microsoft's Phi series, designed for strong reasoning and coding capabilities while maintaining low latency and cost. It is optimized for efficiency and practical deployment in resource-constrained environments.

CodingReasoningFastCheap

Input / 1M tokens

$0.125

Output tokens/s

37.74

First-token seconds

0.54s

Artificial Analysis Intelligence Index

10.4

Phi-4

Phi-4 Mini Instruct

Phi-4 Mini is a lightweight, efficient small language model from Microsoft's Phi series, optimized for high performance on resource-constrained devices. It excels at instruction following, reasoning, and code generation tasks while maintaining a small footprint.

FastCheapReasoningCoding

Input / 1M tokens

$0.00

Output tokens/s

44.19

First-token seconds

0.34s

Artificial Analysis Intelligence Index

8.4

Phi-4

Phi-4 Multimodal Instruct

A multimodal model from Microsoft's Phi-4 family, designed for efficient reasoning and instruction following across text, image, and audio inputs. It emphasizes strong performance on complex tasks while maintaining a relatively small and fast architecture.

MultimodalFastReasoning

Input / 1M tokens

$0.00

Output tokens/s

14.33

First-token seconds

2.25s

Artificial Analysis Intelligence Index

10

Discussion

Thinking... Make sure you are connected to GitHub server