United States

Microsoft

Microsoft integrates AI across Azure, Copilot products, and enterprise software, leveraging OpenAI models and its own Phi models for multimodal and enterprise AI.

Website

Products

Models

Available

Benchmarks

Region

United States

Updated

May 29, 2026

Product coverage

Products from this provider

No products have been linked to this provider yet.

Model coverage

Models from this provider

Phi-3 Mini Instruct

Phi-3 Mini Instruct 3.8B

Phi-3 Mini is a compact, 3.8-billion parameter language model from Microsoft's Phi-3 family, optimized for high efficiency and performance on resource-constrained devices. It supports a long context window of up to 128K tokens and is designed for fast inference, making it suitable for edge deployment and mobile applications.

CodingReasoningFastCheapLong context

Input / 1M tokens

$0.00

Artificial Analysis Intelligence Index

10.1

Phi-4

Phi-4 is a small language model from Microsoft's Phi series, designed for strong reasoning and coding capabilities while maintaining low latency and cost. It is optimized for efficiency and practical deployment in resource-constrained environments.

CodingReasoningFastCheap

Input / 1M tokens

$0.125

Output tokens/s

42.43

First-token seconds

0.5s

Artificial Analysis Intelligence Index

10.4

Phi-4

Phi-4 Mini Instruct

Phi-4 Mini is a lightweight, efficient small language model from Microsoft's Phi series, optimized for high performance on resource-constrained devices. It excels at instruction following, reasoning, and code generation tasks while maintaining a small footprint.

FastCheapReasoningCoding

Input / 1M tokens

$0.00

Output tokens/s

44.19

First-token seconds

0.34s

Artificial Analysis Intelligence Index

8.4

Phi-4

Phi-4 Multimodal Instruct

A multimodal model from Microsoft's Phi-4 family, designed for efficient reasoning and instruction following across text, image, and audio inputs. It emphasizes strong performance on complex tasks while maintaining a relatively small and fast architecture.

MultimodalFastReasoning

Input / 1M tokens

$0.00

Output tokens/s

16.6

First-token seconds

1.33s

Artificial Analysis Intelligence Index

Discussion

Thinking... Make sure you are connected to GitHub server