Skip to main content

Appendix A: Supported Models Reference

Complete list of models available in Azimuth Orion (as of plugin version 1.0):

Note: Not all models may be available for your API key. Access depends on the restrictions and usage tier associated with your key (e.g., free vs. paid, rate limits, model eligibility). If a model fails with a rate limit or access error, try another model or check your provider's dashboard for your key's capabilities.

xAI (Grok)

Model IDDisplay NameContext WindowMax OutputVisionCost TierDefault Timeout
grok-4Grok 4256k tokens131kYesHigh90s
grok-4-1-fast-reasoningGrok 4.1 Fast Reasoning2M tokens131kYesVery Low45s
grok-4-1-fast-non-reasoningGrok 4.1 Fast2M tokens131kYesVery Low30s
grok-code-fast-1Grok Code Fast256k tokens32kNoLow60s

Notes:

  • grok-code-fast-1: Code-optimized, agentic coding (default for xAI)
  • grok-4: Flagship, highest quality reasoning
  • grok-4-1-fast-reasoning / grok-4-1-fast-non-reasoning: 2M context, vision support

OpenAI

Model IDDisplay NameContext WindowMax OutputVisionCost TierDefault Timeout
gpt-5.2GPT-5.2400k tokens128kYesHigh90s
gpt-5-miniGPT-5 mini400k tokens128kYesLow60s
gpt-5-nanoGPT-5 nano400k tokens128kYesVery Low45s
gpt-5.3-codexGPT-5.3 Codex400k tokens128kYesHigh90s
o4-minio4-mini200k tokens100kYesLow60s

Notes:

  • gpt-5.2: Best for coding and agentic tasks (flagship)
  • gpt-5-mini: Faster, cost-efficient for well-defined tasks
  • o4-mini: Fast reasoning model with vision

Anthropic (Claude)

Model IDDisplay NameContext WindowMax OutputVisionCost TierDefault Timeout
claude-opus-4-6Claude Opus 4.6200k tokens128kYesVery High120s
claude-sonnet-4-6Claude Sonnet 4.6200k tokens64kYesMedium75s
claude-haiku-4-5Claude Haiku 4.5200k tokens64kYesLow45s

Notes:

  • claude-sonnet-4-6: Best speed/intelligence balance (default for Anthropic, Feb 2026)
  • claude-opus-4-6: Most intelligent, Adaptive Thinking
  • claude-haiku-4-5: Fastest Claude model, lowest cost

Google (Gemini)

Model IDDisplay NameContext WindowMax OutputVisionCost TierDefault Timeout
gemini-3-pro-previewGemini 3 Pro (Preview)1M tokens65kYesHigh90s
gemini-3-flash-previewGemini 3 Flash (Preview)1M tokens65kYesLow60s
gemini-3.1-pro-previewGemini 3.1 Pro (Preview)1M tokens65kYesHigh90s
gemini-2.5-proGemini 2.5 Pro1M tokens65kYesMedium90s
gemini-2.5-flashGemini 2.5 Flash1M tokens65kYesLow60s

Notes:

  • gemini-2.5-flash: Price-performance (default for Google)
  • gemini-2.5-pro: Advanced reasoning (Stable)
  • gemini-3.x: Preview models with 1M context

Legend

Context Window: Maximum input tokens (your question + attached context)
Max Output: Maximum completion tokens (AI response length)
Vision: Supports image input (screenshots)
Cost Tier: Relative pricing (Very Low < Low < Medium < High < Very High)
Default Timeout: Recommended HTTP timeout for this model