Skip to main content

Appendix A: Supported Models Reference

Complete list of models available in Azimuth Orion (as of plugin version 1.0):

xAI (Grok)

Model IDDisplay NameContext WindowMax OutputVisionCost TierDefault Timeout
grok-code-fast-1Grok Code Fast128k tokens32kNoVery Low60s
grok-3Grok 3131k tokens32kNoLow75s
grok-4Grok 4256k tokens32kNoMedium90s
grok-4-turboGrok 4 Turbo256k tokens32kNoMedium90s
grok-visionGrok Vision128k tokens32kYesMedium75s

Notes:

  • grok-code-fast-1: Best value for general UE5 questions
  • grok-vision: Best xAI option for screenshots

OpenAI

Model IDDisplay NameContext WindowMax OutputVisionCost TierDefault Timeout
gpt-4oGPT-4o128k tokens16kYesHigh60s
gpt-4o-miniGPT-4o Mini128k tokens16kYesLow60s
gpt-4-turboGPT-4 Turbo128k tokens4kYesHigh60s
gpt-3.5-turboGPT-3.5 Turbo16k tokens4kNoVery Low30s
o1o1128k tokens32kNoVery High90s
o1-minio1 Mini128k tokens32kNoHigh60s
o3-minio3 Mini128k tokens32kNoHigh60s

Notes:

  • GPT-4o: Industry standard for vision, excellent quality
  • GPT-4o mini: Budget-friendly with vision support
  • o1 series: Advanced reasoning, very expensive

Anthropic (Claude)

Model IDDisplay NameContext WindowMax OutputVisionCost TierDefault Timeout
claude-opus-4Claude Opus 4200k tokens16kYesVery High90s
claude-sonnet-4-5Claude Sonnet 4.5200k tokens16kYesHigh75s
claude-3-5-sonnetClaude 3.5 Sonnet200k tokens8kYesMedium75s
claude-haiku-4Claude Haiku 4200k tokens8kYesLow60s

Notes:

  • Claude Opus 4: Top-tier quality, 200k context window (huge Blueprints)
  • Claude 3.5 Sonnet: Excellent balance of speed, cost, and quality

Google (Gemini)

Model IDDisplay NameContext WindowMax OutputVisionCost TierDefault Timeout
gemini-pro-1.5Gemini Pro 1.51M+ tokens32kYesMedium90s
gemini-flash-1.5Gemini Flash 1.51M+ tokens32kYesLow60s
gemini-pro-visionGemini Pro Vision128k tokens8kYesMedium75s

Notes:

  • Gemini Pro 1.5: Massive context window (1M+ tokens—can fit entire codebases)
  • Gemini Flash 1.5: Fast, low cost, still huge context

Legend

Context Window: Maximum input tokens (your question + attached context)
Max Output: Maximum completion tokens (AI response length)
Vision: Supports image input (screenshots)
Cost Tier: Relative pricing (Very Low < Low < Medium < High < Very High)
Default Timeout: Recommended HTTP timeout for this model