Prometheus Spark
A compact model for chat, simple actions, classification, and flows where latency matters.
- Embedded assistants
- Routing and short summaries
- Lightweight automation
Prometheus family
A model line for assistants, agents, transcription, semantic search, and reasoning workflows with stable names, predictable latency, and a consistent developer experience.
Model line
Prometheus separates speed, depth, audio, and semantic representation so every product can use the right model without exposing provider names or internal implementation details.
A compact model for chat, simple actions, classification, and flows where latency matters.
A balance of cost and quality for high-volume product responses and agents that run all day.
The main model for complex tasks, deep analysis, planning, and multi-step agents.
Speech transcription and understanding for turning meetings, messages, and calls into actionable data.
Vector representations for search, recommendations, context retrieval, and agent memory.
Designed for production
Prometheus lets teams think in capabilities: speed, depth, voice, or embeddings. Behind the interface, the platform can optimize the engine for each task without changing the public product contract.
Turn voice into useful text for support, sales, and operations.
Find knowledge with embeddings and semantic context.
Plan, compare, and execute tasks with deeper models.
Deliver fast interactions with models optimized for product use.
Private access