Google DeepMind lineup overview: capabilities, latency profiles, and where each model fits inside the 4032.ai bridge.
Lineup
Compare the models from Google DeepMind side by side. Look at tiers, latency, pricing, and where they slot into your workloads.
Interactive latency with streaming enabled by default
Balanced multimodal Gemini model that blends quality, speed, and long-context reasoning.
Very low; designed for real-time experiences
Speed-focused Gemini tier for high-traffic workloads with strong multimodal coverage.