Authorization: Bearer. See the GMI Cloud API docs for provider-specific parameters.
Supports both managed (Lava’s API keys) and unmanaged (bring your own credentials) mode.
Quick Start
Chat Completions
Target URL:https://api.gmi-serving.com/v1/chat/completions
| Content Type | application/json |
| Streaming | Yes (set stream: true in request body) |
| Model | Input / 1M tokens | Output / 1M tokens |
|---|---|---|
| deepseek-ai/DeepSeek-R1-0528 | $0.80 | $2.30 |
| deepseek-ai/DeepSeek-V3-0324 | $0.74 | $0.90 |
| deepseek-ai/DeepSeek-Prover-V2-671B | $0.50 | $2.18 |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | $0.50 | $0.90 |
| Qwen/QwQ-32B | $0.50 | $1.50 |
| deepseek-ai/DeepSeek-R1-Distill-Llama-70B | $0.25 | $0.75 |
| meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | $0.25 | $0.80 |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | $0.20 | $0.20 |
| Qwen/Qwen3-235B-A22B-FP8 | $0.17 | $1.09 |
| Qwen/Qwen3-32B-FP8 | $0.10 | $0.60 |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | $0.10 | $0.20 |
Next Steps
All Providers
Browse all supported AI providers
Forward Proxy
Learn how to construct proxy URLs and authenticate requests