CheapTokens.ai sells discounted Venice.ai API credits with transparent, real-time pricing. You get a real Venice INFERENCE API key and call Venice's API directly. Full Venice API docs at docs.venice.ai.
CheapTokens.ai sells discounted Venice.ai API credits using time-decay dynamic pricing. When you purchase credits, we generate a real Venice INFERENCE API key for you via Venice's key management API. You then call Venice's API directly — we are not a proxy.
api.venice.ai/api/v1Use the Venice API key you received from CheapTokens.ai in the Authorization header as a Bearer token.
Your key has a credit usage limit set by your purchase. Venice tracks usage automatically. Credits expire at midnight UTC.
Your key works directly with Venice's API. Use their OpenAI-compatible endpoint:
If you are using the OpenAI SDK, set the base_url (Python) or baseURL (Node.js) configuration option. See the full Venice API docs for all available parameters.
Create a chat completion. The request format is identical to the OpenAI Chat Completions API.
Venice offers a variety of open-source and proprietary models. Use the models endpoint to list available options.
Available models change over time. Use GET /models to fetch the live catalog instead of hardcoding IDs in your app. See the full list at docs.venice.ai.
The underlying credit supply and relay flow has an independent security review available. The public product sells Venice.ai API credits; this audit covers part of the backing infrastructure.
Notable items in this review include one medium-severity operational issue, one low-severity accounting/dust issue, and several hardening recommendations. Customer-facing CheapTokens keys remain direct Venice API keys, not a CheapTokens proxy.
Set "stream": true to receive responses as Server-Sent Events (SSE).
Rate limits are managed by Venice based on your API key type. Your key has a credit usage limit set at purchase time.
| Limit | Details |
|---|---|
| Credit usage | Per your purchase |
| Key expiry | Midnight UTC |
| Request limits | Set by Venice |
Venice uses standard HTTP status codes. Error responses include a JSON body with details.
| Code | Description |
|---|---|
| 401 | Invalid, expired, or revoked API key |
| 402 | Insufficient credits / payment required |
| 403 | Not entitled / forbidden model or feature |
| 429 | Rate limit exceeded |
| 500 | Internal server error |
Buy discounted Venice AI API credits and start making calls in minutes.
Buy Credits