Add CheapTokens to any agent runtime that can read a Markdown skill and make HTTPS calls. Paste a CheapTokens key and the skill discovers Venice's live models across text, code, image, video, music, audio, and embeddings, then routes work through discounted credits with honest attribution. Works with OpenClaw, Hermes, Claude Code, Codex-style harnesses, Cursor, Cline, and similar runtimes.
If your agent runtime can load Markdown skills or persistent instructions and can make HTTPS requests, it can use CheapTokens. Use the full pack for best Venice coverage, or the single-file router for the smallest install.
After the skill is loaded, paste your CheapTokens key and ask for work normally: use CheapTokens for this coding task, use CheapTokens to generate image prompts, or route this through Venice.
Best default. Includes the CheapTokens router plus synced Venice skills for chat, code, image, video, audio, music, transcription, embeddings, model traits, errors, and utility endpoints.
Best when a runtime asks for one SKILL.md URL. It is standalone for normal text/code and core multimodal routing, with links to deeper Venice references.
SKILL.md under ~/.openclaw/skills/cheaptokens/, or point it at the full repo's skills/ directory if supported.skills/ directory according to the harness's skill/instruction loader.The hosting agent (OpenClaw, Claude Code, etc.) runs on some default provider — that's what produces the tokens you see in chat. The agent cannot swap the provider behind its own conversational reply mid-session. The only mechanism that actually spends a CheapTokens/Venice key is an outbound HTTPS call to api.venice.ai made by the agent.
This skill is a single Markdown file that teaches the agent (1) when to make that call, (2) which Venice endpoint matches the user's ask, and (3) how to print an attribution line so you can verify, after the fact, which provider produced which bytes.
If you ever see the agent claim “switched to Venice” or “using your CheapTokens key” on a reply that does not carry an attribution footer like [via CheapTokens → Venice:<model> · ...], treat that claim as false: the key was not used for that reply.
The public skill pack is github.com/alde1022/cheaptokens-skills. The single-file CheapTokens router skill URL is https://raw.githubusercontent.com/alde1022/cheaptokens-skills/main/skills/cheaptokens/SKILL.md. Add it to your agent runtime as the CheapTokens skill. There is no helper binary, no npm package, and no PATH change.
For OpenClaw, save the skill file into the watched skills directory:
For the full CheapTokens + Venice skill pack, clone https://github.com/alde1022/cheaptokens-skills and point your runtime at its skills/ directory. For Hermes, Claude Code, Codex-style harnesses, Cursor, Cline, OpenCode, and similar runtimes, save the same skill files wherever that runtime watches for skills or persistent instructions.
Once loaded, paste your CheapTokens key or say “use CheapTokens for this coding task / image / video / audio / embedding job” and the skill takes over.
The fastest workflow is to paste a CheapTokens key into a trusted local agent and ask it to use CheapTokens. That is fine for speed, but remember: anyone who can see the key can spend the remaining credits until it expires.
VENICE_API_KEY and tell the agent to read it from there.Do not paste keys into public/shared agents, commit keys to repos, or include them in screenshots/logs. CheapTokens reduces blast radius because keys are budget-capped and expire at midnight UTC, but pasted keys are still bearer credentials. If a key is exposed, reissue it from wallet recovery.
The skill's description matches any of the following:
/cheaptokens.GET https://cheaptokens.ai/api/status/{last6} once and caches { status, creditsIssuedUsd, expiresAt }. HTTP 404 means it's a plain Venice key — still usable, just without CheapTokens-specific copy./models?type=all or targeted filters: text, code, image, inpaint, upscale, video, music, tts, asr, and embedding, plus /models/traits. Code is selected from code-optimized text/chat models and still runs through /chat/completions; it is not a separate API./chat/completions; media requests route to the relevant image, video, audio/music, TTS, transcription, or embedding endpoint. If it's hybrid, the skill spends the key on the parts Venice can do and uses the host model for the rest. If Venice can't do any of it, the skill says exactly what's missing and still spends the key on adjacent text artifacts so it doesn't idle to zero./chat/completions, /image/generate, /audio/speech, /video/queue, …) using whatever HTTP tool the runtime already provides. No helper required.status !== "active", or on a past expiresAt, the skill tells you once and continues on the host provider. Never silent.CheapTokens is designed to work standalone. It includes enough Venice API logic to detect available text/code models, route coding and chat through /chat/completions, and route image generation/editing, video, music, audio, TTS, transcription, embeddings, document parsing, scrape/search, and character workflows from the live Venice model registry.
Venice-specific skills can still improve advanced workflows: provider-specific parameters, media endpoint quirks, error handling, model traits, and best practices. Treat them as an optional expert pack, not a dependency.
CheapTokens credits expire at 23:59:59 UTC on the purchase date. The skill biases toward acting as that deadline approaches:
OpenClaw may expose the skill as a slash command:
But you don't need it — pasting a Venice-looking key anywhere in chat, or saying “use this key,” matches the skill's trigger description directly.
The skill never phones home on its own. The only endpoint it hits with your key is Venice itself; the only endpoint it hits on CheapTokens.ai is the public /api/status/{last6}, which returns only the data tied to the last-6 of the key you already hold. CheapTokens keeps no server-side record of which agent is using which key — wallet address is the only identity.
/api/v1/models first. If you see it refuse a video / image / audio request without referencing the live capability map, the skill is stale — reload from cheaptokens-skills.