@maddy-a

ZeroGPU Router

ZeroGPU Router plugin for OpenClaw: route trivial AI tasks (summarize, classify, redact PII, extract JSON, follow-ups) to small/nano models via MCP. Cuts agent inference costs without changing your reasoning model.

当前版本
v0.1.0
code-plugin社区source-linked

ZeroGPU Router

Cut your OpenClaw agent's inference costs. Route trivial AI tasks — summarize, classify, redact PII, extract JSON, generate follow-ups, short chat — to small/nano models instead of burning frontier-model tokens.

Website Dashboard License

What it does

Your OpenClaw agent keeps doing the heavy reasoning. Routine tasks get offloaded to ZeroGPU's small models — typically 100–1000× cheaper per call.

  • 11 task-specific MCP tools (zerogpu_summarize, zerogpu_classify_iab, zerogpu_redact_pii, …)
  • A routing skill that teaches your agent when to call each tool
  • Per-call savings logged with model, latency, and a real savings_usd figure
  • Hosted MCP server at https://mcp.zerogpu.ai/mcp — no infra to run

Install

openclaw plugins install clawhub:zerogpu-router

Set up (2 steps)

1. Get your API key

Sign in at platform.zerogpu.ai and create a project to grab an API key + project ID.

2. Register the hosted MCP server

In your OpenClaw shell:

openclaw mcp set zerogpu '{
  "url": "https://mcp.zerogpu.ai/mcp",
  "transport": "streamable-http",
  "headers": {
    "x-api-key": "<your-api-key>",
    "x-project-id": "<your-project-id>"
  }
}'

Verify:

openclaw mcp show zerogpu --json

That's it. Your agent will now route trivial tasks through ZeroGPU automatically.

Try it

Ask your agent:

summarize this paragraph: Renewable energy adoption is accelerating globally, driven by falling solar and wind costs.

The agent calls zerogpu_summarize (running on t5-small) instead of the host model and replies with the summary plus a savings line.

The 11 tools you get

ToolWorkloadBacking model
zerogpu_classify_iabIAB topic classificationzlm-v1-iab-classify-edge
zerogpu_summarizeTL;DRs, abstracts, meeting summariest5-small
zerogpu_classify_zero_shotClassify against a flat label listdeberta-v3-small
zerogpu_extract_entitiesPeople, places, companies, dates, custom entitiesgliner2-base-v1
zerogpu_extract_jsonPull structured fields into grouped JSONgliner2-base-v1
zerogpu_classify_structuredMulti-axis schema classificationgliner2-base-v1
zerogpu_redact_piiMask emails, phones, names, addresses, other PIIgliner-multi-pii-v1
zerogpu_extract_piiExtract PII grouped by categorygliner-multi-pii-v1
zerogpu_generate_followupsGenerate follow-up questions from a passagezlm-v1-followup-questions-edge
zerogpu_chatShort small-model chat repliesLFM2.5-1.2B-Instruct / -Thinking
zerogpu_healthVerify the ZeroGPU backend

Every tool returns { <task fields>, model, usage, savings }.

Watch your savings

Live dashboard at platform.zerogpu.ai — token usage, latency, per-tool savings, broken down by agent and time range.

Links

License

MIT — see LICENSE.

源码与版本

源码仓库

zerogpu/zerogpu-router

打开仓库

源码提交

a5f2515e0f80381fe876558fcf31d5ba9ed63612

查看提交

安装命令

openclaw plugins install clawhub:zerogpu-router

元数据

  • 包名: zerogpu-router
  • 创建时间: 2026/05/08
  • 更新时间: 2026/05/08
  • 执行代码:
  • 源码标签: a5f2515e0f80381fe876558fcf31d5ba9ed63612

兼容性

  • 构建于 OpenClaw: 2026.4.27
  • 插件 API 范围: >=2026.4.0
  • 标签: latest
  • 文件数: 6