The provider agent
The provider agent (usepod-agent) is a small standalone binary that connects
your inference hardware to the Use Pod coordinator. It is open source so
operators can audit exactly what they run.
Supported backends
Section titled “Supported backends”The agent discovers and dispatches to local OpenAI-compatible backends:
| Backend | Default port |
|---|---|
| vLLM | 8000 |
| llama.cpp server | 8080 |
| LM Studio | 1234 |
| Ollama | 11434 |
Lifecycle
Section titled “Lifecycle”- Identity. On first run the agent generates an Ed25519 keypair, persisted
locally with
0600permissions. This key is your provider identity. - Connect. The agent dials an outbound WebSocket to the coordinator
(
wss://api.usepod.ai/provider/connect). No inbound ports are required. - Authenticate. A signed challenge/response binds the connection to your provider record (enrolling on first connect via the enrollment code).
- Advertise. The agent sends a
capabilitiesmessage — your models, prices, max concurrency, and backend kind — which the coordinator records and marks you online. - Serve. The coordinator dispatches jobs; the agent calls your local backend and streams the response bytes back. Heartbeats keep your health and metrics fresh.
Reconnection
Section titled “Reconnection”If the connection drops — for example during a coordinator deploy — the agent reconnects automatically with backoff and re-advertises its capabilities. A clean coordinator shutdown sends a close frame so the agent treats it as a planned cycle rather than an outage.
Keeping current
Section titled “Keeping current”Existing hosts do not update automatically. To upgrade:
usepod-agent upgradesudo systemctl restart usepod-agent # or restart your process managerusepod-agent version