Production-ready distributed inference cluster
Saved once and reused when loading models on any node.
Use a Hugging Face model URL. The same download command is sent to every online worker.
| Worker | Host | Requests | Prompt Tokens | Completion Tokens |
|---|---|---|---|---|
Click Refresh to load stats.