About the REST API
The NVIDIA Run:ai Multi-tenant REST APIs provide a programmatic interface for managing and operating multi-tenant NVIDIA Run:ai deployments.
These APIs are used by platform administrators to manage shared control plane resources across tenants.
You can use these APIs to:
Manage tenants and tenant-level configuration
Register and manage clusters
Configure platform-level settings
Monitor and audit multi-tenant operational activity
All API requests are authenticated using access tokens. For details on authentication and access setup, see API access setup.
API Versions
The NVIDIA Run:ai Multi-tenant REST APIs are release-based and are available only for self-hosted deployments.
Each API version aligns with a specific NVIDIA Run:ai product release (for example, 2.22, 2.23, 2.24).
API behavior is fixed for the lifetime of the release
No new features are added after the release is published
Changes are introduced only in newer released versions
Using a versioned API ensures stability and predictability for platform administrators operating multi-tenant environments.
API Conventions
he NVIDIA Run:ai Multi-tenant REST APIs follow RESTful design principles and use JSON for request and response payloads. See Making REST API requests for more details:
Authentication is provided via bearer tokens in the
AuthorizationheaderRequests use standard HTTP methods (
GET,POST,PUT,PATCH,DELETE)Request and response bodies are formatted as JSON
Standard HTTP status codes are used to indicate request success or failure (
2xx,4xx,5xx)
API Lifecycle and Deprecation
NVIDIA Run:ai is committed to providing stable and predictable Multi-tenant APIs. When an API or field is deprecated, the deprecated version remains supported for two additional released versions. After the deprecation period ends, deprecated APIs or fields are removed from both the codebase and the documentation. For full details, see NVIDIA Run:ai API Policy.
Last updated