Workloads

Workloads are trainings, workspaces, and inferences that are fully controlled by NVIDIA Run:ai. Workloads can be native, third party integrations, and typical Kubernetes workload types. For more information, see Workloads overviewarrow-up-right.

List workloads.

get

Retrieve a list of active workloads with details.

Authorizations
AuthorizationstringRequired

Bearer authentication

Query parameters
deletedbooleanOptional

Return only deleted resources when true.

offsetinteger · int32Optional

The offset of the first item returned in the collection.

Example: 100
limitinteger · int32 · min: 1 · max: 500Optional

The maximum number of entries to return.

Default: 50
sortOrderstring · enumOptional

Sort results in descending or ascending order.

Default: ascPossible values:
sortBystring · enumOptional

Sort results by a parameter.

Possible values:
filterBystring[]Optional

Filter results by a parameter. Use the format field-name operator value. Operators are == Equals, != Not equals, <= Less than or equal, >= Greater than or equal, =@ contains, !@ Does not contain, =^ Starts with and =$ Ends with. Dates are in ISO 8601 timestamp format and available for operators ==, !=, <= and >=.

Example: ["name!=some-workload-name","allocatedGPU>=2","createdAt>=2021-01-01T00:00:00Z"]
searchstringOptional

Filter results by a free text search.

Example: test project
Responses
chevron-right
200

Executed successfully.

application/json
get
/api/v1/workloads

List workload categories.

get

Retrieves a list of workload categories. These categories are used to classify and monitor different types of workloads within the NVIDIA Run:ai platform.

Authorizations
AuthorizationstringRequired

Bearer authentication

Responses
chevron-right
200

List of categories retrieved successfully.

application/json
get
/api/v1/workload-categories

Get workload category by id.

get

Retrieves a specific workload category by its ID. Workload categories are used to classify and monitor different types of workloads within the NVIDIA Run:ai platform.

Authorizations
AuthorizationstringRequired

Bearer authentication

Path parameters
categoryIdstring · uuidRequired

The unique identifier of the workload category.

Responses
chevron-right
200

Category retrieved successfully

application/json
get
/api/v1/workload-categories/{categoryId}

List workload types.

get

Retrieves a list of workload types with their configurations - their corresponding workload categories and priorities.

Authorizations
AuthorizationstringRequired

Bearer authentication

Query parameters
externalTypesOnlybooleanOptional

Return only external workload types.

Default: false
Responses
chevron-right
200

List of types retrieved successfully

application/json
get
/api/v1/workload-types

List workload type by id.

get

Retrieves a specific workload type by its ID.

Authorizations
AuthorizationstringRequired

Bearer authentication

Path parameters
workloadTypeIdstring · uuidRequired

The unique identifier of the workload type.

Responses
chevron-right
200

Type retrieved successfully

application/json
get
/api/v1/workload-types/{workloadTypeId}

Update a workload type by id.

put

Update the default category or priority assigned to a workload type.

Authorizations
AuthorizationstringRequired

Bearer authentication

Path parameters
workloadTypeIdstring · uuidRequired

The unique identifier of the workload type to update.

Body
categoryIdstring · uuidOptional

The unique identifier of the workload category.

priorityIdstring · uuidOptional

The unique identifier of the workload priority.

Responses
chevron-right
200

Updated successfully

application/json
put
/api/v1/workload-types/{workloadTypeId}

Get a workload.

get

Retrieve workload data using a workloadId.

Authorizations
AuthorizationstringRequired

Bearer authentication

Path parameters
workloadIdstring · uuidRequired

The Universally Unique Identifier (UUID) of the workload.

Responses
chevron-right
200

Executed successfully.

application/json
get
/api/v1/workloads/{workloadId}

Count workloads.

get

Retrieve the number of workloads.

Authorizations
AuthorizationstringRequired

Bearer authentication

Query parameters
deletedbooleanOptional

Return only deleted resources when true.

filterBystring[]Optional

Filter results by a parameter. Use the format field-name operator value. Operators are == Equals, != Not equals, <= Less than or equal, >= Greater than or equal, =@ contains, !@ Does not contain, =^ Starts with and =$ Ends with. Dates are in ISO 8601 timestamp format and available for operators ==, !=, <= and >=.

Example: ["name!=some-workload-name","allocatedGPU>=2","createdAt>=2021-01-01T00:00:00Z"]
searchstringOptional

Filter results by a free text search.

Example: test project
Responses
chevron-right
200

Executed successfully.

application/json
get
/api/v1/workloads/count

Get the workloads telemetry.

get

Retrieves workload data by telemetry type.

Authorizations
AuthorizationstringRequired

Bearer authentication

Query parameters
clusterIdstring · uuidOptional

Filter using the Universally Unique Identifier (UUID) of the cluster.

Example: d73a738f-fab3-430a-8fa3-5241493d7128
nodepoolNamestringOptional

Filter using the nodepool.

Example: default
departmentIdstringOptional

Filter using the department id.

Example: 1
telemetryTypestring · enumRequired

Select a telemetry type.

Possible values:
Responses
chevron-right
200

Executed successfully.

get
/api/v1/workloads/telemetry

Get workload metrics data.

get

Retrieves workloads data metrics from the metrics database. Use in reporting and analysis tools.

Authorizations
AuthorizationstringRequired

Bearer authentication

Path parameters
workloadIdstring · uuidRequired

The Universally Unique Identifier (UUID) of the workload.

Query parameters
startstring · date-timeRequired

Start date of time range to fetch data in ISO 8601 timestamp format.

Example: 2023-06-06T12:09:18.211Z
endstring · date-timeRequired

End date of time range to fetch data in ISO 8601 timestamp format.

Example: 2023-06-07T12:09:18.211Z
numberOfSamplesinteger · max: 1000Optional

The number of samples to take in the specified time range.

Default: 20Example: 20
Responses
chevron-right
200

Executed successfully.

get
/api/v1/workloads/{workloadId}/metrics

Last updated