Workloads

List workloads.

get

Retrieve a list of active workloads with details.

Authorizations
AuthorizationstringRequired

Bearer authentication

Query parameters
deletedbooleanOptional

Return only deleted resources when true.

offsetinteger · int32Optional

The offset of the first item returned in the collection.

Example: 100
limitinteger · int32 · min: 1 · max: 500Optional

The maximum number of entries to return.

Default: 50
sortOrderstring · enumOptional

Sort results in descending or ascending order.

Default: ascPossible values:
sortBystring · enumOptional

Sort results by a parameter.

Possible values:
filterBystring[]Optional

Filter results by a parameter. Use the format field-name operator value. Operators are == Equals, != Not equals, <= Less than or equal, >= Greater than or equal, =@ contains, !@ Does not contain, =^ Starts with and =$ Ends with. Dates are in ISO 8601 timestamp format and available for operators ==, !=, <= and >=.

Example: ["name!=some-workload-name","allocatedGPU>=2","createdAt>=2021-01-01T00:00:00Z"]
searchstringOptional

Filter results by a free text search.

Example: test project
Responses
chevron-right
200

Executed successfully.

application/json
nextintegerOptionalExample: 1
get
/api/v1/workloads

Get a workload.

get

Retrieve workload data using a workloadId.

Authorizations
AuthorizationstringRequired

Bearer authentication

Path parameters
workloadIdstring · uuidRequired

The Universally Unique Identifier (UUID) of the workload.

Responses
chevron-right
200

Executed successfully.

application/json
tenantIdinteger · int32Required

The id of the tenant.

Example: 1001
runningPodsinteger · int32RequiredExample: 1
phaseUpdatedAtstring · date-timeRequiredExample: 2022-06-08T11:28:24.131Z
k8sPhaseUpdatedAtstring · date-timeRequiredExample: 2022-06-08T11:28:24.131Z
updatedAtstring · date-timeRequiredExample: 2022-06-08T11:28:24.131Z
sourcestring · enumRequiredPossible values:
deletedAtstring · date-time · nullableRequiredExample: 2022-08-12T19:28:24.131Z
typestringRequiredExample: runai-job
namestringRequiredExample: very-important-job
idstring · uuidRequired
priorityinteger · int32 · nullableRequiredExample: 50
priorityClassNamestringRequiredExample: high-priority
submittedBystringOptionalExample: [email protected]
clusterIdstring · uuidRequired

The id of the cluster.

Example: 71f69d83-ba66-4822-adf5-55ce55efd210
projectNamestringRequiredExample: proj-1
projectIdstringRequiredExample: 1
departmentNamestringRequiredExample: department-1
departmentIdstringRequiredExample: 1
namespacestringRequiredExample: runai-proj-1
createdAtstring · date-timeRequiredExample: 2022-01-01T03:49:52.531Z
phasestring · enumRequiredPossible values:
phaseMessagestringOptionalExample: Not enough resources in the requested nodepool
k8sPhasestringRequiredExample: Pending
requestedNodePoolsstring[]OptionalExample: default
currentNodePoolsstring[]OptionalExample: default
completedAtstring · date-time · nullableOptionalExample: 2022-01-01T03:49:52.531Z
imagesstring[]OptionalExample: alpine:latest
urlsstring[]Optional
distributedFrameworkstringOptionalExample: Pytorch
preemptibleboolean · nullableOptional
commandstringOptionalExample: sleep
argumentsstringOptionalExample: 1000
phaseReasonstring · enumOptionalPossible values:
idleGpusinteger · nullableOptionalDeprecated

deprecated. use idleAllocatedGpus instead

Example: 3
idleAllocatedGpusnumber · nullableOptional

sum of idle allocated gpus in the workload

Example: 1
totalPendingTimeSecondsinteger · nullableOptional

The total cumulative time, in seconds, that the workload has spent in the Pending phase since submission.

Example: 60
totalRunningTimeSecondsinteger · nullableOptional

The total cumulative time, in seconds, that the workload has spent in the Running phase since submission.

Example: 60
categorystringRequired

Category Description

Example: Train
guaranteedRuntimeEndsAtstring · date-time · nullableOptional

A timestamp indicating when the workload will reach its minimum guaranteed runtime, as defined by minGuaranteedRuntime. Until this time, the workload is considered non-preemptible and cannot be interrupted by higher-priority workloads.

Example: 2025-08-01T03:49:52.531Z
aiApplicationIdstringOptional
aiApplicationNamestringOptional
sourceApistring · enum · nullableOptionalPossible values:
get
/api/v1/workloads/{workloadId}

Count workloads.

get

Retrieve the number of workloads.

Authorizations
AuthorizationstringRequired

Bearer authentication

Query parameters
deletedbooleanOptional

Return only deleted resources when true.

filterBystring[]Optional

Filter results by a parameter. Use the format field-name operator value. Operators are == Equals, != Not equals, <= Less than or equal, >= Greater than or equal, =@ contains, !@ Does not contain, =^ Starts with and =$ Ends with. Dates are in ISO 8601 timestamp format and available for operators ==, !=, <= and >=.

Example: ["name!=some-workload-name","allocatedGPU>=2","createdAt>=2021-01-01T00:00:00Z"]
searchstringOptional

Filter results by a free text search.

Example: test project
Responses
chevron-right
200

Executed successfully.

application/json
countinteger · int64RequiredExample: 1
get
/api/v1/workloads/count

Get the workloads telemetry.

get

Retrieves workload data by telemetry type. Optionally filter by specific workload phases.

Telemetry Types:

  • WORKLOADS_COUNT: Count of workloads

  • GPU_ALLOCATION: GPU allocation metrics

  • PENDING_TIME_DISTRIBUTION: Distribution of workloads by current pending time (time since entering Pending phase).

    For this type, results are automatically grouped by 4 time buckets and any additional groupBy parameters.

Authorizations
AuthorizationstringRequired

Bearer authentication

Query parameters
clusterIdstring · uuidOptional

Filter using the Universally Unique Identifier (UUID) of the cluster.

Example: d73a738f-fab3-430a-8fa3-5241493d7128
nodepoolNamestringOptional

Filter using the nodepool.

Example: default
departmentIdstringOptional

Filter using the department id.

Example: 1
telemetryTypestring · enumRequired

Select a telemetry type.

Possible values:
Responses
chevron-right
200

Executed successfully.

typestringRequired

specifies what data returned

Example: ALLOCATION_RATIO
timestampstring · date-timeRequiredExample: 2023-06-06 12:09:18.211
get
/api/v1/workloads/telemetry

Get workload metrics data.

get

Retrieves workloads data metrics from the metrics database. Use in reporting and analysis tools.

Authorizations
AuthorizationstringRequired

Bearer authentication

Path parameters
workloadIdstring · uuidRequired

The Universally Unique Identifier (UUID) of the workload.

Query parameters
startstring · date-timeRequired

Start date of time range to fetch data in ISO 8601 timestamp format.

Example: 2023-06-06T12:09:18.211Z
endstring · date-timeRequired

End date of time range to fetch data in ISO 8601 timestamp format.

Example: 2023-06-07T12:09:18.211Z
numberOfSamplesinteger · max: 1000Optional

The number of samples to take in the specified time range.

Default: 20Example: 20
Responses
chevron-right
200

Executed successfully.

get
/api/v1/workloads/{workloadId}/metrics

Last updated