The workload priority management feature allows you to change the priority of a workload within a project. The priority determines the workload's position in the project scheduling queue managed by the NVIDIA Run:ai Scheduler. By adjusting the priority, you can increase the likelihood that a workload will be scheduled and preferred over others within the same project, ensuring that critical tasks are given higher priority and resources are allocated efficiently.
You can change the priority of a workload by selecting one of the predefined values from the NVIDIA Run:ai priority dictionary. This can be done using the NVIDIA Run:ai UI, API or CLI, depending on the workload type.
Workload priority is defined by selecting a string name from a predefined list in the NVIDIA Run:ai priority dictionary. Each string corresponds to a specific Kubernetes , which in turn determines scheduling behavior, such as whether the workload is preemptible or allowed to run over quota.
Non-preemptible workloads must run within the project’s deserved quota, cannot use over-quota resources, and will not be interrupted once scheduled.
Preemptible workloads can use opportunistic compute resources beyond the project’s quota but may be interrupted at any time.
Both NVIDIA Run:ai and third-party workloads are assigned a default priority. The below table shows the default priority per workload type:
The below table shows the default priority listed in the previous section and the supported override options per workload:
You can override the default priority when submitting a workload through the UI, API, or CLI depending on the workload type.
To use the override options:
UI: Enable "Allow the workload to exceed the project quota" when
API: Set PriorityClass in the
CLI: using the --priority flag
To use the override options:
API: Set PriorityClass in the
CLI: using the --priority flag
4
train
Preemptible
Available
1
inference
Non-preemptible
Not available
2
build
Non-preemptible
Not available
3
interactive-preemptible
Preemptible
build
train
inference
train
inference
Available
runai workspace submit --priority priority-classrunai training submit --priority priority-class