Workload priority class control

The workload priority class management feature allows you to change the priority class of a workload within a project. The priority class determines the workload's position in the project scheduling queue managed by the NVIDIA Run:ai Scheduler. By adjusting the priority class, you can increase the likelihood that a workload will be scheduled and preferred over others within the same project, ensuring that critical tasks are given higher priority and resources are allocated efficiently.

You can change the priority class of a workload by selecting one of the predefined values from the NVIDIA Run:ai priority dictionary. This can be done using the NVIDIA Run:ai UI, API or CLI, depending on the workload type.

Note

This applies only within a single project. It does not impact the scheduling queues or workloads of other projects.

Priority class

Priority class is represented as a string value (Name) and must be selected from a predefined, closed list of string values defined by NVIDIA Run:ai priority dictionary. These string values correspond to the workload priorities:

  • Priority - Integer value ranging from 1 to 4 (1 being the highest)

  • Name - A descriptive name of the priority class (string value)

Priority dictionary

The following table lists the string values (Name) that must be selected from the predefined set in the NVIDIA Run:ai priority dictionary.

Note

While the table below includes fields such as priority level, preemption, and over-quota status, these are descriptive only. The list of valid priority class values is limited to the string names.

Priority
Name (string)
Preemption Flag
Over Quota

1

inference

Non-preemptible

Not available

2

build

Non-preemptible

Not available

3

interactive-preemptible

Preemptible

Available

4

train

Preemptible

Available

Preemptible vs non-preemptible workloads

  • Non-preemptible workloads must run within the project’s deserved quota, cannot use over-quota resources, and will not be interrupted once scheduled.

  • Preemptible workloads can use opportunistic compute resources beyond the project’s quota but may be interrupted at any time.

Default priority class per workload

Both NVIDIA Run:ai-native workloads and third-party workloads are assigned a default priority class. The below table shows the default PriorityClass per workload type:

Workload Type
Default PriorityClass

Supported priority overrides per workload

Note

Changing a workload’s priority class may impact its ability to be scheduled. For example, switching a workload from a train priority class (which allows over-quota usage) to build priority class (which requires in-quota resources) may reduce its chances of being scheduled in cases where the required quota is unavailable.

The below table shows the default PriorityClass listed in the previous section and the supported override options per workload:

Workload Type
interactive-preemptible
build
train
inference

How to override priority class

You can override the default priority class when submitting a workload through the UI, API, or CLI depending on the workload type.

Workspaces

To use the override options:

Training workloads

To use the override options:

  • API: Set PriorityClass in the Trainings API

  • CLI: Submit training using the --priority flag

    runai training submit --priority priority-class

Last updated