# Workload Priority Control The workload priority management feature allows you to change the priority of a workload within a project. The priority determines the workload's position in the project scheduling queue managed by the NVIDIA Run:ai [Scheduler](/self-hosted/2.21/platform-management/runai-scheduler/scheduling/how-the-scheduler-works.md). By adjusting the priority, you can increase the likelihood that a workload will be scheduled and preferred over others within the same project, ensuring that critical tasks are given higher priority and resources are allocated efficiently. You can change the priority of a workload by selecting one of the predefined values from the NVIDIA Run:ai priority dictionary. This can be done using the NVIDIA Run:ai UI, API or CLI, depending on the workload type. {% hint style="info" %} **Note** This applies only within a single project. It does not impact the scheduling queues or workloads of other projects. {% endhint %} ## Priority Dictionary Workload priority is defined by selecting a **string name** from a predefined list in the NVIDIA Run:ai priority dictionary. Each string corresponds to a specific Kubernetes [PriorityClass](/self-hosted/2.21/platform-management/runai-scheduler/scheduling/concepts-and-principles.md#priority-and-preemption), which in turn determines scheduling behavior, such as whether the workload is preemptible or allowed to run over quota. {% hint style="info" %} **Note** The numeric priority levels (1 = highest, 4 = lowest) are descriptive only and are **not** part of the NVIDIA Run:ai priority dictionary. {% endhint %}

Priority Level	Name (string)	Preemption	Over Quota
1	`inference`	Non-preemptible	Not available
2	`build`	Non-preemptible	Not available
3	`interactive-preemptible`	Preemptible	Available
4	`train`	Preemptible	Available

### Preemptible vs Non-Preemptible Workloads * **Non-preemptible workloads** must run within the project's deserved quota, cannot use over-quota resources, and will not be interrupted once scheduled. * **Preemptible workloads** can use opportunistic compute resources beyond the project's quota but may be interrupted at any time. ## Default Priority per Workload Both NVIDIA Run:ai and third-party workloads are assigned a default priority. The below table shows the default priority per workload type:

Workload Type	Default Priority
Workspaces	`build`
Training	`train`
Inference	`inference`
Third-party workloads	`train`
NVIDIA Cloud Functions (NVCF)	`inference`

## Supported Priority Overrides per Workload {% hint style="info" %} **Note** Changing a workload's priority may impact its ability to be scheduled. For example, switching a workload from a `train` priority (which allows over-quota usage) to `build` priority (which requires in-quota resources) may reduce its chances of being scheduled in cases where the required quota is unavailable. {% endhint %} The below table shows the default priority listed in the previous section and the supported override options per workload:

Workload Type	interactive-preemptible	build	train	inference
Workspaces	true	true	false	false
Training	true	true	true	false
Inference	false	false	false	true
Third-party workloads	true	true	true	false
NVIDIA Cloud Functions (NVCF)	false	false	false	true

## How to Override Priority You can override the default priority when submitting a workload through the UI, API, or CLI depending on the workload type. ### Workspaces To use the override options: * **UI:** Enable **"Allow the workload to exceed the project quota"** when [submitting a workspace](/self-hosted/2.21/workloads-in-nvidia-run-ai/using-workspaces/running-workspace.md) * **API:** Set `PriorityClass` in the [Workspaces](https://run-ai-docs.nvidia.com/api/2.21/workloads/workspaces) API * **CLI:** [Submit a workspace](/self-hosted/2.21/reference/cli/runai/runai_workspace_submit.md) using the `--priority` flag ```sh runai workspace submit --priority priority-class ``` ### Training Workloads To use the override options: * **API:** Set `PriorityClass` in the [Trainings](https://run-ai-docs.nvidia.com/api/2.21/workloads/trainings) API * **CLI:** [Submit training](/self-hosted/2.21/reference/cli/runai/runai_training_submit.md) using the `--priority` flag ```sh runai training submit --priority priority-class ``` --- # Agent Instructions: Querying This Documentation If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question. Perform an HTTP GET request on the current page URL with the `ask` query parameter: ``` GET https://run-ai-docs.nvidia.com/self-hosted/2.21/platform-management/runai-scheduler/scheduling/workload-priority-control.md?ask= ``` The question should be specific, self-contained, and written in natural language. The response will contain a direct answer to the question and relevant excerpts and sources from the documentation. Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.