> For the complete documentation index, see [llms.txt](https://run-ai-docs.nvidia.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://run-ai-docs.nvidia.com/self-hosted/2.23/platform-management/runai-scheduler/scheduling/workload-priority-control.md). # Workload Priority Control The workload priority management feature allows you to change the priority of a workload within a project. The priority determines the workload's position in the project scheduling queue managed by the NVIDIA Run:ai [Scheduler](/self-hosted/2.23/platform-management/runai-scheduler/scheduling/how-the-scheduler-works.md). By adjusting the priority, you can increase the likelihood that a workload will be scheduled and preferred over others within the same project, ensuring that critical tasks are given higher priority and resources are allocated efficiently. The workload's priority also affects whether it can consume over-quota resources and whether it is subject to preemption by higher-priority workloads. You can change the priority of a workload by selecting one of the predefined values from the NVIDIA Run:ai priority dictionary. {% hint style="info" %} **Note** This applies only within a single project. It does not impact the scheduling queues or workloads of other projects. {% endhint %} ## Priority Dictionary Workload priority is defined by selecting a priority from a predefined list in the NVIDIA Run:ai priority dictionary. Each string corresponds to a specific Kubernetes [PriorityClass](/self-hosted/2.23/platform-management/runai-scheduler/scheduling/concepts-and-principles.md#priority-and-preemption), which in turn determines scheduling behavior, such as whether the workload is preemptible or allowed to run over quota.

Priority	Kubernetes Value	Preemption	Over Quota
`very-low`	25	Preemptible	Available
`low`	40	Preemptible	Available
`medium-low`	65	Preemptible	Available
`medium`	80	Preemptible	Available
`medium-high`	90	Preemptible	Available
`high`	125	Non-preemptible	Not available
`very-high`	150	Non-preemptible	Not available

### Preemptible vs Non-Preemptible Workloads * **Non-preemptible workloads** must run within the project’s deserved quota, cannot use over-quota resources, and will not be interrupted once scheduled. * **Preemptible workloads** can use opportunistic compute resources beyond the project’s quota but may be interrupted at any time. ## Default Priority per Workload NVIDIA Run:ai defines the following default mappings of workload types to priorities. To retrieve the default priority per workload type, refer to the [List workload types](https://run-ai-docs.nvidia.com/api/2.23/workloads/workload-properties#get-api-v1-workload-types) API. {% hint style="info" %} **Note** * For more information on workload support, see [Introduction to workloads](/self-hosted/2.23/workloads-in-nvidia-run-ai/introduction-to-workloads.md). * [Legacy priority values](/self-hosted/2.21/platform-management/runai-scheduler/scheduling/workload-priority-control.md) remain available for backward compatibility * Changing the priority is not supported for NVCF workloads. {% endhint %}

	Workload Types	Default Priority
NVIDIA Run:ai native workloads	Workspaces, Standard training, Distributed training, Custom inference, Hugging Face inference, NVIDIA NIM inference	Workspaces = `high` Training = `low` Inference = `very-high`
NVIDIA	NIM services, NVIDIA Cloud Functions (NVCF)	`very-high`
Kubernetes	Deployment, StatefulSet, ReplicaSet, Pod, Service, CronJob, Job, JobSet	Deployment, StatefulSet, ReplicaSet, Pod, Service, CronJob = `very-high` JobSet = `low` Job = `high`
Kubeflow	TFJob, PyTorchJob, MPIJob, XGBoostJob, Notebook, ScheduledWorkflow	TFJob, PyTorchJob, MPIJob, XGBoostJob = `low` Notebook = `high` ScheduledWorkflow = `very-high`
Ray	RayService, RayCluster, RayJob	RayService = `very-high` RayCluster, RayJob = `low`
Tekton	PipelineRun, TaskRun	PipelineRun = `very-high` TaskRun = `high`
Additional Frameworks	SeldonDeployment, AMLJob, DevWorkspace, VirtualMachineInstance, KServe, Workflow	SeldonDeployment, KServe, Workflow = `very-high` DevWorkspace = `high` AMLJob, VirtualMachineInstance = `low`

## Setting Priority During Workload Submission {% hint style="info" %} **Note** Changing a workload’s priority may impact its ability to be scheduled. For example, switching a workload from a `low` priority (which allows over-quota usage) to `high` priority (which requires in-quota resources) may reduce its chances of being scheduled in cases where the required quota is unavailable. {% endhint %} * Set the priority when submitting NVIDIA Run:ai workloads via the UI, CLI, or API: * **UI** - Set workload priority under **General** settings (flexible submission only) * **API** - Set using the `PriorityClass` field * **CLI** - Set using the `--priority` flag * Set the workload's priority by adding the following label to your YAML under the `metadata.labels` section of your workload definition and use the following values, `very-low`, `medium-low`, `medium`, `medium-high`, `high`, `very-high` : ```yaml metadata: labels: priorityClassName: ``` ## Updating the Default Priority Mapping Administrators can change the default priority and preemptibility assigned to a workload type by updating the mapping using the [NVIDIA Run:ai API](https://run-ai-docs.nvidia.com/api/2.23/). To update the priority mapping: 1. Retrieve the list of workload types and their IDs using `GET /api/v1/workload-types`. 2. Identify the `workloadTypeId` of the workload type you want to modify. 3. Retrieve the list of available priorities and their IDs using `GET /api/v1/workload-priorities`. 4. Send a request to update the workload type with the new priority using\ `PUT /api/v1/workload-types/{workloadTypeId}` and include the `priorityId` in the request body. ## Using API Go to the [Workload priorities](https://run-ai-docs.nvidia.com/api/2.23/workloads/workload-properties#get-api-v1-workload-priorities) API reference to view the available actions. --- # Agent Instructions This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com. ## Querying This Documentation If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question. Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter: ``` GET https://run-ai-docs.nvidia.com/self-hosted/2.23/platform-management/runai-scheduler/scheduling/workload-priority-control.md?ask=&goal= ``` `ask` is the immediate question: it should be specific, self-contained, and written in natural language. `goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal. The response will contain a direct answer to the question and relevant excerpts and sources from the documentation. Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.