Hotfixes for Version 2.24

This section provides details on all hotfixes available for version 2.24. Hotfixes are critical updates released between our major and minor versions to address specific issues or vulnerabilities. These updates ensure the system remains secure, stable, and optimized without requiring a full version upgrade.

Version
Date (MM/DD/YYYY)
Internal ID
Description

2.24.75

04/30/2026

RUN-38467

Fixed a security vulnerability related to GHSA-pc3f-x583-g7j2 with severity HIGH.

2.24.75

04/30/2026

RUN-37894

Fixed a security vulnerability related to GHSA-xw7x-h9fj-p2c7 with severity CRITICAL.

2.24.75

04/30/2026

RUN-34648

Fixed an issue where an inference workload incorrectly displayed "Running" status when the pod was Pending after a scale-to-zero event.

2.24.74

04/27/2026

RUN-38510

Fixed a security vulnerability related to GHSA-9jj7-4m8r-rfcm with severity CRITICAL.

2.24.74

04/27/2026

RUN-38040

Fixed an issue where distributed workloads submitted with hostNetwork: true did not have dnsPolicy set to ClusterFirstWithHostNet.

2.24.73

04/27/2026

RUN-38569

Fixed an issue where users could not save project quota changes through the UI after upgrading to 2.24 when the over-quota priority feature was disabled.

2.24.73

04/27/2026

RUN-38503

Fixed a security vulnerability related to GHSA-rp42-5vxx-qpwr with severity HIGH.

2.24.73

04/27/2026

RUN-38474

Fixed a security vulnerability related to CVE-2026-27144 with severity HIGH.

2.24.73

04/27/2026

RUN-38430

Fixed an issue where workloads could not be submitted when the NVIDIA GPU Operator was deployed with the NRI plugin enabled.

2.24.73

04/27/2026

RUN-38428

Fixed a security vulnerability related to CVE-2026-40175 with severity CRITICAL.

2.24.73

04/27/2026

RUN-38405

Fixed an issue where uninstalling the cluster Helm chart on OpenShift failed because the runai-operator was missing permission to delete the runai-prometheus Secret in the openshift-monitoring namespace.

2.24.73

04/27/2026

RUN-38358

Fixed a security vulnerability related to CVE-2026-4424 with severity HIGH.

2.24.73

04/27/2026

RUN-38282

Fixed a security vulnerability related to CVE-2026-32283 with severity HIGH.

2.24.73

04/27/2026

RUN-38260

Fixed an issue where the Admin > Event history page returned a 500 error when filtering by Event ID.

2.24.73

04/27/2026

RUN-38240

Fixed a security vulnerability related to GHSA-6v2p-p543-phr9 with severity HIGH.

2.24.73

04/27/2026

RUN-38183

Fixed a security vulnerability related to CVE-2026-21710 with severity HIGH.

2.24.73

04/27/2026

RUN-38175

Fixed a security vulnerability related to CVE-2026-32280 with severity HIGH.

2.24.73

04/27/2026

RUN-38149

Fixed a security vulnerability related to CVE-2026-0994 with severity HIGH.

2.24.73

04/27/2026

RUN-38094

Fixed a security vulnerability related to CVE-2026-27654 with severity HIGH.

2.24.73

04/27/2026

RUN-38084

Fixed a security vulnerability related to GHSA-hfvc-g4fc-pqhx with severity HIGH.

2.24.73

04/27/2026

RUN-38072

Fixed an issue on OpenShift where duplicate ServiceMonitors caused PrometheusRuleFailures alerts.

2.24.73

04/27/2026

RUN-38058

Fixed an issue where inference requests exceeding 30 seconds returned a 502 Bad Gateway.

2.24.73

04/27/2026

RUN-38055

Fixed an issue where the access rules API accepted invalid subjectType values without returning a validation error.

2.24.73

04/27/2026

RUN-38029

Fixed an issue where the GET /api/v1/workloads/pods endpoint ignored project scope for user API tokens.

2.24.73

04/27/2026

RUN-37984

Fixed a security vulnerability related to GHSA-r5fr-rjxr-66jc with severity HIGH.

2.24.73

04/27/2026

RUN-37972

Fixed an issue where ingressClass was missing from the Minimal Cluster resource.

2.24.73

04/27/2026

RUN-37959

Fixed an issue where automatic topology constraints for distributed workloads were applied at the wrong topology level.

2.24.73

04/27/2026

RUN-37898

Fixed a security vulnerability related to GHSA-37ch-88jc-xwx2 with severity HIGH.

2.24.73

04/27/2026

RUN-37895

Fixed a security vulnerability related to GHSA-c2c7-rcm5-vvqj with severity HIGH.

2.24.73

04/27/2026

RUN-37701

Fixed a security vulnerability related to GHSA-p77j-4mvh-x3m3 with severity CRITICAL.

2.24.73

04/27/2026

RUN-37578

Fixed a security vulnerability related to GHSA-25h7-pfq9-p65f with severity HIGH.

2.24.73

04/27/2026

RUN-37532

Fixed an issue where workloads were slow to appear in the UI and API after being submitted.

2.24.73

04/27/2026

RUN-36615

Fixed a security vulnerability related to CVE-2024-12797 with severity HIGH.

2.24.73

04/27/2026

RUN-36456

Fixed an issue where enabling enableWorkloadOwnershipProtection prevented the Workload Overseer from enforcing project scheduling rules such as Idle GPU timeout and Workload Duration limits.

2.24.73

04/27/2026

RUN-35919

Fixed an issue where db-migrations failed during control plane upgrades in the org-unit-service.

2.24.73

04/27/2026

RUN-34639

Fixed an issue where the UI displayed impossible Free GPU values (allocated + free exceeding the node's total capacity) on nodes with fractional GPU allocations.

2.24.66

03/31/2026

RUN-37889

Fixed a security vulnerability related to GHSA-p77j-4mvh-x3m3 with severity HIGH.

2.24.65

03/25/2026

RUN-37702

Fixed a security vulnerability related to GHSA-p77j-4mvh-x3m3 with severity HIGH.

2.24.65

03/25/2026

RUN-36559

Fixed an issue where tenant-level policy permissions could not delete policies belonging to scopes that no longer exist.

2.24.61

03/19/2026

RUN-37611

Fixed an issue in the distributed workload submission form where a project policy with a locked rule on storage instances could result in a failure to submit the workload.

2.24.61

03/19/2026

RUN-37504

Fixed a security vulnerability related to CVE-2026-25679 with severity HIGH.

2.24.61

03/19/2026

RUN-37497

Fixed a security vulnerability related to CVE-2026-27142 with severity HIGH.

2.24.61

03/19/2026

RUN-37170

Fixed a security vulnerability related to GHSA-23c5-xmqv-rm74 with severity HIGH.

2.24.61

03/19/2026

RUN-37169

Fixed a security vulnerability related to GHSA-5rq4-664w-9x2c with severity HIGH.

2.24.58

03/11/2026

RUN-37341

Fixed a security vulnerability related to CVE-2025-61732 with severity HIGH.

2.24.58

03/11/2026

RUN-37174

Fixed a security vulnerability related to GHSA-72hv-8253-57qq with severity HIGH.

2.24.57

03/10/2026

RUN-36407

Fixed an issue where workspace workload submissions intermittently failed with a "Workload failed due to a Network issue" error.

2.24.56

03/09/2026

RUN-37278

Fixed a security vulnerability related to CVE-2024-1013 with severity HIGH.

2.24.56

03/09/2026

RUN-37167

Fixed a security vulnerability related to GHSA-72hv-8253-57qq with severity HIGH.

2.24.56

03/09/2026

RUN-36732

Fixed a security vulnerability related to GHSA-5vv4-hvf7-2h46 with severity HIGH.

2.24.54

03/05/2026

RUN-37515

Fixed a security vulnerability related to GHSA-9h8m-3fm2-qjrq with severity HIGH.

2.24.54

03/05/2026

RUN-36734

Fixed an issue where the Analytics table displayed incorrect GPU Compute Utilization values for Training and Interactive workloads.

2.24.54

03/05/2026

RUN-34564

Fixed an issue where updating a project with node type names in the payload did not return the node type names in the 200 success response.

2.24.52

02/26/2026

RUN-36370

Fixed an issue where NIM and HuggingFace inference templates failed to submit when a policy defined locked storage instances.

2.24.52

02/26/2026

RUN-36560

Fixed an issue where the Connect button did not open the workspace URL for workspaces submitted through YAML.

2.24.52

02/26/2026

RUN-37113

Fixed an issue where image strings that included a port number in the registry URL were not parsed correctly.

2.24.51

02/25/2026

RUN-37060

Fixed an issue where the NVLink total bytes per pod metric was labeled with GPU metrics labels instead of the expected pod labels.

2.24.51

02/25/2026

RUN-35612

Fixed a security vulnerability related to CVE-2025-64756 with severity HIGH.

2.24.51

02/25/2026

RUN-36443

Fixed an issue where the dashboard returned a 500 error instead of an informative error message.

2.24.49

02/17/2026

RUN-34472

Fixed an issue where the "Allocation ratio by node pool" widget in the Overview dashboard aggregated unlimited quotas together with other quotas, resulting in incorrect data.

2.24.49

02/17/2026

RUN-34624

Fixed an issue in Projects and Departments where GPU utilization/allocation metrics were not displayed if only partial data was available.

2.24.48

02/15/2026

RUN-35976

Fixed an issue where workloads submitted with names longer than 63 characters failed to schedule.

2.24.48

02/15/2026

RUN-35834

Fixed an issue where the AI practitioner role did not have read access to policies granted through workload submission permission sets (for example, workspaceEditAccess).

2.24.48

02/15/2026

RUN-36457

Fixed an issue where, on rare occasions, "Allocation ratio by node pool" widget would show incorrect data.

2.24.48

02/15/2026

RUN-35326

Fixed an issue where the Projects/Departments table in the Overview dashboard sometimes showed fewer than 15 projects/departments when their workloads did not have allocated GPUs or were not in Running or Pending status.

2.24.48

02/15/2026

RUN-34017

Fixed an issue where runai template list returned incorrect output when using --page-size and --max-items together.

2.24.48

02/15/2026

RUN-36505

Fixed an issue where, on rare occasions, there was a race condition in some of the metrics causing the average GPU utilization to be above 100%.

2.24.48

02/15/2026

RUn-36257

Fixed an issue in the flexible workload submission form where image pull secret section would present shared credentials instead of shared secrets resulting in a failure to submit the workload.

2.24.48

02/15/2026

RUn-36383

Fixed a security vulnerability related to GHSA-4f99-4q7p-p3gh with severity HIGH.

2.24.48

02/15/2026

RUN-36382

Fixed a security vulnerability related to GHSA-cv78-6m8q-ph82 with severity HIGH.

2.24.48

02/15/2026

RUN-36045

Fixed an issue where inference workload metrics were not being refreshed correctly.

2.24.48

02/15/2026

RUN-36254

Fixed an issue where a race condition during webhook certificate generation caused failures.

2.24.48

02/15/2026

RUN-36413

Fixed a security vulnerability related to CVE-2024-41110 with severity HIGH.

2.24.48

02/15/2026

RUN-36506

Fixed an issue where the UI shows the wrong GPU quotas for node pools associated with the “Default” department.

2.24.48

02/15/2026

RUN-36414

Fixed a security vulnerability related to CVE-2025-14459 and CVE-2025-64324 with severity HIGH.

2.24.48

02/15/2026

RUN-36381

Fixed a security vulnerability related to GHSA-jmp9-x22r-554x with severity HIGH.

2.24.48

02/15/2026

RUN-36122

Fixed an issue where credentials assets were not displayed in the Credentials table.

2.24.48

02/15/2026

RUN-36020

Fixed an issue where, when swap was enabled, the toolkit-reservation pod could enter an OutOfMemory state if the kubelet detected insufficient RAM at startup, and would not automatically recover once memory was freed.

2.24.48

02/15/2026

RUN-35511

Fixed an issue where an incorrect FQDN used during certificate generation caused errors.

2.24.48

02/15/2026

RUN-35443

Fixed a security vulnerability related to CVE-2025-68973 with severity HIGH.

2.24.48

02/15/2026

RUN-36010

Fixed an issue where navigating back to the root level in dashboard widgets caused the dashboard to crash.

2.24.48

02/15/2026

RUN-35620

Fixed an issue where providing an invalid admin password during installation caused the tenant to become permanently stuck.

2.24.17

01/25/2026

RUN-34593

Fixed an issue in the Overview dashboard where the Node pool filter did not work for the Idle workloads table.

2.24.17

01/25/2026

RUN-35148

Fixed an issue where charts in the Overview dashboard did not render data after the node pool filter was changed.

2.24.17

01/25/2026

RUN-35421

Fixed a security vulnerability related to CVE-2025-15284 with severity HIGH.

2.24.17

01/25/2026

RUN-35583

Fixed an issue where the template describe command did not display the master specification for distributed templates when the master and worker configurations differed.

2.24.17

01/25/2026

RUN-35594

Fixed an issue where the workload describe command did not display the master specification for distributed workloads.

2.24.17

01/25/2026

RUN-35637

Fixed an issue where, when CPU quota and Limit projects from exceeding department quota were both enabled, updating department or project memory quotas to very large values failed with incorrect validation errors, even though the values were valid.

2.24.17

01/25/2026

RUN-35922

Fixed a security vulnerability related to CVE-2026-0861 with severity HIGH.

Last updated