Preparations
The following section provides the information needed to prepare for a NVIDIA Run:ai installation.
Software Artifacts
The following software artifacts should be used when installing the control plane and cluster.
Kubernetes
Connected
You will receive a token from NVIDIA Run:ai to access the NVIDIA Run:ai container registry. Use the following command to create the required Kubernetes secret:
kubectl create secret docker-registry runai-reg-creds \
--docker-server=https://runai.jfrog.io \
--docker-username=self-hosted-image-puller-prod \
--docker-password=<TOKEN> \
[email protected] \
--namespace=runai-backendAir-gapped
You will receive a token from NVIDIA Run:ai to access the NVIDIA Run:ai air-gapped installation package. Use the following commands with the token provided by NVIDIA Run:ai to download and extract the package.
Download and Extract the Air-gapped Package
Run the following command to browse all available air-gapped packages:
curl -H "Authorization: Bearer <token>" "https://runai.jfrog.io/artifactory/api/storage/runai-airgapped-prod/?list"Run the following command to download the desired package:
curl -L -H "Authorization: Bearer <token>" -O "https://runai.jfrog.io/artifactory/runai-airgapped-prod/runai-airgapped-package-<VERSION>.tar.gz"SSH into a node with
kubectlaccess to the cluster and Docker installed.Extract the NVIDIA Run:ai package and replace
<VERSION>in the command below and run:tar xvf runai-airgapped-package-<VERSION>.tar.gz
Upload Images
NVIDIA Run:ai assumes the existence of a Docker registry within your organization for hosting container images. The installation requires the network address and port for this registry (referred to as <REGISTRY_URL>).
Upload images to a local Docker Registry. Set the Docker Registry address in the form of
NAME:PORT(do not addhttps):export REGISTRY_URL=<DOCKER REGISTRY ADDRESS>Run the following script. You must have at least 20GB of free disk space to run. If Docker is configured to run as non-root then
sudois not required:sudo ./setup.sh
The script should create a file named custom-env.yaml which will be used during control plane installation.
OpenShift
Connected
You will receive a token from NVIDIA Run:ai to access the NVIDIA Run:ai container registry. Use the following command to create the required Kubernetes secret:
oc create secret docker-registry runai-reg-creds \
--docker-server=https://runai.jfrog.io \
--docker-username=self-hosted-image-puller-prod \
--docker-password=<TOKEN> \
[email protected] \
--namespace=runai-backendAir-gapped
You will receive a token from NVIDIA Run:ai to access the NVIDIA Run:ai air-gapped installation package. Use the following commands with the token provided by NVIDIA Run:ai to download and extract the package.
Download and Extract the Air-gapped Package
Run the following command to browse all available air-gapped packages:
Run the following command to download the desired package:
SSH into a node with
ocaccess to the cluster and Docker installed.Extract the NVIDIA Run:ai package and replace
<VERSION>in the command below and run:
Upload Images
NVIDIA Run:ai assumes the existence of a Docker registry within your organization for hosting container images. The installation requires the network address and port for this registry (referred to as <REGISTRY_URL>).
Upload images to a local Docker Registry. Set the Docker Registry address in the form of
NAME:PORT(do not addhttps):Run the following script. You must have at least 20GB of free disk space to run. If Docker is configured to run as non-root then
sudois not required:
The script should create a file named custom-env.yaml which will be used by the control plane installation.
Private Docker Registry (Optional)
Kubernetes
To access the organization's docker registry it is required to set the registry's credentials (imagePullSecret).
Create the secret named runai-reg-creds based on your existing credentials. For more information, see Pull an Image from a Private Registry.
OpenShift
To access the organization's docker registry it is required to set the registry's credentials (imagePullSecret).
Create the secret named runai-reg-creds in the runai-backend namespace based on your existing credentials. The configuration will be copied over to the runai namespace at cluster install. For more information, see Allowing pods to reference images from other secured registries.
Set Up Your Environment
External Postgres Database (Optional)
If you have opted to use an external PostgreSQL database, you need to perform initial setup to ensure successful installation. Follow these steps:
Create a SQL script file, edit the parameters below, and save it locally:
Replace
<DATABASE_NAME>with a dedicate database name for NVIDIA Run:ai in your PostgreSQL database.Replace
<ROLE_NAME>with a dedicated role name (user) for NVIDIA Run:ai database.Replace
<ROLE_PASSWORD>with a password for the new PostgreSQL role.Replace
<GRAFANA_PASSWORD>with the password to be set for Grafana integration.
Run the following command on a machine where PostgreSQL client (
pgsql) is installed:Replace
<POSTGRESQL_HOST>with the PostgreSQL ip address or hostname.Replace
<POSTGRESQL_USER>with the PostgreSQL username.Replace
<POSTGRESQL_PORT>with the port number where PostgreSQL is running.Replace
<POSTGRESQL_DB>with the name of your PostgreSQL database.Replace
<POSTGRESQL_DB>with the name of your PostgreSQL database.Replace
<SQL_FILE>with the path to the SQL script created in the previous step.
Last updated