Launching workloads with GPU fractions

This quick start provides a step-by-step walkthrough for running a Jupyter Notebook workspace using GPU fractions.

NVIDIA Run:ai’s GPU fractions provides an agile and easy-to-use method to share a GPU or multiple GPUs across workloads. With GPU fractions, you can divide the GPU/s memory into smaller chunks and share the GPU/s compute resources between different workloads and users, resulting in higher GPU utilization and more efficient resource allocation.

Note

If enabled by your Administrator, the NVIDIA Run:ai UI allows you to create a new workload using either the Flexible or Original submission form. The steps in this quick start guide reflect the Original form only.

Prerequisites

Before you start, make sure:

  • You have created a project or have one created for you.

  • The project has an assigned quota of at least 0.5 GPU.

Step 1: Logging in

Browse to the provided NVIDIA Run:ai user interface and log in with your credentials.

Step 2: Submitting a workspace

  1. Go to the Workload manager → Workloads

  2. Click +NEW WORKLOAD and select Workspace

  3. Select under which cluster to create the workload

  4. Select the project in which your workspace will run

  5. Select a preconfigured template or select the Start from scratch to launch a new workspace quickly

  6. Enter a name for the workspace (if the name already exists in the project, you will be requested to submit a different name)

  7. Click CONTINUE

    In the next step:

  8. Select the ‘jupyter-lab’ environment for your workspace (Image URL: jupyter/scipy-notebook)

    • If the ‘jupyter-lab’ is not displayed in the gallery, follow the below steps:

      • Click +NEW ENVIRONMENT

      • Enter a name for the environment. The name must be unique.

      • Enter the jupyter-lab Image URL - jupyter/scipy-notebook

      • Tools - Set the connection for your tool

        • Click +TOOL

        • Select Jupyter tool from the list

      • Set the runtime settings for the environment

        • Click +COMMAND

        • Enter command - start-notebook.sh

        • Enter arguments - --NotebookApp.base_url=/${RUNAI_PROJECT}/${RUNAI_JOB_NAME} --NotebookApp.token=''

        Note: If host-based routing is enabled on the cluster, enter the --NotebookApp.token='' only.

      • Click CREATE ENVIRONMENT

    The newly created environment will be selected automatically

  9. Select the ‘small-fraction’ compute resource for your workspace (GPU % of devices: 10)

    • If ‘small-fraction’ is not displayed in the gallery, follow the below steps:

      • Click +NEW COMPUTE RESOURCE

        • Enter a name for the compute resource. The name must be unique.

        • Set GPU devices per pod - 1

        • Set GPU memory per device

          • Select % (of device) - Fraction of a GPU device’s memory

          • Set the memory Request - 10 (the workload will allocate 10% of the GPU memory)

        • Optional: set the CPU compute per pod - 0.1 cores (default)

        • Optional: set the CPU memory per pod - 100 MB (default)

      • Click CREATE COMPUTE RESOURCE

    The newly created compute resource will be selected automatically

  10. Click CREATE WORKSPACE

Step 3: Connecting to the Jupyter Notebook

  1. Select the newly created workspace with the Jupyter application that you want to connect to

  2. Click CONNECT

  3. Select the Jupyter tool. The selected tool is opened in a new tab on your browser.

Next steps

Manage and monitor your newly created workload using the Workloads table.

Last updated