Logs Collection

This guide provides instructions for IT administrators on collecting NVIDIA Run:ai logs for support, including prerequisites, CLI commands, and log file retrieval. It also covers enabling verbose logging for Prometheus and the NVIDIA Run:ai Scheduler.

Collect Logs to Send to Support

To collect NVIDIA Run:ai logs, follow these steps:

Prerequisites

Ensure that you have administrator-level access to the Kubernetes cluster where NVIDIA Run:ai is installed.

Logs Verbosity

Increase log verbosity to capture more detailed information, providing deeper insights into system behavior and make it easier to identify and resolve issues.

Prerequisites

Before you begin, ensure you have the following:

  • Access to the Kubernetes cluster where NVIDIA Run:ai is installed

  • kubectl installed and configured:

    • The Kubernetes command-line tool, kubectl, must be installed and configured to interact with the cluster.

    • Sufficient privileges to edit configurations and view logs.

  • Monitoring Disk Space

    • When enabling verbose logging, ensure adequate disk space to handle the increased log output, especially when enabling debug or high verbosity levels.

Adding Verbosity

Adding verbosity to Prometheus

To increase the logging verbosity for Prometheus, follow these steps:

  1. Edit the RunaiConfig to adjust Prometheus log levels. Copy the following command to your terminal:

  2. In the configuration file that opens, add or modify the following section to set the log level to debug:

  3. Save the changes. To view the Prometheus logs with the new verbosity level, run:

    This command streams the last 100 lines of logs from Prometheus, providing detailed information useful for debu

Adding verbosity to the Scheduler

To enable extended logging for the NVIDIA Run:ai scheduler:

  1. Edit the RunaiConfig to adjust scheduler verbosity:

  2. Add or modify the following section under the scheduler settings:

    This increases the verbosity level of the scheduler logs to provide more detailed output.

Warning: Enabling verbose logging can significantly increase disk space usage. Monitor your storage capacity and adjust the verbosity level as necessary.

Last updated