Logs collection
This section provides instructions for IT administrators on collecting NVIDIA Run:ai logs for support, including prerequisites, CLI commands, and log file retrieval. It also covers enabling verbose logging for Prometheus and the NVIDIA Run:ai Scheduler.
Collect logs to send to support
To collect NVIDIA Run:ai logs, follow these steps:
Prerequisites
Ensure that you have administrator-level access to the Kubernetes cluster where NVIDIA Run:ai is installed.
The NVIDIA Run:ai Administrator Command-Line Interface (CLI) must be installed.
Step-by-Step Instructions
Run the Command from your local machine or a Bastion Host (secure server). Open a terminal on your local machine (or any machine that has network access to the Kubernetes cluster) where the NVIDIA Run:ai Administrator CLI is installed.
Collect the Logs. Execute the following command to collect the logs:
This command gathers all relevant NVIDIA Run:ai logs from the system and generate a compressed file.
Locate the Generated File. After running the command, note the location of the generated compressed log file. You can retrieve and send this file to NVIDIA Run:ai Support for further troubleshooting.
Note
The tar file packages the logs of NVIDIA Run:ai components only. It does not include logs of researcher containers that may contain private information.
Logs verbosity
Increase log verbosity to capture more detailed information, providing deeper insights into system behavior and make it easier to identify and resolve issues.
Prerequisites
Before you begin, ensure you have the following:
Access to the Kubernetes cluster where NVIDIA Run:ai is installed
Including necessary permissions to view and modify configurations.
kubectl installed and configured:
The Kubernetes command-line tool,
kubectl
, must be installed and configured to interact with the cluster.Sufficient privileges to edit configurations and view logs.
Monitoring Disk Space
When enabling verbose logging, ensure adequate disk space to handle the increased log output, especially when enabling debug or high verbosity levels.
Adding verbosity
Last updated