runai training pytorch exec
execute a command in a pytorch training job
runai training pytorch exec [WORKLOAD_NAME] [flags]
Examples
# Execute bash in the pytorch training's main worker
runai training pytorch exec pytorch-01 --tty --stdin -- /bin/bash
# Execute ls command in the pytorch training's main worker
runai training pytorch exec pytorch-01 -- ls
# Execute a command in a specific pytorch training worker
runai training pytorch exec pytorch-01 --pod pytorch-01-worker-1 -- nvidia-smi
Options
-c, --container string Container name for log extraction
-h, --help help for exec
--pod string Workload pod ID for log extraction, default: master (0-0)
--pod-running-timeout duration Pod check for running state timeout.
-p, --project string Specify the project to which the command applies. By default, commands apply to the default project. To change the default project use ‘runai config project <project name>’
-i, --stdin Pass stdin to the container
-t, --tty Stdin is a TTY
--wait-timeout duration Timeout for waiting for workload to be ready for log streaming
Options inherited from parent commands
--config-file string config file name; can be set by environment variable RUNAI_CLI_CONFIG_FILE (default "config.json")
--config-path string config path; can be set by environment variable RUNAI_CLI_CONFIG_PATH
-d, --debug enable debug mode
-q, --quiet enable quiet mode, suppress all output except error messages
--verbose enable verbose mode
SEE ALSO
runai training pytorch - pytorch management
Last updated