Comprehensive and Detailed Explanation From Exact Extract:
For troubleshooting and performance optimization on NVIDIA DGX systems such as DGX H100, theNVIDIA System Management (nvsm)tool is used to gather system health and diagnostic data. The command nvsm dump health is the correct command to generate and export detailed system logs related to the health report of the DGX system.
nvsm show logs --save is not a recognized command format.
nvsm get logs retrieves logs but does not specifically dump the health report logs.
nvsm health --dump-log is not a standard documented nvsm command.
Therefore, nvsm dump health is the valid and documented command used to generate system logs focused on health reporting, useful for diagnosing performance issues in DGX H100 systems.
This usage aligns with NVIDIA’s system management tools guidance for DGX platforms as described in NVIDIA AI Operations documentation for troubleshooting and performance optimization.
Contribute your Thoughts:
Chosen Answer:
This is a voting comment (?). You can switch to a simple comment. It is better to Upvote an existing comment if you don't have anything to add.
Submit