NVIDIA AI Infrastructure NCP-AII Question # 35 Topic 4 Discussion
NCP-AII Exam Topic 4 Question 35 Discussion:
Question #: 35
Topic #: 4
An engineer is reimaging a DGX system in a large cluster. Which method ensures the most efficient and secure remote installation without physical access?
A.
Use apt-get to upgrade the operating system without rebooting the system.
B.
Create a USB drive with the ISO and manually boot from it on the DGX system.
C.
Build a software image on Base Command Manager and then reimage the system.
D.
Skip ISO verification and directly flash the operating system to the disk via SSH.
Building a software image in NVIDIA Base Command Manager and using it to reimage the DGX system is the most efficient and secure method in a large cluster. In managed DGX infrastructure, remote image provisioning allows administrators to maintain consistent operating system versions, drivers, CUDA components, network settings, security baselines, and cluster management agents across compute nodes. This avoids manual access to each server and reduces configuration drift. Using apt-get to upgrade the operating system is not equivalent to a controlled reimage and may leave inconsistent packages or unsupported driver combinations. Creating a USB drive requires physical access and does not scale for large deployments. Skipping ISO verification and directly flashing the operating system over SSH is unsafe because it bypasses image validation and can leave nodes in an unrecoverable or untrusted state. BCM-based imaging supports repeatable deployment, centralized control, rollback planning, and integration with node categories or software images. This is the preferred operational model for reliable DGX cluster bring-up and lifecycle management.
Contribute your Thoughts:
Chosen Answer:
This is a voting comment (?). You can switch to a simple comment. It is better to Upvote an existing comment if you don't have anything to add.
Submit