NVIDIA AI Infrastructure NCP-AII Question # 15 Topic 2 Discussion
NCP-AII Exam Topic 2 Question 15 Discussion:
Question #: 15
Topic #: 2
You are validating the environment of an NVIDIA GPU-accelerated data center during post-deployment checks. Which one action is essential to confirm that power and cooling are sufficient for the stable operation of NVIDIA DGX H100 systems?
A.
Confirm the system fans are running at 100% under all workloads to prevent overheating.
B.
Review the system BIOS to ensure GPU overclocking is enabled for maximum performance.
C.
Use NVSM to disable unused PCIe devices to reduce overall system heat output.
D.
Verify that each DGX system is connected to redundant, properly rated PDUs and that all power supplies are reporting nominal input.
Stable operation of high-density AI infrastructure like the DGX H100 requires strict adherence to power and thermal specifications. A single DGX H100 system can draw up to10.2kWunder peak load. Therefore, the most essential validation step is ensuring the electrical "infrastructure-to-server" handoff is healthy. This involves verifying that the system is connected to redundant PDUs (Power Distribution Units) capable of handling the amperage requirements without tripping breakers. UsingNVSM (NVIDIA System Management), an administrator must check that all six power supplies (PSUs) are functional and receiving nominal input voltage (typically 200V-240V). If a PSU reports sub-optimal input or a "Loss of Redundancy," the system may throttle performance or shut down unexpectedly during a heavy training run. Fans running at 100% (Option A) at all times would actually indicate an inefficient or failed cooling policy, as fans should dynamically scale based on thermals. Overclocking (Option B) is not supported or recommended for enterprise DGX systems, as they are already factory-tuned for the highest stable performance.
Contribute your Thoughts:
Chosen Answer:
This is a voting comment (?). You can switch to a simple comment. It is better to Upvote an existing comment if you don't have anything to add.
Submit