Pre-Summer Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: force70

NVIDIA AI Infrastructure NCP-AII Question # 25 Topic 3 Discussion

NVIDIA AI Infrastructure NCP-AII Question # 25 Topic 3 Discussion

NCP-AII Exam Topic 3 Question 25 Discussion:
Question #: 25
Topic #: 3

A systems engineer is updating firmware across a large DGX cluster using automation. What is the best practice for minimizing risk and ensuring cluster health during and after the process?


A.

Drain nodes from the scheduler, run pre-update diagnostics, update firmware in batches, and verify health post-update before scaling to the next batch.


B.

To save time, simultaneously update all nodes in the cluster without draining or diagnostics.


C.

Update nodes that have reported faults, leaving others on older firmware.


D.

Drain nodes from the scheduler, update firmware in batches, skip diagnostics and verify health post-update before scaling to the next batch.


Get Premium NCP-AII Questions

Contribute your Thoughts:


Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.