NCP-AIO Exam Question 111
You have a Kubernetes cluster with several nodes equipped with NVIDIA GPUs. You want to ensure that pods requesting GPUs are only scheduled on nodes that have the appropriate NVIDIA drivers and the NVIDIA Container Toolkit installed. Which Kubernetes feature(s) can you leverage to achieve this?
NCP-AIO Exam Question 112
You are using a custom topology with NVSwitches, and 'nvsm' is not detecting the correct links. You need to manually define the topology. Which of the following is the correct way to provide a custom topology definition to
NCP-AIO Exam Question 113
A user reports that they are unable to submit jobs to a specific partition. You've verified that the partition exists and is enabled. What are the possible reasons for this?
NCP-AIO Exam Question 114
You have a Kubernetes cluster running AI workloads. The pods are experiencing intermittent storage performance issues, particularly when writing checkpoints. You are using a Container Storage Interface (CSI) driver for your storage. How would you go about troubleshooting this issue, focusing on the CSI driver and Kubernetes interaction?
NCP-AIO Exam Question 115
A data scientist submits a Run.ai job requesting 4 GPUs. However, due to resource constraints, only 2 GPUs are immediately available. You want the job to automatically start running as soon as the remaining 2 GPUs become available, without manual intervention. How do you configure Run.ai to achieve this?
