NCP-AII Exam Question 141
You are configuring a server with multiple GPUs for CUDA-aware MPI. Which environment variable is critical for ensuring proper GPU affinity, so that each MPI process uses the correct GPU?
NCP-AII Exam Question 142
A data center is designed for A1 training with a high degree of east-west traffic. Considering cost and performance, which network topology is generally the most suitable?
NCP-AII Exam Question 143
A data scientist reports that training performance on a DGX A100 server has significantly degraded over the past week. 'nvidia-smi' shows all GPUs functioning, but 'nvprof' reveals substantially increased 'cudaMemcpy' times. What is the MOST likely bottleneck?
NCP-AII Exam Question 144
You are tasked with upgrading the NVIDIA driver on a Kubernetes node hosting GPU-accelerated A1 workloads. To minimize downtime and ensure a smooth transition, which sequence of steps should you follow?
