NCP-AII Exam Question 86

You are configuring a network for a distributed training job using multiple DGX servers connected via InfiniBand. After launching the training job, you observe that the inter-GPU communication is significantly slower than expected, even though 'ibstat' shows all links are up and active. What is the MOST likely cause of this performance bottleneck?
  • NCP-AII Exam Question 87

    You are installing the NGC CLI using 'pip' behind a corporate proxy. The installation fails due to connection errors. How do you configure pip' to use the proxy during the NGC CLI installation?
  • NCP-AII Exam Question 88

    You are developing a distributed deep learning application that uses multiple GPUs across several Docker containers running on different physical servers. How do you ensure that each container can access and utilize the GPUs on its respective host?
  • NCP-AII Exam Question 89

    You are evaluating different parallel file systems for an AI training cluster. You need a file system that supports POSIX compliance and offers high bandwidth and low latency. Which of the following options are viable candidates?
  • NCP-AII Exam Question 90

    You're setting up a cluster with 8 NVIDIA A100 GPUs. Each GPU needs to read 4GB/s from storage to keep it fully utilized. The network connecting the storage and compute nodes has a bandwidth of 25GB/s. What is the maximum number of GPUs that can be simultaneously saturated with data without exceeding the network bandwidth?