Successfully ran diagnostic for group. +---------------------------+------------------------------------------------+ | Diagnostic | Result | +===========================+================================================+ |----- Metadata ----------+------------------------------------------------| | DCGM Version | 3.3.8 | | Driver Version Detected | 535.129.03 | | GPU Device IDs Detected | 2330,2330,2330,2330,2330,2330,2330,2330 | |----- Deployment --------+------------------------------------------------| | Denylist | Pass | | NVML Library | Pass | | CUDA Main Library | Pass | | Permissions and OS Blocks | Pass | | Persistence Mode | Pass | | Environment Variables | Pass | | Page Retirement/Row Remap | Pass | | Graphics Processes | Pass | | Inforom | Pass | +----- Integration -------+------------------------------------------------+ | PCIe | Pass - GPUs: 0, 2, 3, 4, 5, 6, 7 | | | Fail - GPU: 1 | | Warning | GPU 1 GPU 1 is running at PCI link width 8X, | | | which is below the minimum allowed link gener | | | ation of 16 (parameter 'min_pci_width') Check | | | DCGM and system configuration. This error ma | | | y be eliminated with an updated configuration | | | . | | Info | GPU 0 GPU to Host bandwidth: 55.17 GB/s, GPU | | | 0 Host to GPU bandwidth: 55.00 GB/s, GPU 0 | | | bidirectional bandwidth: 100.73 GB/s, GPU 0 G | | | PU to Host latency: 1.964 us, GPU 0 Host to | | | GPU latency: 2.175 us, GPU 0 bidirectional l | | | atency: 3.370 us | | Info | GPU 1 GPU to Host bandwidth: 28.90 GB/s, GPU | | | 1 Host to GPU bandwidth: 28.58 GB/s, GPU 1 | | | bidirectional bandwidth: 51.86 GB/s, GPU 1 GP | | | U to Host latency: 1.929 us, GPU 1 Host to G | | | PU latency: 2.164 us, GPU 1 bidirectional la | | | tency: 3.358 us | | Info | GPU 2 GPU to Host bandwidth: 55.15 GB/s, GPU | | | 2 Host to GPU bandwidth: 55.02 GB/s, GPU 2 | | | bidirectional bandwidth: 100.75 GB/s, GPU 2 G | | | PU to Host latency: 1.949 us, GPU 2 Host to | | | GPU latency: 2.216 us, GPU 2 bidirectional l | | | atency: 3.937 us | | Info | GPU 3 GPU to Host bandwidth: 55.15 GB/s, GPU | | | 3 Host to GPU bandwidth: 55.01 GB/s, GPU 3 | | | bidirectional bandwidth: 100.74 GB/s, GPU 3 G | | | PU to Host latency: 2.120 us, GPU 3 Host to | | | GPU latency: 2.290 us, GPU 3 bidirectional l | | | atency: 3.513 us | | Info | GPU 4 GPU to Host bandwidth: 55.02 GB/s, GPU | | | 4 Host to GPU bandwidth: 54.84 GB/s, GPU 4 | | | bidirectional bandwidth: 100.62 GB/s, GPU 4 G | | | PU to Host latency: 1.908 us, GPU 4 Host to | | | GPU latency: 2.165 us, GPU 4 bidirectional l | | | atency: 2.981 us | | Info | GPU 5 GPU to Host bandwidth: 55.07 GB/s, GPU | | | 5 Host to GPU bandwidth: 54.88 GB/s, GPU 5 | | | bidirectional bandwidth: 100.63 GB/s, GPU 5 G | | | PU to Host latency: 1.871 us, GPU 5 Host to | | | GPU latency: 2.154 us, GPU 5 bidirectional l | | | atency: 3.045 us | | Info | GPU 6 GPU to Host bandwidth: 55.04 GB/s, GPU | | | 6 Host to GPU bandwidth: 54.88 GB/s, GPU 6 | | | bidirectional bandwidth: 100.62 GB/s, GPU 6 G | | | PU to Host latency: 1.896 us, GPU 6 Host to | | | GPU latency: 2.183 us, GPU 6 bidirectional l | | | atency: 3.114 us | | Info | GPU 7 GPU to Host bandwidth: 55.03 GB/s, GPU | | | 7 Host to GPU bandwidth: 54.90 GB/s, GPU 7 | | | bidirectional bandwidth: 100.67 GB/s, GPU 7 G | | | PU to Host latency: 1.890 us, GPU 7 Host to | | | GPU latency: 2.178 us, GPU 7 bidirectional l | | | atency: 3.124 us | +----- Hardware ----------+------------------------------------------------+ | GPU Memory | Pass - All | | Diagnostic | Pass - All | | Pulse Test | Pass - All | +----- Stress ------------+------------------------------------------------+ | Targeted Stress | Pass - All | | Targeted Power | Pass - All | | Memory Bandwidth | Pass - All | | Memtest | Pass - All | | EUD Test | Skip - All | +---------------------------+------------------------------------------------+