Successfully ran diagnostic for group. +---------------------------+------------------------------------------------+ | Diagnostic | Result | +===========================+================================================+ |----- Metadata ----------+------------------------------------------------| | DCGM Version | 3.3.8 | | Driver Version Detected | 535.129.03 | | GPU Device IDs Detected | 2330,2330,2330,2330,2330,2330,2330,2330 | |----- Deployment --------+------------------------------------------------| | Denylist | Pass | | NVML Library | Pass | | CUDA Main Library | Pass | | Permissions and OS Blocks | Pass | | Persistence Mode | Pass | | Environment Variables | Pass | | Page Retirement/Row Remap | Fail | | Error | GPU 2 had uncorrectable memory errors and 0 r | | | ows were remapped | | Graphics Processes | Skip | | Inforom | Skip | +----- Integration -------+------------------------------------------------+ | PCIe | Skip - All | +----- Hardware ----------+------------------------------------------------+ | GPU Memory | Skip - All | | Pulse Test | Skip - All | +----- Stress ------------+------------------------------------------------+ | Targeted Stress | Skip - All | | Targeted Power | Skip - All | | Memory Bandwidth | Skip - All | | Memtest | Skip - All | | EUD Test | Skip - All | +---------------------------+------------------------------------------------+