Description:
DGXC SRE at NVIDIA ensures that our internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and at the same time enabling developers to make changes to the existing system through careful preparation and planning while keeping an eye on capacity, latency and performance. We are looking for systems and software engineers who are interested in building tooling, reporting, automation, and ML to enable operational excellence across a highly dynami
Apr 30, 2025;
from:
dice.com