Where

Site Reliability Engineer (NVIDIA and Cisco UCS infrastructure)

ConfigUSA
San Jose Full-day Full-time

Description:

Site Reliability Engineer Responsibilities & Required Skills/Experience: 1) NVIDIA (DGX) A100/ H100/ H200 2) Cisco UCS-C885A 3) Docker 4) NVIDIA certificated professionals preferred 5) Infrastructure knowledge on above skills 6) DevOps Automation CI/CD systems (e.g., GitLab, GitHub Actions, Jenkins) Terraform, Ansible, Jenkins Python 8) Enterprise Grade Kubernetes cluster (RedHat OpenShift preferred) and/or Google Anthos AI Infrastructure SRE Engineer responsible for Technical knowledge of high-
Oct 30, 2025;   from: dice.com

Similar jobs

Description: Systems Reliability Engineer Intern - LOCAL applicant ONLY Hungry, Humble, Honest, with Heart. The Opportunity This is a 12-week internship with start dates beginning in May 2026, contingent on your availability. You will work in our San Jose ...
23 days ago
Description: A premier chip and silicon IP provider is seeking a Principal Reliability Engineer to join our Operations team in San Jose. In this role, you will be working with some of the brightest inventors and engineers in the world developing products ...
2 days ago
Description: Sr. Systems Reliability Engineer - San Jose, CA Hungry, Humble, Honest, with Heart. The Opportunity Are you a proactive problem-solver with a passion for customer success and expertise in troubleshooting technologies like virtualization, ...
20 days ago
  • Jobot
  • San Jose
Description: 100% REMOTE Senior Site Reliability Engineer / Senior Dev Ops Engineer Needed for Growing Fintech Startup! This Jobot Job is hosted by: Reed Kellick Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume. ...
8 days ago