Where

Software Engineer, Supercomputing, HPC Infrastructure

OpenAI
San Francisco Full-day Full-time

Description:

About the Team We believe that increasing compute is a huge lever to AI progress. The Supercomputing team owns the entire process of building OpenAI's compute and infrastructure. This includes the deployment of huge clusters using Kubernetes and Azure, and building the internal experiment platform for running/training the world's largest AI models. We work at the very cutting edge of speed and scale, combining the traditions of High-Performance Computing (HPC) in a modern cloud and containeri
Apr 26, 2024;   from: dice.com

Similar jobs

Description: About the Team Supercomputers scale vertically. The workloads are synchronous and cluster-scale. These conditions demand a novel approach to cluster infrastructure, and it is the work of the Supercomputing Scalability Pillar to invent it. The ...
9 days ago
Description: About the Team The Supercomputing Scheduling Pillar at OpenAI is dedicated to ensuring the reliability, scalability, and user-friendliness of job lifecycle management, with an emphasis on efficient and flexible job scheduling, quota ...
9 days ago
Description: About Pinterest: Millions of people across the world come to Pinterest to find new ideas every day. It's where they get inspiration, dream about new possibilities and plan for what matters most. Our mission is to help those people find their ...
18 days ago
  • Meta Platforms, Inc. (f/k/a Facebook, Inc.)
  • San Francisco
Description: Meta Platforms, Inc. (f/k/a Facebook, Inc.) has the following position in San Francisco, CA: Software Engineer, Android: Research, design, develop, and test operating systems-level software, compilers, and network distribution software for ...
9 days ago