HireSleek

Software Engineer – DGX Cloud API Services

  • Full Time
  • Remote

Website Nvidia

About Nvidia

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions.

Job Summary

Join NVIDIA’s DGX Cloud Kubernetes API Services team and be at the forefront of building GPU-accelerated Kubernetes clusters supporting NVIDIA AI, robotics, and scientific computing projects. As an API Services Software Engineer, you will work across the stack with partner teams to bring NVIDIA’s GPUs to life in a cloud or on-prem environment, ensuring end-to-end performance across compute, storage, and networking.

Key Responsibilities

  • Help build out and scale customer-facing APIs and systems for the DGX Cloud Kubernetes Platform.
  • Work with the Runtime and Cluster Architecture teams to provide a complete GPU-accelerated Kubernetes clusters to a wide variety of NVIDIA initiatives.
  • Be the voice of our customers to ensure they have a smooth experience to access the compute they need for the workloads they want.
  • Build platform services for other NVIDIA developers to bring their services to NVIDIA Kubernetes clusters.

Requirements

  • BS/MS in Computer Science or related field (or equivalent experience).
  • 2+ years of relevant work experience.
  • Experience in building foundational SaaS systems at scale, such as API design, user management, or authentication and authorization flows.
  • Proficiency in Go and building Go services at scale.
  • Experience with deploying and maintaining services atop Kubernetes.
  • Experience writing automation with Kubernetes (i.e. Controllers, CustomResourceDefinitions, etc.).
  • Background with AWS or GCP and related technologies like S3, GCS, RDS, etc.
  • Ability to solve issues across multiple layers: infrastructure, Kubernetes, application runtime.
  • Communicate effectively across a big organization, both within and outside the Kubernetes Platform organization.

Preferred Qualifications

  • Experience working on internal tools and services for large engineering organizations.
  • Experience working across multiple layers of cloud infrastructure such as CSP APIs, Terraform, Kubernetes, and custom controllers and automation atop.
  • Experience working deeply in and with the upstream Kubernetes apiserver code.
  • Background with user-facing APIs with a focus on customer and/or developer experience.

To apply for this job please visit nvidia.wd5.myworkdayjobs.com.