HireSleek

Senior Software Engineer, DGX Cloud Orchestration

Website Nvidia

About Nvidia

NVIDIA is widely recognized as one of the most desirable employers, with some of the most talented people in the world working for us.

Job Summary

We are looking for a Senior Software Engineer to join our DGX Cloud team and build the foundational systems that drive NVIDIA’s high-performance GPU infrastructure.

Key Responsibilities

  • Design and develop APIs (GraphQL/REST) to orchestrate and integrate operational workflows.
  • Build state management and workflow automation systems that streamline infrastructure lifecycle processes.
  • Collaborate across teams to codify business processes into scalable, self-measuring systems.
  • Develop extensible, schema-driven platforms for reducing manual toil and ensuring operational consistency.
  • Drive integrations with container orchestration tools like Kubernetes and observability systems such as Prometheus, OpenTelemetry, Grafana.
  • Optimize the reliability and efficiency of cloud operations through automated workflows and telemetry systems.
  • Lead and ship impactful technical projects, ensuring quality and scalability at every stage.

Requirements

  • 5+ years of industry experience with a Bachelor’s or Master’s degree (or equivalent experience), or 2+ years with a PhD.
  • Expertise in building GraphQL and REST APIs.
  • Proficiency in programming languages such as Go, Java, or Python.
  • Familiarity with modern JavaScript frameworks (e.g., React, Angular, Next.js).
  • Strong understanding of cloud infrastructure (AWS, GCP, Azure) and container technologies like Docker and Kubernetes.
  • Experience with high-scale distributed systems, including architectural patterns for APIs and data pipelines.
  • Outstanding communication and collaboration skills, with a focus on solving complex operational challenges.
  • A passion for automating manual processes and driving system efficiency.

Preferred Qualifications

  • A track record of designing workflow orchestration systems for large-scale infrastructure.
  • Proven experience in reducing operational inefficiencies through automation and integration.
  • Strong debugging and problem-solving skills in distributed environments.

NVIDIA is committed to creating an environment where diverse perspectives drive innovation. As part of the DGX Cloud team, you’ll work on groundbreaking technology that powers the future of AI and cloud computing.

To apply for this job please visit nvidia.wd5.myworkdayjobs.com.