Website Nvidia
About Nvidia
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
Job Summary
NVIDIA is seeking a highly skilled and modern software engineer to develop and prototype brand new advancements in distributed training and inference using NVIDIA’s Spectrum-X AI fabric. This role offers a rare chance to pioneer AI and networking technology, contributing to ground-breaking projects that will define the landscape of large-scale AI systems.
Key Responsibilities
- Prototype end-to-end solutions to improve distributed training and disaggregated inference performance.
- Analyze and optimize communication flows across application, transport, and network layers.
- Develop system software spanning communication libraries, drivers, and firmware integrations.
- Collaborate with hardware, firmware, and SDK teams to co-design network features.
- Validate and integrate prototypes into NVIDIA’s AI infrastructure and products.
Requirements
- BSc/MSc/PhD in Computer Science or Electrical Engineering.
- 5+ years of relevant experience and/or knowledge.
- Deep understanding of networking and communication internals — NCCL, RDMA/RoCE, congestion control.
- Hands-on experience with HW/SW/FW integration and low-level programming (C/C++, kernel, drivers).
- Some background in distributed training systems (such as PyTorch DDP, Megatron-LM, DeepSpeed).
Preferred Qualifications
- Demonstrated innovation and leadership turning prototypes into impactful product features.
- Experience with programmable data planes (P4, eBPF, DOCA SDK, or switch SDKs).
- Familiarity with NIC firmware scheduling, in-network compute, or congestion management.
- Contributions to open-source projects, academic papers, or performance benchmarking tools.
- Strong background in AI factory architectures, distributed inference, or network telemetry.
NVIDIA is known as one of the most sought-after employers globally. You’ll be part of a high-impact team that develops technologies shaping the future of AI networking and distributed computing. If you’re enthusiastic about crafting the future — we look forward to hearing from you!
To apply for this job please visit nvidia.wd5.myworkdayjobs.com.