HireSleek

Sr. HPC Systems Engineer

Website Spacex

About Spacex

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars.

Job Summary

SpaceX is looking for an HPC Systems Engineer with strong knowledge and experience in a world class engineering organization. This employee will be a member of the HPC team and will support SpaceX personnel and proprietary systems. The ideal candidate will be flexible and flourish in a fast paced and challenging environment. They should be a self-starter, self-motivator and possess ingenuity to excel at this position.

Key Responsibilities

  • Administer and manage HPC clusters, storage systems, and high-speed networks.
  • Provide application support to SpaceX employees across engineering disciplines.
  • Install and integrate Linux-based compute clusters.
  • Write instructional documentation and convey highly technical ideas in non-technical terms.

Required Qualifications

  • 5+ years of hands-on experience with client and server hardware/software, management tools, enterprise networking, virtualization, and security technologies.
  • Bachelor’s degree in computer science, engineering, math, or scientific discipline and 5+ years of systems engineering experience; OR 7+ years of professional experience building software in lieu of a degree.
  • Experience with Linux.

Preferred Qualifications

  • 5+ years of professional experience building, deploying and troubleshooting Linux systems.
  • Experience with a scripting language (Bash, Python) to automate and solve reoccurring tasks.
  • Experience building, deploying and troubleshooting HPC clusters.
  • Familiarity with cluster resource managers (Slurm, PBS, LSF).
  • Experience with monitoring and alerting technologies (Prometheus, Grafana, Nagios).
  • Familiarity with scientific and engineering computing (CFD, FEA).
  • Familiarity with ML frameworks (PyTorch, Tensorflow).
  • Familiarity with GPU usage in a compute cluster and Cuda.
  • Experience with containers (Docker, Podman, Singularity).
  • Experience deploying and maintaining automated configuration management software (Puppet, Ansible).
  • Comfortable working with mission critical and sensitive systems, with a sense of urgency appropriate to the responsibilities.
  • Eligibility for access to classified material up to TS/SCI with Polygraph.

Additional Requirements

  • Must be willing to work extended hours and weekends as needed.

Compensation and Benefits

Pay Range: Sr. HPC Systems Engineer: $160,000.00-$220,000.00/per year. Your actual level and base salary will be determined on a case-by-case basis and may vary based on the following considerations: job-related knowledge and skills, education, and experience. Base salary is just one part of your total rewards package at SpaceX. You may also be eligible for long-term incentives, in the form of company stock, stock options, or lo.

To apply for this job please visit job-boards.greenhouse.io.