We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Dev Ops Engineer

California Institute of Technology
vision insurance, parental leave, paid time off, sick time, tuition reimbursement
United States, California, Pasadena
Apr 25, 2025

Caltech is a world-renowned science and engineering institute that marshals some of the world's brightest minds and most innovative tools to address fundamental scientific questions. We thrive on finding and cultivating talented people who are passionate about what they do. Join us and be a part of the diverse Caltech community.

Job Summary

We are seeking a DevOps Engineer with experience managing large-scale, on-premises infrastructure deployments, to work on the DSA-2000 project, a world-leading radio telescope that will commence construction in 2026. The array will use 2000, 5-meter dishes to observe at radio wavelengths (0.7 - 2 GHz frequency range) to survey the skies 10x faster than any radio telescope current or planned. The telescope will conduct a broad range of research, including the study of the formation of galaxies, the search for cosmic explosions, and investigations into the nature of gravity.

The applicant would fulfill the role of DevOps Engineer for the DSA-2000 project. This role will support the Software Group as we progress through the final design stage to construction.

The role can be based at either the Caltech campus, the Owens Valley Radio Observatory in Big Pine, CA, or hybrid/remote.

Please contact Dr. Giangi Sacco (gsacco@caltech.edu), with any questions regarding this position.

Application review will begin on May 5, 2025.

Essential Job Duties

  • On-Premises Infrastructure Management: Design, deploy, and manage the on-premises infrastructure for the DSA-2000's processing and monitoring pipelines. This will involve provisioning and configuring thousands of servers with high-performance computing capabilities (including approximately 4,000 Nvidia RTX 4000 Ada GPUs and over 30,000 CPU cores) hosted at the project data center in Nevada where the telescope is located.
  • Automation: Develop and implement automated processes for server provisioning, configuration management, and testing. This will utilize tools like Ansible, Puppet, Chef, or similar configuration management solutions.
  • Containerization and Orchestration: Implement containerization strategies using Docker and orchestrate container deployments using Kubernetes or similar container orchestration platforms.
  • CI/CD Pipelines: Design and implement robust CI/CD pipelines for the DSA-2000 software to ensure continuous integration and delivery.
  • Monitoring and Logging: Establish comprehensive monitoring and logging systems to track infrastructure health and application performance.
  • Security and Compliance: Ensure compliance with relevant security standards and regulations.
  • Collaboration: Work closely with the Software Development team to understand their needs and provide ongoing support.

Basic Qualifications

  • Bachelor's degree in Computer Science or related field, or equivalent experience.
  • 7+ years of experience as a DevOps Engineer or similar role.
  • Experience in managing large-scale, on-premises infrastructure deployments.
  • In-depth knowledge of Linux server administration.
  • Proficiency with infrastructure as code (IaC) tools like Terraform, Ansible, Puppet, or Chef.
  • Strong understanding of containerization technologies like Docker and container orchestration platforms like Kubernetes.
  • Experience with CI/CD pipelines and tools like Jenkins, GitLab CI/CD, or similar solutions.
  • Excellent scripting skills (Bash, Python).
  • Excellent communication and collaboration skills.
  • Experience with provisioning bare-metal servers with tools like Maas or Xcat.

Preferred Qualifications

  • Experience with monitoring and logging tools like Prometheus, Grafana, ELK Stack, or similar solutions.
  • Experience working with high-performance computing (HPC) systems.
  • Experience with timeseries database like Prometheus, InfluxDB or similar solutions.

Required Documents

  • Resume
  • CV
  • Cover letter
Hiring Range

$132,000 - $159,000 Per Year

The salary of the finalist(s) selected for this role will be set based on a variety of factors, including but not limited to, internal equity, experience, education, specialty and training.

As one of the largest employers in Pasadena, CA, Caltech is committed to providing comprehensive benefits to eligible employees and their eligible dependents. Our benefits package includes competitive compensation, health, dental, and vision insurance, retirement savings plans, generous paid time off (vacation, holidays, sick time, parental leave, bereavement, etc.), tuition reimbursement, and more. Non-benefit eligible employees will have access to some benefits such as onsite counseling and sick time. Learn more about our benefits and staff perks.


EEO Statement

We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to age, race, color, religion, sex, sexual orientation, gender identity, or national origin, disability status, protected veteran status, or any other characteristic protected by law.

Caltech is a VEVRAA Federal Contractor.

To read more Equal Employment Opportunity (EEO) go to eeoc_self_print_poster.pdf.

Disability Accommodations

If you would like to request an accommodation in completing this application, interviewing, or otherwise participating in the employee selection process, please direct your inquiries to Caltech Recruiting at employment@caltech.edu.


Applied = 0

(web-94d49cc66-c7mnv)