Description
As part of a growing team of experts, you will provide the data vital to design, build and launch rockets faster, cheaper and with continually higher quality. We accomplish this by building state-of-the art software, and analyzing data to uncover patterns for quick decision making. We design systems that track millions of physical parts and complex manufacturing activities in remote locations. We build systems that process massive amounts of data and engineering tools that enable rapid design and iteration. We are seeking team members of all backgrounds who are passionate about space and who have a strong desire to serve on a team that is the backbone of the company. This position will directly impact the history of space exploration and will require your dedicated commitment and detailed attention towards safe and repeatable spaceflight.
As a Site Reliability Engineer you will work on rewarding problems and interesting technologies. You will bring a software engineering approach to ensuring our systems are operational and scalable. You will implement the infrastructure that allows for rapid development and iteration of software throughout the company, including distributed systems and embedded software on-board our rockets and space vehicles. You will make decisions and implement systems that affect the capabilities of thousands of rocket scientists and engineers at Blue Origin and beyond. At Blue Origin we rely heavily on Kubernetes for rapid software development and deployment. A candidate for this position will need to have a deep understanding and experience supporting containerized workloads and services.
We are looking for someone to apply their technical expertise, leadership skills, and commitment to quality to positively impact safe human spaceflight. Passion for our mission and vision is required!
Our tech stack at glance:
- Amazon Web Services
- Kubernetes and Docker
- Datadog
- Gitlab
- Linux
- Ansible
- Java, JavaScript, and Python
What makes our SRE’s successful?
- Technical breadth and depth with a strong understanding of emerging trends and technologies
- A strong bias for automating everything and reducing toil
- Humility and the enthusiasm to learn unfamiliar domains
- A strong “customer first” personality and desire to be a domain expert
- High Judgement with technical design decisions
Responsibilities:
- Engage in and improve the whole lifecycle of software in the Cloud – from inception and design, through deployment, operation, and refinement
- Support services before they go to production through activities such as system design, consulting, developing software platforms and frameworks, scaling and launch reviews.
- Maintain software once it is live by measuring and monitoring availability, latency and overall system health.
- Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
- Practice sustainable incident response and blameless postmortems.
- Configure, deploy, scale, and administer open source and commercial software
Qualifications:
- Understanding of and experience with modern software development practices
- Experience in analyzing and troubleshooting distributed systems
- Familiarity with “infrastructure as code” and technologies used to achieve this
- Knowledge of software defined networking (VPC, Subnets, Firewalls, VPNs, etc.)
- Experience in one or more of the following: Java, Python, JavaScript, C, or C++.
- Ability to earn trust, maintain positive and professional relationships, and contribute to a culture of inclusion
- Must be a U.S. citizen or national, U.S. permanent resident (current Green Card holder), or lawfully admitted into the U.S. as a refugee or granted asylum.
Desired:
- Terraform and AWS experience
- Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
- API design experience
Please mention the word **UNCOMPLICATED** and tag RMzQuMTY4LjE0NS4yMjY= when applying to show you read the job post completely (#RMzQuMTY4LjE0NS4yMjY=). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.