Senior Infrastructure Engineer
Astronomer
πRemote - India
Please let Astronomer know you found this job on JobsCollider. Thanks! π
Job highlights
Summary
Join Astronomer's R&D team as a Software Engineer and contribute to the development and maintenance of our industry-leading data orchestration platform, Astro. You will enhance the scalability, performance, and reliability of our flagship Enterprise product. Collaborate with cross-functional teams to drive continuous improvement, implement robust security measures, and optimize system performance. Leverage your expertise in Kubernetes, cloud platforms, and automation to streamline our infrastructure and support seamless on-premise installations. This role requires significant experience in operating Kubernetes clusters and managing distributed systems. Astronomer is a remote-first company.
Requirements
- 5 years of hands-on experience operating Kubernetes clusters in a production environment
- Experience in managing and scaling distributed systems in one of the three major cloud providers (AWS, Azure, GCP)
- Strong experience with at least one Continuous Integration system, such as CircleCI or Jenkins
- Understanding of the Linux Operating System, standard networking protocols, and components
- Experience with deploying, supporting, and monitoring new and existing services, platforms, and application stacks
- Automation/Scripting experience with Shell, Python, or similar
- Familiarity with Infrastructure as Code (IaC) tools (Terraform, Cloudformation, etc.)
- Strong troubleshooting and problem-solving skills
Responsibilities
- Serve as a primary point who is responsible for the overall health, performance, and capacity of our platform
- Assist in the roll-out and deployment of new product features and installations to facilitate our rapid iteration and growth
- Develop tools to improve our ability to rapidly deploy and effectively monitor applications in a large-scale environment
- Work closely with development teams to ensure the platform is designed with operability in mind
- Identify and lead efforts to improve automation
- Perform root cause analysis and document results in the form of post-mortems
- Write and maintain documentation around key systems and processes
- Participate in an on-call rotation with some of our customers
- Function well in a fast-paced, rapidly changing environment
Preferred Qualifications
- Experience with scale testing, disaster recovery, and capacity planning
- Experience with at least one of the following languages: NodeJS, Go
- Familiarity with Apache Airflow
- Experience with Openshift and the Red Hat marketplace
- Experience with the Prometheus/Grafana and ELK stacks
Benefits
Astronomer is a remote-first company
Share this job:
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Similar Remote Jobs
- πUnited States
- πUnited States
- πCanada
- πCanada
- πRomania
- πWorldwide
- πBrazil
- πCanada
- πEstonia
Please let Astronomer know you found this job on JobsCollider. Thanks! π