SRE Manager

Altium Logo

Altium

πŸ“Remote - Poland

Summary

Join Altium as a Site Reliability Manager and ensure the reliability and performance of Altium Cloud Platforms. You will pioneer improvements in observability, lead incident response, plan infrastructure upgrades, and manage a high-performing SRE team. This fully remote role, based in Poland, requires 5+ years of experience in leading/management roles and software development, along with a strong understanding of SDLC and microservice architecture. Altium offers various benefits, including private medical insurance, group life insurance, contributions to a MyBenefit account, and professional development support.

Requirements

  • 5+ years experience in Leading/Management roles
  • 5+ years of experience in Software Development
  • Strong understanding of SDLC, microservice architecture
  • Observability - NewRelic, Elastic, Grafana, PagerDuty, OTEL
  • Knowledge Kubernetes clusters in production setting, AWS, IaaC
  • Knowledge of CI-CD tooling Jenkins, Gitlab, GitHub, ArgoCD or similar

Responsibilities

  • Understand how an Altium Cloud Platform works
  • Pioneer improvements in observability, including logging, monitoring, and application performance management (APM), ensuring system reliability and proactive issue detection
  • Lead incident response and management, ensuring rapid resolution, clear stakeholder communication, and post-incident analysis for continuous improvement
  • Plan and overview infrastructure upgrades, patching, and maintenance activities while consistently managing and meeting agreed SLA targets
  • Recruit, mentor, and develop a high-performing SRE team, fostering professional growth and a collaborative culture
  • Participate in system design consulting, platform management, and capacity planning
  • Improve reliability, quality, and time-to-market of our software solutions, including software development
  • Partner closely with engineering and development teams to enhance product stability, observability, and manageability through best practices in reliability engineering
  • Partner closely with DevOps/Operations, drive automation initiatives, promote Infrastructure as Code (IaC), and streamline deployment processes to improve operational efficiency and scalability

Preferred Qualifications

5+ years relational databases (mysql, postgres)

Benefits

  • Private medical insurance
  • Group life insurance
  • Contributions to your Kafeteria MyBenefit account
  • Nilo.health, mental health and wellbeing support
  • Professional development support
  • Employee referral and employee-of-the-month programs
  • Home internet allowance
  • Flexible working arrangements available based on role and location
  • Free lunch on Tuesdays, snacks, and drinks in the office
  • Free parking

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Remote Jobs