Summary
Join Jamf as a Senior Site Reliability Engineer and contribute to the stability and reliability of our systems. You will leverage SRE best practices and automation to balance development velocity with customer needs. This role involves creating and leading projects focused on service measurement and observability, working with production systems to identify areas for automation. You will collaborate with various teams, including Cloud Operations, Engineering, and Technical Support, within an Agile framework. This remote position is based in Poland, with occasional on-site requirements. Jamf offers a flexible and supportive work environment.
Requirements
- Minimum of 5 years of experience in IT
- Experience identifying, tuning, and fixing issues with software
- Experience working with containerization and Kubernetes
- Experience utilizing system monitoring tools, such as Grafana & Prometheus
- Experience working within a form of the Agile development framework process
- Experience optimizing SQL queries and database engine tuning
- Experience using and troubleshooting AWS services, including: EC2, S3, CloudFront, EKS (Kubernetes), RDS (Aurora)
- Experience creating clear and concise technical documentation that is targeted at both technical and non-technical audiences
- Bachelor's degree or a combination of relevant experience and education may be considered
Responsibilities
- Identify improvements in both the platform and processes by implementing established SRE concepts with the goal of improving product and system reliability
- Proactively engage and collaborate with other individuals and teams as issues arise by serving as an escalation point of customer issues to ensure successful outcomes
- Perform root cause analysis for customer impacting issues and be able to clearly document the solution and advise others from the results of those findings
- Create technical documentation based upon new technology proof of concepts, project work, root cause issue analysis, identification of alerting patterns, and proactively sharing this knowledge with other teams as part of the Continuous Improvement Model
- Participate in team ceremonies to identify and refine potential work, communicate findings, and drive opportunities to collaborate
- Assign and communicate the business value and benefit hypothesis of new projects, initiatives, and strategies while being able to break down the technical work require to achieve a successful outcome
- Lead cross-team and cross-department technical collaboration in critical customer escalations
- Advise stakeholders and senior leadership on critical customer escalations
- Occasionally provide off hours support for deployments and customer escalations
Preferred Qualifications
- Experience solving production Java issues through use of native tools such as heap dumps, thread dumps and profilers
- Experience using CI/CD tools (e.g Jenkins, ArgoCD, GitHub Actions)
- Experience writing infrastructure as a code (Terraform)
- Experience defining and implementing Service Level Objectives
Benefits
Our volunteer time off allows employees to support and give back to our communities
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.