Senior Site Reliability Engineer
![Tillster Logo](https://cdn.jobscollider.com/logo/tillster-90cf.webp)
Tillster
Summary
Join Tillster as a Sr. Site Reliability Engineer (SRE) and be responsible for the availability, performance, monitoring, and incident response of our platforms and services. You will analyze and troubleshoot large-scale distributed systems, scale systems sustainably through automation, and improve monitoring and logging solutions. This remote position requires experience in software engineering, IT operations, and cloud infrastructure. You will need proficiency in programming languages, configuration management, and monitoring tools. Tillster offers competitive compensation, benefits including health insurance, paid time off, and professional development opportunities. The position is based in Portugal.
Requirements
- Ability to program with one or more high level languages, ex: Typescript, Python, etc
- Configuration Management and Infrastructure as Code (e.g.: CloudFormation, Ansible)
- Monitoring and Alerting tools, ex: AWS Cloudwatch, New Relic, etc
- Incident management/on-call, ex: PagerDuty, etc
- Bachelor's degree from a four-year college or university, or three to four years related experience and/or training; or equivalent combination of education and experience
Responsibilities
- Analyze and troubleshoot large-scale distributed systems in the public cloud
- Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity
- Improve and maintain monitoring and logging solutions that measure availability, latency and overall system health of production systems
- Provision and manage cloud Infrastructure through automation and infrastructure as code
- Restore healthy operation of applications and services through sustainable incident response and blameless postmortems
- Follow and monitor security and compliance best practices
- A proactive approach to spotting problems, areas for improvement, and performance bottlenecks
- Gather and analyze metrics to assist in performance tuning and fault finding
Preferred Qualifications
3+ years of software engineering and/or IT operations and infrastructure experience
Benefits
- Compensation competitive to market and geographical location
- Meal allowance for each day worked available through meal card
- Home/Office allowance reimbursement per calendar month, pro-rated based on employment start date
- Health insurance : Tillster pays the premium for employee private health insurance. Employees have the option to add their spouse/dependents at the employeeβs cost
- Holidays: Up to 14 federal and local/municipal holidays in accordance with applicable Portuguese Labour laws, dependent on your employment start date
- Vacation: Up to 22 days of vacation every holiday year, pro-rated based on employment start date
- Education, Learning & Development : We offer Udemy Learning courses; and ongoing learning and development opportunities