Remote Staff Site Reliability Engineer
Illumio
πRemote - Australia
Please let Illumio know you found this job on JobsCollider. Thanks! π
Job highlights
Summary
Join Illumio's Cloud Operations team as a Cloud Platform SRE Engineer to design and deploy scalable, reliable, and secure cloud infrastructure based on Kubernetes. As an essential member of the Operations team, you will collaborate with Platform and Data engineers to deliver the latest Illumio products.
Requirements
- Bachelor's degree in Computer Science, Engineering, or related field; or equivalent work experience
- 6+ years of relevant SRE, DevOps, Platform or Infrastructure Engineering experience
- 4+ years in production support role in a fast-paced industry/organization
- Experience deploying, tuning, and maintaining Linux-based, highly available, fault-tolerant web platforms in public cloud providers such as AWS, Azure, and GCP
- Common monitoring, log aggregation, and metrics gathering platforms experience (Icinga, Sensu, Splunk, Telegraf/InfluxDB, et. al.)
- Configuration management & orchestration tools experience like Chef, Ansible, and AWS Services & APIs, or equivalent
- Experience scripting/coding with Python, Java, Ruby and/or Go
- Experience with MySQL, PostgreSQL, Redis, or similar
- Solid knowledge of Linux operating system, Ubuntu, RHEL, OEL7 is required
- EKS and/or AKS frameworks
- Knowledge/Experience of Incident Management/on-call: PagerDuty
- Knowledge of Database Technologies, Release Management, REST, SRE, etc
- Load balancers/ Traffic manager knowledge
- Experience working with Kubernetes, Docker, or other virtualization & containerization technologies
- Networking basics and trouble shooting skills
- Good understanding of Production deployment, Distributed Environments required
- Strong problem solving and operational process skills, attention to detail
- Application support and debugging experience in a dynamic fast-paced production environment
- Experience with SDLC principles, architecture and operations
- Experience working with senior leadership both inside and outside of engineering
- Ability to manage multiple tasks and competing priorities to deliver projects on schedule
Responsibilities
- Driving reliability improvements back into applications
- Building code to resolve reliability/resiliency issues
- Mentor and educate team members to aid in strengthening technical expertise
- Collaborate closely with cloud architects to drive cloud solutions
- Curating proper SLI/SLOs to accurately measure or assess error budgets
- Embed with the development teams to assist with cloud methodologies when developing products to ensure that the deliverable is as reliable as possible
- Work with development teams to build and strengthen application security and compliance
- Manage high impact situations that involve technically challenging issues across diverse audiences and drive to find the root cause, mitigate, and identify a solution
- Focus on observability
Benefits
- Medical, Dental, Vision Coverage β Health and Dependent Savings Accounts
- Life and Disability Programs
- Paid Parental Leave
- Voluntary Benefit Programs
- Company Sponsored Wellness Program
- Wellness Reimbursement Program
- Retirement Savings
- Equity Opportunities
- Paid time off and Paid Holidays
- Employee Incentive Program
Share this job:
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Similar Remote Jobs
- π°$198k-$270kπUnited States
- π°$172k-$215kπUnited States
- π°$148k-$204kπUnited States
- πEurope
- πUnited States
- πBrazil
- πWorldwide
- π°$165k-$210kπUnited States
- πCosta Rica
Please let Illumio know you found this job on JobsCollider. Thanks! π