πMexico
Mid Site Reliability Engineer

Zipdev
πRemote - Brazil
Please let Zipdev know you found this job on JobsCollider. Thanks! π
Summary
Join Zipdev's team of Latin American developers as a remote Site Reliability Engineer! You will be an integral part of product teams, building, deploying, and monitoring cloud services. This role involves developing code, building monitoring frameworks, and troubleshooting critical issues in production systems. You will collaborate with various teams and engage in capacity planning and standardization efforts. The ideal candidate possesses a Bachelor's degree in Computer Science or equivalent experience, along with 3-4 years of SRE experience and strong programming skills. Zipdev offers a remote work environment with various benefits, including paid time off, parental leave, and several reimbursements.
Requirements
- Bachelorβs degree in Computer Science or equivalent work experience as System Administrator with programming skills
- 3 -4 years of proven professional experience as a Site Reliability Engineer
- Experience with one or more general-purpose programming/scripting languages including but not limited to: Python, Bash, Perl or Go
- Fundamental knowledge of technologies across a broad range of disciplines: virtualization storage, networking, server, and security
- Understanding of systems and application design, including the operational trade-offs of various designs
- Demonstrable knowledge of Unix, TCP/IP, HTTP, web application security, and experience supporting multi-tier web application architectures
- Experience in analyzing logs and troubleshooting large-scale distributed systems
- Excellent organization, time management, and communication skills
- Currently living in Latin America
Responsibilities
- Build systems and infrastructure to monitor complex, large-scale distributed systems
- Identify stability/performance issues and collaborate with developers to triage critical issues in production systems
- Represent the SRE organization in design reviews and operational readiness exercises for new and existing services
- Devise ways to actively monitor system throughput, capacity and reliability
- Ability to debug complex systems and evolve a running environment without downtime
- Engage in service capacity planning and demand forecasting, software performance analysis and system tuning
- Drive standardization efforts across multiple disciplines and services in conjunction with embedded SREs throughout the organization
Preferred Qualifications
- Experience with instrumenting and monitoring production systems (ELK stack, Zabbix, Nagios, Statsd/Graphite, APM, etc.)
- Experience with Amazon AWS Infrastructure (EC2, S3, VPC, Security Groups, RDS) and related services desired
- A working understanding of Docker, Vagrant, Ansible/Chef/Puppet
Benefits
- Work remotely Monday - Friday, 40 hours a week (no weekends)
- Vacation: 10 business days a year
- Holidays: 5 National Holidays a year
- Company Holidays: 5 Company Holidays a year (Christmas Eve, Christmas Day, New Year's Eve, New Year's Day, Zipdev Day)
- Parental Leave
- Health Care Reimbursement
- Active Lifestyle Reimbursement
- Quarterly Home Office Reimbursement
- Payroll Deduction Purchase Plans
- Longevity Bonus
- Continuous Learning Bonus
- Access to Training and Professional Development Platforms
Share this job:
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Similar Remote Jobs
πMexico
πMexico
πFrance
πFrance
π°$183k-$304k
πUnited States
πWorldwide
πFrance
πIndia