πTaiwan
Site Reliability Engineer
closed
DaCodes
πRemote - Mexico
Summary
Join DaCodes, a leading software and digital transformation firm, as a Site Reliability Engineer (SRE)! You will leverage cutting-edge technologies to solve operational and development challenges, ensuring system reliability and optimized performance. This role involves automating infrastructure management, developing CI/CD pipelines, and collaborating with diverse teams. You'll work with global brands and disruptive startups, contributing your expertise to impactful projects. DaCodes offers a remote work option and various benefits, including health insurance, life insurance, professional development opportunities, and a flexible work schedule.
Requirements
- 5+ years of experience in Site Reliability Engineering or similar roles
- Proficiency in cloud computing platforms like AWS, with advanced expertise in network infrastructure (load balancers, subnets, gateways, NAT, etc.)
- Strong experience with container orchestration tools like Kubernetes, ECS, and Docker
- Advanced skills with CI/CD tools (Jenkins, ArgoCD, Terraform, CloudFormation)
- Experience with monitoring tools such as Prometheus, Grafana, and Elasticsearch
- Proficient in scripting and development languages (Go, Python, Ruby, Bash)
- Experience with system and application debugging, and ensuring high availability
- Strong problem-solving and troubleshooting abilities in cloud and on-prem environments
- In-depth understanding of networking (IPv4, IPv6, BGP, etc.)
- Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience)
- Excellent communication and interpersonal skills to collaborate effectively with teams
Responsibilities
- Automate infrastructure management using tools such as Terraform, Ansible, and CloudFormation
- Develop and manage CI/CD pipelines using tools like Jenkins
- Architect and maintain scalable systems in data centers and cloud environments
- Manage containerized environments, with hands-on experience in Kubernetes and ECS
- Automate routine tasks, optimize deployments, and ensure reliability of production systems
- Collaborate with cross-functional teams to improve performance, reliability, and scalability
- Analyze and debug issues, ensuring timely resolutions and minimal downtime
- Monitor applications, systems, and databases using tools like Prometheus, Grafana, and Elasticsearch
- Troubleshoot network issues and automate network configurations with pipeline tools
- Participate in technical discussions, bringing real-world solutions and contributing to architectural decisions
Preferred Qualifications
- AWS Certified Solutions Architect or SysOps Administrator
- Familiarity with Agile software development methodologies, such as Scrum or Kanban
- Experience with application monitoring and alerting systems
- Familiarity with Machine Learning applications for infrastructure optimization
Benefits
- Remote work / Home office
- Work schedule aligned with the assigned project/team
- Monday to Friday schedule
- Legal benefits (Applicable for Mexico)
- Day off on your birthday
- Private health insurance (Applicable for Mexico)
- Life insurance (Applicable for Mexico)
- Multicultural teams
- Access to courses and certifications
- Meetups with industry experts and top universities
- Virtual networking events and interest groups
- English classes
- Opportunities within our different business lines
- Proudly certified as a Great Place to Work
This job is filled or no longer available
Similar Remote Jobs
πChina
πSingapore
πWorldwide
πJapan
π°$60k-$120k
πAsia
πIndia
π°$140k-$190k
πUnited States
πIndia
π°$140k-$175k
πUnited States