Site Reliability Engineer

closed
DaCodes Logo

DaCodes

πŸ“Remote - Mexico

Summary

Join DaCodes, a leading software and digital transformation firm, as a Site Reliability Engineer (SRE)! You will leverage cutting-edge technologies to solve operational and development challenges, ensuring system reliability and optimized performance. This role involves automating infrastructure management, developing CI/CD pipelines, and collaborating with diverse teams. You'll work with global brands and disruptive startups, contributing your expertise to impactful projects. DaCodes offers a remote work option and various benefits, including health insurance, life insurance, professional development opportunities, and a flexible work schedule.

Requirements

  • 5+ years of experience in Site Reliability Engineering or similar roles
  • Proficiency in cloud computing platforms like AWS, with advanced expertise in network infrastructure (load balancers, subnets, gateways, NAT, etc.)
  • Strong experience with container orchestration tools like Kubernetes, ECS, and Docker
  • Advanced skills with CI/CD tools (Jenkins, ArgoCD, Terraform, CloudFormation)
  • Experience with monitoring tools such as Prometheus, Grafana, and Elasticsearch
  • Proficient in scripting and development languages (Go, Python, Ruby, Bash)
  • Experience with system and application debugging, and ensuring high availability
  • Strong problem-solving and troubleshooting abilities in cloud and on-prem environments
  • In-depth understanding of networking (IPv4, IPv6, BGP, etc.)
  • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience)
  • Excellent communication and interpersonal skills to collaborate effectively with teams

Responsibilities

  • Automate infrastructure management using tools such as Terraform, Ansible, and CloudFormation
  • Develop and manage CI/CD pipelines using tools like Jenkins
  • Architect and maintain scalable systems in data centers and cloud environments
  • Manage containerized environments, with hands-on experience in Kubernetes and ECS
  • Automate routine tasks, optimize deployments, and ensure reliability of production systems
  • Collaborate with cross-functional teams to improve performance, reliability, and scalability
  • Analyze and debug issues, ensuring timely resolutions and minimal downtime
  • Monitor applications, systems, and databases using tools like Prometheus, Grafana, and Elasticsearch
  • Troubleshoot network issues and automate network configurations with pipeline tools
  • Participate in technical discussions, bringing real-world solutions and contributing to architectural decisions

Preferred Qualifications

  • AWS Certified Solutions Architect or SysOps Administrator
  • Familiarity with Agile software development methodologies, such as Scrum or Kanban
  • Experience with application monitoring and alerting systems
  • Familiarity with Machine Learning applications for infrastructure optimization

Benefits

  • Remote work / Home office
  • Work schedule aligned with the assigned project/team
  • Monday to Friday schedule
  • Legal benefits (Applicable for Mexico)
  • Day off on your birthday
  • Private health insurance (Applicable for Mexico)
  • Life insurance (Applicable for Mexico)
  • Multicultural teams
  • Access to courses and certifications
  • Meetups with industry experts and top universities
  • Virtual networking events and interest groups
  • English classes
  • Opportunities within our different business lines
  • Proudly certified as a Great Place to Work
This job is filled or no longer available