Infrastructure Engineer

Masabi Logo

Masabi

πŸ“Remote - Colombia

Summary

Join Masabi, a global leader in fare payment technology, as a Site Reliability Engineer. You will play a crucial role in ensuring the reliability, performance, and security of our platform. This fully remote position, based in Colombia, involves automating operations, improving processes, and maintaining security compliance. You will collaborate with various teams, manage incidents, and optimize cloud infrastructure costs. The ideal candidate possesses significant SRE experience, AWS expertise, and proficiency in tools like Terraform and Grafana. Masabi offers a competitive salary, benefits, and a collaborative work environment.

Requirements

  • Significant experience in SRE or related roles, with a proven track record in building and maintaining reliable systems
  • Expertise in AWS Cloud technologies
  • Hands-on experience with Terraform and Grafana, along with strong knowledge of security principles and networking components
  • Experience in building pipelines and robust CI/CD infrastructure
  • A collaborative team player who approaches projects with an open mind and prioritises security
  • Passionate about leveraging technology to drive advancements while ensuring reliability and security
  • Excellent communication skills, a collaborative mindset, and a willingness to learn and contribute to team success
  • Self-sufficient and capable of working independently, while also knowing when to seek support or input

Responsibilities

  • Drive automation to reduce operational overhead and human error. Build CI/CD pipelines, develop Infrastructure as Code (IaC) using tools like Terraform and CloudFormation, and design scalable systems to handle high traffic while optimising resource utilisation. Drive the effort to scale up new environments as we expand globally
  • Refine processes, tools, and workflows to enhance system reliability, scalability, and efficiency. Plan capacity to anticipate future needs and support high-performance systems
  • Ensure infrastructure meets organisational security standards and supports compliance frameworks like SOC 2 and PCI
  • Maintain real-time monitoring systems aligned with SLIs and SLOs, ensuring uptime and performance meet or exceed SLAs. Set up proactive alerting mechanisms to address issues before they escalate
  • Monitor and optimise cloud infrastructure costs through autoscaling, rightsizing, and architectural reviews to balance cost-effectiveness with reliability
  • Implement failover strategies, disaster recovery plans, and redundancy to ensure system resilience under all conditions
  • Respond to production incidents, minimise downtime, and restore availability. Perform root cause analysis, implement preventive measures, and contribute to post-incident reviews to share lessons learned
  • Partner with developers to design reliable, maintainable systems. Coach teams on best practices for reliability, scalability, and observability, fostering a culture of ownership
  • Maintain detailed documentation for infrastructure, incident response, and workflows. Develop playbooks and runbooks to ensure seamless knowledge transfer

Preferred Qualifications

  • Familiarity with PCI DSS v4 Compliance requirements is a plus
  • AWS Cloud certification
  • Experience with orchestrating containers

Benefits

  • Competitive salary package
  • 15 days paid vacation for each year plus 18 public holidays
  • Private Healthcare
  • Monthly team bonding allowance
  • Menopause support
  • Choice of a workstation
  • Ability to work for up to 3 months per year from any country in the world
  • Fun and collaborative environment with a focus on making a difference in the world
  • In addition to the above, as an employee, you will also have access to a training allowance of up to $750 USD and $250 USD to spend on your home office every year

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.