Site Reliability Engineer

Commercetools Logo

Commercetools

πŸ“Remote - Spain

Summary

Join our Platform Engineering team as a Site Reliability Engineer and contribute to building the next generation of cloud infrastructure and developer platforms! Develop and implement infrastructure automation using Terraform and Crossplane, manage Kubernetes clusters across multiple cloud providers, and build self-service platforms and workflows for product teams. Ensure reliability by participating in on-call rotations and collaborate with product teams to understand their needs. Support continuous improvement by developing scalable automation tools and promoting knowledge sharing within the team. This fully remote role (for Spain-based candidates) is perfect for an engineer with cloud and automation experience passionate about platform engineering and innovative technologies.

Requirements

  • Solid hands-on experience with at least one major cloud provider (AWS or GCP), and familiarity with another
  • Demonstrated experience with Infrastructure as Code, particularly Terraform; familiarity with Crossplane is a plus
  • Proven experience managing Kubernetes clusters, including workload configuration, optimization, and troubleshooting
  • Understanding of GitOps practices, CI/CD pipelines, and experience with automation tools like Spacelift
  • Strong automation and scripting capabilities (e.g., Python, Bash, Go)
  • Experience with monitoring and observability tools such as Prometheus and Grafana
  • Excellent problem-solving abilities, including expertise in root cause analysis
  • Clear written and verbal communication skills in English
  • Enthusiasm for working in diverse, distributed international teams with a commitment to continuous improvement

Responsibilities

  • Develop and Implement Automation: Develop infrastructure automation using Terraform and Crossplane
  • Manage Kubernetes Clusters: Optimize Kubernetes environments across multiple cloud providers
  • Contribute to Self-Service Tooling: Help build and refine self-service platforms and workflows for our product teams, utilizing Spacelift and GitOps practices
  • Ensure Reliability: Participate in on-call rotations for infrastructure and platform services as part of our commitment to operational excellence
  • Collaborate with Teams: Work closely with product teams to understand their needs and develop platform solutions that enhance productivity
  • Support Continuous Improvement: Help develop scalable automation tools, work with the team to address infrastructure drift, and implement established security best practices
  • Promote Knowledge Sharing: Engage in pair programming, provide constructive code reviews, and foster knowledge sharing within the team and organization

Preferred Qualifications

  • Experience with Crossplane or similar Kubernetes-based infrastructure tools
  • Background in platform engineering, particularly with self-service workflows
  • Comfortable developing automation tooling in Go
  • Experience working with multi-cloud environments
  • Familiarity with security tools and standards like HashiCorp Vault and CIS Benchmarks
  • Knowledge of building and maintaining CI/CD pipelines

Benefits

  • Competitive Compensation Package: Generous compensation structure consisting of salary, a competitive stock option package, and various benefits and perks
  • Workation: Work u p to 60 days per year in a country different from your home country, with up to 20 working days per trip
  • Learning & Development Budget
  • Academy: Regular training sessions, access to Coursera and Babbel training courses
  • Flexibility: Morning person or night owl? We believe in outcome and motivated employees

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.