Senior Site Reliability Engineer

CloudLinux Logo

CloudLinux

πŸ“Remote - Cyprus

Summary

Join CloudLinux's Release Engineering Department as a Senior Site Reliability Engineer (SRE) and play a critical role in maintaining internal and external infrastructure related to package repositories. This remote position, ideal for professionals in Europe and CIS, offers collaboration with multiple development teams, focusing on delivering and managing repository distribution. You will design, implement, and manage scalable repository infrastructure, automate software operations, monitor system performance, and automate deployment processes. The role requires strong Linux experience, development background, and proficiency in agile SDLC practices and CI/CD tooling. CloudLinux offers professional development opportunities, flexible remote work, paid time off, medical insurance, and other benefits.

Requirements

  • Strong background in development: an ideal candidate had started a career as a developer, then rolled to infrastructure-based projects on a large scale
  • Proven experience as a leading SRE or in a similar role, with a strong focus on Linux environments
  • Proficiency in modern agile SDLC practices and principles, orchestration, and CI/CD tooling i.e. Python, Java, Terraform, Ansible, Cloudformation, Puppet, Chef, or similar
  • Knowledge of the Grafana ecosystem or similar, building dashboards, alert rules, PromQL, as well as frontend observability
  • Excellent technical knowledge of IT Infrastructure, including network and application load balancers, switches, routers, and IP addressing
  • Strong analytical and problem-solving skills with a focus on root cause analysis and mitigation
  • Excellent communication and teamwork skills with the ability to collaborate effectively across engineering teams
  • English: at least Intermediate level required

Responsibilities

  • Design, implement, and manage scalable, resilient, and secure wide company repository infrastructure for CloudLinux products as a first assignment
  • Automate software operations for re-usability and consistency across private and public clouds, taking into consideration the complexities of distributed systems
  • Monitor system performance and troubleshoot issues proactively to ensure optimal uptime and reliability
  • Automate deployment processes using Infrastructure as Code (IaC) principles
  • Share your experience, know-how, and best practices with other team members in design sessions, system architecture discussions, mentorship, and "doing work together"

Benefits

  • A focus on professional development
  • Interesting and challenging projects
  • Fully remote work with flexible working hours, that allows you to schedule your day and work from any location worldwide
  • Paid 24 days of vacation per year, 10 days of national holidays, and unlimited sick leaves
  • Compensation for private medical insurance
  • Co-working and gym/sports reimbursement
  • Budget for education
  • The opportunity to receive a reward for the most innovative idea that the company can patent

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Remote Jobs