Senior Site Reliability Engineer

DocPlanner Logo

DocPlanner

πŸ“Remote - Spain

Summary

Join Docplanner as a Site Reliability Engineer (SRE) and play a key role in ensuring the reliability and performance of our platform. You will operate production environments, optimize system performance, improve software solutions, and provide operational support for large-scale applications. Responsibilities include ensuring system reliability, investigating and resolving incidents, defining SLOs/SLIs, improving performance, collaborating with developers, and automating tasks. The ideal candidate possesses experience with monitoring stacks (DataDog/OTEL/Prometheus), a detective mindset, .NET and AWS experience, and Kubernetes expertise. Docplanner offers a competitive salary, flexible remuneration and benefits (restaurant card, transportation card, kindergarten, training tax savings), share options, remote/hybrid work, flexible hours, generous paid time off, and comprehensive health benefits.

Requirements

  • Monitoring and observability - Experience with monitoring stack like DataDog / OTEL / Prometheus
  • Detective mindset - Strong investigative mindset with a detective-like approach to troubleshooting and resolving complex issues
  • .NET experience - Familiar with .NET environment and ability to code
  • AWS experience - Experience working with AWS services and cloud-native architectures
  • Kubernetes - Practical experience deploying, managing, and troubleshooting applications in Kubernetes; understanding of containers, Helm, and scaling strategies
  • Think like an owner - Proactive approach to identifying problems, performance bottlenecks, and areas for improvement
  • Communicator – Equally fluent when talking to humans or machines; clear, effective communication across teams and tools

Responsibilities

  • Operate production environments by monitoring availability and taking a holistic view of system health
  • Measure and optimize system performance to stay ahead of customer needs and drive continuous innovation
  • Improve reliability, quality, and time-to-market of our suite of software solutions
  • Provide primary operational support and engineering expertise for multiple large-scale, distributed software applications
  • Ensure reliability and availability of systems through monitoring, alerting, and incident response
  • Investigate and resolve incidents, perform root cause analysis, and implement long-term fixes
  • Define and maintain SLOs/SLIs to measure and drive service quality
  • Continuously improve performance and optimize infrastructure cost and resource usage
  • Collaborate with developers to build scalable, fault-tolerant systems and improve deployment practices
  • Automate operational tasks to reduce manual toil and improve efficiency

Preferred Qualifications

  • Proficiency in scripting or programming with languages such as Python or Go – to support automation and tooling development
  • Hands-on experience in Site Reliability Engineering practices – including incident management and service-level objectives
  • Understanding of microservices architecture – with experience in designing, observing, and troubleshooting distributed systems

Benefits

  • A salary adequate to your experience and skills
  • Flexible remuneration and benefits system via Flexoh , which includes: restaurant card, transportation card, kindergarten, and training tax savings
  • Share options plan after 6 months of working with us
  • Remote or hybrid work model with our hub in Barcelona
  • Flexible working hours (fully flexible, as in most cases you only have to be on a couple of meetings weekly)
  • Summer intensive schedule during July and August (work 7 hours, finish earlier)
  • 23 paid holidays, with exchangeable local bank holidays
  • Additional paid holiday on your birthday or work anniversary (you choose what you want to celebrate)
  • Private healthcare plan with Adeslas for you and subsidized for your family (medical and dental)
  • Access to hundreds of gyms for a symbolic fee in partnership for you and your family with Wellhub
  • Access to iFeel , a technological platform for mental wellness offering online psychological support and counseling

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.