Remote DevOps Developer, Observability

Logo of Belmont Lavan

Belmont Lavan

πŸ“Remote - Sweden

Job highlights

Summary

Join our team as a DevOps Developer - Observability and contribute to designing and implementing observability solutions for microservices running on AWS. As a key member of the team, you will be responsible for setting up and optimizing monitoring tools, implementing distributed tracing solutions, establishing alerting mechanisms, creating dashboards and reports, investigating and troubleshooting issues, collaborating with teams, automating manual processes, and ensuring observability best practices are embedded into CI/CD pipelines and infrastructure as code (IaC) practices.

Requirements

  • Required skills: Observability / ArgoCD / Grafana / Gitlab / Prometheus / Harbor
  • Typescript / Log Management / Cloud Infrastructure / Jira/ Git / Dependency Tracker
  • Slack / Jaeger / OTEL / Python / Incident Management / Javascript / Terraform / Ansible
  • PagerDuty / Apica / CloudWatch / OpenTelemetry / Splunk

Responsibilities

  • Design and Implement Observability Solutions: Build and maintain observability tools (monitoring, logging, tracing) to ensure the health and performance of microservices running on AWS
  • Monitoring & Logging: Set up and optimize monitoring using tools like Prometheus, Grafana, CloudWatch, OTEL and Splunk stacks for real-time insights into the AWS infrastructure
  • Distributed Tracing: Implement distributed tracing solutions (e.g., OpenTelemetry, Jaeger) to trace and debug service interactions across multiple microservices
  • Proactive Alerting: Establish alerting mechanisms to detect performance anomalies and potential failures in real-time
  • Dashboards & Reporting: Create dashboards and reports to monitor service-level objectives (SLOs), key performance indicators (KPIs), and overall system health
  • Incident Management: Investigate and troubleshoot issues, identifying root causes, and providing insights to reduce mean time to detection (MTTD) and mean time to resolution (MTTR)
  • Collaboration with Teams: Collaborate with DevOps and development teams to ensure observability best practices are embedded into CI/CD pipelines and infrastructure as code (IaC) practices
  • Automation & Optimization: Automate manual monitoring and incident management processes to reduce operational overhead

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Remote Jobs

Please let Belmont Lavan know you found this job on JobsCollider. Thanks! πŸ™