Senior Engineer - Observability Platform

Logo of IT Scout

IT Scout

πŸ“Remote - Mexico

Job highlights

Summary

Join Indeed Inc. as a Senior Engineer in Guadalajara, Mexico, working remotely on a 6-month contract (with potential extension) focusing on the DataDog observability platform. You will be primarily responsible for monitoring and troubleshooting issues within the DataDog environment, integrating it with cloud environments, and ensuring platform scalability and observability. The role requires proficiency in monitoring tools (DataDog, Prometheus, Grafana), CI/CD pipelines, and cloud platforms (AWS). Experience with containerized environments (Kubernetes) and scripting (Python) is essential. The ideal candidate will possess strong operational skills, including installation, configuration, and user management within DataDog. The hourly rate is $23.

Requirements

  • Proficiency with Data Dog, Prometheus, and Grafana for tracking system performance and health
  • Knowledge of integrating Data Dog with CI/CD pipelines
  • Hands-on experience with Data Dog's monitoring and logging
  • Experience with AWS and integrating cloud services with Data Dog
  • Expertise in monitoring containerized environments using Kubernetes, integrated with Data Dog
  • Ability to automate monitoring tasks and configure Data Dog via scripts using Python
  • Experience installing Data Dog agents, configuring integrations, and managing API keys or tokens
  • Familiarity with managing user roles, permissions, and best practices within Data Dog
  • English language proficiency

Responsibilities

  • Monitor and troubleshoot issues within the Data Dog environment
  • Integrate Data Dog seamlessly with cloud environments
  • Ensure the platform's observability and scalability
  • Create and manage dashboards in DataDog, Prometheus, and Grafana
  • Configure alerts and handle application performance monitoring (APM) in DataDog, Prometheus, and Grafana
  • Integrate Data Dog with CI/CD pipelines
  • Utilize Data Dog's monitoring and logging features
  • Integrate cloud services (AWS) with Data Dog for unified monitoring
  • Monitor containerized environments using Kubernetes, integrated with Data Dog
  • Automate monitoring tasks and configure Data Dog via scripts using Python
  • Install Data Dog agents
  • Configure Data Dog integrations
  • Manage API keys or tokens for secure access to the Data Dog platform
  • Manage user roles, permissions, and best practices within Data Dog

Preferred Qualifications

  • Experience migrating teams from proprietary tracing models (like DataDog's APM) to OpenTelemetry for distributed tracing
  • Ability to make the platform capable of using OpenTelemetry and guide teams through the transition
  • Experience working with teams to migrate and consolidate individual keys to a service account model

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Please let IT Scout know you found this job on JobsCollider. Thanks! πŸ™