Monitoring And Observability Administrator

Alter Solutions Portugal Logo

Alter Solutions Portugal

πŸ“Remote - Spain

Summary

Join the UNICC Monitoring team as a Monitoring & Observability Administrator in Valencia, Spain or remotely within the CET/GMT time zone. This role requires proficiency in SCOM, Checkmk, and Elastic monitoring tools within a complex enterprise environment. You will administer, extend, and develop the monitoring infrastructure, integrate APIs, create monitors and management packs, and develop automations using Ansible. Strong scripting and programming skills are essential, along with excellent communication and problem-solving abilities. The position offers a teleworking option with on-call requirements of one week per month on a rotational basis.

Requirements

  • Good experience with SCOM, Checkmk, Elastic Observability (Elasticsearch & APM) Monitoring tools
  • Good scripting knowledge in either Bash, PowerShell, Python etc…
  • Programming skills in .NET, C#, Python
  • Proficient in English
  • Customer facing experience and oral communication skills
  • Ability to write documentation & reports
  • Creativity/ ability to find innovative solutions
  • Willingness to learn on the job
  • Conflict management & cooperation
  • Experience and understanding complex enterprise environments
  • Any of following certifications: Elasticsearch engineers, Elastic Observability Engineer, System Monitoring with Checkmk, System Center Operations Manager

Responsibilities

  • Administer the monitoring infrastructure (SCOM / Checkmk / Elastic), ensuring that it is stable, up to date, well designed, properly tuned and properly maintained
  • Extend the current infrastructure and /or implement a new infrastructure following capacity management plans
  • Develop monitoring, improve/configure/maintain
  • Develop integration through APIs
  • Develop ad-hoc monitors when required
  • Develop SCOM management packs
  • Develop and maintain automations using ansible
  • Develop reports
  • Identify and troubleshoot known problems and document solutions
  • Provide guidance to Tier 1 workforce
  • Provide guidance and coach team members when needed
  • Create relevant statistics for the several monitoring tools
  • Perform required tasks in maintenance windows
  • Provide β€œstand-by” services on a rotation basis during weekends, holidays and outside of normal working hours
  • Perform other duties as required

Preferred Qualifications

  • Knowledge of other Monitoring tools like OEM or Prometheus are also desirable
  • Experience in Full-Stack Observability platform also desirable
  • Experience with automation tools and runbooks like ansible, YAML also desirable
  • Experience with KQL, ESQL, PromQL, JSON and dashboards in Kibana also desirable
  • Experience with reports, SSRS, PowerBI also desirable
  • DevOps practice is desirable
  • Relevant industry certifications
  • Monitoring tools certifications
  • Programming language, preferably either .NET, C#, Python

Benefits

  • Teleworking Option: Yes
  • On-call requirements: One week per month (rotation is subject to the number of team members)

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Remote Jobs