Cloud Operations Engineer

closed
Extreme Networks Logo

Extreme Networks

πŸ“Remote - Canada

Summary

Join Extreme Networks as a Cloud Operations Engineer and be part of a talented team building highly reliable, scalable, and secure cloud solutions. You will design, develop, and implement deployment automation solutions, participate in continuous cloud service operations, troubleshoot complex issues, and collaborate with various teams. The role requires managing and maintaining infrastructure across AWS, GCP, and Azure, and involves working with Kubernetes, Docker, and other technologies. The position is based in Toronto, Ontario, but remote work within Canada is possible. Extreme Networks offers a hybrid work approach, combining remote work with access to office resources.

Requirements

  • BS level technical degree required; Computer Science or Engineering background preferred
  • 5+ years of experience in a CloudOps / DevOps role
  • Hands on experience with AWS or any public cloud (Azure, GCP etc)
  • Knowledge of Linux, security and networking fundamentals
  • Working knowledge of container-based architecture and deployment (Docker, Kubernetes.)
  • Working knowledge of deployment automation development (Argo Workflows, Terraform, Helm)
  • Experience in diagnosing and resolving complex application problems
  • Working knowledge of Elasticsearch, PostgreSQL, Redis, Ignite, Kafka and RabbitMQ
  • Experience with monitoring tools (Nagios, Kibana, Prometheus)
  • Strong follow-through and initiative to stay with issues until they are resolved
  • Comfortable working within a distributed team located in multiple time zones

Responsibilities

  • Manage and maintain ExtremeCloud service infrastructure in AWS, GCP & Azure
  • Participate in continuous cloud service operations with US, EU, and China teams
  • Troubleshoot and follow up on production infrastructure / application related issues
  • Driving root cause analysis and resolution
  • Communicate with Dev/QA as well as external carriers to resolve and prevent issues
  • Participate in release deployment, system maintenance and cloud expansion
  • Design and implement deployment automation platform for Kubernetes based microservices
  • Improve service availability and scalability through tuning, automation, tools, and process
  • Analyze service performance, identify bottleneck and provide actionable improvement plans
  • Improve service monitoring coverage, accuracy and efficiency
  • Participate in cloud security and compliance implementation

Preferred Qualifications

  • Experience with cloud security and compliance implementation is a plus
  • Location: Toronto, Ontario is preferred - our site is located in Thornhill. We're utilizing a FlexFirst (hybrid) approach to allow employees to work both remotely and have access to our offices for collaboration, labs and office resources. We're also open to remote hire in Canada in order to find the right skills for this role
This job is filled or no longer available