Site Reliability Engineer

closed
iManage Logo

iManage

πŸ“Remote - Worldwide

Summary

Join a supportive, experienced team with an inclusive, encouraging, and vibrant culture. Have flexible work hours that allow me to balance my β€˜me time’ with my work commitments. Collaborate in a modern open plan workspace, with a gaming area, free snacks, drinks and regular social events.

Requirements

  • Extensive knowledge working in a public cloud and/or hosted datacenter environment (Azure preferred)
  • Experience with highly available and scalable systems
  • Familiarity with Google SRE concepts (as defined by Google: https://sre.google/sre-book/table-of-contents/)
  • Ability to manage Windows Servers, AD, and .NET applications (C#.NET/ASP)
  • Experience with IIS and MS SQL configuration and support
  • Familiarity with Linux Server stacks (Ubuntu/Debian distributions preferred)
  • Basic to intermediate knowledge of networking (Subnetting. CIDR)
  • Experience with one cloud provisioning platform (HashiCorp Terraform preferred)
  • Experience with at least one configuration management platform (Chef preferred)
  • Familiarity with containerization/clustering technologies (e.g Docker, Azure Kubernetes)
  • Familiatiry with alerting and monitoring tools (Prometheus/Grafana or ELK/EFK preferred)
  • Working knowledge of CI/CD
  • Experience writing technical documentation and SOP’s for internal stakeholders
  • Willingness to collaborate with other teams, providing root cause analysis and problem analysis as needed
  • A Bachelor’s degree in Computer Engineering or related field (or equivalent experience)
  • Profiency in at least one scripting language (PowerShell, Python, Ruby, etc)

Responsibilities

  • Participating in, and facilitating, agile sprints and associated ceremonies
  • Driving innovation and platform evolution
  • Scaling cloud infrastructure to support our growing ecosystem based on Kubernetes
  • Providing reliable, predictable deployment and maintenance of distributed systems
  • Adhering to security best practices
  • Writing and designing automation, monitoring, diagnosing, and debugging tooling
  • Coordinating and participating in production support and on-call rotations
  • Conducting incident management and contributing to associated retrospective/postmortem as needed
  • Working cross functionally with cloud operations, development, and product team

Benefits

  • Creating an inclusive environment where I can help shape the culture not just by fitting in, but by adding to it
  • Providing a market competitive salary that is applied through a consistent process, equitable for all our employees, and regularly reviewed based on industry data
  • Rewarding me with an annual performance-based bonus
  • Offering comprehensive Health/Vision/Dental/Life Insurance, and a 401k Retirement Savings Plan with a company match up to 4%
  • Giving access to HealthJoy, a healthcare concierge service, to help me maximize my health benefits
  • Granting enhanced leave for expecting parents; 20 weeks 100% paid for primary leave, and 10 weeks 100% paid for secondary leave
  • Providing me with a flexible time off policy to take the time off that I need. Be it for vacation, volunteering, celebrating holidays, spending time with family, or simply taking time to recharge and reset
  • Caring for my mental health and well-being with multiple company wellness days and free access to the Healthy Minds app for mindfulness, meditation and more
This job is filled or no longer available