Principal Site Reliability Engineer

Accela Logo

Accela

πŸ’΅ $170k-$190k
πŸ“Remote - Worldwide

Summary

Join Accela's high-impact Cloud Engineering & Operations team as a Site Reliability Engineer to ensure the availability, performance, security, and scalability of our SaaS offerings. You will collaborate with various teams, contribute to the full software lifecycle, and play a key role in platform modernization and containerization. This strategic role offers opportunities for leadership, mentorship, and shaping future cloud architecture. You will lead platform modernization projects, drive containerization efforts, improve system reliability and efficiency through automation, and ensure the availability and security of Accela's SaaS cloud services. You will also provide technical leadership and mentorship, perform deep troubleshooting, and design and maintain observability dashboards. This role requires strong collaboration skills in a remote environment.

Requirements

  • 10+ years in software engineering or production systems engineering, ideally in a SaaS environment
  • Proven leadership in platform modernization and containerized architecture (e.g., Kubernetes)
  • Deep experience with Azure cloud services and infrastructure (CLI/API usage)
  • Strong command of Git/GitHub and version control best practices
  • Expertise in troubleshooting complex, distributed systems (full stack)
  • Proficiency in Infrastructure as Code tools, particularly Terraform
  • Experience with Ansible or similar configuration management tools
  • Knowledge of scripting languages such as Bash, Python, Ruby, or Go
  • Strong understanding of system internals (OS-level) and system administration
  • Familiarity with production monitoring, alerting, and logging tools
  • Demonstrated experience using AI tools (e.g., GitHub Copilot) to enhance development
  • Excellent written and verbal communication skills, with the ability to present to senior leadership
  • Strong problem-solving skills and systematic approach to incident management

Responsibilities

  • Lead and execute platform modernization and refactoring projects in alignment with CI/CD best practices and strategic goals
  • Drive the full containerization of the Accela platform, including orchestration using Kubernetes or similar technologies
  • Improve the reliability, scalability, and efficiency of systems by developing automation and infrastructure code
  • Ensure the availability, performance, and security of Accela’s SaaS cloud services through proactive engineering and operational excellence
  • Provide technical leadership and mentorship to junior engineers, fostering a culture of continuous learning and growth
  • Perform deep troubleshooting during production incidents, conduct root cause analysis, and implement long-term corrective actions
  • Act as a senior escalation point for high-impact incidents, complex deployments, and critical change management events
  • Design, build, and maintain observability dashboards and key performance metrics to monitor system health and support data-driven decisions
  • Effectively collaborate and lead cross functionally in a remote environment leveraging different collaboration tools (Teams, Slack etc)

Preferred Qualifications

  • Experience in Linux environments
  • Experience with PowerShell
  • Deep familiarity with Kubernetes deployments
  • Experience with Cloudflare and Harness.io
  • Hands-on experience with Python scripting
  • Background in implementing and maturing Incident, Problem, and Change Management processes

Benefits

  • Flexible time off
  • Comprehensive medical, dental, and vision plans
  • Family planning benefits
  • 401(k) retirement savings plan with company match
  • Health savings account with company contributions
  • Flexible spending account
  • Life, accident, and disability coverage
  • Business travel insurance
  • Employee assistance programs
  • Other well-being benefits

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Remote Jobs