Remote Principal Site Reliability Engineer

closed
Logo of ScienceLogic

ScienceLogic

πŸ“Remote - United States

Job highlights

Summary

Join our team as a Principal Site Reliability Engineer and contribute to building the foundation for Autonomic IT. We're looking for someone with experience in cloud technologies, automation mindset, and SRE discipline. As a key member of our Site Reliability Engineering team, you'll design, deploy, and maintain Cloud Infrastructure used for running company's revenue-generating go-forward SaaS product line.

Requirements

  • Must be a U.S. Citizen
  • 7-10 years of site reliability engineering or cloud operations experience or equivalent experience
  • Proven track record of operating production SaaS environments within security standards like FedRAMP, SOC2, ISO, PCI
  • Bachelors or Master's degree in Computer Science, Information Systems or similar field
  • Skilled at problem solving, algorithms, and data structures conforming to the modern SaaS security requirements
  • Building tools and scripting frameworks from scratch
  • Working with Cloud Automation tools like CloudFormation, Terraform, CDK, aws-cli
  • Scripting languages like Python, Groovy, PowerShell, Bash, Perl etc
  • Exposure to Windows and Linux administration skills
  • Familiarity with basic networking, security and cloud engineering concepts
  • Highly collaborative with effective written and verbal communication skills

Responsibilities

  • Enhance the company’s SaaS infrastructure security protocols
  • Collaborate across the organization to design, build and operationalize SaaS services conforming to various security standards like FedRAMP, SOC2, ISO etc
  • Participate in architecture, security, and operations reviews
  • Lead design reviews and buildout of secure systems for delivering various SaaS services with 99.99% uptime
  • Design, automate, test, and monitor the use of cloud native technologies as a foundation for a service platform
  • Investigate and resolve customer and operational issues with the mentality of fixing and not just mitigating issues
  • Identify and automate measurement of operations SLAs and SLOs
  • Triage incident response, document SOPs, Runbooks, and train NOC team members
  • Writing automation that can be easily supported and extended by others
  • Work on special projects as assigned

Benefits

  • A remote-first culture - work from home or come into the office, it's totally up to you
  • Comprehensive medical, dental and vision plans
  • 401(k) plan with employer match
  • Flexible Paid Time Off (FTO) so that you can take the time that you need to re-energize
  • Volunteer Time Off (VTO) - take two days off per calendar year to volunteer with your preferred charitable organization
  • 5-year Service Milestone Sabbatical
  • Paid parental leave
  • Generous employee referral bonus program
  • Pet insurance
This job is filled or no longer available