ScienceLogic is hiring a
Principal Site Reliability Engineer

Logo of ScienceLogic

ScienceLogic

πŸ’΅ ~$176k-$314k
πŸ“Remote - United States

Summary

Join our team as a Principal Site Reliability Engineer and contribute to building the foundation for Autonomic IT. We're looking for someone with experience in cloud technologies, automation mindset, and SRE discipline. As a key member of our Site Reliability Engineering team, you'll design, deploy, and maintain Cloud Infrastructure used for running company's revenue-generating go-forward SaaS product line.

Requirements

  • Must be a U.S. Citizen
  • 7-10 years of site reliability engineering or cloud operations experience or equivalent experience
  • Proven track record of operating production SaaS environments within security standards like FedRAMP, SOC2, ISO, PCI
  • Bachelors or Master's degree in Computer Science, Information Systems or similar field
  • Skilled at problem solving, algorithms, and data structures conforming to the modern SaaS security requirements
  • Building tools and scripting frameworks from scratch
  • Working with Cloud Automation tools like CloudFormation, Terraform, CDK, aws-cli
  • Scripting languages like Python, Groovy, PowerShell, Bash, Perl etc
  • Exposure to Windows and Linux administration skills
  • Familiarity with basic networking, security and cloud engineering concepts
  • Highly collaborative with effective written and verbal communication skills

Responsibilities

  • Enhance the company’s SaaS infrastructure security protocols
  • Collaborate across the organization to design, build and operationalize SaaS services conforming to various security standards like FedRAMP, SOC2, ISO etc
  • Participate in architecture, security, and operations reviews
  • Lead design reviews and buildout of secure systems for delivering various SaaS services with 99.99% uptime
  • Design, automate, test, and monitor the use of cloud native technologies as a foundation for a service platform
  • Investigate and resolve customer and operational issues with the mentality of fixing and not just mitigating issues
  • Identify and automate measurement of operations SLAs and SLOs
  • Triage incident response, document SOPs, Runbooks, and train NOC team members
  • Writing automation that can be easily supported and extended by others
  • Work on special projects as assigned

Benefits

  • A remote-first culture - work from home or come into the office, it's totally up to you
  • Comprehensive medical, dental and vision plans
  • 401(k) plan with employer match
  • Flexible Paid Time Off (FTO) so that you can take the time that you need to re-energize
  • Volunteer Time Off (VTO) - take two days off per calendar year to volunteer with your preferred charitable organization
  • 5-year Service Milestone Sabbatical
  • Paid parental leave
  • Generous employee referral bonus program
  • Pet insurance

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Jobs

Please let ScienceLogic know you found this job on JobsCollider. Thanks! πŸ™