Lumin Digital is hiring a
Senior Site Reliability Engineer, Remote - United States

Logo of Lumin Digital

Senior Site Reliability Engineer closed

🏢 Lumin Digital

💵 $170k-$200k
📍United States

Summary

The job is for a Site Reliability Engineer at Lumin Digital. The SRE will ensure high application availability, work with Software Engineers, engage in capacity planning, change management, and uptime reporting. They should have operational expertise, excellent troubleshooting skills, knowledge of configuration management systems, networking protocols, and cloud services like AWS, Google Cloud, and Azure. A bachelor's degree or higher in Computer Science is required.

Requirements

  • Cultural fit
  • Humility
  • Strong sense of ownership, customer service, and integrity
  • Willing to walk in the mud
  • Commitment to continually improving yourself
  • Operational expertise with a desire to eliminate manual tasks: DevOps approach - Automation and resilient systems are key
  • Monitoring and Alerting - Monitor the right things. Alert appropriately: Self heal. Involve people when needed. Log tickets when no immediate action is required
  • Remain calm in trying circumstances
  • Exceptional full stack and environment troubleshooting skills
  • Expert-level knowledge of at least one configuration management system (Chef, Ansible, Puppet, etc.)
  • Understanding of standard networking protocols and components such as: HTTP, DNS, TCP/IP, ICMP, the OSI Model, Subnetting and Load Balancing
  • Security mindset. Data cannot and will not be compromised
  • Driven to ensure that being on-call is boring
  • Exceptional written and verbal communication skills
  • Past history working on an agile scrum team
  • Expert hosting in the Cloud. AWS preferred, but Google Cloud and Azure are also of interest
  • Experience with a microservice architecture running in containers (Docker or other containerization technology)
  • Experience with Terraform and Kubernetes
  • Understand CI / CD and ability to architect the workflow
  • Willing to participate in a 24x7 on-call rotation

Responsibilities

  • CI/CD: Monitor and resolve issues in all environments. Ensure SLO and uptime are met
  • Ensure SRE concerns are addressed from the design of a feature through its deployment to production
  • Work on the SRE scrum team
  • Engage in capacity planning and demand forecasting, anticipating performance bottlenecks and scaling the environment as needed
  • Change management
  • Uptime and SLO reporting

Preferred Qualifications

  • 2+ years of experience as a software engineer. C#, Angular, JavaScript preferred
  • AWS Certification preferred but not essential: SysOps and/or Solutions Architect ideal
  • Experience with Amazon RDS, EKS, CloudWatch, etc
  • Experience with Docker tooling and ecosystem

Benefits

  • Bachelor’s degree or higher in Computer Science, or equivalent experience
  • $170,000 - $200,000 a year
This job is filled or no longer available

Similar Jobs