Scout Motors is hiring a
Senior Site Reliability Engineer

Logo of Scout Motors

Scout Motors

πŸ’΅ $150k-$180k
πŸ“Remote - United States

Summary

Join us at Scout Motors and be part of shaping the future of transportation. If you're ready to drive change and make history, apply now!

Requirements

  • Bachelor's degree in computer science, information technology, or related field or equivalent work experience
  • 8+ years of hands-on experience as a Site Reliability, DevOps, or Cloud engineer
  • Proficient in building automation using languages such as Python, Shell, Ruby, and others
  • Strong experience with containerization technologies (Docker, Kubernetes)
  • Expertise in configuration management tools (e.g., Ansible, Chef, Puppet)
  • Solid understanding of CI/CD concepts and tools (GitHub Actions, GitLab Pipelines, Harness.io, ArgoCD)
  • Multiple years of experience working with cloud platforms such as AWS, Azure, or Google Cloud
  • Experience with monitoring and alerting tools such as Datadog, New Relic, SignalFX, Prometheus, AWS CloudWatch etc
  • Experience with logging solutions AWS CloudWatch, ELK, Splunk or equivalent
  • Experience with infrastructure as code (Terraform, Pulumi, or CDK)
  • Excellent problem-solving and troubleshooting skills. When a problem occurs, you run towards it not away
  • Effective communication and collaboration skills. You treat colleagues with respect. You have a desire for clean implementations but are also humble in discussing alternative solutions and options
  • A teaching and coaching approach to guiding engineers and teams in approaches

Responsibilities

  • Contribute to the design, implementation, and maintenance of the overall cloud infrastructure platform using modern IaC (Infrastructure as Code) practices
  • Work closely with software development and systems integration teams to build end-to-end solutions
  • Design and build infrastructure utilizing container orchestration such as EKS/K8S
  • Provide and participate in an Incident Response process for establishing disaster recovery practices
  • Ensure high uptime of critical systems
  • Design and implement an availability reporting framework working with engineering teams to develop SLO and SLI measurements and targets
  • Participate in scaling and performance testing of critical components and services
  • Design and implement cloud infrastructure components, ensuring high availability, reliability, scalability, and performance
  • Implement monitoring solutions to proactively identify and address potential issues
  • Implement logging solutions to facilitate efficient troubleshooting and analysis
  • Collaborate with security teams to ensure the platform meets industry standards and compliance requirements
  • Collaborate with cross-functional teams, including product managers, developers, and QA engineers to ensure robust and reliable systems

Benefits

  • Competitive insurance including: Medical, dental, vision and income protection plans
  • 401(k) program with: An employer match and immediate vesting
  • Generous Paid Time Off including: 20 days planned PTO, as accrued 40 hours of unplanned PTO and 14 company or floating holidays, annually Up to 16 weeks of paid parental leave for biological and adoptive parents of all genders Paid leave for circumstances related to bereavement, jury duty, voting time, or military leave

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Jobs

Please let Scout Motors know you found this job on JobsCollider. Thanks! πŸ™