Summary
Join Motional's Site Reliability Engineering (SRE) team as a Senior Engineer to enhance the reliability, performance, and scalability of our infrastructure platforms. You will manage complex systems, deliver high-quality service, and collaborate with various teams. Responsibilities include developing and implementing reliability strategies, leading incident responses, optimizing AWS spend, mentoring junior engineers, and collaborating on automation. The ideal candidate possesses a BS in Computer Science or Engineering, 5+ years of relevant experience, strong AWS expertise, and experience with various tools and technologies. Motional offers a competitive salary, benefits including medical, dental, vision, 401k, and more.
Requirements
- BS in Computer Science, Engineering, or equivalent
- AWS Certifications and work experience
- 5+ years in SRE, DevOps or related roles
- Strong experience with AWS Cloud Platforms inclusive of DevOps, Automation, Networking, Connectivity and Cost Optimization
- Experience with infrastructure-as-code tools (e.g. Terraform , CloudFormation)
- Knowledge of CI/CD tools (e.g. GitLab CI, Jenkins)
- Strong expertise in containerization and orchestration technologies (e.g., Docker, Kubernetes)
- Solid understanding of networking topologies and concepts
- Experience with monitoring and logging tools (e.g., Prometheus, Grafana , Cloudwatch, Datadog)
- Strong communication and interpersonal skills
- Exceptional problem solving skills
- Ability to thrive in a fast-paced, dynamic environment and manage multiple priorities
Responsibilities
- Develop and implement strategies to enhance system reliability, performance, and scalability
- Monitor system performance and health, proactively identifying and resolving issues before they impact users
- Lead the response to high-severity incidents, coordinating cross-functional teams to resolve issues and minimize downtime
- Develop or implement systems to facilitate incident management and troubleshooting
- Partner with the DevOps and other engineering teams to analyze and optimize AWS spend by implementing cost-effective strategies and identifying cost-saving opportunities and efficiency improvements in cloud infrastructure
- Mentor and guide junior team members on developing technical problem-solving skills and adopting industry best practices
- Collaborate closely with development and research teams around the world (Singapore, US) to drive the automation of operational tasks and processes to improve efficiency and reduce manual intervention
- Stay abreast of the latest industry developments to ensure that internal SRE practices align with Motionalβs overall business objectives and industry trends
Preferred Qualifications
- Experience in the AV industry or robotics
- Proficient in other Cloud Platforms such as GCP
- Experience designing tooling to simplify the operational management of SaaS/PaaS systems
- Experience with various programming languages (e.g. GO, Python, Java, C++, or Bash)
- Experience with Linux environments and software
- Experience with build tools (e.g. Bazel, CMake)
- Knowledge of ArgoCD or FLUX
Benefits
- Medical
- Dental
- Vision
- 401k with a company match
- Health saving accounts
- Life insurance
- Pet insurance
- Bonus
- Company equity
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.