📍Worldwide
Senior Engineer, NSO

Life360
💵 $90k-$172k
📍Remote - United States, Canada
Please let Life360 know you found this job on JobsCollider. Thanks! 🙏
Summary
Join Life360's Network and Systems Operations team as a Site Reliability Engineer and contribute to the observability and reporting capabilities of our large-scale distributed systems. Monitor the day-to-day operations of Life360's services, onboard new services, and maintain visibility of metrics for existing services. Respond to alerts, execute runbooks, and escalate major issues. Collaborate with a strong team, leveraging your expertise in large-scale systems, observability tools (Prometheus, Grafana, Datadog), and automation. This remote-first position offers competitive compensation and a comprehensive benefits package. Life360 is committed to diversity and inclusion, encouraging applications from all backgrounds.
Requirements
- Bachelor's in Computer Science, Engineering, related field or equivalent practical experience
- 5+ years experience writing/reading/debugging code in one or more languages, such as: Java, Python, Shell, Ruby
- 5+ years experience working with large-scale distributed systems and managing Linux-based systems in a cloud like AWS
- In depth experience with large scale observability and reporting systems (New Relic, Datadog, ElasticSearch, Prometheus, etc)
- 3+ year(s) experience with solutions such as Docker, Kubernetes, system virtualization, cloud monitoring and logging
- 3+ years experience with IaC and config management tools such as Terraform, Cloudformation, Chef, Ansible, and similar
- Experience working as part of a team, using analytical, problem-solving skills
- Excellent troubleshooting and attention to detail
- Ability to quickly learn new technologies and follow industry trends
- Ability to analyze and optimize high-traffic internet applications
Responsibilities
- Use tools such as Prometheus, Grafana, and Datadog to create and maintain observability infrastructure and tooling, including creating alerts, production reporting, and writing documentation
- Manage observability infrastructure
- Serve as a member of “follow the sun” L1 on-call support, working alone or with teammates to answer pages for all onboarded services and resolve or escalate issues in a timely manner
- Utilize anomaly detection and alerting, respond to alerts in PagerDuty, drive incidents to their conclusion, and lead the effort to strengthen the system based on post-mortem action items
- Coordinate cross-team and cross-functional efforts with processes, documentation, and tooling to ensure operational excellence
Benefits
- Competitive pay and benefits
- Medical, dental, vision, life, and disability insurance plans (100% paid for US employees). We offer supplemental medical and dental plans for Canadian employees
- RRSP plan with DPSP company matching program
- Employee Assistance Program (EAP) for mental wellness
- Flexible PTO and 12 company-wide days off throughout the year
- Learning & Development programs
- Equipment, tools, and reimbursement support for a productive remote environment
- Free Life360 Platinum Membership for your preferred circle
Share this job:
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.