Newsela is hiring a
Site Reliability Engineer, Remote - Worldwide

Logo of Newsela

Site Reliability Engineer closed

🏢 Newsela

💵 $95k-$105k
📍Worldwide

Summary

The job is for a Site Reliability Engineer in Newsela's Technology team. The role includes on-call rotation, maintaining and extending infrastructure, building monitoring systems, improving operational processes, debugging production issues, providing infrastructure support to developers, and designing and maintaining cloud infrastructure. The candidate should have 2+ years of experience as a Site Reliability Engineer, be familiar with Terraform, Github CI/CD, containerization, cloud technologies, monitoring and instrumentation, engineering practices, various languages, agile methodologies, and team collaboration.

Requirements

  • 2+ years of experience as a Site Reliability Engineer
  • Background in Infrastructure as code: use Terraform and Github CI/CD for automation, containerize our environments (Docker, ECS), and leverage cloud technologies to meet our goals
  • Systems experience managing, configuring and troubleshooting operating system issues, storage (block and object), networking (VPCs, proxies and CDNs), and administer high-availability datastores (mySQL, Postgres, Neo4J) and Redis clusters
  • Monitoring and instrumentation: implement metrics in Datadog, Sentry, log management and related systems, and Slack/JIRA integrations
  • Understanding of engineering practices: availability, reliability and scalability, as well as disaster recovery
  • Ability to work in a variety of languages: Shell, IaC, Python, and SQL
  • Be able to plan using your familiarity with agile methodologies; use epics, issues to drive projects
  • Personal and team workload organization and ability to self-organize and accomplish tasks asynchronously
  • Contributing to Newsela architecture diagrams, process diagrams and runbook documentation
  • Completing Root Cause Analysis (RCA) investigations and perform readiness reviews
  • Improving team practices through code reviews, handoffs of work, and incidents
  • Self-awareness, handling conflict in the team, providing and receiving feedback, and maintaining good relationships with other engineering teams

Responsibilities

  • Be on an on-call rotation to respond to incidents that impact Newsela.com availability and provide support for developers during internal and external incidents
  • Maintain and assist in extending our infrastructure with Terraform, Github Actions CI/CD, Prefect, and AWS services
  • Build monitoring that alerts on symptoms rather than outages using Datadog, Sentry and CloudWatch
  • Look for ways to turn repeatable manual actions into automations to reduce on-call toil
  • Improve operational processes (such as deployments, releases, migrations, etc) to make them run seamlessly with fault tolerance in mind
  • Design, build and maintain core cloud infrastructure on AWS and GCP that enables scaling to support thousands of concurrent users
  • Debug production issues across services and levels of the stack
  • Provide infrastructure and architectural planning support as an embedded team member within a domain of Newsela’s application developers

Benefits

  • Health & Wellness: Access to the world’s leading medical experts for healthcare (pets included!). Discounts and resources to stay healthy: mind, body, and soul
  • Work From Home: Almost all of our roles are fully remote - tech stipend included!
  • Supporting ALL Families: Supplemental programs and time off to take care of your family and yourself
  • Time Off: Flexible PTO to recharge, including Sabbatical Leave
  • Professional Development: Annual stipends for continued learning and education
  • Make A Difference: No matter your role or department, the work you do each day helps share the future of education and improves the lives of students and teachers
This job is filled or no longer available

Similar Jobs