Burq is hiring a
SRE/ Platform Engineer

Logo of Burq

Burq

πŸ’΅ ~$117k-$210k
πŸ“Remote - United States

Summary

The job is for a Senior Platform Software Engineer at Burq, a fast-growing delivery network company backed by leading venture capitalists. The role involves investigating and resolving technical issues related to the software platform, leading automation efforts, and contributing to release management. The candidate should have proficiency in application and infrastructure monitoring, experience with Node.js, Docker, AWS, and other technologies, and a solid understanding of continuous integration, delivery, and microservices architecture.

Requirements

  • Proficiency in application and infra monitoring, using tools such as DataDog, Grafana, and Sentry. Drive organizational best practice on service instrumentation and alerting
  • Experience managing and maintaining foundational services such as MySQL, SQS, Kafka, Redis, etc. Know enough to ramp up quickly and triage production issues involving these services
  • Experience in Node.js and commitment to becoming an expert in managing Node processes. Read and contribute to application code, understand Node performance characteristics and common issues
  • Proven experience in operating Docker-based containers on AWS using terraform
  • Solid understanding of continuous integration, continuous delivery, microservices architecture, and infrastructure as code
  • Ownership and strategic vision to chart your own roadmap. Work with various stakeholders to identify different types of platform risks, prioritize them, and ensure timely mitigation before the risks turn into incidents

Responsibilities

  • Ensure the critical success of the scaling and growth of the software platform and transform the way businesses offer on-demand & same-day delivery
  • Investigate random ECS container crushes in production both in the infra layer and application layer, and identify issues such as problematic Docker image layering, insufficient infrastructure provisioning, and application issues in Node.js
  • Lead automation efforts and establish engineering practice to monitor, alert, and triage database performance issues (we use MySQL) caused by inefficient queries, imperfect indexes, unoptimized query planner, and under-provisioned database resources
  • Investigate memory leak by instrumenting/profiling the application code and infrastructure layer. Lead collaborative efforts with application devs and devops to identify root cause
  • Identify bottlenecks in SQS messaging throughput, and work with application devs to identify root cause and remediation
  • Contribute to release management and help move us towards continuous delivery with canary release

Benefits

  • Competitive salary and opportunity for equity
  • Option to work fully remotely or in-person
  • Medical, dental and vision insurance
  • Reimbursement for educational courses
  • Generous Time Off 🏝

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.

Similar Jobs

Please let Burq know you found this job on JobsCollider. Thanks! πŸ™