Summary
Join Fleetio's Platform Engineering team as a Senior Site Reliability Engineer to maintain and enhance the performance of our Ruby on Rails stack and infrastructure. You will scale our application using best-in-class architecture and software design, proactively identifying and owning initiatives to improve performance, reliability, and scalability. This role involves collaborating with product engineers, leading database initiatives, and managing disaster recovery planning. The position is remote and open to candidates in the US, Canada, or Mexico. Fleetio fosters a culture of 'Product Engineers' and offers a competitive benefits package.
Requirements
- 5+ years of Ruby/Rails Experience
- 3+ years of AWS Experience
- Kubernetes experience
- Experience with profiling and benchmarking source code
- Effective at code review, and identifying potential performance problems before they reach production
- Experience with Datadog or other APM tools
- Excellent written and verbal communication skills
Responsibilities
- Proactively identify, triage, and resolve performance issues
- Enhance system observability by monitoring performance metrics across Ruby, Rails, and database systems, including SLOs and SLIs
- Guide product engineers on Ruby/Rails performance and database best practices through code reviews and pair programming
- Optimize performance through instance configuration and monitoring
- Collaborate with other SREs to proactively identify and address performance bottlenecks
- Lead database capacity planning and upgrade initiatives
- Manage the database-specific components of disaster recovery planning and execution
- Oversee backup systems and pre-production databases
- Create and maintain infrastructure and operations documentation
- Participate in the on-call rotation
Preferred Qualifications
- Infrastructure as Code tools (Terraform)
- Deep understanding of cloud network fundamentals (routing, firewalls, load balancers, CDNs, VPCs, etc.)
- Experience with distributed event and data stores, such as Kafka, Redis, Elasticsearch, Memcached, and TimescaleDB
- You know a thing or two about the fleet management industry
Benefits
- Multiple health/dental coverage options
- Vision insurance
- Incentive stock options
- 401(k) match of 4%
- PTO - 4 weeks
- 12 company holidays + 2 floating holidays
- Parental leave- birthing parent (16 weeks paid) non-birthing (4 weeks)
- FSA & HSA options
- Short and long term disability (short term 100% paid)
- Community service funds
- Professional development funds
- Wellbeing fund - $150 quarterly
- Business expense stipend- $125 quarterly
- Mac laptop + new hire equipment stipend
- Monthly catered lunches
- Fully stocked kitchen with tons of drinks & snacks
- Remote working friendly since 2012 #LI-REMOTE
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.