Summary
Join our team as a Site Reliability Engineer at Forma, where you will design and implement foundational infrastructure and developer experience, shaping the direction of infrastructure and tooling as we grow.
Requirements
- 8+ years of backend software development experience
- Strong understanding of operating systems (Linux, Windows, etc.), networking, and cloud infrastructure (AWS, GCP, and/or Azure)
- Knowledge of containerization technologies (Kubernetes) and orchestration tools
- Expertise in at least one programming language (Python, Go, Java, etc.) and scripting languages (Bash)
- Experience with CI/CD pipelines and tools (Circle CI, Github Actions, etc.) to automate software delivery and deployments
- Working knowledge of IAC tools such as Terraform
Responsibilities
- Build and maintain on-call, monitoring and alerting systems to proactively detect issues before they impact users
- Troubleshoot and resolve outages, conduct post-incident reviews (postmortems), and implement changes to prevent future occurrences
- Develop tools and scripts to streamline operations, reduce manual toil, and improve efficiency
- Build and maintain promote best practices, and troubleshoot CI/CD infrastructure
- Analyze complex problems, identify root causes, and develop effective solutions
- Mentor Engineering team members to promote a culture of technical excellence and innovation
- Apply software engineering principles to infrastructure and operations. Conduct chaos engineering experiments to identify system weaknesses and improve resilience
Preferred Qualifications
- Experience at an early-stage startup is a plus
- Fintech experience a plus
Benefits
- Remote-first working environment
- Medical, dental and vision insurance plans
- Employee wellness program
- One-time home office stipend
- 401(k) savings plan
- Flexible PTO policy
- 12 weeks Parental Leave + 4 additional weeks for the Birthing Parent