Senior Engineering Manager, Site Reliability
Coalition, Inc.
Job highlights
Summary
Join Coalition as a Senior Engineering Manager to lead our Platform Site Reliability Engineering (SRE) team. This pivotal role involves designing, building, and operating the foundational platform powering our core products and services. You will manage a talented team of engineers, aligning engineering efforts with business goals, and fostering a collaborative, innovative environment. The ideal candidate possesses strong technical breadth, people management expertise, and agile project leadership. This high-impact role requires a strong leader who can manage a talented team of engineers, align engineering efforts with business goals, and foster a culture of collaboration, innovation, and excellence. You will be responsible for ensuring the reliability, scalability, and efficiency of our cloud-based infrastructure.
Requirements
- 7+ years of overall SRE or Software engineering experience, with 3+ years managing a team of at least 5 engineers
- Demonstrated success in hiring, growing, and managing diverse engineering teams
- Broad technical depth across modern cloud-based technologies and architectures (e.g., AWS, Kubernetes, Terraform)
- Strong foundational knowledge in software engineering principles, reliability engineering, and distributed systems
- Experience building and operating platform infrastructure at scale, including CI/CD pipelines, observability, and incident response
- Proficiency in agile planning and execution, with a strong track record of delivering features and platform capabilities
- A customer-centric approach to solving platform challenges and driving developer enablement through the use of tooling and/or internal development platforms
- Excellent written and verbal communication skills, with the ability to influence across teams and functions
- Technical background to objectively evaluate complex project risks and issues
- Strong analytical, planning, and organizational skills with an ability to manage competing priorities
Responsibilities
- Lead and develop a high-performing SRE team, hiring and growing world-class talent
- Foster team growth by mentoring engineers, identifying skill gaps, and managing career development plans
- Manage performance effectively, balancing recognition of achievements with constructive feedback and coaching
- Drive the development and operation of reliable, scalable, and secure platform infrastructure
- Provide guidance on cloud-based architectures, distributed systems, and service-oriented design
- Partner with engineering teams to enhance operational readiness, ensure observability, and optimize for reliability and scalability
- Balance managing technical debt with delivering new features and meeting business priorities
- Collaborate with stakeholders to prioritize team deliverables and align on roadmaps
- Track and communicate progress, risks, and dependencies effectively across all levels of the organization
- Be a leader for reliability, operational best practices, and customer-focused solutions
Preferred Qualifications
- Experience in reliability engineering, platform engineering, or DevOps roles
- Proficiency with service-oriented architectures, Kubernetes, or service mesh technologies
- In-depth knowledge of cloud cost management and optimization strategies
- Familiarity with cybersecurity, risk management, or insurance-related platforms
Benefits
- 100% medical, dental, and vision coverage
- Flexible PTO
- Annual home office stipend and WeWork access
- Mental & physical health wellness programs like Headspace, Lumino, and more!
- Competitive compensation and opportunity for advancement
Share this job:
Similar Remote Jobs
- π°$154k-$227kπUnited States
- π°$60k-$120kπAsia
- π°$177k-$213kπUnited States
- πUnited States
- π°$78k-$135kπUnited States
- πAustralia
- πPoland
- πUkraine
- πWorldwide