Director of Engineering, Infrastructure

DC SCORES
Summary
Join Ditto, a rapidly expanding startup, as the Director of Engineering - Infrastructure, leading and unifying Site Reliability Engineering (SRE), Platform, and CI Infrastructure teams. You will design and execute the vision for platform excellence, transforming Dittoβs engineering culture to prioritize reliability and resilience. This critical role involves consolidating infrastructure knowledge, developing a comprehensive infrastructure strategy for massive scale, and strengthening collaboration between SRE and Platform teams. You will establish a sustainable working cadence, partner with the Cloud team, implement cloud best practices, optimize Kubernetes, and build a high-performing team. The position requires collaboration with various teams to align infrastructure capabilities with business needs, managing budgets and making strategic technology decisions, and establishing robust monitoring and alerting strategies.
Requirements
- 10+ years of experience in cloud SRE/Platform engineering, with deep expertise in distributed systems and infrastructure at scale
- 5+ years of experience with Kubernetes in production environments
- 5+ years of experience managing SRE/Platform engineering teams
- Proven track record of managing teams of 10+ people and scaling engineering organizations
- Strong AWS experience is required; experience with GCP and Azure is highly desired
- Advanced knowledge of cloud best practices and infrastructure automation
- Excellent verbal and written communication skills with the ability to explain complex technical concepts to diverse audiences
- Strong conflict management and leadership skills
- Experience managing through rapid growth phases and organizational change
- Strategic thinking abilities with a focus on business outcomes
- Track record of successfully reducing operational overhead while improving system reliability
- Experience with CI/CD pipelines and developer productivity tooling
- Understanding of security best practices and compliance requirements
Responsibilities
- Lead and manage our SRE, Platform, and CI Infrastructure teams
- Manage managers and key ICs such as architects and senior staff engineers
- Design, own, and execute the vision for platform excellence at Ditto
- Play a central role in the transformation of Dittoβs engineering culture into a culture that prioritizes reliability and resilience of our mission critical software
- Communicate & articulate this mission across the entire company in all hands, presentations, working sessions, and via enactment of strategic objectives
- Consolidate infrastructure knowledge and expertise to reduce cognitive load across the organization and create scalable processes
- Develop and execute a comprehensive infrastructure strategy that prepares Ditto for massive scale based on current sales trajectory
- Strengthen the bond between SRE and Platform teams, fostering collaboration and shared ownership
- Establish and maintain a healthy, sustainable working cadence for all infrastructure teams while reducing on-call incident frequency
- Partner with the upcoming Cloud team leadership to ensure seamless integration of infrastructure services
- Implement best practices for cloud infrastructure management, focusing on AWS with consideration for multi-cloud strategies
- Lead the optimization of Kubernetes across our infrastructure stack
- Build and mentor a high-performing team of infrastructure engineers and managers
- Collaborate with Product, Sales, and other Engineering teams to align infrastructure capabilities with business needs
- Manage infrastructure budgets and make strategic decisions about tooling and technology investments
- Establish SLOs, monitoring, and alerting strategies that ensure reliability at scale
- Champion infrastructure as code, automation, and self-service capabilities
Preferred Qualifications
- Experience in companies going through hyper-growth phases
- Background in edge computing or distributed database technologies
- Experience with multi-region and multi-cloud architectures
- Knowledge of mesh networking or peer-to-peer systems
- Previous experience working with enterprise customers with high reliability requirements
Benefits
- Health, dental, vision, life, and disability insurance
- A 401(k) and flexible spending accounts
- Private healthcare through Vitality
- A pension plan
- Flexible time off
Share this job:
Similar Remote Jobs
