Senior Infrastructure Engineer

SentinelOne
Summary
Join SentinelOne's Cloud Fabric team as a Senior Infrastructure Engineer and build the platform powering our AI cybersecurity solutions. Lead the design and automation of critical cloud systems, embracing an "Infrastructure as Code" philosophy using Terraform and Python. You will build innovative tooling, troubleshoot complex problems, and mentor other engineers. This role requires proven cloud expertise, mastery of Infrastructure as Code, strong programming skills, and deep networking knowledge. The position offers a competitive benefits package including stock and bonuses, flexible time off, insurance and health benefits, and work perks.
Requirements
- Proven Cloud Expertise: 5+ years of experience designing, building, and operating business-critical infrastructure services in a large-scale public cloud environment (e.g., AWS, GCP, Azure)
- Mastery of Infrastructure as Code: Deep, hands-on expertise building and scaling infrastructure with Terraform. You should be comfortable creating reusable modules and maintaining a complex state
- Strong Programming Skills: Proficiency in Python or another high-level language (e.g., Golang, Java). You write clean, maintainable code for automation and building services, and you're always willing to learn
- Deep Networking Knowledge: A solid understanding of cloud networking principles, including VPC architecture, routing, DNS, load balancing, and network security in a multi-account environment
- Observability Mindset: Practical experience building and operating modern monitoring and alerting stacks (e.g., Prometheus, Grafana, Thanos) to ensure operational excellence
- Collaborative Communication: Excellent written and verbal communication skills, with a demonstrated ability to partner effectively with other teams and influence technical decisions
Responsibilities
- Build a World-Class Platform: Take ownership of critical components within our next-generation Cloud Infrastructure Platform. You will design, build, and evolve the core control planes and paved-road solutions that our developers use every day
- Drive Platform Adoption and Enablement: Treat our infrastructure as a product. You will partner with engineering teams as your customers, helping them adopt our platform to ship features with greater speed, reliability, and confidence
- Own Reliability for Core Infrastructure: Define and own the SLOs for our core cloud infrastructure by deeply instrumenting its components within our observability platform. The insights you generate will directly inform the proactive engineering required to improve performance, latency, and availability. You will participate in a sustainable on-call rotation to transform operational incidents into lasting reliability improvements
- Automate to Eliminate Toil: Build robust, scalable automation and tooling that simplifies complex processes, improves system reliability, and increases developer velocity
- Lead the design and automation of critical cloud systems to support hyper-growth
- Embrace an "Infrastructure as Code" philosophy, using tools like Terraform and Python to create declarative, repeatable, and secure environments
- Dive deep to research, troubleshoot, and resolve complex, distributed systems-level problems
- Build innovative tooling and automation that eliminates toil and accelerates the entire engineering organization
- Mentor other engineers and champion best practices in cloud architecture and operational excellence
Preferred Qualifications
Production Kubernetes Experience: Hands-on experience running Kubernetes in a production environment is a plus. You are familiar with its core architecture, APIs, and networking models. Experience with managed offerings (GKE, EKS) is highly valued
Benefits
- Stock & Bonuses: Grant of Restricted Stock Units with a 4-year vesting plan, annual performance-based bonuses, and an employee stock purchase plan
- Time Off & Well-being: Flexible Time Off, on top of the standard 5 weeks vacation, flexible paid sick days, fully paid Short Term Sick/Nursing Leave, 16-week parental leave, grandparent leave, and additional company holidays
- Insurance & Health: Pension Insurance Contribution, Premium life insurance, Private medical care (for you and +1), and a Global Employee Assistance Program
- Work Perks: Monthly meal and well-being allowance, high-end MacBook/Windows laptop, work-from-home support, and in-office refreshments
- Growth & Community: LinkedIn Learning, internal mentoring, educational support, generous referral bonuses, and optional company events (sports, BBQs, charity)