πNigeria
Staff Artificial Intelligence Infrastructure Engineer

SentinelOne
π΅ $170k-$234k
πRemote - United States
Please let SentinelOne know you found this job on JobsCollider. Thanks! π
Summary
Join SentinelOne, a leading cybersecurity company, as a Staff AI Infrastructure Engineer. You will play a crucial role in designing, building, and maintaining scalable infrastructure for AI products and models, ensuring their efficient and secure deployment across diverse cloud environments. This position involves automating infrastructure management, optimizing Kubernetes clusters, implementing CI/CD pipelines, and ensuring compliance with security standards. You will collaborate with AI engineering, product teams, and DevOps to meet infrastructure requirements, monitor performance, and drive best practices.
Requirements
- A degree in Computer Science, Information Technology, or related field, or equivalent practical experience
- 7+ years of experience managing scalable, secure, and resilient infrastructure for AI and machine learning applications
- Deep proficiency with infrastructure-as-code tools like Helm, Terraform and ArgoCD
- Extensive hands-on experience with Kubernetes for deploying containerized workloads
- Demonstrated experience with major cloud platforms (AWS, GCP, Azure), specifically with services related to AI model hosting (e.g., Azure OpenAI)
- Experience implementing and managing CI/CD pipelines (GitHub Actions, Jenkins)
- Familiarity with compliance frameworks, particularly FedRAMP, and security best practices
- Strong scripting and automation skills using Python, Bash, or similar languages
- Excellent problem-solving skills, creativity, and self-driven motivation
Responsibilities
- Architect, build, and maintain scalable infrastructure to host and serve AI products and models reliably
- Automate infrastructure deployment and management using Helm, ArgoCD and Terraform
- Manage and optimize Kubernetes clusters to support high-performance AI workloads
- Implement and manage CI/CD pipelines utilizing GitHub Actions and Jenkins
- Ensure infrastructure compliance with security standards including FedRAMP and related guidelines
- Collaborate closely with AI engineering, product teams, and DevOps to meet infrastructure requirements
- Monitor infrastructure health and performance, implementing optimizations proactively
- Drive infrastructure best practices and mentor team members to foster technical excellence
Preferred Qualifications
- Previous experience as a Site Reliability Engineer (SRE), particularly in AI or ML contexts
- Monitoring and logging tools (Prometheus, Grafana, Datadog, Jaeger)
- Networking concepts and security best practices within cloud infrastructure
- Professional certifications in Kubernetes or cloud platforms (AWS, Azure, GCP)
Benefits
- Medical, Vision, Dental, 401(k), Commuter, Health and Dependent FSA
- Unlimited PTO
- Industry-leading gender-neutral parental leave
- Paid Company Holidays
- Paid Sick Time
- Employee stock purchase program
- Disability and life insurance
- Employee assistance program
- Gym membership reimbursement
- Cell phone reimbursement
- Numerous company-sponsored events, including regular happy hours and team-building events
Share this job:
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Similar Remote Jobs

π°$150k-$185k
πUnited States
π°$200k-$220k
πUnited States
π°$187k-$234k
πUnited States
πUnited States, Worldwide
πUnited States, Worldwide
πUnited States, Worldwide
π°$225k-$250k
πUnited States
π°$225k-$250k
πUnited States