Senior Platform Engineer

Collibra
Summary
Join Collibra's Platform Infrastructure Engineering team and become a key member responsible for building and operating the cloud foundation for all Collibra services. This role is crucial in evolving the multi-cloud, Kubernetes, IaC, Golang automation, and GitOps infrastructure environment. You will contribute to developer enablement, platform architecture, operational excellence, and continuous improvement. The ideal candidate possesses 3+ years of experience in Platform Engineering, SRE, or related fields, along with proven experience in Kubernetes, GitOps, IaC, and major cloud platforms. Collibra offers a collaborative culture focused on continuous improvement and provides opportunities for professional growth. This position requires US citizenship and eligibility to work in the USA without sponsorship.
Requirements
- 3+ years of experience in Platform Engineering, SRE, or infrastructure-focused roles with a Bachelor's degree in Computer Science or a related technical field, OR equivalent practical experience demonstrating the skills below
- Proven experience designing, building, and managing production services using Kubernetes and gitops / IaC at a scale of between tens and hundreds of Kubernetes clusters
- Experience managing production workloads and infrastructure on major cloud platforms (AWS, GCP, Azure)
- Hands-on experience operating Kubernetes clusters and managing containerized services in production
- Demonstrable experience writing and maintaining Infrastructure as Code (IaC), preferably with Terraform, and proficiency in Golang or Python for automation
- Must be eligible to work in the USA without requiring sponsorship
- Because this role supports the US government, it is required that this candidate be a US citizen who resides on US soil
- Experienced in applying systematic troubleshooting and critical thinking to diagnose root causes within distributed cloud infrastructure and propose effective solutions
- Able to demonstrate initiative in learning and utilizing evolving technologies related to cloud platforms, container orchestration, GitOps, IaC and automation
- An effective communicator in articulating complex technical details, designs, and trade-offs clearly to both technical peers and potentially other stakeholders within a distributed team setting
- Possess a mindset geared towards efficiency, proactively seeking and evaluating ways to automate manual processes and improve system reliability
- Independently manage and complete assigned work, ensuring deliverables consistently meet defined requirements and acceptance criteria
Responsibilities
- Develop controllers and automations, work with development teams on refinements to platform capabilities
- Contribute to the overall architecture of the platform infrastructure, collaborating with other infrastructure engineers using GitOps, IaC and Kubernetes
- Participate in on-call rotations, troubleshoot complex service issues, implement security best practices, and maintain clear documentation (architecture, procedures)
- Stay current with platform engineering trends and infrastructure automation, identifying and implementing improvements
Preferred Qualifications
- CKA / CKAD
- Istio
- ArgoCD
- Deep experience with networks, linux and Kubernetes
- Experience with monitoring/logging tools and observability spans / traces (e.g., Datadog, Grafana, Honeycomb)
- Proficient in creating controllers and other automation patterns to manage Kubernetes resources
Benefits
- Equity ownership at every level
- Bonus potential
- A Flex Fund monthly stipend
- Pension/401k plans