
Infrastructure Engineer, Platform

Canvas
Summary
Join Canvas, a leading-edge AI and computer vision technology company revolutionizing the remodeling, architecture, and interior design industry, as an experienced Infrastructure Engineer. Reporting to the Platform Team Lead, you will play a crucial role in scaling, securing, and maintaining our business-powering systems. This role demands collaboration with various engineering teams to ensure high-performing and resilient services. You will monitor, maintain, and improve our infrastructure, focusing on reliability, observability, performance, and security. Your expertise in Linux system administration, cloud infrastructure (AWS), CI/CD systems, and infrastructure-as-code will be essential. We are a global virtual-first company with a distributed team.
Requirements
- Itβs a joy to communicate with you in both written and spoken English
- You have excellent knowledge of Linux, including performance tuning and security hardening
- You are experienced with cloud platforms (especially AWS) and their associated best practices
- You have strong knowledge of modern information security principles and how to apply them effectively
- You are comfortable working with multiple monitoring and observability tools, both cloud solutions and on-prem (e.g.: Prometheus and Grafana, NewRelic, Datadog, Etc)
- You have experience writing infrastructure automation using tools such as Ansible and Terraform
- You are proficient with Git and have experience working with GitLab CI/CD pipelines
- You have a good understanding of container technologies and orchestration, including Docker and Kubernetes
- You can write or maintain automation scripts and tooling in Python
Responsibilities
- Monitor, maintain, and improve our infrastructure across various environments and services
- Ensure reliability and uptime of our systems by developing robust observability and alerting practices
- Lead Linux system administration and hardening efforts to enhance security posture
- Manage and improve cloud infrastructure (AWS-focused), including compute, storage, and networking
- Own and enhance our CI/CD systems and deployment automation (GitLab CI/CD)
- Implement and maintain infrastructure-as-code using tools like Terraform and Ansible
- Participate in incident response and postmortems, ensuring learnings are translated into action
- Provide technical support and ongoing maintenance for infrastructure-related projects and initiatives
- Stay ahead of emerging security risks and proactively identify opportunities to improve our infrastructure security
- Collaborate cross-functionally with engineering teams to support their infrastructure needs and deployments
Preferred Qualifications
- Experience setting up Machine Learning operations
- Experience working with Datadog, Prometheus, Grafana, or similar observability stacks
- Experience with relational databases such as PostgreSQL
- Familiarity with managing production-grade Kubernetes clusters
- A passion for clean documentation and systems architecture clarity
- Experience supporting compliance and security audits through infrastructure best practices
Share this job:
Similar Remote Jobs
