Remote Senior Site Reliability Engineer

Logo of Acquird.io

Acquird.io

📍Remote - Canada

Job highlights

Summary

Join our team as a Senior Site Reliability Engineer / Cloud Engineer to play a key role in running our Cloud Ops, ensuring uptime, reliability, CI/CD, Containerization, and automation. Our platform is growing, and we're looking for an outstanding engineer with experience in Azure, SRE, DevOps, and automation.

Requirements

  • 5 years plus of experience as a Site Reliability Engineer or DevOps Engineer, working with software and infrastructure
  • Bachelor’s degree in Computer Science, a related technical field, or equivalent practical experience
  • Experience in one or more of the following: Python, Javascript, Ruby, Groovy, PHP, or Bash
  • Experience in one of the cloud platforms: Azure, AWS, or GCP. (Azure is our main Cloud Platform)
  • Led and built cloud infrastructure projects (zero to one, automation, start up experience)

Responsibilities

  • Collaborate with software engineering teams to design, implement, and maintain CI/CD pipelines, enabling rapid and reliable software releases
  • Automate and optimize our infrastructure provisioning, configuration, and management processes using industry-standard tools and best practices
  • Implement and manage containerization and orchestration technologies to enhance scalability and resource utilization
  • Take ownership of the end-to-end availability and performance of our cloud infrastructure; proactively identifying potential issues, and implementing automation to prevent the recurrence of problems
  • Participate in an on-call rotation, ensuring our systems remain stable and responsive even during off-hours
  • Lead the development, implementation, and achievement of service-level objectives that are instrumental in maintaining product reliability
  • Maintain and enhance version control systems and repositories for codebase management
  • Steer and drive the SRE / DevOps roadmap, assuming full ownership while actively engaging in negotiation and strategic planning to ensure its successful execution
  • Stay current with industry trends, emerging technologies, and best practices in SRE, DevOps, and automation

Preferred Qualifications

  • Experience with high availability systems
  • Experience troubleshooting and debugging production code
  • Experience with application deployment and data pipelines
  • Understanding of distributed computing systems
  • Experience with Snowflake and/or relational databases

Benefits

  • Meritocracy
  • Leadership opportunities as we scale
  • Equity/Options grants
  • Generous Time Off
  • Flexible remote work policies
  • Platinum Benefits
  • Solid Health Insurance Plan on Day 1
  • Learn and Grow
  • Coaching/mentoring for your professional development

Job description

A Few Notes:

  • Profitable B2B SaaS company, teams are based out of North America

  • Role is 95% remote in Toronto (we meetup 1x a month).

  • Must be able to legally work in Canada (visa or sponsorship won’t be provided)

  • Our Platform is growing and we are looking to hire a Senior Site Reliability Engineer (SRE) / Cloud Engineer

  • Our main Cloud Platform is Azure(those with Azure will be prioritized first)

About Us:

We’re one of the top retail analytic platforms that help marketing teams/brands understand their retail data and run targeted media campaigns without writing code. We help our clients better understand their customers and improve their ROI on campaigns. One of our main customers is Home Depot.

  • Modern Cloud Stack: Azure is our primary cloud. CI/CD, Containerization, Distributed computing.

About You:

We are looking for an outstanding Senior SRE/Cloud Engineer who wants to play a key role in running our Cloud Ops (ensuring uptime, reliability, CI/CD, Containerization, and automation)

Example Responsibilities:

  • Collaborate with software engineering teams to design, implement, and maintain CI/CD pipelines, enabling rapid and reliable software releases.

  • Automate and optimize our infrastructure provisioning, configuration, and management processes using industry-standard tools and best practices.

  • Implement and manage containerization and orchestration technologies to enhance scalability and resource utilization.

  • Take ownership of the end-to-end availability and performance of our cloud infrastructure; proactively identifying potential issues, and implementing automation to prevent the recurrence of problems.

  • Participate in an on-call rotation, ensuring our systems remain stable and responsive even during off-hours.

  • Lead the development, implementation, and achievement of service-level objectives that are instrumental in maintaining product reliability.

  • Maintain and enhance version control systems and repositories for codebase management.

  • Steer and drive the SRE / DevOps roadmap, assuming full ownership while actively engaging in negotiation and strategic planning to ensure its successful execution.

  • Stay current with industry trends, emerging technologies, and best practices in SRE, DevOps, and automation.

Qualifications:

  • 5 years plus of experience as a Site Reliability Engineer or DevOps Engineer, working with software and infrastructure.

  • Bachelor’s degree in Computer Science, a related technical field, or equivalent practical experience.

  • Experience in one or more of the following: Python, Javascript, Ruby, Groovy, PHP, or Bash.

  • Experience in one of the cloud platforms: Azure, AWS, or GCP. (Azure is our main Cloud Platform)

  • Led and built cloud infrastructure projects (zero to one, automation, start up experience)

  • Nice to have: Experience with high availability systems.

  • Nice to have: Experience troubleshooting and debugging production code.

  • Nice to have: Experience with application deployment and data pipelines.

  • Nice to have: Understanding of distributed computing systems.

  • Nice to have: Experience with Snowflake and/or relational databases

Target FT Salary Range:

  • $130,000 - $180,000* base CDN a year, with Equity & Health Benefits

  • *Comp range is higher for Staff level

Benefits:

Meritocracy 💰

- Leadership opportunities as we scale

- Equity/Options grants

Generous Time Off  🏝

- Flexible remote work policies

Platinum Benefits 🧑🏻

- Solid Health Insurance Plan on Day 1

Learn and Grow 💻

- Coaching/mentoring for your professional development

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Please let Acquird.io know you found this job on JobsCollider. Thanks! 🙏