Remote Site Reliability Engineer

Logo of Pacifica Continental

Pacifica Continental

πŸ“Remote - Worldwide

Job highlights

Summary

Join our engineering team as a Site Reliability Engineer to maintain shared cloud resources, improve internal processes, and collaborate with cross-functional teams. We're passionate about DevOps, security, and automation, and we want you to be too.

Requirements

  • Hands-on Engineering
  • 5+ years of hands-on experience with a majority of the following technologies, along with a willingness to become proficient in the remaining areas
  • Windows and Linux Servers
  • VMware
  • Cloud platforms, preferably with Azure
  • Active Directory
  • Secrets management with Consul and Vault or similar systems
  • Configuration management tools like Salt, Ansible and Terraform
  • Firewalls and load balancers such as F5
  • Web servers, including IIS and NGINX
  • Database Server Infrastructure like Microsoft SQL Server and PostgreSQL
  • Application Performance Monitoring with tools like New Relic
  • Infrastructure monitoring with tools like Sensu, SolarWinds, Nagios, or Azure App Insights
  • CI/CD tools like TeamCity, Octopus Deploy, Concourse, Azure DevOps, or GitHub Actions
  • Log Aggregation tools like SumoLogic or Splunk
  • Network theory and protocols such as DNS, DHCP, proxy servers, and firewalls
  • Security operations with tools for SAST, DAST, RAST, and WAF
  • Infrastructure as Code or automation experience
  • Proficiency, high-comfort, and familiarity with
  • One or more programming languages, such as C#, JavaScript, Python or Go
  • One or more scripting languages, such as PowerShell and BASH
  • Command line tools such as (git, netcat, npm, terraform, etc.)

Responsibilities

  • Make improvements to internal processes to reduce lead time and increase deployment frequency
  • Identify improvements to the quality, security, and performance of our infrastructure
  • Increase the velocity with which teams deliver, leveraging expertise from various functional disciplines
  • Identify how to remediate production incidents more quickly and safely while reducing the frequency of outages
  • Actively engage with other teams and departments to collaborate on best practices and implementation strategy
  • Adhere to and advocate for best practices, including Infrastructure as Code, monitoring, high availability, disaster recovery, security, and DevOps methodologies
  • Create SLIs, SLOs, and SLAs
  • Contribute to capacity planning, advise and consult with teams who will be load/stress testing
  • Keep up with industry innovations, recommending new tools or practices when appropriate
  • Actively mentor peers, developing their expertise and inspiring others to innovate
  • Provide timely assistance and remediation solutions during critical situations and production incident
  • Document and share β€œlessons learned” from production, including root cause analysis
  • Explore new ways of improving communication between other Site Reliability Engineers and with other teams
  • Write and maintain architectural, stakeholder, and policy documentation

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Please let Pacifica Continental know you found this job on JobsCollider. Thanks! πŸ™