Summary

Join us to help empower commerce brands with the best end-to-end customer and delivery experience. Stord is looking for a mission-driven Senior SRE to be a driving force behind an exceptionally resilient, efficient, and secure infrastructure and platform.

Requirements

Proven experience as a Senior DevOps Engineer or Senior Site Reliability Engineer
Strong expertise in cloud platforms such as AWS, GCP or Azure
Strong experience with CI/CD tools (Github Actions, GitLab CI, CircleCI) and version control systems (Git)
Proficiency with infrastructure-as-code tools (e.g., Terraform, Ansible, Cloudformation)
Hands-on experience with container orchestration tools like Docker and Kubernetes
Solid understanding of networking, security, and system engineering
Experience with monitoring and logging tools (e.g., Datadog, Prometheus, Grafana, ELK stack)
Strong scripting skills in languages such as Python, Shell or similar
Familiarity with security best practices and compliance requirements
Excellent problem-solving and troubleshooting skills
Ability to work collaboratively in a fast-paced, agile environment
Passion for building the highest-quality solutions for the long term that delight the customer (both internal and external customers)
Automation first mindset
High degree of ownership and pride for work

Responsibilities

Collaborate with cross-functional teams to design and implement CI/CD pipelines that automate fast and safe delivery of software to our customers, enable experimentation, create fast feedback loops and developer self-service capabilities
Lead efforts in automating deployment, monitoring, and infrastructure management
Proactively identify and resolve performance bottlenecks, system failures, and security vulnerabilities
Minimize or eliminate degradations and failures related to fault tolerance, security, availability, and performance
Develop SLOs and SLIs to manage risk through continuous monitoring and measurement of system performance
Build, manage and deploy highly available, self-healing, customer facing production infrastructure and applications (microservice and event based architectures) using Docker, Kubernetes, Helm and Terraform
Leverage 12 Factor App methodology when building and deploying all our services and systems
Implement best practice infrastructure as code (IaC) principles for configuration management and deployment of infrastructure
Enhance operational efficiency by identifying repetitive tasks and developing automation to eliminate toil work
Implement robust metrics, monitoring and alerting for proactive issue identification and resolution
Participate in incident response, on-call rotation and post-incident reviews to ensure 24/7 availability of critical systems and to learn from failures and continuously improve system reliability
Implement and enforce security best practices for infrastructure and applications
Collaborate with security teams to ensure compliance with industry standards and regulations
Empower others by sharing knowledge through documentation, training, and mentorship

Benefits

401(k)
Medical, Dental, and Vision Insurance
Life and Disability Insurance
Health Savings Account (HSA) option
Employee Assistance Program (EAP) - Mental Health Resources
Paid Parental Leave
Gym Stipend
Paid Time Off
Paid holidays

Remote Site Reliability Engineer

STORD

Job highlights

Summary

Requirements

Responsibilities

Benefits

Remote

DevOps

Senior

Share this job:

Similar Remote Jobs

Senior Infrastructure Engineer, Site Reliability Engineer

Flex

Remote

DevOps

Senior

Software Engineer, Site Reliability Engineer

Tailor

Remote

Software Development

Mid-level

Senior Site Reliability Engineering Engineer

Binance

Remote

DevOps

Senior

Site Reliability Engineer, DevOps Engineer

Wizeline

Remote

DevOps

Mid-level

Lead Site Reliability Engineer, Infrastructure Security

MongoDB

Remote

DevOps

Senior

Lead Site Reliability Engineer, Infrastructure Security

MongoDB

Remote

DevOps

Senior

Site Reliability Engineer

Sezzle

Remote

DevOps

Mid-level

SRE Senior/Expert Site Reliability Engineer

Lucca

Remote

DevOps

Senior

Site Reliability Engineer

Masabi

Remote

DevOps

Mid-level

Senior Site Reliability Engineer

Input Output

Remote

DevOps

Senior