πEstonia
Senior Cloud Ops Engineer
closedEntrata
πRemote - United States
Summary
The job is for a Senior Cloud Operations Engineer at Entrata, a global leader in property management software. The role involves enhancing the reliability and scalability of production systems, mentoring teams, collaborating with other departments, and optimizing incident response processes. The company offers flexible work options, comprehensive health benefits, retirement plans, wellness initiatives, family-centric leave policies, and various employee benefits.
Requirements
- 7+ years of software development experience, including at least 2 years in a senior role focused on Site Reliability Engineering (SRE), DevOps, or platform automation
- Strong expertise in building and expanding Application Performance Monitoring (APM) systems such as New Relic, Dynatrace, etc
- In-depth understanding of modern cloud-native architecture, with experience in building, deploying, and managing distributed systems on AWS or other cloud providers
- Proficiency with CI/CD tools such as GitHub Actions, CircleCI, Jenkins, or similar
- Hands-on experience with Kubernetes (K8s) for orchestration and Argo CD for continuous deployment in cloud environments
- Strong analytical skills for debugging, troubleshooting, and resolving complex technical problems
- Fluency in one or more programming languages, along with familiarity with scripting languages
- Ability to manage on-call duties and respond to out-of-band requests as needed
Responsibilities
- Lead efforts to enhance the reliability, repeatability, and flexibility of our production systems by developing and utilizing software tools that streamline operations
- Mentor and promote a CloudOps mindset across the development organization, fostering a culture of site reliability engineering and DevOps best practices
- Collaborate with Engineering, Architecture, and InfoSec teams to improve operational health, security, growth, usability, and quality of our applications
- Develop and implement comprehensive monitoring, logging, tagging, and other feedback mechanisms to ensure transparency and improve the customer experience
- Continuously enhance system performance and reliability by creating and maintaining frameworks for Service Level Indicators (SLIs), Objectives (SLOs), and Agreements (SLAs)
- Optimize incident response processes through alerting, troubleshooting, automation, playbooks, and root-cause analysis, and actively participate in the Incident Response team
- Leverage cloud technologies to improve performance, reliability, quality, and cost-efficiency
- Drive the deployment, scaling, and management of distributed systems on cloud platforms like AWS, with a focus on cloud-native architecture and application performance
Preferred Qualifications
- 5+ years of experience as a Site Reliability Engineer in a cloud environment, preferably with AWS
- AWS Certifications
- Extensive experience in a high-volume or critical production environment
- Expertise in networking, network analysis, and performance troubleshooting using tools like tcpdump
- Proven ability to analyze and troubleshoot large-scale distributed systems effectively
Benefits
- Flexible and transparent culture with remote and hybrid work options, generous vacation time, and frequent company recharge days for work-life balance
- Comprehensive medical, dental, and vision coverage, including fertility benefits, available for eligible employees and their families
- HSA/FSA options and employer-paid disability benefits provided for eligible employees
- Access to 401(k) or similar retirement plans with employer matching for eligible employees, ensuring long-term financial security
- Wellness initiatives promoting physical and mental well-being, access to an onsite gym at HQ, mental health resources, wellness challenges, and employee assistance programs
- Family-centric leave policies supporting new parents during significant life events
- Entrata Cares programs offering opportunities for volunteerism, charity events, and giving back to our community
- Exclusive Previ cell phone plan and discounts on services or local business partnerships for additional employee benefits
- Bi-annual swag drops for employees
This job is filled or no longer available
Similar Remote Jobs
πEgypt, Mali
πIndia
πUnited States
πUnited States
πUnited States
πUnited States
πUnited States
π°$133k-$200k
πUnited States
π°$130k-$160k
πUnited States