Senior DevOps Engineer

closed
3Pillar Global Logo

3Pillar Global

πŸ“Remote - Romania

Summary

Join Our Mission at 3Pillar: Elevate Your Impact! As a Senior DevOps Engineer, you are responsible for ensuring that our platform is stable and healthy. We break down barriers to run our products by fostering developer-run ownership and empowering developers to build resilient products.

Requirements

  • Bachelor’s degree in computer science, software engineering, or a similar field
  • Experience in Splunk and SignalFx
  • Experience with Amazon Web Services including RDS
  • Relevant data DevOps, SRE, or general systems engineering experience
  • Experience in managing large production platforms
  • Experience architecting and implementing data governance processes and tooling (data catalogues, lineage tools, role-based access control, PII handling)
  • Strong coding ability in Python or other languages like Java, C#, Golang, C, C++, Perl Ruby etc

Responsibilities

  • Plan, manage, and oversee all aspects of the production environment for all merchant loyalty use cases
  • Define strategies for all facets of observability
  • Identify areas of improvement in production
  • Ability to understand MTTR, SLO, SLI definitions and apply them to services
  • Respond to Incidents and improvise platform based on feedback and measure the reduction of incidents over time
  • Ensure reliable, fault-tolerant, efficiently scalable and cost-effective services and infrastructure
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health
  • Practice sustainable incident response and blameless postmortems
  • Ensures that batch production scheduling and process are accurate and timely
  • Able to create and execute queries to big data platforms and relational data tables to identify process issues or to perform mass updates, preferred
  • Ability to isolate problems between hardware and software
  • Analyze ITSM activities of the platform and provide a feedback loop to development teams on operational gaps or resiliency concerns
  • Support services before they go live through activities such as system design consulting, capacity planning and launch reviews
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity

Benefits

  • Flexible work environment – whether it's the office, your home, or a blend of both
  • Remote-first approach
  • Global team, learning from top talent around the world and across cultures, speaking English every day
  • Flexible time off
  • Mental health plans (country-dependent)
  • Professional services model enables to accelerate career growth and development opportunities - across projects, offerings, and industries
This job is filled or no longer available

Similar Remote Jobs