Remote Site Reliability Engineer

Logo of TRG Research and Development

TRG Research and Development

πŸ“Remote - Worldwide

Job highlights

Summary

The job is for a Site Reliability Engineer at TRG Research and Development, where the employee will collaborate with teams to ensure production stability, automate deployment processes, develop tools, manage incidents, and implement strategies to maintain application uptime. The candidate should have relevant experience and skills in IT engineering.

Requirements

  • At least 2 years of experience in a similar role (DevOps, SRE, System Engineer)
  • Experience with IaC practices (Terraform)
  • Experience with Docker and Kubernetes
  • Experience with one of the major cloud providers (AWS, Azure)
  • Worked with Linux Administrative Skills
  • Proven work experience with Python is mandatory

Responsibilities

  • Collaborate with Customer Support and DevOps teams to establish SLA, SLO, and SLI, ensuring clear expectations for internal and external customers
  • Maintain 24/7 production stability year-round
  • Deploy, configure, and monitor production environments
  • Automate production deployments, validations, and reporting processes
  • Develop and maintain tools for production operations
  • Manage and document incidents
  • Develop disaster recovery automation
  • Handle Mean Time to Respond (MTTR) and Mean Time to Detect (MTTD) metrics
  • Implement strategies to ensure 100% application uptime
  • Work with development and QA teams to enhance code quality and resilience

Preferred Qualifications

  • Experience with monitoring tools like Prometheus, New Relic or similar
  • Experience with web-related technologies (Web applications, Web Services, Service Oriented Architecture) and network/web-related protocols
  • Being able to understand and implement complex networking solutions between different cloud providers and/or bare metal infrastructure
  • Configure and manage data sources like Mongo, Elasticsearch, Redis, ArangoDB, etc

Benefits

  • Working from home
  • Flexible hours
  • Yearly performance bonus
  • Paid medical insurance
  • Daily lunch allowance
  • Sport/Gym(Exercise) allowance
  • Udemy unlimited subscription
  • Onboarding plan and training
  • Equipment support
  • No dress code
  • Gifts and rewards for celebrating birthdays, anniversaries, and personal milestones
  • Happy hours, coffee time, online team building, company events, and much more to promote team bonding and of course to have fun!
  • Fresh fruit, snacks, coffee, and tea at the office

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Please let TRG Research and Development know you found this job on JobsCollider. Thanks! πŸ™