Summary
Join Loadsmart, a rapidly growing logistics technology company valued at over $1 billion, as a Sr. Site Reliability Engineer. In this role, you will be responsible for building and maintaining the company's internal platform, ensuring operational excellence and empowering the engineering team. You will analyze, propose, and implement safer systems and processes, collaborating closely with engineering squads across platform engineering to guarantee the reliability and security of our applications. This position offers the opportunity to work remotely from anywhere in Brazil and be part of a creative and collaborative development team.
Requirements
- Over 5 years of experience in Cloud Computing, SRE/DevOps
- Proficient in English communication (both written and spoken) to collaborate in an international team with native and non-native English speakers
- Detail-oriented with high initiative and self-motivation
- Strong understanding of software engineering principles and how systems work under the hood
- In-depth knowledge of modern networking and operating systems
- Proficiency in AWS, cloud environments, containers, Kubernetes, Docker, and DevOps engineering, including managing tests and CI/CD pipelines
- Familiarity with automation tools and provisioners like Terraform, Ansible, or Chef
- Solid troubleshooting and system engineering experience in UNIX/Linux production environments
- Experience with monitoring, alerting, and incident management
- Proficiency in automating tasks with scripting languages like Python, Bash, etc
Responsibilities
- Collaborate with and support our creative, tight-knit development team
- Design, deploy, and operate Loadsmart's critical systems while balancing reliability, cost, and agility
- Play a key role in driving reliability projects with engineering teams
- Utilize your intuitive problem-solving skills and contagious positive attitude to tackle challenging and exciting issues, inspiring those around you
- Collect metrics and understand their business impact, encouraging the team to do the same
- Perform troubleshooting and root-cause analysis of system operation issues
- Be accountable for the platform's Service Level Agreements and Objectives
- Provide infrastructure support during off-hours as needed
- Take ownership of software infrastructure projects
- Seek, give, and receive constructive feedback through code and specification reviews
Preferred Qualifications
Experience or exposure to PostgreSQL and DBA responsibilities is a plus
Benefits
- Competitive base salaries - we believe in rewarding top talent
- Extremely competitive Equity package - become a shareholder in our company!
- Loadie Time Off - flexible PTO
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.