📍Taiwan
Site Reliability Engineer

Appspace
📍Remote - Spain
Please let Appspace know you found this job on JobsCollider. Thanks! 🙏
Summary
Join Appspace's Cloud Operations team as a Site Reliability Engineer and play a key role in maintaining our cloud platform, which includes Kubernetes, Microservices, MongoDB, RabbitMQ, MySQL, and more. You will automate maintenance tasks, deploy new features, troubleshoot performance issues, and collaborate with other teams. This mission-critical role requires strong experience in Python, shell scripting, Kubernetes, and Helm. The position offers opportunities for growth and development within a rapidly growing company. While flex time is offered, on-call coverage is required weekly. This role involves working with a global team and may include occasional travel.
Requirements
- Must be able to learn new technologies quickly and a desire to be a life-long learner
- Must communicate well and adapt to working well with others across different countries and cultures
- Strong background in Containers, Kubernetes, Helm, Linux, Python coding, and some experience with Windows Server OS and MacOS are a must
- Solid troubleshooting experience and the ability to reason through a process workflow to identify a fault or odd behavior (i.e., spending time following log trails) is a must
- Must be flexible on occasionally attending “off-hour” meetings (we’re a global team supporting a global customer base!)
- Open to quarterly travel up to 5%
Responsibilities
- Automating maintenance tasks for our Cloud Platform, therefore strong experience in Python and shell scripting is a must
- Deploying new features and releases of our software into Kubernetes via Helm, so strong experience in Kubernetes and Helm is a must
- Troubleshooting performance issues or errors thrown by the cloud platform or application, and either resolving the underlying cause, or forwarding your research to Engineering to address in the product
- Actioning Request Tickets from other teams in support of their needs to enable and prepare for upcoming releases
- Monitoring the application’s performance, uptime, and cloud infrastructure’s performance, looking for improvement opportunities, and proactively taking action to solve any negative trends before they become issues
- Lead, Participate, or Execute within the incident management process when alerts fire, and quickly ascertain root cause, resolve the issue, and find new and creative solutions to prevent recurrence
- Configure, Monitor, Research, and Evaluate workload performances both on Google Cloud Platform and Microsoft Azure Clouds
- Collaborating with our Development and Quality Assurance teams to address issues in the product and platform
- Documenting new or updating existing processes and procedures to share knowledge and improve on standardized approaches to solution
Preferred Qualifications
- Experience with Google Cloud Platform, Google Kubernetes Engine, Google Compute Engine, and Google Storage is highly desired, but comparable experience with AWS or Azure will be considered
- Experience with administering MySQL & MongoDB preferred
- Experience with administering message brokering systems like RabbitMQ preferred
- Experience with Build pipeline tools and the Atlassian suite (JIRA, Confluence, Bitbucket/Git, Bamboo, Octopus)
- Experience with monitoring and alerting platforms, especially StackDriver
- Experience with HashiCorp Terraform
- Experience with IIS
Benefits
- Competitive salaries
- Employer paid medical, dental and vision coverage
- Mental health resources
- Flexible work schedules
- Remote work opportunities
- A casual dress work environment
- Reduced working hours in August
- Appspace Quiet Fridays (No non-essential internal meetings scheduled)
- Gym allowance
- Training allowance
- Training days off
Share this job:
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Similar Remote Jobs
📍China
📍Singapore
📍Worldwide
📍Japan
💰$60k-$120k
📍Asia
📍India
📍Australia
📍Latin America