Summary
Join Printful and Printify, a global on-demand powerhouse, as a Site Reliability Engineer. You will design, develop, and maintain highly available and distributed solutions and platforms. This role involves working across multiple platforms and brands in an international environment. Daily tasks include configuring container platforms, building services across various environments, collaborating with application teams, troubleshooting issues, and participating in on-call support. You will be responsible for your own initiatives and contribute to knowledge sharing. Staying up-to-date with the latest technologies and promoting a DevOps culture are essential.
Requirements
- Solid experience in systems administration and maintenance in UNIX/Linux environments
- Strong Kubernetes administrator experience, building and operating multi-cluster setup running production workloads
- Solid experience with the development, configuration, maintenance, and enhancements of both relational databases like PostgreSQL, MySQL, and NoSQL databases as MongoDB
- Solid experience in at least two areas within AWS IAM, Networking, Logging, or security
- Experience with AWS file storage, block storage, and distributed object storage solutions
- Experience with setup and configuration of Elasticstack, Prometheus, Grafana, or similar
- Skills to maintain the application, infrastructure, and system health awareness using monitoring tools
- Knowledge of at least one scripting language, such as Bash, Python, Go, etc
- Excellent troubleshooting skills in the system, network, and application related are a must
- High self-management and ability to prioritize in a dynamic start-up environment
Responsibilities
- Configure and operate our container platform and data storage solutions in AWS
- Build and configure various services across SAAS, PAAS, and IAAS environments, like monitoring and logging solutions and CI/CD
- Work closely with Applications Teams on various operational and improvement cases – develop, configure, and continuously improve solutions based on the team’s needs
- Be responsible for the design and execution of your own initiatives
- Troubleshoot and resolve any service issues related to operating systems and servers
- Maintain response and resolution of cases/tickets received in queue against established SLAs
- Take part in on-call support
- Support other groups of system engineers, contribute to knowledge sharing and meetups
- Stay up to date with the latest technologies and trends, propose improvements, and be an example of DevOps culture
Preferred Qualifications
- Experience with setting up ServiceMesh is a plus
- Love all things automation, have a passion for open-source technologies and tools, always learning and developing skills and knowledge
- A team player with plenty of ideas and enthusiasm, a proactive and can-do attitude
Benefits
- Be part of a friendly, inclusive, and global team
- An opportunity to work remotely or in a modern and welcoming office in Rīga or Tallinn
- Flexible working hours (start your day as late as 11 a.m.)
- Health insurance
- Access to mentorship, internal meetups, and hackathons both on-site and online
- Exciting team-building events and parties you’ll never forget!
- Free and healthy lunch if you work from the Rīga office
- Design and order your own merch using our platforms with employee discount
- Apple MacBook laptop as your standard work equipment
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.