Site Reliability Engineer

Argus Labs
Summary
Join Argus Labs, a company building the next generation of massively multiplayer online (MMO) games, and help empower players with extensive freedom to build, extend, and influence game worlds. We use World Engine, a state-of-the-art onchain game server framework leveraging a novel sharded rollup blockchain architecture. This role focuses on enhancing user experience for both developers and end-users, designing and building operational infrastructure, spearheading company-wide security, and ensuring the delivery, scalability, and reliability of backend infrastructure. The ideal candidate will have extensive experience in software deployment pipelines, cloud environments, and various programming languages. This position is open to APAC-based candidates only.
Requirements
- APAC-based candidates only β Must reside in and have the legal right to work in APAC countries
- 4+ years of experience managing software deployment pipelines in a production cloud environment
- Proficient in Go, JavaScript, Python, or other object-oriented programming languages
- Strong scripting skills with Bash
- Hands-on experience with writing and maintaining complex Infrastructure-as-Code (Terraform, Pulumi, etc.)
- Expertise in CI/CD β Building and maintaining performant pipelines using GitHub Actions
- Production Kubernetes management β Deployment best practices with Helm, etc
- Database infrastructure management β Setup, maintenance, and migration coordination
- Excellent communication and time management skills
- Ability to design and implement highly available, reliable systems
Responsibilities
- Work closely with stakeholders company-wide to provide services that enhance the user experience for the development team, as well as our end-users
- Design and build operational infrastructure to support games, automating where possible
- Spearhead company-wide security culture and architecture to keep our platform secure
- Own delivery, scalability, and reliability of our backend infrastructure
- Advise and collaborate with the rest of the engineering team to ensure we are building safe, secure, and reliable products
Preferred Qualifications
- Experience in game development and game server hosting, ensuring high-performance and scalable infrastructure
- Hands-on expertise with Buildkite for CI/CD automation and pipeline optimization
- Knowledge of secrets management systems like HashiCorp Vault, Infisical, and similar tools to safeguard sensitive data
- Experience in securing cryptographic keys using KMS or equivalent technologies to enhance security protocols
- Proficiency in Layer 3 network optimization, including geo-based routing and traffic management with Cloudflare
- Familiarity with deploying and maintaining blockchain infrastructure, such as full nodes, validator nodes, and other blockchain-related services
Benefits
- Flexible PTO (2 weeks required) + holidays
- 100% employer-covered medical, dental, and vision insurance (US)
- 401k (US)
- Up to $1500 desk set-up stipend
- Company retreats