Senior DevOps Engineer, SRE

Fundraise Up
Summary
Join Fundraise Up, a global fundraising platform, as a DevOps Engineer/SRE to make a significant impact on our infrastructure. You will be a generalist working across our full stack, from server provisioning and CI/CD pipelines to observability and database management. Ensure our platform's scalability, security, and reliability. The role involves managing and automating infrastructure, building and maintaining CI/CD pipelines, administering data stores and messaging systems, enhancing our observability stack, troubleshooting complex issues, contributing to Kubernetes rollout, and securing our infrastructure. We offer a collaborative environment, a clear product vision, and a transparent company culture. The team is distributed, with members across various countries, and values thoughtful collaboration and strong engineering practices.
Requirements
- 5+ years of experience as a DevOps Engineer, SRE, or Linux Systems Administrator
- A strong foundation in Linux (we use Ubuntu), including core CLI troubleshooting tools
- Solid experience with configuration management tools, particularly Ansible
- Proficiency in building and maintaining complex CI/CD pipelines (Jenkins experience is a major plus)
- A good understanding of networking fundamentals, including TCP/IP and firewall configuration (iptables)
- Experience with monitoring and observability principles (Prometheus/VictoriaMetrics stack preferred)
- Scripting ability in Bash or Python
- A high sense of ownership, responsibility, and attention to detail. We value professionals who are proactive and reliable
Responsibilities
- Manage and automate our bare-metal and VM infrastructure using Ansible and custom scripting (Bash/Python)
- Build and maintain complex CI/CD pipelines in Jenkins to ensure smooth and reliable deployments
- Administer and optimize our data stores (MongoDB, ClickHouse) and messaging systems (Kafka, Redis)
- Enhance our observability stack (VictoriaMetrics, Grafana, Graylog) to improve monitoring, alerting, and troubleshooting capabilities
- Troubleshoot complex issues at the OS, network, and application levels
- Contribute to the rollout and management of Kubernetes on our bare-metal servers
- Secure our infrastructure, including configuring firewalls (iptables) and managing disk encryption
Preferred Qualifications
- Data Systems: Managing ClickHouse, MongoDB, Kafka, JupyterHub, or Airflow
- Observability: VictoriaMetrics or Graylog at scale
- Storage: Software RAID, LVM, and Full Disk Encryption (Clevis/Tang)
Benefits
- 31 days off
- 100% paid telemedicine plan
- Home Office Setup Assistance: the company offers assistance with purchasing furniture (office chair, office desk, monitor) and other items to create a comfortable workspace
- English learning courses
- Relevant professional education
- Gym or swimming pool
- Co-working
- Remote working