Staff Software Engineer, Core

OneSignal
Summary
Join OneSignal, a leading omnichannel customer engagement solution, as a Scylla & DB focused Software Engineer. You will primarily own our Scylla clusters, ensuring their performance and reliability at scale. Responsibilities include optimizing Scylla clusters, designing schemas and queries, conducting data analysis, collaborating with engineers, participating in code reviews, resolving production issues, staying updated on industry trends, and participating in on-call rotation. You'll need significant experience optimizing Scylla at scale and at least 8 years of software engineering experience. The role requires expertise in distributed systems, Docker, Kubernetes, and either Rust, Golang, or similar languages. The New York and California base salary is between $190,000 and $210,000, with additional benefits.
Requirements
- Significant experience optimizing Scylla at scale
- At least 8 years experience working as a software engineer
- Ability to work on a self-motivated team, high performing team
- Experience operating reliable production systems at scale
- Easily bored running tasks by hand and the ability to automate such tasks
- Experience with distributed system event streaming framework such as Apache Kafka
- Experience with Rust, Golang or other lower level languages. Those with backgrounds in interpreted languages may have a steeper learning curve with Rust, but if you have worked on core components of very high throughput systems, please apply
- Ability to reason about how data flows through distributed systems
- Experience with Docker and Kubernetes
Responsibilities
- Optimize a scaling strategy for our high performance Scylla clusters
- Design and optimize schemas, queries, indices, caches and more, primarily for Scylla, eventually for Postgres and ClickHouse
- Conduct data analysis and performance monitoring to identify areas for optimization and enhancement
- Collaborate closely with fellow engineers to architect and implement services required to back OneSignalβs product offerings
- Actively participate in peer code reviews and Technical Design Spec reviews, providing valuable technical insights to continuously improve our code base
- Work with the team to efficiently resolve production issues and ensure the system scales smoothly to meet the growing demands of our customers
- Stay up-to-date with the latest industry trends and technologies, incorporating new ideas into our engineering processes
- Participate in production on-call rotation
- Debug production issues by utilizing metrics, logs, and distributed traces
- Design synchronous and asynchronous APIs for communicating between services in a large distributed system
Preferred Qualifications
- Experience with any of Redis, Kafka, Postgres, ClickHouse, gRPC
- Experience profiling applications to improve performance in terms of time/CPU/memory
- Experience debugging issues in distributed systems
Benefits
- Competitive equity program
- Comprehensive and inclusive benefits