Summary
Join Sayari's data team and contribute to the development and maintenance of Sayari Graph, the company's flagship product. You will collaborate with product and software engineering teams to collect global data, maintain existing data pipelines, and build new ones. Responsibilities include writing and deploying crawling scripts, transforming data using Scala Spark, parsing data using Python, diagnosing and fixing bugs, and analyzing internal datasets. The ideal candidate possesses professional experience with Python and a JVM language, along with experience in data pipeline design and maintenance. Sayari offers a competitive compensation package, including comprehensive health benefits, generous time off, bonuses, and equity.
Requirements
- Professional experience with Python and a JVM language (e.g., Scala, Java, Kotlin)
- 2+ years of experience designing and maintaining data pipelines
- Experience using Apache Spark and Apache Airflow
- Experience with SQL and NoSQL databases (e.g., columns stores, graph, etc.)
- Experience working on a cloud platform like GCP, AWS, or Azure
- Experience working collaboratively with Git
- Understanding of Docker/Kubernetes
- Interest in learning from and mentoring team members
- Experience supporting and working with cross-functional teams in a dynamic environment
- Passionate about open source development and innovative technology
Responsibilities
- Write and deploy crawling scripts to collect source data from the web
- Write and run data transformers in Scala Spark to standardize bulk data sets
- Write and run modules in Python to parse entity references and relationships from source data
- Diagnose and fix bugs reported by internal and external users
- Analyze and report on internal datasets to answer questions and inform feature work
- Work collaboratively on and across a team of engineers using agile principles
- Give and receive feedback through code reviews
Preferred Qualifications
- Experience working with BI tools like BigQuery and Superset
- Understanding of knowledge graphs
Benefits
- 100% fully paid medical, vision, and dental for employees and their dependents
- Generous time off; we observe all US federal holidays, close our office for a winter break (12/24-12/31), in addition to granting 18 PTO days and 10 sick days
- Outstanding compensation package; competitive commissions for revenue roles and quarterly bonuses for non-revenue positions
- A strong commitment to diversity, equity, and inclusion
- Eligibility to participate in additional benefits such as 401k match up to 5%, 100% paid life insurance (up to $100,000 coverage),, and parental leave
- A collaborative and positive culture - your team will be as smart and driven as you
- Limitless growth and learning opportunities
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.