Summary
Join OpenTable's Data Engineering team as a remote Data Engineer, contributing to various initiatives for diners, restaurants, and internal stakeholders. You will build and maintain data pipelines, develop generative AI for content summarization and question answering, and collaborate with cross-functional teams. This role requires proven data engineering knowledge, experience with production systems, and proficiency in Python and SQL. The position offers a competitive salary, comprehensive benefits, and opportunities for professional growth. Initially remote, the role will transition to a hybrid model in downtown Toronto. OpenTable is committed to fostering a welcoming and inclusive work environment.
Requirements
- Proven knowledge of the technologies and principles of data engineering
- Experience building, testing, deploying, monitoring, and maintaining production systems
- Proficient in Python, at least one strongly typed language, and SQL
- Experience in building scalable batch and streaming data pipelines
Responsibilities
- Generative AI for content summarization and question answering
- Data pipelines which provide insights to restaurants on their guests, revenue, and operations
- Diner analytics projects which power ML, Sales, and Marketing workstreams
- Integrations with partners to ingest and export data to support business critical partnerships
- As a Data Engineer you will work closely with Product Managers, Business Intelligence Analysts, Infrastructure SREs, and Product Engineers to design robust and well governed solutions
- You will lead both data pipelines and production services to deliver data based products to both internal and external customers
Preferred Qualifications
- Familiarity with our stack (PySpark, Databricks, Airflow, Python, K8s, AWS)
- Machine Learning expertise (especially ranking, recommendations, classification, NLP, LLMs, RAG, Agentic development)
- Data/ML Ops Experience
- Java/Python proficiency
- Familiarity with A/B Testing best practices
- Track record of project leadership, cross-team collaboration, and coordination with Product Managers and other stakeholders
- Pipelines: Spark, Airflow, Databricks, Snowflake, S3, Delta Lake
- ML: PyTorch, XGBoost / CatBoost, Large Language Models
- Deployment: Docker, Kubernetes, Helm, Prometheus, Graphite/Grafana
- Infrastructure: Kafka, ElasticSearch, Postgres, MongoDB, Redis, Qdrant
- Build: Flask, Gunicorn, Poetry, TeamCity, Spring, Maven
Benefits
- Paid Time Off - 20 days a year
- Birthday/celebration PTO - 1 day
- Annual company weeks off
- Flexible sick time off
- Paid volunteer time
- Parental Leave Benefits
- Dental & Vision Insurance
- Life & Disability Insurance
- Group RRSP and DPSP
- Major Medical Insurance (dependent care options)
- Annual cash bonus
- Equity grant
- Health benefits
- Flexible spending account
- Retirement benefits
- Life insurance
- Paid time off (including PTO, paid sick leave, medical leave, bereavement leave, floating holidays and paid holidays)
- Parental leave and benefits