Data Engineer

Emplifi Logo

Emplifi

📍Remote - Czech Republic

Summary

Join Emplifi's Data Engineering team and build the data platform powering company-wide decision-making. You will build and maintain reliable data pipelines using PySpark, SQL, and AWS, develop and scale the company's Data Lake, and work with diverse data sources. Collaborate with engineers, analysts, and product teams to deliver data solutions, contribute to internal tooling, and write clean, tested code. The ideal candidate possesses strong Python, Apache Spark, and SQL skills, along with experience in distributed systems and Git. This role offers professional growth, cutting-edge technology exposure, and a collaborative international environment.

Requirements

  • Python – clean code, testing, and ability to read existing codebases
  • Apache Spark – development and basic performance tuning
  • SQL – good understanding and hands-on experience
  • Git – solid version control habits
  • Strong English – comfortable working and communicating in an international team
  • Distributed systems mindset – solid understanding of fault tolerance, data partitioning, shuffling, and parallel processing

Responsibilities

  • Build and maintain reliable data pipelines (batch and streaming) using PySpark, SQL, and AWS
  • Help develop and scale our company-wide Data Lake on AWS and Databricks (operating at petabyte scale)
  • Work with data from diverse sources: APIs, file systems, databases, event streams
  • Contribute to internal tooling (e.g., schema registries) to improve workflows
  • Write clean, tested code and participate in code reviews
  • Collaborate closely with other engineers, analysts, and product teams to deliver data solutions
  • Learn and experiment with new tools and best practices in modern data engineering

Preferred Qualifications

  • Delta Lake , Databricks
  • Apache Airflow or similar orchestration tools
  • Amazon S3 , AWS experience overall
  • Streaming & messaging technologies – Kafka, Kinesis, RabbitMQ
  • Python libraries for RESTful APIs
  • Data modeling
  • PostgreSQL , ElasticSearch
  • Familiarity with JVM languages (e.g., Java, Scala)

Benefits

  • Unlimited PTO
  • Multisport card
  • Home office working

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.