Senior Data Engineer

Brightwheel Logo

Brightwheel

πŸ’΅ $81k-$93k
πŸ“Remote - Brazil

Summary

Join Brightwheel's Data Engineering team as a Staff Data Engineer and play a key role in building and scaling a web scraping platform. You will be responsible for crafting and implementing a best-in-class web scraping strategy and infrastructure, building pipelines to gather millions of records, and ensuring data quality. This role requires strong technical skills in Python, web scraping tools, and data processing technologies. You will work with a fully remote team and enjoy competitive compensation, equity, and premium benefits, including comprehensive healthcare, generous parental leave, flexible PTO, 401(k), and a wellness stipend. Brightwheel is committed to creating a diverse and inclusive work environment.

Requirements

  • 3+ years of work experience as a data engineer/full stack engineering, coding in Python
  • 3+ years of experience building web scraping tools in python, using Beautiful Soup, Scrapy, Selenium, or similar tooling
  • 1-3 years of deployment experience with CI/CD
  • Strong experience of HTML, CSS, JavaScript, and browser behavior
  • Experience with RESTful APIs and JSON/XML data formats
  • Knowledge of cloud platforms and containerization technologies (e.g., Docker, Kubernetes)
  • Advanced understanding of how at least one big data processing technology works under the hood (e.g. Spark / Hadoop / HDFS / Redshift / BigQuery / Snowflake)
  • Excellent analytical, problem solving, and troubleshooting skills to manage complex process and technology issues without guidance

Responsibilities

  • Use modern tooling to build robust, extensible, and performant web scraping platform
  • Build thoughtful and reliable data acquisition and integration solutions to meet business requirements and data sourcing needs
  • Deliver best in class infrastructure solutions for flexible and repeatable applications across disparate sources
  • Troubleshoot, improve and scale existing data pipelines, models and solutions
  • Build upon data engineering's CI/CD deployments, and infrastructure-as-code for provisioning AWS and 3rd party (Apify) services

Preferred Qualifications

  • 2+ years of experience developing in Airflow
  • 2+ deploying Infrastructure as Code within AWS or similar
  • 2+ deploying microservices and/or APIs within cloud environment
  • 1+ years using ML / AI workflows for data enrichment and/or sentiment analysis by integrating scraped data into ML pipelines

Benefits

  • Healthcare Coverage: Medical, dental, and vision benefits typically valued at $15,000+, with brightwheel providing high coverage for both employees and families
  • Generous Paid Parental Leave for growing families
  • Flexible Paid Time Off (PTO) to recharge and relax
  • 401(k) Enrollment to help you plan for the future
  • Monthly Wellness & Productivity Stipend to support your well-being
  • Competitive compensation, benchmarked against similar-stage growth companies
  • Equity & Ownership
  • Work from where you thrive

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.