Web Scraping Specialist

Logo of SEON

SEON

πŸ“Remote - Hungary

Job highlights

Summary

Join SEON, a leading fraud prevention company, as a Web Scraping Specialist to build cutting-edge anti-money laundering (AML) solutions. You will extract data from open-source web platforms to improve SEON’s fraud prevention and risk detection tools. This remote role, based in the European Union (CET), involves developing and maintaining a scalable scraping pipeline using Python, implementing scraping solutions with tools like Selenium and BeautifulSoup, and collaborating with data scientists and engineers. You will manage proxy services, apply advanced client-faking techniques, and stay updated on the latest web scraping technologies. SEON offers competitive benefits including flexible hours, generous holiday allowance, learning and development opportunities, private health insurance, language courses, and enhanced parental leave.

Requirements

  • 2-4 years of experience in web scraping, with a strong focus on data extraction from complex, dynamic websites and unstructured resources
  • Proficient in Python and libraries such as Selenium, BeautifulSoup, Scrapy, or equivalent frameworks
  • Experience working with third-party proxy providers and rotating proxies to handle scraping challenges
  • Knowledge of client faking techniques (e.g., user-agent manipulation, cookie management, header spoofing)
  • Familiarity with handling common web scraping challenges like CAPTCHAs, rate limiting, and bot detection
  • Experience with API interaction and extracting data from both public and private APIs
  • Strong problem-solving skills, attention to detail, and the ability to handle large-scale scraping projects
  • Familiarity with data cleaning and processing best practices
  • Fluent English

Responsibilities

  • Develop and maintain a scalable in-house built scraping pipeline using Python
  • Implement web scraping solutions using tools like Selenium, BeautifulSoup, or similar libraries
  • Troubleshoot, optimize and enhance existing scraping workflows and tools
  • Cooperation with data scientists and colleagues in developing in-house built data consolidation tools to clean and organize scraped data to ensure it is accurate, reliable, and ready for analysis
  • Manage and utilize third-party proxy services to ensure effective data extraction, bypassing anti-scraping mechanisms
  • Apply advanced client-faking techniques (e.g., user-agent rotation, CAPTCHA solving, IP masking) to avoid detection
  • Collaborate with data engineers and other team members to integrate data into pipelines or systems
  • Stay updated on the latest developments in web scraping, proxies, and anti-scraping techniques

Preferred Qualifications

  • Experience with cloud services like AWS, Google Cloud, or Azure
  • Knowledge of database systems and handling large datasets (SQL/NoSQL)
  • Understanding of ethical data scraping practices and legal considerations (e.g., complying with website terms of service)
  • Experience with containerization (e.g., Docker, kubernetes) and workflow automation

Benefits

  • Employee stock ownership plan (ESOP)
  • Flexible hours
  • Generous Holiday allowance
  • Access to significant opportunities for learning and development
  • Private health insurance including dependants (inc. employee assistance & mental health support)
  • Complimentary weekly language courses
  • Enhanced Parental leave

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Please let SEON know you found this job on JobsCollider. Thanks! πŸ™