Data Engineering Intern

Logo of RippleMatch

RippleMatch

💵 $41k-$72k
📍Internship - United States

Job highlights

Summary

Join our Enterprise Data & Analytics organization as a 2025 Guardian Summer Intern, Data Engineering. You will be immersed in the daily environment of a thriving global financial services company and gain invaluable industry and organizational knowledge through daily business interactions and job assignments.

Requirements

  • Pursuing a master's degree with 6+ months of experience or PhD degree with 1+ years of experience in Computer Science, Engineering, Mathematics, or related fields (graduation date of December 2025- June 2026)
  • Hands on experience in working with Python, SQL, PySpark and bash scripts. Proficient in software development lifecycle and software engineering practices
  • Experience in developing and maintaining robust data pipelines for both structured and unstructured data to be used by Data Scientists to build ML Models
  • Experience working with Cloud Data Warehousing (Redshift, Snowflake, Databricks SQL or equivalent) platforms and experience in working with distributed framework like Spark
  • Experience with machine learning frameworks (like Keras or PyTorch) and libraries (like scikit-learn, xgboost)
  • Proven understanding of machine learning life cycle, data mining, and ETL techniques

Responsibilities

  • Work in an innovative, fast-paced environment, collaborating with bright minds while enjoying a balance between strategic and hands-on work
  • Help to enable cutting edge AI & machine learning solutions which will contribute towards enhancing the wellbeing of our customers, fostering growth, maintaining competitive advantage, and customer satisfaction
  • Collaborate with data scientists and data analysts to understand data requirements and translate them into scalable, high performant data pipeline solutions
  • Design, implement, validate, and prepare the datasets for AI model
  • Support data discovery & data preparation for model development. Perform detailed analysis of raw data sources by applying business context and collaborate with cross-functional teams to transform raw data into curated & certified data assets to be used for ML and BI use cases
  • Extract text data from variety of sources like documents (Word, PDFs, Text Files, JSON etc.), logs, text notes stored in databases, using Web scrapping method from web pages to support development of NLP / LLM solutions
  • Collaborate with data science and data engineering team to build scalable and reproducible machine learning pipelines for training and inference
  • Assist in developing and maintaining robust tools, frameworks, and libraries that standardize and streamline the data & machine learning lifecycle
  • Collaborate with cross-functional teams of Data Science, Data Engineering, business units and various IT teams
  • Build and maintain effective documentation for project and practices ensuring transparency and effective team communication
  • Adapt standard machine learning methods to best exploit modern parallel environments (e.g. distributed clusters, multicore SMP, and GPU)
  • Have the opportunity to work and learn from supportive leaders, mentors and team members across the organization who will help coach you as you develop your professional career
  • Learn about Guardian’s purpose, values, how we work, and our suite of product and service offerings
  • Build a network of colleagues and have a sense of community with other interns and other parts of the business
  • Think broadly and ask questions about data, facts and other information
  • Be a self-starter – someone who enjoys “rolling up their sleeves and getting things done”, has high energy, strong work ethic, displays the ability to work independently, and is creative

Benefits

  • Choice of [high deductible/copay] medical plans* with prescription drugs, including coverage for fertility and transgender inclusive benefits
  • Dental plan
  • Vision plan
  • Health care accounts – flexible spending, health reimbursement, and health savings accounts
  • Critical illness insurance
  • Company-paid Life and Disability insurance plus voluntary supplemental coverage
  • Accident insurance
  • 401(k) retirement plan with a company match, plus an annual age/service-based Company contribution and an annual profit-sharing contribution, if applicable
  • Complimentary 1:1 financial guidance with a licensed Fidelity representative
  • Flexible work arrangements (part in-person/part remote)
  • Unlimited paid time off for most roles plus time off for volunteering, jury duty, voting, and bereavement
  • Personal holidays for colleagues to use in recognition of religious, cultural, or civic days
  • Paid parental leave and paid family and medical leave policies
  • Emotional well-being, mental health, and work/life resources powered by Spring Health
  • Wellness programs, including fitness program and equipment reimbursement
  • Child, adult, and elder back-up care support through Bright Horizons
  • Adoption assistance
  • College planning
  • Tuition reimbursement
  • Student loan assistance
  • Commuter benefits in select metropolitan areas

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Please let RippleMatch know you found this job on JobsCollider. Thanks! 🙏