Data Engineer - Intern

closed
Hypersonix Logo

Hypersonix

📍Remote - Worldwide

Summary

Join Hypersonix.ai, a company disrupting the e-commerce space with AI and ML, as a Data Engineer. You will be responsible for creating and maintaining optimal data pipeline architecture, assembling large datasets, identifying and implementing process improvements, and building infrastructure for data extraction, transformation, and loading. The role involves running ad-hoc analysis, working with stakeholders across various teams, ensuring data security across multiple data centers, and collaborating with analytics and data science teams. You will build and optimize big data pipelines and perform root cause analysis. This position requires SQL knowledge, experience with relational databases, and the ability to work with large datasets.

Requirements

  • SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases
  • Able to build and optimize ‘big data’ data pipelines, architectures and data sets
  • Perform root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement
  • Basic analytic skills related to working with unstructured datasets
  • Build processes supporting data transformation, data structures, metadata, dependency and workload management
  • Able to manipulate, process and extract value from large disconnected datasets
  • Working knowledge of message queuing, stream processing, and highly scalable ‘big data’ data stores
  • Able to support and work with cross-functional teams in a dynamic environment

Responsibilities

  • Create and maintain optimal data pipeline architecture
  • Assemble large, complex data sets that meet functional / non-functional business requirements, should write complex queries in optimized way
  • Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc
  • Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS ‘big data’ technologies
  • Run ad-hoc analysis utilizing the data pipeline to provide actionable insights
  • Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs
  • Keep our data separated and secure across national boundaries through multiple data centers and AWS regions
  • Work with analytics and data scientist team members and assist them in building and optimizing our product into an innovative industry leader
This job is filled or no longer available