Data Engineer
![Red Canary Logo](https://cdn.jobscollider.com/logo/redcanary-78e0-0.webp)
Red Canary
Summary
Join Red Canary's Business Data Infrastructure (BDI) team as a Data Engineer and play a pivotal role in designing, building, and maintaining the data infrastructure that powers the organization's insights and decisions. You will build and maintain scalable data infrastructure, develop and manage data products, ensure data quality and governance, collaborate with cross-functional teams, and continuously optimize for scalability and efficiency. This role requires a Bachelor's or Master's degree or equivalent experience in a relevant field, 3+ years of experience as a Data Analyst, Data Engineer, or similar, and strong knowledge of relational databases, distributed storage, and cloud computing platforms. The position offers a competitive salary, bonus program, and stock options.
Requirements
- Hold a Bachelor's or Master's degree or equivalent work experience in a relevant field such as Data Engineering, Mathematics, Statistics, Computer Science, or a related engineering discipline
- Have 3+ years of proven experience as a Data Analyst, Data Engineer, data-focused Software Engineer, or similar β preferably in a startup or fast-paced environment
- Possess strong knowledge of relational databases, distributed storage and semi/un-structured data
- Have substantial experience working within cloud computing platforms, such as AWS, GCP or Azure
- Have strong experience working with data pipeline execution and design β working with tools such as AWS Sagemaker, AWS Glue, Apache Airflow, Prefect or similar
- Demonstrate excellent problem-solving and critical-thinking skills, with the ability to assess a complex problem set and distill insight into a focused, durable and understandable data product
- Possess strong communication skills, with the ability to effectively present complex findings and recommendations to non-technical stakeholders
- Be self-motivated and proactive, with the ability to work independently and collaborate effectively within a team and across a distributed customer base
Responsibilities
- Build and Maintain Scalable Data Infrastructure: Design, implement, and optimize data pipelines and storage solutions to support the ingestion, transformation, and distribution of high-quality data across the organization
- Develop and Manage Data Products: Create reusable, well-documented, and user-centric data products tailored to the needs of specific teams, ensuring they meet defined performance and quality standards
- Ensure Data Quality and Governance: Implement rigorous validation, monitoring, and governance processes to ensure data accuracy, consistency, and compliance with enterprise policies and regulatory requirements
- Collaborate with Cross-Functional Teams: Partner with analysts, data scientists, and business stakeholders in the spokes to understand their needs and deliver solutions that empower data-driven decision-making
- Continuously Optimize for Scalability and Efficiency: Stay updated on emerging technologies and industry best practices, using them to enhance the hubβs infrastructure and improve the usability and reliability of data products
Preferred Qualifications
- Exposure to distributed computing, such as pySpark or similar
- Experience with some of our primary data sources (Intacct, Salesforce, Zendesk, Jira)
Benefits
- Base salary for this role is $117,000 - $135,000 per year
- This role is also eligible for participation in the company's bonus program
- This role is eligible for a grant of stock options, subject to the approval of the company's board of directors