Remote Data Engineer

closed
Logo of CDC Foundation

CDC Foundation

πŸ’΅ $103k-$143k
πŸ“Remote - United States

Job highlights

Summary

Join the CDC Foundation as a Data Engineer to design, build, and maintain data infrastructure for a public health organization. This role will collaborate with data content experts, analysts, and other partners to deliver architecture needed for data generation, storage, processing, and analysis.

Requirements

  • Bachelor's degree in Computer Science, Information Technology, Data Science, or a related field is preferred
  • Minimum of 5 years of relevant experience in data engineering
  • Proficiency in programming languages commonly used in data engineering, such as Python, Java, Scala, or SQL
  • Experience with large-scale projects using Amazon Web Services is required. Certification is preferred
  • Strong technical writing skills for creating documentation, policies, and procedures
  • Experience with project planning, including developing timelines, setting milestones, and managing resources
  • Knowledge of data warehousing concepts and tools
  • Experience with cloud computing platforms
  • Experience with data security and data governance
  • Expertise in data modeling, ETL (Extract, Transform, Load) processes, and data integration techniques
  • Strong analytical thinking and problem-solving abilities
  • Excellent verbal and written communication skills, including the ability to convey technical concepts to non-technical partners effectively
  • Flexibility to adapt to evolving project requirements and priorities
  • Outstanding interpersonal and teamwork skills; and the ability to develop productive working relationships with colleagues and partners
  • Experience working in a virtual environment with remote partners and teams
  • Proficiency in Microsoft Office

Responsibilities

  • Collaborate with data scientists, analysts, and other partners to understand their data needs and requirements
  • Implement and maintain ETL processes to ensure the accuracy, completeness, and consistency of data
  • Implement security measures to protect sensitive information
  • Design and manage data storage systems, including relational databases, NoSQL databases, and data warehouses
  • Create and manage the systems and pipelines that enable efficient and reliable flow of data
  • Collect data from various sources, transforming and cleaning it to ensure accuracy and consistency
  • Optimize data pipelines, infrastructure, and workflows for performance and scalability
  • Monitor data pipelines and systems for performance issues, errors, and anomalies, and implement solutions to address them
  • Provide technical guidance to other staff
  • Communicate effectively with partners at all levels of the organization to gather requirements, provide updates, and present findings

Preferred Qualifications

  • Experience with big data technologies and frameworks like Hadoop, Spark, Kafka, and Flink
  • Strong understanding of database systems, including relational databases (e.g., MySQL, PostgreSQL) and NoSQL databases (e.g., MongoDB, Cassandra)
  • Familiarity with agile development methodologies, software design patterns, and best practices
  • Previous experience working with or within government agencies is preferred
This job is filled or no longer available