Cloud Data Integration Engineer

closed
Logo of VetsEZ

VetsEZ

πŸ“Remote - United States

Job highlights

Summary

VetsEZ is seeking a Cloud Data Integration Engineer to support the Electronic Health Record Modernization Integration Office (EHRM-IO) project. The candidate will design and implement a cloud strategy for data integration, test and validate data pipelines, communicate technical concepts, and more.

Requirements

  • Bachelor’s degree in Computer Science, Electronics Engineering, or a related technical field, plus 5+ years of experience
  • Data integration activities could include, the use of the following tools and languages: SSIS, T-SQL, P-SQL, BIML Studio, Visual Studio, PowerBI, Python, Scala, YAML scripting for data pipelines, Azure Data Factory, C#, Talend, AWS Glue, PowerShell, Databricks, DeltaLake, and/or Microsoft Synapse
  • Knowledge of how to secure the data lake using role-based access controls (RBACs) and Access Control Lists (ACLs)
  • Experience with Agile Frameworks, DevSecOps, and CI/CD Pipelines
  • Ability to work in a fast paced and agile development environment
  • Takes ownership of tasks and assignments to completion with the ability to delegate amongst a team effectively
  • Knowledge of database architecture, administration, and security for on-premise and cloud-hosted database systems
  • Communicates and leads effectively in detailed technical discussions with the customer and among cross functional stakeholders

Responsibilities

  • Utilize a diverse set of tools and systems to support easy access to the data
  • Handle incremental updates to databases with near-real time data Management of the data through parquet files or other file formats
  • Create and manage ETL and Extract, Load, transform (ELT) scripts to populate EHRM data model tables with data
  • Test and Validate data pipelines and data quality improvement
  • Utilize data standards, terminologies and regulations in data modeling
  • Generate Database entity diagrams and data dictionaries using erwin data modeler or similar tool
  • Access, query, read, write and transform data to and from multiple data sources and varying database applications
  • Create and update databases comprised of various data types, formats, constraints and storage options over multiple platforms (such as Azure Cloud, etc.)
  • Communicate complex technical concepts to non-technical stakeholders
  • Monitor and optimize Databricks jobs and clusters to ensure efficient and scalable performance
  • Troubleshoot and resolve issues related to data integration using Databricks
  • Build and optimize β€˜big data’ data pipelines, architectures and data sets
  • Update documentation/Wiki pages to document work and updates as directed by the Government Project Manager
  • Update the Data Lake/Analytics Database Design Document with the enhancements and changes

Preferred Qualifications

  • Experience in the VA or other federal organizations desired
  • Experience with Data Migration and Data Syndication within the VA desired

Benefits

  • Medical/Dental/Vision
  • 401k with Employer Match
  • PTO + Federal Holidays
  • Corporate Laptop
  • Training opportunities
  • Remote Opportunity
This job is filled or no longer available