Summary
Join the Data Management and BI Team as a Sr. Staff Data Engineer! We build data pipelines and applications using AWS and open-source technologies like Scala, Python, and Spark. You will design, build, and maintain data solutions, collaborate with business partners, and leverage AI/ML techniques. The role requires strong experience in data modeling, AWS technologies, and CI/CD pipelines. Full-stack expertise is a plus. This is a fully remote position with competitive benefits.
Requirements
- Strong Computer Science/Engineering/Information Systems background
- 10+ Years of Experience in Data Modeling, Data architecture, Data Quality, Metadata, ETL, and Data Warehouse methodologies and technologies
- 5+ years experience with AWS technologies
- Proven experience (3+ years) in designing and managing CI/CD pipelines, specifically using GitHub Actions
- Demonstrated experience with Python, APIs, Spark, and Scala
- Experience with with advanced SQL, Linux, MicroStrategy, Tableau, and Pandas
- Strong problem-solving skills
- Strong oral and written communication and influencing skills, with the ability to communicate new concepts and drive change in processes and behaviors and to communicate complex technical topics to management and non-technical audiences
- Bachelor’s degree in Engineering, Computer Science, Information Systems or related field with 10+ years of relevant experience
Responsibilities
- Designs, builds, and oversees the deployment and operation of technology architecture, solutions and software to capture, manage, store, and utilize structured and unstructured data from internal and external sources
- Contributor to the overall Data Product roadmap by working closely with our business partners to understand their challenges and develop analytical tools to help drive business decisions
- Develops technical tools and programming that leverage artificial intelligence, machine learning, and big-data techniques to cleanse, organize and transform data and to maintain, defend and update data structures and integrity on an automated basis
- Leverage prototyping methodologies to propose and design creative business solutions that exploit our broad toolset of technologies (Big Data, MicroStrategy, Tableau, Python, Spark etc)
- Creates and establishes design standards and assurance processes for software, systems, and applications development to ensure compatibility and operability of data connections, flows, and storage requirements
- Reviews internal and external business and product requirements for data operations and activity and suggests changes and upgrades to systems and storage to accommodate ongoing needs
- Design, develop, and maintain CI/CD pipelines using GitHub Actions to automate deployment, testing, and monitoring of applications
- Implement and manage serverless solutions (e.g., AWS Lambda, EMR Serverless, Kafka, SNS, SQS, Athena etc.) as part of the application architecture
- Implement infrastructure as code (IaC) practices using tools like Terraform, AWS CloudFormation, or similar to manage cloud infrastructure
- Work with development teams to set up automated testing frameworks, ensuring high test coverage and code quality
- Must understand the basics of relational data modeling and be able to clearly articulate the reasons for using to non-relational systems in our architecture
- Educate and inform business partners on architecture, capabilities, best practices and solutions to build out future enhancements
- Assist in analyzing business requirements, source systems, understand underlying data sources, transformation requirements, data mapping, data model and metadata for reporting solutions
- Writing easily understood documentation and architecture diagrams and keeping them up to date as code and frameworks change over time
Preferred Qualifications
- 1+ years in Digital Media Publisher Industry with a solid understanding of Digital Research
- Experience with various digital platforms such as Omniture (Site Catalyst), Rentrak, comScore, Operative One, Google DoubleClick, Freewheel, Ad-Juster, MOAT, Nielsen, Facebook, Twitter, etc
- Understanding of how to manage code in the Enterprise Git repository with appropriate branching and documentation skills
- Ability to design optimized, performant, and visually appealing reports, user interfaces, mockups, and documentation
- Ability to read external API documentation and write pipelines to extract data from our partners’ systems
- Ability to write and stand up internal API endpoints to share data with other internal teams
- Strong analytical focus, results-oriented, and execution driven
- Ability and desire to work within a cross-functional team environment with people from multiple business units, vendors, countries, and cultures
- Self-driven/self-initiator and resourceful to achieve goals independently as well as in teams and promotes an open flow of information so that all stakeholders are well informed
- Flexibility to adjust to changing requirements, schedules, and priorities
- Ability to work independently under minimum supervision and be proactive in solving issues
- Energetic, committed, and solution-focused with the ability to perform under pressure and meet targets
Benefits
- Medical, dental and vision insurance
- 401(k)
- Paid leave
- Tuition reimbursement
- A variety of other discounts and perks
Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.