Summary

Join Coupa as a Data Engineer and play a key role in designing, building, and maintaining the data infrastructure that powers our business. You will collaborate with cross-functional teams to develop data pipelines, transform raw data, and ensure data quality. Responsibilities include designing robust data architectures, creating data warehouses and lakes, and optimizing Spark clusters for efficiency. You will also build analytics tools, work with stakeholders on data-related issues, and ensure data security. Coupa offers a collaborative culture, pioneering technology, and the opportunity to make a global impact. The ideal candidate will have a strong background in data engineering, experience with various technologies, and a graduate degree in a quantitative field.

Requirements

Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases
Experience with processing large workloads and complex code on Spark clusters
Proven experience in setting up monitoring for Spark clusters and driving optimization based on insights and findings
Experience in designing and implementing scalable Data Warehouse solutions to support analytical and reporting needs
Experience with API development and design with REST or GraphQL experience building and optimizing ‘big data’ data pipelines, architectures, and data sets
Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement
Strong analytic skills related to working with unstructured datasets
Build processes supporting data transformation, data structures, metadata, dependency, and workload management
Working knowledge of message queuing, stream processing, and highly scalable ‘big data’ data stores
Strong project management and organizational skills
Experience supporting and working with cross-functional teams in a dynamic environment
6-10 years of experience in a Data Engineer role
Graduate degree in Computer Science, Statistics, Informatics, Information Systems, or another quantitative field
Experience with object-oriented/object function scripting languages: Python, Java, Etc
Expertise with Python
Experience with big data tools: Spark, Kafka, etc
Experience with relational SQL and NoSQL databases, including Postgres and Cassandra
Experience with data pipeline and workflow management tools: Azkaban, Luigi, Airflow, etc
Experience with AWS cloud services: EC2, EMR, RDS, Redshift
Working knowledge of stream-processing systems: Storm, Spark-Streaming, etc

Responsibilities

Create and maintain optimal data pipeline architecture
Optimize Spark clusters for efficiency and performance by implementing robust monitoring systems to identify bottlenecks using data and metrics. Provide actionable recommendations for continuous improvement
Optimize Spark clusters for efficiency and performance
Assemble large, complex data sets that meet functional / non-functional business requirements
Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc
Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS ‘big data’ technologies
Build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition, operational efficiency, and other key business performance metrics
Work with stakeholders including the Executive, Product, Data, and Design teams to assist with data-related technical issues and support their data infrastructure needs
Keep our data separated and secure across national boundaries through multiple data centers and AWS regions
Create data tools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader
Work with data and analytics experts to strive for greater functionality in our data systems

Benefits

Based in Bay Area, California: $142,000 - 167,000
Based in California: $136,000 - 160,000
Based in Colorado: $120,000 - 162,000
Based in New Jersey: $136,000 - 160,000
Based in New York: $136,000 - 160,000
Based in Washington: $120,000 - 162,000

Senior Data Engineer

Coupa Software

Summary

Requirements

Responsibilities

Benefits

Remote

Data

Senior

Share this job:

Similar Remote Jobs

Remote

Data

Principal

Remote

Data

Senior

Fetch

Remote

DevOps

Senior

Remote

Software Development

Senior

Remote

Data

Mid-level

Remote

Software Development

Senior

Remote

Data

Principal

Remote

Data

Senior

M3 USA

Remote

Data

Senior