Remote Software Engineer

Logo of Blue River Technology

Blue River Technology

💵 $142k-$250k
📍Remote - Worldwide

Job highlights

Summary

Join us to help create a data-driven future as a Sr Software Engineer on our Data/CVML platform team at Blue River. The ideal candidate is an experienced data platform builder and data wrangler who enjoys optimizing data systems and building them from the ground up.

Requirements

  • 10+ years of experience building data platforms/data backends
  • 5+ years of experience working with Python
  • Experience building and optimizing ‘big data’ data pipelines using Spark, architectures, and data sets
  • Familiar with best practices in building CVML pipelines
  • Strong analytic skills related to working with unstructured datasets
  • Working knowledge of message queuing, stream processing, and highly scalable ‘big data’ data stores
  • Experience supporting and working with cross-functional teams in a dynamic environment
  • They should also have experience using the following software/tools: Experience with relational SQL and NoSQL databases, including MongoDB
  • Experience with data pipeline and orchestration platforms such as Airflow
  • Experience with AWS cloud services: EC2, RDS DBs
  • Experience with Terraform
  • Strong problem-solving skills and ability to troubleshoot complex data-related issues
  • Excellent communication skills to collaborate effectively with technical and non-technical stakeholders
  • Attention to detail and commitment to producing high-quality, well-documented code

Responsibilities

  • Create and maintain optimal data platforms for ingesting machine logs, image data, and various other types of datasets
  • Assemble large, complex data sets that meet functional / non-functional business requirements
  • Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc
  • Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using Python, SQL, Databricks Spark, and AWS ‘big data’ technologies
  • Build and enhance CVML pipelines, and integrate data within Kubeflow and Databricks using pyTorch and its ecosystem
  • Work with stakeholders including the Product, Data, and Infra teams to assist with data-related technical issues and support their data infrastructure needs
  • Create data tools for analytics and data scientist team members that assist them in building and optimizing their workflows
  • Triage and fix support issues related to data anomalies
  • Collaboration: Work closely with cross-functional teams, including data scientists, analysts, software engineers, and product managers, to understand data requirements and deliver data solutions that align with business goals
  • Documentation: Create and maintain technical documentation, including data flow diagrams, architecture designs, and standard operating procedures
  • Technology Evaluation: Stay up-to-date with industry trends and emerging technologies related to data engineering, recommending and implementing new tools and frameworks as appropriate

Preferred Qualifications

  • Experience with image processing and labeling platforms
  • Can understand some C++ or Go, or talk with people that do
  • Prior experience in the autonomy and robotics space is a huge plus
  • Experience with image processing pipelines
  • Familiarity with robotics logs
  • Knows how to use git and/or other versioning systems

Share this job:

Disclaimer: Please check that the job is real before you apply. Applying might take you to another website that we don't own. Please be aware that any actions taken during the application process are solely your responsibility, and we bear no responsibility for any outcomes.
Please let Blue River Technology know you found this job on JobsCollider. Thanks! 🙏