Scientific Data Engineer

TetraScience
Summary
Join TetraScience, a leader in Scientific Data and AI Cloud, and contribute to the Scientific AI revolution. Research data acquisition strategies for scientific lab instrumentation and productionize file parsers for various instrument output files. Design and build data models and pipelines, alongside unit and integration tests. Cross-analyze instrument data to design common data model components and build visualizations using tools like Streamlit and Tableau. Ensure solutions meet customer requirements and deliver value. This role requires extensive wet lab experience, proficiency in Python and SQL, and a PhD in a relevant field. Competitive salary, equity, generous PTO, flexible working arrangements, and a supportive team culture are offered.
Requirements
- Must have >3 years experience in Python and SQL
- PhD degree in biology or chemistry or relevant fields
- Must have extensive wet lab experience, preferably with HPLC and Mass Spec experience and worked with one or more of the following instruments
- Waters LCs with Empower software and MassLynx software
- Thermo Fisher LCs with Chromeleon software and Xcalibur software
- Cytiva LCs with Unicorn software
- Shimadzu LCs with LabSolution software
- Sciex LCs with Sciex software
- Agilent LCs with OpenLab software and ChemStation software
Responsibilities
- Research data acquisition strategy for scientific lab instrumentation
- Research and productionize file parsers for instrument output files (.xlsx, .pdf, .txt, .raw, .fid, many other vendor binaries)
- Design and build data models and the corresponding data pipelines, unit tests, integration tests, and reusable utility functions
- Cross-analyze instrument data with the same instrument type or scientific workflow to design common data model components
- Build visualization, report, and dashboards using Streamlit, Tableau, Jupyter notebook, etc
- Drive value for the customers - test and make sure the solution fulfills their requirements and provides value
Preferred Qualifications
- Experience with data plotting dashboarding tools like Streamlit, Tableau, Jupyter Notebook is a plus
- Passionate about data problems and using AI and LLM to come up with creative ideas
- Excellent communication skills, attention to detail, and the confidence to take control of project delivery
- Quickly understand a highly technical product and effectively communicate with product management and engineering
- Proactive problem-solving skills
- High-bandwidth: thrives when managing multiple simultaneous projects
- Intellectually curious: Unwavering drive to learn and know more every day
- Ability to think creatively about how to solve project risks without reducing quality
- Team player and ability to "roll up your sleeves" and do what it takes to make the team successful
Benefits
- Competitive Salary and equity in a fast-growing company
- Supportive, team-oriented culture of continuous improvement
- Generous paid time off (PTO)
- Flexible working arrangements - Remote work