ML Data Engineer
Role Overview
Develop systems that integrate diverse scientific data sources into cohesive AI training and inference pipelines.
What You'll Do
- Build data processing pipelines that harmonize theoretical models, scientific literature, and experimental data
- Design feature extraction and representation learning approaches for heterogeneous scientific data
- Implement and optimize multimodal fusion techniques for scientific discovery applications
- Develop evaluation metrics and validation procedures for multimodal scientific AI systems
What You Bring
- Experience with multimodal deep learning architectures and fusion techniques
- Interest in natural language processing for scientific literature analysis
- Background in processing structured scientific data (molecular structures, experimental measurements)
- Software engineering skills with experience in ML systems
Nice to Have
- Experience with knowledge graphs or scientific databases
- Background in information retrieval or search systems
- Understanding of chemistry or materials science concepts
Benefits
- Competitive salary and equity
- Comprehensive health, dental, and vision insurance
- Generous PTO and parental leave
Last Updated: May 20, 2025