Battery Data Scientist
- Employer
- Chemix, Inc.
- Location
- Sunnyvale, California
- Posted
- Nov 01, 2024
- Closes
- Nov 13, 2024
- Ref
- 2840197367
- Discipline
- Engineering, Battery
- Specialty
- Electrification, Information Technology, Data Management
- Hours
- Full Time
- Organization Type
- Corporate
Chemix is seeking a highly-motivated data scientist to develop and expand our AI platform for battery materials discovery. Our AI platform is the core of Chemix. Though data is first and foremost in any application of AI, it is typically very scarce in materials development. We've designed our R&D operation to generate large, high-quality battery materials datasets. As a data scientist at Chemix, your mission is to help develop machine learning models, perform statistical analyses, and ultimately design and implement the pipelines that turn our data into actionable results. You'll make a fundamental contribution to developing the batteries that will power the electrification revolution in transportation and beyond.
As an early employee at a fast-moving startup, we expect you to quickly and creatively solve all kinds of technical problems, including those beyond your core expertise. An ideal candidate is able to learn quickly, is eager to stretch their knowledge of the ML and data software stack, takes pride in the quality of their work, and wants to make a real impact in energy storage technologies for electric transportation.
Responsibilities:
Develop machine learning and data pipelines for a wide variety of applications and types of battery data
Discover and introduce new ML models, statistical methods, software frameworks, and libraries
Contribute code to Chemix's internal codebase (Python)
Interface with our machine learning scientists, battery engineers, and customers
Implement best practices for code development and ML-ops, experiment tracking, etc
Inform the optimization of the R&D process that generates our data
Requirements
Bachelor's degree in computer science, or the physical, chemical, or biological sciences or engineering, combined with 3+ years of work experience in data science for the physical sciences
Fluency in a variety of data science and statistics concepts
Extensive experience with the python data science stack: pandas, numpy, sklearn, plotly, scipy
Experience with the fundamentals of data science and software ops: git, testing, CI/CD
Clear communication and good people skills
Strong organization and ability to manage parallel projects
Nice to have:
Experience with workflow orchestration tools, e.g. Airflow, Prefect, Luigi, and scaling tools such as Dask
Experience with various modern neural network architectures such as transformers, GCNN, etc
Experience with physics-based modeling of batteries (e.g. DFN model) and/or chemistry (DFT, MD, QC, etc)
Experience with cloud web services (AWS, Google Cloud, Azure, etc.), Docker, Kubernetes
Familiarity with experimental chemistry/materials science
Benefits
Stock Option Plan
Health Care Plan (Medical, Dental & Vision)
Retirement Plan (401k)
Paid Time Off (Vacation, Sick & Public Holidays)
Family Leave (Maternity, Paternity)
As an early employee at a fast-moving startup, we expect you to quickly and creatively solve all kinds of technical problems, including those beyond your core expertise. An ideal candidate is able to learn quickly, is eager to stretch their knowledge of the ML and data software stack, takes pride in the quality of their work, and wants to make a real impact in energy storage technologies for electric transportation.
Responsibilities:
Develop machine learning and data pipelines for a wide variety of applications and types of battery data
Discover and introduce new ML models, statistical methods, software frameworks, and libraries
Contribute code to Chemix's internal codebase (Python)
Interface with our machine learning scientists, battery engineers, and customers
Implement best practices for code development and ML-ops, experiment tracking, etc
Inform the optimization of the R&D process that generates our data
Requirements
Bachelor's degree in computer science, or the physical, chemical, or biological sciences or engineering, combined with 3+ years of work experience in data science for the physical sciences
Fluency in a variety of data science and statistics concepts
Extensive experience with the python data science stack: pandas, numpy, sklearn, plotly, scipy
Experience with the fundamentals of data science and software ops: git, testing, CI/CD
Clear communication and good people skills
Strong organization and ability to manage parallel projects
Nice to have:
Experience with workflow orchestration tools, e.g. Airflow, Prefect, Luigi, and scaling tools such as Dask
Experience with various modern neural network architectures such as transformers, GCNN, etc
Experience with physics-based modeling of batteries (e.g. DFN model) and/or chemistry (DFT, MD, QC, etc)
Experience with cloud web services (AWS, Google Cloud, Azure, etc.), Docker, Kubernetes
Familiarity with experimental chemistry/materials science
Benefits
Stock Option Plan
Health Care Plan (Medical, Dental & Vision)
Retirement Plan (401k)
Paid Time Off (Vacation, Sick & Public Holidays)
Family Leave (Maternity, Paternity)