MolCryst-MLIPs: A Machine-Learned Interatomic Potentials Database for Molecular Crystals

๐Ÿ“… 2026-04-15
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

206K/year
๐Ÿค– AI Summary
This work addresses the scarcity of high-quality, reproducible, and open-source machine learning interatomic potential (MLIP) databases for molecular crystals, which has hindered their application in polymorph and thermodynamic simulations. We present the first open MLIP database specifically designed for molecular crystals, built upon the MACE-MH-1 pretrained model and an automated machine learning pipeline (AMLP). Covering nine representative chemical systems, our framework encompasses the entire workflowโ€”from generation of high-fidelity reference data to model fine-tuning and validation. The resulting models achieve mean absolute errors of 0.141 kJ/mol/atom for energy and 0.648 kJ/mol/ร… for forces, demonstrating excellent energy conservation and structural stability in molecular dynamics simulations. This approach significantly enhances model generalizability, development efficiency, and reproducibility, enabling efficient simulation of polymorphic behavior under diverse thermodynamic conditions.

Technology Category

Application Category

๐Ÿ“ Abstract
We present an open Molecular Crystal (MC) database of Machine-Learned Interatomic Potentials (MLIP) called MolCryst-MLIPs. The first release comprises fine-tuned MACE models for nine molecular crystal systems -- Benzamide, Benzoic acid, Coumarin, Durene, Isonicotinamide, Niacinamide, Nicotinamide, Pyrazinamide, and Resorcinol -- developed using the Automated Machine Learning Pipeline (AMLP), which streamlines the entire MLIP development workflow, from reference data generation to model training and validation, into a reproducible and user-friendly pipeline. Models are fine-tuned from the MACE-MH-1 foundation model (omol head), yielding a mean energy MAE of 0.141 kJ/mol/atom and a mean force MAE of 0.648 kJ/mol/Angstrom across all systems. Dynamical stability and structural integrity, as assessed through energy conservation, P2 orientational order parameters, and radial distribution functions, are evaluated using molecular dynamics simulations. The released models and datasets constitute a growing open database of validated MLIPs, ready for production MD simulations of molecular crystal polymorphism under different thermodynamic conditions.
Problem

Research questions and friction points this paper is trying to address.

Molecular Crystals
Machine-Learned Interatomic Potentials
Polymorphism
Molecular Dynamics
Database
Innovation

Methods, ideas, or system contributions that make the work stand out.

Machine-Learned Interatomic Potentials
Molecular Crystals
Automated Machine Learning Pipeline
MACE Models
Polymorphism Simulation
๐Ÿ”Ž Similar Papers
A
Adam Lahouari
Department of Chemistry, New York University, New York, NY 10003, USA
S
Shen Ai
Department of Physics, New York University, New York, NY 10003, USA
J
Jihye Han
Department of Chemistry, New York University, New York, NY 10003, USA
J
Jillian Hoffstadt
Department of Chemistry, New York University, New York, NY 10003, USA
P
Philipp Hoellmer
Department of Chemistry, New York University, New York, NY 10003, USA
C
Charlotte Infante
Department of Chemistry, New York University, New York, NY 10003, USA
P
Pulkita Jain
Department of Chemical and Biomolecular Engineering, Tandon School of Engineering, New York University, Brooklyn, New York 11201, United States
S
Sangram Kadam
Department of Chemistry, New York University, New York, NY 10003, USA
M
Maya M. Martirossyan
Department of Chemistry, New York University, New York, NY 10003, USA
A
Amara McCune
Department of Physics, New York University, New York, NY 10003, USA
H
Hypatia Newton
Simons Center for Computational Physical Chemistry, New York University, New York, NY 10003, USA
S
Shlok J. Paul
Department of Chemical and Biomolecular Engineering, Tandon School of Engineering, New York University, Brooklyn, New York 11201, United States
W
Willmor Pena
Department of Chemistry, New York University, New York, NY 10003, USA
J
Jonathan Raghoonanan
Department of Physics, New York University, New York, NY 10003, USA
S
Sumon Sahu
Department of Physics, New York University, New York, NY 10003, USA
O
Oliver Tan
Department of Chemistry, New York University, New York, NY 10003, USA
A
Andrea Vergara
Department of Physics, New York University, New York, NY 10003, USA
Jutta Rogal
Jutta Rogal
Flatiron Institute
enhanced samplingdimensionality reductionmachine learning for molecular physicsmaterials
M
Mark E. Tuckerman
Department of Chemistry, New York University, New York, NY 10003, USA