Physics-Informed Weakly Supervised Learning for Interatomic Potentials

📅 2024-07-23

🏛️ arXiv.org

📈 Citations: 0

✨ Influential: 0

career value

222K/year

🤖 AI Summary

Poor generalizability and unphysical energy/force predictions plague machine learning interatomic potentials (MLIPs). To address this, we propose a physics-informed weakly supervised training framework that requires only sparse energy labels—eliminating the need for force annotations. Our method introduces a novel dual-physics loss: (i) a Taylor-expansion-based energy extrapolation term ensuring smooth, physically consistent energy surfaces, and (ii) a conservative force constraint enforcing path independence of forces via curl-free regularization. The framework integrates physics-aware embedding, Taylor-series energy approximation, and conservative force optimization, drastically reducing reliance on large-scale pretraining datasets. Evaluated across multiple benchmark datasets, our approach achieves ~50% average reduction in both energy and force prediction errors. It further demonstrates robustness and accuracy in high-precision quantum chemical tasks—including challenging scenarios such as complete basis-set extrapolation where force computation is infeasible.

Technology Category

Application Category

📝 Abstract

Machine learning plays an increasingly important role in computational chemistry and materials science, complementing computationally intensive ab initio and first-principles methods. Despite their utility, machine-learning models often lack generalization capability and robustness during atomistic simulations, yielding unphysical energy and force predictions that hinder their real-world applications. We address this challenge by introducing a physics-informed, weakly supervised approach for training machine-learned interatomic potentials (MLIPs). We introduce two novel loss functions, extrapolating the potential energy via a Taylor expansion and using the concept of conservative forces. Our approach improves the accuracy of MLIPs applied to training tasks with sparse training data sets and reduces the need for pre-training computationally demanding models with large data sets. Particularly, we perform extensive experiments demonstrating reduced energy and force errors -- often lower by a factor of two -- for various baseline models and benchmark data sets. Finally, we show that our approach facilitates MLIPs' training in a setting where the computation of forces is infeasible at the reference level, such as those employing complete-basis-set extrapolation.

Problem

Research questions and friction points this paper is trying to address.

Improves generalization of machine-learned interatomic potentials

Reduces energy and force errors in sparse data scenarios

Enhances robustness in molecular dynamics simulations

Innovation

Methods, ideas, or system contributions that make the work stand out.

Physics-informed weakly supervised learning approach

Novel loss functions with Taylor expansion

Conservative forces concept for accuracy

🔎 Similar Papers

No similar papers found.