🤖 AI Summary
Predicting protein stability changes (ΔΔG) induced by single-point mutations is a fundamental challenge in computational biology. Conventional energy-based methods often neglect the contribution of the unfolded state’s free energy change, violating thermodynamic mass balance and introducing systematic bias; while deep learning approaches achieve high accuracy, they are computationally expensive and difficult to deploy. This paper introduces the Mass-Balance Correction (MBC) framework—the first to explicitly incorporate thermodynamic mass-balance constraints into energy-based models, rigorously accounting for the unfolded-state contribution. MBC integrates statistical potentials, structure-derived feature engineering, and physical principles, requiring no training and enabling lightweight, efficient inference. On standard benchmarks, MBC significantly improves prediction correlation for ΔΔG, outperforming mainstream energy-based methods and approaching the accuracy of certain deep learning models—thereby achieving a favorable trade-off among physical consistency, predictive accuracy, and computational efficiency.
📝 Abstract
The prediction of protein stability changes following single-point mutations plays a pivotal role in computational biology, particularly in areas like drug discovery, enzyme reengineering, and genetic disease analysis. Although deep-learning strategies have pushed the field forward, their use in standard workflows remains limited due to resource demands. Conversely, potential-like methods are fast, intuitive, and efficient. Yet, these typically estimate Gibbs free energy shifts without considering the free-energy variations in the unfolded protein state, an omission that may breach mass balance and diminish accuracy. This study shows that incorporating a mass-balance correction (MBC) to account for the unfolded state significantly enhances these methods. While many machine learning models partially model this balance, our analysis suggests that a refined representation of the unfolded state may improve the predictive performance.