MatMind: A Structure-Activity Knowledge-Driven Generative Foundation Model for Materials Science

📅 2026-06-05
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes the first generative foundation model for crystalline materials, addressing the limitations of conventional task-specific AI models that struggle to jointly handle structural representation, property prediction, and structure–activity reasoning. By integrating structure–activity knowledge injection, a dual-head joint training architecture, and multi-objective physics-informed reinforcement learning, the model achieves synergistic optimization of symbolic reasoning and numerical regression within a unified framework. It attains state-of-the-art accuracy in predicting formation energy above the convex hull, bulk modulus, and bandgap, while achieving a 65.3% S.U.N. rate for unconditional crystal generation. Furthermore, it significantly improves success rates in conditional generation tasks involving rare magnetization densities, outperforming existing specialized models across multiple benchmarks.
📝 Abstract
Progress in AI-driven crystal materials science has so far been carried by narrow architectures purpose-built for individual tasks -- graph neural networks for property prediction, diffusion and flow-matching models for crystal generation -- each excelling within its niche yet unable to act as a shared backbone across the full spectrum of materials problems. Generative large language models offer a fundamentally different paradigm, in which structural representation, quantitative prediction, and structure-activity reasoning can be unified within one model, but the materials community has yet to see this paradigm realized at a level competitive with established narrow specialists. Here we present MatMind, a generative foundation model purpose-built for crystal materials science under this paradigm, developed through the coordinated activation of structure-activity knowledge and physics-informed feedback within a progressive training framework -- combining structure-activity knowledge injection, a dual-head architecture that jointly trains language reasoning and numerical regression in a shared representation space, and multi-objective physics-informed reinforcement learning over stability, novelty, and structural diversity. Across three task families, MatMind attains the lowest mean absolute error on energy above hull, bulk modulus, and band gap -- surpassing graph neural network predictors purpose-built for these tasks -- reaches an S.U.N. rate of 65.3% on unconditional crystal generation, and achieves a comparable multiplicative improvement on magnetization-density-conditioned generation, where only 21 positive samples exist within over 600000 training entries. By matching or surpassing narrow specialists on their own ground while operating within a single unified model, MatMind shows that the LLM-based paradigm can serve as a viable backbone for crystal materials science going forward.
Problem

Research questions and friction points this paper is trying to address.

foundation model
crystal materials science
structure-activity reasoning
generative AI
unified model
Innovation

Methods, ideas, or system contributions that make the work stand out.

generative foundation model
structure-activity knowledge
physics-informed reinforcement learning
dual-head architecture
crystal generation
Z
Zhan'ao Yao
State Key Laboratory of High Performance Ceramics, Shanghai Institute of Ceramics, Chinese Academy of Sciences, 1295 Dingxi Road, Shanghai, 200050, China.; Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing, 100049, China.
Boxuan Zhang
Boxuan Zhang
Beijing Sport University
Spiking Neural NetworksAI4Sports
J
Jingyuan Shu
Beijing Wenge Technology Co., Ltd., Room 717, 7th Floor, Building 9, No. 9 West Beisihuan Road, Beijing, 100080, Beijing, China.
Xiaoyu Wu
Xiaoyu Wu
Central University of Finance and Economics
development economicslabor economicshealth economics
R
Rongyan Wang
State Key Laboratory of High Performance Ceramics, Shanghai Institute of Ceramics, Chinese Academy of Sciences, 1295 Dingxi Road, Shanghai, 200050, China.; Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing, 100049, China.
L
Linjing Li
State Key Laboratory of Multimodal Artificial Intelligence Systems, Institute of Automation, Chinese Academy of Sciences, 95 Zhongguancun East Road, Beijing, 100190, China.; School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, 100049, China.
D
Dajun Zeng
State Key Laboratory of Multimodal Artificial Intelligence Systems, Institute of Automation, Chinese Academy of Sciences, 95 Zhongguancun East Road, Beijing, 100190, China.; School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, 100049, China.
Y
Yudong Yao
College of Medicine and Biological Information Engineering, Northeastern University, No. 3-11, Wenhua Road, Heping District, Shenyang, 110819, Liaoning, China.
T
Tingwei Chen
Faculty of Information, Liaoning University, 66 Chongshan Middle Road, Huanggu Qu, 110031, Shenyang, China.
Y
Youwei Wang
State Key Laboratory of High Performance Ceramics, Shanghai Institute of Ceramics, Chinese Academy of Sciences, 1295 Dingxi Road, Shanghai, 200050, China.; Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing, 100049, China.; School of Chemistry and Materials Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, 1 Sub-lane Xiangshan, Hangzhou, 310024, China.
X
Xiaolin Zhao
State Key Laboratory of High Performance Ceramics, Shanghai Institute of Ceramics, Chinese Academy of Sciences, 1295 Dingxi Road, Shanghai, 200050, China.; Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing, 100049, China.; School of Chemistry and Materials Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, 1 Sub-lane Xiangshan, Hangzhou, 310024, China.
J
Jiahui Shi
State Key Laboratory of Multimodal Artificial Intelligence Systems, Institute of Automation, Chinese Academy of Sciences, 95 Zhongguancun East Road, Beijing, 100190, China.; School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, 100049, China.
J
Jianjun Liu
State Key Laboratory of High Performance Ceramics, Shanghai Institute of Ceramics, Chinese Academy of Sciences, 1295 Dingxi Road, Shanghai, 200050, China.; Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing, 100049, China.; School of Chemistry and Materials Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, 1 Sub-lane Xiangshan, Hangzhou, 310024, China.