🤖 AI Summary
This work proposes the first generative foundation model for crystalline materials, addressing the limitations of conventional task-specific AI models that struggle to jointly handle structural representation, property prediction, and structure–activity reasoning. By integrating structure–activity knowledge injection, a dual-head joint training architecture, and multi-objective physics-informed reinforcement learning, the model achieves synergistic optimization of symbolic reasoning and numerical regression within a unified framework. It attains state-of-the-art accuracy in predicting formation energy above the convex hull, bulk modulus, and bandgap, while achieving a 65.3% S.U.N. rate for unconditional crystal generation. Furthermore, it significantly improves success rates in conditional generation tasks involving rare magnetization densities, outperforming existing specialized models across multiple benchmarks.
📝 Abstract
Progress in AI-driven crystal materials science has so far been carried by narrow architectures purpose-built for individual tasks -- graph neural networks for property prediction, diffusion and flow-matching models for crystal generation -- each excelling within its niche yet unable to act as a shared backbone across the full spectrum of materials problems. Generative large language models offer a fundamentally different paradigm, in which structural representation, quantitative prediction, and structure-activity reasoning can be unified within one model, but the materials community has yet to see this paradigm realized at a level competitive with established narrow specialists. Here we present MatMind, a generative foundation model purpose-built for crystal materials science under this paradigm, developed through the coordinated activation of structure-activity knowledge and physics-informed feedback within a progressive training framework -- combining structure-activity knowledge injection, a dual-head architecture that jointly trains language reasoning and numerical regression in a shared representation space, and multi-objective physics-informed reinforcement learning over stability, novelty, and structural diversity. Across three task families, MatMind attains the lowest mean absolute error on energy above hull, bulk modulus, and band gap -- surpassing graph neural network predictors purpose-built for these tasks -- reaches an S.U.N. rate of 65.3% on unconditional crystal generation, and achieves a comparable multiplicative improvement on magnetization-density-conditioned generation, where only 21 positive samples exist within over 600000 training entries. By matching or surpassing narrow specialists on their own ground while operating within a single unified model, MatMind shows that the LLM-based paradigm can serve as a viable backbone for crystal materials science going forward.