A Transformer Model for Predicting Chemical Reaction Products from Generic Templates

📅 2025-03-04
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Reaction product prediction faces a dual challenge: template-based methods suffer from poor generalizability, while template-free approaches exhibit limited accuracy. This paper introduces the Benchmark Reaction Set (BRS)—a compact, high-coverage set of 20 universal reaction templates—and ProPreT5, a customized T5 model that, for the first time, integrates template-guided decoding with end-to-end sequence-to-sequence learning while preserving chemical validity. Built upon SMILES representations, ProPreT5 leverages large-scale pretraining followed by template-constrained fine-tuning. Experiments across multiple benchmarks demonstrate that ProPreT5 significantly outperforms state-of-the-art methods in top-k accuracy, chemical validity, and reaction realism. Crucially, it overcomes the long-standing trade-off between template rigidity and model generalizability, establishing a new paradigm for data-efficient, chemically grounded reaction prediction.

Technology Category

Application Category

📝 Abstract
The accurate prediction of chemical reaction outcomes is a major challenge in computational chemistry. Current models rely heavily on either highly specific reaction templates or template-free methods, both of which present limitations. To address these limitations, this work proposes the Broad Reaction Set (BRS), a dataset featuring 20 generic reaction templates that allow for the efficient exploration of the chemical space. Additionally, ProPreT5 is introduced, a T5 model tailored to chemistry that achieves a balance between rigid templates and template-free methods. ProPreT5 demonstrates its capability to generate accurate, valid, and realistic reaction products, making it a promising solution that goes beyond the current state-of-the-art on the complex reaction product prediction task.
Problem

Research questions and friction points this paper is trying to address.

Predicting chemical reaction outcomes accurately
Overcoming limitations of specific and template-free methods
Generating valid and realistic reaction products efficiently
Innovation

Methods, ideas, or system contributions that make the work stand out.

ProPreT5: T5 model tailored for chemistry
Broad Reaction Set: 20 generic reaction templates
Balances rigid templates and template-free methods
🔎 Similar Papers
No similar papers found.
D
Derin Ozer
Univ Angers, LERIA, SFR MATRIX, F-49000 Angers, France
S
Sylvain Lamprier
Univ Angers, LERIA, SFR MATRIX, F-49000 Angers, France
T
Thomas Cauchy
Univ Angers, CNRS, MOLTECH-ANJOU, SFR MATRIX, F-49000 Angers, France
Nicolas Gutowski
Nicolas Gutowski
Université d'Angers / LERIA
Recommendation systemsMulti-armed banditsMulticriteria optimizationGenerative AIPINNs
B
Benoit Da Mota
Univ Angers, LERIA, SFR MATRIX, F-49000 Angers, France