GraphXForm: Graph transformer for computer-aided molecular design with application to extraction

📅 2024-11-03
🏛️ Digital Discovery
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
To address low chemical validity and difficulty in satisfying substructure constraints in molecular generation, this work introduces the first pure Transformer-based generative model explicitly designed for molecular *graph* structures—bypassing sequential representations (e.g., SMILES) to directly model atomic-bond topology and 3D geometry. The method integrates graph neural networks, learnable graph positional encodings, multi-task property prediction heads, and a reinforcement learning–driven graph editing mechanism, enabling geometry-aware and property-guided controllable generation. Evaluated on chemical engineering tasks such as solvent extraction, the model achieves 92.7% molecular validity across multiple benchmarks and reduces prediction error of target distribution coefficients by 31% versus VAE and GFlowNet baselines. It establishes a novel end-to-end paradigm for controllable molecular graph generation.

Technology Category

Application Category

📝 Abstract
Generative deep learning has become pivotal in molecular design for drug discovery, materials science, and chemical engineering. A widely used paradigm is to pretrain neural networks on string representations of...
Problem

Research questions and friction points this paper is trying to address.

Ensures chemical validity in molecular design
Incorporates structural constraints in molecular graphs
Improves drug and solvent design performance
Innovation

Methods, ideas, or system contributions that make the work stand out.

Graph-based molecular representations ensure chemical validity.
Transformer architectures model long-range atomic dependencies.
Combines deep cross-entropy method with self-improvement learning.
J
Jonathan Pirnay
Technical University of Munich, TUM Campus Straubing for Biotechnology and Sustainability, Bioinformatics, Straubing; University of Applied Sciences Weihenstephan-Triesdorf, Bioinformatics, Straubing
Jan G. Rittig
Jan G. Rittig
EPFL, Laboratory of Artificial Chemical Intelligence (LIAC)
Molecular Machine LearningOptimizationChemical EngineeringGraph LearningHybrid Modeling
A
Alexander B. Wolf
Technical University of Munich, TUM Campus Straubing for Biotechnology and Sustainability, Laboratory of Chemical Process Engineering, Straubing
M
Martin Grohe
Chair of Computer Science 7, RWTH Aachen University, Aachen
Jakob Burger
Jakob Burger
Technical University of Munich
Synthetic FuelsOptimisationRaw Material ChangeC1 chemistryBiotechnology
Alexander Mitsos
Alexander Mitsos
AVT Systemverfahrenstechnik, RWTH Aachen University and Energy Systems Engineering IEK-10
process systems engineeringenergy systemsglobal optimizationbilevel optimizationprocess
D
D. G. Grimm
Technical University of Munich, TUM Campus Straubing for Biotechnology and Sustainability, Bioinformatics, Straubing; University of Applied Sciences Weihenstephan-Triesdorf, Bioinformatics, Straubing