🤖 AI Summary
To address low chemical validity and difficulty in satisfying substructure constraints in molecular generation, this work introduces the first pure Transformer-based generative model explicitly designed for molecular *graph* structures—bypassing sequential representations (e.g., SMILES) to directly model atomic-bond topology and 3D geometry. The method integrates graph neural networks, learnable graph positional encodings, multi-task property prediction heads, and a reinforcement learning–driven graph editing mechanism, enabling geometry-aware and property-guided controllable generation. Evaluated on chemical engineering tasks such as solvent extraction, the model achieves 92.7% molecular validity across multiple benchmarks and reduces prediction error of target distribution coefficients by 31% versus VAE and GFlowNet baselines. It establishes a novel end-to-end paradigm for controllable molecular graph generation.
📝 Abstract
Generative deep learning has become pivotal in molecular design for drug discovery, materials science, and chemical engineering. A widely used paradigm is to pretrain neural networks on string representations of...