Is Sequence Information All You Need for Bayesian Optimization of Antibodies?

📅 2025-09-29
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study investigates the necessity of structural information and surrogate model selection in antibody Bayesian optimization. Method: We propose the first Bayesian optimization framework integrating structural features with protein language models (PLMs), introducing a novel PLM-driven soft-constraint mechanism that implicitly encodes structural preferences to guide sequence-space search. We systematically compare three modalities—sequence-only, sequence-with-soft-constraints, and sequence-with-explicit-structure—on antibody affinity and stability optimization tasks. Results: Soft constraints enable sequence-only optimization to match the peak performance of structure-dependent methods, lagging only marginally in early optimization stages. Crucially, they substantially reduce reliance on costly 3D structure prediction or experimental determination. Our core contribution challenges the “structural necessity” assumption, establishing PLM-based soft constraints as an efficient, high-fidelity alternative paradigm for antibody design.

Technology Category

Application Category

📝 Abstract
Bayesian optimization is a natural candidate for the engineering of antibody therapeutic properties, which is often iterative and expensive. However, finding the optimal choice of surrogate model for optimization over the highly structured antibody space is difficult, and may differ depending on the property being optimized. Moreover, to the best of our knowledge, no prior works have attempted to incorporate structural information into antibody Bayesian optimization. In this work, we explore different approaches to incorporating structural information into Bayesian optimization, and compare them to a variety of sequence-only approaches on two different antibody properties, binding affinity and stability. In addition, we propose the use of a protein language model-based ``soft constraint,'' which helps guide the optimization to promising regions of the space. We find that certain types of structural information improve data efficiency in early optimization rounds for stability, but have equivalent peak performance. Moreover, when incorporating the protein language model soft constraint we find that the data efficiency gap is diminished for affinity and eliminated for stability, resulting in sequence-only methods that match the performance of structure-based methods, raising questions about the necessity of structure in Bayesian optimization for antibodies.
Problem

Research questions and friction points this paper is trying to address.

Optimizing antibody properties using Bayesian optimization methods
Incorporating structural versus sequence-only information in optimization
Evaluating necessity of structural data for antibody engineering
Innovation

Methods, ideas, or system contributions that make the work stand out.

Incorporating structural information into Bayesian optimization
Using protein language model as soft constraint
Comparing sequence-only and structure-based optimization methods
🔎 Similar Papers
No similar papers found.