BAnG: Bidirectional Anchored Generation for Conditional RNA Design

πŸ“… 2025-02-28
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Protein-specific RNA design is hindered by reliance on extensive experimental data or high-resolution structural priors. Method: This paper proposes a conditional generative framework that requires neither labeled data nor pre-built structural models. We introduce Bidirectional Anchored Generation (BAnG), a novel paradigm jointly modeling the locality and context-dependency of functional motifs. Built upon a deep sequence generation architecture, BAnG integrates bidirectional attention, anchor-aware decoding, and a motif-aware loss function. Contribution/Results: On both synthetic benchmarks and real-world protein–RNA binding tasks, our method substantially outperforms existing approaches. It successfully generates multiple novel RNA sequences computationally validated to exhibit high binding affinity. The framework establishes a new, efficient, general-purpose, and data-light paradigm for targeted RNA design.

Technology Category

Application Category

πŸ“ Abstract
Designing RNA molecules that interact with specific proteins is a critical challenge in experimental and computational biology. Existing computational approaches require a substantial amount of experimentally determined RNA sequences for each specific protein or a detailed knowledge of RNA structure, restricting their utility in practice. To address this limitation, we develop RNA-BAnG, a deep learning-based model designed to generate RNA sequences for protein interactions without these requirements. Central to our approach is a novel generative method, Bidirectional Anchored Generation (BAnG), which leverages the observation that protein-binding RNA sequences often contain functional binding motifs embedded within broader sequence contexts. We first validate our method on generic synthetic tasks involving similar localized motifs to those appearing in RNAs, demonstrating its benefits over existing generative approaches. We then evaluate our model on biological sequences, showing its effectiveness for conditional RNA sequence design given a binding protein.
Problem

Research questions and friction points this paper is trying to address.

Design RNA sequences for protein interactions
Overcome limitations of existing computational approaches
Generate RNA sequences without extensive experimental data
Innovation

Methods, ideas, or system contributions that make the work stand out.

Deep learning model for RNA sequence generation
Bidirectional Anchored Generation (BAnG) method
Generates RNA sequences without experimental data
πŸ”Ž Similar Papers
No similar papers found.