OceanBase Bacchus: a High-Performance and Scalable Cloud-Native Shared Storage Architecture for Multi-Cloud

📅 2026-02-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of balancing cost and performance in shared-storage databases under high concurrency, where traditional B+ trees suffer from inefficient in-place updates and existing LSM-tree designs are hindered by complex cross-node shared logging mechanisms. The authors propose a novel LSM-tree architecture tailored for cloud object storage, which innovatively offloads a Paxos-based append-only log file system (PALF) as a dedicated shared log service. By integrating stateless compute nodes, a shared block cache, and asynchronous background tasks, the design achieves elastic decoupling of compute, caching, and storage layers. This approach effectively resolves the log-sharing bottleneck in disaggregated architectures and supports multi-cloud deployment. Experimental results demonstrate that the system matches or exceeds HBase in OLTP performance while significantly outperforming StarRocks in OLAP workloads, simultaneously reducing storage costs by 59% for OLTP and 89% for OLAP.

Technology Category

Application Category

📝 Abstract
Although an increasing number of databases now embrace shared-storage architectures, current storage-disaggregated systems have yet to strike an optimal balance between cost and performance. In high-concurrency read/write scenarios, B+-tree-based shared storage struggles to efficiently absorb frequent in-place updates. Existing LSM-tree-backed disaggregated storage designs are hindered by the intricate implementation of cross-node shared-log mechanisms, where no satisfactory solution yet exists. This paper presents OceanBase Bacchus, an LSM-tree architecture tailored for object storage provided by cloud vendors. The system sustains high-performance reads and writes while rendering compute nodes stateless through shared service-oriented PALF (Paxos-backed Append-only Log File system) logging and asynchronous background services. We employ a Shared Block Cache Service to flexibly utilize cache resources. Our design places log synchronization into a shared service, providing a novel solution for log sharing in storage-compute-separated databases. The architecture decouples functionality across modules, enabling elastic scaling where compute, cache, and storage resources can be resized rapidly and independently. Through experimental evaluation using multiple benchmark tests, including SysBench and TPC-H, we confirm that OceanBase Bacchus achieves performance comparable to or superior to that of HBase in OLTP scenarios and significantly outperforms StarRocks in OLAP workloads. Leveraging Bacchus's support for multi-cloud deployment and consistent performance, we not only retain high availability and competitive performance but also achieve substantial reductions in storage costs by 59% in OLTP scenarios and 89% in OLAP scenarios.
Problem

Research questions and friction points this paper is trying to address.

shared-storage architecture
LSM-tree
log sharing
storage-compute disaggregation
high-concurrency workloads
Innovation

Methods, ideas, or system contributions that make the work stand out.

shared-storage architecture
LSM-tree
cloud-native
log disaggregation
multi-cloud
🔎 Similar Papers
No similar papers found.
Quanqing Xu
Quanqing Xu
Ant Group
Cloud ComputingCloud StorageLarge-scale Hybrid Storage Systems
M
Mingqiang Zhuang
OceanBase, Ant Group
C
Chuanhui Yang
OceanBase, Ant Group
Q
Quanwei Wan
OceanBase, Ant Group
F
Fusheng Han
OceanBase, Ant Group
F
Fanyu Kong
OceanBase, Ant Group
H
Hao Liu
OceanBase, Ant Group
H
Hu Xu
OceanBase, Ant Group
J
Junyu Ye
OceanBase, Ant Group