OceanBase分布式关系数据库架构与技术

阳振坤; 杨传辉; 韩富晟; 王国平; 杨志丰; 成肖君

doi:10.7544/issn1000-1239.202330835

OceanBase分布式关系数据库架构与技术

Architecture and Technology of OceanBase Distributed Relational Database

摘要

摘要: 关系数据库是当今社会的关键信息基础设施，互联网和数字化带来了高并发和海量数据，传统关系数据库均为集中式架构，处理能力和存储容量都捉襟见肘. OceanBase分布式关系数据库基于通用PC服务器，不仅实现了在线水平伸缩，还实现了机房故障自动无损容灾以及高倍率数据压缩等，已经应用于金融、政务、通信和互联网等行业. 介绍了OceanBase分布式关系数据库的系统架构和关键技术，包括分布式事务处理、基于LSM-tree 的存储系统以及分布式SQL优化器. 详细阐述了OceanBase数据库的高可用和数据一致性，包括RPO为0和RTO小于8 s. 也介绍了OceanBase数据库多租户机制，即采用了集群内原生多租户设计，在集群内实现多个互相独立的数据库服务. 基于Sysbench和TPC-H评测基准，对比实验结果表明：1）在单机模式下，OceanBase的性能是MySQL的1.27倍至2倍多；2）在单主模式下，OceanBase的性能是MySQL的1.25倍至近2倍；3）在多主模式下，OceanBase的性能是MySQL的1.09倍至3.1倍，对于OLAP的复杂查询，OceanBase 的性能是MySQL 的6 倍到327倍.

Abstract: Relational database is the key information infrastructure of today’s society. The Internet and digitization have brought high concurrency and massive data. Due to their centralized architectures, the processing power and storage capacity of traditional relational databases are stretched. OceanBase is a distributed relational database based on commodity PC servers. It achieves online horizontal scalability, automatic lossless disaster recovery from data center failure and high-ratio data compression. It has been used in finance, government affairs, telecommunication systems, Internet, etc. We introduce the architecture and some key technologies of OceanBase, including distributed transaction processing, LSM-tree-based storage system and distributed SQL optimizer. In addition, we explain in detail the high availability and data consistency of OceanBase, which can ensure that RPO is 0 and RTO is less than 8 seconds. At the same time, it also introduces OceanBase’s multi-tenant mechanism, which adopts a native multi-tenant design within the cluster to implement multiple independent database services in the cluster. Based on the Sysbench and TPC-H evaluation benchmarks, comparative experimental results show that 1) in a stand-alone mode, the performance of OceanBase is 1.27 times to over 2 times that of MySQL; 2) in a single-master mode, the performance of OceanBase is 1.25 times to nearly 2 times that of MySQL; 3) in a multi-master mode, the performance of OceanBase is 1.09 to 3.1 times that of MySQL, and for complex OLAP queries, the performance of OceanBase is 6 to 327 times that of MySQL.

HTML全文

参考文献(30)

施引文献

资源附件(1)