MariaDB 和 GreatSQL 性能差异背后的真相

作者：GreatSQL

2024-10-23
福建
本文字数：3823 字
阅读完需：约 13 分钟

MariaDB 和 GreatSQL 性能差异背后的真相

前言

最近项目上遇到了两次 MariaDB 和 GreatSQL 的对比，GreatSQL 受到客户质疑，最后经过排查抓到性能差异背后的真相。基于此做个分享。

版本

MariaDB 版本:10.3.39 该版本为麒麟 V10 yum 安装

GreatSQL 版本:GreatSQL-8-0-32-25

问题一:MariaDB 和 GreatSQL 使用 sysbench 压测性能相差 100 倍

某天某客户反馈他们的 sysbench 压测结果，MariaDB 和 GreatSQL 压测性能相差 100 倍。

信息收集

架构:均为单机

版本如上文版本所示

配置文件:MariaDB yum 安装后/etc/my.cnf 未曾更改

GreatSQL 配置文件为 GreatSQL 用户手册中的配置文件链接如下:

https://GreatSQL.cn/docs/8.0.32-25/3-quick-start/3-4-quick-start-with-cnf.html

两者在同一台机器上轮流运行（即开启 MariaDB 时关闭 GreatSQL，开启 GreatSQL 时关闭 MariaDB）

ps:没收集硬件信息是因为都是运行在麒麟 V10 上，相同规格硬件运行，这部分已经排除影响。

思考

遇到这么夸张的性能差距，第一反应是 innodb_buffer_pool，binlog 开关，sync_binlog 和 innodb_flush_log_at_trx_commit 这几个和性能关系较大的参数

复现测试

MariaDB 10.3.39 安装

$ yum -y install MariaDB$ yum -y install MariaDB.server #启动MariaDB$ systemctl start MariaDB #查看MariaDB的状态$ systemctl status MariaDB

复制代码

查看相关参数

# 登录，初始化首次登录MariaDB的密码为空$ mysql -uroot -p -S /var/lib/mysql/mysql.sock
#查看binlogMariaDB [(none)]> show variables like 'log_bin';+---------------+-------+| Variable_name | Value |+---------------+-------+| log_bin       | OFF   |+---------------+-------+1 row in set (0.003 sec)
# 查看sync_binlog MariaDB [(none)]> show variables like 'sync_binlog';+---------------+-------+| Variable_name | Value |+---------------+-------+| sync_binlog   | 0     |+---------------+-------+1 row in set (0.002 sec)
# 查看innodb_flush_log_at_trx_commitMariaDB [(none)]> show variables like '%trx_commit%';+--------------------------------+-------+| Variable_name                  | Value |+--------------------------------+-------+| innodb_flush_log_at_trx_commit | 1     |+--------------------------------+-------+1 row in set (0.001 sec)

复制代码

查到这里其实问题已经明朗。GreatSQL 用户手册中配置为 binlog 开启，sync_binlog 和 innodb_flush_log_at_trx_commit 均设置为 1。

GreatSQL 安装

参考用户手册进行安装

https://GreatSQL.cn/docs/8.0.32-25/3-quick-start/3-2-quick-start-with-tarball.html

sysbench 测试相关说明

针对这个情况使用 sysbench 在相同情况下做了对比测试:

innodb_buffer_pool 大小设置一致，测试组分别为:

MariaDB 和 GreatSQL 均关闭 binlog，sync_binlog 设置为 0;

MariaDB 和 GreatSQL 均开启 binlog，sync_binlog 设置为 1。

MariaDB 开启 binlog，并设置 sync_binlog=1（反向操作注释掉相应内容即可）

配置文件加入

[mysqld]log_bin=onbinlog_format=ROWsync_binlog=1

复制代码

GreatSQL 关闭 binlog，并设置为 sync_binlog=0

在配置文件中加入，并注释掉相关 binlog 参数

[mysqld]skip-log-binsync_binlog=0

复制代码

测试结果

由于文章篇幅原因这里附上测试结果，sysbench 安装测试请参考:

https://greatsql.cn/docs/8.0.32-26/10-optimze/3-1-benchmark-sysbench.html

10 张表每张表 1 千万数据，8 线程，测试结果均取第四次的结果。测试命令样例:

$ sysbench  ./oltp_read_only.lua --mysql-db=database --mysql-host=ip --mysql-port=port --mysql-user=root --mysql-password=password --tables=10 --table_size=1000000 --report-interval=2 --threads=8 --db-driver=mysql --skip-trx=off --db-ps-mode=disable --create-secondary=off --time=60 --simple-ranges=0 --sum-ranges=0 --order-ranges=0 --distinct-ranges=0 --mysql-ignore-errors=9001,9002,9000,1062 run

复制代码

MariaDB 和 GreatSQL 均关闭 binlog，sync_binlog 设置为 0

MariaDB 和 GreatSQL 均开启 binlog，sync_binlog 设置为 1

ps:在 MariaDB 和 GreatSQL 均开启 binlog，sync_binlog 设置为 1 的测试场景下，只读还存在少量的性能差异。因为测试机器为虚拟机性能有限，磁盘 I/O 等并不稳定，无法排除硬件的影响，有兴趣的朋友可以自行测试得到更为精确的性能报告

问题一小结

实际生产环境出于对数据安全的考虑，建议开启 binlog，sync_binlog 设置为 1。

开启 binlog 和 sync_binlog=1 的情况下，MariaDB 和 GreatSQL 大多数场景下性能相当，只有在只读场景下 MariaDB 略优于 GreatSQL

关闭 binlog 和 sync_binlog=0 的情况下，在 update_index 和 delete 场景下 GreatSQL 略优于 MariaDB，而只读场景下 MariaDB 略优于 GreatSQL，其余场景下二者性能相当

问题二:GreatSQL 插入大事务 hang 住，MariaDB 不会

某天某客户反馈 GreatSQL 插入大事务会 hang 住，而 MariaDB 不会。在论坛帖子求助也无果

原贴链接https://GreatSQL.cn/forum.php?mod=viewthread&tid=827&extra=&highlight=MariaDB&page=2

信息收集:

可看原贴

show processlsit 显示

update 的线程 State 整齐卡在 waiting for handler commit

hang 住的时候，错误日志打印 Warning 信息:

[Warning] [MY-013865] [InnoDB] Redo log writer is waiting for a new redo log file. Consider increasing innodb_redo_log_capacity.[Warning] [MY-013865] [InnoDB] Redo log writer is waiting for a new redo log file. Consider increasing innodb_redo_log_capacity.[Warning] [MY-013865] [InnoDB] Redo log writer is waiting for a new redo log file. Consider increasing innodb_redo_log_capacity.[Warning] [MY-013865] [InnoDB] Redo log writer is waiting for a new redo log file. Consider increasing innodb_redo_log_capacity.[Warning] [MY-013865] [InnoDB] Redo log writer is waiting for a new redo log file. Consider increasing innodb_redo_log_capacity.

复制代码

问题定位及分析：

1.其配置文件中

"innodb_io_capacity"设置为 200 过小，可以调整为 2000 或根据物理 I/O 适当调整。写负载越高的系统建议设置更高值，但不建议使用高于 20000 的值。

"innodb_redo_log_capacity"设置为 100M 过小，可调整到 1G~8G 之间，在写负载高场景中，可以调整的更大。

2.GreatSQL 中的宽表插入大量二进制数据（疑似图片或文件附件二进制内容），易造成更多的 page 页分裂，造成 redo 膨胀，redo 的写入加剧，消耗更多 innodb 的 I/O，redo 的总容量过小，InnoDB 在受到重做日志容量压力的情况下被迫把脏页刷新到磁盘，同样需要更多 innodb 的 I/O，innodb 的 I/O 瓶颈导致正常提交的事务 hang 住。

参数详解:

innodb_io_capacity:

控制 innodb 可用的总 I/O 容量

The innodb_io_capacity variable defines the overall I/O capacity available to InnoDB. It should be set to approximately the number of I/O operations that the system can perform per second (IOPS). When innodb_io_capacity is set, InnoDB estimates the I/O bandwidth available for background tasks based on the set value.
You can set innodb_io_capacity to a value of 100 or greater. The default value is 200.

译文

innodb_io_cacapacity 变量定义了 innodb 可用的总 I/O 容量。它应该设置为大约系统每秒可以执行的 I/O 操作数（IOPS）。当设置 innodb_io_capacity 时，innodb 会根据设置值估计后台任务可用的 I/O 带宽。

您可以将 innodb_io_capacity 设置为 100 或更大的值。默认值为 200。

innodb_redo_log_capacity：

redo 日志文件的总容量。

The Innodb_redo_log_capacity_resized server status variable indicates the total redo log capacity for all redo log files.
If innodb_redo_log_capacity is not defined, and if innodb_log_file_size and/or innodb_log_files_in_group is defined, then the InnoDB redo log capacity is calculated as (innodb_log_files_in_group * innodb_log_file_size). This calculation does not modify the unused innodb_redo_log_capacity setting's value.

译文

Innodb_redo_log_capacity_resized 服务器状态变量表示所有重做日志文件的总重做日志容量。

如果未定义 innodb_redo_log_capacity，并且定义了 innodb_log_file_size 和/或 innodb_log.files_in_group，则 innodb 重做日志容量计算为（innodb_log-files_in_group*innodb_log_file_size）。此计算不会修改未使用的 innodb_redo_log_capacity 设置的值。

验证

客户把对应的参数调大，hang 住的情况消失，业务能正常使用，达到预期。

问题二小结

该次问题貌似是简单的参数调整，其实其中关联到了以下内容，有兴趣的可以去了解

1.开发规范，是否允许往库宽表插入大量二进制数据，查询怎么办？

2.了解 redo 是什么，innodb master 线程有什么作用(脏页回收线程也合并到 master 中了)

3.innodb 的 IO 由哪些参数控制，调整为多大是合理值?

4.redo 的容量规格由哪些参数控制，调整为多大是合理值？

全文总结

无论是问题一中的 100 倍性能背后还是插入 hang 住，其实背后都是对产品的不了解，MariaDB 10.3.39 其实对应的是 5.7 的产品，而且其本身的默认参数调整有作弊嫌疑，默认关闭 binlog ，sync_binlog 设置为 0，checkpoint 阈值降低，提高 checkpoint 的频率等设置，面向测试环境数据是很好看的，但是增加了生产环境的风险和硬件成本。

应知其然，知其不然。

诸君共勉

发布于: 刚刚阅读数: 3

原文链接:【http://xie.infoq.cn/article/7d1e972c8c6dbb7934a742355】。文章转载请联系作者。

GreatSQL

关注

GreatSQL社区 2023-01-31 加入

GreatSQL是由万里数据库维护的MySQL分支，专注于提升MGR可靠性及性能，支持InnoDB并行查询特性，是适用于金融级应用的MySQL分支版本。社区：https://greatsql.cn/ Gitee: https://gitee.com/GreatSQL/GreatSQL

发布

暂无评论

创作场景

MariaDB 和 GreatSQL 性能差异背后的真相

MariaDB 和 GreatSQL 性能差异背后的真相

前言

版本

问题一:MariaDB 和 GreatSQL 使用 sysbench 压测性能相差 100 倍

信息收集

思考

复现测试

MariaDB 10.3.39 安装

GreatSQL 安装

sysbench 测试相关说明

MariaDB 开启 binlog，并设置 sync_binlog=1（反向操作注释掉相应内容即可）

GreatSQL 关闭 binlog，并设置为 sync_binlog=0

测试结果

MariaDB 和 GreatSQL 均关闭 binlog，sync_binlog 设置为 0

MariaDB 和 GreatSQL 均开启 binlog，sync_binlog 设置为 1

问题一小结

问题二:GreatSQL 插入大事务 hang 住，MariaDB 不会

信息收集:

问题定位及分析：

参数详解:

innodb_io_capacity:

innodb_redo_log_capacity：

验证

问题二小结

全文总结

GreatSQL

评论