innodb_flush_method value method (example explanation)

innodb_flush_method value method (example explanation)

Several typical values ​​of innodb_flush_method

fsync: InnoDB uses the fsync() system call to flush both the data and log files. fsync is the default setting.

O_DSYNC: InnoDB uses O_SYNC to open and flush the log files, and fsync() to flush the data files. InnoDB does not use O_DSYNC directly because there have been problems with it on many varieties of Unix.

O_DIRECT: InnoDB uses O_DIRECT (or directio() on Solaris) to open the data files, and uses fsync() to flush both the data and log files. This option is available on some GNU/Linux versions, FreeBSD, and Solaris.

How to get the value, the MySQL official document recommends this

How each setting affects performance depends on hardware configuration and workload. Benchmark
your particular configuration to decide which setting to use, or whether to keep the default setting.
Examine the Innodb_data_fsyncs status variable to see the overall number of fsync() calls for
each setting. The mix of read and write operations in your workload can affect how a setting performs.
For example, on a system with a hardware RAID controller and battery-backed write cache, O_DIRECT
can help to avoid double buffering between the InnoDB buffer pool and the operating system's file
system cache. On some systems where InnoDB data and log files are located on a SAN, the default
value or O_DSYNC might be faster for a read-heavy workload with mostly SELECT statements. Always
test this parameter with hardware and workload that reflect your production environment

In other words, the specific value depends on the hardware configuration and workload, and it is best to do a stress test to decide. However, generally speaking, in a Linux environment with a raid controller and write-back write policy, o_direct is a better choice; if the storage medium is SAN, it may be better to use the default fsync or osync.

Generally speaking, it seems that most people use the value o_direct, with a raid card at the bottom and the read-write policy set to write-back. When using sysbench to stress test the oltp type, I found that o_direct does have better performance than fsync, and it seems to be suitable for most scenarios. However, I recently encountered such a SQL statement, and the customer feedback was very slow. Under the same memory, the cloud host built by it executed relatively faster. Later, I found that the huge performance difference was mainly caused by the different setting values ​​of innodb_flush_method.

Test scenario 1

innodb_flush_method is the default value, i.e. fsync, the cache pool is 512M, the table data volume is 1.2G, excluding the influence of the cache pool, the stable result

mysql> show variables like '%innodb_flush_me%';
+---------------------+-------+
| Variable_name | Value |
+---------------------+-------+
| innodb_flush_method | |
+---------------------+-------+
1 row in set (0.00 sec)


mysql> SELECT sql_no_cache SUM(outcome)-SUM(income) FROM journal where account_id = '1c6ab4e7-main';
+--------------------------+
| SUM(outcome)-SUM(income) |
+--------------------------+
|-191010.51 |
+--------------------------+
1 row in set (1.22 sec)


mysql> SELECT sql_no_cache SUM(outcome)-SUM(income) FROM journal where account_id = '1c6ab4e7-main';
+--------------------------+
| SUM(outcome)-SUM(income) |
+--------------------------+
|-191010.51 |
+--------------------------+
1 row in set (1.22 sec)
mysql> explain SELECT sql_no_cache SUM(outcome)-SUM(income) FROM journal where account_id = '1c6ab4e7-main';
+----+-------------+---------+------+---------------+------------+---------+-------+--------+-----------------------+
| id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra |
+----+-------------+---------+------+---------------+------------+---------+-------+--------+-----------------------+
| 1 | SIMPLE | journal | ref | account_id | account_id | 62 | const | 161638 | Using index condition |
+----+-------------+---------+------+---------------+------------+---------+-------+--------+-----------------------+
1 row in set (0.03 sec)

Test scenario 2

innodb_flush_method is changed to o_direct, eliminating the influence of the cache pool, and the result after stabilization

mysql> show variables like '%innodb_flush_me%';
+---------------------+----------+
| Variable_name | Value |
+---------------------+----------+
| innodb_flush_method | O_DIRECT |
+---------------------+----------+
1 row in set (0.00 sec)


mysql> SELECT sql_no_cache SUM(outcome)-SUM(income) FROM journal where account_id = '1c6ab4e7-main';
+--------------------------+
| SUM(outcome)-SUM(income) |
+--------------------------+
|-191010.51 |
+--------------------------+
1 row in set (3.22 sec)


mysql> SELECT sql_no_cache SUM(outcome)-SUM(income) FROM journal where account_id = '1c6ab4e7-main';
+--------------------------+
| SUM(outcome)-SUM(income) |
+--------------------------+
|-191010.51 |
+--------------------------+
1 row in set (3.02 sec)


mysql> explain SELECT sql_no_cache SUM(outcome)-SUM(income) FROM journal where account_id = '1c6ab4e7-main';
+----+-------------+---------+------+---------------+------------+---------+-------+--------+-----------------------+
| id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra |
+----+-------------+---------+------+---------------+------------+---------+-------+--------+-----------------------+
| 1 | SIMPLE | journal | ref | account_id | account_id | 62 | const | 161638 | Using index condition |
+----+-------------+---------+------+---------------+------------+---------+-------+--------+-----------------------+
1 row in set (0.00 sec)

Comparison of results:

The execution plans of the two are exactly the same, but the performance is very different. The query results when the database is first started are also very different, and o_direct is also very different ( test results omitted). I don't quite understand why in this case, the reading efficiency is much higher because of the extra layer of operating system cache. The production environment settings must be based on the stress test results and the actual effects, and you cannot blindly trust the experience values.

Improvement measures:

Without changing innodb_flush_method, this SQL statement can be further optimized by adding a composite index (account_id, outcome, income) to perform a covering index scan, which can greatly reduce the response time.

The above innodb_flush_method value method (example explanation) is all the content that the editor shares with you. I hope it can give you a reference. I also hope that you will support 123WORDPRESS.COM.

<<:  Introduction to Jenkins and how to deploy Jenkins with Docker

>>:  How to solve the problem of FileZilla_Server:425 Can't open data connection

Recommend

Nginx location matching rule example

1. Grammar location [=|~|~*|^~|@] /uri/ { ... } 2...

How to configure tomcat server for eclipse and IDEA

tomcat server configuration When everyone is lear...

Learn one minute a day to use Git server to view debug branches and fix them

Debug branch During the normal development of a p...

How to create a test database with tens of millions of test data in MySQL

Sometimes you need to create some test data, base...

Detailed explanation of MySQL binlog usage

binlog is a binary log file that records all DML ...

Vue-cli creates a project and analyzes the project structure

Table of contents 1. Enter a directory and create...

How to use CocosCreator to create a shooting game

Analyze the production steps: 1. Prepare resource...

Implementation of Docker deployment of Django+Mysql+Redis+Gunicorn+Nginx

I. Introduction Docker technology is very popular...

Vue uses GraphVis to develop an infinitely expanded relationship graph

1. Go to the GraphVis official website to downloa...

Linux service monitoring and operation and maintenance

Table of contents 1. Install the psutil package S...

Detailed explanation of the use of state in React's three major attributes

Table of contents Class Component Functional Comp...