Implementation of transactionsThe redo log ensures the persistence of transactions, and the undo log is used to help with transaction rollback and MVCC functions. InnoDB storage engine architecture redo logWrite Ahead Log Strategy When a transaction is committed, the redo log is written first and then the page is modified; when data is lost due to a crash, the data can be recovered through the redo log.
Redo log files: By default, there are two files named ib_logfile1 and ib_logfile2 in the data directory of the InnoDB storage engine. Each InnoDB storage engine has at least one redo log file group, and each file group has at least two redo log files. The following figure 1 shows that the redo log group runs in a circular writing mode. The InnoDB storage engine writes ib_logfile1 first, and when it reaches the end of the file, it switches to the redo log file ib_logfile2. As shown in Figure 2, adding an OS Buffer helps to understand the fsync process. The log group is called the redo log group, which is a logical concept. The InnoDB storage engine actually has only one log group. The first redo log file in the log group stores four 512-byte blocks in its first 2KB: Redo log buffer flushed to disk The following three situations refresh:
To supplement the second of the three situations mentioned above, the triggering of the disk write process is controlled by the parameter innodb_flush_log_at_trx_commit, which indicates how to handle the redo log when the commit operation is performed. The valid values of the parameter innodb_flush_log_at_trx_commit are 0, 1, and 2.
0, when the database crashes, some logs are not flushed to disk, so the transactions of the last period of time will be lost. The following diagram helps to understand Redo log blocksIn the InnoDB storage engine, redo logs are stored in 512 bytes. This means that the redo log cache and redo log files are saved in blocks, with each block being 512 bytes. The redo log header is 12 bytes and the redo log tail is 8 bytes, so each redo log block can actually store 492 bytes. Redo log formatThe redo log is recorded in page-based format. By default, the page size of InnoDB is 16KB (controlled by the innodb_page_size variable). A page can store a large number of log blocks (each 512 bytes), and the log blocks record the changes in the data pages. The format of the log body is divided into 4 parts:
The following figures show the general recording methods of insert and delete. redo log recoveryThe following LSN (Log Sequence Number) represents the checkpoint. When the database crashes at LSN 10000, the recovery operation only recovers logs in the range of LSN10000-LSN13000. undo logThe role of undo log Undo is a logical log that simply restores the database to its original state logically; all modifications are logically canceled, but the data structure and the page itself may be different after the rollback. Undo log has two functions: providing rollback and multiple row version control (MVCC). When the InnoDB storage engine rolls back, for each INSERT, a DELETE is completed; for each DELETE, an INSERT is executed; for each UPDATE, a reverse UPDATE is executed to put the previous row back. MVCC: When a user reads a row of records, if the record has been occupied by other transactions, the current transaction can read the previous row version information through undo, thereby achieving non-locking reading. Undo log storage methodThe innodb storage engine manages undo in segments. The rollback segment is called a rollback segment, and each rollback segment has 1024 undo log segments. In the old version, only one rollback segment was supported, so only 1024 undo log segments could be recorded. Later, MySQL 5.5 can support 128 rollback segments, that is, support 128*1024 undo operations. You can also customize the number of rollback segments through the variable innodb_undo_logs (before version 5.6, this variable was innodb_rollback_segments). The default value is 128. Undo logs are stored in the shared tablespace by default. Transaction commit undo log processing When a transaction commits, the InnoDB storage engine does the following two things:
When a transaction is committed, the undo log is first put into the linked list, and then it is determined whether the used space of the undo page is less than 3/4. If so, it means that the undo page can be reused, and then the new undo log is recorded after the current undo log. Undo log is divided into:
Because of transaction isolation, the insert undo log is not visible to other transactions, so the undo log can be deleted directly after the transaction is committed without the need for a purge operation. Update undo log records the undo logs generated by delete and update operations. The undo log may need to provide an MVCC mechanism, so it cannot be deleted when it is submitted. There are two cases for update:
When InnoDB purges, it first searches for the undo log in the history list, and then searches for the undo log in the undo page; this can avoid a large number of random read operations, thereby improving purge efficiency. MVCC (Multi-version Concurrency Control)MVCC actually adds two hidden columns after each row of records to record the creation version number and the deletion version number. When each transaction is started, it has a unique incrementing version number. MVCC only works at REPEATABLE READ and READ COMMITTED isolation levels. There is no version problem for read uncommitted, and serialization locks all read rows. Example: Insert operation: The creation version number of the record is the transaction version number If you insert a record, the transaction id is assumed to be 1, and the creation version number is also 1 |
id | name | create version | delete version |
---|---|---|---|
1 | test | 1 |
Update operation: first mark the old version number as deleted, the version number is the current version number, and then insert a new record
For example, transaction 2 updates the name field
update table set name = 'new test' where id = 1;
The original record is marked for deletion, with a deletion version of 2, and a new record is inserted, with a creation version of 2
id | name | create version | delete version |
---|---|---|---|
1 | test | 1 | 2 |
1 | new test | 2 |
Deletion operation: Use the transaction version as the deletion version number
For example, transaction 3 deletes the record
delete from table where id = 1;
id | name | create version | delete version |
---|---|---|---|
1 | test | 2 | 3 |
Records that meet the following two conditions can be queried by the transaction:
MVCC benefits: reduce lock contention and improve performance
Concept and function of binary files
The binary log records all operations that change the MySQL database (excluding SELECT, SHOW, etc., because the data is not modified)
The main functions of binary files are:
Three binary file formats
MySQL 5.1 introduced the binlog_format parameter, which can be set to STATEMENT, ROW, and MIX.
(Binary files are used for POINT-IN-TIME (PIT) recovery and the establishment of a master-slave replication environment.
group commit
If the transaction is not a read-only transaction, an fsync operation is required each time the transaction is committed to ensure that the redo log has been written to disk. However, the disk fsync performance is limited. To improve the disk fsync efficiency, current databases provide a group commit function, which means that multiple transaction logs can be refreshed at one time to ensure that they are written to the file.
For InnoDB group commit, a two-stage operation is performed:
Before InnoDB1.2, the group commit function fails when binary files are opened:
After opening the binary file, the steps are as follows:
1) When a transaction is committed, the InnoDB storage engine performs a prepare operation
2) MySQL database upper layer writes binary files
3) InnoDB writes logs to redo log files
a) Modify the information corresponding to the transaction in memory and write the log to the redo log buffer b) Call fsync to ensure that the log is written from the redo log buffer to disk
To ensure that the writing order of the upper binary files of the MySQL database is consistent with the InnoDB transaction commit order, MySQL uses the prepare_commit_mutex lock internally. Therefore, step a) in step 3) cannot be performed when other transactions are executing step b), which causes the row commit function to fail.
The solution is BLGC (Binary Log Group Commit)
The MySQL 5.6 BLGC implementation is divided into three stages:
This is the end of this article about the detailed explanation of MySQL redo log, undo log and binlog. For more relevant MySQL redo log, undo log and binlog content, please search 123WORDPRESS.COM's previous articles or continue to browse the following related articles. I hope everyone will support 123WORDPRESS.COM in the future!
<<: Some references about colors in HTML
>>: Summary of 10 amazing tricks of Element-UI
Request logic Front-end --> Request nginx via ...
<br />When uploading on some websites, a [Se...
Table of contents 1. Write in front 2. Overlay to...
I have been learning porters recently. I feel lik...
The GtkTreeView component is an advanced componen...
Writing a Dockerfile Taking the directory automat...
Web Services are concerned with application-to-ap...
This article shares with you how to use thinkphp5...
count(*) accomplish 1. MyISAM: Stores the total n...
Make a blank space for Taobao: When you shrink th...
Because the version I used when I was learning wa...
cause The reason for writing this blog is that I ...
1. Overview The information_schema database is th...
1. Add MySQL Yum repository MySQL official websit...
A WeakMap object is a collection of key/value pai...