MySQL transaction details

MySQL transaction details

Introduction

A transaction is a logical processing unit consisting of a group of SQL statements.

Four characteristics of transactions

Atomicity:
	Either all succeed or all fail. Undo log achieves consistency:
	For example, the sum of the two amounts before and after the transfer remains unchanged. Isolation: The database provides a certain isolation mechanism to ensure that the transaction runs in an "independent" environment that is not affected by external concurrent operations. Lock, mvcc multi-version concurrency control. Durable: Transaction submission persists to disk redo log

Transaction Isolation Level

There are four transaction isolation levels in the database, namely read uncommitted, read committed, repeatable read, and serialization. Different isolation levels may cause dirty reads, phantom reads, non-repeatable reads, and other related problems. Therefore, when choosing an isolation level, you should decide based on the application scenario and use different isolation levels.

Isolation Level Dirty Read Non-repeatable read Phantom Read
READ-UNCOMMITTED
READ-COMMITTED ×
REPEATABLE-READ × ×
SERIALIZABLE × × ×

Problems caused by transaction isolation levels

Dirty Reads (Dirty Reads) A transaction accesses uncommitted data from another transaction:
	When a transaction is accessing data and modifying the data, and this modification has not yet been committed to the database, another transaction also accesses the data and then uses the data.
Non-Repeatable Reads (a transaction performs the same query twice, but obtains different data):
	A transaction reads some data and then reads the data it read before. It finds that the data is inconsistent with the data it read before. Update and delete phantom reads (a transaction makes the same query twice and obtains different data):
	A transaction re-reads previously queried data according to the same query conditions, but finds that other transactions have inserted new data that meets its query conditions.

verify

View the transaction isolation level show variables like 'tx_isolation';

Check whether the transaction is automatically committed show variables like 'autocommit';

Disable autocommit transaction = 0|OFF

set autocommit = 0;

Dirty read:

Set transaction isolation level A, B
	set session transaction isolation level read uncommitted;
sessionA
Start transaction start transaction;
Insert a data INSERT INTO `db_test`.`t_user`(`id`, `name`) VALUES (5, 'DuQi');
sessionB
Another connection queries select * from t_user;
	+----+----------+
	| id | name |
	+----+----------+
	| 1 | ZhangSan |
	| 2 | LiSi |
	| 3 | WangWu |
	| 4 | LaoWang |
	| 5 | DuQi |
	+----+----------+
	At this time, connection B queries the record id of the uncommitted transaction of connection A, which is 5
	
Here we verify that a session has read uncommitted data from another transaction

Non-repeatable read:

Modify the transaction isolation level set session transaction isolation level read committed;
A starts a transaction start transaction;
Verify the update B executes the query statement MySQL [db_test]> select * from t_user;
		+----+----------+
		| id | name |
		+----+----------+
		| 1 | ZhangSan |
		| 2 | LiSi |
		| 3 | WangWu |
		| 4 | LaoWang |
		| 5 | DuQi |
		+----+----------+
	A executes the update statement update t_user set name = 'duqi' where id = 5;
	B executes the query statement start transaction;
		MySQL [db_test]> select * from t_user;
		+----+----------+
		| id | name |
		+----+----------+
		| 1 | ZhangSan |
		| 2 | LiSi |
		| 3 | WangWu |
		| 4 | LaoWang |
		| 5 | DuQi |
		+----+----------+
	A commits the transaction commit;
	B executes the query statement (the results of two queries for the same transaction are inconsistent)
		MySQL [db_test]> select * from t_user;
		+----+----------+
		| id | name |
		+----+----------+
		| 1 | ZhangSan |
		| 2 | LiSi |
		| 3 | WangWu |
		| 4 | LaoWang |
		| 5 | duqi |
		+----+----------+
Continue to verify the deletion. A starts a transaction. B starts a transaction. start transaction;
	A deletes a record delete from t_user where id = 5;
	Transaction B query is normal, and the deleted records are still in MySQL [db_test]> select * from t_user;
		+----+----------+
		| id | name |
		+----+----------+
		| 1 | ZhangSan |
		| 2 | LiSi |
		| 3 | WangWu |
		| 4 | LaoWang |
		| 5 | DuQi |
		+----+----------+
	A commit;
	B continues to query and finds that the results of multiple queries on the same transaction are inconsistent. MySQL [db_test]> select * from t_user;
		+----+----------+
		| id | name |
		+----+----------+
		| 1 | ZhangSan |
		| 2 | LiSi |
		| 3 | WangWu |
		| 4 | LaoWang |
		+----+----------+

Verify the insertion of A and B. Start transaction. start transaction;
	A inserts records INSERT INTO `db_test`.`t_user`(`id`, `name`) VALUES (5, 'DuQi');
	B queries MySQL [db_test]> select * from t_user;
		+----+----------+
		| id | name |
		+----+----------+
		| 1 | ZhangSan |
		| 2 | LiSi |
		| 3 | WangWu |
		| 4 | LaoWang |
		+----+----------+	
	A commits the transaction commit;
	B query can also query the transaction submitted by A MySQL [db_test]> select * from t_user;
		+----+----------+
		| id | name |
		+----+----------+
		| 1 | ZhangSan |
		| 2 | LiSi |
		| 3 | WangWu |
		| 4 | LaoWang |
		| 5 | DuQi |
		+----+----------+

Phantom reading:

Modify the transaction isolation level set session transaction isolation level repeatable read;
A and B start transaction start transaction;
A inserts a piece of data	
	INSERT INTO `db_test`.`t_user`(`id`, `name`) VALUES (5, 'DuQi');
B Query MySQL [db_test]> select * from t_user;
	+----+----------+
	| id | name |
	+----+----------+
	| 1 | ZhangSan |
	| 2 | LiSi |
	| 3 | WangWu |
	| 4 | LaoWang |
	+----+----------+
A commits the transaction commit;
B transaction query MySQL [db_test]> select * from t_user;
	+----+----------+
	| id | name |
	+----+----------+
	| 1 | ZhangSan |
	| 2 | LiSi |
	| 3 | WangWu |
	| 4 | LaoWang |
	| 5 | DuQi |
	+----+----------+
It may be found that between different transactions, the insertion can be queried. Let's continue to verify the update and deletion. A and B start the transaction. A updates update t_user set name = 'duqi' where id = 5;
	B Query MySQL [db_test]> select * from t_user;
		+----+----------+
		| id | name |
		+----+----------+
		| 1 | ZhangSan |
		| 2 | LiSi |
		| 3 | WangWu |
		| 4 | LaoWang |
		| 5 | DuQi |
		+----+----------+	
	A commits the transaction
	B continues to query MySQL [db_test]> select * from t_user;
		+----+----------+
		| id | name |
		+----+----------+
		| 1 | ZhangSan |
		| 2 | LiSi |
		| 3 | WangWu |
		| 4 | LaoWang |
		| 5 | DuQi |
		+----+----------+
	Let's continue to verify the deletion. A and B start transactions. Transaction A executes the delete operation delete from t_user where id = 5;
		Transaction B executes the query MySQL [db_test]> select * from t_user;
			+----+----------+
			| id | name |
			+----+----------+
			| 1 | ZhangSan |
			| 2 | LiSi |
			| 3 | WangWu |
			| 4 | LaoWang |
			| 5 | duqi |
			+----+----------+
		A commits the transaction, and B continues to query MySQL [db_test]> select * from t_user;
			+----+----------+
			| id | name |
			+----+----------+
			| 1 | ZhangSan |
			| 2 | LiSi |
			| 3 | WangWu |
			| 4 | LaoWang |
			| 5 | duqi |
			+----+----------+
You may find that the REPEATABLE-READ transaction isolation level solves the problems of deletion and update, but the problem of insertion still exists.

MVCC

Multi-version concurrency control. MVCC is a concurrency control method, generally used in database management systems to implement concurrent access to databases and transactional memory in programming languages.

The implementation of mvcc in MySQL INNODB is mainly to improve the concurrent performance of the database and to handle read-write conflicts in a better way, so that even when there is a read-write conflict, non-locking and non-blocking concurrent reading can be achieved.

Before understanding mvcc, you must first understand two concepts: what is current read and what is snapshot read.

Current Reading

Read the latest version of the data

Operations such as select lock in share mode (shared lock), select for update; update, insert, delete (exclusive lock) are all current reads.
Why is it called Current Reading?
	That is, it reads the latest version of the record. When reading, it must ensure that other concurrent transactions cannot modify the current record, and the read record will be locked.

Snapshot Read

Read historical version data

For example, an unlocked select operation is a snapshot read, which is a non-blocking read without locking.
The premise of snapshot read is that the isolation level is not serial level. Snapshot read at serial level will degenerate into current read.
The reason for snapshot read is to improve concurrency performance. The implementation of snapshot read is based on multi-version concurrency control, namely MVCC. MVCC can be considered a variant of row lock, but it avoids locking operations in many cases and reduces overhead.

Current read, snapshot read, and MVCC relationship

MVCC multi-version concurrency control refers to maintaining multiple versions of a data so that there is no conflict in read and write operations. Snapshot read is a non-blocking read function of MySQL to implement MVCC.
The specific implementation of the mvcc module in Mysql is implemented by three implicit fields, undo logs, and readview components.

One more thing to add here: one of the three implicit fields is the unique identifier of the column. Some students must add a primary key (columns depend on the primary key) when designing a table, even if it is almost useless. In fact, for configuration tables, there is no need to add a primary key for tables that are rarely added or deleted. When inserting data, MySQL will determine whether the table has a primary key. If there is a primary key, it will use the primary key as the unique identifier. If there is no primary key, a 7-byte primary key will be automatically generated. Therefore, the rationality of the table should be designed according to different usage scenarios.

Problems that mvcc solves

Concurrent scenarios

1. Read-read: There is no problem and no concurrency control is required. 2. Read-write: There are thread safety issues, which may cause transaction isolation level problems, and may encounter dirty reads, non-repeatable reads, and phantom reads. 3. Write-write: There are thread safety issues and there may be update loss problems.

Problems solved

1. When reading and writing the database concurrently, it is possible to do so without blocking the write operation during the read operation, and the write operation does not need to block the read operation, thereby improving the performance of concurrent reading and writing of the database. 2. It solves transaction isolation problems such as dirty reads, phantom reads, and non-repeatable reads, but it cannot solve the problem of update loss.

MVCC Implementation Principle

The implementation principle of mvcc mainly relies on three hidden fields, undolog, and read view in the record.

Hidden Fields

In addition to our custom fields, row records also have fields implicitly defined by the database, such as DB_TRX_ID, BD_ROLL_PTR, DB_ROW_ID, etc.

DB_TRX_ID Most recently modified transaction id:
	6 bytes, records the transaction ID that created this record or last modified this record
DB_ROLL_PTR rollback pointer:
	7 bytes, pointing to the previous version of this record, used to cooperate with undolog, pointing to the previous old version DB_ROW_ID hidden primary key:
	6 bytes. If the database table does not have a primary key, InnoDB will automatically generate a 6-byte row_id.

undo log

Undo log is called rollback log, which means a convenient rollback log generated when insert, delete, and update operations are performed.

When performing insert operations, the undo log generated is only needed when the transaction is rolled back, and can be discarded immediately after the transaction is committed. When performing update and delete operations, the undo log generated is not only needed when the transaction is rolled back, but also when the snapshot is read, so it cannot be deleted casually. Only when the snapshot read or transaction rollback does not involve the log, the corresponding log will be cleared uniformly by the purge thread (when data is updated or deleted, only the old record is set. If the deleted_id of a record is true, and DB_TRX_ID is visible to the read view of the purge thread, then this record can definitely be cleared)

principle

When performing an insert operation, a corresponding delete statement will be generated. When performing a delete operation, the insert statement of the original data will be backed up. When performing an update, the update statement of the original data will be recorded. This operation is convenient for record rollback.

read View

READ View is a read view generated when a transaction performs a snapshot read operation. At the moment the transaction executes the snapshot, a current snapshot of the data system is generated, which records and maintains the ID of the currently active transaction in the system. The transaction ID value is incremented.

DB_ROW_ID DB_TRX_ID DB_ROLL_PTR c_name i_age
1 1 zhangsan1 18
2 2 1 zhangsan2 19
The biggest function of READ VIEW is to make visibility judgments. That is to say, when a transaction is executing a snapshot read, a Read View is created for the record, and it is used as a condition to determine which version of the data the current transaction can see. It is possible that the latest data is read, or it is possible that the latest data is read, or it is possible that a certain version of the data in the undolog of the current record is read. The visibility algorithm followed by read view is mainly to take out the DB_TRX_ID in the latest record of the data to be modified, and compare it with the ID of other active transactions in the system. If the DB_TRX_ID is compared with the attributes of READ VIEW and does not meet the visibility, then the DB_TRX_ID in the undolog is taken out through the DB_ROLL_PTR rollback pointer for comparison, that is, the DB_TRX_ID in the linked list is traversed until a DB_TRX_ID that meets the conditions is found. The old record where this DB_TRX_ID is located is the latest old version of the data that the current transaction can see.

Visibility rules

Before understanding visibility rules, you must first understand the three global properties in Read View.

trx_list:
	A list of values ​​used to maintain the transaction IDs that are active in the system at the time the Read View is generated
up_limit_id:
	Record the minimum ID of the transaction ID in the trx_list list
low_limit_id:
	The system has not yet allocated the next transaction ID when the Read View is generated

Comparison Rules

1. First, determine whether DB_TRX_ID < up_limit_id. If it is less than up_limit_id, the current transaction can see the record where DB_TRX_ID is located. If it is greater than or equal to, proceed to the next determination. 2. Determine whether DB_TRX_ID >= low_limit_id. If it is greater than or equal to, it means that the record where DB_TRX_ID is located only appears after the Read View is generated, so it is definitely not visible to the current transaction. If it is less than, proceed to the next determination. 3. Determine whether DB_TRX_ID is in an active transaction. If it is, it means that at the time of Read View generation, this transaction is still active and has not been committed. The modified data cannot be seen by the current transaction. If it is not, it means that this transaction has been committed before Read View is generated, so the modified result can be seen.

This is the end of this article about the detailed introduction of MySQL transactions. For more relevant MySQL transaction content, please search for previous articles on 123WORDPRESS.COM or continue to browse the following related articles. I hope everyone will support 123WORDPRESS.COM in the future!

You may also be interested in:
  • MySQL Database Indexes and Transactions
  • A brief analysis of the underlying principles of MySQL transactions and isolation levels
  • MySQL transaction analysis
  • Detailed explanation of transactions and indexes in MySQL database
  • How is MySQL transaction isolation achieved?
  • In-depth analysis of MySQL transactions

<<:  Example of how to implement embedded table with vue+elementUI

>>:  Summary of precautions when writing EDM in HTML (general precautions for sending emails)

Recommend

Methods and steps for Etcd distributed deployment based on Docker

1. Environmental Preparation 1.1 Basic Environmen...

Vendor Prefix: Why do we need a browser engine prefix?

What is the Vendor Prefix? Vendor prefix—Browser ...

Improving the effect of hyperlinks in web design and production

Hyperlinks enable people to jump instantly from pa...

HTML code text box limit input text box becomes gray limit text box input

Method 1: Set the readonly attribute to true. INPU...

mysql method to recursively search for all child nodes of a menu node

background There is a requirement in the project ...

Tutorial on using the frameset tag in HTML

Frameset pages are somewhat different from ordina...

Windows Server 2008 R2 Multi-User Remote Desktop Connection Licensing

At work, we often need remote servers and often e...

Introduction to Computed Properties in Vue

Table of contents 1. What is a calculated propert...

Steps to package and deploy the Vue project to the Apache server

In the development environment, the vue project i...

Tutorial on building file sharing service Samba under CentOS6.5

Samba Services: This content is for reference of ...

Detailed explanation of Mysql transaction processing

1. MySQL transaction concept MySQL transactions a...

MySQL 8.0.18 installation tutorial under Windows (illustration)

Download Download address: https://dev.mysql.com/...

How to quickly build a LAMP environment on CentOS platform

This article uses an example to describe how to q...