The overall architecture of MySQL is divided into the server layer and the storage engine layer. The server layer includes connectors, query cache, analyzer, optimizer, executor, etc. Stored procedures, triggers, views and built-in functions are all implemented in this layer. The data engine layer is responsible for data storage and retrieval, such as InnoDB, MyISAM, Memory and other engines. After the client connects to the Server layer, the Server will call the interface provided by the data engine to make data changes. Connectors Responsible for establishing connections with clients, obtaining user permissions, and maintaining and managing connections. Use Briefly describe the long connection: Advantage: During the connection time, the client always uses the same connection, avoiding resource consumption of multiple connections. Disadvantages: When MySQL is executed, the memory used is managed by the connection object. Since it is not released for a long time, it will cause system memory overflow and be killed by the system. Therefore, it is necessary to disconnect long connections regularly or disconnect after executing large queries. After MySQL 5.7, you can use Query Cache When a query request is received, the query cache (key/value storage) will be queried to see if it has been executed. If not, follow the normal execution process. But in actual situations, it is generally not necessary to set up a query cache. Because the cache will be cleared when the table involved in the query is updated. So it is suitable for static tables. After MySQL 8.0, query cache is abolished. Analyzer Lexical analysis: Such as identifying select, table name, column name, and determining whether they exist. Syntax analysis: Determine whether the statement conforms to MySQL syntax. Optimizer Determine the use of indexes, the order of join tables, etc., and select the most optimized solution. Actuator Before executing a statement, a permission check will be performed first. If the permission is passed, the query will be performed using the interface provided by the data engine. If a slow query is set, you will see
Use profiling to view SQL execution process Turn on profiling to analyze the statement execution process: mysql> select @@profiling; +-------------+ | @@profiling | +-------------+ | 0 | +-------------+ 1 row in set, 1 warning (0.00 sec) mysql> set profiling=1; Query OK, 0 rows affected, 1 warning (0.00 sec) Execute the query statement: mysql> SELECT * FROM s limit 10; +------+--------+-----+-----+ | s_id | s_name | age | sex | +------+--------+-----+-----+ | 1 | z | 12 | 1 | | 2 | s | 14 | 0 | | 3 | c | 14 | 1 | +------+--------+-----+-----+ 3 rows in set (0.00 sec) Get profiles; mysql> show profiles; +----------+------------+--------------------------+ | Query_ID | Duration | Query | +----------+------------+--------------------------+ | 1 | 0.00046600 | SELECT * FROM s limit 10 | +----------+------------+--------------------------+ mysql> show profile; +----------------------+----------+ | Status | Duration | +----------------------+----------+ | starting | 0.000069 | | checking permissions | 0.000008 | Opening tables | 0.000018 | init | 0.000019 | System lock | 0.000010 | optimizing | 0.000004 | statistics | 0.000013 | | preparing | 0.000094 | Preparing | executing | 0.000016 | Sending data | 0.000120 | | end | 0.000010 | | query end | 0.000015 | | closing tables | 0.000014 | | freeing items | 0.000032 | | cleaning up | 0.000026 | +----------------------+----------+ 15 rows in set, 1 warning (0.00 sec) Query specific statements: mysql> show profile for query 1; +----------------------+----------+ | Status | Duration | +----------------------+----------+ | starting | 0.000069 | | checking permissions | 0.000008 | | Opening tables | 0.000018 | | init | 0.000019 | | System lock | 0.000010 | | optimizing | 0.000004 | | statistics | 0.000013 | | preparing | 0.000094 | | executing | 0.000016 | | Sending data | 0.000120 | | end | 0.000010 | | query end | 0.000015 | | closing tables | 0.000014 | | freeing items | 0.000032 | | cleaning up | 0.000026 | +----------------------+----------+ 15 rows in set, 1 warning (0.00 sec) MySQL Logging Module As mentioned earlier, MySQL is divided into the server layer and the data engine layer, and each layer also corresponds to its own log file. If the InnoDB engine is selected, the corresponding file is the redo log file. The server layer corresponds to the binlog file. As for why there are two logging systems, let's look down. redo log Redo log is a log specific to InnoDB. Why do we need to introduce redo log? Imagine a scenario where MySQL needs to write data to disk files to ensure persistence. We know that when writing to disk, file IO and search operations will be performed. If this is the case for every update operation, the overall efficiency will be extremely low and it will be impossible to use it. Since writing directly to disk is not possible, the solution is to write it into memory first, and then update it to disk when the system is idle. However, simply updating the memory is not enough. If the system crashes or restarts abnormally, the data in the memory that has not been written to the disk will be lost, and data consistency will become a problem. This is when the redo log comes into play. When an update operation occurs, InnoDb will first write the redo log (recording how the data has changed), then update the memory, and finally write it to the disk at an appropriate time, usually when the system is idle. The operation of writing logs first and then writing to disk is the commonly mentioned WAL (Write-Ahead-Logging) technology. The emergence of redo log not only greatly improves efficiency, but also ensures that MySQL has crash-safe capabilities, and no data will be lost in abnormal situations. In the specific implementation, the size of the redo log is fixed. It can be configured as a group of 4 files, each file is 1GB, and the four files are written in a loop during updates. write pos records the current writing position and moves back after writing. When the end of the 4th file is written, it is rewritten from position 0. The check point indicates the current position that can be erased. When the data is updated to the disk, the check point moves backward. The position between write pos and check point is the space where update operations can be recorded. When the write pos catches up with the check point, no new operations can be performed, so let the check point write some data first. You can set innodb_flush_log_at_trx_commit to 1 to enable redo log persistence. binlog Binlog is a log at the server layer, mainly used for archiving, and plays a role in backup, master-slave synchronization, and data recovery. Common log formats include row, mixed, and statement. For specific usage, please refer to Binlog Recovery Log. You can enable binlog writing to disk by setting sync_binlog=1. Here we distinguish between binlog and redo:
Two-phase commit The following is the internal process of the executor and InnoDB when executing Update: Take the update T set c=c+1 where ID=2; statement as an example:
Light colors are executed by the executor, and dark colors are executed by the engine. After updating the memory, writing the redo log is split into two steps: prepare and commit, which is often called a two-phase commit. Used to ensure data consistency when unexpected situations occur. Let's assume here that what would happen if two-phase commit was not used?
Let's analyze the two-phase submission process: 1. The process crashes during the redo log prepare phase, at time A. After restarting, it was found that the redo log was not written, so the transaction was rolled back. 2. If the system crashes while writing binlog, after restart, it is found that binlog has not been written, so roll back the operation. 3. Binlog is written, but a crash occurs when submitting the redo log commit status
Complete, commit the transaction. Incomplete, roll back the transaction. How to determine whether the binlog is complete?
How to associate redo log and binlog to represent the same operation? There is a common data field in the structure, XID. During crash recovery, the redo log is scanned sequentially:
After the data is written, is there any relationship between the final disk location and the redo log?
What is the relationship between redo log buffer and redo log? During the update process of a transaction, there are multiple SQL statements, so multiple logs need to be written. Summarize At the beginning of the article, it is explained that the overall architecture of MySQL is divided into the server layer and the engine layer, and the execution process of a statement is briefly described. Then, MySQL chose InnoDB as the default engine after 5.5 because it has more transaction and crash-safe capabilities than the native MyISAM. Crash-safe is achieved by redo log. Similar to redo log, there is binlog, which is the log of the Server engine and is used for archiving and backing up data. Finally, it is mentioned that in order to ensure data consistency, the redo log and binlog are put into the same transaction, which is often referred to as the two-phase commit operation. The above is the detailed introduction of MySQL overall architecture. For more information about MySQL overall architecture, please pay attention to other related articles on 123WORDPRESS.COM! You may also be interested in:
|
<<: Implementation of Docker deployment of MySQL cluster
>>: WeChat applet realizes left-right linkage
Today I saw a case study on MySQL IN subquery opt...
Install zip decompression function under Linux Th...
Table of contents 1. Project folder structure 1. ...
This article mainly summarizes various problems o...
Import and export of Docker images This article i...
I started working on my final project today, but ...
This article mainly introduces three methods of i...
Table of contents 1. Introduction to teleport 1.1...
Operating system: Ubuntu 17.04 64-bit MySQL versi...
Environmental Description Server system: Ubuntu 1...
In actual work, JavaScript regular expressions ar...
We often use click events in the a tag: 1. a href=...
This article mainly introduces how to use the Rea...
Nginx hides version number In a production enviro...
HTTP Status Codes The status code is composed of ...