Imagine a scenario where, when designing a user table, each person's ID number is unique and needs to be searched. However, since the ID number field is large, it is not suitable to be used as the primary key. When the business code has guaranteed that the inserted ID card is unique, you can choose to create a unique index and a normal index. How should you choose? Next, we will analyze the query and update execution process. Query process Assume that k is the index on table t. When searching for For a normal index, after finding the record with k=5, it will continue to search one more time until it encounters the first record that is not 5. For a unique index, since the value is unique, stop searching after finding it. Because InnoDB reads and writes in units of data pages (data pages are 16 KB by default), when a piece of data is read, the entire data page is read into the memory as a whole . In the data page read into the memory, if it contains records with k=5, in the case of query, the unique index has one more search and judgment process than the ordinary index, which can be ignored. If k=5 is the last entry in the current data page, the next data page needs to be read. But the probability of this happening is low and can be ignored. So in general, there is not much difference between ordinary indexes and unique indexes during the query process. change buffer Before analyzing the impact of unique indexes and common indexes, let's first understand the change buffer structure. What is a change buffer? When performing an update operation, if the data page to be updated is in memory, it will be updated directly. Otherwise, without affecting data consistency, InnoDB will cache the update operation in the change buffer, thereby eliminating the process of reading the data page from disk. When the next query operation reads the data page that needs to be updated, the update statement in the change buffer will be executed and written to the data page. The process of applying operations to the hard disk is called merge. The background thread will merge regularly or when the database is closed normally, a merge operation will also be performed. The execution process of merge is as follows:
The change buffer is actually data that can be persisted to the hard disk, which means that the change buffer exists both in memory and on the hard disk. The change buffer was previously called the insert buffer. Initially, only the insert buffer was optimized, but later support for delete and update was added, and the name was changed to change buffer. It can be seen that recording the update operation in the change buffer first reduces the process of reading disk data pages into memory, and the execution speed of the statement will be significantly improved. At the same time, reading data into memory will occupy the buffer pool memory, so reducing read operations also improves memory utilization.
You can use innodb_change_buffer_max_size to set the size of the buffer pool occupied by the change buffer. Change buffer application scenarios? As mentioned above, the change buffer saves update records in advance, reducing the process of reading data pages and thus improving performance. In other words, if the change buffer contains more update records for different data pages, the benefit will be greater. Therefore, for businesses with more writes and less reads (immediate query after update), the change buffer plays a greater role . Such as common billing and logging systems. If the business is to query immediately after the update, although the update record can be placed in the change buffer, the merge process will be triggered immediately because the data page needs to be queried immediately afterwards. This will not reduce the number of random access IOs, but will increase the maintenance cost of the change buffer, which will have the opposite effect. Update Process For unique indexes, all update operations need to determine whether they violate the uniqueness constraint. Therefore, the required data pages must be read into memory and then updated directly without using the change buffer. Therefore, the change buffer is only useful for ordinary indexes. For a specific analysis, insert a new record into a table: If the data page to be updated by the new record is in memory: For a unique index, find the appropriate position, determine if there is any conflict, insert the value, and the statement ends. For a normal index: find the position, insert the value, and the statement ends. Therefore, when the data page is in memory, the only difference between a unique index and a normal index is a judgment process. Can be ignored. If the data page to be updated by the new record is not in memory: For a unique index, the data page is read into memory, conflicts are determined, the data is inserted, and the statement ends. For common indexes, the statement is recorded in the change buffer and the statement ends. Since the random IO access from disk to memory is involved, it is one of the most expensive operations in the database. Ordinary indexes reduce read operations compared to unique indexes, which can significantly improve performance. Choice of unique or normal index By comparing the two in terms of query and update. We know that during the query process, except for extremely special circumstances, the difference between the two is actually not that big. The main difference is the case when during an update, the data page to be updated is not in the content. At this time, the unique index cannot use the change buffer because it needs a uniqueness check. There is an additional process of reading data from the disk to the content, which involves random IO access and is relatively inefficient. Therefore, if the business needs to update good performance , you can choose a normal index. Of course, everything is based on the premise of ensuring data accuracy. If an update is followed by a query, you can consider turning off the change buffer. In other cases, the change buffer can provide a significant improvement.
Comparison of redo log and change buffer The emergence of redo log in InnoDB makes it crash-safe and improves efficiency by writing logs first and then writing to disk through WAL. The change buffer saves the random IO process of reading data pages from disk to memory. Let's analyze the relationship between the two through an insert statement: mysql> insert into t(id,k) values(id1,k1),(id2,k2); Assume k is a normal index, the data page inserted by k1 is in memory, but k2 is not. When performing an insert operation, the following four parts are mainly involved: InnoDB buffer pool: memory area redo log: log system table space (ibdata1): system table space data(t.idb): data table space
The execution process is as follows:
It can be seen that the execution cost of this update statement (including insert, delete, and update operations) is very low, with two writes to memory and one sequential write to disk . The operations marked with dotted lines are background operations and do not affect the response time. Let’s look at another query statement: select * from t where k in (k1, k2) Assume that the read statement occurs shortly after the update statement and the data in memory is still there, then the read operation has nothing to do with the system tablespace and redo log. Execution process:
To summarize the relationship between redo log and change buffer: Storage location: The change buffer is also persisted on the hard disk, but is saved in the system tablespace ibdata1. The redo log is a separate file. Record content: The change buffer records the content of the update operation, while the redo log records the modification of ordinary data pages and the changes in the change buffer. Disk synchronization process: Synchronizing the changes to the data pages in memory is done through a merge operation, not based on the redo log. From the update process point of view: redo log converts random disk write IO into sequential write, while change buffer saves random disk read IO consumption. If the server loses power unexpectedly, will the change buffer be lost? No, because the data in the change buffer has been recorded in the redo log, so it will not be lost. Because part of the change buffer data is on disk and part is in memory. The data on disk has been merged so it will not be lost.
References Buffer Pool The above is which one to choose between MySQL unique index and normal index? For more details about MySQL unique index and common index, please pay attention to other related articles on 123WORDPRESS.COM! You may also be interested in:
|
<<: Apply provide and inject to refresh Vue page method
>>: Detailed explanation of Vue login and logout
Beginners can learn HTML by understanding some HT...
Table of contents Preface - Vue Routing 1. The mo...
The difference between replace into and insert in...
In combination with the scenario in this article,...
Table of contents 1. Brief Overview 2. Detailed e...
In actual projects, the up and down scroll bars a...
cause The way to import external files into a min...
Overview Operations on any one database are autom...
Get the mongo image sudo docker pull mongo Run th...
This article shares with you the graphic tutorial...
Table of contents Preliminary preparation Deploym...
Due to the default bridge network, the IP address...
Preview of revised version This article was writt...
In a recent project, I wanted to align text verti...
1. Introduction to TypeScript The previous articl...