Question: As we all know, binlog can be used for archiving and master-slave synchronization, but what is its content? Why can the standby database be consistent with the primary database after executing binlog? The basic principle of MySQL master-slaveFigure 1 MySQL master-slave switch process In state 1, the client's read and write operations directly access node A, and node B is the backup database of A. It only synchronizes all updates of A and executes them locally. This keeps the data on nodes B and A the same.
I have set the backup database to read-only. How can I keep it updated synchronously with the primary database? Figure 2 Active/standby flow chart After receiving the update request from the client, the master database executes the update logic of the internal transaction and writes the binlog at the same time. A long connection is maintained between the standby database B and the primary database A. There is a thread inside the master database A, which is dedicated to serving the long connection of the standby database B. The complete process of a transaction log synchronization is as follows:
Later, with the introduction of multi-threaded replication solutions, sql_thread evolved into multiple threads . Comparison of three binlog formatsBinlog has two formats, one is statement and the other is row. You may also see a third format called mixed in other materials. In fact, it is a mixture of the first two formats. mysql> CREATE TABLE t ( id int(11) NOT NULL, a int(11) DEFAULT NULL, t_modified timestamp NOT NULL DEFAULT CURRENT_TIMESTAMP, PRIMARY KEY (id), KEY a (a), KEY t_modified(t_modified) )ENGINE=InnoDB; insert into t values(1,1,'2018-11-13'); insert into t values(2,2,'2018-11-12'); insert into t values(3,3,'2018-11-11'); insert into t values(4,4,'2018-11-10'); insert into t values(5,5,'2018-11-09'); If you want to delete a row of data in the table, let's see how the binlog of this delete statement is recorded. mysql> delete from t /comment/ where a>=4 and t_modified<='2018-11-10' limit 1; When binlog_format=statement, the original text of the SQL statement is recorded in binlog . You can use the mysql> show binlog events in 'master.000001'; command to view the contents of the binlog. Figure 3. Example of statement format binlog Now, let's look at the output in Figure 3.
To illustrate the difference between statement and row formats, let's look at the execution effect diagram of this delete command: Figure 4 delete execution warnings Running this delete command generates a warning because the current binlog is set to statement format and there is a limit in the statement , so this command may be unsafe. Why do I say so? This is because the delete command has a limit, which may cause inconsistency between the primary and standby data . If the delete statement uses index a, the first row that meets the condition will be found based on index a, that is, the row with a=4 will be deleted; Because the original text of the statement is recorded in the binlog in the statement format, the following situation may occur: when the SQL statement is executed on the primary database, index a is used; but when the SQL statement is executed on the standby database, index t_modified is used. Therefore, MySQL considers it risky to write like this . If I change the binlog format to binlog_format='row', will this problem go away? Figure 5 Example of row format binlog Compared with the statement format binlog, the BEGIN and COMMIT before and after are the same. However, the original text of the SQL statement is no longer in the row-format binlog. Instead, it is replaced by two events: Table_map and Delete_rows.
With the help of mysqlbinlog tool, use the following command to parse and view the contents of binlog. The binlog of this transaction starts at position 8900, so the start-position parameter can be used to specify that the log parsing starts from this position. mysqlbinlog -vv data/master.000001 --start-position=8900; Figure 6 Details of a row format binlog example From this figure, we can see the following information:
The last Xid event is used to indicate that the transaction was successfully committed. When binlog_format uses the row format, the primary key ID of the actual deleted row is recorded in the binlog. In this way, when the binlog is transferred to the slave database, the row with id=4 will definitely be deleted, and there will be no problem of the master and slave deleting different rows . Why is there a mixed format binlog?Because some statement-format binlogs may cause inconsistencies between the master and slave, the row format should be used . But the disadvantage of the row format is that it takes up a lot of space . For example, if you use a delete statement to delete 100,000 rows of data , a SQL statement will be recorded in the binlog, taking up dozens of bytes of space. But if you use binlog in row format, you need to write all 100,000 records into binlog. Doing so will not only take up more space, but also consume IO resources to write binlog, affecting the execution speed . Therefore, MySQL took a compromise solution, that is, there is a mixed format binlog. The mixed format means that MySQL will determine whether this SQL statement may cause inconsistency between the master and the slave. If possible, it will use the row format; otherwise, it will use the statement format . In other words, the mixed format can take advantage of the advantages of the statment format while avoiding the risk of data inconsistency. Now more and more scenarios require that the MySQL binlog format be set to row . There are many reasons to do this, here's one that's immediately obvious: data recovery . We will look at the data recovery issue from the perspective of three SQL statements: delete, insert, and update.
Data operation errors caused by delete, insert or update statements often require recovery to the state before the operation. MariaDB's Flashback tool rolls back data based on the principles described above. mysql> insert into t values(10,10,now()); If you set the binlog format to mixed, do you think MySQL will record it in row format or statement format? Figure 7 mixed format and now() MySQL actually uses the statement format. Next, let's use the mysqlbinlog tool to take a look: Figure 8 TIMESTAMP command It turns out that when binlog records the event, it records an extra command: SET TIMESTAMP=1546103491. It uses the SET TIMESTAMP command to agree on the return time of the subsequent now() function . Through this SET TIMESTAMP command, MySQL ensures the consistency of the primary and standby data. When replaying binlog data , this is done: use mysqlbinlog to parse the log, and then directly copy the statements in it and execute them . Therefore, the standard way to use binlog to recover data is to parse it using the mysqlbinlog tool and then send the entire parsed result to MySQL for execution . A command similar to the following: mysqlbinlog master.000001 --start-position=2738 --stop-position=2942 | mysql -h127.0.0.1 -P13000 -u$user -p$pwd; This command means to parse out the content from byte 2738 to byte 2942 in the master.000001 file and put it into MySQL for execution. Circular replication problem The binlog feature ensures that executing the same binlog on the slave database can achieve the same status as the master database. We can assume that under normal circumstances the data of the primary and backup devices are consistent . That is to say, the contents of nodes A and B in Figure 1 are consistent. In fact, what I drew in Figure 1 is the MS structure, but the dual-M structure is more commonly used in actual production , which is the active-standby switching process shown in Figure 9. Figure 9 MySQL master-slave switch process - dual M structure Nodes A and B are always in a master-slave relationship with each other. This way, there is no need to modify the master-slave relationship during switching. However, the double-M structure still has a problem to be solved . The business logic updates a statement on node A, and then sends the generated binlog to node B. Node B also generates a binlog after executing the update statement. (I suggest you set the parameter log_slave_updates to on, which means that the standby database generates binlog after executing the relay log). Then, if node A is also the backup database of node B, it is equivalent to taking the newly generated binlog of node B and executing it again. Then, this update statement will be executed repeatedly between nodes A and B, which is circular replication. How to solve this problem ? As can be seen in Figure 6 above, MySQL records the server id of the instance where this command was first executed in the binlog . Therefore, we can use the following logic to solve the problem of circular replication between two nodes:
According to this logic, if we set up a double M structure, the execution flow of the log will become like this:
Summarize:This concludes this article on how MySQL ensures consistency between the primary and backup servers. For more information on how MySQL ensures consistency between the primary and backup servers, please search for previous articles on 123WORDPRESS.COM or continue to browse the following related articles. I hope you will support 123WORDPRESS.COM in the future! You may also be interested in:
|
<<: Pure CSS to adjust Div height according to adaptive width (percentage)
>>: N ways to vertically center a div with single/multi-line text (unknown height/fixed height)
MultiTail is a software used to monitor multiple ...
1. What problems did we encounter? In standard SQ...
Table of contents introduction MySQL High Availab...
1. Create a new configuration file docker_nginx.c...
Each time tomcat is started, the following log fi...
Table of contents How to deploy MySQL service usi...
The MERGE storage engine treats a group of MyISAM...
9 great JavaScript framework scripts for drawing ...
Preface We all know that the import and export of...
Preface: Jenkins' Master-Slave distributed ar...
Table of contents 1. Source code 1.1 Monorepo 1.2...
When installing nginx, mysql, tomcat and other se...
background Today, I was browsing CodePen and saw ...
Preface: I used the official nginx proxy_cache as...
In Linux systems, especially server systems, it i...