1 Master-slave read-write separationMost Internet businesses are more read than write, so the priority is to consider how the DB can support a higher number of queries. First, it is necessary to distinguish between read and write traffic, which makes it convenient to expand the read traffic separately, that is, master-slave read and write separation. If a sudden increase in front-end traffic causes the slave database to be overloaded, the DBA will prioritize expanding the capacity of the slave database. This way, the read traffic to the DB will fall to multiple slave databases, reducing the load on each slave database. The developer will then try their best to block the traffic at the DB layer. Cache VS MySQL read-write separation Due to the difficulty of development and maintenance, the introduction of cache will introduce complexity. Issues such as cache data consistency, penetration, and anti-avalanche need to be considered, and one more type of component needs to be maintained. Therefore, it is recommended to use read-write separation first, and then use Cache when it can't be handled. 1.1 coreMaster-slave read-write separation generally copies the data of a DB into one or more copies and writes them to other DB servers:
Therefore, the key to master-slave read-write separation is:
Master-slave replication
Let developers feel like they are still using a single DB 2 Master-Slave ReplicationMySQL master-slave replication relies on binlog, which records all changes on MySQL and saves them on disk in binary form in binary log files. Master-slave replication is to transfer the data in the binlog from the master database to the slave database, usually asynchronously: the master database operation will not wait for the binlog synchronization to complete. 2.1 Master-slave replication process
Using an independent log dump thread is asynchronous to avoid affecting the main update process of the master database. After receiving the information, the slave database does not write it to the slave database storage, but writes it to a relay log. This is to avoid writing to the actual storage of the slave database, which is more time-consuming and ultimately causes a longer delay between the slave database and the master database.
For performance reasons, the master database write process does not wait for the master-slave synchronization to complete before returning the result. In extreme cases, for example, the disk is damaged or the machine loses power before the binlog on the master database is written to the disk, resulting in binlog loss and inconsistent master-slave data. But the probability is very low and tolerable. After the master database goes down, the inconsistency of master-slave data caused by the loss of binlog can only be restored manually. After master-slave replication, you can:
In this way, even if the write request locks the table or record, it will not affect the execution of the read request. Under high concurrency, multiple slaves can be deployed to share the read traffic, that is, one master and multiple slaves support high concurrency reading. The slave database can also be used as a backup database to avoid data loss due to failure of the master database. Can increasing the number of slaves without limit support higher concurrency? 2.2 Side Effects of Master-Slave ReplicationFor example, the operation of posting to Moments contains data:
If update DB
For example, synchronizing the Moments content to the review system Therefore, after updating the main database, the friend circle ID will be written into MQ, and the Consumer will obtain the friend circle information from the database based on the ID and then send it to the review system. Schematic diagram of the impact of master-slave delay on business 2.3 Avoiding Delays in Master-Slave ReplicationWhat should I do? In fact, there are many solutions, and the core idea is to try not to query data from the database. Therefore, for the above case, there are the following solutions: 2.3.1 Data RedundancyWhen sending MQ, you can send not only the friend circle ID, but all the friend circle information required by the Consumer, avoiding re-querying data from the DB. This solution is recommended because it is simple enough, but it may cause a single message to be larger, thereby increasing the bandwidth and time for message sending. 2.3.2 Using CacheWhile writing to the DB synchronously, write the Moments data to the cache. This way, when the Consumer obtains Moments information, it will first query the cache, which can also ensure data consistency. This solution is suitable for scenarios where new data is added. If you are updating data, updating the cache first may cause data inconsistency. For example, two threads update data at the same time:
The final DB value (1) and Cache value (2) are inconsistent! 2.3.3 Query the main databaseInstead of querying the slave database in the Consumer, you can query the master database. Be cautious when using it. Make sure that the query volume is not large and is within the tolerance of the main database. Otherwise, it will cause great pressure on the main database. Do not use this solution unless it is absolutely necessary. Because an interface for querying the main database is provided, it is difficult to ensure that others do not abuse this method. The master-slave synchronization delay is also easy to overlook when troubleshooting. How to judge the master-slave delay time warning through which indicator in which database? In the slave library, monitor show slave 3 How to access the DBUsing master-slave replication to copy data to multiple nodes also realizes the read-write separation of DB. At this time, the use of DB also changes:
In order to reduce the complexity of implementation, many DB middlewares have emerged in the industry to solve DB access problems, which can be roughly divided into: 3.1 Inside the application For example, TDDL (Taobao Distributed Data Layer) runs embedded in the application in the form of code. It can be regarded as a data source agent. Its configuration manages multiple data sources. Each data source corresponds to a DB, which may be a master database or a slave database. advantageIt is easy to use and has low deployment cost. Because it is embedded inside the application and runs with the program, it is suitable for small teams with weak operation and maintenance capabilities. shortcomingLack of multi-language support, all developed in Java, unable to support other languages. Version upgrades also rely on updates from the user. 3.2 Independently deployed proxy layer solutionSuch as Mycat, Atlas, and DBProxy. This type of middleware is deployed on an independent server. The business code is like using a single DB. In fact, it manages many data sources internally. When there is a DB request, it will make necessary changes to the SQL statement and then send it to the specified data source. advantage
shortcoming
4 ConclusionMaster-slave replication can be extended to a technology that replicates storage data between storage nodes, which can achieve data redundancy to achieve backup and improve horizontal expansion capabilities. When using master-slave replication, consider:
If all slave nodes are guaranteed to be written successfully, the write performance will definitely be affected; if only writing to the master node returns success, data synchronization failure may occur on the slave node, resulting in inconsistency between the master and the slave. Internet projects generally prioritize performance over strong data consistency
This will cause many strange problems where data cannot be read. Many real cases:
Different components have different requirements for replication consistency and latency, and adopt different solutions, but the design ideas are the same. FAQIf there are a large number of orders, hashing them to different databases via userId is beneficial for front-end user order queries, but the back-end system page needs to view all orders and sort them, so SQL execution is very slow. What should I do about this? Since the backend system cannot directly query the data in the sub-databases and sub-tables, you can consider synchronizing the data to a separate backend database or to ES. This concludes this article on how MySQL supports billions of traffic. For more information about MySQL billions of traffic, please search for previous articles on 123WORDPRESS.COM or continue to browse the following related articles. I hope you will support 123WORDPRESS.COM in the future! You may also be interested in:
|
<<: Various front-end printing methods of web: CSS controls web page printing style
>>: Differences in the hr separator between browsers
One-click execution To install Python 3.8 in a vi...
Table of contents 1. Closure 2. Closure usage sce...
Nginx can not only hide version information, but ...
Preface The optional chaining operator (?.) allow...
As shown in the figure: Check port usage: sudo ne...
1. Development environment vue 2. Computer system...
1. Add the viewport tag to the HTML header. At th...
1. Introduction Git is a free, open source distri...
Install vsftpd $ sudo apt-get install vsftpd -y S...
Mac latest version of MySQL 8.0.22 password recov...
webkit scrollbar style reset 1. The scrollbar con...
Table of contents What is Docker deploy 1. Pull t...
1.1. Download: Download the zip package from the ...
1. How to install? 1. [Run] -> [cmd] to open t...
Table of contents 1. Clever use of indexes and va...