Briefly describe mysql monitoring group replication

Briefly describe mysql monitoring group replication

Original text: https://dev.mysql.com/doc/refman/8.0/en/group-replication-monitoring.html
Translator: kun
I am currently translating the official MySQL 8.0 documentation and this article is Section 18.3 "Monitoring Group Replication".

1. Monitoring Group Replication

Assuming MySQL has been compiled with the Performance Schema enabled, use the Perfomance Schema tables to monitor Group Replication. Group Replication adds the following tables:

  • performance_schema.replication_group_member_stats
  • performance_schema.replication_group_members

These existing Performance Schema replication tables also display information about Group Replication:

  • performance_schema.replication_connection_status displays information about Group Replication, for example, transactions that have been received from the group and queued in the applier queue (relay log).
  • performance_schema.replication_applier_status shows the status of channels and threads related to Group Replication. If there are many different worker threads applying transactions, then this table can also be used to monitor what each worker thread is doing.

The replication channels created by the Group Replication plugin are named:

  • group_replication_recovery - This channel is used for replication changes related to the distributed recovery phase.
  • group_replication_applier - This channel is used for incoming changes from the group. And the channel to which transactions directly from the group are applied.

The following sections describe the information available in each table.

2. Group member instance status

Server instances in a group can be in various states. If the servers are communicating normally, all servers will report the same status. However, if there is a network partition, or a group member leaves the group, different information may be reported depending on which server is queried. Note that if a group member has left the group, then it obviously cannot report the latest information about the status of other servers. If a network partition occurs, if the number of servers exceeding the arbitration number are disconnected, the servers will not be able to cooperate with each other. Therefore, they cannot know the status of different server members. Therefore, they will report that some servers are unreachable, rather than guessing about their status.

Server State

Field describe Group Synchronization
ONLINE The member can act as a group member with full capabilities, which means that clients can connect and start executing transactions. yes
RECOVERING The member is becoming a valid member of the group and is in the process of recovery, receiving status information from the data source node (data source node). no
OFFLINE The plugin is loaded, but the member does not belong to any group. no
ERROR The status of the local member. The server enters this state whenever there is an error during the recovery phase or when applying changes. no
UNREACHABLE Whenever the local failure detector suspects that a given server may be unreachable, either because it has crashed or has been accidentally disconnected, the server status is displayed as "UNREACHABLE". no

Important
Once the instance enters the ERROR state, the super_read_only option will be set to ON. To leave the ERROR state, you must manually configure the instance super_read_only=OFF

It is important to note that Group Replication is not synchronous replication, but it is eventually synchronous. More precisely, transactions are delivered to all group members in the same order, but their execution is asynchronous, meaning that after accepting a transaction to be committed, each member commits at its own pace.

3. replication_group_members table

The performance_schema.replication_group_members table is used to monitor the status of the different server instances that are members of a group. The table replication_group_members is updated whenever the view changes, for example, when the configuration of the group changes dynamically. On this basis, server members exchange some of their metadata to stay synchronized and continue to collaborate. Information is shared between Group Replication members, so information about all group members can be queried from any member. This table can be used to get a high-level view of the state of the replication group, for example by issuing:

SELECT * FROM performance_schema.replication_group_members;+---------------------------+--------------------------------------+--------------+-------------+--------------+-------------+----------------+| CHANNEL_NAME | MEMBER_ID | MEMBER_HOST | MEMBER_PORT | MEMBER_STATE | MEMBER_ROLE | MEMBER_VERSION |+---------------------------+--------------------------------------+--------------+-------------+--------------+-------------+----------------+| group_replication_applier | 041f26d8-f3f3-11e8-adff-080027337932 | example1 | 3306 | ONLINE | SECONDARY | 8.0.13 || group_replication_applier | f60a3e10-f3f2-11e8-8258-080027337932 | example2 | 3306 | ONLINE | PRIMARY | 8.0.13 || group_replication_applier | fc890014-f3f2-11e8-a9fd-080027337932 | example3 | 3306 | ONLINE | SECONDARY | 8.0.13 |+---------------------------+--------------------------------------+--------------+-------------+--------------+-------------+----------------+

Based on this result, we can see that the group consists of three members, the host and port number of each member, which the client uses to connect to the member, and the member's server_uuid . The MEMBER_STATE column shows one of the Group Member Instance States, in this case it shows that all three members of the group are ONLINE , and the MEMBER_ROLE column shows that there are two slaves and one master. Therefore, the group must be running in single-primary mode. The MEMBER_VERSION column can be useful when you are upgrading a group and the members in the group are running different MySQL versions.

4. Replication_group_member_stats

Each member in the replication group validates and applies transactions committed by the group. Statistics about validation and application are very useful to understand how the application queue is growing, how many conflicts are triggered, how many transactions are checked, which transactions have been committed by all members, and so on.

The performance_schema.replication_group_member_stats table provides group-level information related to the authentication process, as well as statistics on transactions received and initiated by each member of the replication group. Information is shared between group member instances, so information about all group members can be queried from any member. Note that refreshing of statistics on remote members is controlled by the message period specified in the group_replication_flow_control_period option, so these may differ slightly from the locally collected statistics of the member making the query.

Table replication_group_member_stats

field describe
CHANNEL_NAME The name of the Group Replication channel.
VIEW_ID The current view identifier for this group.
Member_id This value is the UUID of the server member we are currently connected to. Each member of the group has a different value. Because it is unique to each member, it also becomes a keyword.
Count_transactions_in_queue The number of transactions in the queue waiting to be checked for conflict detection. After the conflict check passes, they are queued for application.
Count_transactions_checked Indicates the number of transactions that have been checked for conflicts.
Count_conflicts_detected Represents the number of transactions that failed the conflict detection check.
Count_transactions_rows_validating Indicates the current size of the conflict detection database (the database against which each transaction is validated).
Transactions_committed_all_members Indicates a transaction that has been successfully committed on all members of the current view. This value is updated at regular intervals.
Last_conflict_free_transaction Displays the last transaction identifier that was checked to be conflict-free.
Count_transactions_remote_in_applier_queue The number of transactions received by this member from the replication group that are waiting to be applied.
Count_transactions_remote_applied The number of transactions received by this member from the replication group that have been applied.
Count_transactions_local_proposed The number of transactions originated by this member and sent to the replication group for coordination.
Count_transactions_local_rollback The number of transactions initiated by this member that were rolled back after being sent to the replication group.

These fields are important for monitoring the performance of members in a group. For example, suppose one of the members of a group is experiencing delays and cannot synchronize with the rest of the group. In this case, you might see a large number of transactions in the queue. Based on this information, you can decide to remove members from the group or to delay transaction processing for other members in the group, thereby reducing the number of queued transactions. This information can also help you decide how to tune the Group Replication plugin's flow control.

The above is a brief description of the details of MySQL monitoring group replication. For more information about MySQL monitoring group replication, please pay attention to other related articles on 123WORDPRESS.COM!

You may also be interested in:
  • Detailed explanation of MySQL monitoring tool mysql-monitor
  • Detailed explanation of non-primary key column overflow monitoring in MySQL tables
  • Use Grafana+Prometheus to monitor MySQL service performance
  • Detailed tutorial on how to monitor Nginx/Tomcat/MySQL using Zabbix
  • MySQL index usage monitoring skills (worth collecting!)
  • Detailed explanation of how to monitor MySQL statements
  • Detailed explanation of how Zabbix monitors the master-slave status of MySQL
  • MySQL database monitoring software lepus usage problems and solutions
  • Detailed explanation of performance monitoring of MySQL server using Prometheus and Grafana
  • Introduction to the use of MySQL real-time monitoring tool orztop
  • Summary of the actual simulation monitoring MySQL service shell script
  • Basic tutorial on installing and configuring Zabbix to monitor MySQL

<<:  Tomcat reports an error when starting the springboot project war package: Error when starting the child

>>:  JQuery implements hiding and displaying animation effects

Recommend

HTML page jump passing parameter problem

The effect is as follows: a page After clicking t...

MySQL loop inserts tens of millions of data

1. Create a test table CREATE TABLE `mysql_genara...

Detailed introduction to JS basic concepts

Table of contents 1. Characteristics of JS 1.1 Mu...

10 Best Practices for Building and Maintaining Large-Scale Vue.js Projects

Table of contents 1. Use slots to make components...

Solution to MySQL failure to start

Solution to MySQL failure to start MySQL cannot s...

Docker runs operations with specified memory

as follows: -m, --memory Memory limit, the format...

JS realizes video barrage effect

Use ES6 modular development and observer mode to ...

Detailed explanation of the use of Teleport in Vue3

Table of contents Purpose of Teleport How Telepor...

Vue uses echart to customize labels and colors

This article example shares the specific code of ...

How to remove spaces or specified characters in a string in Shell

There are many methods on the Internet that, alth...

Detailed explanation of the two modes of Router routing in Vue: hash and history

hash mode (default) Working principle: Monitor th...

How to add Vite support to old Vue projects

1. Introduction I have taken over a project of th...