Detailed deployment steps for MySQL MHA high availability configuration and failover

Detailed deployment steps for MySQL MHA high availability configuration and failover

1. Introduction to MHA

1. What is MHA?

MHA (Master High Availability) is an excellent software for failover and master-slave replication in a MySQL high availability environment.
The emergence of MHA is to solve the problem of MySQL single point.
During the MySQL failover process, MHA can automatically complete the failover operation within 0-30 seconds.
MHA can ensure data consistency to the greatest extent during the failover process to achieve true high availability.

2. Composition of MHA

MHA Node (Data Node)
MHA Node runs on each MySQL server.

MHA Manager (Management Node)
MHA Manager can be deployed on a separate machine to manage multiple master-slave clusters; it can also be deployed on a slave node.
MHA Manager will periodically detect the master nodes in the cluster. When the master fails, it can automatically promote the slave with the latest data to the new master, and then redirect all other slaves to the new master. The entire failover process is completely transparent to the application.

3. Characteristics of MHA

  • During the automatic failover process, MHA attempts to save the binary log from the downed primary server to ensure that data is not lost to the greatest extent possible.
  • Using semi-synchronous replication can greatly reduce the risk of data loss. If only one slave has received the latest binary log, MHA can apply the latest binary log to all other slave servers, thus ensuring data consistency on all nodes.
  • Currently, MHA supports a one-master-multiple-slave architecture, with a minimum of three servers, i.e. one master and two slaves.

insert image description here

2. Build MySQL MHA

1. Experimental ideas:

1. MHA Architecture

1) Database installation
2) One master and two slaves
3) MHA construction

2. Fault simulation

1) The main database fails
2) The alternate master database becomes the master database
3) The original failed master database recovers and rejoins MHA to become a slave database

(II) Experimental steps

MHA manager node server: CentOS7.4 (64 bit) manager/192.168.126.10, install MHA node and manager components Master node server: CentOS7.4 (64 bit) mysql1/192.168.126.20, install mysql5.7, MHA node components Slave1 node server: CentOS7.4 (64 bit) mysql2/192.168.126.30, install mysql5.7, MHA node components Slave2 node server: CentOS7.4 (64 bit) mysql3/192.168.126.40, install mysql5.7, MHA node components

Disable firewall on each machine

systemctl stop firewalld
systemctl disable firewalld
setenforce 0

1. Install mysql15.7

Install mysql5.7 on the Master, Slave1, and Slave2 nodes (For details on mysql installation, see the previous blog post)

2. Modify the host names of the Master, Slave1, and Slave2 nodes

hostnamectl set-hostname Mysql1
hostnamectl set-hostname Mysql2
hostnamectl set-hostname Mysql3

insert image description here

3. Modify the MySQL main configuration file /etc/my.cnf of the Master, Slave1, and Slave2 nodes
##Master Node##

vim /etc/my.cnf
[mysqld]
server-id = 1
log_bin = master-bin
log-slave-updates = true

systemctl restart mysqld

insert image description here

##Slave1, Slave2 nodes##

vim /etc/my.cnf
server-id = 2 #The server-ids of the three servers cannot be the same log_bin = master-bin
relay-log = relay-log-bin
relay-log-index = slave-relay-bin.index

systemctl restart mysqld

insert image description here

4. Create two soft links on the Master, Slave1, and Slave2 nodes

ln -s /usr/local/mysql/bin/mysql /usr/sbin/
ln -s /usr/local/mysql/bin/mysqlbinlog /usr/sbin/

5. Configure MySQL with one master and two slaves (1) All database nodes are authorized by mysql

mysql -uroot -p
grant replication slave on *.* to 'myslave'@'192.168.126.%' identified by '123'; #Slave database synchronization use grant all privileges on *.* to 'mha'@'192.168.126.%' identified by 'manager'; #manager use grant all privileges on *.* to 'mha'@'Mysql1' identified by 'manager'; #Prevent the slave library from connecting to the master library through the host name grant all privileges on *.* to 'mha'@'Mysql2' identified by 'manager';
grant all privileges on *.* to 'mha'@'Mysql3' identified by 'manager';
flush privileges;

insert image description here

(2) View binary files and synchronization points on the Master node
show master status;

insert image description here

(3) Perform synchronization operations on Slave1 and Slave2 nodes

change master to master_host='192.168.126.20',master_user='myslave',master_password='123',master_log_file='master-bin.000001',master_log_pos=1747; 

start slave;

(4) Check the data synchronization results on Slave1 and Slave2 nodes

show slave status\G		
// Make sure both IO and SQL threads are Yes, indicating that the synchronization is normal.
Slave_IO_Running: Yes
Slave_SQL_Running: Yes

insert image description here

(5) Both slave libraries must be set to read-only mode:

set global read_only=1; 

insert image description here

6. Install MHA software (1) Install MHA dependent environment on all servers. First install epel source

yum install epel-release --nogpgcheck -y
yum install -y perl-DBD-MySQL \
perl-Config-Tiny \
perl-Log-Dispatch \
perl-Parallel-ForkManager \
perl-ExtUtils-CBuilder \
perl-ExtUtils-MakeMaker \
perl - CPAN

(2) Install the MHA software package. First The node component must be installed on all servers first. It is different for each operating system version. Here, CentOS7.4 must choose version 0.57.
The node component must be installed on all servers first, and the manager component must be installed on the MHA-manager node last, because the manager depends on the node component.

Installation Packages:

The node component must be installed on all servers

cd /opt
tar zxvf mha4mysql-node-0.57.tar.gz
cd mha4mysql-node-0.57
perl Makefile.PL
make && make install

(3) Install the manager component on the MHA manager node

cd /opt
tar zxvf mha4mysql-manager-0.57.tar.gz
cd mha4mysql-manager-0.57
perl Makefile.PL
make && make install

After the manager component is installed, several tools will be generated under /usr/local/bin, mainly including the following:

  • masterha_check_ssh Check the SSH configuration of MHA
  • masterha_check_repl Check MySQL replication status
  • masterha_manger starts the manager script
  • masterha_check_status checks the current MHA running status
  • masterha_master_monitor detects whether the master is down
  • masterha_master_switch controls failover (automatic or manual)
  • masterha_conf_host adds or deletes configured server information
  • masterha_stop shuts down the manager

insert image description here

#After the node component is installed, several scripts will be generated under /usr/local/bin (these tools are usually triggered by the MHAManager script and do not require manual operation). The main ones are as follows:
save_binary_logs saves and copies the master's binary logs
apply_diff_relay_logs identifies differential relay log events and applies the differential events to other slaves
filter_mysqlbinlog removes unnecessary ROLLBACK events (MHA no longer uses this tool)

purge_relay_logs Clear relay logs (does not block the SQL thread)

insert image description here

7. Configure passwordless authentication on all servers

(1) Configure password-free authentication for all database nodes on the manager node

ssh-keygen -t rsa #Press Enter all the way ssh-copy-id 192.168.126.20
ssh-copy-id 192.168.126.30
ssh-copy-id 192.168.126.40 

insert image description here

insert image description here

(2) Configure password-free authentication on mysql1 to the database nodes mysql2 and mysql3

ssh-keygen -t rsa
ssh-copy-id 192.168.126.30
ssh-copy-id 192.168.126.40

(3) Configure password-free authentication on mysql2 to the database nodes mysql1 and mysql3

ssh-keygen -t rsa
ssh-copy-id 192.168.126.20
ssh-copy-id 192.168.126.40

(4) Configure password-free authentication on mysql3 to the database nodes mysql1 and mysql2

ssh-keygen -t rsa
ssh-copy-id 192.168.126.20
ssh-copy-id 192.168.126.30

8. Configure MHA on the manager node

(1) Copy the relevant scripts to the /usr/local/bin directory on the manager node

cp -rp /opt/mha4mysql-manager-0.57/samples/scripts /usr/local/bin
//After copying, there will be four executable files ll /usr/local/bin/scripts/

master_ip_failover #Script for VIP management during automatic switching master_ip_online_change #Vip management during online switching power_manager #Script for shutting down the host after a failure occurs send_report #Script for sending an alarm after a failure switch 

insert image description here

(2) Copy the above VIP management script for automatic switching to the /usr/local/bin directory. Here, the master_ip_failover script is used to manage VIP and failover.

cp /usr/local/bin/scripts/master_ip_failover /usr/local/bin

(3) The modified contents are as follows: (Delete the original contents, directly copy and modify the VIP related parameters)

vim /usr/local/bin/master_ip_failover

#!/usr/bin/env perl
use strict;
use warnings FATAL => 'all';

use Getopt::Long;

my
$command, $ssh_user, $orig_master_host, $orig_master_ip,
$orig_master_port, $new_master_host, $new_master_ip, $new_master_port
);
###############################Add content section#########################################
my $vip = '192.168.126.200'; #Specify the address of vipmy $brdc = '192.168.126.255'; #Specify the broadcast address of vipmy $ifdev = 'ens33'; #Specify the network card bound to vipmy $key = '1'; #Specify the serial number of the virtual network card bound to vipmy $ssh_start_vip = "/sbin/ifconfig ens33:$key $vip"; #Represents that the value of this variable is ifconfig ens33:1 192.168.126.200
my $ssh_stop_vip = "/sbin/ifconfig ens33:$key down"; #This variable value represents ifconfig ens33:1 192.168.126.200 down
my $exit_code = 0; #Specify the exit status code as 0
#my $ssh_start_vip = "/usr/sbin/ip addr add $vip/24 brd $brdc dev $ifdev label $ifdev:$key;/usr/sbin/arping -q -A -c 1 -I $ifdev $vip;iptables -F;";
#my $ssh_stop_vip = "/usr/sbin/ip addr del $vip/24 dev $ifdev label $ifdev:$key";
##################################################################################
GetOptions(
'command=s' => \$command,
'ssh_user=s' => \$ssh_user,
'orig_master_host=s' => \$orig_master_host,
'orig_master_ip=s' => \$orig_master_ip,
'orig_master_port=i' => \$orig_master_port,
'new_master_host=s' => \$new_master_host,
'new_master_ip=s' => \$new_master_ip,
'new_master_port=i' => \$new_master_port,
);

exit &main();

sub main {

print "\n\nIN SCRIPT TEST====$ssh_stop_vip==$ssh_start_vip===\n\n";

if ( $command eq "stop" || $command eq "stopssh" ) {

my $exit_code = 1;
eval {
print "Disabling the VIP on old master: $orig_master_host \n";
&stop_vip();
$exit_code = 0;
};
if ($@) {
warn "Got Error: $@\n";
exit $exit_code;
}
exit $exit_code;
}
elsif ( $command eq "start" ) {

my $exit_code = 10;
eval {
print "Enabling the VIP - $vip on the new master - $new_master_host \n";
&start_vip();
$exit_code = 0;
};
if ($@) {
warn $@;
exit $exit_code;
}
exit $exit_code;
}
elsif ( $command eq "status" ) {
print "Checking the Status of the script.. OK \n";
exit 0;
}
else {
&usage();
exit 1;
}
}
sub start_vip() {
`ssh $ssh_user\@$new_master_host \" $ssh_start_vip \"`;
}

A simple system call that disables the VIP on the old_master

sub stop_vip() {
`ssh $ssh_user\@$orig_master_host \" $ssh_stop_vip \"`;
}

Sub usage {
print
"Usage: master_ip_failover --command=start|stop|stopssh|status --orig_master_host=host --orig_master_ip=ip --orig_master_port=port --new_master_host=host --new_master_ip=ip --new_master_port=port\n";
}

insert image description here

(4) Create the MHA software directory and copy the configuration file. Here, the app1.cnf configuration file is used to manage the MySQL node server.

mkdir /etc/masterha
cp /opt/mha4mysql-manager-0.57/samples/conf/app1.cnf /etc/masterha
vim /etc/masterha/app1.cnf #Delete the original content, directly copy and modify the IP address of the node server [server default]
manager_log=/var/log/masterha/app1/manager.log
manager_workdir=/var/log/masterha/app1
master_binlog_dir=/usr/local/mysql/data
master_ip_failover_script=/usr/local/bin/master_ip_failover
master_ip_online_change_script=/usr/local/bin/master_ip_online_change
password=manager
ping_interval=1
remote_workdir=/tmp
repl_password=123
repl_user=myslave
secondary_check_script=/usr/local/bin/masterha_secondary_check -s 192.168.126.30 -s 192.168.126.40
shutdown_script=""
ssh_user=root
user=mha

[server1]
hostname=192.168.126.20
port=3306

[server2]
candidate_master=1
check_repl_delay=0
hostname=192.168.126.30
port=3306

[server3]
hostname=192.168.126.40
port=3306
[server default]
manager_log=/var/log/masterha/app1/manager.log #manager log manager_workdir=/var/log/masterha/app1.log #manager working directory master_binlog_dir=/usr/local/mysql/data/ #The location where master saves binlog. The path here must be consistent with the path of binlog configured in master so that MHA can find it master_ip_failover_script=/usr/local/bin/master_ip_failover #Set the switching script for automatic failover, which is the script above master_ip_online_change_script=/usr/local/bin/master_ip_online_change #Set the switching script for manual switching password=manager #Set the password of the root user in mysql. This password is the password for creating the monitoring user in the previous article ping_interval=1 #Set the time interval for monitoring the main database and sending ping packets. The default is 3 seconds. Automatically failover after three attempts without response
remote_workdir=/tmp #Set the location where the remote MySQL binlog is saved when switching occurs repl_password=123 #Set the password of the replication user repl_user=myslave #Set the user of the replication user report_script=/usr/local/send_report #Set the script for sending alarms after switching secondary_check_script=/usr/local/bin/masterha_secondary_check -s 192.168.126.30 -s 192.168.126.40 #Specify the IP address of the slave server to be checked shutdown_script="" #Set the script to shut down the faulty host after a failure occurs (the main function of this script is to shut down the host to prevent brain split, which is not used here)
ssh_user=root #Set the ssh login username user=mha #Set monitoring user root

[server1]
hostname=192.168.126.20
port=3306

[server2]
hostname=192.168.126.30
port=3306
candidate_master=1
#Set to candidate master. After setting this parameter, the slave database will be promoted to the master database after the master-slave switch occurs, even if the master database is not the latest slave in the cluster

check_repl_delay=0
#By default, if a slave lags behind the master by more than 100M relay logs, MHA will not select the slave as a new master, because it takes a long time to recover the slave; by setting check_repl_delay=0, MHA triggers the switch to ignore the replication delay when selecting a new master. This parameter is very useful for hosts with candidate_master=1, because the candidate master must be the new master during the switch.

[server3]
hostname=192.168.126.40
port=3306

9. The first configuration requires manually enabling the virtual IP on the Master node

/sbin/ifconfig ens33:1 192.168.126.200/24 

insert image description here

10. Test the ssh passwordless authentication on the manager node. If it is normal, it will output successfully, as shown below.

masterha_check_ssh -conf=/etc/masterha/app1.cnf 

insert image description here

11. Test the MySQL master-slave connection on the manager node. If MySQL Replication Health is OK appears, it is normal. As shown below.

masterha_check_repl -conf=/etc/masterha/app1.cnf 

insert image description here

12. Start MHA on the manager node

nohup masterha_manager --conf=/etc/masterha/app1.cnf --remove_dead_master_conf --ignore_last_failover < /dev/null > /var/log/masterha/app1/manager.log 2>&1 &

–remove_dead_master_conf: This parameter means that when a master-slave switch occurs, the IP address of the old master database will be removed from the configuration file.
–manger_log: Log storage location.

–ignore_last_failover: By default, if MHA detects consecutive downtimes and the interval between two downtimes is less than 8 hours, no failover will be performed. This restriction is to avoid the ping-pong effect. This parameter means ignoring the files generated by the last MHA-triggered switch. By default, after the MHA switch occurs, the directory will be recorded in the log, that is, the log app1.failover.complete file set above. If the file exists in the directory when switching next time, the switch will not be allowed to be triggered unless the file is deleted after the first switch. For convenience, it is set here to –ignore_last_failover.

13. Check the MHA status and you can see that the current master is the Mysql1 node.

masterha_check_status --conf=/etc/masterha/app1.cnf 

insert image description here

14. Check the MHA log and you can see that the current master is 192.168.126.20, as shown below.

cat /var/log/masterha/app1/manager.log | grep "current master" 

insert image description here

Check whether the VIP address 192.168.126.200 of Mysql1 exists. This VIP address will not disappear because the manager node stops the MHA service.

ifconfig

//To shut down the manager service, you can use the following command.
masterha_stop --conf=/etc/masterha/app1.cnf
Or you can directly kill the process ID to shut it down.

insert image description here

(III) Fault simulation

#Monitor and observe log records on the manager node

tail -f /var/log/masterha/app1/manager.log 

insert image description here

#Stop the mysql service on the Master node Mysql1

systemctl stop mysqld
or pkill -9 mysql 

insert image description here

#After a normal automatic switch, the MHA process will exit. HMA will automatically modify the content of the app1.cnf file and delete the downed mysql1 node. Check whether mysql2 takes over VIP
ifconfig

insert image description here

insert image description here

Algorithm for failover of candidate master database:
1. Generally, the quality of slave databases is judged from (position/GTID). If there is a difference in data, the slave closest to the master becomes the candidate master.
2. If the data is consistent, select the alternate master database according to the order of configuration files.
3. Set a weight (candidate_master=1) to forcibly specify the candidate master according to the weight.
(1) By default, if a slave lags behind the master by 100MB of relay logs, it will fail even if it has weights.
(2) If check_repl_delay=0, it is forcibly selected as the backup master even if it lags behind by a large number of logs.

Troubleshooting steps:

1. Repair mysql

systemctl restart mysqld

2. Repair master and slave

#View the binary files and synchronization points on the current master server Mysql2 show master status;
#Execute synchronization operation on the original master server mysql1 change master to master_host='192.168.126.30',master_user='myslave',master_password='123',master_log_file='master-bin.000001',master_log_pos=1747;

start slave;

show slave status\G

insert image description here

3. Modify the configuration file app1.cnf on the manager node (and add this record, because it will automatically disappear when it detects failure)

vim /etc/masterha/app1.cnf
......
secondary_check_script=/usr/local/bin/masterha_secondary_check -s 192.168.126.20 -s 192.168.126.40
......
[server1]
hostname=192.168.126.30
port=3306

[server2]
candidate_master=1
check_repl_delay=0
hostname=192.168.126.20
port=3306

[server3]
hostname=192.168.126.40
port=3306

insert image description here

4. Start MHA on the manager node

nohup masterha_manager --conf=/etc/masterha/app1.cnf --remove_dead_master_conf --ignore_last_failover < /dev/null > /var/log/masterha/app1/manager.log 2>&1 &

insert image description here

insert image description here

#Solve the problem of Chinese and English characters being incompatible and reporting errors dos2unix /usr/local/bin/master_ip_failover

This concludes this article about MySQL MHA high availability configuration and detailed deployment steps for failover. For more information about MySQL MHA high availability configuration, please search for previous articles on 123WORDPRESS.COM or continue to browse the following related articles. I hope you will support 123WORDPRESS.COM in the future!

You may also be interested in:
  • How to use Python to collect MySQL MHA deployment and operation status information
  • A complete explanation of MySQL high availability architecture: MHA architecture
  • Steps to build MHA architecture deployment in MySQL
  • Summary of several error logs about MySQL MHA setup and switching
  • Mysql GTID Mha configuration method
  • Super deployment tutorial of MHA high availability failover solution under MySQL
  • MHA implements manual switching of MySQL master-slave database
  • Introduction to MySQL MHA operation status monitoring

<<:  Let's talk about the issue of passing parameters to React onClick

>>:  Detailed explanation of the application of CSS Sprite

Recommend

How to optimize MySQL performance through MySQL slow query

As the number of visits increases, the pressure o...

Methods and techniques for quickly displaying web page images

1. Use .gifs rather than .jpgs. GIFs are smaller ...

HTML adaptive table method

<body style="scroll:no"> <tabl...

This article teaches you how to play with CSS border

Border Style The border-style property specifies ...

Summary of event handling in Vue.js front-end framework

1. v-on event monitoring To listen to DOM events,...

Detailed explanation of Docker Swarm service orchestration commands

1. Introduction Docker has an orchestration tool ...

How to solve the synchronization delay caused by MySQL DDL

Table of contents Preface Solution Tool Introduct...

Summary of commonly used multi-table modification statements in Mysql and Oracle

I saw this question in the SQL training question ...

Write a shopping mall card coupon using CSS in three steps

Today is 618, and all major shopping malls are ho...

WiFi Development | Introduction to WiFi Wireless Technology

Table of contents Introduction to WiFi Wireless T...

How to solve the Mysql transaction operation failure

How to solve the Mysql transaction operation fail...

Mini Program implements list countdown function

This article example shares the specific code for...