mysql-canal-rabbitmq installation and deployment super detailed tutorial

mysql-canal-rabbitmq installation and deployment super detailed tutorial

original

1.1. Enable MySQL binlog

1. Modify my.cnf or my.ini (windows), add configuration items:

# binlog log storage path log-bin=D:\env\mysql-5.7.28-winx64\binlog
# The log records the format in which each row of data is modified binlog-format=ROW
# The service ID of the current machine. If it is a cluster, it cannot be repeated server_id=1

2. After restarting the MySQL service, check whether the configuration variables are effective:

mysql> show variables like '%log_bin%';
+---------------------------------+----------------------+
| Variable_name | Value |
+---------------------------------+----------------------+
| log_bin | ON |
| log_bin_basename | D:\env\mysql-5 |
| log_bin_index | D:\env\mysql-5.index |
| log_bin_trust_function_creators | OFF |
| log_bin_use_v1_row_events | OFF |
| sql_log_bin | ON |
+---------------------------------+----------------------+
6 rows in set, 1 warning (0.00 sec)

Configure the canal user of the mysql database

mysql -uroot -p log in to mysql, create and authorize user canal ;

CREATE USER canal IDENTIFIED BY 'canal';
GRANT SELECT, REPLICATION SLAVE, REPLICATION CLIENT ON *.* TO 'canal'@'%';
FLUSH PRIVILEGES;

1.2. Configure RabbitMQ Exchanges and Queues

1. Create a new Queue

2. Create a new Exchange

3. Set the Bindings in the Queue, fill in the Exchange name, and the Routing key ;

1.3. Installing a standalone canal

1.3.1. Download and install

Download and unzip;

sudo wget https://github.com/alibaba/canal/releases/download/canal-1.1.4/canal.deployer-1.1.4.tar.gz
sudo tar -zxvf canal.deployer-1.1.4.tar.gz

Installation of the latest version 1.1.5

sudo wget https://github.com/alibaba/canal/releases/download/canal-1.1.5-alpha-1/canal.deployer-1.1.5-SNAPSHOT.tar.gz
sudo tar -zxvf canal.deployer-1.1.5-SNAPSHOT.tar.gz

Configuration Files

1.3.2.1. Node configuration file canal.properties

# tcp bind ip, the IP address of the current node canal.ip = 192.168.2.108
# register ip to zookeeper, register to ZK's IP address, as shown in Figure 1.
canal.register.ip = 192.168.2.108

canal.zkServers = zk cluster # tcp, kafka, RocketMQ, the latest version 1.1.5 can directly connect to rabbitmq
canal.serverMode = rabbitmq

# destinations, the instance list deployed on the current server, corresponding to the name of each instance folder (../conf/<instance_name>) canal.destinations = example2

# Set the mq server address, here is the rabbitmq server address #!! The default configuration after downloading here is to configure the IP: port # rabbitmq does not need to configure the port here canal.mq.servers = 192.168.208.100

# The following items are all new versions 1.1.5 that support rabbitmq configuration canal.mq.vhost=/
canal.mq.exchange=example2-ex # Specify the exchange name on rabbitmq, the name created in the "Create `Exchange`" step canal.mq.username=admin # Username for connecting to rabbitmq canal.mq.password=**** # Password for connecting to rabbitmq canal.mq.aliyunuid= 

1.3.2.2. Instance configuration file instance.properties

# position info, database connection information canal.instance.master.address=192.168.2.108:3306
# The following two configurations require the values ​​of the fields `File` and `Position` obtained by executing `SHOW MASTER STATUS` in the database of the address configured above canal.instance.master.journal.name=mysql-5.7
canal.instance.master.position=674996

# table meta tsdb info, disable tsdb to record the time series version of table meta canal.instance.tsdb.enable=false

# username/password, the username and password for the instance connection data canal.instance.dbUsername=canal
canal.instance.dbPassword=canal

# table regex, regular matching database table to be monitored canal.instance.filter.regex=ysb\\.useropcosttimes_prod

# mq config, specify the routing to which rabbitmq is bound, see the third step of "Configure rabbitmq" for details.
canal.mq.topic=example2-routingkey

1.3.3. Start the canal service

The startup script for Linux ./bin/startup.sh , and the startup script for Windows is ./bin/startup.bat ; Take Windows as an example:

λ .\startup.bat
start cmd : java -Xms128m -Xmx512m -XX:PermSize=128m -Djava.awt.headless=true -Djava.net.preferIPv4Stack=true -Dapplication.codeset=UTF-8 -Dfile.encoding=UTF-8 -server -Xdebug -Xnoagent -Djava.compiler=NONE -Xrunjdwp:transport=dt_socket,address=9099,server=y,suspend=n -DappName=otter-canal -Dlogback.configurationFile="d:\env\green\canal-1.1.5\bin\\..\conf\logback.xml" -Dcanal.conf="d:\env\green\canal-1.1.5\bin\\..\conf\canal.properties" -classpath "d:\env\green\canal-1.1.5\bin\\..\conf\..\lib\*;d:\env\green\canal-1.1.5\bin\\..\conf" java -Xms128m -Xmx512m -XX:PermSize=128m -Djava.awt.headless=true -Djava.net.preferIPv4Stack=true -Dapplication.codeset=UTF-8 -Dfile.encoding=UTF-8 -server -Xdebug -Xnoagent -Djava.compiler=NONE -Xrunjdwp:transport=dt_socket,address=9099,server=y,suspend=n -DappName=otter-canal -Dlogback.configurationFile="d:\env\green\canal-1.1.5\bin\\..\conf\logback.xml" -Dcanal.conf="d:\env\green\canal-1.1.5\bin\\..\conf\canal.properties" -classpath "d:\env\green\canal-1.1.5\bin\\..\conf\..\lib\*;d:\env\green\canal-1.1.5\bin\\..\conf" com.alibaba.otter.canal.deployer.CanalLauncher
Java HotSpot(TM) Server VM warning: ignoring option PermSize=128m; support was removed in 8.0
Listening for transport dt_socket at address: 9099

Finally, manually modify the database data, or wait for other modifications, and then check the monitoring on rabbitmq to see whether the process is successful.

1.4. Installing cluster canal

1.4.1. Install canal-admin

1.4.1.1. Download and install

Download and unzip

sudo wget https://github.com/alibaba/canal/releases/download/canal-1.1.5-alpha-1/canal.admin-1.1.5-SNAPSHOT.tar.gz
sudo tar -zxvf canal.admin-1.1.5-SNAPSHOT.tar.gz

Configuration Files

application.yml

server:
 port: 8089
spring:
 jackson:
  date-format: yyyy-MM-dd HH:mm:ss
  time-zone: GMT+8

spring.datasource:
 address: 192.168.2.108:3306
 database: canal_manager
 Username: canal
 password: canal
 driver-class-name: com.mysql.jdbc.Driver
 # `serverTimezone=UTC` needs to be added to the end of the database connection string, otherwise a time zone exception will be reported during startup;
 url: jdbc:mysql://${spring.datasource.address}/${spring.datasource.database}?useUnicode=true&characterEncoding=UTF-8&useSSL=false&serverTimezone=UTC
 Hikari:
  maximum-pool-size: 30
  minimum-idle: 1

canal:
 # Configure the administrator account and password of canal-adminUser: admin
 adminPasswd: 123456

canal_manager.sql

Execute this SQL script in the database that manages canal-admin data to initialize some tables;

1.4.1.3. Start the canal-admin service

The startup script for Linux ./bin/startup.sh , and the startup script for Windows is ./bin/startup.bat ; Take Windows as an example:

λ .\startup.bat
start cmd : java -Xms128m -Xmx512m -Djava.awt.headless=true -Djava.net.preferIPv4Stack=true -Dapplication.codeset=UTF-8 -Dfile.encoding=UTF-8 -DappName=canal-admin -classpath "D:\env\green\canal-1.1.5-admin\bin\\..\conf\..\lib\*;D:\env\green\canal-1.1.5-admin\bin\\..\conf" com.alibaba.otter.canal.admin.CanalAdminApplication
2020-04-13 20:01:39.495 [main] INFO com.alibaba.otter.canal.admin.CanalAdminApplication - Starting CanalAdminApplication on Memento-PC with PID 50696 (D:\env\green\canal-1.1.5-admin\lib\canal-admin-server-1.1.5-SNAPSHOT.jar started by Memento in D:\env\green\canal-1.1.5-admin\bin)
2020-04-13 20:01:39.527 [main] INFO com.alibaba.otter.canal.admin.CanalAdminApplication - No active profile set, falling back to default profiles: default
2020-04-13 20:01:39.566 [main] INFO osbwscAnnotationConfigServletWebServerApplicationContext - Refreshing org.springframework.boot.web.servlet.context.AnnotationConfigServletWebServerApplicationContext@13a5bf6: startup date [Mon Apr 13 20:01:39 CST 2020]; root of context hierarchy
2020-04-13 20:01:41.149 [main] INFO osboot.web.embedded.tomcat.TomcatWebServer - Tomcat initialized with port(s): 8089 (http)
2020-04-13 20:01:41.166 [main] INFO org.apache.coyote.http11.Http11NioProtocol - Initializing ProtocolHandler ["http-nio-8089"]
2020-04-13 20:01:41.176 [main] INFO org.apache.catalina.core.StandardService - Starting service [Tomcat]
2020-04-13 20:01:41.177 [main] INFO org.apache.catalina.core.StandardEngine - Starting Servlet Engine: Apache Tomcat/8.5.29
...
2020-04-13 20:01:42.996 [main] INFO org.apache.coyote.http11.Http11NioProtocol - Starting ProtocolHandler ["http-nio-8089"]
2020-04-13 20:01:43.007 [main] INFO org.apache.tomcat.util.net.NioSelectorPool - Using a shared selector for servlet write/read
2020-04-13 20:01:43.019 [main] INFO osboot.web.embedded.tomcat.TomcatWebServer - Tomcat started on port(s): 8089 (http) with context path ''
2020-04-13 20:01:43.024 [main] INFO com.alibaba.otter.canal.admin.CanalAdminApplication - Started CanalAdminApplication in 3.919 seconds (JVM running for 5.241)

1.4.1.4. Notes

canal-admin account that connects to the database must have permissions to create tables, read and write data. If you still use the canal account created above, you need to extend the permissions:

GRANT ALL PRIVILEGES ON *.* TO 'canal'@'%';

1.4.2. Add a standalone canal-server node

1.4.2.1. Start canal-server node service

The stand-alone canal-server is started as usual. At this time, the configuration information in ../conf/canal.properties loaded by canal-server by default can be obtained from the ../bin/startup.bat[startup.sh] script, which can be obtained from the command prompt of the executed script;

1.4.2.2. Create a new stand-alone server

Create a stand-alone server in canal-admin

The server will automatically identify the started canal-server node, but after being taken over by admin , it will not automatically load the ../conf/canal.properties configuration file. Click操作-配置on the far right to view the default configuration information loaded by the server.


You need to manually copy the configuration information in ../conf/canal.properties configured in 1.3.2 to this configuration to overwrite it!

1.4.2.3. Create a new instance

Manually create a new instance in canal-admin , corresponding to the instance example2 in the single-machine canal-server configuration; similarly, you need to manually copy ./conf/<實例名稱>/instance.properies configuration file to admin

!!Note that when creating or starting instance , first delete the meta.dat file in the instance folder and update the two configuration items canal.instance.master.journal.name=... and canal.instance.master.position=... ;

1.4.3. Adding cluster canal-server nodes

1.4.3.1. Create a new cluster

You need to specify the cluster name and configure the zookeeper cluster address bound to the cluster;

After the creation is successful, configure the general server configuration information of the cluster in操作-主配置on the far right

You can also copy the previously configured ../conf/canal.properties configuration directly here and modify it slightly before using it.

# canal admin config
canal.admin.manager = 192.168.2.108:8089

canal.instance.global.mode = manager

1.4.3.2. Create a new server

Specify the cluster to which it belongs, which is the cluster name set in 1.4.3.1 ;

If canal-server node service has been started before, the newly created server will be automatically recognized as啟動, otherwise it will be斷開;

There is one thing that needs to be paid special attention to here.

Careful people may find that in addition to the canal.properties configuration file, there is also a canal_local.properties configuration file. The latter has much less content than the former, because this file is used to build the local node configuration file when canal cluster is built, while the other information in the former configuration file is centrally configured and managed by canal-admin ;

In the ./bin/startup.bat[startup.sh] startup script, the canal.properties configuration file is loaded by default, that is, the service is started in stand-alone mode;

When building a canal cluster on Windows, you need to manually modify startup.bat . The blue mark is the configuration file path for loading %canal_conf% variable, so you need to adjust the variable in the red box to:

@rem set canal_conf=...
set canal_conf=%conf_dir%\canal_local.properties

Load the canal_local.properties configuration file at startup

1.4.3.3. Create a new instance

The configuration here can also be adjusted based on the configuration of instance 1.4.2.3 in the stand-alone server;

# 2. position info, specify the binlog position information where MySQL starts to synchronize canal.instance.master.address=192.168.0.25:63306
canal.instance.master.journal.name=mysql-bin.001349
canal.instance.master.position=198213313

# 3. Username/password, set the database username and password for syncing MySQL canal.instance.dbUsername=xxxx
canal.instance.dbPassword=xxx

# 4. table regex, regular expression matching the data table to be synchronized canal.instance.filter.regex=xxxx

# 5. mq config, specify the routing binding on mysql, see `1.2.3`
canal.mq.topic=example2-routingkey

After saving, you can start the instance in the operation

If the instance here cannot be started, try the following steps to check and operate it:

Check whether canal.destinations in主配置of the cluster contains the name of the newly created instance ; check whether canal-server node has loaded the canal_local.properties configuration file; delete the .db and .bat files in the instance folder, update binglog position of canal.instance.master.position in the instance configuration file, and start instance ;

1.5. Canal Configuration Instructions

1.5.1. canal.properties

  1. canal.ip, the node IP
  2. canal.register.ip, IP registered to zookeeper
  3. canal.zkServers, zk cluster
  4. Whether to enable tsdb and enable the time series version recording function of table meta
  5. // 5. canal.serverMode, set to rabbitmq, default is tcp
canal.instance.tsdb.enable = true
canal.instance.tsdb.dir = ${canal.file.data.dir:../conf}/${canal.instance.destination:}
canal.instance.tsdb.url = jdbc:h2:${canal.instance.tsdb.dir}/h2;CACHE_SIZE=1000;MODE=MYSQL;
canal.instance.tsdb.dbUsername = canal
canal.instance.tsdb.dbPassword = canal

5.canal.destinations, a list of instances deployed on the current cluster

6.canal.mq.servers, set the Rabbitmq cluster address, !! You cannot add the port here

1.5.2. instance.properties

  1. canal.instance.master.address, master database address
  2. canal.instance.master.journal.name, the File value of executing show master status in the database
  3. canal.instance.master.position, the Position value of show master status in the database
  4. canal.instance.tsdb.enable=false, disable tsdb
  5. canal.instance.dbUsername, instance database username
  6. canal.instance.dbPassword, instance database password
  7. canal.instance.filter.regex, matches the table that needs to be synchronized
  8. canal.mq.topic, canal registers the topic name of mq

1.5.3. properties configuration file

properties configuration is divided into two parts

canal.properties (system root configuration file)
instance.properties (instance-level configuration file, one for each instance)

1.canal.properties

canal.destinations # List of instances deployed on the current server canal.conf.dir # Path to the conf directory canal.auto.scan # Enable automatic instance scanning # If configured to true, instance configuration changes under the canal.conf.dir directory will be automatically triggered # 1. Added instance directory: triggers instance configuration loading, and automatically starts when lazy is true;
# 2. Instance directory deletion: Uninstall the corresponding instance configuration, and shut down if it has been started;
# 3. Changes in instance.properties file: reload instance configuration, and automatically restart if it has been started;
canal.auto.scan.interval # instance automatic scan interval, unit s
canal.instance.global.mode # Global configuration loading modecanal.instance.global.lazy # Global lazy modecanal.instance.global.manager.address # Link information of global manager configuration modecanal.instance.global.spring.xml # Component file of global spring configuration modecanal.instance.example.mode
canal.instance.example.lazy
canal.instance.example.spring.xml
# Instance-level configuration definition. If configured, it will automatically overwrite the global configuration definition modecanal.instance.tsdb.enable # Whether to enable the time series version recording function of table metacanal.instance.tsdb.dir # Local storage path of the time series version. The default is the instance directorycanal.instance.tsdb.url # Database connection address of the time series version. The default is the local embedded databasecanal.instance.tsdb.dbUsername # Database connection account of the time series versioncanal.instance.tsdb.dbPassword # Database connection password of the time series version

2.instance.properties

canal.id # Unique identifier for each canal server instancecanal.ip # Local IP information bound to the canal server. If not configured, a local IP will be selected by default to start the servicecanal.port # Port for the canal server to provide socket servicecanal.zkServers # Connection address of the canal server to connect to the zookeeper cluster, for example: 10.20.144.22:2181,10.20.144.23:2181
canal.zookeeper.flush.period # The update frequency of canal persistent data to zookeeper, in ms
canal.instance.memory.batch.mode # Data cache mode in canal memory store# 1. ITEMSIZE: limit according to buffer.size, only limit the number of records# 2. MEMSIZE: limit the size of cached records according to the size of buffer.size * buffer.memunit;
canal.instance.memory.buffer.size # The number of buffer records that can be cached in the canal memory store, needs to be an exponent of 2canal.instance.memory.buffer.memunit # The unit size of the memory record, defaults to 1KB, and combines with buffer.size to determine the final memory usage sizecanal.instance.transactions.size # The maximum transaction complete resolution length supported. After exceeding this length, a transaction may be split into multiple submissions to the canal store, and the complete visibility of the transaction cannot be guaranteedcanal.instance.fallbackIntervalInSeconds # When canal switches to MySQL, the time it needs to look forward when looking for binlog on the new MySQL database, unit s
# Description: There may be parsing delays or clock inconsistencies in the MySQL master and slave databases. It is necessary to roll back for a period of time to ensure that data is not lost canal.instance.detecting.enable # Whether to enable heartbeat check canal.instance.detecting.sql # Heartbeat check sql, insert into retl.xdual values(1,now()) on duplicate key update x=now()
canal.instance.detecting.interval.time # Heartbeat check frequency, unit s
canal.instance.detecting.retry.threshold # Heartbeat check failed retry timescanal.instance.detecting.heatbeatHaEnable # After the heartbeat check fails, whether to enable MySQL automatic switching# Note: For example, if the heartbeat check fails more than the threshold, if this configuration is true, canal will automatically connect to the MySQL standby database to obtain binlog datacanal.instance.network.receiveBufferSize # Network connection parameters, SocketOptions.SO_RCVBUF
canal.instance.network.sendBufferSize # Network connection parameters, SocketOptions.SO_SNDBUF
canal.instance.network.soTimeout # Network connection parameters, SocketOptions.SO_TIMEOUT

1.5.4. canal.mq.dynamicTopic

Reference: https://github.com/alibaba/canal/wiki/Canal-Kafka-RocketMQ-QuickStart?tdsourcetag=s_pctim_aiomsg

1.6. Problem Solving

1. When executing startup.bat to start canal under Windows, the following exception occurs

Failed to instantiate [ch.qos.logback.classic.LoggerContext]
Reported exception:
ch.qos.logback.core.LogbackException: Unexpected filename extension of file [file:/D:/env/green/canal/conf/]. Should be either .groovy or .xml
at ch.qos.logback.classic.util.ContextInitializer.configureByResource(ContextInitializer.java:79)
at ch.qos.logback.classic.util.ContextInitializer.autoConfig(ContextInitializer.java:152)
at org.slf4j.impl.StaticLoggerBinder.init(StaticLoggerBinder.java:85)
at org.slf4j.impl.StaticLoggerBinder.<clinit>(StaticLoggerBinder.java:55)
at org.slf4j.LoggerFactory.bind(LoggerFactory.java:141)
at org.slf4j.LoggerFactory.performInitialization(LoggerFactory.java:120)
at org.slf4j.LoggerFactory.getILoggerFactory(LoggerFactory.java:331)
at org.slf4j.LoggerFactory.getLogger(LoggerFactory.java:283)
at org.slf4j.LoggerFactory.getLogger(LoggerFactory.java:304)
at com.alibaba.otter.canal.deployer.CanalLauncher.<clinit>(CanalLauncher.java:29)

Solution:
Comment out the following line of code in startup.bat
@rem set logback_configurationFile=%conf_dir%\logback.xml

Note: This problem does not exist in the new version 1.1.5 . This line in the file 1.1.5 is not commented out.

The following exception occurs when starting the new version of canal-admin 1.1.5 :

2020-04-10 18:55:40.406 [main] ERROR com.zaxxer.hikari.pool.HikariPool - HikariPool-1 - Exception during pool initialization.
java.sql.SQLException: The server time zone value '�й���׼ʱ��' is unrecognized or represents more than one time zone. You must configure either the server or JDBC driver (via the serverTimezone configuration property) to use a more specifc time zone value if you want to utilize time zone support.

Workaround

Add the parameter &serverTimezone=UTC to the mysql connection address configured by spring.datasource.url

Exceptions appear in Instance log errno = 1236, sqlstate = HY000 errmsg = log event entry exceeded max_allowed_packet;

2020-04-13 13:06:09.507 [destination = example3 , address = /192.168.2.108:3306 , EventParser] ERROR com.alibaba.otter.canal.common.alarm.LogAlarmHandler - destination:example3[java.io.IOException: Received error packet: errno = 1236, sqlstate = HY000 errmsg = log event entry exceeded max_allowed_packet; Increase max_allowed_packet on master; the first event 'mysql-5.7' at 671745, the last event read from 'D:\env\mysql-5.7' at 673181, the last byte read from 'D:\env\mysql-5.7' at 673200.
at com.alibaba.otter.canal.parse.inbound.mysql.dbsync.DirectLogFetcher.fetch(DirectLogFetcher.java:102)
at com.alibaba.otter.canal.parse.inbound.mysql.MysqlConnection.dump(MysqlConnection.java:235)
at com.alibaba.otter.canal.parse.inbound.AbstractEventParser$3.run(AbstractEventParser.java:265)
at java.lang.Thread.run(Unknown Source)
]

Workaround

Delete the meta.dat file in the corresponding instance under canal/conf , and let canal-admin automatically regenerate it;

1.7. References

canal (incremental subscription and consumption based on MySQL database binlog)

Canal Admin builds a Canal cluster and experiences

Canal integrates with RabbitMQ

canal series - configuration file introduction

This is the end of this article about the super detailed tutorial on mysql-canal-rabbitmq installation and deployment. For more information about mysql-canal-rabbitmq installation and deployment, please search for previous articles on 123WORDPRESS.COM or continue to browse the following related articles. I hope you will support 123WORDPRESS.COM in the future!

You may also be interested in:
  • Solution to the problem of mysql master-slave switch canal
  • Detailed explanation of the binlog log analysis tool for monitoring MySQL: Canal
  • SpringBoot integrates Elasticsearch to implement indexing and document operations
  • Use elasticsearch to delete index data regularly
  • Use canal to monitor MySQL database to implement real-time update of elasticsearch index

<<:  Eight rules for effective web forms

>>:  100 ways to change the color of an image using CSS (worth collecting)

Blog    

Recommend

Quickly solve the problem that CentOS cannot access the Internet in VMware

Yesterday I installed CentOS7 under VMware. I wan...

How to Run a Command at a Specific Time in Linux

The other day I was using rsync to transfer a lar...

Nginx routing forwarding and reverse proxy location configuration implementation

Three ways to configure Nginx The first method di...

Designing the experience: What’s on the button

<br />Recently, UCDChina wrote a series of a...

Detailed explanation of MySQL 8's new feature ROLE

What problems does MySQL ROLE solve? If you are a...

Detailed process of modifying hostname after Docker creates a container

There is a medicine for regret in the world, as l...

CSS3 flip card number sample code

I received a task from the company today, and the...

A very detailed tutorial on installing rocketmq under Docker Desktop

Install Docker Desktop Download address: Docker D...

Web Theory: Don't make me think Reading Notes

Chapter 1 <br />The most important principl...

CentOS7 installation GUI interface and remote connection implementation

Use the browser (webdriver)-based selenium techno...