Detailed process of installing Presto and connecting Hive in Docker

Detailed process of installing Presto and connecting Hive in Docker

1. Introduction

Presto is an open source distributed SQL query engine for running interactive analytical queries against data sources ranging from gigabytes to petabytes. Presto allows you to query data wherever it resides, including Hive, Cassandra, relational databases, and even proprietary data stores. A single Presto query can combine data from multiple sources, allowing analysis across your entire organization.

2. Download and install

1) Download the image

docker pull starburstdata/presto

2) Download the client jar

Because the pulled image is a server and does not have the presto-cli command, you need to download presto-cli-0.260.1-executable.jar and then pull the downloaded jar package into the Linux system.

After the download is complete, run the container:

#The port number is 8080, please modify it yourself, the name is presto
docker run -d -p 8080:8080 --name presto starburstdata/presto

Copy the client jar package to the bin directory in the container:

docker cp [linux path where the client jar package is located] [container name]:/bin

Go to the hadoop configuration file directory and copy core-site.xml and hdfs-site.xml to the container

cd ${HADOOP_HOME}/etc/hadoop
docker cp core-site.xml presto:/usr/lib/presto/etc/
docker cp core-site.xml presto:/usr/lib/presto/etc/

Enter the container (the following operations not marked are all performed in the container):

#Enter the container as root user to prevent unauthorized file operations docker exec --user=root -it presto /bin/bash

Rename the client jar package and grant executable permissions:

cd bin
mv presto-cli-0.260.1-executable.jar presto-cli
chmod a+x presto-cli

3. Configuration File

Enter the configuration file directory and create the hive configuration file:

cd /usr/lib/presto/etc/catalog
vi hive.properties

hive.properties:

#Connection name, fixed connector.name=hive-hadoop2
#hive metadata uri, can be found in hive-site.xml, please modify the ip and port yourself hive.metastore.uri=thrift://192.168.10.51:9083
#Hive configuration resources core-site.xml and hdfs-site.xml, please modify according to the configuration file path in your cluster, separated by commas hive.config.resources=/usr/lib/presto/etc/core-site.xml,/usr/lib/presto/etc/hdfs-site.xml

wqSave and exit

Restart the container (on the Linux command line):

docker restart presto

4. Test the connection

Enter the container:

#Enter the container as root user to prevent unauthorized file operations docker exec --user=root -it presto /bin/bash
#Enter the bin directory cd bin
#Execute the command line script and test the hive connection, connect to the hive configuration file, the database is default
./presto-cli --server localhost:8080 --catalog hive --schema default
presto>show tables;

This is the end of this article about the detailed process of installing Presto on Docker to connect to Hive. For more information about installing Presto on Docker, please search for previous articles on 123WORDPRESS.COM or continue to browse the following related articles. I hope you will support 123WORDPRESS.COM in the future!

You may also be interested in:
  • Docker installs ClickHouse and initializes data testing
  • How to install docker and portainer in kali
  • Installing Docker Desktop reports an error message: WSL 2 installation is incomplete (Solve the error message)
  • Detailed process of installing and deploying onlyoffice in docker
  • A detailed tutorial on how to install Jenkins on Docker for beginners

<<:  JavaScript exquisite snake implementation process

>>:  Sample code for implementing 3D book effect with CSS

Recommend

Detailed explanation of SSH password-free login configuration under Linux

Assume there are two Linux servers A and B, and w...

How to deploy Confluence and jira-software in Docker

version: centos==7.2 jdk==1.8 confluence==6.15.4 ...

MySQL query sorting and paging related

Overview It is usually not what we want to presen...

Common shell script commands and related knowledge under Linux

Table of contents 1. Some points to remember 1. V...

JavaScript explains the encapsulation and use of slow-motion animation

Implementing process analysis (1) How to call rep...

How to disable foreign key constraint checking in MySQL child tables

Prepare: Define a teacher table and a student tab...

Detailed explanation of GaussDB for MySQL performance optimization

Table of contents background Inspiration comes fr...

Docker connects to the host Mysql operation

Today, the company project needs to configure doc...

Detailed explanation of the installation process of Jenkins on CentOS 7

Install Jenkins via Yum 1. Installation # yum sou...

A QQ chat room based on vue.js

Table of contents Introduction The following is a...

Summary of some situations when Docker container disk is full

Preface This article describes two situations I h...

CSS modular solution

There are probably as many modular solutions for ...

How to split data in MySQL table and database

Table of contents 1. Vertical (longitudinal) slic...

Detailed explanation on how to modify the default port of nginx

First find out where the configuration file is wh...