PrefaceI need to deploy the selenium crawler program to the Linux server for work. I would like to share this with you. If you are interested, you can take a look. 1. What is selenium?Selenium is a tool used for web application testing. Selenium tests run directly in the browser, just like real users are operating, and crawlers use it to crawl some data dynamically loaded by js 2. Usage steps1. Import libraryThe code is as follows from selenium.webdriver import Chrome from selenium.webdriver.chrome.service import Service from selenium.webdriver.chrome.options import Options # Use a headless browser from selenium.webdriver import ChromeOptions chrome_options = Options() options = ChromeOptions() options.add_experimental_option('excludeSwitches', ['enable-automation']) # => Remove the browser being controlled by the automated testing software options.add_experimental_option('useAutomationExtension', False) chrome_options.add_argument("--headless") # => Configure headless mode for Chrome chrome_options.add_argument('--no-sandbox') chrome_options.add_argument('--disable-gpu') chrome_options.add_argument('--disable-dev-shm-usage') 2. Test codeThe code is as follows: s = Service(r"/home/driver/chromedriver") driver = Chrome( service=s, options=chrome_options ) driver.get("https://www.baidu.com") print(diiver.title) 3. Deployment Procedure1. Install ChromeThe command is as follows: yum install https://dl.google.com/linux/direct/google-chrome-stable_current_x86_64.rpm Check the version of Chrome: google-chrome --version 2. Install chromedriverThe command is as follows: Download the chromedriver driver address according to the corresponding chrome version: https://npm.taobao.org/mirrors/chromedriver My version number is: 96.0.4664.45 wget https://npm.taobao.org/mirrors/chromedriver/96.0.4664.45/chromedriver_linux64.zip yum install -y unzip zip unzip chromedriver_linux64.zip # Unzip the zip file mkdir driver #Create a new folder to store the driver chmod 777 driver/chromedriver # This is the permission. I give it 777 here 3. Run the test codeCreate a new test.py file vi test.py Save test.py and run it. Seeing this, my request is successful. SummarizeThis is the end of this article about deploying selenium crawler program under Linux system. For more relevant content about linux selenium crawler program, please search previous articles of 123WORDPRESS.COM or continue to browse the related articles below. I hope you will support 123WORDPRESS.COM in the future! You may also be interested in:
|
<<: The difference between animation and transition
This article shares the specific code for the WeC...
OBS studio is cool, but JavaScript is cooler. Now...
Color is one of the most important elements for a...
Today we will look at why master-slave delay occu...
1. Table structure TABLE person id name 1 you 2 Y...
Scenario: The crawled data generates a data table...
Preface When it comes to database transactions, a...
Important note: Before studying this article, you...
This article is part of a special series on the 2...
Table of contents Written in front Environment de...
The latest Insider version of Visual Studio Code ...
jQuery realizes the effect of theater seat select...
Table of contents Overview Code Implementation Su...
Table of contents 1. Follow the wizard to create ...
Table of contents 1:mysql execution process 1.1: ...