PrefaceI need to deploy the selenium crawler program to the Linux server for work. I would like to share this with you. If you are interested, you can take a look. 1. What is selenium?Selenium is a tool used for web application testing. Selenium tests run directly in the browser, just like real users are operating, and crawlers use it to crawl some data dynamically loaded by js 2. Usage steps1. Import libraryThe code is as follows from selenium.webdriver import Chrome from selenium.webdriver.chrome.service import Service from selenium.webdriver.chrome.options import Options # Use a headless browser from selenium.webdriver import ChromeOptions chrome_options = Options() options = ChromeOptions() options.add_experimental_option('excludeSwitches', ['enable-automation']) # => Remove the browser being controlled by the automated testing software options.add_experimental_option('useAutomationExtension', False) chrome_options.add_argument("--headless") # => Configure headless mode for Chrome chrome_options.add_argument('--no-sandbox') chrome_options.add_argument('--disable-gpu') chrome_options.add_argument('--disable-dev-shm-usage') 2. Test codeThe code is as follows: s = Service(r"/home/driver/chromedriver") driver = Chrome( service=s, options=chrome_options ) driver.get("https://www.baidu.com") print(diiver.title) 3. Deployment Procedure1. Install ChromeThe command is as follows: yum install https://dl.google.com/linux/direct/google-chrome-stable_current_x86_64.rpm Check the version of Chrome: google-chrome --version 2. Install chromedriverThe command is as follows: Download the chromedriver driver address according to the corresponding chrome version: https://npm.taobao.org/mirrors/chromedriver My version number is: 96.0.4664.45 wget https://npm.taobao.org/mirrors/chromedriver/96.0.4664.45/chromedriver_linux64.zip yum install -y unzip zip unzip chromedriver_linux64.zip # Unzip the zip file mkdir driver #Create a new folder to store the driver chmod 777 driver/chromedriver # This is the permission. I give it 777 here 3. Run the test codeCreate a new test.py file vi test.py Save test.py and run it. Seeing this, my request is successful. SummarizeThis is the end of this article about deploying selenium crawler program under Linux system. For more relevant content about linux selenium crawler program, please search previous articles of 123WORDPRESS.COM or continue to browse the related articles below. I hope you will support 123WORDPRESS.COM in the future! You may also be interested in:
|
<<: The difference between animation and transition
Table of contents Preface Motivation for Fragment...
Delete a file by its inode number First use ls -i...
background First, let me explain the background. ...
Abstract: Many companies, even most companies whos...
1. HTML part <Col span="2">Upload...
1. Create a shell script vim backupdb.sh Create t...
Table of contents Preface: 1. Create a project wi...
This seems to be no longer possible with the new ...
Table of contents 1. HTTP Range Request 1.1 Range...
I have done some research on "embedding non-...
Table of contents 1. Rendering 2. Implementation ...
【question】 When the outer table and the inner tab...
This article example shares the specific code of ...
In order to centrally manage the images we create...
This article hopes to gain some insights through a...