What is a headless browser?A headless browser is a browser that can run in a graphical interface. I can programmatically control the headless browser to automatically perform various tasks, such as doing tests, taking screenshots of web pages, etc. Why is it called a "headless" browser?The word "headless" comes from the original "headless computer". Wikipedia entry on "headless computer": A headless system is a computer system or device that has been configured to operate without a monitor (i.e., "head"), keyboard, and mouse. Headless systems are usually controlled via a network connection, but some headless system devices also require RS-232 serial connection for device management. Servers are often run in headless mode to reduce operating costs. Why detect headless browsers?Besides the two harmless use cases mentioned previously, headless browsers can be used to automate malicious tasks. The most common forms are web crawlers, disguising traffic, or detecting website vulnerabilities. A very popular headless browser is Phantomjs. Because it is based on the Qt framework, it has many different features compared to our common browsers, so there are many ways to identify it. However, starting with Chrome 59, Google released a headless Google Chrome browser. It is different from Phantomjs. It is developed based on the orthodox Google Chrome, not based on other frameworks, which makes it difficult for the program to distinguish whether it is a normal browser or a headless browser. Below, we will introduce several methods to determine whether the program is running in a normal browser or a headless browser. Detecting headless browsersNote: These methods have only been tested on four machines (2 Linux, 2 Mac), that said, there are certainly many other ways to detect headless browsers. User agentFirst, let's introduce the most common method of determining the type of browser, which is to check the User agent. The User agent value of the Chrome version 59 headless browser on a Linux computer is: “Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (Khtml, like Gecko) HeadlessChrome/59.0.3071.115 Safari/537.36” So, we can detect whether it is a headless Chrome browser like this: if (/HeadlessChrome/.test(window.navigator.userAgent)) { console.log("Chrome headless detected"); } User agent can also be obtained from HTTP headers. However, both of these scenarios are easy to fake. Pluginsnavigator.plugins will return an array containing the plugin information in the current browser. Usually, regular Chrome browsers have some default plugins, such as Chrome PDF viewer or Google Native Client. In contrast, in headless mode, without any plugins, an empty array is returned. if (navigator.plugins.length == 0) { console.log("It may be Chrome headless"); } languageIn Google Chrome, there are two JavaScript properties that can get the current browser language settings: navigator.language and navigator.languages. The first one refers to the language of the browser interface, and the second one returns an array that stores all the secondary languages selected by the browser user. However, in headless mode, navigator.languages returns an empty string. if (navigator.languages == "") { console.log("Chrome headless detected"); } WebGLWebGL provides a set of APIs that can perform 3D rendering in HTML canvas. Through these APIs, we can query the vendor and renderer of the graphics driver. In a regular Google Chrome browser on Linux, we get the renderer and vendor values as: “Google SwiftShader” and “Google Inc.”. And in headless mode, the one we get is "Mesa OffScreen" - which is the name of the rendering technology that doesn't use any window system, and "Brian Paul" - the original programmer of the open source Mesa graphics library. var canvas = document.createElement('canvas'); var gl = canvas.getContext('webgl'); var debugInfo = gl.getExtension('WEBGL_debug_renderer_info'); var vendor = gl.getParameter(debugInfo.UNMASKED_VENDOR_WEBGL); var renderer = gl.getParameter(debugInfo.UNMASKED_RENDERER_WEBGL); if (vendor == "Brian Paul" && renderer == "Mesa OffScreen") { console.log("Chrome headless detected"); } Not all versions of headless browsers have the same values for these two. However, currently in headless browsers the values are "Mesa Offscreen" and "Brian Paul" Browser FeaturesModernizr can detect the current browser's support for various features of HTML and CSS. The only difference I found between regular Chrome and headless Chrome is that in headless mode there is no hairline feature, which is used to detect if hidpi/retina hairlines are supported if (!Modernizr["hairline"]) { console.log("It may be Chrome headless"); } Failed to load imageFinally, the last method I found, and the one that seems to work the best, is to check the height and width of the image that is not loading properly in the browser. In normal Chrome, the size of the image that failed to load successfully is related to the browser's zoom, but it is definitely not zero. In the headless Chrome browser, the width and height of this image are both 0. var body = document.getElementsByTagName("body")[0]; var image = document.createElement("img"); image.src = "http://iloveponeydotcom32188.jg"; image.setAttribute("id", "fakeimage"); body.appendChild(image); image.onerror = function(){ if(image.width == 0 && image.height == 0) { console.log("Chrome headless detected"); } } The above is the details of how to use JavaScript to detect whether the current browser is a headless browser. For more information about JavaScript, please pay attention to other related articles on 123WORDPRESS.COM! You may also be interested in:
|
<<: Write a mysql data backup script using shell
>>: How to set a fixed IP in Linux (tested and effective)
I used the label tag when I was doing something re...
Preface MySQL master-slave replication is the bas...
01. Command Overview md5sum - Calculate and verif...
Around version 0.6, privileged was introduced to ...
1. Use CSS to draw a small pointed-corner chat di...
<br />Previous article: Web Design Tutorial ...
Table of contents Creating Arrays in JavaScript U...
Table of contents Preface call usage accomplish A...
MySQL implements sequence function 1. Create a se...
First, what is box collapse? Elements that should...
mysql between boundary range The range of between...
Product designers face complex and large manufactu...
The difference between replace into and insert in...
Preface: Because many business tables use design ...
This article briefly introduces the process of se...