Analysis of the causes of accidents caused by Unicode signature BOM

Maybe you are using include files here, which is usually done for headers and footers. When I opened the included file, I found that the item "Include Unicode Signature BOM" in the page properties was checked. Then I tell you that the accident was caused by this BOM.

unicode-bom

Today, I encountered another BOM accident when writing a JS script.
I inserted an external JS into the page, and inside it there was this sentence: $.getJSON("/my/newmsg", function(data){alert(data);}); Other browsers could pop up the content normally, but IE did not. I was depressed for nearly an hour. I suspected that this sentence was written incorrectly, that the JSON data format was wrong, and that I had a problem with my character...
Later, I suspected that the encoding was wrong, so I saw the damn BOM checked. As soon as I removed it, the miracle emerged from under the dark cloud.
Although I am lazy and rarely update my blog, I have to come up and record this incident because it is really unexpected. JS can also cause accidents due to BOM – -|

There is a concept of BOM in the Unicode specification.
BOM is the abbreviation of Byte Order Mark, which is a byte order mark. This thing cannot be seen in an ordinary text editor. Can it be said to be a file header? Can it only be seen in a binary editor? That may be the case.
In UCS encoding, there is a character called "ZERO WIDTH NO-BREAK SPACE", and its encoding is FEFF. FFFE is a character that does not exist in UCS, so it should not appear in actual transmission. The UCS specification recommends that we transmit the character "ZERO WIDTH NO-BREAK SPACE" before transmitting the byte stream. In this way, if the receiver receives FEFF, it means that the byte stream is Big-Endian; if it receives FFFE, it means that the byte stream is Little-Endian. Therefore, the characters "ZERO WIDTH NO-BREAK SPACE" are also called BOM.
UTF-8 does not require BOM to indicate byte order, but can use BOM to indicate encoding. The UTF-8 encoding of the characters "ZERO WIDTH NO-BREAK SPACE" is EF BB BF. So if the receiver receives a byte stream starting with EF BB BF, it knows that it is UTF-8 encoded. Windows uses BOM to mark the encoding of text files.

<<: How to use partitioning to optimize MySQL data processing for billions of data

>>: The process of setting up an environment for integration testing using remote Docker

How to uninstall MySQL 5.7.19 under Linux

Analysis of the causes of accidents caused by Unicode signature BOM

How to uninstall MySQL 5.7.19 under Linux

Deploy Nginx+Flask+Mongo application using Docker

Detailed explanation of Frp forced redirection to https configuration under Nginx

Summary of some common configurations and techniques of Nginx

Install JDK8 in rpm mode on CentOS7

The forgotten button tag

Complete steps to quickly configure HugePages under Linux system

Summary of four situations of joint query between two tables in Mysql

Tips for organizing strings in Linux

Recommend

The space is displayed differently in IE, Firefox, and Chrome browsers

Basic knowledge of load balancing and a simple example of load balancing using nginx

Detailed explanation of the use of base tag in HTML

About the solution record of the page unresponsiveness when using window.print() in React

Complete code for implementing the vue backtop component

Timeline implementation method based on ccs3

Solve the 1251 error when establishing a connection between mysql and navicat

How to create a view in MySQL

Summary of the differences between Vue's watch, computed, and methods

Share 10 of the latest web front-end frameworks (translation)

Method to detect whether ip and port are connectable

How to detect if the current browser is a headless browser with JavaScript

Detailed explanation of the principle of Docker image layering

Detailed explanation of the difference between adaptive and responsive analysis in vernacular

Enable sshd operation in docker