Analysis of the causes of accidents caused by Unicode signature BOM

Analysis of the causes of accidents caused by Unicode signature BOM
Maybe you are using include files here, which is usually done for headers and footers. When I opened the included file, I found that the item "Include Unicode Signature BOM" in the page properties was checked. Then I tell you that the accident was caused by this BOM.

unicode-bom

Today, I encountered another BOM accident when writing a JS script.
I inserted an external JS into the page, and inside it there was this sentence: $.getJSON("/my/newmsg", function(data){alert(data);}); Other browsers could pop up the content normally, but IE did not. I was depressed for nearly an hour. I suspected that this sentence was written incorrectly, that the JSON data format was wrong, and that I had a problem with my character...
Later, I suspected that the encoding was wrong, so I saw the damn BOM checked. As soon as I removed it, the miracle emerged from under the dark cloud.
Although I am lazy and rarely update my blog, I have to come up and record this incident because it is really unexpected. JS can also cause accidents due to BOM – -|

There is a concept of BOM in the Unicode specification.
BOM is the abbreviation of Byte Order Mark, which is a byte order mark. This thing cannot be seen in an ordinary text editor. Can it be said to be a file header? Can it only be seen in a binary editor? That may be the case.
In UCS encoding, there is a character called "ZERO WIDTH NO-BREAK SPACE", and its encoding is FEFF. FFFE is a character that does not exist in UCS, so it should not appear in actual transmission. The UCS specification recommends that we transmit the character "ZERO WIDTH NO-BREAK SPACE" before transmitting the byte stream. In this way, if the receiver receives FEFF, it means that the byte stream is Big-Endian; if it receives FFFE, it means that the byte stream is Little-Endian. Therefore, the characters "ZERO WIDTH NO-BREAK SPACE" are also called BOM.
UTF-8 does not require BOM to indicate byte order, but can use BOM to indicate encoding. The UTF-8 encoding of the characters "ZERO WIDTH NO-BREAK SPACE" is EF BB BF. So if the receiver receives a byte stream starting with EF BB BF, it knows that it is UTF-8 encoded. Windows uses BOM to mark the encoding of text files.

<<:  How to use partitioning to optimize MySQL data processing for billions of data

>>:  The process of setting up an environment for integration testing using remote Docker

Recommend

A brief discussion on using virtual lists to optimize tables in el-table

Table of contents Preface Solution Specific imple...

Detailed installation and configuration tutorial of mysql5.7 on CentOS

Install Make sure your user has permission to ins...

Javascript operation mechanism Event Loop

Table of contents 1. Four concepts 1. JavaScript ...

How to quickly build your own server detailed tutorial (Java environment)

1. Purchase of Server 1. I chose Alibaba Cloud...

In-depth explanation of MySQL isolation level and locking mechanism

Table of contents Brief description: 1. Four char...

How to understand the difference between ref toRef and toRefs in Vue3

Table of contents 1. Basics 1.ref 2. toRef 3. toR...

Solution to the MySQL server has gone away error

MySQL server has gone away issue in PHP 1. Backgr...

In-depth analysis of HTML semantics and its related front-end frameworks

About semantics Semantics is the study of the rel...

How to use environment variables in nginx configuration file

Preface Nginx is an HTTP server designed for perf...

Solution to the problem of English letters not wrapping in Firefox

The layout of text has some formatting requiremen...

Implementation of Nginx domain name forwarding

Introduction to Nginx Nginx ("engine x"...

Detailed explanation of vue page state persistence

Table of contents Code: Replenish: Summarize Requ...

Explanation of MySQL index types Normal, Unique and Full Text

MySQL's index types include normal index, uni...

MySQL Index Detailed Explanation

Table of contents 1. Index Basics 1.1 Introductio...