This article uses examples to explain the concept and usage of MySQL index cardinality. Share with you for your reference, the details are as follows: Cardinality (index cardinality) is a very important concept in MySQL index Index cardinality is the number of distinct values a column contains. For example, if a data column contains the values 1, 2, 3, 4, 5, 1, then its cardinality is 5. An index works best when its cardinality is high relative to the number of rows in the table (that is, the column contains many distinct values and few duplicate values). If a column contains many different ages, the index can quickly distinguish the rows. If a column is used to record gender (with only two values, "M" and "F"), then an index would not be very useful. If the values appear with almost equal probability, then searching for either value will yield half the rows. In these cases, it is best not to use an index at all, because when the query optimizer finds that a certain value appears in a high percentage of the data rows in the table, it will generally ignore the index and perform a full table scan. The commonly used percentage cutoff is "30%" Another concept is called index selectivity Index selectivity = index cardinality/total data. The cardinality can be viewed through "show index from table name". Here is a table of my own. The amount of data is not high, just to test the index selectivity select * from articles id Title Name15 Title 0 Big Bear16 Title 1 Big Bear17 Title 2 Big Bear18 Title 3 Big Bear19 Title 4 Big Bear20 Title 5 Big Bear21 Title 6 Big Bear22 Title 7 Big Bear23 Title 8 Two Bears24 Title 9 Two Bears id is the default integer auto-increment primary key Now add the author column as an index and view all indexes of the articles table ALTER TABLE `articles` ADD INDEX (`author`) SHOW INDEX FROM articles table non_unique key_name seq_in_index column_name collation cardinality sub_part packed null index_type articles 0 PRIMARY 1 id A 10 NULL NULL BTREE articles 1 author 1 author A 2 NULL NULL BTREE Explain the meaning of each field
As you can see, the articles table already has two indexes. The id index does not contain duplicate words. The primary key column name is id. The index cardinality is 10 in ascending order. There is no partial index. There is no compression. There is no null. The storage method is btree. The author index contains repeated words. The index name is author. The column name is author. The index is sorted in ascending order. The index cardinality is 2. There is no partial index. There is no compression. There are no nulls. The storage method is btree. According to the index selectivity algorithm, the id index selectivity is 10/10 = 1 and the author index selectivity is 2/10 = 0.2. Let's test it and use explain to see the statement analysis. explain select * from articles where id = 15 id select_type table partitions type possible_keys key key_len ref rows filtered extra 1 SIMPLE articles NULL const PRIMARY PRIMARY 4 const 1 100.00 NULL Let me talk about the explain command again. EXPLAIN shows how MySQL uses indexes to process select statements and join tables. It can help choose better indexes and write more optimized query statements. Explain the meaning of each field 1. id SELECT identifier. This is the SELECT query sequence number. This is not important. The query sequence number is the order in which the SQL statements are executed. 2. select_type select type 2.1. SIMPLE When performing a simple select query that does not require a Union operation or does not contain a subquery, the select_type of the response query statement is simple. No matter how complex the query statement is, there must be only one unit query with select_type simple in the execution plan. 3. table table name 4. Type connection type, there are multiple parameters, first from the best type to the worst type is also the focus of this article 4.1 const , the table has at most one matching row, const is used to compare primary keys or unique indexes. Because only one row of data is matched, it is very fast, which can also be understood as the most optimized index and constant search. 5. possible_keys hints which index will be used to find the row in the table, which is not very important 6. keys indicates the index used by MYSQL query 7. key_len MYSQL index length 8. ref shows which column or constant is used together with key to select rows from the table 9. rows shows the number of rows that MYSQL executes the query. The larger the value, the worse it is, indicating that the index is not used well. 10. Extra This column contains detailed information about how MySQL solves the query. You can see that the id query uses the id index. The query type is the optimal constant query. Then let's try another query, this time using the author index. explain select * from articles where author = "Big Bear" 1 SIMPLE articles NULL ALL author NULL NULL NULL 10 80.00 Using where It can be clearly seen that the author index is used for a simple query. The query type is the worst full table scan. Let's not rush to explain it. Let's use the same statement instead. explain select * from articles where author = "二熊" 1 SIMPLE articles NULL ref author author 1022 const 2 100.00 NULL You can see that the query type this time is ref That is to say, because the number of data rows written by the author Daxiong exceeds 30% of the total data, MySQL believes that full table scanning is faster than using indexes. This is the meaning of the concepts of index cardinality and index selectivity. Therefore, when creating an index, you should pay attention to creating the index on a column with a high index cardinality. Readers who are interested in more MySQL-related content can check out the following topics on this site: "Summary of MySQL Index Operation Skills", "Summary of MySQL Common Functions", "Summary of MySQL Log Operation Skills", "Summary of MySQL Transaction Operation Skills", "Summary of MySQL Stored Procedure Skills" and "Summary of MySQL Database Lock-Related Skills". I hope this article will be helpful to everyone's MySQL database design. You may also be interested in:
|
<<: Why TypeScript's Enum is problematic
>>: Use Shell scripts to batch start and stop Docker services
Table of contents Mixins implementation Hook func...
If you don't have a Linux system, please refe...
Table of contents 1. Introduction: In this case, ...
Phenomenon: Change the div into a circle, ellipse...
This article shares the specific code of JS to im...
Ping www.baidu.com unknown domain name Modify the...
Today a junior student asked a question. The HTML...
It is no exaggeration to say that hyperlinks conne...
This article mainly focuses on the installation a...
HTML forms are used to collect different types of...
Preface The origin is a question 1: If your umask...
This article uses an example to illustrate the pa...
This article mainly introduces the Vue project. O...
There is only one solution, that is to change the...
dl:Definition list Definition List dt:Definition t...