1. Prepare in Advance For your convenience, I create two tables here and add some data to them. A table for fruits, a table for suppliers. Fruits table fruits table f_id | f_name | f_price |
---|
a1 | apple | 5 | a2 | appricot | 2 | b1 | blackberry | 10 | b2 | berry | 8 | c1 | cocount | 9 |
Suppliers table suppliers table s_id | s_name |
---|
101 | Rainbow | 102 | Walmart | 103 | Carrefour | 104 | China Resources Vanguard |
We will use these two tables for demonstration. 2. What is exists The parameter following the exists keyword is an arbitrary subquery. The system operates on the subquery to determine whether it returns rows. If at least one row is returned, the result of exists is true , and the outer query statement will be queried. If the subquery does not return any rows, the result of exists is false , and the outer statement will not be queried. It should be noted that when our subquery is SELECT NULL , MYSQL still considers it True . 

3. What is in When the in keyword is used for a subquery, the inner query statement only returns one data column, and the value of this data column will be provided to the outer query statement for comparison operations. To test the in keyword, I added a s_id column to the fruit table. Fruits table fruits table f_id | f_name | f_price | s_id |
---|
a1 | apple | 5 | 101 | a2 | appricot | 2 | 103 | b1 | blackberry | 10 | 102 | b2 | berry | 8 | 104 | c1 | cocount | 9 | 103 |
| | 
4. exists and in
What is the difference between in and exists? When should we use in and when should we use exists?
Let’s remember the formula first and then talk about the details! " If the outer query table is smaller than the subquery table, use exists. If the outer query table is larger than the subquery table, use in. If the outer and subquery tables are similar, use whichever you prefer. "
I think you have already seen that when the data in the fruits table is very large, it is not suitable to use in, because it will traverse all the data in the fruits table at most once.
For example, if the suppliers table has 10,000 records and the fruits table has 1,000,000 records, then the maximum number of traversals possible is 10,000*1,000,000 times, which is very inefficient.
For example, if the suppliers table has 10,000 records and the fruits table has 100 records, then the maximum possible traversal is 10,000*100 times, which greatly reduces the number of traversals and greatly improves efficiency.
However: the suppliers table has 10,000 records and the fruits table has 100 records, then exists() is still executed 10,000 times. It is better to use in() to traverse 10,000*100 times, because in() traverses in memory, and exists() needs to query the database. We all know that querying the database consumes higher performance, and memory is faster.
Therefore, we only need to remember the formula: "If the outer query table is smaller than the subquery table, use exists; if the outer query table is larger than the subquery table, use in; if the outer and subquery tables are similar, use whichever you like."
5. not exists and not in

Like exists, the id index on suppliers is used, exists() is executed the number of times fruits.length, and the result set of exists() is not cached.

Because not in is essentially equal to != and != ···
, and since != will not use the index, not in will not use the index.
Why doesn’t not in use the index?
We assume that there are 1 million data and s_id has only two values: 0 and 1. To use the index, we need to read the index file first, then perform a binary search to find the corresponding data disk pointer, and then use the corresponding data on the disk based on the read pointer, which affects the result set of 500,000. In this case, it is obvious which is faster than a direct full table scan.
If your s_id field is a unique one, the index will be used.
If you must use an index, you can use force index, but the efficiency will not improve and it will generally be slower.
Use indexes properly. Cardinality is an important indicator. If it is too small, it is the same as not having any indexes at all and it also wastes space.
Therefore, regardless of the size of suppliers and fruits, it is more efficient to use not exists.
This is the end of this article about the detailed explanation of the difference between in and exists in MySQL. For more information about the difference between MySQL in and exists, please search 123WORDPRESS.COM's previous articles or continue to browse the following related articles. I hope everyone will support 123WORDPRESS.COM in the future!
You may also be interested in:- Comparison of the usage of EXISTS and IN in MySQL
- Basic usage of exists, in and any in MySQL
- Introduction to the use and difference between in and exists in MySQL
- Comparative Analysis of IN and Exists in MySQL Statements
- MySQL exists and in detailed explanation and difference
- Summary of the differences between in query and exists query in mySQL
- MYSQL IN and EXISTS optimization examples
- mysql not in, left join, IS NULL, NOT EXISTS efficiency problem record
---|