PrefaceThere are a lot of information and methods on the Internet about database optimization, but the quality of many of them is uneven, some of the summaries are not thorough and the content is redundant. This article will give you a detailed introduction to 26 optimization suggestions. Let’s take a look at them. 1. When querying SQL, try not to use full search select *, but select + specific fields.Counterexample: select * from student; Positive example: select id,name,age from student; reason:
2. Use prepared statements for database operationsreason:
3. Do not use insert statements without a field list Counterexample: insert into values ('a', 'b', 'c'); Positive example: insert into t(a, b, c) values ('a','b','c'); reason:
4. Try to avoid using or to connect conditions in the where clauseExample: Create a new user table with a common index userId. The table structure is as follows: CREATE TABLE `user` ( `id` int(11) NOT NULL AUTO_INCREMENT, `user_id` int(11) NOT NULL, `age` int(11) NOT NULL, `name` varchar(30) NOT NULL, PRIMARY KEY (`id`), KEY `idx_userId` (`userId`) )ENGINE=InnoDB DEFAULT CHARSET=utf8; Query users whose userid is 1 or whose age is 18 years old Counterexample: select id, user_id, age, name from user where userid=1 or age =18 Positive example: # Using union all select id, user_id, age, name from user where userid=1 union all select * from user where age = 18 # Or write two separate SQL statements: select id, user_id, age, name from user where userid=1; select * from user where age = 18 reason:
5. When using where conditions to query, limit the data to be queried to avoid returning redundant rows and avoid implicit conversion of data types.Assuming id is of type int, query the data with id = 1 Counterexample: select id, name from student where id = '1'; Positive example: select id, name from student where id = 1; reason:
6. Perform expression operations or function conversions on fields in the where clause, which will cause the system to abandon the use of indexes and perform a full table scanAssume that the age field of the user table is indexed and query its data Counterexample: select name, age from user where age - 1 = 20; Positive example: select name, age from user where age = 21; reason:
7. Try to avoid using != or <> operators in the where clause, otherwise the engine will give up using the index and perform a full table scan.(Applicable in MySQL) Counterexample: select age,name from user where age <> 18; Positive example: # You can consider writing two separate SQL statements: select age,name from user where age < 18; select age,name from user where age > 18; reason:
8. For query optimization, you should consider creating indexes on the columns involved in where and order by to avoid full table scans as much as possible.Counterexample: select name, age, address from user where address ='深圳' order by age; Positive example: Add an index and then query alter table user add index idx_address_age (address,age) 9. Consider using default values instead of null in where clausesCounterexample: (This will check all data) select user_id, name, age from user where age is not null; Positive example: # Set the table field age to 0 as the default value instead of null select user_id, name, age from user where age > 0; 1 2 reason:
10. If the query result has only one record or only one record is needed (possibly the maximum/minimum value), it is recommended to use limit 1Suppose there is a student table, and we want to find a person named Tom. CREATE TABLE `student` ( `id` int(11) NOT NULL, `name` varchar(50) DEFAULT NULL, `age` int(11) DEFAULT NULL, `date` datetime DEFAULT NULL, `sex` int(1) DEFAULT NULL, PRIMARY KEY (`id`) )ENGINE=InnoDB DEFAULT CHARSET=utf8mb4; Counterexample: select id,name from student where name='Tom ' Positive Example select id,name from employee where name='Tom ' limit 1; reason: After adding limit 1 paging, once a corresponding record is found, the scan will not continue, and the efficiency will be greatly improved. 11. Optimize limit paging statementsWhen we do paging in daily life, we usually use limit to achieve it, but when the offset is particularly large, the query efficiency becomes low. Counterexample: select id, name, age from student limit 10000,10 Positive example: # Solution 1: Return the maximum record (offset) of the last query select id, name from student where id > 10000 limit 10; # Solution 2: order by + index select id, name from student order by id limit 10000,10; # Option 3: Limit the number of pages if the business allows: reason:
12. Try to avoid returning too much data to the client and use limit pagingAssume that the business requirement is that the user requests to view the movie data that he has watched in the past year. Counterexample: # Query all data at once select * from LivingInfo where watchId = useId and watchTime >= Date_sub(now(),Interval 1 Y) Positive example: # Pagination query select * from LivingInfo where watchId = useId and watchTime>= Date_sub(now(),Interval 1 Y) limit offset, pageSize # If it is front-end paging, you can query the first 200 records first, because the average user should not scroll down too many pages select * from LivingInfo where watchId = useId and watchTime>= Date_sub(now(),Interval 1 Y) limit 200 ; 13. Optimize like statementsWhen using fuzzy keyword search using LIKE, LIKE is likely to make the index invalid. Counterexample: SELECT * FROM student WHERE name LIKE '%strive_day'; -- Or use % to wrap SELECT * FROM student WHERE name LIKE '%strive_day%'; Positive example: SELECT * FROM student WHERE name LIKE 'strive_day%'; reason:
14. Try to avoid using MySQL built-in functions on index columnsExample: Query users who have logged in in the last seven days (assuming the loginTime field is indexed) Counterexample: SELECT * FROM system_user user WHERE Date_ADD(user.loginTime,Interval 7 DAY) >= now(); Positive example: SELECT * FROM system_user user WHERE user.loginTime >= Date_ADD(NOW(),INTERVAL - 7 DAY); reason:
15. When using a joint index, pay attention to the order of the index columns, generally follow the leftmost matching principleSuppose there is a joint index (user_id, age), with user_id in the front and age in the back. Counterexample: select user_id, name, age from user where age = 10; Positive example: # In accordance with the leftmost matching principle select user_id, name, age from user where userid = 1 and age = 21; # In accordance with the leftmost matching principle select user_id, name, age from user where userid = 1; reason:
16. Use covering indexes when appropriate.Covering indexes can make your SQL statements not need to return to the table. You can get all the required data by simply accessing the index, which greatly improves query efficiency. Counterexample: # Like fuzzy query, no index selection user_id, name, age from user where user_id like '%123%' # If id is the primary key, then it is a normal index, that is, a covering index. select user_id, name, age from user where userid like '%123%'; 17. Remove redundant and duplicate indexesCounterexample: KEY `idx_userId` (`userId`) KEY `idx_userId_age` (`userId`,`age`) Positive example: KEY `idx_userId_age` (`userId`,`age`) # Delete the index of userId (KEY `idx_userId_age` (`userId`,`age`)) # Because the combined index (A, B) is equivalent to creating both the (A) and (A, B) indexes. reason:
18. Inner join, left join, right join, Inner join is preferred. If it is left join, the result of the left table should be as small as possible.Inner join: When two tables are joined for query, only the result sets that fully match the two tables are retained. When left join performs a join query on two tables, all rows in the left table will be returned, even if there are no matching records in the right table. When right join performs a join query on two tables, all rows in the right table will be returned, even if there are no matching records in the left table. Under the premise that SQL requirements are met, Inner join is preferred. If left join is used, the data results of the left table should be as small as possible, and if conditions permit, they should be placed on the left for processing. Counterexample: select name, age from tab1 t1 left join tab2 t2 on t1.age = t2.age where t1.id = 2; Positive example: select name, age from (select * from tab1 where id = 2) t1 left join tab2 t2 on t1.age = t2.age; reason:
19. If you insert too much data, consider batch insertingCounterexample: for(User u :list) { INSERT into user(name,age) values(name, age)} Positive example: //Insert 500 values at a time, insert into user(name,age) values in batches <foreach collection="list" item="item" index="index" separator=","> (#{item.name},#{item.age}) </foreach> reason:
20. Use the distinct keyword as little as possibleThe distinct keyword is generally used to filter duplicate records to return non-duplicate records. When used when querying one field or a few fields, it brings optimization effects to the query. However, when there are many fields, it will greatly reduce the query efficiency. Counterexample: # Remove duplicate fields SELECT DISTINCT * from user; Positive example: select DISTINCT name from user; reason:
21. Do not have more than 5 table joinsreason:
22. How to optimize update statements when the amount of data is large.When the amount of data is large, it is necessary to avoid modifying or deleting too much data at the same time, which will cause excessive CPU utilization and affect other people's access to the database. Counterexample: # Delete 100,000 or 1 million+ records at a time delete from user where id < 1000000; # Or use a single loop operation, which is inefficient and time-consuming for (User user: list) {delete from user;} Positive example: # Delete in batches, such as 500 each time delete user where id < 500 delete user where id >= 500 and id < 1000; ... delete user where id >= 999500 and id < 1000000; reason:
23. Use exist and in appropriatelyAssume that table A represents the employee table of a company, and table B represents the department table. The SQL statement to query all employees in all departments is: Counterexample: select * from A where deptId in (select deptId from B); This is equivalent to writing:
It can be abstracted into a loop statement like this: List<> resultSet ; for(int i = 0; i < B.length; i ++) { for(int j = 0; j < A.length; j ++) { if(A[i].id == B[j].id) { resultSet.add(A[i]); break; } } } We can also use exists to achieve the same query function select * from A where exists (select 1 from B where A.deptId = B.deptId);
select * from A, loop from table A first Because the understanding of the exists query is to execute the main query first, obtain the data, and then put it into the subquery for condition verification. Based on the verification result (true or false), it is determined whether the data result of the main query can be retained. Similarly, it can be abstracted into such a cycle: List<> resultSet; for(int i = 0; i < A.length; i ++) { for(int j = 0; j < B.length; j ++) { if(A[i].deptId == B[j].deptId) { resultSet.add(A[i]); break; } } } reason:
24. Use numeric fields as much as possible. If the field only contains numerical information, try not to design it as character type.Counterexample: `king_id` varchar(20) NOT NULL COMMENT '123' Positive example: `king_id` int(11) NOT NULL COMMENT '123' reason:
25. Try to replace union with union allIf there are no duplicate records in the search results, it is recommended to replace union with union all. Counterexample: select * from user where userid = 1 union select * from user where age = 20 Positive example: select * from user where userid = 1 union all select * from user where age = 20 reason:
26. If the field type is a string, be sure to enclose it in quotation marks when using where, otherwise the index will be invalid.Counterexample: select * from user where userid = 1; Positive example: select * from user where userid ='1'; reason: The first statement will not use the index if it is not enclosed in single quotes. This is because without single quotes, the string is compared with the number, and their types do not match. MySQL will perform implicit type conversion and convert them to floating-point numbers for comparison. SummarizeThis is the end of this article about MySQL optimization and how to write high-quality SQL statements. For more relevant MySQL optimization SQL statement content, please search for previous articles on 123WORDPRESS.COM or continue to browse the following related articles. I hope everyone will support 123WORDPRESS.COM in the future! You may also be interested in:
|
<<: 9 ways to show and hide CSS elements
>>: Several methods of deploying multiple front-end projects with nginx
What is a descending index? You may be familiar w...
The specific code for sending emoticons in the vu...
background: Sometimes we need to display json dat...
Sysbench is an excellent benchmark tool that can ...
Table of contents Overview What is lazy loading? ...
FTP and SFTP are widely used as file transfer pro...
How to use css variables in JS Use the :export ke...
1. Preliminary preparation (windows7+mysql-8.0.18...
Docker-compose deployment configuration jenkins 1...
Table of contents Semaphore Nginx hot deployment ...
This section provides an overview of some other i...
selinux ( Security-Enhanced Linux) is a Linux ker...
I have summarized 3 methods to deploy multiple fr...
This article uses examples to illustrate the pitf...
This article example shares the specific code of ...