Recently, there have been many database-related operations involved, and the company's existing specifications are not very comprehensive. Based on the relevant specifications of various experts on the Internet, I have compiled some standard usage for my own use. I hope you can correct me. Database environment dev: Development environment Developers can read, write, and modify the table structure. Developers can modify the table structure and the data in it at will, but they need to ensure that it does not affect other development colleagues. test: test environment Development can read and write, and developers can modify the table structure through tools. online: online environment Developers are not allowed to perform database operations directly in the online environment. If operations are required, they must find a DBA to perform the operations and make corresponding records. Stress testing is prohibited. The key issue is that the user permissions corresponding to the MySQL servers in each environment must be clearly divided and recognizable, and able to specifically distinguish business scenarios. Naming conventions Basic naming rules
Why are all libraries, tables, and fields in lowercase? In MySQL, databases and tables correspond to directories and files within those directories. Therefore, the sensitivity of the operating system determines the case sensitivity of database and table names.
Table naming Tables in the same module should use the same prefix as much as possible, and the table names should be as meaningful as possible. All log tables begin with log_ Field naming
Index naming
Constraint Naming
Table design specifications The table engine depends on the actual application scenario. MyISAM is recommended for log and report tables, and InnoDB is recommended for tables related to transactions, audits, and amounts. Unless otherwise specified, the innodb engine is used when creating tables. The default character set is utf8mb4, and the database collation rule is utf8mb4_general_ci. (Since the database definition uses the default, the data table can be redefined, but for the sake of insurance, it is recommended to write Why does the character set not select utf8 and the collation not use utf8_general_ci? MySQL using utf8 encoding cannot save Emoji expressions that use 4 bytes as placeholders. In order to make the backend project fully support Emoji expressions input by the client, upgrading the encoding to utf8mb4 is the best solution. If the characterEncoding of the JDBC connection string is set to utf8 or the emoji data cannot be inserted normally after the above configuration, you need to specify the character set of the connection as utf8mb4 in the code. All tables and fields should use the comment column attribute to describe the true meaning of the table or field. If it is an enumeration value, it is recommended to define the contents used in the field. Unless otherwise specified, the first id field in the table must be the primary key and automatically increment. It is prohibited to use it as a context or condition for data transfer outside of a transaction. Do not use varchar as the primary key statement design. Unless otherwise specified, the table must contain the create_time and modify_time fields, that is, the table must contain fields that record the creation time and modification time. Unless otherwise specified, the table must contain is_del to indicate whether the data has been deleted. In principle, physical deletion of database data is not allowed.
What is the difference between datetime and timestamp? Similarities: The display format of a TIMESTAMP column is the same as that of a DATETIME column. The display width is fixed at 19 characters and the format is YYYY-MM-DD HH:MM:SS. Differences: TIMESTAMP
How to use the automatic assignment property of TIMESTAMP? Use the current time as the default value for ts: ts TIMESTAMP DEFAULT CURRENT_TIMESTAMP. When a row is updated, update the value of ts: ts TIMESTAMP DEFAULT 0 ON UPDATE CURRENT_TIMESTAMP. You can combine 1 and 2: ts TIMESTAMP DEFAULT CURRENT_TIMESTAMP ON UPDATE CURRENT_TIMESTAMP. How to use INT UNSIGNED to store ip? Use INT UNSIGNED instead of char(15) to store IPv4 addresses, and convert them using the MySQL functions inet_ntoa and inet_aton. There is currently no conversion function for IPv6 addresses, and they need to be stored using DECIMAL or two bigINTs.
INT[M], what does the M value represent? Note that the number after the brackets of the numeric type only indicates the width and has nothing to do with the storage range. Many people think that the value ranges of INT(4) and INT(10) are (-9999 to 9999) and (-9999999999 to 99999999999 respectively. This understanding is wrong. In fact, when the M value in the integer type is used in combination with the ZEROFILL attribute, the column values can be made equal in width. Regardless of the value of M in INT[M], its value range is (-2147483648 to 2147483647 when signed), (0 to 4294967295 when unsigned). The display width does not restrict the range of values that can be held in the column, nor does it restrict the display of values that exceed the specified width of the column. When used in conjunction with the optional extended attribute ZEROFILL, by default supplementary spaces are replaced by zeros. For example, for a column declared as INT(5) ZEROFILL, the value 4 is retrieved as 00004. Note that if you store a value that exceeds the display width in an integer column, you may encounter problems when MySQL generates temporary tables for complex joins, because in these cases MySQL believes that the data will fit in the original column width. If you specify ZEROFILL for a numeric column, MySQL automatically adds the UNSIGNED attribute to the column. Using VARBINARY to store case-sensitive variable-length strings When to use CHAR and when to use VARCHAR? The CHAR and VARCHAR types are similar, but they are stored and retrieved differently. They also differ in terms of maximum length and whether trailing spaces are preserved. The length declared for CHAR and VARCHAR types indicates the maximum number of characters you want to store. For example, CHAR(30) can occupy 30 characters. The length of a CHAR column is fixed to the length declared when the table is created. The length can be any value from 0 to 255. When CHAR values are stored, they are padded on the right with spaces to the specified length. When CHAR values are retrieved, trailing spaces are removed. No case conversion is performed during storage or retrieval. The values in a VARCHAR column are variable-length strings. The length can be specified as a value from 0 to 65,535. (The maximum effective length of a VARCHAR is determined by the maximum row size and the character set used. The overall maximum length is 65,532 bytes). In contrast to CHAR, VARCHAR values are stored with only the required number of characters, plus an additional byte to record the length (two bytes are used if the column's declared length exceeds 255). VARCHAR values are stored without padding. Trailing spaces are preserved when values are saved and retrieved, in compliance with standard SQL. char is suitable for storing the MD5 hash of the user's password, and its length is always the same. For values that change frequently, char is also better than varchar because fixed-length rows are less likely to be fragmented. For very short columns, char is also more efficient than varchar. A char(1) string takes up only one byte for a single-byte character set, but a varchar(1) string takes up two bytes because one byte is used to store the length information. Index design specifications MySQL query speed depends on good index design, so indexes are critical to high performance. A reasonable index will speed up the query (including the speed of UPDATE and DELETE. MySQL will load the page containing the row into memory and then perform the UPDATE or DELETE operation). An unreasonable index will slow down the query. MySQL index search is similar to the pinyin and radical search of the Xinhua Dictionary. When the pinyin and radical indexes do not exist, they can only be searched by turning the pages one by one. When a MySQL query cannot use an index, MySQL will perform a full table scan, which will consume a lot of IO. The purpose of index: deduplication, accelerated positioning, avoidance of sorting, covering index. What is a covering index? In the InnoDB storage engine, the secondary index (non-primary key index) does not directly store the row address, but stores the primary key value. If the user needs to query a data column that is not included in the secondary index, the user needs to first find the primary key value through the secondary index, and then query other data columns through the primary key, so the query needs to be performed twice. The concept of a covering index is that the query can be completed in one index. The efficiency of a covering index will be higher, and the primary key query is a natural covering index. Reasonable creation of indexes and reasonable use of query statements can improve performance when covering indexes are used. For example, SELECT email,uid FROM user_email WHERE uid=xx. If uid is not the primary key, you can add an index as index(uid,email) to improve performance. Basic index specifications
Why can't there be too many indexes in a table? InnoDB's secondary index uses b+tree for storage, so b+tree needs to be adjusted during UPDATE, DELETE, and INSERT. Too many indexes will slow down the update process. Use prefix index for strings. The length of prefix index should not exceed 8 characters. It is recommended to give priority to prefix index. If necessary, add pseudo column and create index. Do not index blob/text fields, and do not index large fields, as this will cause the index to take up too much storage space. What is a prefix index? To put it simply, prefix indexing is to create an index for the first few characters of the text (the specific number of characters is specified when creating the index). This creates a smaller index, so queries are faster. Prefix index can effectively reduce the size of index files and improve indexing speed. But prefix indexes also have their disadvantages: MySQL cannot use prefix indexes in ORDER BY or GROUP BY, nor can they be used as covering indexes. The syntax for creating a prefix index is: ALTER TABLE table_name ADD KEY(column_name(prefix_length)); Primary key criteria
Important SQL must be indexed, and core SQL should be given priority to covering indexes.
The most distinguishing field is placed first
Index Taboos
Limitations of indexes in MYSQL The total length of the MYISAM storage engine index cannot exceed 1000 bytes Statement design specifications Using prepared statements
Avoid implicit conversions This will cause the index to fail Take advantage of prefix indexes
Do not use negative queries, such as not in/like
Avoid using stored procedures, triggers, UDFs, events, etc.
Avoid JOINs with large tables MySQL is best at single-table primary key/secondary index queries Avoid doing math in the database
Reduce the number of interactions with the database
Use paging appropriately Limit the number of pages displayed in pagination. Only the previous page and the next page can be clicked. Delayed association is used. How to use paging correctly? Suppose there is a paging statement like the following: SELECT * FROM table ORDER BY id LIMIT 10000, 10. The way MySQL handles LIMIT OFFSET is to retrieve all the data of OFFSET+LIMIT, then remove OFFSET and return the bottom LIMIT. Therefore, when the OFFSET value is large, MySQL's query performance will be very low. This can be solved by using id > n: The method of using id > n has limitations. The problem of discontinuous ids can be solved by passing in the last id at the same time when turning the page. http://example.com/page.php?last=100 select * from table where id<100 order by id desc limit 10 //Previous page http://example.com/page.php?first=110 select * from table where id>110 order by id desc limit 10 The biggest disadvantage of this method is that if there is an insert/delete operation during browsing, the page will not be updated, and the total number of pages may still be calculated based on the new count(*), which may eventually result in some records being inaccessible. To fix this problem, you can continue to introduce the current page number and whether there have been operations such as insertion/deletion that affect the total number of records since the last page turn and cache them. select * from table where id >= (select id from table order by id limit #offset#, 1)
The EXPLAIN statement (executed in the MySQL client) can obtain information about how MySQL executes SELECT statements. By executing EXPLAIN on the SELECT statement, you can know whether MySQL uses indexes, full table scans, temporary tables, sorting, and other information when executing the SELECT statement. Try to avoid MySQL performing full table scans, using temporary tables, sorting, etc. See the official documentation for details. Use union all instead of union What is the difference between union all and union? The union and union all keywords both merge two result sets into one, but the two are different in terms of usage and efficiency. After the union table is linked, the duplicate records will be filtered out, so after the table is linked, the resulting result set will be sorted, the duplicate records will be deleted, and then the result will be returned. like: select * from test_union1 union select * from test_union2 When this SQL is run, it first retrieves the results of the two tables, then uses the sorting space to sort and delete duplicate records, and finally returns the result set. If the amount of data in the table is large, it may result in disk sorting. Union all simply merges the two results and returns them. In this way, if there is duplicate data in the two returned result sets, the returned result set will contain the duplicate data. In terms of efficiency, union all is much faster than union, so if you can confirm that the two merged result sets do not contain duplicate data, then use union all, as follows: select * from test_union1 union all select * from test_union2
Table Specifications If the data volume of a single table exceeds 5 million or the data capacity exceeds 10G within one to two years, consider splitting the table. It is necessary to consider historical data migration or application self-deletion of historical data in advance. You can use equal and balanced splitting or splitting according to business rules. The data tables to be split must discuss the splitting strategy with the DBA
Code of Conduct
Other specifications It is not recommended to store log data on MySQL. Hbase or OceanBase should be given priority. If storage is required, please ask the DBA to evaluate the use of compressed tables for storage. The above is the details of the super detailed MySQL usage specifications. For more information about MySQL usage specifications, please pay attention to other related articles on 123WORDPRESS.COM! You may also be interested in:
|
<<: OpenLayers realizes the method of aggregate display of point feature layers
>>: Use of Linux xargs command
Preface After this blog post was published, some ...
Servermanager startup connection database error R...
Table of contents 1. Original value and reference...
What is routing? Routing refers to the activity o...
Vue $http get and post request cross-domain probl...
Preface MRR is the abbreviation of Multi-Range Re...
Preface A classmate is investigating MLSQL Stack&...
1. Command Introduction The cal (calendar) comman...
Table of contents MySQL's current_timestamp p...
margin:auto; + position: absolute; up, down, left...
Today I wanted to change the mysql port, but I fo...
Windows: Support NTFS, FAT Linux supports file fo...
This article example shares the specific code of ...
introduction Most people who have used databases ...
1. I purchased a VPS and CentOS system, and found...