When using SQL to extract data, we often encounter duplicate values in the table. For example, if we want to get UV (unique visitors), we need to deduplicate. In MySQL, For example, there is a table task like this: Remark:
We need to find the total number of tasks. Since task_id is not unique, we need to remove duplicates: distinct -- List all unique values of task_id (after deduplication) select distinct task_id from Task; --Total number of tasks select count(distinct task_id) task_num from Task;
group by -- List all unique values of task_id (after deduplication, null is also a value) -- select task_id -- from Task -- group by task_id; --Total number of tasks select count(task_id) task_num from (select task_id from Task group by task_id) tmp; row_number row_number is a window function with the following syntax: -- Use select count(case when rn=1 then task_id else null end) task_num in SQL that supports window functions from (select task_id , row_number() over (partition by task_id order by start_time) rn from Task) tmp; In addition, let's use a table test to explain the use of distinct and group by in deduplication: -- The semicolon below is used to separate rows select distinct user_id from Test; -- returns 1; 2 select distinct user_id, user_type from Test; -- returns 1, 1; 1, 2; 2, 1 select user_id from Test group by user_id; -- returns 1; 2 select user_id, user_type from Test group by user_id, user_type; -- returns 1, 1; 1, 2; 2, 1 select user_id, user_type from Test group by user_id; -- Hive, Oracle, etc. will report an error, but MySQL can be written like this. -- Returns 1, 1 or 1, 2; 2, 1 (two rows in total). Only the fields after group by will be deduplicated, which means the number of records returned at the end is equal to the number of records in the previous SQL statement, that is, 2 records. For fields that are not placed after group by but are placed in select, only one record will be returned (usually the first one, but there should be no pattern). This is the end of this article on the summary of SQL deduplication methods. For more relevant SQL deduplication methods, please search for previous articles on 123WORDPRESS.COM or continue to browse the following related articles. I hope everyone will support 123WORDPRESS.COM in the future! You may also be interested in:
|
<<: Share 10 of the latest web front-end frameworks (translation)
>>: Pure CSS to achieve hover image pop-out pop-up effect example code
1. mpstat command 1.1 Command Format mpstat [ -A ...
When you use HTML div blocks and the middle of th...
introduction Xiao A was writing code, and DBA Xia...
This article example shares the specific code of ...
Table of contents Problem description: Solution 1...
Use of AES encryption Data transmission encryptio...
First, open the virtual machine Open xshell5 to c...
Table of contents 1 Install Docker in Baota Softw...
I'm building Nginx recently, but I can't ...
For example, if I have a Jenkins server in my int...
Preface I am used to writing less/sass, but now I...
Adding the extra_hosts keyword in docker-compose....
Basic three-column layout .container{ display: fl...
<br />Although there are many web page creat...
Today I have a question about configuring MySQL d...