A brief discussion on the use of GROUP BY and HAVING in SQL statements

A brief discussion on the use of GROUP BY and HAVING in SQL statements

Before introducing the GROUP BY and HAVING clauses, we must first talk about a special function in the SQL language: aggregate functions.

For example, SUM, COUNT, MAX, AVG, etc. The fundamental difference between these functions and other functions is that they generally act on multiple records.

SELECT SUM(population) FROM bbc

The SUM here is applied to the population field of all returned records, resulting in the query returning only one result, the total population of all countries.

Having is the screening condition after grouping (group by), and the grouped data is further screened within the group
where is to filter before grouping

By using the GROUP BY clause, you can have functions such as SUM and COUNT act on data that belongs to a group.

When you specify GROUP BY region, a group of data belonging to the same region will only return one row of values.

That is to say, all fields in the table except region can only return a value after being calculated by aggregate functions such as SUM and COUNT.

The HAVING clause allows us to filter the groups of data.

The WHERE clause filters the records before aggregation. That is to say, it works before the GROUP BY clause and the HAVING clause.

The HAVING clause filters the group records after aggregation.

Let us understand the GROUP BY and HAVING clauses through specific examples, and use the bbc table introduced in Section 3.

SQL Example:

1. Display the total population and total area of ​​each region.

SELECT region, SUM(population), SUM(area)
FROM bbc
GROUP BY region

First, use region to divide the returned records into multiple groups. This is the literal meaning of GROUP BY. After the groups are formed, aggregate functions are then used to operate on different fields in each group (one or more records).

2. Display the total population and total area of ​​each region. Only those regions with an area greater than 1,000,000 are displayed.

SELECT region, SUM(population), SUM(area)

FROM bbc8 F4 w2 v( P- f
GROUP BY region
HAVING SUM(area)>1000000#

Here, we cannot use where to filter regions with more than 1,000,000, because such a record does not exist in the table.
On the contrary, the HAVING clause allows us to filter the groups of data after grouping.

3. Query the number of orders for users in the CUSTOMER and ORDER tables

select c.name, count(order_number) as count from orders o,customer c where c.id=o.customer_id group by customer_id;
+--------+-------+
| name | count |
+--------+-------+
| d | 9 |
| cc | 6 |
| Bodhi seeds | 1 |
| cccccc | 2 |
+--------+-------+

Add HAVING filter

select c.name, count(order_number) as count from orders o,customer c where c.id=o.customer_id group by customer_id having count(order_number)>5;
+------+-------+
| name | count |
+------+-------+
| d | 9 |
| cc | 6 |
+------+-------+

4. I will give some more examples

SQL> select * from sc;

 SNO PNO GRADE
---------- ----- ----------
  1 YW 95
  1 SX 98
  1 YY 90
  2 YW 89
  2 SX 91
  2 YY 92
  3 YW 85
  3 SX 88
  3 YY 96
  4 YW 95
  4 SX 89

 SNO PNO GRADE
---------- ----- ----------
  4 YY 88

This table describes the records of the academic performance of 4 students in each subject, including SNO (student number), PNO (course name), and GRADE (grade).

1. Display the course names and scores of students with scores above 90

//This is a simple query and does not use grouping query

SQL> select sno,pno,grade from sc where grade>=90;
 SNO PNO GRADE
---------- ----- ----------
  1 YW 95
  1 SX 98
  1 YY 90
  2 SX 91
  2 YY 92
  3 YY 96
  4 YW 95

7 rows selected.

2. Display the number of subjects in which each student's score is above 90 points

//Display in groups and count according to the where condition SQL> select sno,count(*) from sc where grade>=90 group by sno;
 SNO COUNT(*)
---------- ----------
  1 3
  twenty two
  4 1
  3 1

3. We did not use the having statement here. Next, if we want to select three good students, the condition is that at least two courses are above 90 points to be eligible. List the eligible student numbers and the number of courses above 90 points.

// Display in groups, count according to the where condition, and filter the groups according to the having clause SQL> select sno,count(*) from sc where grade>=90 group by sno having count(*)>=2;
 SNO COUNT(*)
---------- ----------
  1 3
  twenty two

This result is what we want. It lists the student numbers who are eligible for the "Three Good Students" award. Compared with the previous example, we can see that this is a subquery performed after grouping.

4. The school is selecting advanced students. Students with an average score of more than 90 points are eligible, and the Chinese course score must be above 95 points. Please list the eligible students.

//In fact, this query first extracts the student numbers whose Chinese scores are greater than 95, then calculates the average, and then selects the students whose average scores are greater than 90 according to the having statement after grouping. SQL> select sno,avg(grade) from sc where SNO IN (SELECT SNO FROM SC WHERE GRADE>=95 AND PNO='YW') group by sno having avg(grade)>=90;
 SNO AVG(GRADE)
---------- ----------
  1 94.3333333
  4 90.6666667

5. Query the student ID and average score of students whose average score is at least higher than the average score of student ID 3

//Comparisons and subqueries can be performed in the having clause SQL> select sno,avg(grade) from sc
 group by sno
 having avg(grade) > (select avg(grade) from sc where sno=3);

The above is the full content of this article. I hope it will be helpful for everyone’s study. I also hope that everyone will support 123WORDPRESS.COM.

You may also be interested in:
  • mysql group by having example code
  • Research on the problem of using HAVING statement directly without GROUP BY in MySQL
  • Notes on using group by and having together in MySQL
  • In-depth analysis of the differences between order by, group by, and having in MySQL

<<:  A brief analysis of using coredump technology to trace the cause of process crashes in Linux

>>:  Vue project packaging, merging and compression to optimize web page response speed

Recommend

CSS mimics remote control buttons

Note: This demo is tested in the mini program env...

Detailed explanation of lazy loading and preloading of webpack

Table of contents Normal loading Lazy Loading Pre...

What is ssh? How to use? What are the misunderstandings?

Table of contents Preface What is ssh What is ssh...

Some questions about hyperlinks

<br />I am very happy to participate in this...

Introduction to Jenkins and how to deploy Jenkins with Docker

1. Related concepts 1.1 Jenkins Concepts: Jenkins...

CSS Reset style reset implementation example

Introduction: All browsers come with default styl...

Solve the problem that ifconfig and addr cannot see the IP address in Linux

1. Install the Linux system on the virtual machin...

Usage of MySQL time difference functions TIMESTAMPDIFF and DATEDIFF

Usage of time difference functions TIMESTAMPDIFF ...

What command is better for fuzzy searching files in Linux?

1. Introduction This article mainly explains how ...

Detailed explanation of MySQL table name case-insensitive configuration method

By default, MySQL in Linux distinguishes between ...

Details of function nesting and closures in js

Table of contents 1. Scope 2. Function return val...

Record the whole process of MySQL master-slave configuration based on Linux

mysql master-slave configuration 1. Preparation H...

Detailed explanation of primary keys and transactions in MySQL

Table of contents 1. Comments on MySQL primary ke...

TinyEditor is a simple and easy-to-use HTML WYSIWYG editor

A few days ago, I introduced to you a domestic xh...