What is grouped data in maths

A group of \(\text{10}\) learners count the number of playing cards they each have. This is a histogram describing the data they collected:

What is grouped data in maths

Count the number of playing cards in the following range: \(\text{0} \le \text{number of playing cards} \le \text{2}\)

From the graph the answer is: 1

From the histogram, we arrive at our answer by reading the height of the specified interval from the histogram.

What is grouped data in maths

A group of \(\text{20}\) learners count the number of playing cards they each have. This is the data they collect:

\[\begin{array}{c c c c c} 14 & 9 & 11 & 8 & 13 \\ 2 & 3 & 4 & 16 & 17 \\ 9 & 19 & 10 & 14 & 4 \\ 16 & 16 & 11 & 2 & 17 \end{array}\]

Count the number of learners who have from \(\text{12}\) up to \(\text{15}\) playing cards. In other words, how many learners have playing cards in the following range: \(\text{12} \le \text{number of playing cards} \le \text{15}\)? It may be helpful for you to draw a histogram in order to answer the question.

Firstly we sort the table into sequential order, starting with the smallest value.

\[\begin{array}{c c c c c} 2 & 2 & 3 & 4 & 4 \\ 8 & 9 & 9 & 10 & 11 \\ 11 & 13 & 14 & 14 & 16 \\ 16 & 16 & 17 & 17 & 19 \end{array}\]

Secondly, we draw a histogram of the data:

What is grouped data in maths

From the histogram you can see that the number of learners with playing cards in the range: \(\text{12} \le \text{number of playing cards} \le \text{15}\) is 3.

To find the Mode, or modal value, Alex places the numbers in value order then counts how many of each number. The Mode is the number which appears most often (there can be more than one mode):

53, 55, 56, 56, 58, 58, 59, 59, 60, 61, 61, 62, 62, 62, 64, 65, 65, 67, 68, 68, 70

62 appears three times, more often than the other values, so Mode = 62

Grouped Frequency Table

Alex then makes a Grouped Frequency Table:

SecondsFrequency51 - 55256 - 60761 - 65866 - 704

What is grouped data in maths

So 2 runners took between 51 and 55 seconds, 7 took between 56 and 60 seconds, etc

Oh No!

What is grouped data in maths

Suddenly all the original data gets lost (naughty pup!)


Only the Grouped Frequency Table survived ...

... can we help Alex calculate the Mean, Median and Mode from just that table?

The answer is ... no we can't. Not accurately anyway. But, we can make estimates.

Estimating the Mean from Grouped Data

So all we have left is:

SecondsFrequency51 - 55256 - 60761 - 65866 - 704


The groups (51-55, 56-60, etc), also called class intervals, are of width 5

The midpoints are in the middle of each class: 53, 58, 63 and 68

What is grouped data in maths

We can estimate the Mean by using the midpoints.

So, how does this work?

Think about the 7 runners in the group 56 - 60: all we know is that they ran somewhere between 56 and 60 seconds:

  • Maybe all seven of them did 56 seconds,
  • Maybe all seven of them did 60 seconds,
  • But it is more likely that there is a spread of numbers: some at 56, some at 57, etc

So we take an average and assume that all seven of them took 58 seconds.

Let's now make the table using midpoints:

MidpointFrequency532587638684

What is grouped data in maths

Our thinking is: "2 people took 53 sec, 7 people took 58 sec, 8 people took 63 sec and 4 took 68 sec". In other words we imagine the data looks like this:

53, 53, 58, 58, 58, 58, 58, 58, 58, 63, 63, 63, 63, 63, 63, 63, 63, 68, 68, 68, 68

Then we add them all up and divide by 21. The quick way to do it is to multiply each midpoint by each frequency:

Midpoint
xFrequency
fMidpoint × Frequency
fx532106587406638504684272Totals:211288

And then our estimate of the mean time to complete the race is:

Estimated Mean =  128821  = 61.333...

Very close to the exact answer we got earlier.

Estimating the Median from Grouped Data

Let's look at our data again:

SecondsFrequency51 - 55256 - 60761 - 65866 - 704

What is grouped data in maths

The median is the middle value, which in our case is the 11th one, which is in the 61 - 65 group:

We can say "the median group is 61 - 65"

But if we want an estimated Median value we need to look more closely at the 61 - 65 group.

We call it "61 - 65", but it really includes values from 60.5 up to (but not including) 65.5.

Why? Well, the values are in whole seconds, so a real time of 60.5 is measured as 61. Likewise 65.4 is measured as 65.

At 60.5 we already have 9 runners, and by the next boundary at 65.5 we have 17 runners. By drawing a straight line in between we can pick out where the median frequency of n/2 runners is:

What is grouped data in maths

And this handy formula does the calculation:

Estimated Median = L +  (n/2) − BG × w

where:

  • L is the lower class boundary of the group containing the median
  • n is the total number of values
  • B is the cumulative frequency of the groups before the median group
  • G is the frequency of the median group
  • w is the group width

For our example:

  • L = 60.5
  • n = 21
  • B = 2 + 7 = 9
  • G = 8
  • w = 5

Estimated Median= 60.5 + (21/2) − 9 8 × 5

Estimating the Mode from Grouped Data

Again, looking at our data:

SecondsFrequency51 - 55256 - 60761 - 65866 - 704

We can easily find the modal group (the group with the highest frequency), which is 61 - 65

We can say "the modal group is 61 - 65"

But the actual Mode may not even be in that group! Or there may be more than one mode. Without the raw data we don't really know.

But, we can estimate the Mode using the following formula:

Estimated Mode = L +  fm − fm-1(fm − fm-1) + (fm − fm+1) × w

where:

  • L is the lower class boundary of the modal group
  • fm-1 is the frequency of the group before the modal group
  • fm is the frequency of the modal group
  • fm+1 is the frequency of the group after the modal group
  • w is the group width

In this example:

  • L = 60.5
  • fm-1 = 7
  • fm = 8
  • fm+1 = 4
  • w = 5

Estimated Mode= 60.5 +  8 − 7(8 − 7) + (8 − 4) × 5

Our final result is:

  • Estimated Mean: 61.333...
  • Estimated Median: 61.4375
  • Estimated Mode: 61.5

(Compare that with the true Mean, Median and Mode of 61.38..., 61 and 62 that we got at the very start.)

And that is how it is done.

Now let us look at two more examples, and get some more practice along the way!

Baby Carrots Example

What is grouped data in maths

Example: You grew fifty baby carrots using special soil. You dig them up and measure their lengths (to the nearest mm) and group the results:

Length (mm)Frequency150 - 1545155 - 1592160 - 1646165 - 1698170 - 1749175 - 17911180 - 1846185 - 1893

Length (mm)Midpoint
xFrequency
f
fx150 - 1541525760155 - 1591572314160 - 1641626972165 - 16916781336170 - 17417291548175 - 179177111947180 - 18418261092185 - 1891873561Totals:508530

Estimated Mean =  853050  = 170.6 mm

The Median is the mean of the 25th and the 26th length, so is in the 170 - 174 group:

  • L = 169.5 (the lower class boundary of the 170 - 174 group)
  • n = 50
  • B = 5 + 2 + 6 + 8 = 21
  • G = 9
  • w = 5

Estimated Median= 169.5 +  (50/2) − 219 × 5

The Modal group is the one with the highest frequency, which is 175 - 179:

  • L = 174.5 (the lower class boundary of the 175 - 179 group)
  • fm-1 = 9
  • fm = 11
  • fm+1 = 6
  • w = 5

Estimated Mode= 174.5 +  11 − 9(11 − 9) + (11 − 6) × 5

 = 175.9 mm (to 1 decimal)

Age Example

Age is a special case.

When we say "Sarah is 17" she stays "17" up until her eighteenth birthday.
She might be 17 years and 364 days old and still be called "17".

This changes the midpoints and class boundaries.

What is grouped data in maths

Example: The ages of the 112 people who live on a tropical island are grouped as follows:

AgeNumber0 - 92010 - 192120 - 292330 - 391640 - 491150 - 591060 - 69770 - 79380 - 891

A child in the first group 0 - 9 could be almost 10 years old. So the midpoint for this group is 5 not 4.5

What is a grouped data called?

Grouped data is a statistical term used in data analysis. Raw data can be organized by grouping together similar measurements in a table. This frequency table is also called grouped data.

What is grouped data for kids?

grouped data ~ A Maths Dictionary for Kids Quick Reference by Jenny Eather. data that has been ordered and sorted into groups called classes, often displayed in a frequency table. the graph of grouped data is called a histogram.

What does grouped mean in math?

Mean of grouped data is the data set formed by aggregating individual observations of a variable into different groups. Grouped data is data that is grouped together in different categories.

What are the two types of grouped data?

There are two major types of grouping: data binning of a single-dimensional variable, replacing individual numbers by counts in bins; and grouping multi-dimensional variables by some of the dimensions (especially by independent variables), obtaining the distribution of ungrouped dimensions (especially the dependent ...