Frequency distribution (2024)

home / probability and statistics / descriptive statistics / frequency distribution

A frequency distribution is a visual representation (chart, table, list, graph, etc.) of how frequently some event or outcome occurs in a statistical sample.

The table below shows the frequency distribution of people in line at a movie theater categorized by age.


Age rangeFrequency
15-196
20-2415
25-2925
30-3418
35-394
40-442

Frequency distributions can be useful for depicting patterns in a given set of data. For example, the distribution above shows that the most common age of people in line was 25-29. Also, about 83% of people at the theater fell within the age range of 20-34. Knowing information like this helps the theater make more informed decisions based on their customers.

How to construct a frequency distribution

There are a number of types of frequency distributions. The table above is an example of a grouped frequency distribution, which is a frequency distribution with a large range of values such that the data is usually grouped into classes that are larger than one unit in width. A class in this context is a quantitative or qualitative category. For example, in the table above, each age range is a class, so there are 6 classes.

Constructing a grouped frequency distribution involves identifying and organizing classes, then counting the observations/outcomes that fall within the classes. Some general steps for constructing a frequency distribution are listed below:

  • Determine the range of the set of data. The range is the difference between the largest and smallest values in the set.

  • Choose an appropriate number of classes. Different formulas can be used to estimate the ideal number of classes, but these formulas are not a hard rule. When choosing the number of classes, it is most important to choose a number that provides information about the data that we are interested in. Too few classes may not tell us much about how the data is organized while too many classes may not tell us much about any particular class. As a rule of thumb, between 5 and 20 equal interval classes are commonly used. Formulas for estimating the ideal number of classes (C) given the total number of observations (n) include:

    • C = 1 + 3.3log10n
    • C = Frequency distribution (1)

  • Divide the classes into intervals of equal length by using the following formula then taking the ceiling (the least integer greater than the result; e.g. the ceiling of 4.1 is 5) of the result:

    • Frequency distribution (2)

  • Choose the starting point of the classes. It is common to start the classes from the lowest value, though starting from the highest values is also possible. Add the length of the class interval to the starting value to determine the lower value in the subsequent class interval. Subtract 1 from the result to find the upper limit of the previous class. Continue this process for each class.

  • Tally the scores in the appropriate class intervals to determine the frequency distribution.

Example

Construct a grouped frequency distribution with 6 classes using the scores that students in a class obtained on their statistics exam: 45, 48, 52, 55, 62, 63, 66, 70, 70, 72, 73, 76, 77, 77, 80, 81, 84, 85, 85, 88, 90, 91, 95, 97, 98.

The range of scores is:

98 - 45 = 53

The class interval is:

53 / 6 = 8.8

The ceiling of 8.8 is 9, so each class interval has a length of 9.

Choosing 45 as the starting point, the next class interval begins at 54, and the first class interval ends at 53. The remainder of the class intervals are shown in the table below along with the sum of the tallies of scores in each class interval:


ClassFrequency
45-533
54-622
63-714
72-806
81-895
90-985

Class midpoints in a frequency distribution

The class midpoint of a frequency distribution is the average of each class in a frequency distribution. It can provide more information about the distribution of a data set and is also helpful for creating a histogram. The class midpoint can be computed as follows:


Frequency distribution (3)


Thus, the class midpoints for the frequency distribution in the example above are:


ClassFrequencyMidpoint
45-533Frequency distribution (4)
54-622Frequency distribution (5)
63-714Frequency distribution (6)
72-806Frequency distribution (7)
81-895Frequency distribution (8)
90-985Frequency distribution (9)

Frequency polygons

Frequency polygons are a graphical representation of frequency distributions. They are similar to histograms.

Example

Graph the following frequency distribution given data for the time taken for students to complete a test.


Time (minutes)FrequencyMidpoint
2-604
7-1139
12-161214
17-211819
22-263024
27-312029
32-361234
37-411939
42-462144
47-511749
52-56554
57-61059

To graph the frequency distribution, plot the frequency vs. time using the midpoint for the x-value:

Frequency distribution (10)

Frequency distributions can be represented in a number of other ways as well, including bar graphs, histograms, box and whisker plots, and more.


Frequency distribution (2024)

References

Top Articles
Latest Posts
Article information

Author: Dong Thiel

Last Updated:

Views: 6324

Rating: 4.9 / 5 (59 voted)

Reviews: 90% of readers found this page helpful

Author information

Name: Dong Thiel

Birthday: 2001-07-14

Address: 2865 Kasha Unions, West Corrinne, AK 05708-1071

Phone: +3512198379449

Job: Design Planner

Hobby: Graffiti, Foreign language learning, Gambling, Metalworking, Rowing, Sculling, Sewing

Introduction: My name is Dong Thiel, I am a brainy, happy, tasty, lively, splendid, talented, cooperative person who loves writing and wants to share my knowledge and understanding with you.