Tag: assumed mean

Quartiles, Deciles and Percentiles
Quartiles divides a set of data into four equal parts.

A median divides a set of data into two parts each with equal number of items.

The first quartile, mostly referred to as the lower quartile contains 25% of the total data items. Lower quartile can be described as the median of the bottom half.

Second quartile is actually the median of the whole data(50%).

The third quartile is usually referred to as upper quartile and contains 75% of total data items. It can be described as the median of the upper half the data set.

Formula for the getting the first quartile Q1

Where
- L is the lower class boundary of the quartile class.
- n is the total frequency
- c is the cumulative frequency above the quartile class
- i is the class interval
- f is the frequency of the lower quartile class
Formula for the getting the second quartile Q₂

Second quartile Q₂ is actually the median of the data

it is calculated from:

where
- L is the lower class boundary of the median class.
- n is the total frequency
- c is the cumulative frequency above the median class
- i is the class interval
- f is the frequency of the median class
Formula for the getting the third quartile Q₃

where
- L is the lower class boundary of the upper quartile class.
- n is the total frequency
- c is the cumulative frequency above the third quartile class
- i is the class interval of the upper quartile class
- f is the frequency of the upper quartile class
Deciles

Deciles divides a set of data into ten equal parts.

First decile is when n is divided by 10. that is; Decile = n/10

where n is the total frequency for the data

Percentiles

Percentiles divides a set of data into hundred equal parts.

one percentile is given as (1/100)*n

In quartiles, deciles and percentiles, data is arranged in ascending order

Example 5

The table below shows the distribution of heights to the nearest cm of students in a school.

Table of heights of some students

Find (a) the median

(b)(i) lower quartile (ii) upper quartile (iii) 80th percentile.

Solution

(a) The new frequency table for the data is shown here

There are 130 students . Therefore, the median height is the 65^th student. that is; median is 130/2.

The 65^th student falls in the 150-159 class. This class is called the median class.

Using the formula for the median:

(b) (i)

Lower quartile Q1 = L + (n/4 – C)i/f, that is:

ii)

Upper quartile Q3= L + (3n/4)-23)*5/9

(C)

The 80th percentile of the data is given by 80/100)*130=104^th value.

The 104^th student falls in the 160-169 class

80th percentile= L+(80/100n-C)i/f

The complete solution is as below:

Example

Determine the lower quartile and upper quartile for the following set of data

15, 20, 16, 15, 18, 17, 13, 9, 17, 18, 11

solution

arranging in ascending order

9, 11, 13, 15, 15, 16, 17, 17, 18, 18, 20

The median number is 16. On left of 16 there are 5 values and on the right 5 values.

16 is at the center of the data list

9, 11, 13, 15, 15 | 16 | 17, 17, 18, 18, 20

The first half contains: 9, 11, 13, 15, 15

The central value in that lower half is 13 and it is the first quartile of the data

The upper half includes: 17, 17, 18, 18, 20

The central value is 18 and is hence the upper quartile for the data list

Related Topics
- Introduction to statistics
- Mathematics
January 31, 2024
Working with assumed mean
Assumed mean is a certain value that is chosen from the data set such that it can be subtracted from all other values to reduce the size of numbers in the data set.

An assumed mean is usually determined by guessing the number that could be used as the mean among the values in the data set.

It is like picking one of the numbers in the dataset and assuming it is the mean for the data. By looking at the data, we can guess a number close to the mean because mean, as a measure of central tendency, which most likely will be a number near the median of the data.

Take for instance the data set below.

89, 64, 56, 78, 88, 67, 72, 85, 70, 65, 64, 66, 72, 74, 76.

Arranging the data in ascending order we have

56, 64, 64, 65, 66, 67, 70, 72, 72, 74, 76, 78, 85, 88, 89.

Range= 89 – 56 =33

A method I find convenient to find a central data item is 56+(33/2) =56+17=73.

Now because we don’t have 73, I pick 72 as the assumed mean. And I will subtract 72 from each data item as in table below.

Now i get the summation of fd: ∑fd=-1

and mean of d= (∑fd)/(∑f) ≈ -0.06667

mean of x, x̄ = 72 + (-0.0667) = 71.9333

The sum of x has been done by a statistical software. Otherwise it could be time consuming and error prone and energy sucking to try and compute it manually. It has been recorded there for comparison purposes.

in a more general case, if we take an assumed mean A and subtract it from each data item x, then we get data items labelled with d such that d=x-A and the mean for the data expressed as:

Practice question

Using an assumed mean, calculate the mean of each of the following sets of data

(a)

0.655, 0.685, 0.705, 0.665, 0.695, 0.715, 0.375, 0.745, 0.755, 0.765, 0.745, 0.550, 0.450, 0.400, 0.425, 0.325, 0.775, 0.685, 0.695, 0.745

(b)

225, 400, 300, 225, , 525, 325, 600, 225, 325, 400, 525, 575, 625, 250, 650, 350, 475, 550, 575, 275, 375, 475, 475, 675, 400, 375, 275, 250.

Related Topics
January 30, 2024