The measure of central tendency, as discussed in the previous chapter tells us only about the characteristics of a particular series. They do not describe any thing on the observations or data entirely. In other wards, measures of central tendency do not tell any thing about the variations that exist in the data of a particular series. To make the concept, let discuss an example. It was found by using formula of mean that the average depth of a river is 6 feet. One cannot confidently enter into the river because in some places the depth may be 12 feet or it may have 3 feet. Thus this type of interpretation by using the measures of central tendency some times proves to be useless. Hence the measure of central tendency alone to measure the characteristics of a series of observations is not sufficient to draw a valid conclusion. With the central value one must know as to how the data is distributed. Different sets of data may have the same measures of central tendency but differ greatly in terms of variation. For this knowledge of central value is not enough to appreciate the nature of distribution of values. Thus there is the requirement of some additional measures along with the measures of central tendency which will describe the spread of the entire set of values along with the central value. One such measure is popularly called as dispersion or variation. The study of dispersion will enables us to know whether a series is homogeneous (where all the observations remains around the central value) or the observations is heterogeneous (there will be variations in the observations around the central value like 1, 50, 20, 28 etc., where the central value is 33). Hence it can be said that a measure of dispersion describes the spread or scattering of the individual values of a series around its central value.
Experts opine different opinion on why the variations in a distribution are so important to consider? Following are some views on validity of the measure of dispersion:
Following are some definitions defined by different experts on measures of dispersion. L.R. Connor defines measures of dispersion as ‘dispersion is the measure extended to which individual items vary'. Similarly, Brookes and Dick opines it as ‘dispersion or spread is the degree of the scatter or the variation of the variables about a central value'. Robert H. Wessel defines it as ‘measures which indicate the spread of the values are called measures of dispersion'. From all these definition it is clear that dispersion measures more or less describes the spread or scattering of the individual values of a series around its central value.
Dispersion of a series of data can be calculated by using following four widely used methods
Dispersion when measured on basis of the difference between two extreme values selected from a series of data. The two well known measures are
Dispersion when measured on basis of average deviation from some measure of central tendency. The well known measures are
All the tools are discussed in details below one after the other.
The range is the simplest measure of the dispersion. The range is defined as the difference between the highest value and the lowest value of the series. Range as a measure of variation is having limited applicability. It is widely used for weather forecasting by the meteorological departments. It also used in statistical quality control. Range is a good indicator to measure the fluctuations in price change like that of studying the variations in the price of shares and debentures and other related matters. Following is the procedure of calculating range:
Range= value of the highest observation (H) – value of the lowest observation (L)
or Range = H – L
A second measure of dispersion is the inter-quartile range which takes into account the middle half i.e., 50% of the data thus, avoiding the problem of extreme values in the data. Hence it measures approximately how far from the median one must go on either side before it can be include one-half the values of the data set. Inter-quartile range can be calculated by dividing the series of observations into four parts; each part of the series contains 25 percent of the observations. The quartiles are then the highest values in each of these four parts, and the inter-quartile range is the difference between the values of the first and the third quartile. Following are the steps of calculating the inter-quartile range:
Mean/average deviation is the arithmetic mean of the difference of a series computed from any measure of central tendency i.e., either deviation from mean or median or mode. The absolute values of each observation are calculated. Clark and Schekade opine mean deviation or average deviations as the average amount of scatter of the items in a distribution from either the mean or the median, ignoring the signs of the deviations. Thus the average that is taken of scatter is an arithmetic mean, which accounts for the fact that this measure is often called as mean deviation or average deviations.
In case of discrete series, mean deviation can be calculated through following steps
In the same way one can calculate mean deviation from median or mode in case of individual series.
Mean deviation can be calculated in case of discrete series in a little bit different way. Following are some steps to calculate the average mean when the series is discrete.
Similarly, one can calculate the mean deviation or average deviation by taking deviations from median or mode.
Calculations of Mean Deviation in case of continuous series:
Standard deviation or other wise called as root mean square deviation is the most important and widely used measure of variation. It measures the absolute variation of a distribution. It is the right measure that highlights the spread of the observation over and around the mean value. The greater the rate of variation of observations in a series, the greater will be the value of standard deviation. A small value of standard deviation implies a high degree of homogeneity among the observations in the series. If there will be a comparison between two or more standard deviations of two or more series, than it is always advisable to choose that series as ideal one which is having small value of standard deviation. Standard deviation is always measures from the mean or average value of the series. The credit for introducing this concept in the literature goes to Karl Pearson, a famous statistician. It is denoted by the Greek letter (pronounced as sigma)
Standard deviation is calculated in following three different series:
All the above conditions are discussed in detail below.
In case of individual series, the value of standard deviation can be calculated by using two methods.
Following are some steps to be followed for calculating the value of standard deviation.
In practical uses it so happens that while calculating standard deviation by using the arithmetic mean, the mean value may be in some fractions i.e., .25 etc. This creates the real problem in calculating the value of standard deviation. For this purpose, instead of calculating standard deviation by using the above discussed arithmetic mean methods, researchers generally prefer the method of short-cut which is nothing rather calculation of standard deviation by assuming a mean value. Following are some steps that to be followed for calculating standard deviation in case of assumed mean method:
Discrete series are the series which are having some frequencies or repetitions of observations. In case of a discrete series standard deviation is calculated by using following three methods:
Following are the detailed analysis of the above the two methods.
1. When deviations are taken from actual mean: The steps to calculate standard deviation when deviations are calculated from the actual mean are
2. When deviations are taken from assumed mean: The steps to calculate standard deviation when deviations are calculated from the actual mean are
Standard deviation in case of a continuous series can be calculated by using the following steps
As tool of variance, standard deviation is used as a good measure of interpretation of the scatteredness of observation of a series. It is a fact that in a normal distribution approximately 68 per cent of the observations of a series lies less than standard deviation away from the mean, again approximately 95.5 per cent of the items lie less than 2 standard deviation value away from the mean and in the same way 99.7 per cent of the items lie within 3 standard deviations away from the mean. Hence
Advantage of Standard Deviation: Following are some advantages of standard deviation as a measure of dispersion
Disadvantages of standard deviation: The disadvantages are
Another useful statistical tool for measuring dispersion of a series is coefficient of variation. The coefficient of variation is the relative measure of standard deviation which is an absolute measure of dispersion. This tool of dispersion is mostly used in case of comparing the variability two or more series of observation. While comparing, that series for which the value of the coefficient of variation is greater is said to be more variable (i.e., the observations of the series are less consistent, less uniform, less stable or less homogeneous). Hence it is always advisable to choose that series which is having less value of coefficient of variation. The value of coefficient is less implies more consistent, more uniform, more stable and of course more homogeneous. The value of coefficient of variation is always measured by using the value of standard deviation and its relative arithmetic mean. It is denoted as C.V., and is measured by using simple formula as discussed below:
In practical field, researchers generally prefer to use standard deviation as a tool to measure the dispersion than that of coefficient of variance because of a numbers of reasons (researchers are advised to refer any standard statistics book to know more on coefficient of variance and its usefulness).
An illuminating manner of viewing the Gini coefficient is in terms of the Lorenz curve due to Lorenz (1905). It is generally defined on the basis of the Lorenz curve. It is popularly known as the Lorenz ratio. The most common definition of the Gini coefficient is in terms of the Lorenz diagram is the ratio of the area between the Lorenz curve and the line of equality, to the area of the triangle OBD below this line (figure-1). The Gini coefficient varies between the limits of 0 (perfect equality) and 1 (perfect inequality), and the greater the departure of the Lorenz curve from the diagonal, the larger is the value of the Gini coefficient. Various geometrical definitions of Gini coefficient discussed in the literature and useful for different purposes are examined here.
The study of dispersion will enables us to know whether a series is homogeneous (where all the observations remains around the central value) or the observations is heterogeneous (there will be variations in the observations around the central value Hence it can be said that a measure of dispersion describes the spread or scattering of the individual values of a series around its central value. For this there are a numbers of methods to determine the variations as discussed in this chapter. But it is always confusing among the researchers that which method is the best among the different techniques that we have discussed? The answer to this question is very simple and says that no single average can be considered as best for all types of data series. The most important factors are the type of data available and the purpose of investigation. Critiques suggest that if a series is having more extreme values than standard deviation as technique is to be avoided. On the other hand in case of more skewed observations standard deviation may be used but mean deviation needs to be avoided where as if the series is having more gaps between two observations than quartile deviation is not an appropriate measure to be used. Similarly, standard deviation is the best technique for any purpose of data.
1. Age of ten students in a class is considered. Find the mean and standard deviation.
19, 21, 20, 20, 23, 25, 24, 25, 22, 26
Measures of dispersion. (2017, Jun 26).
Retrieved November 21, 2024 , from
https://studydriver.com/measures-of-dispersion/
A professional writer will make a clear, mistake-free paper for you!
Get help with your assignmentPlease check your inbox
Hi!
I'm Amy :)
I can help you save hours on your homework. Let's start by finding a writer.
Find Writer