We were given a data of population of cities of different
States and 2-wheelers bought in those cities for the year 2009, 2010 and 2011.
Using the provided data we were asked to find out the cities having maximum
number of 2-wheelers in a given Region (North, South, East, West and Central).
Above graphs are not normally distributed and therefore there is no uniformity in the data given. We were also told about skewness and different types of normal curves:
1)Right-tailed normal curve
2)Left-tailed normal curve
Firstly we used the Ratio method i.e. the population of
people in a given city to the number of 2-wheelers in the same city. But this
method was not very accurate.
Further in Class we were introduced to the three basic
term of statistics:
1. Mean:
Mean is basically the average
of a set of given values.
For Example: 1,1,2,4,5,6,7,8,8,8
The average of all numbers
given above is (1+1+2+4+5+6+7+8+8+8)/10 =5
2. Median:
The median is the
numerical value separating the higher half of a data from the lower half.
For Example:
1,1,2,4,5,6,7,8,8,8
The median of the above is
(5+6)/2= 5.5
3. Mode:
The mode is the value
that appears most often in a set of data.
For Example: 1,1,2,4,5,6,7,8,8,8
The mode of the above is 8.
We were then asked to find the mean and median of
2-wheelers in the top cities of all the regions using SPSS.
We also used box-plot to further analyse the given data.
Outliner is an observation that is numerically distant
from the rest of the data. In the given box-plot there are two outliners. One in the northern region and the other in
the central region. This indicated that there was a city in both the region
where the number of 2-wheleers was comparatively much higher than the rest of
the cities of the same region.
We also noticed that in the given data the mean was
always greater than the Median and median was a much better way of analysing
the same.
Prachee Kasera
Neha Gupta
Nitesh Beriwal
Raghav Bhatter
Parthojit Sar
No comments:
Post a Comment