As
the story of statistics goes on we are on the learning path of the software
SPSS. In the previous session we learned the basics of histogram and box plot. In
this session we studied Box plot in detail.
A Box Plot
consists of a box and a whisker.
1) A box plot is divided into quartiles. A whisker
denotes 25% of data. The rest 50 % is
covered in the body of box and last 25 % in the down whisker.
2) The length of Box does not denote more data but
denotes the range of distribution of the data.
3) If any data is more than 3 times of the highest
data value it is an outlier.
4) The median denotes that 50 % of data is spread on
both side of median.
5) Median is the best way to depict the distribution
of data as mode is insignificant in data which has variable data values.
Fig 1: Detailed description of Box plot
Fig 2: Relation of normal distribution and box plot
Apart from that we learned how median, mean can be
calculated through SPSS, excel and manually.
This
whole data analysis was studied through real time data of 2 wheelers.
Objective:
The main objective was to find the cities possessing highest 2 wheelers region
wise.
Approach:
The
approach that was followed was ratio determination of 2 wheelers and population
Methodology:
1. For
this cities were sorted according to their states and accordingly the states
were assigned numeric values using automatic approach of assigning value to
states
2. The
cities were then sorted according to states
3. The
next step was to sort states according to 5 region viz north, east ,west south
and central
4. After
completion of this step analysis was carried out to find ratio of no. of 2
wheelers and population.
5. These
ratios were then sorted region wise with all states of a region at one place.
6. Accordingly
a clear cut picture of highest ratio was identified region wise.
Analysis:
The next step was to see the normal histographical
distribution of 2 wheelers in different regions.
Conclusion:
·
The conclusion drawn was that ratio
determination is not the most suitable method for determining the highest 2
wheelers in states as the data was not distributed normally.
·
It is important to devise the right
strategy for interpretation of data.
Reference:
4) Pallavi Bizoara
No comments:
Post a Comment