Wednesday, 3 July 2013

Statistics Summary Day-2

As the story of statistics goes on we are on the learning path of the software SPSS. In the previous session we learned the basics of histogram and box plot. In this session we studied Box plot in detail.
A Box Plot consists of a box and a whisker.
1) A box plot is divided into quartiles. A whisker denotes 25% of data.  The rest 50 % is covered in the body of box and last 25 % in the down whisker.
2) The length of Box does not denote more data but denotes the range of distribution of the data.
3) If any data is more than 3 times of the highest data value it is an outlier.
4) The median denotes that 50 % of data is spread on both side of median.
5) Median is the best way to depict the distribution of data as mode is insignificant in data which has variable data values.
Fig 1: Detailed description of Box plot



Fig 2: Relation of normal distribution and box plot

Apart from that we learned how median, mean can be calculated through SPSS, excel and manually.
            This whole data analysis was studied through real time data of 2 wheelers.
Objective: The main objective was to find the cities possessing highest 2 wheelers region wise.
Approach: The approach that was followed was ratio determination of 2 wheelers and population
Methodology:
1.      For this cities were sorted according to their states and accordingly the states were assigned numeric values using automatic approach of assigning value to states
2.      The cities were then sorted according to states
3.      The next step was to sort states according to 5 region viz north, east ,west south and central
4.      After completion of this step analysis was carried out to find ratio of no. of 2 wheelers and population.
5.      These ratios were then sorted region wise with all states of a region at one place.
6.      Accordingly a clear cut picture of highest ratio was identified region wise.
Analysis:
The next step was to see the normal histographical distribution of 2 wheelers in different regions.


Conclusion:
·         The conclusion drawn was that ratio determination is not the most suitable method for determining the highest 2 wheelers in states as the data was not distributed normally.
·         It is important to devise the right strategy for interpretation of data.

Reference:
  
Written By: Nidhi

Team Members: 
1) Nitesh Singh Patel
2) Nitin Boratwar
3) Palak Jain
4) Pallavi Bizoara




No comments:

Post a Comment