Tuesday, October 23, 2007

Statistics - Rules and Tips!

1. Mean Average = total of quantities / number of quantities
2.
The median is the "middle" number in a group (when arranged in ascending or descending order) consisting of an odd number of numbers, and the average of the two middle numbers if there are an even number of numbers
3.
For a set of consecutive integers, the median is the the average of the first and the last integer
4.
Mode is the most frequently recurring number/numbers among the given set of numbers. It can be more than one
5.
Range is the difference between the largest number and smallest number is a set
6.
Calculation of Standard Deviation (SD):
  1. Find the mean, \scriptstyle\overline{x}, of the values.
  2. For each value xi calculate its deviation (\scriptstyle x_i - \overline{x}) from the mean.
  3. Calculate the squares of these deviations.
  4. Find the mean of the squared deviations. This quantity is the variance σ2.
  5. Take the square root of the variance.
7. Variance is the square of the standard deviation
8. SD does not change when the same constant is added or subtracted to all the members of the set
9.
If mean = maximum value it means that all values are equal and SD is 0
10.
A set of numbers with range of zero means that all of the numbers are the same, hence the dispersion of the numbers from its mean is zero
11.
For data with approximately the same mean, the greater the spread, the greater the SD.
12. SD is the square root of the average of the sum of square of the variation from the mean
13.
The more uneven members are dispersed around their arithmetic average, the more their SD
14.
You only need to know the difference between values and total number of values to compute SD
15.
If we know all the numbers of the list, there is a definite SD, regardless of what it is, we can compute it and get an answer – this is helpful for DS questions
16.
If the range is 0, then the SD must also be 0, because there is no variance
17.
The SD of any list is not dependent on the average, but on the deviation of the numbers from the average. So just by knowing that two lists having different averages doesn't say anything about their standard deviation - different averages can have the same SD
18.
The sum of the deviations of the elements from the mean must be 0
19.
Closer the more values to the MEAN, lower the SD
20. If
Range or SD of a list is 0, then the list will contain all identical elements
21.
Standard Deviation is also useful when comparing the spread of two separate data sets that have approximately the same mean. The data set with the smaller Standard Deviation has a narrower spread of measurements around the mean and therefore usually has comparatively fewer high or low values.
In general, the more widely spread the values are, the larger the
Standard Deviation is.
22. If you multiply all terms by x then SD =x times old SD and mean = x times old mean
23.
For comparing the SD for two sets any information about mean ,median,mode and range are insufficient unless you can determine the individual terms from the given data
24. Symmetric about the mean means that the shape of the distribution on the right and left side of the curve are mirror-images of each other
25.
For a given set of consecutive even numbers.. mean = median
26.
When you have a set of consecutive numbers (integers, evens, odds, multiples), the mean is equal to the median
27. Good links for Normal Distribution:

http://www-stat.stanford.edu/~naras/jsm/NormalDensity/NormalDensity.html

http://davidmlane.com/hyperstat/z_table.html


http://www.integratedlearning.net/gmat/sample.asp