1 Definitions

Key Concepts

Statistic Parameter
Description Describes a sample Describes a population
Calculation Calculated from a sample Calculated from the whole population
Value Variable depending on the sample Fixed

Levels of Measurement

The way we interpret the numbers we assign to our measurements depends upon the level of measurement that is used.

Example

Fahrenheit and centigrade scales of temperatures are interval-level measures. They are not ratio-level because the zero point is arbitrary. For example, in the F scale 32 degrees happens to be the point where water freezes. There is no reason you couldn’t shift everything down by 32 degrees, and have 0 be the point where water freezes. Or, add 68, and have 100 be the freezing point. The zero point is arbitrary. It is not correct to say that, if it is 70 degrees outside, that it is twice as warm as it would be if it were 35 degrees outside.

Such things as age and income, however, have nonarbitrary zero points. If you have zero dollars, that literally means that you have no income. If you are 20 years old, that literally means you have been around for 20 years. Further, \(10,000 is exactly twice as much as \)5,000. If you are 20, you are half as old as someone who is 40.

Measures of Central Tendency

Note: in a normal distribution, the mean, median and mode are all the same.

Measures of Dispersion

We often want to know how much variability, or spread, there is in the numbers. For example, suppose the average income is \(25,000. It could be that most people have incomes ranging from \)24,000 - \(26,000, or the range of values could be from \)1,000 to $100,000.

Ideally:

Candidates:

For a normal distribution (bell curve) about 68% of all values fall within one standard deviation to either side of the mean, and about 95% fall within 2 standard deviations.

Sample variance uses (N - 1) instead of N in the denominator. Estimating the mean eats up 1 degree of freedom - 1 number cannot vary. With large Ns, this is trivial.

Shapes of Distributions

Univariate Statistics