Measures of Central Tendency and Location
Learning Outcome Statement:
calculate, interpret, and evaluate measures of central tendency and location to address an investment problem
Summary:
This LOS focuses on understanding and applying measures of central tendency and location, which are crucial for analyzing data distributions in finance. Central tendency measures, including the mean, median, and mode, describe where data is centered. Measures of location, such as quartiles and percentiles, help identify values at or below which certain proportions of data lie. The content also discusses handling outliers and the implications of using different measures on investment decisions.
Key Concepts:
Arithmetic Mean
The arithmetic mean is calculated by summing all observations and dividing by the number of observations. It is sensitive to outliers.
Median
The median is the middle value in a sorted dataset. It is less affected by outliers compared to the mean.
Mode
The mode is the most frequently occurring value in a dataset. A dataset can be unimodal, bimodal, or have no mode.
Outliers
Outliers are extreme values that can skew the results of statistical measures. They can be handled by ignoring, deleting, or replacing them with other values.
Quantiles
Quantiles are values that divide a dataset into equal-sized, contiguous subsets. Common quantiles include quartiles, quintiles, deciles, and percentiles.
Interquartile Range (IQR)
IQR is the range between the first quartile (25th percentile) and the third quartile (75th percentile) and is used to measure statistical dispersion.
Formulas:
Sample Mean
Calculates the average of a sample.
Variables:
- :
- Value of the i-th observation
- :
- Total number of observations
Interquartile Range
Measures the middle 50% spread of the data, less influenced by outliers.
Variables:
- :
- Third quartile (75th percentile)
- :
- First quartile (25th percentile)