The process of determining the minimum, first quartile (Q1), median (Q2), third quartile (Q3), and maximum values within a dataset is a fundamental statistical procedure. These five values provide a concise and robust synopsis of the distribution’s central tendency, dispersion, and skewness. As an example, consider the dataset: 4, 7, 1, 9, 3, 5, 8, 6, 2. Sorting yields: 1, 2, 3, 4, 5, 6, 7, 8, 9. The minimum is 1, the maximum is 9, the median is 5. Q1 is the median of the lower half (1, 2, 3, 4), which is 2.5. Q3 is the median of the upper half (6, 7, 8, 9), which is 7.5. Thus, the five values are: 1, 2.5, 5, 7.5, 9.
This summary technique is invaluable for exploratory data analysis, offering a rapid understanding of data characteristics without requiring complex statistical calculations. It is resistant to the influence of outliers, making it preferable to measures like the mean and standard deviation in situations where data contains extreme values. Historically, this method has been employed as a simple way to summarise data by hand before computational power was widely available. Today, it is still commonly used to provide the first step in understanding a new dataset and can be visualised using a boxplot which allows a quick comparison of distributions.