Median is the value that splits an ordered dataset in half — 50% of observations lie at or below it and 50% at or above it. Unlike the mean (average), the median resists outliers, which is why government agencies often report medians for wages, home prices, and household income. Knowing when — and how — to use the median helps you describe real-world data more accurately, compare groups fairly, and make better decisions from noisy or skewed distributions.
Key Takeaways
- Definition: The median is the middle value of an ordered dataset; with an even count, it’s the average of the two middle values.
- Why it matters: The median is robust to outliers and skew, making it a preferred summary for incomes, wages, and prices.
- How it differs from the mean: The mean moves with extreme values; the median does not. Use both to understand shape and skew.
- In official statistics: U.S. Census and BLS widely publish medians (e.g., median household income, median wages) for comparability across geographies and time.
- Variants you may need: weighted medians for grouped/weighted data; median-of-medians in algorithms; medians within boxplots to visualize distributions.
What the Median Really Means (and When It’s Better Than the Mean)
The median is a positional measure of central tendency: once you sort the values, the position of the middle observation defines the statistic. Because it depends on rank rather than magnitude, a few extreme values leave the median unchanged. This resistant property makes the median ideal for skewed distributions such as income, home prices, or transaction sizes. For example, in neighborhoods where a few ultra-expensive properties pull the average higher, the median home price better reflects a “typical” sale.
By contrast, the mean aggregates magnitudes. In quality control, finance, or experimental science — where values cluster symmetrically and outliers are rare or already handled — the mean can be more efficient. Analysts therefore report the median alongside the mean and dispersion metrics (standard deviation, interquartile range) to convey both typical level and spread. Boxplots explicitly display the median line with quartiles and outliers to summarize shape at a glance.
Agencies prefer medians for public-facing indicators because they’re interpretable. The U.S. Census describes median household income as the midpoint where half of households earn more and half earn less; this avoids distortion from very high earners. The Bureau of Labor Statistics likewise reports median hourly wages for occupations, which helps job seekers and policymakers compare typical pay without extremes skewing the picture.
Odd n: Median = value at position (n + 1) / 2 in the ordered data
Even n: Median = average of values at positions n/2 and (n/2 + 1)
Order the data first (ascending or descending). For even counts, average the two middle values.
How to Calculate the Median (Step by Step)
1) Gather and clean. Start with the complete set of observations, decide how to handle duplicates and missing values, and ensure each observation represents the same concept (e.g., hourly wage, not a mix of hourly and weekly). Consistency matters more than sample size for a meaningful median.
2) Sort the data. Order values from smallest to largest. Sorting establishes the rank positions that define the median.
3) Identify the middle position. For an odd count, the middle is the (n+1)/2-th observation; for an even count, it’s the average of the n/2-th and (n/2+1)-th observations. This simple positional rule is invariant to outliers and scale changes.
4) Document decisions. If you used class intervals, imputed values, or weights, note your method. For weighted datasets (e.g., household surveys), compute the weighted median — the point where cumulative weight crosses 50% — to reflect representation correctly.
5) Communicate clearly. When reporting medians externally, use precise language like “half of observations lie above and half below,” and provide context (sample, timeframe, and whether values are nominal or inflation-adjusted). This aligns with how Census and BLS present medians in national indicators.
Suppose you record daily transaction sizes ($) for a week: 18, 22, 22, 24, 150, 19, 21. After sorting (18, 19, 21, 22, 22, 24, 150), the median is the 4th value = 22. The mean is (18+19+21+22+22+24+150)/7 = 39.4, which is inflated by a single $150 outlier. The median better represents a “typical” transaction in this skewed set.
Median in Official Statistics (Income, Wages, Prices)
Household income. The U.S. Census defines median household income as the point dividing the income distribution in halves. In September 2025, the Census highlighted recent changes in median incomes across race and ethnicity groups (reported in inflation-adjusted dollars), underscoring why medians are preferred for distributional analysis. Using medians reduces the influence of very high incomes that can distort averages when assessing typical living standards.
Occupational wages. The Bureau of Labor Statistics (OEWS program) publishes median wages by occupation and geography. Medians help job seekers, employers, and policymakers benchmark typical pay levels — more stable than means when a small fraction of very high earners exists in an occupation. BLS documentation explains the distinction between mean and median wages and when each is reported.
Prices and housing. Medians are also used for home sale prices and rent levels to summarize markets with long right tails. Analysts frequently pair median price with interquartile ranges or percentiles to communicate both center and spread without undue sensitivity to a few luxury transactions. The same logic applies to median list prices in online marketplaces and local housing dashboards.
Communication tip. When comparing medians over time, specify whether values are nominal or adjusted for inflation. Agencies often report medians in “constant” (inflation-adjusted) dollars, which better reflect purchasing power and real trends.
Median vs. Mean vs. Mode: Choosing the Right Summary
Mean: Sum divided by count. Efficient for symmetric distributions and vital for many statistical procedures; sensitive to outliers by design.
Median: Rank-based midpoint. Resistant to extremes; ideal for skewed or heavy-tailed data, bounded measures (like time-to-complete), or when outliers reflect rare events rather than central tendency.
Mode: Most frequent value. Useful for categorical data (most common response) or discretized measures (popular SKU price points). Often reported with the median and mean to show shape.
In practice, present multiple summaries: mean for mass balance or aggregation logic, median for typical values in skewed data, and percentiles (P25, P75) to show dispersion. Boxplots package these elements in a compact visual — displaying the median line, interquartile box, and outliers.
Advanced Topics: Weighted Medians, Medians from Grouped Data, and Robust Spread
Weighted median. In survey or transactional data with weights, the median is the smallest value where cumulative weight ≥ 50%. This mirrors how population-representative surveys (e.g., ACS) summarize income and avoids bias from unequal selection probabilities.
Grouped data. If you only have bins (e.g., “$50k–$59,999”), you can approximate the median by identifying the bin where cumulative frequency passes 50% and interpolating within the class. Always disclose that the result is an estimate and depends on bin widths and within-bin distributional assumptions.
Median absolute deviation (MAD). For a robust measure of spread, compute the median of |x − median(x)|. MAD resists outliers and pairs naturally with the median to describe skewed data. Engineers and quality analysts use median+MAD when normality is doubtful. The NIST handbook discusses robust location and scale choices as part of exploratory data analysis, which is particularly useful with non-normal data.
Sampling uncertainty. Medians from samples carry variance. Use bootstrap intervals or large-sample approximations to quantify uncertainty, especially when medians inform policy or compensation decisions. Reporting confidence intervals around medians adds transparency similar to what agencies provide with standard errors in survey tables.
Computational notes. In large streaming systems, selection algorithms like “median-of-medians” find approximate medians in linear time, supporting dashboards and SLAs at scale. While implementation details vary, the statistical definition remains the same — split the mass.
Common Pitfalls (and How to Avoid Them)
Mixing incomparable units. Don’t take the median of hourly wages combined with annual salaries. Convert to a common basis (e.g., hourly or annualized) first so the statistic has a coherent interpretation. BLS wage series, for example, define unit and concept clearly to maintain comparability across occupations and regions.
Ignoring weights. If a few large entities contribute many records, an unweighted median may misrepresent the population. Use weights when your design calls for them (household surveys, stratified samples, customer-level weighting). Census income medians explicitly reflect sampling and weighting; treat your medians similarly when data are not simple random samples.
Confusing mean with median in narratives. Media headlines sometimes say “average income” when the source reported a median. Verify the source’s definition and use correct terminology to avoid misleading audiences. Census notes the midpoint definition explicitly in releases and briefs.
Not reporting dispersion. A single median hides the distribution’s spread and shape. Pair medians with quartiles, percentiles, or MAD to prevent over-interpreting small differences that may fall within natural variability. The NIST handbook emphasizes describing spread alongside center in exploratory analysis.
Frequently Asked Questions
When is the median better than the mean?
Use the median when distributions are skewed (income, home prices), when outliers are present, or when you want a “typical” value that isn’t pulled by extremes. Use the mean for balanced, symmetric data or when totals and averages are operationally required.
What’s the difference between sample and population median?
The population median is defined over the full distribution; the sample median estimates it from observed data. Like any estimator, the sample median has uncertainty — use intervals (e.g., bootstrap) when reporting results that drive decisions.
Why do agencies prefer median income/wage?
Because extreme values can distort the mean, the median better represents the “typical” household or worker. Census and BLS explicitly define and report medians to support fair comparisons across groups and time.
Can I average medians across groups?
Not meaningfully. The median is not additive. If you must summarize across groups, recompute the overall median from microdata or use weighted methods that reflect group sizes and distributions.
How is the median shown in charts?
Box-and-whisker plots draw a line at the median within the interquartile “box,” with whiskers and points for outliers. They convey location and spread succinctly and enable fair group comparisons.
Sources
- Investopedia — Median: What It Is and How to Calculate It.
- NIST Dictionary — “Median” (definition for odd/even cases).
- NIST/SEMATECH e-Handbook — Measures of Location (robust choices).
- U.S. Census Bureau (2025) — Median Household Income story.
- BLS OEWS — FAQ on mean vs. median wages.
- BLS — Occupational Employment & Wage Statistics (program overview).
- Investopedia — Descriptive Statistics (boxplots show medians and quartiles).

