Statistical Concepts, Mean, Median, Mode

1. Mean (Arithmetic Average)

Mean is one of the most commonly used measures of central tendency in statistics. It represents the average value of a dataset and is calculated by adding all observations and dividing the total by the number of observations. Mean provides a single value that summarizes the entire dataset and helps analysts understand the overall trend of the data. It is widely used in business analytics, finance, economics, and research for performance evaluation and comparison. However, the mean can be affected by extremely high or low values (outliers), which may distort the actual representation of data. Despite this limitation, it remains an important statistical tool because of its simplicity and usefulness in analysis.

Formula

Mean = (Sum of All Observations) ÷ (Number of Observations)

Example

A company’s monthly sales (in ₹ lakh) are 20, 25, 30, 35, and 40.

Mean = (20 + 25 + 30 + 35 + 40) ÷ 5 = 150 ÷ 5 = 30

Thus, the average monthly sales are ₹30 lakh.

Characteristics

  • Easy to calculate and understand.
  • Uses all data values in the dataset.
  • Suitable for numerical data.
  • Sensitive to extreme values.
  • Widely used for statistical analysis.

Role

  • Measures the average performance.
  • Supports comparison between datasets.
  • Assists business decision-making.
  • Helps identify overall trends.
  • Forms the basis for advanced statistical analysis.

2. Median

Median is the middle value of a dataset when the observations are arranged in ascending or descending order. It divides the dataset into two equal halves, with 50% of observations above it and 50% below it. Unlike the mean, the median is not affected by extreme values or outliers, making it a reliable measure for skewed datasets. Median is particularly useful when analyzing income distributions, property prices, and other datasets where unusual values may distort the average. It provides a better representation of the central position of data when the distribution is uneven. In Business Analytics, the median helps organizations understand typical performance and customer behavior more accurately.

Formula

  • If the number of observations is odd:

    Median = Middle Value

  • If the number of observations is even:

    Median = (Middle Two Values Sum) ÷ 2

Example

Employee salaries (₹ thousand): 20, 25, 30, 35, 100

Arranged data: 20, 25, 30, 35, 100

Median = 30

Although one employee earns ₹100 thousand, the median remains ₹30 thousand, providing a more realistic representation of the typical salary.

Characteristics

  • Represents the middle value.
  • Not affected by extreme values.
  • Suitable for skewed distributions.
  • Easy to interpret.
  • Divides data into two equal parts.

Role

  • Identifies the central position of data.
  • Provides accurate analysis in skewed datasets.
  • Supports market and customer analysis.
  • Helps evaluate income and price distributions.
  • Reduces the influence of outliers.

3. Mode

Mode is the value that occurs most frequently in a dataset. It indicates the observation with the highest frequency and is the only measure of central tendency that can be used for both numerical and categorical data. A dataset may have one mode (unimodal), two modes (bimodal), multiple modes (multimodal), or no mode if all values occur equally. Mode is especially useful in marketing, retail, and customer analytics because it identifies the most common preference, behavior, or choice. Unlike the mean, the mode is not affected by extreme values. It helps businesses understand popular products, customer preferences, and frequently occurring events.

Example

Product sizes sold in a store:

M, L, M, XL, M, L, M, S

Mode = M

This indicates that Medium (M) is the most frequently purchased size, helping the store maintain appropriate inventory levels.

Characteristics

  • Represents the most frequent value.
  • Applicable to numerical and categorical data.
  • Not influenced by extreme values.
  • Easy to identify.
  • Useful for preference analysis.

Role

  • Identifies the most common occurrence.
  • Supports product and customer analysis.
  • Helps understand consumer preferences.
  • Assists inventory and demand planning.
  • Useful in market research.

Difference Between Mean, Median, and Mode

Aspect Mean Median Mode
Definition Average value Middle value Most frequent value
Calculation Sum ÷ Number of observations Middle observation Highest frequency
Outlier Effect Highly affected Not affected Not affected
Data Type Numerical only Numerical/Ordinal Numerical/Categorical
Purpose Measure average Find central position Identify common value
Complexity Easy Easy Very easy
Best Used For Symmetrical data Skewed data Frequency analysis
Business Use Performance analysis Income analysis Customer preference analysis

Leave a Reply

error: Content is protected !!