Tag: Time Series Analysis
Fishers Ideal Index Number, Meaning, Concept, Interpretation, Steps, Applications, Advantages and Limitations
Fisher’s Index Number, named after the American economist Irving Fisher, is a composite index that combines elements of both the Laspeyres and Paasche indices to provide a more balanced measure of price changes. It is considered a comprehensive measure because it accounts for both base-period and current-period quantities, offering a more accurate reflection of price changes over time. Here’s an in-depth look at Fisher’s Index Number:
Concept of Fisher’s Index Number
Fisher’s Index Number aims to address the limitations of the Laspeyres and Paasche indices, which are two commonly used methods for calculating price indices. The Laspeyres Index uses base-period quantities to weigh prices, while the Paasche Index uses current-period quantities. Fisher’s Index blends these approaches to mitigate their individual biases and provide a more accurate measure of price changes.
Interpretation of Fisher’s Index Number
The interpretation of Fisher’s Index Number is similar to other index numbers.
- If Fisher’s Index = 100
There is no change in prices or quantities compared to the base year.
- If Fisher’s Index > 100
There is an increase in prices or quantities compared to the base year.
- If Fisher’s Index < 100
There is a decrease in prices or quantities compared to the base year.
Example
- Fisher’s Price Index = 125
- Interpretation: Prices have increased by 25% compared to the base year.
- Fisher’s Price Index = 90
- Interpretation: Prices have decreased by 10% compared to the base year.
Calculation
Fisher’s Index Number is calculated as the geometric mean of the Laspeyres Index and the Paasche Index. The formula for Fisher’s Index Number (I_F) is:
I_F= √(L×P)
where:
- L is the Laspeyres Index
- P is the Paasche Index
1. Laspeyres Index
The Laspeyres Index measures the change in price relative to a base period, using base-period quantities for weighting. The formula is:
L = [ ∑(P1×Q0) / ∑(P0×Q0) ]× 100
where:
- P_1 = Price of the item in the current period
- P_0 = Price of the item in the base period
- Q_0 = Quantity of the item in the base period
2. Paasche Index
The Paasche Index measures the change in price relative to a base period, using current-period quantities for weighting. The formula is:
P = [ ∑(P1×Q1) / ∑(P0×Q1) ]× 100
where:
- Q_1 = Quantity of the item in the current period
Steps to Calculate Fisher’s Index
Un-weighted Index Numbers, Properties, Types
Un-weighted index numbers are simple index numbers where all items are assigned equal importance or weight, regardless of their actual significance or contribution. These index numbers measure relative changes in prices or quantities without considering the quantity consumed or produced. The Simple Aggregative Method and Simple Average of Price Relatives are commonly used techniques. Though easy to compute and understand, un-weighted index numbers may not accurately reflect real economic scenarios because they ignore the actual impact of each item. Therefore, they are mainly used for illustrative or preliminary analysis rather than precise economic measurement.
Properties of Un-weighted Index Numbers:
-
Equal Importance to All Items
Un-weighted index numbers treat all items in the dataset with equal importance, regardless of their actual usage, cost, or impact. This means a low-cost or rarely used item influences the index as much as a high-cost or frequently used item. While this simplifies calculations, it can distort the true picture of economic trends. This property limits the accuracy of un-weighted indices in reflecting real-life consumption or production patterns.
-
Simplicity in Calculation
Un-weighted index numbers are easy to compute because they do not require additional data like weights or quantities. Only the prices or quantities from the base and current periods are needed. This simplicity makes them ideal for quick estimates or introductory statistical analysis. However, this ease comes at the cost of precision and relevance, especially when different items have significantly varied importance or impact in the real-world context.
-
Distorted Representativeness
Because they assign equal weight to all items, un-weighted index numbers may give a distorted representation of overall price or quantity changes. For instance, a major change in a high-volume product could be overshadowed by minor changes in several low-impact items. This lack of representativeness means that un-weighted indices can mislead policymakers or businesses if used for serious economic or financial decision-making.
-
Limited Real-World Application
Due to their disregard for item importance, un-weighted index numbers have limited use in actual business or economic analysis. They are mostly used for academic or theoretical purposes, such as teaching basic statistical concepts. In practical scenarios like inflation tracking or market analysis, weighted index numbers are preferred as they offer a more realistic and reliable measure of change based on actual consumption, sales, or production data.
Types of Un-weighted Index Numbers:
- Simple Aggregative Index Number
This method calculates the index by summing the current period prices and dividing them by the sum of base period prices, multiplied by 100. The formula is:
Simple Aggregative Index = (∑P1 / ∑P0) × 100
Where P1 and P0 are current and base period prices. All items are treated equally, regardless of their significance. While easy to compute, it can be misleading if high-priced items disproportionately affect the result. It is suitable for basic analysis but lacks real-world precision.
-
Simple Average of Price Relatives Index
This method calculates the price relative for each item (current price divided by base price × 100) and then takes the arithmetic mean of all these relatives. Formula:
Simple Average of Price Relatives = [∑(P1 / P0×100)] / n
Where is the number of items. This approach ensures each item has equal influence on the final index, regardless of actual importance. It’s more refined than the aggregative method and reduces the impact of extreme values, but still does not reflect real consumption patterns or weights.
Key differences between Variation and Skewness
Variation refers to the differences or fluctuations in data values within a dataset. In business, understanding variation is essential for making informed decisions, as it helps identify patterns, trends, and inconsistencies in processes or outcomes. Variation can be natural (random) or assignable (caused by specific factors). It occurs in areas like production, sales, customer behavior, and financial metrics. By measuring variation using statistical tools (like range, variance, and standard deviation), businesses can improve quality control, forecast demand, and reduce risks. Effective analysis of variation supports better resource allocation and strategic planning in uncertain environments.
Properties of Variation:
-
Non-Negativity
Variation is always non-negative, meaning its value cannot be less than zero. A variation of zero indicates that all data values are identical, showing no spread. This property ensures that variation is a reliable measure of data dispersion. Since squared differences are used in calculations like variance or standard deviation, negative values are mathematically eliminated, reinforcing consistency in representing the extent of data fluctuations.
-
Basis for Dispersion
Variation serves as the foundation for measuring dispersion in data. It quantifies how much individual values deviate from the mean or central value. Higher variation indicates that data points are widely spread out, while lower variation implies closeness to the average. This helps in comparing datasets and assessing consistency, reliability, and control in business processes and decision-making scenarios like quality control or performance monitoring.
-
Dependence on Data Scale
Variation is scale-dependent, meaning its value is influenced by the units of the data. For example, the variation in centimeters will differ from the same data measured in meters. This property makes direct comparisons across datasets difficult unless standardized. In such cases, coefficient of variation is used to eliminate the unit-based effect and allow fair comparison between different data groups or scales.
-
Influence of Extreme Values
Variation is sensitive to outliers or extreme values. A single unusually high or low value can significantly increase the variation, especially in measures like variance and standard deviation. This sensitivity helps in identifying potential anomalies or quality issues in business processes, but it also means that variation must be interpreted carefully, especially in datasets where extreme values may distort the overall view.
-
Used for Comparative Analysis
Variation allows comparison of consistency between two or more datasets. For example, two production machines might produce the same average output, but one may have a higher variation, indicating less reliability. By analyzing variation, managers can choose better-performing systems or predict future outcomes more effectively. It plays a vital role in fields such as finance, marketing, operations, and quality assurance.
Skewness
Skewness is a statistical measure that describes the asymmetry or deviation from symmetry in a distribution of data. When a dataset is perfectly symmetrical, it has zero skewness. If the data tails more towards the right (positive skew), it indicates that a majority of values are concentrated on the lower end. Conversely, a left tail (negative skew) shows values concentrated on the higher end. Skewness helps in understanding the shape of the data distribution, which is important for choosing appropriate statistical methods, interpreting trends, and making informed business decisions based on non-normal or irregular data patterns.
Properties of Skewness:
-
Direction of Asymmetry
Skewness indicates the direction in which data deviates from symmetry. If the skewness is positive, the tail on the right side of the distribution is longer, indicating more lower values. If it’s negative, the left tail is longer, indicating more higher values. This property helps understand how data is spread around the mean.
-
Impact on Mean and Median
In a skewed distribution, the mean, median, and mode are not equal. In positively skewed data, the mean > median > mode. In negatively skewed data, the mean < median < mode. This helps identify the nature of the distribution and is crucial when selecting the right measure of central tendency for analysis.
-
Quantitative Measure
Skewness is measured using formulas like Pearson’s or Bowley’s coefficient of skewness. These give numerical values where zero represents symmetry, positive values indicate right skew, and negative values indicate left skew. This numerical property allows easy comparison between datasets and helps assess how far a distribution deviates from normality.
-
Unitless Value
Skewness is a dimensionless (unitless) number, meaning it is unaffected by the units of the variable being measured. This allows comparisons of skewness between different datasets, regardless of their scales or units. It also makes skewness a standardized measure, helping in interpreting data shapes across various domains and applications.
-
Sensitivity to Outliers
Skewness is highly sensitive to outliers because extreme values in the data can significantly pull the tail, altering the skewness value. A few large or small values can make an otherwise symmetric distribution appear skewed. This property makes skewness useful in detecting outliers and data irregularities during statistical analysis.
Key differences between Variation and Skewness
| Aspect | Variation | Skewness |
|---|---|---|
| Definition | Dispersion | Asymmetry |
| Focus | Spread | Shape |
| Center Relation | Distance from mean | Tilt of mean |
| Symmetry | Not required | Key factor |
| Direction | None | Left/Right |
| Unit | Square units | Unitless |
| Measure Type | Magnitude | Directional |
| Zero Value Meaning | No variation | Symmetrical |
| Examples | Range, Variance | Skewness Coefficient |
| Application | Consistency check | Distribution shape |
| Used In | Quality Control | Data Normality |
| Calculation Tools | Std. Dev., Variance | Pearson’s/Karl’s |
Significance of Measuring Variation, Properties of Good Variation
Significance of Measuring Variation:
-
Improves Decision Making
Measuring variation helps managers understand the reliability and stability of data. By identifying how much values deviate from the average, decision-makers can assess risks and choose better strategies. For instance, in sales forecasting, recognizing variation in customer demand allows for better inventory planning. Quantifying variation also helps differentiate between normal fluctuations and unusual patterns, leading to more data-driven, informed decisions that align with business goals.
-
Enhances Quality Control
In production and service processes, measuring variation is crucial for maintaining consistent quality. It helps identify deviations from standards and detect defects or process inefficiencies. Tools like control charts and standard deviation enable businesses to monitor performance, reduce errors, and maintain customer satisfaction. By minimizing unnecessary variation, companies can achieve higher quality outputs, reduce costs, and ensure compliance with regulatory or industry standards.
-
Enables Process Improvement
Variation measurement is a foundation for continuous improvement initiatives such as Six Sigma or Total Quality Management. It allows organizations to pinpoint sources of inconsistency and implement targeted improvements. By reducing unwanted variation, businesses can make operations more efficient, predictable, and cost-effective. Over time, this leads to streamlined workflows, reduced waste, and enhanced productivity, giving companies a competitive edge in both manufacturing and service sectors.
-
Assists in Risk Management
Understanding variation helps identify uncertainties and potential risks in business processes. By analyzing variation in financial performance, customer behavior, or supply chain reliability, managers can develop strategies to mitigate risks. For example, consistent variation in supplier delivery times may require contingency planning. Measuring variation allows firms to prepare for worst-case scenarios, allocate resources wisely, and build resilience against market volatility or operational disruptions.
Properties of Good Variation:
- Predictability
Good variation exhibits a consistent and predictable pattern over time. This predictability allows businesses to make reliable forecasts and informed decisions. For example, seasonal sales patterns or daily website traffic variations help managers plan inventory, staffing, or marketing strategies effectively. Predictable variation supports stability in processes, enabling smoother operations and better planning for future trends or demand changes.
- Relevance
A good variation is relevant to the business objective or decision-making process. It should provide meaningful insights that help identify opportunities or problems. For instance, analyzing variation in customer preferences can guide product development. Irrelevant variations, on the other hand, may distract decision-makers. Focusing on relevant variations ensures that the analysis is purpose-driven and aligned with organizational goals, helping managers focus on impactful factors.
- Measurability
Good variation must be quantifiable using statistical methods such as mean, standard deviation, or variance. Measurability ensures that the variation can be analyzed, tracked over time, and compared across different datasets. For example, tracking the variation in daily production output helps monitor consistency. Without measurability, it becomes difficult to evaluate performance or identify areas for improvement, limiting the effectiveness of quantitative analysis.
- Consistency
Good variation maintains a consistent pattern under similar conditions. If the variation changes erratically without any identifiable cause, it may indicate underlying problems. Consistency in variation allows businesses to establish control limits and set performance benchmarks. In manufacturing, for example, consistent variation in product quality indicates a stable process, while inconsistent variation may point to equipment or human error.
-
Informative Value
Good variation provides insights that lead to better decision-making. It should reveal underlying trends, root causes, or patterns that support corrective actions or strategy formulation. For instance, variation in customer complaints across regions can highlight service issues. An informative variation goes beyond raw data and contributes to knowledge generation, making it a valuable input in business intelligence and strategic analysis.
- Controllability
Good variation should be capable of being monitored and controlled to a reasonable extent. If a variation can be managed through process improvement, training, or better systems, it becomes useful for continuous improvement. For example, reducing variation in delivery time improves customer satisfaction. Controllability transforms variation into an opportunity for operational excellence and efficiency, aligning with total quality management principles.
Quantitative Analysis for Business Decisions BU B.Com 1st Semester SEP Notes
| Unit 1 [Book] | |
| Introduction, Meaning, Definitions, Features, Objectives, Functions, Importance and Limitations of Statistics | VIEW |
| Important Terminologies in Statistics: Data, Raw Data, Primary Data, Secondary Data, Population, Census, Survey, Sample Survey, Sampling, Parameter, Unit, Variable, Attribute, Frequency, Seriation, Individual, Discrete and Continuous | VIEW |
| Classification of Data | VIEW |
| Requisites of Good Classification of Data | VIEW |
| Types of Classification Quantitative and Qualitative Classification | VIEW |
| Unit 2 [Book] | |
| Types of Presentation of Data Textual Presentation | VIEW |
| Tabular Presentation | VIEW |
| One-way Table | VIEW |
| Important Terminologies: Variable, Quantitative Variable, Qualitative Variable, Discrete Variable, Continuous Variable, Dependent Variable, Independent Variable, Frequency, Class Interval, Tally Bar | VIEW |
| Diagrammatic and Graphical Presentation, Rules for Construction of Diagrams and Graphs | VIEW |
| Types of Diagrams: One Dimensional Simple Bar Diagram, Sub-divided Bar Diagram, Multiple Bar Diagram, Percentage Bar Diagram Two-Dimensional Diagram Pie Chart, Graphs | VIEW |
| Unit 3 [Book] | |
| Meaning and Objectives of Measures of Tendency, Definition of Central Tendency | VIEW |
| Requisites of an Ideal Average | VIEW |
| Types of Averages, Arithmetic Mean, Median, Mode (Direct method only) | VIEW |
| Empirical Relation between Mean, Median and Mode | VIEW |
| Graphical Representation of Median & Mode | VIEW |
| Ogive Curves | VIEW |
| Histogram | VIEW |
| Meaning of Dispersion | VIEW |
| Standard Deviation, Co-efficient of Variation-Problems | VIEW |
| Unit 4 [Book] | |
| Significance of Measuring Variation, Properties of Good Variation | VIEW |
| Methods of Studying Variation-Absolute and Relative Measure of Variation | VIEW |
| Standard Deviation | VIEW |
| Co-efficient of Variation | VIEW |
| Skewness, Introduction | VIEW |
| Differences between Variation and Skewness | VIEW |
| Measures of Skewness | VIEW |
| Karl Pearson’s Co-efficient of Skewness | VIEW |
| Unit 5 [Book] | |
| Introduction, Uses of Index Number | VIEW |
| Classification of Index Numbers | VIEW |
| Methods of Constructing Index Numbers | VIEW |
| Un-weighted Index Numbers | VIEW |
| Simple Aggregative Method, Simple Average Relative Method, Weighted Index Numbers, Weighted Aggregative Index numbers | VIEW |
| Fishers Ideal Index number | VIEW |
| Test of Perfection: Time Reversal Test, Factor Reversal Test | VIEW |
| Weighted Average of Relative Index Numbers | VIEW |
Type-I and Type-II Errors
In statistical hypothesis testing, a type I error is the incorrect rejection of a true null hypothesis (also known as a “false positive” finding), while a type II error is incorrectly retaining a false null hypothesis (also known as a “false negative” finding). More simply stated, a type I error is to falsely infer the existence of something that is not there, while a type II error is to falsely infer the absence of something that is.
A type I error (or error of the first kind) is the incorrect rejection of a true null hypothesis. Usually a type I error leads one to conclude that a supposed effect or relationship exists when in fact it doesn’t. Examples of type I errors include a test that shows a patient to have a disease when in fact the patient does not have the disease, a fire alarm going on indicating a fire when in fact there is no fire, or an experiment indicating that a medical treatment should cure a disease when in fact it does not.
A type II error (or error of the second kind) is the failure to reject a false null hypothesis. Examples of type II errors would be a blood test failing to detect the disease it was designed to detect, in a patient who really has the disease; a fire breaking out and the fire alarm does not ring; or a clinical trial of a medical treatment failing to show that the treatment works when really it does.
When comparing two means, concluding the means were different when in reality they were not different would be a Type I error; concluding the means were not different when in reality they were different would be a Type II error. Various extensions have been suggested as “Type III errors”, though none have wide use.
All statistical hypothesis tests have a probability of making type I and type II errors. For example, all blood tests for a disease will falsely detect the disease in some proportion of people who don’t have it, and will fail to detect the disease in some proportion of people who do have it. A test’s probability of making a type I error is denoted by α. A test’s probability of making a type II error is denoted by β. These error rates are traded off against each other: for any given sample set, the effort to reduce one type of error generally results in increasing the other type of error. For a given test, the only way to reduce both error rates is to increase the sample size, and this may not be feasible.
Type I error
A type I error occurs when the null hypothesis (H0) is true, but is rejected. It is asserting something that is absent, a false hit. A type I error may be likened to a so-called false positive (a result that indicates that a given condition is present when it actually is not present).
In terms of folk tales, an investigator may see the wolf when there is none (“raising a false alarm”). Where the null hypothesis, H0, is: no wolf.
The type I error rate or significance level is the probability of rejecting the null hypothesis given that it is true. It is denoted by the Greek letter α (alpha) and is also called the alpha level. Often, the significance level is set to 0.05 (5%), implying that it is acceptable to have a 5% probability of incorrectly rejecting the null hypothesis.
Type II error
A type II error occurs when the null hypothesis is false, but erroneously fails to be rejected. It is failing to assert what is present, a miss. A type II error may be compared with a so-called false negative (where an actual ‘hit’ was disregarded by the test and seen as a ‘miss’) in a test checking for a single condition with a definitive result of true or false. A Type II error is committed when we fail to believe a true alternative hypothesis.
In terms of folk tales, an investigator may fail to see the wolf when it is present (“failing to raise an alarm”). Again, H0: no wolf.
The rate of the type II error is denoted by the Greek letter β (beta) and related to the power of a test (which equals 1−β).
| Aspect |
Type-I Error (False Positive) |
Type-II Error (False Negative) |
|---|---|---|
| Definition | Rejecting a true null hypothesis. | Failing to reject a false null hypothesis. |
| Symbol | Denoted as α (significance level). | Denoted as β. |
| Outcome | Concluding that there is an effect when there isn’t. | Concluding that there is no effect when there is. |
| Risk | Risk of concluding a false discovery. | Risk of missing a true effect. |
| Example | Concluding a new drug is effective when it isn’t. | Concluding a drug is ineffective when it is. |
| Critical Value | Occurs when the test statistic exceeds the critical value. | Occurs when the test statistic does not exceed the critical value. |
| Relation to Power | As α decreases, the probability of Type-I error decreases. | As β increases, the probability of Type-II error increases. |
| Control | Controlled by choosing the significance level (α). | Controlled by increasing the sample size or improving the test’s power. |
Z-Test, T-Test
T-test
A t-test is a statistical test used to determine if there is a significant difference between the means of two independent groups or samples. It allows researchers to assess whether the observed difference in sample means is likely due to a real difference in population means or just due to random chance.
The t-test is based on the t-distribution, which is a probability distribution that takes into account the sample size and the variability within the samples. The shape of the t-distribution is similar to the normal distribution, but it has fatter tails, which accounts for the greater uncertainty associated with smaller sample sizes.
Assumptions of T-test
The t-test relies on several assumptions to ensure the validity of its results. It is important to understand and meet these assumptions when performing a t-test.
- Independence:
The observations within each sample should be independent of each other. In other words, the values in one sample should not be influenced by or dependent on the values in the other sample.
- Normality:
The populations from which the samples are drawn should follow a normal distribution. While the t-test is fairly robust to departures from normality, it is more accurate when the data approximate a normal distribution. However, if the sample sizes are large enough (typically greater than 30), the t-test can be applied even if the data are not perfectly normally distributed due to the Central Limit Theorem.
- Homogeneity of variances:
The variances of the populations from which the samples are drawn should be approximately equal. This assumption is also referred to as homoscedasticity. Violations of this assumption can affect the accuracy of the t-test results. In cases where the variances are unequal, there are modified versions of the t-test that can be used, such as the Welch’s t-test.
Types of T-test
There are three main types of t-tests:
- Independent samples t-test:
This type of t-test is used when you want to compare the means of two independent groups or samples. For example, you might compare the mean test scores of students who received a particular teaching method (Group A) with the mean test scores of students who received a different teaching method (Group B). The test determines if the observed difference in means is statistically significant.
- Paired samples t-test:
This t-test is used when you want to compare the means of two related or paired samples. For instance, you might measure the blood pressure of individuals before and after a treatment and want to determine if there is a significant difference in blood pressure levels. The paired samples t-test accounts for the correlation between the two measurements within each pair.
- One-sample t-test:
This t-test is used when you want to compare the mean of a single sample to a known or hypothesized population mean. It allows you to assess if the sample mean is significantly different from the population mean. For example, you might want to determine if the average weight of a sample of individuals is significantly different from a specified value.
The t-test also involves specifying a level of significance (e.g., 0.05) to determine the threshold for considering a result statistically significant. If the calculated t-value falls beyond the critical value for the chosen significance level, it suggests a significant difference between the means.
Z-test
A z-test is a statistical test used to determine if there is a significant difference between a sample mean and a known population mean. It allows researchers to assess whether the observed difference in sample mean is statistically significant.
The z-test is based on the standard normal distribution, also known as the z-distribution. Unlike the t-distribution used in the t-test, the z-distribution is a well-defined probability distribution with known properties.
The z-test is typically used when the sample size is large (typically greater than 30) and either the population standard deviation is known or the sample standard deviation can be a good estimate of the population standard deviation.
Steps Involved in Conducting a Z-test
- Formulate hypotheses:
Start by stating the null hypothesis (H0) and alternative hypothesis (Ha) about the population mean. The null hypothesis typically assumes that there is no significant difference between the sample mean and the population mean.
- Calculate the test statistic:
The test statistic for a z-test is calculated as (sample mean – population mean) / (population standard deviation / sqrt(sample size)). This represents how many standard deviations the sample mean is away from the population mean.
- Determine the critical value:
The critical value is a threshold based on the chosen level of significance (e.g., 0.05) that determines whether the observed difference is statistically significant. The critical value is obtained from the z-distribution.
- Compare the test statistic with the critical value:
If the absolute value of the test statistic exceeds the critical value, it suggests a statistically significant difference between the sample mean and the population mean. In this case, the null hypothesis is rejected in favor of the alternative hypothesis.
- Calculate the p-value (optional):
The p-value represents the probability of obtaining a test statistic as extreme as, or more extreme than, the observed value, assuming the null hypothesis is true. If the p-value is smaller than the chosen level of significance, it indicates a statistically significant difference.
Assumptions of Z-test
- Random sample:
The sample should be randomly selected from the population of interest. This means that each member of the population has an equal chance of being included in the sample, ensuring representativeness.
- Independence:
The observations within the sample should be independent of each other. Each data point should not be influenced by or dependent on any other data point in the sample.
- Normal distribution or large sample size:
The z-test assumes that the population from which the sample is drawn follows a normal distribution. Alternatively, the sample size should be large enough (typically greater than 30) for the central limit theorem to apply. The central limit theorem states that the distribution of the sample mean approaches a normal distribution as the sample size increases, regardless of the shape of the population distribution.
- Known population standard deviation:
The z-test assumes that the population standard deviation (or variance) is known. This assumption is necessary for calculating the z-score, which is the test statistic used in the z-test.
Key differences between T-test and Z-test
| Feature | T-Test | Z-Test |
| Purpose | Compare means of two independent or related samples | Compare mean of a sample to a known population mean |
| Distribution | T-Distribution | Standard Normal Distribution (Z-Distribution) |
| Sample Size | Small (typically < 30) | Large (typically > 30) |
| Population SD | Unknown or estimated from the sample | Known or assumed |
| Test Statistic | (Sample mean – Population mean) / (Standard error) | (Sample mean – Population mean) / (Population SD) |
| Assumption | Normality of populations, Independence | Normality (or large sample size), Independence |
| Variances | Assumes potentially unequal variances | Assumes equal variances (homoscedasticity) |
| Degrees of Freedom | (n1 + n2 – 2) for independent samples t-test | n – 1 for one-sample t-test, (n1 + n2 – 2) for others |
| Critical Values | Vary based on degrees of freedom and level of significance. | Fixed critical values based on level of significance |
| Use Cases | Comparing means of two groups, before-after analysis | Comparing a sample mean to a known population mean |