Skip to content
Adaptive

Learn Statistics

Read the notes, then try the practice. It adapts as you go.When you're ready.

Session Length

~13 min

Adaptive Checks

12 questions

Transfer Probes

6

Lesson Notes

Statistics is the science of collecting, organizing, analyzing, interpreting, and presenting data. At its core, descriptive statistics provides tools for summarizing datasets through measures of central tendency such as the mean, median, and mode, as well as measures of variability like standard deviation and variance. These foundational techniques allow researchers, analysts, and decision-makers to distill large volumes of raw data into meaningful summaries, charts, and tables that reveal underlying patterns and trends.

Inferential statistics extends beyond mere description by enabling researchers to draw conclusions about entire populations based on sample data. Through hypothesis testing, confidence intervals, and regression analysis, statisticians can make probabilistic statements about relationships and effects while quantifying the uncertainty inherent in their conclusions. Probability theory serves as the mathematical backbone of inferential statistics, providing the formal framework for reasoning about randomness, likelihood, and the behavior of data under various assumptions such as the normal distribution.

The applications of statistics permeate virtually every field of modern inquiry. In medicine, clinical trials rely on statistical methods to determine whether new treatments are effective. In business, A/B testing and predictive analytics drive product decisions and marketing strategies. Social scientists use survey sampling and regression to study human behavior, while engineers apply statistical process control to maintain manufacturing quality. The rise of big data and machine learning has only amplified the importance of statistical thinking, making it an indispensable skill for anyone working with quantitative information in the 21st century.

You'll be able to:

  • Apply probability distributions to model uncertainty and calculate expected values in practical real-world decision-making scenarios accurately
  • Design and evaluate hypothesis tests using appropriate significance levels, test statistics, and power analysis procedures
  • Analyze regression models to identify relationships between variables and assess predictive accuracy using diagnostic measures
  • Interpret confidence intervals and p-values correctly while identifying and avoiding common statistical reasoning errors and misapplications

One step at a time.

Data visualization and statistical analysis
Making sense of dataPexels

Interactive Exploration

Adjust the controls and watch the concepts respond in real time.

Key Concepts

Mean, Median, and Mode

The three primary measures of central tendency. The mean is the arithmetic average, the median is the middle value when data are ordered, and the mode is the most frequently occurring value. Each measure captures a different aspect of a dataset's center.

Example: For the dataset {2, 3, 3, 7, 10}, the mean is 5, the median is 3, and the mode is 3. In this right-skewed distribution, the median better represents the typical value than the mean.

Standard Deviation

A measure of the spread or dispersion of a dataset relative to its mean. It is calculated as the square root of the variance, which is the average of squared deviations from the mean. A low standard deviation indicates data points cluster near the mean, while a high value indicates greater spread.

Example: If exam scores are {70, 75, 80, 85, 90}, the mean is 80 and the standard deviation is approximately 7.07, indicating most scores fall within about 7 points of the average.

Normal Distribution

A symmetric, bell-shaped probability distribution defined by its mean $\mu$ and standard deviation $\sigma$. It is fundamental to statistics because of the Central Limit Theorem, which states that sample means tend toward a normal distribution regardless of the population's shape. Approximately 68% of data fall within one standard deviation of the mean, 95% within two, and 99.7% within three.

Bell curve showing standard deviation zones

Example: Human heights within a given sex and age group approximate a normal distribution. If the mean male height is 70 inches with a standard deviation of 3 inches, about 95% of men are between 64 and 76 inches tall.

Hypothesis Testing

A formal procedure for using sample data to evaluate claims about a population. The process involves stating a null hypothesis (no effect or no difference) and an alternative hypothesis, calculating a test statistic, and determining whether the evidence is strong enough to reject the null hypothesis at a chosen significance level.

Example: A pharmaceutical company tests whether a new drug lowers blood pressure more than a placebo. The null hypothesis is that there is no difference. If the test yields a p-value of 0.02 at a significance level of 0.05, the null hypothesis is rejected.

P-Value

The probability of obtaining results at least as extreme as the observed results, assuming the null hypothesis is true. A small p-value suggests that the observed data are unlikely under the null hypothesis, providing evidence against it. It does not measure the probability that the null hypothesis is true.

Example: A p-value of 0.03 means there is a 3% chance of observing data this extreme if the null hypothesis were true. At a significance level of 0.05, this would lead to rejecting the null hypothesis.

Confidence Intervals

A range of values, derived from sample data, that is likely to contain the true population parameter with a specified level of confidence. A 95% confidence interval means that if the same sampling procedure were repeated many times, approximately 95% of the constructed intervals would contain the true parameter.

Example: A survey finds that 54% of voters favor a candidate, with a 95% confidence interval of 51% to 57%. This means we are 95% confident the true proportion of all voters who favor the candidate is between 51% and 57%.

Regression Analysis

A set of statistical methods for estimating the relationship between a dependent variable and one or more independent variables. Linear regression fits a straight line to the data, while multiple regression and nonlinear regression handle more complex relationships. It is widely used for prediction and understanding causal factors.

Scatter plot with regression line showing correlation

Example: A real estate analyst uses multiple regression to predict house prices based on square footage, number of bedrooms, and neighborhood. The equation might be: $\text{Price} = 50{,}000 + 150(\text{sqft}) + 10{,}000(\text{bedrooms}) + 30{,}000(\text{neighborhood rating})$.

Correlation

A statistical measure that quantifies the strength and direction of the linear relationship between two variables. The Pearson correlation coefficient ranges from $-1$ (perfect negative correlation) to $+1$ (perfect positive correlation), with 0 indicating no linear relationship. Correlation does not imply causation.

Example: The correlation between hours studied and exam scores might be $r = 0.85$, indicating a strong positive relationship. However, this alone does not prove that studying more causes higher scores, as confounding variables may exist.

More terms are available in the glossary.

Explore your way

Choose a different way to engage with this topic — no grading, just richer thinking.

Explore your way — choose one:

Explore with AI →

Concept Map

See how the key ideas connect. Nodes color in as you practice.

Worked Example

Walk through a solved problem step-by-step. Try predicting each step before revealing it.

Adaptive Practice

This is guided practice, not just a quiz. Hints and pacing adjust in real time.

Small steps add up.

What you get while practicing:

  • Math Lens cues for what to look for and what to ignore.
  • Progressive hints (direction, rule, then apply).
  • Targeted feedback when a common misconception appears.

Teach It Back

The best way to know if you understand something: explain it in your own words.

Keep Practicing

More ways to strengthen what you just learned.

Statistics Adaptive Course - Learn with AI Support | PiqCue