Adaptive

Learn Biostatistics

Read the notes, then try the practice. It adapts as you go.When you're ready.

Session Length

~17 min

Adaptive Checks

15 questions

Transfer Probes

Lesson Notes Key Concepts Concept Map Worked Example Start Adaptive Practice

Lesson Notes

Biostatistics is the branch of statistics that applies mathematical and statistical methods to the design, analysis, and interpretation of data in the biological and health sciences. It provides the quantitative backbone for medical research, epidemiology, public health, genomics, and clinical trials. Without biostatistics, researchers would have no rigorous way to determine whether a new drug actually works, whether a risk factor truly causes disease, or whether a public health intervention is saving lives.

The field emerged in the late nineteenth and early twentieth centuries through the pioneering work of Francis Galton, Karl Pearson, and Ronald Fisher, who developed foundational techniques such as regression analysis, correlation, and the analysis of variance. These methods were originally created to study biological variation and heredity, and they remain central to modern biostatistics. Today the discipline has expanded to encompass survival analysis, longitudinal data modeling, Bayesian inference, and high-dimensional methods for genomics and proteomics data.

Biostatistics plays a critical role in evidence-based medicine and regulatory science. The design and analysis of randomized controlled trials, the gold standard for evaluating medical interventions, is fundamentally a biostatistical enterprise. Government agencies such as the FDA and EMA rely on biostatistical evidence to approve new therapies. In public health, biostatisticians model disease outbreaks, estimate vaccine effectiveness, and analyze health disparities across populations, making the field indispensable to modern healthcare and scientific discovery.

You'll be able to:

Identify appropriate statistical methods for analyzing biological and clinical data including survival and longitudinal studies
Apply hypothesis testing, regression modeling, and power analysis to design rigorous biomedical research studies
Analyze epidemiological data using measures of association, confounding adjustment, and causal inference techniques
Evaluate published biostatistical analyses by assessing assumptions, sample adequacy, and interpretation validity

One step at a time.

Interactive Exploration

Adjust the controls and watch the concepts respond in real time.

Key Concepts

Hypothesis Testing

A formal statistical procedure for deciding whether observed data provide sufficient evidence to reject a null hypothesis. It involves setting a significance level (alpha), computing a test statistic, and comparing the resulting p-value to alpha to draw conclusions.

Example: A clinical researcher tests whether a new blood pressure medication lowers systolic pressure more than a placebo by comparing mean reductions using a two-sample t-test at alpha = 0.05.

P-Value

The probability of observing data as extreme as, or more extreme than, the observed results under the assumption that the null hypothesis is true. A small p-value suggests the observed effect is unlikely due to chance alone.

Example: A study finds that patients on a new drug have a mean recovery time 2 days shorter than the control group with p = 0.003, indicating strong evidence against the null hypothesis of no difference.

Confidence Interval

A range of values, derived from sample data, that is likely to contain the true population parameter with a specified level of confidence (commonly 95%). It communicates both the estimate and the uncertainty around it.

Example: A vaccine trial estimates efficacy at 91% with a 95% confidence interval of 87% to 94%, meaning researchers are 95% confident the true efficacy lies within that range.

Randomized Controlled Trial (RCT)

An experimental study design in which participants are randomly assigned to treatment or control groups to minimize bias and confounding. RCTs are considered the gold standard for establishing causal relationships in clinical research.

Example: A Phase III trial randomly assigns 10,000 participants to receive either a new cancer immunotherapy or standard chemotherapy, then compares five-year survival rates between the two groups.

Survival Analysis

A set of statistical methods for analyzing time-to-event data, where the outcome of interest is the time until an event such as death, disease recurrence, or equipment failure occurs. It handles censored observations where the event has not yet occurred.

Example: Researchers use Kaplan-Meier curves and the log-rank test to compare overall survival times between patients receiving two different chemotherapy regimens.

Logistic Regression

A regression method used when the outcome variable is binary (e.g., disease present or absent). It models the log-odds of the outcome as a linear function of predictor variables and produces odds ratios as measures of association.

Example: An epidemiologist uses logistic regression to estimate the odds ratio of developing type 2 diabetes associated with obesity, adjusting for age, sex, and physical activity level.

Multiple Testing Correction

Statistical adjustments made when performing many simultaneous hypothesis tests to control the overall probability of false positives. Common methods include the Bonferroni correction and the Benjamini-Hochberg procedure for controlling the false discovery rate.

Example: In a genome-wide association study testing 500,000 SNPs, the Bonferroni-corrected significance threshold becomes 0.05 / 500,000 = 1 x 10^-7 instead of the standard 0.05.

Power Analysis

A method used to determine the sample size needed for a study to detect a meaningful effect with a specified probability (statistical power, typically 80% or 90%). It depends on the expected effect size, significance level, and variability in the data.

Example: Before launching a clinical trial, biostatisticians calculate that 400 patients per group are needed to detect a 15% improvement in response rate with 80% power at a two-sided alpha of 0.05.

More terms are available in the glossary.

Explore your way

Choose a different way to engage with this topic — no grading, just richer thinking.

Explore your way — choose one:

Explore with AI →

Concept Map

See how the key ideas connect. Nodes color in as you practice.

Worked Example

Walk through a solved problem step-by-step. Try predicting each step before revealing it.

Adaptive Practice

This is guided practice, not just a quiz. Hints and pacing adjust in real time.

Small steps add up.

What you get while practicing:

Math Lens cues for what to look for and what to ignore.
Progressive hints (direction, rule, then apply).
Targeted feedback when a common misconception appears.

Teach It Back

The best way to know if you understand something: explain it in your own words.

Keep Practicing

More ways to strengthen what you just learned.

Flashcards Mixed Practice Mistake Journal