Biostatistics Cheat Sheet
The core ideas of Biostatistics distilled into a single, scannable reference — perfect for review or quick lookup.
Quick Reference
Hypothesis Testing
A formal statistical procedure for deciding whether observed data provide sufficient evidence to reject a null hypothesis. It involves setting a significance level (alpha), computing a test statistic, and comparing the resulting p-value to alpha to draw conclusions.
P-Value
The probability of observing data as extreme as, or more extreme than, the observed results under the assumption that the null hypothesis is true. A small p-value suggests the observed effect is unlikely due to chance alone.
Confidence Interval
A range of values, derived from sample data, that is likely to contain the true population parameter with a specified level of confidence (commonly 95%). It communicates both the estimate and the uncertainty around it.
Randomized Controlled Trial (RCT)
An experimental study design in which participants are randomly assigned to treatment or control groups to minimize bias and confounding. RCTs are considered the gold standard for establishing causal relationships in clinical research.
Survival Analysis
A set of statistical methods for analyzing time-to-event data, where the outcome of interest is the time until an event such as death, disease recurrence, or equipment failure occurs. It handles censored observations where the event has not yet occurred.
Logistic Regression
A regression method used when the outcome variable is binary (e.g., disease present or absent). It models the log-odds of the outcome as a linear function of predictor variables and produces odds ratios as measures of association.
Multiple Testing Correction
Statistical adjustments made when performing many simultaneous hypothesis tests to control the overall probability of false positives. Common methods include the Bonferroni correction and the Benjamini-Hochberg procedure for controlling the false discovery rate.
Power Analysis
A method used to determine the sample size needed for a study to detect a meaningful effect with a specified probability (statistical power, typically 80% or 90%). It depends on the expected effect size, significance level, and variability in the data.
Confounding Variable
A variable that is associated with both the exposure and the outcome, potentially distorting the observed relationship between them. Failure to account for confounders can lead to biased estimates of effect.
Kaplan-Meier Estimator
A nonparametric statistic used to estimate the survival function from time-to-event data. It accounts for censored observations and produces a step-function survival curve showing the probability of surviving past each observed event time.
Key Terms at a Glance
Get study tips in your inbox
We'll send you evidence-based study strategies and new cheat sheets as they're published.
We'll notify you about updates. No spam, unsubscribe anytime.