Adaptive

Learn Optimization

Read the notes, then try the practice. It adapts as you go.When you're ready.

Session Length

~17 min

Adaptive Checks

15 questions

Transfer Probes

Lesson Notes Key Concepts Concept Map Worked Example Start Adaptive Practice

Lesson Notes

Optimization is the mathematical discipline concerned with finding the best possible solution from a set of feasible alternatives, subject to given constraints. At its core, optimization involves defining an objective function that quantifies what you want to maximize or minimize, identifying decision variables that you can control, and specifying constraints that limit the range of allowable solutions. From allocating limited resources in a manufacturing plant to training deep neural networks, optimization provides the formal framework for making the best decisions under given conditions.

The field encompasses a rich taxonomy of problem types and solution methods. Linear programming addresses problems where both the objective function and constraints are linear, and it can be solved efficiently using the simplex method or interior-point algorithms. Nonlinear programming handles more general problems where the objective or constraints involve nonlinear relationships. Convex optimization, a powerful subclass of nonlinear programming, guarantees that any local minimum is also a global minimum, making it tractable for large-scale applications. Beyond these continuous methods, combinatorial and integer programming tackle problems where decision variables must take discrete values, such as scheduling, routing, and assignment problems. Metaheuristic approaches like genetic algorithms, simulated annealing, and particle swarm optimization provide approximate solutions for problems that are too complex for exact methods.

Optimization has become indispensable across virtually every quantitative discipline. In engineering, it drives structural design, control system tuning, and signal processing. In economics and operations research, it underpins resource allocation, portfolio theory, and supply chain management. In machine learning, gradient-based optimization algorithms such as stochastic gradient descent are the engines that train models on vast datasets. The growth of computational power and algorithmic advances have expanded the scale and complexity of problems that can be solved, making optimization one of the most impactful areas of applied mathematics in the modern era.

You'll be able to:

Apply gradient descent and convex optimization algorithms to solve unconstrained and constrained minimization problems efficiently
Evaluate the convergence properties and computational complexity of optimization methods including Newton, quasi-Newton, and interior-point approaches
Analyze the duality theory of linear and nonlinear programs and interpret Karush-Kuhn-Tucker conditions for optimality
Design metaheuristic strategies including genetic algorithms and simulated annealing for solving combinatorial optimization problems

One step at a time.

Key Concepts

Objective Function

A mathematical function that quantifies the goal of an optimization problem, expressing the quantity to be maximized (such as profit or efficiency) or minimized (such as cost or error). The optimizer searches for the values of decision variables that yield the best value of this function.

Example: A manufacturer wants to minimize total production cost C = 5x + 8y, where x and y are quantities of two products. The objective function is C = 5x + 8y, and the goal is to find x and y values that make C as small as possible while meeting demand.

Constraints

Mathematical inequalities or equalities that define the set of feasible solutions in an optimization problem. Constraints represent physical, financial, or logical limitations that the decision variables must satisfy.

Example: A factory has 40 hours of machine time available per week. If product A requires 2 hours and product B requires 4 hours, the constraint is 2x + 4y <= 40, limiting the combinations of A and B that can be produced.

Linear Programming

A method for optimizing a linear objective function subject to linear equality and inequality constraints. It is one of the most widely used optimization techniques due to its computational efficiency and broad applicability.

Example: An airline uses linear programming to assign crews to flights, minimizing total labor cost while ensuring every flight is staffed and each crew member respects rest-time regulations.

Gradient Descent

An iterative first-order optimization algorithm that finds a local minimum of a differentiable function by repeatedly moving in the direction of the negative gradient (steepest descent). The step size is controlled by a learning rate parameter.

Example: In training a neural network, gradient descent computes the partial derivatives of the loss function with respect to each weight, then updates all weights by a small step in the direction that reduces the loss.

Convex Optimization

A subfield of optimization dealing with problems where the objective function is convex and the feasible set is a convex set. The key property is that any local minimum is guaranteed to be a global minimum, making these problems efficiently solvable.

Example: Portfolio optimization in finance, where the goal is to minimize the variance of portfolio returns (a convex quadratic function) subject to linear constraints on asset weights, is a classic convex optimization problem.

Feasible Region

The set of all points (variable values) that satisfy every constraint in an optimization problem simultaneously. The optimal solution must lie within this region. In linear programming, the feasible region is a convex polytope.

Example: If constraints require x >= 0, y >= 0, and x + y <= 10, the feasible region is the triangle in the first quadrant bounded by the axes and the line x + y = 10.

Global vs. Local Optima

A local optimum is a solution that is better than all nearby solutions, while a global optimum is the best solution over the entire feasible region. Nonconvex problems may have multiple local optima, making the search for the global optimum challenging.

Example: A function like f(x) = x^4 - 2x^2 + x has multiple peaks and valleys. A gradient-based optimizer might get trapped in a local minimum at one valley rather than finding the deepest valley, which is the global minimum.

Lagrange Multipliers

A mathematical method for finding the extrema of a function subject to equality constraints. It introduces auxiliary variables (multipliers) that measure the sensitivity of the optimal value to changes in the constraint, providing both the optimal solution and shadow prices.

Example: To maximize utility U(x, y) = xy subject to the budget constraint 2x + 3y = 60, you form the Lagrangian L = xy - lambda(2x + 3y - 60) and solve the system of equations to find the optimal consumption bundle.

More terms are available in the glossary.

Explore your way

Choose a different way to engage with this topic — no grading, just richer thinking.

Explore your way — choose one:

Explore with AI →

Concept Map

See how the key ideas connect. Nodes color in as you practice.

Worked Example

Walk through a solved problem step-by-step. Try predicting each step before revealing it.

Adaptive Practice

This is guided practice, not just a quiz. Hints and pacing adjust in real time.

Small steps add up.

What you get while practicing:

Math Lens cues for what to look for and what to ignore.
Progressive hints (direction, rule, then apply).
Targeted feedback when a common misconception appears.

Teach It Back

The best way to know if you understand something: explain it in your own words.

Keep Practicing

More ways to strengthen what you just learned.

Flashcards Mixed Practice Mistake Journal