Recurrences and the Master Theorem

When an algorithm solves a problem by calling smaller copies of itself, its running time obeys an equation that refers to itself: the cost on an input of size $n$ is some local work plus the cost of the recursive calls on smaller inputs. Such an equation is a recurrence. Counting loops, as in the previous lesson, no longer suffices; we need techniques to turn a recurrence into a closed $Θ$ -bound. This lesson develops three, in increasing order of power and precision, and closes with the Akra–Bazzi method for the uneven splits the Master Theorem cannot handle.

From a recursive algorithm to a recurrence

$Divide-and-conquer$ is the paradigm CLRS, Skiena, and Erickson all use to introduce recurrences. It has three steps: divide the instance into subproblems, conquer them by recursion, and combine their solutions. Merge sort splits the array in half, sorts each half recursively, and merges the two sorted halves.

Algorithm 1:

\textsc{Merge-Sort}(A, p, r)

— sort

A[p..r]

1
if $p < r$ then
2
$q \gets \floor{(p + r) / 2}$
midpoint
3
call $\textsc{Merge-Sort}(A, p, q)$
sort left half
4
call $\textsc{Merge-Sort}(A, q+1, r)$
sort right half
5
call $\textsc{Merge}(A, p, q, r)$
combine halves
6
return $A$

The $Merge$ subroutine walks the two sorted halves with two pointers, repeatedly copying the smaller front element into the output. It touches each of the $n$ elements a constant number of times, so it costs $Θ (n)$ .

Now read the cost off the structure. On an array of size $n$ :

Divide is computing the midpoint, $Θ (1)$ .
Conquer is two recursive calls, each on $n /2$ elements, costing $2 T (n /2)$ .
Combine is the merge, $Θ (n)$ .

Adding these (and noting that a one-element array is sorted at no cost) gives the recurrence

T (n) = {Θ (1) 2 T (n /2) + Θ (n) if n = 1, if n > 1.

We can write this compactly, and as an inequality (since the combine step costs at most linear), as

T (n) \leq 2 T (n /2) + O (n) ⟹ T (n) = O (n log n),

and the bulk of the work is justifying that implication. This is the equation we must solve. (We freely write $n /2$ rather than $⌊ n /2 ⌋$ and $⌈ n /2 ⌉$ ; the floors and ceilings change the answer by lower-order amounts that the asymptotics absorb. Skiena and CLRS both justify dropping them.¹) Throughout we also assume a constant base case, which lets us ignore the boundary condition when finding the asymptotic order.

Method 1: the recursion tree

The most intuitive method draws the recurrence. Each node is a subproblem labeled with the non-recursive work it does; its children are the subproblems it spawns. Summing all node labels gives $T (n)$ .

For merge sort, the root does $c n$ work and has two children of size $n /2$ . Each of those does $c (n /2)$ work and has two children of size $n /4$ , and so on, until the leaves are size- $1$ subproblems.

Recursion tree for merge sort,

T (n) = 2 T (n /2) + c n

. The right-hand column sums each level:

2^{i}

nodes of size

n / 2^{i}

, each doing

c n / 2^{i}

work, give

2^{i} \cdot c n / 2^{i} = c n

at every level.

Every level sums to $c n$ . The root level is $c n$ ; the next level is $2 \cdot c (n /2) = c n$ ; the level below is $4 \cdot c (n /4) = c n$ . The subproblem sizes shrink by half each level, so the tree has $log_{2} n + 1$ levels (from size $n$ down to size $1$ ), and the bottom level holds the $n$ size- $1$ leaves. Therefore

T (n) = per level c n \times levels (log_{2} n + 1) = c n log_{2} n + c n = Θ (n log n) .

This is the result: merge sort runs in $Θ (n log n)$ time, strictly better than insertion sort's $Θ (n^{2})$ .² The recursion tree also exposes why: the per-level work stays flat at $c n$ while the depth is only logarithmic.

The tree is a derivation, not yet a proof — it asks us to trust the level sums and the level count. When a recurrence is irregular (unequal splits, work that isn't a clean power of $n$ ), the tree still gives a reliable guess, which we then certify with the next method.

recurrence_tree.pypython

import math
from dataclasses import dataclass

@dataclass(frozen=True)
class TreeLevel:
  """
    One level of the recursion tree.\n
    `depth` is 0 at the root; `node_count` is a^depth subproblems;\n
    `subproblem_size` is n / b^depth; `work` is the total non-recursive\n
    work summed across that level's nodes.\n
  """

  depth: int
  node_count: int
  subproblem_size: float
  work: float

@dataclass(frozen=True)
class RecursionTree:
  """
    The fully expanded tree for one input size.\n
    `levels` runs root-to-leaves; `total_work` sums their work; `depth` is\n
    the count of internal (non-leaf) levels, i.e. floor(log_b n); and\n
    `leaf_count` is the number of size-1 subproblems at the bottom.\n
  """

  levels: list[TreeLevel]
  total_work: float
  depth: int
  leaf_count: int

def build_recurrence_tree(
  input_size: int,
  subproblems: int,
  shrink_factor: int,
  combine_coefficient: float = 1.0,
  combine_exponent: float = 1.0,
) -> RecursionTree:
  """
    Expand T(n) = a T(n/b) + f(n) for n = `input_size`, with a =\n
    `subproblems`, b = `shrink_factor`, and a polynomial combine cost\n
    f(m) = `combine_coefficient` * m^`combine_exponent`.\n
    The tree descends until subproblems reach size 1, summing the work of\n
    every node so the total equals the recurrence's value at `input_size`.\n
  """
  # reject inputs the recurrence isn't defined for.
  if input_size < 1:
    raise ValueError("input size n must be >= 1")
  if subproblems < 1:
    raise ValueError("number of subproblems a must be >= 1")
  if shrink_factor < 2:
    raise ValueError("shrink factor b must be an integer >= 2")

  # accumulators for the expanded tree.
  levels: list[TreeLevel] = []
  total_work: float = 0.0

  # walk state: root is one node of size n at depth 0.
  current_size: float = float(input_size)
  node_count: int = 1
  depth: int = 0

  # expand internal levels: stop once subproblems have shrunk to size 1.
  while current_size > 1:
    # this level's nodes each pay f(size); sum it across the level.
    work_per_node: float = combine_coefficient * (current_size**combine_exponent)
    level_work: float = node_count * work_per_node

    levels.append(
      TreeLevel(
        depth=depth,
        node_count=node_count,
        subproblem_size=current_size,
        work=level_work,
      )
    )
    total_work += level_work

    # descend: a-fold more nodes, each b times smaller.
    node_count *= subproblems
    current_size /= shrink_factor
    depth += 1

  # the leaf level: size-1 subproblems each costing a constant (taken as 1).
  leaf_count: int = node_count
  leaf_work: float = float(leaf_count)

  levels.append(
    TreeLevel(
      depth=depth,
      node_count=leaf_count,
      subproblem_size=1.0,
      work=leaf_work,
    )
  )
  total_work += leaf_work

  return RecursionTree(
    levels=levels,
    total_work=total_work,
    depth=depth,
    leaf_count=leaf_count,
  )

def per_level_work(tree: RecursionTree) -> list[float]:
  """
    The work at each level, root first — the column the lesson sums.\n
    For merge sort (a = b = 2, f(n) = n) every internal level equals n;\n
    for a root-heavy recurrence it shrinks geometrically from the root.\n
  """
  return [level.work for level in tree.levels]

def tree_depth(input_size: int, shrink_factor: int) -> int:
  """
    The number of times n can be divided by b before reaching 1,\n
    i.e. floor(log_b n) — the count of internal levels in the tree.\n
  """
  if input_size < 1:
    raise ValueError("input size n must be >= 1")
  if shrink_factor < 2:
    raise ValueError("shrink factor b must be an integer >= 2")
  return int(math.floor(math.log(input_size, shrink_factor)))

Method 2: substitution (guess and verify)

The substitution method is the rigorous one: guess the form of the answer, then prove it by induction on $n$ . It is the only method that always works, and the only one that produces a complete proof.

We verify the guess $T (n) = O (n log n)$ for the merge-sort recurrence $T (n) = 2 T (n /2) + c n$ .

The substitution method: assume the bound on smaller inputs, substitute into the recurrence, and check that the leftover residual term lets the same bound re-emerge for

n

A symmetric argument with the inequality reversed gives $T (n) = Ω (n log n)$ , and together they yield $T (n) = Θ (n log n)$ , confirming the tree.

Two warnings the standard references repeat:

Guess the right form. Substitution verifies a guess; it cannot invent one. Use the recursion tree (or the Master Theorem below) to find the candidate.
Land on the exact bound. The inductive step must end at the same inequality it assumed, with the same constant. Close enough plus a lower-order term is not a proof, as the next example shows.

A failing guess, and why it fails

Watch the method reject a wrong answer. Take the same recurrence, $T (n) = 2 T (n /2) + n$ , and guess $T (n) = O (n)$ ; concretely, try to prove $T (n) \leq c n$ for some constant $c > 0$ . Substitute the hypothesis $T (n /2) \leq c \frac{n}{2}$ :

T (n) = 2 T (n /2) + n \leq 2 (c \frac{n}{2}) + n = c n + n .

It is tempting to declare victory here: $c n + n = O (n)$ , so we are done. That reasoning is circular hand-waving, and CLRS singles it out as the classic substitution error.³ The induction committed to the exact statement $T (n) \leq c n$ with one fixed constant $c$ that works for every $n$ . The step must therefore arrive at $\leq c n$ on the nose, and

c n + n \leq c n ⟺ n \leq 0,

which never holds. No choice of $c$ , however large, absorbs the leftover $+ n$ ; making $c$ bigger inflates both sides equally. The induction is stuck, and it is stuck for a good reason: the claim is false. We already know $T (n) = Θ (n log n)$ , which is not $O (n)$ . The failed algebra is the method working as designed — a wrong guess leaves a residual that cannot be paid for.

Anatomy of a failed guess for

T (n) = 2 T (n /2) + n

. Guessing

T (n) \leq c n

leaves a

+ n

residual that no constant absorbs; the escape is a stronger hypothesis: raise the guess's order, or subtract a lower-order term.

The escape is to strengthen the guess. For this recurrence the honest fix is to raise its order to $T (n) \leq d n log_{2} n$ , reproducing the proof carried out above: the substitution then produces the residual $- (d - c) n$ , which is negative for $d \geq c$ and absorbs the linear term.

Strengthening by subtracting a lower-order term

A subtler failure mode: the guess has the right order and still gets stuck. Consider

T (n) = 2 T (⌊ n /2 ⌋) + 1.

The tree says $Θ (n)$ : the per-level work is $1, 2, 4, \dots$ , a geometric series dominated by its last term, the $n$ leaves. So guess $T (n) \leq c n$ and substitute:

T (n) \leq 2 (c ⌊ n /2 ⌋) + 1 \leq c n + 1.

Off by $+ 1$ — and no constant $c$ kills a leftover that survives every doubling of $c$ , for the same reason as before. Yet the guess's order is correct. The fix, which CLRS presents with this exact recurrence, is counterintuitive: strengthen the claim by subtracting a lower-order term.³ Guess

T (n) \leq c n - d for constants c > 0, d \geq 0.

Substituting the stronger hypothesis on $⌊ n /2 ⌋$ :

T (n) \leq 2 (c ⌊ n /2 ⌋ - d) + 1 \leq c n - 2 d + 1 = (c n - d) - (d - 1) \leq c n - d

whenever $d \geq 1$ . Choosing $d = 1$ (and $c$ large enough to cover the base case) completes the induction. The stronger hypothesis helps rather than hurts because it is assumed on the subproblems too: each of the two recursive calls brings a $- d$ credit, and the two credits pay for the $+ 1$ of local work with one $- d$ to spare. Proving less was impossible; proving more is easy.

substitution_method.pypython

import math
from dataclasses import dataclass
from typing import Callable

# floating residual comparisons tolerate tiny rounding either side of zero.
_TOLERANCE: float = 1e-9

@dataclass(frozen=True)
class StepFailure:
  """
    A single size where the inductive step failed.\n
    `size` is the n that broke; `recurrence_value` is a guess(n/b) + f(n);\n
    `guess_value` is guess(n); the step needs the former <= the latter.\n
  """

  size: float
  recurrence_value: float
  guess_value: float

@dataclass(frozen=True)
class SubstitutionReport:
  """
    The outcome of checking a guessed bound by substitution.\n
    `holds` is True when the inductive step held at every tested size;\n
    `failures` lists the offending sizes otherwise; `max_residual` is the\n
    largest value of (recurrence_value - guess_value) seen, which is <= 0\n
    exactly when the guess survives.\n
  """

  holds: bool
  failures: list[StepFailure]
  max_residual: float

def verify_substitution(
  guess: Callable[[float], float],
  subproblems: int,
  shrink_factor: float,
  combine: Callable[[float], float],
  sizes: range,
) -> SubstitutionReport:
  """
    Check the inductive step of the substitution method for the guessed\n
    upper bound `guess` against T(n) = a T(n/b) + f(n), where a =\n
    `subproblems`, b = `shrink_factor`, and f = `combine`.\n
    For each n in `sizes` it forms the substituted recurrence value\n
    a * guess(n/b) + f(n) and compares it to guess(n); the guess is\n
    certified when that residual stays <= 0 throughout.\n
  """
  # reject parameters outside the recurrence's domain.
  if subproblems < 1:
    raise ValueError("number of subproblems a must be >= 1")
  if shrink_factor <= 1:
    raise ValueError("shrink factor b must be > 1")

  # track the worst residual and the sizes where the step broke.
  failures: list[StepFailure] = []
  max_residual: float = -math.inf

  for size in sizes:
    # substitute the guess into the recurrence and measure the residual.
    recurrence_value: float = (
      subproblems * guess(size / shrink_factor) + combine(float(size))
    )
    guess_value: float = guess(float(size))
    residual: float = recurrence_value - guess_value
    max_residual = max(max_residual, residual)

    # a positive residual means the step broke at this size.
    if residual > _TOLERANCE:
      failures.append(
        StepFailure(
          size=float(size),
          recurrence_value=recurrence_value,
          guess_value=guess_value,
        )
      )

  return SubstitutionReport(
    holds=not failures,
    failures=failures,
    max_residual=max_residual,
  )

def smallest_constant_for_n_log_n(
  combine_coefficient: float, shrink_factor: float = 2.0
) -> float:
  """
    The least constant d for which the guess T(n) <= d*n*log2(n) survives\n
    the inductive step of the merge-sort recurrence T(n) = 2 T(n/2) + c*n.\n
    The lesson's algebra leaves the residual -(d - c)*n, non-positive\n
    exactly when d >= c, so the smallest working d equals the combine\n
    coefficient c. (Generalized: for b subproblems halving, d >= c / log2 b.)\n
  """
  # reject parameters outside the recurrence's domain.
  if combine_coefficient < 0:
    raise ValueError("combine coefficient c must be >= 0")
  if shrink_factor <= 1:
    raise ValueError("shrink factor b must be > 1")

  # residual -(d - c/log2 b)*n vanishes exactly when d hits this value.
  return combine_coefficient / math.log2(shrink_factor)

A second example: counting inversions

A second divide-and-conquer problem makes the point sharply. Its recurrence has the same shape as merge sort but a different combine cost, and the combine cost is the thing you must get right. An inversion of a list $⟨ a_{1}, \dots, a_{n} ⟩$ is a pair $(i, j)$ with $i < j$ but $a_{i} > a_{j}$ ; the number of inversions measures how far from sorted the list is (a sorted list has $0$ , a reversed list has $(2 n)$ ). The task: given $A [1.. n]$ , return $ninv (A)$ .

The brute-force algorithm compares every pair and runs in $Θ (n^{2})$ . To beat it, mimic merge sort: split $A$ in half, recursively count inversions inside each half, then count the cross inversions, the pairs with one element in the left half and one in the right. That gives a recurrence of the merge-sort form,

T (n) \leq 2 T (n /2) + (cost of counting cross inversions) .

The three kinds of inversion partition cleanly along the split. On $A = ⟨ 2, 4, 1, 3 ⟩$ , the inversions within each half are counted by recursion; the cross pairs, a left element greater than a right element, are what the combine step must tally.

Cross inversions on

⟨ 2, 4, 1, 3 ⟩

split into halves

⟨ 2, 4 ⟩

and

⟨ 1, 3 ⟩

. Each red arc is a cross inversion (a left element bigger than a right one):

(2, 1), (4, 1), (4, 3)

. Within-half inversions are handled by recursion; the combine step counts only these crossing pairs.

Counting cross inversions naively, with a double loop over the two halves, costs $Θ (n^{2})$ , so the recurrence becomes $T (n) \leq 2 T (n /2) + b n^{2}$ . Feed that to the recursion tree: the per-level work is now $b n^{2}, 2 \cdot b (n /2)^{2} = \frac{1}{2} b n^{2}, \dots$ , which shrinks geometrically, so the root dominates and the tree sums to $Θ (n^{2})$ . That is no improvement. The split bought us nothing because the combine step is as expensive as the brute force.

Naive counting-inversions tree,

T (n) = 2 T (n /2) + b n^{2}

: unlike merge sort's flat tree, the level sums

b n^{2}, \frac{1}{2} b n^{2}, \frac{1}{4} b n^{2}, \dots

shrink geometrically, so the root dominates and the total is

Θ (n^{2})

The recurrence therefore sets a requirement: the combine step must run in $O (n)$ , not $O (n^{2})$ . If we can count cross inversions in linear time, which one can, by counting them while merging the two sorted halves, the recurrence collapses to $T (n) \leq 2 T (n /2) + O (n)$ , the merge-sort recurrence, and we get $Θ (n log n)$ . The recurrence both predicts the running time and tells you precisely how fast the combine step has to be for divide-and-conquer to pay off.

Method 3: the Master Theorem

Merge sort's recurrence is one instance of a common pattern. The Master Theorem solves every recurrence of the form

T (n) = a T (n / b) + f (n),

where $a \geq 1$ and $b > 1$ are constants and $f (n)$ is the divide-and-combine work. Here $a$ is the number of subproblems, $n / b$ is each subproblem's size, and $f (n)$ is the work done outside the recursion.

The theorem compares $f (n)$ against the watershed function $n^{log_{b} a}$ , the total cost of the leaves, which equals the number of leaves $a^{log_{b} n} = n^{log_{b} a}$ times the constant base-case cost. Which of the two dominates determines the answer.

The Master Theorem weighs the root's combine work

f (n)

against the leaves' total cost, the watershed

n^{log_{b} a}

: each leaf costs

Θ (1)

, and there are

n^{log_{b} a}

of them.

The intuition matches the recursion tree. Compare the work at the root, $f (n)$ , to the work at the leaves, $n^{log_{b} a}$ . In Case 1 the tree is leaf-heavy and the answer is the leaf count. In Case 3 the root work dwarfs everything below it and the answer is $f (n)$ . In Case 2 the work is spread evenly across all $Θ (log n)$ levels, as we saw for merge sort, giving the extra $log n$ factor.

Where the work concentrates in the Master Theorem's three cases. The answers, left to right:

Θ (n^{log_{b} a})

Θ (n^{log_{b} a} log n)

, and

Θ (f (n))

Each panel stacks the per-level work from root (top) to leaves (bottom); the bar width is the work at that level. The case is decided by which end is heavier.

Why the cases hold: three trees

The theorem is a statement about geometric series, and the recursion tree makes the series visible.⁴ Unroll $T (n) = a T (n / b) + f (n)$ : level $i$ of the tree holds $a^{i}$ subproblems of size $n / b^{i}$ , each contributing $f (n / b^{i})$ of non-recursive work, so

level- i sum = a^{i} f (n / b^{i}), i = 0, 1, \dots, log_{b} n - 1,

and the leaf level contributes $Θ (n^{log_{b} a})$ . Summing,

T (n) = i = 0 \sum log_{b} n - 1 a^{i} f (\frac{n}{b ^{i}}) + Θ (n^{log_{b} a}) .

When $f (n) = Θ (n^{d})$ is a polynomial, the level sums take a clean form:

a^{i} (\frac{n}{b ^{i}})^{d} = n^{d} (\frac{a}{b ^{d}})^{i},

a geometric series with ratio $r = a / b^{d}$ . Everything reduces to whether $r$ is above, at, or below $1$ — equivalently, whether $d$ is below, at, or above $log_{b} a$ . Skiena states the theorem in exactly this three-way form.⁵

Case 1, leaves dominate ( $r > 1$ ). Take $T (n) = 4 T (n /2) + c n$ : here $a = 4$ , $b = 2$ , $d = 1$ , so $r = 4/ 2^{1} = 2$ . Reading the tree level by level:

level 0 level 1 level 2 level i : c n : 4 \cdot c \frac{n}{2} = 2 c n : 16 \cdot c \frac{n}{4} = 4 c n : 4^{i} \cdot c \frac{n}{2 ^{i}} = 2^{i} c n,

doubling every level. A growing geometric series is dominated by its last term, so the total is within a constant factor of the bottom:

i = 0 \sum log_{2} n - 1 2^{i} c n = c n (2^{log_{2} n} - 1) = c n (n - 1) = Θ (n^{2}),

which matches the leaf level: $4^{log_{2} n} = n^{log_{2} 4} = n^{2}$ leaves at $Θ (1)$ each. The combine work is irrelevant; the answer is the leaf count, $T (n) = Θ (n^{log_{b} a}) = Θ (n^{2})$ .

Case 1 tree for

T (n) = 4 T (n /2) + c n

. Each node spawns four children of half the size, so the level sums

c n, 2 c n, 4 c n, \dots

double all the way down; the growing geometric series is dominated by its last term, the

n^{log_{2} 4} = n^{2}

leaves. Total:

Θ (n^{2})

Case 2, balanced ( $r = 1$ ). Merge sort, $T (n) = 2 T (n /2) + c n$ : $a = 2$ , $b = 2$ , $d = 1$ , so $r = 2/ 2^{1} = 1$ . This is the first tree we drew. The level sums are

c n, 2 \cdot c \frac{n}{2} = c n, 4 \cdot c \frac{n}{4} = c n, \dots

— constant at $c n$ for all $log_{2} n + 1$ levels. A flat series is just (number of terms) $\times$ (term), so

T (n) = c n \cdot (log_{2} n + 1) = Θ (n log n) = Θ (n^{log_{b} a} log n) .

Neither end of the tree wins; the $log n$ factor is the number of levels, each pulling equal weight.

Case 3, root dominates ( $r < 1$ ). The naive inversion-counting tree, $T (n) = 2 T (n /2) + b n^{2}$ : $a = 2$ , $b = 2$ , $d = 2$ , so $r = 2/ 2^{2} = \frac{1}{2}$ . The level sums

b n^{2}, 2 \cdot b (\frac{n}{2})^{2} = \frac{1}{2} b n^{2}, 4 \cdot b (\frac{n}{4})^{2} = \frac{1}{4} b n^{2}, \dots

halve every level. A shrinking geometric series is dominated by its first term and bounded by a constant multiple of it:

i \geq 0 \sum (\frac{1}{2})^{i} b n^{2} \leq 2 b n^{2},

so $T (n) = Θ (f (n)) = Θ (n^{2})$ . The root alone already costs $b n^{2}$ ; the entire tree below it costs at most as much again.

The ratio test doubles as a sanity check on concrete instances. For $T (n) = 8 T (n /2) + n^{2}$ : $r = 8/ 2^{2} = 2 > 1$ , Case 1, answer $Θ (n^{log_{2} 8}) = Θ (n^{3})$ . For $T (n) = 2 T (n /2) + n$ : $r = 2/2 = 1$ , Case 2, $Θ (n log n)$ . For $T (n) = 2 T (n /2) + n^{2}$ : $r = 2/4 = \frac{1}{2} < 1$ , Case 3, $Θ (n^{2})$ .

Regularity and the gaps between the cases

Two fine-print clauses matter in practice.

The regularity condition. Case 3 additionally demands $a f (n / b) \leq k f (n)$ for some constant $k < 1$ : the combine work one level down must be a constant factor smaller, which is precisely what makes the level sums a shrinking geometric series. For any polynomial $f$ that satisfies Case 3's growth bound the condition holds automatically — as in Example 4 below, where $a f (n / b) = \frac{1}{2} f (n)$ . It can fail only for contrived oscillating functions that are periodically tiny one level down; CLRS relegates such $f$ to the exercises.⁴ If regularity fails, the theorem does not apply and you must sum the tree by hand.

The gaps. The three cases do not cover every $f$ .⁴ Case 1 needs $f$ polynomially smaller than the watershed (smaller by a factor $n^{ϵ}$ ), and Case 3 polynomially larger; a merely logarithmic separation falls into the crack between the cases. The standard example:

T (n) = 2 T (n /2) + n log n .

The watershed is $n^{log_{2} 2} = n$ , and $f (n) = n log n$ is bigger than $n$ but not bigger by any $n^{ϵ}$ — for every $ϵ > 0$ , $log n = o (n^{ϵ})$ . Case 2 fails since $n log n \neq = Θ (n)$ ; Case 3 fails since $n log n \neq = Ω (n^{1 + ϵ})$ . The basic Master Theorem simply does not apply. The recursion tree still works: level $i$ sums to $2^{i} \cdot \frac{n}{2 ^{i}} log \frac{n}{2 ^{i}} = n (log n - i)$ , so

T (n) = i = 0 \sum log_{2} n - 1 n (log_{2} n - i) = n \cdot Θ (log^{2} n) = Θ (n log^{2} n)

— the sum $log n + (log n - 1) + \dots + 1$ is arithmetic, totaling $Θ (log^{2} n)$ . So the answer picks up a squared log, which none of the three cases predicts. (CLRS's chapter notes discuss extended versions that handle $f (n) = n^{log_{b} a} log^{k} n$ ; for this course, fall back to the tree is the reliable rule.)

Worked examples

Example 1, merge sort. $T (n) = 2 T (n /2) + Θ (n)$ . Here $a = 2$ , $b = 2$ , so $n^{log_{b} a} = n^{log_{2} 2} = n^{1} = n$ . And $f (n) = Θ (n) = Θ (n^{log_{b} a})$ , which is Case 2. Therefore

T (n) = Θ (n log n),

recovering exactly what the tree and substitution gave.

Example 2, binary search. $T (n) = T (n /2) + Θ (1)$ : one subproblem of half size, constant work to pick the side. Here $a = 1$ , $b = 2$ , so $n^{log_{2} 1} = n^{0} = 1$ . Then $f (n) = Θ (1) = Θ (n^{log_{b} a})$ , Case 2 again, and

T (n) = Θ (log n) .

Example 3, leaf-dominated. $T (n) = 4 T (n /2) + n$ . Now $a = 4$ , $b = 2$ , so $n^{log_{2} 4} = n^{2}$ . The combine work $f (n) = n = O (n^{2 - ϵ})$ (take $ϵ = 1$ ) is polynomially smaller than the watershed, which is Case 1, so

T (n) = Θ (n^{2}) .

The recursion has so many leaves ( $n^{2}$ of them) that they dominate the modest linear work per level.

Example 4, root-dominated. $T (n) = 2 T (n /2) + n^{2}$ . Here $a = 2$ , $b = 2$ , watershed $n^{log_{2} 2} = n$ . The combine work $f (n) = n^{2} = Ω (n^{1 + ϵ})$ is polynomially larger, a Case 3 candidate. Check regularity: $a f (n / b) = 2 (n /2)^{2} = \frac{1}{2} n^{2} = \frac{1}{2} f (n) \leq k f (n)$ with $k = \frac{1}{2} < 1$ . Regularity holds, so

T (n) = Θ (n^{2}) .

The root's quadratic work swamps the tree beneath it.

Unequal splits and Akra–Bazzi

The Master Theorem requires every subproblem to have the same size $n / b$ . Divide-and-conquer algorithms do not always split evenly: a partition step can split $n$ elements into a third and two-thirds, giving

T (n) = T (n /3) + T (2 n /3) + c n .

No single $b$ fits, so the theorem does not apply. The recursion tree still works. Each node of size $m$ does $c m$ work and splits into children of sizes $m /3$ and $2 m /3$ — which together are all of $m$ again. So every level where no branch has bottomed out sums to exactly $c n$ ; once leaves start dropping out, levels sum to at most $c n$ .

Recursion tree for

T (n) = T (n /3) + T (2 n /3) + c n

. The two children of a size-

m

node have sizes summing to

m

, so every full level again sums to

c n

. The shallowest branch (all thirds) dies at depth

log_{3} n

, the deepest (all two-thirds) at depth

log_{3/2} n

; both are

Θ (log n)

, so

T (n) = Θ (n log n)

The tree's depth is no longer uniform. The leftmost branch divides by $3$ each step and reaches size $1$ at depth $log_{3} n$ ; the rightmost divides by only $3/2$ and survives until depth $log_{3/2} n$ . Both depths are $Θ (log n)$ — logarithms to different constant bases differ by a constant factor — so

c n \cdot log_{3} n \leq T (n) \leq c n \cdot log_{3/2} n ⟹ T (n) = Θ (n log n),

and substitution certifies the guess in the usual way. Erickson works this recurrence as the standard example of a tree the Master Theorem cannot handle.⁶

For a general tool, the Akra–Bazzi method solves the whole family

T (n) = i = 1 \sum k a_{i} T (n / b_{i}) + f (n), a_{i} > 0, b_{i} > 1,

with different-sized subproblems and reasonable $f$ . Stated without proof: find the unique exponent $p$ with $\sum_{i = 1}^{k} a_{i} / b_{i}^{p} = 1$ ; then

T (n) = Θ n^{p} 1 + 1 \int n \frac{f ( u )}{u ^{p + 1}} d u .

For $T (n) = T (n /3) + T (2 n /3) + n$ the balance equation is $(\frac{1}{3})^{p} + (\frac{2}{3})^{p} = 1$ , satisfied by $p = 1$ (a third plus two-thirds is one). The integral is $\int_{1}^{n} \frac{u}{u ^{2}} d u = ln n$ , so $T (n) = Θ (n (1 + ln n)) = Θ (n log n)$ , agreeing with the tree. The method also handles floors, ceilings, and small perturbations of the subproblem sizes, which is why its answer can be trusted for the real $T (⌊ n /3 ⌋)$ -style recurrences that code produces. CLRS's chapter notes present Akra–Bazzi as the standard generalization of the Master Theorem;⁷ at this course's level, the balance-equation-plus-integral recipe is all you need, with the tree as a cross-check.

Choosing a method

The methods are complementary, and Erickson in particular urges fluency with all of them:⁸

Recursion tree: fastest for building intuition and guessing the answer; shows where the work concentrates, and handles uneven splits.
Master Theorem: fastest for getting the answer when the recurrence fits the $a T (n / b) + f (n)$ template; no derivation needed, but it has gaps.
Akra–Bazzi: the heavier tool for unequal subproblem sizes, such as $T (n) = T (n /3) + T (2 n /3) + n$ ; solve the balance equation, evaluate one integral.
Substitution: the rigorous method that always works and produces a proof; use it to certify a guess, or when the others do not apply.

In practice: sketch the tree to guess, apply the Master Theorem if it fits, and reach for substitution whenever you need a guarantee rather than a hunch.

Recurrences of other shapes

The recurrences here shrink $n$ by a constant factor, the divide-and-conquer signature. Linear recurrences with constant coefficients, like $T (n) = T (n - 1) + T (n - 2)$ (Fibonacci), instead yield to their characteristic equation, whose roots give the closed form — Fibonacci's dominant root is the golden ratio, so $F_{n} = Θ (ϕ^{n})$ .⁹ And the Akra–Bazzi method generalizes to the Akra–Bazzi–Leighton form, which admits lower-order perturbations inside each recursive call, putting the floor/ceiling hand-waving on rigorous footing.¹⁰ For anything that fits none of these, the recursion tree plus a substitution proof never stops applying.

Takeaways

A recursive algorithm induces a recurrence: $T (n)$ = local work + cost of recursive calls on smaller inputs. Merge sort gives $T (n) = 2 T (n /2) + Θ (n)$ .
The recursion tree sums the per-node work; for merge sort every level costs $Θ (n)$ across $Θ (log n)$ levels, giving $Θ (n log n)$ .
Substitution guesses the form and proves it by induction; it is the only always-applicable, fully rigorous method. The step must land on the exact bound with the same constant — $\leq c n + n$ , which is $O (n)$ is not a proof. Strengthen the hypothesis (raise the order, or subtract a lower-order term as in $T (n) \leq c n - d$ ) if a residual blocks the step.
The combine cost drives the answer. Counting inversions has the merge-sort shape $2 T (n /2) + (combine)$ , but a naive $Θ (n^{2})$ combine gives $Θ (n^{2})$ overall, for no gain. Only a linear combine recovers $Θ (n log n)$ .
The Master Theorem solves $T (n) = a T (n / b) + f (n)$ by comparing $f (n)$ to the watershed $n^{log_{b} a}$ : leaves win (Case 1), they tie (Case 2, extra $log n$ ), or the root wins (Case 3, needs regularity). Behind each case is a geometric series of level sums $a^{i} f (n / b^{i})$ ; for $f (n) = Θ (n^{d})$ the ratio $a / b^{d}$ against $1$ decides the case in one division.
The cases have gaps; when $f$ is only non-polynomially separated from the watershed, as in $T (n) = 2 T (n /2) + n log n$ (which sums to $Θ (n log^{2} n)$ ), fall back to the tree or substitution.
Unequal splits like $T (n) = T (n /3) + T (2 n /3) + n$ escape the Master Theorem but not the tree: full levels still sum to $c n$ over $Θ (log n)$ depth, giving $Θ (n log n)$ . Akra–Bazzi generalizes: solve $\sum a_{i} / b_{i}^{p} = 1$ for $p$ , then integrate $f$ .

Skiena, §2.7–2.10 — Logarithms, Recurrences, Divide-and-Conquer: justification for dropping floors and ceilings in recurrences since they perturb the answer by lower-order amounts. ↩
CLRS, Ch. 4 — Divide-and-Conquer: the recursion-tree derivation that merge sort runs in $Θ (n log n)$ time. ↩
CLRS, Ch. 4 — Divide-and-Conquer: the substitution method's pitfalls — the $\leq c n + n$ , hence $O (n)$ fallacy of not proving the exact inductive form, and the subtract-a-lower-order-term fix for $T (n) = 2 T (⌊ n /2 ⌋) + 1$ . ↩ ↩²
CLRS, Ch. 4 — Divide-and-Conquer: the Master Theorem for $T (n) = a T (n / b) + f (n)$ , the recursion-tree proof over the level sums $a^{i} f (n / b^{i})$ , the regularity condition, and the gaps where the theorem does not apply. ↩ ↩² ↩³
Skiena, §2.10 — Divide-and-Conquer Recurrences: the Master Theorem stated by comparing $f (n) = n^{d}$ against $n^{log_{b} a}$ , i.e. the ratio $a / b^{d}$ against $1$ . ↩
Erickson, Algorithms, Ch. 1 and the appendix on solving recurrences: level-by-level analysis of the uneven-split tree $T (n) = T (n /3) + T (2 n /3) + n$ . ↩
CLRS, Ch. 4 chapter notes — the Akra–Bazzi method for divide-and-conquer recurrences with unequal subproblem sizes: the balance equation $\sum a_{i} / b_{i}^{p} = 1$ and the integral form of the solution. ↩
Erickson, Algorithms, Ch. 1–2 — Recursion; Backtracking & Divide-and-Conquer: the case for fluency with recursion trees, substitution, and the Master Theorem as complementary methods. ↩
CLRS, Ch. 4 problems and Appendix — linear recurrences and the characteristic-equation method; the Fibonacci recurrence $F_{n} = F_{n - 1} + F_{n - 2}$ has closed form $Θ (ϕ^{n})$ with $ϕ = (1 + 5) /2$ . ↩
Leighton, T. (1996). Notes on better master theorems for divide-and-conquer recurrences. — the Akra–Bazzi–Leighton generalization admitting lower-order perturbations (floors/ceilings) inside each subproblem. ↩

From a recursive algorithm to a recurrence

Method 1: the recursion tree

Method 2: substitution (guess and verify)

A failing guess, and why it fails

Strengthening by subtracting a lower-order term

A second example: counting inversions

Method 3: the Master Theorem

Why the cases hold: three trees

Regularity and the gaps between the cases

Worked examples

Unequal splits and Akra–Bazzi

Choosing a method

Recurrences of other shapes

Takeaways

Footnotes