Knapsack & Subset Problems

A thief breaks into a warehouse carrying a knapsack that holds at most $W$ units of weight. Item $i$ weighs $w_{i}$ and is worth $v_{i}$ . Each item is taken whole or left behind, with no fractions and no duplicates. Which subset maximizes the value carried out without exceeding the capacity? This is the 0/1 knapsack problem. Despite the toy framing, it underlies resource allocation, budgeting, and cutting-stock problems, and it is a canonical $NP-hard$ optimization problem whose dynamic program illustrates what polynomial really means.¹

We will arrive at knapsack through its simpler decision cousin, $Subset-sum$ , which strips away the values and asks a plain yes/no question. The include/exclude recurrence is identical, the running-time subtlety is identical, and subset-sum is the cleanest place to see both.

Subset-sum: the decision core

The brute-force space is the $2^{n}$ sublists. To get a recurrence we shrink the instance one element at a time; the second dimension comes from making the target itself a parameter of the subproblem.

The answer we want is $A (n, t)$ . Now look at $a_{i}$ , the last element we are allowed to use, and make the include/exclude decision that defines every 0/1 dynamic program:

Exclude $a_{i}$ . Then some sublist of the first $i - 1$ elements must already hit $u$ on its own: $A (i - 1, u)$ .
Include $a_{i}$ . Then it contributes $a_{i}$ toward the target, and the first $i - 1$ elements must cover the shortfall: $A (i - 1, u - a_{i})$ .

The element can be used in either way, so the subproblem is true if either branch succeeds, a logical $\lor$ . The base cases specify the boundary: a budget of $0$ is always reachable by the empty set; a positive budget is unreachable with no elements; a negative budget is impossible:

A (i, u) = ⎩ ⎨ ⎧ true false false A (i - 1, u) \lor A (i - 1, u - a_{i}) if u = 0, if u < 0, if i = 0 and u > 0, if i > 0, u > 0.

Every entry depends only on two entries of the previous row ( $i - 1$ ): the one directly above (exclude) and the one above and $a_{i}$ columns to the left (include). So we fill the table row by row in increasing $i$ , sweeping the budget $u = 0, \dots, t$ within each row.

The subset-sum table, filled

Take $L = ⟨ 1, 3, 4, 2 ⟩$ and target $t = 6$ . The table holds the boolean $A (i, u)$ : row $0$ is $true$ only in column $0$ (the empty set sums to $0$ ), and each later row ORs the exclude cell directly above with the include cell $a_{i}$ columns to its left. The shaded cell $A (4, 6)$ is the answer; the two arrows show its include/exclude dependency.

Boolean subset-sum table with the answer cell and its include and exclude predecessors.

The answer is $A (4, 6) = true$ , witnessed by ${4, 2}$ or ${1, 3, 2}$ . Reading its two dashed predecessors: exclude $a_{4}$ reads $A (3, 6) = false$ (no subset of $⟨ 1, 3, 4 ⟩$ hits $6$ ), while include $a_{4} = 2$ reads $A (3, 6 - 2) = A (3, 4) = true$ ; the $\lor$ makes the cell true through the include branch.

Subset-sum in pseudocode

Algorithm 1:

\textsc{Subset-Sum}(L[1..n], t)

— does some sublist sum to

t

1
$A[0..n][0..t] \gets \text{false}$
2
$A[0..n][0] \gets \text{true}$
empty set reaches $0$
3
for $i \gets 1$ to $n$ do
4
for $u \gets 1$ to $t$ do
5
if $a_i > u$ then
6
$A[i][u] \gets A[i-1][u]$
$a_i$ too big: exclude
7
else
8
$A[i][u] \gets A[i-1][u] \lor A[i-1][u - a_i]$
exclude or include
9
return $A[n][t]$

This fills $(n + 1) (t + 1)$ boolean cells in $Θ (1)$ apiece, for $Θ (n t)$ time and $Θ (n t)$ space, and the space drops to $Θ (t)$ because each row reads only the one above it. We return to the $Θ (n t)$ bound below; it is less benign than it looks. First, 0/1 knapsack is the same recurrence with values added.

The problem and why greed fails

Knapsack is subset-sum with two upgrades: each item carries a separate value $v_{i}$ as well as a weight $w_{i}$ , and the capacity $W$ is an upper bound rather than an exact target, so we maximize value instead of answering yes/no. The include/exclude decision is unchanged: the $\lor$ of subset-sum becomes a $max$ , and the boolean cell becomes a value. (Set $v_{i} = w_{i} = a_{i}$ and the answer $K (n, t) = t$ recovers subset-sum exactly, a reduction we make precise below.)

The greedy strategy fails because a locally efficient item can block a globally better combination; the 0/1 problem lacks the structure that lets greedy choices succeed on matroids. We need to consider subsets, and that calls for dynamic programming.

Why greedy fails for 0/1 knapsack (

W = 10

). Best ratio first grabs

A

(

v / w = 1.67

) and then nothing fits: value

10

. Taking

B

and

C

instead fills capacity exactly for value

18

The high-ratio item $A$ is locally efficient yet blocks the $B + C$ pair that packs the knapsack with no waste.

The subproblem and recurrence

The brute-force space is the $2^{n}$ subsets. To get a recurrence we need a subproblem definition that shrinks the instance one decision at a time. The second dimension comes from tracking not only which items remain available but how much capacity is left.

The answer is $K (n, W)$ . Now consider item $i$ , the last one we are allowed to use, and make the same include/exclude decision as in subset-sum; this is the 0/1 in the name.

Exclude item $i$ . Then the best we can do is whatever the first $i - 1$ items achieve within the same budget: $K (i - 1, w)$ .

Include item $i$ . This is only possible if it fits, $w_{i} \leq w$ . We collect its value $v_{i}$ and spend $w_{i}$ of the budget, leaving $w - w_{i}$ for the first $i - 1$ items: $v_{i} + K (i - 1, w - w_{i})$ .

We take the better of the two, and if item $i$ does not fit, only the first option is available:²

K (i, w) = ⎩ ⎨ ⎧ 0 K (i - 1, w) max (K (i - 1, w), v_{i} + K (i - 1, w - w_{i})) if i = 0 or w = 0, if w_{i} > w, if w_{i} \leq w .

The base case says: with no items, or no capacity, the value is $0$ . Each entry depends only on entries in the previous row ( $i - 1$ ), so we fill the table row by row in increasing $i$ , and within each row over all budgets $w = 0, \dots, W$ .

The DP table, filled

Take capacity $W = 5$ and four items: $1 : (w = 1, v = 1)$ , $2 : (w = 2, v = 6)$ , $3 : (w = 3, v = 10)$ , $4 : (w = 5, v = 16)$ . The table holds $K (i, w)$ ; row $0$ is all zeros (no items), and each later row applies the recurrence across budgets $0$ through $5$ .

Filled 0/1 knapsack value table with include and exclude arrows into the answer.

The shaded answer is $K (4, 5) = 16$ . The arrows trace the highlighted entry $K (3, 5)$ : item $3$ weighs $3 \leq 5$ , so we compare excluding it ( $K (2, 5) = 7$ , the cell directly above) against including it ( $v_{3} + K (2, 5 - 3) = 10 + K (2, 2) = 10 + 6 = 16$ , reaching $w_{3} = 3$ columns to the left); the include branch wins at $16$ . One row down, $K (4, 5)$ compares excluding item $4$ ( $K (3, 5) = 16$ ) against including it ( $16 + K (3, 0) = 16$ ), a tie at $16$ .

To see the whole table appear rather than just its answer, walk the rows in order. Row $0$ is the boundary: with no items every budget yields value $0$ . Each later row copies the row above (the exclude branch) and then, wherever item $i$ fits, overwrites the cell with the larger of that copy and $v_{i} + K (i - 1, w - w_{i})$ .

Row $1$ , item $(w = 1, v = 1)$ . It fits from $w = 1$ onward, and once it fits it is always worth taking, so $K (1, 0) = 0$ and $K (1, w) = 1$ for $w \geq 1$ .
Row $2$ , item $(w = 2, v = 6)$ . At $w = 2$ we compare exclude $K (1, 2) = 1$ against include $6 + K (1, 0) = 6$ ; include wins, $K (2, 2) = 6$ . At $w = 3$ , include gives $6 + K (1, 1) = 7$ , beating the copied $1$ . From $w = 3$ up the row reads $7$ , since one unit of budget past the pair adds nothing new.
Row $3$ , item $(w = 3, v = 10)$ . At $w = 3$ , include gives $10 + K (2, 0) = 10 > K (2, 3) = 7$ . At $w = 4$ , include gives $10 + K (2, 1) = 11$ . At $w = 5$ , include gives $10 + K (2, 2) = 10 + 6 = 16$ , the entry the arrows highlighted.
Row $4$ , item $(w = 5, v = 16)$ . It fits only at $w = 5$ , where include gives $16 + K (3, 0) = 16$ , tying the copied $K (3, 5) = 16$ . The recurrence keeps the earlier winner on a tie, so the answer $K (4, 5) = 16$ is achieved without item $4$ .

A cell is never touched again once written: every dependency points one row up, so the sweep in increasing $i$ always finds its inputs already final.

The algorithm

Algorithm 2:

\textsc{Knapsack-01}(w, v, n, W)

— maximum value within capacity

W

1
for $b \gets 0$ to $W$ do
2
$K[0][b] \gets 0$
no items: value $0$
3
for $i \gets 1$ to $n$ do
4
for $b \gets 0$ to $W$ do
5
$K[i][b] \gets K[i-1][b]$
skip item $i$
6
if $w[i] \le b$ then
7
$take \gets v[i] + K[i-1][b - w[i]]$
8
$K[i][b] \gets \max(K[i][b],\ take)$
take item $i$
9
return $K[n][W]$

Recovering the chosen items. As with every DP, the table holds the value; the subset is recovered by walking backward from $K [n] [W]$ . At row $i$ , if $K [i] [b] \neq = K [i - 1] [b]$ then item $i$ was taken: record it and drop the budget to $b - w_{i}$ ; otherwise it was skipped. Continue down to row $0$ .

Algorithm 3:

\textsc{Knapsack-Items}(K, w, n, W)

— recover the optimal subset

1
$S \gets \emptyset$
2
$b \gets W$
3
for $i \gets n$ downto $1$ do
4
if $K[i][b] \neq K[i-1][b]$ then
5
$S \gets S \cup \set{i}$
item $i$ taken
6
$b \gets b - w[i]$
7
return $S$

On the filled table above ( $W = 5$ ), the walk starts at $K [4] [5]$ and reads off one decision per row, dropping the budget by $w_{i}$ whenever an item is taken:

Backward reconstruction from

K [4] [5] = 16

. At each row,

K [i] [b] = K [i - 1] [b]

means item

i

was skipped (budget unchanged); a jump means it was taken (budget drops by

w_{i}

). Items

3

and

2

are recovered, weight

3 + 2 = 5

, value

10 + 6 = 16

The same walk drawn on the grid is a staircase: a vertical step means the item was skipped (budget held), and a diagonal drop down and to the left means it was taken (budget falls by $w_{i}$ ). The path starts at the answer $K (4, 5)$ and lands at the origin $K (0, 0)$ ; the diagonal drops mark exactly the chosen items.

The traceback path on the value grid. From the answer

K (4, 5) = 16

, a vertical move (skip) keeps the budget; a diagonal move down-left (take) drops it by

w_{i}

. The two diagonal drops recover items

3

and

2

; the path ends at

K (0, 0)

knapsack.pypython

from typing import NamedTuple, Sequence

class Item(NamedTuple):
  """
    One knapsack item: the value gained and the weight it costs.\n
  """
  value: int
  weight: int

def knapsack(items: Sequence[Item], capacity: int) -> int:
  """
    Best total value of a subset of `items` whose weights fit within\n
    `capacity`. Weights are non-negative integers.\n
  """
  # best_value[c] is the most value reachable with a weight budget of c.
  best_value: list[int] = [0 for _ in range(capacity + 1)]

  for item in items:
    # sweep capacities high-to-low so the item is taken at most once.
    for remaining in range(capacity, item.weight - 1, -1):
      candidate: int = best_value[remaining - item.weight] + item.value
      best_value[remaining] = max(best_value[remaining], candidate)

  return best_value[capacity]

Running time and the pseudo-polynomial trap

The table has $(n + 1) (W + 1)$ entries, each filled in $Θ (1)$ , so $Knapsack-01$ runs in

Θ (nW)

time and space. (Space drops to $Θ (W)$ if only the value is needed, since each row reads only the previous one, so scan $b$ from high to low to reuse a single array.)

The descending sweep is the correctness argument for the 1-D version: when we apply an item, $K [b - w_{i}]$ must still hold the value from before this item was available, so we must reach $b$ before we overwrite $b - w_{i}$ — that is, go high to low.

0/1 knapsack on one rolling array, applying item

2

(

w = 2, v = 6

) with the budget swept

b = 5 \to 2

(descending). Each update

K [b] \leftarrow max (K [b], 6 + K [b - 2])

reads

K [b - 2]

before it is overwritten, so item

2

is used at most once.

Sweeping the other way, low to high, would let $K [b - w_{i}]$ already include item $2$ , silently packing it twice — the reuse that the unbounded knapsack requires, and that the ascending sweep there deliberately permits.

The failure is easy to trace on the same array. Sweeping upward, $K [2]$ becomes $6$ first. Then at $b = 4$ the update reads $K [4 - 2] = K [2]$ , which already holds item $2$ ; adding $v_{2}$ again yields $12$ , as if two copies of a weight- $2$ item were packed into a budget of $4$ . The descending sweep reads $K [2]$ while it still holds its pre-item value, so no cell is ever charged item $2$ twice.

The ascending sweep double-counts item

2

(

w = 2, v = 6

). Updating

b = 2

first makes

K [2] = 6

; the later update at

b = 4

then reads that fresh

K [2]

and adds

v_{2}

again, giving

12

— item

2

packed twice into a budget of

4

That $Θ (nW)$ looks polynomial, but the accounting deserves scrutiny.

Is this really polynomial time? It is tempting to call $Θ (n t)$ (or $Θ (nW)$ ) polynomial: the input is a list of $n + 1$ numbers, so surely its size is $\approx Θ (n)$ , and $n t$ is polynomial in $n$ . That accounting is wrong. We have to count the input's size in bits, not in numbers. This is the same bit-length accounting that underlies asymptotic analysis.

Now rewrite the running time against $b$ . Since $t$ can be as large as $2^{b} - 1$ ,

Θ (n t) = Θ (n \cdot 2^{b}),

which is exponential in $b$ , the number of bits of a single input integer. Doubling the bits used to write the target squares $t$ and thus squares the running time. An algorithm whose time is polynomial in the numeric value of an input but exponential in its encoded length is called pseudo-polynomial: it counts as polynomial only if we dishonestly pretend an integer $s$ has size $∣ s ∣$ rather than its true size $Θ (log ∣ s ∣)$ .³ Such running times are central to coping with hardness.

The $Θ (n t)$ DP is the best we currently know how to do. It is fast and practical precisely when the numbers are small (a target in the thousands, say), and uselessly slow when $t$ is a $200$ -bit integer, even though both instances have the same handful of elements. The lesson generalizes: any DP whose table is indexed by a numeric quantity (a target sum, a capacity) inherits this pseudo-polynomial character. The same pattern recurs in coin change and the unbounded knapsack.

Subset-sum as a special case

We opened with subset-sum and built knapsack on top of it; the reduction also runs the other way, and making it precise shows the two problems are equivalent. Take any subset-sum instance $⟨ a_{1}, \dots, a_{n} ⟩$ with target $t$ , and feed it to 0/1 knapsack with each item's value equal to its weight, $v_{i} = w_{i} = a_{i}$ , and capacity $W = t$ . The most value you can pack into a capacity- $t$ knapsack is at most $t$ , and it equals $t$ exactly when some subset of weights sums to $t$ without waste. So

K (n, t) = t \leftrightarrow some sublist of L sums to t,

and one call to $Knapsack-01$ answers subset-sum. The two recurrences have the same shape with the $\lor$ of the boolean table promoted to a $max$ over values, which is why everything transfers: $Θ (n t)$ time, pseudo-polynomial, and $NP-complete$ / NP-hard in general.

Where greed does work: fractional knapsack

Change one rule, letting items be split so that we may take any fraction $0 \leq x_{i} \leq 1$ of item $i$ , and the problem fractional knapsack becomes easy, solvable by the very greedy strategy that failed the 0/1 version. Sort items by value-per-weight ratio $v_{i} / w_{i}$ and take them greedily, highest ratio first, slicing the last item to fill the knapsack exactly.

Algorithm 4:

\textsc{Fractional-Knapsack}(w, v, n, W)

— greedy, fractions allowed

1
sort items so that $v[1]/w[1] \ge v[2]/w[2] \ge \cdots \ge v[n]/w[n]$
2
$value \gets 0$ ; $b \gets W$
remaining capacity
3
for $i \gets 1$ to $n$ do
4
if $w[i] \le b$ then
5
$value \gets value + v[i]$
take all of item $i$
6
$b \gets b - w[i]$
7
else
8
$value \gets value + v[i] \cdot (b / w[i])$
fraction fills exactly
9
return $value$
10
return $value$

This runs in $Θ (n log n)$ ; the sort dominates.

The contrast between the two variants is the point.

Approximation and the two ways to be pseudo-polynomial

Knapsack's $Θ (nW)$ table is pseudo-polynomial — polynomial in the numeric capacity $W$ , exponential in its bit length. The same limitation makes possible a strong approximation result: knapsack admits a fully polynomial-time approximation scheme (FPTAS). Given any $ε > 0$ , one can find a packing within a $(1 - ε)$ factor of optimal in time $O (n^{3} / ε)$ — polynomial in both $n$ and $1/ ε$ — by dynamic-programming on value instead of weight and then rounding the values: divide every $v_{i}$ by a scaling factor $K = ε v_{max} / n$ and round, which shrinks the value-indexed table to polynomial size while losing at most an $ε$ fraction of the objective (Ibarra and Kim, 1975; the treatment in Vazirani's Approximation Algorithms is the standard reference). Few NP-hard problems can be approximated this well; knapsack can precisely because it is only weakly NP-hard, i.e. hard only when the numbers are large.

The complementary DP, keyed on value rather than weight ( $D [v] =$ minimum weight achieving value exactly $v$ ), runs in $Θ (n \cdot V)$ where $V$ is the total value, and is the one the FPTAS rounds; whichever of $W$ and $V$ is smaller gives the better bound. Two related results extend this. The meet-in-the-middle technique (Horowitz and Sahni, 1974) solves subset-sum in $O (2^{n /2})$ time by splitting the items in half, enumerating each half's subset sums, and merging — far better than $2^{n}$ when $W$ is astronomically large and the $Θ (nW)$ table is useless. And strong NP-hardness marks the boundary of the FPTAS approach: problems like bin packing and 3-partition remain hard even with small numbers, so no pseudo-polynomial algorithm exists for them unless P $=$ NP. The FPTAS exploits precisely that large-number weakness of Knapsack.⁴

Takeaways

$Subset-sum$ and 0/1 knapsack share one structure: a two-dimensional subproblem ( $A (i, u)$ / $K (i, w)$ ) indexed by items available and budget remaining, because how much budget is left is part of the state.
The recurrence is the include/exclude choice on the last element: a $\lor$ of two previous-row cells for subset-sum, a $max$ for knapsack. Each cell depends on the one directly above (exclude) and the one $a_{i}$ / $w_{i}$ columns to the left (include); the optimal subset is recovered by walking backward.
$Θ (n t)$ / $Θ (nW)$ is pseudo-polynomial: polynomial in the numeric value of the target, but with an honest size of $(n + 1) b$ bits it is $Θ (n \cdot 2^{b})$ , exponential in the bit length $b$ of one integer.
A truly polynomial $poly (n, b)$ algorithm is unlikely: $Subset-sum$ is $NP-complete$ , so one exists only if $P = NP$ .
Fractional knapsack flips to a $Θ (n log n)$ greedy algorithm: allowing fractions restores the greedy-choice property that indivisibility destroys, so one rule change crosses the line from NP-hard to easy.

Skiena, §10 — Dynamic Programming: 0/1 knapsack as a canonical NP-hard resource-allocation problem solved by DP. ↩
CLRS, Ch. 15 — Dynamic Programming: the 0/1 knapsack value recurrence $K (i, w)$ taking the max of exclude and include. ↩
Erickson, Ch. 3 — Dynamic Programming: the pseudo-polynomial $Θ (n t)$ running time, exponential in the input's bit length. ↩
Ibarra & Kim (1975) for the knapsack FPTAS by value-rounding (see Vazirani, Approximation Algorithms, Ch. 8); Horowitz & Sahni (1974) for the $O (2^{n /2})$ meet-in-the-middle subset-sum. Bin packing and 3-partition are strongly NP-hard, so they admit no pseudo-polynomial algorithm unless P $=$ NP. ↩

Subset-sum: the decision core

The subset-sum table, filled

Subset-sum in pseudocode

The problem and why greed fails

The subproblem and recurrence

The DP table, filled

The algorithm

Running time and the pseudo-polynomial trap

Subset-sum as a special case

Where greed does work: fractional knapsack

Approximation and the two ways to be pseudo-polynomial

Takeaways

Footnotes