4.5.4 Low density knapsacks

One of the more efficient algorithms for solving the general subset sum problem is the meet-in-the-middle algorithm.
For knapsacks having low density there is still another method namely to reduce the subset sum problem to the problem of finding a vector with small length.

Definition
Given a knapsack vector A = (a₁, ..., a_n). Let a_max be the largest element in A: a_max = max_i a_i, i = 1 ... n. Then the density d_A of A is defined by

d_A = n / log₂(a_max)

In Cryptosystems, based on the subset sum problem, the density of a knapsack is almost always less than one. If not so, several subsets may exist summing up to the same s and hence no clear decryption may be performed.

Definition
A vector v in Rⁿ is an orderd tuple with elements v_i in R:

v = (v₁, v₂, ..., v_n), v_i in R, i = 1 ... n

Definition
Let v = (v₁, v₂, ..., v_n) be a vector in Rⁿ. The euclidian length of v is

||v|| = (v₁² + ... + v_n²)^1/2

Definition
Suppose {v_i}_i=1...m is a set of m vectors in Rⁿ with m <= n. Then {v_i} is linearly dependant if and only if there are k_i in R with

k₁v₁ + k₂v₂ + ... + k_mv_m = 0, not all k_i = 0

Otherwise the set {v_i}_i=1...m is called linearly independant.

Definition
Let {b_i}_i=1...n be a set of n linearly independant vectors in Rⁿ. Then {b_i}_i=1...n forms a base for Rⁿ. All vectors v in Rⁿ may be written as linear combinations of {b_i}_i=1...n:

v = k₁b₁ + k₂b₂ + ... +k_nb_n, k_i in R, i = 1 ... n.

Definition
Given m linear independant vectors {b_i}_i=1...m, m <= n. Then the lattice L, spanned by {b_i}_i=1...m, is the set of all integral linear combinations of {b_i}, that is

L = {v | v = z₁b₁ + z₂b₂ + ... + z_mb_m, with z_i in Z}

The set {b_i}_i=1...m forms a base of L. A lattice may have several bases.

Closely related to a lattice is the problem of finding the shortest nonzero vector of this lattice (SVP: shortest vector problem).
It is an open question whether or not this problem is NP-hard using the euclidean norm. Using the supremum norm, it is shown, that the problem is NP-hard. The supremum norm || ||₈ of a vector v is defined by

||v||₈ = max |v_i|_i=1...n

As stated above a lattice may have several bases. A "good" base consists of vectors with relatively small length. Such a base is called a reduced base. The theory of lattice-base-reduction deals with the problem of finding reduced bases, given a lattice.

Algorithms transforming a given base B into a reduced base B', are called lattice reduction algorithms. A popular one is the algorithm invented by Lenstra, Lenstra and Lovász, L³ for short. A L³-reduced base has ,besides others, the property that its first vector, b'₁ has a length of at most an exponential factor greater than the smallest nonzero vector of the lattice, spanned by the vectors of B:

||b'₁|| <= 2^(n-1)/2||v||, v in L.

This paragraph shows as to solve a subset sum problem with a knapsack having low density. Because cryptographic knapsacks almost always have a low density, this method applies to them as well.

Given a public key B = (b₁,b₂, ..., b_n) a cipher c = pB = b₁p₁ + ... + b_np_n
Wanted the plaintext p

y₁ =	p₁ + 0 + ... + 0 - 0.5 = p₁ - 0.5
y₂ =	0 + p₂ + 0 + ... + 0 - 0.5 = p₂ - 0.5
...
y_i =	0 + ... + 0 + p_i + 0 + ... +0 - 0.5 = p_i - 0.5
...
y_n =	0 + ... + 0 + p_n - 0.5 = p_n - 0.5
y_n+1 =	t(b₁p₁ + b₂p₂+ ... + b_np_n) - tc = 0

M =
	1	0	0	...	0	tb₁
	0	1	0	...	0	tb₂
	0	0	1	...	0	tb₃
	...	...	...	...	...	...
	0	0	0	...	1	tb_n
	0.5	0.5	0.5	...	0.5	tc