III Quantum Computation - Some quantum algorithms

3Some quantum algorithms

III Quantum Computation

3.3 Shor’s algorithm

All that was a warm up for Shor’s algorithm. This is a quantum algorithm that

factorizes numbers in p olynomial time. The crux of the algorithm will be a

modified version of the quantum Fourier transform.

The precise statement of the problem is as follows — given an integer

with

log N

digits, we want to find a factor 1

< K < N

. Shor’s algorithm will

achieve this with constant probability (1

− ε

) in

(

) time. The b est known

classical algorithm is e

O(n

1/3

(log n)

2/3

)

To do this, we will use the periodicity algorithm. However, there is one

subtlety involved. Instead of working in

Z/nZ

, we need to work in

. Since

computers cannot work with infinitely many numbers, we will have to truncate

it somehow. Since we have no idea what the period of our function will be, we

must truncate it randomly, and we need to make sure we can control the error

introduced by the truncation.

We shall now begin. Given an

, we first choose some 1

< a < N

uniformly

randomly, and compute

hcf

(

a, N

). If it is not equal to 1, then we are finished.

Otherwise, by Euler’s theorem, there is a least power

such that

≡

mod N

. The number

is called the order of

mod

. It follows that the

function

Z → Z/N Z

given by

(

) =

mod N

has period

, and is injective

in each period.

Note that

(

) can be efficiently computed in

poly

(

log k

) time, by repeated

squaring. Also note that classically, it is hard to find

, even though

has a

simple formula!

It was known to Legendre in 1800 that knowing

means we can factor

Suppose we can find r, and further suppose r is even. Then we have

− 1 ≡ (a

r/2

+ 1)(a

r/2

− 1) ≡ 0 (mod N).

exactly divides the product. By minimality of

, we know

does not

divide

r/2

−

1. So if

does not divide

r/2

+ 1 as well, then

hcf

(

N, a

r/2

are non-trivial factors of N .

For this to work, we needed two assumptions –

is even, and

r/2

6≡ −

(

mod N

). Fortunately, there is a theorem in number theory that says if

odd and not a prime power, and

is chosen uniformly at random, then the

probability that these two things happen is at least

. In fact, it is

≥

−

m−1

where m is the number of prime factors of N.

So if we repeat this

times, the probability that they all fail to give a factor

is less than

. So this can be as small as we wish.

What about the other possibilities? If

is even, then we would have noticed

by looking at the last digit, and we can just write down 2. If

for

c, ` >

then there is a classical p olynomial time algorithm that outputs

, which is a

factor. So these are the easy cases.

Everything we’ve done so far is classical! The quantum part comes in when

we want to compute

. We know that

(

) =

is periodic on

, which is an

infinite domain. So we cannot just apply our periodicity algorithm.

By number theory, we know that

is at most

. But other than that, we

have no idea what

actually is, nor do we know of any multiple of

. So we

cannot apply the periodicity argument directly. Instead, we pick a big number

, and work on the domain

{

, ··· ,

−

}

. How do we

choose

? The idea is that we want 0

, ··· ,

−

1 to contain

full periods,

plus some extra “corrupt” noise b, so

= Br + b,

with 0

≤ b < r

. Since we want to separate out the periodicity information from

the corrupt noise, we will want

to be relatively small, compared to

. We

know the size of

is bounded by

, hence by

. So we need 2

to be “much

larger” than

. It turns out picking 2

> N

is enough, and we will pick

be the smallest number such that this holds.

We now study the effect of corruption on the periodicity algorithm. We again

make the state

|fi =

√

|xi|f (x)i.

and measure the value of f. We then get

|peri =

√

A−1

k=0

+ kri,

where

+ 1, depending on whether

≤ b

or not. As before, we

apply QFT

to obtain

QFT

|peri =

−1

c=0

f(c) |ci.

When we did this before, with an exact period, most of the

(

) is zero. However,

this time things are a bit more messy. As before, we have

f(c) =

√

[1 + α + ··· + α

A−1

], α = e

2πicr/2

The important question is, when we measure this, which

’s will we see with

“good probability”? With exact periodicity, we knew that

is an exact

integer. So

(

) = 0 except when

is a multiple of

. Intuitively, we can think

of this as interference, and we had totally destructive and totally constructive

interference respectively.

In the inexact case, we will get constructive interference for those

such that

the phase

is c lose to 1. These are the

’s with

nearest to integers

, and

the powers up to

A−1

don’t spread too far around the unit circle. So we avoid

cancellations.

So we look at those special

’s having this particular prop erty. As

increases

from 0 to 2

−

1, the angle

increments by

each time from 0 up to

. So

we have c

’s for each k = 0, 1, ··· , r − 1 such that



− k



In other words, we have



− k



So the c

are the integers nearest to the multiples of 2

/r.

(

), the

’s corresponding to the

’s have the smallest phases, i.e. nearest

to the positive real axis. We write

= k + ξ,

where

k ∈ Z, |ξ| <

Then we have

= exp



2πi



= exp (eπi(k + ξ)n) = exp(2πiξn)

Now for

n < A

, we know that

ξn| < π

, and thus 1

, α, α

, ··· , α

A−1

all lie in

the lower half plane or upper half plane.

Doing all the algebra needed, we find that if

QFT |peri

is measured, then for

any c

as above, we have

Prob(c

) >

where

γ =

≈ 0.4.

Recall that in the exact periodicity case, the points

hit the integers exactly,

and instead of γ we had 1. The distribution of the c’s then look like:

With inexact periods, we obtain something like

Now how do we get r from a c

? We know



−



m+1

We claim that there is at most 1 fraction

with denominator

< N

such that

this inequality holds. So this inequality do es uniquely determine k/r.

Indeed, suppose

and

both work. Then we have



−



r − r

However, we also have



−



≤



−



−



So it follows that we must have

We introduce the notion of a “good”

value, which is when

is coprime to

r. The probability of getting a good c

is again

O(1/ log log r) > O(1/ log log N).

Note that this is the same rate as the case of exact periodicity, since we have

only lost a constant factor of

! If we did have such a

, then now

is uniquely

determined.

However, there is still the problem of finding

from a good

value. At this

point, this is just classical number theory.

We can certainly try all

with

< r

< N

and find the closest one to

, but there are

(

) fractions to try, but we want a

(

poly

(

log N

))

algorithm. Indeed, if we were to do it this way, we might as well try all numbers

less than N and see if they divide N . And this is just O(N )!

The answer comes from the nice theory of continued fractions. Any rational

number

< 1 has a continued fraction expansion

+ ···

Indeed to do this, we simply write

where we divide

to get

, and then put

. We then keep

going on with

. Since the numbers

, t

keep getting smaller, it follows that

this process will eventually terminate.

Since it is very annoying to type these continued fractions in L

X, we often

write the continued fraction as

= [a

, a

, ··· , a

We define the kth convergent of

to be

= [a

, a

, ··· , a

There are some magic results from number theory that gives us a simple recur-

rence relation for the convergents.

Lemma. For a

, a

, ··· , a

any positive reals, we set

= 0 q

= 1

= 1 q

= a

We then define

= a

k−1

+ p

k−2

= a

k−1

+ q

k−2

Then we have

(i) We have

, ··· , a

] =

(ii) We also have

k−1

− p

k−1

= (−1)

In particular, p

and q

are coprime.

From a bit more number theory, we find that

Fact.

s < t

are

-bit integers, then the continued fraction has length

(

and all convergents

can be computed in O(m

) time.

More importantly, we have the following result:

Fact. Let 0 < x < 1 be rational, and suppose

is rational with



x −



Then

is a convergent of the continued fraction of x.

Then by this theorem, for a goo d

, we know

must be a conve rgent of

So we compute all convergents find a (unique) one whose denominator is less

than

and is within

. This gives us the value of

, and we are done.

In fact, this last classical part is the slowest part of the algorithm.

Example.

Suppose we want to factor

= 39. Suppose the random

we chose

= 7

39, which is coporime to

. Let

be the period of

(

) = 7

mod

39.

We notice

1024 = 2

< N

= 1621 < 2

= 2048.

So we pick m = 11. Suppose the measurement of QFT

|peri yeilds c = 853.

By the theory, this has a constant probability (approximately 0

4) to satisfy



853

−



m+1

We also have a probability of

/ log log r

) to have

and

coprime. In this

case, c is indeed “good”. So there is a unique

satisfying



853

2048

−



So to find

, we do the continued fraction expansion of

853

2048

. We have

853

2048

853

2 +

342

853

2 +

853

342

2 +

169

342

= ··· = [2, 2, 2, 42, 4].

We can then compute the convergents

[2] =

[2, 2] =

[2, 2, 2] =

[2, 2, 2, 42] =

212

509

[2, 2, 2, 42, 4] =

853

2048

Of all these numbers, only

is within

853

2048

and whose denominator is

less than N = 39.

If we do not assume k and r are coprime, then the possible

are

If we assume that

are coprime, then r = 12. Indeed, we can try that

≡ 1 (mod 39).

So we now know that

39 | (7

+ 1)(7

− 1).

We now hope/expect with probability

exactly that it goes partly into each

factor. We can compute

+ 1 = 117650 ≡ 26 (mod 39)

− 1 = 117648 ≡ 24 (mod 39)

We can then compute

hcf(26, 39) = 13, hcf(24, 39) = 3 (mod 39).

We see that 3 and 13 are factors of 39.