Skip to main content
IBM Quantum Platform

Quantum information

We're now prepared to move on to quantum information in the setting of multiple systems. Much like in the previous lesson on single systems, the mathematical description of quantum information for multiple systems is quite similar to the probabilistic case and makes use of similar concepts and techniques.


Quantum states

Multiple systems can be viewed collectively as single, compound systems. We've already observed this in the probabilistic setting, and the quantum setting is analogous. Quantum states of multiple systems are therefore represented by column vectors having complex number entries and Euclidean norm equal to 1,1, just like quantum states of single systems. In the multiple system case, the entries of these vectors are placed in correspondence with the Cartesian product of the classical state sets associated with each of the individual systems, because that's the classical state set of the compound system.

For instance, if X\mathsf{X} and Y\mathsf{Y} are qubits, then the classical state set of the pair of qubits (X,Y),(\mathsf{X},\mathsf{Y}), viewed collectively as a single system, is the Cartesian product {0,1}×{0,1}.\{0,1\}\times\{0,1\}. By representing pairs of binary values as binary strings of length two, we associate this Cartesian product set with the set {00,01,10,11}.\{00,01,10,11\}. The following vectors are therefore all examples of quantum state vectors of the pair (X,Y):(\mathsf{X},\mathsf{Y}):

12001601+i610+1611,35004511,and01. \frac{1}{\sqrt{2}} \vert 00 \rangle - \frac{1}{\sqrt{6}} \vert 01\rangle + \frac{i}{\sqrt{6}} \vert 10\rangle + \frac{1}{\sqrt{6}} \vert 11\rangle, \quad \frac{3}{5} \vert 00\rangle - \frac{4}{5} \vert 11\rangle, \quad \text{and} \quad \vert 01 \rangle.

There are variations on how quantum state vectors of multiple systems are expressed, and we can choose whichever variation suits our preferences. Here are some examples for the first quantum state vector above.

  1. We may use the fact that ab=ab\vert ab\rangle = \vert a\rangle \vert b\rangle (for any classical states aa and bb) to instead write

    12001601+i610+1611.\frac{1}{\sqrt{2}} \vert 0\rangle\vert 0 \rangle - \frac{1}{\sqrt{6}} \vert 0\rangle\vert 1\rangle + \frac{i}{\sqrt{6}} \vert 1\rangle\vert 0\rangle + \frac{1}{\sqrt{6}} \vert 1\rangle\vert 1\rangle.
  2. We may choose to write the tensor product symbol explicitly like this:

    12001601+i610+1611.\frac{1}{\sqrt{2}} \vert 0\rangle\otimes\vert 0 \rangle - \frac{1}{\sqrt{6}} \vert 0\rangle\otimes\vert 1\rangle + \frac{i}{\sqrt{6}} \vert 1\rangle\otimes\vert 0\rangle + \frac{1}{\sqrt{6}} \vert 1\rangle\otimes\vert 1\rangle.
  3. We may subscript the kets to indicate how they correspond to the systems being considered, like this:

    120X0Y160X1Y+i61X0Y+161X1Y.\frac{1}{\sqrt{2}} \vert 0\rangle_{\mathsf{X}}\vert 0 \rangle_{\mathsf{Y}} - \frac{1}{\sqrt{6}} \vert 0\rangle_{\mathsf{X}}\vert 1\rangle_{\mathsf{Y}} + \frac{i}{\sqrt{6}} \vert 1\rangle_{\mathsf{X}}\vert 0\rangle_{\mathsf{Y}} + \frac{1}{\sqrt{6}} \vert 1\rangle_{\mathsf{X}}\vert 1\rangle_{\mathsf{Y}}.

Of course, we may also write quantum state vectors explicitly as column vectors:

(1216i616). \begin{pmatrix} \frac{1}{\sqrt{2}}\\[2mm] - \frac{1}{\sqrt{6}}\\[2mm] \frac{i}{\sqrt{6}}\\[2mm] \frac{1}{\sqrt{6}} \end{pmatrix}.

Depending upon the context in which it appears, one of these variations may be preferred — but they are all equivalent in the sense that they describe the same vector.

Tensor products of quantum state vectors

Similar to what we have for probability vectors, tensor products of quantum state vectors are also quantum state vectors — and again they represent independence among systems.

In greater detail, and beginning with the case of two systems, suppose that ϕ\vert \phi \rangle is a quantum state vector of a system X\mathsf{X} and ψ\vert \psi \rangle is a quantum state vector of a system Y.\mathsf{Y}. The tensor product ϕψ,\vert \phi \rangle \otimes \vert \psi \rangle, which may alternatively be written as ϕψ\vert \phi \rangle \vert \psi \rangle or as ϕψ,\vert \phi \otimes \psi \rangle, is then a quantum state vector of the joint system (X,Y).(\mathsf{X},\mathsf{Y}). Again we refer to a state of this form as a being a product state.

Intuitively speaking, when a pair of systems (X,Y)(\mathsf{X},\mathsf{Y}) is in a product state ϕψ,\vert \phi \rangle \otimes \vert \psi \rangle, we may interpret this as meaning that X\mathsf{X} is in the quantum state ϕ,\vert \phi \rangle, Y\mathsf{Y} is in the quantum state ψ,\vert \psi \rangle, and the states of the two systems have nothing to do with one another.

The fact that the tensor product vector ϕψ\vert \phi \rangle \otimes \vert \psi \rangle is indeed a quantum state vector is consistent with the Euclidean norm being multiplicative with respect to tensor products:

ϕψ=(a,b)Σ×Γabϕψ2=aΣbΓaϕbψ2=(aΣaϕ2)(bΓbψ2)=ϕψ.\begin{aligned} \bigl\| \vert \phi \rangle \otimes \vert \psi \rangle \bigr\| & = \sqrt{ \sum_{(a,b)\in\Sigma\times\Gamma} \bigl\vert\langle ab \vert \phi\otimes\psi \rangle \bigr\vert^2 }\\[1mm] & = \sqrt{ \sum_{a\in\Sigma} \sum_{b\in\Gamma} \bigl\vert\langle a \vert \phi \rangle \langle b \vert \psi \rangle \bigr\vert^2 }\\[1mm] & = \sqrt{ \biggl(\sum_{a\in\Sigma} \bigl\vert \langle a \vert \phi \rangle \bigr\vert^2 \biggr) \biggl(\sum_{b\in\Gamma} \bigl\vert \langle b \vert \psi \rangle \bigr\vert^2 \biggr) }\\[1mm] & = \bigl\| \vert \phi \rangle \bigr\| \bigl\| \vert \psi \rangle \bigr\|. \end{aligned}

Because ϕ\vert \phi \rangle and ψ\vert \psi \rangle are quantum state vectors, we have ϕ=1\|\vert \phi \rangle\| = 1 and ψ=1,\|\vert \psi \rangle\| = 1, and therefore ϕψ=1,\|\vert \phi \rangle \otimes \vert \psi \rangle\| = 1, so ϕψ\vert \phi \rangle \otimes \vert \psi \rangle is also a quantum state vector.

This generalizes to more than two systems. If ψ0,,ψn1\vert \psi_0 \rangle,\ldots,\vert \psi_{n-1} \rangle are quantum state vectors of systems X0,,Xn1,\mathsf{X}_0,\ldots,\mathsf{X}_{n-1}, then ψn1ψ0\vert \psi_{n-1} \rangle\otimes\cdots\otimes \vert \psi_0 \rangle is a quantum state vector representing a product state of the joint system (Xn1,,X0).(\mathsf{X}_{n-1},\ldots,\mathsf{X}_0). Again, we know that this is a quantum state vector because

ψn1ψ0=ψn1ψ0=1n=1. \bigl\| \vert \psi_{n-1} \rangle\otimes\cdots\otimes \vert \psi_0 \rangle \bigr\| = \bigl\|\vert \psi_{n-1} \rangle\bigl\| \cdots \bigl\|\vert \psi_0 \rangle \bigr\| = 1^n = 1.

Entangled states

Not all quantum state vectors of multiple systems are product states. For example, the quantum state vector

1200+1211(1) \frac{1}{\sqrt{2}} \vert 00\rangle + \frac{1}{\sqrt{2}} \vert 11\rangle \tag{1}

of two qubits is not a product state. To reason this, we may follow exactly the same argument that we used in the previous section for a probabilistic state. That is, if (1)(1) were a product state, there would exist quantum state vectors ϕ\vert\phi\rangle and ψ\vert\psi\rangle for which

ϕψ=1200+1211. \vert\phi\rangle\otimes\vert\psi\rangle = \frac{1}{\sqrt{2}} \vert 00\rangle + \frac{1}{\sqrt{2}} \vert 11\rangle.

But then it would necessarily be the case that

0ϕ1ψ=01ϕψ=0 \langle 0 \vert \phi\rangle \langle 1 \vert \psi\rangle = \langle 01 \vert \phi\otimes\psi\rangle = 0

implying that 0ϕ=0\langle 0 \vert \phi\rangle = 0 or 1ψ=0\langle 1 \vert \psi\rangle = 0 (or both). That contradicts the fact that

0ϕ0ψ=00ϕψ=12 \langle 0 \vert \phi\rangle \langle 0 \vert \psi\rangle = \langle 00 \vert \phi\otimes\psi\rangle = \frac{1}{\sqrt{2}}

and

1ϕ1ψ=11ϕψ=12 \langle 1 \vert \phi\rangle \langle 1 \vert \psi\rangle = \langle 11 \vert \phi\otimes\psi\rangle = \frac{1}{\sqrt{2}}

are both nonzero. Thus, the quantum state vector (1)(1) represents a correlation between two systems, and specifically we say that the systems are entangled.

Notice that the specific value 1/21/\sqrt{2} is not important to this argument — all that is important is that this value is nonzero. Thus, for instance, the quantum state

3500+4511 \frac{3}{5} \vert 00\rangle + \frac{4}{5} \vert 11\rangle

is also not a product state, by the same argument.

Entanglement is a quintessential feature of quantum information that will be discussed in greater detail in a later lesson. Entanglement can be complicated, particularly for the sorts of noisy quantum states that can be described by density matrices (which are discussed in the General formulation of quantum information course, which is the third course in the Understanding Quantum Information and Computation series). For quantum state vectors, however, entanglement is equivalent to correlation: any quantum state vector that is not a product state represents an entangled state.

In contrast, the quantum state vector

1200+i2011210i211 \frac{1}{2} \vert 00\rangle + \frac{i}{2} \vert 01\rangle - \frac{1}{2} \vert 10\rangle - \frac{i}{2} \vert 11\rangle

is an example of a product state.

1200+i2011210i211=(120121)(120+i21) \frac{1}{2} \vert 00\rangle + \frac{i}{2} \vert 01\rangle - \frac{1}{2} \vert 10\rangle - \frac{i}{2} \vert 11\rangle = \biggl( \frac{1}{\sqrt{2}}\vert 0\rangle - \frac{1}{\sqrt{2}}\vert 1\rangle \biggr) \otimes \biggl( \frac{1}{\sqrt{2}}\vert 0\rangle + \frac{i}{\sqrt{2}}\vert 1\rangle \biggr)

Hence, this state is not entangled.

Bell states

We'll now take a look as some important examples of multiple-qubit quantum states, beginning with the Bell states. These are the following four two-qubit states:

ϕ+=1200+1211ϕ=12001211ψ+=1201+1210ψ=12011210\begin{aligned} \vert \phi^+ \rangle & = \frac{1}{\sqrt{2}} \vert 00 \rangle + \frac{1}{\sqrt{2}} \vert 11 \rangle \\[3mm] \vert \phi^- \rangle & = \frac{1}{\sqrt{2}} \vert 00 \rangle - \frac{1}{\sqrt{2}} \vert 11 \rangle \\[3mm] \vert \psi^+ \rangle & = \frac{1}{\sqrt{2}} \vert 01 \rangle + \frac{1}{\sqrt{2}} \vert 10 \rangle \\[3mm] \vert \psi^- \rangle & = \frac{1}{\sqrt{2}} \vert 01 \rangle - \frac{1}{\sqrt{2}} \vert 10 \rangle \end{aligned}

The Bell states are so-named in honor of John Stewart Bell (1928—1990) was a physicist who made important contributions to the foundations of quantum theory. Notice that the same argument that establishes that ϕ+\vert\phi^+\rangle is not a product state reveals that none of the other Bell states are product states either: all four of the Bell states represent entanglement between two qubits.

The collection of all four Bell states

{ϕ+,ϕ,ψ+,ψ} \bigl\{\vert \phi^+ \rangle, \vert \phi^- \rangle, \vert \psi^+ \rangle, \vert \psi^- \rangle\bigr\}

is known as the Bell basis. True to its name, this is a basis; any quantum state vector of two qubits, or indeed any complex vector at all having entries corresponding to the four classical states of two bits, can be expressed as a linear combination of the four Bell states. For example,

00=12ϕ++12ϕ. \vert 0 0 \rangle = \frac{1}{\sqrt{2}} \vert \phi^+\rangle + \frac{1}{\sqrt{2}} \vert \phi^-\rangle.

GHZ and W states

Next we will consider two interesting examples of states of three qubits. The first example is the GHZ state (so named in honor of Daniel Greenberger, Michael Horne, and Anton Zeilinger, who first studied some of its properties):

12000+12111. \frac{1}{\sqrt{2}} \vert 000\rangle + \frac{1}{\sqrt{2}} \vert 111\rangle.

The second example is the so-called W state:

13001+13010+13100. \frac{1}{\sqrt{3}} \vert 001\rangle + \frac{1}{\sqrt{3}} \vert 010\rangle + \frac{1}{\sqrt{3}} \vert 100\rangle.

Neither of these states is a product state, meaning that they cannot be written as a tensor product of three qubit quantum state vectors. We'll examine both of these states later when we discuss partial measurements of quantum states of multiple systems.

Additional examples

The examples of quantum states of multiple systems we've seen so far are states of two or three qubits, but we can also consider quantum states of multiple systems having different classical state sets.

For example, here's a quantum state of three systems, X,\mathsf{X}, Y,\mathsf{Y}, and Z,\mathsf{Z}, where the classical state set of X\mathsf{X} is the binary alphabet (so X\mathsf{X} is a qubit) and the classical state set of Y\mathsf{Y} and Z\mathsf{Z} is {,,,}:\{\clubsuit,\diamondsuit,\heartsuit,\spadesuit\}:

120+121120. \frac{1}{2} \vert 0 \rangle \vert \heartsuit\rangle \vert \heartsuit \rangle + \frac{1}{2} \vert 1 \rangle \vert \spadesuit\rangle \vert \heartsuit \rangle - \frac{1}{\sqrt{2}} \vert 0 \rangle \vert \heartsuit\rangle \vert \diamondsuit \rangle.

And here's an example of a quantum state of three systems, X,\mathsf{X}, Y,\mathsf{Y}, and Z,\mathsf{Z}, that all share the same classical state set {0,1,2}:\{0,1,2\}:

012021+120102+2012106. \frac{ \vert 012 \rangle - \vert 021 \rangle + \vert 120 \rangle - \vert 102 \rangle + \vert 201 \rangle - \vert 210 \rangle }{\sqrt{6}}.

Systems having the classical state set {0,1,2}\{0,1,2\} are often called trits or (assuming that they can be in a quantum state) qutrits. The term qudit refers to a system having classical state set {0,,d1}\{0,\ldots,d-1\} for an arbitrary choice of d.d.


Measurements of quantum states

Standard basis measurements of quantum states of single systems were discussed in the previous lesson: if a system having classical state set Σ\Sigma is in a quantum state represented by the vector ψ,\vert \psi \rangle, and that system is measured (with respect to a standard basis measurement), then each classical state aΣa\in\Sigma appears with probability aψ2.\vert \langle a \vert \psi \rangle\vert^2. This tells us what happens when we have a quantum state of multiple systems and choose to measure the entire compound system, which is equivalent to measuring all of the systems.

To state this precisely, let us suppose that X0,,Xn1\mathsf{X}_0,\ldots,\mathsf{X}_{n-1} are systems having classical state sets Σ0,,Σn1,\Sigma_0,\ldots,\Sigma_{n-1}, respectively. We may then view (Xn1,,X0)(\mathsf{X}_{n-1},\ldots,\mathsf{X}_0) collectively as a single system whose classical state set is the Cartesian product Σn1××Σ0.\Sigma_{n-1}\times\cdots\times\Sigma_0. If a quantum state of this system is represented by the quantum state vector ψ,\vert\psi\rangle, and all of the systems are measured, then each possible outcome (an1,,a0)Σn1××Σ0(a_{n-1},\ldots,a_0)\in\Sigma_{n-1}\times\cdots\times\Sigma_0 appears with probability an1a0ψ2.\vert\langle a_{n-1}\cdots a_0\vert \psi\rangle\vert^2.

For example, if systems X\mathsf{X} and Y\mathsf{Y} are jointly in the quantum state

3504i51,\frac{3}{5} \vert 0\rangle \vert \heartsuit \rangle - \frac{4i}{5} \vert 1\rangle \vert \spadesuit \rangle,

then measuring both systems with standard basis measurements yields the outcome (0,)(0,\heartsuit) with probability 9/259/25 and the outcome (1,)(1,\spadesuit) with probability 16/25.16/25.

Partial measurements

Now let us consider the situation in which we have multiple systems in some quantum state, and we measure a proper subset of the systems. As before, we will begin with two systems X\mathsf{X} and Y\mathsf{Y} having classical state sets Σ\Sigma and Γ,\Gamma, respectively.

In general, a quantum state vector of (X,Y)(\mathsf{X},\mathsf{Y}) takes the form

ψ=(a,b)Σ×Γαabab, \vert \psi \rangle = \sum_{(a,b)\in\Sigma\times\Gamma} \alpha_{ab} \vert ab\rangle,

where {αab:(a,b)Σ×Γ}\{\alpha_{ab} : (a,b)\in\Sigma\times\Gamma\} is a collection of complex numbers satisfying

(a,b)Σ×Γαab2=1, \sum_{(a,b)\in\Sigma\times\Gamma} \vert \alpha_{ab} \vert^2 = 1,

which is equivalent to ψ\vert \psi \rangle being a unit vector.

We already know, from the discussion above, that if both X\mathsf{X} and Y\mathsf{Y} are measured, then each possible outcome (a,b)Σ×Γ(a,b)\in\Sigma\times\Gamma appears with probability

abψ2=αab2. \bigl\vert \langle ab \vert \psi \rangle \bigr\vert^2 = \vert\alpha_{ab}\vert^2.

If we suppose instead that just the first system X\mathsf{X} is measured, the probability for each outcome aΣa\in\Sigma to appear must therefore be equal to

bΓabψ2=bΓαab2. \sum_{b\in\Gamma} \bigl\vert \langle ab \vert \psi \rangle \bigr\vert^{2} = \sum_{b\in\Gamma} \vert\alpha_{ab}\vert^2.

This is consistent with what we already saw in the probabilistic setting, as well as our current understanding of physics: the probability for each outcome to appear when X\mathsf{X} is measured can't possibly depend on whether or not Y\mathsf{Y} was also measured, as that would allow for faster-than-light communication.

Having obtained a particular outcome aΣa\in\Sigma of a standard basis measurement of X,\mathsf{X}, we naturally expect that the quantum state of X\mathsf{X} changes so that it is equal to a,\vert a\rangle, just like we had for single systems. But what happens to the quantum state of Y\mathsf{Y}?

To answer this question, we can first express the vector ψ\vert\psi\rangle as

ψ=aΣaϕa, \vert\psi\rangle = \sum_{a\in\Sigma} \vert a \rangle \otimes \vert \phi_a \rangle,

where

ϕa=bΓαabb \vert \phi_a \rangle = \sum_{b\in\Gamma} \alpha_{ab} \vert b\rangle

for each aΣ.a\in\Sigma. Here we're following the same methodology as in the probabilistic case, of isolating the standard basis states of the system being measured. The probability for the standard basis measurement of X\mathsf{X} to give each outcome aa is as follows:

bΓαab2=ϕa2. \sum_{b\in\Gamma} \vert\alpha_{ab}\vert^2 = \bigl\| \vert \phi_a \rangle \bigr\|^2.

And, as a result of the standard basis measurement of X\mathsf{X} giving the outcome a,a, the quantum state of the pair (X,Y)(\mathsf{X},\mathsf{Y}) together becomes

aϕaϕa. \vert a \rangle \otimes \frac{\vert \phi_a \rangle}{\|\vert \phi_a \rangle\|}.

That is, the state "collapses" like in the single-system case, but only as far as is required for the state to be consistent with the measurement of X\mathsf{X} having produced the outcome a.a.

Informally speaking, aϕa\vert a \rangle \otimes \vert \phi_a\rangle represents the component of ψ\vert \psi\rangle that is consistent with the a measurement of X\mathsf{X} resulting in the outcome a.a. We then normalize this vector — by dividing it by its Euclidean norm, which is equal to ϕa\|\vert\phi_a\rangle\| — to obtain a valid quantum state vector having Euclidean norm equal to 1.1. This normalization step is analogous to what we did in the probabilistic setting when we divided vectors by the sum of their entries to obtain a probability vector.

As an example, consider the state of two qubits (X,Y)(\mathsf{X},\mathsf{Y}) from the beginning of the section:

ψ=12001601+i610+1611. \vert \psi \rangle = \frac{1}{\sqrt{2}} \vert 00 \rangle - \frac{1}{\sqrt{6}} \vert 01 \rangle + \frac{i}{\sqrt{6}} \vert 10 \rangle + \frac{1}{\sqrt{6}} \vert 11 \rangle.

To understand what happens when the first system X\mathsf{X} is measured, we begin by writing

ψ=0(120161)+1(i60+161). \vert \psi \rangle = \vert 0 \rangle \otimes \biggl( \frac{1}{\sqrt{2}} \vert 0 \rangle - \frac{1}{\sqrt{6}} \vert 1 \rangle \biggr) + \vert 1 \rangle \otimes \biggl( \frac{i}{\sqrt{6}} \vert 0 \rangle + \frac{1}{\sqrt{6}} \vert 1 \rangle \biggr).

We now see, based on the description above, that the probability for the measurement to result in the outcome 00 is

1201612=12+16=23, \biggl\|\frac{1}{\sqrt{2}} \vert 0 \rangle -\frac{1}{\sqrt{6}} \vert 1 \rangle\biggr\|^2 = \frac{1}{2} + \frac{1}{6} = \frac{2}{3},

in which case the state of (X,Y)(\mathsf{X},\mathsf{Y}) becomes

012016123=0(320121); \vert 0\rangle \otimes \frac{\frac{1}{\sqrt{2}} \vert 0 \rangle -\frac{1}{\sqrt{6}} \vert 1 \rangle}{\sqrt{\frac{2}{3}}} = \vert 0\rangle \otimes \Biggl( \frac{\sqrt{3}}{2} \vert 0 \rangle - \frac{1}{2} \vert 1\rangle\Biggr);

and the probability for the measurement to result in the outcome 11 is

i60+1612=16+16=13, \biggl\|\frac{i}{\sqrt{6}} \vert 0 \rangle + \frac{1}{\sqrt{6}} \vert 1 \rangle\biggr\|^2 = \frac{1}{6} + \frac{1}{6} = \frac{1}{3},

in which case the state of (X,Y)(\mathsf{X},\mathsf{Y}) becomes

1i60+16113=1(i20+121). \vert 1\rangle \otimes \frac{\frac{i}{\sqrt{6}} \vert 0 \rangle +\frac{1}{\sqrt{6}} \vert 1 \rangle}{\sqrt{\frac{1}{3}}} = \vert 1\rangle \otimes \Biggl( \frac{i}{\sqrt{2}} \vert 0 \rangle +\frac{1}{\sqrt{2}} \vert 1\rangle\Biggr).

The same technique, used in a symmetric way, describes what happens if the second system Y\mathsf{Y} is measured rather than the first. This time we rewrite the vector ψ\vert \psi \rangle as

ψ=(120+i61)0+(160+161)1. \vert \psi \rangle = \biggl( \frac{1}{\sqrt{2}} \vert 0 \rangle + \frac{i}{\sqrt{6}} \vert 1 \rangle \biggr) \otimes \vert 0\rangle + \biggl( -\frac{1}{\sqrt{6}} \vert 0 \rangle +\frac{1}{\sqrt{6}} \vert 1\rangle \biggr) \otimes \vert 1\rangle.

The probability that the measurement of Y\mathsf{Y} gives the outcome 00 is

120+i612=12+16=23,\biggl\| \frac{1}{\sqrt{2}} \vert 0 \rangle + \frac{i}{\sqrt{6}} \vert 1 \rangle \biggr\|^2 = \frac{1}{2} + \frac{1}{6} = \frac{2}{3},

in which case the state of (X,Y)(\mathsf{X},\mathsf{Y}) becomes

120+i61230=(320+i21)0; \frac{\frac{1}{\sqrt{2}} \vert 0 \rangle + \frac{i}{\sqrt{6}} \vert 1 \rangle}{\sqrt{\frac{2}{3}}} \otimes \vert 0 \rangle = \biggl(\frac{\sqrt{3}}{2} \vert 0 \rangle + \frac{i}{2} \vert 1 \rangle\biggr) \otimes\vert 0 \rangle;

and the probability that the measurement outcome is 11 is

160+1612=16+16=13, \biggl\| -\frac{1}{\sqrt{6}} \vert 0 \rangle +\frac{1}{\sqrt{6}} \vert 1\rangle \biggr\|^2 = \frac{1}{6} + \frac{1}{6} = \frac{1}{3},

in which case the state of (X,Y)(\mathsf{X},\mathsf{Y}) becomes

160+161131=(120+121)1.\frac{ -\frac{1}{\sqrt{6}} \vert 0 \rangle +\frac{1}{\sqrt{6}} \vert 1\rangle }{\frac{1}{\sqrt{3}}} \otimes \vert 1\rangle = \biggl(-\frac{1}{\sqrt{2}} \vert 0\rangle + \frac{1}{\sqrt{2}} \vert 1\rangle\biggr) \otimes \vert 1\rangle.

Remark on reduced quantum states

The previous example shows a limitation of the simplified description of quantum information, which is that it does not offer us a way to describe the reduced (or marginal) quantum state of just one of two systems (or of a proper subset of any number of systems) like in the probabilistic case.

Specifically, for a probabilistic state of two systems (X,Y)(\mathsf{X},\mathsf{Y}) described by a probability vector

(a,b)Σ×Γpabab, \sum_{(a,b)\in\Sigma\times\Gamma} p_{ab} \vert ab\rangle,

we can write the reduced or marginal probabilistic state of X\mathsf{X} alone as

aΣ(bΓpab)a=(a,b)Σ×Γpaba. \sum_{a\in\Sigma} \biggl( \sum_{b\in\Gamma} p_{ab}\biggr) \vert a\rangle = \sum_{(a,b)\in\Sigma\times\Gamma} p_{ab} \vert a\rangle.

For quantum state vectors, there isn't an analogous way to do this. In particular, for a quantum state vector

ψ=(a,b)Σ×Γαabab, \vert \psi \rangle = \sum_{(a,b)\in\Sigma\times\Gamma} \alpha_{ab} \vert ab\rangle,

the vector

(a,b)Σ×Γαaba \sum_{(a,b)\in\Sigma\times\Gamma} \alpha_{ab} \vert a\rangle

is not a quantum state vector in general, and does not properly represent the concept of a reduced or marginal state.

What we may do instead is turn to the notion of a density matrix, which is discussed in the General formulation of quantum information course. Density matrices provide us with a meaningful way to define reduced quantum states that is analogous to the probabilistic setting.

Partial measurements for three or more systems

Partial measurements for three or more systems, where some proper subset of the systems are measured, can be reduced to the case of two systems by dividing the systems into two collections, those that are measured and those that are not. Here is a specific example that illustrates how this can be done. It demonstrates specifically how subscripting kets by the names of the systems they represent can be useful — in this case because it gives us a simple way to describe permutations of the systems.

For this example, we'll consider a quantum state of a 5-tuple of systems (X4,,X0),(\mathsf{X}_4,\ldots,\mathsf{X}_0), where all five of these systems share the same classical state set {,,,}:\{\clubsuit,\diamondsuit,\heartsuit,\spadesuit\}:

17+27+17i2717.\begin{gathered} \sqrt{\frac{1}{7}} \vert\heartsuit\rangle \vert\clubsuit\rangle \vert\diamondsuit\rangle \vert\spadesuit\rangle \vert\spadesuit\rangle + \sqrt{\frac{2}{7}} \vert\diamondsuit\rangle \vert\clubsuit\rangle \vert\diamondsuit\rangle \vert\spadesuit\rangle \vert\clubsuit\rangle + \sqrt{\frac{1}{7}} \vert\spadesuit\rangle \vert\spadesuit\rangle \vert\clubsuit\rangle \vert\diamondsuit\rangle \vert\clubsuit\rangle \\ -i \sqrt{\frac{2}{7}} \vert\heartsuit\rangle \vert\clubsuit\rangle \vert\diamondsuit\rangle \vert\heartsuit\rangle \vert\heartsuit\rangle - \sqrt{\frac{1}{7}} \vert\spadesuit\rangle \vert\heartsuit\rangle \vert\clubsuit\rangle \vert\spadesuit\rangle \vert\clubsuit\rangle. \end{gathered}

We'll consider the situation in which the first and third systems are measured, and the remaining systems are left alone.

Conceptually speaking, there's no fundamental difference between this situation and one in which one of two systems is measured. Unfortunately, because the measured systems are interspersed with the unmeasured systems, we face a hurdle in writing down the expressions needed to perform these calculations.

One way to proceed, as suggested above, is to subscript the kets to indicate which systems they refer to. This gives us a way to keep track of the systems as we permute the ordering of the kets, which makes the mathematics simpler.

First, the quantum state vector above can alternatively be written as

1743210+2743210+1743210i27432101743210.\begin{gathered} \sqrt{\frac{1}{7}} \vert\heartsuit\rangle_4 \vert\clubsuit\rangle_3 \vert\diamondsuit\rangle_2 \vert\spadesuit\rangle_1 \vert\spadesuit\rangle_0 + \sqrt{\frac{2}{7}} \vert\diamondsuit\rangle_4 \vert\clubsuit\rangle_3 \vert\diamondsuit\rangle_2 \vert\spadesuit\rangle_1 \vert\clubsuit\rangle_0\\ + \sqrt{\frac{1}{7}} \vert\spadesuit\rangle_4 \vert\spadesuit\rangle_3 \vert\clubsuit\rangle_2 \vert\diamondsuit\rangle_1 \vert\clubsuit\rangle_0 -i \sqrt{\frac{2}{7}} \vert\heartsuit\rangle_4 \vert\clubsuit\rangle_3 \vert\diamondsuit\rangle_2 \vert\heartsuit\rangle_1 \vert\heartsuit\rangle_0\\ - \sqrt{\frac{1}{7}} \vert\spadesuit\rangle_4 \vert\heartsuit\rangle_3 \vert\clubsuit\rangle_2 \vert\spadesuit\rangle_1 \vert\clubsuit\rangle_0. \end{gathered}

Nothing has changed, except that each ket now has a subscript indicating which system it corresponds to. Here we've used the subscripts 0,,4,0,\ldots,4, but the names of the systems themselves could also be used (in a situation where we have system names such as X,\mathsf{X}, Y,\mathsf{Y}, and Z,\mathsf{Z}, for instance).

We can now re-order the kets and collect terms as follows:

1742310+2742310+1742310i27423101742310=42(17310i27310)+42(27310)+42(1731017310).\begin{aligned} & \sqrt{\frac{1}{7}} \vert\heartsuit\rangle_4 \vert\diamondsuit\rangle_2 \vert\clubsuit\rangle_3 \vert\spadesuit\rangle_1 \vert\spadesuit\rangle_0 + \sqrt{\frac{2}{7}} \vert\diamondsuit\rangle_4 \vert\diamondsuit\rangle_2 \vert\clubsuit\rangle_3 \vert\spadesuit\rangle_1 \vert\clubsuit\rangle_0\\ & \quad + \sqrt{\frac{1}{7}} \vert\spadesuit\rangle_4 \vert\clubsuit\rangle_2 \vert\spadesuit\rangle_3 \vert\diamondsuit\rangle_1 \vert\clubsuit\rangle_0 -i \sqrt{\frac{2}{7}} \vert\heartsuit\rangle_4 \vert\diamondsuit\rangle_2 \vert\clubsuit\rangle_3 \vert\heartsuit\rangle_1 \vert\heartsuit\rangle_0\\ & \quad -\sqrt{\frac{1}{7}} \vert\spadesuit\rangle_4 \vert\clubsuit\rangle_2 \vert\heartsuit\rangle_3 \vert\spadesuit\rangle_1 \vert\clubsuit\rangle_0\\[2mm] & \hspace{1.5cm} = \vert\heartsuit\rangle_4 \vert\diamondsuit\rangle_2 \biggl( \sqrt{\frac{1}{7}} \vert\clubsuit\rangle_3 \vert\spadesuit\rangle_1 \vert\spadesuit\rangle_0 -i \sqrt{\frac{2}{7}} \vert\clubsuit\rangle_3 \vert\heartsuit\rangle_1 \vert\heartsuit\rangle_0 \biggr)\\ & \hspace{1.5cm} \quad + \vert\diamondsuit\rangle_4 \vert\diamondsuit\rangle_2 \biggl( \sqrt{\frac{2}{7}} \vert\clubsuit\rangle_3 \vert\spadesuit\rangle_1 \vert\clubsuit\rangle_0 \biggr)\\ & \hspace{1.5cm} \quad + \vert\spadesuit\rangle_4 \vert\clubsuit\rangle_2 \biggl( \sqrt{\frac{1}{7}} \vert\spadesuit\rangle_3 \vert\diamondsuit\rangle_1 \vert\clubsuit\rangle_0 - \sqrt{\frac{1}{7}} \vert\heartsuit\rangle_3 \vert\spadesuit\rangle_1 \vert\clubsuit\rangle_0\biggr). \end{aligned}

The tensor products are still implicit, even when parentheses are used, as in this example.

To be clear about permuting the kets, tensor products are not commutative: if ϕ\vert \phi\rangle and π\vert \pi \rangle are vectors, then, in general, ϕπ\vert \phi\rangle\otimes\vert \pi \rangle is different from πϕ,\vert \pi\rangle\otimes\vert \phi \rangle, and likewise for tensor products of three or more vectors. For instance, \vert\heartsuit\rangle \vert\clubsuit\rangle \vert\diamondsuit\rangle \vert\spadesuit\rangle \vert\spadesuit\rangle is a different vector than .\vert\heartsuit\rangle \vert\diamondsuit\rangle \vert\clubsuit\rangle \vert\spadesuit\rangle \vert\spadesuit\rangle. Re-ordering the kets as we have just done should not be interpreted as suggesting otherwise.

Rather, for the sake of performing calculations, we're simply making a decision that it's more convenient to collect the systems together as (X4,X2,X3,X1,X0)(\mathsf{X}_4,\mathsf{X}_2,\mathsf{X}_3,\mathsf{X}_1,\mathsf{X}_0) rather than (X4,X3,X2,X1,X0).(\mathsf{X}_4,\mathsf{X}_3,\mathsf{X}_2,\mathsf{X}_1,\mathsf{X}_0). The subscripts on the kets serve to keep this all straight, and we're free to revert back to the original ordering later if we wish to do that.

We now see that, if the systems X4\mathsf{X}_4 and X2\mathsf{X}_2 are measured, the (nonzero) probabilities of the different outcomes are as follow:

  • The measurement outcome (,)(\heartsuit,\diamondsuit) occurs with probability
17310i273102=17+27=37\biggl\| \sqrt{\frac{1}{7}} \vert\clubsuit\rangle_3 \vert\spadesuit\rangle_1 \vert\spadesuit\rangle_0 -i \sqrt{\frac{2}{7}} \vert\clubsuit\rangle_3 \vert\heartsuit\rangle_1 \vert\heartsuit\rangle_0 \biggr\|^2 = \frac{1}{7} + \frac{2}{7} = \frac{3}{7}
  • The measurement outcome (,)(\diamondsuit,\diamondsuit) occurs with probability
273102=27\biggl\| \sqrt{\frac{2}{7}} \vert\clubsuit\rangle_3 \vert\spadesuit\rangle_1 \vert\clubsuit\rangle_0 \biggr\|^2 = \frac{2}{7}
  • The measurement outcome (,)(\spadesuit,\clubsuit) occurs with probability
17310173102=17+17=27.\biggl\| \sqrt{\frac{1}{7}} \vert\spadesuit\rangle_3 \vert\diamondsuit\rangle_1 \vert\clubsuit\rangle_0 - \sqrt{\frac{1}{7}} \vert\heartsuit\rangle_3 \vert\spadesuit\rangle_1 \vert\clubsuit\rangle_0 \biggr\|^2 = \frac{1}{7} + \frac{1}{7} = \frac{2}{7}.

If the measurement outcome is (,),(\heartsuit,\diamondsuit), for instance, the resulting state of our five systems becomes

4217310i2731037=1343210i2343210.\begin{aligned} & \vert \heartsuit\rangle_4 \vert \diamondsuit \rangle_2 \otimes \frac{ \sqrt{\frac{1}{7}} \vert\clubsuit\rangle_3 \vert\spadesuit\rangle_1 \vert\spadesuit\rangle_0 - i \sqrt{\frac{2}{7}} \vert\clubsuit\rangle_3 \vert\heartsuit\rangle_1 \vert\heartsuit\rangle_0} {\sqrt{\frac{3}{7}}}\\ & \qquad = \sqrt{\frac{1}{3}} \vert \heartsuit\rangle_4 \vert\clubsuit\rangle_3 \vert \diamondsuit \rangle_2\vert\spadesuit\rangle_1 \vert\spadesuit\rangle_0 -i \sqrt{\frac{2}{3}} \vert \heartsuit\rangle_4 \vert\clubsuit\rangle_3 \vert \diamondsuit \rangle_2\vert\heartsuit\rangle_1 \vert\heartsuit\rangle_0. \end{aligned}

Here, for the final answer, we've reverted back to our original ordering of the systems, just to illustrate that we can do this. For the other possible measurement outcomes, the state can be determined in a similar way.

Finally, here are two examples promised earlier, beginning with the GHZ state

12000+12111.\frac{1}{\sqrt{2}} \vert 000\rangle + \frac{1}{\sqrt{2}} \vert 111\rangle.

If just the first system is measured, we obtain the outcome 00 with probability 1/2,1/2, in which case the state of the three qubits becomes 000;\vert 000\rangle; and we also obtain the outcome 11 with probability 1/2,1/2, in which case the state of the three qubits becomes 111.\vert 111\rangle.

For a W state, on the other hand, assuming again that just the first system is measured, we begin by writing this state like this:

13001+13010+13100=0(1301+1310)+1(1300).\begin{aligned} & \frac{1}{\sqrt{3}} \vert 001\rangle + \frac{1}{\sqrt{3}} \vert 010\rangle + \frac{1}{\sqrt{3}} \vert 100\rangle \\ & \qquad = \vert 0 \rangle \biggl( \frac{1}{\sqrt{3}} \vert 01\rangle + \frac{1}{\sqrt{3}} \vert 10\rangle\biggr) + \vert 1 \rangle \biggl(\frac{1}{\sqrt{3}}\vert 00\rangle\biggr). \end{aligned}

The probability that a measurement of the first qubit results in the outcome 0 is therefore equal to

1301+13102=23,\biggl\| \frac{1}{\sqrt{3}} \vert 01\rangle + \frac{1}{\sqrt{3}} \vert 10\rangle \biggr\|^2 = \frac{2}{3},

and conditioned upon the measurement producing this outcome, the quantum state of the three qubits becomes

01301+131023=0(1201+1210)=0ψ+.\vert 0\rangle\otimes \frac{ \frac{1}{\sqrt{3}} \vert 01\rangle + \frac{1}{\sqrt{3}} \vert 10\rangle }{ \sqrt{\frac{2}{3}} } = \vert 0\rangle \biggl(\frac{1}{\sqrt{2}} \vert 01\rangle + \frac{1}{\sqrt{2}} \vert 10\rangle \biggr) = \vert 0\rangle\vert \psi^+\rangle.

The probability that the measurement outcome is 1 is 1/3,1/3, in which case the state of the three qubits becomes 100.\vert 100\rangle.

The W state is symmetric, in the sense that it does not change if we permute the qubits. We therefore obtain a similar description for measuring the second or third qubit rather than the first.


Unitary operations

In principle, any unitary matrix whose rows and columns correspond to the classical states of a system represents a valid quantum operation on that system. This, of course, remains true for compound systems, whose classical state sets happen to be Cartesian products of the classical state sets of the individual systems.

Focusing in on two systems, if X\mathsf{X} is a system having classical state set Σ,\Sigma, and Y\mathsf{Y} is a system having classical state set Γ,\Gamma, then the classical state set of the joint system (X,Y)(\mathsf{X},\mathsf{Y}) is Σ×Γ.\Sigma\times\Gamma. Therefore, quantum operations on this joint system are represented by unitary matrices whose rows and columns are placed in correspondence with the set Σ×Γ.\Sigma\times\Gamma. The ordering of the rows and columns of these matrices is the same as the ordering used for quantum state vectors of the system (X,Y).(\mathsf{X},\mathsf{Y}).

For example, let us suppose that Σ={1,2,3}\Sigma = \{1,2,3\} and Γ={0,1},\Gamma = \{0,1\}, and recall that the standard convention for ordering the elements of the Cartesian product {1,2,3}×{0,1}\{1,2,3\}\times\{0,1\} is this:

(1,0),  (1,1),  (2,0),  (2,1),  (3,0),  (3,1).(1,0),\;(1,1),\;(2,0),\;(2,1),\;(3,0),\; (3,1).

Here's an example of a unitary matrix representing an operation on (X,Y):(\mathsf{X},\mathsf{Y}):

U=(121212001212i21200i212121200120001212012i21200i200012120).U = \begin{pmatrix} \frac{1}{2} & \frac{1}{2} & \frac{1}{2} & 0 & 0 & \frac{1}{2} \\[2mm] \frac{1}{2} & \frac{i}{2} & -\frac{1}{2} & 0 & 0 & -\frac{i}{2} \\[2mm] \frac{1}{2} & -\frac{1}{2} & \frac{1}{2} & 0 & 0 & -\frac{1}{2} \\[2mm] 0 & 0 & 0 & \frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}} & 0\\[2mm] \frac{1}{2} & -\frac{i}{2} & -\frac{1}{2} & 0 & 0 & \frac{i}{2} \\[2mm] 0 & 0 & 0 & -\frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}} & 0 \end{pmatrix}.

This unitary matrix isn't special, it's just an example. To check that UU is unitary, it suffices to compute and check that UU=I,U^{\dagger} U = \mathbb{I}, for instance. Alternatively, we can check that the rows (or the columns) are orthonormal, which is made simpler in this case given the particular form of the matrix U.U.

The action of UU on the standard basis vector 1,1,\vert 1, 1 \rangle, for instance, is

U1,1=121,0+i21,1122,0i23,0,U \vert 1, 1\rangle = \frac{1}{2} \vert 1, 0 \rangle + \frac{i}{2} \vert 1, 1 \rangle - \frac{1}{2} \vert 2, 0 \rangle - \frac{i}{2} \vert 3, 0\rangle,

which we can see by examining the second column of U,U, considering our ordering of the set {1,2,3}×{0,1}.\{1,2,3\}\times\{0,1\}.

As with any matrix, it is possible to express UU using Dirac notation, which would require 20 terms for the 20 nonzero entries of U.U. If we did write down all of these terms, however, rather than writing a 6×66\times 6 matrix, it would be messy and the patterns that are evident from the matrix expression would not likely be as clear. Simply put, Dirac notation is not always the best choice.

Unitary operations on three or more systems work in a similar way, with the unitary matrices having rows and columns corresponding to the Cartesian product of the classical state sets of the systems. We've already seen one example in this lesson: the three-qubit operation

k=07(k+1)mod8k,\sum_{k = 0}^{7} \vert (k+1) \bmod 8 \rangle \langle k \vert,

where numbers in bras and kets mean their 33-bit binary encodings. In addition to being a deterministic operation, this is also a unitary operation. Operations that are both deterministic and unitary are called reversible operations. The conjugate transpose of this matrix can be written like this:

k=07k(k+1)mod8=k=07(k1)mod8k.\sum_{k = 0}^{7} \vert k \rangle \langle (k+1) \bmod 8 \vert = \sum_{k = 0}^{7} \vert (k-1) \bmod 8 \rangle \langle k \vert.

This represents the reverse, or in mathematical terms the inverse, of the original operation — which is what we expect from the conjugate transpose of a unitary matrix. We'll see other examples of unitary operations on multiple systems as the lesson continues.

Unitary operations performed independently on individual systems

When unitary operations are performed independently on a collection of individual systems, the combined action of these independent operations is described by the tensor product of the unitary matrices that represent them. That is, if X0,,Xn1\mathsf{X}_{0},\ldots,\mathsf{X}_{n-1} are quantum systems, U0,,Un1U_0,\ldots, U_{n-1} are unitary matrices representing operations on these systems, and the operations are performed independently on the systems, the combined action on (Xn1,,X0)(\mathsf{X}_{n-1},\ldots,\mathsf{X}_0) is represented by the matrix Un1U0.U_{n-1}\otimes\cdots\otimes U_0. Once again, we find that the probabilistic and quantum settings are analogous in this regard.

One would naturally expect, from reading the previous paragraph, that the tensor product of any collection of unitary matrices is unitary. Indeed this is true, and we can verify it as follows.

Notice first that the conjugate transpose operation satisfies

(Mn1M0)=Mn1M0 (M_{n-1} \otimes \cdots \otimes M_0)^{\dagger} = M_{n-1}^{\dagger} \otimes \cdots \otimes M_0^{\dagger}

for any chosen matrices M0,,Mn1.M_0,\ldots,M_{n-1}. This can be checked by going back to the definition of the tensor product and of the conjugate transpose, and checking that each entry of the two sides of the equation are in agreement. This means that

(Un1U0)(Un1U0)=(Un1U0)(Un1U0). (U_{n-1} \otimes \cdots \otimes U_0)^{\dagger} (U_{n-1}\otimes\cdots\otimes U_0) = (U_{n-1}^{\dagger} \otimes \cdots \otimes U_0^{\dagger}) (U_{n-1}\otimes\cdots\otimes U_0).

Because the tensor product of matrices is multiplicative, we find that

(Un1U0)(Un1U0)=(Un1Un1)(U0U0)=In1I0. (U_{n-1}^{\dagger} \otimes \cdots \otimes U_0^{\dagger}) (U_{n-1}\otimes\cdots\otimes U_0) = (U_{n-1}^{\dagger} U_{n-1}) \otimes \cdots \otimes (U_0^{\dagger} U_0) = \mathbb{I}_{n-1} \otimes \cdots \otimes \mathbb{I}_0.

Here we have written I0,,In1\mathbb{I}_0,\ldots,\mathbb{I}_{n-1} to refer to the matrices representing the identity operation on the systems X0,,Xn1,\mathsf{X}_0,\ldots,\mathsf{X}_{n-1}, which is to say that these are identity matrices whose sizes agree with the number of classical states of X0,,Xn1.\mathsf{X}_0,\ldots,\mathsf{X}_{n-1}.

Finally, the tensor product In1I0\mathbb{I}_{n-1} \otimes \cdots \otimes \mathbb{I}_0 is equal to the identity matrix for which we have a number of rows and columns that agrees with the product of the number of rows and columns of the matrices In1,,I0.\mathbb{I}_{n-1},\ldots,\mathbb{I}_0. This larger identity matrix represents the identity operation on the joint system (Xn1,,X0).(\mathsf{X}_{n-1},\ldots,\mathsf{X}_0).

In summary, we have the following sequence of equalities:

(Un1U0)(Un1U0)=(Un1U0)(Un1U0)=(Un1Un1)(U0U0)=In1I0=I.\begin{aligned} & (U_{n-1} \otimes \cdots \otimes U_0)^{\dagger} (U_{n-1}\otimes\cdots\otimes U_0) \\ & \quad = (U_{n-1}^{\dagger} \otimes \cdots \otimes U_0^{\dagger}) (U_{n-1}\otimes\cdots\otimes U_0) \\ & \quad = (U_{n-1}^{\dagger} U_{n-1}) \otimes \cdots \otimes (U_0^{\dagger} U_0)\\ & \quad = \mathbb{I}_{n-1} \otimes \cdots \otimes \mathbb{I}_0\\ & \quad = \mathbb{I}. \end{aligned}

We therefore conclude that Un1U0U_{n-1} \otimes \cdots \otimes U_0 is unitary.

An important situation that often arises is one in which a unitary operation is applied to just one system — or a proper subset of systems — within a larger joint system. For instance, suppose that X\mathsf{X} and Y\mathsf{Y} are systems that we can view together as forming a single, compound system (X,Y),(\mathsf{X},\mathsf{Y}), and we perform an operation just on the system X.\mathsf{X}. To be precise, let us suppose that UU is a unitary matrix representing an operation on X,\mathsf{X}, so that its rows and columns have been placed in correspondence with the classical states of X.\mathsf{X}.

To say that we perform the operation represented by UU just on the system X\mathsf{X} implies that we do nothing to Y,\mathsf{Y}, meaning that we independently perform UU on X\mathsf{X} and the identity operation on Y.\mathsf{Y}. That is, "doing nothing" to Y\mathsf{Y} is equivalent to performing the identity operation on Y,\mathsf{Y}, which is represented by the identity matrix IY.\mathbb{I}_\mathsf{Y}. (Here, by the way, the subscript Y\mathsf{Y} tells us that IY\mathbb{I}_\mathsf{Y} refers to the identity matrix having a number of rows and columns in agreement with the classical state set of Y.\mathsf{Y}.) The operation on (X,Y)(\mathsf{X},\mathsf{Y}) that is obtained when we perform UU on X\mathsf{X} and do nothing to Y\mathsf{Y} is therefore represented by the unitary matrix

UIY. U \otimes \mathbb{I}_{\mathsf{Y}}.

For example, if X\mathsf{X} and Y\mathsf{Y} are qubits, performing a Hadamard operation on X\mathsf{X} and doing nothing to Y\mathsf{Y} is equivalent to performing the operation

HIY=(12121212)(1001)=(120120012012120120012012) H \otimes \mathbb{I}_{\mathsf{Y}} = \begin{pmatrix} \frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}}\\[2mm] \frac{1}{\sqrt{2}} & -\frac{1}{\sqrt{2}} \end{pmatrix} \otimes \begin{pmatrix} 1 & 0\\ 0 & 1 \end{pmatrix} = \begin{pmatrix} \frac{1}{\sqrt{2}} & 0 & \frac{1}{\sqrt{2}} & 0\\[2mm] 0 & \frac{1}{\sqrt{2}} & 0 & \frac{1}{\sqrt{2}}\\[2mm] \frac{1}{\sqrt{2}} & 0 & -\frac{1}{\sqrt{2}} & 0\\[2mm] 0 & \frac{1}{\sqrt{2}} & 0 & -\frac{1}{\sqrt{2}} \end{pmatrix}

on the joint system (X,Y).(\mathsf{X},\mathsf{Y}).

Along similar lines, if an operation represented by a unitary matrix UU is applied to Y\mathsf{Y} and nothing is done to X,\mathsf{X}, the resulting operation on (X,Y)(\mathsf{X},\mathsf{Y}) is represented by the unitary matrix

IXU. \mathbb{I}_{\mathsf{X}} \otimes U.

For example, if we again consider the situation in which both X\mathsf{X} and Y\mathsf{Y} are qubits and UU is a Hadamard operation, the resulting operation on (X,Y)(\mathsf{X},\mathsf{Y}) is represented by the matrix

(1001)(12121212)=(121200121200001212001212). \begin{pmatrix} 1 & 0\\ 0 & 1 \end{pmatrix} \otimes \begin{pmatrix} \frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}}\\[2mm] \frac{1}{\sqrt{2}} & -\frac{1}{\sqrt{2}} \end{pmatrix} = \begin{pmatrix} \frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}} & 0 & 0\\[2mm] \frac{1}{\sqrt{2}} & -\frac{1}{\sqrt{2}} & 0 & 0\\[2mm] 0 & 0 & \frac{1}{\sqrt{2}} & \frac{1}{\sqrt{2}}\\[2mm] 0 & 0 & \frac{1}{\sqrt{2}} & -\frac{1}{\sqrt{2}} \end{pmatrix}.

Not every unitary operation on a collection of systems can be written as a tensor product of unitary operations like this, just as not every quantum state vector of these systems is a product state. For example, neither the swap operation nor the controlled-NOT operation on two qubits, which are described below, can be expressed as a tensor product of unitary operations.

The swap operation

To conclude the lesson, let's take a look at two classes of examples of unitary operations on multiple systems, beginning with the swap operation.

Suppose that X\mathsf{X} and Y\mathsf{Y} are systems that share the same classical state set Σ.\Sigma. The swap operation on the pair (X,Y)(\mathsf{X},\mathsf{Y}) is the operation that exchanges the contents of the two systems, but otherwise leaves the systems alone — so that X\mathsf{X} remains on the left and Y\mathsf{Y} remains on the right. We'll denote this operation as SWAP,\operatorname{SWAP}, and it operates like this for every choice of classical states a,bΣ:a,b\in\Sigma:

SWAPab=ba.\operatorname{SWAP} \vert a \rangle \vert b \rangle = \vert b \rangle \vert a \rangle.

One way to write the matrix associated with this operation using the Dirac notation is as follows:

SWAP=c,dΣcddc.\mathrm{SWAP} = \sum_{c,d\in\Sigma} \vert c \rangle \langle d \vert \otimes \vert d \rangle \langle c \vert.

It may not be immediately clear that this matrix represents SWAP,\operatorname{SWAP}, but we can check it satisfies the condition SWAPab=ba\operatorname{SWAP} \vert a \rangle \vert b \rangle = \vert b \rangle \vert a \rangle for every choice of classical states a,bΣ.a,b\in\Sigma. As a simple example, when X\mathsf{X} and Y\mathsf{Y} are qubits, we find that

SWAP=(1000001001000001). \operatorname{SWAP} = \begin{pmatrix} 1 & 0 & 0 & 0\\ 0 & 0 & 1 & 0\\ 0 & 1 & 0 & 0\\ 0 & 0 & 0 & 1 \end{pmatrix}.

Controlled-unitary operations

Now let us suppose that Q\mathsf{Q} is a qubit and R\mathsf{R} is an arbitrary system, having whatever classical state set we wish. For every unitary operation UU acting on the system R,\mathsf{R}, a controlled-UU operation is a unitary operation on the pair (Q,R)(\mathsf{Q},\mathsf{R}) defined as follows:

CU=00IR+11U.CU = \vert 0\rangle \langle 0\vert \otimes \mathbb{I}_{\mathsf{R}} + \vert 1\rangle \langle 1\vert \otimes U.

For example, if R\mathsf{R} is also a qubit, and we consider the Pauli XX operation on R,\mathrm{R}, then a controlled-XX operation is given by

CX=00IR+11X=(1000010000010010). CX = \vert 0\rangle \langle 0\vert \otimes \mathbb{I}_{\mathsf{R}} + \vert 1\rangle \langle 1\vert \otimes X = \begin{pmatrix} 1 & 0 & 0 & 0\\ 0 & 1 & 0 & 0\\ 0 & 0 & 0 & 1\\ 0 & 0 & 1 & 0 \end{pmatrix}.

We already encountered this operation in the context of classical information and probabilistic operations earlier in the lesson. Replacing the Pauli XX operation on R\mathsf{R} with a ZZ operation gives this operation:

CZ=00IR+11Z=(1000010000100001). CZ = \vert 0\rangle \langle 0\vert \otimes \mathbb{I}_{\mathsf{R}} + \vert 1\rangle \langle 1\vert \otimes Z = \begin{pmatrix} 1 & 0 & 0 & 0\\ 0 & 1 & 0 & 0\\ 0 & 0 & 1 & 0\\ 0 & 0 & 0 & -1 \end{pmatrix}.

If instead we take R\mathsf{R} to be two qubits, and we take UU to be the swap operation between these two qubits, we obtain this operation:

CSWAP=(1000000001000000001000000001000000001000000000100000010000000001). \operatorname{CSWAP} = \begin{pmatrix} 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 \end{pmatrix}.

This operation is also known as a Fredkin operation, or more commonly, a Fredkin gate. Its action on standard basis states can be described as follows:

CSWAP0bc=0bcCSWAP1bc=1cb \begin{aligned} \operatorname{CSWAP} \vert 0 b c \rangle & = \vert 0 b c \rangle \\[1mm] \operatorname{CSWAP} \vert 1 b c \rangle & = \vert 1 c b \rangle \end{aligned}

Finally, a controlled-controlled-NOT operation, which we may denote as CCX,CCX, is called a Toffoli operation or Toffoli gate. Its matrix representation looks like this:

CCX=(1000000001000000001000000001000000001000000001000000000100000010). CCX = \begin{pmatrix} 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0\\ 0 & 1 & 0 & 0 & 0 & 0 & 0 & 0\\ 0 & 0 & 1 & 0 & 0 & 0 & 0 & 0\\ 0 & 0 & 0 & 1 & 0 & 0 & 0 & 0\\ 0 & 0 & 0 & 0 & 1 & 0 & 0 & 0\\ 0 & 0 & 0 & 0 & 0 & 1 & 0 & 0\\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1\\ 0 & 0 & 0 & 0 & 0 & 0 & 1 & 0 \end{pmatrix}.

We may alternatively express it using the Dirac notation as follows:

CCX=(0000+0101+1010)I+1111X. CCX = \bigl( \vert 00 \rangle \langle 00 \vert + \vert 01 \rangle \langle 01 \vert + \vert 10 \rangle \langle 10 \vert \bigr) \otimes \mathbb{I} + \vert 11 \rangle \langle 11 \vert \otimes X.
Was this page helpful?
Report a bug or request content on GitHub.