Mathematical formulations of measurements
The lesson begins with two equivalent mathematical descriptions of measurements:
- General measurements can be described by collections of matrices, one for each measurement outcome, in a way that generalizes the description of projective measurements.
- General measurements can be described as channels whose outputs are always classical states (represented by diagonal density matrices).
We'll restrict our attention to measurements having finitely many possible outcomes. Although it is possible to define measurements with infinitely many possible outcomes, they're much less typically encountered in the context of computation and information processing, and they also require some additional mathematics (namely measure theory) to be properly formalized.
Our initial focus will be on so-called destructive measurements, where the output of the measurement is a classical measurement outcome alone — with no specification of the post-measurement quantum state of whatever system was measured. Intuitively speaking, we can imagine that such a measurement destroys the quantum system itself, or that the system is immediately discarded once the measurement is made. Later in the lesson we'll broaden our view and consider non-destructive measurements, where there's both a classical measurement outcome and a post-measurement quantum state of the measured system.
Measurements as collections of matrices
Suppose is a system that is to be measured, and assume for simplicity that the classical state set of is for some positive integer so that density matrices representing quantum states of are matrices. We won't actually have much need to refer to the classical states of but it will be convenient to refer to the number of classical states of We'll also assume that the possible outcomes of the measurement are the integers for some positive integer
Note that we're just using these names to keep things simple; it's straightforward to generalize everything that follows to other finite sets of classical states and measurement outcomes, renaming them as desired.
Projective measurements
Recall that a projective measurement is described by a collection of projection matrices that sum to the identity matrix. In symbols,
describes a projective measurement of if each is an projection matrix and the following condition is met.
When such a measurement is performed on a system while it's in a state described by some quantum state vector each outcome is obtained with probability equal to We also have that the post-measurement state of is obtained by normalizing the vector but we're ignoring the post-measurement state for now.
If the state of is described by a density matrix rather than a quantum state vector then we can alternatively express the probability to obtain the outcome as
If is a pure state, then the two expressions are equal:
Here we're using the cyclic property of the trace for the second equality, and for the third equality we're using the fact that each is a projection matrix, and therefore satisfies
In general, if is a convex combination
of pure states, then the expression coincides with the average probability for the outcome owing to the fact that this expression is linear in
General measurements
A mathematical description for general measurements is obtained by relaxing the definition of projective measurements. Specifically, we allow the matrices in the collection describing the measurement to be arbitrary positive semidefinite matrices rather than projections. (Projections are always positive semidefinite; they can alternatively be defined as positive semidefinite matrices whose eigenvalues are all either 0 or 1.)
In particular, a general measurement of a system having outcomes is specified by a collection of positive semidefinite matrices whose rows and columns correspond to the classical states of and that meet the condition
If the system is measured while it is in a state described by the density matrix then each outcome appears with probability
As we must naturally demand, the vector of outcome probabilities
of a general measurement always forms a probability vector, for any choice of a density matrix The following two observations establish that this is the case.
-
Each value must be nonnegative, owing to the fact that the trace of the product of any two positive semidefinite matrices is always nonnegative:
One way to argue this fact is to use spectral decompositions of and together with the cyclic property of the trace to express the trace of the product as a sum of nonnegative real numbers, which must therefore be nonnegative.
-
The condition together with the linearity of the trace ensures that the probabilities sum to
Example 1: any projective measurement
Projections are always positive semidefinite, so every projective measurement is an example of a general measurement.
For example, a standard basis measurement of a qubit can be represented by where
Measuring a qubit in the state results in outcome probabilities as follows.
Example 2: a non-projective qubit measurement
Suppose is a qubit, and define two matrices as follows.
These are both positive semidefinite matrices: they're Hermitian, and in both cases the eigenvalues happen to be which are both positive. We also have that and therefore describes a measurement.
If the state of is described by a density matrix and we perform this measurement, then the probability of obtaining the outcome is and the probability of obtaining the outcome is For instance, if then the probabilities for the two outcomes and are as follows.
Example 3: tetrahedral measurement
Define four single-qubit quantum state vectors as follows.
These four states are sometimes known as tetrahedral states because they're vertices of a regular tetrahedron inscribed within the Bloch sphere.

The Cartesian coordinates of these four states on the Bloch sphere are
which can be verified by expressing the density matrices representations of these states as linear combinations of Pauli matrices.
These four states are perfectly spread out on the Bloch sphere, each one equidistant from the other three and with the angles between any two of them always being the same.
Now let us define a measurement of a qubit by setting as follows for each
We can verify that this is a valid measurement as follows.
- Each is evidently positive semidefinite, being a pure state divided by one-half. That is, each one is a Hermitian matrix and has one eigenvalue equal to and all other eigenvalues zero.
- The sum of these matrices is the identity matrix: The expressions of these matrices as linear combinations of Pauli matrices makes this straightforward to verify.
Measurements as channels
A second way to describe measurements in mathematical terms is as channels.
Classical information can be viewed as a special case of quantum information, insofar as we can identify probabilistic states with diagonal density matrices. So, in operational terms, we can think about measurements as being channels whose inputs are matrices describing states of whatever system is being measured and whose outputs are diagonal density matrices describing the resulting distribution of measurement outcomes.
We'll see shortly that any channel having this property can always be written in a simple, canonical form that ties directly to the description of measurements as collections of positive semidefinite matrices. Conversely, given an arbitrary measurement as a collection of matrices, there's always a valid channel having the diagonal output property that describes the given measurement as suggested in the previous paragraph. Putting these observations together, we find that the two descriptions of general measurements are equivalent.
Before proceeding further, let's be more precise about the measurement, how we're viewing it as a channel, and what assumptions we're making about it.
As before, we'll suppose that is the system to be measured, and that the possible outcomes of the measurement are the integers for some positive integer We let be the system that stores measurement outcomes, so its classical state set is and we represent the measurement as a channel named from to Our assumption is that is classical — which is to say that no matter what state we start with for the state of we obtain is represented by a diagonal density matrix.
We can express in mathematical terms that the output of is always diagonal in the following way. First define the completely dephasing channel on
This channel is analogous to the completely dephasing qubit channel from the previous lesson. As a linear mapping, it zeros out all of the off-diagonal entries of an input matrix and leaves the diagonal alone.
And now, a simple way to express that a given density matrix is diagonal is by the equation In words, zeroing out all of the off-diagonal entries of a density matrix has no effect if and only if the off-diagonal entries were all zero to begin with. The channel therefore satisfies our assumption — that is classical — if and only if
for every density matrix representing a state of
Equivalence of the formulations
Channels to matrices
Suppose that we have a channel from to with the property that
for every density matrix This may alternatively be expressed as follows.
Like all channels, we can express in Kraus form for some way of choosing Kraus matrices
This provides us with an alternative expression for the diagonal entries of
for
Thus, for these same matrices we can express the channel as follows.
This expression is consistent with our description of general measurements in terms of matrices, as we see each measurement outcome appearing with probability
Now let's observe that the two properties required of the collection of matrices to describe a general measurement are indeed satisfied. The first property is that they're all positive semidefinite matrices. One way to see this is to observe that, for every vector having entries in correspondence with the classical state of we have
The second property is that if we sum these matrices we get the identity matrix.
The last equality follows from the fact that is a channel, so its Kraus matrices must satisfy this condition.
Matrices to channels
Now let's verify that for any collection of positive semidefinite matrices satisfying the mapping defined by
is indeed a valid channel from to
One way to do this is to compute the Choi representation of this mapping.
The transpose of each is introduced for the third equality because
This allows for the expressions and to appear, which simplify to the identity matrix upon summing over and respectively.
By the assumption that are positive semidefinite, so too are In particular, transposing a Hermitian matrix results in another Hermitian matrix, and the eigenvalues of any square matrix and its transpose always agree. It follows that is positive semidefinite. Tracing out the output system (which is the system on the right) yields
and so we conclude that is a channel.
Partial measurements
Suppose that we have multiple systems that are collectively in a quantum state, and a general measurement is performed on one of the systems. This results in one of the measurement outcomes, selected at random according to probabilities determined by the measurement and the state of the system prior to the measurement. The resulting state of the remaining systems will then, in general, depend on which measurement outcome was obtained.
Let's examine how this works for a pair of systems when the system is measured. (We're naming the system on the right because we'll take to be a system representing the classical output of the measurement when we view it as a channel.) We can then easily generalize to the situation in which the systems are swapped as well as to three or more systems.
Suppose the state of prior to the measurement is described by a density matrix which we can write as follows.
In this expression we're assuming the classical states of are
We'll assume that the measurement itself is described by the collection of matrices This measurement may alternatively be described as a channel from to where is a new system having classical state set Specifically, the action of this channel can be expressed as follows.
Outcome probabilities
We're considering a measurement of the system so the probabilities with which different measurement outcomes are obtained can depend only on the reduced state of In particular, the probability for each outcome to appear can be expressed in three equivalent ways.
The first expression naturally represents the probability to obtain the outcome based on what we already know about measurements of a single system. To get the second expression we're simply using the definition
To get the third expression requires more thought — and learners are encouraged to convince themselves that it is true. Here's a hint: The equivalence between the second and third expressions does not depend on being a density matrix or on each being positive semidefinite. Try showing it first for tensor products of the form and then conclude that it must be true in general by linearity.
While the equivalence of the first and third expressions in the previous equation may not be immediate, it does make sense. Starting from a measurement on we're effectively defining a measurement of where we simply throw away and measure Like all measurements, this new measurement can be described by a collection of matrices, and it's not surprising that this measurement is described by the collection
States conditioned on measurement outcomes
If we want to determine not only the probabilities for the different outcomes but also the resulting state of conditioned on each measurement outcome, we can look to the channel description of the measurement. In particular, let's examine the state we get when we apply to and do nothing to
Note that this is a density matrix by virtue of the fact that is a channel, so each matrix is necessarily positive semidefinite.
One final step transforms this expression into one that reveals what we're looking for.
This is an example of a classical-quantum state,
like we saw in the Density matrices lesson. For each measurement outcome we have with probability
that is in the classical state and is in the state
That is, this is the density matrix we obtain by normalizing
by dividing it by its trace. (Formally speaking, the state is only defined when the probability is nonzero; when this state is irrelevant, for it refers to a discrete event that occurs with probability zero.)
Naturally, the outcome probabilities are consistent with our previous observations.
In summary, this is what happens when the measurement is performed on when is in the state
- Each outcome appears with probability
- Conditioned on obtaining the outcome the state of is then represented by the density matrix shown in the equation which is obtained by normalizing
Generalization
We can adapt this description to other situations, such as when the ordering of the systems is reversed or when there are three or more systems. Conceptually it is straightforward, although it can become cumbersome to write down the formulas.
In general, if we have systems the state of the compound system is and the measurement is performed on , the following happens.
-
Each outcome appears with probability
-
Conditioned on obtaining the outcome the state of is then represented by the following density matrix.