Pauli operations and observables
Pauli matrices play a central role in the stabilizer formalism. We'll begin the lesson with a discussion of Pauli matrices, including some of their basic algebraic properties, and we'll also discuss how Pauli matrices (and tensor products of Pauli matrices) can describe measurements.
Pauli operation basics
Here are the Pauli matrices, including the identity matrix and the three non-identity Pauli matrices.
Properties of Pauli matrices
All four of the Pauli matrices are both unitary and Hermitian. We used the names and to refer to the non-identity Pauli matrices earlier in the series, but it is conventional to instead use the capital letters and in the context of error correction. This convention was followed in the previous lesson, and we'll continue to do this for the remaining lessons.
Different non-identity Pauli matrices anti-commute with one another.
These anti-commutation relations are simple and easy to verify by performing the multiplications, but they're critically important, in the stabilizer formalism and elsewhere. As we will see, the minus signs that emerge when the ordering between two different non-identity Pauli matrices is reversed in a matrix product correspond precisely to the detection of errors in the stabilizer formalism.
We also have the multiplication rules listed here.
That is, each Pauli matrix is its own inverse (which is always true for any matrix that is both unitary and Hermitian), and multiplying two different non-identity Pauli matrices together is always times the remaining non-identity Pauli matrix. In particular, up to a phase factor, is equivalent to which explains our focus on and errors and apparent lack of interest in errors in quantum error correction; represents a bit-flip, represents a phase-flip, and so (up to a global phase factor) represents both of those errors occurring simultaneously on the same qubit.
Pauli operations on multiple qubits
The four Pauli matrices all represent operations (which could be errors) on a single qubit — and by tensoring them together we obtain operations on multiple qubits. As a point of terminology, when we refer to an n-qubit Pauli operation, we mean a tensor product of any Pauli matrices, such as the examples shown here, for which
Often, the term Pauli operation refers to a tensor product of Pauli matrices along with a phase factor, or sometimes just certain phase factors such as and There are good reasons to allow for phase factors like this from a mathematical viewpoint — but, to keep things as simple as possible, we'll use the term Pauli operation in this course to refer to a tensor product of Pauli matrices without the possibility of a phase factor different than 1.
The weight of an -qubit Pauli operation is the number of non-identity Pauli matrices in the tensor product. For instance, the first example above has weight the second has weight and the third has weight Intuitively speaking, the weight of an -qubit Pauli operation is the number of qubits on which it acts non-trivially. It's typical that quantum error correcting codes are designed so that they can detect and correct errors represented by Pauli operations so long as their weight isn't too high.
Pauli operations as generators
It's sometimes useful to consider collections of Pauli operations as generators of sets (more specifically, groups) of operations, in an algebraic sense that you may recognize if you're familiar with group theory. If you're not familiar with group theory, that's OK — it's not essential for the lesson. A familiarity with the basics of group theory is, however, strongly recommended for those interested in exploring quantum error correction in greater depth.
Suppose that are -qubit Pauli operations. When we refer to the set generated by we mean the set of all matrices that can be obtained by multiplying these matrices together, in any combination and in any order we choose, taking each one as many times as we like. The notation used to refer to this set is
For example, the set generated by the three non-identity Pauli matrices is as follows.
This can be reasoned through the multiplication rules listed earlier. There are 16 different matrices in this set, which is commonly called the Pauli group.
For a second example, if we remove we obtain half of the Pauli group.
Here's one final example (for now), where this time we have
In this case we obtain just four elements, owing to the fact that and commute:
Pauli observables
Pauli matrices, and -qubit Pauli operations more generally, are unitary, and therefore they describe unitary operations on qubits. But they're also Hermitian matrices, and for this reason they describe measurements, as will now be explained.
Hermitian matrix observables
Consider first an arbitrary Hermitian matrix When we refer to as an observable, we're associating with a certain uniquely defined projective measurement. In words, the possible outcomes are the distinct eigenvalues of and the projections that define the measurement are the ones that project onto the spaces spanned by the corresponding eigenvectors of So, the outcomes for such a measurement happen to be real numbers — but because matrices have only finitely many eigenvalues, there will only be finitely many different measurement outcomes for a given choice of
In greater detail, by the spectral theorem, it is possible to write
for distinct real number eigenvalues and projections satisfying
Such an expression of a matrix is unique up to the ordering of the eigenvalues. Another way to say this is that, if we insist that the eigenvalues are ordered in decreasing value then there's only one way to write in the form above.
Based on this expression, the measurement we associate with the observable is the projective measurement described by the projections and the eigenvalues are understood to be the measurement outcomes corresponding to these projections.
Measurements from Pauli operations
Let's see what measurements of the sort just described look like for Pauli operations, starting with the three non-identity Pauli matrices. These matrices have spectral decompositions as follows.
The measurements defined by and viewed as observables, are therefore the projective measurements defined by the following sets of projections, respectively.
In all three cases, the two possible measurement outcomes are the eigenvalues and Such measurements are commonly called -measurements, -measurements, and -measurements. We encountered these measurements in the "General measurements" lesson of "General formulation of quantum information," where they arose in the context of quantum state tomography.
Of course, a -measurement is essentially just a standard basis measurement and an measurement is a measurement with respect to the plus/minus basis of a qubit — but, as these measurements are described here, we're taking the eigenvalues and to be the actual measurement outcomes.
The same prescription can be followed for Pauli operations on qubits, though it must be stressed that there will still be just two possible outcomes for the measurements described in this way: and which are the only possible eigenvalues of Pauli operations. The two corresponding projections will therefore have rank higher than one in this case. More precisely, for every non-identity -qubit Pauli operation, the dimensional state space always splits into two subspaces of eigenvectors having equal dimension, so the two projections that define the associated measurement will both have rank
The measurement described by an -qubit Pauli operation, considered as an observable, is therefore not the same thing as a measurement with respect to an orthonormal basis of eigenvectors of that operation, nor is it the same thing as independently measuring each of the corresponding Pauli matrices independently, as observables, on qubits. Both of those alternatives would necessitate possible measurement outcomes, but here we have just the two possible outcomes and
For example, consider the 2-qubit Pauli operation as an observable. We can effectively take the tensor product of the spectral decompositions to obtain one for the tensor product.
That is, we have for
so these are the two projections that define the measurement. If, for instance, we were to measure a Bell state nondestructively using this measurement, then we would be certain to obtain the outcome and the state would be unchanged as a result of the measurement. In particular, the state would not collapse to or
Nondestructive implementation through phase estimation
For any -qubit Pauli operation, we can perform the measurement associated with that observable nondestructively using phase estimation.
Here's a circuit based on phase estimation that works for any Pauli matrix where the measurement is being performed on the top qubit. The outcomes and of the standard basis measurement in the circuit correspond to the eigenvalues and just like we usually have for phase estimation with one control qubit. (Note that the control qubit is on the bottom in this diagram, whereas in the "Phase estimation and factoring" lesson of "Fundamentals of quantum algorithms" the control qubits were drawn on the top.)
A similar method works for Pauli operations on multiple qubits. For example, the following circuit diagram illustrates a nondestructive measurement of the -qubit Pauli observable for any choice of
This approach generalizes to -qubit Pauli observables, for any in the natural way. Of course, we only need to include controlled-unitary gates for non-identity tensor factors of Pauli observables when implementing such measurements; controlled-identity gates are simply identity gates and can therefore be omitted. This means that lower weight Pauli observables require smaller circuits to be implemented through this approach.
Notice that, irrespective of these phase-estimation circuits have just a single control qubit, which is consistent with the fact that there are just two possible measurement outcomes for these measurements. Using more control qubits wouldn't reveal additional information because these measurements are already perfect using a single control qubit. (One way to see this is directly from the general procedure for phase estimation: the assumption renders any additional control qubits beyond the first pointless.)
Here's a specific example, of a nondestructive implementation of a measurement, which is relevant to the description of the 3-bit repetition code as a stabilizer code that we'll see shortly.
In this case, and for tensor products of more than two observables more generally, the circuit can be simplified.
Thus, this measurement is equivalent to nondestructively measuring the parity (or XOR) of the standard basis states of two qubits.