Quantum channel basics
In mathematical terms, channels are linear mappings from density matrices to density matrices that satisfy certain requirements. Throughout this lesson we'll use uppercase Greek letters, including and as well as some other letters in specific cases, to refer to channels.
Every channel has an input system and an output system, and we'll typically use the name to refer to the input system and to refer to the output system. It's common that the output system of a channel is the same as the input system, and in this case we can use the same letter to refer to both.
Channels are linear mappings
Channels are described by linear mappings, just like probabilistic operations in the standard formulation of classical information and unitary operations in the simplified formulation of quantum information.
If a channel is performed on an input system whose state is described by a density matrix then the output system of the channel is described by the density matrix In the situation in which the output system of is also we can simply view that the channel represents a change in the state of from to When the output system of is a different system, rather than it should be understood that is a new system that is created by the process of applying the channel, and that the input system, is no longer available once the channel is applied — as if the channel itself transformed into leaving it in the state
The assumption that channels are described by linear mappings can be viewed as being an axiom — or in other words, a basic postulate of the theory rather than something that is proved. We can, however, see the need for channels to act linearly on convex combinations of density matrix inputs in order for them to be consistent with probability theory and what we've already learned about density matrices.
To be more specific, suppose that we have a channel and we apply it to a system when it's in one of the two states represented by the density matrices and If we apply the channel to we obtain the density matrix and if we apply it to we obtain the density matrix Thus, if we randomly choose the input state of to be with probability and with probability we'll obtain the output state with probability and with probability which we represent by a weighted average of density matrices as
On the other hand, we could think about the input state of the channel as being represented by the weighted average in which case the output is It's the same state regardless of how we choose to think about it, so we must have
Whenever we have a mapping that satisfies this condition for every choice of density matrices and and scalars there's always a unique way to extend that mapping to every matrix input (i.e., not just density matrix inputs) so that it's linear.
Channels transform density matrices into density matrices
Naturally, in addition to being linear mappings, channels must also transform density matrices into density matrices. If a channel is applied to an input system while this system is in a state represented by a density matrix then we obtain a system whose state is represented by which must be a valid density matrix in order for us to interpret it as a state.
It is critically important, though, that we consider a more general situation, where a channel transforms a system into a system in the presence of an additional system to which nothing happens. That is, if we start with the pair of systems in a state described by some density matrix, and then apply just to transforming it into we must obtain a density matrix describing a state of the pair
We can describe in mathematical terms how a channel having an input system and an output system transforms a state of the pair into a state of when nothing is done to To keep things simple, we'll assume that the classical state set of is This allows us to write an arbitrary density matrix representing a state of in the following form.
On the right-hand side of this equation we have a block matrix, which we can think of as a matrix of matrices except that the inner parentheses are removed. This leaves us with an ordinary matrix that can alternatively be described using Dirac notation as we have in the middle expression. Each matrix has rows and columns corresponding to the classical states of and these matrices can be determined by a simple formula.
Note that these are not density matrices in general — it's only when they're arranged together to form that we obtain a density matrix.
The following equation describes the state of that is obtained when is applied to
Notice that, in order to evaluate this expression for a given choice of and we must understand how works as a linear mapping on non-density matrix inputs, as each generally won't be a density matrix on its own. The equation is consistent with the expression in which denotes the identity channel on the system This presumes that we've extended the notion of a tensor product to linear mappings from matrices to matrices, which is straightforward — but it isn't really essential to the lesson and won't be explained further.
Reiterating a statement made above, in order for a linear mapping to be a valid channel it must be the case that, for every choice for and every density matrix of the pair we always obtain a density matrix when is applied to In mathematical terms, the properties a mapping must possess to be a channel are that it must be trace-preserving — so that the matrix we obtain by applying the channel has trace equal to one — as well as completely positive — so that the resulting matrix is positive semidefinite. These are both important properties that can be considered and studied separately, but it isn't critical for the sake of this lesson to consider these properties in isolation.
There are, in fact, linear mappings that always output a density matrix when given a density matrix as input, but fail to map density matrices to density matrices for compound systems, so we do eliminate some linear mappings from the class of channels in this way. (The linear mapping given by matrix transposition is the simplest example.)
We have an analogous formula to one above in the case that the two systems and are swapped, so that is applied to the system on the left rather than the right.
This assumes that is a state of rather than This time the block matrix description doesn't work because the matrices don't fall into consecutive rows and columns in but it's the same underlying mathematical structure.
Any linear mapping that satisfies the requirement that it always transforms density matrices into density matrices, even when it's applied to just one part of a compound systems, represents a valid channel. So, in an abstract sense, the notion of a channel is determined by the notion of a density matrix, together with the assumption that channels act linearly. In this regard, channels are analogous to unitary operations in the simplified formulation of quantum information, which are precisely the linear mappings that always transform quantum state vectors to quantum state vectors for a given system; as well as to probabilistic operations (represented by stochastic matrices) in the standard formulation of classical information, which are precisely the linear mappings that always transform probability vectors into probability vectors.
Unitary operations as channels
Suppose is a system and is a unitary matrix representing an operation on The channel that describes this operation on density matrices is defined as follows for every density matrix representing a quantum state of
This action, where we multiply by on the left and on the right, is commonly referred to as conjugation by the matrix
This description is consistent with the fact that the density matrix that represents a given quantum state vector is In particular, if the unitary operation is performed on then the output state is represented by the vector and so the density matrix describing this state is equal to
Once we know that, as a channel, the operation has the action on pure states, we can conclude by linearity that it must work as is specified by the equation above for any density matrix
The particular channel we obtain when we take is the identity channel which we can also give a subscript (such as as we've already encountered) when we wish to indicate explicitly what system this channel acts on. Its output is always equal to its input: This might not seem like an interesting channel, but it's actually a very important one — and it's fitting that this is our first example. The identity channel is the perfect channel in some contexts, representing an ideal memory or a perfect, noiseless transmission of information from a sender to a receiver.
Every channel defined by a unitary operation in this way is indeed a valid channel: conjugation by a matrix gives us a linear map; and if is a density matrix of a system and is unitary, then the result, which we can express as
is also a density matrix. Specifically, this matrix must be positive semidefinite, for if then
for and it must have unit trace by the cyclic property of the trace.
Convex combinations of channels
Suppose we have two channels, and that share the same input system and the same output system. For any real number we could decide to apply with probability and with probability which gives us a new channel that can be written as Explicitly, the way that this channel acts on a given density matrix is specified by the following simple equation.
More generally, if we have channels and a probability vector then we can average these channels together to obtain a new channel.
This is a convex combination of channels, and we always obtain a valid channel through this process. A simple way to say this in mathematical terms is that, for a given choice of an input and output system, the set of all channels is a convex set.
As an example, we could choose to apply one of a collection of unitary operations to a certain system. We obtain what's known as a mixed unitary channel, which is a channel that can be expressed in the following form.
Mixed unitary channels for which all of the unitary operations are Pauli matrices (or tensor products of Pauli matrices) are called Pauli channels, and are commonly encountered in quantum computing.
Examples of qubit channels
Now we'll take a look at a few specific examples of channels that aren't unitary. For all of these examples, the input and output systems are both single qubits, which is to say that these are examples of qubit channels.
The qubit reset channel
This channel does something very simple: it resets a qubit to the state. As a linear mapping this channel can be expressed as follows for every qubit density matrix
Although the trace of every density matrix is equal to writing the channel in this way makes it clear that it's a linear mapping that could be applied to any matrix, not just a density matrix. As we already observed, we need to understand how channels work as linear mappings on non-density matrix inputs to describe what happens when they're applied to just one part of a compound system.
For example, suppose that and are qubits and together the pair is in the Bell state As a density matrix, this state is given by
Using Dirac notation we can alternatively express this state as follows.
By applying the qubit reset channel to and doing nothing to we obtain the following state.
It might be tempting to say that resetting has had an effect on causing it to become completely mixed — but in some sense it's actually the opposite. Before was reset, the reduced state of was the completely mixed state, and that doesn't change as a result of resetting
The completely dephasing channel
Here's an example of a qubit channel called described by its action on matrices:
In words, zeros out the off-diagonal entries of a matrix. This example can be generalized to arbitrary systems, as opposed to qubits: for whatever density matrix is input, the channel zeros out all of the off-diagonal entries and leaves the diagonal alone.
This channel is called the completely dephasing channel, and it can be thought of as representing an extreme form of the process known as decoherence — which essentially ruins quantum superpositions and turns them into classical probabilistic states.
Another way to think about this channel is that it describes a standard basis measurement on a qubit, where an input qubit is measured and then discarded, and where the output is a density matrix describing the measurement outcome. Alternatively, but equivalently, we can imagine that the measurement outcome is discarded, leaving the qubit in its post-measurement state.
Let us again consider an e-bit, and see what happens when is applied to just one of the two qubits. Specifically, we have qubits and for which is in the state and this time let's apply the channel to the second qubit. Here's the state we obtain.
Alternatively we can express this equation using block matrices.
We can also consider a qubit channel that only slightly dephases a qubit, as opposed to completely dephasing it, which is a less extreme form of decoherence than what is represented by the completely dephasing channel. In particular, suppose that is a small but nonzero real number. We can define a channel
which transforms a given qubit density matrix like this:
That is, nothing happens with probability and with probability the qubit dephases. In terms of matrices, this action can be expressed as follows, where the diagonal entries are left alone and the off-diagonal entries are multiplied by
The completely depolarizing channel
Here's another example of a qubit channel called
Here denotes the identity matrix. In words, for any density matrix input the channel outputs the completely mixed state. It doesn't get any noisier than this! This channel is called the completely depolarizing channel, and like the completely dephasing channel it can be generalized to arbitrary systems in place of qubits.
We can also consider a less extreme variant of this channel where depolarizing happens with probability similar to what we saw for the dephasing channel.