*[This post was originally “Part 0”, but it’s been moved. Other parts in this series: 1,2,3, 4,5,6,7.]*

In an ideal world, the formalism that you use to describe a physical system is in a one-to-one correspondence with the physically distinct configurations of the system. But sometimes it can be useful to introduce additional descriptions, in which case it is very important to understand the unphysical over-counting (e.g., gauge freedom). A scalar potential is a very convenient way of representing the vector force field, , but any constant shift in the potential, , yields forces and dynamics that are indistinguishable, and hence the value of the potential on an absolute scale is unphysical.

One often hears that a quantum experiment measures an *observable*, but this is wrong, or very misleading, because it vastly over-counts the physically distinct sorts of measurements that are possible. It is much more precise to say that a given apparatus, with a given setting, simultaneously measures all observables with the same eigenvectors. More compactly, **an apparatus measures an orthogonal basis – not an observable**.

^{ a }You can probably start to see this by just noting that there’s no actual, physical difference between measuring and ; the apparatus that would perform the two measurements are identical.

In the rest of this post, I’ll lay things out very explicitly. I’m going to show how simply acknowledging that a measurement is carried out by a physical apparatus is enough to infer

- that the set of possible eigenstate outcomes (the basis) is all that physically matters,
- that the basis must be orthogonal, and consequently that,
- it’s just as sensible to talk about the measurement of non-Hermitian
*normal*operators as traditional observables (Hermitian operators).

I’ll mostly be following Zurek^{ b }, who first pointed out (2). For simplicity we’ll assume a finite-dimensional Hilbert space. None of this requires you to adopt a many-worlds interpretation or anything; feel free to just stick with Copenhagen and pull the Heisenberg cut up a bit higher so the apparatus is contained within the quantum description.

#### Toy measurement model

Consider what a physical measuring apparatus actually does when it measures a system . From some “ready” state , initially unentangled with the system, the apparatus interacts unitarily such that different possible states of the system are recorded in distinct conditional out-states of the apparatus. These out-states will correspond, at the least, to different macroscopic configurations of the apparatus’s readout system (the “pointer”), e.g., the macroscopic arrangements of atoms in a screen interpreted as “up” rather than “down”.

Let us first assume the apparatus can make a non-disturbing measurement. Then for each , the unitary describing the measurement process must act in this manner:

(1)

A defining characteristic of unitaries is that they preserve the inner product between vectors, so

(2)

Since , the requirement that the measuring device evolves into distinct states, , for different outcomes immediately implies that , i.e., that the set of system states being distinguished must be orthogonal.

Now, let’s relax the assumption that the measurement is non-disturbing. Instead, we will appeal to the key characteristic of a measuring apparatus — that it must *amplify*. More precisely, the apparatus must contain many parts in which the outcome is recorded distinctly. For simplicity, let us simply define for to be the minimal degrees of freedom which are put into a distinct state conditional on the outcome of the measurement, and let be the (messy) rest of the apparatus, which will generally become entangled with the system. Then for each we must have

(3)

where is an arbitrary, possibly entangled joint state of . We again have not assumed a priori that the or the are orthogonal, just that they are distinct states. Nonetheless, unitary evolution preserves the inner product between states, so

(4)

Then, regardless of the value of , we must have that unless for almost all . Since a functioning amplifier must produce many distinct copies (records) of the amplified information, we conclude that the system states we are distinguishing, , are orthogonal.

Note that we have not lost the generality of our argument by assuming that the various components of the apparatus end up in pure states, unentangled with the rest of the apparatus and system. Our only requirement is that, for something to be a proper amplifier, one can choose *some* tensor structure in which this is so, and that’s always possible even if the natural, intuitive parts of the system in which the copies of the information are stored (e.g., the atoms of the macroscopic pointer readout) are in mixed states (so long as the mixed states are distinct). See Zurek for details.

#### Implications

So it’s clear from what a measuring apparatus actually *does* that there is no physical difference between measuring two observables with the same eigenvectors, for the same reason that, even classically, there’s no physical difference between measuring in centimeters and inches; it’s just the labeling on your ruler. The only thing that is meaningful is the orthogonal basis defining the measurement process. All that talking about observables adds to this is *naming* the eigenstates.^{ c }

In fact, it makes as much sense to measure a normal operator as a Hermitian one. Recall that normal and Hermitian operators are defined by the conditions and , respectively. (Obviously, Hermitian operators are a subset of normal operators.) Equivalently, we can say that normal operators are defined by the fact that they are non-singular and have orthogonal eigen*vectors*, while Hermitian operators must additionally have real eigen*values*. It’s perfectly sensible to say, when we are determining the amplitude and phase of a macroscopic electromagnetic field, that we are measuring a single normal operator whose eigenvalues are complex. And if we wanted to be ornery, we could point out that there’s really nothing objectionable about measuring an operator

(5)

where and are elements of some (possibly finite) field which is neither the reals nor the complex numbers. In all these cases, the only thing that matter is the set of states .

(Note that there are still plenty of places in quantum mechanics where the Hermiticity of an operator is critical, such as the Hamiltonian. But then the meaningfulness of the reality of the eigenvalues is connected to the fact that the Hamiltonian is not just something that can be measured, but is used to generate time translation, in which case the eigenvalues are “doing work”.)

#### Blame

Why are the above simple observations not known by undergraduates, or even by professors? I tentatively blame the axiomatic approaches to quantum mechanics as put forth by the titans like Dirac and von Neumann, or at least their typical presentation to other physicist. In particular, when you take

- The expectation value of an observable for a system in a state is given by .

as an irreducible axiom of *the universe*, you obscure a great deal. This seems to be grounded in early formulations of Copenhagen, where the measurement operation was a definitive event, linking the quantum description with observed classical variables at a time and place. (This is to be contrasted with modern Copenhagen approaches where arbitrarily large objects can in principle be given quantum descriptions and the Heisenberg cut is fluid…as long as it is placed *somewhere*.^{ d })

Of course, it’s clear that von Neumann made deep, deep insights about the *completeness* of the quantum description and the problems with hidden variables^{ e }, and that this was achieved by linking what could actually be discovered about a system to complete sets of observables (maximal sets of commuting Hermitian operator). Nonetheless, there is a danger in taking these mathematical objects too seriously, and not taking seriously enough the fundamentally quantum nature of an apparatus.

### Footnotes

(↵ returns to text)

- We can also allow for the measured observable to be degenerate, in which case the apparatus simultaneously measures all observables with the same degenerate eigenspaces. To be abstract, you could say it measures a commuting subalgebra, with the nondegenerate case corresponding to the subalgebra having maximum dimensionality (i.e., the same number of dimensions as the Hilbert space). Commuting subalgebras with maximum dimension are in one-to-one correspondence with orthonormal bases, modulo multiplying the vectors by pure phases.↵
- Wojciech H. Zurek, Phys. Rev. A
**76**, 052110 (2007), [arXiv:quant-ph/0703160]; Phys. Rev. A**87**, 052111 (2013) [arXiv:1212.3245].↵ - Of course there is a physical difference between measuring and , since the latter would imply an apparatus that moves into the exact same conditional out-state if the system starts in an either eigenstate or .↵
- Heisenberg: “The dividing line between the system to be observed and the measuring apparatus is immediately defined by the nature of the problem but it obviously signifies no discontinuity of the physical process. For this reason there must, within limits, exist complete freedom in choosing the position of the dividing line.” See Schlosshauer and Camilleri (2011).↵
- This work strongly contributed to Bell’s theorem. There is disagreement as to whether von Neumann’s proof against hidden variables was foolish or whether von Neumann understood the limitations of his conclusions.↵

Your email address will not be published. Required fields are marked with a *.