by Jim Baggott
There are different kinds of radioactivity. One kind, which was called beta-radioactivity by New Zealand physicist Ernest Rutherford in 1899, involves the transformation of a neutron in a nucleus into a proton, accompanied by the ejection of a high-speed electron (also known as a ‘beta particle’). This is a natural form of alchemy: changing the number of protons in the nucleus changes its chemical identity. In fact, caesium-137 decays by emission of a high-speed electron to produce barium-137, which contains 56 protons and 81 neutrons. This implies that the neutron is an unstable, composite particle, and so not really elementary at all. Left to its own devices, an isolated neutron will decay spontaneously in about 15 minutes.
The origin of beta-radioactivity was a bit of a puzzle. What was an electron supposed to be doing inside the nucleus? But it posed an even bigger problem. It could be expected that in this process of radioactive decay, energy and momentum should be conserved, just as they are conserved in every chemical and physical change that has ever been studied. But a careful accounting of the energies and momenta involved in beta-radioactivity showed that the numbers just didn’t add up. The theoretical energy and momentum released by the transformation of a neutron inside the nucleus could not all be accounted for by the energy and momentum of the ejected electron.
In 1930, Wolfgang Pauli reluctantly suggested that the energy and momentum ‘missing’ in the reaction were being carried away by an as yet unobserved, light, electrically neutral particle which interacts with virtually nothing. It came to be called a neutrino (Italian for ‘small neutral one’). At the time it was judged that it would be impossible to detect, but it was first discovered experimentally in 1956.
The strong nuclear force was judged to be about a hundred times stronger than electromagnetism. But the force governing beta-radioactive decay was found to be much weaker, about ten billionths of the strength of the electromagnetic force. It was also clear that this weak force acted on nuclear particles — protons and neutrons — although electrons, too, were somehow involved. There was no choice but to conclude that this was evidence for yet another force of nature, which became known as the weak nuclear force.
Instead of two fundamental forces there were now four: the strong and weak nuclear forces, electromagnetism and gravity. Constructing a quantum theory that could accommodate three of these forces (the exception being gravity) took another forty years. It is called the standard model of particle physics, and it is one of the most successful theories of physics ever devised.
Spinning electrons and anti-matter
Two versions of quantum theory were developed in the 1920s and were applied principally to the study of the properties and behaviour of electrons. These were matrix mechanics, devised in 1925 by Heisenberg, and wave mechanics, devised in late 1925/early 1926 by Austrian physicist Erwin Schrödinger. Although these were rival theories, Schrödinger himself demonstrated that they are, in fact, equivalent; one and the same theory expressed in two different mathematical ‘languages’.
These early quantum theories were further developed and elaborated, and cast into other mathematical forms that have since proved to be useful (or, at least, they avoid some of the metaphysical baggage that the early theories tended to carry). Quantum theory is still very much a theory of contemporary physics.
But the equations of quantum theory describe individual quantum particles or systems. They describe how the energies associated with various motions of the particles or systems are ‘quantized’, able to admit or remove energy only in discrete lumps, or quanta. Adding quanta therefore involves increasing the energy, promoting the particles or systems to higher-energy quantum states. Taking quanta away likewise involves removing energy, often in the form of emitted photons, demoting the particles or systems to lower-energy states.
To describe correctly such ‘dynamics’ of quantum systems, the theory must in principle conform to the stringent requirements of Einstein’s special theory of relativity. This ensures that the laws of nature that we can observe or measure are guaranteed to be the same independently of how fast the observer or measuring device might happen to be moving in relation to the object under study.* It also ensures that the speed of light retains its privileged status as an ultimate speed that cannot be exceeded.*
Einstein himself identified one important consequence of the special theory. This was a deep connection between energy and mass reflected in the world’s most famous scientific equation, E = mc2, or energy equals mass multiplied by the speed of light squared. This is interpreted to mean that mass is a form of energy, and from energy can spring mass.
Now this might already cause us to pause for a moment’s quiet reflection on what it may mean for our interpretation of mass. I confess that, as a young student, the notion that mass represents a vast reservoir of energy somehow made it seem even more tangible, and ‘solid’. It didn’t shake my naïve conviction that mass must surely be an intrinsic property or primary quality of material substance.
In essence, producing a so-called ‘relativistic’ theory — one that meets the requirements of the special theory of relativity — is all about ensuring that the theory treats time as a kind of fourth dimension, on an equal footing with the three dimensions of space. In late 1927, Paul Dirac extended the early version of quantum theory and made it conform to the special theory of relativity.
The resulting theory predicted that the electron should possess a spin quantum number s equal to ½ and two different spin orientations, labelled spin-up and spin-down.2 This was something of a revelation. That electrons possess an intrinsic spin angular momentum had been shown experimentally some years previously, but neither matrix nor wave mechanics had predicted this behaviour.
Dirac was a mathematician first, a physicist second. In devising his theoretical equation he had followed his mathematical instincts, and when these seemed to suggest some relatively unphysical conclusions, he had nevertheless stood firm.3 The theory produced twice as many solutions as he thought he had needed. Two of these solutions correspond to the spin-up and spin-down orientations of the electron, each carrying positive energy. But there were another two solutions of ‘negative’ energy. Although these solutions were hard to understand, they couldn’t be ignored.
In 1931, Dirac finally conceded that the two negative-energy solutions actually correspond to positive-energy spin-up and spin-down orientations of a positively charged electron. He had discovered the existence of antimatter, a previously unsuspected form of material substance. There had been no hints of antimatter in any experiment performed to that time and no good reason to suspect that it existed. It was completely ‘off the wall’.
Yet within a year, the positive electron had been discovered in experiments on cosmic rays. It was named the positron. Dirac’s mathematical instincts were proved right.
Quantum fields and second quantization
Dirac’s theory is actually a kind of quantum field theory. A field in physics is defined in terms of the magnitude of some physical property distributed over every point in space and time. Sprinkle iron filings on a sheet of paper held above a bar magnet. The iron filings organize themselves along the ‘lines of force’ of the magnetic field, reflecting the strength of the field and its direction, stretching from north to south poles. The field exists in the ‘empty’ space around the outside of the bar of magnetic material.
The nature of the field depends on the nature of the property being measured. The field may be scalar, meaning that it has magnitude but acts in no particular direction in space — it points but it doesn’t push or pull. It may be vector, with both magnitude and direction, like a magnetic field or a Newtonian gravitational field. Finally, it may be tensor, a more complicated version of a vector field for situations in which the geometry of the field requires more parameters than would be required for three ‘Cartesian’ x, y and z directions.
The field described by Dirac’s theory is in fact none of the above. It is a spinor field. Spinors were discovered by mathem
atician Elié Cartan in 1913. They are vectors, but not in the sense of vectors in ‘ordinary’ space, and they cannot be constructed from ordinary vectors. They are, of course, spin vectors, related to the spin orientations of the electron. To make his theory conform to the requirements of special relativity, Dirac needed a spinor field with four components — two for the electron and two for the positron.
It might be best to think of the field used in Dirac’s theory as an ‘electron field’, the quantum field for which the corresponding quantum particle is the electron. The particle is, in essence, a fundamental field quantum, a basic fluctuation or disturbance of the field.
In contrast to conventional quantum theory, when we add or remove quanta in a quantum field theory, we’re adding or removing particles to or from the field itself. We add energy to the field in the form of particles, rather than adding it to individual quantum particles. This is sometimes known as ‘second quantization’.
The strange theory of light and matter
Dirac’s theory represented a major advance, but as a field theory it seemed rather obscure. It wasn’t obvious from the structure of his equation that it met the demands of special relativity (though it did); neither was it clear how the theory should be related to much better understood classical field theories, such as Maxwell’s theory of electromagnetism.
The attentions of some physicists were drawn instead to an alternative approach. Why not start with a well-understood classical wave field, and then find a way to impose quantum conditions on it? The obvious place to start was the field described by Maxwell’s equations. If it could be done, the resulting quantum field theory would describe the interactions between an electron field (whose quanta are electrons) and the electromagnetic field (whose quanta are photons).
It began to dawn on physicists working on early versions of quantum field theory that they had figured out a very different way to understand how forces between particles actually work. Let’s imagine that two electrons are ‘bounced’ off each other (the technical term is ‘scattered’). We can suppose that as the two electrons approach each other in space and in time, they feel the mutually repulsive force generated by their negative charges.
But how? We might speculate that each moving electron generates an electromagnetic field and the mutual repulsion is felt in the space where these two fields overlap, much like we feel the repulsion between the north poles of two bar magnets in the space between them as we try to push them together. But in the quantum domain, fields are also associated with particles, and interacting fields with interacting particles. In 1932, German physicist Hans Bethe and Italian Enrico Fermi suggested that this experience of force is the result of the exchange of a photon between the two electrons.
As the two electrons come closer together, they reach some critical distance and exchange a photon. In this way the particles experience the electromagnetic force. The exchanged photon carries momentum from one electron to the other, thereby changing the momentum of both. The result is recoil, with both electrons changing speed and direction and moving apart.
The exchanged photon is a ‘virtual’ photon, because it is transmitted directly between the two electrons and we don’t actually see it pass from one to the other. In fact, there’s no telling in which direction the photon actually goes. In diagrams drawn to represent the interaction, the passage of a virtual photon is simply illustrated using a squiggly line, with no direction indicated between the electrons.
Here was another revelation. The photon was no longer simply the quantum particle of light. It had become the ‘carrier’ of the electromagnetic force.
Heisenberg and Pauli had tried to develop a formal quantum field theory of electromagnetism a few years before, in 1929. But this theory was plagued with problems. The worst of these was associated with the ‘self-energy’ of the electron. When an electric charge moves through space, it generates an electromagnetic field. The ‘self-energy’ of the electron results when an electron interacts with its own self-generated electromagnetic field. This interaction caused the equations to ‘blow up’, producing physically unrealistic results. Some terms in the equation mushroomed to infinity.
A solution to this problem would be found only in 1947, when physicists were able to return to academic science after spending the war years working on the world’s first atomic weapons.
Suppose the self-energy of the electron appears (using E = mc2) as an additional contribution to the electron’s mass. The mass that we observe in experiments would then include this additional contribution. It would be equal to an intrinsic or ‘bare’ mass plus an ‘electromagnetic’ mass arising from the electron’s interaction with its own electromagnetic field.
The ‘bare’ mass is a purely theoretical quantity. It is the mass that the electron would possess if it could ever be isolated from its own electromagnetic field. The mass that we have to deal with is the observed mass, so the theory has to be rewritten in terms of this. In other words, the theory has to be ‘renormalized’.
Suppose further that we want to apply quantum field theory to the calculation of the energies of the quantum states of an electron in a hydrogen atom. The theory predicts an infinite self-energy associated with the electron interacting with its own electromagnetic field. We identify this as an infinite contribution to the electron mass. But the equations for a freely moving electron also contain the same infinite mass contribution. So, what if we now subtract the expression for a free electron from the expression for the electron in a hydrogen atom? Would the two infinite terms cancel to give a finite answer?
It sounds as though subtracting infinity from infinity should lead only to nonsense, but it worked. A formal theory of quantum electrodynamics (QED), essentially a quantum version of Maxwell’s theory, was eventually devised by rival American physicists Richard Feynman and Julian Schwinger, and independently by Japanese theorist Sin-Itiro Tomonaga. Although the approaches adopted by Schwinger and Tomonaga were similar, Feynman’s was distinctly different, relying on pictorial representations of the different kinds of possible interactions which came to be called ‘Feynman diagrams’. English physicist Freeman Dyson subsequently demonstrated that their different approaches were entirely equivalent.
Not everyone was comfortable with the apparent sleight of hand involved in renormalization. Dyson asked Dirac: ‘Well, Professor Dirac, what do you think of these new developments in quantum electrodynamics?’ Dirac, the mathematical purist, was not enamoured: ‘I might have thought that the new ideas were correct if they had not been so ugly.’4
Ugly or not, there was no denying the power of the resulting theory. The g-factor for the electron, a physical constant governing the strength of the interaction of an electron with an external magnetic field, is predicted by QED to have the value 2.00231930476. The comparable experimental value is 2.00231930482. ‘This accuracy’, wrote Feynman, ‘is equivalent to measuring the distance from Los Angeles to New York, a distance of over 3,000 miles, to within the width of a human hair.’5
The particle zoo
Dirac once speculated that it might one day be possible to describe all matter in terms of just one kind of elementary particle, some kind of ultimate quantum of ‘stuff’. Alas, as experimental physicists set to work with cosmic rays and early particle accelerators in the 1930s and 40s, Dirac’s dream was shattered. Far from discovering an underlying simplicity in nature, they discovered instead a bewildering complexity.
American physicist Carl Anderson had discovered the positron in 1932. Four years later he discovered another particle, a heavier version of the electron, with a mass about 200 times that of an ordinary electron. This simply did not fit with any preconceptions of how the elementary building blocks of nature should be organized. Galician-born American physicist Isidor Rabi demanded to know: ‘Who ordered that?’6 The new particle was given several names, but is today called the muon.
In 1947, another new particle was discovered in cosmic rays by British physicist Cecil Powell
. This was found to have a slightly larger mass than the muon; 273 times that of the electron. It came in positive, negative and, subsequently, neutral varieties. This was called the pion. As techniques for detecting particles became more sophisticated, the floodgates opened. The pion was quickly followed by the positive and negative kaon and the neutral lambda particle.
What was going on? One physicist expressed the prevailing sense of frustration: ‘… the finder of a new elementary particle used to be rewarded by a Nobel Prize, but such a discovery now ought to be punished by a $10,000 fine’.7 Names proliferated. Responding to a question from one young physicist, Fermi remarked: ‘Young man, if I could remember the names of these particles, I would have been a botanist.’8
In an attempt to give some sense of order to this ‘zoo’ of particles, physicists introduced a new taxonomy. They defined two principal classes: hadrons (from the Greek hadros, meaning thick or heavy) and leptons (from the Greek leptos, meaning small).
The class of hadrons includes a subclass of baryons (from the Greek barys, also meaning heavy). These are heavier particles which experience the strong nuclear force. The proton and neutron are baryons. It also includes the subclass of mesons (from the Greek mésos, meaning ‘middle’). These particles experience the strong force but are of intermediate mass. Examples include pions and kaons.
The class of leptons includes the electron, muon and neutrino. These are light particles which do not experience the strong nuclear force.
The baryons and the leptons are matter particles. They are also all fermions, characterized by half-integral spins, which obey Pauli’s exclusion principle.