A Stubbornly Persistent Illusion

Page 41

by Stephen Hawking

The most satisfactory situation is evidently to be found in cases where the new fundamental hypotheses are suggested by the world of experience itself. The hypothesis of the non-existence of perpetual motion as a basis for thermodynamics affords such an example of a fundamental hypothesis suggested by experience; the same thing holds for the principle of inertia of Galileo. In the same category, moreover, we find the fundamental hypotheses of the theory of relativity, which theory has led to an unexpected expansion and broadening of the field theory, and to the superseding of the foundations of classical mechanics.

The successes of the Maxwell-Lorentz theory have given great confidence in the validity of the electromagnetic equations for empty space and hence, in particular, to the statement that light travels “in space” with a certain constant speed c. Is this law of the invariability of light velocity in relation to any desired inertial system valid? If it were not, then one specific inertial system or more accurately, one specific state of motion (of a body of reference), would be distinguished from all others. In opposition to this idea, however, stand all the mechanical and electromagnetic-optical facts of our experience.

For these reasons it was necessary to raise to the degree of a principle, the validity of the law of constancy of light velocity for all inertial systems. From this, it follows that the special coordinates X1, X2, X3, and the time X4, must be transformed according to the “Lorentz-transformation” which is characterized by invariance of the expression

(if the unit of time is chosen in such a manner that the speed of fight c = 1).

By this procedure time lost its absolute character, and was included with the “special” coordinates as of algebraically (nearly) similar character. The absolute character of time and particularly of simultaneity were destroyed, and the four dimensional description became introduced as the only adequate one.

In order to account, also, for the equivalence of all inertial systems with regard to all the phenomena of nature, it is necessary to postulate invariance of all systems of physical equations which express general laws, with regard to the Lorentzian transformation. The elaboration of this requirement forms the content of the special theory of relativity.

This theory is compatible with the equations of Maxwell; but, it is incompatible with the basis of classical mechanics. It is true that the equations of motion of the material point can be modified (and with them the expressions for momentum and kinetic energy of the material point) in such a manner as to satisfy the theory; but, the concept of the force of interaction, and with it the concept of potential energy of a system, lose their basis, because these concepts rest upon the idea of absolute instantaneousness. The field, as determined by differential equations, takes the place of the force.

Since the foregoing theory allows interaction only by fields, it requires a field theory of gravitation. Indeed, it is not difficult to formulate such a theory in which, as in Newton’s theory, the gravitational fields can be reduced to a scalar which is the solution of a partial differential equation. However, the experimental facts expressed in Newton’s theory of gravitation lead in another direction, that of the general theory of relativity.

Classical mechanics contains one point which is unsatisfactory in that, in the fundamentals, the same mass constant is met twice over in two different roles, namely as “inertial mass” in the law of motion, and as “gravitational mass” in the law of gravitation. As a result of this, the acceleration of a body in a pure gravitational field is independent of its material; or, in a coordinate system of uniform acceleration (accelerated in relation to an “inertial system”) the motions take place as they would in a homogeneous gravitational field (in relation to a “motionless” system of coordinates). If one assumes that the equivalence of these two cases is complete, then one attains an adaptation of our theoretical thinking to the fact that the gravitational and inertial masses are identical.

From this it follows that there is no longer any reason for favoring, as a fundamental principle, the “inertial systems”; and, we must admit as equivalent in their own right, also non-linear transformations of the coordinates (x1, x2, x3, x4). If we make such a transformation of a system of coordinates of the special theory of relativity, then the metric

goes over to a general (Riemannian) metric of Bane

where the gμν, symmetrical in μ and n, are certain functions of x1 . . . x4 which describe both the metric property, and the gravitational field in relation to the new system of coordinates.

The foregoing improvement in the interpretation of the mechanical basis must, however, be paid for in that—as becomes evident on closer scrutiny—the new coordinates could no longer be interpreted, as results of measurements by rigid bodies and clocks, as they could in the original system (an inertial system with vanishing gravitational field).

The passage to the general theory of relativity is realized by the assumption that such a representation of the field properties of space already mentioned, by functions gμν (that is to say by a Riemann metric), is also justified in the general case in which there is no system of coordinates in relation to which the metric takes the simple quasi-Euclidian form of the special theory of relativity.

Now the coordinates, by themselves, no longer express metric relations, but only the “neighborliness” of the things described, whose coordinates differ but little from one another. All transformations of the coordinates have to be admitted so long as these transformations are free from singularities. Only such equations as are covariant in relation to arbitrary transformations in this sense have meaning as expressions of general laws of nature (postulate of general covariancy).

The first aim of the general theory of relativity was a preliminary statement which, by giving up the requirement of constituting a closed thing in itself, could be connected in as simple a manner as possible with the “facts directly observed.” Newton’s gravitational theory gave an example, by restricting itself to the pure mechanics of gravitation. This preliminary statement may be characterized as follows:

(1) The concept of the material point and of its mass is retained. A law of motion is given for it, this law of motion being the translation of the law of inertia into the language of the general theory of relativity. This law is a system of total differential equations, the system characteristic of the geodetic line.

(2) In place of Newton’s law of interaction by gravitation, we shall find the system of the simplest generally covariant differential equations which can be set up for the gμν-tensor. It is formed by equating to zero the once contracted Riemannian curvature tensor (Rμν = 0).

This formulation permits the treatment of the problem of the planets. More accurately speaking, it allows the treatment of the problem of motion of material points of practically negligible mass in the gravitational field produced by a material point which itself is supposed to have no motion (central symmetry). It does not take into account the reaction of the “moved” material points on the gravitational field, nor does it consider how the central mass produces this gravitational field.

Analogy with classical mechanics shows that the following is a way to complete the theory. One sets up as field equation

where R represents the scalar of Riemannian curvature, Tik the energy tensor of the matter in a phenomenological representation. The left side of the equation is chosen in such a manner that its divergence disappears identically. The resulting disappearance of the divergence of the right side produces the “equations of motion” of matter, in the form of partial differential equations for the case where Tik introduces, for the description of the matter, only four further functions independent of each other (for instance, density, pressure, and velocity components, where there is between the latter an identity, and between pressure and density an equation of condition).

By this formulation one reduces the whole mechanics of gravitation to the solution of a single system of covariant partial differential equations. The theory avoids all internal discrepancies
which we have charged against the basis of classical mechanics. It is sufficient—as far as we know—for the representation of the observed facts of celestial mechanics. But, it is similar to a building, one wing of which is made of fine marble (left part of the equation), but the other wing of which is built of low grade wood (right side of equation). The phenomenological representation of matter is, in fact, only a crude substitute for a representation which would correspond to all known properties of matter.

There is no difficulty in connecting Maxwell’s theory of the electromagnetic field with the theory of the gravitational field so long as one restricts himself to space, free of ponderable matter and free of electric density. All that is necessary is to put on the right hand side of the above equation for Tik, the energy tensor of the electromagnetic field in empty space and to associate with the so modified system of equations the Maxwell field equation for empty space, written in general covariant form. Under these conditions there will exist, between all these equations, a sufficient number of the differential identities to guarantee their consistency. We may add that this necessary formal property of the total system of equations leaves arbitrary the choice of the sign of the member Tik, a fact which was later shown to be important.

The desire to have, for the foundations of the theory, the greatest possible unity has resulted in several attempts to include the gravitational field and the electromagnetic field in one formal but homogeneous picture. Here we must mention particularly the five-dimensional theory of Kaluza and Klein. Having considered this possibility very carefully I feel that it is more desirable to accept the lack of internal uniformity of the original theory, because I do not consider that the totality of the hypothetical basis of the five-dimensional theory contains less of an arbitrary nature than does the original theory. The same statement may be made for the projective variety of the theory, which has been elaborated with great care, in particular, by v. Dantzig and by Pauli.

The foregoing considerations concern, exclusively, the theory of the field, free of matter. How are we to proceed from this point in order to obtain a complete theory of atomically constructed matter? In such a theory, singularities must certainly be excluded, since without such exclusion the differential equations do not completely determine the total field. Here, in the field theory of general relativity, we meet the same problem of a theoretical field-representation of matter as was met originally in connection with the pure Maxwell theory.

Here again the attempt to construct particles out of the field theory, leads apparently to singularities. Here also the endeavor has been made to overcome this defect by the introduction of new field variables and by elaborating and extending the system of field equations. Recently, however, I discovered, in collaboration with Dr. Rosen, that the above mentioned simplest combination of the field equations of gravitation and electricity produces centrally symmetrical solutions which can be represented as free of singularity (the well known centrally symmetrical solutions of Schwarzschild for the pure gravitational field, and those of Reissner for the electric field with consideration of its gravitational action). We shall refer to this shortly in the paragraph next but one. In this way it seems possible to get for matter and its interactions a pure field theory free of additional hypotheses, one moreover whose test by submission to facts of experience does not result in difficulties other than purely mathematical ones, which difficulties, however, are very serious.

5. QUANTUM THEORY AND THE FUNDAMENTALS OF PHYSICS

The theoretical physicists of our generation are expecting the erection of a new theoretical basis for physics which would make use of fundamental concepts greatly different from those of the field theory considered up to now. The reason is that it has been found necessary to use—for the mathematical representation of the so-called quantum phenomena—new sorts of methods of consideration.

While the failure of classical mechanics, as revealed by the theory of relativity, is connected with the finite speed of light (its avoidance of being ∞), it was discovered at the beginning of our century that there were other kinds of inconsistencies between deductions from mechanics and experimental facts, which inconsistencies are connected with the finite magnitude (the avoidance of being zero) of Planck’s constant h. In particular, while molecular mechanics requires that both heat content and (monochromatic) radiation density of solid bodies should decrease in proportion to the decreasing absolute temperature, experience has shown that they decrease much more rapidly than the absolute temperature. For a theoretical explanation of this behavior it was necessary to assume that the energy of a mechanical system cannot assume any sort of value, but only certain discrete values whose mathematical expressions were always dependent upon Planck’s constant h. Moreover, this conception was essential for the theory of the atom (Bohr’s theory). For the transitions of these states into one another—with or without emission or absorption of radiation—no causal laws could be given, but only statistical ones; and, a similar conclusion holds for the radioactive decomposition of atoms, which decomposition was carefully investigated about the same time. For more than two decades physicists tried vainly to find a uniform interpretation of this “quantum character” of systems and phenomena. Such an attempt was successful about ten years ago, through the agency of two entirely different theoretical methods of attack. We owe one of these to Heisenberg and Dirac, and the other to de Broglie and Schrödinger. The mathematical equivalence of the two methods was soon recognized by Schrödinger. I shall try here to sketch the line of thought of de Broglie and Schrödinger, which lies closer to the physicist’s method of thinking, and shall accompany the description with certain general considerations.

The question is first: How can one assign a discrete succession of energy value Hσ to a system specified in the sense of classical mechanics (the energy function is a given function of the coordinates qr and the corresponding momenta pr)? Planck’s constant h relates the frequency Hσ/h to the energy values Hσ. It is therefore sufficient to give to the system a succession of discrete frequency values. This reminds us of the fact that in acoustics, a series of discrete frequency values is coordinated to a linear partial differential equation (if boundary values are given) namely the sinusoidal periodic solutions. In corresponding manner, Schrödinger set himself the task of coordinating a partial differential equation for a scalar function ψ to the given energy function ε(qr, pr), where the qr and the time t are independent variables. In this he succeeded (for a complex function ψ) in such a manner that the theoretical values of the energy Hσ, as required by the statistical theory, actually resulted in a satisfactory manner from the periodic solution of the equation.

To be sure, it did not happen to be possible to associate a definite movement, in the sense of mechanics of material points, with a definite solution ψ(qr, t) of the Schrödinger equation. This means that the ψ function does not determine, at any rate exactly, the story of the qr as functions of the time t. According to Born, however, an interpretation of the physical meaning of the ψ functions was shown to be possible in the following manner: ψψ (the square of the absolute value of the complex function ψ) is the probability density at the point under consideration in the configuration-space of the qr, at the time t. It is therefore possible to characterize the content of the Schrödinger equation in a manner, easy to be understood, but not quite accurate, as follows: it determines how the probability density of a statistical ensemble of systems varies in the configuration-space with the time. Briefly: the Schrödinger equation determines the alteration of the function ψ of the qr with the time.

It must be mentioned that the result of this theory contains—as limiting values—the result of the particle mechanics if the wave-length encountered during the solution of the Schrödinger problem is everywhere so small that the potential energy varies by a practically infinitely small amount for a change of one wavelength in the configuration-space. Under these conditions the following can in fact be shown: We choose a region G0 in the conf
iguration-space which, although large (in every dimension) in relation to the wave length, is small in relation to the practical dimensions of the configuration-space. Under these conditions it is possible to choose a function of ψ for an initial time t0 in such a manner that it vanishes outside of the region G0, and behaves, according to the Schrödinger equation, in such a manner that it retains this property—approximately at least—also for a later time, but with the region G0 having passed at that time t into another region G. In this manner one can, with a certain degree of approximation, speak of the motion of the region G as a whole, and one can approximate this motion by the motion of a point in the configuration-space. This motion then coincides with the motion which is required by the equations of classical mechanics.

Experiments on interference made with particle rays have given a brilliant proof that the wave character of phenomena of motion as assumed by the theory does, really, correspond to the facts. In addition to this, the theory succeeded, easily, in demonstrating the statistical laws of the transition of a system from one quantum condition to another under the action of external forces, which, from the standpoint of classical mechanics, appears as a miracle. The external forces were here represented by small additions of the potential energy as functions of the time. Now, while in classical mechanics, such additions can produce only correspondingly small alterations of the system, in the quantum mechanics they produce alterations of any magnitude however large, but with correspondingly small probability, a consequence in perfect harmony with experience. Even an understanding of the laws of radioactive decomposition, at least in their broad lines, was provided by the theory.

Probably never before has a theory been evolved which has given a key to the interpretation and calculation of such a heterogeneous group of phenomena of experience as has the quantum theory. In spite of this, however; I believe that the theory is apt to beguile us into error in our search for a uniform basis for physics, because, in my belief, it is an incomplete representation of real things, although it is the only one which can be built out of the fundamental concepts of force and material points (quantum corrections to classical mechanics). The incompleteness of the representation is the outcome of the statistical nature (incompleteness) of the laws. I will now justify this opinion.

‹ Prev Next ›