The Theory of Relativity: and Other Essays

Home > Science > The Theory of Relativity: and Other Essays > Page 4
The Theory of Relativity: and Other Essays Page 4

by Albert Einstein


  Here H. A. Lorentz found an escape which showed, at the same time, the way to an electrodynamic theory of bodies in motion, a theory which was more or less free of arbitrary assumption. His theory was built on the following fundamental hypothesis:

  Everywhere (including the interior of ponderable bodies) the seat of the field is the empty space. The participation of matter in electromagnetic phenomena has its origin only in the fact that the elementary particles of matter carry unalterable electric charges, and, on this account are subject on the one hand to the actions of ponderomotive forces and on the other hand possess the property of generating a field. The elementary particles obey Newton’s law of motion for the material point.

  This is the basis on which H. A. Lorentz obtained his synthesis of Newton’s mechanics and Maxwell’s field theory. The weakness of this theory lies in the fact that it tried to determine the phenomena by a combination of partial differential equations (Maxwell’s field equations for empty space) and total differential equations (equations of motion of points), which procedure was obviously unnatural. The unsatisfactory part of the theory showed up externally by the necessity of assuming finite dimensions for the particles in order to prevent the electromagnetic field existing at their surfaces from becoming infinitely great. The theory failed moreover to give any explanation concerning the tremendous forces which hold the electric charges on the individual particles. H. A. Lorentz accepted these weaknesses of his theory, which were well known to him, in order to explain the phenomena correctly at least as regards their general lines.

  Furthermore, there was one consideration which reached beyond the frame of Lorentz’s theory. In the environment of an electrically charged body there is a magnetic field which furnishes an (apparent) contribution to its inertia. Should it not be possible to explain the total inertia of the particles electromagnetically? It is clear that this problem could be worked out satisfactorily only if the particles could be interpreted as regular solutions of the electromagnetic partial differential equations. The Maxwell equations in their original form do not, however, allow such a description of particles, because their corresponding solutions contain a singularity. Theoretical physicists have tried for a long time, therefore, to reach the goal by a modification of Maxwell’s equations. These attempts have, however, not been crowned with success. Thus it happened that the goal of erecting a pure electromagnetic field theory of matter remained unattained for the time being, although in principle no objection could be raised against the possibility of reaching such a goal. The thing which deterred one in any further attempt in this direction was the lack of any systematic method leading to the solution. What appears certain to me, however, is that, in the foundations of any consistent field theory, there shall not be, in addition to the concept of field, any concept concerning particles. The whole theory must be based solely on partial differential equations and their singularity-free solutions.

  § 4.

  The Theory of Relativity

  There is no inductive method which could lead to the fundamental concepts of physics. Failure to understand this fact constituted the basic philosophical error of so many investigators of the nineteenth century. It was probably the reason why the molecular theory, and Maxwell’s theory were able to establish themselves only at a relatively late date. Logical thinking is necessarily deductive; it is based upon hypothetical concepts and axioms. How can we hope to choose the latter in such a manner as to justify us in expecting success as a consequence?

  The most satisfactory situation is evidently to be found in cases where the new fundamental hypotheses are suggested by the world of experience itself. The hypothesis of the non-existence of perpetual motion as a basis for thermodynamics affords such an example of a fundamental hypothesis suggested by experience; the same thing holds for the principle of inertia of Galileo. In the same category, moreover, we find the fundamental hypotheses of the theory of relativity, which theory has led to an unexpected expansion and broadening of the field theory, and to the superseding of the foundations of classical mechanics.

  The successes of the Maxwell-Lorentz theory have given great confidence in the validity of the electromagnetic equations for empty space and hence, in particular, to the statement that light travels “in space” with a certain constant speed c. Is this law of the invariability of light velocity in relation to any desired inertial system valid? If it were not, then one specific inertial system or more accurately, one specific state of motion (of a body of reference), would be distinguished from all others. In opposition to this idea, however, stand all the mechanical and electromagnetic-optical facts of our experience.

  For these reasons it was necessary to raise to the degree of a principle, the validity of the law of constancy of light velocity for all inertial systems. From this, it follows that the spacial coordinates X1, X2, X3, and the time X4, must be transformed according to the “Lorentz-transformation” which is characterized by invariance of the expression

  ds2 = dx12 + dx22 +dx32 − dx42

  (if the unit of time is chosen in such a manner that the speed of light c = 1).

  By this procedure time lost its absolute character, and was included with the “spacial” coordinates as of algebraically (nearly) similar character. The absolute character of time and particularly of simultaneity were destroyed, and the four dimensional description became introduced as the only adequate one.

  In order to account, also, for the equivalence of all inertial systems with regard to all the phenomena of nature, it is necessary to postulate invariance of all systems of physical equations which express general laws, with regard to the Lorentzian transformation. The elaboration of this requirement forms the content of the special theory of relativity.

  This theory is compatible with the equations of Maxwell; but, it is incompatible with the basis of classical mechanics. It is true that the equations of motion of the material point can be modified (and with them the expressions for momentum and kinetic energy of the material point) in such a manner as to satisfy the theory; but, the concept of the force of interaction, and with it the concept of potential energy of a system, lose their basis, because these concepts rest upon the idea of absolute instantaneousness. The field, as determined by differential equations, takes the place of the force.

  Since the foregoing theory allows interaction only by fields, it requires a field theory of gravitation. Indeed, it is not difficult to formulate such a theory in which, as in Newton’s theory, the gravitational fields can be reduced to a scalar which is the solution of a partial differential equation. However, the experimental facts expressed in Newton’s theory of gravitation lead in another direction, that of the general theory of relativity.

  Classical mechanics contains one point which is unsatisfactory in that, in the fundamentals, the same mass constant is met twice over in two different rôles, namely as “inertial mass” in the law of motion, and as “gravitational mass” in the law of gravitation. As a result of this, the acceleration of a body in a pure gravitational field is independent of its material; or, in a coordinate system of uniform acceleration (accelerated in relation to an “inertial system”) the motions take place as they would in a homogeneous gravitational field (in relation to a “motionless” system of coordinates). If one assumes that the equivalence of these two cases is complete, then one attains an adaptation of our theoretical thinking to the fact that the gravitational and inertial masses are identical.

  From this it follows that there is no longer any reason for favoring, as a fundamental principle, the “inertial systems”; and, we must admit as equivalent in their own right, also non-linear transformations of the coordinates (x1, x2, x3, x4). If we make such a transformation of a system of coordinates of the special theory of relativity, then the metric

  ds2 = dx12 + dx22 + dx32 − dx42

  goes over to a general (Riemannian) metric of Bane

  ds2 = gμν dxμ dxν (Summed over μ and ν)

  where the gμν symmetrical
in μ and ν, are certain functions of x1 … x4 which describe both the metric property, and the gravitational field in relation to the new system of coordinates.

  The foregoing improvement in the interpretation of the mechanical basis must, however, be paid for in that—as becomes evident on closer scrutiny—the new coordinates could no longer be interpreted, as results of measurements by rigid bodies and clocks, as they could in the original system (an inertial system with vanishing gravitational field).

  The passage to the general theory of relativity is realized by the assumption that such a representation of the field properties of space already mentioned, by functions gμν, (that is to say by a Riemann metric), is also justified in the general case in which there is no system of coordinates in relation to which the metric takes the simple quasi-Euclidian form of the special theory of relativity.

  Now the coordinates, by themselves, no longer express metric relations, but only the “neighborliness” of the things described, whose coordinates differ but little from one another. All transformations of the coordinates have to be admitted so long as these transformations are free from singularities. Only such equations as are covariant in relation to arbitrary transformations in this sense have meaning as expressions of general law of nature (postulate of general covariancy).

  The first aim of the general theory of relativity was a preliminary statement which, by giving up the requirement of constituting a closed thing in itself, could be connected in as simple a manner as possible with the “facts directly observed.” Newton’s gravitational theory gave an example, by restricting itself to the pure mechanics of gravitation. This preliminary statement may be characterized as follows:

  (1) The concept of the material point and of its mass is retained. A law of motion is given for it, this law of motion being the translation of the law of inertia into the language of the general theory of relativity. This law is a system of total differential equations, the system characteristic of the geodetic line.

  (2) In place of Newton's law of interaction by gravitation, we shall find the system of the simplest generally covariant differential equations which can be set up for the gμν-tensor. It is formed by equating to zero the once contracted Riemannian curvature tensor (Rμν = 0).

  This formulation permits the treatment of the problem of the planets. More accurately speaking, it allows the treatment of the problem of motion of material points of practically negligible mass in the gravitational field produced by a material point which itself is supposed to have no motion (central symmetry). It does not take into account the reaction of the “moved” material points on the gravitational field, nor does it consider how the central mass produces this gravitational field.

  Analogy with classical mechanics shows that the following is a way to complete the theory. One sets up as field equation

  where R represents the scalar of Riemannian curvature, Tik the energy tensor of the matter in a phenomenological representation. The left side of the equation is chosen in such a manner that its divergence disappears identically. The resulting disappearance of the divergence of the right side produces the “equations of motion” of matter, in the form of partial differential equations for the case where Tik introduces, for the description of the matter, only four further functions independent of each other (for instance, density, pressure, and velocity components, where there is between the latter an identity, and between pressure and density an equation of condition).

  By this formulation one reduces the whole mechanics of gravitation to the solution of a single system of covariant partial differential equations. The theory avoids all internal discrepancies which we have charged against the basis of classical mechanics. It is sufficient—as far as we know—for the representation of the observed facts of celestial mechanics. But, it is similar to a building, one wing of which is made of fine marble (left part of the equation), but the other wing of which is built of low grade wood (right side of equation). The phenomenological representation of matter is, in fact, only a crude substitute for a representation which would correspond to all known properties of matter.

  There is no difficulty in connecting Maxwell's theory of the electromagnetic field with the theory of the gravitational field so long as one restricts himself to space, free of ponderable matter and free of electric density. All that is necessary is to put on the right hand side of the above equation for Tik, the energy tensor of the electromagnetic field in empty space and to associate with the so modified system of equations the Maxwell field equation for empty space, written in general covariant form. Under these conditions there will exist, between all these equations, a sufficient number of the differential identities to guarantee their consistency. We may add that this necessary formal property of the total system of equations leaves arbitrary the choice of the sign of the member Tik, a fact which was later shown to be important.

  The desire to have, for the foundations of the theory, the greatest possible unity has resulted in several attempts to include the gravitational field and the electromagnetic field in one formal but homogeneous picture. Here we must mention particularly the five-dimensional theory of Kaluza and Klein. Having considered this possibility very carefully I feel that it is more desirable to accept the lack of internal uniformity of the original theory, because I do not consider that the totality of the hypothetical basis of the five-dimensional theory contains less of an arbitrary nature than does the original theory. The same statement may be made for the projective variety of the theory, which has been elaborated with great care, in particular, by v. Dantzig and by Pauli.

  The foregoing considerations concern, exclusively, the theory of the field, free of matter. How are we to proceed from this point in order to obtain a complete theory of atomically constructed matter? In such a theory, singularities must certainly be excluded, since without such exclusion the differential equations do not completely determine the total field. Here, in the field theory of general relativity, we meet the same problem of a theoretical field-representation of matter as was met originally in connection with the pure Maxwell theory.

  Here again the attempt to construct particles out of the field theory, leads apparently to singularities. Here also the endeavor has been made to overcome this defect by the introduction of new field variables and by elaborating and extending the system of field equations. Recently, however, I discovered, in collaboration with Dr. Rosen, that the above mentioned simplest combination of the field equations of gravitation and electricity produces centrally symmetrical solutions which can be represented as free of singularity (the well known centrally symmetrical solutions of Schwarzschild for the pure gravitational field, and those of Reissner for the electric field with consideration of its gravitational action). We shall refer to this shortly in the paragraph next but one. In this way it seems possible to get for matter and its interactions a pure field theory free of additional hypotheses, one moreover whose test by submission to facts of experience does not result in difficulties other than purely mathematical ones, which difficulties, however, are very serious.

  § 5.

  Quantum Theory and the Fundamentals of Physics

  The theoretical physicists of our generation are expecting the erection of a new theoretical basis for physics which would make use of fundamental concepts greatly different from those of the field theory considered up to now. The reason is that it has been found necessary to use—for the mathematical representation of the so-called quantum phenomena—new sorts of methods of consideration.

  While the failure of classical mechanics, as revealed by the theory of relativity, is connected with the finite speed of light (its avoidance of being ∞), it was discovered at the beginning of our century that there were other kinds of inconsistencies between deductions from mechanics and experimental facts, which inconsistencies are connected with the finite magnitude (the avoidance of being zero) of Planck's constant h. In particular, while molecular mechanics requires that both, heat content and (monochromatic)
radiation density, of solid bodies should decrease in proportion to the decreasing absolute temperature, experience has shown that they decrease much more rapidly than the absolute temperature. For a theoretical explanation of this behavior it was necessary to assume that the energy of a mechanical system cannot assume any sort of value, but only certain discrete values whose mathematical expressions were always dependent upon Planck's constant h. Moreover, this conception was essential for the theory of the atom (Bohr's theory). For the transitions of these states into one another—with or without emission or absorption of radiation—no causal laws could be given, but only statistical ones; and, a similar conclusion holds for the radioactive decomposition of atoms, which decomposition was carefully investigated about the same time. For more than two decades physicists tried vainly to find a uniform interpretation of this “quantum character” of systems and phenomena. Such an attempt was successful about ten years ago, through the agency of two entirely different theoretical methods of attack. We owe one of these to Heisenberg and Dirac, and the other to de Broglie and Schrödinger. The mathematical equivalence of the two methods was soon recognized by Schrödinger. I shall try here to sketch the line of thought of de Broglie and Schrödinger, which lies closer to the physicist's method of thinking, and shall accompany the description with certain general considerations.

  The question is first: How can one assign a discrete succession of energy value Hσ to a system specified in the sense of classical mechanics (the energy function is a given function of the coordinates qr and the corresponding momenta pr)? Planck's constant h relates the frequency to the energy values It is therefore sufficient to give to the system a succession of discrete frequency values. This reminds us of the fact that in acoustics, a series of discrete frequency values is coordinated to a linear partial differential equation (if boundary values are given) namely the sinusoidal periodic solutions. In corresponding manner, Schrödinger set himself the task of coordinating a partial differential equation for a scalar function to the given energy function where the qr and the time t are independent variables. In this he succeeded (for a complex function ) in such a manner that the theoretical values of the energy as required by the statistical theory, actually resulted in a satisfactory manner from the periodic solution of the equation.

 

‹ Prev