A Stubbornly Persistent Illusion

Home > Science > A Stubbornly Persistent Illusion > Page 14
A Stubbornly Persistent Illusion Page 14

by Stephen Hawking


  This stipulation contains a further physical hypothesis, the validity of which will hardly be doubted without empirical evidence to the contrary. It has been assumed that all these clocks go at the same rate if they are of identical construction. Stated more exactly: When two clocks arranged at rest in different places of a reference-body are set in such a manner that a particular position of the pointers of the one clock is simultaneous (in the above sense) with the same position of the pointers of the other clock, then identical “settings” are always simultaneous (in the sense of the above definition).

  1We suppose further, that, when three events A, B and C occur in different places in such a manner that A is simultaneous with B, and B is simultaneous with C (simultaneous in the sense of the above definition), then the criterion for the simultaneity of the pair of events A, C is also satisfied. This assumption is a physical hypothesis about the law of propagation of light; it must certainly be fulfilled if we are to maintain the law of the constancy of the velocity of light in vacuo.

  NINE

  THE RELATIVITY OF SIMULTANEITY

  Up to now our considerations have been referred to a particular body of reference, which we have styled a “railway embankment.” We suppose a very long train travelling along the rails with the constant velocity v and in the direction indicated in Fig. 1. People travelling in this

  FIG. 1.

  train will with advantage use the train as a rigid reference-body (coordinate system); they regard all events in reference to the train. Then every event which takes place along the line also takes place at a particular point of the train. Also the definition of simultaneity can be given relative to the train in exactly the same way as with respect to the embankment. As a natural consequence, however, the following question arises:

  Are two events (e.g. the two strokes of lightning A and B) which are simultaneous with reference to the railway embankment also simultaneous relatively to the train? We shall show directly that the answer must be in the negative.

  When we say that the lightning strokes A and B are simultaneous with respect to the embankment, we mean: the rays of light emitted at the places A and B, where the lightning occurs, meet each other at the mid-point M of the length A → B of the embankment. But the events A and B also correspond to positions A and B on the train. Let M′ be the mid-point of the distance A → B on the travelling train. Just when the flashes1 of lightning occur, this point M′ naturally coincides with the point M, but it moves towards the right in the diagram with the velocity v of the train. If an observer sitting in the position M′ in the train did not possess this velocity, then he would remain permanently at M, and the light rays emitted by the flashes of lightning A and B would reach him simultaneously, i.e. they would meet just where he is situated. Now in reality (considered with reference to the railway embankment) he is hastening towards the beam of light coming from B, whilst he is riding on ahead of the beam of light coming from A. Hence the observer will see the beam of light emitted from B earlier than he will see that emitted from A. Observers who take the railway train as their reference-body must therefore come to the conclusion that the lightning flash B took place earlier than the lightning flash A. We thus arrive at the important result:

  Events which are simultaneous with reference to the embankment are not simultaneous with respect to the train, and vice versa (relativity of simultaneity). Every reference-body (co-ordinate system) has its own particular time; unless we are told the reference-body to which the statement of time refers, there is no meaning in a statement of the time of an event.

  Now before the advent of the theory of relativity it had always tacitly been assumed in physics that the statement of time had an absolute significance, i.e. that it is independent of the state of motion of the body of reference. But we have just seen that this assumption is incompatible with the most natural definition of simultaneity; if we discard this assumption, then the conflict between the law of the propagation of light in vacuo and the principle of relativity (developed in Section 7) disappears.

  We were led to that conflict by the considerations of Section 6, which are now no longer tenable. In that section we concluded that the man in the carriage, who traverses the distance w per second relative to the carriage, traverses the same distance also with respect to the embankment in each second of time. But, according to the foregoing considerations, the time required by a particular occurrence with respect to the carriage must not be considered equal to the duration of the same occurrence as judged from the embankment (as reference-body). Hence it cannot be contended that the man in walking travels the distance w relative to the railway line in a time which is equal to one second as judged from the embankment.

  Moreover, the considerations of Section 6 are based on yet a second assumption, which, in the light of a strict consideration, appears to be arbitrary, although it was always tacitly made even before the introduction of the theory of relativity.

  TEN

  ON THE RELATIVITY OF THE CONCEPTION OF DISTANCE

  Let us consider two particular points on the train1 travelling along the embankment with the velocity v, and inquire as to their distance apart. We already know that it is necessary to have a body of reference for the measurement of a distance, with respect to which body the distance can be measured up. It is the simplest plan to use the train itself as reference-body (co-ordinate system). An observer in the train measures the interval by marking off his measuring-rod in a straight line (e.g. along the floor of the carriage) as many times as is necessary to take him from the one marked point to the other. Then the number which tells us how often the rod has to be laid down is the required distance.

  It is a different matter when the distance has to be judged from the railway line. Here the following method suggests itself. If we call A and B the two points on the train whose distance apart is required, then both of these points are moving with the velocity v along the embankment. In the first place we require to determine the points A and B of the embankment which are just being passed by the two points A′ and B′ at a particular time t—judged from the embankment. These points A and B of the embankment can be determined by applying the definition of time given in Section 8. The distance between these points A and B is then measured by repeated application of the measuring-rod along the embankment.

  A priori it is by no means certain that this last measurement will supply us with the same result as the first. Thus the length of the train as measured from the embankment may be different from that obtained by measuring in the train itself. This circumstance leads us to a second objection which must be raised against the apparently obvious consideration of Section 6. Namely, if the man in the carriage covers the distance w in a unit of time—measured from the train— then this distance—as measured from the embankment—is not necessarily also equal to w.

  1E.g. the middle of the first and of the twentieth carriage.

  ELEVEN

  THE LORENTZ TRANSFORMATION

  The results of the last three sections show that the apparent incompatibility of the law of propagation of light with the principle of relativity (Section 7) has been derived by means of a consideration which borrowed two unjustifiable hypotheses from classical mechanics; these are as follows:

  (1) The time-interval (time) between two events is independent of the condition of motion of the body of reference.

  (2) The space-interval (distance) between two points of a rigid body is independent of the condition of motion of the body of reference.

  If we drop these hypotheses, then the dilemma of Section 7 disappears, because the theorem of the addition of velocities derived in Section 6 becomes invalid. The possibility presents itself that the law of the propagation of light in vacuo may be compatible with the principle of relativity, and the question arises: How have we to modify the considerations of Section 6 in order to remove the apparent disagreement between these two fundamental results of experience? This question leads to a general one. In the discussi
on of Section 6 we have to do with places and times relative both to the train and to the embankment. How are we to find the place and time of an event in relation to the train, when we know the place and time of the event with respect to the railway embankment? Is there a thinkable answer to this question of such a nature that the law of transmission of light in vacuo does not contradict the principle of relativity? In other words: Can we conceive of a relation between place and time of the individual events relative to both reference-bodies, such that every ray of light possesses the velocity of transmission c relative to the embankment and relative to the train? This question leads to a quite definite positive answer, and to a perfectly definite transformation law for the space-time magnitudes of an event when changing over from one body of reference to another.

  Before we deal with this, we shall introduce the following incidental consideration. Up to the present we have only considered events taking place along the embankment, which had mathematically to assume the function of a straight line. In the manner indicated in Section 2 we can imagine this reference-body supplemented laterally and in a vertical direction by means of a framework of rods, so that an event which takes place anywhere can be localised with reference to this framework. Similarly, we can imagine the train travelling with the velocity v to be continued across the whole of space, so that every event, no matter how far off it may be, could also be localised with respect to the second framework. Without committing any fundamental error, we can disregard the fact that in reality these frameworks would continually interfere with each other, owing to the impenetrability of solid bodies. In every such framework we imagine three surfaces perpendicular to each other marked out, and designated as “co-ordinate planes” (“co-ordinate system”). A co-ordinate system K then corresponds to the embankment, and a co-ordinate system K to the train. An event, wherever it may have taken place, would be fixed in space with respect to K by the three perpendiculars x, y, z on the co-ordinate planes, and with regard to time by a time-value t. Relative to K′, the same event would be fixed in respect of space and time by corresponding values x′, y′, z′, t′, which of course are not identical with x, y, z t. It has already been set forth in detail how these magnitudes are to be regarded as results of physical measurements.

  FIG. 2.

  Obviously our problem can be exactly formulated in the following manner. What are the values x′, Y′, z′, t′, of an event with respect to K′, when the magnitudes x, y, z, t, of the same event with respect to K are given? The relations must be so chosen that the law of the transmission of light in vacuo is satisfied for one and the same ray of light (and of course for every ray) with respect to K and K′. For the relative orientation in space of the co-ordinate systems indicated in the diagram (Fig. 2), this problem is solved by means of the equations:

  This system of equations is known as the “Lorentz transformation.”1

  If in place of the law of transmission of light we had taken as our basis the tacit assumptions of the older mechanics as to the absolute character of times and lengths, then instead of the above we should have obtained the following equations:

  This system of equations is often termed the “Galilei transformation.” The Galilei transformation can be obtained from the Lorentz transformation by substituting an infinitely large value for the velocity of light c in the latter transformation.

  Aided by the following illustration, we can readily see that, in accordance with the Lorentz transformation, the law of the transmission of light in vacuo is satisfied both for the reference-body K and for the reference-body K′. A light-signal is sent along the positive x-axis, and this light-stimulus advances in accordance with the equation.

  x = ct,

  i.e. with the velocity c. According to the equations of the Lorentz transformation, this simple relation between x and t involves a relation between x′ and t′. In point of fact, if we substitute for x the value ct in the first and fourth equations of the Lorentz transformation, we obtain:

  from which, by division, the expression

  x′ = ct′

  immediately follows. If referred to the system K′, the propagation of light takes place according to this equation. We thus see that the velocity of transmission relative to the reference-body K′ is also equal to c. The same result is obtained for rays of light advancing in any other direction whatsoever. Of course this is not surprising, since the equations of the Lorentz transformation were derived conformably to this point of view.

  1A simple derivation of the Lorentz transformation is given in Appendix 1.

  TWELVE

  THE BEHAVIOUR OF MEASURING-RODS AND CLOCKS IN MOTION

  I place a metre-rod in the x′-axis of K′ in such a manner that one end (the beginning) coincides with the point x′ = 0, whilst the other end (the end of the rod) coincides with the point x′ = 1. What is the length of the metre-rod relatively to the system K? In order to learn this, we need only ask where the beginning of the rod and the end of the rod lie with respect to K at a particular time t of the system K. By means of the first equation of the Lorentz transformation the values of these two points at the time t = 0 can be shown to be

  the distance between the points being . But the metre-rod is moving with the velocity v relative to K. It therefore follows that the length of a rigid metre-rod moving in the direction of its length with a velocity v is of a metre. The rigid rod is thus shorter when in motion than when at rest, and the more quickly it is moving, the shorter is the rod. For the velocity v = c we should have and for still greater velocities the square-root becomes imaginary. From this we conclude that in the theory of relativity the velocity c plays the part of a limiting velocity, which can neither be reached nor exceeded by any real body.

  Of course this feature of the velocity c as a limiting velocity also clearly follows from the equations of the Lorentz transformation, for these become meaningless if we choose values of v greater than c.

  If, on the contrary, we had considered a metre-rod at rest in the x-axis with respect to K then we should have found that the length of the rod as judged from K′ would have been this is quite in accordance with the principle of relativity which forms the basis of our considerations.

  A priori it is quite clear that we must be able to learn something about the physical behaviour of measuring-rods and clocks from the equations of transformation, for the magnitudes x, y, z,t, are nothing more nor less than the results of measurements obtainable by means of measuring-rods and clocks. If we had based our considerations on the Galileian transformation we should not have obtained a contraction of the rod as a consequence of its motion.

  Let us now consider a seconds-clock which is permanently situated at the origin (x′ = 0 ) of K′. t′ = 0 and t′ = 1 are two successive ticks of this clock. The first and fourth equations of the Lorentz transformation give for these two ticks:

  and

  As judged from K, the clock is moving with the velocity v; as judged from this reference-body, the time which elapses between two strokes of the clock is not one second, but seconds, i.e. a somewhat larger time. As a consequence of its motion the clock goes more slowly than when at rest. Here also the velocity c plays the part of an unattainable limiting velocity.

  THIRTEEN

  THEOREM OF THE ADDITION OF THE VELOCITIES. THE EXPERIMENT OF FIZEAU

  Now in practice we can move clocks and measuring-rods only with velocities that are small compared with the velocity of light; hence we shall hardly be able to compare the results of the previous section directly with the reality. But, on the other hand, these results must strike you as being very singular, and for that reason I shall now draw another conclusion from the theory, one which can easily be derived from the foregoing considerations, and which has been most elegantly confirmed by experiment.

  In Section 6 we derived the theorem of the addition of velocities in one direction in the form which also results from the hypotheses of classical mechanics. This theorem can also be deduced readily fr
om the Galilei transformation (Section 11). In place of the man walking inside the carriage, we introduce a point moving relatively to the coordinate system K′ in accordance with the equation

  By means of the first and fourth equations of the Galilei transformation we can express x′ and t′ in terms of x and t, and we then obtain

  This equation expresses nothing else than the law of motion of the point with reference to the system K (of the man with reference to the embankment). We denote this velocity by the symbol W, and we then obtain, as in Section 6,

  But we can carry out this consideration just as well on the basis of the theory of relativity. In the equation

  we must then express x′ and t′ in terms of x and t, making use of the first and fourth equations of the Lorentz transformation. Instead of the equation (A) we then obtain the equation

  which corresponds to the theorem of addition for velocities in one direction according to the theory of relativity. The question now arises as to which of these two theorems is the better in accord with experience. On this point we are enlightened by a most important experiment which the brilliant physicist Fizeau performed more than half a century ago, and which has been repeated since then by some of the best experimental physicists, so that there can be no doubt about its result. The experiment is concerned with the following question. Light travels in a motionless liquid with a particular velocity w. How quickly does it travel in the direction of the arrow in the tube T (see the accompanying diagram, Fig. 3) when the liquid above mentioned is flowing through the tube with a velocity v?

 

‹ Prev