It Began with Babbage

Home > Other > It Began with Babbage > Page 8
It Began with Babbage Page 8

by Dasgupta, Subrata


  where P is a polynomial with integer coefficients, known as “Diophantine equations.”

  Hilbert asked whether one could devise a procedure by which it can be decided in a finite number of steps whether a given Diophantine equation is solvable in rational integers (that is, zero, and positive and negative integers, ±1, ±2, and so forth). This is called a decision problem or, to use its impressive German term, Entscheidungsproblem. As for a “procedure” that can be performed in a “finite number of steps,” what Hilbert was asking was whether there is (using present-centered language) an algorithm to decide whether a given Diophantine equation has a solution.

  Now, the Entscheidungsproblem bears a connection with Gödel, for, although Gödel had shown that certain mathematical propositions are undecidable, the Entscheidungsproblem asks whether there is an algorithm that can decide whether any proposition is or is not decidable.

  This is where Alan Turing enters this story.

  III

  As Lord Byron’s estranged daughter and Babbage’s interpreter, as a woman uncommonly gifted in mathematics, and as one who more than anyone else (save for Babbage) can claim to have understood something of what it is to program a computer a century before her time, it is not surprising that Augustus Ada, Countess of Lovelace, has drawn a measure of popular attention. However, among all the characters who people the history of computer science, no one has excited more attention of the nonmathematical, nonscientific world than Alan Mathison Turing (1912–1954).

  Apart from biographies, two plays about him were written, produced, and performed in London and New York (one even garnering three Tony Award nominations). A television film about his life was shown by the BBC. A novel has been written with Turing as its main character. A musical has been made. In Manchester, England, where he lived and taught in the university’s mathematics department in the final years of a short life, and where he died, a bridge and a road were named after him. Statues of the man have been installed in several places in England. Computer laboratories and auditoria and entire buildings have been named after him in universities in Europe, South America, and the United Kingdom. The world’s premier prize in computer science is called the Turing Award. And, in 2012, the centenary of his birth, commemorative events, both scientific and cultural, were held in many parts of the world. One wonders what he would have made of this posthumous adulation.

  No doubt, a significant part of the popular fascination with Turing stems from the tragic aspect of his later life. A homosexual in England in a time when homosexuality was illegal, under the same law as in Oscar Wilde’s time a half-century before, Turing was prosecuted for “gross indecency” in England.8 His death in June 1954 by cyanide poisoning was officially attributed to suicide. His personal eccentricities, as recollected by people who knew him, added to the legend surrounding him. But, all of this would not have meant a thing if he had been an “ordinary” man. This, he most certainly was not. His place in the history of computer science lies in his remarkable contributions to at least three different aspects of this story. Here, we consider the first and, arguably, his most profound contribution.

  At the time Gödel published his massively consequential results, Turing was an undergraduate in King’s College, Cambridge, reading for a mathematics degree or “tripos” (in Cambridge jargon).9 As an undergraduate, Turing read Bertrand Russell’s Introduction to Mathematical Philosophy (1919) and was introduced to the problems of the foundations of mathematics.10 He read a paper on mathematical philosophy to the Moral Science Club (“moral science” being the term Cambridge used to denote philosophy). At King’s College, he came to know, among the college fellows, philosopher of science Richard Braithwaite (1900–1990), and attended a course of lectures on the methodology of science by astrophysicist Arthur Stanley Eddington (1882–1944).11 Working on his own and responding to Eddington’s lectures, some of which touched on probability and statistics, Turing “discovered” a key result in probability theory called the Central Limit Theorem, only to learn that this theorem had actually been published in 1922.12

  He graduated in 1934 and his results in the tripos were good enough for King’s College to offer him a research studentship. He began writing a dissertation that, if accepted, would give him a fellowship in King’s. Dissertation completed and submitted to the college for their consideration, Turing attended a course of lectures offered by Max Newman (1897–1984), a fellow of St. John’s College in Cambridge and distinguished for his contribution to topology, a major branch of mathematics that deals with the properties of spatial objects like curves and surfaces that remain unchanged under such transformations as deformation, twisting, and stretching. Newman will have a later influence on Turing, but in 1935, this particular encounter proved to be fateful. Newman lectured on the foundations of mathematics and the problems Hilbert had laid out in 1928. And he discussed Gödel’s incompleteness theorem.

  Still, Gödel had not dealt with the third of Hilbert’s problems: Entscheidungsproblem. Is there a procedure for determining whether a mathematical proposition was provable or unprovable?

  That same year, at age 22, on the strength of his dissertation and backed by such senior and illustrious King’s College fellows as economists John Maynard Keynes (1883–1946) and Arthur Pigou (1877–1959), Turing was elected a fellow of King’s College.

  IV

  Entscheidungsproblem: a mechanical process to “do” mathematics. What does it mean to call a process mechanical? Does it entail a machine? A mathematical machine?

  Babbage’s Analytical Engine was a mathematical machine. Was this the kind of thing Hilbert had in mind when he formulated his Entscheidungsproblem? Did he at all know of Babbage’s engine? When Turing became interested in Hilbert’s third question, did he know about Babbage? According to his biographer, he did indeed know of Babbage’s work, but probably not until the 1940s.13 At any rate, so his biographer tells us, sometime during the early summer of 1935, in the village of Grantchester, just outside Cambridge—a place immortalized by the poet Rupert Brooke (1887–1915), once a fellow himself of King’s College, in his poem “The Old Vicarage, Grantchester” (1912)—lying in the meadow watching the Cambridge sky, as in Brooke’s poem, Turing began to think of a machine to deal with the Entscheidungsproblem.14

  However, the mathematical machine about which Turing began thinking that summer in 1935 was a far cry from the world of gears, levers, and Jacquard cards. Babbage’s engine was a mathematical machine in that it solved mathematical problems of a certain kind. Turing’s idea of a mathematical machine was, to begin with, far larger in scope regarding what such a machine could do. But also, it was itself a mathematical object. Turing’s vision of a machine belonged not to the world of physical artifacts that obeyed the laws of physics, it belonged to the world of squiggles or, as we will see, squiggles that represented things—hence, it was a world of symbols and symbol processing. It could process symbols but it was itself also a symbolic object; it belonged to the abstract world. Turing envisioned a purely abstract artifact.

  In May 1936, he submitted a long paper bearing the forbidding title “On Computable Numbers with an Application to the Entscheidungsproblem” to the London Mathematical Society. The paper was accepted and published in the society’s Proceedings later that year.15 Notice the title of the paper. It is as if “computable numbers” was the main issue “with an application to” the Entscheidungsproblem as a secondary concern.

  As Nobel laureate biologist and scientific essayist Sir Peter Medawar (1915–1987) memorably wrote, a scientific paper never tells the story of how science actually takes shape in a person’s mind. A scientific paper rarely represents the actual creative thought process that produced the results described in the paper. All the messiness that enters into thinking, all the false trails, blind alleys, muddle-headedness, short-sightedness, and limited rationality that scientists (or any problem solver) experience during actual thinking are cleaned up or shoved under the carpet. What is communicated
is a sanitized “portrait of the scientist as a rational being.” It is for this reason that Medawar claimed that the scientific paper is “a fraud.”16

  So, also, we should not take the title of Turing’s paper and the organization of its contents as a reflection of his actual thought process. Rather, we should accept it as an image of how he probably rationalized his thinking after the fact, an image of how he wanted to present his thoughts to readers.

  He began with the concept of computable numbers. What are these? They are “real numbers whose expressions as a decimal are calculable by finite means.”17 More simply, “a number is computable if its decimal can be written down by a machine.”18 A real number is a quantity that has a decimal expansion, such as 0.25 and 3.14159. Notice, in Turing’s definition, the clause “by finite means.” The calculation of a computable number must eventually come to a stop; it cannot go on forever. In the case of a number such as π = 3.141592653 …, although it is an infinite decimal, it is a computable number in that one can compute the first n decimal digits in a finite number of steps. At the heart of the matter, for Turing, was to determine how a machine could perform the task that we—intuitively and naturally—call computing.

  In 1936, the year his paper “On Computable Numbers” was published, the word computer still meant a human being who computes. And Turing began by sketching out how a (human) “computer” goes about the task of computing.

  It entails reading and writing symbols on paper. But, rather than a two-dimensional paper, imagine a long tape, a “one-dimensional” affair divided into squares (much as a child’s arithmetic book in Turing’s time was divided into squares). Each square can hold a single symbol.

  The computer, being human, has certain obvious perceptual and cognitive traits that affect her behavior. How she behaves at any particular time is dependent on the symbols she is observing and her “state of mind.” Let us suppose that there is a limit to the number of symbols the computer can observe at any moment—there is a bound, in other words, to her perceptual range. If she wants to observe more symbols, she must take successive observations. We also suppose that, although the number of “states of mind” can be large, there is a finite limit to that number.

  We also imagine (Turing wrote) that the computer can only perform very simple or “atomic” operations, and that a computational task is decomposable into a sequence of these atomic operations.

  So what are these atomic operations? They must include operations that (a) change a symbol on an observed square on the tape; (b) move the computer’s “eye” from an observed square to another, within a certain window of squares bounding the observed square; and (c) change the computer’s state of mind. The change of state of mind can accompany either change of symbol on the observed square or change of the observed square.

  Turing then drew an analogy between his human computer and a machine that does the work of the person. To each state of the human’s mind there is a machine state (so the number of machine states is finite). The machine can scan the tape and “read” its squares. In any one move, the machine can change a symbol on the square being read, or move from one square to another on its left or right, and undergo a change of state.19

  Turing then claimed that the atomic operations—scanning a symbol in a square, moving to the right or left of the scanned square, writing a symbol or erasing it, changing the state—are all those that are involved in performing a computation.20 He also stipulated that, at any point in time, the motion of this machine is determined completely by the machine state along with the symbol then being scanned.

  This is what Turing named a computing machine. Suppose, now, that such a machine is given a tape with some “input” set of symbols (or no symbols at all, a blank tape) and, placed in an “initial” configuration, is set into motion. Then, any sequence of symbols printed by the machine onto the tape is said to be computed by the machine.

  It might be convenient to translate Turing’s description into present-centered language:

  A computing machine consists of a tape that is unbounded in length. Each square of the tape can hold one of a vocabulary of symbols, Si, Sj,…. At any point in time, a read/write head is positioned on one square of the tape; call it the current square. The corresponding symbol is the current input symbol. The machine can be in one of a finite number of states, Qk, Ql, …. The state of the machine at any given time is its present state. Depending on the current input symbol and the present state, the read/write head can print a symbol on the current square (possibly overwriting the prior symbol) or erase the current symbol, move one square to the right or to the left, and place the machine into the next state. This next state then becomes the present state of the machine.

  FIGURE 4.1 A “Parity-Detecting” Turing Machine in the Initial State.

  Turing called his machine, simply, a computing machine. In present-centered language, it is called, in his honor, a Turing machine.

  Consider, as an example, a very simple Turing machine (Figure 4.1). Its task is to “read” an input string of 0s and 1s written on the tape, and print a 1 on the tape if the number of 1s in the string is odd—0, otherwise—and come to a halt (in mathematical jargon, this machine is a parity detector). A special symbol X on the tape indicates the end of the input string. The machine replaces the input string with 0s and replaces the X with a 1 or 0, depending on the parity.

  This machine needs three states. Qo corresponds to an odd number of 1s detected in the input string at any point of the machine’s operation. Qe corresponds to an even number of 1s detected up to any point of the machine’s operation. The third state is H, the halting state. The machine begins its operation with the read/write head positioned on the square holding the first binary digit of the input string.

  Consider, for instance, that the tape has the following input string. The initial position of the read/write head is indicated by the digit in bold type.

  … 00 … 1011011X00 … 0…

  The behavior of this Turing machine is defined by what, in present-centered language, is called a state table, which looks like Table 4.1.

  The first row of this table can be read as follows: Given current state Qe and input symbol 0, the next state will be Qe, an output symbol 0 is written onto that square, and the read/write head moves one square to the right. The last row tells us that given the current state Qo and input symbol X, the next state will be the halt state H, output 1 will be written on the tape, and there will be no further motion of the read/write head. The other rows of the state table can be interpreted similarly.

  Tracing the machine’s operation on the example input string, we see that it goes through the following sequence of states and tape configurations. Again, the position of the read/write head at the start of each step in the sequence of operations is indicated by the digit in bold type.

  TABLE 4.1 State Table for the “Parity-Detecting” Machine

  Qe: 1 011011 X

  Qo: 0 0 11011 X

  Qo: 00 1 1011 X

  Qe: 000 1 011 X

  Qo: 0000 0 11 X

  Qo: 00000 1 1 X

  Qe: 000000 1 X

  Qo: 0000000 X

  H: 0000000 1

  Figure 4.2 shows the state of the tape and the position of the read/write head after the leftmost symbol on the tape has been read. The 1 is overwritten by a 0, and the read/write head moves one square to the right.

  FIGURE 4.2 The “Parity-Detecting” Turing Machine after First Move.

  There is, then, a distinct Turing machine for each distinct computational task. Each Turing machine so constructed is a special-purpose machine. Each such special-purpose machine specifies the initial contents of the tape, the desired content of the tape, the initial position of the read/write head, and a state table that defines the possible behavior of the machine. A Turing machine can be constructed to add two integers n, m represented by n 1s followed by a blank followed by m 1s, leaving the result of n + m 1s on the tape. Another machine can be constructed to
perform the multiplication of two numbers n, m leaving the result on the tape. Yet another machine could be built that, say, given an input string of a’s b’s, c’s would produce its mirror image (a palindrome). For example, from input string aaabbaacc the output would be ccaabbaaa, and so on.

  The parity detection machine described here is not a device for performing arithmetic. It is, rather, a pattern recognition machine. Indeed, the parity detector is a decision machine, for it can decide whether the number of 1s in an input string has odd or even parity. Turing’s computing machine, then, is not just a calculating engine à la Babbage or Ludgate. It is a symbol processing machine. More precisely, it is a “squiggle” processing machine in that, for an input string of squiggles on the tape, it will generate an output string of squiggles. That the squiggles represent something else—that they are actually symbols—is determined by the designer of the machine. Sometimes, the strings of squiggles do not represent anything, as in the palindrome producer.

  In more mathematical terms, a Turing machine’s output will be a function of its input. It’s value is, say, F(X), where X is the input or “argument” to the function F. The state table determines how F is to be computed. Indeed, we might say that a Turing machine’s state table is that function.21 If a Turing machine can be constructed to compute F(X), then that function is said to be Turing computable.22

  V

  A particular Turing machine is (to use another present-centered term) programmed to do what it does. The state table describes this program. As already noted, each Turing machine is a special-purpose computing device in the sense that the Difference Engine was a special-purpose device; it computes a particular function F(X). It does not have the universality dreamed of by Babbage when he was designing the Analytical Engine, but Turing quickly rectified the situation. As he tells us, one can invent a single machine U that can compute any computable function. If U is provided with a tape that begins with a symbolic description of some other computing machine M, then U will perform the same computation as M.23 He called this machine U a “universal computing machine.” Later, it came to be known as a universal Turing machine.24 From a present-centered point of view, it can be described in the following way.

 

‹ Prev