Write a 750-1500 word pr´ecis1 of The unreasonable effectiveness of mathematics in the physical sciences, by Eugene Wigner (a PDF copy of the essay is available in the Resources section of the Math 109 Piazza page). Your pr´ecis must be typed, preferably in LATEX, the industry standard typesetting language in mathematics and the sciences.
COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, VOL. XIII, 001-14 (1960)
The Unreasonable Effectiveness of Mat hematics in the Natural Sciences
Richard Courant Lecture i n Mathematical Sciences delivered at New York University, May 1 1 , 1959
E U G E N E P. W I G N E R Princeton University
“and it i s probable t h a t there i s some secret here w h i c h r e m a i n s t o be discovered.” ( C . S . P e i r c e )
There is a story about two friends, who were classmates in high school, talking about their jobs. One of them became a statistician and was working on population trends. He showed a reprint to his former classmate, The reprint started, as usual, with the Gaussian distribution and the statistician explained t o his former classmate the meaning of the symbols for the actual population, for the average population, and so on. His classmate was a bit incredulous and was not quite sure whether the statistician was pulling his leg. “How can you know that?” was his query. “And what is this symbol i e r e ? ” “Oh,” said the statistician, “this is n.” “What is t h a t ? ” “The ratio of the circumference of the circle t o its diameter.” “Well, now you are pushing your joke too far,” said the classmate, “surely the pop- ulation has nothing to do with the circumference of the circle.”
Naturally, we are inclined to smile about the simplicity of the classmate’s approach. Nevertheless, when I heard this story, I had to admit to an eerie feeling because, surely, the reaction of the classmate betrayed only plain common sense. I was even more confused when, not many days later, someone came t o me and expressed his bewilderment1 with the fact that we make a rather narrow selection when choosing the data on which we test our theories. “How do we know that, if we made a theory which focusses its attention on phenomena we disregard and disregards some of the phe- nomena now commanding our attention, that we could not build another theory which has little in common with the present one but which, never- theless, explains just as many phenomena as the present theory.” It has to be admitted that we have not definite evidence that there is no such theory.
The preceding two stories illustrate the two main points which are the
‘The remark to be quoted was made b y F. Werner when he was a student in Princeton.
2 E. P. WIGNER
subjects of the present discourse. The first point is that mathematical con- concepts turn up in entirely unexpected connections. Moreover, they often permit an unexpectedly close and accurate description of the phenomena in these connections. Secondly, just because of this circumstance, and because we do not understand the reasons of their usefulness, we cannot know whether a theory formulated in terms of mathematical concepts is uniquely appropriate. We are in a position similar to that of a man who was provided with a bunch of keys and who, having to open several doors in succession, always hit on the right key on the first or second trial. He became skeptical concerning the uniqueness of the coordination between keys and doors.
Most of what will be said on these questions will not be new; it has probably occurred to most scientists in one form or another. My principal aim is t o illuminate i t from several sides. The first point is that the enormous usefulness of mathematics in the natural sciences is something bordering on the mysterious and that there is no rational explanation for it. Second, it is just this uncanny usefulness of mathematical concepts that raises the ques- tion of the uniqueness of our physical theories I n order to establish the first point, that mathematics plays an unreasonably important role in physics, it will be useful to say a few words on the question “What is mathematics?”, then, “What is physics?”, then, how mathematics enters physical theories, and last, why the success of mathematics in its role in physics appears so baffling. Much less will be said on the second point: the uniqueness of the theories of physics. A proper answer to this question would require elaborate experimental and theoretical work which has not been undertaken to date.
Somebody once said that philosophy is the misuse of a terminology which was invented just for this purpose.2 In the same vein, I would say that mathematics is the science of skillful operations with concepts and rules invented just for this purpose. The principal emphasis is on the invention of concepts. Mathematics would soon run out of interesting theorems if these had to be formulated in terms of the concepts which already appear in the axioms. Furthermore, whereas it is unquestion- ably true that the concepts of elementary mathematics and particularly elementary geometry were formulated to describe entities which are directly suggested by the actual world, the same does not seem to be true of the more advanced concepts, in particular the concepts which play such an important role in physics. Thus, the rules for operations with pairs of numbers are obviously designed to give the same results as the operations with fractions which we first learned without reference to “pairs of numbers”. The rules
What is Mathematics?
lThis statement is quoted here from W. Dubislav’s Die Philosophie der Mathematik in dev Gegenwavi. Junker und Dunnhaupt Verlag, Berlin, 1932, p. 1 .
M A T H E M A T I C S A N D T H E N A T U R A L S C I E N C E S 3
for the operations with sequences, that is with irrational numbers, still belong to the category of rules which were determined so as t o reproduce rules for the operations with quantities which were already known to us. Most more advanced mathematical concepts, such as complex numbers, algebras, linear operators, Bore1 sets- and this list could be continued almost indefinitely-were so devised that they are apt subjects on which the mathematician can demonstrate his ingenuity and sense of formal beauty. I n fact, the definition of these concepts, with a realization that interesting and ingenious considerations could be applied to them, is the first demon- stration of the ingeniousness of the mathematician who defines them. The depth of thought which goes into the formation of the mathematical concepts is later justified by the skill with which these concepts are used. The great mathematician fully, almost ruthlessly, exploits the domain of permissible reasoning and skirts the impermissible. That his recklessness does not lead him into a morass of contradictions is a miracle in itself: certainly it is hard to believe that our reasoning power was brought, by Darwin’s process of natural selection, to the perfection which it seems to possess. However, this is not our present subject. The principal point which will have to be recalled later is that the mathematician could formulate only a handful of interesting theorems without defining concepts beyond those contained in the axioms and that the concepts outside those contained in the axioms are defined with a view of permitting ingenious logical opera- tions which appeal t o our aesthetic sense both as operations and also in their results of great generality and ~ i m p l i c i t y . ~
The complex numbers provide a particularly striking example for the foregoing. Certainly, nothing in our experience suggests the introduction of these quantities. Indeed, if a mathematician is asked to justify his interest in complex numbers, he will point, with some indignation, to the many beautiful theorems in the theory of equations, of power series and of analytic functions in general, which owe their origin to the introduction of complex numbers. The mathematician is not willing to give up his interest in these most beautiful accomplishments of his g e n i u ~ . ~
What is Physics? The physicist is interested in discovering the laws of inanimate nature. I n order to understand this statement, it is necessary t o analyze the concept “law of nature”.
aM. Polanyi, in his Personal Ktzowledge, University of Chicago Press, 1958 says: “All these difficulties are but consequences of our refusal to see that mathematics cannot be defined without acknowledging its most obvious feature: namely, that it is interesting,” (page 188).
‘The reader may be interested, in this connection, in Hilbert’s rather testy remarks about intuitionism which “seeks to break up and to disfigure mathematics”, Abh. Math. Sem. Univ. Hamburg, Vol. 157, 1922, or Gesammelte Werke, Springer, Berlin, 1935, page 188.
4 E. P. WIGNER
The world around us is of baffling complexity and the most obvious fact about it is that we cannot predict the future. Although the joke attributes only t o the optimist the view that the future is uncertain, the optimist is right in this case: the future is unpredictable. It is, as Schro- dinger has remarked, a miracle t h a t in spite of the baffling complexity of the world, certain regularities in the events could be discovered [l]. One such regularity, discovered by Galileo, is that two rocks, dropped a t the same time from the same height, reach the ground a t the same time. The laws of nature are concerned with such regularities. Galileo’s regularity is a prototype of a large class of regularities. It is a surprising regularity for three reasons.
The first reason that it is surprising is that it is true not only in Pisa, and in Galileo’s time, it is true everywhere on the Earth, was always true, and will always be true. This property of the regularity is a recognized invariance property and, as I had occasion t o point out some time ago , without invariance principles similar t o those implied in the preceding generalization of Galileo’s observation, physics would not be possible. The second surprising feature is that the regularity which we are discussing is independent of so many conditions which could have a n effect on it. It is valid no matter whether it rains or not, whether the experiment is carried out in a room or from the Leaning Tower, no matter whether the person who drops the rocks is a man or a woman. It is valid even if the two rocks are dropped, simultaneously and from the same height, by two different people. There are, obviously, innumerable other conditions which are all immaterial from the point of view of the validity of Galileo’s regularity. The irrelevancy of so many circumstances which could play a role in the phenomenon observed, has also been called an invariance . However, this invariance is of a different character than the preceding one since it cannot be formulated as a general principle. The exploration of the conditions which do, and which do not, influence a phenomenon is part of the early experimental exploration of a field. It is the skill and ingenuity of the experimenter which shows him phenomena which depend on a rel- atively narrow set of relatively easily realizable and reproducible con- d i t i o n ~ . ~ I n the present case, Galileo’s restriction of his observations t o relatively heavy bodies was the most important step in this regard. Again, it is true that if there were no phenomena which are independent of all but a manageably small set of conditions, physics would be impossible.
The preceding two points, though highly significant from the point of
SSee, in this connection, the graphic essay of &I. Deutsch, Daedalus, Vol. 87, 1958, page 86 A. Shimony has called my attention to a similar passage in C S. Peirce’s Essays zn the Phzlosophy of Sczence, The Liberal Arts Press, New York, 1957 (page 237).
MATHEMATICS A N D THE NATURAL SCIENCES 5
view of the philosopher, are not the ones which surprised GaIileo most, nor do they contain a specific law of nature. The law of nature is contained in the statement that the length of time which it takes for a heavy object to fall from a given height is independent of the size, material and shape of the body which drops. I n the framework of Newton’s second “law”, this amounts to the statement that the gravitational force which acts on the falling body is proportional to its mass but independent of the size, material and shape of the body which falls.
The preceding discussion is intended t o remind, first, that it is not a t all natural that “laws of nature” exist, much less that man is able to discover them.6 The present writer had occasion, some time ago, to call attention to the succession of layers of “laws of nature”, each layer con- taining more general and more encompassing laws than the previous one and its discovery constituting a deeper penetration into the structure of the universe than the layers recognized before . However, the point which is most significant in the present context is that all these laws of nature contain, in even their remotest consequences, only a small part of our knowledge of the inanimate world. All the laws of nature are conditional statements which permit a prediction of some future events on the basis of the knowledge of the present, except that some aspects of the present state of the world, in practice the overwhelming majority of the determinants of the present state of the world, are irrelevant from the point of view of the prediction. The irrelevancy is meant in the sense of the second point in the discussion of Galileo’s theorem.’
As regards the present state of the world, such as the existence of the earth on which we live and on which Galileo’s experiments were performed, the existence of the sun and of all our surroundings, the laws of nature are entirely silent. It is in consonance with this, first, that the laws of nature can be used to predict future events only under exceptional circumstances -when all the relevant determinants of the present state of the world are known. It is also in consonance with this that the construction of machines, the functioning of which he can foresee, constitutes the most spectacular accomplishment of the physicist. In these machines, the physicist creates a situation in which all the relevant coordinates are known so that the be- havior of the machine can be predicted. Radars and nuclear reactors are examples of such machines.
OE. Schrodinger, in his What is Lije, Cambridge University Press, 1915, says that this second miracle may well be beyond human understanding, (page 31).
‘The writer feels sure that i t is unnecessary to mention that Galileo’s theorem, as given in the text, does not exhaust the content of Galileo’s observations in connection with the laws of freely falling bodies.
6 E. P. WIGNER
The principal purpose of the preceding discussion is to point out that the laws of nature are all conditional statements and they relate only to a very small part of our knowledge of the world. Thus, classical mechanics, which is the best known prototype of a physical theory, gives the second derivatives of the positional coordinates of all bodies, on the basis of the knowledge of the positions, etc., of these bodies. It gives no information on the existence, the present positions, or velocities of these bodies. It should be mentioned, for the sake of accuracy, that we have learned about thirty years ago that even the conditional statements cannot be entirely precise: that the conditional statements are probability laws which enable us only to place intelligent bets on future properties of the inanimate world, based on the knowledge of the present state. They do not allow us to make categorical statements, not even categorical statements conditional on the present state of the world. The probabilistic nature of the “laws of nature” manifests itself in the case of machines also, and can be verified, at least in the case of nuclear reactors, if one runs them at very low power. However, the additional limitation of the scope of the laws of natures which follows from their probabilistic nature, will play no role in the rest of the discussion.
The Role of Mathematics i n Physical Theories. Having refreshed our minds as t o the essence of mathematics and physics, we should be in a better position to review the role of mathematics in physical theories.
Naturally, we do use mathematics in everyday physics to evaluate the results of the laws of nature, to apply the conditional statements t o the particular conditions which happen to prevail or happen to interest us. I n order that this be possible, the laws of nature must already be formulated in mathematical language. However, the role of evaluating the conse- quences of already established theories is not the most important role of math- ematics in physics. Mathematics, or, rather, applied mathematics, is not so much the master of the situation in this function: it is merely serving as a tool.
Mathematics does play, however, also a more sovereign role in physics. This was already implied in the statement, made when discussing the role of applied mathematics, that the laws of nature must be already formulated in the language of mathematics to be an object for the use of applied math- ematics. The statement that the laws of nature are written in the language of mathematics was properly made three hundred years ago9; it is now more true than ever before. In order to show the importance which mathematical concepts possess in the formulation of the laws of physics, let us recall, as an example, the axioms of quantum mechanics as formulated, explicitly,
%ee, for instance, E. Schrodinger, reference [l]. OIt is attributed to Gnlileo.
MATHEMATICS A N D THE NATURAL SCIENCES 7
by the great mathematician, von Neumann, or, implicitly, by the great physicist, Dirac [4, 51. There are two basic concepts in quantum mechanics: states and observables. The states are vectors in Hilbert space, the observ- ables self-adjoint operators on these vectors. The possible values of the observations are the characteristic values of the operators-but we had better stop here lest we engage in a listing of the mathematical concepts developed in the theory of linear operators.
It is true, of course, that physics chooses certain mathematical concepts for the formulation of the laws of nature, and surely only a fraction of all mathematical concepts is used in physics. It is true also that the concepts which were chosen were not selected arbitrarily from a listing of mathematical terms but were developed, in many if not most cases, independently by the physicist and recognized then as having been conceived before by the mathematician. It is not true, however, as is so often stated, that this had t o happen because mathematics uses the simplest possible concepts and these were bound to occur in any formalism. As we saw before, the concepts of mathematics are not chosen for their conceptual simplicity-even se- quences of pairs of numbers are far from being the simplest concepts-but for their amenability to clever manipulations and to striking, brilliant arguments. Let us not forget that the Hilbert space of quantum mechanics is the complex Hilbert space, with a Hermitean scalar product. Surely t o the unpreoccupied mind, complex numbers are far from natural or simple and they cannot be suggested by physical observations. Furthermore, the use of complex numbers is in this case not a calculational trick of applied mathematics but comes close to being a necessity in the formulation of the laws of quantum mechanics. Finally, it now begins to appear that not only numbers but so-called analytic functions are destined to play a decisive role in the formulation of quantum theory. I am referring t o the rapidly developing theory of dispersion relations.
I t is difficult to avoid the impression that a miracle confronts us here, quite comparable in its striking nature to the miracle that the human mind can string a thousand arguments together without getting itself into contra- dictions or to the two miracles of the existence of laws of nature and of the human mind’s capacity t o divine them. The observation which comes closest t o an explanation for the mathematical concepts’ cropping up in physics which I know is Einstein’s statement that the only physical theories which we are willing to accept are the beautiful ones. It stands to argue that the concepts of mathematics, which invite the exercise of so much wit, have the quality of beauty. However, Einstein’s observation can a t best explain prop- erties of theories which we are willing to believe and has no reference to the in- trinsic accuracy of the theory. We shall, therefore, turn to thislatter question.
8 E . P. WIGNER
I s the Success of Physical Theories T r u l y SzGrPrising? A possible ex- planation of the physicist’s use of mathematics to formulate his laws of nature is that he is a somewhat irresponsible person. As a result, when he finds a connection between two quantities which resembles a connection well-known from mathematics, he will jump a t the conclusion that the connection is that discussed in mathematics simply because he does not know of any other similar connection. It is not the intention of the present dis- cussion to refute the charge that the physicist is a somewhat irresponsible person. Perhaps he is. However, it is important to point out that the math- ematical formulation of the physicist’s often crude experience leads in a n uncanny number of cases to an amazingly accurate description of a large class of phenomena. This shows that the mathematical language has more to commend it than being the only language which we can speak; it shows that it is, in a very real sense, the correct language. Let us consider a few examples.
The first example is the oft quoted one of planetary motion. The laws of falling bodies became rather well established as a result of experiments carried out principally in Italy. These experiments could not be very accurate in the sense in which we understand accuracy today partly because of the effect of air resistance and partly because of the impossibility, a t that time, to measure short time intervals. Nevertheless, it is not surprising that as a result of their studies, the Italian natural scientists acquired a familiarity with the ways in which objects travel through the atmosphere. It was Newton who then brought the law of freely falling objects into relation with the motion of the moon, noted t h a t the parabola of the thrown rock’s path on the earth, and the circle of the moon’s path in the sky, are particular cases of the same mathematical object of an ellipse and postulated the universal law of gravitation, on the basis of a single, and a t that time very approximate, numerical coincidence. Philosophically, the law of gravitation as formulated by Newton was repugnant to his time and t o himself. Empirically, i t was based on very scanty observations. The mathematical language in which it was formulated contained the con- cept of a second derivative and those of us who have tried to draw an osculating circle to a curve know t h a t the second derivative is not a very immediate concept. The law of gravity which Newton reluctantly estab- lished and which he could verify with an accuracy of about 4 % has proved to be accurate t o less than a ten thousandth of a per cent and became so closely associated with the idea of absolute accuracy that only recently did physicists become again bold enough to inquire into the limitations of its accuracy.1° Certainly, the example of Newton’s law, quoted over
%ee, for instance, R. H. Dicke, American Scientist, Vol. 25, 1959.
M A T H E M A T I C S A N D THE N A T U R A L S C I E N C E S 9
and over again, must be mentioned first as a monumental example of a law, formulated in terms which appear simple to the mathematician, which has proved accurate beyond all reasonable expectation. Let us just re- capitulate our thesis on this example: first, the law, particularly since a second derivative appears in it, is simple only to the mathematician, not to common sense or to non-mathematically-minded freshmen ; second, it is a conditional law of very limited scope. I t explains nothing about the earth which attracts Galileo’s rocks, or about the circular form of the moon’s orbit, or about the planets of the sun. The explanation of these initial conditions is left to the geologist and the astronomer, and they have a hard time with them.
The second example is that of ordinary, elementary quantum mechanics. This originated when Max Born noticed that some rules of computation, given by Heisenberg, were formally identical with the rules of computation with matrices, established a Iong time before by mathematicians. Born, Jordan and Heisenberg then proposed to replace by matrices the position and momentum variables of the equations of classical mechanics [ S ] . They applied the rules of matrix mechanics to a few highly idealized problems and the results were quite satisfactory. However, there was, at that time, no rational evidence that their matrix mechanics would prove correct under more realistic conditions. Indeed, they say “if the mechanics as here propos- ed should already be correct in its essential traits”. As a matter of fact, the first application of their mechanics to a realistic problem, that of the hydro- gen atom, was given several months later, by Pauli. This application gave results in agreement with experience. This was satisfactory but still under- standable because Heisenberg’s rules of calculation were abstracted from problems which included the old theory of the hydrogen atom. The miracle occurred only when matrix mechanics, or a mathematically equivalent theory, was applied to problems for which Heisenberg’s calculating rules were meaningless. Heisenberg’s rules presupposed that the classical equa- tions of motion had solutions with certain periodicity properties; and the equations of motion of the two electrons of the helium atom, or of the even greater number of electrons of heavier atoms, simply do not have these properties, so that Heisenberg’s rules cannot be applied t o these cases. Nevertheless, the calculation of the lowest energy level of helium, as carried out a few months ago by Kinoshita a t Cornell and by Bazley a t the Bureau of Standards, agree with the experimental data within the accuracy of the observations, which is one part in ten millions. Surely in this case we “got something out” of the equations that we did not put in.
The same is true of t h e qualitative characteristics of the “complex spectra”, that is the spectra of heavier atoms. I wish to recall a conver-
10 E. P. WIGNER
sation with Jordan who told me, when the qualitative features of the spectra were derived, that a disagreement of the rules derived from quantum mechanical theory, and the rules established by empirical research, would have provided the last opportunity to make a change in the framework of matrix mechanics. I n other words, Jordan felt that we would have been, at least temporarily, helpless had an unexpected disagreement occurred in the theory of the helium atom. This was, at that time, developed by Kellner and by Hilleraas. The mathematical formalism was too clear and unchangeable so that, had the miracle of helium which was mentioned before not occurred, a true crisis would have arisen. Surely, physics would have overcome that crisis in one way or another. It is true, on the other hand, that physics as we know it today would not be possible without a constant recurrence of miracles similar to the one of the helium atom which is perhaps the most striking miracle that has occurred in the course of the development of elementary quantum mechanics, but by far not the only one. In fact, the number of analogous miracles is limited, in our view, only by our willingness t o go after more similar ones. Quantum mechanics had, never- theless, many almost equally striking successes which gave us the firm con- viction that it is, what we call, correct.
The last example is that of quantum electrodynamics, or the theory of the Lamb shift. Whereas Newton’s theory of gravitation still had obvious connections with experience, experience entered the formulation of matrix mechanics only in the refined or sublimated form of Heisenberg’s prescrip- tions. The quantum theory of the Lamb shift, as conceived by Bethe and established by Schwinger, is a purely mathematical theory aiid the only direct contribution of experiment was to show the existence of a measurable effect. The agreement with calculation is better than one part in a thousand.
The preceding three examples, which could be multiplied almost in- definitely, should illustrate the appropriateness and accuracy of the math- ematical formulation of the laws of nature in terms of concepts chosen for their manipulability, the “laws of nature” being of almost fantastic accuracy but of strictly limited scope. I propose t o refer to the observation which these examples illustrate as the empirical law of …