A Brief History Of Time (By Stephen Hawking)

ENTRY

The lack of an absolute standard of rest meant that one could not determine whether two events that took place at different times occurred in the same position in space. For example, suppose our Ping-Pong ball on the train bounces straight up and down, hitting the table twice on the same spot one second apart. To someone on the track, the two bounces would seem to take place about forty meters apart, because the train would have traveled that far down the track between the bounces. The nonexistence of absolute rest therefore meant that one could not give an event an absolute position in space, as Aristotle had believed. The positions of events and the distances between them would be different for a person on the train and one on the track, and there would be no reason to prefer one person’s position to the other’s.

Maxwell’s theory predicted that radio or light waves should travel at a certain fixed speed. But Newton’s theory had got rid of the idea of absolute rest, so if light was supposed to travel at a fixed speed, one would have to say what that fixed speed was to be measured relative to.

It was therefore suggested that there was a substance called the “ether” that was present everywhere, even in “empty” space.

As an object approaches the speed of light, its mass rises ever more quickly, so it takes more and more energy to speed it up further. It can in fact never reach the speed of light, because by then its mass would have become infinite, and by the equivalence of mass and energy, it would have taken an infinite amount of energy to get it there. For this reason, any normal object is forever confined by relativity to move at speeds slower than the speed of light. Only light, or other waves that have no intrinsic mass, can move at the speed of light.

An equally remarkable consequence of relativity is the way it has revolutionized our ideas of space and time. In Newton’s theory, if a pulse of light is sent from one place to another, different observers would agree on the time that the journey took (since time is absolute), but will not always agree on how far the light traveled (since space is not absolute). Since the speed of the light is just the distance it has traveled divided by the time it has taken, different observers would measure different speeds for the light. In relativity, on the other hand, all observers must agree on how fast light travels. They still, however, do not agree on the distance the light has traveled, so they must therefore now also disagree over the time it has taken. (The time taken is the distance the light has traveled – which the observers do not agree on – divided by the light’s speed – which they do agree on.) In other words, the theory of relativity put an end to the idea of absolute time!

Our modern picture of the universe dates back to only 1924, when the American astronomer Edwin Hubble demonstrated that ours was not the only galaxy. There were in fact many others, with vast tracts of empty

space between them. In order to prove this, he needed to determine the distances to these other galaxies, which are so far away that, unlike nearby stars, they really do appear fixed. Hubble was forced, therefore, to

use indirect methods to measure the distances. Now, the apparent brightness of a star depends on two factors:

how much light it radiates (its luminosity), and how far it is from us. For nearby stars, we can measure their apparent brightness and their distance, and so we can work out their luminosity. Conversely, if we knew the luminosity of stars in other galaxies, we could work out their distance by measuring their apparent brightness. Hubble noted that certain types of stars always have the same luminosity when they are near enough for us to measure; therefore, he argued, if we found such stars in another galaxy, we could assume that they had the same luminosity – and so calculate the distance to that galaxy. If we could do this for a number of stars in the same galaxy, and our calculations always gave the same distance, we could be fairly confident of our estimate.

We live in a galaxy that is about one hundred thousand light-years across and is slowly rotating; the stars in its spiral arms orbit around its center about once every several hundred million years. Our sun is just an ordinary, average-sized, yellow star, near the inner edge of one of the spiral arms. We have certainly come a long way since Aristotle and Ptolemy, when they thought that the earth was the center of the universe!

Stars are so far away that they appear to us to be just pinpoints of light. We cannot see their size or shape. So how can we tell different types of stars apart? For the vast majority of stars, there is only one characteristic

features that we can observe – the color of their light. Newton discovered that if light from the sun passes through a triangular-shaped piece of glass, called a prism, it breaks up into its component colors (its spectrum) as in a rainbow. By focusing a telescope on an individual star or galaxy, one can similarly observe the spectrum of the light from that star or galaxy. Different stars have different spectra, but the relative brightness of the different colors is always exactly what one would expect to find in the light emitted by an object that is glowing red hot. (In fact, the light emitted by any opaque object that is glowing red hot has a characteristic spectrum that depends only on its temperature – a thermal spectrum. This means that we can tell a star’s temperature from the spectrum of its light.) Moreover, we find that certain very specific colors are missing from stars’ spectra, and these missing colors may vary from star to star. Since we know that each chemical element absorbs a characteristic set of very specific colors, by matching these to those that are missing from a star’s spectrum, we can determine exactly which elements are present in the star’s atmosphere.

In the 1920s, when astronomers began to look at the spectra of stars in other galaxies, they found something most peculiar: there were the same characteristic sets of missing colors as for stars in our own galaxy, but they were all shifted by the same relative amount toward the red end of the spectrum. To understand the implications of this, we must first understand the Doppler effect. As we have seen, visible light consists of

fluctuations, or waves, in the electromagnetic field. In the years following his proof of the existence of other galaxies, Rubble spent his time cataloging their distances and observing their spectra. At that time most people expected the galaxies to be moving around quite randomly, and so expected to find as many blue-shifted spectra as red-shifted ones. It was quite a surprise, therefore, to find that most galaxies appear red-shifted: nearly all were moving away from us! More surprising still was the finding that Hubble published in 1929: even the size of a galaxy’s redshift is not random, but is directly proportional to the galaxy’s distance from us. Or, in other words, the farther a galaxy is, the faster it is moving away! And that meant that the universe could not be static, as everyone previously had thought, is in fact expanding; the distance between the different galaxies is changing all the time.

Even Einstein, when he formulated the general theory of relativity in 1915, was so sure that the universe had to be static that he modified his theory to make this possible, introducing a so-called cosmological constant into his equations. Einstein introduced a new “antigravity” force, which, unlike other forces, did not come from any particular

source but was built into the very fabric of space-time. He claimed that space-time had an inbuilt tendency to expand, and this could be made to balance exactly the attraction of all the matter in the universe, so that a a static universe would result. Only one man, it seems, was willing to take general relativity at face value, and while Einstein and other physicists were looking for ways of avoiding general relativity’s prediction of a non-static universe, the Russian physicist and mathematician Alexander Friedmann instead set about explaining it.

Friedmann made two very simple assumptions about the universe: that the universe looks identical in whichever direction we look, and that this would also be true if we were observing the universe from anywhere

else. From these two ideas alone, Friedmann showed that we should not expect the universe to be static. In 1922, several years before Edwin Hubble’s discovery, Friedmann predicted exactly what Hubble found!

In 1965 two American physicists at the Bell Telephone Laboratories in New Jersey, Arno Penzias and Robert Wilson, were testing a very sensitive microwave detector. (Microwaves are just like light waves, but with a wavelength of around a centimeter.) Penzias and Wilson were worried when they found that their detector was picking up more noise than it ought to. The noise did not appear to be coming from any particular direction.First they discovered bird droppings in their detector and checked for other possible malfunctions, but soon ruled these out. They knew that any noise from within the atmosphere would be stronger when the detector was not pointing straight up than when it was, because light rays travel through much more atmosphere when received from near the horizon than when received from directly overhead. The extra noise was the same whichever direction the detector was pointed, so it must come from outside the atmosphere. It was also the same day and night and throughout the year, even though the earth was rotating on its axis and orbiting around the sun. This showed that the radiation must come from beyond the Solar System, and even from beyond the

galaxy, as otherwise it would vary as the movement of earth pointed the detector in different directions.

Penzias and Wilson had unwittingly stumbled across a remarkably accurate confirmation of Friedmann’s first assumption. However, because the universe is not exactly the same in every direction, but only on average on a large scale, the microwaves cannot be exactly the same in every direction either. There have to be slight variations between different directions. These were first detected in 1992 by the Cosmic

Background Explorer satellite, or COBE, at a level of about one part in a hundred thousand.

At roughly the same time as Penzias and Wilson were investigating noise in their detector, two American physicists at nearby Princeton University, Bob Dicke and Jim Peebles, were also taking an interest in microwaves. They were working on a suggestion, made by George Gamow (once a student of Alexander Friedmann), that the early universe should have been very hot and dense, glowing white hot. Dicke and Peebles argued that we should still be able to see the glow of the early universe, because light from very distant parts of it would only just be reaching us now. However, the expansion of the universe meant that this light should be so greatly red-shifted that it would appear to us now as microwave radiation. Dicke and Peebles were preparing to look for this radiation when Penzias and Wilson heard about their work and realized that they

had already found it. For this, Penzias and Wilson were awarded the Nobel Prize in 1978 (which seems a bit hard on Dicke and Peebles, not to mention Gamow!).

But which Friedmann model describes our universe? Will the universe eventually stop expanding and start contracting, or will it expand forever? To answer this question we need to know the present rate of expansion of the universe and its present average density. If the density is less than a certain critical value, determined by the rate of expansion, the gravitational attraction will be too weak to halt the expansion. If the density is greater than the critical value, gravity will stop the expansion at some time in the future and cause the universe to recollapse.

moving away from us, using the Doppler effect. This can be done very accurately. However, the distances to the galaxies are not very well known because we can only measure them indirectly. So all we know is that the universe is expanding by between 5 percent and 10 percent every thousand million years. However, our uncertainty about the present average density of the universe is even greater. If we add up the masses of all the stars that we can see in our galaxy and other galaxies, the total is less than one hundredth of the amount required to halt the expansion of the universe, even for the lowest estimate of the rate of expansion. Our galaxy and other galaxies, however, must contain a large amount of “dark matter” that we cannot see directly, but which we know must be there because of the influence of its gravitational attraction on the orbits of stars in the galaxies. Moreover, most galaxies are found in clusters, and we can similarly infer the presence of yet more dark matter in between the galaxies in these clusters by its effect on the motion of the galaxies. When we add up all this dark matter, we still get only about one tenth of the amount required to halt the expansion. However, we cannot exclude the possibility that there might be some other form of matter, distributed almost uniformly throughout the universe, that we have not yet detected and that might still raise the average density of the universe up to the critical value needed to halt the expansion. The present evidence therefore suggests that the universe will probably expand forever, but all we can really be sure of is that even if the universe is going to recollapse, it won’t do so for at least another ten thousand million years, since it has already been expanding for at least that long. This should not unduly worry us.

The work of Lifshitz and Khalatnikov was valuable because it showed that the universe could have had a singularity, a big bang, if the general theory of relativity was correct. However, it did not resolve the crucial

question: Does general relativity predict that our universe should have had a big bang, a beginning of time?

The answer to this came from a completely different approach introduced by a British mathematician and physicist, Roger Penrose, in 1965. Using the way light cones behave in general relativity, together with the fact that gravity is always attractive, he showed that a star collapsing under its own gravity is trapped in a region whose surface eventually shrinks to zero size. And, since the surface of the region shrinks to zero, so too must

its volume. All the matter in the star will be compressed into a region of zero volume, so the density of matter and the curvature of space-time become infinite. In other words, one has a singularity contained within a region of space-time known as a black hole.

In 1965 I read about Penrose’s theorem that any body undergoing gravitational collapse must eventually form a singularity. I soon realized that if one reversed the direction of time in Penrose’s theorem, so that the collapse became an expansion, the conditions of his theorem would still hold, provided the universe were roughly like a Friedmann model on large scales at the present time. Penrose’s theorem had shown that any collapsing star must end in a singularity; the time-reversed argument showed that any Friedmann-like expanding universe must have begun with a singularity. For technical reasons, Penrose’s theorem required that the universe be infinite in space. So I could in fact, use it to prove that there should be a singularity only if the universe was expanding fast enough to avoid collapsing again (since only those Friedmann models were infinite in space).

During the next few years I developed new mathematical techniques to remove this and other technical conditions from the theorems that proved that singularities must occur. The final result was a joint paper by Penrose and myself in 1970, which at last proved that there must have been a big bang singularity provided only that general relativity is correct and the universe contains as much matter as we observe.

We have seen in this chapter how, in less than half a century, man’s view of the universe formed over millennia has been transformed. Hubble’s discovery that the universe was expanding, and the realization of the insignificance of our own planet in the vastness of the universe, were just the starting point. As experimental and theoretical evidence mounted, it became more and more clear that the universe must have had a beginning in time, until in 1970 this was finally proved by Penrose and myself, on the basis of Einstein’s general theory of relativity. That proof showed that general relativity is only an incomplete theory: it cannot tell us how the universe started off, because it predicts that all physical theories, including itself, break down at the beginning of the universe. However, general relativity claims to be only a partial theory, so what the singularity theorems really show is that there must have been a time in the very early universe when the universe was so small that one could no longer ignore the small-scale effects of the other great partial theory of the twentieth century, quantum mechanics. At the start of the 1970s, then, we were forced to turn our search for an understanding of the universe from our theory of the extraordinarily vast to our theory of the extraordinarily tiny. That theory, quantum mechanics, will be described next, before we turn to the efforts to combine the two partial

theories into a single quantum theory of gravity.

The doctrine of scientific determinism was strongly resisted by many people, who felt that it infringed God’s freedom to intervene in the world, but it remained the standard assumption of science until the early years of this century. One of the first indications that this belief would have to be abandoned came when calculations by the British scientists Lord

Rayleigh and Sir James Jeans suggested that a hot object, or body, such as a star, must radiate energy at an infinite rate. According to the laws we believed at the time, a hot body ought to give off electromagnetic waves (such as radio waves, visible light, or X rays) equally at all frequencies. For example, a hot body should radiate the same amount of energy in waves with frequencies between one and two million million waves a second as in waves with frequencies between two and three million million waves a second. Now since the number of waves a second is unlimited, this would mean that the total energy radiated would be infinite.In order to avoid this obviously ridiculous result, the German scientist Max Planck suggested in 1900 that light, X rays, and other waves could not be emitted at an arbitrary rate, but only in certain packets that he called quanta. Moreover, each quantum had a certain amount of energy that was greater the higher the frequency of the waves, so at a high enough frequency the emission of a single quantum would require more energy than was available. Thus the radiation at high frequencies would be reduced, and so the rate at which the body lost energy would be finite.

The quantum hypothesis explained the observed rate of emission of radiation from hot bodies very well, but its implications for determinism were not realized until 1926, when another German scientist, Werner Heisenberg, formulated his famous uncertainty principle. In order to predict the future position and velocity of a particle, one has to be able to measure its present position and velocity accurately. The obvious way to do this is to shine light on the particle. Some of the waves of light will be scattered by the particle and this will indicate its position. However, one will not be able to determine the position of the particle more accurately than the distance between the wave crests of light, so one needs to use light of a short wavelength in order to measure the position of the particle precisely. Now, by Planck’s quantum hypothesis, one cannot use an arbitrarily small amount of light; one has to use at least one quantum. This quantum will disturb the particle and change its velocity in a way that cannot be predicted. Moreover, the more accurately one measures the position, the shorter the wavelength of the light that one needs and hence the higher the energy of a single quantum. So the velocity of the particle will be disturbed by a larger amount. In other words, the more accurately you try to measure the position of the particle, the less accurately you can measure its speed, and vice versa. Heisenberg showed that the uncertainty in the position of the particle times the uncertainty in its velocity times the mass of the particle can never be smaller than a certain quantity, which is known as Planck’s constant. Moreover, this limit does not depend on the way in which one tries to measure the position or velocity of the particle, or on the type of particle: Heisenberg’s uncertainty principle is a fundamental, inescapable property of the world.

This approach led Heisenberg, Erwin Schrodinger, and Paul Dirac in the 1920s to reformulate mechanics into a new theory called quantum mechanics, based on the uncertainty principle. In this theory particles no longer had separate, well-defined positions and velocities that could not be observed, Instead, they had a quantum state, which was a combination of position and velocity.

Consider a partition with two narrow parallel slits in it. On one side of the partition one places a source of light of a particular color (that is, of a particular wavelength). Most of the light will hit the partition, but a small amount will go through the slits. Now suppose one places a screen on the far side of the partition from the light. Any point on the screen will receive waves from the two slits. However, in general, the distance the light has to travel from the source to the screen via the two slits will be different. This will mean that the waves from the slits will not be in phase with each other when they arrive at the screen: in some places the waves will cancel each other out, and in others they will reinforce each other. The result is a characteristic pattern of light and dark fringes.

The remarkable thing is that one gets exactly the same kind of fringes if one replaces the source of light by a source of particles such as electrons with a definite speed (this means that the corresponding waves have a definite length). It seems more peculiar because if one only has one slit, one does not get any fringes, just a uniform distribution of electrons across the screen. One might therefore think that opening another slit would just increase the number of electrons hitting each point of the screen, but, because of interference, it actually decreases it in some places. If electrons are sent through the slits one at a time, one would expect each to pass through one slit or the other, and so behave just as if the slit it passed through were the only one there – giving a uniform  distribution on the screen. In reality, however, even when the electrons are sent one at a time, the fringes still appear. Each electron, therefore, must be passing through both slits at the same time!

Einstein’s general theory of relativity seems to govern the large-scale structure of the universe. It is what is called a classical theory; that is, it does not take account of the uncertainty principle of quantum mechanics, as it should for consistency with other theories. The reason that this does not lead to any discrepancy with observation is that all the gravitational fields that we normally experience are very weak. How-ever, the singularity theorems discussed earlier indicate that the gravitational field should get very strong in at least two situations, black holes and the big bang. In such strong fields the effects of quantum mechanics should be important. Thus, in a sense, classical general relativity, by predicting points of infinite density, predicts its own downfall, just as classical (that is, non quantum) mechanics predicted its downfall by suggesting that atoms should collapse to infinite density. We do not yet have a complete consistent theory that unifies general relativity and quantum mechanics, but we do know a number of the features it should have. The consequences that these would have for black holes and the big bang will be described in later chapters. For the moment, however, we shall turn to the recent attempts to bring together our understanding of the other forces of nature into a single, unified quantum theory.

All the known particles in the universe can be divided into two groups: particles of spin ½, which make up the matter in the universe, and particles of spin 0, 1, and 2, which, as we shall see, give rise to forces between the matter particles.

The matter particles obey what is called Pauli’s exclusion principle. This was discovered in 1925 by an Austrian physicist, Wolfgang Pauli – for which he received the Nobel Prize in 1945. He was the archetypal theoretical physicist: it was said of him that even his presence in the same town would make experiments go wrong! Pauli’s exclusion principle says that two similar particles can-not exist in the same state; that is, they cannot have both the same position and the same velocity, within the limits given by the uncertainty principle. The exclusion principle is crucial because it explains why matter particles do not collapse to a state of very high density under the influence of the forces produced by the particles of spin 0, 1, and 2: if the matter particles have very nearly the same positions, they must have different velocities, which means that they will not stay in the same position for long. If the world had been created without the exclusion principle, quarks would not form separate, well-defined protons and neutrons. Nor would these, together with electrons, form separate, well-defined atoms. They would all collapse to form a roughly uniform, dense “soup.”

A proper understanding of the electron and other spin-½ particles did not come until 1928, when a theory was proposed by Paul Dirac, who later was elected to the Lucasian Professorship of Mathematics at Cambridge (the same professorship that Newton had once held and that I now hold). Dirac’s theory was the first of its kind that was consistent with both quantum mechanics and the special theory of relativity. It explained mathematically why the electron had spin-½; that is, why it didn’t look the same if you turned it through only one complete revolution, but did if you turned it through two revolutions. It also predicted that the electron should have a partner: an anti-electron, or positron. The discovery of the positron in 1932 confirmed Dirac’s theory and led to his being awarded the Nobel Prize for physics in 1933. We now know that every particle has an antiparticle, with which it can annihilate. (In the case of the force-carrying particles, the antiparticles are the same as the particles themselves.) There could be whole antiworlds and anti people made out of antiparticles. However, if you meet your antiself, don’t shake hands! You would both vanish in a great flash of light.

In quantum mechanics, the forces or interactions between matter particles are all supposed to be carried by particles of integer spin – 0, 1, or 2. What happens is that a matter particle, such as an electron or a quark, emits a force-carrying particle. The recoil from this emission changes the velocity of the matter particle. The force-carrying particle then collides with another matter particle and is absorbed. This collision changes the velocity of the second particle, just as if there had been a force between the two matter particles. It is an important property of the force-carrying particles that they do not obey the exclusion principle. This means that there is no limit to the number that can be exchanged, and so they can give rise to a strong force. However, if the force-carrying particles have a high mass, it will be difficult to produce and exchange them over a large distance. So the forces that they carry will have only a short range. On the other hand, if the force-carrying particles have no mass of their own, the forces will be long range. The force-carrying particles exchanged between matter particles are said to be virtual particles because, unlike “real” particles, they cannot be directly detected

by a particle detector. We know they exist, however, because they do have a measurable effect: they give rise to forces between matter particles. Particles of spin 0, 1, or 2 do also exist in some circumstances as real particles, when they can be directly detected. They then appear to us as what a classical physicist would call waves, such as waves of light or gravitational waves. They may sometimes be emitted when matter particles interact with each other by exchanging virtual force-carrying particles. (For example, the electric repulsive force between two electrons is due to the exchange of virtual photons, which can never be directly detected; but if one electron moves past another, real photons may be given off, which we detect as light waves.)

FORCES

Force-carrying particles can be grouped into four categories according to the strength of the force that they carry and the particles with which they interact. It should be emphasized that this division into four classes is man-made; it is convenient for the construction of partial theories, but it may not correspond to anything deeper. Ultimately, most physicists hope to find a unified theory that will explain all four forces as different aspects of a single force. Indeed, many would say this is the prime goal of physics today. Recently, successful attempts have been made to unify three of the four categories of force – and I shall describe these in this chapter. The question of the unification of the remaining category, gravity, we shall leave till later.

The first category is the gravitational force. This force is universal, that is, every particle feels the force of gravity, according to its mass or energy. Gravity is the weakest of the four forces by a long way; it is so weak that we would not notice it at all were it not for two special properties that it has: it can act over large distances, and it is always attractive.

This means that the very weak gravitational forces between the individual particles in two large bodies, such as the earth and the sun, can all add up to produce a significant force. The other three forces are either short range, or are sometimes attractive and some-times repulsive, so they tend to cancel out. In the quantum mechanical way of looking at the gravitational field, the force between two matter particles is pictured as being carried by a particle of spin 2 called the graviton. This has no mass of its own, so the force that it carries is long range. The gravitational force between the sun and the earth is ascribed to the exchange of gravitons between the particles that make up these two bodies. Although the exchanged particles are virtual, they certainly do produce a measurable effect – they make the earth orbit the sun! Real gravitons make up what classical physicists would call gravitational waves, which are very weak – and so difficult to detect that they have not yet been observed.

The next category is the electromagnetic force, which interacts with electrically charged particles like electrons and quarks, but not with uncharged particles such as gravitons. It is much stronger than the gravitational force: the electromagnetic force between two electrons is about a million million million million million million million (1 with forty-two zeros after it) times bigger than the gravitational force. However, there are two kinds of electric charge, positive and negative. The force between two positive charges is repulsive, as is the force between two negative charges, but the force is attractive between a positive and a negative charge. A large body, such as the earth or the sun, contains nearly equal numbers of positive and negative charges. Thus the attractive and repulsive forces between the individual particles nearly cancel each other out, and there is very little net electromagnetic force. However, on the small scales of atoms and molecules, electromagnetic forces dominate. The electromagnetic attraction between negatively charged electrons and positively charged protons in the nucleus causes the electrons to orbit the nucleus of the atom, just as gravitational attraction causes the earth to orbit the sun. The electromagnetic attraction is pictured as being caused by the exchange of large numbers of virtual massless particles of spin 1, called photons. Again, the photons that are exchanged are virtual particles. However, when an electron changes from one allowed orbit to another one nearer to the nucleus, energy is released and a real photon is emitted – which can be observed as visible light by the human eye, if it has the right wave-length, or by a photon detector such as photographic film. Equally, if a real photon collides with an atom, it may move an electron from an orbit nearer the nucleus to one farther away. This uses up the energy of the photon, so it is absorbed.

The third category is called the weak nuclear force, which is responsible for radioactivity and which acts on all matter particles of spin-½, but not on particles of spin 0, 1, or 2, such as photons and gravitons. The weak nuclear force was not well understood until 1967, when Abdus Salam at Imperial College, London, and Steven Weinberg at Harvard both proposed theories that unified this interaction with the electromagnetic force, just as Maxwell had unified electricity and magnetism about a hundred years earlier. They suggested that in addition to the photon, there were three other spin-1 particles, known collectively as massive vector bosons, that carried the weak force. These were called W+ (pronounced W plus), W- (pronounced W minus), and Zº (pronounced Z naught), and each had a mass of around 100 GeV (GeV stands for gigaelectron-volt, or one thousand million electron volts). The Weinberg-Salam theory exhibits a property known as spontaneous symmetry breaking. This means that what appear to be a number of completely different particles at low energies are in fact found to be all the same type of particle, only in different states. At high energies all these particles behave similarly. The effect is rather like the behavior of a roulette ball on a roulette wheel. At high energies (when the wheel is spun quickly) the ball behaves in essentially only one way – it rolls round and round. But as the wheel slows, the energy of the ball decreases, and eventually the ball drops into one of the thirty-seven slots in the wheel. In other words, at low energies there are thirty-seven different states in which the ball can exist. If, for some reason, we could only observe the ball at low energies, we would then think that there were thirty-seven different types of ball!

In the Weinberg-Salam theory, at energies much greater than 100 GeV, the three new particles and the photon would all behave in a similar manner. But at the lower particle energies that occur in most normal situations, this symmetry between the particles would be broken. WE, W, and Zº would acquire large masses, making the forces they carry have a very short range. At the time that Salam and Weinberg proposed their theory, few people believed them, and particle accelerators were not powerful enough to reach the energies of 100 GeV required to produce real W+, W-, or Zº particles. However, over the next ten years or so, the other predictions of the theory at lower energies agreed so well with

experiment that, in 1979, Salam and Weinberg were awarded the Nobel Prize for physics, The fourth category is the strong nuclear force, which holds the quarks together in the proton and neutron, and holds the protons and neutrons together in the nucleus of an atom. It is believed that this force is carried by another spin-1 particle, called the gluon, which interacts only with itself and with the quarks. The strong nuclear force has a curious property called confinement: it always binds particles together into combinations that have no color. One cannot have a single quark on its own because it would have a color (red, green, or blue). Instead, a red quark has to be joined to a green and a blue quark by a “string” of gluons (red + green + blue = white). Such a triplet constitutes a proton or a neutron. Another possibility is a pair consisting of a quark and an antiquark (red + antired, or green + antigreen, or blue + antiblue = white). Such combinations make up the particles known as mesons, which are unstable because the quark and antiquark can annihilate each other, producing electrons and other particles. Similarly, confinement prevents one having a single gluon on its own, because gluons also have color. Instead, one has to have a collection of gluons whose colors add up to white. Such a collection forms an unstable particle called a glueball.

The fact that confinement prevents one from observing an isolated quark or gluon might seem to make the whole notion of quarks and gluons as particles somewhat metaphysical. However, there is another property of the strong nuclear force, called asymptotic freedom, that makes the concept of quarks and gluons well defined. At normal energies, the strong nuclear force is indeed strong, and it binds the quarks tightly together. However, experiments with large particle accelerators indicate that at high energies the strong force becomes much weaker, and the quarks and gluons behave almost like free particles.

To understand how a black hole might be formed, we first need an understanding of the life cycle of a star. A star is formed when a large amount of gas (mostly hydrogen) starts to collapse in on itself due to its gravitational attraction. As it contracts, the atoms of the gas collide with each other more and more frequently and at greater and greater speeds –

The gas heats up. Eventually, the gas will be so hot that when the hydrogen atoms collide they no longer bounce off each other, but instead coalesce to form helium. The heat released in this reaction, which is like a controlled hydrogen bomb explosion, is what makes the star shine. This additional heat also increases the pressure of the gas until it is sufficient to balance the gravitational attraction, and the gas stops contracting. It is a bit like a balloon – there is a balance between the pressure of the air inside, which is trying to make the balloon expand, and the tension in the rubber, which is trying to make the balloon smaller. Stars will remain stable like this for a long time, with heat from the nuclear reactions balancing the gravitational attraction. Eventually, however, the star will run out of its hydrogen and other nuclear fuels. Paradoxically, the more fuel a star starts off with, the sooner it runs out. This is because the more massive the star is, the hotter it needs to be to balance its gravitational attraction. And the hotter it is, the faster it will

use up its fuel. Our sun has probably got enough fuel for another five thousand million years or so, but more massive stars can use up their fuel in as little as one hundred million years, much less than the age of the universe. When a star runs out of fuel, it starts to cool off and so contracts. What might happen to it then was first understood only at the end of the 1920s.

In 1928 an Indian graduate student, Subrahmanyan Chandrasekhar, set sail for England to study at Cambridge with the British astronomer Sir Arthur Eddington, an expert on general relativity. (According to some accounts, a journalist told Eddington in the early 1920s that he had heard there were only three people in the world who understood general

relativity. Eddington paused, then replied, “I am trying to think who the third person is.”) During his voyage from India,

Chandrasekhar worked out how big a star could be and still supported itself against its own gravity after it had used up all its fuel. The idea was this: when the star becomes small, the matter particles get very near each other, and so according to the Pauli exclusion principle, they must have very different velocities. This makes them move away from each other and so tends to make the star expand. A star can therefore maintain itself at a constant radius by a balance

between the attraction of gravity and the repulsion that arises from the exclusion principle, just as earlier in its life gravity was balanced by the heat.

Chandrasekhar realized, however, that there is a limit to the repulsion that the exclusion principle can provide. The  theory of relativity limits the maximum difference in the velocities of the matter particles in the star to the speed of light.

This means that when the star got sufficiently dense, the repulsion caused by the exclusion principle would be less than the attraction of gravity. Chandrasekhar calculated that a cold star of more than about one and a half times the mass of the sun would not be able to support itself against its own gravity. (This mass is now known as the Chandrasekhar limit.) A similar discovery was made about the same time by the Russian scientist Lev Davidovich Landau.

This had serious implications for the ultimate fate of massive stars. If a star’s mass is less than the Chandrasekhar limit, it can eventually stop contracting and settle down to a possible final state as a “white dwarf” with a radius of a few thousand miles and a density of hundreds of tons per cubic inch. A white dwarf is supported by the exclusion principle

repulsion between the electrons in its matter. We observe a large number of these white dwarf stars. One of the first to be discovered is a star that is orbiting around Sirius, the brightest star in the night sky.

Landau pointed out that there was another possible final state for a star, also with a limiting mass of about one or two times the mass of the sun but much smaller even than a white dwarf. These stars would be supported by the exclusion principle repulsion between neutrons and protons, rather than between electrons. They were therefore called neutron stars. They would have a radius of only ten miles or so and a density of hundreds of millions of tons per cubic inch. At the time they were first predicted, there was no way that neutron stars could be observed. They were not actually detected until much later.

There are some solutions of the equations of general relativity in which it is possible for our astronaut to see a naked singularity: he may be able to avoid hitting the singularity and instead fall through a “wormhole” and come out in another region of the universe. This would offer great possibilities for travel in space and time, but unfortunately it seems that these solutions may all be highly unstable; the least disturbance, such as the presence of an astronaut, may change them so that the astronaut could not see the singularity until he hit it and his time came to an end. In other words, the singularity would always lie in his future and never in his past. The strong version of the cosmic censorship hypothesis states that in a realistic solution, the singularities would always lie either entirely in the future (like the singularities of gravitational collapse) or entirely in the past (like the big bang). I strongly believe in cosmic censorship so I bet Kip Thorne and John Preskill of CalTech that it would always hold. I lost the bet on a technicality because examples were

produced solutions with a singularity that was visible from a long way away. So I had to pay up, which according to the terms of the bet meant I had to clothe their nakedness. But I can claim a moral victory. The naked singularities were unstable: the least disturbance would cause them either to disappear or to be hidden behind an event horizon. So they would not occur in realistic situations.

General relativity predicts that heavy objects that are moving will cause the emission of gravitational waves, ripples in the curvature of space that travel at the speed of light. These are similar to light waves, which are ripples of the electromagnetic field, but they are much harder to detect. They can be observed by the very slight change in separation they produce between neighboring freely moving objects. A number of detectors are being built in the United States, Europe, and Japan that will measure displacements of one part in a thousand million million million (1 with twenty-one zeros after it), or less than the nucleus of an atom over a distance of ten miles.

Like light, gravitational waves carry energy away from the objects that emit them. One would therefore expect a system of massive objects to settle down eventually to a stationary state, because the energy in any movement would be carried away by the emission of gravitational waves. (It is rather like dropping a cork into water: at first it bobs up and down a great deal, but as the ripples carry away its energy, it eventually settles down to a stationary state.) For example, the movement of the earth in its orbit around the sun produces gravitational waves. The effect of the energy loss will be to change the orbit of the earth so that gradually it gets nearer and nearer to the sun, eventually collides with it, and settles down to a stationary state. The rate of energy loss in the case of the earth and the sun is very low – about enough to run a small electric heater. This means it will take about a thousand million million million million years for the earth to run into the sun, so there’s no immediate cause for worry! The change in the orbit of the earth is too slow to be observed, but this same effect has been observed over the past few years occurring in the system called PSR 1913 + 16 (PSR stands for “pulsar,” a special type of neutron star that emits regular pulses of radio waves). This system contains two neutron stars orbiting each other, and the energy they are losing by the emission of gravitational waves is causing them to spiral in toward each other. This confirmation of general relativity won J. H. Taylor and R. A. Hulse won the Nobel Prize in 1993. It will take about three hundred million . years for them to collide. Just before they do, they will be orbiting so fast that they will emit enough gravitational waves for detectors like LIGO to pick up.

During the gravitational collapse of a star to form a black hole, the movements would be much more rapid, so the rate at which energy is carried away would be much higher. It would therefore not be too long before it settled down to a stationary state.

There was, however, a different interpretation of Israel’s result, which was advocated by Roger Penrose and John Wheeler in particular. They argued that the rapid movements involved in a star’s collapse would mean that the gravitational waves it gave off would make it ever more spherical, and by the time it had settled down to a stationary state, it would be precisely spherical. According to this view, any non-rotating star, however complicated its shape and internal structure, would end up after gravitational collapse as a perfectly spherical black hole, whose size would depend only on its mass. Further calculations supported this view, and it soon came to be adopted generally.

Israel’s result dealt with the case of black holes formed from non-rotating bodies only. In 1963, Roy Kerr, a New Zealander, found a set of solutions of the equations of general relativity that described rotating black holes. These “Kerr” black holes rotate at a constant rate, their size and shape depending only on their mass and rate of rotation. If the rotation is zero, the black hole is perfectly round and the solution is identical to the Schwarzschild solution. If the rotation is non-zero, the black hole bulges outward near its equator (just as the earth or the sun bulge due to their rotation), and the faster it rotates, the more it bulges. So, to extend Israel’s result to include rotating bodies, it was conjectured that any rotating body that collapsed to form a black hole would eventually settle down to a stationary state described by the Kerr solution. In 1970 a colleague and fellow research student of mine at Cambridge, Brandon Carter, took the first step toward proving this conjecture. He showed that, provided a stationary rotating black hole had an axis of symmetry, like a spinning top, its size and shape would depend only on its mass and rate of rotation. Then, in 1971, I proved that any stationary rotating black hole would indeed have such an axis of symmetry. Finally, in 1973, David Robinson at King’s College, London, used Carter’s and my results to show that the conjecture had been correct: such a black hole had indeed to be the Kerr solution. So after gravitational collapse a black hole must settle down into a state in which it could be rotating, but not pulsating. Moreover, its size and shape would depend only on its mass and rate of rotation, and not on the nature of the body that had collapsed to form it. This result became known by the maxim: “A black hole has no hair.” The “no hair” theorem is of great practical importance, because it greatly restricts the possible types of black holes. One can therefore make detailed models of objects that might contain black holes and compare the predictions of the models with observations. It also means that a very large amount of information about the body that has collapsed must be lost when a black hole is formed, because afterward all we can possibly measure about the body is its mass and rate of rotation.

Black holes are one of only a fairly small number of cases in the history of science in which a theory was developed in great detail as a mathematical model before there was any evidence from observations that it was correct. Indeed, this used to be the main argument of opponents of black holes: how could one believe in objects for which the only evidence was calculations based on the dubious theory of general relativity? In 1963, however, Maarten Schmidt, an astronomer at the Palomar Observatory in California, measured the red shift of a faint starlike object in the direction of the source of radio waves called 3C273 (that is, source number 273 in the third Cambridge catalog of radio sources). He found it was too large to be caused by a gravitational field: if it had been a gravitational red shift, the object would have to be so massive and so near to us that it would disturb the orbits of planets in the Solar System. This suggested that the red shift was instead caused by the expansion of the universe, which, in turn, meant that the object was a very long distance away. And to be visible at such a great distance, the object must be very bright, and must, in other words, be emitting a huge amount of energy. The only mechanism that people could think of that would produce such large quantities of energy seemed to be the gravitational collapse not just of a star but of a whole central region of a galaxy. A number of other similar “quasi-stellar objects,” or quasars, have been discovered, all with large red shifts. But they are all too far away and therefore too difficult to observe to provide conclusive evidence of black holes.

How could we hope to detect a black hole, as by its very definition it does not emit any light? It might seem a bit like looking for a black cat in a coal cellar. Fortunately, there is a way. As John Michell pointed out in his pioneering paper in 1783, a black hole still exerts a gravitational fierce force on nearby objects. Astronomers have observed many systems in which two stars orbit around each other, attracted toward each other by gravity. They also observe systems in which there is only one visible star that is orbiting around some unseen companion. One cannot, of course, immediately conclude that the companion is a black hole: it might merely be a star that is too faint to be seen. However, some of these systems, like the one called Cygnus X-1 Figure 6:2, are also strong sources of X-rays.

We also now have evidence for several other black holes in systems like Cygnus X-1 in our galaxy and in two neighboring galaxies called the Magellanic Clouds. The number of black holes, however, is almost certainly very much higher; in the long history of the universe, many stars must have burned all their nuclear fuel and have had to collapse.

The number of black holes may well be greater even than the number of visible stars, which totals about a hundred thousand million in our galaxy alone. The extra gravitational attraction of such a large number of black holes could explain why our galaxy rotates at the rate it does: the mass of the visible stars is insufficient to account for this.

We also have some evidence that there is a much larger black hole, with a mass of about a hundred thousand times that of the sun, at the center of our galaxy. Stars in the galaxy that come too near this black hole will be torn apart by the difference in the gravitational forces on their near and far sides. Their remains and gas that is thrown off other stars, will fall toward the black hole. As in the case of Cygnus X-l, the gas will spiral inward and will heat up, though not as much as in that case. It will not get hot enough to emit X rays, but it could account for the very compact source of radio waves and infrared rays that is observed at the galactic center.

It is thought that similar but even larger black holes, with masses of about a hundred million times the mass of the sun, occur at the centers of quasars. For example, observations with the Hubble telescope of the galaxy known as M87 reveal that it contains a disk of gas 130 light-years across rotating about a central object two thousand million times the mass of the sun. This can only be a black hole. Matter falling into such a supermassive black hole would provide the only source of power great enough to explain the enormous amounts of energy that these objects are emitting. As the matter spirals into the black hole, it would make the black hole rotate in the same direction, causing it to develop a magnetic field rather like that of the earth. Very high-energy particles would be generated near the black hole by the in-falling matter. The magnetic field would be so strong that it could focus these particles into jets ejected outward along the axis of rotation of the black hole, that is, in the directions of its north and south poles. Such jets are indeed observed in a number of galaxies and quasars. One can also consider the possibility that there might be black holes with masses much less than that of the sun. Such black holes could not be formed by gravitational collapse, because their masses are below the Chandrasekhar mass limit: stars of this low mass can support themselves against the force of gravity even when they have exhausted their nuclear fuel. Low-mass black holes could form only if matter was compressed to enormous densities by very large external pressures. Such conditions could occur in a very big hydrogen bomb: the physicist John Wheeler once calculated that if one took all the heavy water in all the oceans of the world, one could build a hydrogen bomb that would compress matter at the center so much that a black hole would be created. (Of course, there would be no one left to observe it!) A more practical possibility is that such low-mass black holes might have been formed in the high temperatures and pressures of the very early universe. Black holes would have been formed only if the early universe had not been perfectly smooth and uniform, because only a small region that was denser than average could be compressed in this way to form a black hole. But we know that there must have been some irregularities, because otherwise the matter in the universe would still be perfectly uniformly distributed at the present epoch, instead of being clumped together in stars and galaxies.

Whether the irregularities required to account for stars and galaxies would have led to the formation of a significant number of “primordial” black holes clearly depends on the details of the conditions in the early universe. So if we could determine how many primordial black holes there are now, we would learn a lot about the very early stages of the universe. Primordial black holes with masses more than a thousand million tons (the mass of a large mountain) could be detected only by their gravitational influence on other, visible matter or on the expansion of the universe. However, as we shall learn in the next chapter, black holes are not really black after all: they glow like a hot body, and the smaller they are, the more they glow. So, paradoxically, smaller black holes might actually turn out to be easier to detect than large ones!

However, if one has a black hole around there seems to be a rather easier way of violating the second law: just throw some matter with a lot of entropy such as a box of gas, down the black hole. The total entropy of matter outside the black hole would go down. One could, of course, still say that the total entropy, including the entropy inside the black hole, has not gone down – but since there is no way to look inside the black hole, we cannot see how much entropy the matter inside it has. It would be nice, then, if there was some feature of the black hole by which observers outside the black hole could tell its entropy, and which would increase whenever matter carrying entropy fell into the black hole. Following the discovery, described above, that the area of the event horizon increased whenever matter fell into a black hole, a research student at Princeton named Jacob Bekenstein suggested that the area of the event horizon was a measure of the entropy of the black hole. As matter carrying entropy fell into a black hole, the area of its event horizon would go up, so that the sum of the entropy of matter outside black holes and the area of the horizons would never go down.

This suggestion seemed to prevent the second law of thermodynamics from being violated in most situations. However, there was one fatal flaw. If a black hole has entropy, then it ought to also have a temperature. But a body with a particular temperature must emit radiation at a certain rate. It is a matter of common experience that if one heats up a poker in a fire it glows red hot and emits radiation, but bodies at lower temperatures emit radiation too; one just does not normally notice it because the amount is fairly small. This radiation is required in order to prevent violation of the second law. So black holes ought to emit radiation. But by their very definition, black holes are objects that are not

supposed to emit anything. It therefore seemed that the area of the event horizon of a black hole could not be regarded as its entropy. In 1972 I wrote a paper with Brandon Carter and an American colleague, Jim Bardeen, in which we pointed out that although there were many similarities between entropy and the area of the event horizon, there was this apparently fatal difficulty. 

To my surprise and annoyance, even non-rotating black holes should apparently create and emit particles at a steady rate. At first I thought that this emission indicated that one of the approximations I had used was not valid. I was afraid that if Bekenstein found out about it, he would use it as a further argument to support his ideas about the entropy of black holes, which I still did not like. However, the more I thought about it, the more it seemed that the approximations really ought to hold. But what finally convinced me that the emission was real was that the spectrum of the emitted particles was exactly that which would be emitted by a hot body, and that the black hole was emitting particles at exactly the correct rate to prevent violations of the second law. Since then the calculations have been repeated in a number of different forms by other people. They all confirm that a black hole ought to emit particles and radiation as if it were a hot body with a temperature that depends only on the black hole’s mass: the higher the mass, the lower the temperature.

How is it possible that a black hole appears to emit particles when we know that nothing can escape from within its event horizon? The answer, quantum theory tells us, is that the particles do not come from within the black hole, but from the “empty” space just outside the black hole’s event horizon! We can understand this in the following way: what we think of as “empty” space cannot be completely empty because that would mean that all the fields, such as the gravitational and electromagnetic fields, would have to be exactly zero. However, the value of a field and its rate of changes with time are like the position and velocity of a particle: the uncertainty principle implies that the more accurately one knows one of these quantities, the less accurately one can know the other. So in empty space the field cannot be fixed at exactly zero, because then it would have both a precise value (zero) and a precise rate of change (also zero). There must be a certain minimum amount of uncertainty, or quantum fluctuations, in the value of the field.

One can think of these fluctuations as pairs of particles of light or gravity that appear together at some time, move apart, and then come together again and annihilate each other. These particles are virtual particles like the particles that carry the gravitational force of the sun: unlike real particles, they cannot be observed directly with a particle detector. However, their indirect effects, such as small changes in the energy of electron orbits in atoms, can be measured and agree with the theoretical predictions to a remarkable degree of accuracy. The uncertainty principle also predicts that there will be similar virtual pairs of matter particles, such as electrons or quarks. In this case, however, one member of the pair will be a particle and the other an antiparticle (the antiparticles of light and gravity are the same as the particles).

Moreover, the lower the mass of the black hole, the higher its temperature. So as the black hole loses mass, its temperature and rate of emission increase, so it loses mass more quickly. What happens when the mass of the black hole eventually becomes extremely small is not quite clear, but the most reasonable guess is that it would disappear completely in a tremendous final burst of emission, equivalent to the explosion of millions of H-bombs.

A black hole with a mass a few times that of the sun would have a temperature of only one ten millionth of a degree above absolute zero. This is much less than the temperature of the microwave radiation that fills the universe (about 2.7º above absolute zero), so such black holes would emit even less than they absorb. If the universe is destined to go expanding forever, the temperature of the microwave radiation will eventually decrease to less than that of such a black hole, which will then begin to lose mass. But, even then, its temperature would be so low that it would take about a million million million million million million million million million million million years (1 with sixty-six zeros after it) to evaporate completely. This is much longer than the age of the universe, which is only about ten or twenty thousand million years (1 or 2 with ten zeros after it). On the other hand, as mentioned in Chapter 6, there might be primordial black holes with a very much smaller mass that were made by the collapse of irregularities in the very early stages of the universe. Such black holes would have a much higher temperature and would be emitting radiation at a much greater rate. A primordial black hole with an initial mass of a thousand million tons would have a lifetime roughly equal to the age of the universe. Primordial black holes with initial masses less than this figure would already have completely evaporated, but those with slightly greater masses would still be emitting radiation in the form of X rays and gamma rays. These X rays and gamma rays are like waves of light, but with a much shorter wavelength. Such holes hardly deserve the epithet black: they really are white hot and are emitting energy at a rate of about ten thousand megawatts.

In order to observe a primordial black hole one would have to detect several gamma ray quanta coming from the same direction within a reasonable space of time, such as a week. Otherwise, they might simply be part of the background. But Planck’s quantum principle tells us that each gamma ray quantum has a very high energy, because gamma rays have a very high frequency, so it would not take many quanta to radiate even ten thousand megawatts. And to observe these few coming from the distance of Pluto would require a larger gamma ray detector than any that have been constructed so far. Moreover, the detector would have to be in space, because gamma rays cannot penetrate the atmosphere.

The existence of radiation from black holes seems to imply that gravitational collapse is not as final and irreversible as we once thought. If an astronaut falls into a black hole, its mass will increase, but eventually the energy equivalent of that extra mass will be returned to the universe in the form of radiation. Thus, in a sense, the astronaut will be “recycled.” It would be a poor sort of immortality, however, because any personal concept of time for the astronaut would almost certainly come to an end as he was torn apart inside the black hole! Even the types of particles that were eventually emitted by the black hole would in general be different from those that made up the astronaut: the only

a feature of the astronaut that would survive would be his mass or energy.

The approximations I used to derive the emission from black holes should work well when the black hole has a mass greater than a fraction of a gram. However, they will break down at the end of the black hole’s life when its mass gets very small. The most likely outcome seems to be that the black hole will just disappear, at least from our region of the universe, taking with it the astronaut and any singularity there might be inside it, if indeed there is one. This was the first indication that quantum mechanics might remove the singularities that were predicted by general relativity. However, the methods that I and other people were using in 1974 were not able to answer questions such as whether singularities would occur in quantum gravity. From 1975 onward I therefore started to develop a more powerful approach to quantum gravity based on Richard Feynrnan’s idea of a sum over histories.

At the big bang itself the universe is thought to have had zero size, and so to have been infinitely hot. But as the universe expanded, the temperature of the radiation decreased. One second after the big bang, it would have fallen to about ten thousand million degrees. This is about a thousand times the temperature at the center of the sun, but temperatures as high as this are reached in H-bomb explosions. At this time the universe would have contained mostly photons, electrons, and neutrinos (extremely light particles that are affected only by the weak force and gravity) and their antiparticles, together with some protons and neutrons. As the universe continued to expand and the temperature to drop, the rate at which electron/antielectron pairs were being produced in collisions would have fallen below the rate at which they were being destroyed by annihilation. So most of the electrons and antielectrons would have annihilated with each other to produce more photons, leaving only a few electrons left over. The neutrinos and antineutrinos, however, would not have annihilated with each other, because these particles interact with themselves and with other particles only very weakly. So they should still be around today. If we could observe them, it would provide a good test of this picture of a very hot early stage of the universe. Unfortunately, their energies nowadays would be too low for us to observe them directly. However, if neutrinos are not massless, but have a small mass of their own, as suggested by some recent experiments, we might be able to detect them indirectly: they could be a form of “dark matter,” like that mentioned earlier, with sufficient gravitational attraction to stop the expansion of the universe and cause it to collapse again.

About one hundred seconds after the big bang, the temperature would have fallen to one thousand million degrees, the temperature inside the hottest stars. At this temperature protons and neutrons would no longer have sufficient energy to escape the attraction of the strong nuclear force, and would have started to combine together to produce the nuclei of atoms of deuterium (heavy hydrogen), which contain one proton and one neutron. The deuterium nuclei would then have combined with more protons and neutrons to make helium nuclei, which contain two protons and two neutrons, and also small amounts of a couple of heavier elements, lithium and beryllium. One can calculate that in the hot big bang model about a quarter of the protons and neutrons would have been converted into helium nuclei, along with a small amount of heavy hydrogen and other elements. The remaining neutrons would have decayed into protons, which are the nuclei of ordinary hydrogen atoms.

In this paper they made the remarkable prediction that radiation (in the form of photons) from the very hot early stages of the universe should still be around today, but with its temperature reduced to only a few degrees above absolute zero (–273ºC). It was this radiation that Penzias and Wilson found in 1965. At the time that Alpher, Bethe, and Gamow wrote their paper, not much was known about the nuclear reactions of protons and neutrons. Predictions made for the proportions of various elements in the early universe were therefore rather inaccurate, but these calculations have been repeated in the light of better knowledge and now agree very well with what we observe. It is, moreover, very difficult to explain in any other way why there should be so much helium in the

universe. We are therefore fairly confident that we have the right picture, at least back to about one second after the big bang.

Within only a few hours of the big bang, the production of helium and other elements would have stopped. And after that, for the next million years or so, the universe would have just continued expanding, without anything much happening. Eventually, once the temperature had dropped to a few thousand degrees, and electrons and nuclei no longer had enough energy to overcome the electromagnetic attraction between them, they would have started combining to form atoms. The universe as a whole would have continued expanding and cooling, but in regions that were slightly denser than average, the expansion would have been slowed down by the extra gravitational attraction. This would eventually stop expansion in some regions and cause them to start to recollapse. As they were collapsing, the gravitational pull of matter outside these regions might start them rotating slightly. As the collapsing the region got smaller, it would spin faster – just as skaters spinning on ice spin faster as they draw in their arms. Eventually, when the region got small enough, it would be spinning fast enough to balance the attraction of gravity, and in this way disklike rotating galaxies were born. Other regions, which did not happen to pick up a rotation, would become oval-shaped objects called elliptical galaxies. In these, the region would stop collapsing because individual parts of the galaxy would be orbiting stably round its center, but the galaxy would have no overall rotation.

As time went on, the hydrogen and helium gas in the galaxies would break up into smaller clouds that would collapse under their own gravity. As these contracted, and the atoms within them collided with one another, the temperature of the gas would increase, until eventually it became hot enough to start nuclear fusion reactions. These would convert the hydrogen into more helium, and the heat given off would raise the pressure, and so stop the clouds from contracting any further. They would remain stable in this state for a long time as stars like our sun, burning hydrogen into helium and radiating the resulting energy as heat and light. More massive stars would need to be hotter to balance their stronger gravitational attraction, making the nuclear fusion reactions proceed so much more rapidly that they would use up their hydrogen in as little as a hundred million years. They would then contract slightly, and as they heated up further, would start to convert helium into heavier elements like carbon or oxygen. This, however, would not release much more energy, so a crisis would occur, as was described in the chapter on black holes. What happens next is not completely clear, but it seems likely that the central regions of the star would collapse to a very dense state, such as a neutron star or black hole. The outer regions of the star may sometimes get blown off in a tremendous explosion called a supernova, which would outshine all the other stars in its galaxy. Some of the heavier elements produced near the end of the star’s life would be flung back into the gas in the galaxy, and would provide some of the raw material for the next generation of stars. Our own sun contains about 2 percent of these heavier elements, because it is a second- or third-generation star, formed some five thousand million years ago out of a cloud of rotating gas containing the debris of earlier supernovas. Most of the gas in that cloud went to form the sun or got blown away, but a small amount of the heavier elements collected together to form the bodies that now orbit the sun as planets like the earth.

The earth was initially very hot and without an atmosphere. In the course of time it cooled and acquired an atmosphere from the emission of gases from the rocks. This early atmosphere was not one in which we could have survived. It contained no oxygen, but a lot of other gasses that are poisonous to us, such as hydrogen sulfide (the gas that gives rotten eggs their smell). There are, however, other primitive forms of life that can flourish under such conditions. It is thought that they developed in the oceans, possibly as a result of chance combinations of atoms into large structures, called macromolecules, which were capable of assembling other atoms in the ocean into similar structures. They would thus have reproduced themselves and multiplied. In some cases there would be errors in the reproduction. Mostly these errors would have been such that the new macromolecule could not reproduce itself and eventually would have been destroyed. However, a few of the errors would have produced new macromolecules that were even better at reproducing themselves. They would have therefore had an advantage and would have tended to replace the original macromolecules. In this way a process of evolution was started that led to the development of more and more complicated, self-reproducing organisms. The first primitive forms of life consumed various materials, including hydrogen sulfide, and released oxygen. This gradually changed the atmosphere to the composition that it

has today, and allowed the development of higher forms of life such as fish, reptiles, mammals, and ultimately the human race.

Why did the universe start out with so nearly the critical rate of expansion that separates models that re-collapse from those that go on expanding forever, that even now, ten thousand million years later, it is still expanding at nearly the critical rate? If the rate of expansion one second after the big bang had been smaller by even one part in a hundred thousand million million, the universe would have re-collapsed before it ever reached its present size.

Despite the fact that the universe is so uniform and homogeneous on a large scale, it contains local irregularities, such as stars and galaxies. These are thought to have developed from small differences in the density of the early universe from one region to another. What was the origin of these density fluctuations?

The general theory of relativity, on its own, cannot explain these features or answer these questions because of its prediction that the universe started off with infinite density at the big bang singularity. At the singularity, general relativity and all other physical laws would break down: one couldn’t predict what would come out of the singularity.

As explained before, this means that one might as well cut the big bang, and any events before it, out of the theory, because they can have no effect on what we observe. Space-time would have a boundary – a beginning at the big bang. Science seems to have uncovered a set of laws that, within the limits set by the uncertainty principle, tell us how the universe will develop with time, if we know its state at any one time. These laws may have originally been decreed by God, but it appears that he has since left the universe to evolve according to them and does not now intervene in it. But how did he choose the initial state or configuration of the universe? What were the “boundary conditions” at the beginning of time?

One example of the use of the weak anthropic principle is to “explain” why the big bang occurred about ten thousand million years ago – it takes about that long for intelligent beings to evolve. As explained above, an early generation of stars first had to form. These stars converted some of the original hydrogen and helium into elements like carbon and oxygen, out of which we are made. The stars then explode as supernovas, and their debris went to form other stars and planets, among them those of our Solar System, which is about five thousand million years old. The first one or two thousand million years of the earth’s existence were too hot for the development of anything complicated. The remaining three thousand million years or so have been taken up by the slow process of biological evolution, which has led from the simplest organisms to beings who are capable of measuring time back to the big bang.

The laws of science, as we know them at present, contain many fundamental numbers, like the size of the electric charge of the electron and the ratio of the masses of the proton and the electron. We cannot, at the moment at least, predict the values of these numbers from theory – we have to find them by observation. It may be that one day we shall discover a complete unified theory that predicts them all, but it is also possible that some or all of them vary from universe to universe or within a single universe. The remarkable fact is that the values of these numbers seem to have been very finely adjusted to make possible the development of life. For example, if the electric charge of the electron had been only slightly different, stars either would have been unable to burn hydrogen and helium, or else they would not have exploded. Of course, there might be other forms of intelligent life, not dreamed of even by writers of science fiction, that did not require the light of a star like the sun or the heavier chemical elements that are made in stars and are flung back into space when the stars explode. Nevertheless, it seems clear that there are relatively few ranges of values for the numbers that would allow the development of any form of intelligent life. Most sets of values would give rise to universes that, although they might be very beautiful, would contain no one able to wonder at that beauty. One can take this either as evidence of a divine purpose in Creation and the choice of the laws of science or as support for the strong anthropic principle.

Guth suggested that the universe started out from the big bang in a very hot, but rather chaotic, state. These high temperatures would have meant that the particles in the universe would be moving very fast and would have high energies. As we discussed earlier, one would expect that at such high temperatures the strong and weak nuclear forces and the electromagnetic force would all be unified into a single force. As the universe expanded, it would cool, and particle energies would go down. Eventually there would be what is called a phase transition and the symmetry between the forces would be broken: the strong force would become different from the weak and electromagnetic forces. One common example of a phase transition is the freezing of water when you cool it down. Liquid water is symmetrical, the same at every point and in every direction. However, when ice crystals form, they will have definite positions and will be lined up in some direction. This breaks water’s symmetry.

In the case of water, if one is careful, one can “supercool” it: that is, one can reduce the temperature below the freezing point (OºC) without ice forming. Guth suggested that the universe might behave in a similar way: the temperature might drop below the critical value without the symmetry between the forces being broken. If this happened, the universe would be in an unstable state, with more energy than if the symmetry had been broken. This special extra energy can be shown to have an anti gravitational effect: it would have acted just like the cosmological constant that Einstein introduced into general relativity when he was trying to construct a static model of the universe. Since the universe would already be expanding just as in the hot big bang model, the repulsive effect of this cosmological constant would therefore have made the universe expand at an ever-increasing rate. Even in regions where there were more matter particles than average, the gravitational attraction of the matter would have been outweighed by the repulsion of the effective cosmological constant. Thus these regions would also expand in an accelerating inflationary manner. As they expanded and the matter particles got farther apart, one would be left with an expanding universe that contained hardly any particles and was still in the supercooled state. Any irregularities in the universe would simply have been smoothed out by the expansion, as the wrinkles in a balloon are

smoothed away when you blow it up. Thus the present smooth and uniform state of the universe could have evolved from many different non-uniform initial states.

The point is that the theory of relativity says that there is no unique measure of time that all observers will agree on. Rather, each observer has his or her own measure of time. If it is possible for a rocket traveling below the speed of light to get from event A (say, the final of the 100-meter race of the Olympic Games in 202) to event B (say, the opening of the 100,004th meeting of the Congress of Alpha Centauri), then all observers will agree that event A happened before event B according to their times. Suppose, however, that the spaceship would have to travel faster than light to carry the news of the race to the Congress. Then observers moving at different speeds can disagree about whether event A occurred before B or vice versa. According to the time of an observer who is at rest with respect to the earth, it may be that the Congress opened after the race. Thus this observer would think that a spaceship could get from A to B in time if only it could ignore the speed-of-light speed limit. However, to an observer at Alpha Centauri moving away from the earth at nearly the speed of light, it would appear that event B, the opening of the Congress, would occur before event A, the 100-meter race. The theory of relativity says that the laws of physics appear the same to observers moving at different speeds.

In 1935, Einstein and Nathan Rosen wrote a paper in which they showed that general relativity allowed what they called “bridges,” but which are now known as wormholes. The Einstein-Rosen bridges didn’t last long enough for a spaceship to get through: the ship would run into a singularity as the wormhole pinched off. However, it has been suggested that it might be possible for an advanced civilization to keep a wormhole open. To do this, or to warp space-time in any other way so as to permit time travel, one can show that one needs a region of space-time with negative curvature, like the surface of a saddle. Ordi-nary matter, which has a positive energy density, gives space-time a positive curvature, like the surface of a sphere. So what one needs, in order to warp space-time in a way that will allow travel into the past, is matter with negative energy density.

Energy is a bit like money: if you have a positive balance, you can distribute it in various ways, but according to the classical laws that were believed at the beginning of the century, you weren’t allowed to be overdrawn. So these classical laws would have ruled out any possibility of time travel. However, as has been described in earlier chapters, the classical laws were superseded by quantum laws based on the uncertainty principle. The quantum laws are more liberal and allow you to be overdrawn on one or two accounts provided the total balance is positive. In other words, quantum theory allows the energy density to be negative in some places, provided that this is made up for by positive energy densities in other places, so that the total energy remains

positive. An example of how quantum theory can allow negative energy densities is provided by what is called the Casimir effect. As we saw in Chapter 7, even what we think of as “empty” space is filled with pairs of virtual particles and antiparticles that appear together, move apart, and come back together and annihilate each other. Now, suppose one has two parallel metal plates a short distance apart. The plates will act like mirrors for the virtual photons or particles of light. In fact they will form a cavity between them, a bit like an organ pipe that will resonate only at certain notes. This means that virtual photons can occur in the space between the plates only if their wavelengths (the distance between the crest of one wave and the next) fit a whole number of times into

the gap between the plates. If the width of a cavity is a whole number of wavelengths plus a fraction of a wave-length, then after some reflections backward and forward between the plates, the crests of one wave will coincide with the troughs of another and the waves will cancel out.

Because the virtual photons between the plates can have only the resonant wavelengths, there will be slightly fewer of them than in the region outside the plates where virtual photons can have any wavelength. Thus there will be slightly fewer virtual photons hitting the inside surfaces of the plates than the outside surfaces. One would therefore expect a force on the plates, pushing them toward each other. This force has actually been detected and has the predicted value. Thus we have experimental evidence that virtual particles exist and have real effects.

In 1928, physicist and Nobel Prize winner Max Born told a group of visitors to Gottingen University, “Physics, as we know it, will be over in six months.” His confidence was based on the recent discovery by Dirac of the equation that governed the electron. It was thought that a similar equation would govern the proton, which was the only other particle known at the time, and that would be the end of theoretical physics. However, the discovery of the neutron and of nuclear forces knocked that one on the head too. Having said this, I still believe there are grounds for cautious optimism that we may now be near the end of the search for the ultimate laws of nature.

In previous chapters I have described general relativity, the partial theory of gravity, and the partial theories that govern the weak, the strong, and the electromagnetic forces. The last three may be combined in so-called grand unified theories, or GUTs, which are not very satisfactory because they do not include gravity and because they contain a number of quantities, like the relative masses of different particles, that cannot be predicted from the theory but have to be chosen to fit observations. The main difficulty in finding a theory that unifies gravity with the other forces is that general relativity is a “classical” theory; that is, it does not incorporate the uncertainty principle of quantum mechanics. On the other hand, the other partial theories depend on quantum mechanics in an essential way. A necessary first step, therefore, is to combine general relativity with the uncertainty principle. As we have seen, this can produce some remarkable consequences, such as black holes not being black, and the universe not having any singularities but being completely self-contained and without a boundary. The trouble is, as explained in Chapter 7, that the uncertainty principle means that even “empty” space is filled with pairs of virtual particles and antiparticles. These pairs would have an infinite amount of energy and, therefore, by Einstein’s famous equation E = mc2, they would have an infinite amount of mass. Their gravitational attraction would thus curve up the universe to infinitely small size.

String theory has a curious history. It was originally invented in the late 1960s in an attempt to find a theory to describe the strong force. The idea was that particles like the proton and the neutron could be regarded as waves on a string. The strong forces between the particles would correspond to pieces of string that went between other bits of string, as in a spider’s web. For this theory to give the observed value of the strong force between particles, the strings had to be like rubber bands with a pull of about ten tons.

String theories, however, have a bigger problem: they seem to be consistent only if space-time has either ten or twenty-six dimensions, instead of the usual four! Of course, extra space-time dimensions are a commonplace of science fiction indeed, they provide an ideal way of overcoming the normal restriction of general relativity that one cannot travel faster than light or back in time (see Chapter 10). The idea is to take a shortcut through the extra dimensions.

Even if we do discover a complete unified theory, it would not mean that we would be able to predict events in general, for two reasons. The first is the limitation that the uncertainty principle of quantum mechanics sets on our powers of prediction. There is nothing we can do to get around that. In practice, however, this first limitation is less restrictive than the second one. It arises from the fact that we could not solve the equations of the theory exactly, except in very simple situations. (We cannot even solve exactly for the motion of three bodies in Newton’s theory of gravity, and the difficulty increases with the number of bodies and the complexity of the theory.) We already know the laws that govern the behavior of matter under all but the most extreme conditions. In particular, we know the basic laws that underlie all of chemistry and biology. Yet we have certainly not reduce these subjects to the status of solved problems: we have, as yet, had little success in predicting human behavior from mathematical equations! So even if we do find a complete set of basic laws, there will still be in the years ahead the intellectually challenging task of developing better approximation methods, so that we can make useful predictions of the probable outcomes in complicated and realistic situations. A complete, consistent, unified theory is only the first step: our goal is a complete understanding of the events around us, and of our own existence.

ISAAC NEWTON

Isaac Newton was not a pleasant man. His relations with other academics were notorious, with most of his later life spent embroiled in heated disputes. Following publication of Principia Mathematica – surely the most influential book ever written in physics – Newton had risen rapidly into public prominence. He was appointed president of the Royal Society and became the first scientist ever to be knighted.

Newton soon clashed with the Astronomer Royal, John Flamsteed, who had earlier provided Newton with much-needed data for Principia, but was now withholding information that Newton wanted. Newton would not take no for an answer: he had himself appointed to the governing body of the Royal Observatory and then tried to force immediate publication of the data. Eventually he arranged for Flamsteed’s work to be seized and prepared for publication by Flamsteed’s mortal enemy, Edmond Halley. But Flamsteed took the case to court and, in the nick of time, won a court order preventing distribution of the stolen work. Newton was incensed and sought his revenge by systematically deleting all references to Flamsteed in later editions of Principia. A more serious dispute arose with the German philosopher Gottfried Leibniz. Both Leibniz and Newton had independently developed a branch of mathematics called calculus, which underlies most of modern physics.

Although we now know that Newton discovered calculus years before Leibniz, he published his work much later. A major row ensued over who had been first, with scientists vigorously defending both contenders. It is remarkable, however, that most of the articles appearing in defense of Newton were originally written by his own hand – and only published in the name of friends! As the row grew, Leibniz made the mistake of appealing to the Royal Society to resolve the dispute. Newton, as president, appointed an “impartial” committee to investigate,  coincidentally consisting entirely of Newton’s friends! But that was not all: Newton then wrote the committee’s report himself and had the Royal Society publish it, officially accusing Leibniz of plagiarism. Still unsatisfied, he then wrote an anonymous review of the report in the Royal Society’s own periodical. Following the death of Leibniz, Newton is reported to have declared that he had taken great satisfaction in breaking Leibniz’s heart.”

During the period of these two disputes, Newton had already left Cambridge and academe. He had been active in anti-Catholic politics at Cambridge, and later in Parliament, and was rewarded eventually with the lucrative post of Warden of the Royal Mint. Here he used his talents for deviousness and vitriol in a more socially acceptable way, successfully conducting a major campaign against counterfeiting, even sending several men to their death on the gallows.

Why don’t we notice all these extra dimensions, if they are really there? Why do we see only three space dimensions and one time dimension? The suggestion is that the other dimensions are curved up into a space of very small size, something like a million million million million millionth of an inch. This is so small that we just don’t notice it: we see only one time dimension and three space dimensions, in which space-time is fairly flat. It is like the surface of a straw. If you look at it closely, you see it is two-dimensional (the position of a point on the straw is described by two numbers, the length along the straw and the distance round the circular direction).

But if you look at it from a distance, you don’t see the thickness of the straw and it looks one-dimensional (the position of a point is specified only by the length along the straw). So it is with space-time: on a very small scale it is ten-dimensional and highly curved, but on bigger scales you don’t see the curvature or the extra dimensions. If this picture is correct, it spells bad news for would-be space travelers: the extra dimensions would be far too small to allow a spaceship through. However, it raises another major problem. Why should some, but not all, of the dimensions be curled up into a small ball? Presumably, in the very early universe all the dimensions would have been very curved. Why did one time dimension and three space dimensions flatten out, while the other dimensions remain tightly curled up?

One possible answer is the anthropic principle. Two space dimensions do not seem to be enough to allow for the development of complicated beings like us. For example, two-dimensional animals living on a one-dimensional earth would have to climb over each other in order to get past each other. If a two-dimensional creature ate something it could not digest completely, it would have to bring up the remains the same way it swallowed them, because if there were a passage right through its body, it would divide the creature into two separate halves: our two-dimensional being would fall apart Figure 11:8. Similarly, it is difficult to see how there could be any circulation of the blood in a two-dimensional creature.