Home | Blog index | Previous | Next | About | Privacy policy |
By Chris Austin. 30 November 2020.
An earlier version of this post was published on another website on 1 January 2013.
This is the fifth in a series of posts about the foundation of our understanding of the way the physical world works, which I'm calling Dirac-Feynman-Berezin sums. The previous post in the series is Action For Fields.
As in the previous posts, I'll show you some formulae and things like that along the way, but I'll try to explain what all the parts mean as we go along, if we've not met them already, so you don't need to know about that sort of thing in advance.
One of the clues that led to the discovery of Dirac-Feynman-Berezin sums came from the attempted application to electromagnetic radiation of discoveries about heat and temperature. We looked at those discoveries about heat and temperature in the second post in the series, and in the third post, we looked at how James Clerk Maxwell, just after the middle of the nineteenth century, was able to identify light as waves of oscillating electric and magnetic fields, and to calculate the speed of light from measurements of electrical and magnetic effects. Dirac-Feynman-Berezin sums for a physical system depend on a property of the system called its action, and in the first post in the series, we derived Sir Isaac Newton's second law of motion from Pierre-Louis de Maupertuis's principle of stationary action. In the fourth post, we calculated the conserved energy of a system of electromagnetic fields and electrically charged particles, by deriving Maxwell's equations for the electromagnetic fields, and the forces exerted on the charged particles by the fields, from de Maupertuis's principle for the action of the system.
Today I would like to put all these pieces together, and show you how they led to a seriously wrong conclusion about the properties of electromagnetic radiation in a hot oven. In the next post in the series, Dirac-Feynman sums, we'll look at how the problem was resolved by the discoveries that led to Dirac-Feynman-Berezin sums, which started with the identification of a new fundamental constant of nature by Max Planck, in 1899.
Let's now consider the electromagnetic fields in a box-shaped oven whose sides are aligned with the Cartesian coordinate directions, and whose internal dimensions in the Cartesian coordinate directions are , , and . We'll assume that the internal faces of the oven walls are perfectly reflecting, and that the oven is empty apart from the electromagnetic fields. We found in the third post in the series, here, that for any vector , any angle , and any vector perpendicular to , which means that , a solution of the equations for the electromagnetic field in a vacuum, with no electric charges or electric currents present, is given by:
where is the voltage field and is the vector potential field, and the electric field strength and the magnetic induction field are expressed in terms of and by the formulae we found in the third post in the series, here. We found in the third post in the series, here, that is equal to the speed of light metres per second in a vacuum, so I'll now write instead of .
From calculations similar to the ones in the third post in the series, here, which confirmed that a wave of the above form satisfies the gauge condition on and in the third post in the series, here, and Maxwell's equations in a vacuum in terms of and assuming that gauge condition, as in the third post in the series, here, with no electric charges or electric currents present, we find that an arbitrary sum of waves of the above form also satisfies those equations. We'll assume that the electromagnetic fields inside the oven consist of a sum of waves of the above form, so that inside the oven.
Maxwell's equation summarizing Faraday's measurements involving time-dependent magnetic fields, as in the third post in the series, here, shows that the components of tangential to an oven wall must be continuous at the oven wall, for if a component of tangential to the oven wall changed discontinously at the oven wall, the component of in the perpendicular direction along the oven wall would have to be infinite at the oven wall. The assumption that the oven walls are perfectly reflecting means that and are 0 inside the material of the oven walls, and we'll assume that and are also 0 inside the material of the oven walls. So from the formula for in terms of and , as in the third post in the series, here, the tangential components of must be continuous at the oven wall, and are thus 0 at the oven wall.
The requirement that the tangential components of must be 0 at each wall of the oven is called a boundary condition, and restricts the possible wave vectors of the electromagnetic waves inside the oven. We'll assume that the interior faces of the oven walls perpendicular to the Cartesian coordinate direction are at and , for . A single wave of the above form with a nonzero polarization vector and a nonzero wave vector cannot satisfy the boundary condition on any wall of the oven by itself, so if there is a wave present with particular values of , , and , there must also be other waves present with different values of , , or , such that the sum of these waves satisfies the boundary conditions.
Considering first the boundary at , if there is a wave present with
with nonzero and , then there must also be other waves present with different values of , , or , such that the sum of over these waves is 0 for all values of , and all values of in the range , and all values of in the range . Any waves present relevant to satisfying the boundary condition at will have the same values of , , and , and we can also require them to have the same value of , since from the first post in the series, here, the value of is unaltered by adding a whole number times to . Thus since , the only other relevant value of is . Thus to satisfy the boundary condition at , any wave present as above must be paired with another wave with the opposite value of , such that the sum of the two waves has the form:
since the 2 and 3 components of this are 0 at , and the polarization vector of the second wave is perpendicular to the wave vector of the second wave.
Considering, next, the boundary condition at , this must also be satisfied by the above sum of two waves, since any relevant waves have the same values of , , and , and can be chosen to have the same value of . Thus we require , , and to be such that the 2 and 3 components of the above sum of two waves are also 0 at , for all values of , and all values of in the range , and all values of in the range . To determine the values of , , and that satisfy this requirement, it is helpful to know about a formula that expresses in terms of , , , and , for arbitrary angles and .
From the definition of and , as in the first post in the series, here, are the Cartesian coordinates, in the 2-dimensional plane of Euclidean geometry, of a point that is moving along a circle of radius 1 centred at the point with Cartesian coordinates , such that the angle between the straight line from to and the straight line from to is , and is at . We'll now use the straight line from to as the first coordinate direction of a second set of Cartesian coordinates also centred at , such that the coordinate directions of the second set of Cartesian coordinates rotate into the original set as tends to 0. From the definition of Cartesian coordinates, as in the first post in the series, here, the second coordinate direction of the second set of Cartesian coordinates must be perpendicular to the first coordinate direction of the second set, so by the discussion in the third post in the series, here, it satisfies , whose solution of length 1 is . This is required to become as tends to 0, so the required solution is . A point whose coordinates are with respect to the first set of coordinates has coordinates with respect to the second set of coordinates, so we have:
Thus we have:
and also:
for all angles and all angles .
The above sum of two waves is therefore equal to:
where I have defined the Greek letter to be . We observe from the definition of and , as in the first post in the series, here, that:
and:
for all , so the above sum of two waves is:
Thus the requirement that the 2 and 3 components of the sum of the two waves are 0 at , for all values of , and all values of in the range , and all values of in the range , or equivalently, at , for all values of , means that either , or . If , then and means that , while if , then from the definition of , as in the first post in the series, here, we have , for some whole number .
Considering the boundary conditions at the oven walls perpendicular to the 2 and 3 coordinate directions in the same way, we therefore find that if there is a wave present as above with nonzero and such that , then it must be part of a sum of waves of the form:
and the components of must be of the form , for some whole numbers , . Doing the sums over the signs , , and using the formulae above, we find:
From the definition of and , as in the first post in the series, here, we observe that , for all . Thus from the formula above, we have:
for all and all . Thus if we display the -dependence of the above sum of waves by representing it as , we have:
Thus for arbitrary , the above sum of waves is equal to a sum of waves of the above form with , plus a sum of waves of the above form with .
From the formulae in the third post in the series, here, the electric field strength and the magnetic induction field for the above sum of waves are:
When the oven is hot, we can expect that it will contain electromagnetic radiation in these possible modes of oscillation. Let's now consider a sum of the above possible modes of electromagnetic radiation in the oven for all the possible values of , as above, and the two independent values 0 and of . There is an independent polarization vector for each pair of a possible value of and one of the two independent values 0 and of , and I'll write this as , where are whole numbers such that , for . The symbol means, "greater than or equal to." The polarization vector satisfies . The 1 component of the electric field strength is now:
where means , and the other components of and the components of are now analogous sums of the components above.
From the gauge-invariant formula for the Hamiltonian for a collection of electrically charged point particles moving slowly compared to the speed of light in a vacuum, plus electric and magnetic fields, which we found in the previous post, here, the energy of the electromagnetic fields in the oven involves the integrals over the volume of the oven of the squares of the components of and . To calculate the contribution of , where is equal to a sum over the independent modes as above, it is convenient to use independent dummy indexes and for each of the two factors of , so that can be written as:
where stand for the remaining factors, and , for .
Considering a term in the above sum with specific values of , , , and , the integral over the volume of the oven factorizes into a product of integrals of the form or , there being one such factor for each of the three possible values of . From the result we found above, and the observations above, we have:
for all and all . Thus we have:
We now observe that if is a fixed number, and is a quantity that depends smoothly on a quantity , so in the terminology of the previous post, here, is a smooth function of , then:
So from the result we found in the first post in the series, here, we have:
Thus from the result we found in the first post in the series, here, that the integral of the rate of change of a quantity is equal to the net change of that quantity, we have:
if is a whole number , while for we have . Thus from the result above, we find that for whole numbers and :
And similarly:
Thus after doing the integral over the volume of the oven, the contribution from to the gauge-invariant formula for the total energy, , as in the previous post, here, is:
where now represents the volume of the oven, and . The expression is 1 if and 0 otherwise, in accordance with the definition of the Kronecker delta, in the first post in the series, here. And in the same way, we find that when is expressed as a sum over , and , analogous to the expression for above, the contributions from to with give 0 after doing the integral over the volume of the oven.
We'll focus now on the contributions of modes with for all . From the formula above and the analogous formulae for the contributions from and , the sum of the contributions from , , and , for the modes with for all , is:
And from the formulae for the components of , above, the sum of the contributions from , , and , to , as in the previous post, here, for the modes with for all , is:
since, for example, , from the definition of the antisymmetric tensor , in the third post in the series, here. From the formula we found in the third post in the series, here, with the indexes and rewritten as and , and the dummy index rewritten as , and the property of the Kronecker delta we observed in the third post in the series, here, the above expression is equal to:
since .
We found in the third post in the series, here, that , the speed of light in a vacuum, is equal to , so the factor in the contribution from is equal to the factor in the contribution from . Thus for each , our observation above implies that the total contribution from terms where and , or vice versa, is proportional to:
where at the second step I used the observation that for all , which follows from the definition of , as in the first post in the series, here.
Thus since for all , the total contribution of the modes with for all to the energy of the electromagnetic radiation in the oven is:
where I used the formula for above.
We now observe that the arguments that led to the Boltzmann distribution, as in the second post in the series, here, for the most likely number of objects of a given type in a given position and momentum bin, when the range of possible positions and momenta of the microscopic objects in a system in thermal equilibrium at absolute temperature is divided up into tiny bins of equal size, can be adapted to the electromagnetic radiation subject to Maxwell's equations in a hot oven, for radiation of wavelengths very small compared to the dimensions , , and of the oven, in the following way.
We'll consider radiation modes whose wavelengths are sufficiently small compared to the dimensions of the oven that we can group the modes, described by their wave-number vector and or , into "types" of similar and equal , such that the number of modes of each type is large compared to 1, and the relative differences , , and , between the wave-number vectors and of any two modes of the same type are small compared to 1. For example for infra-red radiation of wavelength about metres in an oven of size about a metre cubed, we can assume that , , and are all at least about , and divide up the range of each into ranges to ; to ; and so on, and say that two wave-number vectors and are of the same type if is in the same range as , for , and 3. Then the number of wave-number vectors of each type is , and for and of the same type, is , for , and 3. This is consistent with the definition of the type of an object that I gave in the second post in the series, here, in the course of the derivation of the Boltzmann distribution.
Looking back at the derivation of the Boltzmann distribution, in the second post in the series, starting here, we observe that it depended on the assumption that the number of objects of each type is very large compared to 1, and that requirement will be satisfied for the radiation modes in the oven if we regard the modes of wave-number vector such that for , and 3 as "objects", and classify them into types as I just described.
The derivation of the Boltzmann distribution also depended on the conserved total energy being equal to the sum of the energies of the individual objects, so that the range of possible positions and momenta of an object could be divided into very small bins, such that the conserved total energy is to a very good approximation equal to a sum , as in the second post in the series, here, where is the number of objects of type in bin , and is the energy of an object of type at the centre of bin . However the derivation did not depend on any particular details of how depends on the type of object or the position and momentum of an object at the centre of bin , and it did not depend on the numbers associated with an object, that determine the contribution of that object to the total energy , being the position and momentum coordinates of the object, as opposed to some other quantities associated to the object that determine its contribution to .
Thus from the formula above for the contribution of the modes with for all to the energy of the electromagnetic radiation in the oven, which shows that the energy is the sum of a contribution from each mode, the argument works equally well if we regard the modes of wave-number vector such that for , and 3 as objects, and classify them into types as I described above, where the variable quantities associated with each object, that determine the contribution of that object to , are the components of the polarization vector of that mode. Two of the three components of can vary independently for each mode, due to the restriction that , as above.
The derivation of the Boltzmann distribution depended on the range of possible values of the position and momentum coordinates of an object being divided into bins of equal size, but although I used the same bins, for convenience, for each different type of object, the derivation did not depend on that, and neither the derivation nor the result, as in the second post in the series, here, depended on the actual sizes of the bins, other than through the requirements that they be small enough that the energies of all the objects of type in bin are equal to to a good accuracy, and that they be large enough, or that the total number of objects of type be large enough, that the total number of objects in each bin, or at least, in the bins with the largest numbers of objects of type , should be large compared to 1. The derivation of the formula , in the second post in the series, starting here, was carried out separately for each different type of object , and the value of for each different type of object, as in the formula in the second post in the series, here, or in the example of the ideal gas, which we studied in the second post in the series, starting here, automatically compensates for any change of the bin size for each different type of object.
From the formula above, the contribution of a mode the energy of the electromagnetic radiation in the oven is proportional to the sum of the squares of the components of the polarization vector of the mode, so from the analogy to the kinetic energy of a particle, which is proportional to the sum of the squares of the components of the momentum vector of the particle, I shall assume that it is the range of possible polarization vectors of a mode that should be divided into equal size bins. The modes are grouped into types of approximately equal wave-number vector and equal as above, and for a type whose wave-number is , we'll choose two vectors and of length 1 that are perpendicular to and perpendicular to each other. We'll represent the components of the polarization vector in the directions and by and , so from the formula in the third post in the series, here:
for . The mutually perpendicular vectors , , and are the vectors of length 1 in the coordinate directions of an alternative system of Cartesian coordinates, so by Pythagoras:
since .
Thus from the formula above, the energy of a mode of type with independent polarization vector components and is:
So in a similar manner to the calculation for an ideal gas, in the second post in the series, starting here, we find from the result in the second post in the series, here, that the most likely number of modes of type in a polarization bin with edge sizes and is:
where is the total number of modes of type . From the result above, and the results we obtained in the second post in the series, here, and here, we find:
so:
In the same way as for the ideal gas example, in the second post in the series, here, we'll assume now that this most likely number of modes of type in each polarization bin is the actual number of modes in each polarization bin. So the total energy of the modes of type is:
From the result in the second post in the series, here, with , , this is equal to:
And from the calculation in the second post in the series, starting here, this is equal to:
Thus in thermal equilibrium at absolute temperature , each of the modes of type has energy . Since this is independent of and , we thus find that for every mode for which all three components of are sufficiently large, the energy of that mode, in thermal equilibrium at absolute temperature , is . From the discussion above, this applies, in particular, for electromagnetic radiation of infra-red and all shorter wavelengths in an oven of size about a metre cubed or larger, for almost all directions of the wave-number vector .
Thus since there is no upper limit to the size of the components of the wave-number vector , we have arrived at a result in contradiction with everyday experience: if the absolute temperature of the oven is greater than 0, then the energy of the electromagnetic radiation in the oven is infinite, because there are an infinite number of modes, each of which has energy .
At the end of the nineteenth century, the actual energy of the electromagnetic radiation in different frequency ranges in hot ovens was measured by Otto Lummer, Ferdinand Kurlbaum, and Heinrich Rubens. From the formulae for the electric and magnetic fields in the waves, above, the angular frequency , where is the Greek letter omega, which is times the number of cycles per unit time, is , where the wavevector is , from above, and the speed of light is , from the third post in the series, here. Each wave-number vector corresponds to 2 modes, one for each of the two values 0 and of , so the number of modes per unit volume in the space of wavevectors is . Each component of is , so from the value we found in the third post in the series, here, for the area of a sphere of radius 1, the total number of modes whose wavevector magnitude lies in the range from to is:
Thus the total number of modes whose angular frequency lies in the range from to is , so since each of these modes has energy according to the result above, the total energy of electromagnetic radiation in the oven in modes whose angular frequency lies in the range from to , according to the result above, is:
This is called the Rayleigh-Jeans law, after Lord Rayleigh and Sir James Jeans.
Max Planck discovered in 1900 that the measurements of Lummer, Kurlbaum, and Rubens are instead represented accurately by a formula:
where , which is pronounced "h bar", is a fundamental constant of nature that was previously unknown, whose value is:
and is Napier's number, as in the second post in the series, here.
From the formula in the second post in the series, here, the factor tends to as tends to 0, and is approximately equal to for all such that , so the Rayleigh-Jeans law, as above, is approximately valid for all such that , and becomes accurately valid when is substantially smaller than . However at such that , the energy in modes whose angular frequency lies in the range from to stops growing with as predicted by the Rayleigh-Jeans law, and at larger it decreases very rapidly with increasing , in complete contradiction with the Rayleigh-Jeans law.
If we define , then the Planck law, above, becomes , and the Rayleigh-Jeans law, above, becomes . This graph shows the Planck factor, , which is in agreement with observation, plotted in blue, and the Rayleigh-Jeans factor, , plotted in red. We see that the Rayleigh-Jeans factor is already too large by about a factor of 2 at , and only agrees well with the correct Planck factor for less than about 0.4.
The quantity is known as Planck's constant. The dimensions of are energy times time, which are the dimensions of de Maupertuis's action. Planck suggested that energy could be transferred between the oven walls and the electromagnetic radiation of frequency in the oven only in whole number multiples of a basic amount . This was the start of the discoveries that led to Dirac-Feynman-Berezin sums, and which made possible, among other things, the design and construction of the electronic device on which you are reading this blog post.
In the next post in this series, Dirac-Feynman sums, we'll take a first look at a special case of Dirac-Feynman-Berezin sums, which I'll call Dirac-Feynman sums. We'll see that Dirac-Feynman sums lead to de Maupertuis's principle, Newton's laws, and Maxwell's equations, when the energies involved are large compared to Planck's basic amount , but lead to essential corrections when the energies involved are comparable in size to , or smaller than . In particular, Dirac-Feynman sums correct the Rayleigh-Jeans law to the Planck law, when the Boltzmann energy is comparable to or smaller than the Planck energy .
The software on this website is licensed for use under the Free Software Foundation General Public License.
Page last updated 4 May 2023. Copyright (c) Chris Austin 1997 - 2023. Privacy policy