Jump to content

Elementary Principles in Statistical Mechanics/Chapter X

From Wikisource
1540755Elementary Principles in Statistical MechanicsChapter X. On a distribution in phase called microcanonical in which all the systems have the same energy.Josiah Willard Gibbs

CHAPTER X.

ON A DISTRIBUTION IN PHASE CALLED MICROCANONICAL IN WHICH ALL THE SYSTEMS HAVE THE SAME ENERGY.

An important case of statistical equilibrium is that in which all systems of the ensemble have the same energy. We may arrive at the notion of a distribution which will satisfy the necessary conditions by the following process. We may suppose that an ensemble is distributed with a uniform density-in-phase between two limiting values of the energy, and , and with density zero outside of those limits. Such an ensemble is evidently in statistical equilibrium according to the criterion in Chapter IV, since the density-in-phase may be regarded as a function of the energy. By diminishing the difference of and , we may diminish the differences of energy in the ensemble. The limit of this process gives us a permanent distribution in which the energy is constant.

We should arrive at the same result, if we should make the density any function of the energy between the limits and , and zero outside of those limits. Thus, the limiting distribution obtained from the part of a canonical ensemble between two limits of energy, when the difference of the limiting energies is indefinitely diminished, is independent of the modulus, being determined entirely by the energy, and is identical with the limiting distribution obtained from a uniform density between limits of energy approaching the same value.

We shall call the limiting distribution at which we arrive by this process microcanonical.

We shall find however, in certain cases, that for certain values of the energy, viz., for those for which is infinite, this process fails to define a limiting distribution in any such distinct sense as for other values of the energy. The difficulty is not in the process, but in the nature of the case, being entirely analogous to that which we meet when we try to find a canonical distribution in cases when becomes infinite. We have not regarded such cases as affording true examples of the canonical distribution, and we shall not regard the cases in which is infinite as affording true examples of the microcanonical distribution. We shall in fact find as we go on that in such cases our most important formulae become illusory.

The use of formulae relating to a canonical ensemble which contain instead of , as in the preceding chapters, amounts to the consideration of the ensemble as divided into an infinity of microcanonical elements.

From a certain point of view, the microcanonical distribution may seem more simple than the canonical, and it has perhaps been more studied, and been regarded as more closely related to the fundamental notions of thermodynamics. To this last point we shall return in a subsequent chapter. It is sufficient here to remark that analytically the canonical distribution is much more manageable than the microcanonical.

We may sometimes avoid difficulties which the microcanonical distribution presents by regarding it as the result of the following process, which involves conceptions less simple but more amenable to analytical treatment. We may suppose an ensemble distributed with a density proportional to

where and are constants, and then diminish indefinitely the value of the constant . Here the density is nowhere zero until we come to the limit, but at the limit it is zero for all energies except . We thus avoid the analytical complication of discontinuities in the value of the density, which require the use of integrals with inconvenient limits.

In a microcanonical ensemble of systems the energy () is constant, but the kinetic energy () and the potential energy () vary in the different systems, subject of course to the condition

(373)
Our first inquiries will relate to the division of energy into these two parts, and to the average values of functions of and .

We shall use the notation to denote an average value in a microcanonical ensemble of energy . An average value in a canonical ensemble of modulus , which has hitherto been denoted by , we shall in this chapter denote by , to distinguish more clearly the two kinds of averages.

The extension-in-phase within any limits which can be given in terms of and may be expressed in the notations of the preceding chapter by the double integral

taken within those limits. If an ensemble of systems is distributed within those limits with a uniform density-in-phase, the average value in the ensemble of any function () of the kinetic and potential energies will be expressed by the quotient of integrals
Since , and when is constant, the expression may be written
To get the average value of in an ensemble distributed microcanonically with the energy , we must make the integrations cover the extension-in-phase between the energies and . This gives
But by (299) the value of the integral in the denominator is . We have therefore
(374)
where and are connected by equation (373), and , if given as function of , or of and , becomes in virtue of the same equation a function of alone.

We shall assume that has a finite value. If , it is evident from equation (305) that is an increasing function of , and therefore cannot be infinite for one value of without being infinite for all greater values of , which would make infinite.[1] When , therefore, if we assume that is finite, we only exclude such cases as we found necessary to exclude in the study of the canonical distribution. But when , cases may occur in which the canonical distribution is perfectly applicable, but in which the formulae for the microcanonical distribution become illusory, for particular values of , on account of the infinite value of . Such failing cases of the microcanonical distribution for particular values of the energy will not prevent us from regarding the canonical ensemble as consisting of an infinity of microcanonical ensembles.[2]

From the last equation, with (298), we get

(375)
But by equations (288) and (289)
(376)
Therefore
(377)

Again, with the aid of equation (301), we get

(378)
if . Therefore, by (289),
(379)

These results are interesting on account of the relations of the functions and to the notion of temperature in thermodynamics,—a subject to which we shall return hereafter. They are particular cases of a general relation easily deduced from equations (306), (374), (288) and (289). We have

The equation may be written
We have therefore
(380)
if . For example, when is even, we may make , which gives, with (307),
(381)

Since any canonical ensemble of systems may be regarded as composed of microcanonical ensembles, if any quantities and have the same average values in every microcanonical ensemble, they will have the same values in every canonical ensemble. To bring equation (380) formally under this rule, we may observe that the first member being a function of is a constant value in a microcanonical ensemble, and therefore identical with its average value. We get thus the general equation

(382)
if .[3] The equations
(383)
(384)
may be regarded as particular cases of the general equation. The last equation is subject to the condition that .

The last two equations give for a canonical ensemble, if ,

(385)
The corresponding equations for a microcanonical ensemble give, if ,
(386)
which shows that approaches the value unity when is very great.

If a system consists of two parts, having separate energies, we may obtain equations similar in form to the preceding, which relate to the system as thus divided.[4] We shall distinguish quantities relating to the parts by letters with suffixes, the same letters without suffixes relating to the whole system. The extension-in-phase of the whole system within any given limits of the energies may be represented by the double integral

taken within those limits, as appears at once from the definitions of Chapter VIII. In an ensemble distributed with uniform density within those limits, and zero density outside, the average value of any function of and is given by the quotient
which may also be written[5]
If we make the limits of integration and , we get the average value of in an ensemble in which the whole system is microcanonically distributed in phase, viz.,
(387)
where and are connected by the equation
(388)
and , if given as function of , or of and , becomes in virtue of the same equation a function of alone.[6] Thus
(389)
(390)
This requires a similar relation for canonical averages
(391)
Again
(392)
But if , vanishes for ,[7] and
(393)
Hence, if , and ,
(394)
and
(395)

We have compared certain functions of the energy of the whole system with average values of similar functions of the kinetic energy of the whole system, and with average values of similar functions of the whole energy of a part of the system. We may also compare the same functions with average values of the kinetic energy of a part of the system.

We shall express the total, kinetic, and potential energies of the whole system by , , and , and the kinetic energies of the parts by , and . These kinetic energies are necessarily separate: we need not make any supposition concerning potential energies. The extension-in-phase within any limits which can be expressed in terms of , , may be represented in the notations of Chapter VIII by the triple integral

taken within those limits. And if an ensemble of systems is distributed with a uniform density within those limits, the average value of any function of , , will be expressed by the quotient
or
To get the average value of for a microcanonical distribution, we must make the limits and . The denominator in this case becomes , and we have
(396)
where , , and are connected by the equation
Accordingly
(397)
and we may write
(398)
and
(399)

Again, if ,

(400)
Hence, if , and ,
(401)
(402)

We cannot apply the methods employed in the preceding pages to the microcanonical averages of the (generalized) forces , , etc., exerted by a system on external bodies, since these quantities are not functions of the energies, either kinetic or potential, of the whole or any part of the system. We may however use the method described on page 116.

Let us imagine an ensemble of systems distributed in phase according to the index of probability

where is any constant which is a possible value of the energy, except only the least value which is consistent with the values of the external coördinates, and and are other constants. We have therefore
(403)
or
(404)
or again
(405)
From (404) we have
(406)
where denotes the average value of in those systems of the ensemble which have any same energy . (This is the same thing as the average value of in a microcanonical ensemble of energy .) The validity of the transformation is evident, if we consider separately the part of each integral which lies between two infinitesimally differing limits of energy. Integrating by parts, we get
(407)
Differentiating (405), we get
(408)
where denotes the least value of consistent with the external coördinates. The last term in this equation represents the part of which is due to the variation of the lower limit of the integral. It is evident that the expression in the brackets will vanish at the upper limit. At the lower limit, at which , and has the least value consistent with the external coördinates, the average sign on is superfluous, as there is but one value of which is represented by . Exceptions may indeed occur for particular values of the external coördinates, at which receive a finite increment, and the formula becomes illusory. Such particular values we may for the moment leave out of account. The last term of (408) is therefore equal to the first term of the second member of (407). (We may observe that both vanish when on account of the factor .)

We have therefore from these equations

or
(409)
That is: the average value in the ensemble of the quantity represented by the principal parenthesis is zero. This must be true for any value of . If we diminish , the average value of the parenthesis at the limit when vanishes becomes identical with the value for . But this may be any value of the energy, except the least possible. We have therefore
(410)
unless it be for the least value of the energy consistent with the external coördinates, or for particular values of the external coördinates. But the value of any term of this equation as determined for particular values of the energy and of the external coördinates is not distinguishable from its value as determined for values of the energy and external coördinates indefinitely near those particular values. The equation therefore holds without limitation. Multiplying by , we get
(411)
The integral of this equation is
(412)
where is a function of the external coördinates. We have an equation of this form for each of the external coördinates. This gives, with (266), for the complete value of the differential of
(413)
or
(414)
To determine the values of the functions , , etc., let us suppose , , etc. to vary arbitrarily, while varies so as always to have the least value consistent with the values of the external coördinates. This will make , and . If , we shall have also , which will give
(415)
The result is the same for any value of . For in the variations considered the kinetic energy will be constantly zero, and the potential energy will have the least value consistent with the external coördinates. The condition of the least possible potential energy may limit the ensemble at each instant to a single configuration, or it may not do so; but in any case the values of , , etc. will be the same at each instant for all the systems of the ensemble,[8] and the equation
will hold for the variations considered. Hence the functions , , etc. vanish in any case, and we have the equation
(416)
or
(417)
or again
(418)
It will be observed that the two last equations have the form of the fundamental differential equations of thermodynamics, corresponding to temperature and to entropy. We have already observed properties of suggestive of an analogy with temperature.[9] The significance of these facts will be discussed in another chapter.

The two last equations might be written more simply

and still have the form analogous to the thermodynamic equations, but has nothing like the analogies with temperature which we have observed in .

  1. See equation (322).
  2. An example of the failing case of the microcanonical distribution is afforded by a material point, under the influence of gravity, and constrained to remain in a vertical circle. The failing case occurs when the energy is just sufficient to carry the material point to the highest point of the circle. It will be observed that the difficulty is inherent in the nature of the case, and is quite independent of the mathematical formulae. The nature of the difficulty is at once apparent if we try to distribute a finite number of material points with this particular value of the energy as nearly as possible in statistical equilibrium, or if we ask: What is the probability that a point taken at random from an ensemble in statistical equilibrium with this value of the energy will be found in any specified part of the circle?
  3. See equation (292).
  4. If this condition is rigorously fulfilled, the parts will have no influence on each other, and the ensemble formed by distributing the whole microcanonically is too arbitrary a conception to have a real interest. The principal interest of the equations which we shall obtain will be in cases in which the condition is approximately fulfilled. But for the purposes of a theoretical discussion, it is of course convenient to make such a condition absolute. Compare Chapter IV, pp. 35 ff., where a similar condition is considered in connection with canonical ensembles.
  5. Where the analytical transformations are identical in form with those on the preceding pages, it does not appear necessary to give all the steps with the same detail.
  6. In the applications of the equation (387), we cannot obtain all the results corresponding to those which we have obtained from equation (374), because is a known function of , while must be treated as an arbitrary function of , or nearly so.
  7. See Chapter VIII, equations (306) and (316).
  8. This statement, as mentioned before, may have exceptions for particular values of the external coördinates. This will not invalidate the reasoning, which has to do with varying values of the external coördinates.
  9. See Chapter IX, page 111; also this chapter, page 119.