Jump to content

Elementary Principles in Statistical Mechanics/Chapter IX

From Wikisource

Transcriber's note: The original text used an older notation for the factorial operation, similar in appearance to (see also Mathworld). This notation is difficult to transcribe faithfully in Wikisource, and so the factorials in this chapter have been convered to the modern notation, .

1543578Elementary Principles in Statistical MechanicsChapter IX. The function ϕ and the canonical distribution.Josiah Willard Gibbs

CHAPTER IX.

THE FUNCTION ϕ AND THE CANONICAL DISTRIBUTION.

In this chapter we shall return to the consideration of the canonical distribution, in order to investigate those properties which are especially related to the function of the energy which we have denoted by .

If we denote by , as usual, the total number of systems in the ensemble,

will represent the number having energies between the limits and . The expression
(317)
represents what may be called the density-in-energy. This vanishes for , for otherwise the necessary equation
(318)
could not be fulfilled. For the same reason the density-in-energy will vanish for , if that is a possible value of the energy. Generally, however, the least possible value of the energy will be a finite value, for which, if , will vanish,[1] and therefore the density-in-energy. Now the density-in-energy is necessarily positive, and since it vanishes for extreme values of the energy if , it must have a maximum in such cases, in which the energy may be said to have its most common or most probable value, and which is determined by the equation
(319)

This value of is also, when , its average value in the ensemble. For we have identically, by integration by parts,

(320)
If , the expression in the brackets, which multiplied by would represent the density-in-energy, vanishes at the limits, and we have by (269) and (318)
(321)
It appears, therefore, that for systems of more than two degrees of freedom, the average value of in an ensemble canonically distributed is identical with the value of the same differential coefficient as calculated for the most common energy in the ensemble, both values being reciprocals of the modulus.

Hitherto, in our consideration of the quantities , , , , , , we have regarded the external coördinates as constant. It is evident, however, from their definitions that and are in general functions of the external coördinates and the energy (), that and are in general functions of the external coördinates and the potential energy (). and we have found to be functions of the kinetic energy () alone. In the equation

(322)
by which may be determined, and the external coördinates (contained implicitly in ) are constant in the integration. The equation shows that is a function of these constants. If their values are varied, we shall have by differentiation, if ,
(323)
(Since vanishes with , when , there are no terms due to the variations of the limits.) Hence by (269)
(324)
or, since
(325)
(326)
Comparing this with (112), we get
(327)
The first of these equations might be written[2]
(328)
but must not be confounded with the equation
(329)
which is derived immediately from the identity
(330)

Moreover, if we eliminate from (326) by the equation

(331)
obtained by differentiating (325), we get
(332)
or by (321),
(333)
Except for the signs of average, the second member of this equation is the same as that of the identity
(334)
For the more precise comparison of these equations, we may suppose that the energy in the last equation is some definite and fairly representative energy in the ensemble. For this purpose we might choose the average energy. It will perhaps be more convenient to choose the most common energy, which we shall denote by . The same suffix will be applied to functions of the energy determined for this value. Our identity then becomes
(335)
It has been shown that
(336)
when . Moreover, since the external coördinates have constant values throughout the ensemble, the values of , , etc. vary in the ensemble only on account of the variations of the energy (), which, as we have seen, may be regarded as sensibly constant throughout the ensemble, when is very great. In this case, therefore, we may regard the average values
as practically equivalent to the values relating to the most common energy
In this case also is practically equivalent to . We have therefore, for very large values of ,
(337)
approximately. That is, except for an additive constant, may be regarded as practically equivalent to , when the number of degrees of freedom of the system is very great. It is not meant by this that the variable part of is numerically of a lower order of magnitude than unity. For when is very great, and are very great, and we can only conclude that the variable part of is insignificant compared with the variable part of or of , taken separately.

Now we have already noticed a certain correspondence between the quantities and and those which in thermodynamics are called temperature and entropy. The property just demonstrated, with those expressed by equation (336), therefore suggests that the quantities and may also correspond to the thermodynamic notions of entropy and temperature. We leave the discussion of this point to a subsequent chapter, and only mention it here to justify the somewhat detailed investigation of the relations of these quantities.

We may get a clearer view of the limiting form of the relations when the number of degrees of freedom is indefinitely increased, if we expand the function in a series arranged according to ascending powers of . This expansion may be written

(338)
Adding the identical equation
we get by (336)
(339)
Substituting this value in
which expresses the probability that the energy of an unspecified system of the ensemble lies between the limits and , we get
(340)
When the number of degrees of freedom is very great, and in consequence very small, we may neglect the higher powers and write[3]
(341)

This shows that for a very great number of degrees of freedom the probability of deviations of energy from the most probable value () approaches the form expressed by the 'law of errors.' With this approximate law, we get

(342)
(343)
whence
(344)
Now it has been proved in Chapter VII that
We have therefore
(345)
approximately. The order of magnitude of is therefore that of . This magnitude is mainly constant. The order of magnitude of is that of unity. The order of magnitude of , and therefore of , is that of .[4]

Equation (338) gives for the first approximation

(346)
(347)
(348)
The members of the last equation have the order of magnitude of . Equation (338) gives also for the first approximation
whence
(349)
(350)
This is of the order of magnitude of .[5]

It should be observed that the approximate distribution of the ensemble in energy according to the 'law of errors' is not dependent on the particular form of the function of the energy which we have assumed for the index of probability (). In any case, we must have

(351)
where is necessarily positive. This requires that it shall vanish for , and also for , if this is a possible value. It has been shown in the last chapter that if has a (finite) least possible value (which is the usual case) and , will vanish for that least value of . In general therefore will have a maximum, which determines the most probable value of the energy. If we denote this value by and distinguish the corresponding values of the functions of the energy by the same suffix, we shall have
(352)
The probability that an unspecified system of the ensemble falls within any given limits of energy ( and ) is represented by
If we expand and in ascending powers of , without going beyond the squares, the probability that the energy falls within the given limits takes the form of the 'law of errors'—
(353)
This gives
(354)
and
(355)
We shall have a close approximation in general when the quantities equated in (355) are very small, i. e., when
(356)
is very great. Now when is very great, is of the same order of magnitude, and the condition that (356) shall be very great does not restrict very much the nature of the function .

We may obtain other properties pertaining to average values in a canonical ensemble by the method used for the average of . Let be any function of the energy, either alone or with and the external coördinates. The average value of in the ensemble is determined by the equation

(357)
Now we have identically
(358)
Therefore, by the preceding equation
[6](359)

If we set , (a value which need not be excluded,) the second member of this equation vanishes, as shown on page 101, if , and we get

(360)
as before. It is evident from the same considerations that the second member of (359) will always vanish if , unless becomes infinite at one of the limits, in which case a more careful examination of the value of the expression will be necessary. To facilitate the discussion of such cases, it will be convenient to introduce a certain limitation in regard to the nature of the system considered. We have necessarily supposed, in all our treatment of systems canonically distributed, that the system considered was such as to be capable of the canonical distribution with the given value of the modulus. We shall now suppose that the system is such as to be capable of a canonical distribution with any (finite)[7] modulus. Let us see what cases we exclude by this last limitation.

The impossibility of a canonical distribution occurs when the equation

(361)
fails to determine a finite value for . Evidently the equation cannot make an infinite positive quantity, the impossibility therefore occurs when the equation makes . Now we get easily from (191)
If the canonical distribution is possible for any values of , we can apply this equation so long as the canonical distribution is possible. The equation shows that as is increased (without becoming infinite) cannot become infinite unless simultaneously becomes infinite, and that as is decreased (without becoming zero) cannot become infinite unless simultaneously becomes an infinite negative quantity. The corresponding cases in thermodynamics would be bodies which could absorb or give out an infinite amount of heat without passing certain limits of temperature, when no external work is done in the positive or negative sense. Such infinite values present no analytical difficulties, and do not contradict the general laws of mechanics or of thermodynamics, but they are quite foreign to our ordinary experience of nature. In excluding such cases (which are certainly not entirely devoid of interest) we do not exclude any which are analogous to any actual cases in thermodynamics.

We assume then that for any finite value of the second member of (361) has a finite value.

When this condition is fulfilled, the second member of (359) will vanish for . For, if we set ,

where denotes the value of for the modulus . Since the last member of this formula vanishes for , the less value represented by the first member must also vanish for the same value of . Therefore the second member of (359), which differs only by a constant factor, vanishes at the upper limit. The case of the lower limit remains to be considered. Now
The second member of this formula evidently vanishes for the value of , which gives , whether this be finite or negative infinity. Therefore, the second member of (359) vanishes at the lower limit also, and we have
or
(362)
This equation, which is subject to no restriction in regard to the value of , suggests a connection or analogy between the function of the energy of a system which is represented by and the notion of temperature in thermodynamics. We shall return to this subject in Chapter XIV.

If , the second member of (359) may easily be shown to vanish for any of the following values of viz.: , , , , where denotes any positive number. It will also vanish, when , for , and when for . When the second member of (359) vanishes, and , we may write

(363)
We thus obtain the following equations:

If ,

(364)
(365)
or
(366)
[8](367)
(368)
If ,
[9](369)
If ,
or
(370)
whence
(371)
Giving the values 1, 2, 3, etc., we have
as already obtained. Also
(372)

If is a continuous increasing function of , commencing with , the average value in a canonical ensemble of any function of , either alone or with the modulus and the external coördinates, is given by equation (275), which is identical with (357) except that , , and have the suffix . The equation may be transformed so as to give an equation identical with (359) except for the suffixes. If we add the same suffixes to equation (361), the finite value of its members will determine the possibility of the canonical distribution.

From these data, it is easy to derive equations similar to (360), (362)-(372), except that the conditions of their validity must be differently stated. The equation

requires only the condition already mentioned with respect to . This equation corresponds to (362), which is subject to no restriction with respect to the value of . We may observe, however, that will always satisfy a condition similar to that mentioned with respect to .

If satisfies the condition mentioned, and a similar condition, i. e., if is a continuous increasing function of , commencing with the value , equations will hold similar to those given for the case when , viz., similar to (360), (364)-(368). Especially important is

If , (or ), all satisfy similar conditions, we shall have an equation similar to (369), which was subject to the condition . And if also satisfies a similar condition, we shall have an equation similar to (372), for which the condition was . Finally, if and successive differential coefficients satisfy conditions of the kind mentioned, we shall have equations like (370) and (371) for which the condition was .

These conditions take the place of those given above relating to . In fact, we might give conditions relating to the differential coefficients of , similar to those given relating to the differential coefficients of , instead of the conditions relating to , for the validity of equations (360), (363)-(372). This would somewhat extend the application of the equations.


  1. See page 96.
  2. See equations (321) and (104). Suffixes are here added to the differential coefficients, to make the meaning perfectly distinct, although the same quantities may be written elsewhere without the suffixes, when it is believed that there is no danger of misapprehension. The suffixes indicate the quantities which are constant in the differentiation, the single letter standing for all the letters , , etc., or all except the one which is explicitly varied.
  3. If a higher degree of accuracy is desired than is afforded by this formula, it may be multiplied by the series obtained from
    by the ordinary formula for the expansion in series of an exponential function. There would be no especial analytical difficulty in taking account of a moderate number of terms of such a series, which would commence
  4. Compare (289), (314).
  5. We shall find hereafter that the equation
    is exact for any value of greater than 2, and that the equation
    is exact for any value of greater than 4.
  6. A more general equation, which is not limited to ensembles canonically distributed, is
    where denotes, as usual, the index of probability of phase.
  7. The term finite applied to the modulus is intended to exclude the value zero as well as infinity.
  8. This equation may also be obtained from equations (252) and (321). Compare also equation (349) which was derived by an approximative method.
  9. Compare equation (350), obtained by an approximative method.