CHAPTER IX.
THE FUNCTION ϕ AND THE CANONICAL DISTRIBUTION.
In this chapter we shall return to the consideration of the canonical distribution, in order to investigate those properties which are especially related to the function of the energy which we have denoted by .
If we denote by , as usual, the total number of systems in the ensemble,
|
|
will represent the number having energies between the limits
and
. The expression
|
(317)
|
represents what may be called the
density-in-energy. This vanishes for
, for otherwise the necessary equation
|
(318)
|
could not be fulfilled. For the same reason the density-in-energy will vanish for
, if that is a possible value of the energy. Generally, however, the least possible value of the energy will be a finite value, for which, if
,
will vanish,
[1] and therefore the density-in-energy. Now the density-in-energy is necessarily positive, and since it vanishes for extreme values of the energy if
, it must have a maximum in such cases, in which the energy may be said to have
its most common or most probable value, and which is determined by the equation
|
(319)
|
This value of is also, when , its average value in the ensemble. For we have identically, by integration by parts,
|
(320)
|
If
, the expression in the brackets, which multiplied by
would represent the density-in-energy, vanishes at the limits, and we have by (269) and (318)
|
(321)
|
It appears, therefore, that for systems of more than two degrees of freedom, the average value of
in an ensemble canonically distributed is identical with the value of the same differential coefficient as calculated for the most common energy in the ensemble, both values being reciprocals of the modulus.
Hitherto, in our consideration of the quantities , , , , , , we have regarded the external coördinates as constant. It is evident, however, from their definitions that and are in general functions of the external coördinates and the energy (), that and are in general functions of the external coördinates and the potential energy (). and we have found to be functions of the kinetic energy () alone. In the equation
|
(322)
|
by which
may be determined,
and the external coördinates (contained implicitly in
) are constant in the integration. The equation shows that
is a function of these constants.
If their values are varied, we shall have by differentiation, if
,
|
(323)
|
(Since
vanishes with
, when
, there are no terms due to the variations of the limits.) Hence by (269)
|
(324)
|
or, since
|
(325)
|
|
(326)
|
Comparing this with (112), we get
|
(327)
|
The first of these equations might be written
[2]
|
(328)
|
but must not be confounded with the equation
|
(329)
|
which is derived immediately from the identity
|
(330)
|
Moreover, if we eliminate from (326) by the equation
|
(331)
|
obtained by differentiating (325), we get
|
(332)
|
or by (321),
|
(333)
|
Except for the signs of average, the second member of this equation is the same as that of the identity
|
(334)
|
For the more precise comparison of these equations, we may suppose that the energy in the last equation is some definite and fairly representative energy in the ensemble. For this purpose we might choose the average energy. It will perhaps be more convenient to choose the most common energy, which we shall denote by
. The same suffix will be applied to functions of the energy determined for this value. Our identity then becomes
|
(335)
|
It has been shown that
|
(336)
|
when
. Moreover, since the external coördinates have constant values throughout the ensemble, the values of
,
, etc. vary in the ensemble only on account of the variations of the energy (
), which, as we have seen, may be regarded as sensibly constant throughout the ensemble, when
is very great. In this case, therefore, we may regard the average values
|
|
as practically equivalent to the values relating to the most common energy
|
|
In this case also
is practically equivalent to
. We have therefore, for very large values of
,
|
(337)
|
approximately. That is, except for an additive constant,
may be regarded as practically equivalent to
, when the number of degrees of freedom of the system is very great. It is not meant by this that the variable part of
is numerically of a lower order of magnitude than unity. For when
is very great,
and
are very great, and we can only conclude that the variable part of
is insignificant compared with the variable part of
or of
, taken separately.
Now we have already noticed a certain correspondence between the quantities and and those which in thermodynamics are called temperature and entropy. The property just demonstrated, with those expressed by equation (336), therefore suggests that the quantities and may also correspond to the thermodynamic notions of entropy and temperature. We leave the discussion of this point to a subsequent chapter, and only mention it here to justify the somewhat detailed investigation of the relations of these quantities.
We may get a clearer view of the limiting form of the relations when the number of degrees of freedom is indefinitely increased, if we expand the function in a series arranged according to ascending powers of . This expansion may be written
|
(338)
|
Adding the identical equation
|
|
we get by (336)
|
(339)
|
Substituting this value in
|
|
which expresses the probability that the energy of an unspecified system of the ensemble lies between the limits
and
, we get
|
(340)
|
When the number of degrees of freedom is very great, and
in consequence very small, we may neglect the higher powers and write
[3]
|
(341)
|
This shows that for a very great number of degrees of freedom the probability of deviations of energy from the most probable value () approaches the form expressed by the 'law of errors.' With this approximate law, we get
|
(342)
|
|
(343)
|
whence
|
(344)
|
Now it has been proved in Chapter VII that
|
|
We have therefore
|
(345)
|
approximately. The order of magnitude of
is therefore that of
. This magnitude is mainly constant. The order of magnitude of
is that of unity. The order of magnitude of
, and therefore of
, is that of
.
[4]
Equation (338) gives for the first approximation
|
(346)
|
|
(347)
|
|
(348)
|
The members of the last equation have the order of magnitude of
. Equation (338) gives also for the first approximation
|
|
whence
|
(349)
|
|
(350)
|
This is of the order of magnitude of
.
[5]
It should be observed that the approximate distribution of the ensemble in energy according to the 'law of errors' is not dependent on the particular form of the function of the energy which we have assumed for the index of probability (). In any case, we must have
|
(351)
|
where
is necessarily positive. This requires that it shall vanish for
, and also for
, if this is a possible value. It has been shown in the last chapter that if
has a (finite) least possible value (which is the usual case) and
,
will vanish for that least value of
. In general therefore
will have a maximum, which determines the most probable value of the energy. If we denote this value by
and distinguish the corresponding values of the functions of the energy by the same suffix, we shall have
|
(352)
|
The probability that an unspecified system of the ensemble
falls within any given limits of energy (
and
) is represented by
|
|
If we expand
and
in ascending powers of
, without going beyond the squares, the probability that the energy falls within the given limits takes the form of the 'law of errors'—
|
(353)
|
This gives
|
(354)
|
and
|
(355)
|
We shall have a close approximation in general when the quantities equated in (355) are very small,
i. e., when
|
(356)
|
is very great. Now when
is very great,
is of the same order of magnitude, and the condition that (356) shall be very great does not restrict very much the nature of the function
.
We may obtain other properties pertaining to average values in a canonical ensemble by the method used for the average of . Let be any function of the energy, either alone or with and the external coördinates. The average value of in the ensemble is determined by the equation
|
(357)
|
Now we have identically
|
(358)
|
Therefore, by the preceding equation
|
[6](359)
|
If we set , (a value which need not be excluded,) the second member of this equation vanishes, as shown on page 101, if , and we get
|
(360)
|
as before. It is evident from the same considerations that the second member of (359) will always vanish if
, unless
becomes infinite at one of the limits, in which case a more careful examination of the value of the expression will be necessary. To facilitate the discussion of such cases, it will be convenient to introduce a certain limitation in regard to the nature of the system considered. We have necessarily supposed, in all our treatment of systems canonically distributed, that the system considered was such as to be capable of the canonical distribution with the given value of the modulus. We shall now suppose that the system is such as to be capable of a canonical distribution with any (finite)
[7] modulus. Let us see what cases we exclude by this last limitation.
The impossibility of a canonical distribution occurs when
the equation
|
(361)
|
fails to determine a finite value for
. Evidently the equation cannot make
an infinite positive quantity, the impossibility therefore occurs when the equation makes
. Now we get easily from (191)
|
|
If the canonical distribution is possible for any values of
, we can apply this equation so long as the canonical distribution is possible. The equation shows that as
is increased (without becoming infinite)
cannot become infinite unless
simultaneously becomes infinite, and that as
is decreased (without becoming zero)
cannot become infinite unless simultaneously
becomes an infinite negative quantity. The corresponding cases in thermodynamics would be bodies which could absorb or give out an infinite amount of heat without passing certain limits of temperature, when no external work is done in the positive or negative sense. Such infinite values present no analytical difficulties, and do not contradict the general laws of mechanics or of thermodynamics, but they are quite foreign to our ordinary experience of nature. In excluding such cases (which are certainly not entirely devoid of interest) we do not exclude any which are analogous to any actual cases in thermodynamics.
We assume then that for any finite value of the second member of (361) has a finite value.
When this condition is fulfilled, the second member of (359) will vanish for . For, if we set ,
|
|
where
denotes the value of
for the modulus
. Since the last member of this formula vanishes for
, the less value represented by the first member must also vanish for the same value of
. Therefore the second member of (359), which differs only by a constant factor, vanishes at the upper limit. The case of the lower limit remains to be considered. Now
|
|
The second member of this formula evidently vanishes for the value of
, which gives
, whether this be finite or negative infinity. Therefore, the second member of (359) vanishes at the lower limit also, and we have
|
|
or
|
(362)
|
This equation, which is subject to no restriction in regard to the value of
, suggests a connection or analogy between the function of the energy of a system which is represented by
and the notion of temperature in thermodynamics. We shall return to this subject in Chapter XIV.
If , the second member of (359) may easily be shown to vanish for any of the following values of viz.: , , , , where denotes any positive number. It will also vanish, when , for , and when for . When the second member of (359) vanishes, and , we may write
|
(363)
|
We thus obtain the following equations:
If ,
|
(364)
|
|
(365)
|
or
|
(366)
|
|
[8](367)
|
|
(368)
|
If
,
|
[9](369)
|
If
,
|
|
or
|
(370)
|
whence
|
(371)
|
Giving
the values 1, 2, 3, etc., we have
|
|
|
|
as already obtained. Also
|
(372)
|
If is a continuous increasing function of , commencing with , the average value in a canonical ensemble of any function of , either alone or with the modulus and the external coördinates, is given by equation (275), which is identical with (357) except that , , and have the suffix . The equation may be transformed so as to give an equation identical with (359) except for the suffixes. If we add the same suffixes to equation (361), the finite value of its members will determine the possibility of the canonical distribution.
From these data, it is easy to derive equations similar to (360), (362)-(372), except that the conditions of their validity must be differently stated. The equation
|
|
requires only the condition already mentioned with respect to
. This equation corresponds to (362), which is subject to no restriction with respect to the value of
. We may observe, however, that
will always satisfy a condition similar to that mentioned with respect to
.
If satisfies the condition mentioned, and a similar condition, i. e., if is a continuous increasing function of , commencing with the value , equations will hold similar to those given for the case when , viz., similar to (360), (364)-(368). Especially important is
|
|
If
,
(or
),
all satisfy similar conditions, we shall have an equation similar to (369), which was subject to the condition
. And if
also satisfies a similar condition, we shall have an equation similar to (372), for which the condition was
. Finally, if
and
successive differential coefficients satisfy conditions of the kind mentioned, we shall have equations like (370) and (371) for which the condition was
.
These conditions take the place of those given above relating to . In fact, we might give conditions relating to the differential coefficients of , similar to those given relating to the differential coefficients of , instead of the conditions relating to , for the validity of equations (360), (363)-(372). This would somewhat extend the application of the equations.