To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
In any area of physical science there are stages in its historical development at which it becomes possible and desirable to pull its theory together into a more coherent and unified whole. Grand examples from physics spring immediately to mind – Maxwell's electro-magnetic theory, Dirac's synthesis of wave-mechanics and matrix mechanics, the Salam–Weinberg electro-weak theory. Yet this process of unification takes place on every scale. One example of immediate concern to us in this book here is afforded by the emergence of the thermodynamics of irreversible processes, where the coming together of separate strands of theory is made plainly visible in the short book by Denbigh (1951). In connection with imperfections in crystal lattices the Pocono Manor Symposium held in 1950 (Shockley et al. 1952) might be said to mark another such stage of synthesis. At any rate a great growth in the understanding of the properties of crystal imperfections occurred in the years following and this carried along a corresponding achievement in understanding atomic transport and diffusion phenomena in atomic terms. Around 1980, in the course of a small Workshop held at the International Centre for Theoretical Physics, Trieste we concluded from several observations that such a stage of synthesis could soon be reached in the study of atomic transport processes in solids. First of all, despite the great growth in detailed knowledge in the area, the theory, or rather theories, which were used to relate point defects to measurable quantities (e.g. transport coefficients of various sorts) were recognizably still largely in the moulds established considerably earlier in the 1950s and 1960s, notwithstanding the advances in purely numerical techniques such as molecular dynamics and Monte Carlo simulations.
In the previous chapter we considered the evaluation of the macroscopic transport coefficients for models of substances displaying small degrees of disorder. In this chapter we turn to the evaluation of the effects of atomic movements upon the relaxation processes arising in nuclear magnetic resonance, as defined in §1.7 in Chapter 1. Our reason for devoting a whole chapter to this subject is that nuclear magnetic relaxation is the most widely applicable nuclear technique for studying atomic migration in solids after the direct application of radioactive tracers. Furthermore, it is still extending in various new directions, most notably for our purposes by the use of nuclear methods of alignment and detection with unstable nuclei, especially β-emitters (see e.g. Ackermann, Heitjans and Stöckmann (1983), Heitjans (1986) and Heitjans, Faber and Schirmer (1991)).
Except for self-diffusion coefficients determined by pulsed field gradient methods, the task which this subject presents to theory is rather different from that tackled in the previous chapter and it presents different mathematical problems, some of which derive from the need to deal with a two-particle correlation function (cf. (1.7.4)), while others derive from the nature of the physical interaction between the particles. For the most part the interaction we are concerned with is the nuclear magnetic dipole–dipole interaction, since this is always present (so long as the nuclear spin I > 0, i.e. so long as n.m.r. is possible). We shall also give some attention to the interaction between internal electric-field gradients and nuclear electric-quadrupole moments (I > 1/2). The dependence of these interactions upon relative position and orientation introduces mathematical complications, …
In this and following chapters we shall consider those theories which are based on microscopic models of atomic movements and which allow us to obtain expressions for the macroscopic phenomenological coefficients introduced in the preceding two chapters. The principle on which all these particular models are founded has already been introduced in §§2.3 and 3.11 and is that the motion of atoms (or ions) of the solid may be divided into (i) thermal vibrations about defined lattice sites and (ii) displacements or ‘jumps’ from one such site to another, the mean time of stay of an atom on any one site being many times both the lattice vibration period and the time of flight between sites. The justification of this principle is provided widely in solid state physics. In general terms it derives from the strong cohesion of solids and the associated strong binding of atoms to their assigned lattice sites. This leads to well-defined phonon spectra such as have now been observed, especially by neutron scattering methods, in a wide range of solids. Theories of solid structures also show that there are generally large energy barriers standing in the way of the displacement of atoms from one site to another, even in the presence of imperfections in the structure, with the consequence that such displacements must be thermally activated and will occur relatively infrequently compared to the period of vibrational motions. Thus even the highest diffusion rates (say D ∼ 10−9m2 s−1) correspond to a mean time of stay τ ∼ 10−11 s, i.e. 100 times longer than the period of the lattice vibrations. It is more difficult to obtain information about the time …
In the last two chapters we have studied kinetic theories of relaxation and diffusion as specific representations of the general master equation (6.2.1). Such analyses allowed us to obtain insight into a number of aspects of these processes, especially in dilute systems, e.g. the L-coefficients, the way correlations in atomic movements enter into diffusion coefficients, the relation of diffusion to relaxation rates and so on. In this chapter and the next we turn to another well-established body of theory, namely the theory of random walks. This too can be presented in the context of the general analysis of Chapter 6 and additional insights obtained.
The basic model or system which is analysed in the mathematical theory of discrete random walks is that of a particle (or ‘walker’) which moves in a series of random jumps or ‘steps’ from one lattice site to another. It can be used to represent physical systems such as interstitial atoms (e.g. C in α-Fe) or point defects moving through crystal lattices under the influence of thermal activation, as long as the concentrations of these species are low enough that their movements do not interfere with one another. The mathematical theory of such random walks has received considerable attention and is well recorded in many books and articles (recent examples include Barber and Ninham, 1970 and Haus and Kehr, 1987). For this reason it would be superfluous (and impractical) to go over all the same ground again here. Nevertheless there are various results which are useful in the theory of atomic transport either directly (e.g. in the evaluation of transport coefficients accordings to eqns.
In the previous chapter we surveyed the ideas of point defects which form the physical basis for the description of diffusion and other atomic transport phenomena. In the present chapter we shall describe the application of equilibrium statistical mechanics to these physical models under conditions of thermodynamic equilibrium. This will allow us to obtain the contributions of thermally created point defects to thermodynamic quantities, e.g. lattice expansions, specific heat, etc. But there is a wider interest in such calculations, as will be fully apparent in later chapters. For, although the processes of atomic transport involve systems not in overall thermodynamic equilibrium, the theory of these processes nevertheless assumes the existence of local thermodynamic equilibrium; i.e. of equilibrium in regions small compared to the size of the entire macroscopic specimen but large enough to allow a thermodynamic description locally. There are two aspects to this, one concerns the chemical potentials μi and the other the transport coefficients, Lij (cf. §1.4). Statistical thermodynamics allows us to calculate the chemical potentials of the various components, i, of any particular model system as functions of the intensive thermodynamic variables. Gradients of these chemical potentials then give the thermodynamic forces Xi which generate the fluxes of atoms, defects, etc. under non-equilibrium conditions. For the transport coefficients we shall need the equilibrium concentrations of point defects and certain related quantities. Statistical thermodynamics again will provide these for particular models as a function of thermodynamic variables (e.g. P, T) without regard to the mechanism by which this equilibrium is attained (although, of course, it is implicit that the density of defect sinks and sources is adequate).
The processes of the migration of atoms through solids enter into a great range of other phenomena of concern to solid state physics and chemistry, metallurgy and materials science. The nature of the concern varies from field to field, but in all cases the mobility of atoms manifests itself in many ways and contributes to many other phenomena. The study of this mobility and its physical and chemical manifestations is thus a fundamental part of solid state science, and one which now has a substantial history. Like much else in this science it is for the most part concerned with crystalline solids, although it also includes much which pertains also to glasses and polymers. Unfortunately, it is not possible to treat crystalline and non-crystalline solids together in any depth. The present work, which is concerned with the fundamentals of these atomic transport processes, is thus limited to crystalline solids, although occasionally we can ‘look over the fence’ and see implications for non-crystalline substances. Even with the study of crystalline solids there is a further important sub-division to be made, and that is between atomic movements in good crystal, where the arrangement of atoms is essentially as one expects from the crystal-lattice structure, and in regions where this arrangement is disturbed, as at surfaces, at the boundaries between adjoining grains or crystallites, along dislocation lines, etc. The atomic arrangements in these regions are not by any means without structure, but the structures in question are diverse and far from easy to determine experimentally. Since the basis of much of what we have to say is the assumption that there is a known and regular …
In the preceding chapter we reviewed a number of properties of crystalline solids which demonstrate the movement and migration of atoms through these solids. The atomic theory of these properties is the subject of this book, and in this chapter we review the relevant basic mechanisms. Fundamental to these are questions of structure, both the ideal lattice structure of the perfect crystal, as one would determine it by X-ray or neutron diffraction, and those local modifications of this crystalline arrangement of atoms, called imperfections or defects, which facilitate this movement of atoms through the body of the crystal. Mostly we shall be dealing with systems in which the fraction of atoms contained in imperfect regions of the crystal is small, i.e. we are dealing with nearly perfect crystals. Notable exceptions, which we shall consider to varying degrees in this and later chapters, include order–disorder alloys, fast ion conductors and concentrated solutions of hydrogen in metals.
There are two important and interesting general facts about these crystal imperfections. This first is that there is not an arbitrary number of distinct types, but just a few. This is true for topological reasons even if we represent the solid as a continuum, when we would classify the elementary imperfections as point defects, dislocations (linear) and surfaces (two-dimensional). In a crystal lattice there are others, such as stacking faults and grain boundaries, for which there are no analogues in a continuum, but the total number of distinct types remains small. The second general fact is that the same classification of imperfections is useful irrespective of the type of bonding between the atoms of the solid.
In this chapter we show how to use some aspects of the random-walk theory to describe the macroscopic diffusion of solute atoms (and isotopic atoms) caused by point defects. In doing so we have to handle solute movements and defect movements together, rather than the random walk of just one type of ‘walker’ or ‘particle’, as in the preceding chapter. This presents a more complex problem; nevertheless the approach we shall describe has provided a major part of the subject for many years. It was started by Bardeen and Herring (1952) who drew attention to the fact that the movement of solute atoms by the action of point defects was such that the directions of successive jumps of the solute atom were necessarily correlated with one another for the reasons already presented in Chapters 2 and 5. In Chapter 8 we showed how these correlated movements are handled in kinetic theories. However, it is the calculation of these correlation effects by random-walk theory which has engaged the attention of theoreticians to a considerable extent. Although the consequence of these effects for isotopic self-diffusion may be to introduce only a simple, numerical factor into the expression for the diffusion coefficient, the consequences for solute diffusion can be qualitatively significant, particularly when the motion of the defect is substantially affected in the vicinity of the solute. Under these same conditions qualitatively significant correlations can also appear in the self-diffusion of the solvent. The calculation of these correlation effects is thus an important part of this chapter.
In the previous chapter we showed how the thermally activated movements of defects of low symmetry (e.g. solute–defect pairs) may give rise to dielectric and anelastic relaxation processes. To do so we employed a particular example of the master equation (already introduced in Chapter 6) to represent the movement of these defects among the possible configurations and orientations open to them. These systems were, by assumption, spatially uniform, but it is natural to seek to extend such a theoretical approach to spatially non-uniform systems in which necessarily there will be diffusion processes taking place. Such extensions are the subject of this chapter. We call them kinetic theories because in them we are concerned with the changes in time of the distributions of the solute atoms, defects, solute–defect pairs, etc. in space and among available configurations. We shall use such approaches to find expressions for the diffusion coefficients and the more general transport coefficients of non-equilibrium thermodynamics for dilute alloys and solid solutions. Both interstitial and substitutional solid solutions are considered. In doing so we shall give quantitative expression to the correlation effects anticipated in atomic migration coefficients when defect mechanisms are active (cf. §§2.5.3 and 5.5). The treatments are mostly limited to isothermal systems, although they can be extended to systems in a thermal gradient.
Kinetic theories come in a variety of forms, depending on the characteristics of the systems of interest and on the physical quantities to be represented (e.g. transport coefficients, quasi-elastic scattering functions, Mössbauer cross-sections, etc.). As elsewhere, the aim is to achieve a useful level of generality and this is achieved for systems dilute in both defects and solute atoms.
The development in the previous chapter has necessarily been fairly abstract. In particular, although it is clear that the flux equations, when taken in conjunction with the corresponding continuity equations, in principle allow us to calculate the course of particular diffusion processes, we have not been able to give many indications of the physical characteristics of the L-coefficients appearing in these flux equations. This is, of course, to be expected at this stage since much of what follows in later chapters is about the theory of these quantities themselves. For the moment therefore they remain phenomenological coefficients, functions of thermodynamic variables (T, concentrations and stress) and governed by Onsager's theorem. It is helpful therefore at this stage to look at the relations between these L-coefficients and more familiar phenomenological coefficients adequate in special situations and which have been studied experimentally, e.g. chemical and isotopic diffusion coefficients, electrical ionic mobilities and ionic conductivity. Throughout we shall keep our assumptions about the nature of the systems being considered fairly broad. In particular, we shall not go out of our way to deal explicitly with dilute solid solutions, even though these are enormously important in fundamental studies. The more detailed results which are obtained for such systems are considered later after we have presented the atomic theory of the L-coefficients and evaluated them for particular models.
We begin by considering one of the simplest applications of the formalism, namely that to interstitial solid solutions in metals (§5.2). The simplicity of these systems allows us to describe several distinct phenomena with a minimum of mathematical manipulation.
Although the overall chemistry of a polymerization might be obvious in terms of monomers used and polymer produced, this knowledge does not ensure that the details of the chemistry are understood. Polymerizations, especially chain polymerizations, can have a number of intermediate steps that lead to the final result. These intermediate steps or mechanism need to be known if the polymerization itself is to be fully understood. The kinetics associated with polymerization are the principal manifestation of the mechanism underlying the overall chemical process. If a specific polymerization mechanism is proposed then the kinetic equations that ensue from the mechanism should lead to relations for the progress through the polymerization of observables such as the rate of monomer consumption and molecular weight build-up that are in agreement with experiment. If it is desired to have control over the polymerization process in the sense of being able to manipulate results such as molecular weight and its distribution, rate of polymerization, or composition in copolymerizations, then it is highly desirable to have a successful mechanistic and kinetic formulation of the process that expresses the effects of the possible variables.
Step polymerization
The description of the kinetics of step polymerization is greatly simplified if it is realized that the reactivities of the as yet unreacted end groups (carboxyl, amine, etc.) are not sensitive to the length of the chain to which they are attached.
Since the achievement of the physical properties characteristic of high polymers depends critically on molecular weight, it is most important to be able to define and measure this quantity. In this chapter definitions are taken up and basic concepts illustrated, largely by the example of step polymerization. The companion conclusions with respect to chain polymerization are necessarily deferred until the treatment of polymerization kinetics is undertaken (in Chapter 4). The subject of branching and crosslinking is also introduced. In the next chapter (Chapter 3) the various methods for measuring molecular weight are considered.
Before proceeding, it is worth giving a brief overview of how certain key material properties depend on molecular weight. In doing so, no effects of molecular weight distribution are considered, only some broad generalizations that are based on average molecular weight are taken up. These properties naturally group themselves into three categories and the effect of molecular weight on each of them is different. The first category is what might be considered typical thermophysical properties such as melting point, heat capacity and, especially important, elastic properties. All of these have the characteristic that they ‘saturate’ with molecular weight. That is, they become independent of molecular weight as the latter increases. Figure 2.1 shows the melting points of alkane or paraffin crystals as chain length increases toward their high molecular weight counterpart, polyethylene.
In Chapter 6 it was pointed out that comparison of the mean unperturbed solution dimensions, i.e., the characteristic ratio, derived from experiment and from calculations based on conformational models could form an important testing ground for the structural models. It was concluded, however (Section 6.1.4), that in order for realistic comparisons to be made, the computation of characteristic ratio would have to allow for interactions between nearby bonds. The steric interference between a pair of adjacent gauche bonds of opposite sense in a three-state chain was given as an important example of such interactions between nearby bonds. It is possible to formulate a treatment of chains where adjacent bond pairs interact. This is accomplished by means of a representation of the statistical mechanical partition function of a one-dimensional chain using the methods of matrix algebra (Kramers and Wannier, 1941) and adapted to the polymer chain conformation problem (Birshtein and Ptitsyn, 1959, 1966; Birshtein, 1959; Nagai, 1959; Lifson, 1959). This development and some applications to real chains are described in this chapter.
The partition function and scalar averages for a one dimensional chain with interacting bonds
In Section 6.3.1 the classical configurational partition function was written in equation (6.44) as an integration over all chain configurations. In the present development, attention will be focused on a discrete number of local bond states, those judged as important in the rotational isomeric state context (see Chapter 5, especially Section 5.1.1).
If one end of a slip of Caoutchouc be fastened to a rod of metal or wood, and a weight be fixed to the other extremity, in order to keep it in a vertical position; the thong will be found to become shorter with heat and longer with cold.
Gough, 1805
A material that can be deformed quickly to several hundred per cent strain, recovers rapidly and completely upon removal of the stress and is capable of having the process repeated numerous times is described as rubbery or elastomeric. The possibility of this behavior is due to the flexible long chain nature of polymer molecules and presents a type of response to mechanical deformation that is fundamentally quite different from the response given by rigid materials such as metals, ceramics and glassy or semi-crystalline polymers. This behavior is often called ‘entropy elasticity’ in contrast to the ‘energy elasticity’ of more familiar materials. The resistance to deformation is due largely to an entropy decrease rather than an energy increase. Entropy elasticity demonstrates itself in easily observed thermodynamic behavior such as the contraction of a stretched rubber band with increasing temperature as described by Gough (1805) in the quotation above. In this chapter, the contrast of energy vs entropy elasticity in thermodynamic behavior is developed. Then the molecular theory of rubber elasticity is discussed and its applicability to real elastomers is considered. Rubber elasticity is intimately connected with the presence of a network.