Cluster Expansions for Thermodynamics and Kinetics of Multicomponent Alloys

Journal of Phase Equilibria and Diffusion, Nov 2015

Cluster expansions have proven a very useful tool to model thermodynamics and kinetics of substitutional alloys in metallic, ionic, and even covalently bonded systems. Cluster expansions are usually obtained with the structure inversion method in which the energies, or other relevant property, of a set of structures are used to obtain expansion coefficients. The expansion coefficients are multipliers of correlation functions which pertain to clusters of sites on the parent lattice. There are significant practical issues associated with obtaining a cluster expansion, such as selecting which structures and especially which correlation functions are required for an adequate description of the energy. While these issues are significant for binary alloys, they become much more daunting when dealing with multicomponent alloys. Moreover, oftentimes interest is not limited to the energetics of the thermodynamic equilibrium state, but the evolution of quenched alloys with time is just as important. The treatment of diffusion within the context of cluster expansions is then another challenge. The article describes a formal method for utilizing cluster expansions for transition states as occur during vacancy mediated diffusion in substitutional alloys. The methods are illustrated with some applications to the prediction of initial coherent precipitates in Al-Cu and Al-Mg-Si alloys.

A PDF file should load here. If you do not see its contents the file may be temporarily unavailable at the journal website or you do not have a PDF plug-in installed and enabled in your browser.

Alternatively, you can download the file locally and open with any standalone PDF reader:

Cluster Expansions for Thermodynamics and Kinetics of Multicomponent Alloys

Journal of Phase Equilibria and Diffusion C l u s t e r E x p a n s i o n s f o r T h e r m o d y n a m i c s a n d K i n e t i c s o f M u l t i c o m p o n e n t A l l o y s Xi Zhang 0 Marcel H.F. Sluiter 0 0 Xi Zhang and Marcel H.F. Sluiter, Department of Materials Science and Engineering, Delft University of Technology , Mekelweg 2, 2628 CD Delft , The Netherlands. Contact Cluster expansions have proven a very useful tool to model thermodynamics and kinetics of substitutional alloys in metallic, ionic, and even covalently bonded systems. Cluster expansions are usually obtained with the structure inversion method in which the energies, or other relevant property, of a set of structures are used to obtain expansion coefficients. The expansion coefficients are multipliers of correlation functions which pertain to clusters of sites on the parent lattice. There are significant practical issues associated with obtaining a cluster expansion, such as selecting which structures and especially which correlation functions are required for an adequate description of the energy. While these issues are significant for binary alloys, they become much more daunting when dealing with multicomponent alloys. Moreover, oftentimes interest is not limited to the energetics of the thermodynamic equilibrium state, but the evolution of quenched alloys with time is just as important. The treatment of diffusion within the context of cluster expansions is then another challenge. The article describes a formal method for utilizing cluster expansions for transition states as occur during vacancy mediated diffusion in substitutional alloys. The methods are illustrated with some applications to the prediction of initial coherent precipitates in Al-Cu and Al-Mg-Si alloys. aluminum alloys; cluster expansion; diffusion modeling; kinetics; multicomponent alloys; thermodynamics 1. Introduction Cluster expansions (CEs) have become a ubiquitous tool in computational alloy theory[ 1–21 ] and even in seemingly completely unrelated fields such and electronic band gap engineering and protein sequencing.[ 22–25 ] It is used to express thermodynamic and other properties as function of the configurational order of alloys. Most of the applications focus on binary alloys. However, the most interesting and the most realistic alloys are generally multi-component alloys. Although there are several CE studies on ternary alloys,[ 26–32 ] usually a presupposed cluster, such as the irregular tetrahedron in the case of bcc lattices, is selected while ignoring convergence issues. On the other hand, more 2. Theory 2.1 Cluster Probabilities, Sum Rules, and Correlation Functions Here, without loss of generality, we will assume that the expanded property is the energy E. The CE assumes that the energy can be written as a rapidly converging sum over cluster contributions where contributions from larger clusters become negligible. A practical example of this idea is common in organic chemistry where the formation enthalpy of a molecule can be expressed approximately as a sum of contributions from nearest neighbor pairs (read bonds) only. In the case of ethane one would count one C-C pair and six C-H pairs and estimate that the formation energy with respect to the isolated atoms is given as the sum of the single C-C bond energy and the six C-H bond energies. Contributions from larger clusters such as triplets usually provide minor corrections only. We will now return to the case of crystalline bulk alloys and limit ourselves to the case where every atom can be uniquely identified with one and only one atomic position. That is, the alloy becomes perfectly periodic with a relatively small unit cell when all atoms were to become indistinguishable. There is a well-defined ‘‘fixed’’ underlying grid of atomic sites. In such a case clusters can be uniquely identified without any ambiguity on how to classify any group of atomic sites. There would then never be any doubt whether a given pair is a 1st or a 2nd nearest neighbor pair. Another aspect of a ‘‘fixed underlying grid is that the configuration of the alloy would be completely specified by the atomic occupation for every site, as in the case of e.g. the Ising model. In analogy with the Ising model we could assign the occupation of every site with a site occupation variable, riðPÞ ¼ where P designates a particular atomic species, the parentheses remind us it does not refer to exponentiation.[ 40,41,32 ] To be completely general we will consider a vacancy also as an atomic species. If there are N atomic species in the alloy, only N 1 need to be specified because of a sum rule: riðjPQÞ ¼ riðPÞrjðQÞ: It signifies that every site is occupied by one and only one atom. The occupation of an arbitrary pair cluster consisting of sites i and j with occupations P and Q can be described in terms of site occupation variables Here again, sum rules apply for every individual pair P¼1 Q¼1 N N X X riðjPQÞ ¼ 1 8ij and P¼1 N X riðjPQÞ ¼ rjðQÞ: Therefore, it is necessary to only specify the N 1 occupations of sites i and j and to only specify the pair occupations that contain exclusively the N 1 atomic species in order to fully designate the specific atomic occupancy of the ij pair cluster. It is not necessary to specify the occupation of a point or pair cluster that contains one or more atoms of type N . This generalizes to clusters of multiple sites: a three body cluster consisting of sites i, j, k with occupations P, Q, R can be described in terms site occupation variables riðjPkQRÞ ¼ riðPÞrjðQÞrðkRÞ: PQR where an N type atom occurs The value of any rijk follows from sum rules and occupations of contained points, contained pairs, and triangle involving N 1 species exclusively. The specific occupation of any particular cluster is generally not of interest in an infinitely large crystal. Then, the cluster probabilities are much more informative. Cluster probabilities are averages over equivalent clusters, where equivalence derives from the symmetry of the underlying grid of atomic sites. Angle brackets are used to designate cluster probabilities. DriPjQE with sites i and j nearest neighbors, thus indicates the probability that any nearest neighbor pair in the infinite crystal is occupied by a PQ pair of atoms. Obviously, this probability shall always be in the interval [ 0,1 ]. The previously mentioned sum rules also apply to cluster probabilities so that probabilities exclusively pertaining to the N 1 atomic species only are needed for a full description of probabilities. In other words, the cluster probabilities pertaining to N 1 atomic species present a complete set of independent basis functions of the alloy configuration. Therefore, unlike the complete set of cluster probabilities involving all N atomic species, the subset of the cluster probabilities pertaining to the N 1 atomic species is a suitable basis for expanding properties that depend on the alloy configuration. This subset is a nonunique choice as a definition of correlation functions. Nonunique for several reasons; site occupations expressed as 0 or 1 instead of e.g. the Ising definition of +1 and 1; and because in our definition any one of the N atomic species can be declared redundant. In the following we proceed with the assumption that the Nth species has been declared redundant.[ 42 ] The correlation function expansion of the energy per atom can now be expressed as E½s ¼ X Jamahra½s i: a E½s is the energy per atom of the structure s, a is short-hand notation for the index of a particular correlation function hrai ¼ DriðjP..Q. k... RÞE, Ja is the effective cluster interaction (ECI) pertaining to correlation a per occurrence, and ma is the multiplicity, or the number of clusters a per atom. It is to be noted that considering crystal structures with interstitials may benefit from defining multiplicities and structural energies with reference to a lattice point of the disordered structure. However, here we shall avoid such complications and assume we are dealing with simple underlying lattices with just one atom per lattice point in the disordered state as for common fcc- and bcc-based alloys. Often, it is handier to combine ma and Ja into one term, J~a ¼ ma Ja, the ECI per atom. In the case of a binary AB alloy the probabilities of the pure A clusters only are required for an expansion of E[s]. So for every cluster there is one and only one correlation function. It is now apparent why the actually correct term ‘‘correlation function expansion’’ is usually replaced with the much more common ‘‘cluster expansion’’ as initially binary alloys only were considered. Expansions based on cluster probabilities can be much more efficient than those based on the Ising convention—even in the case of binary alloys. When the majority type atom is considered as the redundant species, there will be very few clusters consisting of minority atoms only. In fact, in a large simulation cell, say for a Monte Carlo simulation, one can maintain a list of the minority atoms to count quickly how many of those pure minority atom clusters are present. In contrast, using the Ising definition, the particular sign of every instance of a cluster must be evaluated before the value of the corresponding correlation function is known. The more dilute an alloy, the greater the advantage of the current definition.[ 42 ] 2.2 Inheritance of Expansion Coefficients Another advantage of the current description is that it is quite trivial to build up an N component CE using, and retaining exactly, the CEs of N 1 subsystems comprised of N-1 components. In other words, when modeling a ternary alloy ABC one can reuse the ECIs of the AC and BC binaries. The reason for this is that the correlation functions as defined here form an independent basis. In the case of an ABC ternary it is quite simple to rationalize the preservation of ECIs from the AC and BC binaries: E½s ¼ The RHS contains three sums over correlation functions deriving from clusters a. The 1st of the three sums refers to correlation functions pertaining to AC alloys only because there is no B present. Likewise the 2nd term refers to BC alloys only because there are zero A atoms. The last term only contains correlations that pertain to both A and B atoms. It follows that for a ternary alloy ABC the pure A ECIs can be copied from the AC binary and the pure B ECIs can be copied from the B-C binary provided that the C species is chosen as the redundant species. This can be formulated as: J~aAn=ABðCÞ ¼ JaAn=AðCÞ and J~aBn=ABðCÞ ¼ ~ ~ ð Þ refers to the ECI JaBn=BðCÞ, where the notation aPQ=AB C of cluster type a (here pair only) with decoration PQ in the alloy ABCwith the species C as eliminated species, and where the subscript n indicates that it is valid for all n-body pure A and pure B clusters. ECIs from the AB binary are not as trivially preserved exactly. If the CE is limited to pairwise terms only, i.e. clusters a contain two sites at most, it requires a minor algebraic operation only. The ABpair interactions of the A-B-C ternary can be obtained simply from J~aAB=ABðCÞ ¼ J~aAA=AðCÞ þ J~aBB=BðCÞ J~aAA=AðBÞ. The reason that it is more complicated to derive a similar equation when the CE contains 3-site and larger clusters, is that the 3-site and larger cluster interactions of mixed AB type all contain energy terms that include all three species simultaneously. Of course, such energy terms cannot originate from a description of any binary. It follows that in the case of pair clusters, and it then does not matter up to which neighbor shell the pairs are considered, all three atomic species cannot be present in one cluster. By induction it then follows that for a quaternary alloy ABCD with D as the eliminated species the ABC ternary ECIs can be exactly preserved provided that the ECIs are limited to clusters of 3 sites or fewer. For a quinary alloy ABCDE all subsystem quaternary CE coefficients can be preserved if no clusters larger than 4-sites are considered. Generally, ECIs pertaining to 4-site clusters are already quite small, and generally it is not necessary to go to 5-site or larger clusters. The implication is that CEs of many-component alloys are completed determined by their subsystems. Another important consequence is that once CEs for all quaternary systems are known, no new CEs need to be performed for quinary or higher component systems because the ECIs are already known from their quaternary subsystems. This is quite a remarkable conclusion given that generally it is believed that determining CEs for many component alloys is impossibly complicated due to the combinatorial explosion of possible combinations of correlation functions that such a CE entails. It should also be pointed out that the inheritance of ECIs to higher order alloy systems is analogous to the extrapolation methods well-known to CALPHAD practitioners.[ 43 ] To fully take advantage of this inheritance of ECIs from subsystems, it is of course necessary that consistently the same set of underlying clusters is used for all the subsystems. Then a new question arises, one may determine the optimal set of clusters for a CE in some specific binary, but how can one find the optimal set of clusters for a large group of binaries? 2.3 Completeness Another question that arises is how a change of basis, say for re-using the ECIs of the ABCD quaternary in the ABCDE quinary, affects the selection of correlation functions to be used in the CE. This question has been addressed already earlier where it was concluded that a completeness criterion must be met[ 42 ]: When a certain cluster is included in the CE all its subclusters must be included as well.[ 44,45 ] In multicomponent alloys this completeness criterion must be generalized for a CE to remain invariant under transformation of the occupation variable. For a multicomponent alloy the completeness criterion can be shown to require that if a certain correlation function is included in a CE, (a) all correlations associated with that cluster must be included, and (b) all correlations associated with the subclusters must be included also. The multicomponent completeness criterion can be derived in a completely analogous way as was done for a binary.[42] 2.4 Pools of Correlation Functions In the following sections we will therefore focus on the issue of selecting the optimal set of clusters for a series of binaries. We will limit ourselves to fcc Al-base alloys. The set of cluster will be selected from a certain ‘‘pool’’. This pool is defined by two criteria: (a) only clusters with up to and including 4 sites are considered, and (b) in such clusters no two sites shall be further apart than the 8th nearest neighbor. We designate this pool as N4R8. The justification of these criteria is as follows: the enthalpy of mixing is generally represented as a polynomial in the composition. The highest power in this polynomial is an indication of the number of sites in the largest cluster that significantly contributes to the alloy energy. E.g. a parabolic enthalpy of mixing, symmetric around equiatomic composition, indicates that only 2 body (i.e. pairwise) interactions are needed and three- four- and larger body interaction terms can be neglected. Extensive experience in fitting binary phase diagrams has shown that usually a sub-regular (i.e. a third order polynomial) or more rarely, a fourth order polynomial in the composition is completely adequate to represent the enthalpy of mixing accurately. Therefore, ECIs with 4 sites (N4) should generally be adequate. With the exception of long-period superstructures, such as in Cu-Pd, all experimentally observed fcc superstructures can be stabilized by pairwise interactions with ranges up to the 8th nearest neighbor (R8). The experimentally observed Al3Zr phase (StrukturBericht notation:D022) is stabilized by the 8th nearest neighbor (2 0 0 afcc, see Fig. 1). The N4R8 pool contains 1 point cluster, 8 pair clusters, 50 three body, and 427 four body clusters for a total of 486 clusters. In addition we must count the so-called empty cluster with zero sites. The ‘‘cluster interaction’’ corresponding to the empty cluster represents a constant off-set in the energy. Including this empty cluster there are 487 clusters in the N4R8 pool. The versatility of the N4R8 set of clusters is nicely illustrated by the accurate representation of the GP morphology. A particularly challenging issue for any CE without explicit treatment for long-ranged elastic interactions is the prediction of GP zone shapes.[ 46 ] In Al-Cu with about 2 a/o Cu, GP zones are known to form as pure Cu on {100} planes separated by 2 or 3 atomic planes of pure Al. Initially, it is believed that there are 3 intermediary Al planes, but absorption of vacancies and concomitant relaxation rapidly reduces this to 2 atomic Al layers.[ 47–49 ] Describing the Cu-Al-Al-Al sandwich morphology requires a highly optimized CE with a LOOCV of just 1 meV/atom, with about 200 fitted structural energies and 80 ECIs. see Fig. 2. 2.5 On the Impossibility of Finding the Best Expansion Finding the best CE for a certain alloy, using a large set of structural energies, and using a pool of clusters with N members is a daunting task. The total number of combinations PðN Þ that can be generated by selecting arbitrarily a set of clusters with anywhere from 1 to N members can be calculated as follows: PðN Þ ¼ N ð 1Þ!ð1Þ! þ ðN N ! 2Þ!ð2Þ! þ þ ðN N ! N Þ!ðN Þ! ¼ N X i¼1 N ! N i ¼ 2N 1: The value of PðN Þ is easily understood because each cluster either is included, or is not included, which gives 2N , but we explicitly excluded the possibility of a completely empty combination so that one has to be subtracted. When N ¼ 487 the value of P is approximately 4 10146, and this is considering a single binary only! It is not feasible to evaluate all possible CEs and select the best performing one exactly. One has to settle for a ‘‘good’’ CE, rather than expect to find the ‘‘best’’ CE. How well a CE performs can be judged with some fitness criterion. Common fitness criteria are (a) the root mean square fitting error for the known structural energies, sometimes modified by other criteria such as whether or not correct ground states are produced by the CE[ 50 ]; (b) a measure of ‘‘predictive ability’’ of a CE,[ 14,50,51 ] such as the leave-one-out crossvalidation score[ 44 ] or leave-many-out cross-validation score.[ 3 ] Several methods for finding ‘‘good’’ CE have been proposed. Various statistical methodologies have been discussed in e.g. Ref 52,44,53, and 54. Here we mention 4 methods. (I) ‘‘aufbau’’ where a given CE is expanded with a single new cluster, one at a time.[ 55 ] All not yet included clusters are considered. Among the set of not yet included clusters the one that gives the best improvement in the fitness criterion is selected. Now, a new CE has been generated and the process is repeated. Repetitions stop either when the maximal number of clusters in the CE is reached, or when the fitness criterion improves too little, or deteriorates. (II) Singular value decomposition.[ 56 ] When there are not precisely N structural energies available, the hra½s i matrix is not square and only a pseudo-inverse can be defined. Usually, the number of structural energies is less than N , so that there is an under-determined set of equations to be solved. Oftentimes the hra½s i matrix is ill-conditioned for various reasons which to some extent can be mitigated by adding very small random numbers. Generally, this methods works best when the set of equations is overdetermined, i.e. when the number of structural energies is larger than N . This is in practice rarely the case. When this method succeeds in generating a CE it is optimal in the sense that it produces a small fitting error. The SVD method can be modified in order to better utilize insight in underlying physical properties, see Ref 57. (III) Genetic algorithm.[ 11 ] A population of distinct CEs is allowed to ‘mate’ (i.e. exchange expansion terms) and the resulting offspring is then culled on the basis of the fitness criterion. This procedure of is repeated for a number of generations, till the fitness criterion no longer improves. (IV) Enumeration in combination with completeness and mandatory clusters.[ 55 ] Imposition of the completeness criterion drastically reduces the number of acceptable CEs. If furthermore it is imposed that certain clusters, such as the empty cluster, the single point and nearest neighbor pair clusters are always required in a valid CE the number of CEs can be reduced further. A complete CE is fully characterized by its maximal clusters only. When only CEs are considered with a limited number of maximal clusters their total number remains relatively small. In the case of binary fcc alloys with the N4R8 cluster pool and imposition of completeness and mandatory inclusion of the empty, point, nearest and next nearest pair clusters, and considering only CEs with 4 or fewer maximal clusters, there remain just 2 018 401 138ð 2 109Þ CEs only. While this number is still very large, it can be evaluated in a number of weeks to months on a single CPU core. This 4th method is used below. It should be remarked that enumeration with a limited number of maximal clusters biases CEs with completeness towards the clusters with larger numbers of sites. Larger clusters are favored because they contain several subclusters and thus yield a greater improvement in the ‘‘fitting criterion’’ than smaller clusters. With a handicap or weighting function this bias can be ameliorated to some extent. To illustrate that a single set of clusters can yield acceptable CEs across a range of alloys, we considered all major alloying elements in Al-base alloys. Leave-one-out cross-validation scores (LOOCV) have been determined for CEs of 9 binary Al alloys, Al-Cu, Al-Fe, Al-Li, Al-Mg, AlMn, Al-Si, Al-Ti, Al-Zn, Al-Zr. The CEs are complete as discussed in section 2.3. The CEs are based on the N4R8 pool with up to 4 maximal clusters. For each binary a set of 100 ab initio computed structural energies was used for fitting. Enumeration gave 2 018 401 138 CEs for each binary. To find the optimal CE across all 9 binaries we computed the product of all 9 LOOCVs for a given set of maximal clusters. The resulting set of clusters is shown in Fig. 3. Including the empty cluster, the maximal clusters shown in Fig. 3 have 20 subclusters, thus there are 24 ECIs in our optimal CE for Al-alloys, Al-optimal for brevity. For a ternary alloys the number of ECIs in Al-optimal is 146 out of the more than 7000 that occur within the N4R8 pool for ternaries. Al-optimal gives small LOOCV values for binaries (Al-Mg: 1.3 meV/atom, Al-Si: 2.8 meV/atom) by design, but for the Al-Mg-Si ternary also, the LOOCV has a small value of just 2.2 meV/atom. Of course, as mentioned in section 2.2 the interactions of the Al-Mg and Al-Si binaries were inherited in describing the ternary. It should be remarked that with 100 structural energies per binary, while extracting only 24 ECIs, the CE is very overdetermined and as a consequence the LOOCV takes values that are very close to the root of the mean square error of the fit. In addition to using 100 structural energies for each of the three constituent binaries, an additional 100 structural energies pertaining to truly ternary compounds were used. The Al-Mg-Si CE was used to predict plausible initial coherent, i.e. fcc-based, stages of precipitation and resulting structures are displayed in Fig. 4. 2.6 Cluster Expansions for Vacancy-Mediated Diffusion in Substitutional Alloys When vacancies trade places with neighboring atoms in an alloy, a certain activation barrier must be overcome, see Fig. 5. When the vacancy is considered as an additional atomic species, it is apparent that the energies of state 1 (E1) and state 2 (E2) can be described by a multinary CE. However, for the activated state between states 1 and 2 problems arise. In the activated state the atom that trades places with the vacancy is no longer uniquely associated with a single lattice site. This is a problem because the cluster expansion is built upon the concept of there being a lattice gas where every site is associated with one, and only one, atomic species. A second problem is that the energy barrier is not purely a ‘‘state function’’ because the height of the barrier depends on the direction of the swap, the transition 1 ! 2 has a different barrier than transition 1 2. This last problem is elegantly solved by van der Ven et al. by defining the kinetically resolved activation barrier,[ 21 ] defined as EKRA ¼ Etr 1 2 ðE1 þ E2Þ; where Etr is the highest energy along the lowest energy path connecting the states 1 and 2. When EKRA, E1 and E2 are known it is trivial to extract the energy barriers for transitions 1 ! 2 and 1 2. The first problem, the lattice gas violation, can also be resolved by introducing a new atomic species for the vacancy-swapping atom pair, ‘‘swapping pair’’ for brevity, as displayed in Fig. 6. When a single vacancy is present only in an N-component alloy (here N = 3: black species, donut species, and vacancy), replacing the swapping pair by two ‘‘atoms’’ of a new atomic species, results in N 1 new N component systems. Thus the complexity of the problem is not significantly increased. When the value of EKRA is computed in actual alloys, it is found to vary strongly with atomic occupancy right around the swapping pair. In Al-Cu alloys the largest and smallest values might easily differ by a factor 3. To illustrate the significant local neighborhood dependence of the KRA energies, some Al-Cu supercells are shown with the corresponding values of EKRA in Fig. 7. As EKRA is a strong function of the local environment, a local CE appears a sensible approach. A local CE should depend only on atomic occupancy right near the swapping pair and this expansion should not describe the average energetics of the alloy, which is already covered in the CE for states 1 and 2. This is easily achieved by selecting only correlation functions that encompass the swapping pair. We emphasize that the whole pair should be included because otherwise correlation functions may not be properly distinguished, as illustrated in Fig. 8. The reason for this is that the introduction of the swapping pair has eliminated many operations from the symmetry group of the underlying disordered crystal structure. Only operations that preserve the swapping pair can be retained. As the local CE retains only correlations that contain the swapping pair, it is apparent that such a local CE is NOT complete in a strict sense, because subclusters that contain only one or no sites of the swapping pair are excluded. Of course, one can retain the completeness property with regard to the local environment of the swapping pair. The local CE can be expressed as EKRA ¼ X a ra[b Ja[b; where a represents a cluster decoration corresponding to the empty cluster, or a point cluster, etc. but not part of the swapping pair, b represents the swapping pair, ra[b is a counter for the number of cluster decorations of type a [ b, and J is the corresponding ECI. ra[b has a maximum value la[b that is determined by its symmetry, e.g. for the decorations encircled by the oval in Fig. 8 it is 4 (left side) and 2 (right side). 2.7 Positive Definite Local Cluster Expansions Another aspect of the EKRA local CE is that it should generally yield positive values for all local neighborhoods. Unlike configurational energies which can be both positive and negative, here only positive energies are desired. In the local CE occurrences of cluster decorations are counted. As these occurrences can be counted using positive numbers only, it is tempting to assume that a strictly positive expansion can be obtained by requiring all ECI nonnegative. However, this is a severe restriction that makes it generally too difficult to obtain a good CE. A less severe restriction can be designed. The interaction Jb for the empty cluster should be positive. The sum of all negative valued interactions, times their maximal multipliers la[b, plus the Jb should thus be positive, Jb þ 21 P la[b Ja[b a Ja[b Þ > 0, where a in the sum indicates that the empty cluster is excluded. This less restrictive condition is easily implemented in the aufbau and enumeration methods. The most effective method we have found however, is another method; Immediately screening every CE for a large number of pre-selected local environments and requiring that for all these test environments the EKRA exceeds zero or some small positive value. 3. Conclusion It has been shown that the definition of the correlation functions is a crucial aspect of cluster expansions. Particularly for multicomponent alloys, the current definition of the correlation functions allows inheritance, i.e. effective cluster interactions from constituent systems can be directly re-used. This assures that a good description of the alloy energetics in constituent systems is carried forth in more complex alloys. It also great facilitates determining cluster expansions in alloys with many components because it was shown that if N-body terms in a CE suffice for describing the energetics of an alloy, then alloys with N + 1 components do not require any interactions beyond those present in the N + 1 constituent systems with N components. As generally CEs with 4-body terms are capable of describing alloy energetics to within the meV/ atom range, it follows that quinary alloys already, can be fully described using the energetics of the constituent quaternaries only. This observation is also an indication why there are so very few multicomponent superstructures in metallic alloys. Another aspect of the current formulation of correlation coefficients is the great efficiency in evaluating the energy in dilute binaries and multicomponent alloys generally. Efficient algorithms for determining CEs were presented. These methods were illustrated by determining a set of clusters that yields good CEs for multiple Al-based alloys. Finally, the peculiarities of strictly local CEs, e.g. for KRA energy barriers for vacancy mediated diffusion in substitutional alloys, were discussed. A method to impose that such local CEs yield only positive energy barriers was presented. This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. 1. M. Asta , D. de Fontaine, M. van Schilfgaarde , M. Sluiter , and M. Methfessel , First-Principles Phase-Stability Study of fcc Alloys in the Ti-Al System , Phys. Rev. B , 1992 , 46 , p 5055 - 5072 2. M. Asta , S.M. Foiles , and A.A. Quong , First-Principles Calculations of Bulk and Interfacial Thermodynamic Properties for fcc-Based Al-Sc Alloys , Phys. Rev. B , 1998 , 57 , p 11265 - 11275 3. V. Blum and A. Zunger , Structural Complexity in Binary bcc Ground States: The Case of bcc Mo-Ta , Phys. Rev. B , 2004 , 69 , p 020103 : 4 4. J.W.D. Connolly and A.R. Williams , Density-Functional Theory Applied to Phase Transformations in Transition-Metal Alloys , Phys. Rev. B , 1983 , 27 , p 5169 - 5172 5. S. Curtarolo , W. Setyawan , G.L.W. Hart , M. Jahnatek , R.V. Chepulskii , R.H. Taylor , S. Wang, J. Xue , K. Yang , O. Levy , M. Mehl , H.T. Stokes , D.O. Demchenko , and D. Morgan, AFLOW: An Automatic Framework for High-Throughput Materials Discovery, Comput . Mater. Sci., 2012 , 58 , p 218 - 226 6. D. de Fontaine, Solid State Physics , Vol 47 , H. Ehrenreich and D. Turnbull , Ed., Academic Press, New York, 1994 , p 84 7. A. D´ ıaz-Ortiz, H. Dosch , and R. Drautz , Cluster Expansions in Multicomponent Systems: Precise Expansions from Noisy Databases, J. Phys. , 2007 , 19 , p 406206 : 19 8. R. Drautz and M. Fa¨hnle, Spin-Cluster Expansion: Parametrization of the General Adiabatic Magnetic Energy Surface with Ab Initio Accuracy , Phys. Rev. B , 2004 , 69 , p 104404 : 6 9. G.D. Garbulsky and G. Ceder , Linear-Programming Method for Obtaining Effective Cluster Interactions in Alloys from Total-Energy Calculations: Application to the fcc Pd-V System , Phys. Rev. B , 1995 , 51 , p 67 - 72 10. G. Ghosh, A. van de Walle, and M. Asta , First-Principles Calculations of the Structural and Thermodynamic Properties of bcc, fcc and hcp Solid Solutions in the Al-TM (TM = Ti, Zr and Hf) Systems: A Comparison of Cluster Expansion and Supercell Methods, Acta Mater ., 2008 , 56 , p 3202 - 3221 11. G.L.W. Hart , V. Blum , M.J. Walorski , and A. Zunger , Evolutionary Approach for Determining First-Principles Hamiltonians, Nat . Mater., 2005 , 4 , p 391 - 394 12. D. Lerch , O. Wieckhorst , G.L.W. Hart , R.W. Forcade , and S. M u¨ller, UNCLE: A Code for Constructing Cluster Expansions for Arbitrary Lattices with Minimal User-Input, Model . Simulat. Mater. Sci. Eng ., 2009 , 17 , p 055003 : 19 13. J.Z. Liu , A. Van De Walle , G. Ghosh, and M. Asta , Structure, Energetics, and Mechanical Stability of Fe-Cu bcc Alloys from First-Principles Calculations , Phys. Rev. B , 2005 , 72 , p 144109 : 16 14. Z.W. Lu , S.-H. Wei , A. Zunger , S. Frota-Pessoa , and L.G. Ferreira , First-Principles Statistical Mechanics of Structural Stability of Intermetallic Compounds , Phys. Rev. B , 1991 , 44 , p 512 - 544 15. A.A. Mbaye , L.G. Ferreira , and A. Zunger , First Principles Calculations of Semiconductors Alloy Phase Diagrams, Phys. Rev. Lett. , 1987 , 58 , p 49 - 52 16. D. Morgan , A. van de Walle, G. Ceder, J.D. Althoff , and D. de Fontaine, Vibrational Thermodynamics: Coupling of Chemical Order and Size Effects , Model. Simul. Mater Sci Eng. , 2000 , 8 , p 295 - 309 17. J.M. Sanchez , F. Ducastelle , and D. Gratias , Generalized Cluster Description of Multicomponent Systems , Physica A , 1984 , 128 , p 334 - 350 18. M. Sluiter , D. de Fontaine, X. Guo , R. Podloucky , and A.J. Freeman , First Principles Calculation of Phase Equilibria in the Aluminum-Lithium System , Phys. Rev. B , 1990 , 42 , p 10460 - 10476 19. M.H.F. Sluiter and Y. Kawazoe , Cluster Expansion Method for Adsorption: Application to Hydrogen Chemisorption on Graphene , Phys. Rev. B , 2003 , 68 , p 085410 : 7 20. A. van der Ven , M.K. Aydinol , G. Ceder, G. Kresse, and J. Hafner , First-Principles Investigation of Phase Stability in LixCoO2 , Phys. Rev. B , 1998 , 58 , p 2975 - 2987 21. A. van der Ven , G. Ceder, M. Asta , and P.D. Tepesch , FirstPrinciples Theory of Ionic Diffusion with Nondilute Carriers, Phys. Rev. B , 2001 , 64 , p 184307 : 17 22. G. Grigoryan , F. Zhou , S.R. Lustig , G. Ceder , D. Morgan , and A.E. Keating , Ultra-Fast Evaluation of Protein Energies Directly from Sequence , PLOS Comput. Biol ., 2006 , 2 , p e63 23. A. Kutana , E.S. Penev , and B.I. Yakobson , Engineering Electronic Properties of Layered Transition-Metal Dichalcogenide Compounds Through Alloying, Nanoscale, 2014 , 6 , p 5820 - 5825 24. R. Magri and Alex. Zunger, Real-Space Description of Semiconducting Band Gaps in Substitutional Systems , Phys. Rev. B , 1991 , 44 , p 8672 - 8684 25. F. Zhou , G. Grigoryan, S.R. Lustig , A.E. Keating , G. Ceder, and D. Morgan , Coarse-Graining Protein Energetics in Sequence Variables, Phys. Rev. Lett. , 2005 , 95 , p 148103 : 4 26. P.R. Alonso and G.H. Rubiolo , Relative Stability of bcc Structures in Ternary Alloys with Ti50Al25Mo25 Composition, Phys . Rev. B , 2000 , 62 , p 237 : 6 27. S. Bein , C. Colinet , and M. Durand-Charre , CVM Calculation of the Ternary System Co-Cu- Fe , J. Alloy Compd ., 2000 , 313 , p 133 - 143 28. V. Chaumat , F. Moret , and C. Colinet , Study of Phase Equilibria in the Nb-Ti-Al System Theoretical Study: CVM Calculation of the Phase Diagram of bcc Nb-Ti- Al , J. Phase Equilib ., 1999 , 20 , p 389 - 398 29. R. Kikuchi , Ternary Phase Diagrams Calculation-I: General Theory, Acta Metall ., 1977 , 25 , p 195 - 205 30. I. Ohnuma, C.G. Sch o¨n , R. Kainuma , G. Inden, and K. Ishida , Ordering and Phase Separation in the bcc Phase of the Fe-AlTi System, Acta Mater ., 1998 , 46 , p 2083 - 2094 31. C.G. Sch o¨n, Order-Disorder Transformations and Diffusion in Ternary bcc Alloys: The Role of Second-Order Boundaries , J. Phase Equil ., 2001 , 22 , p 287 - 290 32. M. Sluiter and Y. Kawazoe , Site Preference of Ternary Additions in Ni3Al, Phys. Rev. B , 1995 , 51 , p 4062 - 4073 33. T. Hoshino, N. Fujima , and M. Asato , Ab Initio Study for Magnetism in Ni2MnAl full-Heusler Alloy: A Cluster Expansion Approach for Total Energy , J. Alloy. Compd. , 2010 , 504 , p S534 - S537 34. F. Lechermann , M. Fa¨hnle, and J.M. Sanchez , First-Principles Investigation of the Ni-Fe- Al System , Intermetallics , 2005 , 13 , p 1096 - 1109 35. J.S. Wro´bel, D. Nguyen-Manh, M. Yu . Lavrentiev, M. Muzyk , and S.L. Dudarev , Phase Stability of Ternary fcc and bcc FeCr-Ni Alloys , Phys. Rev. B , 2015 , 91 , p 024108 : 31 36. K. Yuge , First-Principles Study of Phase Equilibria in Cu-PtRh Disordered Alloys , J. Phys. , 2009 , 21 , p 415401 : 7 37. C. Wolverton and D. de Fontaine, Cluster Expansions of Alloy Energetics in Ternary Intermetallics, Phys. Rev. B , 1994 , 49 , p 8627 - 8642 38. A. van der Ven and G. Ceder, First Principles Calculation of the Interdiffusion Coefficient in Binary Alloys , Phys. Rev. Lett. , 2005 , 94 , p 045901 : 4 39. A. van der Ven and G. Ceder, Vacancies in Ordered and Disordered Binary Alloys Treated with the Cluster Expansion , Phys. Rev. B , 2005 , 71 , p 054102 : 7 40. F. Ducastelle and F. Gautier , Generalized Perturbation Theory in Disordered Transitional Alloys: Applications to the Calculation of Ordering Energies , J. Phys . F, 1976 , 6 , p 2039 - 2062 41. F. Gautier , F. Ducastelle , and J. Giner , Ordering and Segregation Processes in Transition Metal Alloys in Relation to Their Electronic Structures, Philos . Mag., 1975 , 31 , p 1373 - 1390 42. M.H.F. Sluiter and Y. Kawazoe , Invariance of Truncated Cluster Expansions for First-Principles Alloy Thermodynamics , Phys. Rev. B , 2005 , 71 , p 212201 : 2 43. H. Lukas , S.G. Fries , B. Sundman , Computational Thermodynamics: The Calphad Method, (Cambridge Univ. Press, Cambridge, 2007 ) ISBN: 9780521868112 44. A. van de Walle and G. Ceder, Automating First-Principles Phase Diagram Calculations , J. Phase Equil ., 2002 , 23 , p 348 - 359 45. N.A. Zarkevich and D.D. Johnson , Reliable First-Principles Alloy Thermodynamics via Truncated Cluster Expansions, Phys. Rev. Lett. , 2004 , 92 , p 255702 : 4 46. C.M. Wolverton , First- Principles Prediction of Equilibrium Precipitate Shapes in Al-Cu Alloys , Philos. Mag. Lett., 1999 , 79 , p 683 - 690 47. A. Bigot , F. Danoix , P. Auger , D. Blavette , and A. Menand , 3D Reconstruction and Analysis of GP Zones in Al-1.7Cu (at%): A Tomographic Atom Probe Investigation , Appl. Surf. Sci., 1996 , 94 ( 95 ), p 261 - 266 48. T. Sato and A. Kamio , High Resolution Electron Microscopy of Phase Decomposition Microstructures in Aluminium-Based Alloys, Mater. Sci. Eng ., 1991 , A146, p 161 - 180 49. M.J. Starink and A.M. Zahra , Mechanisms of Combined GP Zone and h' Precipitation in an Al-Cu Alloy , J. Mater. Sci. Lett. , 1997 , 16 , p 1613 - 1615 50. M. Sluiter and P. Turchi , Electronic Theory of Phase Stability in Substitutional Alloys: The Generalized Perturbation Method Versus the Connolly-Williams Method , Phys. Rev. B , 1989 , 40 , p 11215 - 11228 51. M. Sluiter , Y. Watanabe , D. de Fontaine, and Y. Kawazoe , A FirstPrinciples Calculation of the Pressure Dependence of Phase Equilibria in the Al-Li System , Phys. Rev. B , 1996 , 53 , p 6137 - 6149 52. R. Drautz and A. D´ ıaz-Ortiz, Obtaining Cluster Expansion Coefficients in Ab Initio Thermodynamics of Multicomponent Lattice-Gas Systems , Phys. Rev. B , 2006 , 73 , p 224207 : 5 53. A. van de Walle, Multicomponent Multisublattice Alloys, Nonconfigurational Entropy and Other Additions to the Alloy Theoretic Automated Toolkit , Calphad, 2009 , 33 , p 266 - 278 54. A. van de Walle, Methods for First-Principles Alloy Thermodynamics , JOM , 2013 , 65 , p 1523 - 1532 55. M.H.F. Sluiter , C. Colinet , and A. Pasturel , Ab Initio Calculation of the Phase Stability in Au-Pd and Ag-Pt Systems , Phys. Rev. B , 2006 , 73 , p 174204 : 17 56. M. Sluiter , K. Esfarjani , and Y. Kawazoe , Site Occupation Reversal in the Fe-Cr sigma Phase , Phys. Rev. Lett. , 1995 , 75 , p 3142 - 3145 57. E. Cockayne and A. van de Walle, Building Effective Models from Sparse but Precise Data: Application to an Alloy Cluster Expansion Model , Phys. Rev. B , 2010 , 81 , p 012104 : 4

This is a preview of a remote PDF:

Xi Zhang, Marcel H. F. Sluiter. Cluster Expansions for Thermodynamics and Kinetics of Multicomponent Alloys, Journal of Phase Equilibria and Diffusion, 2015, 44-52, DOI: 10.1007/s11669-015-0427-x