MULTIBREED EVALUATION -
THEORY AND APPLICATION[1]
M. A. Elzo
Animal Science Department
University of Florida,
Gainesville 32611
Introduction
The vast majority of the
beef produced in the United States comes from crossbred animals. If all animals involved in beef cattle
production systems in the United States were included in a gigantic genetic
evaluation scheme, they would constitute a complex multibreed population. Even if we considered only partial sections
of the United States multibreed population, these are likely to be reasonably
complete multibreed subpopulations. In
other words, multibreed populations in the United States are the rule rather
than the exception. If multibreed
populations are so abundant, then why are there no multibreed genetic
evaluation systems to evaluate animals, especially sires, for additive,
nonadditive, and total multibreed expected progeny differences? What are the problems that need to be solved
to produce national multibreed genetic evaluations?
This presentation discusses
some theoretical and applied aspects of multibreed genetic evaluation and
estimation of covariance component procedures used at the University of Florida
that may be useful for the development of national multibreed genetic
evaluation systems. Problems that need
to be solved to generate national, and perhaps international, multibreed
genetic evaluations are subsequently discussed in a general context.
Multibreed Populations
Multibreed populations are
those composed of straightbred and crossbred animals that interbreed. In other words, crossbred animals are not
just the end product of some straightbred × straightbred, straightbred ×
crossbred, and crossbred × crossbred matings, but they also actively participate
as parents of the next generation if chosen by some selection process. For the purposes of this discussion,
multibreed population will be classified as either complete or incomplete.
Complete multibreed
populations are those whose mating pattern follow a diallel design, i.e., they
have the same breed groups of sires and dams, and sires are mated across breed
groups of dams. The Angus-Brahman Herd
of the University of Florida has a complete multibreed design; Table 1 shows
the number of sires used per breed group of sire × breed group of dam in this
herd. In Table 1, BGS (BGD) = breed
group of sire (dam), A = Angus, B = Brahman, and Br = Brangus.
Incomplete multibreed
populations are those that have different numbers and (or) kinds of mating
groups of sires and dams. In addition,
sires of some mating groups may be preferentially mated to certain breed groups
of dams. The mating design of the
Sanmartinero-Brahman Multibreed Herd of La Libertad, Colombia, was incomplete;
Table 2 contains the numbers of bulls per breed group of sire × breed group of
dam combination; BGS (BGD) = breed group of sire (dam), S = Sanmartinero, and B
= Brahman.
The structure of the overall
multibreed population and the various multibreed subpopulations (e.g.,
Angus-Brangus-Brahman, Simmental-Canadian Simmental-Simbrah-Brahman) in North
America, because of the practice of grading up to a single breed and (or) their
goal of producing composite breeds, is likely to be more complete to the design
shown in Table 2. Multibreed Expected
Progeny Differences (MEPD) were computed for the Angus-Brahman Herd (Elzo and
Wakeman, 1998; Elzo et al., 1998b) and for the Sanmartinero-Brahman Herd (Elzo
et al., 1999). Considering the much
larger data sets available for the various multibreed subpopulations in North
America, it seems reasonable to assume that at least from a number of data
viewpoint, such evaluations are feasible.
The main issue here is likely to be connectedness among multibreed
contemporary groups. Cornell University
has had a Multibreed Genetic Evaluation program for the Simmental-Canadian
Simmental-Simbrah population that accounts for group nonadditive genetic
effects since 1997 (Klei and Quaas, 1995; Pollak and Quaas, 1998). Thus, at least in this multibreed population,
connectedness among multibreed contemporary groups was not a problem. This aspect needs to be investigated in all
multibreed subpopulations in the United States.
Genetic and Environmental
Effects
The type of genetic and
environmental effects present in multibreed populations can be classified in
almost exactly the same form as it is done in genetic analyses within
populations. The sole exception is the
existence of interbreed genetic effects.
Multibreed populations have additive and nonadditive, direct and
maternal, intrabreed and interbreed genetic effects.
Figure 1 shows a simplified
flowchart of genetic effects considered in a multibreed model. If all interbreed genetic effects were
omitted this chart will reflect an intrabreed analysis. A similar chart can be constructed for environmental
effects. However, environmental effects
associated with specific genetic effects are of less interest, thus they are
usually lumped together in less specific residual terms. The larger the number of base breeds in a
multibreed population the larger the need for this type of simplifying
assumptions. For example, there could
be intra and interbreed residual terms, or, a different residual term per breed
group of calf could be defined.
Whichever assumption is made relative to residual effects, it will be a
compromise between a desire for accuracy and computational feasibility.
Modeling Strategies
Additive and nonadditive
genetic effects in a multibreed prediction model can be viewed from a subclass
(i.e., 1 and 0 in the design matrix) or a regression viewpoint. Explaining a genetic effect in terms of
regression is done to simplify computations, but regression predictions are
usually approximations to the complete (subclass) effect. On the other hand, prediction of genetic
effects not represented in the data (e.g., future crossbred matings) can be
easily obtained. The author has
considered both strategies when modeling multibreed additive and nonadditive
genetic effects.
The subclass approach to
multibreed additive genetic effects is similar to the intrabreed situation,
except that the matrix of covariances among animals depends on at least two
sets of additive intrabreed and, if parents are crossbred, on one set of
additive interbreed genetic covariances.
The regression approach for additive genetic effects aim at predicting
all intra and interbreed genetic effects present in every animal. The usefulness of this approach in a
national animal evaluation seems unclear.
The subclass approach to
multibreed nonadditive genetic effects, where these effects are predicted
within sire × breed group of dam subclasses, can quickly become an extremely
large problem if breed groups of dams are defined in terms of their expected
breed compositions. One way to simplify
the problem is to redefine breed groups of dams in terms of ranges rather than
individual breed expectations. However,
the problem could still be too large from a practical standpoint. Another approach is to use regression to
approximate nonadditive genetic effects within sire × breed group of dam
subclasses (Elzo, 1983). If the number
of factors used to predict nonadditive genetic effects is small (e.g.,
intralocus interbreed), it can yield substantial computational savings. If, on the other hand, the number of
nonadditive prediction factors is large, computations can be even more involved
than using a subclass approach based on ranges of breed groups of dams. One of the main advantages of the regression
approach is that it permits the prediction of nonadditive genetic effects for
future matings. Thus, even if sires are
initially tested with a fraction of the future spectrum of possible matings,
predictions of nonadditive genetic effects for all of them can be easily made.
Multibreed Linear Model
Figure 2 shows, as an
example, a generic multibreed sire-maternal grandsire linear model. An animal model would substitute calf for
sire and dam for maternal grandsire.
The capital letters in parenthesis in Figure 2 indicate additive (A),
nonadditive (N), direct (D), and maternal (M) genetic effects. Fixed effects are similar to intrabreed
models except for exclusion (or inclusion)
of breed group effects. For example, multibreed contemporary groups
would follow the usual Beef Improvement Federation recommendations (BIF, 1996)
excluding breed group of calf. In fact,
calves of several breed groups must be reared together to create a
multibreed comparison group. Similarly,
age of dam would consider dam breed composition in some form, in addition of
sex of calf. Additive and nonadditive
genetic groups have similar meaning to that of intrabreed genetic models, and
can follow a subclass or a regression approach. Similarly, random additive genetic effects can have a subclass or
regression form. Preferably all
additive and nonadditive genetic effects should be deviated from a single base
(e.g., a base breed). The active
options of the current version of the MREMLEM program are regression additive
and nonadditive groups, subclass additive random effects, and regression
nonadditive random effects.
The type of multibreed
expected progeny differences (MEPD) produced by the multibreed model in Figure
2 are: 1) additive direct (AD), 2) additive maternal (AM), 3) nonadditive
direct (ND), 4) nonadditive maternal (NM), 5) total direct (TD = AD + ND), and
6) total maternal (TM = AM + NM). A
regression approach yields predictions of maximum values of genetic
effects. Thus, if regression was used
to predict nonadditive regression effects (as in MREMLEM), nonadditive genetic
predictions for specific crossbred matings need to be computed by weighing the
predicted values obtained from the solutions to the mixed model equations by
their probability of occurrence in these matings.
Estimation of Multibreed
Covariances
The largest problem to be
solved in multibreed genetic evaluation systems by far is that of estimating
the large number of multibreed covariances needed. However, because additive, nonadditive, and environmental
multibreed covariances are linear combinations of a much smaller set of base
additive, nonadditive, and environmental covariances this covariance estimation
problem is drastically reduced.
Computations can be further decreased if during a prediction or a
covariance estimation run, only multibreed covariances corresponding to breed
combinations present in the evaluation are computed.
Figure 3 shows the
computational procedure used by the MREMLEM program to estimate covariance
components. The computational procedure
uses a Generalized Expectation-Maximization algorithm (GEM, Dempster et al.,
1977) to first compute the elements of the Cholesky Decomposition of the base
additive, nonadditive, and environmental covariance matrices, then the base
covariance matrices are computed by multiplication by their transposes, and
finally multibreed covariance matrices are computed as linear combinations of
base covariance matrices. Subsequently,
genetic parameters (heritabilities, interactibilities, correlations) are
computed for a predetermined number of breed groups.
The MREMLEM program has been
used to estimate additive and nonadditive genetic, and environmental base and
multibreed covariances in several small experimental multibreed data sets (Elzo
and Wakeman, 1998; Elzo et al., 1998a,b, 1999). The main problem encountered when estimating covariances in these
data sets was the presence of multicollinearity when solving the set of
equations in the maximization step of the GEM algorithm. This multicollinearity problem seemed to get
worse when interbreed additive covariances (segregation covariances in the
terminology of Wright (1968), Lande (1981), and Lo et al., 1993) needed to be
computed. Adding a small number to the
diagonal of these equations (Elzo, 1996) helped somewhat. Partial steps (Jennrich and Schluchter,
1986) in the maximization step of the GEM algorithm were usually needed to
increase the value of the log-likelihood of the complete data. Whether much larger number of data will
decrease multicollinearity needs to be investigated. However, because multibreed covariances are linear combinations
of base covariances, if these base covariances are too similar, then
some degree of multicollinearity in the set of equations of the maximization
step seems inevitable. Adding a
computational rule of equality of estimated base covariances (or base
covariance matrices) in the multibreed GEM algorithm could be an alternative to
obtain approximate estimates.
Analyses of Multibreed Data
Sets
Three multibreed herds have
been analyzed using multibreed genetic evaluation procedures: 1) the Angus-Brahman
herd of the University of Florida for preweaning growth traits (Elzo and
Wakeman, 1998), and carcass traits (Elzo et al., 1998b), 2) the
Romosinuano-Brahman herd of Turipaná, Colombia for pre and postweaning growth
traits (Elzo et al., 1998a), and 3) the Sanmartinero-Brahman herd of La
Libertad, Colombia for pre and postweaning growth traits (Elzo et al.,
1999). The preweaning growth traits
were birth and weaning weights, and the postweaning growth trait was weight
gain between weaning and 16 mo of age.
The carcass traits were carcass weight, area of the longissimus muscle,
fat over the longissimus muscle, kidney-pelvic-heart fat, marbling score, and
Warner-Bratzler shear force. The goal
in all these studies was the characterization of these multibreed herds for the
analyzed traits. Thus, base and
multibreed additive, nonadditive, environmental, and phenotypic covariance
components and genetic parameters were estimated, and additive, nonadditive,
and total genetic values were predicted.
The multibreed model used in
these analyses was a sire-maternal grandsire model (Figure 2) and covariances
and genetic parameters were estimated as shown in Figure 3. As examples of the parameters obtained in
these analyses, Tables 3 contains estimates of heritabilities and
interactibilities (ratios of intralocus interbreed nonadditive to phenotypic
variances) for preweaning growth traits, and Table 4, estimates of the same
parameters for carcass traits, in the Angus-Brahman multibreed herd of the
University of Florida. The most salient
aspects of these tables is that heritability estimates were similar in Angus
and Brahman (except for Warner-Bratzler shear force), interactibility values
were somewhat lower (but not zero), except for fat over the longissimus dorsi
(steers slaughtered at similar fat endpoints).
A similar pattern of heritability-interactibility estimates for growth
traits was obtained in the Romosinuano-Brahman and Sanmartinero-Brahman
multibreed herds.

To illustrate predictions
results for growth traits, Figure 4 shows direct additive, nonadditive, and
total MEPD in the Angus-Brahman herd (left column) and the Sanmartinero-Brahman
herd (right column), and Figure 5 shows the same arrangement for maternal
additive, nonadditive, and total MEPD in these two herds. The meaning of the numbers in the abcisa of
the Angus-Brahman graphs is: 1 = Angus, 2 = ¾A ¼B, 3 = ½A ½B, 4 = ¼A ¾B, 5 =
Brahman, and 6 = Brangus. The letters
in the abcisa of the Sanmartinero-Brahman graphs mean: S = Sanmartinero, X = ½S
½B, C = Brahman. Sires were ordered by
date of entry into the stud within breed group of sire; thus, they show
an unweighted trend in sire usage over time by breed group in these herds.
The main points in these
graphs are: 1) the samples of Angus and Sanmartinero sires behaved in similar
fashion relative to the samples of Brahman sires in each herd for direct
additive MEPD, but not for maternal additive MEPD, where Sanmartinero sires
were clearly superior to Brahman sires for maternal additive MEPD for weaning
weight, 2) there was nonadditive direct and maternal variation among sires of
all breed groups for nonadditive MEPD, 3) total MEPD tended to increase toward
Brahman sires for direct and maternal genetic weaning weight effects in the
Angus-Brahman herd, but only for direct genetic effects in the
Sanmartinero-Brahman herd; the total maternal MEPD of Sanmartinero sires was
larger than those of Brahman sires.
Although some of the sires used in the Colombian herds were the same as
those used in the Angus-Brahman herd, a joint analysis has not been done yet,
thus only indirect comments, as the ones above, are possible at this time.
Figures 6 and 7 illustrate
multibreed predictions for carcass traits in the Angus-Brahman herd of the
University of Florida. Direct additive,
nonadditive, and total MEPD for carcass weight and longissimus muscle area are
shown in Figure 6, whereas Figure 7 contains the same MEPD for marbling score
and Warner-Bratzler shear force. Recall
that steers were slaughtered at approximately the same fat endpoint. These graphs reaffirm visually the existence
of intralocus interbreed nonadditive genetic variability in the Angus-Brahman
multibreed herd, as well as show in detail differences in direct additive MEPD
among sires of the six breed groups.
For example, Angus sires had the smallest additive and total MEPD for
carcass weight, among the largest ones for area of the longissimus muscle, the
largest ones for marbling score, and the lowest ones for Warner-Bratzler shear
force. According to these graphs,
intralocus interbreed nonadditive MEPD would increase most sire's total MEPD
values for carcass weight and longissimus muscle area, but it would decrease
most sire's total MEPD for marbling score and Warner-Bratzler shear force.
The ranking of sires by
additive and nonadditive MEPD for all growth and carcass traits was
substantially different (correlation between .04 and .41). The correlation between additive and total
MEPD yielded values between .81 and 1.00 among the growth and carcass traits
analyzed in these herds. Thus,
accounting for intralocus interbreed nonadditive genetic effects will be more
useful to help identify the best sires based on total MEPD for some traits
(e.g., weight traits) than for others (e.g., kidney-pelvic-heart fat).
Estimates of covariance
components and genetic predictions indicated that there was substantial
variability for intralocus interbreed nonadditive genetic effects in the three
analyzed multibreed herds. Thus, selection
of animals for both additive and nonadditive genetic effects is feasible in
these herds. Furthermore, when sires
were ordered by their additive, nonadditive, and total MEPD, sires from all
breed groups could be found in the top, middle, and bottom tiers. A conservative selection approach would be
to first select animals for their additive MEPD, and then select the best ones
for their total MEPD among those in the preselected group. This would ensure additive genetic progress
and increased combining ability of the selected sires in the next generation.
National Multibreed
Evaluations
The main problem to be
solved to have a unified system of national multibreed genetic evaluations is
to agree on the definition of the national beef cattle multibreed
population(s). Elzo (1995)
discussed three possible alternatives, two of which would produce national
multibreed genetic evaluations. The
three definitions were: 1) a single multibreed population including all
straightbred and crossbred beef cattle in the country, 2) several overlapping
multibreed subpopulations, each composed by several breeds and all crossbred
groups of these breeds, and 3) several extended breeds which included
straightbred sires and their mates of various breed compositions. The first two definitions will yield
national multibreed genetic evaluations.
Furthermore, the inclusion of international data in USA intrabreed
genetic evaluations (e.g., Canadian Simmental, Uruguayan Hereford) has opened
the door for continental, and perhaps world genetic evaluations in the future.
The simplest situation would
be to have a single national multibreed population. Although this may not be hitherto possible, it should be
seriously considered for the future.
The second definition is more realistic at this time. The data set used in the Cornell Multibreed
Evaluation which includes Simmental, Canadian Simmental, and Simbrah, is the
closest to one to meet the criteria of this definition. It would be complete if all Brahman data
were included as well. Other multibreed
subpopulations could be Angus-Brangus-Brahman, Hereford-Braford-Brahman,
Limousin-Bramousin-Brahman, etc. Since
all these multibreed subpopulations have Brahman in common, connectedness among
them would be created by using reference Brahman sires in (some) of their
contemporary groups.
Some breed associations are
currently accepting straightbred and crossbred records from other breeds (E. J.
Pollak, personal communication).
Consequently, ties among overlapping multibreed populations will become
stronger over time, thus facilitating the generation of national multibreed
additive, nonadditive, and total MEPD.
A second problem is
methodological. Multibreed procedures
are computationally more demanding than intrabreed ones. Given the complexity of the national
multibreed data sets in terms of the numbers of breeds and crossbred groups,
efficient linear and nonlinear multibreed computational procedures need to be
developed.
A third problem is
publication of information. The large number
of MEPD (additive, nonadditive, total)
generated per animal in a multibreed evaluation makes it unfeasible to
completely publish them on paper. Thus,
electronic publication should be considered the primary source. Furthermore, publication of nonadditive and
total MEPD for every potential crossbred mating of an animal is clearly
inadvisable. Perhaps a mating program
should be part of the electronic service, which would provide a list of
potential mates for a dam with their additive, nonadditive, and total
MEPD. These mates could be chosen
within a single breed or among several breeds specified by the user. In short, with electronic distribution of
MEPD, additional services to facilitate and expand their usability will need to
be considered.
The problems outlined above
are simply a chapter in the ongoing evolution of national genetic evaluation
methodology. Solutions to these
problems will require an even larger level of communication and cooperation
among producers, breed associations, and university researchers.
References
BIF. 1996.
Guidelines for uniform beef improvement programs (7th
Ed.). Beef Improvement Federation, Kansas State Univ., Kansas, p 1-161.
Dempster, A. P., N. M.
Laird, and D. B. Rubin. 1977. Maximum
Likelihood from Incomplete Data via the EM Algorithm. J. Royal Stat. Soc., Ser. B 38:1-38.
Elzo, M. A. 1983. Multibreed evaluation within and across
countries. Ph. D. Dissertation, Univ.
of California, Davis.
Elzo, M. A. 1996. Unconstrained procedures for
the estimation of positive definite covariance matrices using Restricted
Maximum Likelihood procedures in multibreed populations. J. Anim. Sci. 74:317-328.
Elzo, M. A. 1995. Considerations for the genetic evaluation of
straightbred and crossbred bulls in large multibreed populations. Proc. WCC-100
Symp. Development and Implementation of Statistical Analyses and Computer
Strategies for National Cattle Evaluation, p 1-21.
Elzo, M. A., and D. L.
Wakeman. 1998. Covariance components and prediction for
additive and nonadditive preweaning growth genetic effects in an Angus-Brahman
multibreed herd. J.
Anim. Sci. 76:1290-1302.
Elzo, M. A., C. Manrique, G. Ossa, and O. Acosta. 1998a. Additive and nonadditive genetic variability
for growth traits in the Turipaná Romosinuano-Zebu multibreed herd. J. Anim. Sci. 76: 1539-1549.
Elzo, M. A., G. Martínez, F. González, and H. Huertas. 1999.
Variability and genetic predictions for beef traits in the
Sanmartinero-Brahman multibreed herd of La Libertad. Proc. Inter. Sem. Genet. Characterization Sanmartinero Criollo
Cattle, p 54-81.
Elzo, M. A., R. L. West, D.
D. Johnson, and D. L. Wakeman.
1998b. Genetic variation and
prediction of additive and nonadditive genetic effects for six carcass traits
in an Angus-Brahman multibreed herd. J. Anim. Sci. 76: 1810-1823.
Jennrich, R. I., and M. D.
Schluchter. 1986. Unbalanced repeated-measures models with
structured covariance matrices.
Biometrics 42:805-820.
Klei, L., and R. L.
Quaas. 1995. Multiple Breed - EPD: The
Cornell approach to the Simmental data.
Proc. WCC-100 Symp. Development and Implementation of Statistical
Analyses and Computer Strategies for National Cattle Evaluation, p 41-49.
Lande, R. 1981.
The minimum number of genes contributing to quantitative variation
between and within populations.
Genetics 99:541-553.
Lo, L. L., R. L. Fernando,
and M. Grossman. 1993. Covariance between relatives in multibreed
populations: Additive model. Theor.
Appl. Genet. 87:423-430.
Pollak, E. J., and R. L.
Quaas. 1998. Multibreed genetic evaluations of beef cattle. Proc. 6th World Congr. Appl.
Livest. Prod. 23:81-88.
Wright, S. 1968.
Evolution and the Genetics of Populations. Vol. 1. Genetics and
Biometrical Foundations. University of Chicago Press.



Figure 4. Direct Additive, Nonadditive, and Total
Multibreed Expected Progeny Differences for Weaning Weight in the Angus-Brahman
Herd of the University of Florida (left column) and in the Sanmartinero-Brahman
Herd of La Libertad, Colombia (right column).



Figure 5. Maternal Additive, Nonadditive, and
Total Multibreed Expected Progeny Differences for Weaning Weight in the
Angus-Brahman Herd of the University of Florida (left column) and in the
Sanmartinero-Brahman Herd of La Libertad, Colombia (right column).



Figure 6. Direct Additive, Nonadditive, and Total
Multibreed Expected Progeny Differences for Carcass Weight and Longissimus
Muscle Area in the Angus-Brahman Multibreed Herd of the University of Florida.



Figure 7. Direct Additive, Nonadditive, and Total
Multibreed Expected Progeny Differences for Marbling Score and Warner-Bratzler
Shear Force in the Angus-Brahman Multibreed Herd of the University of Florida.