Routine Computation and Visualization of Coefficients of Parentage Using the International Crop Information System

Graham McLaren[*] Ian DeLacy[**] and Jose Crossa[***]

The Coefficient of Parentage (COP), also known as the Coefficient of Kinship measures the genetic relationship between strains of germplasm according to the proportion of identical alleles that they share by descent through their pedigrees. COP values are therefore important measures of genetic diversity. Two times the COP value is known as the Coefficient of Relationship and is related to the additive genetic correlation between the strains. This makes them a powerful statistic for analyzing breeding strategies.

In organisms where it is difficult to replicate genotypes, such as mammals, information from relatives is sometimes the only source of information on breeding values. Sewell Wright (1922) proposed the Coefficient of Relationship as a way to quantify the relatedness and to derive the Inbreeding Coefficient for livestock. COP values computed from pedigree records have been used extensively in livestock improvement since mixed model theory and software was developed in the 70s (Mrode, 1996).

COP values have played a lesser role in crop breeding for several possible reasons: pedigree records have been less extensive and recorded in more diverse ways, it is easy to replicate genotypes and increase heritability of interesting traits by experimental design and replication, and software for computing COP values and using them has, until recently, not been readily available to crop scientists. Many studies have, however, shown the utility and power of COP values for crop improvement.

Cox et al. (1985) demonstrated the correlation between COP values and similarity indices based on isozyme markers in Soybean (Glycine max (L) Merr.). Murphy et al. (1986) and Cox et al. (1986) studied the population structure and field diversity of red winter wheat (Triticum aestivum L.) cultivars based on COP values.

Dilday (1990) used pedigree analysis and COP values to analyze the diversity of rice (Oryza satva L.) germplasm developed and released by region in the U.S. Souza et al. (1994) used COP values to analyze spring wheat diversity in the intensive cropping systems of Mexico and Pakistan, concluding that cultivar improvement programs did not erode genetic diversity and that patterns, both temporal and spatial, of cultivar use by farmers were important in determining genetic diversity in these areas.

The relationship between COP values and heterosis is an interesting and useful phenomenon. Lefort-Buson et al. (1986) studied this relationship in rapeseed (Brassica napus) and Cowen and Frey (1987) studied the relationship between genealogical distance and breeding behavior in oats (Avena sativa L.). Significant heterosis was observed for matings of more distantly related parents. Bernardo (2002) demonstrates the utility of COP values for predicting single cross performance in hybrid breeding programs.

Sneller (1994) used analysis of COP values to indicate that efforts to increase diversity amongst U.S. soybean cultivars were having little effect and suggested crossing lines with low COP values as a way of exploiting what genetic diversity does exist between regions.

More direct applications of COP values for modeling additive and additive x additive components of genetic variance as well as concomitant components of genotype by environmental variation in crop breeding evaluation data are now being proposed (Crossa et al., 2006, Oakey et al., 2006 and Burgueño et al., 2007). These techniques, however, require the routine calculation and inversion of COP matrices for breeding populations and then their exploitation in mixed linear models to dynamically weight contributions of relatives for estimating breeding values with high precision.

Several computer programs have been developed for computation of COP values and even for the direct computation of the inverse of the COP matrices for animal breeding (Henderson, 1976). Most of these programs do not cater for close relationships which occur in crop strains due to inbreeding by self pollination. Also, the algorithms for direct computation of inverse COP matrices do not deal with selfing. One program focused on crop breeding is described in Cox et al. (1985), and another which allows selfing generations in Tinker et al. (1993). Sneller (1994) developed a SAS macro based on these programs but this does not deal with related sister lines. A major drawback of these programs is the necessity of preparing detailed input files of pedigrees and inbreeding status.

The major constraint to the routine use of COP values in plant breeding is now being removed by the use of information systems such as the International Crop Information System (ICIS, McLaren et al., 2005) which allow the curation of historical pedigrees and automatically capture new pedigree information for a crop breeding project. The BROWSE program of ICIS extracts the required pedigree information, counting selfing generations and identifying common ancestors of sister lines. It produces COP matrices and facilitates Pattern Analysis for display of genetic relationships among strains and between sets of strains. It also inverts the COP matrix for use in Mixed Model analysis programs to model components of genetic and genetic by environmental variation.

In this paper we describe the data model for the management of genealogical information in ICIS and the algorithm used for the computation of COP values by the BROWSE application of ICIS. We describe Pattern Analysis for arrays of COP values leading to several graphic visualizations which can be interpreted in terms of important genetic relationships. We illustrate these displays using examples of COP matrices for rice and wheat.

MATERIALS AND METHODS

The COP between two strains of germplasm, denoted f, is the probability that a randomly selected allele from one strain is identical by descent to a randomly selected allele at the same locus from the other strain (Falconer, 1981 p. 81). A strain is a sample of seeds, either germplasm from a genebank or other collection, or seeds collected from a breeding process. It is not synonymous with genotype since it may be a mixture of individuals with different genotpes, and the notion of ‘randomly selected allele’ corresponds with the idea of observing the allele at a particular locus in a randomly selected gamete from a randomly selected individual in the sample.

Since all individuals of a species are related via evolution, measures of relatedness need to start from some base population and are always relative to this population. In pedigree analysis this base population is taken to be the set of terminal ancestors of the pedigrees which are assumed to be unrelated. Alleles may appear identical in individuals of the base population, and these, and their descendants through the pedigrees are said to be identical by state (IBS). However, two alleles of a locus are identical by descent (IBD) if they have originated by replication of a single allele in the base population (Lynch and Walsh, 1998, p132). COP values measure the proportion of alleles which are IBD and this gives rise to the basic relationship between COP values for strains in one generation and those in a previous one:

fPQ = ½ (fPC + fPD) [1]

where strains P and Q are to be mated in the current generation and Q has resulted from a cross between strains C and D in the previous generation (Falconer, 1981 p.81).

Inbreeding is the process by which loci become homozygous with IBD alleles through inter-mating of related individuals. The coefficient of inbreeding (COI) is the probability that two alleles at any locus in a strain are IBD. Hence, the COP between two strains is the COI of offspring from a cross between those strains. This was the relationship exploited by Wright (1922) in calculation COI for livestock.

In species capable of self-fertilization, inbreeding derives from two sources, the relatedness of parents, measured by their COP and the increase in homozygosity due to selfing which decreases the proportion of heterozygous loci by a factor of one half in each generation. If HZ is the probability of homozygous loci in strain Z due to n generations of selfing, then HZ = 1-(1/2)n. Combining the two sources of inbreeding we have that the COI, FZ, of a strain Z resulting from a cross between strains C and D followed by n generations of selfing is:

FZ = (1- HZ) fCD + HZ

= 1 – (1- fCD)(1/2)n. [2]

Hence, the COI of a strain Z which is the direct result of a cross (n=0) is the COP of its parents: FZ = fCD. The other relationship between the COP and COI is found by considering the COP, fZZ, of a strain Z with itself (Falconer, 1981 p.82):

fZZ = ½ (1 + FZ). [3]

Relationship of COP to additive genetic variance

The breeding value of a strain is the average effect of its alleles on the phenotype of its progeny. More practically, if a strain is crossed with strains taken at random from a population, then its breeding value with respect to this population is twice the mean deviation of the progeny of these matings from the population mean (Falconer, 1981 p. 106). These theoretical and practical definitions of breeding value coincide in the absence of epistasis but otherwise differ slightly (Falconer, 1981 p. 143).

Breeding values are important for crop improvement to determine the value of a strain as a potential parent. The variance of breeding values for a particular trait of all individuals in the population is the additive genetic variance of the population for that trait, σ2A, and it is a key determinant of the heritability, h, of the trait and the rate of response to selection for that trait in the population, R. In fact:

R = iσ2AP = ihσA [4]

where i is the intensity of selection(extreme fraction of the parental generation retained), σP is the phenotypic variance of the trait and h is the heritability, h = σAP.

The estimation of breeding values and genetic variance for breeding populations is therefore of considerable interest. Breeding populations usually comprise related strains so that the breeding value of one strain is correlated with that of another. In fact under conditions of random mating and independent inheritance of unselected loci, the Coefficient of Relationship between strains (2xCOP) is the additive genetic correlation so that the additive genetic covariance of strains P and Q is 2fPQσ2A (Lynch and Walsh, 1998, p 144). Similarly, the additive by additive component of epistatic genetic variance between strains P and Q is (2fPQ)2σ2AA where σ2AA is the additive by additive component of epistatic genetic variance for the trait in the population. The matrix of elements 2fPQ for all st pairs of strains in a population is called the relationship matrix (Falconer, 1981 p. 142) or the numerator relationship matrix (Mrode, 1996, p24).

Examples from Rice and Wheat

To illustrate the computation and visualization of COP matrices two examples are computed. In the first example COP values between rice cultivars derived from lines bred by the International Rice Research Institute and released by the Philippine seed authorities, Table 1, are computed and subjected to pattern analysis. In the second example, COP values for 19 historic cultivars of wheat released in Mexico, Table 2, and their 87 founder ancestors are computed and analysed to study their additive genetic relationships and the contribution of founders to cultivars.

RESULTS

Management of Pedigree Information

The development of freely available crop information systems capable of routine and uniform maintenance of pedigree information linked all the way back to information on genetic resources and landrace cultivars provides the missing step in the regular exploitation of COP values in crop breeding programs.

The International Crop Information System (ICIS, McLaren et al, 2005) is one such community-developed system generalizing the International Wheat Information System (IWIS) developed at CIMMYT during the 1980s (Fox and Skovmand, 1996). The Genealogy Management System of ICIS records pedigree, nomenclature and chronology of breeding lines and genetic resources and updates these records automatically during germplasm maintenance or breeding operations.

Pedigree information is stored independently of nomenclature making it uniform and amenable to computer manipulation. Strains are uniquely identified by germplasm identifiers (GID) and relate to source strains or progenitors depending on method of production. Methods are divided into three classes, maintenance methods (such as regeneration) designed to maintain genetic composition of a strain, generative methods (such as crossing) designed to increase genetic diversity and derivative methods (such as selection) designed to reduce and refine genetic diversity. The data model for genealogy information in ICIS is shown in Figure 1.

Strains produced by generative methods are linked to one or more progenitor strains: one progenitor in the case of mutants, two for bi-parental crossing schemes and more for dispersed mating schemes such as reciprocal recurrent selection. Strains produced by maintenance or derivative methods are linked to exactly one source strain and one group strain. The source strain is the immediately preceding generation and the group strain is the most recent generative generation in the pedigree of a strain. The source, or source and group may be unknown.

Information on nomenclature, origin and chronology are linked to the GID. Also users can define any attributes to be recorded and linked to GID. For example breeding objectives or conditions or intellectual property status of a strain could be stiored.

Computation of COP values by BROWSE

The BROWSE application of ICIS is a program which interacts with an ICIS database implementation for any crop to facilitate management and analysis of pedigree information. The computation of the COP between two strains in BROWSE commences with the extraction of the complete pedigree trees for two lines into a table with one row for each ancestor indicating: (1) its identity, (2) whether it is maintenance, derivative or generative germplasm, (3) the rows in the table for the source and group strains of the current ancestor if it is a derivative or maintenance strain or (4) the number of its progenitors, and (5) the row for the first progenitor if it is generative (further progenitors occur on subsequent rows of the table). As each ancestor is extracted, the table is checked to see whether that strain has already been tabulated as an ancestor of the first strain or from a different branch of the pedigree of the current strain. If so, pointers are set to the existing row and extraction of the branch is terminated.

The second step is to determine the order of the rows (ancestors) in the pedigree. This order is the largest number of generative steps from the current ancestor to a terminal ancestor via any of its progenitors. Terminal ancestors are those without progenitor information and in complete pedigrees should comprise landrace cultivars or wild relatives. Ancestors produced by derivative or maintenance methods have the same order as their group strains.

The third step is to use the recursive algorithm of equation 1 to accumulate COP values from terminal ancestors together with the relationships of equations 2 and 3 to incorporate inbreeding effects. An example is illustrated in Figure 2.

Contributions from Generative Processes

If P and Q are strains derived from different generative processes with known progenitors as in Figure 2, then Equation 1 is used. Although it is symmetrical in P and Q so that the parents of either could be used to obtain the right hand expansion, it is implemented in BROWSE by expanding the strain with the highest order. This ensures that when the terminal ancestors are reached, the computations involve COP values between the same strain, or unrelated strains or crosses between unrelated strains, all of which are easily calculated. If any parent is unknown, then COPs involving that parent are taken as zero.

Equation1 is generalized in BROWSE to allow generative processes involving any number of progenitors: If Q has progenitors Q1, Q2 … Qm then fPQ = (1/m) ∑ fPQi. This allows, for example, Q to be produced by a reciprocal recurrent selection program involving random mating between m strains. The current implementation assumes equal contribution from the m strains; but facilities exist in ICIS to record unequal contributions and the BROWSE computation could be easily generalized to obtain a differently weighted average.

Contributions from Inbreeding

If P and Q are derived from different generative strains, such as different crosses, the gametes of P and Q are essentially a random sample of the gametes from their group strains. Hence, the COP between P and Q is unaffected by inbreeding of P or Q and so it is ignored.

To compute the COP of a derivative strain with itself, say Z in Figure 2, the number of derivative generations from Z to its generative group, A ´ B, is counted and the inbreeding coefficient, FZ, of Z is determined by Equation 2 and converted to a COP using Equation 3.

BROWSE uses a parameter BTYPE to implement some assumptions in the case of incomplete pedigree records. The user should set BTYPE = 1 for self-pollinating species and 0 otherwise. If the progenitors of a strain are unknown, then FZ is set to BTYPE. This occurs most frequently when Z is derived from a landrace or traditional cultivar and corresponds to an assumption of full inbreeding for self-pollinating crops and no inbreeding for others. Similarly, if Z traces back to a single progenitor, such as a mutant strain, then FZ = BTYPE.

If there is a break in the records of source strains back to the group strain, as with strain B in Figure 2, then the strain is assumed to be an F4 derived breeding line so that FB = 1 – (1/2)4 = 15/16.

For strains which are sister lines derived from the same group, such as R and Q in Figure 2, the effect of inbreeding depends on the inbreeding coefficient of the most recent common ancestor, Z. If Z is the group strain of R and Q, then there is no effect of inbreeding. If incomplete pedigree records prevent the determination of a common ancestor, then R and Q are assumed to have diverged at the group strain. Otherwise, the gametes of R and Q can be regarded as a random sample of the possible gametes of Z and inbreeding since strain Z does not affect the COP. The inbreeding coefficient of Z and hence the COP between R and Q are computed as above with the same assumptions for strains with incomplete pedigrees.

COP matrices for all pairs of strains in a germplasm list

COP values are often required for all pairs of strains in a germplasm list. The matrix of these values indexed by strain is obviously symmetric with COPs for all strains with themselves on the diagonal. BROWSE implements this use-case by using the list processing facilities of ICIS. It extracts ancestors for all strains in the list into a single table as with pairs of strains. The program has a limit of 20,000 distinct ancestors for a single germplasm list. Since breeding populations are often highly related, this is not usually a constraint. The relatedness also implies that the same component ancestral COPs would be computed many times in calculating COPs for all pairs from the list. This is avoided by storing intermediate COP values for re-use in a sparse matrix with dimensions equal to the number of distinct ancestral strains. The COP algorithm checks this store whenever computations are required and writes to it whenever they have had to be executed.

Clearly, only one half of the COP matrix need be computed, and BROWSE outputs the lower triangular part, by rows as a list to the screen and to a text file. BROWSE also outputs this lower triangular matrix by rows in sections of ten columns at a time to the text file.

The inverse COP matrix is often required for mixed linear model analysis. BROWSE computes the eigen structure of the COP matrix pursuant to its inversion, and prints the eigen values and the first four eigen vectors of the COP matrix before inversion. The eigen vectors can be used to visualize and analyze the pedigree structure of the germplasm list.

Eigen values smaller than 1.0E-9 are taken to indicate singularities in the COP matrix (which occur, for example if the same strain occurs more than once in the list). The inverse COP matrix is computed from the eigen structure according to the formula:

[5]

where li are the eigen values and ui the eigen vectors of A.

If A is singular, a G-Inverse A- is computed by omitting terms in the sum for which li < 1.0E-9.

The lower triangular part of the inverse (or G-inverse) is printed to the text output file by rows in sections of ten columns, and as a list by rows. The list contains row number, column number, matrix value, row-GID, column-GID for each cell of the lower triangular part of the matrix. This can readily be extracted and used in statistical software such as ASREML (Gilmour et al., 2006).

Visualization of COP matrices

The eigen structure of the of the COP matrix used to obtain the inverse in Equation 5 can be used to construct ordination plots showing approximations to the additive genetic relationship between strains. The plotted points (√λ1ui1, √λ2ui2) in two dimensions or (√λ1ui1, √λ2ui2, √λ3ui3) in three dimensions display the rank 2 or rank 3 approximation to the COP matrix. The quality of the approximation can be gauged from the proportion of the generalized variance (sum of all the eigen values) accounted for by the first two or three eigen values. The approximation to fij shown in the plots is the inner product between the plotted vectors: ^f ij= (λ1ui1uj1+ λ2ui2uj2) or ^f ij= (λ1ui1uj1+ λ2ui2uj2+ λ3ui3uj3) and this is just √(^f ii ^f jj) Cos(θij) where θij is the angle subtended by the points for strains i and j at the origin. Hence if the diagonal elements of the COP matrix are constant, as in the case where the strains are all inbred lines then Cos(θij) is approximately proportional to the additive genetic correlation between strains i and j.

The COP matrix is a proximity matrix – the larger the values the closer the entities. Gower’s formula, dij = fij/√(fiifjj), can be used to transform it into a dissimilarity matrix and then cluster analysis by such methods as Ward’s agglomerative hierarchical clustering can be used to determine groups of closely related strains. The joint interpretation of the ordination plots and the cluster analysis is pattern analysis and is a powerful tool for summarizing high dimensional distance relationships.

The set of COP values between a strain which is a breeding line or cultivar and its terminal or founder strains which are often landrace cultivars of unknown parentage is called a Mendelgram. They give the proportion of alleles at unselected loci in the lines which are IBD from the founder and hence can be used to show the contribution of early germplasm to modern cultivars. These COP values for n lines with m distinct founders can be arranged in a Mendelgram Matrix, M, which contains information on the patterns of contribution of founders to lines. It can be subjected to Pattern Analysis to summariza and visualize these relationships. The n x m matrix M is factored by Singular Value Decomposition into three components, UΛV, where the columns of the n x p matrix U contain the left eigen vectors of M, and the rows of the p x m matrix V contain the right eigen vectors of M and the p x p diagonal matrix Λ contains the p non zero singular values of M, λk for k =1, 2, … p (p <= min(m,n)) in decreasing order of magnitude. The first two eigne vectors of each mode, lines and founders, are used to form a biplot. For the lines, spokes are plotted from the origin to the points (√λ1ui1, √λ2ui2), i=1, 2 .. n and the points (√λ1v1j, √λ2v2j) j=1, 2 … m. To the extent that the rank-two approximation ^mij= (λ1ui1v1j+ λ2ui2v2j) is close to M (as measured by the proportion of the generalized variance accounted for by the first two singular values), this is a picture of the Mendelgram Matrix. The projection of the founder point j onto the line spoke i is: ^mij/√( λ1ui12 + λ2ui22) so that the order of projections onto the spoke for any line gives the order of magnitudes of the contributions for the founters to that line. The greater the angle between the spokes for two lines, the more different will be this pattern of contributions.

The Euclidean distance between all pairs of lines (or founders) can be computed from the inner product of the rows (or columns) of the Mendelgram Matrix. These distance matrices can be subjected to cluster analysis to identify groups of lines receiving similar contributions from founders and groups of founders contributing similar proportions of alleles to lines. These groups can be interpreted with the biplot to give a Pattern Analysis of the Mendelgram Matrix.

COP Analysis for the rice cultivars

The coordinates (√λ1u1i, √λ2u2i) for each cultivar, i, in Table 1 are plotted in Fig 1. The rank-two approximation to the COP matrix accounts for 32% of the generalized variance (sum of the eigen values). To the extent that this is a good approximation, this is a picture of the additive genetic relationships between the cultivars. IR28, IR29 and IR34 at the top of Figure 1 are highly related being sister lines from the same cross IR 2061. IR8 sits squarely in the center of the group and the vertical axis (Vector 2) measures genetic distance from IR8.

COP analysis of the wheat cultivars

Figure 2 shows a similar plot for the 19 wheat cultivars released in Mexico. In this case, however, the first three axes have been plotted to capture 48% of the generalized variance.

DISCUSSION

The BROWSE application and the ICIS database system make the routine computation of COP matrices possible for crop improvement programs.

The COP matrices or their inverses can be used for structural analysis of breeding nurseries, cross prediction strategies and for increasing the precision of estimates of breeding values. With database tools like ICIS and BROWSE linked to statistical software which is now available, this can be a routine part of any breeding program.

The ICIS software, including BROWSE, is freely available and published for use or further development through ICIS projects on CropForge at www.cropforge.org under the GNU open source license.

REFERENCES

Bernardo, R. 2002. Breeding for Quantitative Traits in Plants. Stemma Press. Woodbury, Minnesota, USA.

Burgueño, J., J. Crossa, P.L. Cornelius, R. Trethowan, G. McLaren and A. Krishnamachari. Modeling Additive ´ Environment and Additive ´ Additive ´ Environment Using Genetic Covariances of Relatives of wheat Genotypes. Crop Sci. (In Press).

Cowen, N.M. and K.J. Frey. 1978. Relationship Between Genealogical Distance and Breeding Behaviour in Oats (Avena Sativa L.). Euphytica 36:413-424.

Cox, T.S., Y.T. Kiang, M.B. Gorman, and D.M. Rodgers. 1985. Relationship Between Coefficient of Parentage and Genetic Similarity Indices in the soybean. Crop Sci. 25:529-532.

Crossa, R, J. Burgueño, P.L. Cornelius, G. McLaren, R. Trethowan, and A. Krishnamachari. 2006. Modeling Genotype ´ Environment Interaction Using Additive Genetic Covariance of Relatives for Predicting Breeding Values of Wheat Genotypes. Crop Sci. 46:1722-1733.

Dilday, R.H. 1990. Contribution of Ancestral Lines in the Development of New Cultivars of Rice. Crop Sci. 30:905-911.

Falconer, D.S. 1981. Introduction to Quantitative Genetics. Longman Group Ltd. Second Edition.

Fox, P.N. and B. Skovmand. The International Crop Information System (ICIS) – Connects Genebank to Breeder to Farmer’s Field. CAB International 1996. Plant Adaptation and Crop Improvement (eds. M. Cooper and G.L. Hammer).

Gilmour, A.R., Gogel, B.J., Cullis, B.R. and Thompson, R. 2006. ASReml User Guide Release 2.0. Department of Primary Industries, NSW, Australia

Hanson, W.D. 1994. Distance Statistics and Interpretation of Southern States Regional Soybean Tests. Crop Sci. 34:1498-1504.

Henderson, C.R. A simple methof for computing the inverse of a numerator relationship matrix used in predicting of breeding values. Biometrics 32: 69-83.

Lefort-Buson, M., Y. Dattee, and B. Guillot-Lemoine. 1986. Heterosis and Genetic Distance in Rapeseed (Brassica napus L.): use of Kinship Coefficient. Genome 29:11-18.

Lynch, M. And Walsh, B. 1998. Genetics and analysis of quantitative traits. Sinauer Associates, Massachusetts, U.S.A.

McLaren, C.G., R. Bruskiewich, A.M. Portugal, and A. B. Cosico. 2005. The International Rice Information system. A Platform for Meta-Analysis of Rice Crop Data. Plant Physiology 139:637-642.

Mrode, R.A. Linear Models for the Prediction of Animal Breeding Values. CAB International 1996.

Murphy, J.P., T.S. Cox, and D.M. Rodgers. 1986. Cluster Analysis of Red Winter Wheat Cultivars Based Upon Coefficients of Parentage. Crop Sci. 26:672-.

Sneller, C.H. 1994. Pedigree Analysis of Elite Soybean Lines. Crop Sci. 34:1515-1522.

Sneller, C.H. 1994. SAS Program for Calculating Coefficient of Parentage. Crop Sci. 34:1679-1680.

Souza, E., P.N. Fox, D. Byerlee, and B. Skovmand. 1994. Spring Wheat Diversity in Irrigated Areas of Two Developing Countries. Crop Sci. 34:774-783.

Tinker, N.A. and Mather, D.E. 1993. KIN:Software for computing kinship coefficients. J. Hered. 84:238.

Wright, S. 1922. Coefficients of inbreeding and relationship. Am. Naturalist 56:330-338



CLUSTER

ENTRYCD

SOURCE

DESIGNATION

4

IR 29

IRRI

IR 2061-464-4-14-1

4

IR 34

IRRI

IR 2061-213-2-17

4

IR 28

IRRI

IR 2061-214-3-8-2

4

RC 28

IRRI

IR 56381-139-2-2, PSB RC 28, AGNO

4

IR 64

IRRI

IR 18348-36-3-3

4

IR 54

IRRI

IR 5853-162-1-2-3

4

IR 52

IRRI

IR 5853-118-5

4

IR 68

IRRI

IR 28224-3-2-3-2

4

IR 65

IRRI

IR 21015-196-3-1-3

 

 

 

 

1

IR 22

IRRI

IR 579-160-2

1

IR 08

IRRI

IR 8-288-3

1

IR 05

IRRI

IR 5-47-2

1

IR 46

IRRI

IR 2058-78-1-3-2

1

RC 01

IRRI

IR 10147-113-5-1-1-5, PSB RC 1,MAKILING

1

RC 18

IRRI

IR 51672-62-2-1-1-2-3, PSB RC 18, ALA

1

IR 48

IRRI

IR 4570-83-3-3

1

IR 43

IRRI

IR 1529-430-3

1

IR 24

IRRI

IR 661-1-140-3-117,IR 661-1-140-3

1

IR 45

IRRI

IR 2035-242-1

1

RC 26 H

IRRI

IR 64616 H, PSB RC 26 H, MAGAT

1

IR 74

IRRI

IR 32453-20-3-2-2

1

IR 70

IRRI

IR 28228-12-3-1-1-2

1

IR 44

IRRI

IR 2863-38-1-2

1

RC 64

IRRI

IR 59552-21-3-2-2, PSB RC 64, KABACAN

1

RC 02

IRRI

IR 32809-26-3-3, PSB RC 2,NAHALIN

 

 

 

 

3

RC 10

IRRI

IR 50404-57-2-2-3, PSB RC 10, PAGSANJAN

3

IR 66

IRRI

IR 32307-107-3-2-2

3

RC 32

 

C 3563-B-5-1, PSB RC 32,JARO

3

RC 52

IRRI

IR 59682-132-1-1-2, PSB RC 52, GANDARA

3

IR 50

IRRI

IR 9224-117-2-3-3-2

3

RC 30

IRRI

IR 58099-41-2-3, PSB RC 30, AGUS

3

IR 72

IRRI

IR 35366-90-3-2-1-2

3

RC 20

IRRI

IR 57301-195-3-3, PSB RC 20, CHICO

3

RC 04

IRRI

IR 41985-111-3-2-2, PSB RC 4, MOLAWIN

3

IR 42

IRRI

IR 2071-586-5-6-3

3

IR 36

IRRI

IR 2071-625-1-252

3

IR 58

IRRI

IR 9752-71-3-2

3

IR 62

IRRI

IR 13525-43-2-3-1-3-2

3

IR 60

IRRI

IR 13429-299-2-1-3

3

IR 56

IRRI

IR 13429-109-2-2-1

 

 

 

 

2

IR 30

IRRI

IR 2153-159-1-4

2

IR 20

IRRI

IR 532-E576,IR 532 E 576

2

IR 26

IRRI

IR 1541-102-7

2

IR 38

IRRI

IR 2070-423-2-5-6

2

IR 40

IRRI

IR 2070-414-3-9

2

IR 32

IRRI

IR 2070-747-6-3-2

Table 2. Wheat cultivars released in Mexico derived from CIMMYT breeding lines.

ENTRYID

ENTRYCD

SOURCE

DESIGNATION

1

 PI62

MEXICO

PITIC 62

2

 7C66

MEXICO

SIETE CERROS T 66

3

 SO

MEXICO

SONALIKA

4

 YE70

MEXICO

YECORA F 70

5

 JU73

MEXICO

JUPATECO F 73

6

 TO73

MEXICO

TORIM F 73

7

 CO75

MEXICO

COCORAQUE F 75

8

 NA76

 

NACOZARI F 76

9

 PA76

MEXICO

PAVON F 76

10

 CI79

MEXICO

CIANO T 79

11

 GE81

MEXICO

GENARO T 81

12

 SE82

MEXICO

SERI M 82

13

 OP85

MEXICO

OPATA M 85

14

 OA86

MEXICO

OASIS F 86

15

 AN88

MEXICO

ANGOSTURA F 88

16

 AT

MEXICO

ATTILA

17

 BC88

MEXICO

BACANORA T 88

18

 BV92

MEXICO

BAVIACORA M 92

19

 WE

 

WEAVER


 


NO

DESIG

SOURCE

GID

 

NO

DESIG

SOURCE

GID

1

CID 4508

CIMMYT

1

 

44

CID 202

CIMMYT

878

2

POLYSSU

 

3

 

45

AGUILERA

CIMMYT

882

3

TURCO

 

4

 

46

CID 4302

CIMMYT

893

4

RIETI

ITALY

8

 

47

GEHUN

 

922

5

WILZEN

 

9

 

48

CID 138896

CIMMYT

937

6

CID 8

CIMMYT

11

 

49

CID 4921

CIMMYT

960

7

AKAGOMUGHI

 

18

 

50

CID 712

CIMMYT

1211

8

KARLIK-112

 

24

 

51

CID 713

CIMMYT

1213

9

KRYMKI

 

25

 

52

DU TOIT

SOUTH AFRICA

1396

10

MEDITERRANEAN

 

26

 

53

HYBRID ENGLISH

 

1402

11

BARLETA

 

27

 

54

LC.CI05878

CIMMYT

1422

12

AMERICANO 44D

 

28

 

55

CID 2617

CIMMYT

1561

13

CID 11

CIMMYT

29

 

56

INALLETTABLE TODARO 38

 

1612

14

CID 12

CIMMYT

30

 

57

LOST NATION

 

1746

15

CID 13

CIMMYT

31

 

58

NEW PUSA 773

 

2035

16

CID 14

CIMMYT

32

 

59

UTTAR PRADESH

 

2036

17

KENIA RF 324

 

39

 

60

CID 33802

CIMMYT

2045

18

CID 33

CIMMYT

49

 

61

CID 5903

CIMMYT

2076

19

CID 2931

CIMMYT

60

 

62

CENT.PETKUSER

 

2078

20

IUMILLO

 

65

 

63

GENTIL ROSSO

 

2099

21

CID 24

INDIA

67

 

64

SAUMUR DE MARS

 

2219

22

CID 25

CIMMYT

68

 

65

CID 520

CIMMYT

2411

23

RED EGYPTIAN(PI-170925)

CIMMYT

69

 

66

TRIGO MEXICANO

CIMMYT

2481

24

CID 29

CIMMYT

70

 

67

Z2-50

 

2502

25

CID 30

CIMMYT

71

 

68

NATORAL

CIMMYT

2521

26

DARUMA

 

72

 

69

CID 33808

CIMMYT

3202

27

CID 5798

CIMMYT

100

 

70

PB TYPE 9

 

3210

28

DIEHL

 

137

 

71

8B

 

3211

29

CID 5905

CIMMYT

142

 

72

CID 33792

CIMMYT

4084

30

CID 23330

CIMMYT

213

 

73

CID 5792

CIMMYT

5000

31

EGYPT NA101

 

219

 

74

CID 1630

CIMMYT

5081

32

CID 122749

CIMMYT

452

 

75

KENYA 2

 

5112

33

CID 122748

CIMMYT

454

 

76

CID 1812

CIMMYT

5460

34

NOE

 

458

 

77

B4946.A.4.18.2.1Y

CIMMYT

6380

35

CID 33805

CIMMYT

460

 

78

NEW PUSA 876

 

8378

36

HICKLING'S PROLIFIC

 

462

 

79

CID 3830

CIMMYT

8707

37

CHIDDAM BLANC

 

468

 

80

CID 4318

CIMMYT

9299

38

CID 122745

CIMMYT

480

 

81

CID 4397

CIMMYT

9394

39

IDUNA

SWEDEN

504

 

82

BLE-CARRE-DE-SICILE

 

235235

40

CID 1551

CIMMYT

508

 

83

RED-STRAW

ENGLAND

250534

41

SUPREZA

CIMMYT

579

 

84

RICHELLE-DE-NAPLES

ITALY

250782

42

CID 4202

CIMMYT

657

 

85

PUGH'S RUSTPROOF

CALIFORNIA

4982321

43

CID 3254

CIMMYT

863

 

86

HALYCHANKA

UKRAINE

4982372

 

 

 

 

 

87

CI 122362

AUSTRALIA

4982916


Figure 1. Data model for the ICIS Genealogy Management System


Figure 2. Example computation of a COP value


Figure 1.








[*] Crop Research Informatics Laboratory, International Rice Research Institute, DAPO Box 7777, Manila, Philippines.

[**] School of Land Crop and Food Scieneces, University of Queensland, Brisbane, Qld. 4072, Australia.

[***] Crop Research Informatics Laboratory, International Maize and Wheat Improvement Center (CIMMYT), El Batan, Texcoco, Mexico CP56130.