|Fitting and validating the genomic evaluation model to Polish Holstein-Friesian cattle.|
|Jump to Full Text|
|PMID: 21553085 Owner: NLM Status: MEDLINE|
|The aim of the study was to fit the genomic evaluation model to Polish Holstein-Friesian dairy cattle. A training data set for the estimation of additive effects of single nucleotide polymorphisms (SNPs) consisted of 1227 Polish Holstein-Friesian bulls. Genotypes were obtained by the use of Illumina BovineSNP50 Genotyping BeadChip. Altogether 29 traits were considered: milk-, fat- and protein- yields, somatic cell score, four female fertility traits, and 21 traits describing conformation. The prediction of direct genomic values was based on a mixed model containing deregressed national proofs as a dependent variable and random SNP effects as independent variables. The correlations between direct genomic values and conventional estimated breeding values estimated for the whole data set were overall very high and varied between 0.98 for production traits and 0.78 for non return rates for cows. For the validation data set of 232 bulls the corresponding correlations were 0.38 for milk-, 0.37 for protein-, and 0.32 for fat yields, while the correlations between genomic enhanced breeding values and conventional estimated breeding values for the four traits were: 0.43, 0.44, 0.31, and 0.35. This model was able to pass the interbull validation criteria for genomic selection, which indicates that it is realistic to implement genomic selection in Polish Holstein-Friesian cattle.|
|Joanna Szyda; Andrzej Zarnecki; Tomasz Suchocki; Stanisław Kamiński|
Related Documents :
|18550045 - Patient oriented and robust automatic liver segmentation for pre-evaluation of liver tr...
19647765 - Mathematical models for describing the shape of the in vitro unstretched human crystall...
17354845 - Multilevel segmentation and integrated bayesian model classification with an applicatio...
21078575 - High capacity color barcodes: per channel data encoding via orientation modulation in e...
9609815 - Quantitative methanol-burning lung model for validating gas-exchange measurements over ...
23026145 - Queens defense by workers in the highly polygynous ant crematogaster pygmaea (hymenopte...
|Type: Journal Article; Research Support, Non-U.S. Gov't Date: 2011-05-07|
|Title: Journal of applied genetics Volume: 52 ISSN: 2190-3883 ISO Abbreviation: J. Appl. Genet. Publication Date: 2011 Aug|
|Created Date: 2011-07-11 Completed Date: 2011-11-21 Revised Date: 2013-06-30|
Medline Journal Info:
|Nlm Unique ID: 9514582 Medline TA: J Appl Genet Country: England|
|Languages: eng Pagination: 363-6 Citation Subset: IM|
|Department of Animal Genetics, Wrocław University of Environmental and Life Sciences, Kożuchowska 7, Wrocław, Poland. email@example.com|
|APA/MLA Format Download EndNote Download BibTex|
Cattle / genetics*
Fertility / genetics
Milk / secretion
Polymorphism, Single Nucleotide
Quantitative Trait, Heritable*
Validation Studies as Topic
Journal ID (nlm-ta): J Appl Genet
Publisher: Springer-Verlag, Berlin/Heidelberg
© The Author(s) 2011
Received Day: 30 Month: 9 Year: 2010
Revision Received Day: 5 Month: 4 Year: 2011
Accepted Day: 6 Month: 4 Year: 2011
Electronic publication date: Day: 7 Month: 5 Year: 2011
pmc-release publication date: Day: 7 Month: 5 Year: 2011
Print publication date: Month: 8 Year: 2011
Volume: 52 Issue: 3
First Page: 363 Last Page: 366
PubMed Id: 21553085
Publisher Id: 47
|Fitting and validating the genomic evaluation model to Polish Holstein-Friesian cattle|
Address: +48-71-3205846 +48-71-3205758 firstname.lastname@example.org
1Department of Animal Genetics, Wrocław University of Environmental and Life Sciences, Kożuchowska 7, 51–631 Wrocław, Poland
2Institute of Animal Breeding and Genetics, National Research Institute of Animal Production, Krakowska 1, 32–083 Kraków, Poland
3Department of Animal Genetics, University of Warmia and Mazury, Oczapowskiego 5, 10–718 Olsztyn, Poland
Recently many countries have incorporated the genomic information, in a form of thousands of single nucleotide polymorphism (SNP) genotypes originating from a microarray technology, into their genetic evaluation systems (Hayes et al. 2009, VanRaden, 2008). It has become evident that the genomic information is now an important part of a routine evaluation of genetic merit in dairy cattle (Liu, 2010). In this paper we describe the results of fitting and validating the genomic selection model to the population of Polish Holstein-Friesian dairy cattle.
The data set used as a training data set for the estimation of additive effects of SNPs consisted of 1227 Polish Holstein-Friesian bulls. The selection of bulls for genotyping was based on two major criteria: on the accuracy of their conventionally estimated breeding values and on the representativeness, in terms of genetic merit, of the selected bulls for the population of all dairy bulls active in Poland. The first criterion was quantified through the number of the effective daughter contribution (EDC) associated with the estimated breeding value (EBV) for milk yield of each bull. Traits were represented by EBVs, which were deregressed using the method of Jairath et al. (1998) based on the national proofs corresponding to the release from February 2010. Altogether 29 traits were considered, comprising three production traits and a somatic cell score - originating from a random regression test day model as well as four female fertility traits and 21 traits describing type and conformation - originating from an animal model. The traits are listed in online resource 1. Genotypes were generated by the use of Illumina BovineSNP50 Genotyping BeadChip, which consists of 54 001 SNPs. The applied SNP selection criteria comprised polymorphism, expressed by the minor allele frequency (MAF), with the minimum MAF of 0.01, and technical quality of a SNP, expressed by the minimum call rate of 90% within the analyzed sample of bulls. Average call rate obtained for our data was high and amounted to 99.66% and 99.75% for all SNPs and for selected SNPs, respectively. For DGV estimation 46 267 SNPs were selected, yielding 56 502 470 bull-SNP genotypes in total for milk yield. For the other traits the total number of bull-SNP genotypes was lower since not all of the genotyped bulls had EBVs available.
The following mixed model was used to estimate the additive effects of the selected Nsnp = 46 267 SNPs for up to Na = 1227 bulls with genotypes: [
DGV is defined as the sum of additive effects of SNPs estimated from the above model: [
Descriptive statistics regarding the analyzed traits were summarized in online resource 1, which shows that for each of the analyzed traits DGV had similar, but somewhat lower standard deviations than EBV, which was expected since EBV were used as a dependent variable in the SNP effect estimation model. For the training data set estimated correlations between EVB and DGV, were very high and varied between 0.98 for milk yield, 0.78 and 0.81 for non return rates at 56 days for cows and heifers, respectively - traits with the lowest heritability of 0.02.
The highest positive correlations between SNP estimates were observed for interval from calving to first insemination and days open (0.89), size and stature (0.80), as well as between milk and protein yields (0.76), the negative correlations were highest between overall feet and leg score and real leg set (−0.35), between body depth and udder depth (−0.26) and between rear leg rear view and rear leg set (−0.21). Most of the values (except the correlation between body depth and udder depth) well correspond with the estimates obtained for the Polish Holstein-Friesian breed based on conventional, multivariate models (Żarnecki et al., 2003). Manhattan plots of SNP effect estimates for milk and fat yields along the genome were presented in online resource 2. In order to enable comparison of SNP effects, their estimates were transformed to a standard normal distribution and were presented as absolute values. The highest SNP estimate for milk yield amounted to 3.67 kg, for fat yield 0.20 kg, and 0.0002 day for non return rate at 56 days of heifers. The main goal of genetic evaluation is not to identify particular loci with considerable effects on a trait, but to assess the sum of all possible additive effects across the genome. However, from the geneticists' perspective, a closer examination of effects if particular SNPs and their links to bovine genomic features are of great interest. Estimates of the effect of SNP on milk and fat yield on BTA14 in a proximity of DGAT1 - a gene having very strong effect on both traits (Grisart et al., 2002) were shown on online resource 3. Our result confirmed that DGAT1 locus has a large effect on milk and fat yields and provides empirical evidence of the validity of SNP effect estimation procedure.
In order to formally validate the genomic selection model the procedure recommended by Interbull (Mäntysaari et al. 2010) was followed. For this purpose the original, training data set was partitioned into an estimation data set consisting of older bulls and a validation data set consisting of younger bulls. The validation data set consisted of 232 bulls, while the remaining 984 bulls were used for the estimation of SNP effects. Validation was done for milk, fat and protein yields. The linear regression coefficients for regression of dEBV on PA and GEBV for the three traits were summarised in online resource 4. In general, models involving PA had much lower slopes than models using GEBV as an independent variable, indicating that the latter models had better predictive ability (Fig. 1). The best prediction, indicated by the slope of 0.96 which is closest to the expected value of 1.00, was estimated for regression of dEBV on GEBV for milk yield, and the worst, with a slope of 0.26 was obtained for regression of dEBV on PA for fat yield. The correlations with EBV (Table 1) were lowest for PA (from 0.14 to 0.26), middle for DGV (from 0.32 to 0.38), and generally the highest when both sources of information were combined into GEBV (from 0.31 to 43). One exception was fat yield, for which the highest correlation was obtained using DGV.
Many simulated as well as real data sets have been analysed in order to compare predictive ability of various models used for the estimation of SNP effects (Clark et al. 2010; Konstantinov and Hayes 2010; Mrode et al. 2010; Shepherd et al. 2010). Summarising the results of those studies one can conclude that no marked differences in predictive abilities can be observed between models. Instead factors related to the trait genetic background (heritability, number of loci with large effects) as well as the structure of the training data set play a key role in determining correlations between the predicted and true genetic merits (Calus, 2010). Results obtained in our study clearly show that a much better accuracy of prediction for selection candidates can be achieved by using a combined information from SNP genotypes (through DGV) and parental EBVs (through PA) instead of the conventional approach based entirely on the EBVs of ancestors.
In our study a low reliability of DGV was obtained for the young selection candidates. It is much lower than values reported for production traits by Hayes et al. (2009), Lund and Su (2009), and VanRaden et al. (2009), which vary between 0.45 and 0.73. The main reason for low values obtained in our study was, as indicated by Hayes et al. (2009) and Habier et al. (2010), a relatively small training data set and corresponding low genetic relatedness between the training and the selection candidate data sets (only 59% of bulls from the validation data set had sires in a training data set). Still, the obtained accuracy of DGV and GEBV was much higher than the accuracy of PA. Moreover, based on the results for protein yield, the predictive ability of the genomic model described here was positively validated by the International Bull Evaluation Service (Interbull and International Bull Evaluation 2010) in August 2010. Consequently, the model presented in this study has been recognised within European Union states by the Directorate of Animal Health and Welfare of the European Commission as a valid procedure for genomic evaluation.
Below is the link to the electronic supplementary material.Click here for additional data file (13353_2011_47_MOESM1_ESM.pdf)
Click here for additional data file (13353_2011_47_MOESM2_ESM.pdf)
Click here for additional data file (13353_2011_47_MOESM3_ESM.pdf)
Click here for additional data file (13353_2011_47_MOESM4_ESM.pdf)
The project was carried out within the framework of MASinBULL consortium which is supported financially by the Animal Breeding and Insemination Center in Bydgoszcz, Poland.
Open Access This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
|Calus MPL. Genomic breeding value prediction: methods and proceduresAnimalYear: 2010415716410.1017/S1751731109991352|
|Clark SA, Hickey JM, van der Werf JHJ (2010) How Would Different Models of Genetic Variation Affect Genomic Selection? Proceedings of the 9th WCGALP, Leipzig, Germany|
|Grisart B,Coppieters W,Farnir F,Karim L,Ford C,Berzi P,Cambisano N,Mni M,Reid S,Simon P,Spelman R,Georges M,Snell R. Positional candidate cloning of a QTL in dairy cattle: identification of a missense mutation in the bovine DGAT1 gene with major effect on milk yield and compositionGenome ResYear: 20021222223110.1101/gr.22420211827942|
|Hayes BJ,Bowman PJ,Chamberlain AJ,Goddard ME. Genomic selection in dairy cattle: progress and challengesJ Dairy SciYear: 20099243344310.3168/jds.2008-164619164653|
|Habier D,Tetens J,Seefried FR,Lichtner P,Thaller G. The impact of genetic relationship information on genomic breeding values in German Holstein cattleGenet Sel EvolYear: 201042510.1186/1297-9686-42-520170500|
|Henderson CR (1984) Applications of Linear Models in Animal Breeding, University of Guelph|
|Interbull, International Bull Evaluation Service (2010) http://www.interbull.org|
|Jairath L,Dekkers JCM,Schaeffer LR,Liu Z,Burnside EB,Kolstad B. Genetic Evaluation for Herd Life in CanadaJ Dairy SciYear: 19988155056210.3168/jds.S0022-0302(98)75607-39532510|
|Konstantinov KV, Hayes BJ (2010) Comparison of BLUP and Reproducing kernel Hilbert spaces methods for genomic prediction of breeding values in Australian Holstein Friesian cattle. Proceedings of the 9th WCGALP, Leipzig, Germany|
|Legarra A,Misztal I. Technical Note: Computing Strategies in Genome-Wide SelectionJ Dairy SciYear: 20089136036610.3168/jds.2007-040318096959|
|Liu Z (2010) Dairy cattle genetic evaluation enhanced with genomic information. Proceedings of the 9th WCGALP, Leipzig, Germany|
|Lund MS,Su G. Genomic selection in the Nordic countriesInterbull BullYear: 2009392942|
|Mäntysaari E,Liu Z,VanRaden P. Interbull Validation Test for Genomic EvaluationsYear: 2010BulletinInterbull|
|Mrode R, Coffey MP, Strandén I, Meuwissen THE, van Kaam JBCHM, Kearney JF, Berry DP (2010) A Comparison Of Various Methods For The Computation Of Genomic Breeding Values Of Dairy Bulls Using Software At Genomicselection.net. Proceedings of the 9th WCGALP, Leipzig, Germany|
|Shepherd R. Meuwissen THE, Woolliams J (2010) A Fast EM Algorithm For Genomic Selection. Proceedings of the 9th WCGALP, Leipzig, Germany|
|Strandén I,Garrick DJ. Technical note: Derivation of equivalent computing algorithms for genomic predictions and reliabilities of animal meritJ Dairy SciYear: 2009922971297510.3168/jds.2008-192919448030|
|VanRaden PM. Efficient methods to compute genomic predictionsJ Dairy SciYear: 2008914414442310.3168/jds.2007-098018946147|
|VanRaden PM,Tassell CP,Wiggans GR,Sonstegard TS,Schnabel RD,Taylor JF,Schenkel FS. Invited Review: Reliability of genomic predictions for North American Holstein bullsJ Dairy SciYear: 200992162410.3168/jds.2008-151419109259|
|Żarnecki A,Morek-Kopeć M,Jagusiak W. Genetic parameters of linearly scored conformation traits of Polish Black-and-White cowsJ Anim Feed SciYear: 200312689696|
[Figure ID: Fig1]
Predictive ability for PA and GEBV expressed as a linear regression for 232 bulls from the validation data set
Pearson correlation coefficients between EBV from 2010 and PA/DGV/GEBV together with the reliability of DGV and GEBV, calculated based on daughter information from 2004 for the validation data set. Nv is the number of bulls in the validation data set
|Correlation with EBV2010||Reliability|
Keywords: Keywords Dairy cattle, Genomic selection, Model validation, Single nucleotide polymorphism.
Previous Document: A case of bacteremia caused by Hafnia paralvei.
Next Document: Clinical results of radionuclide therapy of neuroendocrine tumours with 90Y-DOTATATE and tandem 90Y/...