1,720,996 research outputs found
Analysis of different genotyping and selection strategies in laying hen breeding programs
Abstract Background Genomic selection has become an integral component of modern animal breeding programs, having the potential to improve the efficiency of layer breeding programs both by obtaining higher prediction accuracies and reducing the generation interval, particularly for males, who cannot be phenotyped for sex-limited traits such as laying performance. In the current study, we investigate different strategies to reduce the generation interval either for both sexes or only for the male side of the breeding scheme based on stochastic simulation using the software MoBPS. Additionally, prediction accuracies based on varying proportions of genotyping and phenotype- and pedigree-based selection as well as genomic breeding values are compared. Results Selection of hens based on estimated breeding values, either pedigree-based or genomic, increased genetic gain compared to selection based on phenotypes only. The use of two time-shifted subpopulations with exchange of males between subpopulations to reduce the generation interval on the male side led to significantly higher genetic gains. Reducing the generation interval for both males and females was only efficient when population sizes were maintained, which result in doubling of the number of females to genotype and phenotype within the same time frame compared to the scenarios with the longer generation intervals. Although substantially higher gains were obtained by in particular pedigree-based selection of females and a reduction of generation intervals this led to substantially greater rates of inbreeding per year. The use of a genomic relationship matrix in breeding value estimation instead of a pedigree-based relationship matrix not only increased genetic gains but also reduced inbreeding rates. The use of optimum contribution selection led to basically the same genetic gains as without it but reduced inbreeding rates. However, overall differences obtained with optimal contribution selection were small compared to differences caused by the other effects that were considered. Conclusions The reduction of the generation interval on the male side by the use of genomic estimated breeding values was highly beneficial. Reduction of the generation interval on the female side was only beneficial when a high proportion of hens was genotyped and housing capacities were increased. On the female side of a layer breeding program, selection based on pedigree-based estimated breeding values was inferior to phenotypic selection, as it resulted in a substantial increase in inbreeding rates.Open-Access-Publikationsfonds 202
Optimization of recurrent rapid cycle breeding in maize for sustained long-term genetic improvement via stochastic simulations
Abstract In recent years, the turnover of germplasm in plant breeding has substantially increased as the use of genomic information allows for earlier selection and the integration of controlled growing environments reduces the time to reach a particular growing stage. However, high generation turnover and intensive selection of lines before own yield trials are performed come at the risk of a drastic reduction of genetic diversity and lower prediction accuracies. To this end, we investigate strategies to cope with these challenges in a maize rapid cycle breeding scheme using stochastic simulations employing the software MoBPS. We find that genetic gains soon reach a plateau when only the original breeding material is phenotyped. Updating the training data set via additional phenotyping of crosses or doubled haploid lines ensures long-term progress with a gain of 6.80 / 6.95 genetic standard deviations for the performance as a cross / DH after 30 cycles of breeding compared to 3.40 / 4.28 without additional phenotyping. Introducing genetic material from outside the breeding pool to introduce novel genetic diversity led to a further increase to 9.34 / 7.89 genetic standard deviations. In particular, for the management of genetic diversity, further modifications of breeding program design are analyzed to optimize the number of selected lines per cycle and to account for the relatedness of F2 plants in the selection using the software AlphaMate. Balancing short-term genetic gains with long-term diversity preservation is crucial for sustainable breeding. MoBPS provides a tool for quantifying these effects and provides solutions specific to the respective breeding program.Open-Access-Publikationsfonds 202
Using Local Convolutional Neural Networks for Genomic Prediction
The prediction of breeding values and phenotypes is of central importance for both livestock and crop breeding. In this study, we analyze the use of artificial neural networks (ANN) and, in particular, local convolutional neural networks (LCNN) for genomic prediction, as a region-specific filter corresponds much better with our prior genetic knowledge on the genetic architecture of traits than traditional convolutional neural networks. Model performances are evaluated on a simulated maize data panel (n = 10,000; p = 34,595) and real Arabidopsis data (n = 2,039; p = 180,000) for a variety of traits based on their predictive ability. The baseline LCNN, containing one local convolutional layer (kernel size: 10) and two fully connected layers with 64 nodes each, is outperforming commonly proposed ANNs (multi layer perceptrons and convolutional neural networks) for basically all considered traits. For traits with high heritability and large training population as present in the simulated data, LCNN are even outperforming state-of-the-art methods like genomic best linear unbiased prediction (GBLUP), Bayesian models and extended GBLUP, indicated by an increase in predictive ability of up to 24%. However, for small training populations, these state-of-the-art methods outperform all considered ANNs. Nevertheless, the LCNN still outperforms all other considered ANNs by around 10%. Minor improvements to the tested baseline network architecture of the LCNN were obtained by increasing the kernel size and of reducing the stride, whereas the number of subsequent fully connected layers and their node sizes had neglectable impact. Although gains in predictive ability were obtained for large scale data sets by using LCNNs, the practical use of ANNs comes with additional problems, such as the need of genotyping all considered individuals, the lack of estimation of heritability and reliability. Furthermore, breeding values are additive by design, whereas ANN-based estimates are not. However, ANNs also comes with new opportunities, as networks can easily be extended to account for additional inputs (omics, weather etc.) and outputs (multi-trait models), and computing time increases linearly with the number of individuals. With advances in high-throughput phenotyping and cheaper genotyping, ANNs can become a valid alternative for genomic prediction
Development and validation of a horse reference panel for genotype imputation
BACKGROUND: Genotype imputation is a cost-effective method to generate sequence-level genotypes for a large number of animals. Its application can improve the power of genomic studies, provided that the accuracy of imputation is sufficiently high. The purpose of this study was to develop an optimal strategy for genotype imputation from genotyping array data to sequence level in German warmblood horses, and to investigate the effect of different factors on the accuracy of imputation. Publicly available whole-genome sequence data from 317 horses of 46 breeds was used to conduct the analyses. RESULTS: Depending on the size and composition of the reference panel, the accuracy of imputation from medium marker density (60K) to sequence level using the software Beagle 5.1 ranged from 0.64 to 0.70 for horse chromosome 3. Generally, imputation accuracy increased as the size of the reference panel increased, but if genetically distant individuals were included in the panel, the accuracy dropped. Imputation was most precise when using a reference panel of multiple but related breeds and the software Beagle 5.1, which outperformed the other two tested computer programs, Impute 5 and Minimac 4. Genome-wide imputation for this scenario resulted in a mean accuracy of 0.66. Stepwise imputation from 60K to 670K markers and subsequently to sequence level did not improve the accuracy of imputation. However, imputation from higher density (670K) was considerably more accurate (about 0.90) than from medium density. Likewise, imputation in genomic regions with a low marker coverage resulted in a reduced accuracy of imputation. CONCLUSIONS: The accuracy of imputation in horses was influenced by the size and composition of the reference panel, the marker density of the genotyping array, and the imputation software. Genotype imputation can be used to extend the limited amount of available sequence-level data from horses in order to boost the power of downstream analyses, such as genome-wide association studies, or the detection of embryonic lethal variants. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12711-022-00740-8
MoBPS - Modular Breeding Program Simulator
The R-package MoBPS provides a computationally efficient and flexible framework to simulate complex breeding programs and compare their economic and genetic impact. Simulations are performed on the base of individuals. MoBPS utilizes a highly efficient implementation with bit-wise data storage and matrix multiplications from the associated R-package miraculix allowing to handle large scale populations. Individual haplotypes are not stored but instead automatically derived based on points of recombination and mutations. The modular structure of MoBPS allows to combine rather coarse simulations, as needed to generate founder populations, with a very detailed modeling of todays’ complex breeding programs, making use of all available biotechnologies. MoBPS provides pre-implemented functions for common breeding practices such as optimum genetic contributions and single-step GBLUP but also allows the user to replace certain steps with personalized and/or self-written solutions
Imputation of low-density marker chip data in plant breeding : Evaluation of methods based on sugar beet
Low-density genotyping followed by imputation reduces genotyping costs while still providing high-density marker information. An increased marker density has the potential to improve the outcome of all applications that are based on genomic data. This study investigates techniques for 1k to 20k genomic marker imputation for plant breeding programs with sugar beet (Beta vulgaris L. ssp. vulgaris) as an example crop, where these are realistic marker numbers for modern breeding applications. The generally accepted ‘gold standard’ for imputation, Beagle 5.1, was compared with the recently developed software AlphaPlantImpute2 which is designed specifically for plant breeding. For Beagle 5.1 and AlphaPlantImpute2, the imputation strategy as well as the imputation parameters were optimized in this study. We found that the imputation accuracy of Beagle could be tremendously improved (0.22 to 0.67) by tuning parameters, mainly by lowering the values for the parameter for the effective population size and increasing the number of iterations performed. Separating the phasing and imputation steps also improved accuracies when optimized parameters were used (0.67 to 0.82). We also found that the imputation accuracy of Beagle decreased when more low-density lines were included for imputation. AlphaPlantImpute2 produced very high accuracies without optimization (0.89) and was generally less responsive to optimization. Overall, AlphaPlantImpute2 performed relatively better for imputation whereas Beagle was better for phasing. Combining both tools yielded the highest accuracies
Simulation Study on the Integration of Health Traits in Horse Breeding Programs
Osteochondrosis dissecans (OCD) is a degenerative disease of the cartilage leading to osseous fragments in the joints. It is important in horse breeding both from an animal welfare and an economic perspective. To study adequate breeding strategies to reduce OCD prevalence, a lifelike simulation of the breeding program of German Warmblood horses was performed with the R package MoBPS. We simulated complex breeding schemes of riding horses with different selection steps and realistic age structure, mimicking the German situation. As an example, osseous fragments in fetlock and hock joints were considered. Different scenarios, either using threshold selection, index selection or genomic index selection, respectively, were compared regarding their impact on health and performance traits. A rigorous threshold selection as well as the integration of OCD in a selection index at the stage of stallion licensing and chosen frequency of use in breeding cases on a selection index that includes breeding values for OCD traits performed best on a comparable level. Simply integrating OCD in this breeding value was less effective in terms of OCD reduction. Scenarios with a higher reduction of OCD also showed a slightly reduced improvement in the riding horse performance traits
Optimization Strategies to Adapt Sheep Breeding Programs to Pasture-Based Production Environments: A Simulation Study
Strong differences between the selection (indoor fattening) and production environment (pasture fattening) are expected to reduce genetic gain due to possible genotype-by-environment interactions (G × E). To investigate how to adapt a sheep breeding program to a pasture-based production environment, different scenarios were simulated for the German Merino sheep population using the R package Modular Breeding Program Simulator (MoBPS). All relevant selection steps and a multivariate pedigree-based BLUP breeding value estimation were included. The reference scenario included progeny testing at stations to evaluate the fattening performance and carcass traits. It was compared to alternative scenarios varying in the progeny testing scheme for fattening traits (station and/or field). The total merit index (TMI) set pasture-based lamb fattening as a breeding goal, i.e., field fattening traits were weighted. Regarding the TMI, the scenario with progeny testing both in the field and on station led to a significant increase in genetic gain compared with the reference scenario. Regarding fattening traits, genetic gain was significantly increased in the alternative scenarios in which field progeny testing was performed. In the presence of G × E, the study showed that the selection environment should match the production environment (pasture) to avoid losses in genetic gain. As most breeding goals also contain traits not recordable in field testing, the combination of both field and station testing is required to maximize genetic gain
Optimization of breeding program design through stochastic simulation with evolutionary algorithms
The effective planning and allocation of resources in modern breeding programs is a complex task. Breeding program design and operational management have a major impact on the success of a breeding program and changing parameters such as the number of selected/phenotyped/genotyped individuals in the breeding program will impact genetic gain, genetic diversity, and costs. As a result, careful assessment and balancing of design parameters is crucial, taking into account the trade-offs between different breeding goals and associated costs. In a previous study, we optimized the resource allocation strategy in a dairy cattle breeding scheme via the combination of stochastic simulations and kernel regression, aiming to maximize a target function containing genetic gain and the inbreeding rate under a given budget. However, the high number of simulations required when using the proposed kernel regression method to optimize a breeding program with many parameters weakens the effectiveness of such a method. In this work, we are proposing an optimization framework that builds on the concepts of kernel regression but additionally makes use of an evolutionary algorithm to allow for a more effective and general optimization. The key idea is to consider a set of potential parameter settings of the breeding program, evaluate their performance based on stochastic simulations, and use these outputs to derive new parameter settings to test in an iterative procedure. The evolutionary algorithm was implemented in a Snakemake workflow management system to allow for efficient scaling on large distributed computing platforms. The algorithm achieved stabilization around the same optimum with a massively reduced number of simulations. Thereby, the incorporation of class variables and accounting for a higher number of parameters in the optimization framework leads to substantially reduced computing time and better scaling for the desired optimization of a breeding program.Open-Access-Publikationsfonds 202
- …
