TY - JOUR
T1 - The value of expanding the training population to improve genomic selection models in tetraploid potato
AU - Sverrisdóttir, Elsa
AU - Sundmark, Ea Høegh Riis
AU - Johnsen, Heidi Øllegaard
AU - Kirk, Hanne Grethe
AU - Asp, Torben
AU - Janss, Luc
AU - Bryan, Glenn
AU - Nielsen, Kåre Lehmann
N1 - The research is part of Centre for Genomic Selection in Animals and Plants (GenSAP) and is funded by The Danish Council of Strategic Research (Research grant # 12-132452). The MASPOT population used in this study was provided from the MASPOT project (2012–2017), funded by The Danish Council for Strategic Research (Research grant # 11-116190).
PY - 2018/8/6
Y1 - 2018/8/6
N2 - Genomic selection (GS) is becoming increasingly applicable to crops as the genotyping costs continue to decrease, which makes it an attractive alternative to traditional selective breeding based on observed phenotypes. With genome-wide molecular markers, selection based on predictions from genotypes can be made in the absence of direct phenotyping. The reliability of predictions depends strongly on the number of individuals used for training the predictive algorithms, particularly in a highly genetically diverse organism such as potatoes; however, the relationship between the individuals also has an enormous impact on prediction accuracy. Here we have studied genomic prediction in three different panels of potato cultivars, varying in size, design, and phenotypic profile. We have developed genomic prediction models for two important agronomic traits of potato, dry matter content and chipping quality. We used genotyping-by-sequencing to genotype 1,146 individuals and generated genomic prediction models from 167,637 markers to calculate genomic estimated breeding values with genomic best linear unbiased prediction. Cross-validated prediction correlations of 0.75–0.83 and 0.39–0.79 were obtained for dry matter content and chipping quality, respectively, when combining the three populations. These prediction accuracies were similar to those obtained when predicting performance within each panel. In contrast, but not unexpectedly, predictions across populations were generally lower, 0.37–0.71 and 0.28–0.48 for dry matter content and chipping quality, respectively. These predictions are not limited by the number of markers included, since similar prediction accuracies could be obtained when using merely 7,800 markers (<5%). Our results suggest that predictions across breeding populations in tetraploid potato are presently unreliable, but that individual prediction models within populations can be combined in an additive fashion to obtain high quality prediction models relevant for several breeding populations.
AB - Genomic selection (GS) is becoming increasingly applicable to crops as the genotyping costs continue to decrease, which makes it an attractive alternative to traditional selective breeding based on observed phenotypes. With genome-wide molecular markers, selection based on predictions from genotypes can be made in the absence of direct phenotyping. The reliability of predictions depends strongly on the number of individuals used for training the predictive algorithms, particularly in a highly genetically diverse organism such as potatoes; however, the relationship between the individuals also has an enormous impact on prediction accuracy. Here we have studied genomic prediction in three different panels of potato cultivars, varying in size, design, and phenotypic profile. We have developed genomic prediction models for two important agronomic traits of potato, dry matter content and chipping quality. We used genotyping-by-sequencing to genotype 1,146 individuals and generated genomic prediction models from 167,637 markers to calculate genomic estimated breeding values with genomic best linear unbiased prediction. Cross-validated prediction correlations of 0.75–0.83 and 0.39–0.79 were obtained for dry matter content and chipping quality, respectively, when combining the three populations. These prediction accuracies were similar to those obtained when predicting performance within each panel. In contrast, but not unexpectedly, predictions across populations were generally lower, 0.37–0.71 and 0.28–0.48 for dry matter content and chipping quality, respectively. These predictions are not limited by the number of markers included, since similar prediction accuracies could be obtained when using merely 7,800 markers (<5%). Our results suggest that predictions across breeding populations in tetraploid potato are presently unreliable, but that individual prediction models within populations can be combined in an additive fashion to obtain high quality prediction models relevant for several breeding populations.
KW - Chipping quality
KW - Dry matter
KW - Genomic prediction
KW - Genomic selection
KW - Potato breeding
KW - Solanum tuberosum
UR - http://www.scopus.com/inward/record.url?scp=85051858445&partnerID=8YFLogxK
U2 - 10.3389/fpls.2018.01118
DO - 10.3389/fpls.2018.01118
M3 - Article
C2 - 30131817
AN - SCOPUS:85051858445
SN - 1664-462X
VL - 9
SP - 1
EP - 14
JO - Frontiers in Plant Science
JF - Frontiers in Plant Science
M1 - 1118
ER -