Identification of the myostatin locus (MSTN) as having a ... - UC Davis

effect on optimum racing distance in the Thoroughbred horse in ... distance (7 furlongs and under compared to 8–10 furlongs) identified a locus on ECA...

2 downloads 598 Views 230KB Size
doi:10.1111/j.1365-2052.2010.02126.x

Identification of the myostatin locus (MSTN) as having a major effect on optimum racing distance in the Thoroughbred horse in the USA M. M. Binns, D. A. Boehler and D. H. Lambert Equine Analysis, PO Box 4570, Midway, KY 40347, USA

Summary

One hundred and eighty-nine Thoroughbred horses that had won Graded Stakes races in North America were genotyped with the Illumina Equine SNP50 bead chip. Association tests using PLINK to determine whether any SNPs were associated with optimum racing distance (7 furlongs and under compared to 8–10 furlongs) identified a locus on ECA18 that was statistically significant ()log 10 EMP2 = 1.63) at the genome-wide level following permutation analysis (10 000 permutations). Bioinformatic analysis revealed that the two ECA18 SNPs with the highest statistical significance spanned the MSTN (myostatin) locus. Mutations in myostatin in several mammalian species have been associated with increased muscling, with a preferential increase in fast glycolytic type IIB fibres, which would increase power potential. Thoroughbred horses that race over sprint distances, which are 5–7 furlongs, are often characterized by impressive hind quarter musculature, strongly suggesting that the association observed between the ECA18 SNPs and optimum race distance is mediated through MSTN. Keywords association test, horse, myostatin, racing distance, SNP array, Thoroughbred.

Introduction The Thoroughbred horse breed was established in England in the early 1700s based on crosses between stallions of Arabian origin and a poorly defined, possibly indigenous, group of mares. The founder population was small; all contemporary males trace back to one of three stallions, the Godolphin Arabian, the Byerley Turk and the Darley Arabian, whilst on the female side about seventy foundation mares have been identified (Willett 1970). A stud book for Thoroughbred horses was initiated in 1791, and pedigree records for the breed, which now numbers about 500 000 horses, are maintained by Thoroughbred registries worldwide. Initially, the new breed raced over extended distances; match races in which two horses competed against each other over 4–6 miles were common, and horses frequently had to run multiple heats during a day to determine the winner. TodayÕs English Classic races, the St. Leger (run over 1¾ miles), the Derby and Oaks (run over one and a half miles) and the 1000 and 2000 Guineas (run over Address for correspondence M. Binns, Equine Analysis, PO Box 4570, Midway, KY 40347, USA. E-mail: [email protected] Accepted for publication 20 September 2010

154

1 mile) were established in 1776, 1780,1779, 1809 and 1814, respectively. The decreasing distances over time suggest that there was already a trend towards racing over shorter distances by 1800, and the fashion has continued to the point that, in North America, races at one and a half miles and over are now frequently referred to as ÔmarathonsÕ and are relatively uncommon. In North America, whilst the classic Triple Crown races, The Kentucky Derby, The Preakness and the Belmont are run over one and a quarter 13=16 and one and a half miles, respectively, there are important races at 7 furlongs and under, and there has been an emphasis on selecting for speed through the fast pace at which most US races are run when compared to European races. In general, contemporary Thoroughbred horses are classified by optimum distance into sprint, middle distance and stayers, and distinct pedigrees specific for each distance have been developed by breeders. The biochemistry of this distance specialization has remained obscure. An examination of the muscle fibre types present in sprinters and stayers and the response of muscle to different training regimes has been hampered by the difficulty of reliably sampling the same muscle location in different individuals (Snow & Guy 1980; Sewell et al. 1994). Mysostatin (MSTN) is a member of the transforming growth factor b family that is expressed in skeletal muscle

Ó 2010 The Authors, Journal compilation Ó 2010 Stichting International Foundation for Animal Genetics, 41 (Suppl. 2), 154–158

Identification of MSTN and acts as a negative regulator of muscle mass. Several natural mutations in MSTN have been identified in cattle (Grobet et al. 1997, 1998; Kambadur et al. 1997, McPherron & Lee 1997; Marchitelli et al. 2003) sheep (Clop et al. 2006), mice (Nishi et al. 2002) and recently human (Schuelke et al. 2004), and these result in a double-muscling phenotype. In addition, a myostatin mutation in whippet dogs was shown to be associated with increased muscling and athletic ability, although no equivalent mutation was found in the closely related greyhound breed (Lee 2007; Mosher et al.2007). This study seeks to investigate whether there are any specific loci in the equine genome associated with distance specialization in the Thoroughbred horse using a whole genome association approach.

Methods DNA and genotyping DNA was prepared from blood and hair samples using standard protocols from 189 Thoroughbred horses that had won Graded Stakes races in North America. The samples represent a random selection of horses that won Graded Stakes races, and no full siblings were included in the sample group. DNA was quantitated with a Quant-iT PicoGreen dsDNA assay kit (Invitrogen) using a Nanodrop ND3300 fluorospectrometer following the Nanodrop protocol ÔPicoGreen assay for dsDNAÕ.

SNP genotyping and data analysis SNP genotyping was carried out with the Equine SNP50 bead chip (Illumina) using standard protocols at the Genotyping Shared Resource facility, Mayo Clinic, Rochester, MN. Quality control of the genotyping data was assessed in several ways. Duplicate samples (n = 5) revealed a concordance in genotype calls >99.9%. Using PLINK (Purcell et al. 2007), Mendelian inheritance errors in a trio set of sire/ dam/offspring were <0.01%. Data were subjected to the SEXCHECK option to ensure that sample fidelity had been maintained, and the MC1R mutation that produces chestnut coat colour in horses was mapped as a control to its expected location on ECA3 using the SNP genotype data and the coat colour phenotypes of the samples tested. Horses were stratified into three distance classes: 7 furlongs and under, 8–10 furlongs and over 10 furlongs, based on their individual optimum race performances obtained from Equibase. That is, the distance of the horseÕs best performance in Graded Stakes races was recorded, and if an individual won the same class of Graded Stakes race at more than one distance, the longer distance was recorded. Association test statistics and permutations to assess statistical significance were determined using the ASSOC and MPERM functions of PLINK.

Results One hundred and eighty-nine horses that had won Graded Stakes races were classified into those that had won races at seven or less furlongs (n = 125), those that had won at eight to ten furlongs (n = 44) and those that had won races at more than 10 furlongs (n = 20). Whole genome SNP genotypes were determined using the Illumina Equine SNP50 bead chip system, which comprises 54 602 SNPs uniformly distributed across the equine genome. The ASSOC option was run to compare the genotypes of the 7 furlongs and under group to the 8–10 furlong group. The results are presented in Fig. 1a, where it can be seen that the SNPs with the most significant P values (smallest P value = 2.77E-07) are located on ECA18. To confirm the significance of this association, a permutation analysis with 10 000 permutations was carried out using the MPERM option in PLINK. The results of the permutation analysis, reported as max (T) empirical P values (EMP2) are presented in Fig. 1b. Genome-wide significance for the horse is estimated to be )log 10 EMP2 = 1.3. Two markers on ECA18, BIEC2-417274 (located at 65 868 604 bp) and BIEC2-417495 (located at 67 186 093 bp) reach genomewide significance with )log 10 EMP2 values of 1.3829 and 1.6345, respectively. The location of these markers on ECA18, together with other SNPs in the immediate region that show association but do not reach genome-wide significance, is shown in Fig. 1c. It should be noted from Fig. 1b that a second locus on ECA17, whilst not reaching genome-wide significance ()log 10 EMP2 = 0.9179), is high enough to suggest that a second locus may make a contribution to optimum racing distance. A second analysis compared the 7 furlongs and under group of horses (n = 125) to the group that won at over 10 furlongs (n = 20). Whilst the same region on ECA18 was again identified, there was insufficient power in this analysis to reach genome-wide significance with the highest )log 10 EMP2 = 0.261, because of the limited number of samples in the longer-distance category. The genotype and allele frequencies for BIEC2-417495 (the SNP with the highest statistical significance) were estimated from a set of 464 Thoroughbred horses that had not been selected on the basis of distance. The A allele that is associated with a longer optimum distance was present at a frequency of 0.473, whilst the G allele associated with sprinting was present at a frequency of 0.527. The genotype frequencies for the AA:AG:GG genotypes in this population were 0.189:0.567:0.244, respectively.

Discussion The association analysis comparing Thoroughbred horses that won Graded Stakes races at a distance of 7 furlongs or less with those that won Graded Stakes races over a distance of 8–10 furlongs identified a single locus on ECA18 that

Ó 2010 The Authors, Journal compilation Ó 2010 Stichting International Foundation for Animal Genetics, 41 (Suppl. 2), 154–158

155

156

Binns et al. (a)

(b)

(c)

Figure 1 The myostatin (MSTN) locus shows a significant association with optimum winning distance. (a) Results of genome-wide association test for optimum distance, 7 furlongs vs. 8–10 furlongs. The x axis shows each individual equine chromosome, where 1–31 are ECA1–ECA31 and 32 is the X chromosome. The y axis shows the )log 10 P value. (b) Results of permutation analysis (10 000 permutations) showing a genome-wide significant signal on ECA18. The x axis shows the individual chromosomes as above, and the y axis shows the )log 10 EMP2 value. (c) Results of the association test for each ECA18 SNP plotted along the length of the chromosome to locate the locus associated with optimum distance. The x axis shows the base pairs along ECA18, and the y axis shows the )log 10 P value. A strong signal is observed at approximately 67 000 000 base pairs.

achieved genome-wide significance following permutation analysis. Whilst a clear association peak involving a large number of SNPs towards the distal end of ECA18 is evident in

Fig. 1c, only two SNPs, BIEC2-417274 (located at 65 868 604 bp) and BIEC2-417495 (located at 67 186 093 bp), reach genome-wide significance with )log 10 EMP2 values of 1.3829 and 1.6345, respectively. When the genes in this

Ó 2010 The Authors, Journal compilation Ó 2010 Stichting International Foundation for Animal Genetics, 41 (Suppl. 2), 154–158

Identification of MSTN region of ECA18 were examined using the Ensembl genome browser, it was noted that the MSTN (myostatin) gene is located between the two significant SNPs at 66 490 208– 66 495 180 bp. Traditionally, the Thoroughbred horse industry has regarded horses that race over less than seven furlongs, between seven and twelve furlongs, and longer than twelve furlongs as different, referring to them as sprinters, middledistance and stayers, respectively. The pedigrees of horses racing over these different distances tend to be different and breeders have frequently tried to produce middle-distance horses by breeding sprinters to stayers. The horses in the different distance groups also often possess distinct body morphologies, with the sprinters being heavily muscled, whilst the middle-distance and staying horses have a lighter musculature. This phenotypic difference suggests that myostatin might play a role in the distance preference of Thoroughbred horses. The increased muscling seen with myostatin mutations does not affect all muscle fibre types evenly, and there is a preferential increase in fast glycolytic type IIB fibres (Deveaux et al.2001; Hennebry et al. 2009), which is consistent with improved sprinting ability. The identification of SNP markers linked to the myostatin loci that are associated with athletic ability in the Thoroughbred horse represents the first gene influencing athletic ability identified in horses. Whilst this paper was in preparation, Hill et al. (2010) published research showing that a polymorphism in the myostatin gene is strongly associated with best race distance in elite Thoroughbred horses. In their study, myostatin was selected as a candidate gene, and whilst the intronic mutation used is in the myostatin gene, they state that the results do not preclude the functional variant being in a neighbouring gene. They state that there are no other plausible candidates within 2 Mb upstream or downstream of the myostatin gene, although it is unclear whether the functional mutation could be further away than this. In addition, they do not appear to have controlled for the possibility of population stratification between the sprinting and staying populations by testing association with other genes unlikely to be involved in muscle physiology. In this study, following a whole genome-wide association analysis strategy, we identify a single locus on ECA18 that is significant at the genome-wide level following permutation analysis. In our analysis, population stratification between sprinters and middle-distance horses is internally controlled for, and excluded, by the remaining 54 600 SNPs uniformly distributed across the genome. No other region of the genome showed a genome-wide statistically significant difference, indicating that no population stratification exists. The results clearly have implications for the Thoroughbred racing and breeding industries, although it is important to stress that the markers identified do not in any way define the likely racing ÔclassÕ of the individual, merely over

what distance the horse is likely to be most effective. It would be wrong to directly equate the preference for sprint distances with speed, and myostatin does not represent Ôthe speed geneÕ. There will be many horses that, whilst homozygous for the sprint genotype, are poor racehorses and would be beaten over sprint distances by better racehorses with one of the other two genotypes.

Acknowledgements We thank Michael and Diane Blowen, Old Friends, Kentucky (http://www.oldfriendsequine.org/) for help with samples.

Conflicts of interest The authors have not declared any conflicts of interest.

References Clop A., Marcq F., Takeda H. et al. (2006) A mutation creating a potential illegitimate microRNA target site in the myostatin gene affects muscularity in sheep. Nature Genetics 38, 813–8. Deveaux V., Cassar-Malek I. & Picard B. (2001) Comparison of contractile characteristics of muscle from Holstein and doublemuscled Belgian Blue foetuses. Comparative Biochemistry and Physiology. Part A, Molecular & Integrative Physiology 131, 21–9. Grobet L., Martin L.J., Poncelet D. et al. (1997) A deletion in the bovine myostatin gene causes the double-muscled phenotype in cattle. Nature Genetics 17, 71–4. Grobet L., Poncelet D., Royo L.J., Brouwers B., Pirottin D., Michaux C., Me´nissier F., Zanotti M., Dunner S. & Georges M. (1998) Molecular definition of an allelic series of mutations disrupting the myostatin function and causing double-muscling in cattle. Mammalian Genome 9, 210–3. Hennebry A., Berry C., Siriett V., OÕCallaghan P., Chau L., Watson T., Sharma M. & Kambadur R. (2009) Myostatin regulates fibertype composition of skeletal muscle by regulating MEF2 and MyoD gene expression. American Journal of Physiology. Cell Physiology 296, C525–34. Epub. Hill E.W., Gu J., Eivers S.S., Fonseca R.G., McGivney B.A., Govindarajan P., Orr N., Katz L.M. & MacHugh D. (2010) A sequence polymorphism in MSTN predicts sprinting ability and racing stamina in thoroughbred horses. PLoS ONE 5, e8645. Kambadur R., Sharma M., Smith T.P. & Bass J.J. (1997) Mutations in myostatin (GDF8) in double-muscled Belgian Blue and Piedmontese cattle. Genome Research 7, 910–6. Lee S.J. (2007) Sprinting without myostatin: a genetic determinant of athletic prowess. Trends in Genetics 23, 475–7. Marchitelli C., Savarese M.C., Crisa` A., Nardone A., Marsan P.A. & Valentini A. (2003) Double muscling in Marchigiana beef breed is caused by a stop codon in the third exon of myostatin gene. Mammalian Genome 14, 392–5. McPherron A.C. & Lee S.J. (1997) Double muscling in cattle due to mutations in the myostatin gene. Proceedings of the National Academy of Sciences of the United States of America 94, 12457–61. Mosher D.S., Quignon P., Bustamante C.D., Sutter N.B., Mellersh C.S., Parker H.G. & Ostrander E.A. (2007) A mutation in the

Ó 2010 The Authors, Journal compilation Ó 2010 Stichting International Foundation for Animal Genetics, 41 (Suppl. 2), 154–158

157

158

Binns et al. myostatin gene increases muscle mass and enhances racing performance in heterozygote dogs. PLoS Genetics 3, e79. Nishi M., Yasue A., Nishimatu S., Nohno T., Yamaoka T., Itakura M., Moriyama K., Ohuchi H. & Noji S. (2002) A missense mutant myostatin causes hyperplasia without hypertrophy in the mouse muscle. Biochemical and Biophysical Research Communications 293, 247–51. Purcell S., Neale B., Todd-Brown K. et al. (2007) PLINK: a toolset for whole-genome association and population-based linkage analysis. American Journal of Human Genetics, 81, 559–75. Schuelke M., Wagner K.R., Stolz L.E., Hu¨bner C., Riebel T., Ko¨men W., Braun T., Tobin J.F. & Lee S.J. (2004) Myostatin mutation

associated with gross muscle hypertrophy in a child. The New England Journal of Medicine 350, 2682–8. Sewell D.A., Harris R.C. & Marlin D.J. (1994) Skeletal muscle characteristics in 2 year-old race-trained thoroughbred horses. Comparative Biochemistry and Physiology. Comparative Physiology 108, 87–96. Snow D.H. & Guy P.S. (1980) Muscle fibre type composition of a number of limb muscles in different types of horse. Research in Veterinary Science 28, 137–44. Willett P. (1970) The Thoroughbred. Weidenfeld and Nicholson, London, England.

Ó 2010 The Authors, Journal compilation Ó 2010 Stichting International Foundation for Animal Genetics, 41 (Suppl. 2), 154–158