Abstract
Preserving genetic diversity is pivotal for enhancing genetic improvement and facilitating adaptive responses to selection. This study focuses on identifying key genetic variants, including single nucleotide polymorphisms (SNPs), insertion/deletion polymorphisms (INDELs), and copy number variants (CNVs), while exploring the genomic evolutionary connectedness among seven Iranian horses representing five indigenous breeds: Caspian, Turkemen, DareShuri, Kurdish, and Asil. Using whole-genome resequencing, we generated 2.7 Gb of sequence data, with raw reads ranging from 1.2 Gb for Caspian horses to 0.38 Gb for Turkoman horses. Post-filtering, approximately 1.9 Gb of reads remained, with ~ 1.5 Gb successfully mapped to the horse reference genome (EquCab3.0), achieving mapping rates between 76.4% (Caspian) and 98.35% (Turkoman). We identified 2,909,816 SNPs in Caspian horses, constituting around 0.1% of the genome. Notably, 71% of these SNPs were situated in intergenic regions, while 8.5 and 6.8% were located upstream and downstream, respectively. A comparative analysis of SNPs between Iranian and non-Iranian horse breeds showed that Caspian horses had the lowest number of shared SNPs with Turkoman horses. Instead, they showed a closer genetic relationship with DareShuri, Quarter, Arabian, Standardbred, and Asil breeds. Hierarchical clustering highlighted Caspian horses as a distinct cluster, underscoring their distinctive genomic signature. Caspian horses exhibit a unique genetic profile marked by an enrichment of private mutations in neurological genes, influencing sensory perception and awareness. This distinct genetic makeup shapes mating preferences and signifies a separate evolutionary trajectory. Additionally, significant non-synonymous single nucleotide polymorphisms (nsSNPs) in reproductive genes offer intervention opportunities for managing Caspian horses. These findings reveal the population genetic structure of Iranian horse breeds, contributing to the advancement of knowledge in areas such as conservation, performance traits, climate adaptation, reproduction, and resistance to diseases in equine science.
Similar content being viewed by others
Data availability
The datasets generated during and/or analyzed during the current study are not publicly available due to privacy policy of Agricultural Biotechnology Research Institute of Iran but are available from the corresponding author on reasonable request.
Change history
09 April 2024
The original online version of this article was revised: ORCID ID of the author Mehrshad Zeinalabedini has been corrected.
References
Ablondi M, Viklund Å, Lindgren G, Eriksson S, Mikko S (2019) Signatures of selection in the genome of Swedish warmblood horses selected for sport performance. BMC Genom 20:1–12. https://doi.org/10.1186/s12864-019-6079-1
Ablondi M, Dadousis C, Vasini M, Eriksson S, Mikko S, Sabbioni A (2020) Genetic diversity and signatures of selection in a native italian horse breed based on SNP data. Animals 10:1–15. https://doi.org/10.3390/ani10061005
Abyzov A, Urban AE, Snyder M, Gerstein M (2011) CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res 21:974–984. https://doi.org/10.1101/gr.114876.110
Al Abri MA, Brooks SA, Al-Saqri N, Alkharousi K, Johnson EH, Alqaisi O, Al-Rawahi A, Al Marzooqi W (2021) Investigating the population structure and genetic diversity of Arabian horses in Oman using SNP markers. Anim Genet 52:304–310. https://doi.org/10.1111/AGE.13056
Alipour H, Sharbatoghli M, Yazdi PE, Shahverdi A, Daneshzadeh MT, Afshani M, Mirian SJ, Hamidi H, Mohammadi AR, Valojerdi MR (2013) Pregnancy in the Caspian miniature horse using frozen semen cryopreserved with the EquiPRO CryoGuard freeze medium and customized freezing protocols. J Equine Vet Sci 33:266–271. https://doi.org/10.1016/j.jevs.2012.07.005
Amirinia C, Seyedabadi H, Banabazi MH, Kamali MA (2007) Bottleneck study and genetic structure of Iranian Caspian horse population using microsatellites. Pakistan J Biol Sci 10:1540–1543. https://doi.org/10.3923/pjbs.2007.1540.1543
Boeva V, Popova T, Bleakley K, Chiche P, Cappo J, Schleiermacher G, Janoueix-Lerosey I, Delattre O, Barillot E (2012) Control-FREEC: a tool for assessing copy number and allelic content using next-generation sequencing data. Bioinformatics 28:423–425. https://doi.org/10.1093/BIOINFORMATICS/BTR670
Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. https://doi.org/10.1093/bioinformatics/btu170
Chen K, Wallis JW, McLellan MD, Larson DE, Kalicki JM, Pohl CS, McGrath SD, Wendl MC, Zhang Q, Locke DP, Shi X, Fulton RS, Ley TJ, Wilson RK, Ding L, Mardis ER (2009) BreakDancer: an algorithm for high-resolution mapping of genomic structural variation. Nat Methods 6:677–681. https://doi.org/10.1038/NMETH.1363
Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, Wang L, Land SJ, Lu X, Ruden DM (2012) A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6:80–92. https://doi.org/10.4161/fly.19695
Criscione A, Mastrangelo S, D’Alessandro E, Tumino S, Di Gerlando R, Zumbo A, Marletta D, Bordonaro S (2022) Genome-wide survey on three local horse populations with a focus on runs of homozygosity pattern. J Anim Breed Genet 139:540–555. https://doi.org/10.1111/jbg.12680
Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, Handsaker RE, Lunter G, Marth GT, Sherry ST, McVean G, Durbin R (2011) The variant call format and VCFtools. Bioinformatics 27:2156–2158. https://doi.org/10.1093/bioinformatics/btr330
Doan R, Cohen ND, Sawyer J, Ghaffari N, Johnson CD, Dindot SV (2012) Whole-genome sequencing and genetic variant analysis of a quarter Horse mare. BMC Genom. https://doi.org/10.1186/1471-2164-13-78
Durward-Akhurst SA, Schaefer RJ, Grantham B, Carey WK, Mickelson JR, McCue ME (2021) Genetic variation and the distribution of variant types in the horse. Front Genet 12:1–11. https://doi.org/10.3389/fgene.2021.758366
Esdaile E, Avila F, Bellone RR (2022) Analysis of genetic diversity in the American Standardbred horse utilizing short tandem repeats and single nucleotide polymorphisms. J Hered 113:238–247. https://doi.org/10.1093/jhered/esab070
Fages A, Hanghøj K, Khan N, Gaunitz C, Seguin-Orlando A, Leonardi M, McCrory Constantz C, Gamba C, Al-Rasheid KAS, Albizuri S, Alfarhan AH, Allentoft M, Alquraishi S, Anthony D, Baimukhanov N, Barrett JH, Bayarsaikhan J, Benecke N, Bernáldez-Sánchez E, Berrocal-Rangel L, Biglari F, Boessenkool S, Boldgiv B, Brem G, Brown D, Burger J, Crubézy E, Daugnora L, Davoudi H et al (2019) Tracking five millennia of horse management with extensive ancient genome time series. Cell 177:1419-1435.e31. https://doi.org/10.1016/j.cell.2019.03.049
Feng CWA, Spiller C, Merriner DJ, O’Bryan MK, Bowles J, Koopman P (2017) SOX30 is required for male fertility in mice. Sci Rep 7:1–12. https://doi.org/10.1038/s41598-017-17854-5
Garrison E, Marth G (2012) Haplotype-based variant detection from short-read sequencing. arXiv 1207.3907, 1–9
Ghavi Hossein-Zadeh N, Ghorbani A (2018) Modeling the growth curves for body weight and some biometric traits in Caspian horses (Equus ferus caballus) using non-linear mixed models. Mamm Biol 93:5–12. https://doi.org/10.1016/j.mambio.2018.07.004
Handsaker RE, Van Doren V, Berman JR, Genovese G, Kashin S, Boettger LM, Mccarroll SA (2015) Large multiallelic copy number variations in humans. Nat Genet 47:296–303. https://doi.org/10.1038/ng.3200
Hatami-Monazah H, Pandit RV (1979) A cytogenetic study of the Caspian pony. J Reprod Fertil 57:331–333. https://doi.org/10.1530/jrf.0.0570331
Hendricks BL (2007) International encyclopedia of horse breeds. University of Oklahoma Press, Oklahoma
Höglund J (2009) The extinction vortex, is genetic variation related to extinction? Evol Conserv Genet. https://doi.org/10.1093/ACPROF:OSO/9780199214211.003.0001
Hosseini M, Shahrbabak HM, Zandi MB, Fallahi MH (2016) A morphometric survey among three Iranian horse breeds with multivariate analysis. Media Peternak 39:155–160. https://doi.org/10.5398/medpet.2016.39.3.155
Howe KL, Achuthan P, Allen James, Allen Jamie, Alvarez-Jarreta J, Ridwan Amode M, Armean IM, Azov AG, Bennett R, Bhai J, Billis K, Boddu S, Charkhchi M, Cummins C et al (2021) Ensembl 2021. Nucleic Acids Res 49:D884–D891. https://doi.org/10.1093/nar/gkaa942
Iqbal K, Chitwood JL, Meyers-Brown GA, Roser JF, Ross PJ (2014) RNA-Seq transcriptome profiling of equine inner cell mass and trophectoderm. Biol Reprod 90:1–9
Ivanković A, Bittante G, Konjačić M, Ugarković NK, Pećina M, Ramljak J (2021) Evaluation of the conservation status of the croatian posavina horse breed based on pedigree and microsatellite data. Animals 11:1–16. https://doi.org/10.3390/ani11072130
Jun JH, Cho YS, Hu H, Kim HM, Jho S, Gadhvi P, Park KM, Lim J, Paek WK, Han K, Manica A, Edwards JS, Bhak J (2014) Whole genome sequence and analysis of the marwari horse breed and its genetic origin. BMC Genom 15:1–10. https://doi.org/10.1186/1471-2164-15-S9-S4
Klecel W, Martyniuk E (2021) From the eurasian steppes to the roman circuses: a review of early development of horse breeding and management. Animals. https://doi.org/10.3390/ani11071859
Lawson JM, Shilton CA, Lindsay-mcgee V, Psifidi A, Wathes DC, Mestre A.M. De (2024) Does inbreeding contribute to pregnancy loss in thoroughbred horses ? Equine Vet J. https://doi.org/10.1111/evj.14057
Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25:1754–1760. https://doi.org/10.1093/bioinformatics/btp324
Librado P, Khan N, Fages A, Kusliy MA, Suchan T, Tonasso-Calvière L, Schiavinato S, Alioglu D, Fromentier A, Perdereau A, Aury JM, Gaunitz C, Chauvey L, Seguin-Orlando A, Der Sarkissian C, Southon J, Shapiro B, Tishkin AA, Kovalev AA, Alquraishi S, Alfarhan AH, Al-Rasheid KAS, Seregély T, Klassen L, Iversen R, Bignon-Lau O, Bodu P, Olive M, Castel JC, Boudadi-Maligne M, Alvarez N, Germonpré M, Moskal-del Hoyo M, Wilczyński J, Pospuła S, Lasota-Kuś A, Tunia K, Nowak M, Rannamäe E, Saarma U, Boeskorov G, Lōugas L, Kyselý R, Peške L, Bălășescu A, Dumitrașcu V, Dobrescu R, Gerber D, Kiss V, Szécsényi-Nagy A, Mende BG, Gallina Z, Somogyi K, Kulcsár G, Gál E, Bendrey R, Allentoft ME, Sirbu G, Dergachev V, Shephard H, Tomadini N, Grouard S, Kasparov A, Basilyan AE, Anisimov MA, Nikolskiy PA, Pavlova EY, Pitulko V, Brem G, Wallner B, Schwall C, Keller M, Kitagawa K, Bessudnov AN, Bessudnov A, Taylor W, Magail J, Gantulga JO, Bayarsaikhan J, Erdenebaatar D, Tabaldiev K, Mijiddorj E, Boldgiv B, Tsagaan T, Pruvost M, Olsen S, Makarewicz CA, Valenzuela Lamas S, Albizuri Canadell S, Nieto Espinet A, Iborra MP, Lira Garrido J, Rodríguez González E, Celestino S, Olària C, Arsuaga JL, Kotova N, Pryor A, Crabtree P, Zhumatayev R, Toleubaev A, Morgunova NL, Kuznetsova T, Lordkipanize D, Marzullo M, Prato O, Bagnasco Gianni G, Tecchiati U, Clavel B, Lepetz S, Davoudi H, Mashkour M, Berezina NY, Stockhammer PW, Krause J, Haak W, Morales-Muñiz A, Benecke N, Hofreiter M, Ludwig A, Graphodatsky AS, Peters J, Kiryushin KY, Iderkhangai TO, Bokovenko NA, Vasiliev SK, Seregin NN, Chugunov KV, Plasteeva NA, Baryshnikov GF, Petrova E, Sablin M, Ananyevskaya E, Logvin A, Shevnina I, Logvin V, Kalieva S, Loman V, Kukushkin I, Merz I, Merz V, Sakenov S, Varfolomeyev V, Usmanova E, Zaibert V, Arbuckle B, Belinskiy AB, Kalmykov A, Reinhold S, Hansen S, Yudin AI, Vybornov AA, Epimakhov A, Berezina NS, Roslyakova N, Kosintsev PA, Kuznetsov PF, Anthony D, Kroonen GJ, Kristiansen K, Wincker P, Outram A, Orlando L (2021) The origins and spread of domestic horses from the Western Eurasian steppes. Nature 598:634–640. https://doi.org/10.1038/s41586-021-04018-9
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, DePristo MA (2010) The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. https://doi.org/10.1101/gr.107524.110
Mousavi SF, Razmkabir M, Rostamzadeh J, Seyedabadi H-R, Naboulsi R, Petersen JL, Lindgren G (2023) Genetic diversity and signatures of selection in four indigenous horse breeds of Iran. Heredity (Edinb). https://doi.org/10.1038/s41437-023-00624-7
Orlando L, Aurélien G, Zhang G, Froese D, Albrechtsen A, Stiller M, Schubert M, Cappellini E, Petersen B, Moltke I, Johnson PLF, Fumagalli M (2013) Recalibrating Equus evolution using the genome sequence of an early Middle Pleistocene horse. Nature 4:74–78
Petersen JL, Mickelson JR, Cothran EG, Andersson LS, Axelsson J, Bailey E, Bannasch D, Binns MM, Borges AS, Brama P, da Câmara Machado A, Distl O, Felicetti M, Fox-Clipsham L, Graves KT, Guérin G, Haase B, Hasegawa T, Hemmann K, Hill EW, Leeb T, Lindgren G, Lohi H, Lopes MS, McGivney BA, Mikko S, Orr N, Penedo MCT, Piercy RJ, Raekallio M, Rieder S, Røed KH, Silvestrelli M, Swinburne J, Tozaki T, Vaudin M, Wade MC, McCue, ME, (2013) Genetic diversity in the modern horse illustrated from genome-wide SNP data. PLoS ONE. https://doi.org/10.1371/journal.pone.0054997
Petersen JL, Mickelson JR, Rendahl AK, Valberg SJ, Andersson LS, Axelsson J, Bailey E, Bannasch D, Binns MM, Borges AS, Brama P, da Câmara Machado A, Capomaccio S, Cappelli K, Cothran EG, Distl O, Fox-Clipsham L, Graves KT, Guérin G, Haase B, Hasegawa T, Hemmann K, Hill EW, Leeb T, Lindgren G, Lohi H, Lopes MS, McGivney BA, Mikko S, Orr N, Penedo MCT, Piercy RJ, Raekallio M, Rieder S, Røed KH, Swinburne J, Tozaki T, Vaudin M, Wade CM, McCue ME (2013) Genome-wide analysis reveals selection for important traits in domestic horse breeds. PLoS Genet. https://doi.org/10.1371/journal.pgen.1003211
Pluta M, Bańka K, Cieśla A, Rogala Ł (2021) The state of breeding and use of Caspian horses in Europe and around the world. Acta Sci Pol Zootech 19:79–84. https://doi.org/10.21005/asp.2020.19.3.10
Rausch T, Zichner T, Schlattl A, Stütz AM, Benes V, Korbel JO (2012) DELLY: structural variant discovery by integrated paired-end and split-read analysis. Bioinformatics. https://doi.org/10.1093/BIOINFORMATICS/BTS378
Rørvang MV, Nielsen BL, McLean AN (2020) Sensory abilities of horses and their importance for equitation science. Front Vet Sci 7:1–17. https://doi.org/10.3389/fvets.2020.00633
Sadeghi R, Moradi-Shahrbabak M, Ashtiani SRM, Schlamp F, Cosgrove EJ, Antczak DF (2019) Genetic diversity of Persian arabian horses and their relationship to other native iranian horse breeds. J Hered 110:173–182. https://doi.org/10.1093/jhered/esy061
Salek Ardestani S, Zandi MB, Vahedi SM, Janssens S (2022) Population structure and genomic footprints of selection in five major Iranian horse breeds. Anim Genet 53:627–639. https://doi.org/10.1111/AGE.13243
Seyedabadi HR, Amirinia S, Bana BMH, Emrani H (2006) Parentage verification of Iranian Caspian horse using microsatellites markers. Iran J Biotechnol 4:260–264
Shasavarani H, Rahimi-Mianji G (2010) Analysis of genetic diversity and estimation of inbreeding coefficient within Caspian horse population using microsatellite markers. African J Biotechnol 9:293–299
Talebi R, Ahmadi A, Afraz F (2018) Analysis of protein-protein interaction network based on transcriptome profiling of ovine granulosa cells identifies candidate genes in cyclic recruitment of ovarian follicles. J Anim Sci Technol. https://doi.org/10.1186/s40781-018-0171-y
Talebi R, Ahmadi A, Afraz F (2018) Construction of protein–protein interaction network based on transcriptome profiling of ovine granulosa cells during the sheep’s anestrus phase. Front Biol (Beijing). https://doi.org/10.1007/s11515-018-1499-x
Talebi R, Ahmadi A, Afraz F, Sarry J, Plisson-Petit F, Genêt C, Fabre S (2018) Transcriptome analysis of ovine granulosa cells reveals differences between small antral follicles collected during the follicular and luteal phases. Theriogenology. https://doi.org/10.1016/j.theriogenology.2017.11.027
Talebi R, Szmatoła T, Mészáros G, Qanbari S (2020) Runs of homozygosity in modern chicken revealed by sequence data. G3 Genes Genomes Genet 10:4615–4623. https://doi.org/10.1534/g3.120.401860
Talebi R, Seighalani R, Qanbari S (2021) A handmade DNA extraction kit using laundry powder; insights on simplicity, cost-efficiency, rapidity, safety and the quality of purified DNA. Anim Biotechnol 32:388–394. https://doi.org/10.1080/10495398.2019.1684933
Wang K, Li M, Hadley D, Liu R, Glessner J, Grant SFA, Hakonarson H, Bucan M (2007) PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. Genome Res 17:1665–1674. https://doi.org/10.1101/GR.6861907
Ye K, Schulz MH, Long Q, Apweiler R, Ning Z (2009) Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads. Bioinformatics 25:2865–2871. https://doi.org/10.1093/BIOINFORMATICS/BTP394
Zandi MB, Salek Ardestani S, Vahedi SM, Mahboudi H, Mahboudi F, Meskoob A (2022) Detection of common copy number of variants underlying selection pressure in middle Eastern horse breeds using whole-genome sequence data. J Hered 113:421–430. https://doi.org/10.1093/JHERED/ESAC027
Zhang Daoqin, Xie D, Lin X, Ma L, Chen J, Zhang Daoqi, Wang Y, Duo S, Feng Y, Zheng C, Jiang B, Ning Y, Han C (2018) The transcription factor SOX30 is a key regulator of mouse spermiogenesis. Development. https://doi.org/10.1242/dev.164723
Acknowledgements
The authors would like to acknowledge the initial support provided by Dr. Hamid Kohram, former professor at the Department of Animal Science, College of Agriculture and Natural Resources, University of Tehran, Iran. In February 2020, Dr. Kohram passed away due to complications from COVID-19.
Funding
This work was supported by a PhD grant from “College of Agriculture and Natural Resources, University of Tehran, Iran” and “Agricultural Biotechnology Research Institute, Iran”.
Author information
Authors and Affiliations
Contributions
BA, MZ and GHS conceived and designed the study. BA and MZ performed the bioinformatics and analyzed the data. BA, MZ, RT, MRG, and MFV wrote the manuscript. MZ, RT, MM, MRG, MFV, MKN, and TS edited the manuscript. All authors approved the manuscript before submission.
Corresponding authors
Ethics declarations
Competing interest
Authors declare that they have no conflicts of interest and financial or personal relationships that could inappropriately influence or bias the article's content.
Ethical approval
All activities and procedures involving the animals were approved by the Animal Care and Use committee and was done according to the local guidelines in the Agricultural Biotechnology Research Institute of Iran (ABRII).
Consent for publication
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Arefnejad, B., Zeinalabedini, M., Talebi, R. et al. Unveiling the population genetic structure of Iranian horses breeds by whole-genome resequencing analysis. Mamm Genome 35, 201–227 (2024). https://doi.org/10.1007/s00335-024-10035-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00335-024-10035-6