Skip to main content
Log in

Family-based association mapping in crop species

  • Review
  • Published:
Theoretical and Applied Genetics Aims and scope Submit manuscript

Abstract

Identification of allelic variants associated with complex traits provides molecular genetic information associated with variability upon which both artificial and natural selections are based. Family-based association mapping (FBAM) takes advantage of linkage disequilibrium among segregating progeny within crosses and among parents to provide greater power than association mapping and greater resolution than linkage mapping. Herein, we discuss the potential adaption of human family-based association tests and quantitative transmission disequilibrium tests for use in crop species. The rapid technological advancement of next generation sequencing will enable sequencing of all parents in a planned crossing design, with subsequent imputation of genotypes for all segregating progeny. These technical advancements are easily adapted to mating designs routinely used by plant breeders. Thus, FBAM has the potential to be widely adopted for discovering alleles, common and rare, underlying complex traits in crop species.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

References

  • Abecasis GR, Cardon LR, Cookson WOC (2000) A general test of association for quantitative traits in nuclear families. Am J Hum Genet 66:279–292

    Article  PubMed  CAS  Google Scholar 

  • Allison DB, Heo M, Kaplan N, Martin ER (1999) Sibling based tests of linkage and association for quantitative traits. Am J Hum Genet 64:1754–1764

    Article  PubMed  CAS  Google Scholar 

  • Anderson L, Georges M (2004) Domestic-animal genomics: deciphering the genetics of complex traits. Nat Rev Genetics 5:202–212

    Article  Google Scholar 

  • Bernardo R (2009) Should maize doubled haploids be induced among F1 or F2 plants. Theor Appl Genet 119:255–262

    Article  PubMed  CAS  Google Scholar 

  • Bernardo R (2010) Breeding for quantitative traits in plants. Stemma Press, Minnesota

    Google Scholar 

  • Blanc G, Charcosset A, Mangin B, Gallais A, Moreau L (2006) Connected populations for detecting quantitative trait loci and testing for epistasis: an application in maize. Theor Appl Genet 113:206–224

    Article  PubMed  CAS  Google Scholar 

  • Bordes J, Dumas de Vaul R, Lapierre A, Pollacsek M (1997) Haplodiploidization of maize (Zea mays L.) through induced gynogenesis assisted by glossy markers and its use in breeding. Agronomie 17:291–297

    Article  Google Scholar 

  • Buckler E, Gore M (2007) An Arabidopsis haplotype map takes root. Nat Genet 39:1056–1057

    Article  PubMed  CAS  Google Scholar 

  • Buckler ES, Holland JB, Bradbury PJ, Acharya CB, Brown PJ, Browne C et al (2009) The genetic architecture of maize flowering time. Science 325:714–718

    Article  PubMed  CAS  Google Scholar 

  • Burdick JT, Chen W, Abecasis GR, Cheung VG (2006) In silico method for inferring genotypes in pedigrees. Nat Genet 38:1002–1004

    Article  PubMed  CAS  Google Scholar 

  • Burgueno J, Campos G, Weigel K, Crossa J (2012) Genomic prediction of breeding values when modeling genotype x environment interaction using pedigree and dense molecular markers. Crop Sci 52:707–719

    Article  Google Scholar 

  • Chase SS (1951) Production of homozygous diploids of maize from monoploids. Agron J 44:263–267

    Article  Google Scholar 

  • Churchill GA et al (2004) The collaborative cross: a community resource for the genetic analysis of complex traits. Nat Genet 36d:1133–1137

    Article  Google Scholar 

  • Cirulli ET, Goldstein DB (2010) Uncovering the roles of rare variants in common disease through whole genome sequencing. Nat Rev Genet 11:415–425

    Article  PubMed  CAS  Google Scholar 

  • Darvasi A, Weinreb A, Minke V, Weller JI, Soller M (1993) Detecting marker-QTL linkage and estimating QTL gene effect and map location using a saturated genetic map. Genetics 134:943–951

    PubMed  CAS  Google Scholar 

  • Dickson SP, Wang K, Krantz I, Hakonarson H, Goldstein DB (2010) Rare variants create synthetic genome wide associations. PLoS Biol 6:e1000294

    Article  Google Scholar 

  • Farnir F, Grisart B, Coppieters W, Riquet J, Berzi P, Cambisano N, Karim L et al (2002) Simultaneous mining of linkage and linkage disequilibrium to fine map quantitative trait loci in outbred hal-sib pedigrees: revisiting the location of a quantitative trait locus with major effect on milk production on bovine chromosome 14. Genetics 161:275–287

    PubMed  CAS  Google Scholar 

  • Fulker DW, Cherny SS, Sham PC, Hewitt JK (1999) Combined linkage and association sib-pair analysis for quantitative traits. Am J Hum Genet 64:259–267

    Article  PubMed  CAS  Google Scholar 

  • Gale MD, Youssefian S (1985) Dwarfing genes in wheat. In: Russell GE (ed) Progress in plant breeding 1. Butterworths, London, pp 1–35

    Google Scholar 

  • George AW, Visscher PM, Haley CS (2000) Mapping quantitative trait loci in complex pedigree: a two- step variance component approach. Genetics 156:2081–2092

    PubMed  CAS  Google Scholar 

  • Guo B, Beavis WD (2011) In silico genotyping of the maize nested association mapping population. Mol Breed 27:107–113

    Article  PubMed  Google Scholar 

  • Guo B, Sleper DA, Sun J, Nguyen HT, Arelli PR, Shannon JG (2006) Pooled analysis of data from multiple quantitative trait locus mapping populations. Theor Appl Genet 113:39–48

    Article  PubMed  CAS  Google Scholar 

  • Guo B, Sleper DA, Beavis WD (2010) Nested association mapping for identification of functional markers. Genetics 186:373–383

    Article  PubMed  CAS  Google Scholar 

  • Haley CS, Knott SA (1992) A simple regression method for mapping quantitative trait loci in line crosses using flanking markers. Heredity 89:315–324

    Article  Google Scholar 

  • Hirschhorn JN, Daly MJ (2005) Genome-wide association studies for common diseases and complex traits. Nat Rev Genet 6:95–108

    Article  PubMed  CAS  Google Scholar 

  • Horvath S, Xu X, Lake SL, Silverman EK, Weiss ST, Laird NM (2004) Family based tests for associating haplotypes with general phenotype data: application to asthma genetics. Genet Epidemiol 26:61–69

    Article  PubMed  Google Scholar 

  • Huang X, Wei X, Sang T, Zhao Q, Feng Q et al (2010) Genome-wide association studies of 14 agronomic traits in rice landraces. Nat Genet 42:961–967

    Article  PubMed  CAS  Google Scholar 

  • International Hapmap Consortium (2003) The international HapMap project. Nature 426:789–796

    Article  Google Scholar 

  • Jansen RC, Jannink JL, Beavis WD (2003) Mapping quantitative trait loci in plant breeding populations: use of parental haplotype sharing. Crop Sci 43:829–834

    Article  CAS  Google Scholar 

  • Kiezun A, Garimella K, Do R, Stitziel NO, Neale BM et al (2012) Exome sequencing and the genetic basis of complex traits. Nat Genet 44:623–630

    Article  PubMed  CAS  Google Scholar 

  • Kingsmore SF, Lindquist IE, Mudge J, Gesler DD, Beavis WD (2008) Genome-wide association studies: progress and potential for drug discovery and development. Nat Rev Drug Discov 7:221–230

    Article  PubMed  CAS  Google Scholar 

  • Kump KL, Bradbury PJ, Buckler ES, Belcher AR, Oropeza-Rosas M et al (2011) Genome-wide association study of quantitative resistance to southern leaf blight in the maize nested association mapping population. Nat Genet 43:163–168

    Article  PubMed  CAS  Google Scholar 

  • Laird NM, Lange C (2006) Family based designs in the age of large scale gene association studies. Nat Rev Genetics 7:385–394

    Article  CAS  Google Scholar 

  • Lam HM, Xu X, Liu X, Chen W, Yang G, Wong FL, Li MW, Qin N, Wang B, Li J, Jian M, Wang J, Shao G, Wang J, Sun SM, Zhang G (2010) Resequencing of 31 wild and cultivated soybean genomes identifies patterns of genetic diversity and selection. Nat Genet 42:1053–1059

    Article  PubMed  CAS  Google Scholar 

  • Lander ES, Bostein (1989) Mapping Mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics 121:185–199

    PubMed  CAS  Google Scholar 

  • Lange C, DeMeo DL, Laird NM (2002) Power and design considerations for a general class of family based association tests: quantitative traits. Am J Hum Genet 71:1330–1341

    Article  PubMed  CAS  Google Scholar 

  • Lee SH, Werf JHJ (2004) The efficiency of designs for fine mapping of quantitative trait loci using combined linkage disequilibrium and linkage. Genet Sel Evol 36:145–161

    Article  PubMed  CAS  Google Scholar 

  • Li R, Lyons MA, Wittenburg H, Paigen B, Churchill GA (2005) Combining data from multiple inbred line crosses improves the power and resolution of quantitative trait loci mapping. Genetics 169:1699–1709

    Article  PubMed  CAS  Google Scholar 

  • Lund MS, Sorensen P, Guldbrandtsen B, Sorensen DA (2003) Multi fine mapping of quantitative trait loci using combined linkage disequilibria and linkage analysis. Genetics 163:405–410

    PubMed  CAS  Google Scholar 

  • Lynch M, Walsh B (1998) Genetics and analysis of quantitative traits. Sinauer Associates, Inc, Sunderland

    Google Scholar 

  • Martin ER, Monks SA, Warren LL, Kaplan NL (2000) A test for linkage and association in general pedigrees: the pedigree disequilibrium test. Am J Hum Genet 67:146–154

    Article  PubMed  CAS  Google Scholar 

  • Metzker M (2010) Sequencing technologies-the next generation. Nat Rev Genetics 11:31–46

    Article  CAS  Google Scholar 

  • Meuwissen THE, Karlsen A, Lien S, Olsaker I, Goddard ME (2002) Fine mapping of a quantitative trait locus for twinning rate using combined linkage and linkage disequilibrium mapping. Genetics 161:373–379

    PubMed  CAS  Google Scholar 

  • Monks SA, Kaplan NL (2000) Removing the sampling restrictions from family based tests of association for a quantitative trait locus. Am J Hum Genet 66:576–592

    Article  PubMed  CAS  Google Scholar 

  • Morrell PL, Buckler ES, Ross-Ibarra J (2012) Crop genomics: advances and applications. Nat Rev Genetics 13:85–96

    CAS  Google Scholar 

  • Rabinowitz D, Laird N (2000) A unified approach to adjusting association tests for population admixture with arbitrary pedigree structure and arbitrary missing marker information. Hum Hered 50:211–223

    Article  PubMed  CAS  Google Scholar 

  • Rival MA, Beaudoin M, Gardet A, Stevens C, Sharma Y et al (2011) Deep resequencing of GWAS loci identifies independent rare variants associated with inflammatory bowel disease. Nat Genet 43:1066–1073

    Article  Google Scholar 

  • SAS Institute (2004) SAS/STAT 9.1 user’s guide. SAS institute, Cary, NC

  • Snape JW (1988) The detection and estimation of linkage using doubled haploid or single seed descent populations. Theor Appl Genet 76:125–128

    Article  Google Scholar 

  • So Y, Edwards J (2011) Predictive ability assessment of linear mixed models in multi-environment trials in corn. Crop Sci 51:542–552

    Article  Google Scholar 

  • Stanklewicz P, Lupski JR (2010) Structural variation in the human genome and its role in disease. Annu Rev Med 61:437–455

    Article  Google Scholar 

  • Steen KV, McQueen MB, Herbert A, Raby B, Lyon H et al (2005) Genomic screening and replication using the same data set in family based association testing. Nat Genet 37:683–691

    Article  PubMed  Google Scholar 

  • Stich B, Melchinger AE, Piepho H, Heckenberger M, Manurer HP, Reif JC (2006) A new test for family based association mapping with inbred lines from plant breeding programs. Theor Appl Genet 113:1121–1130

    Article  PubMed  Google Scholar 

  • Tian F, Bradbury PJ, Brown PJ, Sun Q, Flint-Garcia S et al (2011) Genome-wide association study of maize identifies genes affecting leaf architecture. Nat Genet 43:159–162

    Article  PubMed  CAS  Google Scholar 

  • Valdar W, Holmes CC, Mott R, Flint J (2009) Mapping in structured populations by resample model averaging. Genetics 182:1263–1277

    Article  PubMed  Google Scholar 

  • VSN international (2010) ASReml 3. VSSN international Ltd., Hemel Hempstead

    Google Scholar 

  • Xu S (1998) Mapping quantitative trait loci using multiple families of lines crosses. Genetics 148:517–524

    PubMed  CAS  Google Scholar 

  • Yan J, Kandianis CB, Harjes CE, Bai L, Kim EH et al (2010) Rare genetic variation at Zea mays crtRB1 increases β-carotene in maize grain. Nat Genet 42:322–327

    Article  PubMed  CAS  Google Scholar 

  • Yu J, Pressoir G, Briggs WH, Bi IV, Yamasaki M et al (2006) A unified mixed model method for association mapping that accounts for multiple levels of relatedness. Nat Genet 38:203–208

    Article  PubMed  CAS  Google Scholar 

  • Yu J, Holland JB, McMullen MD, Buckler ES (2008) Genetic design and statistical power of nested association mapping in maize. Genetics 178:539–551

    Article  PubMed  Google Scholar 

Download references

Acknowledgments

The authors thank Drs. Gilles Gay and Robert Bensen, Syngenta biotechnology, Inc, for critical reading of the manuscript.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Baohong Guo or William D. Beavis.

Additional information

Communicated by R. Varshney.

Appendix: Conditional probabilities of QTL genotypes given flanking markers in DHF2 populations

Appendix: Conditional probabilities of QTL genotypes given flanking markers in DHF2 populations

In maize, DHF2 can be produced through inducing haploids from F2 plants using inducer lines and doubling them using chemical colchicine (Chase 1951; Bordes et al. 1997; Bernardo 2009). Consider three linked loci A, Q and B, which are fixed with AAQQBB and aaqqbb for two parental lines, respectively. r 1 is recombination rate between A and Q loci, r 2 between Q and B loci and r between A and B loci. Due to the high density of markers we assume that probability of double-cross over events per meiosis is zero. Therefore, r = r 1 + r 2.

Production of DHF2 involves three processes. Firstly, F1 plants (AQB//aqb) produce a total of six gametes: (1) No recombination event. AQB and aqb, with a frequency of (1 − r)/2, respectively. (2) A single recombination event between A and Q. aQB and Aqb, with a frequency of r 1/2, respectively. (3) A single recombination between Q and B. AQb and aqB, with a frequency of r 2/2, respectively. Secondly, female and male gametes are randomly mated to produce a total of 21 F2 phased genotypes: (1) No recombination event. AQB//AQB, aqb//aqb and AQB//aqb. (2) A recombination event in one gamete between A and Q. aQB//AQB, aQB//aqb, Aqb//AQB and Aqb//aqb. (3) A recombination event in one gamete between Q and B. AQb//AQB, AQb//aqb, aqB//AQB and aqB//aqb. (4) Recombination events in both gametes between A and Q. aQB//aQB, Aqb//Aqb and aQB//Aqb. (5) Recombination events in both gametes between Q and B. AQb//AQb, aqB//aqB, AQb//aqB. (6) Recombination events in both gametes, one occurs between A and Q, but the other occurs between Q and B. AQb//aQB, aqB//aQB, AQb//Aqb and aqB//Aqb. The frequency of one phased genotype is two times the product of two F1 gamete frequencies if the two gametes have different genotypes and the product of two F1 gamete frequencies if the two gametes have the same genotype. For example, AQB//AQB is (1 − r)2/4 and aQB//AQB is r 1(1 − r)/2. F2 plants produce haploids which are equivalent to gametes. For each phased F2 genotype, the frequencies of gametes are produced as with F1 described above and then be multiplied by its F2 genotype frequency to obtain the frequencies of corresponding double haplotype frequencies. For example, an F2 genotype of aQB//Aqb, which has a frequency of (r 1)2/2, produces haploids: aQB with a frequency of (1 − r)/2, Aqb with a frequency of (1 − r)/2, AQB with a frequency of r 1/2, aqb with a frequency of r 1/2, aQb with a frequency of r 2/2 and AqB with a frequency of r 2/2. The frequencies of haploids aQB, Aqb, AQB, aqb, aQb and Aqb are r 21 (1 − r)/4, r 21 (1 − r)/4, r 31 /4, r 31 /4, r 32 /4, and r 32 /4, respectively. A total of eight double haploid genotypes are produced from F2 plants: AAQQBB, aaqqbb, AAqqBB, aaQQbb, aaQQBB, AAqqbb, aaqqBB and AAQQbb. Their frequencies are as follows:

$$\begin{aligned} & P({\text{AAQQBB}}) = P({\text{aaqqbb}}) = p_{1} (2 - r) + p_{2} (1 - r_{2} /2) + p_{3} (1 - r_{1} /2) + p_{4} r_{1} + p_{5} r_{2} + 0.5p_{6} r \\ & P({\text{AAqqBB}}) = P({\text{aaQQbb}}) = 0.5p_{2} r_{2} + 0.5p_{3} r_{1} + p_{4} r_{2} + p_{5} r_{1} + 0.5p_{6} r \\ & P({\text{aaQQBB}}) = P({\text{AAqqbb}}) = p_{1} r_{1} + p_{2} (1 - r_{2} /2) + 0.5p_{3} r_{1} + p_{4} (2 - r) + p_{6} (1 - r/2) \\ & P({\text{aaqqBB}}) = P({\text{AAQQbb}}) = p_{1} r_{2} + 0.5p_{2} r_{2} + p_{3} (1 - r_{1} /2) + p_{5} (2 - r) + p_{6} (1 - r/2) \\ & \quad {\text{where }}p_{1} = (1 - r)^{2} /4,p_{2} = \, r_{1} (1 - r)/2,p_{3} = r_{2} (1 - r)/2,p_{4} = r_{1}^{2} /4,p_{5} = r_{2}^{2} /4{\text{ and }}p_{6} = r_{1} r_{2} /2. \\ \end{aligned}$$

On the ends of chromosomes a single marker could be used to impute the QTL genotypes. Assume that Q locus is linked with marker locus with recombination rate of r between them. The frequencies of genotypes in DHF2 are:

$$\begin{aligned} & P({\text{AAQQ}}) = P({\text{aaqq}}) = p_{1} (2 - r) + p_{2} + p_{3} r \\ & P({\text{AAqq}}) = P({\text{aaQQ}}) = p_{1} r + p_{2} + p_{3} (2 - r) \\ & \quad {\text{where }}p_{1} = (1 - r)^{2} /4,p_{2} = r(1 - r)/2{\text{ and }}p_{3} = r^{2} /4 \\ \end{aligned}$$

Rights and permissions

Reprints and permissions

About this article

Cite this article

Guo, B., Wang, D., Guo, Z. et al. Family-based association mapping in crop species. Theor Appl Genet 126, 1419–1430 (2013). https://doi.org/10.1007/s00122-013-2100-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00122-013-2100-2

Keywords

Navigation