Skip to main content

Advertisement

Log in

Breeding without breeding: minimum fingerprinting effort with respect to the effective population size

  • Original Paper
  • Published:
Tree Genetics & Genomes Aims and scope Submit manuscript

Abstract

We present a probabilistic model to minimize the fingerprinting effort associated with the implementation of the “breeding without breeding” scheme under partial pedigree reconstruction. Our approach is directed at achieving a declared target population’s minimum effective population size (N e ), following the pedigree reconstruction and genotypic selection and is based on the graph theory algorithm. The primary advantage of the proposed method is to reduce the cost associated with fingerprinting before the implementation of the pedigree reconstruction for seed parent–offspring derived from breeding arboreta and production or natural populations. Stochastic simulation was conducted to test the method’s efficiency assuming a simple polygenic model and a single trait. Hypothetical population consisted of 30 parental trees that were paired at random (selfing excluded), resulting in 600 individuals (potential candidates for forwards selection). The male parentage was assumed initially unknown. The model was used to estimate the minimum genotyping sample size needed to reaching the prescribed N e . Results were compared with the known pedigree data. The model was successful in revealing the true relationship pattern over the whole range of N e . Two to three offspring entered genotyping to meet the N e  = 2 while 41 to 43 were required to satisfy the N e  = 14. Importantly, genetic gain was affected at the lower limits of the genotyping effort. Doubling the number of parents resulted in considerable reduction of the genotyping effort at higher N e values.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  • Agnarsson G, Greenlaw R (2008) Graph theory: modeling, applications, and algorithms. Prentice-Hall, Englewood Cliffs

    Google Scholar 

  • Botstein D, White RL, Skolnick M, Davis RW (1980) Construction of a genetic linkage map in man using restriction fragment length polymorphisms. Am J Hum Genet 32:314–331

    PubMed  CAS  Google Scholar 

  • Edwardes MD deB (1998) The evaluation of confidence sets with application to binomial intervals. Stat Sinica 8:393–409

    Google Scholar 

  • El-Kassaby YA, Rudin D, Yazdani R (1989) Levels of outcrossing and contamination in two Scots pine seed orchards. Scand J Forest Res 4:41–49

    Article  Google Scholar 

  • El-Kassaby YA, Lstibůrek M (2009) Breeding without breeding. Genet Res 91:111–120

    Article  Google Scholar 

  • Friedman ST, Adams WT (1985) Estimation of gene flow into seed orchards of loblolly pine (Pinus taeda L.). Theor Appl Genet 69:609–615

    Article  Google Scholar 

  • Funda T, Liewlaksaneeyanawin C, Fundova I, Lai BSK, Walsh C, Niejenhuis AV, Cook C, Graham H, Woods J, El-Kassaby YA (2011) Congruence between parental reproductive investment and success determined by DNA-based pedigree reconstruction in conifer seed orchards. Can J For Res 41:380–389

    Article  CAS  Google Scholar 

  • Grattapaglia D, Ribeiro VJ, Resende GDSP (2004) Retrospective selection of elite parent trees using paternity testing with microsatellite markers: an alternative short term breeding tactic for Eucalyptus. Theor Appl Genet 109:192–199

    Article  PubMed  CAS  Google Scholar 

  • Jones AG, Ardren WR (2003) Methods of parentage analysis in natural populations. Mol Ecol 12:2511–2523

    Article  PubMed  CAS  Google Scholar 

  • Lambeth C, Lee B-C, O’Malley D, Wheeler N (2001) Polymix breeding with parental analysis of progeny: an alternative to full-sib breeding and testing. Theor Appl Genet 103:930–943

    Article  Google Scholar 

  • Lynch M, Walsh B (1998) Genetics and analysis of quantitative traits. Sinauer Associates, Sunderland

    Google Scholar 

  • Magun J (1998) Greedy matching algorithms: an experimental study. ACM J Exp Algorithmics 3:22–28

    Google Scholar 

  • Massah N, Wang J, Russell JH, Niejenhuis AV, El-Kassaby YA (2009) Genealogical relationship among members of selection and production populations of Yellow Cedar (Callitropsis nootkatensis [D.Don] Oerst.) in the absence of parental information. J Hered 101:154–163

    Article  PubMed  Google Scholar 

  • Schoen DJ, Stewart SC (1986) Variation in male reproductive investment and male reproductive success in white spruce. Evolution 40:1109–1120

    Article  Google Scholar 

  • Schoen DJ, Stewart SC (1987) Variation in male fertilities and pairwise mating probabilities in Picea glauca. Genetics 116:141–152

    PubMed  CAS  Google Scholar 

Download references

Acknowledgements

We are grateful to Rowland Burdon and two anonymous reviewers for their critical review and many helpful comments on this article. The access to the MetaCentrum supercomputing facilities provided under the research intent MSM6383917201 is highly acknowledged. Support from the Czech Science Foundation (GAČR; grant 521/07/P337; M. Lstibůrek) and the National Agency for Agricultural Research (NAZV; grant QH81172; M. Lstibůrek) and (NAZV; grant QH81160; Jaroslav Kobliha) and the Natural Sciences and Engineering Research Council of Canada (Discovery and IRC Grants) and the Johnson’s Family Forest Biotechnology Endowment to Y. A. El-Kassaby are highly appreciated.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Milan Lstibůrek.

Additional information

Communicated by R. Burdon

Appendix

Appendix

ad(iii):

Probabilities are estimated by the Monte Carlo method:

Algorithm M

Let the number of all outcomes be denoted as A and that of all successful outcomes as S.

(i M ):

Set \(A\leftarrow 0\), \(S \leftarrow 0\).

(ii M ):

\(A \leftarrow A + 1\)

Assign randomly male parents to offspring, so that i th offspring is sired by a male parent y with a probability p i,y.

(iii M ):

Find the biggest subset of offspring M with all parents different. {algorithm MinRedl2}

If \(\vert M \vert \geqq N\), increase \(S \leftarrow S + 1\).

(iv M ):

If \(A \geqq o_{\max}\) or

if P is outside the confidence interval logit(S, A, α), {we reject H 0 : p = P} return result \(\hat{p}= S/A\), else go to (ii M ).

ad(iii M ):

The problem of finding the largest subset of offspring with different parents was converted to the problem of finding a maximum matching in a general graph, a well-developed subject in graph theory (Agnarsson and Greenlaw 2008).

As a graph, we define a pair (V, E) containing a set of vertices V and edges E connecting pairs of vertices. In our settings, the vertices are declared as parents while the edges are respective offspring. A pairing M is a subset of E where no two edges share common vertex. Vertices sharing an edge with a vertex V will be denoted as its neighbors.

For finding maximum matchings in graphs, we utilized the fast approximation algorithm MinRed12 (MR) by Magun (1998).

(i MR ):

\(\vert M \vert \leftarrow 0\)

(ii MR ):

Remove all vertices without edges and all duplicate edges; consider vertex v with the lowest number of neighbors Δ v .

Note: This vertex is always removed. Along with the vertex, potential neighbor is removed as well (this is determined by the number of neighbors):

(iii MR ):

If Δ v  = 1, remove v and its neighbor w,

if Δ v  = 2, remove v and merge its neighbors w 1 and w 2, else remove w with the lowest number of neighbors Δ w .

Note: While the first case (Δ v  = 1) is obvious, the second case (Δ v  = 2) is questionable (which one should be removed). As one vertex must always be removed, the two are simply merged. We are not interested at the exact form of such a pairing. It is sufficient to declare the number of paired vertices. If Δ v  > 2, a heuristic approach is used (this is the only approximation step in the algorithm). Let us choose a particular neighbor w out of all potential neighbors v so that it has the fewest number of neighbors. This one is removed. The general idea is that vertices with the lowest number of neighbors interfere the least to the pairing among other vertices. Details are provided in Magun (1998).

(iv MR ):

\(\vert M \vert \leftarrow \vert M \vert + 1\) {Add edge vw to pairing}

If \(\vert M \vert \geqq N \), finish.

If there are any more edges in the graph, go to (ii MR ).

Note: We are testing a null hypothesis that \(\hat{p} = P\) against the alternative that \(\hat{p}\ne P\). If H 0 is rejected, we can stop testing the current set. As a result, an outcome of this algorithm must always satisfy this testing, irrespective of its exact score value.

For hypothesis testing, we will use the logit interval, based on the approximation of binomial distribution of logit functions. Such a function is used in the next step with the exception that the exact binomial interval is used for X = 0, 1,Z − 1,Z. If \(\hat{p}\) is inside this interval, we are not rejecting the hypothesis.

ad(iv M ):

To test, whether the probability is estimated accurately, an interval logit(X, Z, α) is used in the algorithm, where X = number of successful tests, Z = number of all tests, and α is the probability of the type I error. The value c = − 0.5 is taken from Edwardes (1998).

Set

$$\begin{array}{rll} x&=&X-c,\quad z=Z-2c, \\ \varphi &=& \text{exp} \bigg( \Phi (1-\alpha / 2) \sqrt{\frac{z}{x(z-x)}}\bigg), \end{array}$$

where Φ is the cumulative distribution function for the normal distribution. Then

$$\begin{array}{rll} &&\text{logit}(X,Z,\alpha) \\&& = \left\{ \begin{array}{rl} \Big\langle 0,1-\sqrt[Z]{\alpha/2} \Big\rangle & \text{if } X=0 \\ \Big\langle 1-\sqrt[Z]{1-\alpha}, \displaystyle\frac{x}{x+(z-x)/\varphi} \Big\rangle & \text{if } X=1 \\ \Big\langle \displaystyle\frac{x}{x+(z-x)\varphi},\sqrt[Z]{1-\alpha} \Big\rangle & \text{if } X\!=\!Z\!-\!1 \\ \Big\langle \sqrt[Z]{\alpha/2},1 \Big\rangle & \text{if } X=Z \\ \Big\langle \displaystyle\frac{x}{x+(z-x)\varphi},\displaystyle\frac{x}{x+(z-x)/\varphi} \Big\rangle & \text{else.} \end{array} \right. \end{array} $$
(9)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lstibůrek, M., Ivanková, K., Kadlec, J. et al. Breeding without breeding: minimum fingerprinting effort with respect to the effective population size. Tree Genetics & Genomes 7, 1069–1078 (2011). https://doi.org/10.1007/s11295-011-0395-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11295-011-0395-1

Keywords

Navigation