Breeding without breeding: minimum fingerprinting effort with respect to the effective population size

Lstibůrek, Milan; Ivanková, Kristýna; Kadlec, Jan; Kobliha, Jaroslav; Klápště, Jaroslav; El-Kassaby, Yousry A.

doi:10.1007/s11295-011-0395-1

Breeding without breeding: minimum fingerprinting effort with respect to the effective population size

Original Paper
Published: 11 June 2011

Volume 7, pages 1069–1078, (2011)
Cite this article

Tree Genetics & Genomes Aims and scope Submit manuscript

Milan Lstibůrek¹,
Kristýna Ivanková^1,2,
Jan Kadlec³,
Jaroslav Kobliha¹,
Jaroslav Klápště^1,4 &
…
Yousry A. El-Kassaby⁴

287 Accesses
11 Citations
Explore all metrics

Abstract

We present a probabilistic model to minimize the fingerprinting effort associated with the implementation of the “breeding without breeding” scheme under partial pedigree reconstruction. Our approach is directed at achieving a declared target population’s minimum effective population size (N _e), following the pedigree reconstruction and genotypic selection and is based on the graph theory algorithm. The primary advantage of the proposed method is to reduce the cost associated with fingerprinting before the implementation of the pedigree reconstruction for seed parent–offspring derived from breeding arboreta and production or natural populations. Stochastic simulation was conducted to test the method’s efficiency assuming a simple polygenic model and a single trait. Hypothetical population consisted of 30 parental trees that were paired at random (selfing excluded), resulting in 600 individuals (potential candidates for forwards selection). The male parentage was assumed initially unknown. The model was used to estimate the minimum genotyping sample size needed to reaching the prescribed N _e. Results were compared with the known pedigree data. The model was successful in revealing the true relationship pattern over the whole range of N _e. Two to three offspring entered genotyping to meet the N _e = 2 while 41 to 43 were required to satisfy the N _e = 14. Importantly, genetic gain was affected at the lower limits of the genotyping effort. Doubling the number of parents resulted in considerable reduction of the genotyping effort at higher N _e values.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Mapping and functional characterization of structural variation in 1060 pig genomes

Article Open access 07 May 2024

Isolation, small population size, and management influence inbreeding and reduced genetic variation in K’gari dingoes

Article Open access 19 April 2024

Overview of Statistical Methods for Genome-Wide Association Studies (GWAS)

References

Agnarsson G, Greenlaw R (2008) Graph theory: modeling, applications, and algorithms. Prentice-Hall, Englewood Cliffs
Google Scholar
Botstein D, White RL, Skolnick M, Davis RW (1980) Construction of a genetic linkage map in man using restriction fragment length polymorphisms. Am J Hum Genet 32:314–331
PubMed CAS Google Scholar
Edwardes MD deB (1998) The evaluation of confidence sets with application to binomial intervals. Stat Sinica 8:393–409
Google Scholar
El-Kassaby YA, Rudin D, Yazdani R (1989) Levels of outcrossing and contamination in two Scots pine seed orchards. Scand J Forest Res 4:41–49
Article Google Scholar
El-Kassaby YA, Lstibůrek M (2009) Breeding without breeding. Genet Res 91:111–120
Article Google Scholar
Friedman ST, Adams WT (1985) Estimation of gene flow into seed orchards of loblolly pine (Pinus taeda L.). Theor Appl Genet 69:609–615
Article Google Scholar
Funda T, Liewlaksaneeyanawin C, Fundova I, Lai BSK, Walsh C, Niejenhuis AV, Cook C, Graham H, Woods J, El-Kassaby YA (2011) Congruence between parental reproductive investment and success determined by DNA-based pedigree reconstruction in conifer seed orchards. Can J For Res 41:380–389
Article CAS Google Scholar
Grattapaglia D, Ribeiro VJ, Resende GDSP (2004) Retrospective selection of elite parent trees using paternity testing with microsatellite markers: an alternative short term breeding tactic for Eucalyptus. Theor Appl Genet 109:192–199
Article PubMed CAS Google Scholar
Jones AG, Ardren WR (2003) Methods of parentage analysis in natural populations. Mol Ecol 12:2511–2523
Article PubMed CAS Google Scholar
Lambeth C, Lee B-C, O’Malley D, Wheeler N (2001) Polymix breeding with parental analysis of progeny: an alternative to full-sib breeding and testing. Theor Appl Genet 103:930–943
Article Google Scholar
Lynch M, Walsh B (1998) Genetics and analysis of quantitative traits. Sinauer Associates, Sunderland
Google Scholar
Magun J (1998) Greedy matching algorithms: an experimental study. ACM J Exp Algorithmics 3:22–28
Google Scholar
Massah N, Wang J, Russell JH, Niejenhuis AV, El-Kassaby YA (2009) Genealogical relationship among members of selection and production populations of Yellow Cedar (Callitropsis nootkatensis [D.Don] Oerst.) in the absence of parental information. J Hered 101:154–163
Article PubMed Google Scholar
Schoen DJ, Stewart SC (1986) Variation in male reproductive investment and male reproductive success in white spruce. Evolution 40:1109–1120
Article Google Scholar
Schoen DJ, Stewart SC (1987) Variation in male fertilities and pairwise mating probabilities in Picea glauca. Genetics 116:141–152
PubMed CAS Google Scholar

Download references

Acknowledgements

We are grateful to Rowland Burdon and two anonymous reviewers for their critical review and many helpful comments on this article. The access to the MetaCentrum supercomputing facilities provided under the research intent MSM6383917201 is highly acknowledged. Support from the Czech Science Foundation (GAČR; grant 521/07/P337; M. Lstibůrek) and the National Agency for Agricultural Research (NAZV; grant QH81172; M. Lstibůrek) and (NAZV; grant QH81160; Jaroslav Kobliha) and the Natural Sciences and Engineering Research Council of Canada (Discovery and IRC Grants) and the Johnson’s Family Forest Biotechnology Endowment to Y. A. El-Kassaby are highly appreciated.

Author information

Authors and Affiliations

Department of Dendrology and Forest Tree Breeding, Faculty of Forestry and Wood Sciences, Czech University of Life Sciences Prague, Kamýcká 129, 165 21, Praha 6, Czech Republic
Milan Lstibůrek, Kristýna Ivanková, Jaroslav Kobliha & Jaroslav Klápště
Institute of Economic Studies, Faculty of Social Sciences, Charles University in Prague, Opletalova 26, 110 00, Praha 1, Czech Republic
Kristýna Ivanková
Faculty of Mathematics and Physics, Charles University in Prague, Ke Karlovu 3, 121 16, Praha 2, Czech Republic
Jan Kadlec
Department of Forest Sciences, Faculty of Forestry, University of British Columbia, 2424 Main Mall, V6T 1Z4, Vancouver, BC, Canada
Jaroslav Klápště & Yousry A. El-Kassaby

Authors

Milan Lstibůrek
View author publications
You can also search for this author in PubMed Google Scholar
Kristýna Ivanková
View author publications
You can also search for this author in PubMed Google Scholar
Jan Kadlec
View author publications
You can also search for this author in PubMed Google Scholar
Jaroslav Kobliha
View author publications
You can also search for this author in PubMed Google Scholar
Jaroslav Klápště
View author publications
You can also search for this author in PubMed Google Scholar
Yousry A. El-Kassaby
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Milan Lstibůrek.

Additional information

Communicated by R. Burdon

Appendix

ad(iii):

Probabilities are estimated by the Monte Carlo method:

Algorithm M

Let the number of all outcomes be denoted as A and that of all successful outcomes as S.

(i_M):

Set $A\leftarrow 0$, $S \leftarrow 0$.

(ii_M):

$A \leftarrow A + 1$

Assign randomly male parents to offspring, so that i ^th offspring is sired by a male parent y with a probability p _i,y.

(iii_M):

Find the biggest subset of offspring M with all parents different. {algorithm MinRedl2}

If $\vert M \vert \geqq N$, increase $S \leftarrow S + 1$.

(iv_M):

If $A \geqq o_{\max}$ or

if P is outside the confidence interval logit(S, A, α), {we reject H ₀ : p = P} return result $\hat{p}= S/A$, else go to (ii_M).

ad(iii_M):

The problem of finding the largest subset of offspring with different parents was converted to the problem of finding a maximum matching in a general graph, a well-developed subject in graph theory (Agnarsson and Greenlaw 2008).

As a graph, we define a pair (V, E) containing a set of vertices V and edges E connecting pairs of vertices. In our settings, the vertices are declared as parents while the edges are respective offspring. A pairing M is a subset of E where no two edges share common vertex. Vertices sharing an edge with a vertex V will be denoted as its neighbors.

For finding maximum matchings in graphs, we utilized the fast approximation algorithm MinRed12 (MR) by Magun (1998).

(i_MR):: $\vert M \vert \leftarrow 0$
(ii_MR):: Remove all vertices without edges and all duplicate edges; consider vertex v with the lowest number of neighbors Δ_v.

Note: This vertex is always removed. Along with the vertex, potential neighbor is removed as well (this is determined by the number of neighbors):

(iii_MR):

If Δ_v = 1, remove v and its neighbor w,

if Δ_v = 2, remove v and merge its neighbors w ₁ and w ₂, else remove w with the lowest number of neighbors Δ_w.

Note: While the first case (Δ_v = 1) is obvious, the second case (Δ_v = 2) is questionable (which one should be removed). As one vertex must always be removed, the two are simply merged. We are not interested at the exact form of such a pairing. It is sufficient to declare the number of paired vertices. If Δ_v > 2, a heuristic approach is used (this is the only approximation step in the algorithm). Let us choose a particular neighbor w out of all potential neighbors v so that it has the fewest number of neighbors. This one is removed. The general idea is that vertices with the lowest number of neighbors interfere the least to the pairing among other vertices. Details are provided in Magun (1998).

(iv_MR):

$\vert M \vert \leftarrow \vert M \vert + 1$ {Add edge vw to pairing}

If $\vert M \vert \geqq N $, finish.

If there are any more edges in the graph, go to (ii_MR).

Note: We are testing a null hypothesis that $\hat{p} = P$ against the alternative that $\hat{p}\ne P$. If H ₀ is rejected, we can stop testing the current set. As a result, an outcome of this algorithm must always satisfy this testing, irrespective of its exact score value.

For hypothesis testing, we will use the logit interval, based on the approximation of binomial distribution of logit functions. Such a function is used in the next step with the exception that the exact binomial interval is used for X = 0, 1,Z − 1,Z. If $\hat{p}$ is inside this interval, we are not rejecting the hypothesis.

ad(iv_M):

To test, whether the probability is estimated accurately, an interval logit(X, Z, α) is used in the algorithm, where X = number of successful tests, Z = number of all tests, and α is the probability of the type I error. The value c = − 0.5 is taken from Edwardes (1998).

Set

$$\begin{array}{rll} x&=&X-c,\quad z=Z-2c, \\ \varphi &=& \text{exp} \bigg( \Phi (1-\alpha / 2) \sqrt{\frac{z}{x(z-x)}}\bigg), \end{array}$$

where Φ is the cumulative distribution function for the normal distribution. Then

$$\begin{array}{rll} &&\text{logit}(X,Z,\alpha) \\&& = \left\{ \begin{array}{rl} \Big\langle 0,1-\sqrt[Z]{\alpha/2} \Big\rangle & \text{if } X=0 \\ \Big\langle 1-\sqrt[Z]{1-\alpha}, \displaystyle\frac{x}{x+(z-x)/\varphi} \Big\rangle & \text{if } X=1 \\ \Big\langle \displaystyle\frac{x}{x+(z-x)\varphi},\sqrt[Z]{1-\alpha} \Big\rangle & \text{if } X\!=\!Z\!-\!1 \\ \Big\langle \sqrt[Z]{\alpha/2},1 \Big\rangle & \text{if } X=Z \\ \Big\langle \displaystyle\frac{x}{x+(z-x)\varphi},\displaystyle\frac{x}{x+(z-x)/\varphi} \Big\rangle & \text{else.} \end{array} \right. \end{array} $$

(9)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lstibůrek, M., Ivanková, K., Kadlec, J. et al. Breeding without breeding: minimum fingerprinting effort with respect to the effective population size. Tree Genetics & Genomes 7, 1069–1078 (2011). https://doi.org/10.1007/s11295-011-0395-1

Download citation

Received: 30 June 2010
Revised: 25 March 2011
Accepted: 19 April 2011
Published: 11 June 2011
Issue Date: October 2011
DOI: https://doi.org/10.1007/s11295-011-0395-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Breeding without breeding: minimum fingerprinting effort with respect to the effective population size

Abstract

Access this article

Similar content being viewed by others

Mapping and functional characterization of structural variation in 1060 pig genomes

Isolation, small population size, and management influence inbreeding and reduced genetic variation in K’gari dingoes

Overview of Statistical Methods for Genome-Wide Association Studies (GWAS)

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Breeding without breeding: minimum fingerprinting effort with respect to the effective population size

Abstract

Access this article

Similar content being viewed by others

Mapping and functional characterization of structural variation in 1060 pig genomes

Isolation, small population size, and management influence inbreeding and reduced genetic variation in K’gari dingoes

Overview of Statistical Methods for Genome-Wide Association Studies (GWAS)

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation