Long-Term and Short-Term Evolutionary Impacts of Transposable Elements on Drosophila

Table 1

Information of candidate genes

Category	Flybase ID	Symbol	Gene name	Chr^a	CDs length^b	Gene functions
piRNA genes	FBgn0004872	piwi	piwi	2L	2532	piRNA generation
	FBgn0000146	aub	aubergine	2L	2601	piRNA generation
	FBgn0250816	AGO3	Argonaute 3	3L	2604	piRNA generation
	FBgn0041164	armi	armitage	3L	3825	RNA helicase
	FBgn0003483	spn-E	spindle E	3R	4305	RNA helicase
	FBgn0002652	squ	squash	2L	726	Nuclease
	FBgn0261266	zuc	zucchini	2L	762	Nuclease
	FBgn0033686	Hen1	PIMET/DmHen1	2R	1176	piRNA methyltransferase
	FBgn0016034	mael	maelstrom	3L	1389	Nuage component
	FBgn0034098	krimp	krimper	2R	2241	Nuage component
	FBgn0262526	vas	vasa	2L	661	Nuage component
	FBgn0004400	rhi	rhino	2R	1257	HP1 paralog, heterochromatin binding
P-element–specific genes	FBgn0014870	Psi	P-element somatic inhibitor	2R	2394	P-element somatic inhibitor; mRNA splicing factor
	FBgn0004838	Hrb27C	Heterogeneous nuclear ribonucleoprotein at 27C	2L	1266	mRNA splicing factor
	FBgn0011774	Irbp	Inverted repeat-binding protein	3R	1897	Recombination repair protein
	FBgn0041627	Ku80	Ku80	2L	2100	Recombination repair protein

Category	Flybase ID	Symbol	Gene name	Chr^a	CDs length^b	Gene functions
piRNA genes	FBgn0004872	piwi	piwi	2L	2532	piRNA generation
	FBgn0000146	aub	aubergine	2L	2601	piRNA generation
	FBgn0250816	AGO3	Argonaute 3	3L	2604	piRNA generation
	FBgn0041164	armi	armitage	3L	3825	RNA helicase
	FBgn0003483	spn-E	spindle E	3R	4305	RNA helicase
	FBgn0002652	squ	squash	2L	726	Nuclease
	FBgn0261266	zuc	zucchini	2L	762	Nuclease
	FBgn0033686	Hen1	PIMET/DmHen1	2R	1176	piRNA methyltransferase
	FBgn0016034	mael	maelstrom	3L	1389	Nuage component
	FBgn0034098	krimp	krimper	2R	2241	Nuage component
	FBgn0262526	vas	vasa	2L	661	Nuage component
	FBgn0004400	rhi	rhino	2R	1257	HP1 paralog, heterochromatin binding
P-element–specific genes	FBgn0014870	Psi	P-element somatic inhibitor	2R	2394	P-element somatic inhibitor; mRNA splicing factor
	FBgn0004838	Hrb27C	Heterogeneous nuclear ribonucleoprotein at 27C	2L	1266	mRNA splicing factor
	FBgn0011774	Irbp	Inverted repeat-binding protein	3R	1897	Recombination repair protein
	FBgn0041627	Ku80	Ku80	2L	2100	Recombination repair protein

a

chromosome

b

length of coding regions (bp)

The second group of candidates contains genes known to interact with P elements via other pathways. The double-stranded breaks left after P-element transpositions in the germline are repaired by host recombination–repair machinery of heterodimers formed by Irbp and Ku80 (Rio and Rubin 1988; Beall et al. 1994). Double-stranded breaks generated by other DNA-based TEs may be repaired using a similar mechanism. Splicing factor Psi is shown to specifically bind to the 5′-splice site of the P-element third intron, suppressing the proper splicing of mRNA of P-element transposase and thereby repressing the somatic transposition of P elements (Siebel and Rio 1990; Siebel et al. 1992, 1994, 1995; Adams et al. 1997). Hrb27C is a nuclear protein that forms an mRNA splicing complex with Psi (Siebel et al. 1992, 1994).

Genes interacting with transposable elements show evidence of positive selection

Recurrent directional selection can lead to an accelerated rate of protein divergence relative to synonymous site divergence. We used maximum-likelihood methods (Yang 2007) to estimate the d_N/d_S ratios on both the D. melanogaster and the D. simulans branches with D. yakuba as an outgroup. We then rank d_N/d_S estimates of candidate genes among other D. melanogaster annotated genes (see Table 2 for d_N/d_S ratio and Table S2 for separate d_N and d_S values). Consistent with a previous report (Vermaak et al. 2005), rhi has a d_N/d_S ratio that is larger than one and is among the fastest-evolving genes in D. melanogaster (d_N/d_S = 1.415, rank = 0.31% genome-wide). Two nuage component genes, krimp and mael, both rank in the top 5% genome-wide while aub and zuc are among top 10%. Estimates of the d_N/d_S ratios on the D. simulans branch, except for that of zuc, gave similar results. Our inferences are generally in agreement with previous reports that used a branch-site model to detect recurrent amino acid substitutions on the phylogeny of 12 Drosophila species (Heger and Ponting 2007; Kolaczkowski et al. 2011). Even though Spn-E was identified as the RNA interference gene showing the most extensive signal of recurrent adaptation on the phylogenetic tree (Heger and Ponting 2007; Kolaczkowski et al. 2011), the branch leading to D. melanogaster was not significant (Kolaczkowski et al. 2011). The differences between studies are not surprising given the fundamental differences in methodology (comparing relative rates of amino acid substitutions of entire coding sequences among all genes vs. identification of a subset of sites or branches that recurrently substituted across the phylogeny).

Linage-specific relative rates of protein evolution

Table 2

Linage-specific relative rates of protein evolution

	mel		sim
Gene	d_N/d_S	Percentile^a	d_N/d_S	Percentile^a
AGO3	0.261	12.67	0.265	18.79
armi	0.297	10.22	0.391	10.88
aub	0.382	6.60	0.530	6.31
Hen1	0.246	13.64	0.266	18.66
Hrb27C	0.176	21.99	0.019	82.02
Irbp	0.101	38.42	0.164	32.83
krimp	0.471	4.71	0.964	2.09
Ku80	0.181	21.28	0.245	20.80
mael	0.902	1.05	0.491	7.17
piwi	0.081	45.87	0.194	27.71
Psi	0.091	42.12	0.027	78.55
rhi	1.415	0.31	0.508	6.77
spn-E	0.219	16.28	0.281	17.42
squ	0.240	14.25	0.338	13.72
vas	0.243	14.05	0.306	15.63
zuc	0.359	7.48	0.207	25.73

	mel		sim
Gene	d_N/d_S	Percentile^a	d_N/d_S	Percentile^a
AGO3	0.261	12.67	0.265	18.79
armi	0.297	10.22	0.391	10.88
aub	0.382	6.60	0.530	6.31
Hen1	0.246	13.64	0.266	18.66
Hrb27C	0.176	21.99	0.019	82.02
Irbp	0.101	38.42	0.164	32.83
krimp	0.471	4.71	0.964	2.09
Ku80	0.181	21.28	0.245	20.80
mael	0.902	1.05	0.491	7.17
piwi	0.081	45.87	0.194	27.71
Psi	0.091	42.12	0.027	78.55
rhi	1.415	0.31	0.508	6.77
spn-E	0.219	16.28	0.281	17.42
squ	0.240	14.25	0.338	13.72
vas	0.243	14.05	0.306	15.63
zuc	0.359	7.48	0.207	25.73

a

d_N/d_S for each candidate gene was ranked among 9172 (D. melanogaster) and 9051 (D. simulans) genes that have PAML results. Candidate genes that ranked among the top 10% among all the genes with PAML results are in boldface type.

Table 2

Linage-specific relative rates of protein evolution

	mel		sim
Gene	d_N/d_S	Percentile^a	d_N/d_S	Percentile^a
AGO3	0.261	12.67	0.265	18.79
armi	0.297	10.22	0.391	10.88
aub	0.382	6.60	0.530	6.31
Hen1	0.246	13.64	0.266	18.66
Hrb27C	0.176	21.99	0.019	82.02
Irbp	0.101	38.42	0.164	32.83
krimp	0.471	4.71	0.964	2.09
Ku80	0.181	21.28	0.245	20.80
mael	0.902	1.05	0.491	7.17
piwi	0.081	45.87	0.194	27.71
Psi	0.091	42.12	0.027	78.55
rhi	1.415	0.31	0.508	6.77
spn-E	0.219	16.28	0.281	17.42
squ	0.240	14.25	0.338	13.72
vas	0.243	14.05	0.306	15.63
zuc	0.359	7.48	0.207	25.73

	mel		sim
Gene	d_N/d_S	Percentile^a	d_N/d_S	Percentile^a
AGO3	0.261	12.67	0.265	18.79
armi	0.297	10.22	0.391	10.88
aub	0.382	6.60	0.530	6.31
Hen1	0.246	13.64	0.266	18.66
Hrb27C	0.176	21.99	0.019	82.02
Irbp	0.101	38.42	0.164	32.83
krimp	0.471	4.71	0.964	2.09
Ku80	0.181	21.28	0.245	20.80
mael	0.902	1.05	0.491	7.17
piwi	0.081	45.87	0.194	27.71
Psi	0.091	42.12	0.027	78.55
rhi	1.415	0.31	0.508	6.77
spn-E	0.219	16.28	0.281	17.42
squ	0.240	14.25	0.338	13.72
vas	0.243	14.05	0.306	15.63
zuc	0.359	7.48	0.207	25.73

a

d_N/d_S for each candidate gene was ranked among 9172 (D. melanogaster) and 9051 (D. simulans) genes that have PAML results. Candidate genes that ranked among the top 10% among all the genes with PAML results are in boldface type.

We used the MK test (McDonald and Kreitman 1991) to detect genes whose evolution does not follow the neutral model of evolution. Rejection of the null hypothesis due to the presence of more than the expected number of amino acid fixations has been interpreted as evidence supporting adaptive protein evolution. To have greater statistical power, we considered polymorphisms from both D. melanogaster (Langley et al. 2012) and D. simulans (Begun et al. 2007). We identified aub and armi as significant while spn-E, krimp, vas, and Ku80 were marginally significant (Table 3). All of these genes rejected the null hypothesis of neutral evolution in the direction of an excess of amino acid divergence (Table 3), consistent with a history of positive selection acting on these genes. To further localize protein domains showing signals of positive selection, we performed MK tests on Pfam-annotated domains. We found the PAZ domain of aub (P-value = 0.011) and the DEAD domain of vas (P-value = 0.0007) showed significant enrichment of amino acid substitutions. When considering only within D. melanogaster polymorphism, we did not find evidence of adaptive evolution for spn-E, vas, and Ku80 (Table 3), perhaps due to a generally lower level of variation in D. melanogaster than in D. simulans (Aquadro et al. 1988; Andolfatto 2001; Andolfatto et al. 2011) and thus lower statistical power.

McDonald–Kreitman test on candidate genes

Table 3

McDonald–Kreitman test on candidate genes

	Two-species MK test^a		mel MK test^b
Gene	No. codons	P-value	No. codons	P-value
AGO3	NA	NA^c	689	0.351
armi	1186	<0.001	1231	<0.001
aub	835	<0.001	855	<0.001
Hen1	389	0.837	391	1.000
Hrb27C	421	0.095	421	0.532
Irbp	628	0.189	628	0.069
krimp	712	0.020	690	0.001
Ku80	694	0.012	699	0.156
mael	NA	NA	459	0.390
piwi	837	0.732	843	0.267
Psi	796	0.189	797	1.000
rhi	413	0.396	413	0.535
spn-E	1427	0.021	1433	0.731
squ	130	0.493	233	1.000
vas	634	0.022	639	0.151
zuc	253	1.000	253	1.000

	Two-species MK test^a		mel MK test^b
Gene	No. codons	P-value	No. codons	P-value
AGO3	NA	NA^c	689	0.351
armi	1186	<0.001	1231	<0.001
aub	835	<0.001	855	<0.001
Hen1	389	0.837	391	1.000
Hrb27C	421	0.095	421	0.532
Irbp	628	0.189	628	0.069
krimp	712	0.020	690	0.001
Ku80	694	0.012	699	0.156
mael	NA	NA	459	0.390
piwi	837	0.732	843	0.267
Psi	796	0.189	797	1.000
rhi	413	0.396	413	0.535
spn-E	1427	0.021	1433	0.731
squ	130	0.493	233	1.000
vas	634	0.022	639	0.151
zuc	253	1.000	253	1.000

MK tests with significant p-values and positive α are in bold type.

a

MK tests using both D. melanogaster and D. simulans polymorphism (see Materials and Methods).

b

MK tests using only D. melanogaster polymorphism.

c

Not available due to lack of D. simulans polymorphism data.

Table 3

McDonald–Kreitman test on candidate genes

	Two-species MK test^a		mel MK test^b
Gene	No. codons	P-value	No. codons	P-value
AGO3	NA	NA^c	689	0.351
armi	1186	<0.001	1231	<0.001
aub	835	<0.001	855	<0.001
Hen1	389	0.837	391	1.000
Hrb27C	421	0.095	421	0.532
Irbp	628	0.189	628	0.069
krimp	712	0.020	690	0.001
Ku80	694	0.012	699	0.156
mael	NA	NA	459	0.390
piwi	837	0.732	843	0.267
Psi	796	0.189	797	1.000
rhi	413	0.396	413	0.535
spn-E	1427	0.021	1433	0.731
squ	130	0.493	233	1.000
vas	634	0.022	639	0.151
zuc	253	1.000	253	1.000

	Two-species MK test^a		mel MK test^b
Gene	No. codons	P-value	No. codons	P-value
AGO3	NA	NA^c	689	0.351
armi	1186	<0.001	1231	<0.001
aub	835	<0.001	855	<0.001
Hen1	389	0.837	391	1.000
Hrb27C	421	0.095	421	0.532
Irbp	628	0.189	628	0.069
krimp	712	0.020	690	0.001
Ku80	694	0.012	699	0.156
mael	NA	NA	459	0.390
piwi	837	0.732	843	0.267
Psi	796	0.189	797	1.000
rhi	413	0.396	413	0.535
spn-E	1427	0.021	1433	0.731
squ	130	0.493	233	1.000
vas	634	0.022	639	0.151
zuc	253	1.000	253	1.000

MK tests with significant p-values and positive α are in bold type.

a

MK tests using both D. melanogaster and D. simulans polymorphism (see Materials and Methods).

b

MK tests using only D. melanogaster polymorphism.

c

Not available due to lack of D. simulans polymorphism data.

Genes interacting with transposable elements show more prevalent evidence of positive selection than immunity genes

The fundamental differences in the mechanism of transmission between TE families and other nongenomic pathogens raised the question regarding the relative intensity of evolutionary impacts they imposed on hosts. To address this question, we compared the proportion of genes exhibiting evidence of positive selection between our candidate genes and immunity genes, using the same population genetic and molecular evolution analysis.

We found 5 of 12 piRNA genes (41.67%) have D. melanogaster d_N/d_S estimates among the top 10% genome-wide, which is significantly greater than that of immunity genes (24 of 214 genes, 11.21%, FET P = 0.01, Figure 1A and Table S2). Studies have found that the rates of adaptive evolution vary with respect to the function of immunity genes (Sackton et al. 2007; Obbard et al. 2009b; reviewed in Lazzaro 2008). We categorized immunity genes into “recognition,” “signaling,” and “effector” categories and still found piRNA genes have a higher proportion of fast-evolving genes than all categories of immunity genes. However, we found statistical significance only when comparing piRNA genes to either signaling or effector categories (Figure 1A). Comparisons considering all candidate genes (both piRNA genes and genes known to interact with P elements; Figure 1A) or focusing on relative rates of amino acid evolution on the D. simulans lineage (data not shown) are consistent with our findings using piRNA pathway gene evolution along the D. melanogaster lineage.

Figure 1

Proportion of candidate and immunity genes showing evidence of positive selection. (A) Proportions of candidate and immunity genes having D. melanogaster d_N/d_S among the top 10% genome-wide. (B) Proportions of candidate genes, immunity genes, and all genes having significant two-species MK tests (P-value < 0.05) and positive α. Dashed lines are the expectations assuming uniformity. The Number of genes with MK test and PAML results in each category is shown in parentheses. (C and D) Maximum-likelihood estimates of averaged α (C) and boxplots (25th, 50th, and 75th percentiles) of estimated ω_α (D) for different classes of genes. Error bars represent the 95% bootstrapping intervals around each estimate. Significant comparisons between piRNA genes and other classes of genes are denoted by * (P-value <0.05), ** (P-value <0.01), and *** (P-value <0.001). Comparisons of proportions (A and B) were based on Fisher’s exact test, comparisons of maximum-likelihood estimated α (C) were based on permutations, and comparisons of ω_α (D) were based on a Mann–Whitney U-test.

We observed even more dramatic enrichment in the proportion of genes showing evidence of recurrent adaptive protein evolution (rejection of MK tests with overabundant amino acid fixation) of piRNA genes (5 of 10 genes, 50%) than that of immunity genes (18 of 180 genes, 9.52%, FET P-values = 0.002; Figure 1B) and the genome-wide proportion (888 of 8085 genes, 10.98%, FET P-values = 0.003). Signaling genes have the largest proportion of genes showing adaptive evolution among three immunity gene categories (13 of 97 genes, 10.92%), which is still significantly lower than that of piRNA genes or all candidate genes (FET P-values = 0.012). We found a consistent pattern when including all candidate genes in the comparisons.

We used the maximum-likelihood method proposed by Welch (2006) to formally test whether averaged α (the proportion of amino acid substitution fixed by positive selection) is different between classes of genes. When comparing either “piRNA vs. immunity genes” or “piRNA vs. all genes,” we found a two-α model consistently fitted the data better [Λ = 446.46 (vs. immunity genes) and 613.82 (vs. all genes); P < 0.001 for both comparisons]. Comparisons between piRNA genes and a specific subset of immunity genes were also highly significant (P < 0.001). Permutation analysis also found the maximum-likelihood estimated α of piRNA genes (α = 0.82) is significantly higher than that of immunity genes (α = 0.40 for all immunity genes) and all genes (α = 0.36), except for recognition immunity genes (Figure 1C). Again, comparisons considering all candidate genes gave consistent results. Another estimate of adaptive protein evolution ω_α, the rate of adaptive substitution relative to the rate of neutral substitutions (Gossmann et al. 2010), supports the same conclusion based on α (Figure 1D). Either piRNA genes or all candidate genes have larger ω_α than immunity genes except for recognition immunity genes (Mann–Whitney U-test, P < 0.05) and all genes (Mann–Whitney U-test, P < 0.001), suggesting that the larger α of TE-interacting genes is not due to differences in proportion of effectively neutral mutations.

Horizontal transfer of TE families does not impose enduringly strong selection on host beneficial variants

We used a deterministic model to analyze the dynamics of host alleles that can reduce the transposition rate of a newly horizontally transferred TE family by a fixed proportion (referred to as “beneficial allele”) during the spread of that TE family in a panmictic host population (see Materials and Methods and Appendix for model details). We specifically considered the well-documented horizontal transfer of P elements, which provided the biological context needed to specify details of the model. The model considered both epistatic selections against increases in P-element copy number (Charlesworth and Charlesworth 1983; Dolgin and Charlesworth 2006, 2008) and host sterility caused by too many double-stranded chromosomal breaks generated through P-element transposition. We set individuals with more than n_HD new P-element transpositions (and thus double-stranded breaks) to be completely sterile. This sterility effect would most likely occur in the M(female) × P(male) hybrid dysgenic cross, as the transposition rate of P elements in the dysgenic cross (u₀) is several orders of magnitude higher than in the nondysgenic cross (u₁) (Eggleston et al. 1988). The host beneficial allele will especially enjoy strong fitness advantages in the dysgenic crosses because hosts with this allele will be more likely to have fewer than n_HD double-stranded breaks and therefore higher expected fertility. Of course, the degree of sterility elicited by P-element transposition can be a continuous phenotype. The usage of the truncation selection model will maximize the selective benefit of a host allele that can reduce the transposition rate of P element, making our overall conclusion conservative.

The increase in frequency of the host beneficial allele (l) is dependent on how fast P elements spread through the population and, during their spread, how likely hybrid dysgenesis is to occur. We found that the spread of P elements is fast in most cases (Figure 2A for u₀ = 1 and n_HD = 5; see Appendix for discussions of other cases tested), a finding that is consistent with several caged experiments introducing P elements into M strain populations (Kidwell et al. 1988; Good et al. 1989). This quick spread leads to the host beneficial allele having selective advantage only during a narrow period (Figure 2B). We found that the largest selective advantage occurs when the proportion of P cytotype individuals (r) is intermediate and the probability of a hybrid dysgenic cross is high (Figure 2C). This is also reflected in the dynamics of l, which has a phase of rapid increase followed by a longer phase of slow increase (Figure 2D).

Figure 2

The dynamics of the host population during the spread of P elements. (A–D) The change of proportion of P-cytotype individuals, r (A); the selection coefficient against the nonbeneficial host allele, s (B); the relationship between s and r (C); and the allele frequency of the host beneficial allele, I (D) when u₀=1 and n_HD=5.

The combined effects of u₀ and n_HD had the most notable influence on the dynamics of the host beneficial allele. With increased n_HD, it takes fewer generations until the P element is found in nearly all genomes (r > 0.99) in the population (t_0.99) (Figure 3A) and host beneficial allele frequency at generation 1000 (l₁₀₀₀) because of the decreased duration during which the probability of hybrid dysgenesis is high (Figure 3B). Generally, P elements spread faster when u₀ is larger (except for n_HD = 2, see below). Yet, the probability of hybrid dysgenesis is also higher, leading to larger l₁₀₀₀ (Figure 3, A and B). For all cases examined except one (see Appendix for all parameters tested), the difference between l₀ and l₁₀₀₀ (<2%) would hardly be detected with the regular size of samples.

Figure 3

The influences of u0 and nHD on the time for near fixation of P elements and host beneficial allele frequency. (A) The generations until the P element is nearly fixed in the population (t0.99) for different u0 and nHD. Green dots are when u0 = 1 and blue dots are when u0 = 10−1. The dashed line denotes generation 1000. (B) The allele frequencies of the host beneficial allele (l1000) at generation 1000 for different u0 and nHD. The dashed line denotes 0.001, which equals l0.

The influences of u₀ and n_HD on the time for near fixation of P elements and host beneficial allele frequency. (A) The generations until the P element is nearly fixed in the population (t_0.99) for different u₀ and n_HD. Green dots are when u₀ = 1 and blue dots are when u₀ = 10⁻¹. The dashed line denotes generation 1000. (B) The allele frequencies of the host beneficial allele (l₁₀₀₀) at generation 1000 for different u₀ and n_HD. The dashed line denotes 0.001, which equals l₀.

The only exception is when u₀ = 1 and n_HD=2 in which the temporal F_st can be as large as 0.15. However, it takes >10,000 generations for the P element to become nearly fixed in the population due to the still high probability of hybrid dysgenesis even when the majority of the hosts have P cytotype. This is much longer than the timescale of actual P-element spread. Furthermore, the tolerance of nonprogrammed double-stranded breaks in the germline can be much higher than this threshold (n_HD = 2; Orsi et al. 2010).

It is worth noting that we modeled the distribution of P-element copy number as approximately Poisson. The derivation and previous application of the Poisson approximation were focused on situations where the TE population is near equilibrium and the mean copy number of TEs is much larger than one (Charlesworth and Charlesworth 1983; Langley et al. 1983). However, the critical aspect of our analytical modeling here is the initial phase of the invasion of a new TE family (the P element), which may lead to a copy number distribution different from Poisson. We thus used Monte Carlo simulations (see File S1 for details of simulations) to investigate how the distribution of TE copy number reaches a Poisson distribution. We found that soon after the P cytotype is common in the population, the TE copy number distribution is close to Poisson (Figure S5 in File S1). More importantly, these simulations show that the estimated change of host beneficial allele frequency from the analytical approximation is within 2% of the simulated results (Figure S3 and Figure S4 in File S1), supporting our overall conclusions.

Recent horizontal transfer of the P element does not have widespread evolutionary impacts on candidate genes

To empirically investigate the short-term evolutionary impacts of TE horizontal transfer on hosts, we compared the genetic differentiation between pre–P-element invasion (pre-P) and current (post-P) populations (temporal differentiation). For five candidate genes that showed strong temporal differentiation in our initial survey with a smaller number of M strains (Irbp, krimp, Hen1, spn-E, and squ; see Materials and Methods and Table S3), only Hen1 and squ still showed highly significant temporal differentiation with increased size of pre–P-element samples (Table 4). However, we observed strong genetic differentiation between North American and African contemporary post-P populations for our candidate genes (geographic differentiation, Table 4). If the geographic differentiation of the current population was also present in the pre-P population, the wide geographic distribution of M strains used may lead to false conclusions about the temporal differentiation (see Materials and Methods and Table S1). We further restricted our analysis to the North American population, which has the largest number of alleles for both pre-P (15) and post-P (37) samples and still found strong temporal differentiation of Hen1 (F_st = 0.173, P = 0.003; Table 4). Such differentiation may reflect divergence in protein functions, as F_st estimated using amino acid sequences also showed significant temporal differentiation (F_st = 0.261, P < 0.001; Table 4).

Temporal differentiation of a subset of candidate genes

Table 4

Temporal differentiation of a subset of candidate genes

	Geographic differentiation^a		Temporal differentiation^b
			All samples		Only North American samples^c
	nucleotide differentiation^d		nucleotide differentiation		nucleotide differentiation		amino acid differentiation^e
	F_st	P-value	F_st	P-value	F_st	P-value	F_st	P-value
Irbp	0.144	0.111	0.170	0.038	0.000	0.334	−0.010	0.455
krimp	0.143	0.004	0.011	0.262	−0.009	0.635	0.006	0.345
Hen1	0.632	<0.001	0.303	0.001	0.173	0.003	0.261	<0.001
Spn-E	0.459	<0.001	0.053	0.08	0.031	0.097	0.071	0.033
squ	0.028	0.25	0.202	0.001	0.071	0.036	0.039	0.103

	Geographic differentiation^a		Temporal differentiation^b
			All samples		Only North American samples^c
	nucleotide differentiation^d		nucleotide differentiation		nucleotide differentiation		amino acid differentiation^e
	F_st	P-value	F_st	P-value	F_st	P-value	F_st	P-value
Irbp	0.144	0.111	0.170	0.038	0.000	0.334	−0.010	0.455
krimp	0.143	0.004	0.011	0.262	−0.009	0.635	0.006	0.345
Hen1	0.632	<0.001	0.303	0.001	0.173	0.003	0.261	<0.001
Spn-E	0.459	<0.001	0.053	0.08	0.031	0.097	0.071	0.033
squ	0.028	0.25	0.202	0.001	0.071	0.036	0.039	0.103

All the significant results are in boldface type.

a

Genetic differentiation between current (post–P-element) African and North American populations.

b

Genetic differentiation between pre–P- and post–P-element invasion populations.

c

The temporal differentiation was estimated considering only North American samples of both populations (before and after P-element invasions).

d

Genetic differentiation estimated using nucleotide sequences.

e

Genetic differentiation estimated using amino acid sequences.

Table 4

Temporal differentiation of a subset of candidate genes

	Geographic differentiation^a		Temporal differentiation^b
			All samples		Only North American samples^c
	nucleotide differentiation^d		nucleotide differentiation		nucleotide differentiation		amino acid differentiation^e
	F_st	P-value	F_st	P-value	F_st	P-value	F_st	P-value
Irbp	0.144	0.111	0.170	0.038	0.000	0.334	−0.010	0.455
krimp	0.143	0.004	0.011	0.262	−0.009	0.635	0.006	0.345
Hen1	0.632	<0.001	0.303	0.001	0.173	0.003	0.261	<0.001
Spn-E	0.459	<0.001	0.053	0.08	0.031	0.097	0.071	0.033
squ	0.028	0.25	0.202	0.001	0.071	0.036	0.039	0.103

	Geographic differentiation^a		Temporal differentiation^b
			All samples		Only North American samples^c
	nucleotide differentiation^d		nucleotide differentiation		nucleotide differentiation		amino acid differentiation^e
	F_st	P-value	F_st	P-value	F_st	P-value	F_st	P-value
Irbp	0.144	0.111	0.170	0.038	0.000	0.334	−0.010	0.455
krimp	0.143	0.004	0.011	0.262	−0.009	0.635	0.006	0.345
Hen1	0.632	<0.001	0.303	0.001	0.173	0.003	0.261	<0.001
Spn-E	0.459	<0.001	0.053	0.08	0.031	0.097	0.071	0.033
squ	0.028	0.25	0.202	0.001	0.071	0.036	0.039	0.103

All the significant results are in boldface type.

a

Genetic differentiation between current (post–P-element) African and North American populations.

b

Genetic differentiation between pre–P- and post–P-element invasion populations.

c

The temporal differentiation was estimated considering only North American samples of both populations (before and after P-element invasions).

d

Genetic differentiation estimated using nucleotide sequences.

e

Genetic differentiation estimated using amino acid sequences.

The strong temporal differentiation of Hen1 is likely the result of genetic hitchhiking from a nearby, strongly selected gene

Given that the spread of P elements is fairly recent and fast, any associated directional selection on suppressive host variants is expected to result in reduced genomic polymorphism around the selected SNPs and increased linkage disequilibrium due to genetic hitchhiking (Maynard Smith and Haigh 1974; Kim and Nielsen 2004; Stephan et al. 2006). Consistent with the analysis of temporal differentiation, we observed marginally significant haplotypic structures for the Hen1 gene region for post-P North American populations [frequency of major haplotypes = 15, P = 0.042 (Hudson et al. 1994) and haplotype heterozygosity = 0.759, P = 0.031 (Depaulis and Veuille 1998)]. This strong haplotypic structure was extended at least 10 kb upstream and downstream of Hen1 (frequency of major haplotypes = 10, P = 0.003 and haplotype heterozygosity = 0.290, P = 0.001). We also observed a dramatic reduction of post–P-element North American variation for an almost 100-kb genomic segment around Hen1 when compared with polymorphism in post–P-element African D. melanogaster, polymorphism in closely related D. simulans, and divergence between D. melanogaster and D. simulans (Figure 4A).

Figure 4

Polymorphism, divergence, and temporal differentiation around Hen1. (A) Divergence between D. melanogaster and D. simulans (red), polymorphism of D. simulans (orange), polymorphism of the post–P-element African D. melanogaster population (green), and polymorphism of the post–P-element North American D. melanogaster population (blue) of 100 kb upstream and downstream of Hen1 (from 8,033,215 to 8,040,257) are shown on a log scale. There is a dramatic drop of polymorphism in the North American D. melanogaster populations around Hen1. (B and C) Temporal differentiation between pre–P- and post–P-element North American populations of control genes (B), and their relative position (C). Genes in (C) are Hen1 (red), Cyp6g1 (orange), jeb (1), CG8378 (2), CG13178 (3), CG8878 (4), CG8407 (5), Oda (6), wash (7), CG33964 (8), Cyp6t3 (9), RpS11 (10), and Sr-CII (11). Sequenced regions of each control gene are shown in blue while unsequenced regions are in light blue. The coordinates of three figures are aligned.

However, the genomic region around Hen1 is highly gene rich, with >20 genes, including Cyp6g1. Studies have identified a recent selective sweep associated with the Cyp6g1 allele that has an Accord transposable element inserted upstream (Daborn et al. 2002; Catania et al. 2004; Chung et al. 2007; Schmidt et al. 2010). Functional analysis confirmed that this Accord insertion confers insecticide resistance (Daborn et al. 2002; Chung et al. 2007), which is the most likely force driving the strong, recent selective sweep on Cyp6g1. The Accord inserted allele was found fixed in non-African populations, yet it was intermediate in African populations (Catania et al. 2004), which may also explain why the reduction of heterozygosity around Hen1 was most apparent in the North American populations while the P element is virtually fixed worldwide.

To further investigate whether the strong temporal differentiation observed on Hen1 is the result of genetic hitchhiking from strongly selected Cyp6g1 or any other genes in the nearby region, we estimated the temporal differentiation of coding regions for another 11 genes that are within 30 kb to either Hen1 or Cyp6g1 and have functions unrelated to either TE suppression or insecticide resistance (“control genes,” Table S4 and Figure 4C). Compared with the observed strong temporal differentiation of these control genes, Hen1 (F_st = 0.173) is no longer exceptional (Table 5 and Figure 4B). We also found that the closer a gene is to Cyp6g1, the stronger the temporal differentiation was (Table 5 and Figure 4B). The geographic differentiation was also strong for these 11 genes (Table 5), which is consistent with the scenario that application of insecticide in the non-African regions is leading to strong genetic hitchhiking on genes around Cyp6g1 in the post-P North American population we studied.

Temporal and geographic differentiation of control genes

Table 5

Temporal and geographic differentiation of control genes

	Distance to Hen1^c	Distance to Cyp6g1^d	M strain π		Temporal differentiation^a		Geographic differentiation^b
			Nonsyn	Syn	F_st	P-value	F_st	P-value
jeb^e	−31,911	−69,718	0.0021	0.0145	0.089	0.037	0.084	0.085
CG8378	−7,273	−45,080	0.0000	0.0013	0.239	0.001	0.372	0.011
CG13178	−5,378	−43,185	0.0007	0.0044	0.271	0.002	0.689	<0.001
CG8878	0	−37,807	0.0001	0.0013	0.195	0.001	0.365	<0.001
CG8407	4,426	−33,381	0.0000	0.0143	0.230	0.005	0.057	0.226
Oda	21,177	−16,630	0.0012	0.0007	0.369	0.001	0.669	<0.001
wash	32,266	−5,541	0.0010	0.0029	0.393	<0.001	0.583	<0.001
CG33964	34,832	−2,975	0.0009	0.0010	0.377	<0.001	0.654	<0.001
Cyp6t3	42,227	4,420	0.0021	0.0059	0.549	<0.001	0.329	0.007
RpS11	50,683	12,876	0.0000	0.0027	NA	NA	0.648	<0.001
Sr-CII	60,799	22,992	0.0025	0.0151	0.238	0.001	0.368	<0.001

	Distance to Hen1^c	Distance to Cyp6g1^d	M strain π		Temporal differentiation^a		Geographic differentiation^b
			Nonsyn	Syn	F_st	P-value	F_st	P-value
jeb^e	−31,911	−69,718	0.0021	0.0145	0.089	0.037	0.084	0.085
CG8378	−7,273	−45,080	0.0000	0.0013	0.239	0.001	0.372	0.011
CG13178	−5,378	−43,185	0.0007	0.0044	0.271	0.002	0.689	<0.001
CG8878	0	−37,807	0.0001	0.0013	0.195	0.001	0.365	<0.001
CG8407	4,426	−33,381	0.0000	0.0143	0.230	0.005	0.057	0.226
Oda	21,177	−16,630	0.0012	0.0007	0.369	0.001	0.669	<0.001
wash	32,266	−5,541	0.0010	0.0029	0.393	<0.001	0.583	<0.001
CG33964	34,832	−2,975	0.0009	0.0010	0.377	<0.001	0.654	<0.001
Cyp6t3	42,227	4,420	0.0021	0.0059	0.549	<0.001	0.329	0.007
RpS11	50,683	12,876	0.0000	0.0027	NA	NA	0.648	<0.001
Sr-CII	60,799	22,992	0.0025	0.0151	0.238	0.001	0.368	<0.001

NA, not available due to no SNP differences between pre–P- and post–P-element North American populations. The differentiations were estimated using nucleotide sequences.

a

Comparisons between pre– and post–P-element invasion populations with samples from North American populations only.

b

Comparisons between North American and African post–P-element invasion populations.

c

Distance in base pairs between the midpoints of the focused gene and Hen1. CG8878 has zero distance because it is nested within Hen1.

d

Distance in base pairs between the midpoints of the focused gene and Cyp6g1.

e

Only the coding region of the first exon of jeb was sequenced.

Table 5

Temporal and geographic differentiation of control genes

	Distance to Hen1^c	Distance to Cyp6g1^d	M strain π		Temporal differentiation^a		Geographic differentiation^b
			Nonsyn	Syn	F_st	P-value	F_st	P-value
jeb^e	−31,911	−69,718	0.0021	0.0145	0.089	0.037	0.084	0.085
CG8378	−7,273	−45,080	0.0000	0.0013	0.239	0.001	0.372	0.011
CG13178	−5,378	−43,185	0.0007	0.0044	0.271	0.002	0.689	<0.001
CG8878	0	−37,807	0.0001	0.0013	0.195	0.001	0.365	<0.001
CG8407	4,426	−33,381	0.0000	0.0143	0.230	0.005	0.057	0.226
Oda	21,177	−16,630	0.0012	0.0007	0.369	0.001	0.669	<0.001
wash	32,266	−5,541	0.0010	0.0029	0.393	<0.001	0.583	<0.001
CG33964	34,832	−2,975	0.0009	0.0010	0.377	<0.001	0.654	<0.001
Cyp6t3	42,227	4,420	0.0021	0.0059	0.549	<0.001	0.329	0.007
RpS11	50,683	12,876	0.0000	0.0027	NA	NA	0.648	<0.001
Sr-CII	60,799	22,992	0.0025	0.0151	0.238	0.001	0.368	<0.001

	Distance to Hen1^c	Distance to Cyp6g1^d	M strain π		Temporal differentiation^a		Geographic differentiation^b
			Nonsyn	Syn	F_st	P-value	F_st	P-value
jeb^e	−31,911	−69,718	0.0021	0.0145	0.089	0.037	0.084	0.085
CG8378	−7,273	−45,080	0.0000	0.0013	0.239	0.001	0.372	0.011
CG13178	−5,378	−43,185	0.0007	0.0044	0.271	0.002	0.689	<0.001
CG8878	0	−37,807	0.0001	0.0013	0.195	0.001	0.365	<0.001
CG8407	4,426	−33,381	0.0000	0.0143	0.230	0.005	0.057	0.226
Oda	21,177	−16,630	0.0012	0.0007	0.369	0.001	0.669	<0.001
wash	32,266	−5,541	0.0010	0.0029	0.393	<0.001	0.583	<0.001
CG33964	34,832	−2,975	0.0009	0.0010	0.377	<0.001	0.654	<0.001
Cyp6t3	42,227	4,420	0.0021	0.0059	0.549	<0.001	0.329	0.007
RpS11	50,683	12,876	0.0000	0.0027	NA	NA	0.648	<0.001
Sr-CII	60,799	22,992	0.0025	0.0151	0.238	0.001	0.368	<0.001

NA, not available due to no SNP differences between pre–P- and post–P-element North American populations. The differentiations were estimated using nucleotide sequences.

a

Comparisons between pre– and post–P-element invasion populations with samples from North American populations only.

b

Comparisons between North American and African post–P-element invasion populations.

c

Distance in base pairs between the midpoints of the focused gene and Hen1. CG8878 has zero distance because it is nested within Hen1.

d

Distance in base pairs between the midpoints of the focused gene and Cyp6g1.

e

Only the coding region of the first exon of jeb was sequenced.

Discussion

TEs are selfish genetic elements in the genome. Their interactions with their hosts are often analogized to the molecular arms race between hosts and other nongenomic pathogens, such as bacteria, viruses, fungi, and protozoa. This analogy deserves further mechanistic specification and analysis because of the fundamental differences in transmission mode between TEs and horizontally transmitted pathogens.

To address this analogy, we first systematically compared the long-term evolution of host TE-interacting genes to that of immunity genes. We found the proportion of TE-interacting genes with evidence of positive selection is at least as large as, if not greater than, that of genes in pathways conferring immunity to pathogens. aub, which showed strong evidence of adaptive protein evolution, is a key component in piRNA-mediated TE silencing (Brennecke et al. 2007; Gunawardane et al. 2007) and its PAZ domain, known to mediate the binding of single-strand RNAs associated with Argonaute/Piwi proteins (Lingel et al. 2003; Song et al. 2003; Yan et al. 2003), also showed an excess of amino acid fixations. Both RNA helicases surveyed, armi and spn-E, showed evidence of adaptive evolution, although the detailed mechanism of their involvement in piRNA biogenesis and TE suppression is still not clear. Most interestingly, all three nuage component genes (mael, krimp, and vas) showed strong evidence of positive selection. The nuage, where many proteins encoded by piRNA genes localize (reviewed in Klattenhoff and Theurkauf 2008), is considered the major battleground for the host–TE arms race (Blumenstiel 2011). In addition, vas is involved in the formation of pole plasm (the future germline) and is essential for the proper localization and translational control of maternally deposited mRNAs at the rear of developing embryos (Lasko and Ashburner 1990; Styhler et al. 1998; Johnstone and Lasko 2004). The RNA genome of TEs that can be preferentially incorporated into the pole plasm and have its RNA tertiary structure be detangled and translated will have large fitness benefits from increased transmission. The DEAD box of VAS, the key domain mediating the unwinding of RNAs with self-annealed structure (reviewed in Linder 2006; Arkov and Ramos 2010), had a strong enrichment of amino acid fixations under the McDonald–Kreitman framework.

Our analytical model found that host alleles suppressing P-element transposition enjoy strong selective benefits only during a limited time frame due to the fast spread of P elements through strict vertical parent–offspring inheritance. This led to negligible host allele frequency differences between pre– and post–P-element populations with all biologically reasonable parameters tested. This theoretical prediction may be extended to other known cases of horizontally transferred TEs in Drosophila, such as I-element, hobo, and mariner (reviewed in Silva et al. 2004). For DNA-based TE families that also transpose with the cut-and-paste mechanism (hobo and mariner), our model should be readily applicable. RNA-based TE families, such as I-element, transpose through a “copy-and-paste mechanism” (replicating through insertions of reversed transcribed cDNA into another position in the genome). The hybrid dysgenic syndrome observed for I-element is caused by the catastrophic meiosis of eggs, which includes failure to produce functional female pronucleus and developmental abnormalities after fertilization (Orsi et al. 2010). If such a meiotic defect is initiated with a higher than tolerable threshold of I-elements transposition activity and is also responsible for the hybrid dysgenic syndromes of other unobserved horizontal transfer of RNA-based TEs, our model could be a reasonable generalization for all TE families.

Consistent with our quantitative analysis, we found no strong evidence supporting recent selection imposed by P elements on host candidate genes. The significant temporal differentiation and haplotypic structures of Hen1, our only strong candidate, are likely the result of the strong genetic hitchhiking effects from the nearby insecticide-resistant gene, Cyp6g1. Certainly it is possible that other types of alleles or loci may have responded to the P-element invasion. We did not find a strong reduction in polymorphism of the post–P-element population 10 kb upstream and downstream of each candidate gene, providing no support for the alternative that selection has acted on polymorphisms of local cis-acting elements (data not shown). Yet, the possibility of strong selection on cis-regulatory elements outside the surveyed region or of trans-acting regulatory variation cannot be ruled out. The possibility that there was instead strong selection for a P-element variant that can reduce the deleterious impact on its host is not plausible because there is only a 1-base difference observed between D. willistoni and D. melanogaster canonical P elements (Daniels et al. 1990). Still another alternative is that prevalent D. melanogaster variants of host genes segregating in the pre–P-element population were already similar to those of D. willistoni, which coevolved with P elements and can thus effectively reduce P-element transposition and minimize their deleterious effects. We found that the amino acid sequence divergence between D. melanogaster and D. willistoni orthologs (Clark et al. 2007) for most candidate genes is greater than the genome-wide median (Figure 5). The only two exceptions [Hrb27C (1.7%) and Psi (16.4%)] both have other essential host functions (Hammond et al. 1997; Labourier et al. 2002; Goodrich et al. 2004; Huynh et al. 2004; Yano et al. 2004; Blanchette et al. 2005) and their evolution is expected to be strongly constrained. These observations therefore do not support the scenario of D. melanogaster preadaptation to P elements.

Figure 5

Divergence between D. melanogaster (D. mel) and D. willistoni (D. wil) of candidate genes among other genes. Amino acid sequence divergences were estimated for genes with an annotated D. willistoni ortholog (11 candidate genes and 10,029 other genes) and the genome-wide distribution of the divergence is shown. The divergences of the 11 candidate genes on the x-axis are shown at the bottom.

The selection coefficient for a host allele that can reduce TE transposition rate in an outbred population with limited linkage disequilibrium is

s \approx δ u (\frac{\bar{n} u}{2 \tilde{H}})

(Charlesworth and Langley 1986), which depends on the change in transposition rate

δ u

and the expected number of new TE copies (average copy number

\bar{n}

times transposition rate u).

\tilde{H}

is the approximated harmonic mean of recombination frequency between pairs of TE insertions, which approaches

\frac{1}{2}

in species with free recombination. This theoretical prediction suggests that the selective benefit of a host allele increases with the number of TE copies whose transposition it can suppress. Given the TE transposition rate [10⁻⁵ ∼ 10⁻⁴ (Nuzhdin and Mackay 1995; Nuzhdin et al. 1997; Maside et al. 2000, 2001; reviewed in Charlesworth and Langley 1989; Le Rouzic and Deceliere 2005)] and the number of active copies of individual TE families [generally <100 (Kaminker et al. 2002; Quesneville et al. 2005; Bergman et al. 2006)] in Drosophila, the selective benefits for a host allele targeting a specific TE family are small. However, Drosophila genomes are occupied by >100 TE families (Kaminker et al. 2002; Quesneville et al. 2005; Bergman et al. 2006; Clark et al. 2007), and TE families can be further classified into clades or superfamilies according to encoded protein products and similarities in sequences [such as Ty1-copia-like or Ty3-gypsy like (reviewed in Wicker et al. 2007)]. In this case, a host variant targeting attributes shared between multiple TE families can enjoy larger selective benefits than a variant targeting a single TE family and is likely to spread in the host population. For example, a host allele that can reduce half of the transposition rate of a subset of TE families with a total of 1000 copies can have a selection coefficient as large as 10⁻⁵, which would be strong enough to overcome the effect of genetic drift in D. melanogaster [N_e ∼ 10⁶ (Langley et al. 1982; Kreitman 1983)]. Accordingly, unlike the arms race between host and horizontally transferred pathogens, where strong selective benefit comes from the precise targeting of the host allele to a specific pathogen variant, the antagonistic interaction between TEs and host variants that can target the aggregated influence of multiple vertically inherited TE families could be the main force driving the fast evolution of host TE-interacting genes

Parameters and varables used in the model

Table A1

Parameters and varables used in the model

Parameters
a, b	Selection coefficient against P elements through synergistic epistasis
d	Proportional reduction of P-element transposition rate in homozygotes of host beneficial allele
h	Dominance coefficient for host beneficial allele
u₀	P-element transposition rate per copy per generation in dysgenic cross
u₁	P-element transposition rate per copy per generation in nondysgenic cross
n_HD	Maximum number of new P-element insertions an individual can tolerate before becoming completely sterile

Variables
n	P-element copy number in a given individual
m	Mean copy number of P element of parents
u	P-element transposition rate per copy per generation
δ	Proportional reduction of P-element transposition rate due to the host genotype
r_t	Proportion of individuals with P cytotype in the population at generation t
l_t	Allele frequency of the host beneficial allele in the population at generation t
μ_t	Average P-element copy number among individuals with P cytotype at generation t
${\bar{W}}_{t}$	Mean fitness of the population at generation t
s	Selection coefficient against host nonbeneficial allele

Parameters
a, b	Selection coefficient against P elements through synergistic epistasis
d	Proportional reduction of P-element transposition rate in homozygotes of host beneficial allele
h	Dominance coefficient for host beneficial allele
u₀	P-element transposition rate per copy per generation in dysgenic cross
u₁	P-element transposition rate per copy per generation in nondysgenic cross
n_HD	Maximum number of new P-element insertions an individual can tolerate before becoming completely sterile

Variables
n	P-element copy number in a given individual
m	Mean copy number of P element of parents
u	P-element transposition rate per copy per generation
δ	Proportional reduction of P-element transposition rate due to the host genotype
r_t	Proportion of individuals with P cytotype in the population at generation t
l_t	Allele frequency of the host beneficial allele in the population at generation t
μ_t	Average P-element copy number among individuals with P cytotype at generation t
${\bar{W}}_{t}$	Mean fitness of the population at generation t
s	Selection coefficient against host nonbeneficial allele

Table A1

Parameters and varables used in the model

Parameters
a, b	Selection coefficient against P elements through synergistic epistasis
d	Proportional reduction of P-element transposition rate in homozygotes of host beneficial allele
h	Dominance coefficient for host beneficial allele
u₀	P-element transposition rate per copy per generation in dysgenic cross
u₁	P-element transposition rate per copy per generation in nondysgenic cross
n_HD	Maximum number of new P-element insertions an individual can tolerate before becoming completely sterile

Variables
n	P-element copy number in a given individual
m	Mean copy number of P element of parents
u	P-element transposition rate per copy per generation
δ	Proportional reduction of P-element transposition rate due to the host genotype
r_t	Proportion of individuals with P cytotype in the population at generation t
l_t	Allele frequency of the host beneficial allele in the population at generation t
μ_t	Average P-element copy number among individuals with P cytotype at generation t
${\bar{W}}_{t}$	Mean fitness of the population at generation t
s	Selection coefficient against host nonbeneficial allele

Parameters
a, b	Selection coefficient against P elements through synergistic epistasis
d	Proportional reduction of P-element transposition rate in homozygotes of host beneficial allele
h	Dominance coefficient for host beneficial allele
u₀	P-element transposition rate per copy per generation in dysgenic cross
u₁	P-element transposition rate per copy per generation in nondysgenic cross
n_HD	Maximum number of new P-element insertions an individual can tolerate before becoming completely sterile

Variables
n	P-element copy number in a given individual
m	Mean copy number of P element of parents
u	P-element transposition rate per copy per generation
δ	Proportional reduction of P-element transposition rate due to the host genotype
r_t	Proportion of individuals with P cytotype in the population at generation t
l_t	Allele frequency of the host beneficial allele in the population at generation t
μ_t	Average P-element copy number among individuals with P cytotype at generation t
${\bar{W}}_{t}$	Mean fitness of the population at generation t
s	Selection coefficient against host nonbeneficial allele

The effects of change in μ₀ and u₁ on r_0.99, l₁₀₀₀, μ₁₀₀₀, and max(s)

Table A2

The effects of change in μ₀ and u₁ on r_0.99, l₁₀₀₀, μ₁₀₀₀, and max(s)

	μ₀				u₁
μ₀ or u₁	5	10	20	30	10⁻³	10⁻⁴	10⁻⁵
r_0.99 (gen)	87	87	88	88	78	87	88
l₁₀₀₀ (10⁻³)	1.312	1.331	1.331	1.313	1.189	1.331	1.349
μ₁₀₀₀	6.650	6.451	6.451	6.650	9.698	6.451	6.297
max(s) (10⁻³)^a	5.776	5.788	5.777	5.776	5.804	5.788	5.786

	μ₀				u₁
μ₀ or u₁	5	10	20	30	10⁻³	10⁻⁴	10⁻⁵
r_0.99 (gen)	87	87	88	88	78	87	88
l₁₀₀₀ (10⁻³)	1.312	1.331	1.331	1.313	1.189	1.331	1.349
μ₁₀₀₀	6.650	6.451	6.451	6.650	9.698	6.451	6.297
max(s) (10⁻³)^a	5.776	5.788	5.777	5.776	5.804	5.788	5.786

Other not varied parameters are set as a = 10⁻⁵, b = 10⁻⁶, d = 0.5, h = 0.5, u₀ = 1, u₁ = 10⁻⁴, μ₀ = 10, and n_HD = 5.

a

The largest s within 1000 generations.

Table A2

The effects of change in μ₀ and u₁ on r_0.99, l₁₀₀₀, μ₁₀₀₀, and max(s)

	μ₀				u₁
μ₀ or u₁	5	10	20	30	10⁻³	10⁻⁴	10⁻⁵
r_0.99 (gen)	87	87	88	88	78	87	88
l₁₀₀₀ (10⁻³)	1.312	1.331	1.331	1.313	1.189	1.331	1.349
μ₁₀₀₀	6.650	6.451	6.451	6.650	9.698	6.451	6.297
max(s) (10⁻³)^a	5.776	5.788	5.777	5.776	5.804	5.788	5.786

	μ₀				u₁
μ₀ or u₁	5	10	20	30	10⁻³	10⁻⁴	10⁻⁵
r_0.99 (gen)	87	87	88	88	78	87	88
l₁₀₀₀ (10⁻³)	1.312	1.331	1.331	1.313	1.189	1.331	1.349
μ₁₀₀₀	6.650	6.451	6.451	6.650	9.698	6.451	6.297
max(s) (10⁻³)^a	5.776	5.788	5.777	5.776	5.804	5.788	5.786

Other not varied parameters are set as a = 10⁻⁵, b = 10⁻⁶, d = 0.5, h = 0.5, u₀ = 1, u₁ = 10⁻⁴, μ₀ = 10, and n_HD = 5.

a

The largest s within 1000 generations.

The effects of change in d on r_0.99, l₁₀₀₀, μ₁₀₀₀, and max(s)

Table A3

The effects of change in d on r_0.99, l₁₀₀₀, μ₁₀₀₀, and max(s)

	h = 0.5					hd = 0.5
d	0.2	0.3	0.5	0.7	0.9	0.3	0.5	0.7	0.9
r_0.99 (gen)	87	87	87	87	87	87	87	87	87
l₁₀₀₀(10⁻³)	1.066	1.201	1.331	1.448	1.543	1.331	1.331	1.331	1.331
μ₁₀₀₀	6.454	6.451	6.451	6.451	6.451	6.451	6.451	6.451	6.451
max(s)(10⁻³)^a	1.850	4.476	5.788	6.189	6.229	4.475	5.788	6.190	6.232

	h = 0.5					hd = 0.5
d	0.2	0.3	0.5	0.7	0.9	0.3	0.5	0.7	0.9
r_0.99 (gen)	87	87	87	87	87	87	87	87	87
l₁₀₀₀(10⁻³)	1.066	1.201	1.331	1.448	1.543	1.331	1.331	1.331	1.331
μ₁₀₀₀	6.454	6.451	6.451	6.451	6.451	6.451	6.451	6.451	6.451
max(s)(10⁻³)^a	1.850	4.476	5.788	6.189	6.229	4.475	5.788	6.190	6.232

Other not varied parameters are set as a = 10⁻⁵, b = 10⁻⁶, u₀ = 1, u₁ = 10⁻⁴, μ₀ = 10, and n_HD = 5.

a

The largest s within 1000 generations.

Table A3

The effects of change in d on r_0.99, l₁₀₀₀, μ₁₀₀₀, and max(s)

	h = 0.5					hd = 0.5
d	0.2	0.3	0.5	0.7	0.9	0.3	0.5	0.7	0.9
r_0.99 (gen)	87	87	87	87	87	87	87	87	87
l₁₀₀₀(10⁻³)	1.066	1.201	1.331	1.448	1.543	1.331	1.331	1.331	1.331
μ₁₀₀₀	6.454	6.451	6.451	6.451	6.451	6.451	6.451	6.451	6.451
max(s)(10⁻³)^a	1.850	4.476	5.788	6.189	6.229	4.475	5.788	6.190	6.232

	h = 0.5					hd = 0.5
d	0.2	0.3	0.5	0.7	0.9	0.3	0.5	0.7	0.9
r_0.99 (gen)	87	87	87	87	87	87	87	87	87
l₁₀₀₀(10⁻³)	1.066	1.201	1.331	1.448	1.543	1.331	1.331	1.331	1.331
μ₁₀₀₀	6.454	6.451	6.451	6.451	6.451	6.451	6.451	6.451	6.451
max(s)(10⁻³)^a	1.850	4.476	5.788	6.189	6.229	4.475	5.788	6.190	6.232

Other not varied parameters are set as a = 10⁻⁵, b = 10⁻⁶, u₀ = 1, u₁ = 10⁻⁴, μ₀ = 10, and n_HD = 5.

a

The largest s within 1000 generations.

Figure A1

The impact of d on the allele frequency of the host beneficial allele at generation 1000 ( l1000). The dashed line is the beneficial allele frequency of host beneficial allele at generation zero (l0).

The impact of d on the allele frequency of the host beneficial allele at generation 1000 ( l₁₀₀₀). The dashed line is the beneficial allele frequency of host beneficial allele at generation zero (l₀).

The effects of change in a and b on r_0.99, l₁₀₀₀, μ₁₀₀₀, and max(s)

Table A4

The effects of change in a and b on r_0.99, l₁₀₀₀, μ₁₀₀₀, and max(s)

	a			b
a or b	10⁻⁵	5 × 10⁻⁵	10⁻⁴	10⁻⁶	5 × 10⁻⁶	10⁻⁵
r_0.99 (gen)	87	92	100	87	93	102
l₁₀₀₀(10⁻³)	1.331	1.413	1.521	1.331	1.450	1.564
μ₁₀₀₀	6.451	5.867	5.423	6.451	5.677	5.2829
max(s)(10⁻³)^a	5.788	5.784	5.779	5.788	5.786	5.783

	a			b
a or b	10⁻⁵	5 × 10⁻⁵	10⁻⁴	10⁻⁶	5 × 10⁻⁶	10⁻⁵
r_0.99 (gen)	87	92	100	87	93	102
l₁₀₀₀(10⁻³)	1.331	1.413	1.521	1.331	1.450	1.564
μ₁₀₀₀	6.451	5.867	5.423	6.451	5.677	5.2829
max(s)(10⁻³)^a	5.788	5.784	5.779	5.788	5.786	5.783

Other not varied parameters are set as u₀ = 1, u₁ = 10⁻⁴, μ₀ = 10, and n_HD = 5.

a

The largest s within 1000 generations.

Table A4

The effects of change in a and b on r_0.99, l₁₀₀₀, μ₁₀₀₀, and max(s)

	a			b
a or b	10⁻⁵	5 × 10⁻⁵	10⁻⁴	10⁻⁶	5 × 10⁻⁶	10⁻⁵
r_0.99 (gen)	87	92	100	87	93	102
l₁₀₀₀(10⁻³)	1.331	1.413	1.521	1.331	1.450	1.564
μ₁₀₀₀	6.451	5.867	5.423	6.451	5.677	5.2829
max(s)(10⁻³)^a	5.788	5.784	5.779	5.788	5.786	5.783

	a			b
a or b	10⁻⁵	5 × 10⁻⁵	10⁻⁴	10⁻⁶	5 × 10⁻⁶	10⁻⁵
r_0.99 (gen)	87	92	100	87	93	102
l₁₀₀₀(10⁻³)	1.331	1.413	1.521	1.331	1.450	1.564
μ₁₀₀₀	6.451	5.867	5.423	6.451	5.677	5.2829
max(s)(10⁻³)^a	5.788	5.784	5.779	5.788	5.786	5.783

Other not varied parameters are set as u₀ = 1, u₁ = 10⁻⁴, μ₀ = 10, and n_HD = 5.

a

The largest s within 1000 generations.

The effects of change in u₀ and n_HD on r_0.99, l₁₀₀₀, μ₁₀₀₀, and max(s)

Table A5

The effects of change in u₀ and n_HD on r_0.99, l₁₀₀₀, μ₁₀₀₀, and max(s)

u₀ = 1
n_HD	2	3	5	7	9
r_0.99 (gen)	>10,000	3,787	87	64	57
l₁₀₀₀(10⁻³)	592.2	4.701	1.331	1.093	1.016
μ₁₀₀₀	3.441	4.637	6.451	6.994	7.261
max(s)^a	2.590 × 10⁻²	1.688 × 10⁻²	5.788 × 10⁻³	1.503 × 10⁻³	1.380 × 10⁻⁴

u₀ = 0.1
n_HD	2	3	5	7	9
r_0.99 (gen)	687	628	621	621	621
l₁₀₀₀(10⁻³)	1.032	1.003	1.000	1.000	1.000
μ₁₀₀₀	5.138	5.240	5.255	5.255	5.255
max(s)^a	1.366 × 10⁻⁴	1.093 × 10⁻⁵	7.066 × 10⁻⁸	1.092 × 10⁻⁹	1.337 × 10⁻¹²

u₀ = 1
n_HD	2	3	5	7	9
r_0.99 (gen)	>10,000	3,787	87	64	57
l₁₀₀₀(10⁻³)	592.2	4.701	1.331	1.093	1.016
μ₁₀₀₀	3.441	4.637	6.451	6.994	7.261
max(s)^a	2.590 × 10⁻²	1.688 × 10⁻²	5.788 × 10⁻³	1.503 × 10⁻³	1.380 × 10⁻⁴

u₀ = 0.1
n_HD	2	3	5	7	9
r_0.99 (gen)	687	628	621	621	621
l₁₀₀₀(10⁻³)	1.032	1.003	1.000	1.000	1.000
μ₁₀₀₀	5.138	5.240	5.255	5.255	5.255
max(s)^a	1.366 × 10⁻⁴	1.093 × 10⁻⁵	7.066 × 10⁻⁸	1.092 × 10⁻⁹	1.337 × 10⁻¹²

Other not varied parameters are set as a = 10⁻⁵, b = 10⁻⁶, d = 0.5, h = 0.5, u₁ = 10⁻⁴, and μ₀ = 10.

a

The largest s within 1000 generations.

Table A5

The effects of change in u₀ and n_HD on r_0.99, l₁₀₀₀, μ₁₀₀₀, and max(s)

u₀ = 1
n_HD	2	3	5	7	9
r_0.99 (gen)	>10,000	3,787	87	64	57
l₁₀₀₀(10⁻³)	592.2	4.701	1.331	1.093	1.016
μ₁₀₀₀	3.441	4.637	6.451	6.994	7.261
max(s)^a	2.590 × 10⁻²	1.688 × 10⁻²	5.788 × 10⁻³	1.503 × 10⁻³	1.380 × 10⁻⁴

u₀ = 0.1
n_HD	2	3	5	7	9
r_0.99 (gen)	687	628	621	621	621
l₁₀₀₀(10⁻³)	1.032	1.003	1.000	1.000	1.000
μ₁₀₀₀	5.138	5.240	5.255	5.255	5.255
max(s)^a	1.366 × 10⁻⁴	1.093 × 10⁻⁵	7.066 × 10⁻⁸	1.092 × 10⁻⁹	1.337 × 10⁻¹²

u₀ = 1
n_HD	2	3	5	7	9
r_0.99 (gen)	>10,000	3,787	87	64	57
l₁₀₀₀(10⁻³)	592.2	4.701	1.331	1.093	1.016
μ₁₀₀₀	3.441	4.637	6.451	6.994	7.261
max(s)^a	2.590 × 10⁻²	1.688 × 10⁻²	5.788 × 10⁻³	1.503 × 10⁻³	1.380 × 10⁻⁴

u₀ = 0.1
n_HD	2	3	5	7	9
r_0.99 (gen)	687	628	621	621	621
l₁₀₀₀(10⁻³)	1.032	1.003	1.000	1.000	1.000
μ₁₀₀₀	5.138	5.240	5.255	5.255	5.255
max(s)^a	1.366 × 10⁻⁴	1.093 × 10⁻⁵	7.066 × 10⁻⁸	1.092 × 10⁻⁹	1.337 × 10⁻¹²

Other not varied parameters are set as a = 10⁻⁵, b = 10⁻⁶, d = 0.5, h = 0.5, u₁ = 10⁻⁴, and μ₀ = 10.

a

The largest s within 1000 generations.

The effects of change in a and b on r_0.99, l₁₀₀₀, μ₁₀₀₀, and max(s)

Table A6

The effects of change in a and b on r_0.99, l₁₀₀₀, μ₁₀₀₀, and max(s)

a
a or b	10⁻⁵	5 × 10⁻⁵	10⁻⁴
R_0.99	621	>1000	>1000
l₁₀₀₀(10⁻³)	1.000	1.000	1.000
μ₁₀₀₀	5.255	4.259	3.139
max(s)	7.066 × 10⁻⁸	7.060 × 10⁻⁸	7.018 × 10⁻⁸

b
a or b	10⁻⁶	5 × 10⁻⁶	10⁻⁵
r_0.99	621	741	>1000
l₁₀₀₀(10⁻³)	1.000	1.000	1.000
μ₁₀₀₀	5.255	4.973	4.237
max(s)^a	7.066 × 10⁻⁸	7.063 × 10⁻⁸	7.053 × 10⁻⁸

a
a or b	10⁻⁵	5 × 10⁻⁵	10⁻⁴
R_0.99	621	>1000	>1000
l₁₀₀₀(10⁻³)	1.000	1.000	1.000
μ₁₀₀₀	5.255	4.259	3.139
max(s)	7.066 × 10⁻⁸	7.060 × 10⁻⁸	7.018 × 10⁻⁸

b
a or b	10⁻⁶	5 × 10⁻⁶	10⁻⁵
r_0.99	621	741	>1000
l₁₀₀₀(10⁻³)	1.000	1.000	1.000
μ₁₀₀₀	5.255	4.973	4.237
max(s)^a	7.066 × 10⁻⁸	7.063 × 10⁻⁸	7.053 × 10⁻⁸

Other not varied parameters are set as u₀ = 0.1, u₁ = 10⁻⁴, μ₀ = 10, and n_HD = 5.

a

The largest s within 1000 generations.

Table A6

The effects of change in a and b on r_0.99, l₁₀₀₀, μ₁₀₀₀, and max(s)

a
a or b	10⁻⁵	5 × 10⁻⁵	10⁻⁴
R_0.99	621	>1000	>1000
l₁₀₀₀(10⁻³)	1.000	1.000	1.000
μ₁₀₀₀	5.255	4.259	3.139
max(s)	7.066 × 10⁻⁸	7.060 × 10⁻⁸	7.018 × 10⁻⁸

b
a or b	10⁻⁶	5 × 10⁻⁶	10⁻⁵
r_0.99	621	741	>1000
l₁₀₀₀(10⁻³)	1.000	1.000	1.000
μ₁₀₀₀	5.255	4.973	4.237
max(s)^a	7.066 × 10⁻⁸	7.063 × 10⁻⁸	7.053 × 10⁻⁸

a
a or b	10⁻⁵	5 × 10⁻⁵	10⁻⁴
R_0.99	621	>1000	>1000
l₁₀₀₀(10⁻³)	1.000	1.000	1.000
μ₁₀₀₀	5.255	4.259	3.139
max(s)	7.066 × 10⁻⁸	7.060 × 10⁻⁸	7.018 × 10⁻⁸

b
a or b	10⁻⁶	5 × 10⁻⁶	10⁻⁵
r_0.99	621	741	>1000
l₁₀₀₀(10⁻³)	1.000	1.000	1.000
μ₁₀₀₀	5.255	4.973	4.237
max(s)^a	7.066 × 10⁻⁸	7.063 × 10⁻⁸	7.053 × 10⁻⁸

Other not varied parameters are set as u₀ = 0.1, u₁ = 10⁻⁴, μ₀ = 10, and n_HD = 5.

a

The largest s within 1000 generations.

Figure A2

The dynamics of r (A), l (B), s (C), μ (D), and W– (E) over generations when u0 = 1 and nHD = 5. (F) describes how s changes as r varies.

The dynamics of r (A), l (B), s (C), μ (D), and $\bar{W}$ (E) over generations when u₀ = 1 and n_HD = 5. (F) describes how s changes as r varies.

Figure A3

The dynamics of r (A), l (B), s (C), μ (D), and W– (E) over generations when u0 = 1 and nHD = 2. (F) describes how s changes as r varies.

The dynamics of r (A), l (B), s (C), μ (D), and $\bar{W}$ (E) over generations when u₀ = 1 and n_HD = 2. (F) describes how s changes as r varies.

Figure A4

The dynamics of r (A), l (B), s (C), μ (D), and W– (E) over generations when u0 = 0.1 and nHD = 5. (F) describes how s changes as r varies.

The dynamics of r (A), l (B), s (C), μ (D), and $\bar{W}$ (E) over generations when u₀ = 0.1 and n_HD = 5. (F) describes how s changes as r varies.

Acknowledgments

We thank the Bloomington Drosophila Stock Center and M. Itoh for providing M strains. We appreciate D. J. Begun and M. T. Levine for extensive discussion of the project and D. J. Begun, M. T. Levine, P. L. Ralph, M. Turelli, James A. Chandler, and E. I. Dietrich for critical reading of the manuscript. We also thank D. J. Obbard for helpful discussion and kindly sharing unpublished results regarding MK tests for piRNA genes. We thank C. M. Cardeno, P. Saelao, and T. Trejo-Cantwell for experimental assistance. Finally, we acknowledge the thoughtful, constructive, and patient criticism of two anonymous reviewers. This work was supported by National Science Foundation Doctoral Dissertation Improvement Grant, Daphne and Ted Pengelley Award in Evolutionary Biology, and Center for Population Biology of University of California, Davis, Research Award to Y.C.G.L.

Appendix

In this section, we describe the derivations of r, l, and μ. Parameters and variables used in the model are listed in Table A1.

The expected fitness of offspring of a pair of parents is

\bar{w} (m, u) = \sum_{n = 0}^{\infty} \frac{e^{- m} m^{n}}{n!} \sum_{i = 0}^{n_{HD}} \frac{e^{- n u} {(n u)}^{i}}{i!} e^{- a (n + i) - (b / 2) {(n + i)}^{2}} .

(A1)

m is the mean copy number of P elements of the parents. u, the transposition rate of the P element in the offspring, depends on the type of cross [u = u₀ in M(female) × P(male) dysgenic cross and u = u₁ in other crosses]. u also depends on the host allele passed to the offspring. With the assumption that the host locus is in complete linkage equilibrium with P-element insertions, the expected mean fitness of offspring for a specific type of cytotype cross (ccross) when considering the effect of the host beneficial allele is

\begin{array}{l} {\bar{w}}_{ccross} (m, u, l) = l^{2} \bar{w} (m, u (1 - d)) + 2 l (1 - l) \bar{w} (m, u (1 - h d)) \\ + {(1 - l)}^{2} \bar{w} (m, u) . \end{array}

(A2)

The expected mean fitness of offspring having a specific type of genotype (geno) when considering the effect of P-element cytotype is

\begin{array}{l} {\bar{w}}_{geno} (m, δ, r) = r^{2} \bar{w} (m, (1 - δ) u_{1}) + r (1 - r) \bar{w} (m / 2, (1 - δ) u_{1}) \\ + r (1 - r) \bar{w} (m / 2, (1 - δ) u_{0}) + {(1 - r)}^{2}, \end{array}

(A3)

where δ is the proportional reduction in P-element transposition rate due to the genotype of the host locus, which equals d and hd in the homozygote and the heterozygote of the host beneficial allele.

r of the next generation, the relative mean fitness of offspring with P cytotype to that of the mean population fitness, can be expressed as

\begin{array}{l} r_{t + 1} = \frac{1}{{\bar{W}}_{t + 1}} {r_{t}^{2} {\bar{w}}_{ccross} (μ_{t}, u_{1}, l_{t}) + r_{t} (1 - r_{t}) {\bar{w}}_{ccross} (\frac{μ_{t}}{2}, u_{1}, l_{t}) \\ + r_{t} (1 - r_{t}) {\bar{w}}_{ccross} (\frac{μ_{t}}{2}, u_{0}, l_{t}) - [r_{t}^{2} e^{- μ_{t}} + 2 r_{t} (1 - r_{t}) e^{- μ_{t} / 2}]}, \end{array}

(A4)

where the mean fitness of the population

\bar{W}

is

\begin{array}{l} {\bar{W}}_{t + 1} = r_{t}^{2} {\bar{w}}_{ccross} (μ_{t}, u_{1}, l_{t}) + r_{t} (1 - r_{t}) {\bar{w}}_{ccross} (\frac{μ_{t}}{2}, u_{1}, l_{t}) \\ + r_{t} (1 - r_{t}) {\bar{w}}_{ccross} (\frac{μ_{t}}{2}, u_{0}, l_{t}) + {(1 - r_{t})}^{2} . \end{array}

(A5)

The first three terms on the right side of (A4) are for P × P, P × M, and M × P (dysgenic) crosses. The average parental P-element copy number of P × M and M × P crosses is half of the average of that of all P-cytotype parents (μ/2). The last substraction of (A4) is cases when the offspring inherited zero P elements, which will have M cytotype under the assumption that cytocype is determined by the presence/absence of P elements.

The frequency of the host beneficial allele of the next generation can be expressed as

\begin{matrix} l_{t + 1} = \frac{1}{{\bar{W}}_{t + 1}} [l_{t}^{2} {\bar{w}}_{geno} (μ_{t}, (1 - d), r_{t}) + l_{t} (1 - l_{t}) {\bar{w}}_{geno} (μ_{t}, (1 - h d), r_{t})] \end{matrix} .

(A6)

The expected number of newly transposed P elements in offspring of a pair of parents is

\begin{matrix} \bar{Δ n} (m, u) = \sum_{n = 1}^{\infty} \frac{e^{- m} m^{n}}{n!} \end{matrix} \sum_{i = 0}^{n_{HD}} i \frac{e^{- n u} {(n u)}^{i}}{i!} e^{- a (n + i) - (b / 2) {(n + i)}^{2}} .

(A7)

The expected number of newly transposed P elements of a specific cross when considering the effect of the host beneficial allele is then

\begin{array}{l} {\bar{Δ n}}_{ccross} (m, u, l) = l^{2} \bar{Δ n} (m, u (1 - d)) + 2 l (1 - l) \bar{Δ n} (m, u (1 - h d)) \\ + {(1 - l)}^{2} \bar{Δ n} (m, u) . \end{array}

(A8)

The average P-element copy number among P-cytotype individuals in the next generation is thus

\begin{array}{l} μ_{t + 1} = μ_{t} + \frac{r_{t}^{2} {\bar{Δ n}}_{ccross} (μ_{t}, u_{1}, l_{t}) + r_{t} (1 - r_{t}) {\bar{Δ n}}_{ccross} (μ_{t} / 2, u_{1}, l_{t})}{r_{t + 1}} \\ + \frac{r_{t} (1 - r_{t}) {\bar{Δ n}}_{ccross} (μ_{t} / 2, u_{0}, l_{t})}{r_{t + 1}} . \end{array}

(A9)

The selection coefficient against the host nonbeneficial allele is

s_{t + 1} = 1 - \frac{{\bar{W}}_{a a}}{{\bar{W}}_{A A}} = \frac{{\bar{w}}_{geno} (μ, 0, r_{t})}{{\bar{w}}_{geno} (μ, 1 - d, r_{t})} .

(A10)

We iterated over (A4), (A6), and (A9) over 1000–10,000 generations for each set of parameters chosen. With the assumption that D. melanogaster can have ∼10 generations per year, there are ∼1000 generations between when D. melanogaster was first collected as M strains and now. We thus mainly reported r, l, and μ at generation 1000 for most cases. One aspect of the model we are interested in is the time until P-element fixed in the population. As our model is deterministic, we set when r ≥ 0.99 as the approximate time of fixation of the P cytotype in the population (r_0.99). In a finite population, genetic drift can expediate the fixation when r is nearly fixed in the population.

μ₀ (the average P-element copy number among individuals with P cytotype at generation zero) was tested for several values (5, 10, 20, and 30; Table A2), which did not influence the population dynamics or l₁₀₀₀ significantly. This is because only individuals with low copy number will have low probability of hybrid dysgenesis during the initial spread of the P elements. Transposition rates estimated for other TE families range from 10⁻⁵ to 10⁻⁴ (Nuzhdin and Mackay 1995; Nuzhdin et al. 1997; Maside et al. 2000) while old estimates for P-element transposition rate in a dysgenic cross are ∼10⁻³ (Eggleston et al. 1988). Similarly, for μ₁ tested (10⁻⁵, 10⁻⁴, and 10⁻³), no dramatic differences in population dyanmics were observed (Table A2). We tested the effect of changing d, the proportional reduction of P-element transposition rate in the homozygotes of host beneficial allele, with fixed h and fixed hd (Table A3). With fixed h, l increases as d increases (Table A3 and Figure A1), yet the change is slight. a and b were set as 10⁻⁵ and 10⁻⁶, respectively, according to previous studies (Dolgin and Charlesworth 2006, 2008). Increasing these two parameters, and thus increasing the deleterious effects of P-element insertions, does not have significant effects on l₁₀₀₀ either (Table A4).

To sum up, for all a, b, d, h, u₁, and μ₀ tested, there were no significant changes in the dynamics of r, l, μ, and s over generations and the relationship between r and s. In neither case tested is there temporal allele frequency change of the host beneficial allele (l₀ and l₁₀₀₀) that will be detectable with the usual size of samples. We thus fixed a, b, d, h, u₁, and μ₀ to 10⁻⁵, 10⁻⁶, 0.5, 0.5, 10⁻⁴, and 10.

On the other hand, the combined effects of u₀ and n_HD did show strong impacts on the dynamics of the host beneficial allele as discussed in the main text and shown in Table A5. When either u₀ or n_HD and other parameters are changed at the same time, u₀ or n_HD is the determining factor of the population dynamics observed. For example, when the same values of a and b are tested with u₀ = 0.1, the observed population dynamics follow Case 3 below instead of Case 1 (see below and Tables A4 and A6). We discussed the three most characteristic cases in terms of the dynamics of r, l, μ, and s over generations and the relationship between r and s in the following.

Case 1: u₀ = 1, n_HD ≥ 3 (Figure A2, case of n_HD = 5 is shown)

The dynamics of r, l, s, μ, and $\bar{W}$ (population mean fitness) over generations and the relationship between r and s are shown in Figure A2. The spread of the P element is fast (Figure A2A), even though it did lower the population mean fitness during its spread (Figure A2E). $\bar{W}$ started to bounce back when the proportion of P cytotype is intermediate due to the much reduced probability of hybrid dysgenesis. The change of s increased dramatically during the initial spread of P elements and then dropped as P cytotype is in high frequency in the population (Figure A2, C and F). This is also reflected in the dynamics of l, which has two phases of increase, with the former having a faster rate (Figure A2B).

Case 2: u₀ = 1, n_HD = 2 (Figure A3)

This is the exception of Case 1. The spread of P elements in the population is slowed due to the high probability of hybrid dysgenesis even when the majority of the population has P cytotype (Figure A3A). Because of this delay in P-elements spread, there is an extended period when the host repressing allele has large s (Figure A3C). However, several aspects of the population dynamics under this parameters are not consistent with known P-element biology (see main text). In a finite population, genetic drift may shorten the time of fixation for P cytotype, resulting in a shorter period for the repressing allele to have large selective benefits from lingering incidences of hybrid dysgenesis.

Case 3: u₀ = 0.1 and n_HD ≥ 2 (Figure A4, case of n_HD = 5 is shown)

Because of the lower u₀, the spread of the P element is slower than the case when u₀ = 1 (Figure A4A) and the duration for the host beneficial allele to have large selective benefit is therefore longer (Figure A4C). However, the absolute magnitude of the host allele selection benefit is much lower due to the much reduced probability of hybrid dysgenesis. The increase of μ is also delayed until the proportion of P cytotype is intermediate in the population (Figure A4D), which may explain why the largest s happened when r is relatively large (Figure A4E). The population mean fitness does not bounce back, suggesting that the increase in P-element copy number after fixation of P cytotype led to stronger fitness reduction than hybrid dysgenesis during the P-element spread.

Literature Cited

Adams

M D

,

Tarng

R S

,

Rio

D C

,

1997

The alternative splicing factor PSI regulates P-element third intron splicing in vivo

.

Genes Dev.

11

:

129

–

138

.

Aminetzach

Y T

,

Macpherson

J M

,

Petrov

D A

,

2005

Pesticide resistance via transposition-mediated adaptive gene truncation in Drosophila

.

Science

309

:

764

–

767

.

Andolfatto

P

,

2001

Contrasting patterns of X–linked and autosomal nucleotide variation in Drosophila melanogaster and Drosophila simulans

.

Mol. Biol. Evol.

18

:

279

–

290

.

Andolfatto

P

,

Wong

K M

,

Bachtrog

D

,

2011

Effective population size and the efficacy of selection on the X chromosomes of two closely related Drosophila species

.

Genome Biol. Evol.

3

:

114

–

128

.

Anxolabéhère

D

,

Kidwell

M G

,

Periquet

G

,

1988

Molecular characteristics of diverse populations are consistent with the hypothesis of a recent invasion of Drosophila melanogaster by mobile P elements

.

Mol. Biol. Evol.

5

:

252

–

269

.

Aquadro

C F

,

Desse

S F

,

Bland

M M

,

Langley

C H

,

Laurie-Ahlberg

C C

,

1986

Molecular population genetics of the alcohol dehydrogenase gene region of Drosophila melanogaster

.

Genetics

114

:

1165

–

1190

.

Aquadro

C F

,

Lado

K M

,

Noon

W A

,

1988

The rosy region of Drosophila melanogaster and Drosophila simulans. I. Contrasting levels of naturally occurring DNA restriction map variation and divergence

.

Genetics

119

:

875

–

888

.

Aravin

A A

,

Klenov

M S

,

Vagin

V V

,

Bantignies

F

,

Cavalli

G

et al. ,

2004

Dissection of a natural RNA silencing process in the Drosophila melanogaster germ line

.

Mol. Cell. Biol.

24

:

6742

–

6750

.

Aravin

A A

,

Hannon

G J

,

Brennecke

J

,

2007

The Piwi-piRNA pathway provides an adaptive defense in the transposon arms race

.

Science

318

:

761

–

764

.

Arkov

A L

,

Ramos

A

,

2010

Building RNA-protein granules: insight from the germline

.

Trends Cell Biol.

20

:

482

–

490

.

Beall

E L

,

Admon

A

,

Rio

D C

,

1994

A Drosophila protein homologous to the human p70 Ku autoimmune antigen interacts with the P transposable element inverted repeats

.

Proc. Natl. Acad. Sci. USA

91

:

12681

–

12685

.

Begun

D J

,

Holloway

A K

,

Stevens

K

,

Hillier

L W

,

Poh

Y-P

et al. ,

2007

Population genomics: whole-genome analysis of polymorphism and divergence in Drosophila simulans

.

PLoS Biol.

5

:

e310

.

Bergman

C M

,

Quesneville

H

,

Anxolabéhère

D

,

Ashburner

M

,

2006

Recurrent insertion and duplication generate networks of transposable element sequences in the Drosophila melanogaster genome

.

Genome Biol.

7

:

R112

.

Bingham

P M

,

Kidwell

M G

,

Rubin

G M

,

1982

The molecular basis of P-M hybrid dysgenesis: the role of the P element, a P-strain-specific transposon family

.

Cell

29

:

995

–

1004

.

Blanchette

M

,

Green

R E

,

Brenner

S E

,

Rio

D C

,

2005

Global analysis of positive and negative pre-mRNA splicing regulators in Drosophila

.

Genes Dev.

19

:

1306

–

1314

.

Blumenstiel

J P

,

2011

Evolutionary dynamics of transposable elements in a small RNA world

.

Trends Genet.

27

:

23

–

31

.

Brennecke

J

,

Aravin

A A

,

Stark

A

,

Dus

M

,

Kellis

M

et al. ,

2007

Discrete small RNA-generating loci as master regulators of transposon activity in Drosophila

.

Cell

128

:

1089

–

1103

.

Brennecke

J

,

Malone

C D

,

Aravin

A A

,

Sachidanandam

R

,

Stark

A

et al. ,

2008

An epigenetic role for maternally inherited piRNAs in transposon silencing

.

Science

322

:

1387

–

1392

.

Brookfield

J F

,

Montgomery

E

,

Langley

C H

,

1984

Apparent absence of transposable elements related to the P elements of D. melanogaster in other species of Drosophila

.

Nature

310

:

330

–

332

.

Catania

F

,

Kauer

M O

,

Daborn

P J

,

Yen

J L

,

Ffrench-Constant

R H

et al. ,

2004

World-wide survey of an Accord insertion and its association with DDT resistance in Drosophila melanogaster

.

Mol. Ecol.

13

:

2491

–

2504

.

Charlesworth

B

,

Charlesworth

D

,

1983

The population dynamics of transposable elements

.

Genet. Res.

42

:

1

–

27

.

Charlesworth

B

,

Langley

C H

,

1986

The evolution of self-regulated transposition of transposable elements

.

Genetics

112

:

359

–

383

.

Charlesworth

B

,

Langley

C H

,

1989

The population genetics of Drosophila transposable elements

.

Annu. Rev. Genet.

23

:

251

–

287

.

Chenna

R

,

Sugawara

H

,

Koike

T

,

Lopez

R

,

Gibson

T J

et al. ,

2003

Multiple sequence alignment with the Clustal series of programs

.

Nucleic Acids Res.

31

:

3497

–

3500

.

Chung

H

,

Bogwitz

M R

,

McCart

C

,

Andrianopoulos

A

,

Ffrench-Constant

R H

et al. ,

2007

cis-regulatory elements in the Accord retrotransposon result in tissue-specific expression of the Drosophila melanogaster insecticide resistance gene Cyp6g1

.

Genetics

175

:

1071

–

1077

.

Clark

A G

,

Eisen

M B

,

Smith

D R

,

Bergman

C M

,

Oliver

B

et al. ,

2007

Evolution of genes and genomes on the Drosophila phylogeny

.

Nature

450

:

203

–

218

.

Clark

J B

,

Maddison

W P

,

Kidwell

M G

,

1994

Phylogenetic analysis supports horizontal transfer of P transposable elements

.

Mol. Biol. Evol.

11

:

40

–

50

.

Clark

J B

,

Kim

P C

,

Kidwell

M G

,

1998

Molecular evolution of P transposable elements in the genus Drosophila. III. The melanogaster species group

.

Mol. Biol. Evol.

15

:

746

–

755

.

Daborn

P J

,

Yen

J L

,

Bogwitz

M R

,

Le Goff

G

,

Feil

E

et al. ,

2002

A single P450 allele associated with insecticide resistance in Drosophila

.

Science

297

:

2253

–

2256

.

Daniels

S B

,

Peterson

K R

,

Strausbaugh

L D

,

Kidwell

M G

,

Chovnick

A

,

1990

Evidence for horizontal transmission of the P transposable element between Drosophila species

.

Genetics

124

:

339

–

355

.

Dawkins

R

,

Krebs

J R

,

1979

Arms races between and within species

.

Proc. R. Soc. Lond., B, Biol. Sci.

205

:

489

–

511

.

Depaulis

F

,

Veuille

M

,

1998

Neutrality tests based on the distribution of haplotypes under an infinite-site model

.

Mol. Biol. Evol.

15

:

1788

–

1790

.

Dolgin

E S

,

Charlesworth

B

,

2006

The fate of transposable elements in asexual populations

.

Genetics

174

:

817

–

827

.

Dolgin

E S

,

Charlesworth

B

,

2008

The effects of recombination rate on the distribution and abundance of transposable elements

.

Genetics

178

:

2169

–

2177

.

Eggleston

W B

,

Johnson-Schlitz

D M

,

Engels

W R

,

1988

P-M hybrid dysgenesis does not mobilize other transposable element families in D. melanogaster

.

Nature

331

:

368

–

370

.

Engels

W R

,

1979

Germ line aberrations associated with a case of hybrid dysgenesis in Drosophila melanogaster males

.

Genet. Res.

33

:

137

–

146

.

Engels

W R

,

1997

Invasions of P elements

.

Genetics

145

:

11

–

15

.

Engels

W R

,

Johnson-Schlitz

D M

,

Eggleston

W B

,

Sved

J

,

1990

High-frequency P element loss in Drosophila is homolog dependent

.

Cell

62

:

515

–

525

.

Finn

R D

,

Mistry

J

,

Tate

J

,

Coggill

P

,

Heger

A

et al. ,

2009

The Pfam protein families database

.

Nucleic Acids Res.

38

:

D211

–

D222

.

Finnegan

D J

,

1992

Transposable elements

.

Curr. Opin. Genet. Dev.

2

:

861

–

867

.

González

J

,

Lenkov

K

,

Lipatov

M

,

Macpherson

J M

,

Petrov

D A

,

2008

High rate of recent transposable element-induced adaptation in Drosophila melanogaster

.

PLoS Biol.

6

:

e251

.

Good

A G

,

Meister

G A

,

Brock

H W

,

Grigliatti

T A

,

Hickey

D A

,

1989

Rapid spread of transposable P elements in experimental populations of Drosophila melanogaster

.

Genetics

122

:

387

–

396

.

Goodrich

J S

,

Clouse

K N

,

Schüpbach

T

,

2004

Hrb27C, Sqd and Otu cooperatively regulate gurken RNA localization and mediate nurse cell chromosome dispersion in Drosophila oogenesis

.

Development

131

:

1949

–

1958

.

Gossmann

T I

,

Song

B-H

,

Windsor

A J

,

Mitchell-Olds

T

,

Dixon

C J

et al. ,

2010

Genome wide analyses reveal little evidence for adaptive evolution in many plant species

.

Mol. Biol. Evol.

27

:

1822

–

1832

.

Gunawardane

L S

,

Saito

K

,

Nishida

K M

,

Miyoshi

K

,

Kawamura

Y

et al. ,

2007

A slicer-mediated mechanism for repeat-associated siRNA 5′ end formation in Drosophila

.

Science

315

:

1587

–

1590

.

Haldane

J B

,

1949

Disease and evolution

.

Ric. Sci. Suppl. A

19

:

68

–

76

.

Hammond

L E

,

Rudner

D Z

,

Kanaar

R

,

Rio

D C

,

1997

Mutations in the hrp48 gene, which encodes a Drosophila heterogeneous nuclear ribonucleoprotein particle protein, cause lethality and developmental defects and affect P-element third-intron splicing in vivo

.

Mol. Cell. Biol.

17

:

7260

–

7267

.

Heger

A

,

Ponting

C P

,

2007

Evolutionary rate analyses of orthologs and paralogs from 12 Drosophila genomes

.

Genome Res.

17

:

1837

–

1849

.

Hiraizumi

Y

,

1971

Spontaneous recombination in Drosophila melanogaster males

.

Proc. Natl. Acad. Sci. USA

68

:

268

–

270

.

Hoffmann

A A

,

Weeks

A R

,

2007

Climatic selection on genes and traits after a 100 year-old invasion: a critical look at the temperate-tropical clines in Drosophila melanogaster from eastern Australia

.

Genetica

129

:

133

–

147

.

Hudson

R R

,

2002

Generating samples under a Wright-Fisher neutral model of genetic variation

.

Bioinformatics

18

:

337

–

338

.

Hudson

R R

,

Boos

D D

,

Kaplan

N L

,

1992

A statistical test for detecting geographic subdivision

.

Mol. Biol. Evol.

9

:

138

–

151

.

Hudson

R R

,

Bailey

K

,

Skarecky

D

,

Kwiatowski

J

,

Ayala

F J

,

1994

Evidence for positive selection in the superoxide dismutase (Sod) region of Drosophila melanogaster

.

Genetics

136

:

1329

–

1340

.

Hughes

A L

,

Yeager

M

,

1998

Natural selection at major histocompatibility complex loci of vertebrates

.

Annu. Rev. Genet.

32

:

415

–

435

.

Hughes

A L

,

Ota

T

,

Nei

M

,

1990

Positive Darwinian selection promotes charge profile diversity in the antigen-binding cleft of class I major-histocompatibility-complex molecules

.

Mol. Biol. Evol.

7

:

515

–

524

.

Huynh

J-R

,

Munro

T P

,

Smith-Litière

K

,

Lepesant

J-A

,

St Johnston

D

,

2004

The Drosophila hnRNPA/B homolog, Hrp48, is specifically required for a distinct step in osk mRNA localization

.

Dev. Cell

6

:

625

–

635

.

Jiggins

F M

,

Kim

K W

,

2007

A screen for immunity genes evolving under positive selection in Drosophila

.

J. Evol. Biol.

20

:

965

–

970

.

Johnstone

O

,

Lasko

P

,

2004

Interaction with eIF5B is essential for Vasa function during development

.

Development

131

:

4167

–

4178

.

Kaminker

J S

,

Bergman

C M

,

Kronmiller

B

,

Carlson

J

,

Svirskas

R

et al. ,

2002

The transposable elements of the Drosophila melanogaster euchromatin: a genomics perspective. Genome Biol. 3: RESEARCH0084

.

Kidwell

M G

,

Lisch

D R

,

2001

Perspective: transposable elements, parasitic DNA, and genome evolution

.

Evolution

55

:

1

–

24

.

Kidwell

M G

,

Kidwell

J F

,

Nei

M

,

1973

A case of high rate of spontaneous mutation affecting viability in Drosophila melanogaster

.

Genetics

75

:

133

–

153

.

Kidwell

M G

,

Kidwell

J F

,

Sved

J A

,

1977

Hybrid dysgenesis in Drosophila melanogaster: a syndrome of aberrant traits including mutation, sterility and male recombination

.

Genetics

86

:

813

–

833

.

Kidwell

M G

,

Frydryk

T

,

Novy

J

,

1983

The hybrid dysgenesis potential of Drosophila melanogaster strains of diverse temporal and geographical natural origins

.

Drosoph. Inf. Serv.

59

:

63

–

69

.

Kidwell

M G

,

Kimura

K

,

Black

D M

,

1988

Evolution of hybrid dysgenesis potential following P element contamination in Drosophila melanogaster

.

Genetics

119

:

815

–

828

.

Kim

Y

,

Nielsen

R

,

2004

Linkage disequilibrium as a signature of selective sweeps

.

Genetics

167

:

1513

–

1524

.

Klattenhoff

C

,

Theurkauf

W

,

2008

Biogenesis and germline functions of piRNAs

.

Development

135

:

3

–

9

.

Klattenhoff

C

,

Xi

H

,

Li

C

,

Lee

S

,

Xu

J

et al. ,

2009

The Drosophila HP1 homolog Rhino is required for transposon silencing and piRNA production by dual-strand clusters

.

Cell

138

:

1137

–

1149

.

Kolaczkowski

B

,

Hupalo

D N

,

Kern

A D

,

2011

Recurrent adaptation in RNA interference genes across the Drosophila phylogeny

.

Mol. Biol. Evol.

28

:

1033

–

1042

.

Kreitman

M

,

1983

Nucleotide polymorphism at the alcohol dehydrogenase locus of Drosophila melanogaster

.

Nature

304

:

412

–

417

.

Labourier

E

,

Blanchette

M

,

Feiger

J W

,

Adams

M D

,

Rio

D C

,

2002

The KH-type RNA-binding protein PSI is required for Drosophila viability, male fertility, and cellular mRNA processing

.

Genes Dev.

16

:

72

–

84

.

Langley

C H

,

Montgomery

E

,

Quattlebaum

W F

,

1982

Restriction map variation in the Adh region of Drosophila

.

Proc. Natl. Acad. Sci. USA

79

:

5631

–

5635

.

Langley

C H

,

Brookfield

J F

,

Kaplan

N

,

1983

Transposable elements in Mendelian populations. I. A theory

.

Genetics

104

:

457

–

471

.

Langley

C H

,

Montgomery

E

,

Hudson

R

,

Kaplan

N

,

Charlesworth

B

,

1988

On the role of unequal exchange in the containment of transposable element copy number

.

Genet. Res.

52

:

223

–

235

.

Langley

C H

,

Lazzaro

B P

,

Phillips

W

,

Heikkinen

E

,

Braverman

J M

,

2000

Linkage disequilibria and the site frequency spectra in the su(s) and su(w(a)) regions of the Drosophila melanogaster X chromosome

.

Genetics

156

:

1837

–

1852

.

Langley

C H

,

Stevens

K

,

Cardeno

C

,

Lee

Y C G

,

Schrider

D R

et al. ,

2012

Genomic variation in natural populations of Drosophila melanogaster

.

Genetics

192

:

533

–

598

.

Lasko

P F

,

Ashburner

M

,

1990

Posterior localization of vasa protein correlates with, but is not sufficient for, pole cell development

.

Genes Dev.

4

:

905

–

921

.

Lazzaro

B P

,

2008

Natural selection on the Drosophila antimicrobial immune system

.

Curr. Opin. Microbiol.

11

:

284

–

289

.

Lee

Y C G

,

Langley

C H

,

2010

Transposable elements in natural populations of Drosophila melanogaster

.

Philos. Trans. R. Soc. Lond. B Biol. Sci.

365

:

1219

–

1228

.

Le Rouzic

A

,

Deceliere

G

,

2005

Models of the population genetics of transposable elements

.

Genet. Res.

85

:

171

–

181

.

Li

C

,

Vagin

V V

,

Lee

S

,

Xu

J

,

Ma

S

et al. ,

2009

Collapse of germline piRNAs in the absence of Argonaute3 reveals somatic piRNAs in flies

.

Cell

137

:

509

–

521

.

Lim

A K

,

Kai

T

,

2007

Unique germ-line organelle, nuage, functions to repress selfish genetic elements in Drosophila melanogaster

.

Proc. Natl. Acad. Sci. USA

104

:

6714

–

6719

.

Linder

P

,

2006

Dead-box proteins: a family affair—active and passive players in RNP-remodeling

.

Nucleic Acids Res.

34

:

4168

–

4180

.

Lingel

A

,

Simon

B

,

Izaurralde

E

,

Sattler

M

,

2003

Structure and nucleic-acid binding of the Drosophila Argonaute 2 PAZ domain

.

Nature

426

:

465

–

469

.

Long

A D

,

Lyman

R F

,

Langley

C H

,

Mackay

T F

,

1998

Two sites in the Delta gene region contribute to naturally occurring variation in bristle number in Drosophila melanogaster

.

Genetics

149

:

999

–

1017

.

Loreto

E L S

,

Carareto

C M A

,

Capy

P

,

2008

Revisiting horizontal transfer of transposable elements in Drosophila

.

Heredity

100

:

545

–

554

.

Lu

J

,

Clark

A G

,

2010

Population dynamics of PIWI-interacting RNAs (piRNAs) and their targets in Drosophila

.

Genome Res.

20

:

212

–

227

.

Mackay

T F

,

1989

Transposable elements and fitness in Drosophila melanogaster

.

Genome

31

:

284

–

295

.

Maside

X

,

Assimacopoulos

S

,

Charlesworth

B

,

2000

Rates of movement of transposable elements on the second chromosome of Drosophila melanogaster

.

Genet. Res.

75

:

275

–

284

.

Maside

X

,

Bartolomé

C

,

Assimacopoulos

S

,

Charlesworth

B

,

2001

Rates of movement and distribution of transposable elements in Drosophila melanogaster: in situ hybridization vs. Southern blotting data

.

Genet. Res.

78

:

121

–

136

.

Maynard Smith

J

,

Haigh

J

,

1974

The hitch-hiking effect of a favourable gene

.

Genet. Res.

23

:

23

–

35

.

McDonald

J H

,

Kreitman

M

,

1991

Adaptive protein evolution at the Adh locus in Drosophila

.

Nature

351

:

652

–

654

.

Miyashita

N

,

Langley

C H

,

1988

Molecular and phenotypic variation of the white locus region in Drosophila melanogaster

.

Genetics

120

:

199

–

212

.

Montgomery

E

,

Charlesworth

B

,

Langley

C H

,

1987

A test for the role of natural selection in the stabilization of transposable element copy number in a population of Drosophila melanogaster

.

Genet. Res.

49

:

31

–

41

.

Montgomery

E A

,

Huang

S M

,

Langley

C H

,

Judd

B H

,

1991

Chromosome rearrangement by ectopic recombination in Drosophila melanogaster: genome structure and evolution

.

Genetics

129

:

1085

–

1098

.

Nei

M

1987

Molecular Evolutionary Genetics. Columbia University Press, New York

.

Nuzhdin

S V

,

Mackay

T F

,

1995

The genomic rate of transposable element movement in Drosophila melanogaster

.

Mol. Biol. Evol.

12

:

180

–

181

.

Nuzhdin

S V

,

Pasyukova

E G

,

Mackay

T F

,

1997

Accumulation of transposable elements in laboratory lines of Drosophila melanogaster

.

Genetica

100

:

167

–

175

.

Obbard

D J

,

Jiggins

F M

,

Halligan

D L

,

Little

T J

,

2006

Natural selection drives extremely rapid evolution in antiviral RNAi genes

.

Curr. Biol.

16

:

580

–

585

.

Obbard

D J

,

Gordon

K H J

,

Buck

A H

,

Jiggins

F M

,

2009a

The evolution of RNAi as a defence against viruses and transposable elements

.

Philos. Trans. R. Soc. Lond. B Biol. Sci.

364

:

99

–

115

.

Obbard

D J

,

Welch

J J

,

Kim

K-W

,

Jiggins

F M

,

2009b

Quantifying adaptive evolution in the Drosophila immune system

.

PLoS Genet.

5

:

e1000698

.

Obbard

D J

,

Jiggins

F M

,

Bradshaw

N J

,

Little

T J

,

2011

Recent and recurrent selective sweeps of the antiviral RNAi gene Argonaute-2 in three species of Drosophila

.

Mol. Biol. Evol.

28

:

1043

–

1056

.

O’Hare

K

,

Rubin

G M

,

1983

Structures of P transposable elements and their sites of insertion and excision in the Drosophila melanogaster genome

.

Cell

34

:

25

–

35

.

Orsi

G A

,

Joyce

E F

,

Couble

P

,

McKim

K S

,

Loppin

B

,

2010

Drosophila I-R hybrid dysgenesis is associated with catastrophic meiosis and abnormal zygote formation

.

J. Cell Sci.

123

:

3515

–

3524

.

Pane

A

,

Wehr

K

,

Schüpbach

T

,

2007

zucchini and squash encode two putative nucleases required for rasiRNA production in the Drosophila germline

.

Dev. Cell

12

:

851

–

862

.

Pasyukova

E G

,

Nuzhdin

S V

,

Morozova

T V

,

Mackay

T F C

,

2004

Accumulation of transposable elements in the genome of Drosophila melanogaster is associated with a decrease in fitness

.

J. Hered.

95

:

284

–

290

.

Quesneville

H

,

Bergman

C M

,

Andrieu

O

,

Autard

D

,

Nouaud

D

et al. ,

2005

Combined evidence annotation of transposable elements in genome sequences

.

PLOS Comput. Biol.

1

:

e22

.

Rio

D C

,

2002

P transposable elements in Drosophila melanogaster, pp. 484–518 in Mobile DNA II, edited by N. L. Craig, R. Craigie, M. Gellert, and A. M. Lambowitz

.

Rio

D C

,

Rubin

G M

,

1988

Identification and purification of a Drosophila protein that binds to the terminal 31-base-pair inverted repeats of the P transposable element

.

Proc. Natl. Acad. Sci. USA

85

:

8929

–

8933

.

Rose

L E

,

Bittner-Eddy

P D

,

Langley

C H

,

Holub

E B

,

Michelmore

R W

et al. ,

2004

The maintenance of extreme amino acid diversity at the disease resistance gene, RPP13, in Arabidopsis thaliana

.

Genetics

166

:

1517

–

1527

.

Rozen

S

,

Skaletsky

H

,

2000

Primer3 on the WWW for general users and for biologist programmers

.

Methods Mol. Biol.

132

:

365

–

386

.

Rubin

G M

,

Kidwell

M G

,

Bingham

P M

,

1982

The molecular basis of P-M hybrid dysgenesis: the nature of induced mutations

.

Cell

29

:

987

–

994

.

Sackton

T B

,

Lazzaro

B P

,

Schlenke

T A

,

Evans

J D

,

Hultmark

D

et al. ,

2007

Dynamic evolution of the innate immune system in Drosophila

.

Nat. Genet.

39

:

1461

–

1468

.

Schaack

S

,

Gilbert

C

,

Feschotte

C

,

2010

Promiscuous DNA: horizontal transfer of transposable elements and why it matters for eukaryotic evolution

.

Trends Ecol. Evol.

25

:

537

–

546

.

Schlenke

T A

,

Begun

D J

,

2003

Natural selection drives Drosophila immune system evolution

.

Genetics

164

:

1471

–

1480

.

Schmidt

J M

,

Good

R T

,

Appleton

B

,

Sherrard

J

,

Raymant

G C

et al. ,

2010

Copy number variation and transposable elements feature in recent, ongoing adaptation at the Cyp6g1 locus

.

PLoS Genet.

6

:

e1000998

.

Schmidt

P S

,

Matzkin

L

,

Ippolito

M

,

Eanes

W F

,

2005

Geographic variation in diapause incidence, life-history traits, and climatic adaptation in Drosophila melanogaster

.

Evolution

59

:

1721

–

1732

.

Siebel

C W

,

Rio

D C

,

1990

Regulated splicing of the Drosophila P transposable element third intron in vitro: somatic repression

.

Science

248

:

1200

–

1208

.

Siebel

C W

,

Fresco

L D

,

Rio

D C

,

1992

The mechanism of somatic inhibition of Drosophila P-element pre-mRNA splicing: multiprotein complexes at an exon pseudo-5′ splice site control U1 snRNP binding

.

Genes Dev.

6

:

1386

–

1401

.

Siebel

C W

,

Kanaar

R

,

Rio

D C

,

1994

Regulation of tissue-specific P-element pre-mRNA splicing requires the RNA-binding protein PSI

.

Genes Dev.

8

:

1713

–

1725

.

Siebel

C W

,

Admon

A

,

Rio

D C

,

1995

Soma-specific expression and cloning of PSI, a negative regulator of P element pre-mRNA splicing

.

Genes Dev.

9

:

269

–

283

.

Silva

J C

,

Loreto

E L

,

Clark

J B

,

2004

Factors that affect the horizontal transfer of transposable elements

.

Curr. Issues Mol. Biol.

6

:

57

–

71

.

Simmons

G M

,

1992

Horizontal transfer of hobo transposable elements within the Drosophila melanogaster species complex: evidence from DNA sequencing

.

Mol. Biol. Evol.

9

:

1050

–

1060

.

Siomi

M C

,

Saito

K

,

Siomi

H

,

2008

How selfish retrotransposons are silenced in Drosophila germline and somatic cells

.

FEBS Lett.

582

:

2473

–

2478

.

Song

J-J

,

Liu

J

,

Tolia

N H

,

Schneiderman

J

,

Smith

S K

et al. ,

2003

The crystal structure of the Argonaute2 PAZ domain reveals an RNA binding motif in RNAi effector complexes

.

Nat. Struct. Biol.

10

:

1026

–

1032

.

Stephan

W

,

Song

Y S

,

Langley

C H

,

2006

The hitchhiking effect on linkage disequilibrium between linked neutral loci

.

Genetics

172

:

2647

–

2663

.

Styhler

S

,

Nakamura

A

,

Swan

A

,

Suter

B

,

Lasko

P

,

1998

vasa is required for GURKEN accumulation in the oocyte, and is involved in oocyte differentiation and germline cyst development

.

Development

125

:

1569

–

1578

.

Takahata

N

,

Satta

Y

,

Klein

J

,

1992

Polymorphism and balancing selection at major histocompatibility complex loci

.

Genetics

130

:

925

–

938

.

Vagin

V V

,

Klenov

M S

,

Kalmykova

A I

,

Stolyarenko

A D

,

Kotelnikov

R N

et al. ,

2004

The RNA interference proteins and vasa locus are involved in the silencing of retrotransposons in the female germline of Drosophila melanogaster

.

RNA Biol.

1

:

54

–

58

.

Van Valen

L M

,

1973

A new evolutionary law

.

Evolutionary Theory

1

:

1

–

30

.

Vermaak

D

,

Henikoff

S

,

Malik

H S

,

2005

Positive selection drives the evolution of rhino, a member of the heterochromatin protein 1 family in Drosophila

.

PLoS Genet.

1

:

96

–

108

.

Weir

B S

,

Cockerham

C C

,

1984

Estimating F-statistics for the analysis of population structure

.

Evolution

38

:

1358

–

1370

.