Introduction

The mouse is a powerful and versatile research tool to gain a greater understanding of the human condition. A key capability is the ease and speed with which mice can be genetically modified, fueling the development of preclinical mouse models for drug development, human disease modeling and synthetic biology1,2,3,4,5. However, it remains technically challenging to integrate DNA constructs into the mouse genome in vivo in a precise and defined manner, which is essential for appropriate transgene control and expression6. Although targeting nucleases, e.g., Zinc Finger Nucleases (ZFNs), Transcription Activator-Like Effector nucleases (TALENs), and CRISPR/Cas9 can mediate integration of exogenous DNA into the mouse genome in the zygote via homology-directed repair (HDR), the success of this approach is fraught with poorly defined variables and an inherent low efficiency of the intended event occurring, especially when the exogenous DNA exceeds a few kilobases6,7,8,9.

Often where larger constructs are required to be added, researchers resort to the deceptively expedient solution of random (integration) transgenesis10. Although this approach is well established and rapid, the resulting transgene insertions are haphazard and inefficient, often leading to unintentional genetic damage with potentially muddling downstream outcomes. For example, unpredicted local effects on host gene expression and silencing can occur11,12. Random transgenesis is also a misnomer. It has been shown that ~ 50% of “random” transgene insertions disrupt coding sequences of endogenous genes, often invoking large deletions and structural variations at the integration regions13,14. Random transgenesis often results in multiple copy integrations, including concatemers leading to aberrant gene expression or silencing12,15,16,17,18. Although the resulting serendipitous heterogeneity of the transgene expression can be of use, a significant amount of time is required to fully characterize the new strains, including multiple back-crosses to allow allelic segregation to fully stabilize the phenotype.

To overcome these limitations we developed a precision transgenesis platform for integration of DNA constructs with minimal extraneous sequences directly into mouse zygotes. To minimize potential damage to the mouse genome the strategy uses the well-characterized safe harbor locus, Gt(ROSA)26Sor (ROSA26) with the pre-positioning of an integrase “landing pad” (attachment sites) for subsequent recombinant transgene integration. ROSA26 encodes a long non-coding RNA (lncRNA) under the control of a constitutive promoter. The region is transcriptionally active, with an open chromatin configuration leading to expression in all cell types examined. Further, its disruption does not lead to a reduction in fertility or other notable phenotypes19,20,21,22,23. To facilitate the integration of large transgenes into ROSA26 we utilized the highly efficient serine site-specific recombinase derived from the bacteriophage Bxb124,25,26.

The Bxb1 mycobacteriophage was first isolated from Mycobacterium smegmatis in 1990, by Dr. William R. Jacobs, Jr., (see https://phagesdb.org/phages/Bxb1/)25. Subsequently, it was determined that Bxb1 integrase (Int) is exquisitely suited to integrate DNA donor constructs into mammalian cells27,28,29,30. Bxb1 Int-mediated recombination uses a minimal attachment site of 48 bp attP (attachment Phage) and a cognate 38 bp site attB (attachment Bacterial) between the donor and recipient DNAs. The attachment sites B and P represent four half-sites. After Bxb1 recombination, a half-site from each remains, yielding two 43 bp sites (attL and attR, Left and Right) flanking the inserted DNA. Bxb1 Int requires no other secondary factors and under these conditions provides irreversible unidirectional integration of donor DNAs31,32,33. Importantly, pseudo or intrinsic Bxb1 attachment sites that could lead to background off-target integrations (OTI) of donor DNA have not been detected in the mouse genome27,30,34,35,36. Lastly, upon the comparison of fifteen integrases using cell lines, Bxb1 Int yielded approximately two-fold greater and accurate recombinants when compared to the most widely studied recombinase, φC3127,35,37.

Here we demonstrate that Bxb1 Int can efficiently and precisely introduce donor DNA constructs into the ROSA26 locus as single-copy transgenes in the desired orientation, directly in zygotes. In the first phase of this work, we used a single attachment site (attP-GT) in the mouse lines C57BL/6J (B6), FVB/NJ (FVB), and NOD.Cg-Prkdcscid Il2rgtm1Wjl/SzJ (NSG), successfully generating RMKI alleles using DNA minicircle donors to avoid integration of the prokaryotic backbone. In a second phase, we added a second heterologous attachment site (attP-GA) to the ROSA26 landing pad and show that this improved transgene insertion efficiency by more than threefold. Also, using dual sites enables larger donor DNAs to be inserted via recombinase mediated cassette exchange (RMCE), excluding the vector backbone and eliminating any need for a minicircle conversion step (outline of approach in Fig. 1).

Figure 1
figure 1

Overview of the Bxb1 Int-mediated transgenesis strategy. (A) Schematic showing RMKI allele generated by in vivo recombination between vector free minicircle donor DNA carrying a single attB site and host mouse containing a single cognate attP site in the ROSA26 locus (RosaBxb-GT). (B) Recombinase-Mediated Cassette Exchange (RMCE) allele generated by in vivo recombination between plasmid DNA, carrying two flanking heterologous attB sites and host mouse carrying dual cognate attP sites (RosaBxb-GT/GA) in the ROSA26 locus. (C) Nanopore Cas9-targeted sequencing (nCATS) enables complete verification at the sequence level of the entire transgene including the adjacent genomic DNA (R26).

Finally, combining this strategy with nanopore Cas9-targeted sequencing (nCATS) enables us to completely sequence-verify the resulting transgenic allele rapidly in its genomic context. This complete strategy lends speed and confidence to the creation and characterization of genetically modified animal production with a level of precision and speed not previously obtainable.

Results

The dual attP site landing pad alleles were constructed over two phases enabling testing of multiple CRISPR/Cas9 based strategies for optimal knock-in allele generation. After establishing the single site attP allele on various backgrounds, the B6 and NSG versions were re-targeted adding a second attP site in cis. Both single and dual-site mouse strains were used to generate precision transgenics via Bxb1-mediated recombination. Three single site strains (B6, FVB, NSG) used donor DNA as minicircles to generate vector-free RMKI alleles. In contrast, the two (B6 and NSG) dual-site strains used plasmids or modified BACs to generate vector-free RMCE alleles.

Generation of single site Bxb1 attP-GT landing pad mouse strains (RosaBxb-GT)

A single Bxb1 attP-GT site was introduced 3’ of the XbaI site in intron 1 of the ROSA26 locus (Fig. 2). To reduce potential off-target cutting, we used an 18-mer TRU-guide gRNA38 which consistently cuts at ≥ 85% at this locus. For the mouse strain B6 only, we used a GUIDE-seq strategy39 introducing the attP site using Cas9 mRNA, gRNA#1888 and a 48 bp dsDNA oligonucleotide donor (duplexed oligos #1927/#1928) delivered by microinjection (MIJ). Analysis of the resulting allele in B6 showed successful integration of the complete attP-GT site into the ROSA26 locus. However, this approach was inefficient (1/15) and in this instance led to a lesion at the integration site, with a +6 bp/−3 bp INDEL preceding the 5’ end of the attP-GT site; Fig. 2. As a consequence this strategy was abandoned and for all other attP integrations into other strains of mice strains we used Cas9 mediated HDR with a single-stranded oligo (#1969 or #1970) which included homology arms13. These data are outlined in Table 1.

Figure 2
figure 2

Construction of RosaBxb-GT alleles on multiple genetic backgrounds. (A) For the mouse strain B6 only, a CRISPR/Cas9 GUIDE-seq strategy was used resulting in the insertion of the attP-GT site with an indel. (B) For all other backgrounds a CRISPR/Cas9 mediated HDR strategy was used, resulting in scarless incorporation of the attP site to generate the RosaBxb-GT alleles. (C) Sequence of Bxb1 attP-GT site for B6.RosaBxb1-GT, showed that an indel occurred in B6 (only) and the correct RosaBxb-GT allele with perfect integration occurred in mouse strains 129S1.RosaBxb-GT, AJ.RosaBxb-GT, CAST.RosaBxb-GT, DBA2.RosaBxb-GT, FVB.RosaBxb-GT, NOD.RosaBxb-GT, NSG.RosaBxb-GT, NZO.RosaBxb-GT, and PWK.RosaBxb-GT. Generation of RosaBxb-GT/GA alleles on B6 (D) and NSG (E) genetic backgrounds using a CRISPR/Cas9 HDR strategy wherein the attP-GA site was placed 240 bp 3’ of the attP-GT site, using gRNA #2254 and asymmetric donor oligo #2255. (F) Sequence detail of the Bxb1 attP-GA site region in strains B6.RosaBxb1-GT/GA and NSG.RosaBxb1-GT/GA.

Table 1 Summary of CRISPR/Cas9-mediated Knock-In efficiency of Bxb1 attP site(s) into the ROSA26 locus.

As shown in Table 1, all strain backgrounds attempted resulted in successful generation of the ROSA26 attP-GT allele, with the exception of WSB. The primary challenge for CAST and WSB strains was low embryo production, due to super-ovulation resistance combined with poor embryo survival post-manipulation. For CAST this was overcome using timed matings followed by electroporation of zygotes to introduce duplexed Cas9 protein + guide and donor oligo, producing two carrier Founders. Unfortunately, while CRISPR/Cas9 targeted NHEJ was detected in WSB Founders, we did not detect knock-in of the attP site in the very limited number of offspring obtained. For all other strains listed, the attP knock-in allele was detected, transmitted and confirmed as described. After a minimum of one back-cross to the appropriate strain background, each new strain was bred to homozygosity. Ultimately a single line was selected, Supplemental Table S1.

Use of single site RosaBxb-GT strains to generate RMKI alleles

The generation of vector-free RMKI alleles was achieved as shown in Fig. 3, with the screening strategy as outlined in Fig. 4. A range of conditions were attempted to optimize the system for efficient generation of RMKI alleles with three single-site strains, B6.RosaBxb-GT, FVB.RosaBxb-GT, and NSG.RosaBxb-GT. Results from all conditions tested for each donor minicircle were combined and summarized in Table 2. This table highlights our attempts to integrate twelve unique minicircle plasmid donors ranging in size from 3.2 to 9.9 kb. The highly variable results were expected due to the multiple conditions tested. Overall, we demonstrated that more than half (22/39) of the individual conditions resulted in the successful generation of an RMKI allele, with rates as high as 33%.

Figure 3
figure 3

Schema for using single attP-GT site mouse strains targeted by RMKI. Before microinjection, the plasmid vector containing the donor transgene and a single cognate attB-GT site are converted into (vector-less) DNA minicircles and purified. Along with Bxb1 Int, the minicircles are delivered into zygotes carrying the attP-GT landing pad allele, enabling precision transgene integration as the attP (host genome), and attB (donor DNA) sites recombine to form attR and attL sites.

Figure 4
figure 4

RMCE screening strategy. Screening offspring generated by microinjection follows a reductive series of PCRs to identify candidate animals. Both PCR1 and PCR2 assays are used to screen all potential Founder animals, while PCR3 and PCR4 assays are performed on only those positive by PCR1 and/or PCR2. A “generic” multiplex PCR1 was used to identify the presence of the RMCE allele resulting from any DNA donor, targeting both the ROSA26 locus and the recombined attR-GT site (green/red box). PCR1 serves as a DNA template quality control and used later to determine zygosity when verified animals are inbred. A project-specific PCR2, using transgene-specific forward and reverse primers (GSP), verifies that donor DNA is present in the sample (note—this is not designed to distinguish between random insertion events and the RMCE allele). PCR3 assays verify that the correct (ROSA26) integration event has occurred utilizing a PCR In/Out strategy, where the PCR span each newly created unique junction sites. This is followed by Sanger Sequencing verification of the PCR product. For transgenes < 10 Kb, re-positioning of GSP-R2 and GSP-F2 can be used to generate overlapping PCR products that allow sequence verification of the entire allele from the two long-range amplicons. Assay PCR4 is performed to screen for the presence of aberrant/random insertion using vector-backbone specific primers combined with transgene specific primers (GSP) designed to span the attB sites in the DNA donor.

Table 2 Summary of RMKI attempts.

In 14 independent tests we attempted to integrate twelve unique minicircles into the RosaBxb-GT single site allele across three different genetic backgrounds (B6, FVB, NSG). Minicircle DNA donor size, the number of microinjected embryos transferred and the number of positive Founders over the number of animals screened is shown. Recommended conditions are given in Material and Methods. Note, results from multiple conditions using the same donor were combined. Individual conditions tested ranged from 0 to 33% correct integrations. Aberrant alleles resulting from the minicircle production step occasionally resulted in larger inserts than intended (indicated by*). All projects taken to liveborn and established colonies, except for MC-12, which was used solely to optimize conditions and were collected as embryos for rapid analysis.

In < 5% of identified integrations, we detected aberrant integration events when analyzing RMKI alleles. These appear to be the result of unintended rearrangement occurring during donor DNA minicircle production. Analysis of these alleles suggests that in vitro φC31-based recombination designed to aid minicircle production has combined multiple plasmid DNAs producing some minicircles with 2–3 copies in tandem of the intended transgene. Where identified, these aberrant alleles are shown in Table 2. In the case of MC-1, which resulted in 3 copies of a double-inverted open-reading frame (DIO) allele designed to express a toxin only in cells where Cre is expressed, the allele was maintained and shown to function as intended (data not shown). As a general rule however, such aberrant allele lines were detected and discarded early in the screening process, or after sequencing the region by nCATS.

Addition of second site, attP-GA

Although the RMKI strategy was successful with inserts up to ~ 15 kb, the overall efficiency of donor construct integration, plus operational constraints in making minicircle donor DNA led us to engineer RMCE enabled host mice. This was achieved through the addition of a second (heterologous) attP site, with “GA” as the central dinucleotide as opposed to the wild type “GT” used for the first site. This second site (attP-GA) was placed in cis, 240 bp 3’ of the first attP-GT site, in B6.RosaBxb-GT and NSG.RosaBxb-GT strains (Fig. 2). This sequential modification was achieved using CRISPR/Cas9 and sgRNA#2254 to mediate the HDR integration of donor oligo #2255 into zygotes homozygous for the initial attP-GT insertion, successfully generating the RosaBxb-GT/GA allele in B6 and NSG; Table 1. After a minimum of one back-cross to the appropriate strain background these were bred to homozygosity; see Supplemental Table S1.

Use of dual site RosaBxb-GT/GA strains to generate RMCE Alleles

Typically, homozygous males carrying the dual landing pad are mated to super-ovulated wild-type females, generating heterozygous embryos for MIJ with Bxb1-Int mRNA and various DNA donor plasmids. Highly purified donor plasmid carrying dual cognate attB sites flanking the donor transgene were MIJ’d into zygotes as outlined in Fig. 5, and results summarized in Table 3. Twenty-five unique transgenic alleles were created at efficiencies ranging from 3 to 43%, with a combined average efficiency across all projects of 15% RMCE positive alleles, with constructs ranging in size from 1.5 to ~ 43 kb. As highlighted in Table 3, a few projects initially performed poorly (P-7, P-12, P-15 and P21). However, after repeating the MIJ with a newly purified donor DNA, these generally succeeded, often with dramatically improved results.

Figure 5
figure 5

Schema for the use of dual attP-GT/attP-GA mouse strains targeted for transgenesis by RMCE. Bxb1 integrase mRNA and the vector carrying the donor transgene (flanked by dual attB-GT, attB-GA sites) are microinjected into zygotes carrying the cognate attP-GT, attP-GA sites. Upon integration, RMCE excludes the vector backbone and results in the precise integration of the desired donor transgene into the ROSA26 locus.

Table 3 Successful targeted transgenesis in dual site RosaBxb-GT/GA strains.

Founder offspring carrying the donor insertion were backcrossed to their respective genetic background generating N1 animals for germline transmission and complete validation of the transgenic allele. Screening was performed using a combination of PCR strategies and Sanger sequencing of product as described. A limited number of alleles were subject to nCATS to completely verify the precise nature of the transgene in its genomic context. On rare occasions (> 1%), we did identify instances where either a mis-matched recombination event occurred (i.e., attB-GA + attP-GT) or only one of the two sites successfully recombined. It should be emphasized that these were rare events, see40, and these alleles were easily identified by our screening strategy allowing for the aberrant lines to be detected and discarded. We did detect off-target, random integrations of the donor DNA at expected frequencies for random transgenesis (~ 5% or less). Typically, Founders carrying an off-target allele were discarded. On the rare occasion where a positive founder also carried a random integration these, mice were backcrossed to segregate and select for the correct integration event.

The rapid development of a Krt18 driven human ACE2 transgenic model in B6 and NSG backgrounds

Based on a construct used to make the random transgenic mouse line B6.Cg-Tg(K18-ACE2)2Prlmn/J JAX# 03486041, a near identical 6.84 kb construct (“P-11”) was assembled to express human ACE2 under the direction of a human Krt18 promoter. Using Bxb1 Int-mediated transgenesis this construct was introduced into the B6.RosaBxb1-GT/GA and NSG.RosaBxb1-GT/GA strains. Initial characterization of offspring detected the correctly targeted transgene in Founder animals in the B6 background at 19% (10/54) and in the NSG background, 29% (18/63)—these data are summarized in Table 3. Carrier Founders were backcross to their respective background and confirmed faithful germline transmission. The resulting heterozygous targeted offspring were identified and the transgene confirmed by PCR and Sanger sequencing. These N1 animals were determined to be free of random transgenes and were also verified by nCATS. For both strains, heterozygous animals were available for experiments ~ 4 months from the date of injection. Both lines were bred to homozygosity and deposited at the Jackson Laboratory Repository (see Supplemental Table S1).

Comparative transgene sequence verification of the B6.RMCE[Krt18-ACE2] allele versus a random ACE2 transgenic strain B6.Cg-Tg(K18-ACE2)2Prlmn/J

The development of a targeted transgenesis strategy was driven by the ever-increasing need for efficient and precise genetic engineering of animals. To fully verify insertions of increasingly larger DNA constructs we developed a DNA Nanopore sequencing workflow, based on nCATS, which provides enrichment of reads spanning the transgenic insertion42. Currently, we have applied this targeted sequencing workflow to Bxb1-int mediated transgenic strains with insertions of 5–43 kb. In Fig. 6a we also present an example of the nCATS workflow and the resulting characterization of the B6.RMCE[K18-HuACE2] mouse. This allele was validated by gRNA targeting and subsequent enrichment of an 8.5 kb region, generating an average per-base coverage of 195X over the transgene and surrounding chromosomal region (Fig. 6b). These nCATS-derived data, which include 800 bp and 900 bp flanking genomic (ROSA26) sequence, fully support that the transgenic sequence integrated precisely as expected.

Figure 6
figure 6

Characterization of transgenic mice using nCATS. (A) Post sequencing workflow to construct insert regions and identify genomic locations. (B) nCATS characterization of B6.Cg-Tg(Krt18-ACE2) allele generated using the Bxb1 Int to achieve single copy integration, 215 reads on-target generating 195X coverage across the entire 8.5 kb insertion site and 900/800 bp genomic breakpoints. (C) nCATS characterization B6.Cg-Tg(K18-ACE2)2Prlmn allele, 5’ and 3’ breakpoints identified at least 11 copies of K18-ACE2 in varying orientations.

To illustrate the precision of Bxb1 Int-mediated transgene integration compared to conventional random integration, we applied nCATS to the B6.Cg-Tg(K18-ACE2)2Prlmn/J mouse (JAX Stock #034860). Guides (Supplemental Table S2) targeting the suspected integration site on Chromosome 2 (chr2:99204917–99225067) as well as against the CDS of ACE2 revealed the complex architectural nature of this random transgenic insertion. These data lead to a reconstruction of a 79.3 kb 5’ and 70.5 kb 3’ breakpoint sequences (Fig. 6c) and identified K18-ACE2 concatemers consisting of at least 11 copies in both orientations. In addition, we also detected a 10.5 kb duplication of Chromosome 2 at the periphery of each breakpoint along with K18-ACE2 cassette truncations at sites of inversion throughout the insertion. Due to the repeat-rich complexity of the insertion we were unable to generate a full consensus for the entire integration site.

Discussion

There is a need to create genetically modified animals where large transgenes are integrated precisely, in a defined genomic context efficiently to further advance biological research and transgenic development3,6,43. To address this we developed a strategy enabling efficient precision integration of large DNA constructs into the ROSA6 safe harbor locus directly via the zygote, across multiple mouse genetic backgrounds. Our aim was high efficiency, site-specific recombination which is not significantly impacted by donor constructs of up to at least 10 kb. For this we selected from the limited number of lysogenic bacteriophages tested for targeted integration into the mammalian genome, Bxb1 integrase.

To test the Bxb1 Int approach, we used CRISPR/Cas9-mediated insertion of the Bxb1 attP site with the wild type “GT” central dinucleotide into the ROSA26 locus in mouse background B6, FVB and the immunocompromised line NSG. Once established, these lines B6.RosaBxb1-GT, FVB.RosaBxb1-GT and NSG.RosaBxb1-GT) were used to test Bxb1 Int-mediated integration. We focused on establishing MIJ conditions (Fig. 3). To speed the process we developed a PCR screening and sequencing strategy which also covered the flanking regions of the ROSA26 integration attP-GT site. This provided sequence verification of Bxb1 Int-mediated insertions in the context of the ROSA26 locus. Our tests, summarized in Table 2, show clearly that targeted integration into ROSA26 attP-GT occurs at levels at or more often, considerably higher than random transgenesis.

A key driver of this work was to develop improved access to efficient transgenic targeting using multiple genetic backgrounds to facilitate comparative genetic background studies. Our success in the construction of and use the ROSA26 Bxb1-GT allele in three backgrounds inspired us to add the same attP site/location to a host of novel genetic backgrounds. The choice of these based on the Collaborative Cross and also included DBA/2J due to its importance in developing the BXD RI strains44,45.

The genetic modification of many genetic backgrounds has been hindered as many of these “non-conventional” mouse strains exhibit poor zygote yield and/or survivability post modification attempts. Although mouse embryonic stem cell (mESC) lines exist for many backgrounds and could be used to circumvent this, the introduction of constructs into these is often encumbered by drug selection markers, requires additional time to produce the chimeras and obtain germline transmission, verses direct modification of zygotes from the required background46,47,48. Also, by using extant mouse strains the actual genetic background of the resulting models will represent the present genome of the background and not that of a long-established mESC line49.

Although CRISPR/Cas9 mediated HDR with ssDNA oligonucleotides was relatively efficient, embryo yield and survivability from some genetic background remained problematic. Our initial attempts to introduce the attP-GT site used MIJ of zygotes. However as shown in Table 1, the results were mixed on both success to live born and the efficiency of introduction of the attP-GT site into the ROSA26 locus. This difficulty was especially evident in the wild inbred strains CAST, WSB and PWK, which failed due to a low embryo yield/female, compounded by subsequent poor survivability to term following MIJ. The use of electroporation as an approach to introduce CRISPR/Cas9 and small ssDNA donors has been shown by others to be efficient in B6 mice50,51. Using conditions previously established for B6 zygote electroporation we introduced the attP-GT site into a series of different genetic backgrounds. Although multiple direct comparative experiments of the overall efficiency of MIJ vs electroporation were not part of this study, we did succeed in obtaining attP-GT site integration in multiple strains, including CAST and PWK (Table 1), using electroporation. A review of these data suggests that the success is linked to the higher survivability of zygotes to term post electroporation as compared to MIJ, although further experiments would be needed to fully confirm this hypothesis. Combined, these efforts generated a diverse genetic panel of ten strains (listed in Tables 1 and 2) carrying the attP-GT site in the ROSA26 locus. However, even with our best efforts we failed at this time to generate the desired allele in the wild inbred mouse strain WSB due to poor availability of zygotes combined with low survivability post electroporation.

Out of concern for potential off-target integration of the donor attP-GT sequence, we modulated the concentration of the oligonucleotide delivered by EP in some of our tests. Although we did not screen for off-target integration, we did find that greatly reducing the level of donor oligo in the electroporation does not necessarily result in reduced knock-in efficiency. For example, despite reducing the concentration of the donor by more than 90% (to 70 ng/µl) when targeted NOD/ShiLtJ embryos, we still successfully generated the intended HDR allele with 57% efficiency (Table 1). This parameter should be further investigated, especially in the light of possible off-target integrations at the higher concentrations.

Precise Bxb1 Int-mediated integration of donor constructs into the ROSA attP-GT site was demonstrated in three genetic backgrounds. However, we sought to further improve efficiency and easy of use by exploring the use of a second Bxb1 attachment site (Fig. 2). We reasoned that the second site in close proximity to the initial attP-GT site would increase stochastic interaction and hence integration. A key attribute of using a dual heterologous attachment site strategy is that the donor vector can now be designed to integrate by RMCE, leading to excision and loss of the plasmid backbone and directional integration of the construct (Fig. 5)52. We selected the GA dinucleotide for our second site based on Jusiak et al.’s work which compared wild type Bxb1-GT to the Bxb1-GA variant in cell lines and noted that the GA variant resulted in more than double the integration efficiency40. They also reported minimal crosstalk between Bxb1 GA and GT att sites, an essential requirement when using dual heterologous sites40. We added the Bxb1 attP-GA site in cis to the attP-GT site in B6.RosaBxb-GT and NSG.RosaBxb-GT mouse, creating the strains B6.RosaBxb-GT/GA and NSG.RosaBxb-GT/GA respectively. Using these lines tests were conducted with a range of constructs (Fig. 5), and data summarized in Table 3. This process improvement also lead to operational advantages, including avoiding troublesome minicircle preparation, use and artifacts. Surprisingly, when using these dual attP site mice with cognate constructs we saw a more than three-fold increase in overall efficiency over similar tests with animals carrying a single attP-GT site. This increased efficiency is intriguing, perhaps reflecting synergy or simply an additive effect of the combined Bxb1-GT and the more efficient Bxb1-GA sites. RMCE may also improve operational efficiency, especially as transgene plasmid preparation is considerably simpler and potentially of higher quality52,53,54,55,56. The use of RMCE also opens the possibility of constructs of tens, if not hundreds of kilobases, which would be difficult to prepare as minicircles at the concentration and purity required.

Our data are insufficient to ascertain if there is a strong correlation between construct size and efficiency of integration. However, it would be logical to assume that increasing the size of the insert is likely to decrease the integration efficiency. Using identical DNA concentrations for MIJ (e.g., 30 ng/µl), larger constructs are actually delivered at a lower copy number per zygote, and the reduced molar concentration of donor attB sites would be expected to impact integration efficiency due to a simple reduction in stochastic interactions57. It also should be considered that larger constructs are more susceptible to DNA breakage during handling, including during MIJ, which would be expected to reduce their integration efficiency. To counter this, it may be worthwhile to incorporate some of the strategies previously described for BAC transgenics, notably modifying the MIJ buffer58,59. Preliminary work on this front shows promise, however an increase in aberrant insertion events suggests the need for greater optimization.

To demonstrate the Bxb1-int approach’s speed and to aid SARS-CoV research, two novel mouse models were created based on a previous random Krt18 driven human ACE2 transgenic mouse line41. Using dual aatP sites we produced a large number of ROSA26 targeted transgenics enabling the development of sequence-verified, single copy, N1 heterozygous transgenic alleles in the B6 and NSG backgrounds in under five months. Bxb1-Int mediated transgenesis is directional, for the Krt18-ACE2 lines developed here the transgene was inserted in the same orientation as Gt(Rosa26)Sor lncRNA. Ideas concerning which direction is optimal for transcription of inserted transgenes into the ROSA26 remain unresolved and are undoubtedly complex involving the particular integration site and the construct’s promoter21,23. Preliminary work on these two strains indicated that the ROSA26 transgene human ACE2 under the Krt18 promoter is transcribed at high levels in both B6 and NSG backgrounds (data not shown).

A central tenet of the approach outlined here is to provide precision and definition of DNA integration. In pursuit we also developed methods to deliver complete sequencing of integrated DNAs. using amplification-free targeted long read sequencing by nCATS. This allowed for example here, validation of > 43 kb insertions including ~ 1 kb of the flanking regions. As expected, nCATS verification of the Bxb1 Int-generated Krt18-ACE2 strains showed a single on-target integration with no disruption to the surrounding genomic sequence. These data demonstrate that the Bxb1 Int system can precisely integrate transgenes intact into a known genomic location and this can be rapidly validated including the surrounding genomic region. Combining the Bxb1 system with nCATS as demonstrated here, allows us to construct models with ever increasing confidence with reduced/no host chromosomal deformation. This is in sharp contrast, to the random transgenic B6.Cg-Tg(K18-ACE2)2Prlmn/J mouse where we clearly saw highly abberant integration. This illustrates how random integration can adversely impact the host genomic environment, for example here we identified a partial duplication of Chromosome 2 (Fig. 6c). Additional models generated via random integration transgenesis have been assayed with nCATS, and every model assayed showed concatemerization and often deletion of large sections of the host chromosome.

Conclusion and future directions

The strategy devised and presented here was envisioned to be used in any laboratory with access to zygote MIJ and general assisted reproductive technologies. Operational advantages include: scalability, high efficiency, rapid screening and defined precision integration of constructs to at least 43 kb. The approach can be used on multiple genetic backgrounds extending ideas of genetic diversity and crucially providing a uniform integration site for expression of transgenes. The attP/B approach also makes feasible ideas of subsequent sequential gene addition or “gene-stacking” by embedding other heterologous att sites in introduced constructs or by multipart assembly in vivo33,60,61. The advent of advanced sequencing protocol/methods exemplified here in the single molecule reading of ~ 80 kb further advances the reality of precision in the genetic modification of animals. Future directions with site specific recombinases include the potential to isolate novel integrases and/or manipulate targeting via directed evolution, enhancing efficiency through integrase site engineering, reprogramming specificity, or fusion to, e.g., Cas9 + gRNA, enabling targeting insertions to specific naturally occurring sequences within the genome28,30,37,62,63. Such designer site specific recombinases would have use in synthetic biology, biotechnology and genetic engineering, including human gene therapy.

Materials and methods

Zygote Microinjection

Microinjection was performed using a Zeiss AxioObserver.D1 microscope with Eppendorf NK2 micromanipulators and Narashige IM-5A injectors. Needles for microinjection were pulled fresh daily using WPI TW100F-4 capillary glass and a Sutter P97 horizontal puller. Zygotes were removed from culture and placed onto a slide containing 150µL of fresh room temperature M2 media. Standard zygote microinjection procedure was followed with special care taken to deposit material into both the pronucleus and the cytoplasm of each zygote. Before same-day embryo transfer was initiated, injected zygotes were removed from the slide and rinsed through three 30µL drops of equilibrated (under oil) K-RVCL and pooled in a separate 30µL microdrop of equilibrated K-RVCL.

Zygote electroporation

After collection, zygotes were placed into a 100µL drop of room temperature M2 media, then placed in Acidified Tyrode’s solution (Sigma, T1788) for 10 s followed by three washes through 100uL drops of room temperature M2 media. Zygotes were transferred into a 100µL drop of Opti-MEM media. Zygotes were then removed in a 10 µl volume, combined with 10 µl of the CRISPR reagents and deposited in a 1 mm electroporation cuvette (BTX, P/N 610) and electroporated in a Square Wave Electroporation System (BTX, ECM 830) using a voltage of 30 V, 100 ms pulse interval, 1 ms pulse duration, two pulses with 5 repeats (total of 6). Repeated aliquots of room temperature M2 media (100µL) were used to flush the zygotes from the cuvette to ensure all zygotes were recovered. Electroporated zygotes were then rinsed through three 30µL drops of equilibrated K-RVCL before being placed into a separate 30µL microdrop of equilibrated K-RVCL where they were allowed to recover before being processed for same-day embryo transfer.

Embryo transfer

Zygotes processed for same-day transfer were removed from culture and placed in a 1.8 mL screw-top tube (Thermo Scientific, 363,401) containing 900 µL of pre-warmed/equilibrated K-RVCL media for transport to the surgical station. Zygotes were removed from the tube and transferred via the oviduct into d0.5 pseudopregnant CByB6F1/J females (age 9–11 weeks).

Generation of RosaBxb-GT and RosaBxb-GT/GA landing Pad alleles in host strains

Bxb1 attachment sites (attP) were inserted into the ROSA26 locus by CRISPR/Cas9 using TruGuide gRNAs38 with donor oligonucleotides directly into zygotes using either microinjection (MIJ) or electroporation (EP), as detailed above. Project-specific parameters are indicated in Table 1, and all donor oligonucleotides, CRISPR gRNA targeting sequences and PCR screening primers are listed in Supplemental Table S2. Microinjection reagents were assembled as described previously64. Briefly, a 25 µl solution containing the Cas9 mRNA (60–100 ng/µl, Trilink), sgRNA (30–50 ng/µl), donor oligonucleotide (1–3 ng/µl, IDT) and RNasin® (0.2U/µl, Promega, P/N 2515) was assembled in MIJ TE buffer (10 mM Tris/0.1 mM EDTA/pH7.5; IDT). For some projects, Cas9 protein (100 ng/µl, PNABio P/N CP01), was also included (Table 1). Some strains were generated using electroporation as described by50,51 and detailed above. Briefly, a 10 µl solution containing the Cas9 protein (125–250 ng/µl, PNABio P/N CP01), sgRNA (100–154 ng/µl), and donor oligonucleotide (11–1106 ng/µl, IDT) was prepared in MIJ TE buffer, incubated for 30 min at 37 °C and then placed on ice before combining with zygotes.

For all strains created here, insertion of the initial Bxb1 attP-GT site used CRISPR/Cas9 with gRNA #1888, targeting the ROSA26 locus. For only mouse strain B6, a CRISPR/Cas9 GUIDE-seq strategy using duplexed donor oligos #1927/#1928 was applied39. For all other strains insertion of the initial Bxb1 attP-GT site used CRISPR/Cas9 mediated HDR with single-stranded donor oligo #1969 or #1970. The second Bxb1 site attP-GA, was inserted into zygotes isolated from B6 or NSG RosaBxb-GT mice homozygous for the initial Bxb1 attP-GT site and used CRISPR/Cas9 mediated HDR with gRNA #2254 and donor oligo #2255. The resulting two 48 bp attP sites are positioned 240 bp apart, with the initial attP-GT site placed 4 bp from the XbaI site in intron 1 of the ROSA26 locus. In all cases following MIJ or electroporation zygotes were transferred to pseudopregnant females and brought to term.

Tissue from tail tip or ear punch was used to isolate DNA for PCR analysis as outlined in64. Specifically for the first attP-GT site we used PCR primers #1699 and #2162, yielding a 254 bp product if the RosaBxb-GT allele was present. For the detection of the second site attP-GA, PCR using primers #2161 and #2162 yielded a 336 bp product if the RosaBxb-GT/GA allele was present. To fully validate the modified ROSA26 locus, PCR using primers #1699 & #1703 flanking the entire modified region were used followed by Sanger sequencing. After one back-cross to the unmodified parental strain, lines confirmed to be carrying the attP site/s were inbred to establish homozygous colonies as listed in the Results. The RosaBxb-GT/GA alleles on B6-albino and NOD.ShiLtJ backgrounds were produced by selective back-crossing of B6.RosaBxb-GT/GA mice with B6(Cg)-Tyrc-2J/J (stock#58), and NSG.RosaBxb-GT/GA with NOD/ShiLtJ (stock#1976), respectively.

RMKI host donor plasmid construction

To prepare minicircle DNA, desired transgenes are first cloned into a donor host plasmid (p5087). This plasmid was derived from the MN530A-1 (System Biosciences) by replacing the GFP Reporter sequence with a multiple cloning site using the restriction enzymes XmaI and StuI and duplexed donor oligonucleotides #2020 and #2021 (Supplemental Table S2). This plasmid was further modified to include the Bxb1 attB-GT site using restriction enzymes XbaI and EcoRI and duplexed oligonucleotides #2025 and #2026. Donor plasmid DNA (containing the attB-GT and the desired transgene) is prepared for injection by conversion into a vector-free minicircle using the MC-Easy™ Minicircle DNA Production Kit (SystemBio). Minicircle DNA is then isolated using the Purelink HiPure Plasmid Filter Midiprep kit (Invitrogen). To ensure complete removal of the parental plasmid, a restriction digest is performed to exclusively linearize the parental plasmid followed by an overnight digestion with Plasmid-Safe™ ATP-Dependent DNase (Lucigen). Prior to MIJ, the resultant minicircle DNA is extensively purified by phenol–chloroform extraction, precipitated with ethanol-sodium acetate and reconstituted in nuclease-free 10 mM Tris/0.1 mM EDTA/pH7.5.

Bxb1 Int mRNA

The Bxb1 Integrase mRNA used in RMCE and RMKI microinjections was generated by Trilink Biotechnologies (San Diego, CA USA), as a custom order. The mRNA was fully substituted with Pseudo-U, enzymatically capped and 2’OMethylated (Cap1), and delivered at ~ 1 mg/ml in 1 mM Sodium Citrate pH6.4. Batches of 10 µg single-use aliquots were prepared following phenol–chloroform purification with ethanol-ammonium acetate precipitation and reconstituted in MIJ TE and then stored at -80C until used. The full sequence provided to Trilink is shown in the Supplement, and was derived from pCAG-NLS-HA-Bxb1, a gift from Pawel Pelczar (Addgene plasmid # 51,271; http://n2t.net/addgene:51271; RRID:Addgene_51271)65.

RMKI by Bxb1 Int using RosaBxb-GT Host Mice

RMKI of donor DNA into the single-site RosaBxb-GT host strains was achieved by pronuclear microinjection of 1–30 ng/µl donor minicircle DNA with 100 ng/µl Bxb1 mRNA (Trilink) into zygotes of the desired attP-GT carrier strains. Typically, host zygotes heterozygous for Bxb1-attP were used, as these can be readily generated by super-ovulating wild-type dams, using for example C57BL/6J females mated to B6.RosaBxb-GT homozygous studs. Microinjection preps were prepared in MIJ TE supplemented with RNasin® (0.2U/µl, Promega, P/N 2515).

RMCE donor plasmid construction

For RMCE alleles, DNA vector requires the Bxb1 attB-GT and Bxb1 attB-GA sites are positioned in the correct orientation and flanking the desired sequence. Our donor plasmid p5154 (Addgene reference 175,390) is intended for small-medium sized donor DNAs (< 15 kb) while the p5155 (Addgene reference 175,391) is a low-copy version for use with larger donor DNAs (> 15 kb). Prior to microinjection, plasmid DNA containing the requisite attB-GT and attB-GA sites flanking the transgenic donor DNAs are isolated using the Purelink HiPure Plasmid Filter Midiprep kit (Invitrogen) followed by extensive purification via phenol–chloroform extraction, ethanol-sodium acetate precipitation with a final reconstitution in nuclease-free 10 mM Tris/0.1 mM EDTA/pH7.5.

RMCE in RosaBxb-GT/GA host mice

Recombinase-Mediated Cassette Exchange (RMCE) of donor DNA into the dual-site RosaBxb-GT/GA host strains is achieved by MIJ of 1–30 ng/µl donor DNA with 100 ng/µl Bxb1 mRNA (Trilink) into zygotes of the desired strain. Generally, host zygotes heterozygous for the dual Bxb1-attP sites were generated by super-ovulating wild-type dams, e.g., B6 mated to B6.RosaBxb-GT/GA homozygous studs. MIJ preps were prepared in nuclease-free 10 mM Tris/0.1 mM EDTA/pH7.5 (IDT).

Screening offspring for RMCE and RMKI donor integration

A generalized reductive sequential screening strategy for identifying all Bxb1 Int-mediated alleles was developed using a series of four to six simple PCR assays on crude DNA lysates prepared from tail-tip or ear punch biopsies64. As outlined in Fig. 4, these assays are designed to rapidly identify the correctly integrated alleles as well as screen for founder animals that also carry random insertions of donor DNA. The PCR primers used are defined in Supplemental Table S2. It is strongly recommended that time be taken to optimize these screening primers, see64.

PCR1 Assay is designed to rapidly identify candidate Founders using a generic multiplex PCR with primers #1699, #1703 and #2164. PCR1 also verifies the integrity of the genomic DNA template, yielding a wild type band of 776 bp, simultaneously generating a unique band in samples where the recombination event between attP-GT and attB-GT has occurred (~ 252 bp); Fig. 4. This PCR is performed on all potential Founder animals.

PCR2 Assay is a transgene-specific PCR using gene sequence-specific primers (GSP) designed to target a unique portion of the donor and is performed on all candidate offspring. Note, while this assay does not differentiate between random transgene events and correctly integrated alleles, its increased sensitivity will detect candidates that the multiplex PCR1 may miss due to low copy number.

PCR3 Assays Founder animals that show a positive result by PCR1 or PCR2 are next screened using two project-specific In/Out PCR assays designed to amplify the allele across both left and right ROSA26/Transgene junctions. As shown in Fig. 4, the In/Out-Left (IOL) PCR uses a forward primer in ROSA26 5’ of the attP-GT site (e.g., primer #1699) with a transgene-specific reverse primer (GSP) that is 3’ of the attB-GT site. Similarly, the In/Out-Right (IOR) PCR assay uses a reverse primer in ROSA26 3’ of the attP site (e.g., primer #1703) with a matching donor-specific forward primer. Resulting PCR products are sequenced to confirm the correct junction between host and donor DNA resulted64. Where possible (~ < 10 kb), these assays are modified to enable full-sequence verification using long-range PCR. In these instances, the In/Out PCRs are re-designed so as to overlap, i.e., the forward primer for the IOR PCR should be 5’ of the reverse primer for the IOL PCR.

PCR4 Assays To identify random transgenesis in putative candidate Founders, project-specific PCR for OTI’s are performed. These assays are designed to target across the attB site(s) in the donor DNA using vector backbone-specific primers with gene matching transgene-specific primers (GSP). Products from one or both of these assays indicate either a random insertion or an aberrant integration at the landing pad site has occurred.

RMKI alleles generated by minicircle integration into a single attP-GT landing pad in ROSA26 are screened in the same manner as RMCE alleles, except only one PCR4 Assay is needed to span the single attB site in the minicircle donor. However, an additional PCR is recommended to screen for the presence of any parental plasmid backbone that may have eluded the minicircle production process. These assays are project-specific and should use primers spanning the φC31 and attB sites in the parental plasmid (not shown).

Following the RMKI/RMCE strategy outlined here, Founder animals are expected be heterozygous for any Bxb1 Int-mediated integration. However, donor DNA integration may not occur with 100% efficiency in the single cell zygote, leading to a mosaic founder animal whose actual zygosity will be less than 50%. To confirm germline transmission and establish the integrity of the allele, one to four carrier Founders are back-crossed to the wild type parental strain (never other Founders). The resulting N1 animals are then subjected to the same PCR screening process (including PCR4), regardless of the genotype of the mosaic Founder animal. We recommend that a single verified N1 animal then be used to establish the new strain. Consequently, subsequent genotyping requires only a single multiplex PCR (e.g., IOL PCR3 with primer #1703 included) to determine the presence and zygosity of the new allele.

nCATS and data visualization

Unique gRNA’s for nanopore Cas9-targeted sequencing (nCATS) are designed 5’ and 3’ of the region to be sequenced (see Supplemental Table S2). For genomic DNA extraction, tissue is pulverized on dry ice in a Bessman Tissue Pulverizer (Spectrum™, 189,476). High molecular weight gDNA extraction using a Monarch® HMW DNA Extraction Kit for Tissue (New England Biolabs, T3060L) is performed on tissues isolated from N1 or higher generation mice. We do not recommend this process on founder animals as they are both often mosaic and potentially heterogeneous for the modification. nCATS targeted libraries are constructed as detailed in42 with the exception of ACE2-CDS of B6.Cg-Tg(K18-ACE2)2Prlmn/J JAX# 034860, where a single cut and read library was made per ACE2-CDS targeting gRNA, and pooled prior to Ampure bead purification. nCATS libraries are routinely sequenced for 24 h on a GridION using a R9.4.1 flow cell (Oxford Nanopore Technologies). After 24 h of sequencing, flow cells are nuclease flushed (Flow Cell Wash Kit EXP-WSH004, Oxford Nanopore Technologies) and reloaded with new nCATS libraries up to three times.

Base calling is done using GUPPY (v3.2.10) and the resulting FASTQ files are aligned to a reference sequence using minimap2(v2.17). Custom reference sequences were constructed for transgene insertion sites using the Mus musculus C57BL/6J reference genome (MM10) and corresponding insert sequence. Alignment results are subject to MapQ score filtering using Samtools (v1.11). Subsequent coverage depth for on-target reads are generated using Samtools (v1.11) and Bedtools (v2.29.2). On-target reads are visualized using the Integrative Genomics Viewer (IGV) and consensus sequences were generated using Medaka (v1.0.3) and annotated in Snapgene (v5.3).

Ethics statement

The Institutional Animal Care and Use Committee of The Jackson Laboratory approved all procedures used in this study and all mice were maintained at The Jackson Laboratory (Bar Harbor, ME, USA) in strict accordance with all institutional protocols and the Guide for the Care and Use of Laboratory Animals. Data and experiments reported are in accordance with ARRIVE guidelines (https://arriveguidelines.org) where applicable.