Title
SAM Filtering Pipeline (SFP): Algorithm for the determination of integration sites from next generation sequencing data
Published Date
2019-07-16
Authors
Author Contact
Hu, Wei-Shou (acre@umn.edu)
Type
Dataset
Programming Software Code
Abstract
The locus at which a vector harboring a product transgene integrates into the genome can have a profound effect on the transgene’s transcript level and the stability of the resulting cell line. In order to identify integration site(s) of a transfected vector from next generation genome sequencing data, the SAM filtering pipeline (SFP) was created. It is best suited for targeted sequence data, such as that from sequence capture of probed vector regions. However, it will also work for whole genome sequencing data, though the memory requirements are large (the more reads in your data set, the larger the memory requirements). A bwa-mem mapped .sam file is required as input to the pipeline.
Referenced by
O'Brien, S.A., Ojha, J., Wu, P. and Hu, W.‐S. (2020), Multiplexed clonality verification of cell lines for protein biologic production. Biotechnology Progress, e2978.
License
Depositor did not specify a license. Material may be reused with appropriate attribution.
Suggested Citation
O'Brien, Sofie A; Hu, Wei-Shou.
(2019). SAM Filtering Pipeline (SFP): Algorithm for the determination of integration sites from next generation sequencing data.
Retrieved from the Data Repository for the University of Minnesota,
https://doi.org/10.13020/9wgm-mj51.