0 483

Cited 6 times in

Vecuum: identification and filtration of false somatic variants caused by recombinant vector contamination

DC Field Value Language
dc.contributor.author김상우-
dc.date.accessioned2017-10-26T07:36:19Z-
dc.date.available2017-10-26T07:36:19Z-
dc.date.issued2016-
dc.identifier.issn1367-4803-
dc.identifier.urihttps://ir.ymlib.yonsei.ac.kr/handle/22282913/152260-
dc.description.abstractMOTIVATION: Advances in sequencing technologies have remarkably lowered the detection limit of somatic variants to a low frequency. However, calling mutations at this range is still confounded by many factors including environmental contamination. Vector contamination is a continuously occurring issue and is especially problematic since vector inserts are hardly distinguishable from the sample sequences. Such inserts, which may harbor polymorphisms and engineered functional mutations, can result in calling false variants at corresponding sites. Numerous vector-screening methods have been developed, but none could handle contamination from inserts because they are focusing on vector backbone sequences alone. RESULTS: We developed a novel method-Vecuum-that identifies vector-originated reads and resultant false variants. Since vector inserts are generally constructed from intron-less cDNAs, Vecuum identifies vector-originated reads by inspecting the clipping patterns at exon junctions. False variant calls are further detected based on the biased distribution of mutant alleles to vector-originated reads. Tests on simulated and spike-in experimental data validated that Vecuum could detect 93% of vector contaminants and could remove up to 87% of variant-like false calls with 100% precision. Application to public sequence datasets demonstrated the utility of Vecuum in detecting false variants resulting from various types of external contamination. AVAILABILITY AND IMPLEMENTATION: Java-based implementation of the method is available at http://vecuum.sourceforge.net/ CONTACT: swkim@yuhs.acSupplementary information: Supplementary data are available at Bioinformatics online.-
dc.description.statementOfResponsibilityrestriction-
dc.publisherOxford University Press-
dc.relation.isPartOfBIOINFORMATICS-
dc.rightsCC BY-NC-ND 2.0 KR-
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/2.0/kr/-
dc.subject.MESHAlleles-
dc.subject.MESHGenetic Vectors-
dc.subject.MESHHigh-Throughput Nucleotide Sequencing*-
dc.subject.MESHMutation*-
dc.subject.MESHRecombination, Genetic-
dc.subject.MESHSoftware-
dc.titleVecuum: identification and filtration of false somatic variants caused by recombinant vector contamination-
dc.typeArticle-
dc.publisher.locationEngland-
dc.contributor.collegeCollege of Medicine-
dc.contributor.departmentDept. of Life Science-
dc.contributor.googleauthorJunho Kim-
dc.contributor.googleauthorJu Heon Maeng-
dc.contributor.googleauthorJae Seok Lim-
dc.contributor.googleauthorHyeonju Son-
dc.contributor.googleauthorJunehawk Lee-
dc.contributor.googleauthorJeong Ho Lee-
dc.contributor.googleauthorSangwoo Kim-
dc.identifier.doi10.1093/bioinformatics/btw383-
dc.contributor.localIdA00524-
dc.relation.journalcodeJ00299-
dc.identifier.eissn1367-4811-
dc.identifier.pmid27334474-
dc.identifier.urlhttps://academic.oup.com/bioinformatics/article-lookup/doi/10.1093/bioinformatics/btw383-
dc.contributor.alternativeNameKim, Sang Woo-
dc.contributor.affiliatedAuthorKim, Sang Woo-
dc.citation.volume32-
dc.citation.number20-
dc.citation.startPage3072-
dc.citation.endPage3080-
dc.identifier.bibliographicCitationBIOINFORMATICS, Vol.32(20) : 3072-3080, 2016-
dc.date.modified2017-10-24-
dc.identifier.rimsid47996-
dc.type.rimsART-
Appears in Collections:
1. College of Medicine (의과대학) > BioMedical Science Institute (의생명과학부) > 1. Journal Papers

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.