0 674

Cited 3 times in

SoloDel: a probabilistic model for detecting low-frequent somatic deletions from unmatched sequencing data

Authors
 Junho Kim  ;  Sanghyeon Kim  ;  Hojung Nam  ;  Sangwoo Kim  ;  Doheon Lee 
Citation
 BIOINFORMATICS, Vol.31(19) : 3105-3113, 2015 
Journal Title
BIOINFORMATICS
ISSN
 1367-4803 
Issue Date
2015
MeSH
Computer Simulation ; Databases, Genetic ; Humans ; Mental Disorders/genetics ; Models, Statistical* ; Reproducibility of Results ; Sequence Analysis, DNA/methods* ; Sequence Deletion/genetics* ; Software*
Abstract
MOTIVATION: Finding somatic mutations from massively parallel sequencing data is becoming a standard process in genome-based biomedical studies. There are a number of robust methods developed for detecting somatic single nucleotide variations However, detection of somatic copy number alteration has been substantially less explored and remains vulnerable to frequently raised sampling issues: low frequency in cell population and absence of the matched control samples.

RESULTS: We developed a novel computational method SoloDel that accurately classifies low-frequent somatic deletions from germline ones with or without matched control samples. We first constructed a probabilistic, somatic mutation progression model that describes the occurrence and propagation of the event in the cellular lineage of the sample. We then built a Gaussian mixture model to represent the mixed population of somatic and germline deletions. Parameters of the mixture model could be estimated using the expectation-maximization algorithm with the observed distribution of read-depth ratios at the points of discordant-read based initial deletion calls. Combined with conventional structural variation caller, SoloDel greatly increased the accuracy in classifying somatic mutations. Even without control, SoloDel maintained a comparable performance in a wide range of mutated subpopulation size (10-70%). SoloDel could also successfully recall experimentally validated somatic deletions from previously reported neuropsychiatric whole-genome sequencing data.

AVAILABILITY AND IMPLEMENTATION: Java-based implementation of the method is available at http://sourceforge.net/projects/solodel/
Full Text
http://bioinformatics.oxfordjournals.org/content/31/19/3105.long
DOI
10.1093/bioinformatics/btv358
Appears in Collections:
1. College of Medicine (의과대학) > BioMedical Science Institute (의생명과학부) > 1. Journal Papers
Yonsei Authors
Kim, Sangwoo(김상우) ORCID logo https://orcid.org/0000-0001-5356-0827
Kim, Jun Ho(김준호)
URI
https://ir.ymlib.yonsei.ac.kr/handle/22282913/141180
사서에게 알리기
  feedback

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse

Links