Reuse of imputed data in microarray analysis increases imputation efficiency

Ki-Yeol Kim; Byoung-Jin Kim; Gwan-Su Yi

doi:10.1186/1471-2105-5-160

YUHSpace

BROWSE

461 1196

Cited 0 times in

Cited 165 times in

Reuse of imputed data in microarray analysis increases imputation efficiency

DC Field	Value	Language
dc.contributor.author	김기열	-
dc.date.accessioned	2015-07-14T17:25:47Z	-
dc.date.available	2015-07-14T17:25:47Z	-
dc.date.issued	2004	-
dc.identifier.uri	https://ir.ymlib.yonsei.ac.kr/handle/22282913/112864	-
dc.description.abstract	BACKGROUND: The imputation of missing values is necessary for the efficient use of DNA microarray data, because many clustering algorithms and some statistical analysis require a complete data set. A few imputation methods for DNA microarray data have been introduced, but the efficiency of the methods was low and the validity of imputed values in these methods had not been fully checked. RESULTS: We developed a new cluster-based imputation method called sequential K-nearest neighbor (SKNN) method. This imputes the missing values sequentially from the gene having least missing values, and uses the imputed values for the later imputation. Although it uses the imputed values, the efficiency of this new method is greatly improved in its accuracy and computational complexity over the conventional KNN-based method and other methods based on maximum likelihood estimation. The performance of SKNN was in particular higher than other imputation methods for the data with high missing rates and large number of experiments. Application of Expectation Maximization (EM) to the SKNN method improved the accuracy, but increased computational time proportional to the number of iterations. The Multiple Imputation (MI) method, which is well known but not applied previously to microarray data, showed a similarly high accuracy as the SKNN method, with slightly higher dependency on the types of data sets. CONCLUSIONS: Sequential reuse of imputed data in KNN-based imputation greatly increases the efficiency of imputation. The SKNN method should be practically useful to save the data of some microarray experiments which have high amounts of missing entries. The SKNN method generates reliable imputed values which can be used for further cluster-based analysis of microarray data.	-
dc.description.statementOfResponsibility	open	-
dc.format.extent	3500~3507	-
dc.relation.isPartOf	BMC BIOINFORMATICS	-
dc.rights	CC BY-NC-ND 2.0 KR	-
dc.rights.uri	https://creativecommons.org/licenses/by-nc-nd/2.0/kr/	-
dc.subject.MESH	Efficiency, Organizational/standards*	-
dc.subject.MESH	Microarray Analysis/methods*	-
dc.subject.MESH	Microarray Analysis/standards*	-
dc.title	Reuse of imputed data in microarray analysis increases imputation efficiency	-
dc.type	Article	-
dc.contributor.college	Researcher Institutes (부설 연구소)	-
dc.contributor.department	Oral Cancer Research Institute (구강종양연구소)	-
dc.contributor.googleauthor	Ki-Yeol Kim	-
dc.contributor.googleauthor	Byoung-Jin Kim	-
dc.contributor.googleauthor	Gwan-Su Yi	-
dc.identifier.doi	10.1186/1471-2105-5-160	-
dc.admin.author	false	-
dc.admin.mapping	false	-
dc.contributor.localId	A00337	-
dc.relation.journalcode	J00350	-
dc.identifier.eissn	1471-2105	-
dc.identifier.pmid	15504240	-
dc.contributor.alternativeName	Kim, Ki Yeol	-
dc.contributor.affiliatedAuthor	Kim, Ki Yeol	-
dc.rights.accessRights	free	-
dc.citation.volume	5	-
dc.citation.startPage	3500	-
dc.citation.endPage	3507	-
dc.identifier.bibliographicCitation	BMC BIOINFORMATICS, Vol.5 : 3500-3507, 2004	-
dc.identifier.rimsid	36763	-
dc.type.rims	ART	-

Appears in Collections:: 2. College of Dentistry (치과대학) > Others (기타) > 1. Journal Papers

Show simple item record Find it @ YMLIB

License

YUHSpace: Reuse of imputed data in microarray analysis increases imputation efficiency

YUHSpace

BROWSE

Browse

Links