Show simple item record

dc.contributor.authorEng Aik, Lim
dc.contributor.authorZarita, Zainuddin
dc.date.accessioned2009-12-10T03:39:51Z
dc.date.available2009-12-10T03:39:51Z
dc.date.issued2008-12-01
dc.identifier.citationp.1-5en_US
dc.identifier.isbn978-1-4244-2315-6
dc.identifier.urihttp://ieeexplore.ieee.org/search/wrapper.jsp?arnumber=4786656
dc.identifier.urihttp://dspace.unimap.edu.my/123456789/7393
dc.descriptionLink to publisher's homepage at http://ieeexplore.ieee.orgen_US
dc.description.abstractMissing data is a problem that permeates much of the research bring done today. Some data frequently contain missing values such as gene expression data, which most of its down stream analyses for microarray experiments require complete data. In the literature many methods have been proposed to estimate missing values via information of the correlation patterns within the data matrix. In this report we describe an evaluation of top three current methods including a neural network method and two imputation methods on multiple types of data including microarray data, time series data such as air pollutant data and phytoplankton data. Based on the overall performance of the method, we then determine the most appropriate method that can be applied to various data sets. We found that the optimal method (Local Least Square Imputation (LLS) and Bayesian Principle Component Analyses (BPCA)) are all highly competitive to each other in overall results. We tested with Radial Basis Function (RBF) network method which is one of the neural network methods and found that, the overall performance of RBF network is lower than BPCA method and LLS method. According to the overall NRMSE of the three methods, the BPCA method provides the most accurate estimation for missing values.en_US
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineering (IEEE)en_US
dc.relation.ispartofseriesProceedings of the International Conference on Electronic Design (ICED 2008)en_US
dc.subjectMissing dataen_US
dc.subjectRadial basis function networksen_US
dc.subjectEstimation theoryen_US
dc.subjectBayes methodsen_US
dc.subjectLeast squares approximationsen_US
dc.subjectAir pollutant dataen_US
dc.subjectMissing value estimationen_US
dc.titleA comparative study of missing value estimation methods: which method performs better?en_US
dc.typeWorking Paperen_US
dc.contributor.urlealim@unimap.edu.myen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record