dc.contributor.author | Norazian, Mohamed Noor | |
dc.contributor.author | Mohd Mustafa, Al Bakri Abdullah | |
dc.contributor.author | Ahmad Shukri, Yahaya | |
dc.contributor.author | Nor Azam, Ramli | |
dc.date.accessioned | 2008-05-20T07:38:32Z | |
dc.date.available | 2008-05-20T07:38:32Z | |
dc.date.issued | 2007-06-09 | |
dc.identifier.uri | http://dspace.unimap.edu.my/123456789/1171 | |
dc.description | Organized by Universiti Malaysia Perlis (UniMAP), 9th - 12th June 2007 at Park Royal Hotel, Penang. | en_US |
dc.description.abstract | Missing data is a very frequent problem in many scientific field including environmental research. These are usually due to machine failure, routine maintenance, changes in siting monitors and human error.
Incomplete datasets can cause bias due to systematic differences between observed and unobserved data. Therefore, the need to find the best way in estimating missing values is very important so that the data analysed is ensured of high quality. In this study, two methods were used to estimate the missing values in environmental data set and the performances of these methods were compared. The two methods are linear interpolation method and mean method. Annual hourly monitoring data for PM10 were used to generate simulated missing values. Four randomly simulated missing data patterns were generated for evaluating the accuracy of imputation techniques in different missing data conditions. They are 10%,
15%, 25% and 40%. Three types of performance indicators that are mean absolute error (MAE), rootmean squared error (RMSE) and coefficient of determination (R2) were calculated in order to describe the goodness of fit for the two methods. From the two methods applied, it was found that linear interpolation method gave better results compared to mean method in substituting data for all percentage of missing data considered. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Universiti Malaysia Perlis (UniMAP) | en_US |
dc.relation.ispartofseries | 1st International Conference on Sustainable Materials 2007 (ICoSM2007) | en_US |
dc.subject | Linear interpolation method | en_US |
dc.subject | Mean method | en_US |
dc.subject | Missing values | en_US |
dc.subject | Environmental research -- Missing values | en_US |
dc.subject | Environmental engineering -- Research | en_US |
dc.subject | Missing values -- Analysis | en_US |
dc.title | Comparison of Linear Interpolation Method and Mean Method to Replace the Missing Values in Environmental Data Set | en_US |
dc.title.alternative | 1st International Conference on Sustainable Materials 2007 (ICoSM2007) | en_US |
dc.type | Working Paper | en_US |