Please use this identifier to cite or link to this item: http://dspace.unimap.edu.my:80/xmlui/handle/123456789/34373
Title: Mean imputation techniques for filling the missing observations in air pollution dataset
Authors: Noorazian, Mohamed Noor
Ahmad Shukri, Yahaya, Prof. Madya
Nor Azam, Ramli, Prof. Dr.
Mohd Mustafa Al Bakri, Abdullah
norazian@unimap.edu.my
shukri@eng.usm.my
ceazam@eng.usm.my
mustafa_albakri@unimap.edu.my
Keywords: Air pollution
Imputation
Performance indicators
PM₁₀
Issue Date: 2014
Publisher: Trans Tech Publications
Citation: Key Engineering Materials, vol.594-595, 2014, pages 902-908
Abstract: Almost all real life datasets consist missing values. These are usually due to machine failure, routine maintenance, changes in siting monitors and human error. The occurence of missing values requires special attention on analysing the data. Incomplete datasets can cause bias due to systematic differences between observed and unobserved data. Therefore, the need to find the best way in estimating missing values is very important so that the data analysed is ensured of high quality. In this research, three types of mean imputation techniques that are mean, mean above and mean above below methods were used to replace the missing values. Annual hourly monitoring data for PM₁₀ were used to generate missing values. Four randomly simulated missing data were evaluated in order to test the efficiency of the methods used. They are 5%, 10%, 15%, 25% and 40%. Three types of performance indicators that are mean absolute error (MAE), root mean square error (RMSE) and coefficient of determination (R²) were calculated to describe the goodness of fit for all the method. From all the method applied, it was found that mean above below method is the best method for estimating data for all percentages of simulated missing values.
Description: Link to publisher's homepage at http://www.ttp.net/
URI: http://dspace.unimap.edu.my:80/dspace/handle/123456789/34373
ISSN: 1662-9795
Appears in Collections:Norazian, Mohamed Noor, Ts. Dr.
Mohd Mustafa Al Bakri Abdullah, Prof. Dr.
Center of Excellence for Geopolymer and Green Technology (CEGEOGTECH) (Articles)
School of Environmental Engineering (Articles)

Files in This Item:
File Description SizeFormat 
Mean imputation techniques for filling the missing observations in air pollution dataset.pdf172.08 kBAdobe PDFView/Open


Items in UniMAP Library Digital Repository are protected by copyright, with all rights reserved, unless otherwise indicated.