Please use this identifier to cite or link to this item: http://dspace.unimap.edu.my:80/xmlui/handle/123456789/42067
Title: Semi-automated construction of Neglish-Malay machine readable dictionary for technical terms
Authors: Nurfathiah, Abd Ghani
Dr. Nik Adilah Hanin Zahri
Keywords: Machine readable dictionary
English-Malay dictionary
Technical term
Machine readable dictionary -- Design and construction
Keyword density
Issue Date: Jun-2015
Publisher: Universiti Malaysia Perlis (UniMAP)
Abstract: This project presents a method for semi – automated construction of English – Malay machine readable dictionary for technical terms. We proposed to use Keyword Density in order to classify the category for each term by measuring the weight of the term with Visual Studio using visual basic language. In the meantime, Cosine Similarity algorithm is used to measure the similarity between two sentence which are definition and sentence from the journal using C language. In order to calculate the category, 523 trainings data which is a set of journal for each term was collected. Then, we preprocessed the journal by using Brill’s Tagger with Penn-Tree Bank Tagger. We assigned 50 terms to test the algorithm. By using word extraction method the terms occurrence was counted. The total of the word in the category journal are also calculated. To categorize the term, we calculated the keyword density. For example sentence extraction, the data is used from the highest cosine similarity measurement between definition and sentence from journal. The sentence with the highest value was extracted as example sentence by the system. By using this algorithm, the Precision for the example sentence is 79%, Recall 90% and the F-Measure is 84%. It can be considered as a successful since the result is high. As a conclusion, based on the result, the proposed method shows a great potential with further improvement.
Description: Access is limited to UniMAP community.
URI: http://dspace.unimap.edu.my:80/xmlui/handle/123456789/42067
Appears in Collections:School of Computer and Communication Engineering (FYP)

Files in This Item:
File Description SizeFormat 
Abstract,Acknowledgement.pdf452.91 kBAdobe PDFView/Open
Introduction.pdf371.08 kBAdobe PDFView/Open
Literature Review.pdf221.38 kBAdobe PDFView/Open
Methodology.pdf472.13 kBAdobe PDFView/Open
Results and Discussion.pdf336.65 kBAdobe PDFView/Open
Conclusion and Recommendation.pdf194.37 kBAdobe PDFView/Open
Refference and Appendics.pdf659.59 kBAdobe PDFView/Open


Items in UniMAP Library Digital Repository are protected by copyright, with all rights reserved, unless otherwise indicated.