Please use this identifier to cite or link to this item:
http://dspace.unimap.edu.my:80/xmlui/handle/123456789/42067
Title: | Semi-automated construction of Neglish-Malay machine readable dictionary for technical terms |
Authors: | Nurfathiah, Abd Ghani Dr. Nik Adilah Hanin Zahri |
Keywords: | Machine readable dictionary English-Malay dictionary Technical term Machine readable dictionary -- Design and construction Keyword density |
Issue Date: | Jun-2015 |
Publisher: | Universiti Malaysia Perlis (UniMAP) |
Abstract: | This project presents a method for semi – automated construction of English – Malay machine readable dictionary for technical terms. We proposed to use Keyword Density in order to classify the category for each term by measuring the weight of the term with Visual Studio using visual basic language. In the meantime, Cosine Similarity algorithm is used to measure the similarity between two sentence which are definition and sentence from the journal using C language. In order to calculate the category, 523 trainings data which is a set of journal for each term was collected. Then, we preprocessed the journal by using Brill’s Tagger with Penn-Tree Bank Tagger. We assigned 50 terms to test the algorithm. By using word extraction method the terms occurrence was counted. The total of the word in the category journal are also calculated. To categorize the term, we calculated the keyword density. For example sentence extraction, the data is used from the highest cosine similarity measurement between definition and sentence from journal. The sentence with the highest value was extracted as example sentence by the system. By using this algorithm, the Precision for the example sentence is 79%, Recall 90% and the F-Measure is 84%. It can be considered as a successful since the result is high. As a conclusion, based on the result, the proposed method shows a great potential with further improvement. |
Description: | Access is limited to UniMAP community. |
URI: | http://dspace.unimap.edu.my:80/xmlui/handle/123456789/42067 |
Appears in Collections: | School of Computer and Communication Engineering (FYP) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Abstract,Acknowledgement.pdf | 452.91 kB | Adobe PDF | View/Open | |
Introduction.pdf | 371.08 kB | Adobe PDF | View/Open | |
Literature Review.pdf | 221.38 kB | Adobe PDF | View/Open | |
Methodology.pdf | 472.13 kB | Adobe PDF | View/Open | |
Results and Discussion.pdf | 336.65 kB | Adobe PDF | View/Open | |
Conclusion and Recommendation.pdf | 194.37 kB | Adobe PDF | View/Open | |
Refference and Appendics.pdf | 659.59 kB | Adobe PDF | View/Open |
Items in UniMAP Library Digital Repository are protected by copyright, with all rights reserved, unless otherwise indicated.