• Login
    View Item 
    •   DSpace Home
    • Final Year Project Papers & Reports
    • School of Computer and Communication Engineering (FYP)
    • View Item
    •   DSpace Home
    • Final Year Project Papers & Reports
    • School of Computer and Communication Engineering (FYP)
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Semi-automated construction of Neglish-Malay machine readable dictionary for technical terms

    Thumbnail
    View/Open
    Abstract,Acknowledgement.pdf (452.9Kb)
    Introduction.pdf (371.0Kb)
    Literature Review.pdf (221.3Kb)
    Methodology.pdf (472.1Kb)
    Results and Discussion.pdf (336.6Kb)
    Conclusion and Recommendation.pdf (194.3Kb)
    Refference and Appendics.pdf (659.5Kb)
    Date
    2015-06
    Author
    Nurfathiah, Abd Ghani
    Metadata
    Show full item record
    Abstract
    This project presents a method for semi – automated construction of English – Malay machine readable dictionary for technical terms. We proposed to use Keyword Density in order to classify the category for each term by measuring the weight of the term with Visual Studio using visual basic language. In the meantime, Cosine Similarity algorithm is used to measure the similarity between two sentence which are definition and sentence from the journal using C language. In order to calculate the category, 523 trainings data which is a set of journal for each term was collected. Then, we preprocessed the journal by using Brill’s Tagger with Penn-Tree Bank Tagger. We assigned 50 terms to test the algorithm. By using word extraction method the terms occurrence was counted. The total of the word in the category journal are also calculated. To categorize the term, we calculated the keyword density. For example sentence extraction, the data is used from the highest cosine similarity measurement between definition and sentence from journal. The sentence with the highest value was extracted as example sentence by the system. By using this algorithm, the Precision for the example sentence is 79%, Recall 90% and the F-Measure is 84%. It can be considered as a successful since the result is high. As a conclusion, based on the result, the proposed method shows a great potential with further improvement.
    URI
    http://dspace.unimap.edu.my:80/xmlui/handle/123456789/42067
    Collections
    • School of Computer and Communication Engineering (FYP) [310]

    Atmire NV

    Perpustakaan Tuanku Syed Faizuddin Putra (PTSFP) | Send Feedback
     

     

    Browse

    All of UniMAP Library Digital RepositoryCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    Statistics

    View Usage Statistics

    Atmire NV

    Perpustakaan Tuanku Syed Faizuddin Putra (PTSFP) | Send Feedback