Please use this identifier to cite or link to this item: http://dspace.unimap.edu.my:80/xmlui/handle/123456789/41827
Title: Plagiarism detection using N-Gram model
Authors: Muhammad Syahir, Shah Kholl Ajam
Dr. Nik Adilah Hanin Zahri
Keywords: Plagiarism
N-Gram model
Plagiarism detection
Plagiarism detection -- Methods
Issue Date: Jun-2015
Publisher: Universiti Malaysia Perlis (UniMAP)
Abstract: The vast increase of available documents in the World Wide Web (WWW) and the ease access to these documents has lead to a serious problem of using other’s works without giving credits. Although many methods have been developed to detect some instances of plagiarism such as changing the structure of sentences or when slightly replacing words by their synonyms, it is often hard to reveal plagiarism when the copied sentences are deliberately modified. This project proposes an algorithm for plagiarism detection by using syntactic plagiarism detection using 1-gram and 2-gram. Jaccard similarity coefficient is applied to detect similarity between documents of English corpus in engineering field by using C programming language. From the value of the results which is precision, recall and f-measure, we considered 2-gram showed the great potential for the plagiarism detection method. The 2-gram extraction achieved values 0.983 for precision, 0.380 for recall and 0.548 for f-measure compared to 1-gram extraction. Jaccard similarity coefficient incorporation with N-gram method is suitable sufficiently to be employed in the word similarity measurement. In efficiency measurement, the program performance can deal appropriately with high stability to calculate the word similarity.
Description: Access is limited to UniMAP community.
URI: http://dspace.unimap.edu.my:80/xmlui/handle/123456789/41827
Appears in Collections:School of Computer and Communication Engineering (FYP)

Files in This Item:
File Description SizeFormat 
Abstract,Acknowledgement.pdf2.66 MBAdobe PDFView/Open
Introduction.pdf3.64 MBAdobe PDFView/Open
Literature Review.pdf885.59 kBAdobe PDFView/Open
Methodology.pdf6.07 MBAdobe PDFView/Open
Results and Discussion.pdf3.77 MBAdobe PDFView/Open
Conclusion and Recommendation.pdf1.57 MBAdobe PDFView/Open
Refference and Appendics.pdf3.1 MBAdobe PDFView/Open


Items in UniMAP Library Digital Repository are protected by copyright, with all rights reserved, unless otherwise indicated.