• Login
    View Item 
    •   DSpace Home
    • Final Year Project Papers & Reports
    • School of Computer and Communication Engineering (FYP)
    • View Item
    •   DSpace Home
    • Final Year Project Papers & Reports
    • School of Computer and Communication Engineering (FYP)
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Plagiarism detection using N-Gram model

    Thumbnail
    View/Open
    Abstract,Acknowledgement.pdf (2.593Mb)
    Introduction.pdf (3.559Mb)
    Literature Review.pdf (885.5Kb)
    Methodology.pdf (5.925Mb)
    Results and Discussion.pdf (3.680Mb)
    Conclusion and Recommendation.pdf (1.535Mb)
    Refference and Appendics.pdf (3.028Mb)
    Date
    2015-06
    Author
    Muhammad Syahir, Shah Kholl Ajam
    Metadata
    Show full item record
    Abstract
    The vast increase of available documents in the World Wide Web (WWW) and the ease access to these documents has lead to a serious problem of using other’s works without giving credits. Although many methods have been developed to detect some instances of plagiarism such as changing the structure of sentences or when slightly replacing words by their synonyms, it is often hard to reveal plagiarism when the copied sentences are deliberately modified. This project proposes an algorithm for plagiarism detection by using syntactic plagiarism detection using 1-gram and 2-gram. Jaccard similarity coefficient is applied to detect similarity between documents of English corpus in engineering field by using C programming language. From the value of the results which is precision, recall and f-measure, we considered 2-gram showed the great potential for the plagiarism detection method. The 2-gram extraction achieved values 0.983 for precision, 0.380 for recall and 0.548 for f-measure compared to 1-gram extraction. Jaccard similarity coefficient incorporation with N-gram method is suitable sufficiently to be employed in the word similarity measurement. In efficiency measurement, the program performance can deal appropriately with high stability to calculate the word similarity.
    URI
    http://dspace.unimap.edu.my:80/xmlui/handle/123456789/41827
    Collections
    • School of Computer and Communication Engineering (FYP) [310]

    Atmire NV

    Perpustakaan Tuanku Syed Faizuddin Putra (PTSFP) | Send Feedback
     

     

    Browse

    All of UniMAP Library Digital RepositoryCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    Statistics

    View Usage Statistics

    Atmire NV

    Perpustakaan Tuanku Syed Faizuddin Putra (PTSFP) | Send Feedback