Browse

The Virtual University, Pakistan’s first University based completely on modern Information and Communication Technologies, was established by the Government as a public sector, not-for-profit institution with a clear mission: to provide extremely affordable world class education to aspiring students all over the country.

Using free-to-air satellite television broadcasts and the Internet, the Virtual University allows students to follow its rigorous programs regardless of their physical locations. It thus aims at alleviating the lack of capacity in the existing universities while simultaneously tackling the acute shortage of qualified professors in the country. By identifying the top Professors of the country, regardless of their institutional affiliations, and requesting them to develop and deliver hand-crafted courses, the Virtual University aims at providing the very best courses to not only its own students but also to students of all other universities in the country.

A STUDY OF PLAGIARISM DETECTION USING NATURAL LANGUAGE PROCESSING TECHNIQUE

Download

Author: NASREEN MALIK


Citable URI : https://vspace.vu.edu.pk/detail.aspx?id=157

Publisher : Virtual University

Date Issued: 11/13/2018 12:00:00 AM


Abstract

Now a day’s plagiarism became very common in many fields of life such as research and education. It is an illegal deed used to make others work as own property without any proper references. Plagiarism is defined as showing other’s work as your own or using/stealing other’s ideas without any permission. Due to advancement in plagiarism techniques adopted by plagiarist, it is very difficult to detect plagiarism accurately by existing techniques. Different features are observed to determine the presence of plagiarism in documents such as syntactic, lexical, semantic and structural features. Today lots of techniques are introduced to detect plagiarism i.e. string matching, a bag of words, fingerprinting, citation analysis and stylometry . Advance detectors mostly work with source code or natural language text. To detect similarity in natural language texts, detectors commonly explore the Internet. In text analysis, detectors use very easy and simple comparison procedures based on broad coverage and processing speed. This research explores new and modern plagiarism detection tasks especially text-based plagiarism detection includes monolingual plagiarism detection. The main idea behind this research is that rewritten and original text does not have similar text and differences among these documents can be explored with the help of linguistic and statistical indicators. To investigate above statement, the main research objectives are formulated as follow; a four stage novel framework for plagiarism detection is proposed. Natural Language Processing (NLP) is used by this framework instead of focusing on traditional string-matching approaches. The objective of this model is to use text pre-processing and statistical, shallow and deep linguistic techniques using a corpus-based approach. Proposed framework is tested by comparing its working theoretically with other techniques.


URI : https://vspace.vu.edu.pk/details.aspx?id=157

Citation: Malik, N(2018). A STUDY OF PLAGIARISM DETECTION USING NATURAL LANGUAGE PROCESSING TECHNIQUE. Virtual University of Pakistan.(Lahore, Pakistan).

Version : Final Version

Terms of Use :

Detailed Terms :

Journal :

Files in this item

Name Size Format
Spring 2018_CS720_ms150200150.pdf 1119kb pdf


Copyright 2016 © Virtual University of Pakistan