A context aware approach for detecting elusive vandalism in Wikipedia

Tummalapenta, Raga Sowmya

A context aware approach for detecting elusive vandalism in Wikipedia

Tummalapenta, Raga Sowmya

2012

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Files

Abstract

The collaborative model of Wikipedia is simple and open. This nature of Wikipedia challenges its trustworthiness, leading to vandalism. There are several current vandalism detection techniques but none of them focus on detecting elusive vandalism. This type do not contain normal characteristics of vandalism and hence difficult to detect. We have proposed multicontext aware detection techniques for determining whether an elusive edit is vandalized or not. The main idea of these techniques is to check whether an edit lies within the context of other words within a particular Wikipedia article. For the experimental purposes, we make use of a PAN corpus, which is a large collection of Wikipedia edits. Then we perform a feature extraction followed by a data trained classification using WEKA. Accuracy of our methods is calculated using f1-measure. Results show that the context aware techniques are efficient since they result in highly less number of false positives and negatives.

Details

Record ID

15543

Record Created

2024-12-05

Title

A context aware approach for detecting elusive vandalism in Wikipedia

Author

Tummalapenta, Raga Sowmya

Contributor

Ramaswamy, Lakshmish Advisor
Li, Kang Committee Member
Rasheed, Khaled Committee Member

College or School

Computer Sciences

Date

2012

Publisher

University of Georgia

Content Type

Thesis

Language

English

Dissertation/ Thesis Note

Graduate

Degree Type

Master of Science (MS)

Name of Granting Institution

University of Georgia, Winter 2012

Year Degree Granted

2012

Keywords

Wikipedia, Vandalism, Elusive Vandalism, WEKA, context, Search Engine, Accuracy

Record Appears in

Electronic Theses and Dissertations > Graduate Thesis
All Resources

System Control Number

9949333882002959

PDF

Statistics

Download Full History