Browse

The Virtual University, Pakistan’s first University based completely on modern Information and Communication Technologies, was established by the Government as a public sector, not-for-profit institution with a clear mission: to provide extremely affordable world class education to aspiring students all over the country.

Using free-to-air satellite television broadcasts and the Internet, the Virtual University allows students to follow its rigorous programs regardless of their physical locations. It thus aims at alleviating the lack of capacity in the existing universities while simultaneously tackling the acute shortage of qualified professors in the country. By identifying the top Professors of the country, regardless of their institutional affiliations, and requesting them to develop and deliver hand-crafted courses, the Virtual University aims at providing the very best courses to not only its own students but also to students of all other universities in the country.

Analyzing & Visualizing Conversational Behavior of Microblogs

Download

Author: SAMEERA SALEEM


Citable URI : https://vspace.vu.edu.pk/detail.aspx?id=31

Publisher : Virtual University of Pakistan

Date Issued: 2/17/2015 12:00:00 AM


Abstract

The World Wide Web and its millions of internet applications have revolutionized the way people to communicate, socialize, stay healthy, educate, conduct business, do politics, and do ‘any-thing-at-all’. The huge amount of data generated by these applications is also proving quite valuable in the context of analyses, evaluations and predictions. These predictions help in organizational planning & management, to a very acceptable degree of accuracy. This research thesis is a study of several current researches on the extraction and analysis of short text messages or micro-blogs, from a social-science perspective. Specifically, this study investigates linguistic-frequency and sentiment analysis of micro-blogs in general, by identifying, collating and visualizing conversational and behavioral biases of people, from a very large twitter dataset of 6+ million tweets. This study explores short text analysis using freely available text processing and visual analysis tools. A novel contribution of this work is to investigate the effectiveness of automated labeling of tweets instead of manual labeling. Comparison of automatically labelled tweets with baseline STS-gold set. The baseline dataset of manually labelled twitter dataset show a very small deficiency of 6-8%, making our method viable for huge/big datasets. Main result of this study is a framework that encapsulates three current techniques for analyzing and visualizing microblogs. The frequency measurements and classification have been performed using the NLTK text processing tool. Sentiment Analysis has been carried out using NLTK and WEKA tools. Network Graphs made in Gephi software have been used to visualize user trends and behaviors, utilizing metrics from Networks Theory.


URI : https://vspace.vu.edu.pk/details.aspx?id=31

Citation: Saleem, S. (2015). Analyzing & Visualizing Conversational Behavior Of Microblogs. Virtual University of Pakistan, (Lahore, Pakistan).

Version : Final Version

Terms of Use : All the material and results are copyright of Virtual University of Pakistan

Detailed Terms :

Journal :

Files in this item

Name Size Format
Fall 2014_CS720_ms080400023.pdf 1886kb pdf


Copyright 2016 © Virtual University of Pakistan