Topic summarisation on tweets

Due to the sheer volume of tweets generated by a micro blog site like Twitter, it is often difficult to summarize the required content of the user or the data analyst to evaluate the stream of Twitter data from tweets in million amount which contain enormous redundancy and amount of noise is large....

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:2015 Seventh International Conference on Advanced Computing (ICoAC) S. 1 - 6
Hauptverfasser: Divyasree, T., Sujatha, P. Kola
Format: Tagungsbericht
Sprache:Englisch
Veröffentlicht: IEEE 01.12.2015
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Due to the sheer volume of tweets generated by a micro blog site like Twitter, it is often difficult to summarize the required content of the user or the data analyst to evaluate the stream of Twitter data from tweets in million amount which contain enormous redundancy and amount of noise is large. In an attempt to efficiently summarize the Twitter data and achieve better retrieval of the required topic, this research work focuses on the topic summarization on tweet streams, which produce better performance enhancement when processed in the distributed system. This work enhances the Weighted PageRank algorithm which considers both the Inlink and the Outlink value of the Tweets and Summarises the tweets efficiently and compare it with existing concept of similarity which was concerned with the mathematical cosine formulation. It also presents a timeline generation of tweets which enhance the summarization to analyses the tweet content according to the time. In order to enhance a special feature of Twitter posts, this work includes an effective analysis of performance results, that gives more improvement than the existing system. The experimental results on frequency measure when comparing it with the Weighted PageRank algorithm shows more efficiency than the existing cosine similarity computation.
DOI:10.1109/ICoAC.2015.7562805