Topic summarisation on tweets
Due to the sheer volume of tweets generated by a micro blog site like Twitter, it is often difficult to summarize the required content of the user or the data analyst to evaluate the stream of Twitter data from tweets in million amount which contain enormous redundancy and amount of noise is large....
Saved in:
| Published in: | 2015 Seventh International Conference on Advanced Computing (ICoAC) pp. 1 - 6 |
|---|---|
| Main Authors: | , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
IEEE
01.12.2015
|
| Subjects: | |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Due to the sheer volume of tweets generated by a micro blog site like Twitter, it is often difficult to summarize the required content of the user or the data analyst to evaluate the stream of Twitter data from tweets in million amount which contain enormous redundancy and amount of noise is large. In an attempt to efficiently summarize the Twitter data and achieve better retrieval of the required topic, this research work focuses on the topic summarization on tweet streams, which produce better performance enhancement when processed in the distributed system. This work enhances the Weighted PageRank algorithm which considers both the Inlink and the Outlink value of the Tweets and Summarises the tweets efficiently and compare it with existing concept of similarity which was concerned with the mathematical cosine formulation. It also presents a timeline generation of tweets which enhance the summarization to analyses the tweet content according to the time. In order to enhance a special feature of Twitter posts, this work includes an effective analysis of performance results, that gives more improvement than the existing system. The experimental results on frequency measure when comparing it with the Weighted PageRank algorithm shows more efficiency than the existing cosine similarity computation. |
|---|---|
| DOI: | 10.1109/ICoAC.2015.7562805 |