Ranking of Bangla word graph using graph based ranking algorithms

Ranking words is an important way to summarize a text or to retrieve information. Word graph is a way to represent the words of a sentence or a text as vertices of a graph and to show the relationship among the words. It is also useful to determine the relative importance of a word among the word gr...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:EICT 2017 : 3rd International Conference on Electrical Information and Communication Technology : 7-9 December 2017 s. 1 - 5
Hlavný autor: Rafiuddin, S. M.
Médium: Konferenčný príspevok..
Jazyk:English
Vydavateľské údaje: IEEE 01.12.2017
Predmet:
ISBN:9781538623053, 1538623056
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:Ranking words is an important way to summarize a text or to retrieve information. Word graph is a way to represent the words of a sentence or a text as vertices of a graph and to show the relationship among the words. It is also useful to determine the relative importance of a word among the word graph. In this research, the ranking of Bangla words are calculated representing Bangla words from a text in a word graph using various graph based ranking algorithms. There is a lack of standard Bangla word database. In this research, Indian Language POS-tag Corpora is used that has a rich collection of Bangla words as a form of sentences with their parts of speech tag. For applying a word graph to various graph based ranking algorithms, several standard procedures are applied. The preprocessing steps are done in every word graph and then applied to graph based ranking algorithms to make a comparison among these algorithms. This paper illustrate the entire procedure of calculating the ranking of Bangla words including the construction of word graph from text. Experimental result analysis on real data reveals the accuracy of each ranking algorithms in terms of F measure.
ISBN:9781538623053
1538623056
DOI:10.1109/EICT.2017.8275214