Automatic text summarization based on sentences clustering and extraction

Technology of automatic text summarization plays an important role in information retrieval and text classification, and may provide a solution to the information overload problem. Text summarization is a process of reducing the size of a text while preserving its information content. This paper pro...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:2009 2nd IEEE International Conference on Computer Science and Information Technology s. 167 - 170
Hlavní autoři: Zhang Pei-ying, Li Cun-he
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: IEEE 01.08.2009
Témata:
ISBN:1424445191, 9781424445196
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Technology of automatic text summarization plays an important role in information retrieval and text classification, and may provide a solution to the information overload problem. Text summarization is a process of reducing the size of a text while preserving its information content. This paper proposes a sentences clustering based summarization approach. The proposed approach consists of three steps: first clusters the sentences based on the semantic distance among sentences in the document, and then on each cluster calculates the accumulative sentence similarity based on the multi-features combination method, at last chooses the topic sentences by some extraction rules. The purpose of present paper is to show that summarization result is not only depends the sentence features, but also depends on the sentence similarity measure. The experimental result on the DUC 2003 dataset show that our proposed approach can improve the performance compared to other summarization methods.
ISBN:1424445191
9781424445196
DOI:10.1109/ICCSIT.2009.5234971