Content-based Recommender System for Textual Documents Written in Croatian

Saved in:
Bibliographic Details
Title: Content-based Recommender System for Textual Documents Written in Croatian
Authors: Kavran, Natalija, Kavran, Zvonko, Cvitić, Ivan, Šemanjski, Ivana
Publisher Information: 2013.
Publication Year: 2013
Subject Terms: recommender system, k-nearest neighbour, text mining, content-based classification, document-term matrix
Description: The paper describes a content-based recommender system that classifies textual documents written in Croatian. We describe how documents are pre- processed, including procedures of dimensionality reduction, selection of stop- words and creation of document-term matrix. For the text classification, a combination of v- fold cross validation and k - nearest neighbours (kNN) methods is used. This way, the ‘optimal’ value of k is firstly analyzed, and the results of v-fold cross validation are applied for the selection of value k. Results are given in the form of classification error analysis.
Document Type: Conference object
Accession Number: edsair.dris...01492..28a2c381f360dd797051257b7a1d2532
Database: OpenAIRE
Description
Abstract:The paper describes a content-based recommender system that classifies textual documents written in Croatian. We describe how documents are pre- processed, including procedures of dimensionality reduction, selection of stop- words and creation of document-term matrix. For the text classification, a combination of v- fold cross validation and k - nearest neighbours (kNN) methods is used. This way, the ‘optimal’ value of k is firstly analyzed, and the results of v-fold cross validation are applied for the selection of value k. Results are given in the form of classification error analysis.