A new distance with derivative information for functional k-means clustering algorithm

•Define a novel distance used to measure the similarity among functional samples.•Present the optimal representation of cluster centroids for the functional k-means clustering algorithm based on the new distance.•Experimental results show the effectiveness, robustness and convergence of the function...

Full description

Saved in:
Bibliographic Details
Published in:Information sciences Vol. 463-464; pp. 166 - 185
Main Authors: Meng, Yinfeng, Liang, Jiye, Cao, Fuyuan, He, Yijun
Format: Journal Article
Language:English
Published: Elsevier Inc 01.10.2018
Subjects:
ISSN:0020-0255, 1872-6291
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•Define a novel distance used to measure the similarity among functional samples.•Present the optimal representation of cluster centroids for the functional k-means clustering algorithm based on the new distance.•Experimental results show the effectiveness, robustness and convergence of the functional k-means clustering algorithm based on the new distance. The functional k-means clustering algorithm is a widely used method for clustering functional data. However, with this algorithm, the derivative information is not further considered in calculating the similarity between two functional samples. In fact, the derivative information is very important for catching the trend characteristic differences among functional data. In this paper, we define a novel distance used to measure the similarity among functional samples by adding their derivative information. Furthermore, in theory, we construct cluster centroids that can minimize the objective function of the functional k-means clustering algorithm based on the proposed distance. After preprocessing functional data using three types of common basis representation techniques, we compare the clustering performance of the functional k-means clustering algorithms based on four different similarity metrics. The experiments on six data sets with class labels show the effectiveness and robustness of the functional k-means clustering algorithm with the defined distance statistically. In addition, the experimental results on three real-life data sets verify the convergence and practicability of the functional k-means clustering algorithm with the defined distance.
ISSN:0020-0255
1872-6291
DOI:10.1016/j.ins.2018.06.035