Extended Quality (eQual): Radial Threshold Clustering Based on n -ary Similarity

We are transforming Radial Threshold Clustering (RTC), an ( ) algorithm, into Extended Quality Clustering (eQual), an ( ) algorithm with several novel features. Daura et al.'s RTC algorithm is a partitioning clustering algorithm that groups similar frames together based on their similarity to t...

Full description

Saved in:
Bibliographic Details
Published in:Journal of chemical information and modeling Vol. 65; no. 10; p. 5062
Main Authors: Chen, Lexin, Smith, Micah, Roe, Daniel R, Miranda-Quintana, Ramón Alain
Format: Journal Article
Language:English
Published: United States 26.05.2025
Subjects:
ISSN:1549-960X, 1549-960X
Online Access:Get more information
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:We are transforming Radial Threshold Clustering (RTC), an ( ) algorithm, into Extended Quality Clustering (eQual), an ( ) algorithm with several novel features. Daura et al.'s RTC algorithm is a partitioning clustering algorithm that groups similar frames together based on their similarity to the seed configuration. RTC has two main issues: it scales as ( ), making it inefficient for large frame counts, and its clustering results depend on the order of input frames whenever there is a tie in the most populated cluster. To address the first issue, we have increased the speed of the seed selection by using -means++ to select the seeds of the available frames. To address the second issue and make the results invariant with respect to frame order, the densest and most compact cluster is chosen using the extended similarity indices. The new algorithm is able to cluster in linear time and produce more compact and separate clusters.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1549-960X
1549-960X
DOI:10.1021/acs.jcim.4c02341