Data Structures and Algorithms for k-th Nearest Neighbours Conformational Entropy Estimation

Entropy of multivariate distributions may be estimated based on the distances of nearest neighbours from each sample from a statistical ensemble. This technique has been applied on biomolecular systems for estimating both conformational and translational/rotational entropy. The degrees of freedom wh...

Full description

Saved in:
Bibliographic Details
Published in:Biophysica Vol. 2; no. 4; pp. 340 - 352
Main Authors: Borelli, Roberto, Dovier, Agostino, Fogolari, Federico
Format: Journal Article
Language:English
Published: Merced MDPI AG 01.12.2022
Subjects:
ISSN:2673-4125, 2673-4125
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Entropy of multivariate distributions may be estimated based on the distances of nearest neighbours from each sample from a statistical ensemble. This technique has been applied on biomolecular systems for estimating both conformational and translational/rotational entropy. The degrees of freedom which mostly define conformational entropy are torsion angles with their periodicity. In this work, tree structures and algorithms to quickly generate lists of nearest neighbours for periodic and non-periodic data are reviewed and applied to biomolecular conformations as described by torsion angles. The effect of dimensionality, number of samples, and number of neighbours on the computational time is assessed. The main conclusion is that using proper data structures and algorithms can greatly reduce the complexity of nearest neighbours lists generation, which is the bottleneck step in nearest neighbours entropy estimation.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:2673-4125
2673-4125
DOI:10.3390/biophysica2040031