Comparison of Methods for Testing the Hypothesis of Independence of Random Variables Based on a Nonparametric Classifier and Pearson’s Chi-Squared Test

A technique for testing the hypothesis about the independence of random variables, based on a nonparametric pattern recognition algorithm, is used in the analysis of ambiguous dependencies. The pattern recognition algorithm meets the maximum likelihood criterion. The assessment of distribution laws...

Full description

Saved in:
Bibliographic Details
Published in:Optoelectronics, instrumentation, and data processing Vol. 59; no. 5; pp. 551 - 560
Main Authors: Lapko, A. V., Lapko, V. A., Bakhtina, A. V.
Format: Journal Article
Language:English
Published: Moscow Pleiades Publishing 01.10.2023
Subjects:
ISSN:8756-6990, 1934-7944
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:A technique for testing the hypothesis about the independence of random variables, based on a nonparametric pattern recognition algorithm, is used in the analysis of ambiguous dependencies. The pattern recognition algorithm meets the maximum likelihood criterion. The assessment of distribution laws in classes is carried out using initial statistical data under the assumption of independence and dependence of the random variables being compared. To estimate probability densities in classes, nonparametric Rosenblatt–Parzen statistics are used. The blurring coefficients of kernel functions in nonparametric estimates of probability densities in classes are determined from the condition of the minimum of their standard deviations. Under these conditions, estimates of the probabilities of pattern recognition errors in classes are calculated. Based on their minimum value, a decision is made on the independence or dependence of random variables. The hypothesis about a significant difference in the probabilities of pattern recognition errors in classes is tested. The use of the proposed technique allows us to bypass the problem of decomposing the range of values of random variables into intervals, which is characteristic of the Pearson criterion. The effectiveness of the proposed method is compared with the Pearson criterion. The results of computational experiments using the studied criteria in the analysis of ambiguous dependencies between random variables are presented.
ISSN:8756-6990
1934-7944
DOI:10.3103/S8756699023050047