Comparison of Methods for Testing the Hypothesis of Independence of Random Variables Based on a Nonparametric Classifier and Pearson’s Chi-Squared Test

A technique for testing the hypothesis about the independence of random variables, based on a nonparametric pattern recognition algorithm, is used in the analysis of ambiguous dependencies. The pattern recognition algorithm meets the maximum likelihood criterion. The assessment of distribution laws...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Optoelectronics, instrumentation, and data processing Ročník 59; číslo 5; s. 551 - 560
Hlavní autori: Lapko, A. V., Lapko, V. A., Bakhtina, A. V.
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: Moscow Pleiades Publishing 01.10.2023
Predmet:
ISSN:8756-6990, 1934-7944
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:A technique for testing the hypothesis about the independence of random variables, based on a nonparametric pattern recognition algorithm, is used in the analysis of ambiguous dependencies. The pattern recognition algorithm meets the maximum likelihood criterion. The assessment of distribution laws in classes is carried out using initial statistical data under the assumption of independence and dependence of the random variables being compared. To estimate probability densities in classes, nonparametric Rosenblatt–Parzen statistics are used. The blurring coefficients of kernel functions in nonparametric estimates of probability densities in classes are determined from the condition of the minimum of their standard deviations. Under these conditions, estimates of the probabilities of pattern recognition errors in classes are calculated. Based on their minimum value, a decision is made on the independence or dependence of random variables. The hypothesis about a significant difference in the probabilities of pattern recognition errors in classes is tested. The use of the proposed technique allows us to bypass the problem of decomposing the range of values of random variables into intervals, which is characteristic of the Pearson criterion. The effectiveness of the proposed method is compared with the Pearson criterion. The results of computational experiments using the studied criteria in the analysis of ambiguous dependencies between random variables are presented.
ISSN:8756-6990
1934-7944
DOI:10.3103/S8756699023050047