A synthetic data generation procedure for univariate circular data with various outliers scenarios using Python programming language

Synthetic data is artificial data that is created based on the statistical properties of the original data. The aim of this study is to generate a synthetic or simulated data for univariate circular data that follow von Mises (VM) distribution with various outliers scenario using Python programming...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of physics. Conference series Jg. 1988; H. 1; S. 12111 - 12119
Hauptverfasser:	Zulkipli, N S, Satari, S Z, Wan Yusoff, W N S
Format:	Journal Article
Sprache:	Englisch
Veröffentlicht:	Bristol IOP Publishing 01.07.2021
Schlagworte:	Data analysis Datasets Inliers (landforms) Outliers (statistics) Programming languages Python Synthetic data
ISSN:	1742-6588, 1742-6596
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Synthetic data is artificial data that is created based on the statistical properties of the original data. The aim of this study is to generate a synthetic or simulated data for univariate circular data that follow von Mises (VM) distribution with various outliers scenario using Python programming language. The procedure of formulation a synthetic data generation is proposed in this study. The synthetic data is generated from various combinations of seven sample size, n and five concentration parameters, K. Moreover, a synthetic data will be generated by formulating a data generation procedure with different condition of outliers scenarios. Three outliers scenarios are proposed in this study to introduce the outliers in synthetic dataset by placing them away from inliers at a specific distance. The number of outliers planted in the dataset are fixed with three outliers. The synthetic data is randomly generated by using Python library and package which are ‘numpy’, ‘random’ and von Mises’. In conclusion, the synthetic data of univariate circular data from von Mises distribution is generated and the outliers are successfully introduced in the dataset with three outliers scenarios using Python. This study will be valuable for those who are interested to study univariate circular data with outliers and choose Python as an analysis tool.
Bibliographie:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1742-6588 1742-6596
DOI:	10.1088/1742-6596/1988/1/012111