Machine Learning-Based Business Rule Engine Data Transformation over High-Speed Networks
Raw data processing is a key business operation. Business-specific rules determine how the raw data should be transformed into business-required formats. When source data continuously changes its formats and has keying errors and invalid data, then the effectiveness of the data transformation is a b...
Saved in:
| Published in: | Computer assisted methods in engineering and science Vol. 30; no. 1 |
|---|---|
| Main Authors: | , |
| Format: | Journal Article |
| Language: | English |
| Published: |
Institute of Fundamental Technological Research Polish Academy of Sciences
2023
|
| Subjects: | |
| ISSN: | 2299-3649, 2956-5839 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Raw data processing is a key business operation. Business-specific rules determine how the raw data should be transformed into business-required formats. When source data continuously changes its formats and has keying errors and invalid data, then the effectiveness of the data transformation is a big challenge. The conventional data extraction and transformation technique produces a delay in handling such data because of continuous fluctuations in data formats and requires continuous development of a business rule engine. The best business rule engines require near real-time detection of business rule and data transformation mechanisms utilizing machine learning classification models. Since data is combined from numerous sources and older systems, it is challenging to categorize and cluster the data and apply suitable business rules to turn raw data into the businessrequired format. This paper proposes a methodology for designing ensemble machine learning techniques and approaches for classifying and segmenting registered numbers of registered title records to choose the most suitable business rule that can convert the registered number into the format the business expects, allowing businesses to provide customers with the most recent data in less time. This study evaluates the suggested model by gathering sample data and analyzing classification machine learning (ML) models to determine the relevant business rule. Experimentation employed Python, R, SQL stored procedures, Impala scripts, and Datameer tools. |
|---|---|
| ISSN: | 2299-3649 2956-5839 |
| DOI: | 10.24423/cames.472 |