Research on text fault recognition for on-board equipment of a C3 train control system based on an integrated XGBoost algorithm

The robust guarantee of train control on-board equipment is inextricably linked to the safe functioning of a high-speed train. A fault diagnostic model of on-board equipment is built utilizing the integrated learning XGBoost (eXtreme Gradient Boosting) algorithm to help technicians assess the malfun...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Transportation safety and environment Online Jg. 5; H. 4
Hauptverfasser: Yue, Lili, Liu, Luyue, Li, Maoqing, Xiao, Baodi, Wu, Xiaochun
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Changsha Oxford University Press 01.09.2023
Schlagworte:
ISSN:2631-4428, 2631-4428
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The robust guarantee of train control on-board equipment is inextricably linked to the safe functioning of a high-speed train. A fault diagnostic model of on-board equipment is built utilizing the integrated learning XGBoost (eXtreme Gradient Boosting) algorithm to help technicians assess the malfunction category of high-speed train control on-board equipment accurately and rapidly. The XGBoost algorithm iterates multiple decision tree models to improve the accuracy of fault diagnosis by lifting the predicted residual and adding regular terms. To begin, the text features were extracted using the improved TF-IDF (Term Frequency–Inverse Document Frequency) approach, and 24 fault feature words were chosen and converted into weight word vectors. Secondly, considering the imbalanced fault categories in the data set, the ADASYN (Adaptive Synthetic sampling) adaptive synthetically oversampling technique was used to synthesize a few category fault samples. Finally, the data samples were split into training and test sets based on the fault text data of CTCS-3 train control on-board equipment recorded by Guangzhou Railway Group maintenance personnel. The XGBoost model was utilized to realize the automatic fault location of the test set after optimized parameter tuning through grid search. Compared with other methods, the evaluation index of the XGBoost model was significantly improved. The diagnostic accuracy reached 95.43%, which verifies the effectiveness of the method in text fault diagnosis.
Bibliographie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:2631-4428
2631-4428
DOI:10.1093/tse/tdac066