Three-Branch BERT-Based Text Classification Network for Gastroscopy Diagnosis Text

During a hospital visit, a significant volume of Gastroscopy Diagnostic Text (GDT) data are produced, representing the unstructured gastric medical records of patients undergoing gastroscopy. As such, GDTs play a crucial role in evaluating the patient’s health, shaping treatment plans, and schedulin...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:International journal of crowd science Ročník 8; číslo 1; s. 56 - 63
Hlavní autoři: Wang, Zhichao, Zheng, Xiangwei, Zhang, Jinsong, Zhang, Mingzhe
Médium: Journal Article
Jazyk:angličtina
Vydáno: Tsinghua University Press 01.03.2024
Témata:
ISSN:2398-7294, 2398-7294
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:During a hospital visit, a significant volume of Gastroscopy Diagnostic Text (GDT) data are produced, representing the unstructured gastric medical records of patients undergoing gastroscopy. As such, GDTs play a crucial role in evaluating the patient’s health, shaping treatment plans, and scheduling follow-up visits. However, given the free-text nature of GDTs, which lack a formal structure, physicians often find it challenging to extract meaningful insights from them. Furthermore, while deep learning has made significant strides in the medical domain, to our knowledge, there are not any readily available text-based pre-trained models tailored for GDT classification and analysis. To address this gap, we introduce a Bidirectional Encoder Representations from Transformers (BERT) based three-branch classification network tailored for GDTs. We leverage the robust representation capabilities of the BERT pre-trained model to deeply encode the texts. A unique three-branch decoder structure is employed to pinpoint lesion sites and determine cancer stages. Experimental outcomes validate the efficacy of our approach in GDT classification, with a precision of 0.993 and a recall of 0.784 in the early cancer category. In pinpointing cancer lesion sites, the weighted F1 score achieved was 0.849.
ISSN:2398-7294
2398-7294
DOI:10.26599/IJCS.2023.9100031