Neural transfer learning for assigning diagnosis codes to EMRs

•Transfer learning using convolutional neural networks improves multi-label learning.•Predicting MeSH terms for biomedical articles is a useful source task for EMR coding.•Using 2 copies of source task parameters, one fixed and one tuned, helps target models.•Using both word embeddings and convoluti...

Full description

Saved in:
Bibliographic Details
Published in:Artificial intelligence in medicine Vol. 96; pp. 116 - 122
Main Authors: Rios, Anthony, Kavuluru, Ramakanth
Format: Journal Article
Language:English
Published: Netherlands Elsevier B.V 01.05.2019
Subjects:
ISSN:0933-3657, 1873-2860, 1873-2860
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•Transfer learning using convolutional neural networks improves multi-label learning.•Predicting MeSH terms for biomedical articles is a useful source task for EMR coding.•Using 2 copies of source task parameters, one fixed and one tuned, helps target models.•Using both word embeddings and convolutions from source task improves prediction. Electronic medical records (EMRs) are manually annotated by healthcare professionals and specialized medical coders with a standardized set of alphanumeric diagnosis and procedure codes, specifically from the International Classification of Diseases (ICD). Annotating EMRs with ICD codes is important for medical billing and downstream epidemiological studies. However, manually annotating EMRs is both time-consuming and error prone. In this paper, we explore the use of convolutional neural networks (CNNs) for automatic ICD coding. Because many codes occur infrequently, CNN performance is inhibited. Therefore, we propose supplementing EMR data with PubMed indexed biomedical research abstracts through neural transfer learning. Transfer learning is the process of “transferring” knowledge acquired from one task (the source task) to a different (target) task. For the source task, we train a CNN to predict medical subject headings (MeSH) using 1.6 million PubMed indexed biomedical abstracts. For the target task, we train a CNN on 71,463 real-world EMRs collected from the University of Kentucky (UKY) medical center to predict ICD diagnosis codes. We introduce a simple, yet effective, transfer learning methodology which avoids forgetting knowledge gained from the source task. Compared to our prior work using EMRs from the UKY medical center, we improve both the micro and macro F-scores by more than 8%. Likewise, compared to other transfer learning methods, our approach results in nearly 2% improvement in macro F-score. We show that transfer learning can improve CNN performance for EMR coding in the presence of data sparsity issues. Furthermore, we find that our proposed transfer learning approach outperforms other methods with respect to macro F-score. Finally, we analyze how transfer learning impacts codes with respect to code frequency. We find that we achieve greater improvement on infrequent codes compared to improvements in most frequent codes.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0933-3657
1873-2860
1873-2860
DOI:10.1016/j.artmed.2019.04.002