Sometimes the apple does fall far from the tree: a case study on automatic indexing precision errors in PubMed

Objective: This case study identifies the presence and prevalence of precision indexing errors in a subset of automatically indexed MEDLINE records in PubMed (specifically, all MEDLINE records automatically indexed with the MeSH term Malus, the genus name for apple trees). In short, how well does au...

Full description

Saved in:
Bibliographic Details
Published in:Journal of the Medical Library Association Vol. 113; no. 4; pp. 318 - 326
Main Author: Wilson, Paije
Format: Journal Article
Language:English
Published: United States University Library System, University of Pittsburgh 23.10.2025
Subjects:
ISSN:1536-5050, 1558-9439, 1558-9439
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Objective: This case study identifies the presence and prevalence of precision indexing errors in a subset of automatically indexed MEDLINE records in PubMed (specifically, all MEDLINE records automatically indexed with the MeSH term Malus, the genus name for apple trees). In short, how well does automatic indexing compare [figurative] apples to [literal] apples?  Methods: 1,705 MEDLINE records automatically indexed with the MeSH term Malus underwent title/abstract and full text screening to determine whether they were correctly indexed (i.e., the records were about Malus, meaning they discussed the literal fruit or tree) or incorrectly indexed (i.e., they were not about Malus, meaning they did not discuss the literal fruit or tree). The context and type of indexing error were documented for each erroneously indexed record. Results: 135 (7.9%) records were incorrectly indexed with the MeSH term Malus. The most common indexing error was due to the word "apple" being used in similes, metaphors, and idioms (80, or 59.2%), with the next most common error being due to "apple" being present in a name or term (50, or 37%). Additional indexing errors were attributed to the use of "apple" in acronyms, and, in one case, a reference to Sir Isaac Newton. Conclusion: As indicated by this study's findings, automatic indexing can commit errors when indexing records that have words with non-literal or alternative meanings in their titles or abstracts. Librarians should be mindful of the existence of automatic indexing errors, and instruct authors on how best to ameliorate the effects of them within their own manuscripts.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:1536-5050
1558-9439
1558-9439
DOI:10.5195/jmla.2025.2110