How Humans and LLMs Organize Conceptual Knowledge: Exploring Subordinate Categories in Italian

Uložené v:
Podrobná bibliografia
Názov: How Humans and LLMs Organize Conceptual Knowledge: Exploring Subordinate Categories in Italian
Autori: Pedrotti A., Rambelli G., Villani C., Bolognesi M.
Zdroj: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). :4464-4482
Publication Status: Preprint
Informácie o vydavateľovi: Association for Computational Linguistics (ACL), 2025.
Rok vydania: 2025
Predmety: FOS: Computer and information sciences, Computer Science - Computation and Language, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computation and Language (cs.CL), Artificial Intelligence, Conceptual Knowledge, Knowledge Organization
Popis: People can categorize the same entity at multiple taxonomic levels, such as basic (bear), superordinate (animal), and subordinate (grizzly bear). While prior research has focused on basic-level categories, this study is the first attempt to examine the organization of categories by analyzing exemplars produced at the subordinate level. We present a new Italian psycholinguistic dataset of human-generated exemplars for 187 concrete words. We then use these data to evaluate whether textual and vision LLMs produce meaningful exemplars that align with human category organization across three key tasks: exemplar generation, category induction, and typicality judgment. Our findings show a low alignment between humans and LLMs, consistent with previous studies. However, their performance varies notably across different semantic domains. Ultimately, this study highlights both the promises and the constraints of using AI-generated exemplars to support psychological and linguistic research.
Accepted at ACL 2025
Druh dokumentu: Article
Conference object
Popis súboru: application/pdf
DOI: 10.18653/v1/2025.acl-long.224
DOI: 10.48550/arxiv.2505.21301
Prístupová URL adresa: http://arxiv.org/abs/2505.21301
https://hdl.handle.net/20.500.14243/551583
https://aclanthology.org/2025.acl-long.224/
https://doi.org/10.18653/v1/2025.acl-long.224
Rights: CC BY
Prístupové číslo: edsair.doi.dedup.....adfe94da76c24c3d468f39af01db9201
Databáza: OpenAIRE
Popis
Abstrakt:People can categorize the same entity at multiple taxonomic levels, such as basic (bear), superordinate (animal), and subordinate (grizzly bear). While prior research has focused on basic-level categories, this study is the first attempt to examine the organization of categories by analyzing exemplars produced at the subordinate level. We present a new Italian psycholinguistic dataset of human-generated exemplars for 187 concrete words. We then use these data to evaluate whether textual and vision LLMs produce meaningful exemplars that align with human category organization across three key tasks: exemplar generation, category induction, and typicality judgment. Our findings show a low alignment between humans and LLMs, consistent with previous studies. However, their performance varies notably across different semantic domains. Ultimately, this study highlights both the promises and the constraints of using AI-generated exemplars to support psychological and linguistic research.<br />Accepted at ACL 2025
DOI:10.18653/v1/2025.acl-long.224