ESCOX: A tool for skill and occupation extraction using LLMs from unstructured text

ESCOX, also known as ESCOSkillExtractor, is an open-source, non-proprietary tool for identifying and classifying skills, skillsets, and occupations from job postings and general text. It utilizes the European Skills, Competences, Qualifications and Occupations (ESCO) taxonomy to structure extraction...

Full description

Saved in:
Bibliographic Details
Published in:Software impacts Vol. 25; p. 100772
Main Authors: Kavargyris, Dimitrios Christos, Georgiou, Konstantinos, Papaioannou, Eleanna, Petrakis, Konstantinos, Mittas, Nikolaos, Angelis, Lefteris
Format: Journal Article
Language:English
Published: Elsevier B.V 01.07.2025
Subjects:
ISSN:2665-9638, 2665-9638
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:ESCOX, also known as ESCOSkillExtractor, is an open-source, non-proprietary tool for identifying and classifying skills, skillsets, and occupations from job postings and general text. It utilizes the European Skills, Competences, Qualifications and Occupations (ESCO) taxonomy to structure extraction, addressing the need for taxonomy-aligned skill identification in unstructured labor market data. Developed within the SKILLAB EU Horizon project, ESCOX combines LLMs and text embeddings to map content to standardized categories. It offers a user-friendly graphical interface for researchers, educators, and HR professionals, supporting skills gap analysis, training, recruitment, and policy planning, and contributing to the development of a skills-based economy. [Display omitted] •Identifying relevant skills is essential for shaping the modern workforce.•ESCOX is an open-source tool for skill and occupation extraction using the ESCO taxonomy.•High-speed extraction enabled by precomputed embeddings and LLMs.•User-friendly interface supports use without requiring coding knowledge.•Supports skill extraction across domains with flexible preprocessing options.
ISSN:2665-9638
2665-9638
DOI:10.1016/j.simpa.2025.100772