Automatic music transcription for traditional woodwind instruments sopele

•Prospects of sopele woodwind instrument AMT are inspected on a newly acquired dataset.•Unwanted pitch variation is mitigated using DFT and supervised machine learning.•DFT-coupled RF and CNN models achieve F1=0.92 in the polyphonic setup.•A full-stack system for effortless music preservation of sop...

Full description

Saved in:
Bibliographic Details
Published in:Pattern recognition letters Vol. 128; pp. 340 - 347
Main Authors: Skoki, Arian, Ljubic, Sandi, Lerga, Jonatan, Štajduhar, Ivan
Format: Journal Article
Language:English
Published: Amsterdam Elsevier B.V 01.12.2019
Elsevier Science Ltd
Subjects:
ISSN:0167-8655, 1872-7344
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•Prospects of sopele woodwind instrument AMT are inspected on a newly acquired dataset.•Unwanted pitch variation is mitigated using DFT and supervised machine learning.•DFT-coupled RF and CNN models achieve F1=0.92 in the polyphonic setup.•A full-stack system for effortless music preservation of sopele pieces is presented.•The system performs reasonably well for transcribing sopele traditional music pieces. Sopela is a traditional hand-made woodwind instrument, commonly played in pair, characteristic to the Istrian peninsula in western Croatia. Its piercing sound, accompanied by two-part singing in the hexatonic Istrian scale, is registered in the UNESCO Representative List of the Intangible Cultural Heritage of Humanity. This paper presents an insight study of automatic music transcription (AMT) for sopele tunes. The process of converting audio inputs into human-readable musical scores involves multi-pitch detection and note tracking. The proposed solution supports this process by utilising frequency-feature extraction, supervised machine learning (ML) algorithms, and postprocessing heuristics. We determined the most favourable tone-predicting model by applying grid search for two state-of-the-art ML techniques, optionally coupled with frequency-feature extraction. The model achieved promising transcription accuracy for both monophonic and polyphonic music sources encompassed in the originally developed dataset. In addition, we developed a proof-of-concept AMT system, comprised of a client mobile application and a server-side API. While the mobile application records, tags and uploads audio sources, the back-end server applies the presented procedure for converting recorded music into a common notation to be delivered as a transcription result. We thus demonstrate how collecting and preserving traditional sopele music, performed in real-life occasions, can be effortlessly accomplished on-the-go.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0167-8655
1872-7344
DOI:10.1016/j.patrec.2019.09.024