Automatic music transcription for traditional woodwind instruments sopele

•Prospects of sopele woodwind instrument AMT are inspected on a newly acquired dataset.•Unwanted pitch variation is mitigated using DFT and supervised machine learning.•DFT-coupled RF and CNN models achieve F1=0.92 in the polyphonic setup.•A full-stack system for effortless music preservation of sop...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Pattern recognition letters Ročník 128; s. 340 - 347
Hlavní autori: Skoki, Arian, Ljubic, Sandi, Lerga, Jonatan, Štajduhar, Ivan
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: Amsterdam Elsevier B.V 01.12.2019
Elsevier Science Ltd
Predmet:
ISSN:0167-8655, 1872-7344
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:•Prospects of sopele woodwind instrument AMT are inspected on a newly acquired dataset.•Unwanted pitch variation is mitigated using DFT and supervised machine learning.•DFT-coupled RF and CNN models achieve F1=0.92 in the polyphonic setup.•A full-stack system for effortless music preservation of sopele pieces is presented.•The system performs reasonably well for transcribing sopele traditional music pieces. Sopela is a traditional hand-made woodwind instrument, commonly played in pair, characteristic to the Istrian peninsula in western Croatia. Its piercing sound, accompanied by two-part singing in the hexatonic Istrian scale, is registered in the UNESCO Representative List of the Intangible Cultural Heritage of Humanity. This paper presents an insight study of automatic music transcription (AMT) for sopele tunes. The process of converting audio inputs into human-readable musical scores involves multi-pitch detection and note tracking. The proposed solution supports this process by utilising frequency-feature extraction, supervised machine learning (ML) algorithms, and postprocessing heuristics. We determined the most favourable tone-predicting model by applying grid search for two state-of-the-art ML techniques, optionally coupled with frequency-feature extraction. The model achieved promising transcription accuracy for both monophonic and polyphonic music sources encompassed in the originally developed dataset. In addition, we developed a proof-of-concept AMT system, comprised of a client mobile application and a server-side API. While the mobile application records, tags and uploads audio sources, the back-end server applies the presented procedure for converting recorded music into a common notation to be delivered as a transcription result. We thus demonstrate how collecting and preserving traditional sopele music, performed in real-life occasions, can be effortlessly accomplished on-the-go.
Bibliografia:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0167-8655
1872-7344
DOI:10.1016/j.patrec.2019.09.024