Piano multipitch estimation using sparse coding embedded deep learning

As the foundation of many applications, multipitch estimation problem has always been the focus of acoustic music processing; however, existing algorithms perform deficiently due to its complexity. In this paper, we employ deep learning to address piano multipitch estimation problem by proposing MPE...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:EURASIP journal on audio, speech, and music processing Ročník 2018; číslo 1; s. 1 - 23
Hlavní autori: Li, Xingda, Guan, Yujing, Wu, Yingnian, Zhang, Zhongbo
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: Cham Springer International Publishing 12.09.2018
Springer Nature B.V
SpringerOpen
Predmet:
ISSN:1687-4722, 1687-4714, 1687-4722
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:As the foundation of many applications, multipitch estimation problem has always been the focus of acoustic music processing; however, existing algorithms perform deficiently due to its complexity. In this paper, we employ deep learning to address piano multipitch estimation problem by proposing MPENet based on a novel multimodal sparse incoherent non-negative matrix factorization (NMF) layer . This layer originates from a multimodal NMF problem with Lorentzian-BlockFrobenius sparsity constraint and incoherentness regularization. Experiments show that MPENet achieves state-of-the-art performance (83.65% F-measure for polyphony level 6) on RAND subset of MAPS dataset. MPENet enables NMF to do online learning and accomplishes multi-label classification by using only monophonic samples as training data. In addition, our layer algorithms can be easily modified and redeveloped for a wide variety of problems.
Bibliografia:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1687-4722
1687-4714
1687-4722
DOI:10.1186/s13636-018-0132-x