Piano multipitch estimation using sparse coding embedded deep learning
As the foundation of many applications, multipitch estimation problem has always been the focus of acoustic music processing; however, existing algorithms perform deficiently due to its complexity. In this paper, we employ deep learning to address piano multipitch estimation problem by proposing MPE...
Saved in:
| Published in: | EURASIP journal on audio, speech, and music processing Vol. 2018; no. 1; pp. 1 - 23 |
|---|---|
| Main Authors: | , , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
Cham
Springer International Publishing
12.09.2018
Springer Nature B.V SpringerOpen |
| Subjects: | |
| ISSN: | 1687-4722, 1687-4714, 1687-4722 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | As the foundation of many applications, multipitch estimation problem has always been the focus of acoustic music processing; however, existing algorithms perform deficiently due to its complexity. In this paper, we employ deep learning to address piano multipitch estimation problem by proposing
MPENet
based on a novel
multimodal sparse incoherent non-negative matrix factorization (NMF) layer
. This layer originates from a multimodal NMF problem with Lorentzian-BlockFrobenius sparsity constraint and incoherentness regularization. Experiments show that MPENet achieves state-of-the-art performance (83.65% F-measure for polyphony level 6) on RAND subset of MAPS dataset. MPENet enables NMF to do online learning and accomplishes multi-label classification by using only monophonic samples as training data. In addition, our layer algorithms can be easily modified and redeveloped for a wide variety of problems. |
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 1687-4722 1687-4714 1687-4722 |
| DOI: | 10.1186/s13636-018-0132-x |