Soundprism: An Online System for Score-Informed Source Separation of Music Audio
Soundprism, as proposed in this paper, is a computer system that separates single-channel polyphonic music audio played by harmonic sources into source signals in an online fashion. It uses a musical score to guide the separation process. To the best of our knowledge, this is the first online system...
Uloženo v:
| Vydáno v: | IEEE journal of selected topics in signal processing Ročník 5; číslo 6; s. 1205 - 1215 |
|---|---|
| Hlavní autoři: | , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
New York
IEEE
01.10.2011
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| Témata: | |
| ISSN: | 1932-4553, 1941-0484 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | Soundprism, as proposed in this paper, is a computer system that separates single-channel polyphonic music audio played by harmonic sources into source signals in an online fashion. It uses a musical score to guide the separation process. To the best of our knowledge, this is the first online system that addresses score-informed music source separation that can be made into a real-time system. The proposed system consists of two parts: 1) a score follower that associates a score position to each time frame of the audio performance; 2) a source separator which reconstructs the source signals for each time frame, informed by the score. The score follower uses a hidden Markov approach, where each audio frame is associated with a 2-D state vector (score position and tempo). The observation model is defined as the likelihood of observing the frame given the pitches at the score position. The score position and tempo are inferred using particle filtering. In building the source separator, we first refine the score-informed pitches of the current audio frame by maximizing the multi-pitch observation likelihood. Then, the harmonics of each source's fundamental frequency are extracted to reconstruct the source signal. Overlapping harmonics between sources are identified and their energy is distributed in inverse proportion to the square of their respective harmonic number. Experiments on both synthetic and human-performed music show both the score follower and the source separator perform well. Results also show that the proposed score follower works well for highly polyphonic music with some degree of tempo variations. |
|---|---|
| AbstractList | Soundprism, as proposed in this paper, is a computer system that separates single-channel polyphonic music audio played by harmonic sources into source signals in an online fashion. It uses a musical score to guide the separation process. To the best of our knowledge, this is the first online system that addresses score-informed music source separation that can be made into a real-time system. The proposed system consists of two parts: 1) a score follower that associates a score position to each time frame of the audio performance; 2) a source separator which reconstructs the source signals for each time frame, informed by the score. The score follower uses a hidden Markov approach, where each audio frame is associated with a 2-D state vector (score position and tempo). The observation model is defined as the likelihood of observing the frame given the pitches at the score position. The score position and tempo are inferred using particle filtering. In building the source separator, we first refine the score-informed pitches of the current audio frame by maximizing the multi-pitch observation likelihood. Then, the harmonics of each source's fundamental frequency are extracted to reconstruct the source signal. Overlapping harmonics between sources are identified and their energy is distributed in inverse proportion to the square of their respective harmonic number. Experiments on both synthetic and human-performed music show both the score follower and the source separator perform well. Results also show that the proposed score follower works well for highly polyphonic music with some degree of tempo variations. |
| Author | Zhiyao Duan Pardo, B. |
| Author_xml | – sequence: 1 surname: Zhiyao Duan fullname: Zhiyao Duan email: zhiyao-duan00@gmail.com organization: Dept. of Electr. Eng. & Comput. Sci., Northwestern Univ., Evanston, IL, USA – sequence: 2 givenname: B. surname: Pardo fullname: Pardo, B. organization: Dept. of Electr. Eng. & Comput. Sci., Northwestern Univ., Evanston, IL, USA |
| BookMark | eNp9kM1P9CAQh4nRxM9_QC_Ei6euDNCWvreN8TMaTapnwtJpgmlhX2gP_veyrvHgwRMTeH4zzHNIdn3wSMgpsAUAay4f2tf2ZcEZwIJD2dQMdsgBNBIKJpXc3dSCF7IsxT45TOmdsbKuQB6QlzbMvltHl8Z_dOnpsx-cR9p-pAlH2odIWxsiFvc-1yN2NPPRZgDXJprJBU9DT5_m5Cxdzp0Lx2SvN0PCk-_ziLzdXL9e3RWPz7f3V8vHwgpeTYUpmeVccN5VgKLDpgfRG5C1NaY2KyXypeIlMJDGiFVdmqbLLwi96irBVuKIXGz7rmP4P2Oa9OiSxWEwHsOcdMMrIUBymcnzX-R73sHnz2nVMKFqDiJDfAvZGFKK2OvsZDTxQwPTG8X6S7HeKNbfinNI_QpZN31JmaJxw9_Rs23UIeLPrFKpWiguPgFMGIsU |
| CODEN | IJSTGY |
| CitedBy_id | crossref_primary_10_1186_1687_6180_2013_184 crossref_primary_10_1109_TASLP_2013_2285484 crossref_primary_10_1002_cmm4_1040 crossref_primary_10_1109_TASLP_2016_2611938 crossref_primary_10_1186_s13636_019_0168_6 crossref_primary_10_1007_s11227_016_1865_x crossref_primary_10_1007_s11042_013_1398_8 crossref_primary_10_1109_TASLP_2014_2355772 crossref_primary_10_1109_TASLP_2015_2507862 crossref_primary_10_1109_TASLP_2023_3277290 crossref_primary_10_1186_1687_6180_2014_23 crossref_primary_10_1007_s11227_018_2703_0 crossref_primary_10_1016_j_patcog_2017_09_020 crossref_primary_10_1007_s11227_020_03282_2 crossref_primary_10_1007_s11227_018_2265_1 crossref_primary_10_1109_TASLP_2024_3356980 crossref_primary_10_1162_COMJ_a_00286 crossref_primary_10_1038_s41467_020_15367_w crossref_primary_10_1186_s13636_020_00190_4 crossref_primary_10_1109_TMM_2018_2856090 crossref_primary_10_1007_s11042_018_6349_y crossref_primary_10_1109_TASLPRO_2025_3571294 crossref_primary_10_1080_09298215_2014_989174 crossref_primary_10_1109_TASLP_2015_2412464 crossref_primary_10_1007_s11227_016_1647_5 crossref_primary_10_1109_TASLP_2016_2598323 crossref_primary_10_1109_MSP_2013_2296076 crossref_primary_10_1145_2926717 crossref_primary_10_1109_TASLP_2021_3121991 crossref_primary_10_1016_j_engappai_2013_03_010 crossref_primary_10_1109_LSP_2018_2847236 |
| Cites_doi | 10.1007/s10994-006-8415-3 10.1109/TSA.2005.858005 10.1109/ICASSP.2011.5946324 10.1162/comj.2008.32.1.51 10.1109/TASL.2009.2020886 10.1109/TSA.2005.857574 10.1109/TASL.2008.919073 10.1109/TPAMI.2009.106 10.1109/78.978374 10.1109/34.761266 10.1007/978-1-4757-3437-9 10.1109/ICASSP.2009.4959972 10.1155/2007/48317 10.1109/TASL.2011.2134092 10.1109/ICASSP.2010.5496224 10.1109/ASPAA.2003.1285862 10.1109/TASL.2009.2030006 10.1109/TSA.2003.815516 10.1109/TASL.2010.2042119 10.1109/ICASSP.2006.1661258 |
| ContentType | Journal Article |
| Copyright | Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) Oct 2011 |
| Copyright_xml | – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) Oct 2011 |
| DBID | 97E RIA RIE AAYXX CITATION 7SP 8FD H8D L7M |
| DOI | 10.1109/JSTSP.2011.2159701 |
| DatabaseName | IEEE Xplore (IEEE) IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE Xplore CrossRef Electronics & Communications Abstracts Technology Research Database Aerospace Database Advanced Technologies Database with Aerospace |
| DatabaseTitle | CrossRef Aerospace Database Technology Research Database Advanced Technologies Database with Aerospace Electronics & Communications Abstracts |
| DatabaseTitleList | Technology Research Database Aerospace Database |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Xplore url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Engineering Music |
| EISSN | 1941-0484 |
| EndPage | 1215 |
| ExternalDocumentID | 2456647341 10_1109_JSTSP_2011_2159701 5887382 |
| Genre | orig-research |
| GroupedDBID | -~X 0R~ 29I 4.4 5GY 5VS 6IK 97E AAJGR AARMG AASAJ AAWTH ABAZT ABQJQ ABVLG ACIWK AENEX AETIX AGQYO AGSQL AHBIQ AKJIK AKQYR ALMA_UNASSIGNED_HOLDINGS ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 EBS EJD F5P HZ~ IFIPE IPLJI JAVBF LAI M43 O9- OCL RIA RIE RNS AAYXX CITATION 7SP 8FD H8D L7M RIG |
| ID | FETCH-LOGICAL-c326t-a50c22322d61e3de9f13fa147caa7ab83e3d8251014aa3b75a9daa7e1f8d630b3 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 64 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000295012900011&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 1932-4553 |
| IngestDate | Tue Sep 30 20:10:03 EDT 2025 Mon Jun 30 10:17:39 EDT 2025 Sat Nov 29 03:55:50 EST 2025 Tue Nov 18 21:29:46 EST 2025 Tue Aug 26 17:17:22 EDT 2025 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 6 |
| Language | English |
| License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c326t-a50c22322d61e3de9f13fa147caa7ab83e3d8251014aa3b75a9daa7e1f8d630b3 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 ObjectType-Article-2 ObjectType-Feature-1 content type line 23 |
| PQID | 890387213 |
| PQPubID | 75721 |
| PageCount | 11 |
| ParticipantIDs | proquest_miscellaneous_926331424 crossref_primary_10_1109_JSTSP_2011_2159701 ieee_primary_5887382 proquest_journals_890387213 crossref_citationtrail_10_1109_JSTSP_2011_2159701 |
| PublicationCentury | 2000 |
| PublicationDate | 2011-Oct. 2011-10-00 20111001 |
| PublicationDateYYYYMMDD | 2011-10-01 |
| PublicationDate_xml | – month: 10 year: 2011 text: 2011-Oct. |
| PublicationDecade | 2010 |
| PublicationPlace | New York |
| PublicationPlace_xml | – name: New York |
| PublicationTitle | IEEE journal of selected topics in signal processing |
| PublicationTitleAbbrev | JSTSP |
| PublicationYear | 2011 |
| Publisher | IEEE The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| Publisher_xml | – name: IEEE – name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| References | ref35 ref13 orio (ref9) 2001 ref15 orio (ref20) 2001 ref36 ref14 every (ref30) 2004 ref11 ref32 ref10 ref2 ref1 cano (ref12) 1999 raphael (ref21) 2001 macrae (ref7) 2010 goto (ref39) 2002 vercoe (ref17) 1984 grubb (ref22) 1994 yeh (ref33) 2010; 18 li (ref29) 2009; 17 ref24 ref23 ref26 cont (ref38) 2007 dannenberg (ref16) 1984 woodruff (ref4) 2006 ganseman (ref5) 2010 ref25 puckette (ref18) 1995 grubb (ref19) 1997 dixon (ref6) 2005 vincent (ref37) 0 virtanen (ref31) 2003 ref28 ref27 ref8 ref3 ganseman (ref34) 2010 |
| References_xml | – start-page: 35 year: 2003 ident: ref31 article-title: Algorithm for the separation of harmonic sounds with time-frequency smoothness constraint publication-title: Proc Int Conf Digital Audio Effects (DAFx) – ident: ref14 doi: 10.1007/s10994-006-8415-3 – start-page: 94 year: 1994 ident: ref22 article-title: Automated accompaniment of musical ensembles publication-title: Proc 10th Nat Conf Artificial Intell (AAAI-92) – start-page: 315 year: 2007 ident: ref38 article-title: Evaluation of real-time audio-to-score alignment publication-title: Proc Int Conf Music Inf Retrieval (ISMIR) – start-page: 193 year: 1984 ident: ref16 article-title: An on-line algorithm for real-time accompaniment publication-title: Proc Int Comput Music Conf (ICMC) – year: 0 ident: ref37 publication-title: BSS Oracle Toolbox Version 2 1 – ident: ref36 doi: 10.1109/TSA.2005.858005 – start-page: 199 year: 1984 ident: ref17 article-title: The synthetic performer in the context of live performance publication-title: Proc Int Comput Music Conf (ICMC) – ident: ref3 doi: 10.1109/ICASSP.2011.5946324 – ident: ref2 doi: 10.1162/comj.2008.32.1.51 – volume: 17 start-page: 1361 year: 2009 ident: ref29 article-title: Monaural musical sound separation based on pitch and common amplitude modulation publication-title: IEEE Trans Audio Speech Lang Process doi: 10.1109/TASL.2009.2020886 – start-page: 314 year: 2006 ident: ref4 article-title: Remixing stereo music with score-informed source separation publication-title: Proc Int Conf Music Inf Retrieval (ISMIR) – year: 2010 ident: ref5 article-title: Source separation by score synthesis publication-title: Proc Int Comput Music Conf (ICMC) – start-page: 197 year: 2004 ident: ref30 article-title: A spectral-filtering approach to music signal separation publication-title: Proc Int Conf Digital Audio Effects (DAFx) – ident: ref32 doi: 10.1109/TSA.2005.857574 – start-page: 155 year: 2001 ident: ref9 article-title: Alignment of monophonic and polyphonic music to a score publication-title: Proc Int Comput Music Conf (ICMC) – ident: ref1 doi: 10.1109/TASL.2008.919073 – ident: ref24 doi: 10.1109/TPAMI.2009.106 – ident: ref28 doi: 10.1109/78.978374 – year: 2001 ident: ref21 article-title: A Bayesian network for real-time musical accompaniment publication-title: Proc Adv Neural Inf Process Syst (NIPS) – start-page: 199 year: 1995 ident: ref18 article-title: Score following using the sung voice publication-title: Proc Int Comput Music Conf (ICMC) – start-page: 441 year: 1999 ident: ref12 article-title: Score-performance matching using HMMs publication-title: Proc Int Comput Music Conf (ICMC) – ident: ref13 doi: 10.1109/34.761266 – ident: ref27 doi: 10.1007/978-1-4757-3437-9 – ident: ref11 doi: 10.1109/ICASSP.2009.4959972 – start-page: 219 year: 2010 ident: ref34 article-title: Evaluation of a score-informed source separation system publication-title: Proc Int Symp Music Inf Retrieval (ISMIR) – start-page: 301 year: 1997 ident: ref19 article-title: A stochastic method of tracking a vocal performer publication-title: Proc Int Comput Music Conf (ICMC) – ident: ref25 doi: 10.1155/2007/48317 – ident: ref15 doi: 10.1109/TASL.2011.2134092 – ident: ref35 doi: 10.1109/ICASSP.2010.5496224 – ident: ref10 doi: 10.1109/ASPAA.2003.1285862 – start-page: 423 year: 2010 ident: ref7 article-title: Accurate real-time windowed time warping publication-title: Proc Int Conf Music Inf Retrieval (ISMIR) – volume: 18 start-page: 1116 year: 2010 ident: ref33 article-title: Multiple fundamental frequency estimation and polyphony inference of polyphonic music signals publication-title: IEEE Trans Audio Speech Lang Process doi: 10.1109/TASL.2009.2030006 – ident: ref26 doi: 10.1109/TSA.2003.815516 – year: 2001 ident: ref20 article-title: Score following using spectral analysis and hidden markov models publication-title: Proc Int Comput Music Conf (ICMC) – start-page: 287 year: 2002 ident: ref39 article-title: RWC music database: Popular, classical, and jazz music databases publication-title: Proc Int Conf Music Inf Retrieval (ISMIR) – ident: ref8 doi: 10.1109/TASL.2010.2042119 – ident: ref23 doi: 10.1109/ICASSP.2006.1661258 – start-page: 92 year: 2005 ident: ref6 article-title: Live tracking of musical performances using on-line time warping publication-title: Proc Int Conf Digital Audio Effects (DAFx) |
| SSID | ssj0057614 |
| Score | 2.3112166 |
| Snippet | Soundprism, as proposed in this paper, is a computer system that separates single-channel polyphonic music audio played by harmonic sources into source signals... |
| SourceID | proquest crossref ieee |
| SourceType | Aggregation Database Enrichment Source Index Database Publisher |
| StartPage | 1205 |
| SubjectTerms | Followers Frames Harmonic analysis Harmonics Hidden Markov models Instruments Mathematical model Multi-pitch estimation Music On-line systems Online online algorithm Real time systems score following Separation Separators Source separation Studies |
| Title | Soundprism: An Online System for Score-Informed Source Separation of Music Audio |
| URI | https://ieeexplore.ieee.org/document/5887382 https://www.proquest.com/docview/890387213 https://www.proquest.com/docview/926331424 |
| Volume | 5 |
| WOSCitedRecordID | wos000295012900011&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVIEE databaseName: IEEE Xplore customDbUrl: eissn: 1941-0484 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0057614 issn: 1932-4553 databaseCode: RIE dateStart: 20070101 isFulltext: true titleUrlDefault: https://ieeexplore.ieee.org/ providerName: IEEE |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3NS8MwFH_M6UEPfmyK84scvGlck7RN422Iw9MYVGG3kiYpDKSVbfXvN0nbISiCt9KkobyXl_eSl_f7AdxaF0-V9WRYJizEoY4FFpIJzKlmSuWiCKnyZBN8NksWCzHvwf22FsYY4y-fmQf36HP5ulK1OyobR9YiWGIX3B3O46ZWq1t1bdhM2gwyxWEUsa5AJhBjO8XTeYPWaR2c4C0BTOeEPKvKj6XY-5fp0f_-7BgO2zgSTRrFn0DPlAM4-IYuOIBdz-E8hHnqqJM83uEjmpSoQRdFDVY5skErSh2WJW4qk4xGqT_QR6lpcMGrElUF8oOhSa2X1Sm8TZ9fn15wS6SAlY3ONlhGgbJhAKU6JoZpIwrCCklCrqTkMk-YfelKWO12SUqW80gKbVsMKRIdsyBnZ9Avq9KcA9IkUFzHSVCoPJRhIYWKtVIOpVnnOtQjIJ1kM9WijDuyi_fM7zYCkXltZE4bWauNEdxtv_loMDb-7D108t_2bEU_gstOgVlrhussES47TwkbAdq2WvtxSRFZmqpeZ4LGjLlyv4vfx72Efdpd-yNX0N-sanMNe-pzs1yvbvwc_AKWTNlD |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LS8QwEB5EBfXgW1yfOXjTaPPoI94WcVlRl4UqeAtpkoIgrbi7_n6TtF0ERfBWmjSUmUxmksl8H8CZc_FUO0-GVcY45iYRWCgmcEoN07oQJac6kE2ko1H28iLGC3Axr4Wx1obLZ_bSP4Zcvqn1zB-VXcXOIljmFtylmHMaNdVa3brrAmfS5pAp5nHMuhKZSFy5SZ6PG7xO5-JE2lLAdG4o8Kr8WIyDhxls_O_fNmG9jSRRv1H9FizYahvWvuELbsNSYHHegXHuyZMC4uE16leowRdFDVo5cmEryj2aJW5qk6xBeTjSR7ltkMHrCtUlCoOh_sy81rvwPLh9uhnilkoBaxefTbGKI-0CAUpNQiwzVpSElYrwVCuVqiJj7qUvYnUbJqVYkcZKGNdiSZmZhEUF24PFqq7sPiBDIp2aJItKXXDFSyV0YrT2OM2mMNz0gHSSlbrFGfd0F28y7DciIYM2pNeGbLXRg_P5N-8NysafvXe8_Oc9W9H34LBToGwNcSIz4fPzlLAeoHmrsyCfFlGVrWcTKWjCmC_4O_h93FNYGT49PsiHu9H9IazS7hIgOYLF6cfMHsOy_py-Tj5Ownz8AgoS3Io |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Soundprism%3A+An+Online+System+for+Score-Informed+Source+Separation+of+Music+Audio&rft.jtitle=IEEE+journal+of+selected+topics+in+signal+processing&rft.au=Duan%2C+Zhiyao&rft.au=Pardo%2C+Bryan&rft.date=2011-10-01&rft.issn=1932-4553&rft.eissn=1941-0484&rft.volume=5&rft.issue=6&rft.spage=1205&rft.epage=1215&rft_id=info:doi/10.1109%2FJSTSP.2011.2159701&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_JSTSP_2011_2159701 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1932-4553&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1932-4553&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1932-4553&client=summon |