A Sparse Plus Low-Rank Exponential Language Model for Limited Resource Scenarios
This paper describes a new exponential language model that decomposes the model parameters into one or more low-rank matrices that learn regularities in the training data and one or more sparse matrices that learn exceptions (e.g., keywords). The low-rank matrices induce continuous-space representat...
Saved in:
| Published in: | IEEE/ACM transactions on audio, speech, and language processing Vol. 23; no. 3; pp. 494 - 504 |
|---|---|
| Main Authors: | , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
Piscataway
IEEE
01.03.2015
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| Subjects: | |
| ISSN: | 2329-9290, 2329-9304 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | This paper describes a new exponential language model that decomposes the model parameters into one or more low-rank matrices that learn regularities in the training data and one or more sparse matrices that learn exceptions (e.g., keywords). The low-rank matrices induce continuous-space representations of words and histories. The sparse matrices learn multi-word lexical items and topic/domain idiosyncrasies. This model generalizes the standard ℓ 1 -regularized exponential language model, and has an efficient accelerated first-order training algorithm. Language modeling experiments show that the approach is useful in scenarios with limited training data, including low resource languages and domain adaptation. |
|---|---|
| AbstractList | This paper describes a new exponential language model that decomposes the model parameters into one or more low-rank matrices that learn regularities in the training data and one or more sparse matrices that learn exceptions (e.g., keywords). The low-rank matrices induce continuous-space representations of words and histories. The sparse matrices learn multi-word lexical items and topic/domain idiosyncrasies. This model generalizes the standard ℓ 1 -regularized exponential language model, and has an efficient accelerated first-order training algorithm. Language modeling experiments show that the approach is useful in scenarios with limited training data, including low resource languages and domain adaptation. This paper describes a new exponential language model that decomposes the model parameters into one or more low-rank matrices that learn regularities in the training data and one or more sparse matrices that learn exceptions (e.g., keywords). The low-rank matrices induce continuous-space representations of words and histories. The sparse matrices learn multi-word lexical items and topic/domain idiosyncrasies. This model generalizes the standard [ell] 1 -regularized exponential language model, and has an efficient accelerated first-order training algorithm. Language modeling experiments show that the approach is useful in scenarios with limited training data, including low resource languages and domain adaptation. This paper describes a new exponential language model that decomposes the model parameters into one or more low-rank matrices that learn regularities in the training data and one or more sparse matrices that learn exceptions (e.g., keywords). The low-rank matrices induce continuous-space representations of words and histories. The sparse matrices learn multi-word lexical items and topic/domain idiosyncrasies. This model generalizes the standard [Formula Omitted]-regularized exponential language model, and has an efficient accelerated first-order training algorithm. Language modeling experiments show that the approach is useful in scenarios with limited training data, including low resource languages and domain adaptation. |
| Author | Fazel, Maryam Ostendorf, Mari Hutchinson, Brian |
| Author_xml | – sequence: 1 givenname: Brian surname: Hutchinson fullname: Hutchinson, Brian email: brian.hutchinson@wwu.edu organization: Comput. Sci. Dept., Western Washington Univ., Bellingham, WA, USA – sequence: 2 givenname: Mari surname: Ostendorf fullname: Ostendorf, Mari organization: Electr. Eng. Dept., Univ. of Washington, Seattle, WA, USA – sequence: 3 givenname: Maryam surname: Fazel fullname: Fazel, Maryam organization: Electr. Eng. Dept., Univ. of Washington, Seattle, WA, USA |
| BookMark | eNp9kD1PwzAQhi0EEp9_ABZLLCwpPjuJ47FCfElBVG2ZI8e5IENqFzsR8O8JFBgYmO6G93nv9OyTbecdEnIMbALA1PlyuihnE84gnXAhVabEFtnjgqtECZZu_-xcsV1yFOMTYwyYVEqme2Q2pYu1DhHprBsiLf1rMtfumV6-rccjrre6o6V2j4N-RHrnG-xo6wMt7cr22NA5Rj8Eg3Rh0OlgfTwkO63uIh59zwPycHW5vLhJyvvr24tpmRjBiz4B0YBIwYjUGCNroSQv8lYL2bBWGg2ImZYix1qDrNMMeM3qDHIjirpRMs_EATnb9K6Dfxkw9tXKRoNdpx36IVaQS6lA5LIYo6d_ok_j0278bkzlDADSr8JikzLBxxiwrYztdW-964O2XQWs-rRdfdmuPm1X37ZHlP9B18GudHj_HzrZQBYRfwHJMiaKTHwA5EuLgA |
| CODEN | ITASD8 |
| CitedBy_id | crossref_primary_10_1109_TASLP_2015_2482118 crossref_primary_10_1016_j_specom_2019_03_004 crossref_primary_10_1109_TASLP_2015_2405131 crossref_primary_10_1162_tacl_a_00035 |
| Cites_doi | 10.1109/ICASSP.2007.367158 10.1162/jmlr.2003.3.4-5.993 10.1016/j.specom.2003.08.002 10.1109/ASRU.2011.6163937 10.1109/ASRU.2009.5373380 10.3115/1620754.1620822 10.1109/ICASSP.2002.1005858 10.1109/LSP.2011.2160850 10.21437/Interspeech.2010-343 10.21437/Interspeech.2010-519 10.21437/Interspeech.2011-243 10.1006/csla.1999.0128 10.1198/016214506000000302 10.1109/ICASSP.2011.5947608 10.1109/ICASSP.2011.5947611 10.1016/j.csl.2005.10.001 10.3115/1620754.1620820 10.1109/ICASSP.2011.5947609 10.1007/978-1-4419-8853-9 10.1109/ICASSP.2013.6639340 10.1137/080716542 10.1006/csla.1996.0011 10.21437/Interspeech.2012-459 10.1162/153244303322533223 10.1145/1273496.1273499 10.21437/Interspeech.2011-242 10.1109/ICASSP.1998.675356 10.1137/070697835 10.1006/csla.2001.0174 10.1109/ICASSP.1993.319375 10.3115/v1/D14-1162 10.21437/Interspeech.2010-341 10.1016/j.csl.2006.09.003 10.21437/Interspeech.2008-253 10.1145/1273496.1273577 10.1109/89.817454 10.21437/Eurospeech.1999-409 |
| ContentType | Journal Article |
| Copyright | Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) Mar 2015 |
| Copyright_xml | – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) Mar 2015 |
| DBID | 97E RIA RIE AAYXX CITATION 7SC 8FD JQ2 L7M L~C L~D |
| DOI | 10.1109/TASLP.2014.2379593 |
| DatabaseName | IEEE All-Society Periodicals Package (ASPP) 2005–Present IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE/IET Electronic Library CrossRef Computer and Information Systems Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
| DatabaseTitle | CrossRef Computer and Information Systems Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace ProQuest Computer Science Collection Computer and Information Systems Abstracts Professional |
| DatabaseTitleList | Computer and Information Systems Abstracts Computer and Information Systems Abstracts |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Engineering |
| EISSN | 2329-9304 |
| EndPage | 504 |
| ExternalDocumentID | 3611613931 10_1109_TASLP_2014_2379593 7050385 |
| Genre | orig-research |
| GroupedDBID | 0R~ 4.4 6IK 97E AAJGR AAKMM AALFJ AARMG AASAJ AAWTH AAWTV ABAZT ABQJQ ABVLG ACIWK ACM ADBCU AEBYY AEFXT AEJOY AENSD AFWIH AFWXC AGQYO AGSQL AHBIQ AIKLT AKJIK AKQYR AKRVB ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CCLIF EBS EJD GUFHI HGAVV IFIPE IPLJI JAVBF LHSKQ M43 OCL PQQKQ RIA RIE RNS ROL AAYXX CITATION 7SC 8FD JQ2 L7M L~C L~D |
| ID | FETCH-LOGICAL-c328t-13d1341c34ccc7b397286fa37d0f7ca1ee5a736eba17b4512b0b516c38bd97653 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 7 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000350876100008&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 2329-9290 |
| IngestDate | Sat Sep 27 20:09:17 EDT 2025 Sun Nov 09 08:04:28 EST 2025 Tue Nov 18 19:41:25 EST 2025 Sat Nov 29 07:52:15 EST 2025 Tue Aug 26 16:39:04 EDT 2025 |
| IsDoiOpenAccess | false |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 3 |
| Language | English |
| License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html https://doi.org/10.15223/policy-029 https://doi.org/10.15223/policy-037 |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c328t-13d1341c34ccc7b397286fa37d0f7ca1ee5a736eba17b4512b0b516c38bd97653 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 |
| PQID | 1660111465 |
| PQPubID | 85426 |
| PageCount | 11 |
| ParticipantIDs | proquest_journals_1660111465 proquest_miscellaneous_1677913678 ieee_primary_7050385 crossref_citationtrail_10_1109_TASLP_2014_2379593 crossref_primary_10_1109_TASLP_2014_2379593 |
| PublicationCentury | 2000 |
| PublicationDate | 2015-March 2015-3-00 20150301 |
| PublicationDateYYYYMMDD | 2015-03-01 |
| PublicationDate_xml | – month: 03 year: 2015 text: 2015-March |
| PublicationDecade | 2010 |
| PublicationPlace | Piscataway |
| PublicationPlace_xml | – name: Piscataway |
| PublicationTitle | IEEE/ACM transactions on audio, speech, and language processing |
| PublicationTitleAbbrev | TASLP |
| PublicationYear | 2015 |
| Publisher | IEEE The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| Publisher_xml | – name: IEEE – name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| References | ref12 ref15 ref14 mikolov (ref32) 2010 brown (ref2) 1992; 18 ref11 ref17 mikolov (ref35) 2011 ref16 huang (ref48) 2008 srebro (ref13) 2004 hutchinson (ref10) 2013 ref50 arisoy (ref31) 2012 ref46 fazel (ref18) 2002 ref45 ref47 ref41 ref44 ref43 stolcke (ref37) 2002 parikh (ref7) 2013; abs 1312 7077 ref9 ref3 ref6 ref5 ref40 chen (ref27) 2010 hutchinson (ref8) 2012 ref36 ref33 toh (ref19) 2010; 6 ref1 nesterov (ref20) 2004 nocedal (ref22) 2000 mikolov (ref34) 2011 siu (ref4) 2000; 8 adda (ref42) 1999 ref24 ref23 ref26 ref25 ref21 alume (ref38) 2010 zweig (ref29) 2011 ref28 bengio (ref30) 2001 wood (ref49) 2009; 12 graff (ref39) 2003 |
| References_xml | – ident: ref47 doi: 10.1109/ICASSP.2007.367158 – ident: ref45 doi: 10.1162/jmlr.2003.3.4-5.993 – start-page: 901 year: 2002 ident: ref37 article-title: SRILM - an extensible language modeling toolkit publication-title: Proc ICSLP – year: 2000 ident: ref22 publication-title: Numerical Optimization – ident: ref43 doi: 10.1016/j.specom.2003.08.002 – year: 2003 ident: ref39 article-title: English Gigaword LDC2003T05 publication-title: Linguistic Data Consortium – ident: ref25 doi: 10.1109/ASRU.2011.6163937 – volume: 12 year: 2009 ident: ref49 article-title: A hierarchical nonparametric Bayesian approach to statistical language model domain adaptation publication-title: Proc AISTATS – ident: ref24 doi: 10.1109/ASRU.2009.5373380 – ident: ref23 doi: 10.3115/1620754.1620822 – ident: ref40 doi: 10.1109/ICASSP.2002.1005858 – ident: ref6 doi: 10.1109/LSP.2011.2160850 – start-page: 1045 year: 2010 ident: ref32 article-title: Recurrent neural network based language model publication-title: Proc INTERSPEECH doi: 10.21437/Interspeech.2010-343 – start-page: 1820 year: 2010 ident: ref38 article-title: Efficient estimation of maximum entropy language models with n-gram features: An SRILM extension publication-title: Proc INTERSPEECH doi: 10.21437/Interspeech.2010-519 – start-page: 609 year: 2011 ident: ref29 article-title: Personalizing model M for voice-search publication-title: Proc INTERSPEECH doi: 10.21437/Interspeech.2011-243 – ident: ref1 doi: 10.1006/csla.1999.0128 – volume: abs 1312 7077 year: 2013 ident: ref7 article-title: Language modeling with power low rank ensembles publication-title: Proc CoRR – volume: 6 start-page: 615 year: 2010 ident: ref19 article-title: An accelerated proximal gradient algorithm for nuclear norm regularized least squares problems publication-title: Pacific J Optimiz – ident: ref46 doi: 10.1198/016214506000000302 – ident: ref28 doi: 10.1109/ICASSP.2011.5947608 – ident: ref33 doi: 10.1109/ICASSP.2011.5947611 – year: 2002 ident: ref18 publication-title: Matrix Rank Minimization With Applications – ident: ref5 doi: 10.1016/j.csl.2005.10.001 – year: 2013 ident: ref10 publication-title: Rank and sparsity in language processing – ident: ref12 doi: 10.3115/1620754.1620820 – ident: ref26 doi: 10.1109/ICASSP.2011.5947609 – year: 2004 ident: ref20 publication-title: Introductory Lectures on Convex Optimization doi: 10.1007/978-1-4419-8853-9 – year: 2004 ident: ref13 publication-title: Learning with Matrix Factorization – ident: ref9 doi: 10.1109/ICASSP.2013.6639340 – ident: ref21 doi: 10.1137/080716542 – ident: ref11 doi: 10.1006/csla.1996.0011 – start-page: 196 year: 2011 ident: ref35 article-title: RNNLM recurrent neural network language modeling toolkit publication-title: Proc ASRU – year: 2012 ident: ref8 article-title: A sparse plus low rank maximum entropy language model publication-title: Proc INTERSPEECH doi: 10.21437/Interspeech.2012-459 – ident: ref16 doi: 10.1162/153244303322533223 – ident: ref15 doi: 10.1145/1273496.1273499 – start-page: 605 year: 2011 ident: ref34 article-title: Empirical evaluation and combination of advanced language modeling techniques publication-title: Proc INTERSPEECH doi: 10.21437/Interspeech.2011-242 – ident: ref44 doi: 10.1109/ICASSP.1998.675356 – ident: ref14 doi: 10.1137/070697835 – ident: ref3 doi: 10.1006/csla.2001.0174 – ident: ref41 doi: 10.1109/ICASSP.1993.319375 – ident: ref50 doi: 10.3115/v1/D14-1162 – start-page: 932 year: 2001 ident: ref30 article-title: A neural probabilistic language model publication-title: Proc NIPS – start-page: 1037 year: 2010 ident: ref27 article-title: Enhanced word classing for model M publication-title: Proc INTERSPEECH doi: 10.21437/Interspeech.2010-341 – ident: ref17 doi: 10.1016/j.csl.2006.09.003 – start-page: 833 year: 2008 ident: ref48 article-title: Unsupervised language model adaptation based on topic and role information in multiparty meetings publication-title: Proc INTERSPEECH doi: 10.21437/Interspeech.2008-253 – volume: 18 start-page: 467 year: 1992 ident: ref2 article-title: Class-based n-gram models of natural language publication-title: Comput Linguist – ident: ref36 doi: 10.1145/1273496.1273577 – volume: 8 start-page: 63 year: 2000 ident: ref4 article-title: Variable n-grams and extensions for conversational speech language modeling publication-title: IEEE Trans Speech Audio Process doi: 10.1109/89.817454 – start-page: 1759 year: 1999 ident: ref42 article-title: Language modeling for broadcast news transcription publication-title: Proc EUROSPEECH doi: 10.21437/Eurospeech.1999-409 – start-page: 20 year: 2012 ident: ref31 article-title: Deep neural network language models publication-title: Proc NAACL-HLT Workshop Future Lang Model for HLT |
| SSID | ssj0001079974 |
| Score | 2.174668 |
| Snippet | This paper describes a new exponential language model that decomposes the model parameters into one or more low-rank matrices that learn regularities in the... |
| SourceID | proquest crossref ieee |
| SourceType | Aggregation Database Enrichment Source Index Database Publisher |
| StartPage | 494 |
| SubjectTerms | Adaptation models Algorithms Data models exponential History Language model log bilinear low-hyphen Mathematical models Matrix decomposition Natural language processing Regularity Representations sparse Sparse matrices Speech Training |
| Title | A Sparse Plus Low-Rank Exponential Language Model for Limited Resource Scenarios |
| URI | https://ieeexplore.ieee.org/document/7050385 https://www.proquest.com/docview/1660111465 https://www.proquest.com/docview/1677913678 |
| Volume | 23 |
| WOSCitedRecordID | wos000350876100008&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVIEE databaseName: IEEE Electronic Library (IEL) customDbUrl: eissn: 2329-9304 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0001079974 issn: 2329-9290 databaseCode: RIE dateStart: 20140101 isFulltext: true titleUrlDefault: https://ieeexplore.ieee.org/ providerName: IEEE |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LS8QwEB5UPOjBt7i-iOBNq2nTNu1xEcXDIour4K0k01kQl3ZxXfXnO8l2V0URvBWStCWTZB6Z-T6AY22Jj0VFQRajDOLUYpCniEFIUal13nfkVp5sQt_cZA8PeXcOTme1METkk8_ozD36u_yyxrELlZ1rD16SzMO81umkVuszniJ1nnvQZbYR8oC1vpzWyMj8_K7d63RdIld8FilHr62-6SFPrPLjNPYq5mr1fz-3BiuNKSnaE9mvwxxVG7D8BWBwE7pt0Ruy60qiOxiPRKd-C25N9SQu34d15fKEeHyniVgKR4s2EGzEiqbqSUxj-6KHVLFTXY-24P7q8u7iOmg4FAJUUeaY5ksH2YYqRkRt2fqIsrRvlC5lX6MJiRKjVUrWhNrGrP2ttEmYospsyZZKorZhoeI_2gERkpXI9lWiTBTryLCi13EqFRmFCmXSgnA6owU2AOOO52JQeEdD5oWXQuGkUDRSaMHJbMxwAq_xZ-9NN--zns2Ut2B_Krii2YGjIkzZ1XQl19x8NGvmveMuRExF9dj14eXoMOuy3d_fvAdL_P1kknO2Dwsvz2M6gEV8fXkcPR_6BfgBd7XVsQ |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3da9swED-6brD2ofvoRtN2nQZ729zKlm3Zj6G0tMwLYcmgb0Y6X2As2KFptv75PSlK1tFR2JtBkhE6Sfehu98P4KO2xNeioqhIUUZpbjEqc8QopqTRupw4citPNqEHg-LqqhxuwOd1LQwR-eQzOnaf_i2_6XDhQmUn2oOXZE_gqWPOCtVafyIqUpelh11mK6GMWO_LVZWMLE_G_VE1dKlc6XGiHMG2-ksTeWqVB_exVzLnL_5vei9hJxiTor-U_ivYoPY1bN-DGNyFYV-MZuy8khhOF3NRdb-jb6b9Kc5uZ13rMoV4fBVilsIRo00Fm7Ei1D2JVXRfjJBadqu7-Rv4fn42Pr2IAotChCopHNd840DbUKWIqC3bH0mRT4zSjZxoNDFRZrTKyZpY25T1v5U2i3NUhW3YVsnUW9hseUZ7IGKyEtnCypRJUp0YVvU6zaUio1ChzHoQr1a0xgAx7pguprV3NWRZeynUTgp1kEIPPq3HzJYAG4_23nXrvu4ZlrwHhyvB1eEMzus4Z2fTFV1z84d1M58e9yRiWuoWrg9vSIdaV-z_-8_v4fnF-GtVV5eDLwewxXPJlhloh7B5c72gd_AMf938mF8f-c14B8Se2Po |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+Sparse+Plus+Low-Rank+Exponential+Language+Model+for+Limited+Resource+Scenarios&rft.jtitle=IEEE%2FACM+transactions+on+audio%2C+speech%2C+and+language+processing&rft.au=Hutchinson%2C+Brian&rft.au=Ostendorf%2C+Mari&rft.au=Fazel%2C+Maryam&rft.date=2015-03-01&rft.issn=2329-9290&rft.eissn=2329-9304&rft.volume=23&rft.issue=3&rft.spage=494&rft.epage=504&rft_id=info:doi/10.1109%2FTASLP.2014.2379593&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_TASLP_2014_2379593 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2329-9290&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2329-9290&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2329-9290&client=summon |