Learning and controlling the source-filter representation of speech with a variational autoencoder
Uloženo v:
| Vydáno v: | Speech Communication Ročník 148; s. 53 - 65 |
|---|---|
| Hlavní autoři: | , , , , |
| Médium: | Journal Article |
| Jazyk: | japonština |
| Vydáno: |
Elsevier BV
01.03.2023
|
| Témata: | |
| ISSN: | 0167-6393 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Author | Seguier, Renaud Leglaive, Simon Alameda-Pineda, Xavier Sadok, Samir Girin, Laurent |
|---|---|
| Author_xml | – sequence: 1 fullname: Sadok, Samir – sequence: 2 orcidid: 0000-0002-8219-1298 fullname: Leglaive, Simon – sequence: 3 fullname: Girin, Laurent – sequence: 4 fullname: Alameda-Pineda, Xavier – sequence: 5 fullname: Seguier, Renaud |
| BackLink | https://cir.nii.ac.jp/crid/1871146593093352320$$DView record in CiNii |
| BookMark | eNotj01LAzEURbOoYKv-AHdZuJ36kpdkMkspfkHBTfeSpC82ZUhKJq3-fK26uZfDhQN3wWa5ZGLsVsBSWa3h3tWvdFpKCWoJPfR6xuYgTN8ZHPCSLaZpDwDKWjlnfk2u5pQ_uMtbHkputYzjmduO-FSONVAX09io8kqHShPl5loqmZfIpwNR2PHP1Hbc8ZOr6XdyI3fHViiHsqV6zS6iGye6-e8rtnl63KxeuvXb8-vqYd3tBw2dccLhADEoDd4IiwpJhui10sGjDRa30RnpVRDRSDA_R32I2kWPpAdt8Ird_WlzSu8hnVPYXghl9IAwIGqJEvAbRhRXxw |
| ContentType | Journal Article |
| Contributor | Nantes Université (Nantes Univ)-Nantes Université (Nantes Univ) GIPSA - Cognitive Robotics, Interactive Systems, & Speech Processing (GIPSA-CRISSP) ; GIPSA Pôle Parole et Cognition (GIPSA-PPC) ; Grenoble Images Parole Signal Automatique (GIPSA-lab) ; Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP) ; Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP) ; Université Grenoble Alpes (UGA)-Grenoble Images Parole Signal Automatique (GIPSA-lab) ; Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP) ; Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-I |
| Contributor_xml | – sequence: 1 fullname: CentraleSupélec [campus de Rennes] – sequence: 2 fullname: Institut d'Électronique et des Technologies du numéRique (IETR) ; Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes) ; Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-Nantes Université - pôle Sciences et technologie ; Nantes Université (Nantes Univ)-Nantes Université (Nantes Univ) – sequence: 3 fullname: GIPSA - Cognitive Robotics, Interactive Systems, & Speech Processing (GIPSA-CRISSP) ; GIPSA Pôle Parole et Cognition (GIPSA-PPC) ; Grenoble Images Parole Signal Automatique (GIPSA-lab) ; Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP) ; Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP) ; Université Grenoble Alpes (UGA)-Grenoble Images Parole Signal Automatique (GIPSA-lab) ; Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP) ; Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP) ; Université Grenoble Alpes (UGA) – sequence: 4 fullname: Vers des robots à l’intelligence sociale au travers de l’apprentissage, de la perception et de la commande (ROBOTLEARN) ; Inria Grenoble - Rhône-Alpes ; Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université Grenoble Alpes (UGA) – sequence: 5 fullname: Société Française d'Acoustique (SFA) – sequence: 6 fullname: ANR-19-P3IA-0003,MIAI,MIAI @ Grenoble Alpes – sequence: 7 fullname: ANR-19-CE33-0008,ML3RI,Apprentissage de bas-niveau d'ineractions robotiques multi-modales avec plusieurs personnes – sequence: 8 fullname: European Project: 871245,H2020-EU.2.1.1. - INDUSTRIAL LEADERSHIP - Leadership in enabling and industrial technologies - Information and Communication Technologies (ICT),SPRING – sequence: 9 fullname: Leglaive, Simon – sequence: 10 fullname: GIPSA - Cognitive Robotics, Interactive Systems, & Speech Processing (GIPSA-CRISSP) ; GIPSA Pôle Parole et Cognition (GIPSA-PPC) ; Grenoble Images Parole Signal Automatique (GIPSA-lab) ; Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ) ; Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ) ; Université Grenoble Alpes (UGA)-Grenoble Images Parole Signal Automatique (GIPSA-lab) ; Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ) ; Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ) ; Université Grenoble Alpes (UGA) – sequence: 11 fullname: Institut d'Électronique et des Technologies du numéRique (IETR) – sequence: 12 fullname: Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes) – sequence: 13 fullname: Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-Nantes Université - pôle Sciences et technologie – sequence: 14 fullname: Nantes Université (Nantes Univ)-Nantes Université (Nantes Univ) – sequence: 15 fullname: CentraleSupélec campus de Rennes – sequence: 16 fullname: GIPSA - Cognitive Robotics, Interactive Systems, & Speech Processing (GIPSA-CRISSP) – sequence: 17 fullname: GIPSA Pôle Parole et Cognition (GIPSA-PPC) – sequence: 18 fullname: Grenoble Images Parole Signal Automatique (GIPSA-lab) – sequence: 19 fullname: Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ) – sequence: 20 fullname: Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ) – sequence: 21 fullname: Université Grenoble Alpes (UGA)-Grenoble Images Parole Signal Automatique (GIPSA-lab) – sequence: 22 fullname: Université Grenoble Alpes (UGA) – sequence: 23 fullname: Vers des robots à l’intelligence sociale au travers de l’apprentissage, de la perception et de la commande (ROBOTLEARN) – sequence: 24 fullname: Inria Grenoble - Rhône-Alpes – sequence: 25 fullname: Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université Grenoble Alpes (UGA) |
| DBID | RYH |
| DOI | 10.48550/arxiv.2204.07075 10.1016/j.specom.2023.02.005 |
| DatabaseName | CiNii Complete |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Languages & Literatures Social Welfare & Social Work Psychology |
| EndPage | 65 |
| GroupedDBID | --K --M -~X .DC .~1 0R~ 123 1B1 1~. 1~5 4.4 457 4G. 5VS 7-5 71M 8P~ 9JN 9JO AADFP AAEDT AAEDW AAFJI AAGJA AAGUQ AAIKJ AAKOC AALRI AAOAW AAQFI AATTM AAXKI AAXUO AAYFN AAYWO ABBOA ABIVO ABJNI ABMAC ABMMH ABOYX ACDAQ ACGFS ACLOT ACRLP ACVFH ACXNI ACZNC ADBBV ADCNI ADEZE ADTZH AEBSH AECPX AEIPS AEKER AENEX AEUPX AFJKZ AFPUW AFTJW AGHFR AGUBO AGYEJ AHHHB AHJVU AHZHX AIALX AIEXJ AIGII AIIUN AIKHN AITUG AKBMS AKRWK AKYEP ALMA_UNASSIGNED_HOLDINGS AMRAJ ANKPU AOMHK AOUOD APXCP AVARZ AXJTR BJAXD BKOJK BLXMC CS3 DU5 EBS EFJIC EFKBS EFLBG EO8 EO9 EP2 EP3 F5P FDB FIRID FNPLU FYGXN G-Q GBLVA GBOLZ IHE J1W JJJVA KOM LG9 M41 MO0 N9A O-L O9- OAUVE OKEIE OZT P-8 P-9 P2P PC. PQQKQ PRBVW Q38 ROL RPZ RYH SDF SDG SDP SES SPC SPCBC SSB SSO SST SSV SSY SSZ T5K XJE ~G- ~HD |
| ID | FETCH-LOGICAL-j950-6a1a390fc450b618343e2cfb545cb38c83dfa62b4c1f6206855bcf5afb3e59563 |
| ISSN | 0167-6393 |
| IngestDate | Mon Nov 10 09:09:35 EST 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Language | Japanese |
| LinkModel | OpenURL |
| MergedId | FETCHMERGED-LOGICAL-j950-6a1a390fc450b618343e2cfb545cb38c83dfa62b4c1f6206855bcf5afb3e59563 |
| ORCID | 0000-0002-8219-1298 |
| OpenAccessLink | https://cir.nii.ac.jp/crid/1871146593093352320 |
| PageCount | 13 |
| ParticipantIDs | nii_cinii_1871146593093352320 |
| PublicationCentury | 2000 |
| PublicationDate | 2023-03-01 |
| PublicationDateYYYYMMDD | 2023-03-01 |
| PublicationDate_xml | – month: 03 year: 2023 text: 2023-03-01 day: 01 |
| PublicationDecade | 2020 |
| PublicationTitle | Speech Communication |
| PublicationYear | 2023 |
| Publisher | Elsevier BV |
| Publisher_xml | – name: Elsevier BV |
| SSID | ssj0004882 ssib017387539 ssib006543819 |
| Score | 2.4899008 |
| SourceID | nii |
| SourceType | Publisher |
| StartPage | 53 |
| SubjectTerms | [INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI] [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG] [INFO.INFO-SD] Computer Science [cs]/Sound [cs.SD] [SPI.SIGNAL] Engineering Sciences [physics]/Signal and Image processing Audio and Speech Processing (eess.AS) Computer Science - Machine Learning Computer Science - Sound Deep generative models Electrical Engineering and Systems Science - Audio and Speech Processing FOS: Computer and information sciences FOS: Electrical engineering, electronic engineering, information engineering Machine Learning (cs.LG) Representation learning Sound (cs.SD) Source-filter model Variational autoencoder |
| Title | Learning and controlling the source-filter representation of speech with a variational autoencoder |
| URI | https://cir.nii.ac.jp/crid/1871146593093352320 |
| Volume | 148 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVESC databaseName: Elsevier SD Freedom Collection Journals 2021 issn: 0167-6393 databaseCode: AIEXJ dateStart: 20220201 customDbUrl: isFulltext: true dateEnd: 99991231 titleUrlDefault: https://www.sciencedirect.com omitProxy: false ssIdentifier: ssj0004882 providerName: Elsevier – providerCode: PRVESC databaseName: Elsevier SD Freedom Collection Journals 2021 issn: 0167-6393 databaseCode: AIEXJ dateStart: 19950101 customDbUrl: isFulltext: true dateEnd: 99991231 titleUrlDefault: https://www.sciencedirect.com omitProxy: false ssIdentifier: ssj0004882 providerName: Elsevier |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3JbtswECXcpIdcitTdkxQ8FL0ESiVSEqWjEaQbhCBAjdS3gqRIVI4jBbJjuP_Qj85QpBanBdoeeiEMirZkzcNwOJz3iNAbFqtEmK1_RWXoGYU5L4mV9HKYCzUDQFsW_2XGzs-T2Sy9GI1-tlyY9YKVZbLZpDf_1dTQB8Y21Nl_MHf3o9ABn8Ho0ILZof0rw2dtssMy1ppK9EVLirK5ek8XZpP8uFG0bNlHTdy4vFFKfneMt-M1LKTbZCG_XVVG9DJ35bwuoP1iv7DFM-myNjyvrmza-broioAzBRE7-NjmQnHdD_9Q1FbQwHC1B_U4EwCtyrl3AQFx3sS6M25m82HCgtC-Ystm0TomzeUwrwn-GoIluuWYrQanc61WU9hN0vZ8ifvuv1FnM5NbvSnWJ4QYDVvms9-OdVmL-YlhtFZGk4BQq94aPUC7hEUp-Mjdyaez2efeP0VGEK0L_wJGzWov7Xm4SXM4WfdnWqpmU0_4633sJnvz0O_uPzKEOmVRDEKd6T565NYoeGKx9RiN5nyMnmcus73Eb3HWiXEvx2ivm0R_jNGh5Xvjr2qhea1gbNtR1VdPkGjRiQGdeIBODOjEW-jE2-jElcYWndigE3M8QCceoPMpmr4_m55-9NwxH948jXwv5gGnqa9lGPkihhkmpIpILSC0l4ImMqG55jERoQx0TPwYXpeQOuJaUBXB6p4-QztlVaoXCBPBolAF3GymQ6CtUy15GoaxL1geSs5eoiN4qd9kYdogYYaRH6WmFgCWIZT4r_5w_QDt9YA-RDur-lYdoYdyvSqW9WuHljsDfokj |
| linkProvider | Elsevier |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Learning+and+controlling+the+source-filter+representation+of+speech+with+a+variational+autoencoder&rft.jtitle=Speech+Communication&rft.au=Sadok%2C+Samir&rft.au=Leglaive%2C+Simon&rft.au=Girin%2C+Laurent&rft.au=Alameda-Pineda%2C+Xavier&rft.date=2023-03-01&rft.pub=Elsevier+BV&rft.issn=0167-6393&rft.volume=148&rft.spage=53&rft.epage=65&rft_id=info:doi/10.48550%2Farxiv.2204.07075&rft_id=info:doi/10.1016%2Fj.specom.2023.02.005 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0167-6393&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0167-6393&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0167-6393&client=summon |