Is text-to-speech synthesis ready for use in computer-assisted language learning?
Text-to-speech (TTS) synthesis, the generation of speech from text input, offers another means of providing spoken language input to learners in Computer-Assisted Language Learning (CALL) environments. Indeed, many potential benefits (ease of creation and editing of speech models, generation of spee...
Saved in:
| Published in: | Speech communication Vol. 51; no. 10; pp. 906 - 919 |
|---|---|
| Main Author: | |
| Format: | Journal Article |
| Language: | English |
| Published: |
Elsevier B.V
01.10.2009
Elsevier : North-Holland |
| Subjects: | |
| ISSN: | 0167-6393, 1872-7182 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | Text-to-speech (TTS) synthesis, the generation of speech from text input, offers another means of providing spoken language input to learners in Computer-Assisted Language Learning (CALL) environments. Indeed, many potential benefits (ease of creation and editing of speech models, generation of speech models and feedback on demand, etc.) and uses (talking dictionaries, talking texts, dictation, pronunciation training, dialogue partner, etc.) of TTS synthesis in CALL have been put forward. Yet, the use of TTS synthesis in CALL is not widely accepted and only a few applications have found their way onto the market. One potential reason for this is that TTS synthesis has not been adequately evaluated for this purpose. Previous evaluations of TTS synthesis for use in CALL, have only addressed the comprehensibility of TTS synthesis. Yet, CALL places demands on the comprehensibility, naturalness, accuracy, register and expressiveness of the output of TTS synthesis. In this paper, the aforementioned aspects of the quality of the output of four state-of-the-art French TTS synthesis systems are evaluated with respect to their use in the three different roles that TTS synthesis systems may assume within CALL applications, namely: (1) reading machine, (2) pronunciation model and (3) conversational partner [Handley, Z., Hamel, M.-J., 2005. Establishing a methodology for benchmarking speech synthesis for computer-assisted language learning (CALL). Language Learning and Technology Journal 9(3), 99–119. Retrieved from:
http://llt.msu.edu/vol9num3/handley/default.html.]. The results of this evaluation suggest that the best TTS synthesis systems are ready for use in applications in which they ‘add value’ to CALL, i.e. exploit the unique capacity of TTS synthesis to generate speech models on demand. An example of such an application is a dialogue partner. In order to fully meet the requirements of CALL, further attention needs to be paid to accuracy and naturalness, in particular at the prosodic level, and expressiveness. |
|---|---|
| AbstractList | Text-to-speech (TTS) synthesis, the generation of speech from text input, offers another means of providing spoken language input to learners in Computer-Assisted Language Learning (CALL) environments. Indeed, many potential benefits (ease of creation and editing of speech models, generation of speech models and feedback on demand, etc.) and uses (talking dictionaries, talking texts, dictation, pronunciation training, dialogue partner, etc.) of TTS synthesis in CALL have been put forward. Yet, the use of TTS synthesis in CALL is not widely accepted and only a few applications have found their way onto the market. One potential reason for this is that TTS synthesis has not been adequately evaluated for this purpose. Previous evaluations of TTS synthesis for use in CALL, have only addressed the comprehensibility of TTS synthesis. Yet, CALL places demands on the comprehensibility, naturalness, accuracy, register and expressiveness of the output of TTS synthesis. In this paper, the aforementioned aspects of the quality of the output of four state-of-the-art French TTS synthesis systems are evaluated with respect to their use in the three different roles that TTS synthesis systems may assume within CALL applications, namely: (1) reading machine, (2) pronunciation model and (3) conversational partner [Handley, Z., Hamel, M.-J., 2005. Establishing a methodology for benchmarking speech synthesis for computer-assisted language learning (CALL). Language Learning and Technology Journal 9(3), 99-119. Retrieved from: http://llt.msu.edu/vol9num3/handley/default.html.]. The results of this evaluation suggest that the best TTS synthesis systems are ready for use in applications in which they 'add value' to CALL, i.e. exploit the unique capacity of TTS synthesis to generate speech models on demand. An example of such an application is a dialogue partner. In order to fully meet the requirements of CALL, further attention needs to be paid to accuracy and naturalness, in particular at the prosodic level, and expressiveness. Text-to-speech (TTS) synthesis, the generation of speech from text input, offers another means of providing spoken language input to learners in Computer-Assisted Language Learning (CALL) environments. Indeed, many potential benefits (ease of creation and editing of speech models, generation of speech models and feedback on demand, etc.) and uses (talking dictionaries, talking texts, dictation, pronunciation training, dialogue partner, etc.) of TTS synthesis in CALL have been put forward. Yet, the use of TTS synthesis in CALL is not widely accepted and only a few applications have found their way onto the market. One potential reason for this is that TTS synthesis has not been adequately evaluated for this purpose. Previous evaluations of TTS synthesis for use in CALL, have only addressed the comprehensibility of TTS synthesis. Yet, CALL places demands on the comprehensibility, naturalness, accuracy, register and expressiveness of the output of TTS synthesis. In this paper, the aforementioned aspects of the quality of the output of four state-of-the-art French TTS synthesis systems are evaluated with respect to their use in the three different roles that TTS synthesis systems may assume within CALL applications, namely: (1) reading machine, (2) pronunciation model and (3) conversational partner [Handley, Z., Hamel, M.-J., 2005. Establishing a methodology for benchmarking speech synthesis for computer-assisted language learning (CALL). Language Learning and Technology Journal 9(3), 99-119. Retrieved from: [URL:http://llt.msu.edu/vol9num3/handley/default.html].]. The results of this evaluation suggest that the best TTS synthesis systems are ready for use in applications in which they 'add value' to CALL, i.e. exploit the unique capacity of TTS synthesis to generate speech models on demand. An example of such an application is a dialogue partner. In order to fully meet the requirements of CALL, further attention needs to be paid to accuracy and naturalness, in particular at the prosodic level, and expressiveness. [Copyright Elsevier B.V.] Text-to-speech (TTS) synthesis, the generation of speech from text input, offers another means of providing spoken language input to learners in Computer-Assisted Language Learning (CALL) environments. Indeed, many potential benefits (ease of creation and editing of speech models, generation of speech models and feedback on demand, etc.) and uses (talking dictionaries, talking texts, dictation, pronunciation training, dialogue partner, etc.) of TTS synthesis in CALL have been put forward. Yet, the use of TTS synthesis in CALL is not widely accepted and only a few applications have found their way onto the market. One potential reason for this is that TTS synthesis has not been adequately evaluated for this purpose. Previous evaluations of TTS synthesis for use in CALL, have only addressed the comprehensibility of TTS synthesis. Yet, CALL places demands on the comprehensibility, naturalness, accuracy, register and expressiveness of the output of TTS synthesis. In this paper, the aforementioned aspects of the quality of the output of four state-of-the-art French TTS synthesis systems are evaluated with respect to their use in the three different roles that TTS synthesis systems may assume within CALL applications, namely: (1) reading machine, (2) pronunciation model and (3) conversational partner [Handley, Z., Hamel, M.-J., 2005. Establishing a methodology for benchmarking speech synthesis for computer-assisted language learning (CALL). Language Learning and Technology Journal 9(3), 99–119. Retrieved from: http://llt.msu.edu/vol9num3/handley/default.html.]. The results of this evaluation suggest that the best TTS synthesis systems are ready for use in applications in which they ‘add value’ to CALL, i.e. exploit the unique capacity of TTS synthesis to generate speech models on demand. An example of such an application is a dialogue partner. In order to fully meet the requirements of CALL, further attention needs to be paid to accuracy and naturalness, in particular at the prosodic level, and expressiveness. Text-to-Speech (TTS) synthesis, the generation of speech from text input, offers another means of providing spoken language input to learners in Computer-Assisted Language Learning (CALL) environments. Indeed, many potential benefits (ease of creation and editing of speech models, generation of speech models and feedback on demand, etc.) and uses (talking dictionaries, talking texts, dictation, pronunciation training, dialogue partner, etc.) of TTS synthesis in CALL have been put forward. Yet, the use of TTS synthesis in CALL is not widely accepted and only a few applications have found their way onto the market. One potential reason for this is that TTS synthesis has not been adequately evaluated for this purpose. Previous evaluations of TTS synthesis for use in CALL, have only addressed the comprehensibility of TTS synthesis. Yet, CALL places demands on the comprehensibility, naturalness, accuracy, register and expressiveness of the output of TTS synthesis. In this paper, the aforementioned aspects of the quality of the output of four state-of-the-art French TTS synthesis systems are evaluated with respect to their use in the three different roles that TTS synthesis systems may assume within CALL applications, namely: 1) reading machine, 2) pronunciation model and 3) conversational partner (Handley and Hamel, 2005). The results of this evaluation suggest that the best TTS synthesis systems are ready for use in applications in which they ‘add value' to CALL, i.e. exploit the unique capacity of TTS synthesis to generate speech models on demand. An example of such an application is a dialogue partner. In order to fully meet the requirements of CALL, further attention needs to be paid to accuracy and naturalness, in particular at the prosodic level, and expressiveness. |
| Author | Handley, Zöe |
| Author_xml | – sequence: 1 givenname: Zöe surname: Handley fullname: Handley, Zöe email: zoe.handley@nottingham.ac.uk organization: School of Computer Science, The University of Manchester, Lamb Building, Booth St. East, Manchester, M13 9EP, UK 1 The research was actually carried out in The School of Informatics, which has since merged with The School of Computer Science. 1 |
| BackLink | https://hal.science/hal-00558516$$DView record in HAL |
| BookMark | eNqNkUFr3DAQhUVJoJuk_6AHnwo92B3Jli330BJC0wQWSiA9C1ka7WrxWltJDtl_Xy0OPfTQ9CQYfe_Nk94FOZv8hIS8p1BRoO2nXRUPqP2-YgCioqwCaN6QFRUdKzsq2BlZZawr27qv35KLGHeQCSHYijzcxyLhcyqTL7MJ6m0Rj1PaYnSxCKjMsbA-FHPEwk1F3nGYE4ZSxXyf0BSjmjaz2mAxogqTmzZfr8i5VWPEdy_nJfl5--3x5q5c__h-f3O9LjVveCqV0GBboxUTlBsu2CCaQVsYmO476BFr1Ru0A3TUwKBszbVpwXY9ctOhbepL8nHx3apRHoLbq3CUXjl5d72WpxkA54LT9olm9sPCHoL_NWNMcu-ixjGnRz9HKXgHXc35f4A1bwWHV8G6ZbyvQWSwWUAdfIwB7Z-sFOSpPbmTS3vy1J6kLAc_ve7zXzLtkkrOTykoN74m_rKIMf__k8Mgo3Y4aTQuoE7SePdvg98p8LsU |
| CODEN | SCOMDH |
| CitedBy_id | crossref_primary_10_3390_electronics9020267 crossref_primary_10_1007_s10772_017_9463_8 crossref_primary_10_1109_ACCESS_2020_2988406 crossref_primary_10_4000_apliut_5526 crossref_primary_10_1016_j_cct_2021_106500 crossref_primary_10_1145_3276321 crossref_primary_10_1007_s00530_020_00659_4 crossref_primary_10_1080_09588221_2010_526945 crossref_primary_10_1109_TAFFC_2018_2828429 crossref_primary_10_1111_modl_12929 crossref_primary_10_1057_s41599_025_04672_8 crossref_primary_10_1080_00140139_2015_1058425 crossref_primary_10_1080_19376529_2020_1801689 crossref_primary_10_1080_09588221_2017_1422524 crossref_primary_10_1016_j_csl_2010_05_008 crossref_primary_10_1016_j_csl_2018_03_001 crossref_primary_10_1017_S0958344010000273 crossref_primary_10_1016_j_system_2025_103610 crossref_primary_10_1080_10494820_2014_917111 crossref_primary_10_1109_TLT_2020_2980261 crossref_primary_10_1080_09588221_2014_889711 |
| Cites_doi | 10.1121/1.425343 10.64152/10125/44034 10.21437/Eurospeech.1999-51 10.21437/Eurospeech.2003-10 10.1109/WSS.2002.1224411 10.1023/A:1015416013198 10.1109/TASL.2006.878306 10.21236/ADA461150 10.1121/1.424924 10.1080/0033039870240408 10.1017/S0958344000004286 10.21437/Interspeech.2005-79 10.21437/Interspeech.2005-72 10.21236/ADA459395 10.3115/991250.991307 10.1017/S0958344001000210 10.1016/0360-1315(93)90044-J 10.1023/A:1022390615396 10.1093/llc/2.2.116 10.64152/10125/25030 10.64152/10125/25032 |
| ContentType | Journal Article |
| Copyright | 2008 Elsevier B.V. Distributed under a Creative Commons Attribution 4.0 International License |
| Copyright_xml | – notice: 2008 Elsevier B.V. – notice: Distributed under a Creative Commons Attribution 4.0 International License |
| DBID | AAYXX CITATION 7SC 7SP 8FD JQ2 L7M L~C L~D 8BM 7T9 1XC VOOES |
| DOI | 10.1016/j.specom.2008.12.004 |
| DatabaseName | CrossRef Computer and Information Systems Abstracts Electronics & Communications Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional ComDisDome Linguistics and Language Behavior Abstracts (LLBA) Hyper Article en Ligne (HAL) Hyper Article en Ligne (HAL) (Open Access) |
| DatabaseTitle | CrossRef Technology Research Database Computer and Information Systems Abstracts – Academic Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Professional ComDisDome Linguistics and Language Behavior Abstracts (LLBA) |
| DatabaseTitleList | Technology Research Database Linguistics and Language Behavior Abstracts (LLBA) ComDisDome |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Languages & Literatures Social Welfare & Social Work Psychology |
| EISSN | 1872-7182 |
| EndPage | 919 |
| ExternalDocumentID | oai:HAL:hal-00558516v1 10_1016_j_specom_2008_12_004 S0167639308001842 |
| GroupedDBID | --K --M -~X .DC .~1 07C 0R~ 123 1B1 1~. 1~5 4.4 457 4G. 53G 5VS 7-5 71M 8P~ 9JN 9JO AACTN AADFP AAEDT AAEDW AAFJI AAGJA AAGJQ AAGUQ AAIAV AAIKJ AAKOC AALRI AAOAW AAQFI AAQXK AAXUO AAYFN ABBOA ABFNM ABIVO ABJNI ABMAC ABMMH ABOYX ABXDB ABYKQ ACDAQ ACGFS ACNNM ACRLP ACXNI ACZNC ADBBV ADEZE ADIYS ADJOM ADMUD ADTZH AEBSH AECPX AEKER AENEX AFKWA AFTJW AFYLN AGHFR AGUBO AGYEJ AHHHB AHJVU AHZHX AIALX AIEXJ AIKHN AITUG AJBFU AJOXV AKYCK ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ AOMHK AOUOD ASPBG AVARZ AVWKF AXJTR AZFZN BJAXD BKOJK BLXMC CS3 DU5 EBS EFJIC EFLBG EJD EO8 EO9 EP2 EP3 F0J F5P FDB FEDTE FGOYB FIRID FNPLU FYGXN G-Q G8K GBLVA GBOLZ HLZ HVGLF HZ~ IHE J1W JJJVA KOM LG9 M41 MO0 N9A O-L O9- OAUVE OKEIE OZT P-8 P-9 P2P PC. PQQKQ PRBVW Q38 R2- RIG ROL RPZ SBC SDF SDG SDP SES SEW SPC SPCBC SSB SSO SST SSV SSY SSZ T5K WUQ XFK XJE ~G- 9DU AATTM AAXKI AAYWO AAYXX ABDPE ABWVN ACLOT ACRPL ACVFH ADCNI ADNMO AEIPS AEUPX AFJKZ AFPUW AGQPQ AIGII AIIUN AKBMS AKRWK AKYEP ANKPU APXCP CITATION EFKBS ~HD 7SC 7SP 8FD JQ2 L7M L~C L~D 8BM 7T9 1XC VOOES |
| ID | FETCH-LOGICAL-c545t-a8c0f6dca2815d582b84bcf0b2c9709ee3a9defb071d0baf35cd60f79e5d7ef43 |
| ISICitedReferencesCount | 48 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000269092500009&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 0167-6393 |
| IngestDate | Tue Oct 14 20:38:12 EDT 2025 Thu Oct 02 18:18:13 EDT 2025 Mon Sep 29 06:24:55 EDT 2025 Sat Sep 27 17:04:46 EDT 2025 Sat Nov 29 07:22:35 EST 2025 Tue Nov 18 22:26:51 EST 2025 Fri Feb 23 02:28:31 EST 2024 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 10 |
| Keywords | CALL Evaluation Speech synthesis TTS synthesis |
| Language | English |
| License | Distributed under a Creative Commons Attribution 4.0 International License: http://creativecommons.org/licenses/by/4.0 |
| LinkModel | OpenURL |
| MergedId | FETCHMERGED-LOGICAL-c545t-a8c0f6dca2815d582b84bcf0b2c9709ee3a9defb071d0baf35cd60f79e5d7ef43 |
| Notes | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 ObjectType-Article-1 ObjectType-Feature-2 |
| OpenAccessLink | https://hal.science/hal-00558516 |
| PQID | 36259308 |
| PQPubID | 23500 |
| PageCount | 14 |
| ParticipantIDs | hal_primary_oai_HAL_hal_00558516v1 proquest_miscellaneous_85707355 proquest_miscellaneous_85356850 proquest_miscellaneous_36259308 crossref_primary_10_1016_j_specom_2008_12_004 crossref_citationtrail_10_1016_j_specom_2008_12_004 elsevier_sciencedirect_doi_10_1016_j_specom_2008_12_004 |
| PublicationCentury | 2000 |
| PublicationDate | 2009-10-01 |
| PublicationDateYYYYMMDD | 2009-10-01 |
| PublicationDate_xml | – month: 10 year: 2009 text: 2009-10-01 day: 01 |
| PublicationDecade | 2000 |
| PublicationTitle | Speech communication |
| PublicationYear | 2009 |
| Publisher | Elsevier B.V Elsevier : North-Holland |
| Publisher_xml | – name: Elsevier B.V – name: Elsevier : North-Holland |
| References | Campbell, Black (bib9) 1997 Chapelle (bib12) 2001; 23 Chapelle (bib13) 2001 Conkie, A. (1999). Robust Unit Selection System for Speech Synthesis. In Procs. Joint meeting of ASA, EAA, and DAGA. Berlin. Germany. Hamel, M.-J., 2003a. Re-using Natural Language Processing Tools in Computer Assisted Language Learning: The Experience of SAFRAN. Unpublished Doctoral Thesis. UMIST, Manchester. Hamel, M.-J., 2003b. FreeText: a “Smart” multimedia web-based computer assisted language learning environment for learners of French. In: Proceedings of the m-ICTE2003, Vol. III, Badajoz, Spain, pp. 1661–1665. Raux, A., Eskenazi, M., 2004. Using task-oriented spoken dialogues for language learning: Potential, practical application and challenges. In: Delmonte, R., Delcloque, P., Tonelli, S.(Eds.), Proceedings of the InSTIL/ICALL 2004 Symposium, Venice, Italy, pp. 147–150. Seneff, S., Wang, C., Zhang, J., 2004. Spoken conversational interaction for language learning. In: Proceedings of the InSTIL/ICALL 2004 – NLP and Speech Technologies in Language Learning Systems, Venice, Italy, pp. 151–154. ELSE (Evaluation in Language and Speech Engineering), 1999. A Blueprint for a General Infrastructure for Natural Language Processing Systems Evaluation using Semi-Automatic Quantitative Black Box Approach in a Multilingual Environment, Report No. D1.1. Sparck Jones, Galliers (bib46) 1996 Schmidt-Nielsen (bib41) 1995 Mercier, Guyomard, Siroux, Bramoullé, Gourmelon, Guillou, Lavannant (bib34) 2000 Campbell, Hamza, Höge, Tao, Bailly (bib10) 2006; 14 Francis, Nusbaum (bib23) 1999 Babel Technologies, 2003. BrightSpeech. Retrieved from Beutnagel, M., Conkie, A., Schroeter, J., Stylianou, Y., Syrdal, A., 1999. The AT&T next-gen TTS system. In: Proceedings of the Joint Meeting of the ASA, EAA, and DAGA. Berlin, Germany. . Hincks (bib30) 2002; 44 Handley, Hamel (bib28) 2005; 9 Stratil, Weston, Burkhardt (bib49) 1987; 2 Henton (bib29) 2002; 5 van Bezooijen, van Heuven (bib51) 1997 Stevens (bib47) 1989 Egan, LaRocca (bib20) 2000 Ehsani, Knodt (bib21) 1998; 2 Pisoni (bib36) 1978/9 TMA Associates, 2003. Nuance: US English. Retrieved from Handley, Z., 2006. Evaluating Text-To-Speech (TTS) Synthesis for use in Computer-Assisted Language Learning (CALL). Unpublished Doctoral Thesis. The University of Manchester. Johnson, W., Narayanan, S., Whitney, R., Das, R., Bulut, M., LaBore, C., 2002. Limited domain synthesis of expressive military speech for animated characters. In: Proceedings of the 7th International Conference on Spoken Language Processing. Denver, Colorado, USA. Chapelle (bib11) 1998; 2 Black, A., Taylor, P., 1994. CHATR: a generic speech synthesis system. In: Proceedings of the COLING 94, the 15th International Conference on Computational Linguistics, Kyoto, Japan, pp. 983–986. Hamel (bib24) 1998; 10 de Pijper (bib16) 1997 Black, A., Lenzo, K., 2000. Limited domain synthesis. In: Proceedings of the ICSLP, Beijing, China. Cohen (bib14) 1993; 21 Santiago-Oriola, C., 1999. Vocal synthesis in a computerized dictation exercise. In: Proceedings of EUROSPEECH’99, Vol. 1, Budapest, pp. 191–194. Bailly, G., Campbell, N., Mobius, B., 2003. ISCA special session: hot topics in speech synthesis. In: Proceedings of the Eurospeech, Geneva, pp. 37–40. Bennett, C., 2005. Large scale evaluation of corpus-based synthesizers: results and lessons from the blizzard challenge 2005. In: Proceedings of the INTERSPEECH, Lisbon, Portugal, pp. 105–108. Multitel, 2005. eLite Documentation. Retrieved from de Saint-Exupéry (bib17) 1999 Dutoit (bib18) 1997 Rodman (bib39) 1999 Auralog, 2002. Talk to me: the conversation method (French). (Version 3.5) from Auralog Polkosky, Lewis (bib37) 2003; 6 Stratil, Burkhardt, Jarratt, Yandle (bib48) 1987; 24 Black, A, Tokuda, K., 2005. The blizzard challenge – 2005: evaluating corpus-based speech synthesis on common datasets. In: Proceedings of the INTERSPEECH, Lisbon, Portugal, pp. 77–80. Sherwood (bib44) 1981; 3 Huang, Acero, Hon (bib31) 2001 Keller, Zellner-Keller (bib33) 2000 Edgington, M., 1997. Investigating the limitations of concatenative synthesis. In: Proceedings of the Eurospeech’97, Rhodes, Greece, pp. 1–4. Schroeter, J., Conkie, A., Syrdal, A., Beutnagel, M., Juka, M., Strom, V., Kim, M.-J., Kang, H.-G., Kapllow, D., 2002. A perspective on the next challenges for TTS research. In: Proceedings of the IEEE 2002 Workshop on Speech Synthesis, Santa Monica, California, pp. 211–214. Sobkowiak (bib45) 1998 Pisoni (10.1016/j.specom.2008.12.004_bib36) 1978 Chapelle (10.1016/j.specom.2008.12.004_bib13) 2001 Hamel (10.1016/j.specom.2008.12.004_bib24) 1998; 10 10.1016/j.specom.2008.12.004_bib15 10.1016/j.specom.2008.12.004_bib19 Chapelle (10.1016/j.specom.2008.12.004_bib12) 2001; 23 de Pijper (10.1016/j.specom.2008.12.004_bib16) 1997 10.1016/j.specom.2008.12.004_bib50 Handley (10.1016/j.specom.2008.12.004_bib28) 2005; 9 Polkosky (10.1016/j.specom.2008.12.004_bib37) 2003; 6 van Bezooijen (10.1016/j.specom.2008.12.004_bib51) 1997 Sparck Jones (10.1016/j.specom.2008.12.004_bib46) 1996 Henton (10.1016/j.specom.2008.12.004_bib29) 2002; 5 Hincks (10.1016/j.specom.2008.12.004_bib30) 2002; 44 10.1016/j.specom.2008.12.004_bib40 Sobkowiak (10.1016/j.specom.2008.12.004_bib45) 1998 de Saint-Exupéry (10.1016/j.specom.2008.12.004_bib17) 1999 Huang (10.1016/j.specom.2008.12.004_bib31) 2001 Campbell (10.1016/j.specom.2008.12.004_bib10) 2006; 14 10.1016/j.specom.2008.12.004_bib42 10.1016/j.specom.2008.12.004_bib43 Egan (10.1016/j.specom.2008.12.004_bib20) 2000 Dutoit (10.1016/j.specom.2008.12.004_bib18) 1997 Campbell (10.1016/j.specom.2008.12.004_bib9) 1997 Rodman (10.1016/j.specom.2008.12.004_bib39) 1999 10.1016/j.specom.2008.12.004_bib38 10.1016/j.specom.2008.12.004_bib35 Sherwood (10.1016/j.specom.2008.12.004_bib44) 1981; 3 10.1016/j.specom.2008.12.004_bib8 Francis (10.1016/j.specom.2008.12.004_bib23) 1999 10.1016/j.specom.2008.12.004_bib7 10.1016/j.specom.2008.12.004_bib6 Schmidt-Nielsen (10.1016/j.specom.2008.12.004_bib41) 1995 10.1016/j.specom.2008.12.004_bib32 Stevens (10.1016/j.specom.2008.12.004_bib47) 1989 10.1016/j.specom.2008.12.004_bib5 10.1016/j.specom.2008.12.004_bib4 10.1016/j.specom.2008.12.004_bib3 10.1016/j.specom.2008.12.004_bib2 10.1016/j.specom.2008.12.004_bib1 Stratil (10.1016/j.specom.2008.12.004_bib49) 1987; 2 Keller (10.1016/j.specom.2008.12.004_bib33) 2000 Chapelle (10.1016/j.specom.2008.12.004_bib11) 1998; 2 Stratil (10.1016/j.specom.2008.12.004_bib48) 1987; 24 10.1016/j.specom.2008.12.004_bib26 10.1016/j.specom.2008.12.004_bib27 Mercier (10.1016/j.specom.2008.12.004_bib34) 2000 10.1016/j.specom.2008.12.004_bib25 Cohen (10.1016/j.specom.2008.12.004_bib14) 1993; 21 Ehsani (10.1016/j.specom.2008.12.004_bib21) 1998; 2 10.1016/j.specom.2008.12.004_bib22 |
| References_xml | – reference: Edgington, M., 1997. Investigating the limitations of concatenative synthesis. In: Proceedings of the Eurospeech’97, Rhodes, Greece, pp. 1–4. – year: 1999 ident: bib39 article-title: Computer Speech Technology – volume: 21 start-page: 25 year: 1993 end-page: 30 ident: bib14 article-title: The use of a voice synthesizer in the discovery of the written language by young children publication-title: Computers in Educations. – reference: Handley, Z., 2006. Evaluating Text-To-Speech (TTS) Synthesis for use in Computer-Assisted Language Learning (CALL). Unpublished Doctoral Thesis. The University of Manchester. – volume: 2 start-page: 22 year: 1998 end-page: 34 ident: bib11 article-title: Multimedia CALL: lessons to be learned from research on instructed SLA publication-title: Language Learning and Technology – start-page: 195 year: 1995 end-page: 231 ident: bib41 article-title: Intelligibility and acceptability testing for speech technology publication-title: Applied Speech Technology – reference: Hamel, M.-J., 2003b. FreeText: a “Smart” multimedia web-based computer assisted language learning environment for learners of French. In: Proceedings of the m-ICTE2003, Vol. III, Badajoz, Spain, pp. 1661–1665. – reference: Auralog, 2002. Talk to me: the conversation method (French). (Version 3.5) from Auralog – reference: Black, A, Tokuda, K., 2005. The blizzard challenge – 2005: evaluating corpus-based speech synthesis on common datasets. In: Proceedings of the INTERSPEECH, Lisbon, Portugal, pp. 77–80. – year: 1998 ident: bib45 article-title: Speech in EFL CALL publication-title: Multimedia CALL: Theory and Practice – volume: 2 start-page: 45 year: 1998 end-page: 60 ident: bib21 article-title: Speech technology in computer-aided language learning: strengths and limitations of a new CALL paradigm publication-title: Language Learning & Technology – volume: 5 start-page: 117 year: 2002 end-page: 131 ident: bib29 article-title: Challenges and rewards in using parametric or concatenative speech synthesis publication-title: International Journal of Speech Technology – year: 1996 ident: bib46 article-title: Evaluating Natural Language Processing Systems: An Analysis and Review – reference: TMA Associates, 2003. Nuance: US English. Retrieved from – start-page: 151 year: 1978/9 end-page: 171 ident: bib36 article-title: Some measures of intelligibility and comprehension publication-title: From Text to Speech: The MITalk System – reference: Babel Technologies, 2003. BrightSpeech. Retrieved from – reference: Beutnagel, M., Conkie, A., Schroeter, J., Stylianou, Y., Syrdal, A., 1999. The AT&T next-gen TTS system. In: Proceedings of the Joint Meeting of the ASA, EAA, and DAGA. Berlin, Germany. – volume: 14 year: 2006 ident: bib10 article-title: Special section on expressive speech synthesis publication-title: IEEE Transactions on Audio, Speech and Language Processing – volume: 9 start-page: 99 year: 2005 end-page: 119 ident: bib28 article-title: Establishing a methodology for benchmarking speech synthesis for computer-assisted language learning (CALL) publication-title: Language Learning and Technology Journal – start-page: 63 year: 1999 end-page: 67 ident: bib23 article-title: Evaluating the quality of synthetic speech publication-title: Human Factors and Voice Interactive Systems – volume: 23 start-page: 3 year: 2001 end-page: 14 ident: bib12 article-title: Innovative language learning: achieving the vision publication-title: ReCALL – start-page: 4 year: 2000 end-page: 9 ident: bib20 article-title: Speech recognition in language learning: a must publication-title: Procs. InSTIL – volume: 6 start-page: 161 year: 2003 end-page: 182 ident: bib37 article-title: Expanding the MOS: development and psychometric evaluation of the MOS-R and MOS-X publication-title: International Journal of Speech Technology – year: 2001 ident: bib13 publication-title: Computer Applications in Second Language Acquisition: Foundations for Teaching, Testing and Research – start-page: 109 year: 2000 end-page: 116 ident: bib33 article-title: Speech synthesis in language learning: challenges and opportunities publication-title: Procs. InSTIL – year: 1999 ident: bib17 article-title: Le Petit Prince – year: 1997 ident: bib18 article-title: An Introduction to Text-to-Speech Synthesis – reference: . – reference: Hamel, M.-J., 2003a. Re-using Natural Language Processing Tools in Computer Assisted Language Learning: The Experience of SAFRAN. Unpublished Doctoral Thesis. UMIST, Manchester. – reference: Bailly, G., Campbell, N., Mobius, B., 2003. ISCA special session: hot topics in speech synthesis. In: Proceedings of the Eurospeech, Geneva, pp. 37–40. – start-page: 31 year: 1989 end-page: 43 ident: bib47 article-title: A direction for CALL: from behaviouristic to humanistic courseware publication-title: Teaching Languages With Computers: The Sate of the Art – reference: Black, A., Lenzo, K., 2000. Limited domain synthesis. In: Proceedings of the ICSLP, Beijing, China. – reference: Raux, A., Eskenazi, M., 2004. Using task-oriented spoken dialogues for language learning: Potential, practical application and challenges. In: Delmonte, R., Delcloque, P., Tonelli, S.(Eds.), Proceedings of the InSTIL/ICALL 2004 Symposium, Venice, Italy, pp. 147–150. – start-page: 481 year: 1997 end-page: 563 ident: bib51 article-title: Assessment of synthesis systems publication-title: Handbook of Standards and Resources for Spoke Language Systems – reference: Santiago-Oriola, C., 1999. Vocal synthesis in a computerized dictation exercise. In: Proceedings of EUROSPEECH’99, Vol. 1, Budapest, pp. 191–194. – start-page: 279 year: 1997 end-page: 292 ident: bib9 article-title: Prosody and the selection of source units for concatenative synthesis publication-title: Progress in Speech Synthesis – reference: Schroeter, J., Conkie, A., Syrdal, A., Beutnagel, M., Juka, M., Strom, V., Kim, M.-J., Kang, H.-G., Kapllow, D., 2002. A perspective on the next challenges for TTS research. In: Proceedings of the IEEE 2002 Workshop on Speech Synthesis, Santa Monica, California, pp. 211–214. – reference: Black, A., Taylor, P., 1994. CHATR: a generic speech synthesis system. In: Proceedings of the COLING 94, the 15th International Conference on Computational Linguistics, Kyoto, Japan, pp. 983–986. – reference: Seneff, S., Wang, C., Zhang, J., 2004. Spoken conversational interaction for language learning. In: Proceedings of the InSTIL/ICALL 2004 – NLP and Speech Technologies in Language Learning Systems, Venice, Italy, pp. 151–154. – reference: Johnson, W., Narayanan, S., Whitney, R., Das, R., Bulut, M., LaBore, C., 2002. Limited domain synthesis of expressive military speech for animated characters. In: Proceedings of the 7th International Conference on Spoken Language Processing. Denver, Colorado, USA. – volume: 10 start-page: 79 year: 1998 end-page: 85 ident: bib24 article-title: Les outils de TALN dans SAFRAN [NLP tools in SAFRAN] publication-title: ReCALL – volume: 24 start-page: 309 year: 1987 end-page: 316 ident: bib48 article-title: Computer-aided language learning with speech synthesis: user reactions publication-title: Programmed Learning and Educational Technology – reference: ELSE (Evaluation in Language and Speech Engineering), 1999. A Blueprint for a General Infrastructure for Natural Language Processing Systems Evaluation using Semi-Automatic Quantitative Black Box Approach in a Multilingual Environment, Report No. D1.1. – reference: Bennett, C., 2005. Large scale evaluation of corpus-based synthesizers: results and lessons from the blizzard challenge 2005. In: Proceedings of the INTERSPEECH, Lisbon, Portugal, pp. 105–108. – reference: Conkie, A. (1999). Robust Unit Selection System for Speech Synthesis. In Procs. Joint meeting of ASA, EAA, and DAGA. Berlin. Germany. – reference: Multitel, 2005. eLite Documentation. Retrieved from – year: 2001 ident: bib31 article-title: Spoken Language Processing: A Guide to Theory, Algorithm, and System Development – start-page: 145 year: 2000 end-page: 148 ident: bib34 article-title: Courseware for breton spelling pronunciation and intonation learning publication-title: Procs. InSTIL – volume: 44 start-page: 153 year: 2002 end-page: 165 ident: bib30 article-title: Speech synthesis for teaching lexical stress publication-title: TMH-QPSR – volume: 3 start-page: 175 year: 1981 end-page: 181 ident: bib44 article-title: Speech synthesis applied to language teaching publication-title: Studies in Language Learning. – start-page: 575 year: 1997 end-page: 588 ident: bib16 article-title: High-quality message-to-speech generation in a practical application publication-title: Progress in Speech Synthesis – volume: 2 start-page: 116 year: 1987 end-page: 119 ident: bib49 article-title: Exploration of foreign language speech synthesis publication-title: Literary and Linguistic Computing – ident: 10.1016/j.specom.2008.12.004_bib15 doi: 10.1121/1.425343 – volume: 9 start-page: 99 issue: 3 year: 2005 ident: 10.1016/j.specom.2008.12.004_bib28 article-title: Establishing a methodology for benchmarking speech synthesis for computer-assisted language learning (CALL) publication-title: Language Learning and Technology Journal doi: 10.64152/10125/44034 – year: 2001 ident: 10.1016/j.specom.2008.12.004_bib13 – ident: 10.1016/j.specom.2008.12.004_bib40 doi: 10.21437/Eurospeech.1999-51 – start-page: 4 year: 2000 ident: 10.1016/j.specom.2008.12.004_bib20 article-title: Speech recognition in language learning: a must – ident: 10.1016/j.specom.2008.12.004_bib22 – start-page: 151 year: 1978 ident: 10.1016/j.specom.2008.12.004_bib36 article-title: Some measures of intelligibility and comprehension – ident: 10.1016/j.specom.2008.12.004_bib26 – ident: 10.1016/j.specom.2008.12.004_bib3 doi: 10.21437/Eurospeech.2003-10 – start-page: 279 year: 1997 ident: 10.1016/j.specom.2008.12.004_bib9 article-title: Prosody and the selection of source units for concatenative synthesis – year: 1997 ident: 10.1016/j.specom.2008.12.004_bib18 – ident: 10.1016/j.specom.2008.12.004_bib42 doi: 10.1109/WSS.2002.1224411 – ident: 10.1016/j.specom.2008.12.004_bib35 – year: 1999 ident: 10.1016/j.specom.2008.12.004_bib39 – ident: 10.1016/j.specom.2008.12.004_bib50 – year: 1996 ident: 10.1016/j.specom.2008.12.004_bib46 – volume: 44 start-page: 153 year: 2002 ident: 10.1016/j.specom.2008.12.004_bib30 article-title: Speech synthesis for teaching lexical stress publication-title: TMH-QPSR – volume: 5 start-page: 117 year: 2002 ident: 10.1016/j.specom.2008.12.004_bib29 article-title: Challenges and rewards in using parametric or concatenative speech synthesis publication-title: International Journal of Speech Technology doi: 10.1023/A:1015416013198 – volume: 14 issue: 4 year: 2006 ident: 10.1016/j.specom.2008.12.004_bib10 article-title: Special section on expressive speech synthesis publication-title: IEEE Transactions on Audio, Speech and Language Processing doi: 10.1109/TASL.2006.878306 – ident: 10.1016/j.specom.2008.12.004_bib25 – ident: 10.1016/j.specom.2008.12.004_bib6 doi: 10.21236/ADA461150 – ident: 10.1016/j.specom.2008.12.004_bib5 doi: 10.1121/1.424924 – volume: 24 start-page: 309 issue: 4 year: 1987 ident: 10.1016/j.specom.2008.12.004_bib48 article-title: Computer-aided language learning with speech synthesis: user reactions publication-title: Programmed Learning and Educational Technology doi: 10.1080/0033039870240408 – year: 1999 ident: 10.1016/j.specom.2008.12.004_bib17 – start-page: 31 year: 1989 ident: 10.1016/j.specom.2008.12.004_bib47 article-title: A direction for CALL: from behaviouristic to humanistic courseware – volume: 10 start-page: 79 issue: 1 year: 1998 ident: 10.1016/j.specom.2008.12.004_bib24 article-title: Les outils de TALN dans SAFRAN [NLP tools in SAFRAN] publication-title: ReCALL doi: 10.1017/S0958344000004286 – ident: 10.1016/j.specom.2008.12.004_bib4 doi: 10.21437/Interspeech.2005-79 – ident: 10.1016/j.specom.2008.12.004_bib19 – start-page: 145 year: 2000 ident: 10.1016/j.specom.2008.12.004_bib34 article-title: Courseware for breton spelling pronunciation and intonation learning – ident: 10.1016/j.specom.2008.12.004_bib43 – ident: 10.1016/j.specom.2008.12.004_bib8 doi: 10.21437/Interspeech.2005-72 – ident: 10.1016/j.specom.2008.12.004_bib32 doi: 10.21236/ADA459395 – ident: 10.1016/j.specom.2008.12.004_bib7 doi: 10.3115/991250.991307 – start-page: 109 year: 2000 ident: 10.1016/j.specom.2008.12.004_bib33 article-title: Speech synthesis in language learning: challenges and opportunities – year: 2001 ident: 10.1016/j.specom.2008.12.004_bib31 – year: 1998 ident: 10.1016/j.specom.2008.12.004_bib45 article-title: Speech in EFL CALL – ident: 10.1016/j.specom.2008.12.004_bib1 – volume: 23 start-page: 3 issue: 10 year: 2001 ident: 10.1016/j.specom.2008.12.004_bib12 article-title: Innovative language learning: achieving the vision publication-title: ReCALL doi: 10.1017/S0958344001000210 – volume: 21 start-page: 25 issue: 1/2 year: 1993 ident: 10.1016/j.specom.2008.12.004_bib14 article-title: The use of a voice synthesizer in the discovery of the written language by young children publication-title: Computers in Educations. doi: 10.1016/0360-1315(93)90044-J – start-page: 575 year: 1997 ident: 10.1016/j.specom.2008.12.004_bib16 article-title: High-quality message-to-speech generation in a practical application – start-page: 481 year: 1997 ident: 10.1016/j.specom.2008.12.004_bib51 article-title: Assessment of synthesis systems – start-page: 63 year: 1999 ident: 10.1016/j.specom.2008.12.004_bib23 article-title: Evaluating the quality of synthetic speech – ident: 10.1016/j.specom.2008.12.004_bib27 – start-page: 195 year: 1995 ident: 10.1016/j.specom.2008.12.004_bib41 article-title: Intelligibility and acceptability testing for speech technology – volume: 6 start-page: 161 year: 2003 ident: 10.1016/j.specom.2008.12.004_bib37 article-title: Expanding the MOS: development and psychometric evaluation of the MOS-R and MOS-X publication-title: International Journal of Speech Technology doi: 10.1023/A:1022390615396 – volume: 2 start-page: 116 issue: 2 year: 1987 ident: 10.1016/j.specom.2008.12.004_bib49 article-title: Exploration of foreign language speech synthesis publication-title: Literary and Linguistic Computing doi: 10.1093/llc/2.2.116 – volume: 3 start-page: 175 year: 1981 ident: 10.1016/j.specom.2008.12.004_bib44 article-title: Speech synthesis applied to language teaching publication-title: Studies in Language Learning. – volume: 2 start-page: 22 issue: 1 year: 1998 ident: 10.1016/j.specom.2008.12.004_bib11 article-title: Multimedia CALL: lessons to be learned from research on instructed SLA publication-title: Language Learning and Technology doi: 10.64152/10125/25030 – ident: 10.1016/j.specom.2008.12.004_bib2 – volume: 2 start-page: 45 issue: 1 year: 1998 ident: 10.1016/j.specom.2008.12.004_bib21 article-title: Speech technology in computer-aided language learning: strengths and limitations of a new CALL paradigm publication-title: Language Learning & Technology doi: 10.64152/10125/25032 – ident: 10.1016/j.specom.2008.12.004_bib38 |
| SSID | ssj0004882 |
| Score | 2.184962 |
| Snippet | Text-to-speech (TTS) synthesis, the generation of speech from text input, offers another means of providing spoken language input to learners in... Text-to-Speech (TTS) synthesis, the generation of speech from text input, offers another means of providing spoken language input to learners in... |
| SourceID | hal proquest crossref elsevier |
| SourceType | Open Access Repository Aggregation Database Enrichment Source Index Database Publisher |
| StartPage | 906 |
| SubjectTerms | CALL Computer Applications Computer Assisted Language Learning Evaluation French as a Second Language Instruction Human Computer Communication Language Teaching Materials Speech synthesis Text to Speech Systems TTS synthesis |
| Title | Is text-to-speech synthesis ready for use in computer-assisted language learning? |
| URI | https://dx.doi.org/10.1016/j.specom.2008.12.004 https://www.proquest.com/docview/36259308 https://www.proquest.com/docview/85356850 https://www.proquest.com/docview/85707355 https://hal.science/hal-00558516 |
| Volume | 51 |
| WOSCitedRecordID | wos000269092500009&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVESC databaseName: Elsevier SD Freedom Collection Journals 2021 customDbUrl: eissn: 1872-7182 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0004882 issn: 0167-6393 databaseCode: AIEXJ dateStart: 19950101 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1bb9MwFLZKx8NeEJRbYYCFEC_IyLk4sZ9QQUMrqibQhqh4ieJL2EZJqqatxr_nOE7Sm8YAiZeoipxbvy_Hn0_OBaEXnox8KqUhqcwECTPhEQHUIb5IJVXMWkRdNZuIj4_5eCw-djqjJhdmOYnznF9eiul_hRr2Adg2dfYv4G5PCjvgN4AOW4Adtn8E_LB8ZaM5yLwg5dQYdWarEoDMs5VHQCFqF6G5KKtqIapu6kBAQ1vAdevAbPpJfNsK_Ttx51TriSUrM2ZLNlSwfbVf4N9GZsOrINr4tNrVtZPu4ryPYFVB0jiLZJzF5DFIdI9vmNS6hmxNHbpmIAWN1uZa4czljhl3HoWL1zbbtPjhIl6t09Z1Kt4qkH1ib8velRW_sGCFCXnPj5ngXbQ3GB6OP6zyZHnVPKx9jCaVsor3273WVVLlxpmNmd2auis9cnob3aoXEnjgCHAHdUzeQw9GNXolfolHbcXssof225nuZw8duKRs_MVMsnRmYGyzo5h9v4s-DUu8SSHcUghXFMJAIQwUwuc53qEQbiiEGwq9uYc-vz88fXdE6tYbRIGknpOUK5pFWqU-95hm3Jc8lCqj0lcipsKYIBXaZBIEqqYyzQKmdESzWBimY5OFwX3UzYvcPEQYVhAsCGnKFNWhZFpwLb2IqlTGgdE-66Og-ZcTVdelt-1RJkkTgHiROGxcy1TPTwCbPiLtUVNXl-Wa8XEDYFJrS6cZE-DcNUc-B7zbi9hy7EeDUWL32QJ2sGKJll4fPWvokIB9th_d0twUizIJrIMBqHn1CBDMLOKM_m5EDDMxY4_--SEeo_3VW36AuvPZwjxBN9Vyfl7OntbvyS_hWM6C |
| linkProvider | Elsevier |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Is+text-to-speech+synthesis+ready+for+use+in+computer-assisted+language+learning%3F&rft.jtitle=Speech+communication&rft.au=Handley%2C+Z%C3%B6e&rft.date=2009-10-01&rft.pub=Elsevier+B.V&rft.issn=0167-6393&rft.eissn=1872-7182&rft.volume=51&rft.issue=10&rft.spage=906&rft.epage=919&rft_id=info:doi/10.1016%2Fj.specom.2008.12.004&rft.externalDocID=S0167639308001842 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0167-6393&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0167-6393&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0167-6393&client=summon |