Extracting parallel phrases from comparable data for machine translation
Mining parallel data from comparable corpora is a promising approach for overcoming the data sparseness in statistical machine translation and other natural language processing applications. In this paper, we address the task of detecting parallel phrase pairs embedded in comparable sentence pairs....
Uloženo v:
| Vydáno v: | Natural language engineering Ročník 22; číslo 4; s. 549 - 573 |
|---|---|
| Hlavní autoři: | , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
Cambridge, UK
Cambridge University Press
01.07.2016
|
| Témata: | |
| ISSN: | 1351-3249, 1469-8110 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | Mining parallel data from comparable corpora is a promising approach for overcoming the data sparseness in statistical machine translation and other natural language processing applications. In this paper, we address the task of detecting parallel phrase pairs embedded in comparable sentence pairs. We present a novel phrase alignment approach that is designed to only align parallel sections bypassing non-parallel sections of the sentence. We compare the proposed approach with two other alignment methods: (1) the standard phrase extraction algorithm, which relies on the Viterbi path of the word alignment, (2) a binary classifier to detect parallel phrase pairs when presented with a large collection of phrase pair candidates. We evaluate the accuracy of these approaches using a manually aligned data set, and show that the proposed approach outperforms the other two approaches. Finally, we demonstrate the effectiveness of the extracted phrase pairs by using them in Arabic–English and Urdu–English translation systems, which resulted in improvements upto 1.2 Bleu over the baseline. The main contributions of this paper are two-fold: (1) novel phrase alignment algorithms to extract parallel phrase pairs from comparable sentences, (2) evaluating the utility of the extracted phrases by using them directly in the MT decoder. |
|---|---|
| AbstractList | Mining parallel data from comparable corpora is a promising approach for overcoming the data sparseness in statistical machine translation and other natural language processing applications. In this paper, we address the task of detecting parallel phrase pairs embedded in comparable sentence pairs. We present a novel phrase alignment approach that is designed to only align parallel sections bypassing non-parallel sections of the sentence. We compare the proposed approach with two other alignment methods: (1) the standard phrase extraction algorithm, which relies on the Viterbi path of the word alignment, (2) a binary classifier to detect parallel phrase pairs when presented with a large collection of phrase pair candidates. We evaluate the accuracy of these approaches using a manually aligned data set, and show that the proposed approach outperforms the other two approaches. Finally, we demonstrate the effectiveness of the extracted phrase pairs by using them in Arabic-English and Urdu-English translation systems, which resulted in improvements upto 1.2 Bleu over the baseline. The main contributions of this paper are two-fold: (1) novel phrase alignment algorithms to extract parallel phrase pairs from comparable sentences, (2) evaluating the utility of the extracted phrases by using them directly in the MT decoder. |
| Author | HEWAVITHARANA, SANJIKA VOGEL, STEPHAN |
| Author_xml | – sequence: 1 givenname: SANJIKA surname: HEWAVITHARANA fullname: HEWAVITHARANA, SANJIKA email: shewavit@bbn.com organization: 1Raytheon BBN Technologies, Cambridge, MA 02138, USA email: shewavit@bbn.com – sequence: 2 givenname: STEPHAN surname: VOGEL fullname: VOGEL, STEPHAN email: svogel@qf.org.qa organization: 2Qatar Computing Research Institute, Doha, Qatar email: svogel@qf.org.qa |
| BookMark | eNp9kEFLAzEQhYNUsFV_gLeA59VMs9k0RynVCgUP6nmZZJN2y-5mTbag_97U9iCKnhLy3pd58yZk1PnOEnIF7AYYyNtn4AL4NFdQMMaAqxMyhrxQ2QyAjdI9ydlePyOTGLfJk4PMx2S5eB8CmqHu1rTHgE1jG9pvAkYbqQu-pca3e0E3llY4IHU-0BbNpu4sTWgXGxxq312QU4dNtJfH85y83i9e5sts9fTwOL9bZYaDHDKli8JqjXrqBFayEmaGKj2wAhwqoXLFkUktRYGsMtKi08BQMilFxYWd8XNyffi3D_5tZ-NQbv0udGlkCVJJEHlaPrnkwWWCjzFYV5p6-MqZItdNCazc11b-qi2R8IPsQ91i-PiX4UcGWx3qam2_hfqT-gRyBoBs |
| CitedBy_id | crossref_primary_10_1145_3168054 crossref_primary_10_1162_tacl_a_00075 |
| Cites_doi | 10.3115/980845.980916 10.1109/NLPKE.2003.1275968 10.21437/ICSLP.2002-181 10.1162/089120105775299168 10.3115/1075096.1075117 10.3115/1620853.1620881 10.1017/S135132491100026X 10.1162/089120103322711578 10.3115/1557769.1557821 10.3115/1220175.1220186 10.3115/1034678.1034756 10.1007/s10590-011-9089-6 10.1007/978-3-642-20128-8_10 10.3115/981658.981709 10.3115/1075096.1075106 |
| ContentType | Journal Article |
| Copyright | Copyright © Cambridge University Press 2016 |
| Copyright_xml | – notice: Copyright © Cambridge University Press 2016 |
| DBID | AAYXX CITATION 3V. 7T9 7XB 88G 8AL 8FE 8FG 8FI 8FJ 8FK ABJCF ABUWG AEUYN AFKRA ALSLI ARAPS AZQEC BENPR BGLVJ CCPQU CPGLG CRLPW DWQXO FYUFA GHDGH GNUQQ HCIFZ JQ2 K7- L6V M0N M2M M7S P5Z P62 PHGZM PHGZT PKEHL PQEST PQGLB PQQKQ PQUKI PRINS PRQQA PSYQQ PTHSS Q9U |
| DOI | 10.1017/S1351324916000139 |
| DatabaseName | CrossRef ProQuest Central (Corporate) Linguistics and Language Behavior Abstracts (LLBA) ProQuest Central (purchase pre-March 2016) Psychology Database (Alumni) Computing Database (Alumni Edition) ProQuest SciTech Collection ProQuest Technology Collection Hospital Premium Collection Hospital Premium Collection (Alumni Edition) ProQuest Central (Alumni) (purchase pre-March 2016) Materials Science & Engineering Collection ProQuest Central (Alumni) ProQuest One Sustainability (subscription) ProQuest Central UK/Ireland Social Science Premium Collection Advanced Technologies & Computer Science Collection ProQuest Central Essentials ProQuest Central Technology collection ProQuest One Community College Linguistics Collection Linguistics Database ProQuest Central Health Research Premium Collection Health Research Premium Collection (Alumni) ProQuest Central Student SciTech Premium Collection ProQuest Computer Science Collection Computer Science Database ProQuest Engineering Collection Computing Database Psychology Database Engineering Database Advanced Technologies & Aerospace Database ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Premium ProQuest One Academic (New) ProQuest One Academic Middle East (New) ProQuest One Academic Eastern Edition (DO NOT USE) One Applied & Life Sciences ProQuest One Academic (retired) ProQuest One Academic UKI Edition ProQuest Central China One Social Sciences ProQuest One Psychology Engineering collection ProQuest Central Basic |
| DatabaseTitle | CrossRef ProQuest One Psychology Computer Science Database ProQuest Central Student Technology Collection ProQuest One Academic Middle East (New) ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Essentials ProQuest Computer Science Collection ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College ProQuest Central China ProQuest Central ProQuest One Applied & Life Sciences Linguistics Collection ProQuest One Sustainability ProQuest Engineering Collection Health Research Premium Collection ProQuest Central Korea ProQuest Central (New) Engineering Collection Advanced Technologies & Aerospace Collection Social Science Premium Collection ProQuest Computing Engineering Database ProQuest One Social Sciences ProQuest Central Basic ProQuest Computing (Alumni Edition) ProQuest One Academic Eastern Edition Linguistics and Language Behavior Abstracts (LLBA) ProQuest Hospital Collection ProQuest Technology Collection Health Research Premium Collection (Alumni) ProQuest Psychology Journals (Alumni) ProQuest SciTech Collection ProQuest Hospital Collection (Alumni) Advanced Technologies & Aerospace Database ProQuest Psychology Journals ProQuest One Academic UKI Edition Linguistics Database Materials Science & Engineering Collection ProQuest One Academic ProQuest Central (Alumni) ProQuest One Academic (New) |
| DatabaseTitleList | ProQuest One Psychology CrossRef |
| Database_xml | – sequence: 1 dbid: BENPR name: ProQuest Central url: https://www.proquest.com/central sourceTypes: Aggregation Database |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| DocumentTitleAlternate | S. Hewavitharana and S. Vogel Extracting parallel phrases from comparable data |
| EISSN | 1469-8110 |
| EndPage | 573 |
| ExternalDocumentID | 4089667251 10_1017_S1351324916000139 |
| GeographicLocations | United States--US Qatar Japan |
| GeographicLocations_xml | – name: Qatar – name: United States--US – name: Japan |
| GroupedDBID | -1D -1F -2P -2V -E. -~6 -~N .DC .FH 09C 09D 0E1 0R~ 123 29M 3V. 4.4 5VS 6~7 6~8 74X 74Y 7~V 8FE 8FG 8FI 8FJ 8I0 8R4 8R5 9M5 AAAZR AABES AABWE AACJB AACJH AAFUK AAGFV AAKTX AALKF AANRG AAPYI AARAB AASVR AATMM AAUIS AAUKB AAYOK ABBXD ABBZL ABCFY ABITZ ABIVO ABJCF ABJNI ABJWI ABKKG ABLJU ABMWE ABQTM ABQWD ABROB ABTCQ ABTND ABUWG ABVFV ABVKB ABVZP ABXAU ABZCX ABZUI ACABY ACAJB ACBMC ACDLN ACETC ACGFS ACHQT ACIMK ACIWK ACRPL ACUIJ ACYZP ACZBM ACZBN ACZUX ACZWT ADBBV ADCGK ADDNB ADFEC ADKIL ADNMO ADOVH ADTCA ADVJH AEBAK AEBPU AEFOJ AEHGV AEMFK AEMTW AENCP AENEX AENGE AEUYN AEYYC AFFUJ AFKQG AFKRA AFKRZ AFLOS AFLVW AFUTZ AFZFC AGABE AGBYD AGHGI AGJUD AGLWM AHQXX AHRGI AIGNW AIHIV AIOIP AISIE AJ7 AJCYY AJPFC AJQAS AKZCZ ALIPV ALMA_UNASSIGNED_HOLDINGS ALSLI ALVPG ALWZO ANFVQ AOWSX AQJOH ARABE ARAPS ARZZG ATUCA AUXHV AVDNQ AYIQA AZQEC BBLKV BBQHK BCGOX BENPR BESQT BGHMG BGLVJ BJBOZ BLZWO BMAJL BPHCQ BQFHP BVXVI C0O CAG CBIIA CCPQU CCQAD CCTKK CCUQV CDIZJ CFAFE CFBFF CGMFO CGQII CHEAL CJCSC COF CPGLG CRLPW CS3 DC4 DOHLZ DU5 DWQXO EBS ED0 EGQIC EJD FYUFA GNUQQ HCIFZ HG- HOVLH HSS HST HZ~ I.5 I.6 I.7 I.9 IH6 IOEEP IOO IS6 I~P J36 J38 J3A JHPGK JOSPZ JPPIE JQKCU JRMXA K6V K7- KAFGG KCGVB KFECR L6V L98 LHUNA LW7 M-V M0N M2M M7S M7~ M8. NIKVX NMFBF NZEOI O9- OYBOY P2P P62 PQQKQ PROAC PSYQQ PTHSS PYCCK Q2X RAMDC RCA RIG ROL RR0 S6- S6U SAAAG T9M UKHRP UT1 WFFJZ WQ3 WXS WXU WYP ZJOSE ZMEZD ZYDXJ ~A4 ~V1 AAKNA AAYXX ABGDZ ABXHF ACEJA ACQPF ADMLS AFFHD AGQPQ AGTDA AKMAY ANOYL CITATION IPYYG PHGZM PHGZT PQGLB PRQQA 7T9 7XB 8AL 8FK JQ2 PKEHL PQEST PQUKI PRINS Q9U |
| ID | FETCH-LOGICAL-c317t-9b66ebbab2f5ad7d5c8a9ebb061fa959493a07b756a0dc7eafb10a70775d35e83 |
| IEDL.DBID | M7S |
| ISICitedReferencesCount | 7 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000379138800004&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 1351-3249 |
| IngestDate | Mon Nov 10 07:01:17 EST 2025 Sat Nov 29 01:32:22 EST 2025 Tue Nov 18 22:14:09 EST 2025 Tue Jan 21 06:20:16 EST 2025 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 4 |
| Language | English |
| License | https://www.cambridge.org/core/terms |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c317t-9b66ebbab2f5ad7d5c8a9ebb061fa959493a07b756a0dc7eafb10a70775d35e83 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| PQID | 1797154139 |
| PQPubID | 30339 |
| PageCount | 25 |
| ParticipantIDs | proquest_journals_1797154139 crossref_citationtrail_10_1017_S1351324916000139 crossref_primary_10_1017_S1351324916000139 cambridge_journals_10_1017_S1351324916000139 |
| PublicationCentury | 2000 |
| PublicationDate | 20160700 2016-07-00 20160701 |
| PublicationDateYYYYMMDD | 2016-07-01 |
| PublicationDate_xml | – month: 07 year: 2016 text: 20160700 |
| PublicationDecade | 2010 |
| PublicationPlace | Cambridge, UK |
| PublicationPlace_xml | – name: Cambridge, UK – name: Cambridge |
| PublicationTitle | Natural language engineering |
| PublicationTitleAlternate | Nat. Lang. Eng |
| PublicationYear | 2016 |
| Publisher | Cambridge University Press |
| Publisher_xml | – name: Cambridge University Press |
| References | Crammer, Dekel, Keshet, Shalev-Shwartz, Singer 2006; 7 Bourdaillet, Huet, Langlais, Lapalme 2010; 24 Brown, Della Pietra, Della Pietra, Mercer 1993; 19 Tillmann, Hewavitharana 2013; 19 Munteanu, Marcu 2005; 31 Resnik, Smith 2003; 29 Brown (S1351324916000139_ref003) 1993; 19 S1351324916000139_ref005 S1351324916000139_ref027 S1351324916000139_ref006 S1351324916000139_ref028 S1351324916000139_ref007 S1351324916000139_ref029 S1351324916000139_ref008 S1351324916000139_ref009 Crammer (S1351324916000139_ref004) 2006; 7 S1351324916000139_ref030 S1351324916000139_ref010 S1351324916000139_ref011 S1351324916000139_ref012 S1351324916000139_ref013 S1351324916000139_ref014 S1351324916000139_ref015 S1351324916000139_ref016 S1351324916000139_ref017 S1351324916000139_ref018 S1351324916000139_ref019 S1351324916000139_ref020 S1351324916000139_ref021 S1351324916000139_ref022 S1351324916000139_ref023 S1351324916000139_ref001 S1351324916000139_ref024 S1351324916000139_ref002 S1351324916000139_ref025 S1351324916000139_ref026 |
| References_xml | – volume: 7 start-page: 551 issue: (March) year: 2006 article-title: Online passive-agressive algorithms publication-title: Journal of Machine Learning Research – volume: 19 start-page: 263 issue: (2) year: 1993 end-page: 311 article-title: The mathematics of statistical machine translation: parameter estimation publication-title: Computational Linguistics – volume: 29 start-page: 349 issue: (3) year: 2003 article-title: The web as a parallel corpus publication-title: Computational Linguistics – volume: 19 start-page: 33 issue: (01) year: 2013 end-page: 60 article-title: A unified alignment algorithm for bilingual data publication-title: Natural Language Engineering – volume: 24 start-page: 241 issue: (3–4) year: 2010 article-title: TransSearch: from a bilingual concordancer to a translation finder publication-title: Machine Translation – volume: 31 start-page: 477 issue: (4) year: 2005 end-page: 504 article-title: Improving machine translation performance by exploiting non-parallel corpora publication-title: Computational Linguistics – ident: S1351324916000139_ref013 – ident: S1351324916000139_ref011 – ident: S1351324916000139_ref005 – ident: S1351324916000139_ref006 doi: 10.3115/980845.980916 – ident: S1351324916000139_ref027 doi: 10.1109/NLPKE.2003.1275968 – ident: S1351324916000139_ref007 – ident: S1351324916000139_ref009 – ident: S1351324916000139_ref028 – ident: S1351324916000139_ref030 doi: 10.21437/ICSLP.2002-181 – ident: S1351324916000139_ref001 – ident: S1351324916000139_ref022 – ident: S1351324916000139_ref014 doi: 10.1162/089120105775299168 – ident: S1351324916000139_ref016 doi: 10.3115/1075096.1075117 – volume: 19 start-page: 263 year: 1993 ident: S1351324916000139_ref003 article-title: The mathematics of statistical machine translation: parameter estimation publication-title: Computational Linguistics – ident: S1351324916000139_ref025 doi: 10.3115/1620853.1620881 – ident: S1351324916000139_ref017 – ident: S1351324916000139_ref023 doi: 10.1017/S135132491100026X – ident: S1351324916000139_ref021 doi: 10.1162/089120103322711578 – ident: S1351324916000139_ref012 doi: 10.3115/1557769.1557821 – ident: S1351324916000139_ref015 doi: 10.3115/1220175.1220186 – ident: S1351324916000139_ref020 doi: 10.3115/1034678.1034756 – ident: S1351324916000139_ref029 – ident: S1351324916000139_ref008 – ident: S1351324916000139_ref002 doi: 10.1007/s10590-011-9089-6 – ident: S1351324916000139_ref010 doi: 10.1007/978-3-642-20128-8_10 – ident: S1351324916000139_ref024 doi: 10.1017/S135132491100026X – ident: S1351324916000139_ref019 doi: 10.3115/981658.981709 – volume: 7 start-page: 551 year: 2006 ident: S1351324916000139_ref004 article-title: Online passive-agressive algorithms publication-title: Journal of Machine Learning Research – ident: S1351324916000139_ref018 – ident: S1351324916000139_ref026 doi: 10.3115/1075096.1075106 |
| SSID | ssj0004174 |
| Score | 2.10275 |
| Snippet | Mining parallel data from comparable corpora is a promising approach for overcoming the data sparseness in statistical machine translation and other natural... |
| SourceID | proquest crossref cambridge |
| SourceType | Aggregation Database Enrichment Source Index Database Publisher |
| StartPage | 549 |
| SubjectTerms | Algorithms Alignment Arabic language Bilingualism Candidates Computational linguistics Data mining Data processing English language Extraction Hindi language International conferences Language Linguistics Machine translation Natural language processing Parallel corpora Sentences Similarity measures Translation Translation systems Translations Urdu language |
| Title | Extracting parallel phrases from comparable data for machine translation |
| URI | https://www.cambridge.org/core/product/identifier/S1351324916000139/type/journal_article https://www.proquest.com/docview/1797154139 |
| Volume | 22 |
| WOSCitedRecordID | wos000379138800004&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVPQU databaseName: Advanced Technologies & Aerospace Database customDbUrl: eissn: 1469-8110 dateEnd: 20241213 omitProxy: false ssIdentifier: ssj0004174 issn: 1351-3249 databaseCode: P5Z dateStart: 20010301 isFulltext: true titleUrlDefault: https://search.proquest.com/hightechjournals providerName: ProQuest – providerCode: PRVPQU databaseName: Computer Science Database customDbUrl: eissn: 1469-8110 dateEnd: 20241213 omitProxy: false ssIdentifier: ssj0004174 issn: 1351-3249 databaseCode: K7- dateStart: 20010301 isFulltext: true titleUrlDefault: http://search.proquest.com/compscijour providerName: ProQuest – providerCode: PRVPQU databaseName: Engineering Database customDbUrl: eissn: 1469-8110 dateEnd: 20241213 omitProxy: false ssIdentifier: ssj0004174 issn: 1351-3249 databaseCode: M7S dateStart: 20010301 isFulltext: true titleUrlDefault: http://search.proquest.com providerName: ProQuest – providerCode: PRVPQU databaseName: Linguistics Database customDbUrl: eissn: 1469-8110 dateEnd: 20241213 omitProxy: false ssIdentifier: ssj0004174 issn: 1351-3249 databaseCode: CRLPW dateStart: 20010301 isFulltext: true titleUrlDefault: https://search.proquest.com/linguistics providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest Central customDbUrl: eissn: 1469-8110 dateEnd: 20241213 omitProxy: false ssIdentifier: ssj0004174 issn: 1351-3249 databaseCode: BENPR dateStart: 20010301 isFulltext: true titleUrlDefault: https://www.proquest.com/central providerName: ProQuest – providerCode: PRVPQU databaseName: Psychology Database customDbUrl: eissn: 1469-8110 dateEnd: 20241213 omitProxy: false ssIdentifier: ssj0004174 issn: 1351-3249 databaseCode: M2M dateStart: 20010301 isFulltext: true titleUrlDefault: https://www.proquest.com/psychology providerName: ProQuest |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV3NS8MwFH_o5sGL8xOnc-TgSQx2_UpzEpWNgW4MvxheSpqmItRurlX8833px9wQdvGSQ9pA6HvJ79fkvd8DOFW26yhPWNR0AoPakc0oR1-ipol4FyGBDvIDt-c7Nhx64zEflQduaRlWWe2J-UYdTqQ-I79Ax2EI90hYLqcfVFeN0rerZQmNdahrlYROHrr38JsXWagw6yJ0FIkDr241tWS07tR9HbegQYvaCssYtbxF57jTa_x3xtuwVTJOclW4yA6sqWQXGlU1B1Iu7j3od7-zPGUqeSVaEDyOVUzQ1AhzKdFZKKSIV9e5VkQHlhLku-Q9D8ZUJNOYV8TV7cNTr_t406dlnQUqkT1klAeuq4JABGbkiJCFjvQExw6E-khwh9vcEgZDo7nCCCVTIgo6hmBaPC-00NLWAdSSSaIOgeDvmvQCLrlgNiKd6yE_MZASSS0iIy3RhPP5V_bL1ZL6RaQZ8_8YpQlGZQhflprlunRGvGrI2XzItBDsWPVyqzLdwmzmdjta_fgYNpE-uUXwbgtq2exTncCG_Mre0lkb6tfd4ei-Deu3jGI7MAft3D2xHTkvP9lo5Jg |
| linkProvider | ProQuest |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V3JTsMwEB2xSXChrGIp4ANcEBYhm-MDQohFRS0VB0DcguM4CKmUpWH7Kb6RmTgpIKTeOHB14siOn_2e7VkA1o0fBiZSHneDxOF-5gsuEUvcdZHvMhTQSXHgdtkS7XZ0dSXPhuCj8oUhs8pqTSwW6vRe0xn5NgJHIN2jYNl7eOSUNYpuV6sUGhYWTfP-ilu23u7JIY7vhuseH50fNHiZVYBr5MqcyyQMTZKoxM0ClYo00JGSWIDElikZSF96yhHYxFA5qRZGZcmOowSFiks97JeH3x2GUd-LBM2rpuBffpg26jMlveMoVGR1i0ohqqmQynZCK7u-x3L4yYk_KaHguePaf_tDUzBZKmq2b6fANAyZ7gzUqmwVrFy8ZqFx9JYXLmHdG0YBzzsd02EIZaTxHiMvG2bt8cmXjJHhLEM9z-4KY1PDcuJ0azc4Bxd_0p95GOned80CMNyO6iiRWirhI5OHEeovByWfpiA52lOLsNUf1bhcDXqxtaQT8S8QLIJTDXysy5jslBqkM6jKZr_Kgw1IMujlegWVb63p42Rp8OM1GG-cn7bi1km7uQwTKBVDa6hch5H86dmswJh-yW97T6vFNGBw_deo-gRl7j94 |
| linkToPdf | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V3JTsMwEB2xCXFhR5TVB7ggLNJsjg8IIaACgaoeACEuwXEchFQKtGH7Nb6OmTgpIKTeOHB1FiXxs9-z82YGYMP4YWAi5XE3SBzuZ77gErHEXRf5LkMBnRQbbpdnotmMrq5kawg-qlgYslVWc2IxUacPmvbIdxA4AukeBctOVtoiWoeNvccnThWk6E9rVU7DQuTUvL_i8q23e3KIfb3puo2j84NjXlYY4Bp5M-cyCUOTJCpxs0ClIg10pCQ2IMllSgbSl55yBD5uqJxUC6OypO4oQWnjUg_f0cP7DsOowDUm2QlbwfVXTKbNAE0F8DiKFln9UaV01dRIbfXQSrDveR1-8uNPeig4rzH1n7_WNEyWSpvt26ExA0OmMwtTVRULVk5qc3B89JYXoWKdW0aJ0Ntt02YIcaT3HqPoG2Z9-hRjxshQy1Dns_vChGpYTlxv_YTzcPEn77MAI52HjlkEhstUHSVSSyV8ZPgwQl3moBTUlDxHe6oG2_0ejstZohdbh52IfwGiBk4FgliXudqpZEh70CVb_UsebaKSQSevVLD59jR9zCwNPrwO4wim-OykeboME6ggQ-tfXoGRvPtsVmFMv-R3ve5aMSIY3Pw1qD4BnjpInA |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Extracting+parallel+phrases+from+comparable+data+for+machine+translation&rft.jtitle=Natural+language+engineering&rft.au=HEWAVITHARANA%2C+SANJIKA&rft.au=VOGEL%2C+STEPHAN&rft.date=2016-07-01&rft.pub=Cambridge+University+Press&rft.issn=1351-3249&rft.eissn=1469-8110&rft.volume=22&rft.issue=4&rft.spage=549&rft_id=info:doi/10.1017%2FS1351324916000139&rft.externalDBID=HAS_PDF_LINK&rft.externalDocID=4089667251 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1351-3249&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1351-3249&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1351-3249&client=summon |