Syntax-based Language Models for Statistical Machine Translation
The goal of machine translation is to develop algorithms that produce human-quality translations of natural language sentences. The evaluation of machine translation quality is split broadly into two aspects: adequacy and fluency. Adequacy measures how faithfully the meaning of the original sentence...
Uloženo v:
| Hlavní autor: | |
|---|---|
| Médium: | Dissertation |
| Jazyk: | angličtina |
| Vydáno: |
ProQuest Dissertations & Theses
01.01.2010
|
| Témata: | |
| ISBN: | 1124481915, 9781124481913 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | The goal of machine translation is to develop algorithms that produce human-quality translations of natural language sentences. The evaluation of machine translation quality is split broadly into two aspects: adequacy and fluency. Adequacy measures how faithfully the meaning of the original sentence is preserved, whereas fluency measures whether this meaning is expressed in valid sentences in the target language. While both of these criteria are difficult to meet; fluency is a much more difficult goal. Generally, this likely has something to do with the asymmetrical nature of producing and understanding sentences; although humans are quite robust at inferring the meaning of text even in the presence of lots of noise and error, the rules that govern grammatical utterances are exacting, subtle; and elusive. To produce understandable text, we can rely on this robust processing hardware, but to produce grammatical text, we have to understand how it, works. This dissertation attempts to improve the fluency of machine translation output by explicitly incorporating models of the target language structure into machine translation systems. It is organized into three parts. First, we propose a framework for decoding that decouples the structures of the sentences of the source and target languages, and evaluate it with existing grammatical models as language models for machine translation. Next, we apply lessons from that task to the learning of grammars more suitable to the demands of the machine translation. We then incorporate these grammars, called Tree Substitution Grammars, into our decoding framework. |
|---|---|
| AbstractList | The goal of machine translation is to develop algorithms that produce human-quality translations of natural language sentences. The evaluation of machine translation quality is split broadly into two aspects: adequacy and fluency. Adequacy measures how faithfully the meaning of the original sentence is preserved, whereas fluency measures whether this meaning is expressed in valid sentences in the target language. While both of these criteria are difficult to meet; fluency is a much more difficult goal. Generally, this likely has something to do with the asymmetrical nature of producing and understanding sentences; although humans are quite robust at inferring the meaning of text even in the presence of lots of noise and error, the rules that govern grammatical utterances are exacting, subtle; and elusive. To produce understandable text, we can rely on this robust processing hardware, but to produce grammatical text, we have to understand how it, works. This dissertation attempts to improve the fluency of machine translation output by explicitly incorporating models of the target language structure into machine translation systems. It is organized into three parts. First, we propose a framework for decoding that decouples the structures of the sentences of the source and target languages, and evaluate it with existing grammatical models as language models for machine translation. Next, we apply lessons from that task to the learning of grammars more suitable to the demands of the machine translation. We then incorporate these grammars, called Tree Substitution Grammars, into our decoding framework. |
| Author | Post, Matthew John |
| Author_xml | – sequence: 1 givenname: Matthew surname: Post middlename: John fullname: Post, Matthew John |
| BookMark | eNotjctqwzAQAAVtIU2afxC9GyRLsrS3ltAXOOQQ38MqWiUORmotG9q_b6A9DcxhZsluU050w5ZS1lo7CdIs2LqU3gshQCmh63v2tP9JE35XHgsF3mI6zXgivs2BhsJjHvl-wqkvU3_EgW_xeO4T8W7EVIarz-mB3UUcCq3_uWLd60u3ea_a3dvH5rmtzhpEZQXKAE6EYBtwRKoB7esGIBIqHWTtFQQKZKOOTQQTDVGI1nljgrLeqBV7_Mt-jvlrpjIdLnke0_V4cEY12oAS6hc7Wkc2 |
| ContentType | Dissertation |
| Copyright | Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works. |
| Copyright_xml | – notice: Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works. |
| DBID | 053 0BH 0L7 CBPLH EU9 G20 M8- PHGZT PKEHL PQEST PQQKQ PQUKI |
| DatabaseName | Dissertations & Theses Europe Full Text: Science & Technology ProQuest Dissertations and Theses Professional Dissertations & Theses @ The University of Rochester ProQuest Dissertations & Theses Global: The Sciences and Engineering Collection ProQuest Dissertations & Theses A&I ProQuest Dissertations & Theses Global ProQuest Dissertations and Theses A&I: The Sciences and Engineering Collection ProQuest One Academic (New) ProQuest One Academic Middle East (New) ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic (retired) ProQuest One Academic UKI Edition |
| DatabaseTitle | Dissertations & Theses Europe Full Text: Science & Technology Dissertations & Theses @ The University of Rochester ProQuest One Academic Middle East (New) ProQuest One Academic UKI Edition ProQuest One Academic Eastern Edition ProQuest Dissertations & Theses Global: The Sciences and Engineering Collection ProQuest Dissertations and Theses Professional ProQuest One Academic ProQuest Dissertations & Theses A&I ProQuest One Academic (New) ProQuest Dissertations and Theses A&I: The Sciences and Engineering Collection ProQuest Dissertations & Theses Global |
| DatabaseTitleList | Dissertations & Theses Europe Full Text: Science & Technology |
| Database_xml | – sequence: 1 dbid: G20 name: ProQuest Dissertations & Theses Global url: https://www.proquest.com/pqdtglobal1 sourceTypes: Aggregation Database |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| ExternalDocumentID | 2275427261 |
| Genre | Dissertation/Thesis |
| GroupedDBID | 053 0BH 0L7 8R4 8R5 CBPLH EU9 G20 M8- PHGZT PKEHL PQEST PQQKQ PQUKI Q2X |
| ID | FETCH-LOGICAL-h490-70a1d980dd7698ee3694b2699fea34d12b39dede7f4f6f95f5eedf78b55d37b53 |
| IEDL.DBID | G20 |
| ISBN | 1124481915 9781124481913 |
| IngestDate | Mon Jun 30 03:51:30 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-h490-70a1d980dd7698ee3694b2699fea34d12b39dede7f4f6f95f5eedf78b55d37b53 |
| Notes | SourceType-Dissertations & Theses-1 ObjectType-Dissertation/Thesis-1 content type line 12 |
| PQID | 853645930 |
| PQPubID | 18750 |
| ParticipantIDs | proquest_journals_853645930 |
| PublicationCentury | 2000 |
| PublicationDate | 20100101 |
| PublicationDateYYYYMMDD | 2010-01-01 |
| PublicationDate_xml | – month: 01 year: 2010 text: 20100101 day: 01 |
| PublicationDecade | 2010 |
| PublicationYear | 2010 |
| Publisher | ProQuest Dissertations & Theses |
| Publisher_xml | – name: ProQuest Dissertations & Theses |
| SSID | ssib000933042 |
| Score | 1.5275241 |
| Snippet | The goal of machine translation is to develop algorithms that produce human-quality translations of natural language sentences. The evaluation of machine... |
| SourceID | proquest |
| SourceType | Aggregation Database |
| SubjectTerms | Computer science |
| Title | Syntax-based Language Models for Statistical Machine Translation |
| URI | https://www.proquest.com/docview/853645930 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV09T8MwED1BYUAM5VNAAXlgtUhqO7YnkICKoVRIVKhb5cS2GKoU2oLg3-MzTlUJiYUxypI48b17d-f3AC4Y44FIcEOV8pzyKuRwpcGGa-AiKue2cFFS6LkvBwM1GunHNJszT2OVTUyMgdpOK6yRXwZYQd0Tll29vlE0jcLmanLQWIcNPFwbz_quZj9Lsp4jiiE1EUnlqblmv0JwxJVe-59PtAPbtyv99F1Yc_UetBunBpI27j5cP33VC_NJEbMs6acaJUEjtMmchLyVYNIZNZvNhDzEAUtHIo79zModwLB3N7y5p8k7gb5wnVGZmdxqlVkrC62cY4XmZbfQ2jvDuM27JdPWWSc994XXwouAlV6qUgjLZCnYIbTqae2OgOjMcOEM6sop7iumvA8cS_qiyox3Nj-GTrM84_T_z8fLtTn5824Htn668VjSOIXWYvbuzmCz-ghvPDuPX_MbVXipFQ |
| linkProvider | ProQuest |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V1NS8NAEB2KCoqH-olaP_agx2DS3SS7B1GwlpamRbBIb2GT3cVDSbWtH_1P_kh30qQUBG89eAw5bWbz3szs7HsAl5QyW0gw6XBumMNSm8MlEg9cbS3CPaYCnUsKPUdhr8cHA_FYge_yLgyOVZaYmAO1GqXYI7-2tIK6J9S9fX1z0DQKD1dLB435rujo2aet2CY37YYN71W93nzo37ecwlTAeWHCdUJXekpwV6kwEFxrGgiW1AMhjJaUKa-eUKG00qFhJjDCN74lERPyxPcVDRP0iLCAv85Q6A6vFi8nW4vegIekiZWQX4hKlc_0F-LnNNas_q8PsAPbjaVpgV2o6GwPqqUPBSlgaR_unmbZVH45yMiKREUHlqDN23BCbFZOMKXOFanlkHTz8VFNcpaeTwIeQH8VaziEtWyU6SMgwpXM1xJV8zgzKeXG2AoyNEHqSqOVdwy1Mhpx8XdP4kUoTv58ewGbrX43iqN2r1ODrfncATZvTmFtOn7XZ7CRftjVj8_zjUQgXnHcfgDQmQbD |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adissertation&rft.genre=dissertation&rft.title=Syntax-based+Language+Models+for+Statistical+Machine+Translation&rft.DBID=053%3B0BH%3B0L7%3BCBPLH%3BEU9%3BG20%3BM8-%3BPHGZT%3BPKEHL%3BPQEST%3BPQQKQ%3BPQUKI&rft.PQPubID=18750&rft.au=Post%2C+Matthew+John&rft.date=2010-01-01&rft.pub=ProQuest+Dissertations+%26+Theses&rft.isbn=1124481915&rft.externalDBID=HAS_PDF_LINK&rft.externalDocID=2275427261 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781124481913/lc.gif&client=summon&freeimage=true |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781124481913/mc.gif&client=summon&freeimage=true |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781124481913/sc.gif&client=summon&freeimage=true |

