Syntax-based Language Models for Statistical Machine Translation

The goal of machine translation is to develop algorithms that produce human-quality translations of natural language sentences. The evaluation of machine translation quality is split broadly into two aspects: adequacy and fluency. Adequacy measures how faithfully the meaning of the original sentence...

Celý popis

Uloženo v:
Podrobná bibliografie
Hlavní autor: Post, Matthew John
Médium: Dissertation
Jazyk:angličtina
Vydáno: ProQuest Dissertations & Theses 01.01.2010
Témata:
ISBN:1124481915, 9781124481913
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract The goal of machine translation is to develop algorithms that produce human-quality translations of natural language sentences. The evaluation of machine translation quality is split broadly into two aspects: adequacy and fluency. Adequacy measures how faithfully the meaning of the original sentence is preserved, whereas fluency measures whether this meaning is expressed in valid sentences in the target language. While both of these criteria are difficult to meet; fluency is a much more difficult goal. Generally, this likely has something to do with the asymmetrical nature of producing and understanding sentences; although humans are quite robust at inferring the meaning of text even in the presence of lots of noise and error, the rules that govern grammatical utterances are exacting, subtle; and elusive. To produce understandable text, we can rely on this robust processing hardware, but to produce grammatical text, we have to understand how it, works. This dissertation attempts to improve the fluency of machine translation output by explicitly incorporating models of the target language structure into machine translation systems. It is organized into three parts. First, we propose a framework for decoding that decouples the structures of the sentences of the source and target languages, and evaluate it with existing grammatical models as language models for machine translation. Next, we apply lessons from that task to the learning of grammars more suitable to the demands of the machine translation. We then incorporate these grammars, called Tree Substitution Grammars, into our decoding framework.
AbstractList The goal of machine translation is to develop algorithms that produce human-quality translations of natural language sentences. The evaluation of machine translation quality is split broadly into two aspects: adequacy and fluency. Adequacy measures how faithfully the meaning of the original sentence is preserved, whereas fluency measures whether this meaning is expressed in valid sentences in the target language. While both of these criteria are difficult to meet; fluency is a much more difficult goal. Generally, this likely has something to do with the asymmetrical nature of producing and understanding sentences; although humans are quite robust at inferring the meaning of text even in the presence of lots of noise and error, the rules that govern grammatical utterances are exacting, subtle; and elusive. To produce understandable text, we can rely on this robust processing hardware, but to produce grammatical text, we have to understand how it, works. This dissertation attempts to improve the fluency of machine translation output by explicitly incorporating models of the target language structure into machine translation systems. It is organized into three parts. First, we propose a framework for decoding that decouples the structures of the sentences of the source and target languages, and evaluate it with existing grammatical models as language models for machine translation. Next, we apply lessons from that task to the learning of grammars more suitable to the demands of the machine translation. We then incorporate these grammars, called Tree Substitution Grammars, into our decoding framework.
Author Post, Matthew John
Author_xml – sequence: 1
  givenname: Matthew
  surname: Post
  middlename: John
  fullname: Post, Matthew John
BookMark eNotjctqwzAQAAVtIU2afxC9GyRLsrS3ltAXOOQQ38MqWiUORmotG9q_b6A9DcxhZsluU050w5ZS1lo7CdIs2LqU3gshQCmh63v2tP9JE35XHgsF3mI6zXgivs2BhsJjHvl-wqkvU3_EgW_xeO4T8W7EVIarz-mB3UUcCq3_uWLd60u3ea_a3dvH5rmtzhpEZQXKAE6EYBtwRKoB7esGIBIqHWTtFQQKZKOOTQQTDVGI1nljgrLeqBV7_Mt-jvlrpjIdLnke0_V4cEY12oAS6hc7Wkc2
ContentType Dissertation
Copyright Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.
Copyright_xml – notice: Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.
DBID 053
0BH
0L7
CBPLH
EU9
G20
M8-
PHGZT
PKEHL
PQEST
PQQKQ
PQUKI
DatabaseName Dissertations & Theses Europe Full Text: Science & Technology
ProQuest Dissertations and Theses Professional
Dissertations & Theses @ The University of Rochester
ProQuest Dissertations & Theses Global: The Sciences and Engineering Collection
ProQuest Dissertations & Theses A&I
ProQuest Dissertations & Theses Global
ProQuest Dissertations and Theses A&I: The Sciences and Engineering Collection
ProQuest One Academic (New)
ProQuest One Academic Middle East (New)
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Academic (retired)
ProQuest One Academic UKI Edition
DatabaseTitle Dissertations & Theses Europe Full Text: Science & Technology
Dissertations & Theses @ The University of Rochester
ProQuest One Academic Middle East (New)
ProQuest One Academic UKI Edition
ProQuest One Academic Eastern Edition
ProQuest Dissertations & Theses Global: The Sciences and Engineering Collection
ProQuest Dissertations and Theses Professional
ProQuest One Academic
ProQuest Dissertations & Theses A&I
ProQuest One Academic (New)
ProQuest Dissertations and Theses A&I: The Sciences and Engineering Collection
ProQuest Dissertations & Theses Global
DatabaseTitleList Dissertations & Theses Europe Full Text: Science & Technology
Database_xml – sequence: 1
  dbid: G20
  name: ProQuest Dissertations & Theses Global
  url: https://www.proquest.com/pqdtglobal1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
ExternalDocumentID 2275427261
Genre Dissertation/Thesis
GroupedDBID 053
0BH
0L7
8R4
8R5
CBPLH
EU9
G20
M8-
PHGZT
PKEHL
PQEST
PQQKQ
PQUKI
Q2X
ID FETCH-LOGICAL-h490-70a1d980dd7698ee3694b2699fea34d12b39dede7f4f6f95f5eedf78b55d37b53
IEDL.DBID G20
ISBN 1124481915
9781124481913
IngestDate Mon Jun 30 03:51:30 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-h490-70a1d980dd7698ee3694b2699fea34d12b39dede7f4f6f95f5eedf78b55d37b53
Notes SourceType-Dissertations & Theses-1
ObjectType-Dissertation/Thesis-1
content type line 12
PQID 853645930
PQPubID 18750
ParticipantIDs proquest_journals_853645930
PublicationCentury 2000
PublicationDate 20100101
PublicationDateYYYYMMDD 2010-01-01
PublicationDate_xml – month: 01
  year: 2010
  text: 20100101
  day: 01
PublicationDecade 2010
PublicationYear 2010
Publisher ProQuest Dissertations & Theses
Publisher_xml – name: ProQuest Dissertations & Theses
SSID ssib000933042
Score 1.5275241
Snippet The goal of machine translation is to develop algorithms that produce human-quality translations of natural language sentences. The evaluation of machine...
SourceID proquest
SourceType Aggregation Database
SubjectTerms Computer science
Title Syntax-based Language Models for Statistical Machine Translation
URI https://www.proquest.com/docview/853645930
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV07SwNBEB40WohFfKJGZQvbxbvs43YqBTVYxCCYIl24fWERLppE0X_v7WYvBAQby2Wbfc18s_P6AK5QKWE0cmqU55SXHmkZ5NFkSivUWjobC4X7xWCgRiN8Trk585RW2ejEqKjt1AQf-XUNK6HvCctu3t5pII0KwdXEoLEJW6G4Ntb6rls_q896HlAsfE1E6vLUjNkvFRxxpdf-54r2YPd-LZ6-DxuuOoB2w9RAkuAewu3Ld7Uov2jALEv6yUdJAhHaZE5qu5UEozP2bC4n5CkmWDoScWyZK3cEw97D8O6RJu4E-soxo0VW5hZVZm0hUTnHJHLdlYjelYzbvKsZ2voaCs-99Ci8qLHSF0oLYVmhBTuGVjWt3AkQUwupy70QjiEX1urcSaV8Jo0xBVf-FDrN8YzT-5-PV2dz9udsB3aW0fjg0jiH1mL24S5g23zWO55dxtv8AYW_qo4
linkProvider ProQuest
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V07T8MwED5VBQnEUJ4CysMDjBZJYyf2gECiVK2aVkh06BbFLzFUKbTl0f_Ej8ROk6oSElsHxshL7LO_73y-uw_gijNGpeAES2YIJqnhOHXnUXpMMC5EqFVeKBxH_T4bDvlTBb7LWhiXVlliYg7UaixdjPzG0orrexJ4d69v2IlGucfVUkFjsSu6ev5pb2zT207Tmve60Wg9Dh7auBAVwC-EezjyUl9x5ikVhZxpHYSciEbIudFpQJTfEAFX9v8iQ0xoODXUkoiJmKBUBZFwGhEW8DeIa3TnSotXna1lbMB3pOluQrRoKlV-B78QP6exVu1_LcAu7DRXsgX2oKKzfaiVOhSogKUDuH-eZ7P0CztGViguIrDIybyNpsh65ci51HlH6nSEenn6qEY5Sy8yAQ9hsI45HEE1G2f6GJC0EKR9Q6kOOKFKCV-HjBkvlFJGhJkTqJfWSIrTPU2Wpjj9c_QSttqDXpzEnX63DtuLvAMXvDmD6mzyrs9hU37Y2U8u8o2EIFmz3X4A3cYIPA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adissertation&rft.genre=dissertation&rft.title=Syntax-based+Language+Models+for+Statistical+Machine+Translation&rft.DBID=053%3B0BH%3B0L7%3BCBPLH%3BEU9%3BG20%3BM8-%3BPHGZT%3BPKEHL%3BPQEST%3BPQQKQ%3BPQUKI&rft.PQPubID=18750&rft.au=Post%2C+Matthew+John&rft.date=2010-01-01&rft.pub=ProQuest+Dissertations+%26+Theses&rft.isbn=1124481915&rft.externalDBID=HAS_PDF_LINK&rft.externalDocID=2275427261
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781124481913/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781124481913/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781124481913/sc.gif&client=summon&freeimage=true