English syntactic disambiguation using Parser's ambiguity type information
- Authors
- Lee, Jae Won; Kim, SD; Chae, JS; Lee, Jongwoo; Kim, DH
- Issue Date
- Aug-2003
- Publisher
- ELECTRONICS TELECOMMUNICATIONS RESEARCH INST
- Citation
- ETRI JOURNAL, v.25, no.4, pp 219 - 230
- Pages
- 12
- Journal Title
- ETRI JOURNAL
- Volume
- 25
- Number
- 4
- Start Page
- 219
- End Page
- 230
- URI
- https://scholarworks.sookmyung.ac.kr/handle/2020.sw.sookmyung/149111
- DOI
- 10.4218/etrij.03.0102.0401
- ISSN
- 1225-6463
2233-7326
- Abstract
- This paper describes a rule-based approach for syntactic disambiguation used by the English sentence parser in E-TRAN 2001, an English-Korean machine translation system. We propose Parser's Ambiguity Type Information (PATI) to automatically identify the types of ambiguities observed in competing candidate trees produced by the parser and synthesize the types into a formal representation. PATI provides an efficient way of encoding knowledge into grammar rules and calculating rule preference scores from a relatively small training corpus. In the overall scoring scheme for sorting the candidate trees, the rule preference scores are combined with other preference functions that are based on statistical information. We compare the enhanced grammar with the initial one in terms of the amount of ambiguity. The experimental results show that the rule preference scores could significantly increase the accuracy of ambiguity resolution.
- Files in This Item
-
Go to Link
- Appears in
Collections - ETC > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.