Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ
Tải xuống
The integration of sophisticated inference-based techniques into natural language processing applications first requires a reliable method of encoding the predicate-argument structure of the propositional content of text. Recent statistical approaches to automated predicateargument annotation have utilized parse tree paths as predictive features, which encode the path between a verb predicate and a node in the parse tree that governs its argument. | A Comparison of Alternative Parse Tree Paths for Labeling Semantic Roles Reid Swanson and Andrew S. Gordon Institute for Creative Technologies University of Southern California 13274 Fiji Way Marina del Rey CA 90292 USA swansonr@ict.usc.edu gordon@ict.usc.edu Abstract The integration of sophisticated inference-based techniques into natural language processing applications first requires a reliable method of encoding the predicate-argument structure of the propositional content of text. Recent statistical approaches to automated predicateargument annotation have utilized parse tree paths as predictive features which encode the path between a verb predicate and a node in the parse tree that governs its argument. In this paper we explore a number of alternatives for how these parse tree paths are encoded focusing on the difference between automatically generated constituency parses and dependency parses. After describing five alternatives for encoding parse tree paths we investigate how well each can be aligned with the argument substrings in annotated text corpora their relative precision and recall performance and their comparative learning curves. Results indicate that constituency parsers produce parse tree paths that can more easily be aligned to argument substrings perform better in precision and recall and have more favorable learning curves than those produced by a dependency parser. 1 Introduction A persistent goal of natural language processing research has been the automated transformation of natural language texts into representations that unambiguously encode their propositional content in formal notation. Increasingly first-order predicate calculus representations of textual meaning have been used in natural lanugage processing applications that involve automated inference. For example Moldovan et al. 2003 demonstrate how predicate-argument formulations of questions and candidate answer sentences are unified using logical inference in a top-performing .