TAILIEUCHUNG - Báo cáo khoa học: "Improving English Subcategorization Acquisition with Diathesis Alternations as Heuristic Information"

Automatically acquired lexicons with subcategorization information have already proved accurate and useful enough for some purposes but their accuracy still shows room for improvement. By means of diathesis alternation, this paper proposes a new filtering method, which improved the performance of Korhonen’s acquisition system remarkably, with the precision increased to and recall unchanged, making the acquired lexicon much more practical for further manual proofreading and other NLP uses. . | Improving English Subcategorization Acquisition with Diathesis Alternations as Heuristic Information Xiwu Han Institute of Computational Linguistics Heilongjiang University Harbin City 150080 China hxw@ Tiejun Zhao School of Computer Science and Technology Harbin Institute of Technology Harbin City 150001 China tjzhao@ Xingshang Fu Institute of Computational Linguistics Heilongjiang University Harbin City 150080 China fxs@ Abstract Automatically acquired lexicons with subcategorization information have already proved accurate and useful enough for some purposes but their accuracy still shows room for improvement. By means of diathesis alternation this paper proposes a new filtering method which improved the performance of Korhonen s acquisition system remarkably with the precision increased to and recall unchanged making the acquired lexicon much more practical for further manual proofreading and other NLP uses. 1 Introduction Subcategorization is the process that further classifies a syntactic category into its subsets. Chomsky 1965 defines the function of strict subcategorization features as appointing a set of constraints that dominate the selection of verbs and other arguments in deep structure. Large subcategorized verbal lexicons have proved to be crucially important for many tasks of natural language processing such as probabilistic parsers Korhonen 2001 2002 and verb classifications Schulte im Walde 2002 Korhonen 2003 . Since Brent 1993 a considerable amount of research focusing on large-scaled automatic acquisition of subcategorization frames SCF has met with some success not only in English but also in many other languages including German Schulte im Walde 2002 Spanish Chrupala 2003 Czech Sarkar and Zeman 2000 Portuguese Gamallo et. al 2002 and Chinese Han et al 2004 . The general objective of this research is to acquire from a given corpus the SCF types and numbers for predicate verbs. Two typi cal steps during

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.