TAILIEUCHUNG - Báo cáo khoa học: "ADOP Model for Semantic Interpretation*"

In data-oriented language processing, an annotated language corpus is used as a stochastic grammar. The most probable analysis of a new sentence is constructed by combining fragments from the corpus in the most probable way. This approach has been successfully used for syntactic analysis, using corpora with syntactic annotations such as the Penn Tree-bank. If a corpus with semantically annotated sentences is used, the same approach can also generate the most probable semantic interpretation of an input sentence. The present paper explains this semantic interpretation method. . | A DOP Model for Semantic Interpretation Remko Bonnema Rens Bod and Remko Scha Institute for Logic Language and Computation University of Amsterdam Spuistrầt 134 1012 VB Amsterdam Abstract In data-oriented language processing an annotated language corpus is used as a stochastic grammar. The most probable analysis of a new sentence is constructed by combining fragments from the corpus in the most probable way. This approach has been successfully used for syntactic analysis using corpora with syntactic annotations such as the Penn Tree-bank. If a corpus with semantically annotated sentences is used the same approach can also generate the most probable semantic interpretation of an input sentence. The present paper explains this semantic interpretation method. A data-oriented semantic interpretation algorithm was tested on two semantically annotated corpora the English ATIS corpus and the Dutch OVIS corpus. Experiments show an increase in semantic accuracy if larger corpus-fragments are taken into consideration. 1 Introduction Data-oriented models of language processing embody the assumption that human language perception and production works with representations of concrete past language experiences rather than with abstract grammar rules. Such models therefore maintain large corpora of linguistic representations of previously occurring utterances. When processing a new input utterance analyses of this utterance are constructed by combining fragments from the corpus the occurrence-frequencies of the fragments are used to estimate which analysis is the most probable one. This work was partially supported by NWO the Netherlands Organization for Scientific Research Priority Programme Language and Speech Technology . For the syntactic dimension of language various instantiations of this data-oriented processing or DOP approach have been worked out . Bod 1992-1995 Charniak 1996 Tugwell 1995 Sima an et .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.