Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Estimating Class Priors in Domain Adaptation for Word Sense Disambiguation"

Quốc Trụ 73 8 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

Instances of a word drawn from different domains may have different sense priors (the proportions of the different senses of a word). This in turn affects the accuracy of word sense disambiguation (WSD) systems trained and applied on different domains. This paper presents a method to estimate the sense priors of words drawn from a new domain, and highlights the importance of using well calibrated probabilities when performing these estimations. By using well calibrated probabilities, we are able to estimate the sense priors effectively to achieve signiﬁcant improvements in WSD accuracy. . | Estimating Class Priors in Domain Adaptation for Word Sense Disambiguation Yee Seng Chan and Hwee Tou Ng Department of Computer Science National University of Singapore 3 Science Drive 2 Singapore 117543 chanys nght @comp.nus.edu.sg Abstract Instances of a word drawn from different domains may have different sense priors the proportions of the different senses of a word . This in turn affects the accuracy of word sense disambiguation WSD systems trained and applied on different domains. This paper presents a method to estimate the sense priors of words drawn from a new domain and highlights the importance of using well calibrated probabilities when performing these estimations. By using well calibrated probabilities we are able to estimate the sense priors effectively to achieve significant improvements in WSD accuracy. 1 Introduction Many words have multiple meanings and the process of identifying the correct meaning or sense of a word in context is known as word sense disambiguation WSD . Among the various approaches to WSD corpus-based supervised machine learning methods have been the most successful to date. With this approach one would need to obtain a corpus in which each ambiguous word has been manually annotated with the correct sense to serve as training data. However supervised WSD systems faced an important issue of domain dependence when using such a corpus-based approach. To investigate this Escudero et al. 2000 conducted experiments using the DSO corpus which contains sentences drawn from two different corpora namely Brown Corpus BC and Wall Street Journal WSJ . They found that training a WSD system on one part BC or WSJ of the DSO corpus and applying it to the other part can result in an accuracy drop of 12 to 19 . One reason for this is the difference in sense priors i.e. the proportions of the different senses of a word between BC and WSJ. For instance the noun interest has these 6 senses in the DSO corpus sense 1 2 3 4 5 and 8. In the BC part of .

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Estimating Compact Yet Rich Tree Insertion Grammars"

Báo cáo khoa học: "Estimating Strictly Piecewise Distributions"

Báo cáo khoa học: "Knowing the Unseen: Estimating Vocabulary Size over Unseen Samples"

Báo cáo khoa học: "Estimating Class Priors in Domain Adaptation for Word Sense Disambiguation"

Báo cáo khoa học: "Empirically Estimating Order Constraints for Content Planning in Generation"

Báo cáo khoa học: "Estimating Upper and Lower Bounds on the Performance of Word-Sense Disambiguation Programs"

báo cáo khoa học: " Early exit: Estimating and explaining early exit from drug treatment"

Báo cáo khoa hoc:" Estimating covariance functions for longitudinal data using a random regression model"

Báo cáo khoa hoc:" Estimating genetic covariance functions assuming a parametric correlation structure for environmental effects"

Báo cáo khoa hoc:" A sampling method for estimating the accuracy of predicted breeding values in genetic evaluation"

Đã phát hiện trình chặn quảng cáo AdBlock

Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.