Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Bayesian Word Sense Induction"

Hoàng Ngôn 46 9 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

Sense induction seeks to automatically identify word senses directly from a corpus. A key assumption underlying previous work is that the context surrounding an ambiguous word is indicative of its meaning. Sense induction is thus typically viewed as an unsupervised clustering problem where the aim is to partition a word’s contexts into different classes, each representing a word sense. Our work places sense induction in a Bayesian context by modeling the contexts of the ambiguous word as samples from a multinomial distribution over senses which are in turn characterized as distributions over words. . | Bayesian Word Sense Induction Samuel Brody Dept. of Biomedical Informatics Columbia University samuel.brody@dbmi.columbia.edu Mirella Lapata School of Informatics University of Edinburgh mlap@inf.ed.ac.uk Abstract Sense induction seeks to automatically identify word senses directly from a corpus. A key assumption underlying previous work is that the context surrounding an ambiguous word is indicative of its meaning. Sense induction is thus typically viewed as an unsupervised clustering problem where the aim is to partition a word s contexts into different classes each representing a word sense. Our work places sense induction in a Bayesian context by modeling the contexts of the ambiguous word as samples from a multinomial distribution over senses which are in turn characterized as distributions over words. The Bayesian framework provides a principled way to incorporate a wide range of features beyond lexical cooccurrences and to systematically assess their utility on the sense induction task. The proposed approach yields improvements over state-of-the-art systems on a benchmark dataset. 1 Introduction Sense induction is the task of discovering automatically all possible senses of an ambiguous word. It is related to but distinct from word sense disambiguation WSD where the senses are assumed to be known and the aim is to identify the intended meaning of the ambiguous word in context. Although the bulk of previous work has been devoted to the disambiguation problem1 there are good reasons to believe that sense induction may be able to overcome some of the issues associated with WSD. Since most disambiguation methods assign senses according to and with the aid Approaches to WSD are too numerous to list We refer the interested reader to Agirre et al. 2007 for an overview of the state of the art. of dictionaries or other lexical resources it is difficult to adapt them to new domains or to languages where such resources are scarce. A related problem concerns the .

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Using Rejuvenation to Improve Particle Filtering for Bayesian Word Segmentation"

Báo cáo khoa học: "A Nonparametric Bayesian Approach to Acoustic Model Discovery"

Báo cáo khoa học: "Learning to “Read Between the Lines” using Bayesian Logic Programs"

Báo cáo khoa học: "Bayesian Symbol-Reﬁned Tree Substitution Grammars for Syntactic Parsing"

Báo cáo khoa học: "Semantic Parsing with Bayesian Tree Transducers"

Báo cáo khoa học: "A Bayesian Method for Robust Estimation of Distributional Similarities"

Báo cáo khoa học: "Bayesian Synchronous Tree-Substitution Grammar Induction and its Application to Sentence Compression"

Báo cáo khoa học: "A Bayesian Model for Unsupervised Semantic Parsing"

Báo cáo khoa học: "Blocked Inference in Bayesian Tree Substitution Grammars"

Báo cáo khoa học: "Insertion Operator for Bayesian Tree Substitution Grammars"

Đã phát hiện trình chặn quảng cáo AdBlock

Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.