Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Latent Semantic Word Sense Induction and Disambiguation"

Diễm Trang 58 10 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

In this paper, we present a uniﬁed model for the automatic induction of word senses from text, and the subsequent disambiguation of particular word instances using the automatically extracted sense inventory. The induction step and the disambiguation step are based on the same principle: words and contexts are mapped to a limited number of topical dimensions in a latent semantic word space. The intuition is that a particular sense is associated with a particular topic, so that different senses can be discriminated through their association with particular topical dimensions; in a similar vein, a particular instance of a word. | Latent Semantic Word Sense Induction and Disambiguation Tim Van de Cruys RCEAL University of Cambridge United Kingdom tv234@cam.ac.uk Marianna Apidianaki Alpage INRIA Univ Paris Diderot Sorbonne Paris Cite UMRI-001 75013 Paris France marianna.apidianaki@inria.fr Abstract In this paper we present a unified model for the automatic induction of word senses from text and the subsequent disambiguation of particular word instances using the automatically extracted sense inventory. The induction step and the disambiguation step are based on the same principle words and contexts are mapped to a limited number of topical dimensions in a latent semantic word space. The intuition is that a particular sense is associated with a particular topic so that different senses can be discriminated through their association with particular topical dimensions in a similar vein a particular instance of a word can be disambiguated by determining its most important topical dimensions. The model is evaluated on the semeval-2010 word sense induction and disambiguation task on which it reaches state-of-the-art results. 1 Introduction Word sense induction WSI is the task of automatically identifying the senses of words in texts without the need for handcrafted resources or manually annotated data. The manual construction of a sense inventory is a tedious and time-consuming job and the result is highly dependent on the annotators and the domain at hand. By applying an automatic procedure we are able to only extract the senses that are objectively present in a particular corpus and it allows for the sense inventory to be straightforwardly adapted to a new domain. Word sense disambiguation WSD on the other hand is the closely related task of assigning a sense 1476 label to a particular instance of a word in context using an existing sense inventory. The bulk of WSD algorithms up till now use pre-defined sense inventories such as WordNet that often contain finegrained sense distinctions which .

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "A Discriminative Latent Variable Model for Statistical Machine Translation"

Báo cáo khoa học: "Fine Granular Aspect Analysis using Latent Structural Models"

Báo cáo khoa học: "Exploiting Latent Information to Predict Diffusions of Novel Topics on Social Networks"

Báo cáo khoa học: "Historical Analysis of Legal Opinions with a Sparse Mixed-Effects Latent Variable Model"

Báo cáo khoa học: "Modeling Sentences in the Latent Space"

Báo cáo khoa học: "A Pilot Study of Implicit Attitude using Latent Textual Semantics"

Báo cáo khoa học: "Learning the Latent Semantics of a Concept from its Deﬁnition"

Báo cáo khoa học: "Spectral Learning of Latent-Variable PCFGs"

Báo cáo khoa học: "A Latent Dirichlet Allocation method for Selectional Preferences"

Báo cáo khoa học: "Latent variable models of selectional preference"

Đã phát hiện trình chặn quảng cáo AdBlock

Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.