TAILIEUCHUNG - Báo cáo khoa học: "Learning the Latent Semantics of a Concept from its Definition"

In this paper we study unsupervised word sense disambiguation (WSD) based on sense definition. We learn low-dimensional latent semantic vectors of concept definitions to construct a more robust sense similarity measure wmfvec. Experiments on four all-words WSD data sets show significant improvement over the baseline WSD systems and LDA based similarity measures, achieving results comparable to state of the art WSD systems. | Learning the Latent Semantics of a Concept from its Definition Weiwei Guo Department of Computer Science Columbia University New York NY USA weiwei@ Mona Diab Center for Computational Learning Systems Columbia University New York NY USA mdiab@ Abstract In this paper we study unsupervised word sense disambiguation WSD based on sense definition. We learn low-dimensional latent semantic vectors of concept definitions to construct a more robust sense similarity measure wmfvec. Experiments on four all-words WSD data sets show significant improvement over the baseline WSD systems and LDA based similarity measures achieving results comparable to state of the art WSD systems. 1 Introduction To date many unsupervised WSD systems rely on a sense similarity module that returns a similarity score given two senses. Many similarity measures use the taxonomy structure of WordNet WN Fellbaum 1998 which allows only noun-noun and verb-verb pair similarity computation since the other parts of speech adjectives and adverbs do not have a taxonomic representation structure. For example the jcn similarity measure Jiang and Conrath 1997 computes the sense pair similarity score based on the information content of three senses the two senses and their least common subsumer in the noun verb hierarchy. The most popular sense similarity measure is the Extended Lesk elesk measure Banerjee and Pedersen 2003 . In elesk the similarity score is computed based on the length of overlapping words phrases between two extended dictionary definitions. The definitions are extended by definitions of neighbor senses to discover more overlapping words. However exact word matching is lossy. Below are two definitions from WN bank n 1 a financial institution that accepts deposits and channels the money into lending activities stock n 1 the capital raised by a corporation through 140 the issue of shares entitling holders to an ownership interest equity Despite the high semantic .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.