Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Semi-supervised Relation Extraction with Large-scale Word Clustering"

Minh Thắng 68 9 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

We present a simple semi-supervised relation extraction system with large-scale word clustering. We focus on systematically exploring the effectiveness of different cluster-based features. We also propose several statistical methods for selecting clusters at an appropriate level of granularity. When training on different sizes of data, our semi-supervised approach consistently outperformed a state-of-the-art supervised baseline system. | Semi-supervised Relation Extraction with Large-scale Word Clustering Ang Sun Ralph Grishman Satoshi Sekine Computer Science Department New York University asun grishman sekine @cs.nyu.edu Abstract We present a simple semi-supervised relation extraction system with large-scale word clustering. We focus on systematically exploring the effectiveness of different cluster-based features. We also propose several statistical methods for selecting clusters at an appropriate level of granularity. When training on different sizes of data our semi-supervised approach consistently outperformed a state-of-the-art supervised baseline system. 1 Introduction Relation extraction is an important information extraction task in natural language processing NLP with many practical applications. The goal of relation extraction is to detect and characterize semantic relations between pairs of entities in text. For example a relation extraction system needs to be able to extract an Employment relation between the entities US soldier and US in the phrase US soldier. Current supervised approaches for tackling this problem in general fall into two categories feature based and kernel based. Given an entity pair and a sentence containing the pair both approaches usually start with multiple level analyses of the sentence such as tokenization partial or full syntactic parsing and dependency parsing. Then the feature based method explicitly extracts a variety of lexical syntactic and semantic 521 features for statistical learning either generative or discriminative Miller et al. 2000 Kambhatla 2004 Boschee et al. 2005 Grishman et al. 2005 Zhou et al. 2005 Jiang and Zhai 2007 . In contrast the kernel based method does not explicitly extract features it designs kernel functions over the structured sentence representations sequence dependency or parse tree to capture the similarities between different relation instances Zelenko et al. 2003 Bunescu and Mooney 2005a Bunescu and Mooney 2005b Zhao and .

TÀI LIỆU LIÊN QUAN

Kỷ yếu tóm tắt báo cáo khoa học: Hội nghị khoa học tim mạch toàn quốc lần thứ XI - Hội tim mạch Quốc gia Việt Nam

Báo cáo nghiên cứu khoa học: "Danh lục các loài thú ở khu bảo tồn thiên nhiên Pù Huống tỉnh Nghệ An và ý nghĩa bảo tồn nguồn gen quí hiếm của chúng"

Báo cáo khoa học: Hỗ trợ nâng cao năng lực quản lý chất thải sinh hoạt tại thành phố Hội An

Báo cáo nghiên cứu khoa học: "Tính năng động nghệ thuật của văn học hiện đại Việt Nam và một cách nhìn hành trình thể loại"

Báo cáo nghiên cứu khoa học: " DỊCH CHUYỂN TRUY VẤN OQL VÀO CÁC PHÉP TÍNH BAO HÀM"

Báo cáo khoa học: " Áp dụng thủ tục phân tích trong kiểm toán báo cáo tài chính"

Báo cáo nghiên cứu khoa học: "Người lính trở về sau chiến tranh với mặc cảm “ăn mày dĩ vãng’ trong tiểu thuyết Chu Lai"

Báo cáo nghiên cứu khoa học: "Khảo sát hiện tượng chuyển đổi chức năng - nghĩa của động từ tiếng Việt"

Báo cáo nghiên cứu khoa học: " BẢN CHẤT KHOA HỌC VÀ CÁCH MẠNG LÀ CỘI NGUỒN SỨC SỐNG CỦA CHỦ NGHĨA MÁC - LÊNIN"

Báo cáo khoa học: " CẢI TIẾN CÁC THUẬT TOÁN MƯỢN VÀ KHOÁ KÊNH TẦN SỐ MẠNG DI ĐỘNG TẾ BÀO"

Đã phát hiện trình chặn quảng cáo AdBlock

Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.