TAILIEUCHUNG - Báo cáo khoa học: "Modeling Commonality among Related Classes in Relation Extraction"

This paper proposes a novel hierarchical learning strategy to deal with the data sparseness problem in relation extraction by modeling the commonality among related classes. For each class in the hierarchy either manually predefined or automatically clustered, a linear discriminative function is determined in a topdown way using a perceptron algorithm with the lower-level weight vector derived from the upper-level weight vector. As the upper-level class normally has much more positive training examples than the lower-level class, the corresponding linear discriminative function can be determined more reliably. . | Modeling Commonality among Related Classes in Relation Extraction Zhou GuoDong Su Jian Zhang Min Institute for Infocomm Research 21 Heng Mui Keng Terrace Singapore 119613 Email zhougd sujian mzhang @ Abstract This paper proposes a novel hierarchical learning strategy to deal with the data sparseness problem in relation extraction by modeling the commonality among related classes. For each class in the hierarchy either manually predefined or automatically clustered a linear discriminative function is determined in a topdown way using a perceptron algorithm with the lower-level weight vector derived from the upper-level weight vector. As the upper-level class normally has much more positive training examples than the lower-level class the corresponding linear discriminative function can be determined more reliably. The upperlevel discriminative function then can effectively guide the discriminative function learning in the lower-level which otherwise might suffer from limited training data. Evaluation on the ACE RDC 2003 corpus shows that the hierarchical strategy much improves the performance by and in F-measure on least- and medium- frequent relations respectively. It also shows that our system outperforms the previous best-reported system by in F-measure on the 24 subtypes using the same feature set. 1 Introduction With the dramatic increase in the amount of textual information available in digital archives and the WWW there has been growing interest in techniques for automatically extracting information from text. Information Extraction IE is such a technology that IE systems are expected to identify relevant information usually of predefined types from text documents in a certain domain and put them in a structured format. According to the scope of the ACE program ACE 2000-2005 current research in IE has three main objectives Entity Detection and Tracking EDT Relation Detection and Characterization RDC and Event Detection and .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.