TAILIEUCHUNG - Báo cáo khoa học: "Exploiting Shallow Linguistic Information for Relation Extraction from Biomedical Literature"

We propose an approach for extracting relations between entities from biomedical literature based solely on shallow linguistic information. We use a combination of kernel functions to integrate two different information sources: (i) the whole sentence where the relation appears, and (ii) the local contexts around the interacting entities. We performed experiments on extracting gene and protein interactions from two different data sets. The results show that our approach outperforms most of the previous methods based on syntactic and semantic information. . | Exploiting Shallow Linguistic Information for Relation Extraction from Biomedical Literature Claudio Giuliano and Alberto Lavelli and Lorenza Romano ITC-irst Via Sommarive 18 38050 Povo TN Italy giuliano lavelli romano @ Abstract We propose an approach for extracting relations between entities from biomedical literature based solely on shallow linguistic information. We use a combination of kernel functions to integrate two different information sources i the whole sentence where the relation appears and ii the local contexts around the interacting entities. We performed experiments on extracting gene and protein interactions from two different data sets. The results show that our approach outperforms most of the previous methods based on syntactic and semantic information. 1 Introduction Information Extraction IE is the process of finding relevant entities and their relationships within textual documents. Applications of IE range from Semantic Web to Bioinformatics. For example there is an increasing interest in automatically extracting relevant information from biomedical literature. Recent evaluation campaigns on bio-entity recognition such as BioCreAtIvE and JNLPBA 2004 shared task have shown that several systems are able to achieve good performance even if it is a bit worse than that reported on news articles . However relation identification is more useful from an applicative perspective but it is still a considerable challenge for automatic tools. In this work we propose a supervised machine learning approach to relation extraction which is applicable even when deep linguistic processing is not available or reliable. In particular we explore a kernel-based approach based solely on shallow linguistic processing such as tokeniza- tion sentence splitting Part-of-Speech PoS tagging and lemmatization. Kernel methods Shawe-Taylor and Cristianini 2004 show their full potential when an explicit computation of the feature map becomes computationally infeasible

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.