TAILIEUCHUNG - Báo cáo khoa học: "The GENIA project: corpus-based knowledge acquisition and information extraction from genome research papers"

We present an outline of the genome information acquisition (GENIA) project for automatically extracting biochemical information from journal papers and abstracts. GENIA will be available over the Internet and is designed to aid in information extraction, retrieval and visualisation and to help reduce information overload on researchers. The vast repository of papers available online in databases such as MEDLINE is a natural environment in which to develop language engineering methods and tools and is an opportunity to show how language engineering can play a key role on the Internet. . | Proceedings of EACL 99 The GENIA project corpus-based knowledge acquisition and information extraction from genome research papers Nigel Collier Hyun Seok Park Norihiro Ogata Yuka Tateishi Chikashi Nobata Tomoko Ohta Tateshi Sekimizu Hisao Imai Katsutoshi Ibushi Jun-ichi Tsujii nigel hsp20 Ogata yucca nova okap sekimizu hisao is. . Department of Information Science Graduate School of Science University of Tokyo Hongo 7-3-1 Bunkyo-ku Tokyo 113 Japan Abstract We present an outline of the genome information acquisition GENIA project for automatically extracting biochemical information from journal papers and abstracts. GENIA will be available over the Internet and is designed to aid in information extraction retrieval and visualisation and to help reduce information overload on researchers. The vast repository of papers available online in databases such as MEDLINE is a natural environment in which to develop language engineering methods and tools and is an opportunity to show how language engineering can play a key role on the Internet. 1 Introduction In the context of the global research effort to map the human genome the Genome Informatics Extraction project GENIA GENIA 1999 aims to support such research by automatically extracting information from biochemical papers and their abstracts such as those available from MEDLINE MEDLINE 1999 written by domain specialists. The vast repository of research papers which are the results of genome research are a natural environment in which to develop language engineering tools and methods. This project aims to help reduce the problems caused by information overload on the researchers who want to access the information held inside collections such as MEDLINE. The key elements of the project are centered around the tasks of information extraction and retrieval. These are outlined below and then the interface which integrates them is described. Terminology identification and classification .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.