TAILIEUCHUNG - Báo cáo khoa học: "The GENIA project: corpus-based knowledge acquisition and information extraction from genome research papers"

We present an outline of the genome information acquisition (GENIA) project for automatically extracting biochemical information from journal papers and abstracts. GENIA will be available over the Internet and is designed to aid in information extraction, retrieval and visualisation and to help reduce information overload on researchers. The vast repository of papers available online in databases such as MEDLINE is a natural environment in which to develop language engineering methods and tools and is an opportunity to show how language engineering can play a key role on the Internet. . | Proceedings of EACL 99 The GENIA project corpus-based knowledge acquisition and information extraction from genome research papers Nigel Collier Hyun Seok Park Norihiro Ogata Yuka Tateishi Chikashi Nobata Tomoko Ohta Tateshi Sekimizu Hisao Imai Katsutoshi Ibushi Jun-ichi Tsujii nigel hsp20 Ogata yucca nova okap sekimizu hisao is. . Department of Information Science Graduate School of Science University of Tokyo Hongo 7-3-1 Bunkyo-ku Tokyo 113 Japan Abstract We present an outline of the genome information acquisition GENIA project for automatically extracting biochemical information from journal papers and abstracts. GENIA will be available over the Internet and is designed to aid in information extraction retrieval and visualisation and to help reduce information overload on researchers. The vast repository of papers available online in databases such as MEDLINE is a natural environment in which to develop language engineering methods and tools and is an opportunity to show how language engineering can play a key role on the Internet. 1 Introduction In the context of the global research effort to map the human genome the Genome Informatics Extraction project GENIA GENIA 1999 aims to support such research by automatically extracting information from biochemical papers and their abstracts such as those available from MEDLINE MEDLINE 1999 written by domain specialists. The vast repository of research papers which are the results of genome research are a natural environment in which to develop language engineering tools and methods. This project aims to help reduce the problems caused by information overload on the researchers who want to access the information held inside collections such as MEDLINE. The key elements of the project are centered around the tasks of information extraction and retrieval. These are outlined below and then the interface which integrates them is described. Terminology identification and classification .

Ðình Trung 63 2 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461871 55

Giới thiệu :Lập trình mã nguồn mở

14 22664 59

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10897 529

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10069 446

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9533 104

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8293 1125

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8243 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7866 2220

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6691 253

Vật lý hạt cơ bản (1)

29 5775 85

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Bibliography on Medieval Women, Gender, and Medicine 1980-2009

82 210 0 28-04-2024

Báo cáo tốt nghiệp: Vận hành và bảo dưỡng trong MPLS

92 144 3 28-04-2024

Diseases of the Liver and Biliary System - part 1

33 125 0 28-04-2024

Khóa luận tốt nghiệp: Giải pháp nâng cao chất lượng phương thức thanh toán tín dụng chứng từ phục vụ xuất nhập khẩu tại ngân hàng Thương mại Việt Nam - Trần Thị Tân

12 118 0 28-04-2024

Kỹ thuật nuôi cá rồng part 5

7 128 0 28-04-2024

Lãi suất cơ bản, công cụ quan trọng của chính sách tiền tệ

5 114 0 28-04-2024

Fecal Incontinence Diagnosis and Treatment - part 8

35 103 0 28-04-2024

Gastroenterology an illustrated colour text - part 10

10 89 0 28-04-2024

Báo cáo nghiên cứu nông nghiệp " Introduction of the principles of GAP for citrus through implementation of citrus IPM using Farmer Field Schools "

12 92 0 28-04-2024

Báo cáo nghiên cứu nông nghiệp " Biofertiliser inoculant technology for the growth of rice in Vietnam: Developing technical infrastructure for quality assurance and village production for farmers "

12 87 0 28-04-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7866 2220

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5753 1381

Ebook Chào con ba mẹ đã sẵn sàng

112 3769 1231

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5326 1136

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8293 1125

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3502 643

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10897 529

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3688 525

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4055 516

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4132 480