TAILIEUCHUNG - Báo cáo khoa học: "An Integrated Term-Based Corpus Query System"

In this paper we describe the X-TRACT workbench, which enables efficient termbased querying against a domain-specific literature corpus. Its main aim is to aid domain specialists in locating and extracting new knowledge from scientific literature corpora. Before querying, a corpus is automatically terminologically analysed by the ATRACT system, which performs terminology recognition based on the C/NCvalue method enhanced by incorporation of term variation handling. The results of terminology processing are annotated in XML, and the produced XML documents are stored in an XML-native database. All corpus retrieval operations are performed against this database using an XML query language. We. | An Integrated Term-Based Corpus Query System Irena Spasic Goran Nenadic Computer Science Dept of Computation University of Salford UMIST I. Spasic@ Kostas Manios Computer Science University of Salford @ Sophia Ananiadou Computer Science University of Salford Abstract In this paper we describe the X-TRACT workbench which enables efficient termbased querying against a domain-specific literature corpus. Its main aim is to aid domain specialists in locating and extracting new knowledge from scientific literature corpora. Before querying a corpus is automatically terminologically analysed by the ATRACT system which performs terminology recognition based on the C NC-value method enhanced by incorporation of term variation handling. The results of terminology processing are annotated in XML and the produced XML documents are stored in an XML-native database. All corpus retrieval operations are performed against this database using an XML query language. We illustrate the way in which the X-TRACT workbench can be utilised for knowledge discovery literature mining and conceptual information extraction. 1 Introduction New scientific discoveries usually result in an abundance of publications verbalising these findings in an attempt to share new knowledge with other scientists. Electronically available texts are continually being created and updated and thus the knowledge represented in such texts is more up-to-date than in any other media. The sheer amount of published papers1 makes it difficult for a human to efficiently 1 For example the Medline database PubMed currently contains over 12 million abstracts in the domains of molecular biology biomedicine and medicine growing by more than abstracts each month. localise the information of interest not only in a collection of documents but also within a single document. The growing number of electronically available .

Tố Loan 95 8 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Ebook The integrated circuit Hobbyist’s handbook

14 73 2

Ebook Analysis and design of analog integrated circuits (4th edition): Part 1

415 55 0

Ebook Analysis and design of analog integrated circuits (4th edition): Part 2

482 65 0

Ebook Digital integrated circuits prentice hall: Part 1

241 74 0

Ebook Digital integrated circuits prentice hall: Part 2

249 85 0

Lecture Advertising and promotion: An integrated marketing communications perspective (10/e): Chapter 7 - George E. Belch, Michael A. Belch

20 132 3

An integrated multi-stage supply chain inventory model with imperfect production process

16 72 0

Effect of integrated nutrient management on yield and yield attributes of pearl millet [pennisetum glaucum (L.) R. Br. Emend stuntz]Effect of integrated nutrient management on yield and yield attributes of pearl millet [pennisetum glaucum (L.) R. Br. Emend stuntz]

5 87 0

Effect of integrated fertilization on qualitative and quantitative traits of Radish (Raphanus sativus L.)

9 45 1

Effect of integrated nitrogen management on macronutrient content in toria (Brassica campestris L.var.M-27)

8 36 1

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461904 55

Giới thiệu :Lập trình mã nguồn mở

14 22824 64

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10938 531

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10123 449

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9552 104

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8320 1127

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8264 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7878 2222

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6747 253

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5901 1422

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Bibliography on Medieval Women, Gender, and Medicine 1980-2009

82 214 0 07-05-2024

Trading Strategies Profit Making Techniques For Stock_8

23 178 1 07-05-2024

TƯƠNG QUAN GIỮA MÔ HỌC, GIẢI PHẪU VÀ HÌNH ẢNH CỦA CÁC KHỐI U PHẦN PHỤ

3 170 0 07-05-2024

Management and Services Part 1

10 161 0 07-05-2024

Posted prices versus bargaining in markets_7

23 161 0 07-05-2024

THE ANTHROPOLOGY OF ONLINE COMMUNITIES BY Samuel M.Wilson and Leighton C. Peterson

19 154 0 07-05-2024

Đóng mới oto 8 chỗ ngồi part 9

10 122 0 07-05-2024

Đề tài: Tìm hiểu một số yêu cầu đặt ra với một phòng thu âm, để đảm bảo chất lượng âm thanh trong sản phẩm đa phương tiện

8 164 1 07-05-2024

Khurana et al. Journal of Orthopaedic Surgery and Research 2010, 5:23

7 136 0 07-05-2024

XỬ TRÍ CHẤN THƯƠNG SỌ NÃO KÍN

1 117 1 07-05-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7878 2222

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5901 1422

Ebook Chào con ba mẹ đã sẵn sàng

112 3776 1242

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5365 1137

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8320 1127

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3526 646

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10938 531

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3712 525

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4103 519

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4149 480