TAILIEUCHUNG - Báo cáo khoa học: "INFORMATION RETRIEVAL USING ROBUST NATURAL LANGUAGE PROCESSING"

We developed a prototype information retrieval system which uses advanced natural language processing techniques to enhance the effectiveness of traditional key-word based document retrieval. The backbone of our system is a statistical retrieval engine which performs automated indexing of documents, then search and ranking in response to user queries. This core architecture is augmented with advanced natural language processing tools which are both robust and efficient. | INFORMATION RETRIEVAL USING ROBUST NATURAL LANGUAGE PROCESSING Tomek Sfrzalkowski and Barbara Vautheyt Cour ant Institute of Mathematical Sciences New York University 715 Broadway rm. 704 New York NY 10003 tomek@ ABSTRACT We developed a prototype information retrieval system which uses advanced natural language processing techniques to enhance the effectiveness of traditional key-word based document retrieval. The backbone of our system is a statistical retrieval engine which performs automated indexing of documents then search and ranking in response to user queries. This core architecture is augmented with advanced natural language processing tools which are both robust and efficient. In early experiments the augmented system has displayed capabilities that appear to make it superior to the purely statistical base. INTRODUCTION A typical information retrieval IR task is to select documents from a database in response to a user s query and rank these documents according to relevance. This has been usually accomplished using statistical methods often coupled with manual encoding but it is now widely believed that these traditional methods have reached thefr limits. 1 2 These limits are particularly acute for text databases where natural language processing NLP has long been considered necessary for further progress. Unfortunately the difficulties encountered in applying computational linguistics technologies to text processing have contributed to a wide-spread belief that automated NLP may not be suitable in IR. These difficulties included inefficiency limited coverage and prohibitive cost of manual effort requữeđ to build lexicons and knowledge bases for each new text domain. On the other hand while numerous t Current address Laboratoire d lnformatique Universite de Fribourg ch. du Musee 3 1700 Fribourg Switzerland vauthey@cfnmi51 .bitnet. 1 As far as the automatic document retrieval is concerned. Techniques involving various forms of relevance feedback

Thảo Vy 91 8 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Information retrieval techniques: Lecture 3

17 12 1

Information retrieval techniques: Lecture 23

9 14 1

Information retrieval techniques: Lecture 2

25 14 1

Information retrieval techniques: Lecture 45

6 12 1

Ebook Multimedia information retrieval: Part 2 - Stefan Rüger

73 22 1

Information retrieval techniques: Lecture 1

15 17 1

Principles of Visual Information Retrieval: Part 1 - Michael S. Lew (Ed.)

209 29 1

Distributed backup of user profiles for information retrieval

5 56 0

Information retrieval techniques: Lecture 4

16 13 1

Information retrieval techniques: Lecture 25

12 14 1

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462283 61

Giới thiệu :Lập trình mã nguồn mở

14 24831 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11281 542

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10508 466

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9785 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8876 1160

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8462 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8089 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7464 1763

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7185 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

B2B Content Marketing: 2012 Benchmarks, Budgets & Trends

17 213 3 22-11-2024

Quy Trình Canh Tác Cây Bông Vải

8 148 1 22-11-2024

báo cáo hóa học:" Perceptions of rewards among volunteer caregivers of people living with AIDS working in faith-based organizations in South Africa: a qualitative study"

10 146 1 22-11-2024

Báo cáo " Thẩm quyền quản lí nhà nước đối với hoạt động quảng cáo thực trạng và hướng hoàn thiện "

7 196 7 22-11-2024

Báo cáo " Bàn về hành vi pháp luật và hành vi đạo đức "

11 169 2 22-11-2024

Báo cáo nghiên cứu khoa học " Vai trò chính quyền địa phương trong phát triển kinh tế : khu chuyên doanh gốm sứ ( Trung Quốc ) và Bát Tràng ( Việt Nam )("

11 206 1 22-11-2024

Chủ đề 3 : SỰ CÂN BẰNG CỦA VẬT RẮN (4 tiết)

9 197 1 22-11-2024

CUỘC KHÁNG CHIẾN CHỐNG THỰC DÂN PHÁP KẾT THÚC (1953 - 1954)_5

11 133 1 22-11-2024

The Ombudsman Enterprise and Administrative Justice

309 132 0 22-11-2024

Lập trình Java cơ bản : Luồng và xử lý file part 8

5 133 1 22-11-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8089 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7464 1763

Ebook Chào con ba mẹ đã sẵn sàng

112 4364 1369

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6148 1258

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8876 1160

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3786 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3909 609

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4614 562

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11281 542

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4446 490