TAILIEUCHUNG - Báo cáo khoa học: "Learning to Rank Answers on Large Online QA Collections"

This work describes an answer ranking engine for non-factoid questions built using a large online community-generated question-answer collection (Yahoo! Answers). We show how such collections may be used to effectively set up large supervised learning experiments. Furthermore we investigate a wide range of feature types, some exploiting NLP processors, and demonstrate that using them in combination leads to considerable improvements in accuracy. | Learning to Rank Answers on Large Online QA Collections Mihai Surdeanu Massimiliano Ciaramita Hugo Zaragoza Barcelona Media Innovation Center Yahoo Research Barcelona massi hugo @ Abstract This work describes an answer ranking engine for non-factoid questions built using a large online community-generated question-answer collection Yahoo Answers . We show how such collections may be used to effectively set up large supervised learning experiments. Furthermore we investigate a wide range of feature types some exploiting NLP processors and demonstrate that using them in combination leads to considerable improvements in accuracy. High Quality Low Quality Q How do you quiet a squeaky door A Spray WD-40 directly onto the hinges of the door. Open and close the door several times. Remove hinges if the door still squeaks. Remove any rust dirt or loose paint. Apply WD-40 to removed hinges. Put the hinges back open and close door several times again. Q How to extract html tags from an html documents with c A very carefully Table 1 Sample content from Yahoo Answers. 1 Introduction The problem of Question Answering QA has received considerable attention in the past few years. Nevertheless most of the work has focused on the task of factoid QA where questions match short answers usually in the form of named or numerical entities. Thanks to international evaluations organized by conferences such as the Text REtrieval Conference TREC 1 or the Cross Language Evaluation Forum CLEF Workshop2 annotated corpora of questions and answers have become available for several languages which has facilitated the development of robust machine learning models for the task. The situation is different once one moves beyond the task of factoid QA. Comparatively little research has focused on QA models for non-factoid questions such as causation manner or reason questions. Because virtually no training data is available for this problem most automated

Diễm Phượng 70 9 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Learning Perl - Learning Perl -

6 79 0

Learning Perl - Mảng băm

4 78 0

Learning Perl - Các cấu trúc điều khiển khác

5 95 0

Learning Perl - Tước hiệu tệp và kiểm thử tệp

6 82 1

Learning Perl - Giới thiệu qua về Perl part 1

6 71 1

Learning Perl - Giới thiệu qua về Perl part 2

6 78 1

Learning Perl - Giới thiệu qua về Perl part 3

6 84 1

Learning Perl - Dữ liệu vô hướng part 1

5 70 0

Learning Perl - Dữ liệu vô hướng part 2

5 94 0

Learning Perl - Biểu thức chính qui part 1

5 73 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462351 61

Giới thiệu :Lập trình mã nguồn mở

14 26660 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11375 543

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10567 468

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9855 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8906 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8518 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8109 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7915 1821

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7289 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Báo cáo nghiên cứu khoa học " KẾT QUẢ NGHIÊN CỨU BƯỚC ĐẦU VỀ THIÊN ĐỊCH CHÂN KHỚP TRÊN CÂY THANH TRÀ Ở THỪA THIÊN HUẾ "

7 287 4 08-01-2025

Quy Trình Canh Tác Cây Bông Vải

8 170 3 08-01-2025

báo cáo hóa học:" Quality of data collection in a large HIV observational clinic database in sub-Saharan Africa: implications for clinical research and audit of care"

7 163 4 08-01-2025

Báo cáo " Bàn về hành vi pháp luật và hành vi đạo đức "

11 182 2 08-01-2025

Valve Selection Handbook - Fourth Edition

337 150 2 08-01-2025

ETHICAL CODE HANDBOOK: Demonstrate your commitment to high standards

7 156 1 08-01-2025

Bệnh sán lá gan trên gia súc và cách phòng trị

3 170 1 08-01-2025

Xinh xinh vườn nhà

6 135 0 08-01-2025

Lịch sử Trung Quốc 5000 năm tập 3 part 2

54 157 1 08-01-2025

Báo cáo lâm nghiệp: "Assessment of the effects of below-zero temperatures on photosynthesis and chlorophyll a fluorescence in leaf discs of Eucalyptus globulu"

4 152 0 08-01-2025

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8109 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7915 1821

Ebook Chào con ba mẹ đã sẵn sàng

112 4435 1376

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6353 1276

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8906 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3859 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3930 610

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4768 567

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11375 543

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4533 490