TAILIEUCHUNG - Báo cáo khoa học: "Do Automatic Annotation Techniques Have Any Impact on Supervised Complex Question Answering?"

In this paper, we analyze the impact of different automatic annotation methods on the performance of supervised approaches to the complex question answering problem (deﬁned in the DUC-2007 main task). Huge amount of annotated or labeled data is a prerequisite for supervised training. The task of labeling can be accomplished either by humans or by computer programs. When humans are employed, the whole process becomes time consuming and expensive. | Do Automatic Annotation Techniques Have Any Impact on Supervised Complex Question Answering Yllias Chali University of Lethbridge Lethbridge AB Canada chali@ Sadid A. Hasan University of Lethbridge Lethbridge AB Canada hasan@ Shafiq R. Joty University of British Columbia Vancouver BC Canada rjoty@ Abstract In this paper we analyze the impact of different automatic annotation methods on the performance of supervised approaches to the complex question answering problem defined in the DUC-2007 main task . Huge amount of annotated or labeled data is a prerequisite for supervised training. The task of labeling can be accomplished either by humans or by computer programs. When humans are employed the whole process becomes time consuming and expensive. So in order to produce a large set of labeled data we prefer the automatic annotation strategy. We apply five different automatic annotation techniques to produce labeled data using ROUGE similarity measure Basic Element BE overlap syntactic similarity measure semantic similarity measure and Extended String Subsequence Kernel ESSK . The representative supervised methods we use are Support Vector Machines SVM Conditional Random Fields CRF Hidden Markov Models HMM and Maximum Entropy MaxEnt . Evaluation results are presented to show the impact. 1 Introduction In this paper we consider the complex question answering problem defined in the DUC-2007 main task1. We focus on an extractive approach of summarization to answer complex questions where a subset of the sentences in the original documents are chosen. For supervised learning methods huge amount of annotated or labeled data sets are obviously required as a precondition. The decision as to whether a sentence is important enough 1http projects duc duc2007 to be annotated can be taken either by humans or by computer programs. When humans are employed in the process producing such a large labeled corpora becomes time consuming

Ðình Chương 60 4 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Scaling up Automatic Cross-Lingual Semantic Role Annotation"

6 55 0

Báo cáo khoa học: "Automatic Adaptation of Annotation Standards: Chinese Word Segmentation and POS Tagging – A Case Study"

9 44 0

Báo cáo khoa học: "Do Automatic Annotation Techniques Have Any Impact on Supervised Complex Question Answering?"

4 49 0

Báo cáo khoa học: "Automatic Image Annotation Using Auxiliary Text Information"

9 68 0

Báo cáo khoa học: "Text Analysis for Automatic Image Annotation"

8 63 0

Báo cáo khoa học: "Automatic Annotation for All Semantic Layers in FrameNet"

4 40 0

Automatic semantic annotation of sport news using knowledge base and extraction patterns

8 70 0

Automatic annotation of lecture videos for multimedia driven pedagogical platforms

32 70 0

Automatic landmark annotation and dense correspondence registration for 3D human facial images

12 25 1

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462344 61

Giới thiệu :Lập trình mã nguồn mở

14 26318 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11357 542

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10554 466

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9848 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8894 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8511 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8104 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7802 1800

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7283 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Quy Trình Canh Tác Cây Bông Vải

8 167 3 01-01-2025

Báo cáo nghiên cứu khoa học " HÃY LÀM CHO HUẾ XANH HƠN VÀ ĐẸP HƠN "

6 183 3 01-01-2025

Bảng màu theo chữ cái – V

11 171 2 01-01-2025

Chương 10: Các phương pháp tính quá trình quá độ trong mạch điện tuyến tính

57 237 7 01-01-2025

Hướng dẫn chế độ dinh dưỡng cho người bệnh viêm khớp

5 173 2 01-01-2025

Sử dụng mô hình ARCH và GARCH để phân tích và dự báo về giá cổ phiếu trên thị trường chứng khoán

24 1075 2 01-01-2025

báo cáo khoa học: "Malignant peripheral nerve sheath tumor arising from the greater omentum: Case report"

4 145 1 01-01-2025

OPEN SOURCE ERP REASONABLE TOOLS FOR MANUFACTURING SMEs?

1 149 1 01-01-2025

Lập trình Java cơ bản : Luồng và xử lý file part 8

5 142 1 01-01-2025

Lịch sử Trung Quốc 5000 năm tập 3 part 2

54 155 1 01-01-2025

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8104 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7802 1800

Ebook Chào con ba mẹ đã sẵn sàng

112 4412 1374

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6332 1274

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8894 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3849 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3925 609

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4735 566

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11357 542

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4513 490