Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Do Automatic Annotation Techniques Have Any Impact on Supervised Complex Question Answering?"

Ðình Chương 60 4 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

In this paper, we analyze the impact of different automatic annotation methods on the performance of supervised approaches to the complex question answering problem (deﬁned in the DUC-2007 main task). Huge amount of annotated or labeled data is a prerequisite for supervised training. The task of labeling can be accomplished either by humans or by computer programs. When humans are employed, the whole process becomes time consuming and expensive. | Do Automatic Annotation Techniques Have Any Impact on Supervised Complex Question Answering Yllias Chali University of Lethbridge Lethbridge AB Canada chali@cs.uleth.ca Sadid A. Hasan University of Lethbridge Lethbridge AB Canada hasan@cs.uleth.ca Shafiq R. Joty University of British Columbia Vancouver BC Canada rjoty@cs.ubc.ca Abstract In this paper we analyze the impact of different automatic annotation methods on the performance of supervised approaches to the complex question answering problem defined in the DUC-2007 main task . Huge amount of annotated or labeled data is a prerequisite for supervised training. The task of labeling can be accomplished either by humans or by computer programs. When humans are employed the whole process becomes time consuming and expensive. So in order to produce a large set of labeled data we prefer the automatic annotation strategy. We apply five different automatic annotation techniques to produce labeled data using ROUGE similarity measure Basic Element BE overlap syntactic similarity measure semantic similarity measure and Extended String Subsequence Kernel ESSK . The representative supervised methods we use are Support Vector Machines SVM Conditional Random Fields CRF Hidden Markov Models HMM and Maximum Entropy MaxEnt . Evaluation results are presented to show the impact. 1 Introduction In this paper we consider the complex question answering problem defined in the DUC-2007 main task1. We focus on an extractive approach of summarization to answer complex questions where a subset of the sentences in the original documents are chosen. For supervised learning methods huge amount of annotated or labeled data sets are obviously required as a precondition. The decision as to whether a sentence is important enough 1http www-nlpir.nist.gov projects duc duc2007 to be annotated can be taken either by humans or by computer programs. When humans are employed in the process producing such a large labeled corpora becomes time consuming

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: Phương pháp chuyển độ cao GPS về độ cao thi công có kể đến ảnh hưởng của độ lệch dây dọi

Bài giảng Cách viết bài báo cáo khoa học - BS. Phạm Minh Tuấn

Mẫu Báo cáo nội dung khoa học, tiến độ thực hiện nhiệm vụ

Báo cáo khoa học: "Ảnh hưởng của nhiệt độ và độ ẩm không khí lên lợn F1 (Y x MC) và Yorkshire nuôi thịt"

Báo cáo khoa học: " ỨNG DỤNG KHUNG NHÌN THỰC ĐỂ NÂNG CAO TỐC ĐỘ THỰC THI TRUY VẤN"

Báo cáo nghiên cứu khoa học: "Nghiên cứu mật độ và thành phần thức ăn của một số loài ếch nhái trên đồng ruộng Sầm Sơn - Thanh Hoá"

Báo cáo nghiên cứu khoa học: " KHẢO SÁT ĐỘ TIN CẬY CỦA PHIÊN BẢN TIẾNG VIỆT - BỘ CÔNG CỤ ĐO CHIẾN LƯỢC HỌC TẬP NGÔN NGỮ CỦA OXFORD TRÊN ĐỐI TƯỢNG NGƯỜI VIỆT NAM HỌC TIẾNG PHÁP"

Báo cáo tóm tắt tổng kết khoa học và kỹ thuật: Nghiên cứu các giải pháp nâng cao độ chính xác đo cao GPS trong điều kiện Việt Nam

Báo cáo nghiên cứu khoa học: " NHỮNG TRỞ NGẠI TRONG GIAO TIẾP GIỮA NGƯỜI HỌC TIẾNG ANH Ở VIỆT NAM VỚI NGƯỜI BẢN NGỮ DO DỊCH NHỮNG CỤM TỪ MANG ĐẶC TRƯNG VĂN HÓA VIỆT QUA TIẾNG ANH"

Báo cáo nghiên cứu khoa học: "Xây dựng bản đồ độ dốc huyện hương trà, tỉnh thừa thiên Huế tỷ lệ 1/50.000 dưới sự trợ giúp của công nghệ GIS"

Đã phát hiện trình chặn quảng cáo AdBlock

Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.