TAILIEUCHUNG - Báo cáo khoa học: "Opinion and Generic Question Answering Systems: a Performance Analysis"

The importance of the new textual genres such as blogs or forum entries is growing in parallel with the evolution of the Social Web. This paper presents two corpora of blog posts in English and in Spanish, annotated according to the EmotiBlog annotation scheme. Furthermore, we created 20 factual and opinionated questions for each language and also the Gold Standard for their answers in the corpus. The purpose of our work is to study the challenges involved in a mixed fact and opinion question answering setting by comparing the performance of two Question Answering (QA) systems as far as. | Opinion and Generic Question Answering Systems a Performance Analysis Alexandra Balahur 1 2 1DLSI University of Alicante Ap. De Correos 99 03080 Alicante 2IPSC EC Joint Research Centre Via E. Fermi 21027 Ispra abalahur@ Andrés Montoyo DLSI University of Alicante Ap. De Correos 99 03080 Alicante montoyo@ Abstract The importance of the new textual genres such as blogs or forum entries is growing in parallel with the evolution of the Social Web. This paper presents two corpora of blog posts in English and in Spanish annotated according to the EmotiBlog annotation scheme. Furthermore we created 20 factual and opinionated questions for each language and also the Gold Standard for their answers in the corpus. The purpose of our work is to study the challenges involved in a mixed fact and opinion question answering setting by comparing the performance of two Question Answering QA systems as far as mixed opinion and factual setting is concerned. The first one is open domain while the second one is opinion-oriented. We evaluate separately the two systems in both languages and propose possible solutions to improve QA systems that have to process mixed questions. Introduction and motivation In the last few years the number of blogs has grown exponentially. Thus the Web contains more and more subjective texts. A research from the Pew Institute shows that blogs are created daily Pang and Lee 2008 . They approach a great variety of topics computer science sociology political science or economics and are written by different types of people thus are a relevant resource for large community behavior analysis. Due to the high volume of data contained in blogs new Natural Language Proc- Ester Boldrini DLSI University of Alicante Ap. De Correos 99 03080 Alicante eboldrini@ Patricio Martínez-Barco DLSI University of Alicante Ap. De Correos 99 03080 Alicante patricio@ essing NLP resources tools and methods are needed in order to manage .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.