TAILIEUCHUNG - Báo cáo khoa học: "A Machine Learning Approach to Pronoun Resolution in Spoken Dialogue"

We apply a decision tree based approach to pronoun resolution in spoken dialogue. Our system deals with pronouns with NPand non-NP-antecedents. We present a set of features designed for pronoun resolution in spoken dialogue and determine the most promising features. We evaluate the system on twenty Switchboard dialogues and show that it compares well to Byron’s (2002) manually tuned system. | A Machine Learning Approach to Pronoun Resolution in Spoken Dialogue Michael Strube and Christoph Miiller European Media Laboratory GmbH Villa Bosch SchloB-Wolfsbrunnenweg 33 69118 Heidelberg Germany @ Abstract We apply a decision tree based approach to pronoun resolution in spoken dialogue. Our system deals with pronouns with NP-and non-NP-antecedents. We present a set of features designed for pronoun resolution in spoken dialogue and determine the most promising features. We evaluate the system on twenty Switchboard dialogues and show that it compares well to Byron s 2002 manually tuned system. 1 Introduction Corpus-based methods and machine learning techniques have been applied to anaphora resolution in written text with considerable success Soon et al. 2001 Ng Cardie 2002 among others . It has been demonstrated that systems based on these approaches achieve a performance that is comparable to hand-crafted systems. Since they can easily be applied to new domains it seems also feasible to port a given corpus-based anaphora resolution system from written text to spoken dialogue. This paper describes the extensions and adaptations needed for applying our anaphora resolution system Muller et al. 2002 Strube et al. 2002 to pronoun resolution in spoken dialogue. There are important differences between written text and spoken dialogue which have to be accounted for. The most obvious difference is that in spoken dialogue there is an abundance of personal and demonstrative pronouns with non-NP-antecedents or no antecedents at all. Corpus studies have shown that a significant amount of pronouns in spoken dialogue have non-NP-antecedents Byron Allen 1998 report that about 50 of the pronouns in the TRAINS93 corpus have non-NP-antecedents. Eckert Strube 2000 note that only about 45 of the pronouns in a set of Switchboard dialogues have NP-antecedents. The remainder consists of 22 which have non-NP-antecedents and 33 without

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.