TAILIEUCHUNG - Báo cáo khoa học: "Automatic Detection of Nonreferential It in Spoken Multi-Party Dialog"

We present an implemented machine learning system for the automatic detection of nonreferential it in spoken dialog. The system builds on shallow features extracted from dialog transcripts. Our experiments indicate a level of performance that makes the system usable as a preprocessing filter for a coreference resolution system. We also report results of an annotation study dealing with the classification of it by naive subjects. | Automatic Detection of Nonreferential It in Spoken Multi-Party Dialog Christoph Muller EML Research gGmbH Villa Bosch SchloB-Wolfsbrunnenweg 33 69118 Heidelberg Germany Abstract We present an implemented machine learning system for the automatic detection of nonreferential it in spoken dialog. The system builds on shallow features extracted from dialog transcripts. Our experiments indicate a level of performance that makes the system usable as a preprocessing filter for a coreference resolution system. We also report results of an annotation study dealing with the classification of it by naive subjects. 1 Introduction This paper describes an implemented system for the detection of nonreferential it in spoken multiparty dialog. The system has been developed on the basis of meeting transcriptions from the ICSI Meeting Corpus Janin et al. 2003 and it is intended as a preprocessing component for a coreference resolution system in the DIANA-Summ dialog summarization project. Consider the following utterance MN059 Yeah. Yeah. Yeah. I m sure I could learn a lot about um yeah just how to - how to come up with these structures cuz it s - it s very easy to whip up something quickly but it maybe then makes sense to -to me but not to anybody else and - and if we want to share and integrate things they must - well they must be well designed really. Bed017 In this example only one of the three instances of it is a referential pronoun The first it appears in the reparandum part of a speech repair Heeman Allen 1999 . It is replaced by a subsequent alteration and is thus not part of the final utterance. The second it is the subject of an extraposition construction and serves as the placeholder for the postposed infinitive phrase to whip up something quickly. Only the third it is a referential pronoun which anaphorically refers to something. The task of the system described in the following is to identify and filter out nonreferential instances of .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.