TAILIEUCHUNG - Báo cáo khoa học: "Conundrums in Noun Phrase Coreference Resolution: Making Sense of the State-of-the-Art"

We aim to shed light on the state-of-the-art in NP coreference resolution by teasing apart the differences in the MUC and ACE task definitions, the assumptions made in evaluation methodologies, and inherent differences in text corpora. First, we examine three subproblems that play a role in coreference resolution: named entity recognition, anaphoricity determination, and coreference element detection. | Conundrums in Noun Phrase Coreference Resolution Making Sense of the State-of-the-Art Veselin Stoyanov Cornell University Ithaca NY ves@ Nathan Gilbert University of Utah Salt Lake City UT ngilbert@ Claire Cardie Cornell University Ithaca NY cardie@ Ellen Riloff University of Utah Salt Lake City UT riloff@ Abstract We aim to shed light on the state-of-the-art in NP coreference resolution by teasing apart the differences in the MUC and ACE task definitions the assumptions made in evaluation methodologies and inherent differences in text corpora. First we examine three subproblems that play a role in coreference resolution named entity recognition anaphoric-ity determination and coreference element detection. We measure the impact of each subproblem on coreference resolution and confirm that certain assumptions regarding these subproblems in the evaluation methodology can dramatically simplify the overall task. Second we measure the performance of a state-of-the-art coreference resolver on several classes of anaphora and use these results to develop a quantitative measure for estimating coreference resolution performance on new data sets. 1 Introduction As is common for many natural language processing problems the state-of-the-art in noun phrase NP coreference resolution is typically quantified based on system performance on manually annotated text corpora. In spite of the availability of several benchmark data sets . MUC-6 1995 ACE NIST 2004 and their use in many formal evaluations as a field we can make surprisingly few conclusive statements about the state-of-the-art in NP coreference resolution. In particular it remains difficult to assess the effectiveness of different coreference resolution approaches even in relative terms. For example the F-measure reported by McCallum and Wellner 2004 was produced by a system using perfect information for several linguistic subproblems. In contrast the F-measure

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.