TAILIEUCHUNG - Báo cáo khoa học: "The Same-head Heuristic for Coreference"

We investigate coreference relationships between NPs with the same head noun. It is relatively common in unsupervised work to assume that such pairs are coreferent– but this is not always true, especially if realistic mention detection is used. We describe the distribution of noncoreferent same-head pairs in news text, and present an unsupervised generative model which learns not to link some samehead NPs using syntactic features, improving precision. | The Same-head Heuristic for Coreference Micha Elsner and Eugene Charniak Brown Laboratory for Linguistic Information Processing BLLIP Brown University Providence RI 02912 melsner ec @ Abstract We investigate coreference relationships between NPs with the same head noun. It is relatively common in unsupervised work to assume that such pairs are coreferent- but this is not always true especially if realistic mention detection is used. We describe the distribution of noncoreferent same-head pairs in news text and present an unsupervised generative model which learns not to link some samehead NPs using syntactic features improving precision. 1 Introduction Full NP coreference the task of discovering which non-pronominal NPs in a discourse refer to the same entity is widely known to be challenging. In practice however most work focuses on the subtask of linking NPs with different head words. Decisions involving NPs with the same head word have not attracted nearly as much attention and many systems especially unsupervised ones operate under the assumption that all same-head pairs corefer. This is by no means always the case-there are several systematic exceptions to the rule. In this paper we show that these exceptions are fairly common and describe an unsupervised system which learns to distinguish them from coreferent same-head pairs. There are several reasons why relatively little attention has been paid to same-head pairs. Primarily this is because they are a comparatively easy subtask in a notoriously difficult area Stoyanov et al. 2009 shows that among NPs headed by common nouns those which have an exact match earlier in the document are the easiest to resolve variant MUC score .82 on MUC-6 and while those with partial matches are quite a bit harder .53 by far the worst performance is on those without any match at all .27 . This effect is magnified by most popular metrics for coreference which reward finding links within large clusters more than they .

Nhã Trúc 76 5 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461914 55

Giới thiệu :Lập trình mã nguồn mở

14 22880 64

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10958 531

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10145 450

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9557 104

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8337 1127

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8270 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7883 2224

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6765 253

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5958 1440

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Anh văn bằng C-124

8 187 0 11-05-2024

MySQL Basics for Visual Learners PHẦN 9

15 189 0 11-05-2024

MySQL Database Usage & Administration PHẦN 7

37 163 0 11-05-2024

THE ANTHROPOLOGY OF ONLINE COMMUNITIES BY Samuel M.Wilson and Leighton C. Peterson

19 158 0 11-05-2024

QUẢN LÝ CHẤT LƯỢNG KHÔNG KHÍ

75 140 0 11-05-2024

XỬ TRÍ CHẤN THƯƠNG SỌ NÃO KÍN

1 121 1 11-05-2024

Data Structures and Algorithms - Chapter 8: Heaps

41 126 0 11-05-2024

Hệ thống làm lạnh và điều hòa không khí

21 131 0 11-05-2024

Báo cáo nghiên cứu khoa học " HÃY LÀM CHO HUẾ XANH HƠN VÀ ĐẸP HƠN "

6 121 0 11-05-2024

Tự học thổi sáo và ngâm thơ part 4

11 154 1 11-05-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7883 2224

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5958 1440

Ebook Chào con ba mẹ đã sẵn sàng

112 3780 1247

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5379 1137

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8337 1127

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3532 651

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10958 531

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3723 525

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4140 522

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4169 481