TAILIEUCHUNG - Báo cáo khoa học: "Improving Name Tagging by Reference Resolution and Relation Detection"

Information extraction systems incorporate multiple stages of linguistic analysis. Although errors are typically compounded from stage to stage, it is possible to reduce the errors in one stage by harnessing the results of the other stages. We demonstrate this by using the results of coreference analysis and relation extraction to reduce the errors produced by a Chinese name tagger. We use an N-best approach to generate multiple hypotheses and have them re-ranked by subsequent stages of processing. We obtained thereby a reduction of 24% in spurious and incorrect name tags, and a reduction of 14% in missed tags. . | Improving Name Tagging by Reference Resolution and Relation Detection Heng Ji Ralph Grishman Department of Computer Science New York University New York NY 10003 UsA heng j i@ grishman@ Abstract Information extraction systems incorporate multiple stages of linguistic analysis. Although errors are typically compounded from stage to stage it is possible to reduce the errors in one stage by harnessing the results of the other stages. We demonstrate this by using the results of coreference analysis and relation extraction to reduce the errors produced by a Chinese name tagger. We use an N-best approach to generate multiple hypotheses and have them re-ranked by subsequent stages of processing. We obtained thereby a reduction of 24 in spurious and incorrect name tags and a reduction of 14 in missed tags. 1 Introduction Systems which extract relations or events from a document typically perform a number of types of linguistic analysis in preparation for information extraction. These include name identification and classification parsing or partial parsing semantic classification of noun phrases and coreference analysis. These tasks are reflected in the evaluation tasks introduced for MUC-6 named entity coreference template element and MUC-7 template relation . In most extraction systems these stages of analysis are arranged sequentially with each stage using the results of prior stages and generating a single analysis that gets enriched by each stage. This provides a simple modular organization for the extraction system. Unfortunately each stage also introduces a certain level of error into the analysis. Furthermore these errors are compounded - for example errors in name recognition may lead to errors in parsing. The net result is that the final output relations or events may be quite inaccurate. This paper considers how interactions between the stages can be exploited to reduce the error rate. For example the results of coreference analysis or .

Hữu Tài 75 8 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Simplified tcp based communication approach towards domain name system for improving security

11 113 0

Báo cáo khoa học: "How do you pronounce your name? Improving G2P with transliterations"

10 56 0

Báo cáo khoa học: "Improving Name Tagging by Reference Resolution and Relation Detection"

8 63 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461916 55

Giới thiệu :Lập trình mã nguồn mở

14 22912 64

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10961 531

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10149 450

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9557 104

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8346 1127

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8270 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7883 2224

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6780 255

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5982 1440

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

TƯƠNG QUAN GIỮA MÔ HỌC, GIẢI PHẪU VÀ HÌNH ẢNH CỦA CÁC KHỐI U PHẦN PHỤ

3 171 0 13-05-2024

Công nghiệp gang thép Việt Nam : Một giai đoạn phát triển và chuyển đổi chính sách mới part 5

6 199 0 13-05-2024

Giáo trình CẤU TRÚC DỮ LIỆU VÀ GIẢI THUẬT - Chương 1

5 139 0 13-05-2024

QUẢN LÝ CHẤT LƯỢNG KHÔNG KHÍ

75 142 0 13-05-2024

Data Structures and Algorithms - Chapter 8: Heaps

41 128 0 13-05-2024

Truyện kiếm hiệp - Duy ngã độc tôn phần 5/7

1 100 0 13-05-2024

Thương hiệu sản phẩm làng nghề: Đã ít, lại thiếu tính cạnh tranh

5 121 0 13-05-2024

Báo cáo khoa học: " Principaux critères économiques de gestion des forêts : analyse critique et comparative"

29 92 0 13-05-2024

Bảng màu theo chữ cái – V

11 103 0 13-05-2024

GYNECOLOGIC CANCERS IN PREGNANCY: GUIDELINES OF AN INTERNATIONAL CONSENSUS MEETING

12 97 0 13-05-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7883 2224

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5982 1440

Ebook Chào con ba mẹ đã sẵn sàng

112 3780 1248

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5385 1137

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8346 1127

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3532 651

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10961 531

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3727 525

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4146 523

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4173 481