TAILIEUCHUNG - Báo cáo khoa học: "CONSTRAINT-BASED EVENT RECOGNITION INFORMATION EXTRACTION"

We present a program for segmenting texts according to the separate events they describe. A modular architecture is described that allows us to examine the contributions made by particular aspects of natural language to event structuring. This is applied in the context of terrorist news articles, and a technique is suggested for evaluating the resulting segmentations. We also examine the usefulness of various heuristics in forming these segmentations. | CONSTRAINT-BASED EVENT RECOGNITION FOR INFORMATION EXTRACTION Jeremy Crowe Department of Artificial Intelligence Edinburgh University Edinburgh EH1 1HN UK j .crowe@ Abstract We present a program for segmenting texts according to the separate events they describe. A modular architecture is described that allows us to examine the contributions made by particular aspects of natural language to event structuring. This is applied in the context of terrorist news articles and a technique is suggested for evaluating the resulting segmentations. We also examine the usefulness of various heuristics in forming these segmentations. Introduction One of the issues to emerge from recent evaluations of information extraction systems Sundheim 1992 is the importance of discourse processing Iwaiiska et al. 1991 and in particular the ability to recognise multiple events in a text. It is this task that we address here. We are developing a program that assigns messagelevel event structures to newswire texts. Although the need to recognise events has been widely acknowledged most approaches to information extraction IE perform this task either as a part of template merging late in the IE process Grishman and Sterling 1993 or in a few cases as an integral part of some deeper reasoning mechanism . Hobbs et al. 1991 . Our approach is based on the assumption that discourse processing should be done early in the information extraction process. This is by no means a new idea. The arguments in favour of an early discourse segmentation are well known - easier coreference of entities a reduced volume of text to be subjected to necessarily deeper analysis and so on. Because of this early position in the IE process an event recognition program is faced with a necessarily shallow textual representation. The purpose of our work is therefore to investigate the quality of text segmentation that is possible given such a surface form. T would like to thank Chris Mellish and the anonymous .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.