Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Full Text Parsing using Cascades of Rules: an Information Extraction Perspective"

Long Giang 49 8 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

This paper proposes an approach to full parsing suitable for Information Extraction from texts. Sequences of cascades of rules deterministically analyze the text, building unambiguous structures. Initially basic chunks are analyzed; then argumental relations are recognized; finally modifier attachment is performed and the global parse tree is built. The approach was proven to work for three languages and different domains. It was implemented in the IE module of FACILE, a EU project for multilingual text classification and !E. . | Proceedings of EACL 99 Full Text Parsing using Cascades of Rules an Information Extraction Perspective Fabio Ciravegna and Alberto Lavelli ITC-irst Centro per la Ricerca Scientifica e Tecnologica via Sommarive 18 38050 Povo TN Italy cirave lavelli @irst.itc.it Abstract This paper proposes an approach to full parsing suitable for Information Extraction from texts. Sequences of cascades of rules deterministically analyze the text building unambiguous structures. Initially basic chunks are analyzed then ar-gumental relations are recognized finally modifier attachment is performed and the global parse tree is built. The approach was proven to work for three languages and different domains. It was implemented in the IE module of FACILE a EU project for multilingual text classification and IE. 1 Introduction Most successful approaches in IE Appelt et al. 1993 Grishman 1995 Aone et al. 1998 make a very poor use of syntactic information. They are generally based on shallow parsing for the analysis of non recursive NPs and Verbal Groups VGs . After such step regular patterns are applied in order to trigger primitive actions that fill template s meta-rules are applied to patterns to cope with different syntactic clausal forms e.g. passive forms . If we consider the most complex MUC-7 task i.e. the Scenario Template task MUC7 1998 the current technology is not able to provide results near an operational level expected F l 75 the best system scored about 50 Aone et al. 1998 . One of the limitations of the current technology is the inability to extract and to represent syntactic relations among elements in the sentence i.e. grammatical functions and thematic roles. Scenario Template recognition needs the correct treatment of syntactic relations at both sentence and text level Aone et al. 1998 . Full parsing systems are generally able to correctly model syntactic relations but they tend to be slow because of huge search spaces and brittle because of gaps in the grammar . The use

TÀI LIỆU LIÊN QUAN

Báo cáo hóa học: " Full-Rate Full-Diversity Linear Quasi-Orthogonal Space-Time Codes for Any Number of Transmit Antennas"

Báo cáo khoa học: "Web-Scale Features for Full-Scale Parsing"

Báo cáo khoa học: "Unsupervised Lexicon-Based Resolution of Unknown Words for Full Morphological Analysis"

Báo cáo khoa học: "Speeding Up Full Syntactic Parsing by Leveraging Partial Parsing Decisions"

Báo cáo nghiên cứu khoa học: "NGHIÊN CỨU TẬN DỤNG NHIỆT TỪ KHÍ XẢ ĐỘNG CƠ REASEACH TO TAKE FULL ADVANTAGE OF EXHAUST GAS FROM ENGINES"

báo cáo khoa học: " A full-length enriched cDNA library and expressed sequence tag analysis of the parasitic weed, Striga hermonthica"

Báo cáo khoa học: " Near full-length genome analysis of low prevalent human immunodeficiency virus type 1 subclade F1 in São Paulo, Brazil"

báo cáo khoa học: " An analysis of expressed sequence tags of developing castor endosperm using a full-length cDNA library"

báo cáo khoa học: " Sequencing analysis of 20,000 full-length cDNA clones from cassava reveals lineage specific expansions in gene families related to stress response"

báo cáo khoa học: "Full recovery of a 13-year-old boy with pediatric Ramsay Hunt syndrome using a shorter course of aciclovir and steroid at lower doses: a case report"

Đã phát hiện trình chặn quảng cáo AdBlock

Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.