TAILIEUCHUNG - Báo cáo khoa học: "A SIMPLE BUT USEFUL APPROACH TO CONJUNCT IDENTIFICATION"

This paper presents an approach to identifying conjuncts of coordinate conjunctions appearing in text which has been labelled with syntactic and semantic tags. The overall project of which this research is a part is also briefly discussed. The program was tested on a 10,000 word chapter of the Merck Veterinary Manual. The algorithm is deterministic and domain independent and it performs relatively well on a large real-life domain. Constructs not handled by the simple algorithm are also described in some detail. . | A SIMPLE BUT USEFUL APPROACH TO CONJUNCT IDENTIFICATION1 Rajeev Agarwal Lois Boggess Department of Computer Science Mississippi State University Mississippi State MS 39762 e-mail kudzu@ ABSTRACT This paper presents an approach to identifying conjuncts of coordinate conjunctions appearing in text which has been labelled with syntactic and semantic tags. The overall project of which this research is a part is also briefly discussed. The program was tested on a 10 000 word chapter of the Merck Veterinary Manual. The algorithm is deterministic and domain independent and it performs relatively well on a large real-life domain. Constructs not handled by the simple algorithm are also described in some detail. INTRODUCTION Identification of the appropriate conjuncts of the coordinate conjunctions in a sentence is fundamental to the understanding of the sentence. We use the phrase conjunct identification to refer to the process of identifying the components words phrases clauses in a sentence that are conjoined by the coordinate conjunctions in it. Consider the following sentence The president sent a memo to the managers to inform them of the tragic incident and to request their cooperation. In this sentence the coordinate conjunction and conjoins the infinitive phrases to inform them of the tragic incident and to request their cooperation . If a natural language understanding system fails to recognize the correct conjuncts it is likely to misinterpret the sentence or to lose its meaning entirely. The above is an example of a simple sentence where such conjunct identification is easy. In a realistic domain one encounters sentences which are longer and far more complex. 1 This work is supported in part by the National Science Foundation under grant number IRI-9002135. This paper presents an approach to conjunct identification which while not perfect gives reasonably good results with a relatively simple algorithm. Il is deterministic and domain independent in .

Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.