TAILIEUCHUNG - Báo cáo khoa học: "A STOCHASTIC APPROACH TO SENTENCE PARSING"

A description will be given of a procedure to asslgn the most likely probabilitles to each of the rules of a given context-free grammar. The grammar developed by S. Kuno at Harvard University was picked as the basis and was successfully augmented with rule probabilities. A brief exposition of the method with some preliminary results, w h e n u s e d as a device for disamblguatingparsing English texts picked from natural corpus, will be given. | A STOCHASTIC APPROACH TO SENTENCE PARSING Tetsunosuke Fujisaki Science Institute IBM Japan Ltd. No. 36 Kowa Building 5-19 Sanbancho Chiyoda-ku Tokyo 102 Japan ABSTRACT A description will be given of a procedure to assign the most likely probabilities to each of the rules of a given context-free grammar. The grammar developed by s. Kuno at Harvard University was picked as the basis and was successfully augmented with rule probabilities. A brief exposition of the method with some preliminary results when used as a device for disambiguating parsing English texts picked from natural corpus will be given. I. INTRODUCTION To prepare a grammar which can parse arbitrary sentences taken from a natural corpus is a difficult task. One of the most serious problems is the potentially unbounded number of ambiguities. Pure syntactic analysis with an imprudent grammar will sometimes result in hundreds of parses. With prepositional phrase attachments and conjunctions for example it is known that the actual growth of ambiguities can be approximated by a Cat-lan number Knuth the number of ways to insert parentheses into a formula of N terms 1 2 5 14 42 132 469 1430 4892 . The five ambiguities in the following sentence with three ambiguous constructions can be well explained with this number. I saw a man in a park with a scope. This Catalan number Is essentially exponential and Martin reported a syntactically ambiguous sentence with 455 parses List the sales of products produced in 1973 with the products produced In 1972. On the other hand throughout the long history of natural language understanding work semantic and pragmatic constraints are known to be Indispensable and are recommended to be represented in some formal way and to be referred to during or after the syntactic analysis process. However to represent semantic and pragmatic constraints which are usually domain sensitive in a well-formed way is a very difficult and expensive task. A lot of effort In that direction has been

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.