TAILIEUCHUNG - Báo cáo khoa học: "Design and Scruffy Text Implementation"

Most large text-understanding systems have been designed under the assumption that the input text will be in reasonably "neat" form, ., newspaper stories and other edited texts. However, a great deal of natural language texts memos, rough drafts, conversation transcripts~ etc., have features that differ significantly from "neat" texts, posing special problems for readers, such as misspelled words, missing words, poor syntactic constructlon, missing periods, etc. Our solution to these problems is to make use of exoectations, based both on knowledge of surface English and on world knowledge of the situation being described. . | Scruffy Text Understanding Design and Implementation of Tolerant Understanders Richard H. Granger Artificial Intelligence Project Computer Science Department University of California Irvine California 92717 ABSTRACT Most large text-understanding systems have been designed under the assumption that the input text will be in reasonably neat form . newspaper stories and other edited texts. However a great deal of natural language text . memos rough drafts conversation transcripts etc. have features that differ significantly from neat texts posing special problems for readers such as misspelled words missing words poor syntactic construction missing periods etc. Our solution to these problems is to make use of expectations. based both on knowledge of surface English and on world knowledge of the situation being described. These syntactic and semantic expectations can be used to figure out unknown words from context constrain the possible word-senses of words with multiple meanings ambiguity fill in missing words ellipsis and resolve referents anaphora . This method of using expectations to aid the understanding of scruffy texts has been incorporated into a working computer program called NOMAD which understands scruffy texts in the domain of Navy messages. Introduction Consider the following scribbled message left by a computer science professor on a colleague s desk 1 Met w chrmn agreed on changes to prposl nxt mtg 3 Feb. A good deal of informal text such as everyday messages like the one above are very ill-formed grammatically and contain misspellings ad hoc abbreviations and lack of important punctuation such as periods between sentences. Yet people seem to easily understand such messages and in fact most people would probably understand the above message just as readily as they would a more well-formed version I met with the chairman and we agreed on what changes had to be made to the proposal. Our next meeting will be on Feb. 3. No extra information .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.