TAILIEUCHUNG - Báo cáo khoa học: "ON THE LINGUISTIC CHARACTER OF NON-STANDARD INPUT"

If natural language understanding systems are ever to cope with the full range of English language forms, their designers will have to incorporate a number of features of the spoken vernacular language. This communication discusses such features as non-standard grammatical rules, hesitations and false starts due to self-correction, systematic errors due to mismatches between the grammar and sentence generator, and uncorrected true errors. | ON THE LINGUISTIC CHARACTER OF NON-STANDARD INPUT Anthony s. Kroch and Donald Hindle Department of Linguistics University of Pennsylvania Philadelphia PA 19104 USA ABSTRACT If natural language understanding systems are ever to cope with the full range of English language forms their designers will have to Incorporate a number of features of the spoken vernacular language. This communication discusses such features as non-standard grammatical rules hesitations and false starts due to self-correction systematic errors due to mismatches between the grammar and sentence generator and uncorrected true errors. There are many ways In which the Input to a natural language system can be non-standard without being unlnterpretable . Most obviously such input can be the well-formed output of a grammar other than the standard language grammar with which the interpreter is likely to be equipped. This difference of grammar is presumably what we notice in language that we call non-standard in everyday life. Obviously at least from the perspective of a linguist it Is wrong to think of this difference as being due to errors made by the non-standard language user It is simply a dialect difference. Secondly the non-standard Input can contain hesitations and self-corrections which make the string unlnterpretable unless some parts of It are edited out. This is the normal state of affairs In spoken language so that any system designed to understand spoken communication even at a rudimentary level must be able to edit Its Input as well as Interpret it. Thirdly the input may be ungrammatical even by the rules of the grammar of the speaker but be the expected output of the speaker s sentence generating device. This case has not been much discussed but it is Important because In certain environments speakers and to some extent unskilled writers regularly produce ungrammmatlcal output in preference to grammatically unimpeachable alternatives. Finally the input that the system receives may .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.