TAILIEUCHUNG - Báo cáo khoa học: "PROJECT APRIL -- A PROGRESS REPORT"

Parsing techniques based on rules defining grammaticality are difficult to use with authentic inputs, which are often grammatically messy. Instead, the APRIL system seeks a labelled tree su~cture which maximizes a numerical measure of conformity to statistical norms derived flom a sample of parsed text. No distinction between legal and illegal trees arises: any labelled tree has a value. | PROJECT APRIL A PROGRESS REPORT Robin Haigh Geoffrey Sampson. Eric Atwell Centre for Computer Analysis of Language and Speech University of Leeds Leeds LS2 9JT UK ABSTRACT Parsing techniques based on rules defining grammaticality are difficult to use with authentic inputs which are often grammatically messy. Instead the APRIL system seeks a labelled tree structure which maximizes a numerical measure of conformity to statistical norms derived from a sample of parsed text No distinction between legal and illegal trees arises any labelled tree has a value. Because the search space is large and has an irregular geometry APRIL seeks the best tree using simulated annealing a stochastic optimization technique. Beginning with an arbitrary tree many randomly-generated local modifications are considered and adopted or rejected according to their effect on tree-value acceptance decisions are made probabilistically subject to a bias against adverse moves which is very weak at the outset but is made to increase as the random walk through the search space continues. This enables the system to converge on the global optimum without getting trapped in local optima. Performance of an early version of the APRIL system on authentic inputs is yielding analyses with a mean accuracy of using a schedule which increases processing linearly with sentence-length modifications currently being implemented should eliminate a high proportion of the remaining errors. INTRODUCTION Project APRIL Annealing Parser for Realistic Input Language is constructing a software system that uses the stochastic optimization technique known as simulated annealing Kirkpatrick et al. 1983 van Laarhoven Aarts 1987 to parse authentic English inputs by seeking labelled tree-structures that maximize a measure of plausibility defined in terms of empirical statistics on parse-tree configurations drawn from a database of manually parsed English text This approach is a response to the fact that real-life English .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.