TAILIEUCHUNG - Báo cáo khoa học: "COMPACT REPRESENTATIONS BY FINITE-STATE TRANSDUCERS"

Finite-state transducers give efficient representations of many Natural Language phenomena. They allow to account for complex lexicon restrictions encountered, without involving the use of a large set of complex rules difficult to analyze. We here show that these representations can be made very compact, indicate how to perform the corresponding minimization, and point out interesting linguistic side-effects of this operation. | COMPACT REPRESENTATIONS BY FINITE-STATE TRANSDUCERS Mehryar Mohri Institut Gaspard Monge-LADL Université Marne-la-Vallee 2 rue de la Butte verte 93160 Noisy-le Grand FRANCE Internet mohri@ Abstract Finite-state transducers give efficient representations of many Natural Language phenomena. They allow to account for complex lexicon restrictions encountered without involving the use of a large set of complex rules difficult to analyze. We here show that these representations can be made very compact indicate how to perform the corresponding minimization and point out interesting linguistic side-effects of this operation. 1. MOTIVATION Finite-state transducers constitute appropriate representations of Natural Language phenomena. Indeed they have been shown to be sufficient tools to describe morphological and phonetic forms of a language Karttunen et al. 1992 Kay and Kaplan 1994 . Transducers can then be viewed as functions which map lexical representations to the surface forms or inflected forms to their phonetic pronunciations and vice versa. They allow to avoid the use of a great set of complex rules often difficult to check handle or even understand. Finite-state automata and transducers can also be used to represent the syntactic constraints of languages such as English or French Kosken-niemi 1990 Mohri 1993 Pereira 1991 Roche 1993 . The syntactic analysis can then be reduced to performing the intersection of two automata or to the application of a transducer to an automaton. However whereas first results show that the size of the syntactic transducer exceeds several hundreds of thousands of states no upper bound has been proposed for it as the representation of all syntactic entries has not been done yet. Thus one may ask whether such representations could succeed on a large scale. It is therefore crucial to control or to limit the size of these transducers in order to avoid a blow up. Classic minimization algorithms permit to reduce to the minimal the

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.