TAILIEUCHUNG - Báo cáo khoa học: "Smart Paradigms and the Predictability and Complexity of Inflectional Morphology"

Morphological lexica are often implemented on top of morphological paradigms, corresponding to different ways of building the full inflection table of a word. Computationally precise lexica may use hundreds of paradigms, and it can be hard for a lexicographer to choose among them. To automate this task, this paper introduces the notion of a smart paradigm. It is a metaparadigm, which inspects the base form and tries to infer which low-level paradigm applies. If the result is uncertain, more forms are given for discrimination. The number of forms needed in average is a measure of predictability of an inflection. | Smart Paradigms and the Predictability and Complexity of Inflectional Morphology Gregoire Detrez and Aarne Ranta Department of Computer Science and Engineering Chalmers University of Technology and University of Gothenburg Abstract Morphological lexica are often implemented on top of morphological paradigms corresponding to different ways of building the full inflection table of a word. Computationally precise lexica may use hundreds of paradigms and it can be hard for a lexicographer to choose among them. To automate this task this paper introduces the notion of a smart paradigm. It is a metaparadigm which inspects the base form and tries to infer which low-level paradigm applies. If the result is uncertain more forms are given for discrimination. The number of forms needed in average is a measure of predictability of an inflection system. The overall complexity of the system also has to take into account the code size of the paradigms definition itself. This paper evaluates the smart paradigms implemented in the open-source GF Resource Grammar Library. Predictability and complexity are estimated for four different languages English French Swedish and Finnish. The main result is that predictability does not decrease when the complexity of morphology grows which means that smart paradigms provide an efficient tool for the manual construction and or automatically bootstrapping of lexica. 1 Introduction Paradigms are a cornerstone of grammars in the European tradition. A classical Latin grammar has five paradigms for nouns declensions and four for verbs conjugations . The modern reference on French verbs Bescherelle Bescherelle 1997 has 88 paradigms for verbs. Swedish grammars traditionally have like Latin five paradigms for nouns and four for verbs but a modern computational account Hellberg 1978 aiming for more precision has 235 paradigms for Swedish. Mathematically a paradigm is a function that produces inflection tables. Its argument is a word string either a .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.