TAILIEUCHUNG - Báo cáo khoa học: "Identifying Broken Plurals, Irregular Gender, and Rationality in Arabic Text"

Arabic morphology is complex, partly because of its richness, and partly because of common irregular word forms, such as broken plurals (which resemble singular nouns), and nouns with irregular gender (feminine nouns that look masculine and vice versa). In addition, Arabic morphosyntactic agreement interacts with the lexical semantic feature of rationality, which has no morphological realization. In this paper, we present a series of experiments on the automatic prediction of the latent linguistic features of functional gender and number, and rationality in Arabic. We compare two techniques, using simple maximum likelihood (MLE) with back-off and a support vector machine. | Identifying Broken Plurals Irregular Gender and Rationality in Arabic Text Sarah Alkuhlani and Nizar Habash Center for Computational Learning Systems Columbia University sma2149 nh2142 @ Abstract Arabic morphology is complex partly because of its richness and partly because of common irregular word forms such as broken plurals which resemble singular nouns and nouns with irregular gender feminine nouns that look masculine and vice versa . In addition Arabic morpho-syntactic agreement interacts with the lexical semantic feature of rationality which has no morphological realization. In this paper we present a series of experiments on the automatic prediction of the latent linguistic features of functional gender and number and rationality in Arabic. We compare two techniques using simple maximum likelihood MLE with back-off and a support vector machine based sequence tagger Yamcha . We study a number of orthographic morphological and syntactic learning features. Our results show that the MLE technique is preferred for words seen in the training data while the Yam-cha technique is optimal for unseen words which are our real target. Furthermore we show that for unseen words morphological features help beyond orthographic features and that syntactic features help even more. A combination of the two techniques improves overall performance even further. 1 Introduction Arabic morphology is complex partly because of its richness and partly because of its complex morpho-syntactic agreement rules which depend on functional features not necessarily expressed in word forms. Particularly challenging are broken plurals which resemble singular nouns nouns with irregular gender masculine nouns that look feminine and feminine nouns that look masculine and the semantic feature of rationality which has no morphological realization Smrz 2007b Alkuhlani and Habash 2011 . These features heavily participate in Arabic morpho-syntactic agreement. Alkuhlani and Habash 2011 show .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.