Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ
Tải xuống
In this paper a morphological component with a limited capability to automatically interpret (and generate) derived words is presented. The system combines an extended two-level morphology [Trost, 1991a; Trost, 1991b] with a feature-based word grammar building on a hierarchical lexicon. Polymorphemic stems not explicitly stored in the lexicon are given a compositional interpretation. | Coping With Derivation in a Morphological Component Harald Trost Austrian Research Institute for Artificial Intelligence Schottengasse 3 A-1010 Wien Austria email harald@ai.univie.ac.at Abstract In this paper a morphological component with a limited capability to automatically interpret and generate derived words is presented. The system combines an extended two-level morphology Trost 1991a Trost 1991b with a feature-based word grammar building on a hierarchical lexicon. Polymorphemic stems not explicitly stored in the lexicon are given a compositional interpretation. That way the system allows to minimize redundancy in the lexicon because derived words that are transparent need not to be stored explicitly. Also words formed ad-hoc can be recognized correctly. The system is implemented in CommonLisp and has been tested on examples from German derivation. 1 Introduction This paper is about words. Since word is a rather fuzzy term we will first try to make clear what word means in the context of this paper. Following di Sci-ullo and Williams 1989 we discriminate two senses. One is the morphological word which is built from morphs according to the rules of morphology. The other is the syntactic word which is the atomic entity from which sentences are built according to the rules of syntax. Work on this project was partially sponsored by the Austrian Federal Ministry for Science and Research and the Fonds zur Forderung der wissenschaftlichen Forschung grant no.P7986-PHY. I would also like to thank John Nerbonne Klaus Netter and Wolfgang Heinz for comments on earlier versions of this paper. These two views support two different sets of information which are to be kept separate but which are not disjunctive. The syntactical word carries information about category valency and semantics information that is important for the interpretation of a word in the context of the sentence. It also carries information like case number gender and person. The former information is .