TAILIEUCHUNG - Báo cáo khoa học: "Ambiguity resolution in a reductionistic "

We are concerned with grammar-based surfacesyntactic analysis of running text. Morphological and syntactic analysis is here based on tags that express surface-syntactic relations between functional categories such as Subject, Modifier, Main verb etc.; consider the following simple sentence: | Ambiguity resolution in a reductionistic parser Pasi Tapanainen Atro Voutilainen Research Unit for Computational Linguistics . Box 4 Keskuskatu 8 FIN-00014 University of Helsinki Finland 1 Introduction We are concerned with grammar-based surfacesyntactic analysis of running text. Morphological and syntactic analysis is here based on tags that express surface-syntactic relations between functional categories such as Subject Modifier Main verb etc. consider the following simple sentence I PROS SUBJECT see V PRES MAINVERB a ART OK bird N OBJECT FULLSTOP 2 Description of the parsing system The parsing system consists of the following modules Preprocessor The preprocessor normalises the input text detects sentence boundaries and punctuation marks and identifies idioms and other fixed syntagms. Morphological analyser The ENGTWOL morphological analyser is a 55 000 entry Koskenniemi-style morphological description of English that assigns all recognised input word forms with all possible morphological readings as a disjunctive list. Those words not recognised by the ENGTWOL analyser are analysed by a heuristic module part-of-speech readings are assigned on the basis of the form of the word endings etc. . The morphologically analysed sentences are enriched with syntactic and word boundary ambiguities and converted into regular expressions by simple awk programs. Finite-State parser The Finite-State parser transforms sentences and rules into finite-state automata. The parser computes the intersection of the sentence automaton and all rule automata the intersection is the parse of the sentence. The grammar also contains a heuristic section that can be used to rank multiple analyses. The lexicon is adopted from the ENGCG parser that has been supported by TEKES the Finnish Technological Development Center and the work on Finite-state syntax has been partly supported by the Academy of Finland. 3 Sample analysis The sentence Its leadership was insulted by editors .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.