TAILIEUCHUNG - Báo cáo khoa học: "A Unified Single Scan Algorithm for Japanese Base Phrase Chunking and Dependency Parsing"

We describe an algorithm for Japanese analysis that does both base phrase chunking and dependency parsing simultaneously in linear-time with a single scan of a sentence. In this paper, we show a pseudo code of the algorithm and evaluate its performance empirically on the Kyoto University Corpus. Experimental results show that the proposed algorithm with the voted perceptron yields reasonably good accuracy. | A Unified Single Scan Algorithm for Japanese Base Phrase Chunking and Dependency Parsing Manabu Sassano Yahoo Japan Corporation Midtown Tower 9-7-1 Akasaka Minato-ku Tokyo 107-6211 Japan msassano@ Sadao Kurohashi Graduate School of Informatics Kyoto University Yoshida-honmachi Sakyo-ku Kyoto 606-8501 Japan kuro@ Abstract We describe an algorithm for Japanese analysis that does both base phrase chunking and dependency parsing simultaneously in linear-time with a single scan of a sentence. In this paper we show a pseudo code of the algorithm and evaluate its performance empirically on the Kyoto University Corpus. Experimental results show that the proposed algorithm with the voted perceptron yields reasonably good accuracy. 1 Introduction Single scan algorithms of parsing are important for interactive applications of NLP. For instance such algorithms would be more suitable for robots accepting speech inputs or chatbots handling natural language inputs which should respond quickly in some situations even when human inputs are not clearly ended. Japanese sentence analysis typically consists of three major steps namely morphological analysis bunsetsu base phrase chunking and dependency parsing. In this paper we describe a novel algorithm that combines the last two steps into a single scan process. The algorithm which is an extension of Sassano s 2004 allows us to chunk morphemes into base phrases and decide dependency relations of the phrases in a strict left-to-right manner. We show a pseudo code of the algorithm and evaluate its performance empirically with the voted perceptron on the Kyoto University Corpus Kurohashi and Nagao 1998 . 2 Japanese Sentence Structure In Japanese NLP it is often assumed that the structure of a sentence is given by dependency relations Meg-ga kare-ni ano pen-wo age-ta. Meg-subj to him that pen-acc give-past. ID 0 12 3 4 Head 4 4 3 4 - Figure 1 Sample sentence bunsetsu-based among bunsetsus. A bunsetsu is a base

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.