TAILIEUCHUNG - Báo cáo khoa học: "Dependency Parsing of Japanese Spoken Monologue Based on Clause Boundaries"

Spoken monologues feature greater sentence length and structural complexity than do spoken dialogues. To achieve high parsing performance for spoken monologues, it could prove effective to simplify the structure by dividing a sentence into suitable language units. This paper proposes a method for dependency parsing of Japanese monologues based on sentence segmentation. In this method, the dependency parsing is executed in two stages: at the clause level and the sentence level. | Dependency Parsing of Japanese Spoken Monologue Based on Clause Boundaries Tomohiro Ohnota Shigeki Matsubarai Hideki Kashioka Takehiko Maruyama and Yasuyoshi Inagakk tGraduate School of Information Science Nagoya University Japan iInformation Technology Center Nagoya University Japan ATR Spoken Language Communication Research Laboratories Japan The National Institute for Japanese Language Japan Faculty of Information Science and Technology Aichi Prefectural University Japan a ohno@ Abstract Spoken monologues feature greater sentence length and structural complexity than do spoken dialogues. To achieve high parsing performance for spoken monologues it could prove effective to simplify the structure by dividing a sentence into suitable language units. This paper proposes a method for dependency parsing of Japanese monologues based on sentence segmentation. In this method the dependency parsing is executed in two stages at the clause level and the sentence level. First the dependencies within a clause are identified by dividing a sentence into clauses and executing stochastic dependency parsing for each clause. Next the dependencies over clause boundaries are identified stochastically and the dependency structure of the entire sentence is thus completed. An experiment using a spoken monologue corpus shows this method to be effective for efficient dependency parsing of Japanese monologue sentences. 1 Introduction Recently monologue data such as a lecture and commentary by a professional have been considered as human valuable intellectual property and have gathered attention. In applications such as automatic summarization machine translation and so on for using these monologue data as intellectual property effectively and efficiently it is necessary not only just to accumulate but also to structure the monologue data. However few attempts have been made to parse spoken mono logues. Spontaneously spoken monologues include a lot of grammatically .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.