Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ
Tải xuống
This paper proposes an automatic method of detecting grammar elements that decrease readability in a Japanese sentence. The method consists of two components: (1) the check list of the grammar elements that should be detected; and (2) the detector, which is a search program of the grammar elements from a sentence. By defining a readability level for every grammar element, we can find which part of the sentence is difficult to read. | Automatic Detection of Grammar Elements that Decrease Readability Masatoshi Tsuchiya and Satoshi Sato Department of Intelligence Science and Technology Graduate School of Informatics Kyoto University tsuchiya@pine.kuee.kyoto-u.ac.jp sato@i.kyoto-u.ac.jp Abstract This paper proposes an automatic method of detecting grammar elements that decrease readability in a Japanese sentence. The method consists of two components 1 the check list of the grammar elements that should be detected and 2 the detector which is a search program of the grammar elements from a sentence. By defining a readability level for every grammar element we can find which part of the sentence is difficult to read. 1 Introduction We always prefer readable texts to unreadable texts. The texts that transmit crucial information such as instructions of strong medicines must be completely readable. When texts are unreadable we should rewrite them to improve readability. In English measuring readability as reading age is well studied Johnson 1978 . The reading age is the chronological age of a reader who could just understand the text. The value is usually calculated from the sentence length and the number of syllables. From this value we find whether a text is readable or not for readers of a specific age however we do not find which part we should rewrite to improve readability when the text is unreadable. The goal of our study is to present tools that help rewriting work of improving readability in Japanese. The first tool is to help detect the sentence fragments words and phrases that should be rewritten in other words it is a checker of hard-to-read words and phrases in a sentence. Such a checker can be realized with two components the check list and its detector. The check list provides check items and their readability levels. The detector is a program that searches the check items in a sentence. From the detected items and their readability levels we can identify which part of the sentence is .