TAILIEUCHUNG - Báo cáo khoa học: "Linefeed Insertion into Japanese Spoken Monologue for Captioning"

To support the real-time understanding of spoken monologue such as lectures and commentaries, the development of a captioning system is required. In monologues, since a sentence tends to be long, each sentence is often displayed in multi lines on one screen, it is necessary to insert linefeeds into a text so that the text becomes easy to read. This paper proposes a technique for inserting linefeeds into a Japanese spoken monologue text as an elemental technique to generate the readable captions. . | Linefeed Insertion into Japanese Spoken Monologue for Captioning Tomohiro Ohno Graduate School of International Development Nagoya University Japan ohno@ Masaki Murata Graduate School of Information Science Nagoya University Japan murata@. Shigeki Matsubara Information Technology Center Nagoya University Japan matubara@ Abstract To support the real-time understanding of spoken monologue such as lectures and commentaries the development of a captioning system is required. In monologues since a sentence tends to be long each sentence is often displayed in multi lines on one screen it is necessary to insert linefeeds into a text so that the text becomes easy to read. This paper proposes a technique for inserting linefeeds into a Japanese spoken monologue text as an elemental technique to generate the readable captions. Our method appropriately inserts linefeeds into a sentence by machine learning based on the information such as dependencies clause boundaries pauses and line length. An experiment using Japanese speech data has shown the effectiveness of our technique. 1 Introduction Real-time captioning is a technique for supporting the speech understanding of deaf persons elderly persons or foreigners by displaying transcribed texts of monologue speech such as lectures. In recent years there exist a lot of researches about automatic captioning and the techniques of automatic speech recognition ASR aimed for captioning have been developed Bou-lianne et al. 2006 Holter et al. 2000 Imai et al. 2006 Munteanu et al. 2007 Saraclar et al. 2002 Xue et al. 2006 . However in order to generate captions which is easy to read it is important not only to recognize speech with high recognition rate but also to properly display the transcribed text on a screen Hoogenboom et al. 2008 . Especially in spoken monologue since a sentence tends to be long each sentence is often displayed as a multi-line text on a screen. Therefore proper linefeed .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.