TAILIEUCHUNG - Báo cáo khoa học: "A Ranking Approach to Stress Prediction for Letter-to-Phoneme Conversion"

Correct stress placement is important in text-to-speech systems, in terms of both the overall accuracy and the naturalness of pronunciation. In this paper, we formulate stress assignment as a sequence prediction problem. We represent words as sequences of substrings, and use the substrings as features in a Support Vector Machine (SVM) ranker, which is trained to rank possible stress patterns. The ranking approach facilitates inclusion of arbitrary features over both the input sequence and output stress pattern. . | A Ranking Approach to Stress Prediction for Letter-to-Phoneme Conversion Qing Dou Shane Bergsma Sittichai Jiampojamarn and Grzegorz Kondrak Department of Computing Science University of Alberta Edmonton AB T6G 2E8 Canada qdou bergsma sj kondrak @ Abstract Correct stress placement is important in text-to-speech systems in terms of both the overall accuracy and the naturalness of pronunciation. In this paper we formulate stress assignment as a sequence prediction problem. We represent words as sequences of substrings and use the substrings as features in a Support Vector Machine SVM ranker which is trained to rank possible stress patterns. The ranking approach facilitates inclusion of arbitrary features over both the input sequence and output stress pattern. Our system advances the current state-of-the-art predicting primary stress in English German and Dutch with up to 98 word accuracy on phonemes and 96 on letters. The system is also highly accurate in predicting secondary stress. Finally when applied in tandem with an L2P system it substantially reduces the word error rate when predicting both phonemes and stress. 1 Introduction In many languages certain syllables in words are phonetically more prominent in terms of duration pitch and loudness. This phenomenon is referred to as lexical stress. In some languages the location of stress is entirely predictable. For example lexical stress regularly falls on the initial syllable in Hungarian and on the penultimate syllable in Polish. In other languages such as English and Russian any syllable in the word can be stressed. Correct stress placement is important in text-to-speech systems because it affects the accuracy of human word recognition Tagliapietra and Ta-bossi 2005 Arciuli and Cupples 2006 . However the issue has often been ignored in previous letter-to-phoneme L2P systems. The systems that do generate stress markers often do not report separate figures on stress prediction accuracy or they only .

TÀI LIỆU LIÊN QUAN
TỪ KHÓA LIÊN QUAN
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.