TAILIEUCHUNG - Báo cáo khoa học: "Dialect Classification for online podcasts fusing Acoustic and Language based Structural and Semantic Information"

The variation in speech due to dialect is a factor which significantly impacts speech system performance. In this study, we investigate effective methods of combining acoustic and language information to take advantage of (i) speaker based acoustic traits as well as (ii) content based word selection across the text sequence. For acoustics, a GMM based system is employed and for text based dialect classification, we proposed n-gram language models combined with Latent Semantic Analysis (LSA) based dialect classifiers. . | Dialect Classification for online podcasts fusing Acoustic and Language based Structural and Semantic Information Rahul Chitturi John. . Hansen1 Center for Robust Speech Systems CRSS Erik Jonsson School of Engineering and Computer Science University of Texas at Dallas Richardson Texas 75080 Abstract The variation in speech due to dialect is a factor which significantly impacts speech system performance. In this study we investigate effective methods of combining acoustic and language information to take advantage of i speaker based acoustic traits as well as ii content based word selection across the text sequence. For acoustics a GMM based system is employed and for text based dialect classification we proposed n-gram language models combined with Latent Semantic Analysis LSA based dialect classifiers. The performance of the individual classifiers is established for the three dialect family case DC rates vary from . The final combined system achieved a DC accuracy of and significantly outperforms the baseline acoustic classifier with a relative improvement of 30 confirming that an integrated dialect classification system is effective for American British and Australian dialects. 1 Introduction Automatic Dialect Classification has recently gained substantial interest in the speech processing community Gray and Hansen 2005 Hansen et al. 2004 NIST LRE 2005 . Dialect classification systems have been employed to improve the performance for Automatic Speech Recognition ASR by employing dialect dependent acoustic and language models Di-akoloukas et al. 1997 and for Rich Indexing of Spoken Document Retrieval Systems Gray and Hansen 2005 . Huang and Hansen 2005 2006 focused on identifying pronunciation differences for dialect classification. In this study unsupervised MFCC based GMM classifiers are employed for pronunciation modeling. However English dialects differ in many ways other than pronunciation .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.