TAILIEUCHUNG - EURASIP Journal on Applied Signal Processing 2003:8, 814–823 c 2003 Hindawi Publishing

EURASIP Journal on Applied Signal Processing 2003:8, 814–823 c 2003 Hindawi Publishing Corporation On the Use of Evolutionary Algorithms to Improve the Robustness of Continuous Speech Recognition Systems in Adverse Conditions Sid-Ahmed Selouani Secteur Gestion de l’Information, Universit´ de Moncton, Campus de Shippagan, 218 boulevard , e Shippagan, Nouveau-Brunswick, Canada E8S 1P6 Email: selouani@ Douglas O’Shaughnessy INRS-Energie-Mat´riaux-T´l´communications, Universit´ du Qu´bec, 800 de la Gaucheti`re Ouest, e ee e e e place Bonaventure, Montr´al, Canada H5A 1K6 e Email: dougo@ Received 14 June 2002 and in revised form 6 December 2002 Limiting the decrease in performance due to acoustic environment changes remains a major challenge for. | EURASIP Journal on Applied Signal Processing 2003 8 814-823 2003 Hindawi Publishing Corporation On the Use of Evolutionary Algorithms to Improve the Robustness of Continuous Speech Recognition Systems in Adverse Conditions Sid-Ahmed Selouani Secteur Gestion de rinformation Universite de Moncton Campus de Shippagan 218 boulevard Shippagan Nouveau-Brunswick Canada E8S 1P6 Email selouani@ Douglas O Shaughnessy INRS-Energìe-Matérìaux-Télécommunìcatìons Universite du Quebec 800 de la Gauchetiere Quest place Bonaventure Montreal Canada H5A 1K6 Email dougo@ Received 14 June 2002 and in revised form 6 December 2002 Limiting the decrease in performance due to acoustic environment changes remains a major challenge for continuous speech recognition CSR systems. We propose a novel approach which combines the Karhunen-Loeve transform KLT in the mel-frequency domain with a genetic algorithm GA to enhance the data representing corrupted speech. The idea consists of projecting noisy speech parameters onto the space generated by the genetically optimized principal axis issued from the KLT. The enhanced parameters increase the recognition rate for highly interfering noise environments. The proposed hybrid technique when included in the front-end of an HTK-based CSR system outperforms that of the conventional recognition process in severe interfering car noise environments for a wide range of signal-to-noise ratios SNRs varying from 16 dB to -4 dB. We also showed the effectiveness of the KLT-GA method in recognizing speech subject to telephone channel degradations. Keywords and phrases speech recognition genetic algorithms Karhunen-Loeve transform hidden Markov models robustness. 1. INTRODUCTION Continuous speech recognition CSR systems remain faced with the serious problem of acoustic condition changes. Their performance often degrades due to unknown adverse conditions . due to room acoustics ambient noise speaker variability sensor .

TÀI LIỆU LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.