TAILIEUCHUNG - Báo cáo: An efficient voice activity detection algorithm by combining statistical model and energy detection

Tuyển tập báo cáo các nghiên cứu khoa học quốc tế ngành hóa học dành cho các bạn yêu hóa học tham khảo đề tài: An efficient voice activity detection algorithm by combining statistical model and energy detection | Wu and Zhang EURASIP Journal on Advances in Signal Processing 2011 2011 18 http content 2011 1 18 o EURASIP Journal on Advances in Signal Processing a SpringerOpen Journal RESEARCH Open Access An efficient voice activity detection algorithm by combining statistical model and energy detection Ji Wu and Xiao-Lei Zhang Abstract In this article we present a new voice activity detection VAD algorithm that is based on statistical models and empirical rule-based energy detection algorithm. Specifically it needs two steps to separate speech segments from background noise. For the first step the VAD detects possible speech endpoints efficiently using the empirical rulebased energy detection algorithm. However the possible endpoints are not accurate enough when the signal-to-noise ratio is low. Therefore for the second step we propose a new gaussian mixture model-based multipleobservation log likelihood ratio algorithm to align the endpoints to their optimal positions. Several experiments are conducted to evaluate the proposed VAD on both accuracy and efficiency. The results show that it could achieve better performance than the six referenced VADs in various noise scenarios. Keywords energy detection gaussian mixture model GMM multiple-observation voice activity detection VAD 1 Introduction Voice activity detector VAD segregates speeches from background noise. It finds diverse applications in many modern speech communication systems such as speech recognition speech coding noisy speech enhancement mobile telephony and very small aperture terminals. During the past few decades researchers have tried many approaches to improve the VAD performance. Traditional approaches include energy in time domain 1 2 pitch detection 3 and zero-crossing rate 2 4 . Recently several spectral energy-based new features were proposed including energy-entropy feature 5 spacial signal correlation 6 cepstral feature 7 higher-order statistics 8 9 teager energy 10 spectral .

TÀI LIỆU LIÊN QUAN
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.