TAILIEUCHUNG - Báo cáo khoa học: "Multilingual Access to Large Spoken Archives"

Spoken word collections promise access to unique and compelling content, and most of the technology needed to realize that promise is now in place. Decreasing storage costs, increasing network capacity, and the availability of software to encode and exchange digital audio make possible physical access to spoken word collections at a previously unimaginable scale. Effective support for intellectual access — the problem of finding what you are looking for — is much more challenging, however. . | Multilingual Access to Large Spoken Archives Douglas w. Oard College of Information Studies and Institute for Advanced Computer Studies University of Maryland College Park MD USA Abstract Spoken word collections promise access to unique and compelling content and most of the technology needed to realize that promise is now in place. Decreasing storage costs increasing network capacity and the availability of software to encode and exchange digital audio make possible physical access to spoken word collections at a previously unimaginable scale. Effective support for intellectual access - the problem of finding what you are looking for - is much more challenging however. In this talk I will briefly describe work that has been done on this problem at the Text Retrieval Conferences the Topic Detection and Tracking evaluations and in individual research projects around the world. I will then describe a unique resource a collection of 116 000 hours of oral history interviews recorded in 32 languages in 57 countries that has been assembled by the Survivors of the Shoah Visual History Foundation. Nearly 10 000 hours of this audio has been manually segmented summarized and indexed making this an unrivaled resource with which we can explore a broad array of data-driven techniques. My main focus will be to explain how we are leveraging this exceptional resource to develop the ability to index similar materials automatically. The project we call MALACH Multilingual Access to Large spoken ArCHives builds on a long heritage of increasingly demanding applications for speech recognition technology. The accented emotional and elderly speech in the Shoah Foundation s collection are so challenging that state-of-the-art systems initially yielded a 90 word error rate We now have speech recognition systems that achieve better than half that error rate for two languages English and Czech. That s nowhere near good enough to produce readable transcripts but it is approaching a point where

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.