TAILIEUCHUNG - Báo cáo khoa học: "Discriminative Language Modeling with Conditional Random Fields and the Perceptron Algorithm"

This paper describes discriminative language modeling for a large vocabulary speech recognition task. We contrast two parameter estimation methods: the perceptron algorithm, and a method based on conditional random fields (CRFs). The models are encoded as deterministic weighted finite state automata, and are applied by intersecting the automata with word-lattices that are the output from a baseline recognizer. The perceptron algorithm has the benefit of automatically selecting a relatively small feature set in just a couple of passes over the training data. . | Discriminative Language Modeling with Conditional Random Fields and the Perceptron Algorithm Brian Roark Murat Saraclar Michael Collins Mark Johnson AT T Labs - Research MIT CSAIL Brown University roark murat @ mcollins@ Mark_Johnson@ Abstract This paper describes discriminative language modeling for a large vocabulary speech recognition task. We contrast two parameter estimation methods the perceptron algorithm and a method based on conditional random fields CRFs . The models are encoded as deterministic weighted finite state automata and are applied by intersecting the automata with word-lattices that are the output from a baseline recognizer. The perceptron algorithm has the benefit of automatically selecting a relatively small feature set in just a couple of passes over the training data. However using the feature set output from the perceptron algorithm initialized with their weights CRF training provides an additional reduction in word error rate for a total absolute reduction from the baseline of . 1 Introduction A crucial component of any speech recognizer is the language model LM which assigns scores or probabilities to candidate output strings in a speech recognizer. The language model is used in combination with an acoustic model to give an overall score to candidate word sequences that ranks them in order of probability or plausibility. A dominant approach in speech recognition has been to use a source-channel or noisy-channel model. In this approach language modeling is effectively framed as density estimation the language model s task is to define a distribution over the source - . the possible strings in the language. Markov n-gram models are often used for this task whose parameters are optimized to maximize the likelihood of a large amount of training text. Recognition performance is a direct measure of the effectiveness of a language model an indirect measure which is frequently proposed within

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.