Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "A Scalable Probabilistic Classiﬁer for Language Modeling"

Lam Ngọc 49 6 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

We present a novel probabilistic classiﬁer, which scales well to problems that involve a large number of classes and require training on large datasets. A prominent example of such a problem is language modeling. Our classiﬁer is based on the assumption that each feature is associated with a predictive strength, which quantiﬁes how well the feature can predict the class by itself. The predictions of individual features can then be combined according to their predictive strength, resulting in a model, whose parameters can be reliably and efﬁciently estimated. We show that a generative language model based on our classiﬁer. | A Scalable Probabilistic Classifier for Language Modeling Joel Lang Institute for Language Cognition and Computation School of Informatics University of Edinburgh 10 Crichton Street Edinburgh Eh8 9AB uK J.Lang-3@sms.ed.ac.uk Abstract We present a novel probabilistic classifier which scales well to problems that involve a large number of classes and require training on large datasets. A prominent example of such a problem is language modeling. Our classifier is based on the assumption that each feature is associated with a predictive strength which quantifies how well the feature can predict the class by itself. The predictions of individual features can then be combined according to their predictive strength resulting in a model whose parameters can be reliably and efficiently estimated. We show that a generative language model based on our classifier consistently matches modified Kneser-Ney smoothing and can outperform it if sufficiently rich features are incorporated. 1 Introduction A Language Model LM is an important component within many natural language applications including speech recognition and machine translation. The task of a generative LM is to assign a probability p w to a sequence of words w W1. WL. It is common to factorize this probability as p w n p Wi Wi-N 1 .Wi-1 1 i 1 Thus the central problem that arises from this formulation consists of estimating the probability p wi wi-N 1. wi-1 . This can be viewed as a classification problem in which the target word Wi corresponds to the class that must be predicted based on features extracted from the conditioning context e.g. a word occurring in the context. 625 This paper describes a novel approach for modeling such conditional probabilities. We propose a classifier which is based on the assumption that each feature has a predictive strength quantifying how well the feature can predict the class target word by itself. Then the predictions made by individual features can be combined into a mixture model

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Fast and Scalable Decoding with Language Model Look-Ahead for Phrase-based Statistical Machine Translation"

Báo cáo khoa học: "A Novel Burst-based Text Representation Model for Scalable Event Detection"

Báo cáo khoa học: "A Scalable Probabilistic Classiﬁer for Language Modeling"

Báo cáo khoa học: "Scalable Inference and Training of Context-Rich Syntactic Translation Models"

Báo cáo toán học: " Towards ubiquitous video services through scalable video coding and cross-layer optimization"

Báo cáo toán học: " Memory bandwidth-scalable motion estimation for mobile video coding"

Báo cáo hóa học: " Research Article Scalable Ad Hoc Networks for Arbitrary-Cast: Practical Broadcast-Relay Transmission Strategy Leveraging Physical-Layer Network Coding"

Báo cáo y học: " A scalable, fully automated process for construction of sequence-ready barcoded libraries for 454"

Báo cáo y học: "A scalable, fully automated process for construction of sequence-ready human exome targeted capture libraries"

Báo cáo hóa học: " Scalable synthesis of aligned carbon nanotubes bundles using green natural precursor: neem oil"

Đã phát hiện trình chặn quảng cáo AdBlock

Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.