Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Approximation Lasso Methods for Language Modeling"

Giang Nam 76 8 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

Lasso is a regularization method for parameter estimation in linear models. It optimizes the model parameters with respect to a loss function subject to model complexities. This paper explores the use of lasso for statistical language modeling for text input. Owing to the very large number of parameters, directly optimizing the penalized lasso loss function is impossible. | Approximation Lasso Methods for Language Modeling Jianfeng Gao Microsoft Research One Microsoft Way Redmond WA 98052 USA jfgao@microsoft.com Abstract Hisami Suzuki Microsoft Research One Microsoft Way Redmond WA 98052 USA hisamis@microsoft.com Lasso is a regularization method for parameter estimation in linear models. It optimizes the model parameters with respect to a loss function subject to model complexities. This paper explores the use of lasso for statistical language modeling for text input. Owing to the very large number of parameters directly optimizing the penalized lasso loss function is impossible. Therefore we investigate two approximation methods the boosted lasso BLasso and the forward stagewise linear regression FSLR . Both methods when used with the exponential loss function bear strong resemblance to the boosting algorithm which has been used as a discriminative training method for language modeling. Evaluations on the task of Japanese text input show that BLasso is able to produce the best approximation to the lasso solution and leads to a significant improvement in terms of character error rate over boosting and the traditional maximum likelihood estimation. 1 Introduction Language modeling LM is fundamental to a wide range of applications. Recently it has been shown that a linear model estimated using discriminative training methods such as the boosting and perceptron algorithms outperforms significantly a traditional word trigram model trained using maximum likelihood estimation MLE on several tasks such as speech recognition and Asian language text input Bacchiani et al. 2004 Roark et al. 2004 Gao et al. 2005 Suzuki and Gao 2005 . The success of discriminative training methods is largely due to fact that unlike the traditional approach e.g. MLE that maximizes the function e.g. likelihood of training data that is loosely associated with error rate discriminative training methods aim to directly minimize the error rate on training data even if

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Efﬁcient Tree-based Approximation for Entailment Graph Learning"

Báo cáo khoa học: "Reducing Approximation and Estimation Errors for Chinese Lexical Processing with Heterogeneous Annotations"

Báo cáo khoa học: "Extraction and Approximation of Numerical Attributes from the Web"

Báo cáo khoa học: "Approximation Lasso Methods for Language Modeling"

Báo cáo khoa học: "Grammar Approximation by Representative Sublanguage: A New Model for Language Learning"

Báo cáo khoa học: "Deep dependencies from context-free statistical parsers: correcting the surface dependency approximation"

Báo cáo khoa học: "Finite-state Approximation of Constraint-based Grammars using Left-corner Grammar Transforms"

Báo cáo khoa học: "FINITE-STATE APPROXIMATION OF PHRASE STRUCTURE GRAMMARS"

Báo cáo toán học: " Random approximation with weak contraction random operators and a random fixed point theorem for nonexpansive random self-mappings"

Báo cáo toán học: " On approximation of asymmetric separators of the n-cube"

Đã phát hiện trình chặn quảng cáo AdBlock

Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.