TAILIEUCHUNG - Báo cáo khoa học: "Practical very large scale CRFs"

Conditional Random Fields (CRFs) are a widely-used approach for supervised sequence labelling, notably due to their ability to handle large description spaces and to integrate structural dependency between labels. Even for the simple linearchain model, taking structure into account implies a number of parameters and a computational effort that grows quadratically with the cardinality of the label set. In this paper, we address the issue of training very large CRFs, containing up to hundreds output labels and several billion features. Efficiency stems here from the sparsity induced by the use of a 1 penalty term. . | Practical very large scale CRFs Thomas Lavergne LIMSI - CNRS lavergne@ Olivier Cappe Telecom ParisTech LTCI - CNRS cappe@ Francois Yvon Universite Paris-Sud 11 LIMSI - CNRS yvon@ Abstract Conditional Random Fields CRFs are a widely-used approach for supervised sequence labelling notably due to their ability to handle large description spaces and to integrate structural dependency between labels. Even for the simple linear-chain model taking structure into account implies a number of parameters and a computational effort that grows quadrati-cally with the cardinality of the label set. In this paper we address the issue of training very large CRFs containing up to hundreds output labels and several billion features. Efficiency stems here from the sparsity induced by the use of a c penalty term. Based on our own implementation we compare three recent proposals for implementing this regularization strategy. Our experiments demonstrate that very large CRFs can be trained efficiently and that very large models are able to improve the accuracy while delivering compact parameter sets. 1 Introduction Conditional Random Fields CRFs Lafferty et al. 2001 Sutton and McCallum 2006 constitute a widely-used and effective approach for supervised structure learning tasks involving the mapping between complex objects such as strings and trees. An important property of CRFs is their ability to handle large and redundant feature sets and to integrate structural dependency between output labels. However even for simple linear chain CRFs the complexity of learning and inference This work was partly supported by ANR projects CroTaL ANR-07-MDCO-003 and MGA ANR-07-BLAN-0311-02 . grows quadratically with respect to the number of output labels and so does the number of structural features ie. features testing adjacent pairs of labels. Most empirical studies on CRFs thus either consider tasks with a restricted output space typically in the order of few dozens of output .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.