TAILIEUCHUNG - Báo cáo khoa học: "Training Conditional Random Fields with Multivariate Evaluation Measures"

This paper proposes a framework for training Conditional Random Fields (CRFs) to optimize multivariate evaluation measures, including non-linear measures such as F-score. Our proposed framework is derived from an error minimization approach that provides a simple solution for directly optimizing any evaluation measure. Specifically focusing on sequential segmentation tasks, . text chunking and named entity recognition, we introduce a loss function that closely reflects the target evaluation measure for these tasks, namely, segmentation F-score. . | Training Conditional Random Fields with Multivariate Evaluation Measures Jun Suzuki Erik McDermott and Hideki Isozaki NTT Communication Science Laboratories NTT Corp. 2-4 Hikaridai Seika-cho Soraku-gun Kyoto 619-0237 Japan jun mcd isozaki @ Abstract This paper proposes a framework for training Conditional Random Fields CRFs to optimize multivariate evaluation measures including non-linear measures such as F-score. Our proposed framework is derived from an error minimization approach that provides a simple solution for directly optimizing any evaluation measure. Specifically focusing on sequential segmentation tasks . text chunking and named entity recognition we introduce a loss function that closely reflects the target evaluation measure for these tasks namely segmentation F-score. Our experiments show that our method performs better than standard CRF training. 1 Introduction Conditional random fields CRFs are a recently introduced formalism Lafferty et al. 2001 for representing a conditional model p y x where both a set of inputs x and a set of outputs y display non-trivial interdependency. CRFs are basically defined as a discriminative model of Markov random fields conditioned on inputs observations x. Unlike generative models CRFs model only the output y s distribution over x. This allows CRFs to use flexible features such as complicated functions of multiple observations. The modeling power of CRFs has been of great benefit in several applications such as shallow parsing Sha and Pereira 2003 and information extraction McCallum and Li 2003 . Since the introduction of CRFs intensive research has been undertaken to boost their effectiveness. The first approach to estimating CRF parameters is the maximum likelihood ML criterion over conditional probability p yjx itself Lafferty et al. 2001 . The ML criterion however is prone to over-fitting the training data especially since CRFs are often trained with a very large number of correlated .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.