TAILIEUCHUNG - Báo cáo khoa học: "Efficient Staggered Decoding for Sequence Labeling"

The Viterbi algorithm is the conventional decoding algorithm most widely adopted for sequence labeling. Viterbi decoding is, however, prohibitively slow when the label set is large, because its time complexity is quadratic in the number of labels. This paper proposes an exact decoding algorithm that overcomes this problem. A novel property of our algorithm is that it efficiently reduces the labels to be decoded, while still allowing us to check the optimality of the solution. | Efficient Staggered Decoding for Sequence Labeling Nobuhiro Kaji Yasuhiro Fujiwara Naoki Yoshinaga Masaru Kitsuregawa Institute of Industrial Science The University of Tokyo 4-6-1 Komaba Meguro-ku Tokyo 153-8505 Japan kaji fujiwara ynaga kisture @ Abstract The Viterbi algorithm is the conventional decoding algorithm most widely adopted for sequence labeling. Viterbi decoding is however prohibitively slow when the label set is large because its time complexity is quadratic in the number of labels. This paper proposes an exact decoding algorithm that overcomes this problem. A novel property of our algorithm is that it efficiently reduces the labels to be decoded while still allowing us to check the optimality of the solution. Experiments on three tasks POS tagging joint POS tagging and chunking and supertagging show that the new algorithm is several orders of magnitude faster than the basic Viterbi and a state-of-the-art algorithm CarpeDiem Esposito and Radi-cioni 2009 . 1 Introduction In the past decade sequence labeling algorithms such as HMMs CRFs and Collins perceptrons have been extensively studied in the field of NLP Rabiner 1989 Lafferty et al. 2001 Collins 2002 . Now they are indispensable in a wide range of NLP tasks including chunking POS tagging NER and so on Sha and Pereira 2003 Tsuruoka and Tsujii 2005 Lin and Wu 2009 . One important task in sequence labeling is how to find the most probable label sequence from among all possible ones. This task referred to as decoding is usually carried out using the Viterbi algorithm Viterbi 1967 . The Viterbi algorithm has O NL2 time complexity 1 where N is the input size and L is the number of labels. Although the Viterbi algorithm is generally efficient 1The first-order Markov assumption is made throughout this paper although our algorithm is applicable to higher-order Markov models as well. it becomes prohibitively slow when dealing with a large number of labels since its computational cost is

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.