TAILIEUCHUNG - Báo cáo khoa học: "Efﬁcient Inference of CRFs for Large-Scale Natural Language Data"

This paper presents an efﬁcient inference algorithm of conditional random ﬁelds (CRFs) for large-scale data. Our key idea is to decompose the output label state into an active set and an inactive set in which most unsupported transitions become a constant. Our method uniﬁes two previous methods for efﬁcient inference of CRFs, and also derives a simple but robust special case that performs faster than exact inference when the active sets are sufﬁciently small. We demonstrate that our method achieves dramatic speedup on six standard natural language processing problems. . | Efficient Inference of CRFs for Large-Scale Natural Language Data Minwoo Jeong Chin-Yew Lin- Gary Geunbae Lee tPohang University of Science Technology Pohang Korea Microsoft Research Asia Beijing China t stardust gblee @ lcyl@ Abstract This paper presents an efficient inference algorithm of conditional random fields CRFs for large-scale data. Our key idea is to decompose the output label state into an active set and an inactive set in which most unsupported transitions become a constant. Our method unifies two previous methods for efficient inference of CRFs and also derives a simple but robust special case that performs faster than exact inference when the active sets are sufficiently small. We demonstrate that our method achieves dramatic speedup on six standard natural language processing problems. 1 Introduction Conditional random fields CRFs are widely used in natural language processing but extending them to large-scale problems remains a significant challenge. For simple graphical structures . linear-chain an exact inference can be obtained efficiently if the number of output labels is not large. However for large number of output labels the inference is often prohibitively expensive. To alleviate this problem researchers have begun to study the methods of increasing inference speeds of CRFs. Pal et al. 2006 proposed a Sparse ForwardBackward SFB algorithm in which marginal distribution is compressed by approximating the true marginals using Kullback-Leibler KL divergence. Cohn 2006 proposed a Tied Potential TP algorithm which constrains the labeling considered in each feature function such that the functions can detect only a relatively small set of labels. Both of these techniques efficiently compute the marginals with a significantly reduced runtime resulting in faster training and decoding of CRFs. This paper presents an efficient inference algorithm of CRFs which unifies the SFB and TP approaches. We first decompose output .

Khánh My 91 4 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Efﬁcient Search for Transformation-based Inference"

9 49 0

Báo cáo khoa học: "Efﬁcient Inference Through Cascades of Weighted Tree Transducers"

9 76 0

Báo cáo khoa học: "Efﬁcient Inference of CRFs for Large-Scale Natural Language Data"

4 73 0

Báo cáo khoa học: "Sequential Labeling with Latent Variables: An Exact Inference Algorithm and Its Efﬁcient Approximation"

9 44 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462348 61

Giới thiệu :Lập trình mã nguồn mở

14 26550 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11372 543

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10560 468

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9852 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8897 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8512 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8108 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7867 1809

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7285 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Bảng màu theo chữ cái – V

11 174 2 06-01-2025

báo cáo hóa học:" Perceptions of rewards among volunteer caregivers of people living with AIDS working in faith-based organizations in South Africa: a qualitative study"

10 162 1 06-01-2025

CHƯƠNG 2: RỦI RO THÂM HỤT TÀI KHÓA

28 165 1 06-01-2025

Báo cáo y học: "The Factors Influencing Depression Endpoints Research (FINDER) study: final results of Italian patients with depressio"

9 154 1 06-01-2025

Báo cáo " Bàn về hành vi pháp luật và hành vi đạo đức "

11 181 2 06-01-2025

Word Games with English 1

65 145 1 06-01-2025

Báo cáo nghiên cứu khoa học " Sự nhất quán phát triển kinh tế thị trường XHCN trong xây dựng xã hội hài hoà của Trung Quốc và đổi mới của Việt Nam "

8 148 1 06-01-2025

Chủ đề 3 : SỰ CÂN BẰNG CỦA VẬT RẮN (4 tiết)

9 214 1 06-01-2025

Xinh xinh vườn nhà

6 135 0 06-01-2025

TRẮC NGHIỆM - CÁC BỆNH THIẾU DINH DƯỠNG THƯỜNG GẶP

32 217 2 06-01-2025

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8108 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7867 1809

Ebook Chào con ba mẹ đã sẵn sàng

112 4429 1376

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6336 1275

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8897 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3856 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3927 610

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4759 567

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11372 543

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4530 490