TAILIEUCHUNG - Báo cáo khoa học: "Practical very large scale CRFs"

Conditional Random Fields (CRFs) are a widely-used approach for supervised sequence labelling, notably due to their ability to handle large description spaces and to integrate structural dependency between labels. Even for the simple linearchain model, taking structure into account implies a number of parameters and a computational effort that grows quadratically with the cardinality of the label set. In this paper, we address the issue of training very large CRFs, containing up to hundreds output labels and several billion features. Efﬁciency stems here from the sparsity induced by the use of a 1 penalty term. . | Practical very large scale CRFs Thomas Lavergne LIMSI - CNRS lavergne@ Olivier Cappe Telecom ParisTech LTCI - CNRS cappe@ Francois Yvon Universite Paris-Sud 11 LIMSI - CNRS yvon@ Abstract Conditional Random Fields CRFs are a widely-used approach for supervised sequence labelling notably due to their ability to handle large description spaces and to integrate structural dependency between labels. Even for the simple linear-chain model taking structure into account implies a number of parameters and a computational effort that grows quadrati-cally with the cardinality of the label set. In this paper we address the issue of training very large CRFs containing up to hundreds output labels and several billion features. Efficiency stems here from the sparsity induced by the use of a c penalty term. Based on our own implementation we compare three recent proposals for implementing this regularization strategy. Our experiments demonstrate that very large CRFs can be trained efficiently and that very large models are able to improve the accuracy while delivering compact parameter sets. 1 Introduction Conditional Random Fields CRFs Lafferty et al. 2001 Sutton and McCallum 2006 constitute a widely-used and effective approach for supervised structure learning tasks involving the mapping between complex objects such as strings and trees. An important property of CRFs is their ability to handle large and redundant feature sets and to integrate structural dependency between output labels. However even for simple linear chain CRFs the complexity of learning and inference This work was partly supported by ANR projects CroTaL ANR-07-MDCO-003 and MGA ANR-07-BLAN-0311-02 . grows quadratically with respect to the number of output labels and so does the number of structural features ie. features testing adjacent pairs of labels. Most empirical studies on CRFs thus either consider tasks with a restricted output space typically in the order of few dozens of output .

Lương Tài 93 10 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Practical very large scale CRFs"

10 76 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462363 61

Giới thiệu :Lập trình mã nguồn mở

14 26831 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11378 543

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10573 468

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9857 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8910 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8524 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8111 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 8000 1826

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7298 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

B2B Content Marketing: 2012 Benchmarks, Budgets & Trends

17 242 3 12-01-2025

báo cáo hóa học:" Increased androgen receptor expression in serous carcinoma of the ovary is associated with an improved survival"

6 163 3 12-01-2025

Báo cáo nghiên cứu nông nghiệp " Field control of pest fruit flies in Vietnam "

14 196 4 12-01-2025

Quy Trình Canh Tác Cây Bông Vải

8 171 3 12-01-2025

báo cáo hóa học:" Perceptions of rewards among volunteer caregivers of people living with AIDS working in faith-based organizations in South Africa: a qualitative study"

10 165 1 12-01-2025

Valve Selection Handbook - Fourth Edition

337 150 2 12-01-2025

Báo cáo nghiên cứu khoa học " Vai trò chính quyền địa phương trong phát triển kinh tế : khu chuyên doanh gốm sứ ( Trung Quốc ) và Bát Tràng ( Việt Nam )("

11 218 1 12-01-2025

Báo cáo nghiên cứu khoa học " NÂNG QUAN HỆ KINH TẾ THƯƠNG MẠI VIỆT NAM - TRUNG QUỐC LÊN TẦM CAO THỜI ĐẠI "

8 178 1 12-01-2025

IT Audit: EMC’s Journey to the Private Cloud

13 163 1 12-01-2025

Sáng kiến kinh nghiệm môn mỹ thuật

5 185 1 12-01-2025

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8111 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 8000 1826

Ebook Chào con ba mẹ đã sẵn sàng

112 4443 1376

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6386 1279

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8910 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3862 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3930 610

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4784 567

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11378 543

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4538 490