TAILIEUCHUNG - Báo cáo khoa học: "Compiling Boostexter Rules into a Finite-state Transducer"

A number of NLP tasks have been effectively modeled as classiﬁcation tasks using a variety of classiﬁcation techniques. Most of these tasks have been pursued in isolation with the classiﬁer assuming unambiguous input. In order for these techniques to be more broadly applicable, they need to be extended to apply on weighted packed representations of ambiguous input. One approach for achieving this is to represent the classiﬁcation model as a weighted ﬁnite-state transducer (WFST). | Compiling Boostexter Rules into a Finite-state Transducer Srinivas Bangalore AT T Labs-Research 180 Park Avenue Florham Park NJ 07932 Abstract A number of NLP tasks have been effectively modeled as classification tasks using a variety of classification techniques. Most of these tasks have been pursued in isolation with the classifier assuming unambiguous input. In order for these techniques to be more broadly applicable they need to be extended to apply on weighted packed representations of ambiguous input. One approach for achieving this is to represent the classification model as a weighted finite-state transducer WFST . In this paper we present a compilation procedure to convert the rules resulting from an AdaBoost classifier into an WFST. We validate the compilation technique by applying the resulting WFST on a call-routing application. 1 Introduction Many problems in Natural Language Processing NLP can be modeled as classification tasks either at the word or at the sentence level. For example part-of-speech tagging named-entity identification supertagging1 word sense disambiguation are tasks that have been modeled as classification problems at the word level. In addition there are problems that classify the entire sentence or document into one of a set of categories. These problems are loosely characterized as semantic classification and have been used in many practical applications including call routing and text classification. Most of these problems have been addressed in isolation assuming unambiguous one-best input. Typically however in NLP applications these modules are chained together with each module introducing some amount of error. In order to alleviate the errors introduced by a module it is typical for a module to provide multiple weighted solutions ideally as a packed representation that serve as input to the next module. For example a speech recognizer provides a lattice of possible recognition outputs that is to be annotated with part-of-speech

Sơn Hà 50 4 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Compiling Boostexter Rules into a Finite-state Transducer"

4 40 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462351 61

Giới thiệu :Lập trình mã nguồn mở

14 26696 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11376 543

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10568 468

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9855 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8906 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8519 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8109 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7925 1821

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7290 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Đóng mới oto 8 chỗ ngồi part 9

10 187 3 09-01-2025

Giáo trình phân tích phương trình vi phân viết dưới dạng thuật toán đặc tính của hệ thống p1

5 170 1 09-01-2025

Báo cáo nghiên cứu nông nghiệp " Field control of pest fruit flies in Vietnam "

14 195 4 09-01-2025

Bảng màu theo chữ cái – V

11 177 2 09-01-2025

Chương 10: Các phương pháp tính quá trình quá độ trong mạch điện tuyến tính

57 246 8 09-01-2025

ĐỀ TÀI " ĐÁNH GIÁ HIỆU QUẢ HOẠT ĐỘNG KINH DOANH NGOẠI HỐI CỦA NGÂN HÀNG THƯƠNG MẠI CỔ PHẦN XUẤT NHẬP KHẨU VIỆT NAM "

51 159 3 09-01-2025

Báo cáo nghiên cứu khoa học " Sự nhất quán phát triển kinh tế thị trường XHCN trong xây dựng xã hội hài hoà của Trung Quốc và đổi mới của Việt Nam "

8 151 1 09-01-2025

CUỘC KHÁNG CHIẾN CHỐNG THỰC DÂN PHÁP KẾT THÚC (1953 - 1954)_5

11 153 1 09-01-2025

Lập trình Java cơ bản : Luồng và xử lý file part 8

5 143 1 09-01-2025

SQL và PL/SQLCơ bản.Oracle cơ bản - SQL và PL/SQLMỤC LỤCMỤC LỤC ... CHƯƠNG

104 168 0 09-01-2025

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8109 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7925 1821

Ebook Chào con ba mẹ đã sẵn sàng

112 4436 1376

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6361 1276

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8906 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3859 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3930 610

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4778 567

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11376 543

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4534 490