TAILIEUCHUNG - Báo cáo khoa học: "Stochastic Lexicalized Inversion Transduction Grammar for Alignment"

We present a version of Inversion Transduction Grammar where rule probabilities are lexicalized throughout the synchronous parse tree, along with pruning techniques for efﬁcient training. Alignment results improve over unlexicalized ITG on short sentences for which full EM is feasible, but pruning seems to have a negative impact on longer sentences. | Stochastic Lexicalized Inversion Transduction Grammar for Alignment Hao Zhang and Daniel Gildea Computer Science Department University of Rochester Rochester NY 14627 Abstract We present a version of Inversion Transduction Grammar where rule probabilities are lexicalized throughout the synchronous parse tree along with pruning techniques for efficient training. Alignment results improve over unlexicalized ITG on short sentences for which full EM is feasible but pruning seems to have a negative impact on longer sentences. 1 Introduction The Inversion Transduction Grammar ITG of Wu 1997 is a syntactically motivated algorithm for producing word-level alignments of pairs of transla-tionally equivalent sentences in two languages. The algorithm builds a synchronous parse tree for both sentences and assumes that the trees have the same underlying structure but that the ordering of constituents may differ in the two languages. This probabilistic syntax-based approach has inspired much subsequent reasearch. Alshawi et al. 2000 use hierarchical finite-state transducers. In the tree-to-string model of Yamada and Knight 2001 a parse tree for one sentence of a translation pair is projected onto the other string. Melamed 2003 presents algorithms for synchronous parsing with more complex grammars discussing how to parse grammars with greater than binary branching and lexicalization of synchronous grammars. Despite being one of the earliest probabilistic syntax-based translation models ITG remains state-of-the art. Zens and Ney 2003 found that the constraints of ITG were a better match to the decoding task than the heuristics used in the IBM decoder of Berger et al. 1996 . Zhang and Gildea 2004 found ITG to outperform the tree-to-string model for word-level alignment as measured against human gold-standard alignments. One explanation for this result is that while a tree representation is helpful for modeling translation the trees assigned by the traditional monolingual parsers and

Việt Quốc 81 8 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Stochastic Lexicalized Inversion Transduction Grammar for Alignment"

8 62 0

Báo cáo khoa học: "Lexicalized Stochastic Modeling of Constraint-Based Grammars using Log-Linear Measures and EM Training"

8 46 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462386 61

Giới thiệu :Lập trình mã nguồn mở

14 27289 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11388 543

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10588 468

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9870 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8914 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8539 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8114 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 8077 1836

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7324 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

THE ANTHROPOLOGY OF ONLINE COMMUNITIES BY Samuel M.Wilson and Leighton C. Peterson

19 232 4 23-01-2025

Quy Trình Canh Tác Cây Bông Vải

8 172 3 23-01-2025

Bảng màu theo chữ cái – V

11 177 2 23-01-2025

Chương 10: Các phương pháp tính quá trình quá độ trong mạch điện tuyến tính

57 247 8 23-01-2025

Sử dụng mô hình ARCH và GARCH để phân tích và dự báo về giá cổ phiếu trên thị trường chứng khoán

24 1080 2 23-01-2025

Báo cáo " Bàn về hành vi pháp luật và hành vi đạo đức "

11 182 2 23-01-2025

Bệnh sán lá gan trên gia súc và cách phòng trị

3 171 1 23-01-2025

IT Audit: EMC’s Journey to the Private Cloud

13 165 1 23-01-2025

Lập trình Java cơ bản : Luồng và xử lý file part 8

5 143 1 23-01-2025

Xinh xinh vườn nhà

6 135 0 23-01-2025

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8114 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 8077 1836

Ebook Chào con ba mẹ đã sẵn sàng

112 4475 1381

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6463 1285

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8914 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3884 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3934 613

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4833 568

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11388 543

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4551 490