TAILIEUCHUNG - Báo cáo khoa học: "Log-linear Models for Word Alignment"

We present a framework for word alignment based on log-linear models. All knowledge sources are treated as feature functions, which depend on the source langauge sentence, the target language sentence and possible additional variables. Log-linear models allow statistical alignment models to be easily extended by incorporating syntactic information. In this paper, we use IBM Model 3 alignment probabilities, POS correspondence, and bilingual dictionary coverage as features. Our experiments show that log-linear models signiﬁcantly outperform IBM translation models. . | Log-linear Models for Word Alignment Yang Liu Qun Liu and Shouxun Lin Institute of Computing Technology Chinese Academy of Sciences No. 6 Kexueyuan South Road Haidian District P O. Box 2704 Beijing 100080 China yliu liuqun sxlin @ Abstract We present a framework for word alignment based on log-linear models. All knowledge sources are treated as feature functions which depend on the source langauge sentence the target language sentence and possible additional variables. Log-linear models allow statistical alignment models to be easily extended by incorporating syntactic information. In this paper we use IBM Model 3 alignment probabilities POS correspondence and bilingual dictionary coverage as features. Our experiments show that log-linear models significantly outperform IBM translation models. 1 Introduction Word alignment which can be defined as an object for indicating the corresponding words in a parallel text was first introduced as an intermediate result of statistical translation models Brown et al. 1993 . In statistical machine translation word alignment plays a crucial role as word-aligned corpora have been found to be an excellent source of translation-related knowledge. Various methods have been proposed for finding word alignments between parallel texts. There are generally two categories of alignment approaches statistical approaches and heuristic approaches. Statistical approaches which depend on a set of unknown parameters that are learned from training data try to describe the relationship between a bilingual sentence pair Brown et al. 1993 Vogel and Ney 1996 . Heuristic approaches obtain word alignments by using various similarity functions between the types of the two languages Smadja et al. 1996 Ker and Chang 1997 Melamed 2000 . The central distinction between statistical and heuristic approaches is that statistical approaches are based on well-founded probabilistic models while heuristic ones are not. Studies reveal that statistical .

Trường Giang 89 8 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461856 55

Giới thiệu :Lập trình mã nguồn mở

14 22583 57

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10880 529

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10043 445

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9510 104

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8267 1124

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8215 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7862 2220

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6664 253

Vật lý hạt cơ bản (1)

29 5764 85

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Động cơ đốt trong và máy kéo công nghiêp tập 2 part 8

32 258 0 23-04-2024

BeginningMac OS X Tiger Dashboard Widget Development 2006 phần 2

34 208 0 23-04-2024

Trading Strategies Profit Making Techniques For Stock_3

23 183 0 23-04-2024

Bơm máy nén quạt trong công nghệ part 1

20 249 2 23-04-2024

Trading Strategies Profit Making Techniques For Stock_8

23 173 0 23-04-2024

Management and Services Part 1

10 155 0 23-04-2024

MySQL Basics for Visual Learners PHẦN 9

15 183 0 23-04-2024

THE ANTHROPOLOGY OF ONLINE COMMUNITIES BY Samuel M.Wilson and Leighton C. Peterson

19 138 0 23-04-2024

MÔN HỌC VẬT LIỆU VÀ CÔNG NGHỆ KIM LOẠI - PHẦN I: KIM LOẠI HỌC

32 175 2 23-04-2024

Hướng dẫn sử dụng Quickoffice cho Ipad và Iphone

13 150 0 23-04-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7862 2220

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5667 1347

Ebook Chào con ba mẹ đã sẵn sàng

112 3757 1230

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5295 1134

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8267 1124

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3480 641

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10880 529

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3677 525

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4038 514

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4118 480