TAILIEUCHUNG - Báo cáo khoa học: "M AX S IM: A Maximum Similarity Metric for Machine Translation Evaluation"

We propose an automatic machine translation (MT) evaluation metric that calculates a similarity score (based on precision and recall) of a pair of sentences. Unlike most metrics, we compute a similarity score between items across the two sentences. We then ﬁnd a maximum weight matching between the items such that each item in one sentence is mapped to at most one item in the other sentence. | Max Sim a Maximum Similarity Metric for Machine Translation Evaluation Yee Seng Chan and Hwee Tou Ng Department of Computer Science National University of Singapore Law Link Singapore 117590 chanys nght @ Abstract We propose an automatic machine translation MT evaluation metric that calculates a similarity score based on precision and recall of a pair of sentences. Unlike most metrics we compute a similarity score between items across the two sentences. We then find a maximum weight matching between the items such that each item in one sentence is mapped to at most one item in the other sentence. This general framework allows us to use arbitrary similarity functions between items and to incorporate different information in our comparison such as n-grams dependency relations etc. When evaluated on data from the ACL-07 MT workshop our proposed metric achieves higher correlation with human judgements than all 11 automatic MT evaluation metrics that were evaluated during the workshop. 1 Introduction In recent years machine translation MT research has made much progress which includes the introduction of automatic metrics for MT evaluation. Since human evaluation of MT output is time consuming and expensive having a robust and accurate automatic MT evaluation metric that correlates well with human judgement is invaluable. Among all the automatic MT evaluation metrics BLEU Papineni et al. 2002 is the most widely used. Although BLEU has played a crucial role in the progress of MT research it is becoming evident that BLEU does not correlate with human judgement well enough and suffers from several other deficiencies such as the lack of an intuitive interpretation of its scores. During the recent ACL-07 workshop on statistical MT Callison-Burch et al. 2007 a total of 11 automatic MT evaluation metrics were evaluated for correlation with human judgement. The results show that as compared to BLEU several recently proposed metrics such as Semantic-role overlap .

Thanh Nhã 59 8 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Prediction of maximum earthquake magnitude for northern Vietnam region based on the gev distribution

6 88 0

Optimal taxation policy maximum-entropy approach

11 130 0

An early prediction of the maximum amplitude of the solar cycle 25

4 41 0

Frequency analysis for one day to six consecutive days of annual maximum rainfall for Mulde, Dist: Sindhudurg

7 75 0

An enhanced genatic algorithm based courses timetabling method for maximal enrollments using maximum matching on bipartite graphs

15 54 0

Inferring differentially expressed pathways using kernel maximum mean discrepancy-based test

7 49 1

Anti-inflammatory properties of amomum maximum roxb and amomum muricarpum elmer in the north of Vietnam

7 19 1

Maximum cladding temperature prediction for nuclear research reactor WWR-SM Tashkent using best estimate code RELAP5/MOD3.3

6 1 1

Building Real Estate Riches How to Invest in New Homes for Maximum Profit

192 46 0

Real Functions in Several Variables - Examples of Maximum, Minimum Integration and Vector Analysis Calculus 2b

1 59 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461928 55

Giới thiệu :Lập trình mã nguồn mở

14 22998 64

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10967 531

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10160 451

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9563 104

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8361 1127

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8272 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7886 2225

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6808 256

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 6026 1453

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Đánh giá hao mòn và độ tin cậy của chi tiết và kết cấu trên đầu máy diezel part 3

12 320 0 16-05-2024

extremetech Hacking BlackBerry phần 9

31 262 0 16-05-2024

MySQL Basics for Visual Learners PHẦN 9

15 191 0 16-05-2024

THE ANTHROPOLOGY OF ONLINE COMMUNITIES BY Samuel M.Wilson and Leighton C. Peterson

19 160 0 16-05-2024

B2B Content Marketing: 2012 Benchmarks, Budgets & Trends

17 146 0 16-05-2024

Báo cáo tốt nghiệp: Vận hành và bảo dưỡng trong MPLS

92 150 3 16-05-2024

GIÁO TRÌNH VI XỬ LÝ 1 - CHƯƠNG 5. LẬP TRÌNH CHO VI ĐIỀU KHIỂN 80C51

23 116 1 16-05-2024

Bài Tiểu Luận Chuyên Đề Tổ Chức Hoạt Động Nhận Thức Trong Dạy Học Vật Lý " Định Luật Ôm Cho Các Loại Đoạn Mạch Chứa Nguồn Điện"

10 159 3 16-05-2024

MẪU CHỨNG CHỈ QUẢN LÝ VŨ KHÍ, VẬT LIỆU NỔ, CCHT

1 126 0 16-05-2024

Giáo trình phân tích phương trình vi phân viết dưới dạng thuật toán đặc tính của hệ thống p1

5 110 0 16-05-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7886 2225

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 6026 1453

Ebook Chào con ba mẹ đã sẵn sàng

112 3784 1250

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5401 1137

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8361 1127

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3541 655

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10967 531

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3745 527

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4157 523

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4183 483