Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "a Precision-Order-Recall MT Evaluation Metric for Tuning"

Tuấn Anh 73 10 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

Many machine translation (MT) evaluation metrics have been shown to correlate better with human judgment than BLEU. In principle, tuning on these metrics should yield better systems than tuning on BLEU. However, due to issues such as speed, requirements for linguistic resources, and optimization difficulty, they have not been widely adopted for tuning. | PORT a Precision-Order-Recall MT Evaluation Metric for Tuning Boxing Chen Roland Kuhn and Samuel Larkin National Research Council Canada 283 Alexandre-Taché Boulevard Gatineau Quebec Canada J8X 3X7 Boxing.Chen Roland.Kuhn Samuel.Larkin @nrc.ca Abstract Many machine translation MT evaluation metrics have been shown to correlate better with human judgment than BLEU. In principle tuning on these metrics should yield better systems than tuning on BLEU. However due to issues such as speed requirements for linguistic resources and optimization difficulty they have not been widely adopted for tuning. This paper presents PORT1 a new MT evaluation metric which combines precision recall and an ordering metric and which is primarily designed for tuning MT systems. PORT does not require external resources and is quick to compute. It has a better correlation with human judgment than BLEU. We compare PORT-tuned MT systems to BLEU-tuned baselines in five experimental conditions involving four language pairs. PORT tuning achieves consistently better performance than BLEU tuning according to four automated metrics including BLEU and to human evaluation in comparisons of outputs from 300 source sentences human judges preferred the PORT-tuned output 45.3 of the time vs. 32.7 BLEU tuning preferences and 22.0 ties . 1 Introduction Automatic evaluation metrics for machine translation MT quality are a key part of building statistical MT SMT systems. They play two 1 PORT Precision-Order-Recall Tunable metric. 930 roles to allow rapid though sometimes inaccurate comparisons between different systems or between different versions of the same system and to perform tuning of parameter values during system training. The latter has become important since the invention of minimum error rate training MERT Och 2003 and related tuning methods. These methods perform repeated decoding runs with different system parameter values which are tuned to optimize the value of the evaluation metric over a .

TÀI LIỆU LIÊN QUAN

Kỷ yếu tóm tắt báo cáo khoa học: Hội nghị khoa học tim mạch toàn quốc lần thứ XI - Hội tim mạch Quốc gia Việt Nam

Báo cáo nghiên cứu khoa học: "Danh lục các loài thú ở khu bảo tồn thiên nhiên Pù Huống tỉnh Nghệ An và ý nghĩa bảo tồn nguồn gen quí hiếm của chúng"

Báo cáo khoa học: Hỗ trợ nâng cao năng lực quản lý chất thải sinh hoạt tại thành phố Hội An

Báo cáo nghiên cứu khoa học: "Tính năng động nghệ thuật của văn học hiện đại Việt Nam và một cách nhìn hành trình thể loại"

Báo cáo nghiên cứu khoa học: " DỊCH CHUYỂN TRUY VẤN OQL VÀO CÁC PHÉP TÍNH BAO HÀM"

Báo cáo khoa học: " Áp dụng thủ tục phân tích trong kiểm toán báo cáo tài chính"

Báo cáo nghiên cứu khoa học: "Người lính trở về sau chiến tranh với mặc cảm “ăn mày dĩ vãng’ trong tiểu thuyết Chu Lai"

Báo cáo nghiên cứu khoa học: "Khảo sát hiện tượng chuyển đổi chức năng - nghĩa của động từ tiếng Việt"

Báo cáo nghiên cứu khoa học: " BẢN CHẤT KHOA HỌC VÀ CÁCH MẠNG LÀ CỘI NGUỒN SỨC SỐNG CỦA CHỦ NGHĨA MÁC - LÊNIN"

Báo cáo khoa học: " CẢI TIẾN CÁC THUẬT TOÁN MƯỢN VÀ KHOÁ KÊNH TẦN SỐ MẠNG DI ĐỘNG TẾ BÀO"

Đã phát hiện trình chặn quảng cáo AdBlock

Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.