TAILIEUCHUNG - Báo cáo khoa học: "Human Evaluation of a German Surface Realisation Ranker"

In this paper we present a human-based evaluation of surface realisation alternatives. We examine the relative rankings of naturally occurring corpus sentences and automatically generated strings chosen by statistical models (language model, loglinear model), as well as the naturalness of the strings chosen by the log-linear model. We also investigate to what extent preceding context has an effect on choice. We show that native speakers do accept quite some variation in word order, but there are also clearly factors that make certain realisation alternatives more natural. . | Human Evaluation of a German Surface Realisation Ranker Aoife Cahill Institut fur Maschinelle Sprachverarbeitung IMS University of Stuttgart 70174 Stuttgart Germany Martin Forst Palo Alto Research Center 3333 Coyote Hill Road Palo Alto CA 94304 USA mforst@ Abstract In this paper we present a human-based evaluation of surface realisation alternatives. We examine the relative rankings of naturally occurring corpus sentences and automatically generated strings chosen by statistical models language model log-linear model as well as the naturalness of the strings chosen by the log-linear model. We also investigate to what extent preceding context has an effect on choice. We show that native speakers do accept quite some variation in word order but there are also clearly factors that make certain realisation alternatives more natural. 1 Introduction An important component of research on surface realisation the task of generating strings for a given abstract representation is evaluation especially if we want to be able to compare across systems. There is consensus that exact match with respect to an actually observed corpus sentence is too strict a metric and that BLEU score measured against corpus sentences can only give a rough impression of the quality of the system output. It is unclear however what kind of metric would be most suitable for the evaluation of string realisations so that as a result there have been a range of automatic metrics applied including inter alia exact match string edit distance NIST SSA BLEU NIST ROUGE generation string accuracy generation tree accuracy word accuracy Bangalore et al. 2000 Callaway 2003 Nakanishi et al. 2005 Velldal and Oepen 2006 Belz and Reiter 2006 . It is not always clear how appropriate these metrics are especially at the level of individual sentences. Using automatic evaluation metrics cannot be avoided but ideally a metric for the evaluation of realisation rankers would rank .

Mai Liên 68 9 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Lecture Human-Computer interaction - Lesson 29: Evaluation (Part 1)

9 23 1

Lecture Human-Computer interaction - Lesson 30: Evaluation (Part 2)

14 22 1

Báo cáo khoa học: "MT Evaluation: Human-like vs. Human Acceptable"

8 58 0

Human intangible asset evaluation: The master goldsmith’s figure

10 26 3

Economic evaluation of some projects funded by ngos and their role in human development in fayoum governorate, Egypt

8 73 0

Lecture Human-Computer interaction - Lesson 32: Evaluation (Part 4)

56 15 1

Lecture Human-Computer interaction - Lesson 33: Evaluation (Part 5)

14 15 1

Lecture Human-computer interaction (3rd) - Chapter 9: Evaluation techniques

40 13 1

Báo cáo khoa học: "Assessing Dialog System User Simulation Evaluation Measures Using Human Judges"

8 60 0

Báo cáo khoa học: "Correlation between ROUGE and Human Evaluation of Extractive Meeting Summaries"

4 49 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462337 61

Giới thiệu :Lập trình mã nguồn mở

14 25992 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11342 542

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10547 466

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9838 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8889 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8502 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8100 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7730 1790

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7245 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

báo cáo hóa học:" Increased androgen receptor expression in serous carcinoma of the ovary is associated with an improved survival"

6 156 3 26-12-2024

Quy Trình Canh Tác Cây Bông Vải

8 164 3 26-12-2024

Giáo án điện tử tiểu học môn lịch sử: Cách mạng mùa thu

39 164 1 26-12-2024

Đề tài " Dự báo về tác động của Tổ chức Thương mại Thế giới WTO đối với các doanh nghiệp xuất khẩu vừa và nhỏ Việt Nam – Những giải pháp đề xuất "

72 184 2 26-12-2024

Báo cáo y học: "The Factors Influencing Depression Endpoints Research (FINDER) study: final results of Italian patients with depressio"

9 148 1 26-12-2024

Bệnh sán lá gan trên gia súc và cách phòng trị

3 162 1 26-12-2024

IT Audit: EMC’s Journey to the Private Cloud

13 158 1 26-12-2024

5 thói quen ăn uống hủy hoại hàm răng đẹp

5 167 1 26-12-2024

OPEN SOURCE ERP REASONABLE TOOLS FOR MANUFACTURING SMEs?

1 148 1 26-12-2024

Data Mining Classification: Basic Concepts, Decision Trees, and Model Evaluation Lecture Notes for Chapter 4 Introduction to Data Mining

101 140 1 26-12-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8100 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7730 1790

Ebook Chào con ba mẹ đã sẵn sàng

112 4406 1371

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6281 1266

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8889 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3838 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3919 609

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4705 565

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11342 542

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4505 490