TAILIEUCHUNG - Báo cáo khoa học: "Reinforcement Learning for Mapping Instructions to Actions"

In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function that deﬁnes the quality of the executed actions. During training, the learner repeatedly constructs action sequences for a set of documents, executes those actions, and observes the resulting reward. We use a policy gradient algorithm to estimate the parameters of a log-linear model for action selection. We apply our method to interpret instructions in two domains — Windows troubleshooting guides and game tutorials. . | Reinforcement Learning for Mapping Instructions to Actions . Branavan Harr Chen Luke S. Zettlemoyer Regina Barzilay Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology branavan harr Isz regina @ Abstract In this paper we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function that defines the quality of the executed actions. During training the learner repeatedly constructs action sequences for a set of documents executes those actions and observes the resulting reward. We use a policy gradient algorithm to estimate the parameters of a log-linear model for action selection. We apply our method to interpret instructions in two domains Windows troubleshooting guides and game tutorials. Our results demonstrate that this method can rival supervised learning techniques while requiring few or no annotated training 1 Introduction The problem of interpreting instructions written in natural language has been widely studied since the early days of artificial intelligence Winograd 1972 Di Eugenio 1992 . Mapping instructions to a sequence of executable actions would enable the automation of tasks that currently require human participation. Examples include configuring software based on how-to guides and operating simulators using instruction manuals. In this paper we present a reinforcement learning framework for inducing mappings from text to actions without the need for annotated training examples. For concreteness consider instructions from a Windows troubleshooting guide on deleting temporary folders shown in Figure 1. We aim to map 1Code data and annotations used in this work are available at http rbg code rl o Click start point to search and then click for files or folders. o In the search results dialog box on the tools menu click folder options. o In the folder options dialog .

Ân Lai 72 9 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Lecture Machine learning (2014-2015) - Lecture 12: Reinforcement learning

28 71 0

Towards safe reinforcement-learning in industrial grid-warehousing

18 44 3

A study on efficient transfer learning for reinforcement learning using sparse coding

7 62 0

Reinforcement Learning Theory and Applications

434 53 0

A multi-agent cooperative reinforcement learning model using a hierarchy of consultants, tutors and workers

14 107 0

Hierarchical reinforcement learning based self balancing algorithm for two wheeled robots

11 64 0

Báo cáo khoa học: "Hierarchical Reinforcement Learning and Hidden Markov Models for Task-Oriented Natural Language Generation"

6 45 0

Báo cáo khoa học: "Reinforcement Learning for Mapping Instructions to Actions"

9 48 0

Báo cáo khoa học: "From Structured Prediction to Inverse Reinforcement Learning"

1 48 0

Báo cáo khoa học: "An ISU Dialogue System Exhibiting Reinforcement Learning of Dialogue Policies: Generic Slot-ﬁlling in the TALK In-car System"

4 73 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462386 61

Giới thiệu :Lập trình mã nguồn mở

14 27275 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11388 543

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10588 468

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9870 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8914 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8538 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8114 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 8076 1836

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7322 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Giáo án mầm non chương trình đổi mới: Gia đình vui nhộn

4 396 3 23-01-2025

B2B Content Marketing: 2012 Benchmarks, Budgets & Trends

17 243 3 23-01-2025

Data Structures and Algorithms - Chapter 8: Heaps

41 196 5 23-01-2025

Giáo trình phân tích phương trình vi phân viết dưới dạng thuật toán đặc tính của hệ thống p1

5 171 1 23-01-2025

Hướng dẫn chế độ dinh dưỡng cho người bệnh viêm khớp

5 177 2 23-01-2025

BÀI GIẢNG Biến Đổi Năng Lượng Điện Cơ - TS. Hồ Phạm Huy

137 167 1 23-01-2025

Word Games with English 1

65 149 1 23-01-2025

Báo cáo nghiên cứu khoa học " Sự nhất quán phát triển kinh tế thị trường XHCN trong xây dựng xã hội hài hoà của Trung Quốc và đổi mới của Việt Nam "

8 153 1 23-01-2025

5 thói quen ăn uống hủy hoại hàm răng đẹp

5 183 2 23-01-2025

Sáng kiến kinh nghiệm môn mỹ thuật

5 186 1 23-01-2025

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8114 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 8076 1836

Ebook Chào con ba mẹ đã sẵn sàng

112 4475 1381

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6463 1285

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8914 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3883 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3934 613

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4833 568

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11388 543

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4551 490