Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Learning to Win by Reading Manuals in a Monte-Carlo Framework"

Tú Quyên 76 10 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

This paper presents a novel approach for leveraging automatically extracted textual knowledge to improve the performance of control applications such as games. Our ultimate goal is to enrich a stochastic player with highlevel guidance expressed in text. Our model jointly learns to identify text that is relevant to a given game state in addition to learning game strategies guided by the selected text. | Learning to Win by Reading Manuals in a Monte-Carlo Framework S.R.K. Branavan David Silver Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology branavan regina @csail.mit.edu Regina Barzilay Department of Computer Science University College London d.silver@cs.ucl.ac.uk Abstract This paper presents a novel approach for leveraging automatically extracted textual knowledge to improve the performance of control applications such as games. Our ultimate goal is to enrich a stochastic player with high-level guidance expressed in text. Our model jointly learns to identify text that is relevant to a given game state in addition to learning game strategies guided by the selected text. Our method operates in the Monte-Carlo search framework and learns both text analysis and game strategies based only on environment feedback. We apply our approach to the complex strategy game Civilization II using the official game manual as the text guide. Our results show that a linguistically-informed game-playing agent significantly outperforms its language-unaware counterpart yielding a 27 absolute improvement and winning over 78 of games when playing against the built-in AI of Civilization II.1 1 Introduction In this paper we study the task of grounding linguistic analysis in control applications such as computer games. In these applications an agent attempts to optimize a utility function e.g. game score by learning to select situation-appropriate actions. In complex domains finding a winning strategy is challenging even for humans. Therefore human players typically rely on manuals and guides that describe promising tactics and provide general advice about the underlying task. Surprisingly such textual information has never been utilized in control algorithms despite its potential to greatly improve performance. 1The code data and complete experimental setup for this work are available at http groups.csail.mit.edu rbg code civ. The natural .

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Learning Condensed Feature Representations from Large Unsupervised Data Sets for Supervised Learning"

Báo cáo khoa học: "Learning Better Data Representation using Inference-Driven Metric Learning"

Báo cáo khoa học: "A Combination of Active Learning and Semi-supervised Learning Starting with Positive and Unlabeled Examples for Word Sense Disambiguation: An Empirical Study on Japanese Web Search Query"

B.A Thesis: English major students’ difficulties and expectations in learning written translation at Dong Thap university

Báo cáo đề tài nghiên cứu khoa học cấp trường: Áp dụng mô hình học tập Blended Learning trong giảng dạy học phần Basic IELTS 1 cho sinh viên theo chương trình đào tạo chất lượng cao năm thứ nhất trường Đại học Thương mại

Báo cáo đề tài nghiên cứu khoa học cấp trường: Nâng cao động lực học tiếng Anh cho sinh viên thông qua phương pháp học theo dự án (project-based learning)

Báo cáo đề tài nghiên cứu khoa học cấp trường: Nghiên cứu một số thuật toán học máy (machine learning) ứng dụng cho bài toán xác định các chủ đề quan tâm của khách hàng trực tuyến

Báo cáo khoa học: "Applications of GPC Rules and Character Structures in Games for Learning Chinese Characters"

Báo cáo khoa học: "Learning and Translating by Machines"

Báo cáo khoa học: "Discriminative Learning for Joint Template Filling"

Đã phát hiện trình chặn quảng cáo AdBlock

Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.