TAILIEUCHUNG - Báo cáo khoa học: "Generation that Exploits Corpus-Based Statistical Knowledge"

We describe novel aspects of a new natural language generator called Nitrogen. This generator has a highly flexible input representation that allows a spectrum of input from syntactic to semantic depth, and shifts' the burden of many linguistic decisions to the statistical post-processor. The generation algorithm is compositional, making it efficient, yet it also handles non-compositional aspects of language. Nitrogen's design makes it robust and scalable, operating with lexicons and knowledge bases of one hundred thousand entities. . | Generation that Exploits Corpus-Based Statistical Knowledge Irene Langkiide and Kevin Knight Information Sciences Institute University of Southern California Marina del Rey CA 90292 and Abstract We describe novel aspects of a new natural language generator called Nitrogen. This generator has a highly flexible input representation that allows a spectrum of input from syntactic to semantic depth and shifts the burden of many linguistic decisions to the statistical post-processor. The generation algorithm is compositional making it efficient yet it also handles non-compositional aspects of language. Nitrogen s design makes it robust and scalable operating with lexicons and knowledge bases of one hundred thousand entities. 1 Introduction Language generation is an important subtask of applications like machine translation humancomputer dialogue explanation and summarization. The recurring need for generation suggests the usefulness of a general-purpose domain-independent natural language generator NLG . However plugin generators available today such as FUF SURGE Elhadad and Robin 1998 MUMBLE Meteer et al. 1987 KPML Bateman 1996 and CoGen-Tex s RealPro Lavoie and Rambow 1997 require inputs with a daunting amount of linguistic detail. As a result many client applications resort instead to simpler template-based methods. An important advantage of templates is that they sidestep linguistic decision-making and avoid the need for large complex knowledge resources and processing. For example the following structure could be a typical result from a database query on the type of food a venue serves obj-type venue obj-name Top_of_the_Mark attribute food-type attrib-value American By using a template like obj-name s attribute is attrib-value . the structure could produce the sentence Top of the Mark s food type is American. Templates avoid the need for detailed linguistic information about lexical items part-of-speech tags number gender definiteness

Khánh Hội 76 7 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Bài giảng Next Generation Network : Tổng quan về NGN part 1

5 75 0

Bài giảng Next Generation Network : Tổng quan về NGN part 2

5 94 1

Bài giảng Next Generation Network : Tổng quan về NGN part 3

5 60 0

Bài giảng Next Generation Network : Tổng quan về NGN part 4

4 77 0

Bài giảng Next Generation Network :Cấu trúc NGN part 1

5 73 0

Bài giảng Next Generation Network :Cấu trúc NGN part 2

5 67 0

Bài giảng Next Generation Network :Cấu trúc NGN part 3

5 63 0

Bài giảng Next Generation Network :Cấu trúc NGN part 4

5 76 0

Bài giảng Next Generation Network :Cấu trúc NGN part 5

5 72 0

Bài giảng Next Generation Network :Cấu trúc NGN part 6

5 72 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461867 55

Giới thiệu :Lập trình mã nguồn mở

14 22642 59

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10892 529

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10066 446

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9519 104

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8281 1125

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8238 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7864 2220

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6686 253

Vật lý hạt cơ bản (1)

29 5770 85

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Giáo án mầm non chương trình đổi mới: Gia đình vui nhộn

4 312 1 27-04-2024

Đánh giá hao mòn và độ tin cậy của chi tiết và kết cấu trên đầu máy diezel part 3

12 305 0 27-04-2024

Magnetic Bearings Theory and Applications phần 2

14 172 0 27-04-2024

Posted prices versus bargaining in markets_7

23 155 0 27-04-2024

Công nghiệp gang thép Việt Nam : Một giai đoạn phát triển và chuyển đổi chính sách mới part 5

6 194 0 27-04-2024

Lịch sử Đội TNTP Hồ Chí Minh - CHƯƠNG III VÂNG LỜI BÁC DẠY, LÀM NGHÌN VIỆC TỐT, CHỐNG MỸ, CỨU NƯỚC, THIẾU NIÊN SĂN SÀNG

45 137 0 27-04-2024

Hướng dẫn sử dụng Quickoffice cho Ipad và Iphone

13 151 0 27-04-2024

Báo cáo tốt nghiệp: Vận hành và bảo dưỡng trong MPLS

92 144 3 27-04-2024

Diseases of the Liver and Biliary System - part 1

33 124 0 27-04-2024

Data Structures and Algorithms - Chapter 9: Hashing

54 113 0 27-04-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7864 2220

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5735 1368

Ebook Chào con ba mẹ đã sẵn sàng

112 3767 1231

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5319 1136

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8281 1125

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3499 643

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10892 529

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3684 525

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4046 515

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4128 480