TAILIEUCHUNG - Báo cáo khoa học: "How Many Words is a Picture Worth? Automatic Caption Generation for News Images"

In this paper we tackle the problem of automatic caption generation for news images. Our approach leverages the vast resource of pictures available on the web and the fact that many of them are captioned. Inspired by recent work in summarization, we propose extractive and abstractive caption generation models. They both operate over the output of a probabilistic image annotation model that preprocesses the pictures and suggests keywords to describe their content. | How Many Words is a Picture Worth Automatic Caption Generation for News Images YansongFeng and Mirella Lapata School of Informatics University of Edinburgh 10 Crichton Street Edinburgh Eh8 9AB uK mlap@ Abstract In this paper we tackle the problem of automatic caption generation for news images. Our approach leverages the vast resource of pictures available on the web and the fact that many of them are captioned. Inspired by recent work in summarization we propose extractive and abstractive caption generation models. They both operate over the output of a probabilistic image annotation model that preprocesses the pictures and suggests keywords to describe their content. Experimental results show that an abstractive model defined over phrases is superior to extractive methods. 1 Introduction Recent years have witnessed an unprecedented growth in the amount of digital information available on the Internet. Flickr one of the best known photo sharing websites hosts more than three billion images with approximately million images being uploaded every Many on-line news sites like CNN Yahoo and BBC publish images with their stories and even provide photo feeds related to current events. Browsing and finding pictures in large-scale and heterogeneous collections is an important problem that has attracted much interest within information retrieval. Many of the search engines deployed on the web retrieve images without analyzing their content simply by matching user queries against collocated textual information. Examples include meta-data . the image s file name and format user-annotated tags captions and generally text surrounding the image. As this limits the applicability of search engines images that 1http 2008 11 03 three-billion-photos-at-flickr do not coincide with textual data cannot be retrieved a great deal of work has focused on the development of methods that generate description words for a picture

Tâm Ðoan 47 11 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "How Many Words is a Picture Worth? Automatic Caption Generation for News Images"

11 41 0

Báo cáo toán học: " Binary words containing inﬁnitely many overlaps"

10 36 0

Classifying many-class high-dimensional fingerprint datasets using random forest of oblique decision trees

10 59 0

How many words are Australian children hearing in the first year of life?

9 69 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461886 55

Giới thiệu :Lập trình mã nguồn mở

14 22719 61

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10905 530

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10083 447

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9540 104

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8301 1127

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8248 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7867 2220

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6711 253

Vật lý hạt cơ bản (1)

29 5793 88

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Động cơ đốt trong và máy kéo công nghiêp tập 1 part 7

23 260 0 01-05-2024

Động cơ đốt trong và máy kéo công nghiêp tập 2 part 8

32 262 0 01-05-2024

Mass Transfer in Multiphase Systems and its Applications Part 19

40 258 1 01-05-2024

CẤU TẠO HẠT NHÂN NGUYÊN TỬ-ĐỘ HỤT KHỐI-NĂNG LƯỢNG LIÊN KẾT-LK RIÊNG

12 270 0 01-05-2024

BeginningMac OS X Tiger Dashboard Widget Development 2006 phần 2

34 215 0 01-05-2024

Trading Strategies Profit Making Techniques For Stock_8

23 176 1 01-05-2024

Bơm máy nén quạt trong công nghiệp part 8

20 199 2 01-05-2024

THE ANTHROPOLOGY OF ONLINE COMMUNITIES BY Samuel M.Wilson and Leighton C. Peterson

19 147 0 01-05-2024

BÀI GIẢNG VỀ - MẠCH ĐIỆN II - Chương I: Phân tích mạch trong miền thời gian

38 142 0 01-05-2024

Giáo trình tổng quan khoa học thông tin và thư viện part 7

22 145 2 01-05-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7867 2220

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5790 1388

Ebook Chào con ba mẹ đã sẵn sàng

112 3772 1233

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5333 1136

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8301 1127

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3518 644

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10905 530

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3694 525

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4071 516

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4136 480