TAILIEUCHUNG - Learning Latent Temporal Structure for Complex Event Detection

As a reaction to this complexity, we designed a new abstraction that allows us to express the simple computa- tions we were trying to perform but hides the messy de- tails of parallelization, fault-tolerance, data distribution and load balancing in a library. Our abstraction is in- spired by the map and reduce primitives present in Lisp and many other functional languages. We realized that most of our computations involved applying a map op- eration to each logical “record” in our input in order to compute a set of intermediate key/value pairs, and then applying a reduce operation to all the values that shared the same key, in order to combine the. | Learning Latent Temporal Structure for Complex Event Detection Kevin Tang Li Fei-Fei Daphne Koller Computer Science Department Stanford University kdtang feifeili koller @ Abstract In this paper we tackle the problem of understanding the temporal structure of complex events in highly varying videos obtained from the Internet. Towards this goal we utilize a conditional model trained in a max-margin framework that is able to automatically discover discriminative and interesting segments of video while simultaneously achieving competitive accuracies on difficult detection and recognition tasks. We introduce latent variables over the frames of a video and allow our algorithm to discover and assign sequences of states that are most discriminative for the event. Our model is based on the variable-duration hidden Markov model and models durations of states in addition to the transitions between states. The simplicity of our model allows us to perform fast exact inference using dynamic programming which is extremely important when we set our sights on being able to process a very large number of videos quickly and efficiently. We show promising results on the Olympic Sports dataset 16 and the 2011 TRECVID Multimedia Event Detection task 18 . We also illustrate and visualize the semantic understanding capabilities of our model. Figure 1. Examples of Internet videos for the event of Grooming an animal from the TRECVID MED dataset 18 that illustrate the variance in video length and temporal localization of the event. Video 3 is the only video similar to sequences typically seen in activity recognition tasks where the event occupies the video in full. 1. Introduction With the advent of Internet video hosting sites such as YouTube personal Internet videos are now becoming extremely popular. There are numerous challenges associated with the understanding of these types of videos we focus on the task of complex event detection. In our problem definition we are .

Nam Lộc 59 8 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Environmental Economics: The Essentials

28 64 0

How air pollution influences clinical management of respiratory diseases. A case-crossover study in Milan

12 51 0

REPORT TO THE UTAH LEGISLATURE Number 2011-03: A Performance Audit Of Utah State Parks

86 58 0

GRANT MANAGEMENT REQUIREMENTS

154 69 0

SOUTHERN COMPANY AND SUBSIDIARY COMPANIES 2001 ANNUAL REPORT

44 55 0

Expenditures on Children by Families, 2011

39 56 1

INADEQUATE SECURITY PRACTICES EXPOSE KEY NASA NETWORK TO CYBER ATTACK

1 51 0

The business school dean redefined - New leadership requirements from the front lines of change in academia

12 63 0

PIMG PROJECT MANAGEMENT IMPLEMENTATION GUIDELINE 2009

102 70 0

HEALTH PROJECT MANAGEMENT A MANUAL OF PROCEDURES FOR FORMULATING AND IMPLEMENTING HEALTH PROJECTS

143 70 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461860 55

Giới thiệu :Lập trình mã nguồn mở

14 22613 59

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10883 529

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10060 446

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9515 104

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8274 1125

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8225 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7863 2220

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6669 253

Vật lý hạt cơ bản (1)

29 5767 85

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Giáo án mầm non chương trình đổi mới: Gia đình vui nhộn

4 312 1 25-04-2024

beginning Ubuntu Linux phần 1

34 212 1 25-04-2024

extremetech Hacking Firefox phần 7

46 187 0 25-04-2024

Anh văn bằng C-124

8 172 0 25-04-2024

TƯƠNG QUAN GIỮA MÔ HỌC, GIẢI PHẪU VÀ HÌNH ẢNH CỦA CÁC KHỐI U PHẦN PHỤ

3 167 0 25-04-2024

Management and Services Part 1

10 156 0 25-04-2024

THE ANTHROPOLOGY OF ONLINE COMMUNITIES BY Samuel M.Wilson and Leighton C. Peterson

19 138 0 25-04-2024

BÀI GIẢNG VỀ - MẠCH ĐIỆN II - Chương I: Phân tích mạch trong miền thời gian

38 140 0 25-04-2024

B2B Content Marketing: 2012 Benchmarks, Budgets & Trends

17 138 0 25-04-2024

Đóng mới oto 8 chỗ ngồi part 9

10 116 0 25-04-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7863 2220

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5695 1353

Ebook Chào con ba mẹ đã sẵn sàng

112 3764 1231

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5311 1135

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8274 1125

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3492 642

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10883 529

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3679 525

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4041 514

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4123 480