TAILIEUCHUNG - Báo cáo khoa học: "Heterogeneous Transfer Learning for Image Clustering via the Social Web"

In this paper, we present a new learning scenario, heterogeneous transfer learning, which improves learning performance when the data can be in different feature spaces and where no correspondence between data instances in these spaces is provided. In the past, we have classiﬁed Chinese text documents using English training data under the heterogeneous transfer learning framework. In this paper, we present image clustering as an example to illustrate how unsupervised learning can be improved by transferring knowledge from auxiliary heterogeneous data obtained from the social Web. . | Heterogeneous Transfer Learning for Image Clustering via the Social Web Qiang Yang Hong Kong University of Science and Technology Clearway Bay Kowloon Hong Kong qyang@ Yuqiang Chen Gui-Rong Xue Wenyuan Dai Yong Yu Shanghai Jiao Tong University 800 Dongchuan Road Shanghai 200240 China yuqiangchen grxue dwyak yyu @ Abstract In this paper we present a new learning scenario heterogeneous transfer learning which improves learning performance when the data can be in different feature spaces and where no correspondence between data instances in these spaces is provided. In the past we have classified Chinese text documents using English training data under the heterogeneous transfer learning framework. In this paper we present image clustering as an example to illustrate how unsupervised learning can be improved by transferring knowledge from auxiliary heterogeneous data obtained from the social Web. Image clustering is useful for image sense disambiguation in query-based image search but its quality is often low due to imagedata sparsity problem. We extend PLSA to help transfer the knowledge from social Web data which have mixed feature representations. Experiments on image-object clustering and scene clustering tasks show that our approach in heterogeneous transfer learning based on the auxiliary data is indeed effective and promising. 1 Introduction Traditional machine learning relies on the availability of a large amount of data to train a model which is then applied to test data in the same feature space. However labeled data are often scarce and expensive to obtain. Various machine learning strategies have been proposed to address this problem including semi-supervised learning Zhu 2007 domain adaptation Wu and Diet-terich 2004 Blitzer et al. 2006 Blitzer et al. 2007 Arnold et al. 2007 Chan and Ng 2007 Daume 2007 Jiang and Zhai 2007 Reichart and Rappoport 2007 Andreevskaia and Bergler 2008 multi-task learning Caruana 1997 Re-ichart et al. .

Chí Dũng 90 9 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Heterogeneous Transfer Learning for Image Clustering via the Social Web"

9 72 0

Principles of Electrochemistry , Second Edition

1 41 0

Improving MapReduce Performance in Heterogeneous Environments

14 62 0

Magnetic field effect on exciplex-forming organic acceptor/donor system: a powerful tool for understanding the preferential solvation

11 56 0

The TiO2-graphene oxide-Hemin ternary hybrid composite material as an efficient heterogeneous catalyst for the degradation of organic contaminants

9 57 1

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461990 55

Giới thiệu :Lập trình mã nguồn mở

14 23339 68

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11032 533

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10244 453

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9592 106

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8457 1139

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8311 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7903 2239

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6890 257

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 6321 1529

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Giáo án mầm non chương trình đổi mới: Đề tài: Ôn xác định vị trí trên – dưới, trước- sau của đối tượng khác.

8 388 3 02-06-2024

MySQL Database Usage & Administration PHẦN 7

37 173 0 02-06-2024

B2B Content Marketing: 2012 Benchmarks, Budgets & Trends

17 154 0 02-06-2024

Báo cáo tốt nghiệp: Vận hành và bảo dưỡng trong MPLS

92 157 3 02-06-2024

Hệ thống làm lạnh và điều hòa không khí

21 138 0 02-06-2024

Truyện kiếm hiệp - Duy ngã độc tôn phần 5/7

1 108 0 02-06-2024

Giáo trình phân tích phương trình vi phân viết dưới dạng thuật toán đặc tính của hệ thống p1

5 115 0 02-06-2024

Báo cáo nghiên cứu nông nghiệp " Biofertiliser inoculant technology for the growth of rice in Vietnam: Developing technical infrastructure for quality assurance and village production for farmers "

12 102 0 02-06-2024

Báo cáo khoa học: " Principaux critères économiques de gestion des forêts : analyse critique et comparative"

29 99 0 02-06-2024

Hướng dẫn chế độ dinh dưỡng cho người bệnh viêm khớp

5 133 0 02-06-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7903 2239

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 6321 1529

Ebook Chào con ba mẹ đã sẵn sàng

112 3887 1277

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5504 1148

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8457 1139

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3583 658

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3783 570

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11032 533

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4228 527

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4236 483