TAILIEUCHUNG - Lecture Notes in Computer Science- P16

Lecture Notes in Computer Science- P16:This year, we received about 170 submissions to ICWL 2008. There were a total of 52 full papers, representing an acceptance rate of about 30%, plus one invited paper accepted for inclusion in this LNCS proceedings. The authors of these accepted papers | Web Contents Extracting for Web-Based Learning 65 Training Set CCT2006 in CWIRF1. It consists of 1200 piece of content pages. Dataset3 consists of 184 piece of content page collected from SOHU. Experiment 1 Experiment 1 compares time performance of building Block-List and DOM-Tree. Fig. 4 illustrates accumulating time on building Block-List and DOM-Tree for all web pages in Dataset2. Accumulated time difference on building Block-List and DOM-Tree is increasing while more pages are processed. At the end of experiment building Block-List spends about 30 second lesser than building DOM-Tree. It can be concluded that building Block-List need lesser time than building DOM-Tree. Fig. 4. Comparison of time performance Experiment 2 Experiment 2 evaluates validity of using variance and bending of block distribution to distinguish content pages and non-content pages. Firstly we get block distribution of each web page in Dataset1 and Dataset2 and then compute their variance and bending. Experiment uses Naïve Bayes KNN and ADTree provided by weka2 to conduct classification on dataset1. Table 1 shows results of the classification which use Accuracy as criterion. . correctly labeled documents Accuracy ---------------------------------- all documents Data of experiment in Table 1 shows best classification can be derived by using AD-Tree whose accuracy is . Experiments uses Dataset1 as training set to build classifier based on NB ADTree and KNN respectively. Then it uses these classifiers to conduct classification on dataset2. Table 1 shows result of Experiment where ADTree wrongly classify 81 pieces of web pages which have too short main contents. Experiment and also use Dataset1 as training set to build classifier. Then they conduct classification on Dataset2 by separately using variance and bending. Table 1 shows we can get best accuracy by using variance and bending together than do so by separately using one of two features. 1 http .

Cao Nghiệp 37 5 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

45 câu trắc nghiệm Kỹ thuật máy tính

8 164 1

Đề thi thực hành Kỹ thuật sửa chữa máy tính năm 2012 (Mã đề TH50)

7 224 4

Đề thi thực hành Kỹ thuật sửa chữa máy tính năm 2012 (Mã đề TH1)

7 200 10

Đề thi thực hành Kỹ thuật sửa chữa máy tính năm 2012 (Mã đề TH2)

7 167 6

Đề thi thực hành Kỹ thuật sửa chữa máy tính năm 2012 (Mã đề TH3)

7 170 5

Đề thi thực hành Kỹ thuật sửa chữa máy tính năm 2012 (Mã đề TH4)

7 186 6

Đề thi thực hành Kỹ thuật sửa chữa máy tính năm 2012 (Mã đề TH5)

7 181 3

Đề thi thực hành Kỹ thuật sửa chữa máy tính năm 2012 (Mã đề TH6)

7 185 8

Đề thi thực hành Kỹ thuật sửa chữa máy tính năm 2012 (Mã đề TH7)

6 181 5

Đề thi thực hành Kỹ thuật sửa chữa máy tính năm 2012 (Mã đề TH8)

7 170 5

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461863 55

Giới thiệu :Lập trình mã nguồn mở

14 22634 59

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10884 529

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10064 446

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9518 104

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8278 1125

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8230 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7864 2220

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6674 253

Vật lý hạt cơ bản (1)

29 5769 85

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Động cơ đốt trong và máy kéo công nghiêp tập 2 part 8

32 259 0 26-04-2024

BeginningMac OS X Tiger Dashboard Widget Development 2006 phần 2

34 210 0 26-04-2024

beginning Ubuntu Linux phần 1

34 212 1 26-04-2024

Trading Strategies Profit Making Techniques For Stock_3

23 184 0 26-04-2024

Management and Services Part 1

10 156 0 26-04-2024

The profit magic of stock Timing The Markets_5

22 119 0 26-04-2024

XỬ TRÍ CHẤN THƯƠNG SỌ NÃO KÍN

1 113 1 26-04-2024

Giáo trình tổng quan khoa học thông tin và thư viện part 7

22 143 2 26-04-2024

Khóa luận tốt nghiệp: Giải pháp nâng cao chất lượng phương thức thanh toán tín dụng chứng từ phục vụ xuất nhập khẩu tại ngân hàng Thương mại Việt Nam - Trần Thị Tân

12 117 0 26-04-2024

Hệ thống làm lạnh và điều hòa không khí

21 125 0 26-04-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7864 2220

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5718 1364

Ebook Chào con ba mẹ đã sẵn sàng

112 3767 1231

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5318 1136

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8278 1125

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3498 643

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10884 529

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3683 525

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4045 514

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4127 480