TAILIEUCHUNG - Báo cáo khoa học: "Ensemble Methods for Unsupervised WSD"

Combination methods are an effective way of improving system performance. This paper examines the beneﬁts of system combination for unsupervised WSD. We investigate several voting- and arbiterbased combination strategies over a diverse pool of unsupervised WSD systems. Our combination methods rely on predominant senses which are derived automatically from raw text. Experiments using the SemCor and Senseval-3 data sets demonstrate that our ensembles yield signiﬁcantly better results when compared with state-of-the-art. . | Ensemble Methods for Unsupervised WSD Samuel Brody School of Informatics University of Edinburgh Roberto Navigli Dipartimento di Informatica Universita di Roma La Sapienza navigli@ Mirella Lapata School of Informatics University of Edinburgh mlap@ Abstract Combination methods are an effective way of improving system performance. This paper examines the benefits of system combination for unsupervised WSD. We investigate several voting- and arbiterbased combination strategies over a diverse pool of unsupervised WSD systems. Our combination methods rely on predominant senses which are derived automatically from raw text. Experiments using the SemCor and Senseval-3 data sets demonstrate that our ensembles yield significantly better results when compared with state-of-the-art. 1 Introduction Word sense disambiguation WSD the task of identifying the intended meanings senses of words in context holds promise for many NLP applications requiring broad-coverage language understanding. Examples include summarization question answering and text simplification. Recent studies have also shown that WSD can benefit machine translation Vickrey et al. 2005 and information retrieval Stokoe 2005 . Given the potential of WSD for many NLP tasks much work has focused on the computational treatment of sense ambiguity primarily using data-driven methods. Most accurate WSD systems to date are supervised and rely on the availability of training data . corpus occurrences of ambiguous words marked up with labels indicating the appropriate sense given the context see Mihalcea and Edmonds 2004 and the references therein . A classifier automatically learns disambiguation cues from these hand-labeled examples. Although supervised methods typically achieve better performance than unsupervised alternatives their applicability is limited to those words for which sense labeled data exists and their accuracy is strongly correlated with the amount of .

Triều Thành 93 8 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Ensemble ellipse fitting by spatial median consensus

15 43 3

Báo cáo khoa học: "Ensemble Methods for Unsupervised WSD"

8 72 0

Bài giảng Khai mở dữ liệu: Phương pháp tập hợp mô hình (Ensemble-based methods)

21 86 0

Prediction of Human Phenotype Ontology terms by means of hierarchical ensemble methods

18 39 1

Comprehensive benchmarking and ensemble approaches for metagenomic classifiers

19 9 1

Báo cáo y học: "ISsaga is an ensemble of web-based methods for high throughput identification and semiautomatic annotation of insertion sequences in prokaryotic genomes"

9 35 0

Polyphony: Superposition independent methods for ensemble-based drug discovery

18 35 1

Theoretical study of carbon dioxide activation by metals (Co, Cu, Ni) supported on activated carbon

9 24 1

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462342 61

Giới thiệu :Lập trình mã nguồn mở

14 26060 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11347 542

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10552 466

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9843 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8891 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8506 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8101 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7750 1790

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7271 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Báo cáo nghiên cứu khoa học " KẾT QUẢ NGHIÊN CỨU BƯỚC ĐẦU VỀ THIÊN ĐỊCH CHÂN KHỚP TRÊN CÂY THANH TRÀ Ở THỪA THIÊN HUẾ "

7 279 4 27-12-2024

B2B Content Marketing: 2012 Benchmarks, Budgets & Trends

17 229 3 27-12-2024

Báo cáo nghiên cứu nông nghiệp " Biofertiliser inoculant technology for the growth of rice in Vietnam: Developing technical infrastructure for quality assurance and village production for farmers "

12 146 2 27-12-2024

CHƯƠNG 2: RỦI RO THÂM HỤT TÀI KHÓA

28 160 1 27-12-2024

Giáo án điện tử tiểu học môn lịch sử: Cách mạng mùa thu

39 165 1 27-12-2024

Báo cáo " Bàn về hành vi pháp luật và hành vi đạo đức "

11 179 2 27-12-2024

Báo cáo nghiên cứu khoa học " Đại hội XVI thông qua điều lệ Đảng cộng sản Trung Quốc những sửa đổi bổ sung mới "

4 163 1 27-12-2024

Chủ đề 3 : SỰ CÂN BẰNG CỦA VẬT RẮN (4 tiết)

9 207 1 27-12-2024

The Ombudsman Enterprise and Administrative Justice

309 143 0 27-12-2024

Data Mining Classification: Basic Concepts, Decision Trees, and Model Evaluation Lecture Notes for Chapter 4 Introduction to Data Mining

101 140 1 27-12-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8101 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7750 1790

Ebook Chào con ba mẹ đã sẵn sàng

112 4409 1371

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6285 1266

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8891 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3841 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3920 609

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4711 565

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11347 542

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4509 490