TAILIEUCHUNG - Báo cáo khoa học: "Proceedings of EACL '99"

People have been writing programs for automatic Word Sense Disambiguation (WSD) for forty years now, yet the validity of the task has remained in doubt. At a first pass, the task is simply defined: a word like bank can mean 'river bank' or 'money bank' and the task-is to determine which of these applies in a context in which the word bank appears. The problems arise because most sense distinctions are not as clear as the distinction between 'river bank' and 'money b.~nk', so it is not always straightforward for a person to say what the correct answer is | Proceedings of EACL 99 95 Replicability for Manual Word Sense Tagging Adam Kilgarriff ITRI University of Brighton Lewes Road Brighton UK email adam@ People have been writing programs for automatic Word Sense Disambiguation WSD for forty years now yet the validity of the task has remained in doubt. At a first pass the task is simply defined a word like bank can mean river bank or money bank and the task is to determine which of these applies in a context in which the word bank appears. The problems arise because most sense distinctions are not as clear as the distinction between river bank and money bank so it is not always straightforward for a person to say what the correct answer is. Thus we do not always know what it would mean to say that a computer program got the right answer. The issue is discussed in detail by Gale et al. 1992 who identify the problem as one of identifying the upper bound for the performance of a WSD program. If people can only agree on the correct answer x of the time a claim that a program achieves more than x accuracy is hard to interpret and x is the upper bound for what the program can meaningfully achieve. There have been some discussions as to what this upper bound might be. Gale et al. review a psycholinguistic study Jorgensen 1990 in which the level of agreement averaged 68 . But an upper bound of 68 is disastrous for the enterprise since it implies that the best a program could possibly do is still not remotely good enough for any practical purpose. Even worse news comes from Ng and Lee 1996 who re-tagged parts of the manually tagged SEMCOR corpus Fellbaum 1998 . The taggings matched only 57 of the time. If these represent as high a level of inter tagger agreement as one could ever expect WSD is a doomed enterprise. However neither study set out to identify an upper bound for WSD and it is far from ideal to use their results in this way. In this paper we report on a study which did aim specifically at achieving as .

Trúc Mai 76 2 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Journal of KoreanLaw Vol. 7, No. 2, June 2008

242 47 0

Understanding the Insider Threat: Proceedings of a March 2004 Workshop

1 42 0

Báo cáo khoa học: "Preface to the Student Research Workshop Proceedings"

2 75 0

Báo cáo khoa học: "Preface to the Interactive Posters/Demonstrations Proceedings"

2 66 0

Báo cáo khoa học: "Proceedings of EACL '99"

2 67 0

Báo cáo y học: "Coming together to document mortality in conflict situations: proceedings of a symposium"

5 32 0

Carbon Trading Law and Practice

347 59 0

Ernst Schering Foundation Symposium Proceedings 2007-2 Organocatalysis

348 42 0

Astrophysics and Space Science Proceedings Phần 7

42 49 0

Astrophysics and Space Science Proceedings Phần 8

38 47 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461847 55

Giới thiệu :Lập trình mã nguồn mở

14 22518 57

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10865 529

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10029 445

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9490 104

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8243 1124

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8206 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7860 2220

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6646 253

Vật lý hạt cơ bản (1)

29 5755 85

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Giáo án mầm non chương trình đổi mới: Gia đình vui nhộn

4 309 1 19-04-2024

Giáo án mầm non chương trình đổi mới: Đề tài: Ôn xác định vị trí trên – dưới, trước- sau của đối tượng khác.

8 350 3 19-04-2024

Báo cáo khoa học: Loss of kinase activity in Mycobacterium tuberculosis multidomain protein Rv1364c

14 233 0 19-04-2024

Động cơ đốt trong và máy kéo công nghiêp tập 2 part 8

32 258 0 19-04-2024

Trading Strategies Profit Making Techniques For Stock_3

23 181 0 19-04-2024

Trading Strategies Profit Making Techniques For Stock_8

23 171 0 19-04-2024

Báo cáo nghiên cứu khoa học " KẾT QUẢ NGHIÊN CỨU BƯỚC ĐẦU VỀ THIÊN ĐỊCH CHÂN KHỚP TRÊN CÂY THANH TRÀ Ở THỪA THIÊN HUẾ "

7 173 0 19-04-2024

Posted prices versus bargaining in markets_7

23 154 0 19-04-2024

MySQL Database Usage & Administration PHẦN 9

37 137 0 19-04-2024

THE ANTHROPOLOGY OF ONLINE COMMUNITIES BY Samuel M.Wilson and Leighton C. Peterson

19 138 0 19-04-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7860 2220

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5601 1327

Ebook Chào con ba mẹ đã sẵn sàng

112 3752 1229

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8243 1124

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5255 1124

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3473 641

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10865 529

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3670 524

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4024 513

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4100 478