TAILIEUCHUNG - Báo cáo khoa học: "SEXTANT: EXPLORING UNEXPLORED CONTEXTS FOR SEMANTIC EXTRACTION FROM SYNTACTIC ANALYSIS"

For a very long time, it has been considered that the only way of automatically extracting similar groups of words from a text collection for which no semantic information exists is to use docum e n t co-occurrence data. But, with robust syntactic parsers that are becoming more frequently available, syntactically recognizable p h e n o m e n a about word usage can be confidently noted in large collections of texts. | SEXTANT EXPLORING UNEXPLORED CONTEXTS FOR SEMANTIC EXTRACTION FROM SYNTACTIC ANALYSIS Gregory Grefenstette Computer Science Department University of Pittsburgh Pittsburgh PA 15260 grefen@cs .pit t .edu Abstract For a very long time it has been considered that the only way of automatically extracting similar groups of words from a text collection for which no semantic information exists is to use document co-occurrence data. But with robust syntactic parsers that are becoming more frequently available syntactically recognizable phenomena about word usage can be confidently noted in large collections of texts. We present here a new system called SEXTANT which uses these parsers and the finer-grained contexts they produce to judge word similarity. BACKGROUND Many machine-based approaches to term similarity such as found in TRUMP Jacobs and Zemick 1988 and FERRET Mauldin 1991 can be characterized as knowledge-rich in that they presuppose that known lexical items possess Conceptual Dependence CD -like descriptions. Such an approach necessitates a great amount of manual encoding of semantic information and suffers from the drawbacks of cost in terms of initial coding coherence checking maintenance after modifications and costs derivable from a host of other software engineering concern of domain dependence a semantic structure developed for one domain would not be applicable to another. For example sugar would have very different semantic relations in a medical domain than in a commodities exchange domain and of rigidity even within well-established domain new subdomains spring up . AIDS. Can hand-coded systems keep up with new discoveries and new relations with an acceptable latency In the Information Retrieval community researchers have consistently considered that the linguistic apparatus required for effective domain-independent analysis is not yet at hand and have concentrated on counting document co-occurrence statistics Peat and Willet 1991 based on the idea .

Minh Phượng 83 3 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "SEXTANT: EXPLORING UNEXPLORED CONTEXTS FOR SEMANTIC EXTRACTION FROM SYNTACTIC ANALYSIS"

3 60 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462386 61

Giới thiệu :Lập trình mã nguồn mở

14 27348 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11389 543

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10589 468

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9871 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8914 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8539 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8114 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 8079 1836

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7326 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

THE ANTHROPOLOGY OF ONLINE COMMUNITIES BY Samuel M.Wilson and Leighton C. Peterson

19 232 4 24-01-2025

báo cáo hóa học:" Increased androgen receptor expression in serous carcinoma of the ovary is associated with an improved survival"

6 165 3 24-01-2025

BÀI GIẢNG Biến Đổi Năng Lượng Điện Cơ - TS. Hồ Phạm Huy

137 167 1 24-01-2025

Valve Selection Handbook - Fourth Edition

337 151 2 24-01-2025

ETHICAL CODE HANDBOOK: Demonstrate your commitment to high standards

7 156 1 24-01-2025

ĐỀ TÀI " ĐÁNH GIÁ HIỆU QUẢ HOẠT ĐỘNG KINH DOANH NGOẠI HỐI CỦA NGÂN HÀNG THƯƠNG MẠI CỔ PHẦN XUẤT NHẬP KHẨU VIỆT NAM "

51 160 3 24-01-2025

Word Games with English 1

65 149 1 24-01-2025

Báo cáo nghiên cứu khoa học " NÂNG QUAN HỆ KINH TẾ THƯƠNG MẠI VIỆT NAM - TRUNG QUỐC LÊN TẦM CAO THỜI ĐẠI "

8 179 1 24-01-2025

Chủ đề 3 : SỰ CÂN BẰNG CỦA VẬT RẮN (4 tiết)

9 222 1 24-01-2025

CUỘC KHÁNG CHIẾN CHỐNG THỰC DÂN PHÁP KẾT THÚC (1953 - 1954)_5

11 155 1 24-01-2025

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8114 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 8079 1836

Ebook Chào con ba mẹ đã sẵn sàng

112 4475 1381

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6464 1285

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8914 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3886 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3934 616

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4834 568

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11389 543

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4551 490