TAILIEUCHUNG - Báo cáo khoa học: "Finding Word Substitutions Using a Distributional Similarity Baseline and Immediate Context Overlap"

This paper deals with the task of ﬁnding generally applicable substitutions for a given input term. We show that the output of a distributional similarity system baseline can be ﬁltered to obtain terms that are not simply similar but frequently substitutable. Our ﬁlter relies on the fact that when two terms are in a common entailment relation, it should be possible to substitute one for the other in their most frequent surface contexts. Using the Google 5-gram corpus to ﬁnd such characteristic contexts, we show that for the given task, our ﬁlter improves the precision of a distributional similarity. | Finding Word Substitutions Using a Distributional Similarity Baseline and Immediate Context Overlap Aurelie Herbelot University of Cambridge Computer Laboratory . Thompson Avenue Cambridge ah433@ Abstract This paper deals with the task of finding generally applicable substitutions for a given input term. We show that the output of a distributional similarity system baseline can be filtered to obtain terms that are not simply similar but frequently substitutable. Our filter relies on the fact that when two terms are in a common entailment relation it should be possible to substitute one for the other in their most frequent surface contexts. Using the Google 5-gram corpus to find such characteristic contexts we show that for the given task our filter improves the precision of a distributional similarity system from 41 to 56 on a test set comprising common transitive verbs. 1 Introduction This paper looks at the task of finding word substitutions for simple statements in the context of KB querying. Let us assume that we have a knowledge base made of statements of the type subject - verb - object 1. Bank of America - acquire - Merrill Lynch 2. Lloyd s-buy-HBOS 3. Iceland - nationalise - Kaupthing Let us also assume a simple querying facility where the user can enter a word and be presented with all statements containing that word in a typical search engine fashion. If we want to return all acquisition events present in the knowledge base above as opposed to nationalisation events we might search for acquire . This will return the first statement about the acquisition of Merrill Lynch but not the second statement about HBOS. Ideally we would like a system able to generate words similar to our query so that a statement containing the verb buy gets returned when we search for acquire . This problem is closely related to the clustering of semantically similar terms which has received much attention in the literature. Systems that perform such clustering usually

Duy Cẩn 44 9 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Word Epoch Disambiguation: Finding How Words Change Over Time"

5 72 0

Báo cáo khoa học: "Finding Synonyms Using Automatic Word Alignment and Measures of Distributional Similarity"

8 48 0

Báo cáo khoa học: "Finding Predominant Word Senses in Untagged Text"

8 55 0

Báo cáo khoa học: "Toward Evaluation of Writing Style: Finding Overly Repetitive Word Use in Student Essays"

8 59 0

Báo cáo khoa học: "Finding Word Substitutions Using a Distributional Similarity Baseline and Immediate Context Overlap"

9 33 0

Graduation Thesis Computer Science: Finding the semantic similarity in Vietnamese

55 94 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462351 61

Giới thiệu :Lập trình mã nguồn mở

14 26588 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11375 543

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10565 468

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9854 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8906 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8518 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8109 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7875 1810

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7286 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

THE ANTHROPOLOGY OF ONLINE COMMUNITIES BY Samuel M.Wilson and Leighton C. Peterson

19 231 4 07-01-2025

CHƯƠNG 2: RỦI RO THÂM HỤT TÀI KHÓA

28 165 1 07-01-2025

Giáo án điện tử tiểu học môn lịch sử: Cách mạng mùa thu

39 168 1 07-01-2025

Sử dụng mô hình ARCH và GARCH để phân tích và dự báo về giá cổ phiếu trên thị trường chứng khoán

24 1077 2 07-01-2025

Đề tài " Dự báo về tác động của Tổ chức Thương mại Thế giới WTO đối với các doanh nghiệp xuất khẩu vừa và nhỏ Việt Nam – Những giải pháp đề xuất "

72 193 2 07-01-2025

Báo cáo " Thẩm quyền quản lí nhà nước đối với hoạt động quảng cáo thực trạng và hướng hoàn thiện "

7 216 7 07-01-2025

Báo cáo nghiên cứu khoa học " Sự nhất quán phát triển kinh tế thị trường XHCN trong xây dựng xã hội hài hoà của Trung Quốc và đổi mới của Việt Nam "

8 151 1 07-01-2025

CUỘC KHÁNG CHIẾN CHỐNG THỰC DÂN PHÁP KẾT THÚC (1953 - 1954)_5

11 153 1 07-01-2025

Báo cáo lâm nghiệp: "Assessment of the effects of below-zero temperatures on photosynthesis and chlorophyll a fluorescence in leaf discs of Eucalyptus globulu"

4 152 0 07-01-2025

Phạm trù Chủ nghĩa cá nhân của tư tưởng phương Tây trong sự lý giải của Phan Khôi _1

9 138 0 07-01-2025

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8109 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7875 1810

Ebook Chào con ba mẹ đã sẵn sàng

112 4432 1376

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6346 1276

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8906 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3858 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3930 610

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4768 567

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11375 543

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4533 490