TAILIEUCHUNG - Báo cáo khoa học: "An Error Analysis of Relation Extraction in Social Media Documents"

The annotated mentions in the Corpus are single or multi-word expressions which refer to a particular real world or abstract entity. The mentions are annotated to indicate sets of mentions which constitute co-reference groups referring to the same entity. Five relationships are annotated between these entities: PartOf, FeatureOf, Produces, InstanceOf, and MemberOf. One signiﬁcant difference between these relation annotations and those in the ACE Corpus is that the former are relations between sets of mentions (the co-reference groups) rather than between individual mentions | An Error Analysis of Relation Extraction in Social Media Documents Gregory Ichneumon Brown University of Colorado at Boulder Boulder Colorado browngp@ Abstract Relation extraction in documents allows the detection of how entities being discussed in a document are related to one another . part-of . This paper presents an analysis of a relation extraction system based on prior work but applied to the . Power and Associates Sentiment Corpus to examine how the system works on documents from a range of social media. The results are examined on three different subsets of the JDPA Corpus showing that the system performs much worse on documents from certain sources. The proposed explanation is that the features used are more appropriate to text with strong editorial standards than the informal writing style of blogs. 1 Introduction To summarize accurately determine the sentiment or answer questions about a document it is often necessary to be able to determine the relationships between entities being discussed in the document such as part-of or member-of . In the simple sentiment example Example I bought a new car yesterday. I love the powerful engine. determining the sentiment the author is expressing about the car requires knowing that the engine is a part of the car so that the positive sentiment being expressed about the engine can also be attributed to the car. In this paper we examine our preliminary results from applying a relation extraction system to the 64 . Power and Associates JDPA Sentiment Corpus Kessler et al. 2010 . Our system uses lexical features from prior work to classify relations and we examine how the system works on different subsets from the JDPA Sentiment Corpus breaking the source documents down into professionally written reviews blog reviews and social networking reviews. These three document types represent quite different writing styles and we see significant difference in how the relation extraction system performs .

Phú Hải 61 5 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Ebook Data reduction and error analysis for the physical sciences (3rd edition): Part 1

155 67 0

Ebook Data reduction and error analysis for the physical sciences (3rd edition): Part 2

183 58 0

Comparative analysis of LDPC and BCH codes error-correcting capabilities

5 81 0

A classification of electrical component failures and their human error types in South Korean NPPs during last 10 years

10 69 0

A novel measure and significance testing in data analysis of cell image segmentation

13 28 1

Software Error Detection through Testing and Analysis

271 47 0

Báo cáo khoa học: "A Graphical Interface for MT Evaluation and Error Analysis"

6 50 0

Báo cáo khoa học: "An Error Analysis of Relation Extraction in Social Media Documents"

5 45 0

Báo cáo khoa học: " A Tool for Error Analysis of Machine Translation Output"

6 52 0

Báo cáo khoa học: "Using Generation for Grammar Analysis and Error Detection"

4 39 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462337 61

Giới thiệu :Lập trình mã nguồn mở

14 25975 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11341 542

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10546 466

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9838 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8889 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8502 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8100 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7727 1790

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7245 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Báo cáo nghiên cứu khoa học " KẾT QUẢ NGHIÊN CỨU BƯỚC ĐẦU VỀ THIÊN ĐỊCH CHÂN KHỚP TRÊN CÂY THANH TRÀ Ở THỪA THIÊN HUẾ "

7 276 4 25-12-2024

báo cáo hóa học:" Increased androgen receptor expression in serous carcinoma of the ovary is associated with an improved survival"

6 156 3 25-12-2024

Đề tài " Dự báo về tác động của Tổ chức Thương mại Thế giới WTO đối với các doanh nghiệp xuất khẩu vừa và nhỏ Việt Nam – Những giải pháp đề xuất "

72 184 2 25-12-2024

Báo cáo " Bàn về hành vi pháp luật và hành vi đạo đức "

11 177 2 25-12-2024

ETHICAL CODE HANDBOOK: Demonstrate your commitment to high standards

7 147 1 25-12-2024

Báo cáo nghiên cứu khoa học " Vai trò chính quyền địa phương trong phát triển kinh tế : khu chuyên doanh gốm sứ ( Trung Quốc ) và Bát Tràng ( Việt Nam )("

11 213 1 25-12-2024

Báo cáo nghiên cứu khoa học " Sự nhất quán phát triển kinh tế thị trường XHCN trong xây dựng xã hội hài hoà của Trung Quốc và đổi mới của Việt Nam "

8 144 1 25-12-2024

Chủ đề 3 : SỰ CÂN BẰNG CỦA VẬT RẮN (4 tiết)

9 207 1 25-12-2024

5 thói quen ăn uống hủy hoại hàm răng đẹp

5 167 1 25-12-2024

Determini prounoun 1

6 139 0 25-12-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8100 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7727 1790

Ebook Chào con ba mẹ đã sẵn sàng

112 4406 1371

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6281 1266

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8889 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3837 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3919 609

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4705 565

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11341 542

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4504 490