Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Learning Document-Level Semantic Properties from Free-text Annotations"

Phi Khanh 132 9 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

This paper demonstrates a new method for leveraging free-text annotations to infer semantic properties of documents. Free-text annotations are becoming increasingly abundant, due to the recent dramatic growth in semistructured, user-generated online content. An example of such content is product reviews, which are often annotated by their authors with pros/cons keyphrases such as “a real bargain” or “good value.” | Learning Document-Level Semantic Properties from Free-text Annotations S.R.K. Branavan Harr Chen Jacob Eisenstein Regina Barzilay Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology branavan harr jacobe regina @csail.mit.edu Abstract This paper demonstrates a new method for leveraging free-text annotations to infer semantic properties of documents. Free-text annotations are becoming increasingly abundant due to the recent dramatic growth in semistructured user-generated online content. An example of such content is product reviews which are often annotated by their authors with pros cons keyphrases such as a real bargain or good value. To exploit such noisy annotations we simultaneously find a hidden paraphrase structure of the keyphrases a model of the document texts and the underlying semantic properties that link the two. This allows us to predict properties of unannotated documents. Our approach is implemented as a hierarchical Bayesian model with joint inference which increases the robustness of the keyphrase clustering and encourages the document model to correlate with semantically meaningful properties. We perform several evaluations of our model and find that it substantially outperforms alternative approaches. 1 Introduction A central problem in language understanding is transforming raw text into structured representations. Learning-based approaches have dramatically increased the scope and robustness of this type of automatic language processing but they are typically dependent on large expert-annotated datasets which are costly to produce. In this paper we show how novice-generated free-text annotations available online can be leveraged to automatically infer document-level semantic properties. With the rapid increase of online content created by end users noisy free-text annotations have pros cons great nutritional value . combines it all an amazing product quick and friendly service cleanliness great .

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Learning Condensed Feature Representations from Large Unsupervised Data Sets for Supervised Learning"

Báo cáo khoa học: "Learning Better Data Representation using Inference-Driven Metric Learning"

Báo cáo khoa học: "A Combination of Active Learning and Semi-supervised Learning Starting with Positive and Unlabeled Examples for Word Sense Disambiguation: An Empirical Study on Japanese Web Search Query"

B.A Thesis: English major students’ difficulties and expectations in learning written translation at Dong Thap university

Báo cáo đề tài nghiên cứu khoa học cấp trường: Áp dụng mô hình học tập Blended Learning trong giảng dạy học phần Basic IELTS 1 cho sinh viên theo chương trình đào tạo chất lượng cao năm thứ nhất trường Đại học Thương mại

Báo cáo đề tài nghiên cứu khoa học cấp trường: Nâng cao động lực học tiếng Anh cho sinh viên thông qua phương pháp học theo dự án (project-based learning)

Báo cáo đề tài nghiên cứu khoa học cấp trường: Nghiên cứu một số thuật toán học máy (machine learning) ứng dụng cho bài toán xác định các chủ đề quan tâm của khách hàng trực tuyến

Báo cáo khoa học: "Applications of GPC Rules and Character Structures in Games for Learning Chinese Characters"

Báo cáo khoa học: "Learning and Translating by Machines"

Báo cáo khoa học: "Discriminative Learning for Joint Template Filling"

Đã phát hiện trình chặn quảng cáo AdBlock

Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.