Đang chuẩn bị liên kết để tải về tài liệu:
Báo cáo khoa học: "Using Search-Logs to Improve Query Tagging"

Như Mai 47 5 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

Syntactic analysis of search queries is important for a variety of information-retrieval tasks; however, the lack of annotated data makes training query analysis models difﬁcult. We propose a simple, efﬁcient procedure in which part-of-speech tags are transferred from retrieval-result snippets to queries at training time. | Using Search-Logs to Improve Query Tagging Kuzman Ganchev Keith Hall Ryan McDonald Slav Petrov Google Inc. kuzman kbhall ryanmcd slav @google.com Abstract Syntactic analysis of search queries is important for a variety of information-retrieval tasks however the lack of annotated data makes training query analysis models difficult. We propose a simple efficient procedure in which part-of-speech tags are transferred from retrieval-result snippets to queries at training time. Unlike previous work our final model does not require any additional resources at run-time. Compared to a state-of-the-art approach we achieve more than 20 relative error reduction. Additionally we annotate a corpus of search queries with part-of-speech tags providing a resource for future work on syntactic query analysis. 1 Introduction Syntactic analysis of search queries is important for a variety of tasks including better query refinement improved matching and better ad targeting Barr et al. 2008 . However search queries differ substantially from traditional forms of written language e.g. no capitalization few function words fairly free word order etc. and are therefore difficult to process with natural language processing tools trained on standard corpora Barr et al. 2008 . In this paper we focus on part-of-speech POS tagging queries entered into commercial search engines and compare different strategies for learning from search logs. The search logs consist of user queries and relevant search results retrieved by a search engine. We use a supervised POS tagger to label the result snippets and then transfer the tags to the queries producing a set of noisy labeled queries. These labeled queries are then added to the training data and 238 the tagger is retrained. We evaluate different strategies for selecting which annotation to transfer and find that using the result that was clicked by the user gives comparable performance to using just the top result or to aggregating over the top-k .

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: " Using the reduced La(Co,Cu)O3 nanoperovskites as catalyst precursors for CO hydrogenation"

báo cáo khoa học: " Improving benchmarking by using an explicit framework for the development of composite indicators: an example using pediatric quality of care"

Báo cáo y học: "Improving benchmarking by using an explicit framework for the development of composite indicators: an example using pediatric quality of care"

Báo cáo y học: "The effectiveness of hand-disinfection by a flow water system using electrolytic products of sodium chloride, compared with a conventional method using alcoholic solution in an"

BÁO CÁO NGHIÊN CỨU KHOA HỌC KỸ THUẬT: 75 USING IN VITRO PROPAGATION TO PRESERVE Glyptostrobus pensilis (Staunton ex.)

Báo cáo khoa học: "Grammar Error Correction Using Pseudo-Error Sentences and Domain Adaptation"

Báo cáo khoa học: "Historical Change in Language Using Monte Carlo Techniques"

Báo cáo khoa học: "Multilingual Named Entity Recognition using Parallel Data and Metadata from Wikipedia"

Báo cáo khoa học: "Classifying French Verbs Using French and English Lexical Resources"

Báo cáo khoa học: "Text Segmentation by Language Using Minimum Description Length"

Đã phát hiện trình chặn quảng cáo AdBlock

Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.