TAILIEUCHUNG - Báo cáo khoa học: "Resolving Personal Names in Email Using Context Expansion"

This paper describes a computational approach to resolving the true referent of a named mention of a person in the body of an email. A generative model of mention generation is used to guide mention resolution. Results on three relatively small collections indicate that the accuracy of this approach compares favorably to the best known techniques, and results on the full CMU Enron collection indicate that it scales well to larger collections. | Resolving Personal Names in Email Using Context Expansion Tamer Elsayed Douglas W. Oard and Galileo Namata Human Language Technology Center of Excellence and UMIACS Laboratory for Computational Linguistics and Information Processing CLIP University of Maryland College Park MD 20742 telsayed oard gnamata @ Abstract This paper describes a computational approach to resolving the true referent of a named mention of a person in the body of an email. A generative model of mention generation is used to guide mention resolution. Results on three relatively small collections indicate that the accuracy of this approach compares favorably to the best known techniques and results on the full CMU Enron collection indicate that it scales well to larger collections. 1 Introduction The increasing prevalence of informal text from which a dialog structure can be reconstructed . email or instant messaging raises new challenges if we are to help users make sense of this cacophony. Large collections offer greater scope for assembling evidence to help with that task but they pose additional challenges as well. With well over 100 000 unique email addresses in the CMU version of the Enron collection Klimt and Yang 2004 common names . John might easily refer to any one of several hundred people. In this paper we associate named mentions in unstructured text . the body of an email and or the subject line to modeled identities. We see at least two direct applications for this work 1 helping searchers who are unfamiliar with the contents of an email collection . historians or lawyers better understand the context of emails that they find and 2 augmenting more typical social networks based on senders and recipients with additional links based on references found in unstructured text. Most approaches to resolving identity can be decomposed into four sub-problems 1 finding a reference that requires resolution 2 identifying candidates 3 assembling evidence and 4 choosing .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.