TAILIEUCHUNG - Báo cáo khoa học: "A Method for Relating Multiple Newspaper Articles by Using Graphs, and Its Application to Webcasting"

This paper describes methods for relating (threading) multiple newspaper articles, and for visualizing various characteristics of them by using a directed graph. A set of articles is represented by a set of word vectors, and the similarity between the vectors is then calculated. The graph is constructed from the similarity matrix. By applying some constraints on the chronological ordering of articles, an efficient threading algorithm that runs in O(n) time (where n is the number of articles) is obtained. . | A Method for Relating Multiple Newspaper Articles by Using Graphs and Its Application to Webcasting Naohiko Uramoto and Koichi Takeda IBM Research Tokyo Research Laboratory 1623-14 Shimo-tsuruma Yamato-shi Kanagawa-ken 242 Japan uramoto takeda @ Abstract This paper describes methods for relating threading multiple newspaper articles and for visualizing various characteristics of them by using a directed graph. A set of articles is represented by a set of word vectors and the similarity between the vectors is then calculated. The graph is constructed from the similarity matrix. By applying some constraints on the chronological ordering of articles an efficient threading algorithm that runs in 0 n time where n is the number of articles is obtained. The constructed graph is visualized with words that represent the topics of the threads and words that represent new information in each article. The threading technique is suitable for Webcasting push applications. A threading server determines relationships among articles from various news sources and creates files containing their threading information. This information is represented in extended Markup Language XML and can be visualized on most Web browsers. The XML-based representation and a current prototype are described in this paper. 1 Introduction The vast quantity of information available today makes it difficult to search for and understand the information that we want. If there are many related documents about a topic it is important to capture their relationships so that we can obtain a clearer overview. However most information resources including newspaper articles do not have explicit relationships. For example although documents on the Web are connected by hyperlinks relationships cannot be specified. Webcasting push applications such as PointCast 1 constitute a promising solution to the problem of information overloading but the articles they provide do not have links or else must be .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.