TAILIEUCHUNG - Báo cáo khoa học: "Trimming CFG Parse Trees for Sentence Compression Using Machine Learning Approaches"

Sentence compression is a task of creating a short grammatical sentence by removing extraneous words or phrases from an original sentence while preserving its meaning. Existing methods learn statistics on trimming context-free grammar (CFG) rules. However, these methods sometimes eliminate the original meaning by incorrectly removing important parts of sentences, because trimming probabilities only depend on parents’ and daughters’ non-terminals in applied CFG rules. We apply a maximum entropy model to the above method. . | Trimming CFG Parse Trees for Sentence Compression Using Machine Learning Approaches Yuya Unno1 Takashi Ninomiya2 Yusuke Miyao1 Jun ichi Tsujii134 department of Computer Science University of Tokyo information Technology Center University of Tokyo 3School of Informatics University of Manchester 4SORST JST Hongo 7-3-1 Bunkyo-ku Tokyo Japan unno yusuke tsujii @ ninomi@ Abstract Sentence compression is a task of creating a short grammatical sentence by removing extraneous words or phrases from an original sentence while preserving its meaning. Existing methods learn statistics on trimming context-free grammar CFG rules. However these methods sometimes eliminate the original meaning by incorrectly removing important parts of sentences because trimming probabilities only depend on parents and daughters non-terminals in applied CFG rules. We apply a maximum entropy model to the above method. Our method can easily include various features for example other parts of a parse tree or words the sentences contain. We evaluated the method using manually compressed sentences and human judgments. We found that our method produced more grammatical and informative compressed sentences than other methods. 1 Introduction In most automatic summarization approaches text is summarized by extracting sentences from a given document without modifying the sentences themselves. Although these methods have been significantly improved to extract good sentences as summaries they are not intended to shorten sentences . the output often has redundant words or phrases. These methods cannot be used to make a shorter sentence from an input sentence or for other applications such as generating headline news Dorr et al. 2003 or messages for the small screens of mobile devices. We need to compress sentences to obtain short and useful summaries. This task is called sentence compression. While several methods have been proposed for sentence compression Witbrock .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.