TAILIEUCHUNG - Báo cáo khoa học: "Speeding Up Full Syntactic Parsing by Leveraging Partial Parsing Decisions"

Parsing is a computationally intensive task due to the combinatorial explosion seen in chart parsing algorithms that explore possible parse trees. In this paper, we propose a method to limit the combinatorial explosion by restricting the CYK chart parsing algorithm based on the output of a chunk parser. When tested on the three parsers presented in (Collins, 1999), we observed an approximate three–fold speedup with only an average decrease of in both precision and recall. | Speeding Up Full Syntactic Parsing by Leveraging Partial Parsing Decisions Elliot Glaysher and Dan Moldovan Language Computer Corporation 1701 N. Collins Blvd. Suite 2000 Richardson TX 75080 eglaysher moldovan @ Abstract Parsing is a computationally intensive task due to the combinatorial explosion seen in chart parsing algorithms that explore possible parse trees. In this paper we propose a method to limit the combinatorial explosion by restricting the CYK chart parsing algorithm based on the output of a chunk parser. When tested on the three parsers presented in Collins 1999 we observed an approximate three-fold speedup with only an average decrease of in both precision and recall. 1 Introduction Motivation Syntactic parsing is a computationally intensive and slow task. The cost of parsing quickly becomes prohibitively expensive as the amount of text to parse grows. Even worse syntactic parsing is a prerequisite for many natural language processing tasks. These costs make it impossible to work with large collections of documents in any reasonable amount of time. We started looking into methods and improvements that would speed up syntactic parsing. These are divided into simple software engineering solutions which are only touched on briefly and an optimization to the CYK parsing algorithm which is the main topic of this paper. While we made large speed gains through simple software engineering improvements such as internal symbolization optimizing critical areas optimization of the training data format et cetera the largest individual gain in speed was made by modifying the CYK parsing algorithm to leverage the decisions of a syntactic chunk parser so that it avoided combinations that conflicted with the output of the chunk parser. Previous Work Chart parsing is a method of building a parse tree that systematically explores combinations based on a set of grammatical rules while using a chart to store partial results. The general

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.