TAILIEUCHUNG - Báo cáo khoa học: "Effects of Noun Phrase Bracketing in Dependency Parsing and Machine Translation"

Flat noun phrase structure was, up until recently, the standard in annotation for the Penn Treebanks. With the recent addition of internal noun phrase annotation, dependency parsing and applications down the NLP pipeline are likely affected. Some machine translation systems, such as TectoMT, use deep syntax as a language transfer layer. It is proposed that changes to the noun phrase dependency parse will have a cascading effect down the NLP pipeline and in the end, improve machine translation output, even with a reduction in parser accuracy that the noun phrase structure might cause. This paper examines this noun phrase. | Effects of Noun Phrase Bracketing in Dependency Parsing and Machine Translation Nathan Green Charles University in Prague Institute of Formal and Applied Linguistics Faculty of Mathematics and Physics green@ Abstract Flat noun phrase structure was up until recently the standard in annotation for the Penn Treebanks. With the recent addition of internal noun phrase annotation dependency parsing and applications down the NLP pipeline are likely affected. Some machine translation systems such as TectoMT use deep syntax as a language transfer layer. It is proposed that changes to the noun phrase dependency parse will have a cascading effect down the NLP pipeline and in the end improve machine translation output even with a reduction in parser accuracy that the noun phrase structure might cause. This paper examines this noun phrase structure s effect on dependency parsing in English with a maximum spanning tree parser and shows a Bleu score improvement for English to Czech machine translation. 1 Introduction Noun phrase structure in the Penn Treebank has up until recently been only considered due to underspecification a flat structure. Due to the annotation and work of Vadas and Curran 2007a 2007b 2008 we are now able to create Natural Language Processing NLP systems that take advantage of the internal structure of noun phrases in the Penn Treebank. This extra internal structure introduces additional complications in NLP applications such as parsing. Dependency parsing has been a prime focus of NLP research of late due to its ability to help parse 69 languages with a free word order. Dependency parsing has been shown to improve NLP systems in certain languages and in many cases is considered the state of the art in the field. Dependency parsing made many improvements due to the CoNLL X shared task Buchholz and Marsi 2006 . However in most cases these systems were trained with a flat noun phrase structure in the Penn Treebank. Vadas internal noun

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.