TAILIEUCHUNG - Báo cáo khoa học: "Towards robust multi-tool tagging. An OWL/DL-based approach"

This paper describes a series of experiments to test the hypothesis that the parallel application of multiple NLP tools and the integration of their results improves the correctness and robustness of the resulting analysis. It is shown how annotations created by seven NLP tools are mapped onto toolindependent descriptions that are defined with reference to an ontology of linguistic annotations, and how a majority vote and ontological consistency constraints can be used to integrate multiple alternative analyses of the same token in a consistent way. . | Towards robust multi-tool tagging. An OWL DL-based approach Christian Chiarcos University of Potsdam Germany chiarcos@ Abstract This paper describes a series of experiments to test the hypothesis that the parallel application of multiple NLP tools and the integration of their results improves the correctness and robustness of the resulting analysis. It is shown how annotations created by seven NLP tools are mapped onto toolindependent descriptions that are defined with reference to an ontology of linguistic annotations and how a majority vote and ontological consistency constraints can be used to integrate multiple alternative analyses of the same token in a consistent way. For morphosyntactic parts of speech and morphological annotations of three German corpora the resulting merged sets of ontological descriptions are evaluated in comparison to ontological representation of existing reference annotations. 1 Motivation and overview NLP systems for higher-level operations or complex annotations often integrate redundant modules that provide alternative analyses for the same linguistic phenomenon in order to benefit from their respective strengths and to compensate for their respective weaknesses . in parsing Crys-mann et al. 2002 or in machine translation Carl et al. 2000 . The current trend to parallel and distributed NLP architectures Aschenbrenner et al. 2006 Gietz et al. 2006 Egner et al. 2007 Luis and de Matos 2009 opens the possibility of exploring the potential of redundant parallel annotations also for lower levels of linguistic analysis. This paper evaluates the potential benefits of such an approach with respect to morphosyntax parts of speech pos and morphology in German In comparison to English German shows a rich and polysemous morphology and a considerable number of NLP tools are available making it a promising candidate for such an experiment. Previous research indicates that the integration of multiple part of speech taggers leads to

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.