TAILIEUCHUNG - Báo cáo khoa học: "PDTB-style Discourse Annotation of Chinese Text"

We describe a discourse annotation scheme for Chinese and report on the preliminary results. Our scheme, inspired by the Penn Discourse TreeBank (PDTB), adopts the lexically grounded approach; at the same time, it makes adaptations based on the linguistic and statistical characteristics of Chinese text. | PDTB-style Discourse Annotation of Chinese Text Yuping Zhou Computer Science Department Brandeis University Waltham MA 02452 yzhou@ Nianwen Xue Computer Science Department Brandeis University Waltham MA 02452 xuen@ Abstract We describe a discourse annotation scheme for Chinese and report on the preliminary results. Our scheme inspired by the Penn Discourse TreeBank PDTB adopts the lexically grounded approach at the same time it makes adaptations based on the linguistic and statistical characteristics of Chinese text. Annotation results show that these adaptations work well in practice. Our scheme taken together with other PDTB-style schemes . for English Turkish Hindi and Czech affords a broader perspective on how the generalized lexically grounded approach can flesh itself out in the context of cross-linguistic annotation of discourse relations. 1 Introduction In the realm of discourse annotation the Penn Discourse TreeBank PDTB Prasad et al. 2008 separates itself by adopting a lexically grounded approach Discourse relations are lexically anchored by discourse connectives . because but therefore which are viewed as predicates that take abstract objects such as propositions events and states as their arguments. In the absence of explicit discourse connectives the PDTB asks the annotator to fill in a discourse connective that best describes the discourse relation between these two sentences instead of selecting from an inventory of predefined discourse relations. By keeping the discourse annotation lexically grounded even in the case of implicit discourse relations the PDTB appeals to the annotator s judgment at an intuitive level. This is in 69 contrast with an approach in which the set of discourse relations are pre-determined by linguistic experts and the role of the annotator is just to select from those choices Mann and Thompson 1988 Carlson et al. 2003 . This lexically grounded approach led to consistent and reliable discourse .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.