TAILIEUCHUNG - Báo cáo khoa học: "A Seed-driven Bottom-up Machine Learning Framework for Extracting Relations of Various Complexity"

A minimally supervised machine learning framework is described for extracting relations of various complexity. Bootstrapping starts from a small set of n-ary relation instances as “seeds”, in order to automatically learn pattern rules from parsed data, which then can extract new instances of the relation and its projections. We propose a novel rule representation enabling the composition of n-ary relation rules on top of the rules for projections of the relation. | A Seed-driven Bottom-up Machine Learning Framework for Extracting Relations of Various Complexity Feiyu Xu Hans Uszkoreit and Hong Li Language Technology Lab DFKI GmbH Stuhlsatzenhausweg 3 D-66123 Saarbruecken feiyu uszkoreit hongli @ Abstract A minimally supervised machine learning framework is described for extracting relations of various complexity. Bootstrapping starts from a small set of n-ary relation instances as seeds in order to automatically learn pattern rules from parsed data which then can extract new instances of the relation and its projections. We propose a novel rule representation enabling the composition of n-ary relation rules on top of the rules for projections of the relation. The compositional approach to rule construction is supported by a bottom-up pattern extraction method. In comparison to other automatic approaches our rules cannot only localize relation arguments but also assign their exact target argument roles. The method is evaluated in two tasks the extraction of Nobel Prize awards and management succession events. Performance for the new Nobel Prize task is strong. For the management succession task the results compare favorably with those of existing pattern acquisition approaches. 1 Introduction Information extraction IE has the task to discover n-tuples of relevant items entities belonging to an n-ary relation in natural language documents. One of the central goals of the ACE program1 is to develop a more systematically grounded approach to IE starting from elementary entities binary rela 1 http ace tions to n-ary relations such as events. Current semi- or unsupervised approaches to automatic pattern acquisition are either limited to a certain linguistic representation . subject-verb-object or only deal with binary relations or cannot assign slot filler roles to the extracted arguments or do not have good selection and filtering methods to handle the large number of tree patterns Riloff 1996 .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.