TAILIEUCHUNG - Báo cáo khoa học: "DICTIONARIES, DICTIONARY GRAMMARS AND DICTIONARY ENTRY PARSING"

Computerist: . But, great Scott, what about structure? You can't just bang that lot into a machine without structure. Half a gigabyte of sequential file . Lexicographer: Oh, we know all about structure. Take this entry for example. You see here italics as the typical ambiguous structural element marker, being apparently used as an undefined phrase-entry lemrna, but in fact being the subordinate entry headword address preceding the small-cap cross-reference headword address which is nested within the gloss to a defined phrase entry, itself nested within a subordinate (bold lower-case letter) sense section in the second branch of a. | DICTIONARIES DICTIONARY GRAMMARS AND DICTIONARY ENTRY PARSING _ _ _ . _ Mary s. Neff . IBM T. J. Watson Research Center p. o. Box 704 Yorktown Heights New York 10598 _ . _ Branimir K. Boguraev IBM T. J. Watson Research Center p. o. Box 704 Yorktown Heights New York 10598- Computer Laboratory University of Cambridge New Museums Site Cambridge CB2 3QG Computerist . But great Scott what about structure You can t just bang that lot into a machine without structure. Half a gigabyte of sequential file . Lexicographer Oh we know all about structure. Take this entry for example. You see here italics as the typical ambiguous structural element marker being apparendy used as an undefined phrase-entry lemma but in fact being the subordinate entry headword address preceding the small-cap cross-reference headword address which is nested within the gloss to a defined phrase entry itself nested within a subordinate bold lower-case letter sense section in the second branch of a forked multiple part of speech main entry. Now that s typical of the kind of structural relationship that must be made crystal-clear in the eventual database. from Taking the Words out of His Mouth Edmund Weiner on computerising the Oxford English Dictionary The Guardian London March 1985 ABSTRACT We identify two complementary processes in the conversion of machine-readable dictionaries into lexical databases recoveiy of the dictionary structure from the typograpnical markings which persist on the dictionary distribution tapes and embody the publishers notational conventions followed by making explicit all of the codified and ellided information packed into individual entries. We discuss notational conventions and tape formats outline structural properties of dictionaries observe a range of representational phenomena particularly relevant to dictionary parsing and derive a set of minimal requirements for a dictionary grammar formalism. We present a general purpose dictionary entry parser which uses a formal

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.