TAILIEUCHUNG - Báo cáo khoa học: "The OpenGrm open-source finite-state grammar software libraries"

In this paper, we present a new collection of open-source software libraries that provides command line binary utilities and library classes and functions for compiling regular expression and context-sensitive rewrite rules into finite-state transducers, and for n-gram language modeling. The OpenGrm libraries use the OpenFst library to provide an efficient encoding of grammars and general algorithms for building, modifying and applying models. | The OpenGrm open-source finite-state grammar software libraries Brian Roark Richard Sproat Cyril Allauzen0 Michael Riley0 Jeffrey Sorensen0 Terry Tai Oregon Health Science University Portland Oregon Google Inc. New York Abstract In this paper we present a new collection of open-source software libraries that provides command line binary utilities and library classes and functions for compiling regular expression and context-sensitive rewrite rules into finite-state transducers and for n-gram language modeling. The OpenGrm libraries use the OpenFst library to provide an efficient encoding of grammars and general algorithms for building modifying and applying models. 1 Introduction The OpenGrm libraries1 are a growing collection of open-source software libraries for building and applying various kinds of formal grammars. The C libraries use the OpenFst library2 for the underlying finite-state representation which allows for easy inspection of the resulting grammars and models as well as straightforward combination with other finite-state transducers. Like OpenFst there are easy-to-use command line binaries for frequently used operations as well as a C library interface allowing library users to create their own algorithms from the basic classes and functions provided. The libraries can be used for a range of common string processing tasks such as text normalization as well as for building and using large statistical models for applications like speech recognition. In the rest of the paper we will present each of the two libraries starting with the Thrax grammar compiler and then the NGram library. First though we will briefly present some preliminary informal background on weighted finite-state transducers WFST just as needed for this paper. 1http 2http 61 2 Informal WFST preliminaries A weighted finite-state transducer consists of a set of states and transitions between states. There is an initial state and a subset of states are .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.