TAILIEUCHUNG - Báo cáo khoa học: "An Interface for Rapid Natural Language Processing Development in UIMA"

This demonstration presents the Annotation Librarian, an application programming interface that supports rapid development of natural language processing (NLP) projects built in Apache Unstructured Information Management Architecture (UIMA). The flexibility of UIMA to support all types of unstructured data – images, audio, and text – increases the complexity of some of the most common NLP development tasks. | An Interface for Rapid Natural Language Processing Development in UIMA Balaji R. Soundrarajan Thomas Ginter Scott L. DuVall VA Salt Lake City Health Care System and University of Utah balaji@ @ Abstract This demonstration presents the Annotation Librarian an application programming interface that supports rapid development of natural language processing NLP projects built in Apache Unstructured Information Management Architecture UIMA . The flexibility of UIMA to support all types of unstructured data - images audio and text - increases the complexity of some of the most common NLP development tasks. The Annotation Librarian interface handles these common functions and allows the creation and management of annotations by mirroring Java methods used to manipulate Strings. The familiar syntax and NLP-centric design allows developers to adopt and rapidly develop NLP algorithms in UIMA. The general functionality of the interface is described in relation to the use cases that necessitated its creation. 1 Introduction In the days when public libraries were the center of information exchange the job of the librarian was to serve as an interface between the complex library system and the average user. The librarian made it possible for one to access specific sources of information without memorizing the Dewey Decimal System or flipping through the card catalog. Analogous to the great librarians of yesteryear the Annotation Librarian serves the average Java developer in the creation and management of annotations within natural language processing NLP projects built using the open source Apache Unstructured Information Management Architecture UIMA 1. Many NLP tasks are performed in processing steps that build upon one another. Systems designed in this fashion are called pipelines because text is processed and then passed from one step to the next like water flowing through a pipe. Each step in the pipeline adds structured data on

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.