TAILIEUCHUNG - Báo cáo khoa học: "A HARDWARE ALGORITHM FOR HIGH SPEED MORPHEME EXTRACTION AND ITS IMPLEMENTATION"

This paper describes a new hardware algorithm for morpheme extraction and its implementation on a specific machine (MEX-I), as the first step toward achieving natural language parsing accelerators. It also shows the machine's performance, 100-1,000 times faster than a personal computer. This machine can extract morphemes from 10,000 character Japanese text by searching an 80,000 morpheme dictionary in I second. It can treat multiple text streams, which are composed of character candidates, as well as one text stream. The algorithm is implemented on the machine in linear time for the number of candidates, while conventional sequential algorithms are implemented. | A HARDWARE ALGORITHM FOR HIGH SPEED MORPHEME EXTRACTION AND ITS IMPLEMENTATION Toshikazu Fukushima Yutaka Ohyama and Hitoshi Miyai CfcC Systems Research Laboratories NEC Corporation 1-1 Miyazaki 4-chome Miyamae-ku Kawasaki City Kanagawa 213 Japan fuku@ ohyMna@ miya@ ABSTRACT This paper describes a new hardware algorithm for morpheme extraction and its implementation on a specific machine MEX-I as the first step toward achieving natural language parsing accelerators. It also shows the machine s performance 100-1 000 times faster than a personal computer. This machine can extract morphemes from 10 000 character Japanese text by searching an 80 000 morpheme dictionary ìn 1 second. It can treat multiple text streams which are composed of character candidates as well as one text stream. The algorithm is implemented on the machine in linear time for the number of candidates while conventional sequential algorithms are implemented in combinational time. 1 INTRODUCTION Recent advancement in natural language parsing technology has especially extended the word processor market and the machine translation system market. For further market extension or new market creation for natural language applications parsing speed-up as well as improving parsing accuracy is required. First thè parsing speed-up directly reduces system response time required in such interactive natural language application systems as those using natural language interface speech recognition KanSrto-Kanji 1 conversion which is the most popular Japanese text input method and so on. Second it also increases the advantage of such applications as machine translation document proofreading automatic indexing and so on which are used to treat a large amount of documents. Third it realizes parsing methods based on larger scale dictionary or knowledge database which are necessary to improve parsing accuracy. Until now in the natural language processing field the speed-up

Minh Lý 67 8 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

A Hardware implementation of Winograd Fourier Transform algorithm for Cryptography

7 66 0

Báo cáo khoa học: "A HARDWARE ALGORITHM FOR HIGH SPEED MORPHEME EXTRACTION AND ITS IMPLEMENTATION"

8 52 0

Efﬁcient Pattern Matching Algorithm for Memory Architecture

9 42 0

A Review of Modular Multiplication Methods and Respective Hardware Implementations

20 48 0

Parallel Implementation of MAFFT on CUDA-Enabled Graphics Hardware

14 63 0

Báo cáo hóa học: "Research Article Hardware Implementation of a Spline-Based Genetic Algorithm for Embedded Stereo Vision Sensor Providing Real-Time Visual Guidance to the Visually Impaired"

10 37 0

Báo cáo hóa học: " Research Article Hardware Implementation of a Modiﬁed Delay-Coordinate Mapping-Based QRS Complex Detection Algorithm"

13 42 0

Báo cáo hóa học: " Segmentation algorithm via Cellular Neural/ Nonlinear Network: implementation on Bioinspired hardware platform"

11 33 0

Báo cáo hóa học: " Research Article A Near-Lossless Image Compression Algorithm Suitable for Hardware Design in Wireless Endoscopy System"

13 41 0

Báo cáo hóa học: " Design of Low-Cost FPGA Hardware for Real-time ICA-Based Blind Source Separation Algorithm"

11 34 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462282 61

Giới thiệu :Lập trình mã nguồn mở

14 24826 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11280 542

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10506 466

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9784 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8876 1160

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8461 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8089 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7463 1763

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7184 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Đóng mới oto 8 chỗ ngồi part 9

10 171 3 22-11-2024

Giáo trình phân tích phương trình vi phân viết dưới dạng thuật toán đặc tính của hệ thống p1

5 149 1 22-11-2024

báo cáo hóa học:" Perceptions of rewards among volunteer caregivers of people living with AIDS working in faith-based organizations in South Africa: a qualitative study"

10 146 1 22-11-2024

BÀI GIẢNG Biến Đổi Năng Lượng Điện Cơ - TS. Hồ Phạm Huy

137 146 1 22-11-2024

báo cáo hóa học:" Quality of data collection in a large HIV observational clinic database in sub-Saharan Africa: implications for clinical research and audit of care"

7 146 4 22-11-2024

CHƯƠNG 2: RỦI RO THÂM HỤT TÀI KHÓA

28 152 1 22-11-2024

Sử dụng mô hình ARCH và GARCH để phân tích và dự báo về giá cổ phiếu trên thị trường chứng khoán

24 1064 2 22-11-2024

Báo cáo " Bàn về hành vi pháp luật và hành vi đạo đức "

11 169 2 22-11-2024

báo cáo khoa học: "Malignant peripheral nerve sheath tumor arising from the greater omentum: Case report"

4 135 1 22-11-2024

Báo cáo nghiên cứu khoa học " NÂNG QUAN HỆ KINH TẾ THƯƠNG MẠI VIỆT NAM - TRUNG QUỐC LÊN TẦM CAO THỜI ĐẠI "

8 158 1 22-11-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8089 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7463 1763

Ebook Chào con ba mẹ đã sẵn sàng

112 4364 1369

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6147 1258

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8876 1160

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3785 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3909 609

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4613 562

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11280 542

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4445 490