TAILIEUCHUNG - Báo cáo khoa học: "Practical Glossing by Prioritised Tiling"

We present the design of a practical context-sensitive glosser, incorporating current techniques for lightweight linguistic analysis based on large-scale lexical resources. We outline a general model for ranking the possible translations of the words and expressions that make up a text. This information can be used by a simple resource-bounded algorithm, of complexity O(n log n) in sentence length, that determines a consistent gloss of best translations. We then describe how the results of the general ranking model may be approximated using a simple heuristic prioritisation scheme. . | Practical Glossing by Prioritised Tiling Victor Poznanski Pete Whitelock Jan Udens Steffan Corley Sharp Laboratories of Europe Ltd. Oxford Science Park Oxford 0X4 4GA United Kingdom vp pete jan steffan @ Abstract We present the design of a practical context-sensitive glosser incorporating current techniques for lightweight linguistic analysis based on large-scale lexical resources. We outline a general model for ranking the possible translations of the words and expressions that make up a text. This information can be used by a simple resource-bounded algorithm of complexity O n log n in sentence length that determines a consistent gloss of best translations. We then describe how the results of the general ranking model may be approximated using a simple heuristic prioritisation scheme. Finally we present a preliminary evaluation of the glosser s performance. 1 Introduction In a lexicalist MT framework such as Shake-and-Bake Whitelock 1994 ttanslation equivalence is defined between collections of suitably constrained lexical material in the two languages. Such an approach has been shown to be effective in the description of many types of complex bilingual equivalence. However the complexity of the associated parsing and generation phases leaves a system of this type some way from commercial exploitation. The parsing phase that is needed to establish adequate constraints on the words is of cubic complexity while the most general generation algorithm needed to order the words in the target text is ơ n4 Poznanski et al. 1996 . In this paper we show how a novel application domain glossing can be explored within such a framework by omitting generation entirely and replacing syntactic parsing by a simple combination of morphological analysis and tagging. The poverty of constraints established in this way and the consequent inaccuracy in translation is mitigated by providing a menu of alternatives for each gloss. The gloss is automatically updated in the light

Quốc Bảo 82 7 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Practical Glossing by Prioritised Tiling"

7 72 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 461860 55

Giới thiệu :Lập trình mã nguồn mở

14 22622 59

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10883 529

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10061 446

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9516 104

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8276 1125

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8226 423

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7863 2220

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 6671 253

Vật lý hạt cơ bản (1)

29 5768 85

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Động cơ đốt trong và máy kéo công nghiêp tập 1 part 7

23 258 0 25-04-2024

MySQL Basics for Visual Learners PHẦN 9

15 183 0 25-04-2024

Công nghiệp gang thép Việt Nam : Một giai đoạn phát triển và chuyển đổi chính sách mới part 5

6 194 0 25-04-2024

THE ANTHROPOLOGY OF ONLINE COMMUNITIES BY Samuel M.Wilson and Leighton C. Peterson

19 144 0 25-04-2024

B2B Content Marketing: 2012 Benchmarks, Budgets & Trends

17 138 0 25-04-2024

Giáo trình CẤU TRÚC DỮ LIỆU VÀ GIẢI THUẬT - Chương 1

5 125 0 25-04-2024

Khóa luận tốt nghiệp: Giải pháp nâng cao chất lượng phương thức thanh toán tín dụng chứng từ phục vụ xuất nhập khẩu tại ngân hàng Thương mại Việt Nam - Trần Thị Tân

12 116 0 25-04-2024

Hệ thống làm lạnh và điều hòa không khí

21 125 0 25-04-2024

GIÁO TRÌNH VI XỬ LÝ 1 - CHƯƠNG 5. LẬP TRÌNH CHO VI ĐIỀU KHIỂN 80C51

23 107 1 25-04-2024

báo cáo hóa học:" Journal of the International AIDS Society: an important step forward"

2 84 0 25-04-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 7863 2220

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 5705 1363

Ebook Chào con ba mẹ đã sẵn sàng

112 3766 1231

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 5316 1136

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8276 1125

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3494 642

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 10883 529

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3680 525

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4042 514

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4124 480