TAILIEUCHUNG - Báo cáo khoa học: "Domain Adaptation of Maximum Entropy Language Models"

We investigate a recently proposed Bayesian adaptation method for building style-adapted maximum entropy language models for speech recognition, given a large corpus of written language data and a small corpus of speech transcripts. Experiments show that the method consistently outperforms linear interpolation which is typically used in such cases. | Domain Adaptation of Maximum Entropy Language Models Tanel Alumae Adaptive Informatics Research Centre School of Science and Technology Aalto University Helsinki Finland tanel@ Abstract We investigate a recently proposed Bayesian adaptation method for building style-adapted maximum entropy language models for speech recognition given a large corpus of written language data and a small corpus of speech transcripts. Experiments show that the method consistently outperforms linear interpolation which is typically used in such cases. 1 Introduction In large vocabulary speech recognition a language model LM is typically estimated from large amounts of written text data. However recognition is typically applied to speech that is stylistically different from written language. For example in an often-tried setting speech recognition is applied to broadcast news that includes introductory segments conversations and spontaneous interviews. To decrease the mismatch between training and test data often a small amount of speech data is human-transcribed. A LM is then built by interpolating the models estimated from large corpus of written language and the small corpus of transcribed data. However in practice different models might be of different importance depending on the word context. Global interpolation doesn t take such variability into account and all predictions are weighted across models identically regardless of the context. In this paper we investigate a recently proposed Bayesian adaptation approach Daume III 2007 Finkel and Manning 2009 for adapting a conditional maximum entropy ME LM Rosenfeld 1996 to a new domain given a large corpus of out-of-domain training data and a small corpus of in-domain data. The main contribution of this Currently with Tallinn University of Technology Estonia Mikko Kurimo Adaptive Informatics Research Centre School of Science and Technology Aalto University Helsinki Finland paper is that we show how the .

Diễm Lệ 81 6 pdf

Upload

Bấm vào đây để xem trước nội dung

Tải xuống

TÀI LIỆU LIÊN QUAN

Báo cáo khoa học: "Domain Adaptation by Constraining Inter-Domain Variability of Latent Feature Representation"

10 58 0

Báo cáo khoa học: "Grammar Error Correction Using Pseudo-Error Sentences and Domain Adaptation"

5 58 0

Báo cáo khoa học: "Information-theoretic Multi-view Domain Adaptation"

5 34 0

Báo cáo khoa học: "Domain Adaptation of Maximum Entropy Language Models"

6 54 0

Báo cáo khoa học: "Domain Adaptation for Machine Translation by Mining Unseen Words"

6 62 0

Báo cáo khoa học: "Estimating Class Priors in Domain Adaptation for Word Sense Disambiguation"

8 59 0

Báo cáo khoa học: "Frustratingly Easy Domain Adaptation"

8 58 0

Báo cáo khoa học: "Instance Weighting for Domain Adaptation in NLP"

8 62 0

Báo cáo khoa học: "Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classiﬁcation"

8 43 0

Báo cáo khoa học: "Self-Training for Enhancement and Domain Adaptation of Statistical Parsers Trained on Small Datasets"

8 43 0

TÀI LIỆU XEM NHIỀU

Một Case Về Hematology (1)

8 462336 61

Giới thiệu :Lập trình mã nguồn mở

14 25915 79

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11335 542

Câu hỏi và đáp án bài tập tình huống Quản trị học

14 10543 466

Phân tích và làm rõ ý kiến sau: “Bài thơ Tự tình II vừa nói lên bi kịch duyên phận vừa cho thấy khát vọng sống, khát vọng hạnh phúc của Hồ Xuân Hương”

3 9835 108

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8885 1161

Tiểu luận: Nội dung tư tưởng Hồ Chí Minh về đạo đức

16 8499 426

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8098 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7709 1788

Đề tài: Dự án kinh doanh thời trang quần áo nữ

17 7240 268

TỪ KHÓA LIÊN QUAN

TÀI LIỆU MỚI ĐĂNG

Giáo án mầm non chương trình đổi mới: Gia đình vui nhộn

4 391 3 23-12-2024

B2B Content Marketing: 2012 Benchmarks, Budgets & Trends

17 228 3 23-12-2024

Báo cáo nghiên cứu nông nghiệp " Biofertiliser inoculant technology for the growth of rice in Vietnam: Developing technical infrastructure for quality assurance and village production for farmers "

12 144 2 23-12-2024

BÀI GIẢNG Biến Đổi Năng Lượng Điện Cơ - TS. Hồ Phạm Huy

137 157 1 23-12-2024

Báo cáo " Bàn về hành vi pháp luật và hành vi đạo đức "

11 177 2 23-12-2024

ĐỀ TÀI " ĐÁNH GIÁ HIỆU QUẢ HOẠT ĐỘNG KINH DOANH NGOẠI HỐI CỦA NGÂN HÀNG THƯƠNG MẠI CỔ PHẦN XUẤT NHẬP KHẨU VIỆT NAM "

51 149 3 23-12-2024

Word Games with English 1

65 137 1 23-12-2024

Lập trình Java cơ bản : Luồng và xử lý file part 8

5 140 1 23-12-2024

Báo cáo lâm nghiệp: "Assessment of the effects of below-zero temperatures on photosynthesis and chlorophyll a fluorescence in leaf discs of Eucalyptus globulu"

4 140 0 23-12-2024

TRẮC NGHIỆM - CÁC BỆNH THIẾU DINH DƯỠNG THƯỜNG GẶP

32 208 2 23-12-2024

TÀI LIỆU HOT

Mẫu đơn thông tin ứng viên ngân hàng VIB

8 8098 2279

Giáo trình Tư tưởng Hồ Chí Minh - Mạch Quang Thắng (Dành cho bậc ĐH - Không chuyên ngành Lý luận chính trị)

152 7709 1788

Ebook Chào con ba mẹ đã sẵn sàng

112 4406 1371

Ebook Tuyển tập đề bài và bài văn nghị luận xã hội: Phần 1

62 6273 1266

Ebook Facts and Figures – Basic reading practice: Phần 1 – Đặng Tuấn Anh (Dịch)

249 8885 1161

Giáo trình Văn hóa kinh doanh - PGS.TS. Dương Thị Liễu

561 3835 680

Giáo trình Sinh lí học trẻ em: Phần 1 - TS Lê Thanh Vân

122 3917 609

Giáo trình Pháp luật đại cương: Phần 1 - NXB ĐH Sư Phạm

274 4700 565

Tiểu luận: Tư tưởng Hồ Chí Minh về xây dựng nhà nước trong sạch vững mạnh

13 11335 542

Bài tập nhóm quản lý dự án: Dự án xây dựng quán cafe

35 4501 490