TAILIEUCHUNG - Báo cáo khoa học: "Tree-based Analysis of Simple Recurrent Network Learning"

Using this approach, an SRN. trained to the phonotactics of a Dutch monosyllabic corpus containing 4500 words, was reported to distinguish words from non-words with 7 % error, Since the phonotactics of a given language is represented by the constraints allowing a given sequence to be a word or not, and the SRN managed to distinguish words from random strings with tolerable error, the authors claim that SRNs are able to learn the phonotactics of Dutch language. | Tree-based Analysis of Simple Recurrent Network Learning Ivelin Stoianov Dept. Alfa-Informatica Faculty of Arts Groningen University POBox 716 9700 AS Groningen The Netherlands Email stoianov@ 1 Simple recurrent networks for natural language phonotactics analysis. In searching for a connectionist paradigm capable of natural language processing many researchers have explored the Simple Recurrent Network SRN such as Elman 1990 Cleermance 1993 Reilly 1995 and Lawrence I996 . SRNs have a context layer that keeps track of the past hidden neuron activations and enables them to deal with sequential data. The events in Natural Language span time so SRNs are needed to deal with them. Among the various levels of language processing a phonological level can be distinguished. The Phonology deals with phonemes or graphemes - the latter in the case when one works with orthographic word representations. The principles governing the combinations of these symbols is called phonotactics Laver 1994 . It is a good starting point for connectionist language analysis because there are not too many basic entities. The number of the symbols varies between 26 for the Latin graphemes and 50 for the phonemes . Recently some experiments considering phonotactics modelling with SRNs have been carried out by Stoianov 1997 Rodd 1997 . The neural network in Stoianov 1997 was trained to study the phonotactics of a large Dutch word corpus. This problem was implemented as an SRN learning task -to predict the symbol following the left context given to the input layer so far. Words were applied to the network symbol by symbol which in turn were encoded orthogonally that is one node standing for one symbol Fig. 1 . An extra symbol was used as a delimiter. After the ưaining the network responded to the input with different neuron activations at the output layer. The more active a given output neuron is the higher the probability is that it is a successor. The authors used a so-called optimal .

TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.