TAILIEUCHUNG - báo cáo hóa học: " Text localization using standard deviation analysis of structure elements and support vector machines"

Tuyển tập báo cáo các nghiên cứu khoa học quốc tế ngành hóa học dành cho các bạn yêu hóa học tham khảo đề tài: Text localization using standard deviation analysis of structure elements and support vector machines | Zagoris et al. EURASIP Journal on Advances in Signal Processing 2011 2011 47 http content 2011 1 47 o EURASIP Journal on Advances in Signal Processing a SpringerOpen Journal RESEARCH Open Access Text localization using standard deviation analysis of structure elements and support vector machines Konstantinos Zagoris Savvas A Chatzichristofis and Nikos Papamarkos Abstract A text localization technique is required to successfully exploit document images such as technical articles and letters. The proposed method detects and extracts text areas from document images. Initially a connected components analysis technique detects blocks of foreground objects. Then a descriptor that consists of a set of suitable document structure elements is extracted from the blocks. This is achieved by incorporating an algorithm called Standard Deviation Analysis of Structure Elements SDASE which maximizes the separability between the blocks. Another feature of the SDASE is that its length adapts according to the requirements of the application. Finally the descriptor of each block is used as input to a trained support vector machines that classify the block as text or not. The proposed technique is also capable of adjusting to the text structure of the documents. Experimental results on benchmarking databases demonstrate the effectiveness of the proposed method. 1 Introduction The present electronic age produces vast quantities of many digital document images such as technical articles business letters and faxes. In order to effectively exploit them by many systems such as optical character recognition Word Spotting 1 2 and Document Retrieval Systems the contained text must be located by a detection technique. The research community is engaged on an ongoing attempt to address this problem by using a variety of approaches. There are top-down techniques employing recursive algorithms to segment the whole page to small regions. The subdivision is based on a .

TÀI LIỆU LIÊN QUAN
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.