TAILIEUCHUNG - Báo cáo khoa học: "Structuring E-Commerce Inventory"

Large e-commerce enterprises feature millions of items entered daily by a large variety of sellers. While some sellers provide rich, structured descriptions of their items, a vast majority of them provide unstructured natural language descriptions. In the paper we present a 2 steps method for structuring items into descriptive properties. | Structuring E-Commerce Inventory Karin Mauge eBay Research Labs 2145 Hamilton Avenue San Jose CA 95125 kmauge@ Khash Rohanimanesh eBay Research Labs 2145 Hamilton Avenue San Jose CA 95125 krohanimanesh@ Jean-David Ruvini eBay Research Labs 2145 Hamilton Avenue San Jose CA 95125 jruvini@ Abstract Large e-commerce enterprises feature millions of items entered daily by a large variety of sellers. While some sellers provide rich structured descriptions of their items a vast majority of them provide unstructured natural language descriptions. In the paper we present a 2 steps method for structuring items into descriptive properties. The first step consists in unsupervised property discovery and extraction. The second step involves supervised property synonym discovery using a maximum entropy based clustering algorithm. We evaluate our method on a year worth of ecommerce data and show that it achieves excellent precision with good recall. 1 Introduction Online commerce has gained a lot of popularity over the past decade. Large on-line C2C marketplaces like eBay and Amazon feature a very large and long-tail inventory with millions of items product offers entered into the marketplace every day by a large variety of sellers. While some sellers generally large professional ones provide rich structured description of their products using schemas or via a global trade item number the vast majority only provide unstructured natural language descriptions. To manage items effectively and provide the best user experience it is critical for these marketplaces to structure their inventory into descriptive namevalue pairs called properties and ensure that items of the same kind digital cameras for instance are described using a unique set of property names 805 brand model zoom resolution etc. and values. For example this is important for measuring item similarity and complementarity in merchandising providing faceted navigation and various business .

TỪ KHÓA LIÊN QUAN
TAILIEUCHUNG - Chia sẻ tài liệu không giới hạn
Địa chỉ : 444 Hoang Hoa Tham, Hanoi, Viet Nam
Website : tailieuchung.com
Email : tailieuchung20@gmail.com
Tailieuchung.com là thư viện tài liệu trực tuyến, nơi chia sẽ trao đổi hàng triệu tài liệu như luận văn đồ án, sách, giáo trình, đề thi.
Chúng tôi không chịu trách nhiệm liên quan đến các vấn đề bản quyền nội dung tài liệu được thành viên tự nguyện đăng tải lên, nếu phát hiện thấy tài liệu xấu hoặc tài liệu có bản quyền xin hãy email cho chúng tôi.
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.