TAILIEUCHUNG - Enhance preprocessing technique distinct user identification using web log usage data
In this paper a complete preprocessing methodology having data cleaning, Enhanced preprocessing technique one of the User Identification which is key issue in preprocessing technique phase is to identify the web users. Traditional User Identification is based on the site structure by using some heuristic rules. | ISSN:2249-5789 Sheetal A Raiyani et al , International Journal of Computer Science & Communication Networks,Vol 2(4), 526-530 Enhance Preprocessing Technique Distinct User Identification using Web Log Usage data Sheetal A. Raiyani1, Shailendra Jain2 Dept. of CSE(SS),TIT,Bhopal1, Dept. of CSE,TIT,Bhopal2 , shailendrajein78@ Abstract Millions of visitors interact daily with web sites around the world. Huge amount of data are being generated and these information could be very prized to the company in the field of understanding Customer’s behaviors. In this paper a complete preprocessing methodology having data cleaning, Enhanced preprocessing technique one of the User Identification which is key issue in preprocessing technique phase is to identify the web users. Traditional User Identification is based on the site structure by using some heuristic rules. In most cases relationship between pages are based on the site topology which reduced the efficiency of identification solve this problem we introduced proposed Technique DUI (Distinct User Identification) based on IP address ,Agent ,Referred pages on desired session time. Which can be used in counter terrorism, fraud detection and detection of unusual access of secure data, as well as through detection of frequent access behavior improve the overall designing and performance of future access. Experiments have proved that advanced data preprocessing technique can enhanced the quality of data preprocessing results. Keywords: server log, Web preprocssing, user identification usage pattern. It has quickly become one of the most important areas in Computer and Information Sciences because of its direct applications in e-commerce, CRM, Web analytics, information retrieval and filtering, and Web information systems. According to the differences of the mining objects, there are roughly three knowledge discovery domains that pertain to web mining: Web Content Mining, Web .
đang nạp các trang xem trước