SlideShare a Scribd company logo
1 of 76
WEB USAGE MINING
Monu Chaudhary
071BCT522
INTRODUCTION
Web Usage mining is the process of
applying data mining techniques for the
discovery of usage patterns from Web data,
targeted towards various applications.
INTRODUCTION
Data collected at different levels:
➢ Server level
➢ Client level
➢ Proxy level
INTRODUCTION
Goal:
➢ analyze the behavioral patterns and
profiles of users interacting with a Web
site
➢ Understand and better serve the needs
of Web-based applications
INTRODUCTION
Classification based on Usage Data:
➢ Web server Data
➢ Application Server Data
➢ Application Level Data
INTRODUCTION
Importance:
➢ Growth of e-commerce
○ Provides an a cost effective way of
doing business.
➢ Hidden useful information
○ Visitors’ profile
○ Measure online marketing effort
INTRODUCTION
3 Phases:
➢ Preprocessing
➢ Pattern Discovery
➢ Pattern Analysis
PREPROCESSING
Preprocessing consists of converting the:
➢ usage information
➢ content information
➢ structure information
contained in the various available data
sources into the data abstractions necessary
for pattern discovery.
Web Usage Mining Process
Web Usage Mining Process
Preprocessing of Web Usage Mining
Preprocessing of Web Usage Mining
Data Cleaning remove irrelevant references
and fields in server logs, removes erroneous
references and adds missing references due
to caching.
Preprocessing of Web Usage Mining
Sessionization: the activities performed by a
user from the moment she enters the site
until the moment she leaves it.
Sessionization
Preprocessing of Web Usage Mining
User Identification records multiple sessions
for user. This log is called User activity
record.
User Identification
Preprocessing of Web Usage Mining
A page view consists of every file that
contributes to the display on a user's browser
at one time.
Preprocessing of Web Usage Mining
Conceptually, each Page view can be viewed
as a collection of Web objects or resources
representing a specific “user event,” e.g.,
reading an article, viewing a product page, or
adding a product to the shopping cart.
Preprocessing of Web Usage Mining
Path Completion: Client- or proxy-side
caching can often result in missing access
references to those pages or objects that
have been cached.
Preprocessing of Web Usage Mining
Path Completion: For instance,
➢ if a user returns to a page A during the
same session, the second access to A will
likely result in viewing the previously
downloaded version of A that was
cached on the client- side, and therefore,
no request is made to the server.
Preprocessing of Web Usage Mining
Path Completion:
➢ This results in the second reference to A
not being recorded on the server logs.
Path Completion
Preprocessing of Web Usage Mining
Episode is a subset or subsequence of a
session comprised of semantically or
functionally related page views.
PATTERN DISCOVERY
Pattern discovery draws upon methods and
algorithms developed from several fields such as
statistics, data mining, machine learning and
pattern recognition.
PATTERN DISCOVERY
Methods:
➢ Statistical Analysis
➢ Association Rules
➢ Clustering
➢ Classification
➢ Sequential Patterns
PATTERN ANALYSIS
The motivation behind pattern analysis is to filter
out uninteresting rules or patterns from the set
found in the pattern discovery phase.
PATTERN ANALYSIS
Methods:
➢ A knowledge query mechanism such as SQL.
➢ Another method is to load usage data into a
data cube in order to perform Online
Analytical Processing (OLAP) operations.
PATTERN ANALYSIS
Methods:
➢ Visualization techniques, such as graphing
patterns or assigning colors to different
values.
➢ content and structure information can be
used to filter out patterns containing pages of
a certain usage type, content type, or pages
that match a certain hyperlink structure.
Application of Web Usage Mining
Advantages
➢ Personalized marketing.
➢ Fight against terrorism.
➢ Customer Relationship.
➢ Increase profitability by target pricing.
COLLABORATIVE FILTERING
Subodh chandra shakya
071BCT543
What is collaborative filtering…???
Collaborative filtering is a method of making
automatic predictions about the interest of a
user by collecting preferences or taste
information from other other users users(I.e
collaborating the interest )
Application
Mostly in e-commerce recommendation
system
Amazon
Netflix
This is how it works….
1.Weight all users with respect to similarity with active user
2. Select a subset of Users to use as a set of predictors
3. Compute a prediction from a weighted combination of
selected neighbors’ ratings
Collaborative filtering types
Memory Based: uses user rating data to compute
similarity between users or items user
rating,Neighbourhood based,Item Based etc
Model Based:Uses data mining and machine learning
Bayesian networks,neural embedding
models,clustering models,latent semantic models
such as SVD.
Approaches for CF (memory based)
User-Based CF - compute similarity based on User
Item-Based CF-Compute similarity base on item
User based CF
Look for users who share the same rating
patterns with the active user(the user whom
the prediction is for)
Use the ratings from those like-minded users
to calculate a prediction for the active user
Item based CF
1. Build an item-item matrix determining
relationships between pairs of items
1. Infer the tastes of the current user by
examining the matrix and matching that
user's data
Simple similarity is cosine similarity
Pearson correlation similarity
Collaborative Filtering problem
Cold-start: There should be enough other users
already in the system to find a match.New items
need to get enough ratings
Popularity Bias:Hard to recommend items to
someone with unique tastes
RECOMMENDER SYSTEMS
Atul Khatri
071bct509
Definition
● Estimate a utility function that automatically predicts how a
user will like an item
● Based on
○ Past Behavior
○ Relations to other users
○ Item similarity
○ Context
Impact
Apparent
● Advertisement
● Restaurants, cafes
● Movies, Tv shows, Music
● Books
● News articles
● Social sites including dating services
Impact(continued)
Not so apparent
● Courses in E-learning
● Drug components
● Research papers
● Citations
● Code modules
Architecture
Types
● Collaborative Filtering system
● Content-based system
● Hybrid recommender system
○ Context-based system
○ Knowledge-based system
Paradigms of recommender
systems
Content-Based
Recommender System
● System creates a user profile based on users likes or dislikes
which are explicitly stated
● Every purchase updates the user profile.
● A content-based recommender system matches the profile of
item to user profile to decide its relevancy to the user
Storage of items in database
Content Representation
● Structured data
○ Small number of attributes
○ Each item described by same set of attributes
○ Known set of values of attributes
Content
Representation(contd...)
● Unstructured data
○ No attribute names with well defined values
○ Need to impose structure on text before use
○ Natural language complexity
■ Same word with different meaning
■ Different word with same meaning
Context-Based
Recommender Systems
● System uses additional data about context of an item
consumption.
● Example: Additional component of time may be used to
recommend restaurants to consumers i.e different restaurants
for breakfast, lunch and so on. Further, information about
whether you are going out to eat with your friends or family
should also vary the recommendation.
Major obstacles for contextual computing
● Obtain sufficient and reliable data describing user context
● Understand the impact of contextual dimensions on
personalisation process
● Computational model of contextual dimensions in more
classical recommendation technology
● For instance: How to extend Collaborative filtering to
include contextual dimensions?
Collective Intelligence
Sagun Nakarmi
071bct533
● A shared or group intelligence that emerges
from the collaboration and competition of many
individuals.
● Groups of people and computers, connected by
the Internet, collectively doing intelligent
things.
It can be understood as an emergent property from
the synergies among:
1) Data - knowledge-information
2) Software-hardware
3) Experts
For instance,
Google technology harvests knowledge generated
by millions of people creating and linking web
pages and then uses this knowledge to answer
queries in ways that often seem amazingly
intelligent.
In Wikipedia, thousands of people around the world
have collectively created a very large and high quality
intellectual product with almost no centralized control,
and almost all as volunteers!
Online multi-player games are another example
of collective intelligence. Games such as Dota 2,
Second Life and Call of Duty rely on gamers
coming together as a community to form the
game’s Identity.
Other examples:
● social networking ( perhaps the
most popular of collective intelligence.)
● Amazon, Hamrobazaar & other ecommerce sites
● etc
THANK YOU FOR YOUR
PATIENCE!!

More Related Content

What's hot

Introduction to Machine Learning with SciKit-Learn
Introduction to Machine Learning with SciKit-LearnIntroduction to Machine Learning with SciKit-Learn
Introduction to Machine Learning with SciKit-LearnBenjamin Bengfort
 
Data Mining: Classification and analysis
Data Mining: Classification and analysisData Mining: Classification and analysis
Data Mining: Classification and analysisDataminingTools Inc
 
Web mining (structure mining)
Web mining (structure mining)Web mining (structure mining)
Web mining (structure mining)Amir Fahmideh
 
CS6010 Social Network Analysis Unit I
CS6010 Social Network Analysis Unit ICS6010 Social Network Analysis Unit I
CS6010 Social Network Analysis Unit Ipkaviya
 
5.2 mining time series data
5.2 mining time series data5.2 mining time series data
5.2 mining time series dataKrish_ver2
 
Presentation on data preparation with pandas
Presentation on data preparation with pandasPresentation on data preparation with pandas
Presentation on data preparation with pandasAkshitaKanther
 
Pagerank Algorithm Explained
Pagerank Algorithm ExplainedPagerank Algorithm Explained
Pagerank Algorithm Explainedjdhaar
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemMilind Gokhale
 
Issues in knowledge representation
Issues in knowledge representationIssues in knowledge representation
Issues in knowledge representationSravanthi Emani
 
Introdution and designing a learning system
Introdution and designing a learning systemIntrodution and designing a learning system
Introdution and designing a learning systemswapnac12
 
2.3 bayesian classification
2.3 bayesian classification2.3 bayesian classification
2.3 bayesian classificationKrish_ver2
 
Supervised and unsupervised learning
Supervised and unsupervised learningSupervised and unsupervised learning
Supervised and unsupervised learningParas Kohli
 
Data Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlationsData Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlationsDatamining Tools
 

What's hot (20)

Introduction to Machine Learning with SciKit-Learn
Introduction to Machine Learning with SciKit-LearnIntroduction to Machine Learning with SciKit-Learn
Introduction to Machine Learning with SciKit-Learn
 
Data Mining: Classification and analysis
Data Mining: Classification and analysisData Mining: Classification and analysis
Data Mining: Classification and analysis
 
Web mining (structure mining)
Web mining (structure mining)Web mining (structure mining)
Web mining (structure mining)
 
web mining
web miningweb mining
web mining
 
Collaborative filtering
Collaborative filteringCollaborative filtering
Collaborative filtering
 
Type Checking(Compiler Design) #ShareThisIfYouLike
Type Checking(Compiler Design) #ShareThisIfYouLikeType Checking(Compiler Design) #ShareThisIfYouLike
Type Checking(Compiler Design) #ShareThisIfYouLike
 
CS6010 Social Network Analysis Unit I
CS6010 Social Network Analysis Unit ICS6010 Social Network Analysis Unit I
CS6010 Social Network Analysis Unit I
 
Id3,c4.5 algorithim
Id3,c4.5 algorithimId3,c4.5 algorithim
Id3,c4.5 algorithim
 
5.2 mining time series data
5.2 mining time series data5.2 mining time series data
5.2 mining time series data
 
Presentation on data preparation with pandas
Presentation on data preparation with pandasPresentation on data preparation with pandas
Presentation on data preparation with pandas
 
Pagerank Algorithm Explained
Pagerank Algorithm ExplainedPagerank Algorithm Explained
Pagerank Algorithm Explained
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation System
 
Issues in knowledge representation
Issues in knowledge representationIssues in knowledge representation
Issues in knowledge representation
 
Introdution and designing a learning system
Introdution and designing a learning systemIntrodution and designing a learning system
Introdution and designing a learning system
 
3. mining frequent patterns
3. mining frequent patterns3. mining frequent patterns
3. mining frequent patterns
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
Data Mining: Outlier analysis
Data Mining: Outlier analysisData Mining: Outlier analysis
Data Mining: Outlier analysis
 
2.3 bayesian classification
2.3 bayesian classification2.3 bayesian classification
2.3 bayesian classification
 
Supervised and unsupervised learning
Supervised and unsupervised learningSupervised and unsupervised learning
Supervised and unsupervised learning
 
Data Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlationsData Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlations
 

Similar to Web Usage Mining Process and its Applications

Context Mining and Integration in Web Predictive Analytics
Context Mining and Integration in Web Predictive AnalyticsContext Mining and Integration in Web Predictive Analytics
Context Mining and Integration in Web Predictive AnalyticsJulia Kiseleva
 
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...inventionjournals
 
major ppt.pptx
major ppt.pptxmajor ppt.pptx
major ppt.pptxAnushaG52
 
Architecting AI Solutions in Azure for Business
Architecting AI Solutions in Azure for BusinessArchitecting AI Solutions in Azure for Business
Architecting AI Solutions in Azure for BusinessIvo Andreev
 
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...IRJET Journal
 
Study of Recommendation System Used In Tourism and Travel
Study of Recommendation System Used In Tourism and TravelStudy of Recommendation System Used In Tourism and Travel
Study of Recommendation System Used In Tourism and Travelijtsrd
 
IJRET : International Journal of Research in Engineering and TechnologyImprov...
IJRET : International Journal of Research in Engineering and TechnologyImprov...IJRET : International Journal of Research in Engineering and TechnologyImprov...
IJRET : International Journal of Research in Engineering and TechnologyImprov...eSAT Publishing House
 
Personalized E-commerce based recommendation systems using deep-learning tech...
Personalized E-commerce based recommendation systems using deep-learning tech...Personalized E-commerce based recommendation systems using deep-learning tech...
Personalized E-commerce based recommendation systems using deep-learning tech...IAESIJAI
 
Iaetsd web personalization a general survey
Iaetsd web personalization a general surveyIaetsd web personalization a general survey
Iaetsd web personalization a general surveyIaetsd Iaetsd
 
A recommender system-using novel deep network collaborative filtering
A recommender system-using novel deep network collaborative filteringA recommender system-using novel deep network collaborative filtering
A recommender system-using novel deep network collaborative filteringIAESIJAI
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation SystemsZia Babar
 
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE
 
Connecting social media to e commerce (2)
Connecting social media to e commerce (2)Connecting social media to e commerce (2)
Connecting social media to e commerce (2)krsenthamizhselvi
 
Recommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptxRecommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptxSatyam Sharma
 

Similar to Web Usage Mining Process and its Applications (20)

Web usage mining
Web usage miningWeb usage mining
Web usage mining
 
AN EFFECTIVE FRAMEWORK FOR GENERATING RECOMMENDATIONS
AN EFFECTIVE FRAMEWORK FOR GENERATING RECOMMENDATIONSAN EFFECTIVE FRAMEWORK FOR GENERATING RECOMMENDATIONS
AN EFFECTIVE FRAMEWORK FOR GENERATING RECOMMENDATIONS
 
Context Mining and Integration in Web Predictive Analytics
Context Mining and Integration in Web Predictive AnalyticsContext Mining and Integration in Web Predictive Analytics
Context Mining and Integration in Web Predictive Analytics
 
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
 
major ppt.pptx
major ppt.pptxmajor ppt.pptx
major ppt.pptx
 
Architecting AI Solutions in Azure for Business
Architecting AI Solutions in Azure for BusinessArchitecting AI Solutions in Azure for Business
Architecting AI Solutions in Azure for Business
 
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
 
Web Usage Pattern
Web Usage PatternWeb Usage Pattern
Web Usage Pattern
 
Study of Recommendation System Used In Tourism and Travel
Study of Recommendation System Used In Tourism and TravelStudy of Recommendation System Used In Tourism and Travel
Study of Recommendation System Used In Tourism and Travel
 
IJRET : International Journal of Research in Engineering and TechnologyImprov...
IJRET : International Journal of Research in Engineering and TechnologyImprov...IJRET : International Journal of Research in Engineering and TechnologyImprov...
IJRET : International Journal of Research in Engineering and TechnologyImprov...
 
clickstream analysis
 clickstream analysis clickstream analysis
clickstream analysis
 
Personalized E-commerce based recommendation systems using deep-learning tech...
Personalized E-commerce based recommendation systems using deep-learning tech...Personalized E-commerce based recommendation systems using deep-learning tech...
Personalized E-commerce based recommendation systems using deep-learning tech...
 
Iaetsd web personalization a general survey
Iaetsd web personalization a general surveyIaetsd web personalization a general survey
Iaetsd web personalization a general survey
 
A recommender system-using novel deep network collaborative filtering
A recommender system-using novel deep network collaborative filteringA recommender system-using novel deep network collaborative filtering
A recommender system-using novel deep network collaborative filtering
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation Systems
 
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
 
Connecting social media to e commerce (2)
Connecting social media to e commerce (2)Connecting social media to e commerce (2)
Connecting social media to e commerce (2)
 
Recommender system
Recommender system Recommender system
Recommender system
 
Recommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptxRecommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptx
 
Al26234241
Al26234241Al26234241
Al26234241
 

Recently uploaded

VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 

Recently uploaded (20)

VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 

Web Usage Mining Process and its Applications

  • 1. WEB USAGE MINING Monu Chaudhary 071BCT522
  • 2. INTRODUCTION Web Usage mining is the process of applying data mining techniques for the discovery of usage patterns from Web data, targeted towards various applications.
  • 3. INTRODUCTION Data collected at different levels: ➢ Server level ➢ Client level ➢ Proxy level
  • 4. INTRODUCTION Goal: ➢ analyze the behavioral patterns and profiles of users interacting with a Web site ➢ Understand and better serve the needs of Web-based applications
  • 5. INTRODUCTION Classification based on Usage Data: ➢ Web server Data ➢ Application Server Data ➢ Application Level Data
  • 6. INTRODUCTION Importance: ➢ Growth of e-commerce ○ Provides an a cost effective way of doing business. ➢ Hidden useful information ○ Visitors’ profile ○ Measure online marketing effort
  • 7. INTRODUCTION 3 Phases: ➢ Preprocessing ➢ Pattern Discovery ➢ Pattern Analysis
  • 8. PREPROCESSING Preprocessing consists of converting the: ➢ usage information ➢ content information ➢ structure information contained in the various available data sources into the data abstractions necessary for pattern discovery.
  • 10. Web Usage Mining Process
  • 11.
  • 12. Preprocessing of Web Usage Mining
  • 13. Preprocessing of Web Usage Mining Data Cleaning remove irrelevant references and fields in server logs, removes erroneous references and adds missing references due to caching.
  • 14. Preprocessing of Web Usage Mining Sessionization: the activities performed by a user from the moment she enters the site until the moment she leaves it.
  • 16. Preprocessing of Web Usage Mining User Identification records multiple sessions for user. This log is called User activity record.
  • 18. Preprocessing of Web Usage Mining A page view consists of every file that contributes to the display on a user's browser at one time.
  • 19. Preprocessing of Web Usage Mining Conceptually, each Page view can be viewed as a collection of Web objects or resources representing a specific “user event,” e.g., reading an article, viewing a product page, or adding a product to the shopping cart.
  • 20. Preprocessing of Web Usage Mining Path Completion: Client- or proxy-side caching can often result in missing access references to those pages or objects that have been cached.
  • 21. Preprocessing of Web Usage Mining Path Completion: For instance, ➢ if a user returns to a page A during the same session, the second access to A will likely result in viewing the previously downloaded version of A that was cached on the client- side, and therefore, no request is made to the server.
  • 22. Preprocessing of Web Usage Mining Path Completion: ➢ This results in the second reference to A not being recorded on the server logs.
  • 24. Preprocessing of Web Usage Mining Episode is a subset or subsequence of a session comprised of semantically or functionally related page views.
  • 25. PATTERN DISCOVERY Pattern discovery draws upon methods and algorithms developed from several fields such as statistics, data mining, machine learning and pattern recognition.
  • 26. PATTERN DISCOVERY Methods: ➢ Statistical Analysis ➢ Association Rules ➢ Clustering ➢ Classification ➢ Sequential Patterns
  • 27. PATTERN ANALYSIS The motivation behind pattern analysis is to filter out uninteresting rules or patterns from the set found in the pattern discovery phase.
  • 28. PATTERN ANALYSIS Methods: ➢ A knowledge query mechanism such as SQL. ➢ Another method is to load usage data into a data cube in order to perform Online Analytical Processing (OLAP) operations.
  • 29. PATTERN ANALYSIS Methods: ➢ Visualization techniques, such as graphing patterns or assigning colors to different values. ➢ content and structure information can be used to filter out patterns containing pages of a certain usage type, content type, or pages that match a certain hyperlink structure.
  • 30. Application of Web Usage Mining
  • 31. Advantages ➢ Personalized marketing. ➢ Fight against terrorism. ➢ Customer Relationship. ➢ Increase profitability by target pricing.
  • 33. What is collaborative filtering…??? Collaborative filtering is a method of making automatic predictions about the interest of a user by collecting preferences or taste information from other other users users(I.e collaborating the interest )
  • 34. Application Mostly in e-commerce recommendation system Amazon Netflix
  • 35. This is how it works…. 1.Weight all users with respect to similarity with active user 2. Select a subset of Users to use as a set of predictors 3. Compute a prediction from a weighted combination of selected neighbors’ ratings
  • 36. Collaborative filtering types Memory Based: uses user rating data to compute similarity between users or items user rating,Neighbourhood based,Item Based etc Model Based:Uses data mining and machine learning Bayesian networks,neural embedding models,clustering models,latent semantic models such as SVD.
  • 37. Approaches for CF (memory based) User-Based CF - compute similarity based on User Item-Based CF-Compute similarity base on item
  • 38. User based CF Look for users who share the same rating patterns with the active user(the user whom the prediction is for) Use the ratings from those like-minded users to calculate a prediction for the active user
  • 39.
  • 40. Item based CF 1. Build an item-item matrix determining relationships between pairs of items 1. Infer the tastes of the current user by examining the matrix and matching that user's data
  • 41.
  • 42.
  • 43.
  • 44. Simple similarity is cosine similarity
  • 46. Collaborative Filtering problem Cold-start: There should be enough other users already in the system to find a match.New items need to get enough ratings Popularity Bias:Hard to recommend items to someone with unique tastes
  • 48. Definition ● Estimate a utility function that automatically predicts how a user will like an item ● Based on ○ Past Behavior ○ Relations to other users ○ Item similarity ○ Context
  • 49. Impact Apparent ● Advertisement ● Restaurants, cafes ● Movies, Tv shows, Music ● Books ● News articles ● Social sites including dating services
  • 50. Impact(continued) Not so apparent ● Courses in E-learning ● Drug components ● Research papers ● Citations ● Code modules
  • 52.
  • 53. Types ● Collaborative Filtering system ● Content-based system ● Hybrid recommender system ○ Context-based system ○ Knowledge-based system
  • 55.
  • 56.
  • 57.
  • 59. ● System creates a user profile based on users likes or dislikes which are explicitly stated ● Every purchase updates the user profile. ● A content-based recommender system matches the profile of item to user profile to decide its relevancy to the user
  • 60.
  • 61. Storage of items in database
  • 62.
  • 63. Content Representation ● Structured data ○ Small number of attributes ○ Each item described by same set of attributes ○ Known set of values of attributes
  • 64. Content Representation(contd...) ● Unstructured data ○ No attribute names with well defined values ○ Need to impose structure on text before use ○ Natural language complexity ■ Same word with different meaning ■ Different word with same meaning
  • 66. ● System uses additional data about context of an item consumption. ● Example: Additional component of time may be used to recommend restaurants to consumers i.e different restaurants for breakfast, lunch and so on. Further, information about whether you are going out to eat with your friends or family should also vary the recommendation.
  • 67. Major obstacles for contextual computing ● Obtain sufficient and reliable data describing user context ● Understand the impact of contextual dimensions on personalisation process ● Computational model of contextual dimensions in more classical recommendation technology ● For instance: How to extend Collaborative filtering to include contextual dimensions?
  • 69. ● A shared or group intelligence that emerges from the collaboration and competition of many individuals. ● Groups of people and computers, connected by the Internet, collectively doing intelligent things.
  • 70.
  • 71. It can be understood as an emergent property from the synergies among: 1) Data - knowledge-information 2) Software-hardware 3) Experts
  • 72. For instance, Google technology harvests knowledge generated by millions of people creating and linking web pages and then uses this knowledge to answer queries in ways that often seem amazingly intelligent.
  • 73. In Wikipedia, thousands of people around the world have collectively created a very large and high quality intellectual product with almost no centralized control, and almost all as volunteers!
  • 74. Online multi-player games are another example of collective intelligence. Games such as Dota 2, Second Life and Call of Duty rely on gamers coming together as a community to form the game’s Identity.
  • 75. Other examples: ● social networking ( perhaps the most popular of collective intelligence.) ● Amazon, Hamrobazaar & other ecommerce sites ● etc
  • 76. THANK YOU FOR YOUR PATIENCE!!