SlideShare ist ein Scribd-Unternehmen logo
1 von 10
WEB GRAPHS
2
Internet/Web as Graphs
• Graph of the physical layer with routers ,
computers etc as nodes and physical
connections as edges
– It is limited
– Does not capture the graphical connections
associated with the information on the Internet
• Web Graph where nodes represent web
pages and edges are associated with
hyperlinks
3
Web Graph
4
Web Graph Considerations
• Edges can be directed or undirected
• Graph is highly dynamic
– Nodes and edges are added/deleted often
– Content of existing nodes is also subject to
change
– Pages and hyperlinks created on the fly
• Apart from primary connected component
there are also smaller disconnected
components
5
Why the Web Graph?
• Example of a large,dynamic and
distributed graph
• Possibly similar to other complex graphs
in social, biological and other systems
• Reflects how humans organize
information (relevance, ranking) and their
societies
• Efficient navigation algorithms
• Study behavior of users as they traverse
the web graph (e-commerce)
6
Statistics of Interest
• Size and connectivity of the graph
• Number of connected components
• Distribution of pages per site
• Distribution of incoming and outgoing
connections per site
• Average and maximal length of the
shortest path between any two vertices
(diameter)
7
Properties of Web Graphs
• Connectivity follows a power law
distribution
• The graph is sparse
– |E| = O(n) or atleast o(n2
)
– Average number of hyperlinks per page
roughly a constant
• A small world graph
8
Small World Networks
• It is a ‘small world’
– Millions of people. Yet, separated by “six
degrees” of acquaintance relationships
– Popularized by Milgram’s famous experiment
• Mathematically
– Diameter of graph is small (log N) as
compared to overall size
• 3. Property seems interesting given ‘sparse’ nature
of graph but …
• This property is ‘natural’ in ‘pure’ random graphs
Conclusion
The compression techniques are
specializedspecialized for Web Graphs.
The average link sizelink size decreases with the
increase of the graph.
The average link access timelink access time increases
with the increase of the graph.
9
Conclusion
The compression techniques are
specializedspecialized for Web Graphs.
The average link sizelink size decreases with the
increase of the graph.
The average link access timelink access time increases
with the increase of the graph.
9

Weitere ähnliche Inhalte

Was ist angesagt?

Market oriented Cloud Computing
Market oriented Cloud ComputingMarket oriented Cloud Computing
Market oriented Cloud Computing
Jithin Parakka
 
PageRank Algorithm In data mining
PageRank Algorithm In data miningPageRank Algorithm In data mining
PageRank Algorithm In data mining
Mai Mustafa
 

Was ist angesagt? (20)

Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
A Friendly Introduction to Machine Learning
A Friendly Introduction to Machine LearningA Friendly Introduction to Machine Learning
A Friendly Introduction to Machine Learning
 
Market oriented Cloud Computing
Market oriented Cloud ComputingMarket oriented Cloud Computing
Market oriented Cloud Computing
 
Cloud Computing and Services | PPT
Cloud Computing and Services | PPTCloud Computing and Services | PPT
Cloud Computing and Services | PPT
 
Fraud detection with Machine Learning
Fraud detection with Machine LearningFraud detection with Machine Learning
Fraud detection with Machine Learning
 
Cloud computing notes
Cloud computing notesCloud computing notes
Cloud computing notes
 
Intro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationIntro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning Presentation
 
Semantic Web
Semantic WebSemantic Web
Semantic Web
 
Big Data Analytics with R
Big Data Analytics with RBig Data Analytics with R
Big Data Analytics with R
 
Loan approval prediction based on machine learning approach
Loan approval prediction based on machine learning approachLoan approval prediction based on machine learning approach
Loan approval prediction based on machine learning approach
 
Grid computing
Grid computingGrid computing
Grid computing
 
Machine learning
Machine learningMachine learning
Machine learning
 
Grid based method & model based clustering method
Grid based method & model based clustering methodGrid based method & model based clustering method
Grid based method & model based clustering method
 
PageRank Algorithm In data mining
PageRank Algorithm In data miningPageRank Algorithm In data mining
PageRank Algorithm In data mining
 
Clustering
ClusteringClustering
Clustering
 
Web content mining
Web content miningWeb content mining
Web content mining
 
Machine learning ppt
Machine learning pptMachine learning ppt
Machine learning ppt
 
Clustering in Data Mining
Clustering in Data MiningClustering in Data Mining
Clustering in Data Mining
 
Web Scraping With Python
Web Scraping With PythonWeb Scraping With Python
Web Scraping With Python
 

Andere mochten auch

AIPPLE Catalogue 2016
AIPPLE Catalogue 2016AIPPLE Catalogue 2016
AIPPLE Catalogue 2016
sam zhu
 
Beth presngksgkis;og
Beth presngksgkis;ogBeth presngksgkis;og
Beth presngksgkis;og
BethVannet
 
Trabajo contencioso
Trabajo contenciosoTrabajo contencioso
Trabajo contencioso
YELVER
 
ΠΑΡΟΥΣΙΑΣΗ YOUR NEXT HOME
ΠΑΡΟΥΣΙΑΣΗ YOUR NEXT HOMEΠΑΡΟΥΣΙΑΣΗ YOUR NEXT HOME
ΠΑΡΟΥΣΙΑΣΗ YOUR NEXT HOME
YOUR NEXT HOME
 
Production schedule
Production scheduleProduction schedule
Production schedule
SamFrullo
 

Andere mochten auch (20)

Case study of goggle map
Case study of goggle mapCase study of goggle map
Case study of goggle map
 
Oinarri-plakaren osagaiak
Oinarri-plakaren osagaiakOinarri-plakaren osagaiak
Oinarri-plakaren osagaiak
 
AIPPLE Catalogue 2016
AIPPLE Catalogue 2016AIPPLE Catalogue 2016
AIPPLE Catalogue 2016
 
Nouth Phouthavongsa - Asian TYA Network event presentation at ricca ricca*fes...
Nouth Phouthavongsa - Asian TYA Network event presentation at ricca ricca*fes...Nouth Phouthavongsa - Asian TYA Network event presentation at ricca ricca*fes...
Nouth Phouthavongsa - Asian TYA Network event presentation at ricca ricca*fes...
 
Hugh Brown - Asian TYA Network event presentation at ricca ricca*festa 2016
Hugh Brown - Asian TYA Network event presentation at ricca ricca*festa 2016Hugh Brown - Asian TYA Network event presentation at ricca ricca*festa 2016
Hugh Brown - Asian TYA Network event presentation at ricca ricca*festa 2016
 
Beth presngksgkis;og
Beth presngksgkis;ogBeth presngksgkis;og
Beth presngksgkis;og
 
Cs ss 2016
Cs ss 2016Cs ss 2016
Cs ss 2016
 
evolució del web
evolució del webevolució del web
evolució del web
 
Irteerako periferikoak
Irteerako periferikoakIrteerako periferikoak
Irteerako periferikoak
 
Presentacion infogobierno slideshare.
Presentacion infogobierno slideshare. Presentacion infogobierno slideshare.
Presentacion infogobierno slideshare.
 
Melissa Tan - Asian TYA Network event presentation at ricca ricca*festa 2016
Melissa Tan - Asian TYA Network event presentation at ricca ricca*festa 2016Melissa Tan - Asian TYA Network event presentation at ricca ricca*festa 2016
Melissa Tan - Asian TYA Network event presentation at ricca ricca*festa 2016
 
B uppsats den kyrkliga försvenskningen av bohuslänska hisingen
B uppsats  den kyrkliga försvenskningen av bohuslänska hisingenB uppsats  den kyrkliga försvenskningen av bohuslänska hisingen
B uppsats den kyrkliga försvenskningen av bohuslänska hisingen
 
CV Tamer Abbass
CV Tamer AbbassCV Tamer Abbass
CV Tamer Abbass
 
Trabajo contencioso
Trabajo contenciosoTrabajo contencioso
Trabajo contencioso
 
ΠΑΡΟΥΣΙΑΣΗ YOUR NEXT HOME
ΠΑΡΟΥΣΙΑΣΗ YOUR NEXT HOMEΠΑΡΟΥΣΙΑΣΗ YOUR NEXT HOME
ΠΑΡΟΥΣΙΑΣΗ YOUR NEXT HOME
 
Production schedule
Production scheduleProduction schedule
Production schedule
 
2016 CV 1
2016 CV 12016 CV 1
2016 CV 1
 
eczema treatment singapore
eczema treatment singaporeeczema treatment singapore
eczema treatment singapore
 
Teta Tulay - Asian TYA Network event presentation at ricca ricca*festa 2016.
Teta Tulay - Asian TYA Network event presentation at ricca ricca*festa 2016.Teta Tulay - Asian TYA Network event presentation at ricca ricca*festa 2016.
Teta Tulay - Asian TYA Network event presentation at ricca ricca*festa 2016.
 
Contents page powerpoint
Contents page powerpointContents page powerpoint
Contents page powerpoint
 

Ähnlich wie Case Study Of Webgraph

Scalable Similarity-Based Neighborhood Methods with MapReduce
Scalable Similarity-Based Neighborhood Methods with MapReduceScalable Similarity-Based Neighborhood Methods with MapReduce
Scalable Similarity-Based Neighborhood Methods with MapReduce
sscdotopen
 

Ähnlich wie Case Study Of Webgraph (20)

CS6010 Social Network Analysis Unit V
CS6010 Social Network Analysis Unit VCS6010 Social Network Analysis Unit V
CS6010 Social Network Analysis Unit V
 
Ling liu part 01:big graph processing
Ling liu part 01:big graph processingLing liu part 01:big graph processing
Ling liu part 01:big graph processing
 
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
 
Seeing Graphs - How to successfully visualize connected data
Seeing Graphs - How to successfully visualize connected dataSeeing Graphs - How to successfully visualize connected data
Seeing Graphs - How to successfully visualize connected data
 
10. Graph Databases
10. Graph Databases10. Graph Databases
10. Graph Databases
 
Database
DatabaseDatabase
Database
 
Large graph analysis using g mine system
Large graph analysis using g mine systemLarge graph analysis using g mine system
Large graph analysis using g mine system
 
Database Models, Client-Server Architecture, Distributed Database and Classif...
Database Models, Client-Server Architecture, Distributed Database and Classif...Database Models, Client-Server Architecture, Distributed Database and Classif...
Database Models, Client-Server Architecture, Distributed Database and Classif...
 
20191107 deeplearningapproachesfornetworks
20191107 deeplearningapproachesfornetworks20191107 deeplearningapproachesfornetworks
20191107 deeplearningapproachesfornetworks
 
Graph analytic and machine learning
Graph analytic and machine learningGraph analytic and machine learning
Graph analytic and machine learning
 
2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial
 
Data visualization
Data visualizationData visualization
Data visualization
 
Chapter 3.pptx
Chapter 3.pptxChapter 3.pptx
Chapter 3.pptx
 
Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...
Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...
Lambda Data Grid: An Agile Optical Platform for Grid Computing and Data-inten...
 
Openstreetmap
OpenstreetmapOpenstreetmap
Openstreetmap
 
Southwick britain gr_nsight_cmsi402-presentation_20140508
Southwick britain gr_nsight_cmsi402-presentation_20140508Southwick britain gr_nsight_cmsi402-presentation_20140508
Southwick britain gr_nsight_cmsi402-presentation_20140508
 
Scalable Similarity-Based Neighborhood Methods with MapReduce
Scalable Similarity-Based Neighborhood Methods with MapReduceScalable Similarity-Based Neighborhood Methods with MapReduce
Scalable Similarity-Based Neighborhood Methods with MapReduce
 
Graph Database
Graph DatabaseGraph Database
Graph Database
 
Visualizing big data in the browser using spark
Visualizing big data in the browser using sparkVisualizing big data in the browser using spark
Visualizing big data in the browser using spark
 
Social Network Analysis Using Gephi
Social Network Analysis Using Gephi Social Network Analysis Using Gephi
Social Network Analysis Using Gephi
 

Mehr von Suraksha Sanghavi (6)

Case study
Case studyCase study
Case study
 
Sampling and its type
Sampling and its typeSampling and its type
Sampling and its type
 
Air pollution monitoring system
Air pollution monitoring systemAir pollution monitoring system
Air pollution monitoring system
 
An eye with an atm
An eye with an atmAn eye with an atm
An eye with an atm
 
Smart water tank
Smart water tankSmart water tank
Smart water tank
 
K tree
K  treeK  tree
K tree
 

Kürzlich hochgeladen

Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
dollysharma2066
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 

Kürzlich hochgeladen (20)

Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 

Case Study Of Webgraph

  • 2. 2 Internet/Web as Graphs • Graph of the physical layer with routers , computers etc as nodes and physical connections as edges – It is limited – Does not capture the graphical connections associated with the information on the Internet • Web Graph where nodes represent web pages and edges are associated with hyperlinks
  • 4. 4 Web Graph Considerations • Edges can be directed or undirected • Graph is highly dynamic – Nodes and edges are added/deleted often – Content of existing nodes is also subject to change – Pages and hyperlinks created on the fly • Apart from primary connected component there are also smaller disconnected components
  • 5. 5 Why the Web Graph? • Example of a large,dynamic and distributed graph • Possibly similar to other complex graphs in social, biological and other systems • Reflects how humans organize information (relevance, ranking) and their societies • Efficient navigation algorithms • Study behavior of users as they traverse the web graph (e-commerce)
  • 6. 6 Statistics of Interest • Size and connectivity of the graph • Number of connected components • Distribution of pages per site • Distribution of incoming and outgoing connections per site • Average and maximal length of the shortest path between any two vertices (diameter)
  • 7. 7 Properties of Web Graphs • Connectivity follows a power law distribution • The graph is sparse – |E| = O(n) or atleast o(n2 ) – Average number of hyperlinks per page roughly a constant • A small world graph
  • 8. 8 Small World Networks • It is a ‘small world’ – Millions of people. Yet, separated by “six degrees” of acquaintance relationships – Popularized by Milgram’s famous experiment • Mathematically – Diameter of graph is small (log N) as compared to overall size • 3. Property seems interesting given ‘sparse’ nature of graph but … • This property is ‘natural’ in ‘pure’ random graphs
  • 9. Conclusion The compression techniques are specializedspecialized for Web Graphs. The average link sizelink size decreases with the increase of the graph. The average link access timelink access time increases with the increase of the graph. 9
  • 10. Conclusion The compression techniques are specializedspecialized for Web Graphs. The average link sizelink size decreases with the increase of the graph. The average link access timelink access time increases with the increase of the graph. 9