SlideShare ist ein Scribd-Unternehmen logo
1 von 51
Downloaden Sie, um offline zu lesen
Intro to Web Science
September 19, 2013
ITWS 1100
John Erickson, Kristine Gloria, Qingpeng Zhang
Tetherless World Constellation
Rensselaer Polytechnic Institute
Agenda
1. A Science of The Web and why it matters
2. Web Architecture/Engineering the Web
3. Measuring the Web
4. The Web Science Method
5. Social Aspects of the Web
a. Evolution of methodology
b. Hurdles of incorporating the “social”
c. Why humans aren’t just "nodes" in a network
6. Web and other Governance
What is Web Science?
● Positions the World Wide Web as an object
of scientific study unto itself
● Recognizes the Web as a transformational,
disruptive technology
● Its practitioners focus on understanding the
Web...
○ ...its components, facets and characteristics
● The Web Science Method: “the process of
designing things in a very large space..."
What does Web Science ask?
● What processes have driven the Web’s
growth, and will they persist?
● How does large-scale structure emerge from
a simple set of protocols?
● How does the Web function as a socio-
technical system?
● What drives the viral uptake of certain Web
phenomena?
Bottom line: What might fragment the Web?
What is the Web?
● "The Web is not a thing..."
● Continuously changing due to coordinated
and conflicting processes
● An evolving large-scale structure
dependant on static and emerging protocols
● A socio-technical system that reflects and
obfuscates social and technical structures
● Always goes where we allow it to go...but
seldom where we want or expect it to go!
Les Carr, et.al. http://slidesha.re/142MFrV
Clare Hooper, et.al. http://bit.ly/R813sC
Web Architecture
It's quite simple, really! ;)
● A standard system for identifying resources
● Standard formats for representing
resources
● A standard protocol for exchanging
resources
Relevant core standards:
● URIs (URLs): Universal Resource Identifiers
● HTML: Hypertext Markup Language
● HTTP: Hypertext Transfer Protocol
Architecture of the World Wide Web, Volume One http://www.w3.org/TR/webarch/
Data Mining: Mapping the Blogosphere http://bit.ly/18MuXdD
Mapping the Internet http://bit.ly/18MuWWZ
Identifying Resources (1)
● A global identification system is essential
○ to share information about resources
○ to reason about resources
○ to modify or exchange resources
● "Resources" are anything that can be linked
to or spoken of
○ Documents, cat videos, people, ideas...
● Not all resources are "on" the Web
○ They might be referenced from the Web...
○ ...while not being retrievable from it
○ These are (so called) "information resources"
Les Carr, et.al. http://slidesha.re/142MFrV
Identifying Resources (2)
● A global standard is required; the URI is it
● Others systems are possible...
○ ...but added value of a single global system of
identifiers is high
○ Enables linking, bookmarking and other functions
across heterogeneous applications
● How are URI used?
○ All resources have URIs associated with them
○ Each URI identifies a single resource in a context-
independent manner
○ URIs act as names and (usually) addresses
○ In general URIs are "opaque"Uniform Resource Identifier (URI): Generic Syntax (RFC 3986) http://www.ietf.org/rfc/rfc3986.txt
Identifying Resources (4)
● "URIs identify and URLs locate..."
○ ...and identify
● URLs are URIs aligned with protocols
○ URLs include the "access mechanism" or "network
location", e.g. http:// or ftp://
○ How to "dereference" the URI and retrieve the thing
● URL examples
○ ftp://ftp.is.co.za/rfc/rfc1808.txt
○ http://www.ietf.org/rfc/rfc2396.txt
○ mailto:John.Doe@example.com
○ telnet://192.0.2.16:80/
Uniform Resource Identifier (URI): Generic Syntax (RFC 3986) http://www.ietf.org/rfc/rfc3986.txt
Representing Resources (1)
● Resources are manifest as digital files
● The Web recognizes a (growing) set of file
formats
○ The original and workhorse is HTML...
○ ...but there are many others
● Retrievable resources on the web serve
multiple purposes
○ Resources encode information and data
○ Resources aggregate links to other resources
● This is what makes The Web(tm) a "web..."
Resources (nodes)
aggregate links to
other resources to
create a Web
Retrieving Resources (1)
● Review: URIs that reference retrievable
resources -- URLs -- must specify a protocol
for retrieval
● The original and most common Web protocol
is HTTP
● Specialized protocols are possible but
resources may appear "off the grid..."
URIs, HTTP, many formats...
Principles for creating a healthy Web
Tim Berners-Lee http://www.w3.org/DesignIssues/LinkedData.html
● Use URIs as names for things
● Use HTTP URIs so people can "look up"
those names
● When someone "looks up" a URI, return
useful information
○ use the standards to do it
● Include links to other URIs, so the
Consumer can discover more things
○ People or applications
Why is linking important???
Implications of a well-connected
Web: Google PageRank
● Links to other nodes as a "vote" of quality
and/or relevance
PageRank https://en.wikipedia.org/wiki/PageRank
Measuring the Web
Web as a
Network
Router network through the Internet
Measuring the Web
● The rich variety of networks on the Web
○ Router network
○ Web page network (linking via hyperlinks)
○ Document network (citation network on DBLP*, etc.)
○ Social networks
■ Facebook: friendship, comment-reply, tag, and all
kinds of social relationship on Facebook
■ Twitter: follower, retweet, mention, reply, etc.
■ Blogosphere: friendship, visiting, comment, etc.
■ LinkedIn: colleague, classmate, etc.
■ Crowdsourcing: collaboration, co-worker, etc.
■ Other social media...
*The DBLP Computer Science Bibliography http://www.informatik.uni-trier.de/~ley/db/
Measuring the Web - Blogosphere
Political Blogosphere
2004 US Presidential Election
Bloggers:
Blue -Democrat
Red - Republican
Pink - Neutral
L. Adamic, N. Glance, The political blogosphere and the 2004 U.S. election:
divided they blog, LinkKDD’05
Measuring the Web: Recommender
System
Measuring the Web: Terrorism
Measuring the Web: Twitter
M. D. Conover, et al. Political Polarization on Twitter, ICWSM’11
Measuring the Web:
@olyerickson
Analyzing networks on the Web
Measure...
■ # of nodes
■ # of edges
■ Diameter and radius
■ Network density
■ Degree distribution
■ Clustering coefficient
■ Average shortest path length
■ Strongly/weakly connected components
■ Betweenness/Closeness centrality
■ Bow-tie structure
■ Community discovery
■ Key nodes discovery
■ etc...
Measuring the Web
“Bow-tie” structure
Overall view of the structure
of the Web
SCC
IN
OUT
Tendrils
Tubes
Disconnected
Measuring the Web
● It's a “Small World” after all...
○ Most pairs of pages separated by small # of links
○ Almost always by fewer than 20 links
○ "Diameter" of central core is 28, very small
compared to the size of the Web
○ Analysis suggests diameter will grow logarithmically
with the size of the Web (ie slowly)
○ Diameter of social networks decreases over time
● Conclusion: The Web is “smaller” than we thought!
● “Six degrees of separation” verified in Social Web
R. Albert, H. Jeong and A.-L. Barabasi, Diameter of the World Wide Web,
Nature 401 (1999) 130–131. http://bit.ly/18atsYA
J. Leskovec, etc. Graphs over Time: Densification Laws, Shrinking Diameters
and Possible Explanations, KDD (2005)
Measuring the Web
“Scale-free” property
Highly
Connected
Hubs
“Rich
get
richer”
A live model: http://ccl.northwestern.edu/netlogo/models/PreferentialAttachment
The Web Science Method
Berners-Lee, T. (2007). W3C. http://www.w3.org/2007/Talks/0509-www-keynote-tbl/#(10)
The Web Science Method
Berners-Lee, T. (2007). W3C. http://www.w3.org/2007/Talks/0509-www-keynote-tbl/#(10)
"Science"
"Engineering"
Applied to email...
Berners-Lee, T. (2007). W3C. http://www.w3.org/2007/Talks/0509-www-keynote-tbl/#(16)
Applied to the original Web...
Berners-Lee, T. (2007). W3C. http://www.w3.org/2007/Talks/0509-www-keynote-tbl/#(18)
Applied to Google's Web...
Berners-Lee, T. (2007). W3C. http://www.w3.org/2007/Talks/0509-www-keynote-tbl/#(19)
Applied to Wikis...
Berners-Lee, T. (2007). W3C. http://www.w3.org/2007/Talks/0509-www-keynote-tbl/#(20)
Applied to Blogs...
Berners-Lee, T. (2007). W3C. http://www.w3.org/2007/Talks/0509-www-keynote-tbl/#(21)
Social Aspects of the Web
"Visual complexity produces opacity.
Massive individualizing data produces
beautiful, playful hairballs which show us
nothing."
- Bruno Latour @ CHI2013
For discussion see, "What baboon notebooks, monads, state surveillance and network diagrams have
in common: Bruno Latour at CHI2013" http://bit.ly/14Y3d3u
Multiple disciplines, multiple
methods
● Given it’s multiple disciplines, the argument is for a mixed-
methods approach to measuring the web. This means both
quantitative AND qualitative methods should be employed
by researchers.
○ Pros: More robust, comprehensive understanding of
“human social behavior”
○ Cons: Diametrically opposed philosophies in data
gathering and analysis
● Unanswered questions:
○ Replicability
○ Bias
○ Objectivity and Accuracy
● Ethics
Web Governance, Security and Standards
Who should govern the
Web?
Intro to Web Science (Fall 2013)
Intro to Web Science (Fall 2013)
Intro to Web Science (Fall 2013)
Intro to Web Science (Fall 2013)
Intro to Web Science (Fall 2013)
Intro to Web Science (Fall 2013)
Intro to Web Science (Fall 2013)
Intro to Web Science (Fall 2013)
Web Science meets (Web) Governance?
Policy Design
Policy
Implementation
Analysis and
understanding of
policy
implications
Policy
Conception
Review...
1. A Science of The Web
2. Web Architecture
3. Measuring the Web
4. The Web Science Method
5. Social Aspects of the Web
6. Web and other Governance
Assignment:
1. Preferential Attachment Simulator: http://bit.ly/18bd0p2
○ Try the THINGS TO TRY!
2. Excel-based Network Analysis Tutorial
○ Following instructions at: http://bit.ly/1a3mtzW
○ Install NodeXL from: http://bit.ly/1a3mnbx
○ Use Senate 2007 data from: http://bit.ly/1a3mhAI
○ Play with other data at: http://bit.ly/1a3mJ1X
3. Social Network Exploration
○ Twitalizer: http://twitalyzer.com
○ TweetArchivist: http://tweetarchivist.com
○ MentionMap: http://mentionmapp.com
4. Create a Web Science Scenario:
○ Identify a (social) problem
○ Proposed an engineered solution
○ Identify how to measure, analyze, evaluate, iterate

Más contenido relacionado

Was ist angesagt?

Using Dataverse Virtual Archive Technology for Research Data Management
Using Dataverse Virtual Archive Technology for Research Data ManagementUsing Dataverse Virtual Archive Technology for Research Data Management
Using Dataverse Virtual Archive Technology for Research Data ManagementGary Wilhelm
 
DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016Sebastian Hellmann
 
2011 05-02 linked data intro
2011 05-02 linked data intro2011 05-02 linked data intro
2011 05-02 linked data introvafopoulos
 
2011 05-01 linked data
2011 05-01 linked data2011 05-01 linked data
2011 05-01 linked datavafopoulos
 
Linked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve MeyerLinked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve MeyerWiLS
 
Linked Open Data for Digital Humanities
Linked Open Data for Digital HumanitiesLinked Open Data for Digital Humanities
Linked Open Data for Digital HumanitiesChristophe Guéret
 
Information Retrieval and Social Media
Information Retrieval and Social MediaInformation Retrieval and Social Media
Information Retrieval and Social MediaArjen de Vries
 
Data Management for Undergraduate Researchers (updated - 02/2016)
Data Management for Undergraduate Researchers (updated - 02/2016)Data Management for Undergraduate Researchers (updated - 02/2016)
Data Management for Undergraduate Researchers (updated - 02/2016)Rebekah Cummings
 
Linked Data: A short(-ish) introduction
Linked Data: A short(-ish) introductionLinked Data: A short(-ish) introduction
Linked Data: A short(-ish) introductionPete Johnston
 
DBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataDBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataSebastian Hellmann
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Anita de Waard
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
 
Demo: Profiling & Exploration of Linked Open Data
Demo: Profiling & Exploration of Linked Open DataDemo: Profiling & Exploration of Linked Open Data
Demo: Profiling & Exploration of Linked Open DataStefan Dietze
 
What can linked data do for digital libraries
What can linked data do for digital librariesWhat can linked data do for digital libraries
What can linked data do for digital librariesSören Auer
 

Was ist angesagt? (20)

Using Dataverse Virtual Archive Technology for Research Data Management
Using Dataverse Virtual Archive Technology for Research Data ManagementUsing Dataverse Virtual Archive Technology for Research Data Management
Using Dataverse Virtual Archive Technology for Research Data Management
 
DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016
 
2011 05-02 linked data intro
2011 05-02 linked data intro2011 05-02 linked data intro
2011 05-02 linked data intro
 
2011 05-01 linked data
2011 05-01 linked data2011 05-01 linked data
2011 05-01 linked data
 
Web RDF
Web RDFWeb RDF
Web RDF
 
Linked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve MeyerLinked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve Meyer
 
An Introduction to EZID
An Introduction to EZIDAn Introduction to EZID
An Introduction to EZID
 
Linked Open Data for Digital Humanities
Linked Open Data for Digital HumanitiesLinked Open Data for Digital Humanities
Linked Open Data for Digital Humanities
 
Sanderson Shout It Out: LOUD
Sanderson Shout It Out: LOUDSanderson Shout It Out: LOUD
Sanderson Shout It Out: LOUD
 
Information Retrieval and Social Media
Information Retrieval and Social MediaInformation Retrieval and Social Media
Information Retrieval and Social Media
 
Meadows apr28-1
Meadows apr28-1Meadows apr28-1
Meadows apr28-1
 
Data Management for Undergraduate Researchers (updated - 02/2016)
Data Management for Undergraduate Researchers (updated - 02/2016)Data Management for Undergraduate Researchers (updated - 02/2016)
Data Management for Undergraduate Researchers (updated - 02/2016)
 
Linked Data: A short(-ish) introduction
Linked Data: A short(-ish) introductionLinked Data: A short(-ish) introduction
Linked Data: A short(-ish) introduction
 
DBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataDBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of Data
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
 
Library Linked Data and the Future of Bibliographic Control
Library Linked Data and the Future of Bibliographic ControlLibrary Linked Data and the Future of Bibliographic Control
Library Linked Data and the Future of Bibliographic Control
 
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at ScaleFull Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
 
Demo: Profiling & Exploration of Linked Open Data
Demo: Profiling & Exploration of Linked Open DataDemo: Profiling & Exploration of Linked Open Data
Demo: Profiling & Exploration of Linked Open Data
 
What can linked data do for digital libraries
What can linked data do for digital librariesWhat can linked data do for digital libraries
What can linked data do for digital libraries
 

Andere mochten auch

Google Scholar and Web of Science: Similarities and Differences in Citation A...
Google Scholar and Web of Science: Similarities and Differences in Citation A...Google Scholar and Web of Science: Similarities and Differences in Citation A...
Google Scholar and Web of Science: Similarities and Differences in Citation A...Balachandar Radhakrishnan
 
Tutorial 6 (web graph attributes)
Tutorial 6 (web graph attributes)Tutorial 6 (web graph attributes)
Tutorial 6 (web graph attributes)Kira
 
Wwsss intro2016-final
Wwsss intro2016-finalWwsss intro2016-final
Wwsss intro2016-finalSteffen Staab
 
Graph Structure In The Web
Graph Structure In The WebGraph Structure In The Web
Graph Structure In The Webdailyye
 
Graph Structure in the Web - Revisited. WWW2014 Web Science Track
Graph Structure in the Web - Revisited. WWW2014 Web Science TrackGraph Structure in the Web - Revisited. WWW2014 Web Science Track
Graph Structure in the Web - Revisited. WWW2014 Web Science TrackChris Bizer
 
Web Science Framework and InterDataNet
Web Science Framework and InterDataNetWeb Science Framework and InterDataNet
Web Science Framework and InterDataNetmaria chiara pettenati
 
4Science presentes: ORCiD API Tutorial
4Science presentes: ORCiD API Tutorial4Science presentes: ORCiD API Tutorial
4Science presentes: ORCiD API Tutorial4Science
 
ORCID in the research lifecycle,Thomson Reuters: Web of Science, Converis, In...
ORCID in the research lifecycle,Thomson Reuters: Web of Science, Converis, In...ORCID in the research lifecycle,Thomson Reuters: Web of Science, Converis, In...
ORCID in the research lifecycle,Thomson Reuters: Web of Science, Converis, In...ORCID, Inc
 
Graphs, Edges & Nodes - Untangling the Social Web
Graphs, Edges & Nodes - Untangling the Social WebGraphs, Edges & Nodes - Untangling the Social Web
Graphs, Edges & Nodes - Untangling the Social WebJoël Perras
 
Case Study: Thomson Reuters
Case Study: Thomson ReutersCase Study: Thomson Reuters
Case Study: Thomson ReutersForgeRock
 
Yu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasingYu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasingECNOfficer
 
Casa design 2014
Casa design 2014Casa design 2014
Casa design 2014casadesign
 
Leptospirosis
LeptospirosisLeptospirosis
LeptospirosisDej8vu
 
INGORCEWA Y SUS ASSOCIADOS.
INGORCEWA Y SUS ASSOCIADOS.INGORCEWA Y SUS ASSOCIADOS.
INGORCEWA Y SUS ASSOCIADOS.Orcino Malivert
 
Akka testing
Akka testingAkka testing
Akka testingAvi Levi
 
Delaware Native Plants for Native Bees
Delaware Native Plants for Native BeesDelaware Native Plants for Native Bees
Delaware Native Plants for Native BeesKardatou54a
 

Andere mochten auch (20)

Google Scholar and Web of Science: Similarities and Differences in Citation A...
Google Scholar and Web of Science: Similarities and Differences in Citation A...Google Scholar and Web of Science: Similarities and Differences in Citation A...
Google Scholar and Web of Science: Similarities and Differences in Citation A...
 
Tutorial 6 (web graph attributes)
Tutorial 6 (web graph attributes)Tutorial 6 (web graph attributes)
Tutorial 6 (web graph attributes)
 
Wwsss intro2016-final
Wwsss intro2016-finalWwsss intro2016-final
Wwsss intro2016-final
 
Graph Structure In The Web
Graph Structure In The WebGraph Structure In The Web
Graph Structure In The Web
 
Graph Structure in the Web - Revisited. WWW2014 Web Science Track
Graph Structure in the Web - Revisited. WWW2014 Web Science TrackGraph Structure in the Web - Revisited. WWW2014 Web Science Track
Graph Structure in the Web - Revisited. WWW2014 Web Science Track
 
Web Science Framework and InterDataNet
Web Science Framework and InterDataNetWeb Science Framework and InterDataNet
Web Science Framework and InterDataNet
 
4Science presentes: ORCiD API Tutorial
4Science presentes: ORCiD API Tutorial4Science presentes: ORCiD API Tutorial
4Science presentes: ORCiD API Tutorial
 
ORCID in the research lifecycle,Thomson Reuters: Web of Science, Converis, In...
ORCID in the research lifecycle,Thomson Reuters: Web of Science, Converis, In...ORCID in the research lifecycle,Thomson Reuters: Web of Science, Converis, In...
ORCID in the research lifecycle,Thomson Reuters: Web of Science, Converis, In...
 
Graphs, Edges & Nodes - Untangling the Social Web
Graphs, Edges & Nodes - Untangling the Social WebGraphs, Edges & Nodes - Untangling the Social Web
Graphs, Edges & Nodes - Untangling the Social Web
 
Mendeley: On the Web and on Your Desktop
Mendeley:  On the Web and on Your DesktopMendeley:  On the Web and on Your Desktop
Mendeley: On the Web and on Your Desktop
 
Case Study: Thomson Reuters
Case Study: Thomson ReutersCase Study: Thomson Reuters
Case Study: Thomson Reuters
 
comments_daniels
comments_danielscomments_daniels
comments_daniels
 
Yu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasingYu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasing
 
Casa design 2014
Casa design 2014Casa design 2014
Casa design 2014
 
Leptospirosis
LeptospirosisLeptospirosis
Leptospirosis
 
INGORCEWA Y SUS ASSOCIADOS.
INGORCEWA Y SUS ASSOCIADOS.INGORCEWA Y SUS ASSOCIADOS.
INGORCEWA Y SUS ASSOCIADOS.
 
Actian Vectorwise Brochure
Actian Vectorwise BrochureActian Vectorwise Brochure
Actian Vectorwise Brochure
 
Akka testing
Akka testingAkka testing
Akka testing
 
Delaware Native Plants for Native Bees
Delaware Native Plants for Native BeesDelaware Native Plants for Native Bees
Delaware Native Plants for Native Bees
 
Biossegurança
BiossegurançaBiossegurança
Biossegurança
 

Ähnlich wie Intro to Web Science (Fall 2013)

ITWS 4310: Building and Consuming the Web of Data (Fall 2013)
ITWS 4310: Building and Consuming the Web of Data (Fall 2013)ITWS 4310: Building and Consuming the Web of Data (Fall 2013)
ITWS 4310: Building and Consuming the Web of Data (Fall 2013)Rensselaer Polytechnic Institute
 
Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)
Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)
Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)Rensselaer Polytechnic Institute
 
The CSO Open Data Experience
The CSO Open Data ExperienceThe CSO Open Data Experience
The CSO Open Data ExperienceDublinked .
 
Episode 3(3): Birth & explosion of the World Wide Web - Meetup session11
Episode 3(3): Birth & explosion of the World Wide Web - Meetup session11Episode 3(3): Birth & explosion of the World Wide Web - Meetup session11
Episode 3(3): Birth & explosion of the World Wide Web - Meetup session11William Hall
 
Analysis Of Hypertext Transfer Protocol, Http
Analysis Of Hypertext Transfer Protocol, HttpAnalysis Of Hypertext Transfer Protocol, Http
Analysis Of Hypertext Transfer Protocol, HttpLaurie Gunlicks
 
Informal presentation about RES
Informal presentation about RESInformal presentation about RES
Informal presentation about RESChristophe Guéret
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked dataLaura Po
 
Social Semantic (Sensor) Web
Social Semantic (Sensor) WebSocial Semantic (Sensor) Web
Social Semantic (Sensor) WebDavid Crowley
 
Open Data - Principles and Techniques
Open Data - Principles and TechniquesOpen Data - Principles and Techniques
Open Data - Principles and TechniquesBernhard Haslhofer
 
RDFa From Theory to Practice
RDFa From Theory to PracticeRDFa From Theory to Practice
RDFa From Theory to PracticeAdrian Stevenson
 
UNIT-1.pptx that contains the web and internet of tecnology
UNIT-1.pptx that contains the web and internet of tecnologyUNIT-1.pptx that contains the web and internet of tecnology
UNIT-1.pptx that contains the web and internet of tecnologyrssvsa181514
 
Everyday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for researchEveryday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for researchFrancesca Di Donato
 

Ähnlich wie Intro to Web Science (Fall 2013) (20)

Intro to Web Science (Oct 2022)
Intro to Web Science (Oct 2022)Intro to Web Science (Oct 2022)
Intro to Web Science (Oct 2022)
 
ITWS 4310: Building and Consuming the Web of Data (Fall 2013)
ITWS 4310: Building and Consuming the Web of Data (Fall 2013)ITWS 4310: Building and Consuming the Web of Data (Fall 2013)
ITWS 4310: Building and Consuming the Web of Data (Fall 2013)
 
ITWS Capstone: Engineering a Semantic Web (Fall 2022)
ITWS Capstone: Engineering a Semantic Web (Fall 2022)ITWS Capstone: Engineering a Semantic Web (Fall 2022)
ITWS Capstone: Engineering a Semantic Web (Fall 2022)
 
Engineering a Semantic Web (Spring 2018)
Engineering a Semantic Web (Spring 2018)Engineering a Semantic Web (Spring 2018)
Engineering a Semantic Web (Spring 2018)
 
Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)
Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)
Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)
 
The CSO Open Data Experience
The CSO Open Data ExperienceThe CSO Open Data Experience
The CSO Open Data Experience
 
Episode 3(3): Birth & explosion of the World Wide Web - Meetup session11
Episode 3(3): Birth & explosion of the World Wide Web - Meetup session11Episode 3(3): Birth & explosion of the World Wide Web - Meetup session11
Episode 3(3): Birth & explosion of the World Wide Web - Meetup session11
 
Analysis Of Hypertext Transfer Protocol, Http
Analysis Of Hypertext Transfer Protocol, HttpAnalysis Of Hypertext Transfer Protocol, Http
Analysis Of Hypertext Transfer Protocol, Http
 
Linked (Open) Data
Linked (Open) DataLinked (Open) Data
Linked (Open) Data
 
Ld4 dh tutorial
Ld4 dh tutorialLd4 dh tutorial
Ld4 dh tutorial
 
Informal presentation about RES
Informal presentation about RESInformal presentation about RES
Informal presentation about RES
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
 
Semantic Linked Data
Semantic Linked DataSemantic Linked Data
Semantic Linked Data
 
Social Semantic (Sensor) Web
Social Semantic (Sensor) WebSocial Semantic (Sensor) Web
Social Semantic (Sensor) Web
 
Open Data - Principles and Techniques
Open Data - Principles and TechniquesOpen Data - Principles and Techniques
Open Data - Principles and Techniques
 
RDFa From Theory to Practice
RDFa From Theory to PracticeRDFa From Theory to Practice
RDFa From Theory to Practice
 
UNIT-1.pptx that contains the web and internet of tecnology
UNIT-1.pptx that contains the web and internet of tecnologyUNIT-1.pptx that contains the web and internet of tecnology
UNIT-1.pptx that contains the web and internet of tecnology
 
Web Information Systems Introduction and Origin of World Wide Web
Web Information Systems Introduction and Origin of World Wide WebWeb Information Systems Introduction and Origin of World Wide Web
Web Information Systems Introduction and Origin of World Wide Web
 
Everyday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for researchEveryday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for research
 
Here Comes Everything
Here Comes EverythingHere Comes Everything
Here Comes Everything
 

Mehr von Rensselaer Polytechnic Institute

Mehr von Rensselaer Polytechnic Institute (6)

ITWS Capstone (RPI, Fall 2013)
ITWS Capstone (RPI, Fall 2013)ITWS Capstone (RPI, Fall 2013)
ITWS Capstone (RPI, Fall 2013)
 
ITWS Capstone Lecture (Spring 2013)
ITWS Capstone Lecture (Spring 2013)ITWS Capstone Lecture (Spring 2013)
ITWS Capstone Lecture (Spring 2013)
 
The Semantic Web: RPI ITWS Capstone (Fall 2012)
The Semantic Web: RPI ITWS Capstone (Fall 2012)The Semantic Web: RPI ITWS Capstone (Fall 2012)
The Semantic Web: RPI ITWS Capstone (Fall 2012)
 
First they have to find it: Getting Open Government Data Discovered and Used
First they have to find it: Getting Open Government Data Discovered and UsedFirst they have to find it: Getting Open Government Data Discovered and Used
First they have to find it: Getting Open Government Data Discovered and Used
 
Where is the World is my Open Government Data?
Where is the World is my Open Government Data?Where is the World is my Open Government Data?
Where is the World is my Open Government Data?
 
The Future of DSpace: Making it Personal (Making it Social)
The Future of DSpace: Making it Personal (Making it Social)The Future of DSpace: Making it Personal (Making it Social)
The Future of DSpace: Making it Personal (Making it Social)
 

Último

AI Workshops at Computers In Libraries 2024
AI Workshops at Computers In Libraries 2024AI Workshops at Computers In Libraries 2024
AI Workshops at Computers In Libraries 2024Brian Pichman
 
Stobox 4: Revolutionizing Investment in Real-World Assets Through Tokenization
Stobox 4: Revolutionizing Investment in Real-World Assets Through TokenizationStobox 4: Revolutionizing Investment in Real-World Assets Through Tokenization
Stobox 4: Revolutionizing Investment in Real-World Assets Through TokenizationStobox
 
SIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENT
SIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENTSIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENT
SIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENTxtailishbaloch
 
Explore the UiPath Community and ways you can benefit on your journey to auto...
Explore the UiPath Community and ways you can benefit on your journey to auto...Explore the UiPath Community and ways you can benefit on your journey to auto...
Explore the UiPath Community and ways you can benefit on your journey to auto...DianaGray10
 
IT Service Management (ITSM) Best Practices for Advanced Computing
IT Service Management (ITSM) Best Practices for Advanced ComputingIT Service Management (ITSM) Best Practices for Advanced Computing
IT Service Management (ITSM) Best Practices for Advanced ComputingMAGNIntelligence
 
UiPath Studio Web workshop series - Day 4
UiPath Studio Web workshop series - Day 4UiPath Studio Web workshop series - Day 4
UiPath Studio Web workshop series - Day 4DianaGray10
 
Q4 2023 Quarterly Investor Presentation - FINAL - v1.pdf
Q4 2023 Quarterly Investor Presentation - FINAL - v1.pdfQ4 2023 Quarterly Investor Presentation - FINAL - v1.pdf
Q4 2023 Quarterly Investor Presentation - FINAL - v1.pdfTejal81
 
Emil Eifrem at GraphSummit Copenhagen 2024 - The Art of the Possible.pptx
Emil Eifrem at GraphSummit Copenhagen 2024 - The Art of the Possible.pptxEmil Eifrem at GraphSummit Copenhagen 2024 - The Art of the Possible.pptx
Emil Eifrem at GraphSummit Copenhagen 2024 - The Art of the Possible.pptxNeo4j
 
Introduction - IPLOOK NETWORKS CO., LTD.
Introduction - IPLOOK NETWORKS CO., LTD.Introduction - IPLOOK NETWORKS CO., LTD.
Introduction - IPLOOK NETWORKS CO., LTD.IPLOOK Networks
 
UiPath Studio Web workshop series - Day 1
UiPath Studio Web workshop series  - Day 1UiPath Studio Web workshop series  - Day 1
UiPath Studio Web workshop series - Day 1DianaGray10
 
Introduction to RAG (Retrieval Augmented Generation) and its application
Introduction to RAG (Retrieval Augmented Generation) and its applicationIntroduction to RAG (Retrieval Augmented Generation) and its application
Introduction to RAG (Retrieval Augmented Generation) and its applicationKnoldus Inc.
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
 
.NET 8 ChatBot with Azure OpenAI Services.pptx
.NET 8 ChatBot with Azure OpenAI Services.pptx.NET 8 ChatBot with Azure OpenAI Services.pptx
.NET 8 ChatBot with Azure OpenAI Services.pptxHansamali Gamage
 
Graphene Quantum Dots-Based Composites for Biomedical Applications
Graphene Quantum Dots-Based Composites for  Biomedical ApplicationsGraphene Quantum Dots-Based Composites for  Biomedical Applications
Graphene Quantum Dots-Based Composites for Biomedical Applicationsnooralam814309
 
Keep Your Finger on the Pulse of Your Building's Performance with IES Live
Keep Your Finger on the Pulse of Your Building's Performance with IES LiveKeep Your Finger on the Pulse of Your Building's Performance with IES Live
Keep Your Finger on the Pulse of Your Building's Performance with IES LiveIES VE
 
Scenario Library et REX Discover industry- and role- based scenarios
Scenario Library et REX Discover industry- and role- based scenariosScenario Library et REX Discover industry- and role- based scenarios
Scenario Library et REX Discover industry- and role- based scenariosErol GIRAUDY
 
How to become a GDSC Lead GDSC MI AOE.pptx
How to become a GDSC Lead GDSC MI AOE.pptxHow to become a GDSC Lead GDSC MI AOE.pptx
How to become a GDSC Lead GDSC MI AOE.pptxKaustubhBhavsar6
 
Top 10 Squarespace Development Companies
Top 10 Squarespace Development CompaniesTop 10 Squarespace Development Companies
Top 10 Squarespace Development CompaniesTopCSSGallery
 
Outage Analysis: March 5th/6th 2024 Meta, Comcast, and LinkedIn
Outage Analysis: March 5th/6th 2024 Meta, Comcast, and LinkedInOutage Analysis: March 5th/6th 2024 Meta, Comcast, and LinkedIn
Outage Analysis: March 5th/6th 2024 Meta, Comcast, and LinkedInThousandEyes
 
Planetek Italia Srl - Corporate Profile Brochure
Planetek Italia Srl - Corporate Profile BrochurePlanetek Italia Srl - Corporate Profile Brochure
Planetek Italia Srl - Corporate Profile BrochurePlanetek Italia Srl
 

Último (20)

AI Workshops at Computers In Libraries 2024
AI Workshops at Computers In Libraries 2024AI Workshops at Computers In Libraries 2024
AI Workshops at Computers In Libraries 2024
 
Stobox 4: Revolutionizing Investment in Real-World Assets Through Tokenization
Stobox 4: Revolutionizing Investment in Real-World Assets Through TokenizationStobox 4: Revolutionizing Investment in Real-World Assets Through Tokenization
Stobox 4: Revolutionizing Investment in Real-World Assets Through Tokenization
 
SIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENT
SIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENTSIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENT
SIM INFORMATION SYSTEM: REVOLUTIONIZING DATA MANAGEMENT
 
Explore the UiPath Community and ways you can benefit on your journey to auto...
Explore the UiPath Community and ways you can benefit on your journey to auto...Explore the UiPath Community and ways you can benefit on your journey to auto...
Explore the UiPath Community and ways you can benefit on your journey to auto...
 
IT Service Management (ITSM) Best Practices for Advanced Computing
IT Service Management (ITSM) Best Practices for Advanced ComputingIT Service Management (ITSM) Best Practices for Advanced Computing
IT Service Management (ITSM) Best Practices for Advanced Computing
 
UiPath Studio Web workshop series - Day 4
UiPath Studio Web workshop series - Day 4UiPath Studio Web workshop series - Day 4
UiPath Studio Web workshop series - Day 4
 
Q4 2023 Quarterly Investor Presentation - FINAL - v1.pdf
Q4 2023 Quarterly Investor Presentation - FINAL - v1.pdfQ4 2023 Quarterly Investor Presentation - FINAL - v1.pdf
Q4 2023 Quarterly Investor Presentation - FINAL - v1.pdf
 
Emil Eifrem at GraphSummit Copenhagen 2024 - The Art of the Possible.pptx
Emil Eifrem at GraphSummit Copenhagen 2024 - The Art of the Possible.pptxEmil Eifrem at GraphSummit Copenhagen 2024 - The Art of the Possible.pptx
Emil Eifrem at GraphSummit Copenhagen 2024 - The Art of the Possible.pptx
 
Introduction - IPLOOK NETWORKS CO., LTD.
Introduction - IPLOOK NETWORKS CO., LTD.Introduction - IPLOOK NETWORKS CO., LTD.
Introduction - IPLOOK NETWORKS CO., LTD.
 
UiPath Studio Web workshop series - Day 1
UiPath Studio Web workshop series  - Day 1UiPath Studio Web workshop series  - Day 1
UiPath Studio Web workshop series - Day 1
 
Introduction to RAG (Retrieval Augmented Generation) and its application
Introduction to RAG (Retrieval Augmented Generation) and its applicationIntroduction to RAG (Retrieval Augmented Generation) and its application
Introduction to RAG (Retrieval Augmented Generation) and its application
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
.NET 8 ChatBot with Azure OpenAI Services.pptx
.NET 8 ChatBot with Azure OpenAI Services.pptx.NET 8 ChatBot with Azure OpenAI Services.pptx
.NET 8 ChatBot with Azure OpenAI Services.pptx
 
Graphene Quantum Dots-Based Composites for Biomedical Applications
Graphene Quantum Dots-Based Composites for  Biomedical ApplicationsGraphene Quantum Dots-Based Composites for  Biomedical Applications
Graphene Quantum Dots-Based Composites for Biomedical Applications
 
Keep Your Finger on the Pulse of Your Building's Performance with IES Live
Keep Your Finger on the Pulse of Your Building's Performance with IES LiveKeep Your Finger on the Pulse of Your Building's Performance with IES Live
Keep Your Finger on the Pulse of Your Building's Performance with IES Live
 
Scenario Library et REX Discover industry- and role- based scenarios
Scenario Library et REX Discover industry- and role- based scenariosScenario Library et REX Discover industry- and role- based scenarios
Scenario Library et REX Discover industry- and role- based scenarios
 
How to become a GDSC Lead GDSC MI AOE.pptx
How to become a GDSC Lead GDSC MI AOE.pptxHow to become a GDSC Lead GDSC MI AOE.pptx
How to become a GDSC Lead GDSC MI AOE.pptx
 
Top 10 Squarespace Development Companies
Top 10 Squarespace Development CompaniesTop 10 Squarespace Development Companies
Top 10 Squarespace Development Companies
 
Outage Analysis: March 5th/6th 2024 Meta, Comcast, and LinkedIn
Outage Analysis: March 5th/6th 2024 Meta, Comcast, and LinkedInOutage Analysis: March 5th/6th 2024 Meta, Comcast, and LinkedIn
Outage Analysis: March 5th/6th 2024 Meta, Comcast, and LinkedIn
 
Planetek Italia Srl - Corporate Profile Brochure
Planetek Italia Srl - Corporate Profile BrochurePlanetek Italia Srl - Corporate Profile Brochure
Planetek Italia Srl - Corporate Profile Brochure
 

Intro to Web Science (Fall 2013)

  • 1. Intro to Web Science September 19, 2013 ITWS 1100 John Erickson, Kristine Gloria, Qingpeng Zhang Tetherless World Constellation Rensselaer Polytechnic Institute
  • 2. Agenda 1. A Science of The Web and why it matters 2. Web Architecture/Engineering the Web 3. Measuring the Web 4. The Web Science Method 5. Social Aspects of the Web a. Evolution of methodology b. Hurdles of incorporating the “social” c. Why humans aren’t just "nodes" in a network 6. Web and other Governance
  • 3. What is Web Science? ● Positions the World Wide Web as an object of scientific study unto itself ● Recognizes the Web as a transformational, disruptive technology ● Its practitioners focus on understanding the Web... ○ ...its components, facets and characteristics ● The Web Science Method: “the process of designing things in a very large space..."
  • 4. What does Web Science ask? ● What processes have driven the Web’s growth, and will they persist? ● How does large-scale structure emerge from a simple set of protocols? ● How does the Web function as a socio- technical system? ● What drives the viral uptake of certain Web phenomena? Bottom line: What might fragment the Web?
  • 5. What is the Web? ● "The Web is not a thing..." ● Continuously changing due to coordinated and conflicting processes ● An evolving large-scale structure dependant on static and emerging protocols ● A socio-technical system that reflects and obfuscates social and technical structures ● Always goes where we allow it to go...but seldom where we want or expect it to go! Les Carr, et.al. http://slidesha.re/142MFrV
  • 6. Clare Hooper, et.al. http://bit.ly/R813sC
  • 7. Web Architecture It's quite simple, really! ;) ● A standard system for identifying resources ● Standard formats for representing resources ● A standard protocol for exchanging resources Relevant core standards: ● URIs (URLs): Universal Resource Identifiers ● HTML: Hypertext Markup Language ● HTTP: Hypertext Transfer Protocol
  • 8. Architecture of the World Wide Web, Volume One http://www.w3.org/TR/webarch/
  • 9. Data Mining: Mapping the Blogosphere http://bit.ly/18MuXdD
  • 10. Mapping the Internet http://bit.ly/18MuWWZ
  • 11. Identifying Resources (1) ● A global identification system is essential ○ to share information about resources ○ to reason about resources ○ to modify or exchange resources ● "Resources" are anything that can be linked to or spoken of ○ Documents, cat videos, people, ideas... ● Not all resources are "on" the Web ○ They might be referenced from the Web... ○ ...while not being retrievable from it ○ These are (so called) "information resources" Les Carr, et.al. http://slidesha.re/142MFrV
  • 12. Identifying Resources (2) ● A global standard is required; the URI is it ● Others systems are possible... ○ ...but added value of a single global system of identifiers is high ○ Enables linking, bookmarking and other functions across heterogeneous applications ● How are URI used? ○ All resources have URIs associated with them ○ Each URI identifies a single resource in a context- independent manner ○ URIs act as names and (usually) addresses ○ In general URIs are "opaque"Uniform Resource Identifier (URI): Generic Syntax (RFC 3986) http://www.ietf.org/rfc/rfc3986.txt
  • 13. Identifying Resources (4) ● "URIs identify and URLs locate..." ○ ...and identify ● URLs are URIs aligned with protocols ○ URLs include the "access mechanism" or "network location", e.g. http:// or ftp:// ○ How to "dereference" the URI and retrieve the thing ● URL examples ○ ftp://ftp.is.co.za/rfc/rfc1808.txt ○ http://www.ietf.org/rfc/rfc2396.txt ○ mailto:John.Doe@example.com ○ telnet://192.0.2.16:80/ Uniform Resource Identifier (URI): Generic Syntax (RFC 3986) http://www.ietf.org/rfc/rfc3986.txt
  • 14. Representing Resources (1) ● Resources are manifest as digital files ● The Web recognizes a (growing) set of file formats ○ The original and workhorse is HTML... ○ ...but there are many others ● Retrievable resources on the web serve multiple purposes ○ Resources encode information and data ○ Resources aggregate links to other resources ● This is what makes The Web(tm) a "web..."
  • 15. Resources (nodes) aggregate links to other resources to create a Web
  • 16. Retrieving Resources (1) ● Review: URIs that reference retrievable resources -- URLs -- must specify a protocol for retrieval ● The original and most common Web protocol is HTTP ● Specialized protocols are possible but resources may appear "off the grid..."
  • 17. URIs, HTTP, many formats...
  • 18. Principles for creating a healthy Web Tim Berners-Lee http://www.w3.org/DesignIssues/LinkedData.html ● Use URIs as names for things ● Use HTTP URIs so people can "look up" those names ● When someone "looks up" a URI, return useful information ○ use the standards to do it ● Include links to other URIs, so the Consumer can discover more things ○ People or applications Why is linking important???
  • 19. Implications of a well-connected Web: Google PageRank ● Links to other nodes as a "vote" of quality and/or relevance PageRank https://en.wikipedia.org/wiki/PageRank
  • 20. Measuring the Web Web as a Network Router network through the Internet
  • 21. Measuring the Web ● The rich variety of networks on the Web ○ Router network ○ Web page network (linking via hyperlinks) ○ Document network (citation network on DBLP*, etc.) ○ Social networks ■ Facebook: friendship, comment-reply, tag, and all kinds of social relationship on Facebook ■ Twitter: follower, retweet, mention, reply, etc. ■ Blogosphere: friendship, visiting, comment, etc. ■ LinkedIn: colleague, classmate, etc. ■ Crowdsourcing: collaboration, co-worker, etc. ■ Other social media... *The DBLP Computer Science Bibliography http://www.informatik.uni-trier.de/~ley/db/
  • 22. Measuring the Web - Blogosphere Political Blogosphere 2004 US Presidential Election Bloggers: Blue -Democrat Red - Republican Pink - Neutral L. Adamic, N. Glance, The political blogosphere and the 2004 U.S. election: divided they blog, LinkKDD’05
  • 23. Measuring the Web: Recommender System
  • 24. Measuring the Web: Terrorism
  • 25. Measuring the Web: Twitter M. D. Conover, et al. Political Polarization on Twitter, ICWSM’11
  • 27. Analyzing networks on the Web Measure... ■ # of nodes ■ # of edges ■ Diameter and radius ■ Network density ■ Degree distribution ■ Clustering coefficient ■ Average shortest path length ■ Strongly/weakly connected components ■ Betweenness/Closeness centrality ■ Bow-tie structure ■ Community discovery ■ Key nodes discovery ■ etc...
  • 28. Measuring the Web “Bow-tie” structure Overall view of the structure of the Web SCC IN OUT Tendrils Tubes Disconnected
  • 29. Measuring the Web ● It's a “Small World” after all... ○ Most pairs of pages separated by small # of links ○ Almost always by fewer than 20 links ○ "Diameter" of central core is 28, very small compared to the size of the Web ○ Analysis suggests diameter will grow logarithmically with the size of the Web (ie slowly) ○ Diameter of social networks decreases over time ● Conclusion: The Web is “smaller” than we thought! ● “Six degrees of separation” verified in Social Web R. Albert, H. Jeong and A.-L. Barabasi, Diameter of the World Wide Web, Nature 401 (1999) 130–131. http://bit.ly/18atsYA J. Leskovec, etc. Graphs over Time: Densification Laws, Shrinking Diameters and Possible Explanations, KDD (2005)
  • 30. Measuring the Web “Scale-free” property Highly Connected Hubs “Rich get richer” A live model: http://ccl.northwestern.edu/netlogo/models/PreferentialAttachment
  • 31. The Web Science Method Berners-Lee, T. (2007). W3C. http://www.w3.org/2007/Talks/0509-www-keynote-tbl/#(10)
  • 32. The Web Science Method Berners-Lee, T. (2007). W3C. http://www.w3.org/2007/Talks/0509-www-keynote-tbl/#(10) "Science" "Engineering"
  • 33. Applied to email... Berners-Lee, T. (2007). W3C. http://www.w3.org/2007/Talks/0509-www-keynote-tbl/#(16)
  • 34. Applied to the original Web... Berners-Lee, T. (2007). W3C. http://www.w3.org/2007/Talks/0509-www-keynote-tbl/#(18)
  • 35. Applied to Google's Web... Berners-Lee, T. (2007). W3C. http://www.w3.org/2007/Talks/0509-www-keynote-tbl/#(19)
  • 36. Applied to Wikis... Berners-Lee, T. (2007). W3C. http://www.w3.org/2007/Talks/0509-www-keynote-tbl/#(20)
  • 37. Applied to Blogs... Berners-Lee, T. (2007). W3C. http://www.w3.org/2007/Talks/0509-www-keynote-tbl/#(21)
  • 38. Social Aspects of the Web "Visual complexity produces opacity. Massive individualizing data produces beautiful, playful hairballs which show us nothing." - Bruno Latour @ CHI2013 For discussion see, "What baboon notebooks, monads, state surveillance and network diagrams have in common: Bruno Latour at CHI2013" http://bit.ly/14Y3d3u
  • 39. Multiple disciplines, multiple methods ● Given it’s multiple disciplines, the argument is for a mixed- methods approach to measuring the web. This means both quantitative AND qualitative methods should be employed by researchers. ○ Pros: More robust, comprehensive understanding of “human social behavior” ○ Cons: Diametrically opposed philosophies in data gathering and analysis ● Unanswered questions: ○ Replicability ○ Bias ○ Objectivity and Accuracy ● Ethics
  • 40. Web Governance, Security and Standards Who should govern the Web?
  • 49. Web Science meets (Web) Governance? Policy Design Policy Implementation Analysis and understanding of policy implications Policy Conception
  • 50. Review... 1. A Science of The Web 2. Web Architecture 3. Measuring the Web 4. The Web Science Method 5. Social Aspects of the Web 6. Web and other Governance
  • 51. Assignment: 1. Preferential Attachment Simulator: http://bit.ly/18bd0p2 ○ Try the THINGS TO TRY! 2. Excel-based Network Analysis Tutorial ○ Following instructions at: http://bit.ly/1a3mtzW ○ Install NodeXL from: http://bit.ly/1a3mnbx ○ Use Senate 2007 data from: http://bit.ly/1a3mhAI ○ Play with other data at: http://bit.ly/1a3mJ1X 3. Social Network Exploration ○ Twitalizer: http://twitalyzer.com ○ TweetArchivist: http://tweetarchivist.com ○ MentionMap: http://mentionmapp.com 4. Create a Web Science Scenario: ○ Identify a (social) problem ○ Proposed an engineered solution ○ Identify how to measure, analyze, evaluate, iterate