SlideShare a Scribd company logo
1 of 58
10 Jahre Web Science
Ein Blick auf die nächsten 10 Jahre
http://www.webscience.org/webscience10/tv-channel-webscience10/
Steffen Staab
Chair of WSTNet
WAIS Univ. of
Southampton
WeST Univ. of Koblenz-
Landau
Wolfgang Nejdl
L3S
Leibniz Universität
Hannover
Nikolaus Forgó
L3S
Leibniz Universität
Hannover
World Wide Web
Work
Dating
17% marriages in US due to
online dating
Traveling
Learning
Leisure
Science
Open access papers cited
more often 11:7
https://flic.kr/p/F37KoU
Web Science Network of Laboratories
Wendy Hall - CeBIT 2013
10 Jahre Web Science Research
Initiative
Keynotes
• Ricardo Baeza-Yatest, Yahoo!
• Andrew Tomkins, Google
• Daniel Olmedilla, Facebook
• Jure Leskovec, Stanford & Pinterest
• Daniel Miller, UCL, ERC Grant
„Social Network Sites and Social
Science”
• Helen Margetts, Oxford Internet
Institute
Panels
10 Years of Web Science
Computational Social Science
Privacy and Internet Governance
8th ACM Web Science Conference 2016
22.-25. Mai 2016 in Hannover
http://websci17.org/
Troy, NY, USA, 26-18 June 2017
WWSSS – WSTNet Web Science Summer School
Koblenz 2016 St. Petersburg 2017
30/11/16 8Thomas Risse
Next one: St. Petersburg, July 2017
Web Science
the grand challenge Web Science Observatory
Researchers around the world
gathering and sharing data
and evidence
Sharing tools, methods and
techniques
Web Science Collaboratories
Longitudinal studies
Wendy Hall - CeBIT 2013
Spam
Attack on Copts
Gun running from Sudan
Verlieren wir die
Vergangenheit des Web?
ALEXANDRIA (ERC Advanced Grant, 2.5 Mio. Euro)
World Wide Web – Digitales Erbe der Gesellschaft
 Was bleibt vom Web in 100 / 1000 Jahren, wenn es
niemand bewahrt?
 Datensammlung durch Deutsche
Nationalbibliothek, British Library, Internet
Archive, u.a.
 Suche und Analyse durch ALEXANDRIA
 Entwicklung neuer Modelle und Algorithmen, die
es
ermöglichen, nicht nur auf die Gegenwart,
sondern auch auf die Vergangenheit des Web
zuzugreifen
Semantische und zeitliche Suche nach Rudolph Giuliani
1997
2000
2006
2014
Mayoral
Campaign
Mayoral
Campaign
Mayoral
Campaign 9/11
Post politics
endeavours
Senate,
Cancer,
Allegations
NumberofDocuments
Mayor
SoBigData - Social Mining & Big Data Ecosystem
Big Data Analytics & Social Mining
as a tool to measure,
understand and possibly
predict human behavior
Research infrastructure (RI) for
ethic-sensitive scientific
discoveries and advanced
applications of social data
mining to the various
dimensions of social life, as
recorded by “big data”.
Integrating key national infrastructures and
centers of excellence
CNR & Uni Pisa (SoBigData.it)
Social Data
Big Data Analytics and Social Mining Services
Uni Hannover/L3S (Alexandria)
German Web Archive (80 TB)
Services and expertise on Web Archives
Uni Sheffield (GATE Cloud)
Natural language processing and text mining
FhG IGD & FhG IAIS
Information Visualization and Visual Analytics
Aalto University
Data, services and competences on
social network analysis
Uni Tartu (E-Gov.data)
Estonian e-government and ehealth data
ETH Zürich:
Search engine for Open Data
1st Call SoBigData-funded Transnational Access
Forschungsaufenthalte (bis zu 2 Monate) bei SoBigData Partnern
zu den Themen:
* City of Citizens * Well-being and Economy
* Societal Debates * Migration Studies
Tracking User Behavior
About 75% of websites track user behavior across sites.
[Zhonghao Yu et al. WWW16]
Bias in the
Data
Bias in the
Algorithm
Bias in the
Social Machine
WebObservatory
Observing Bias in Social Networks
(Lerman et al 15)
Part of US election/Brexit misprediction?
Check out: http://www.kdnuggets.com/2016/07/big-data-bible-codes-bonferroni.html
„Torture the data, and it will confess to anything.“
Ronald Coase, economist, Nobel Prize Laureate.
Bonferroni Effect
DemocraZy
WashingtonPost
http://wpo.st/5WdH2
Reality Sensing, Mining and Augmentation for Mobile
Citizen-eGovernment Dialogue
Web for Everyone
Uber, the world‘s largest taxi company, owns no vehicles.
Facebook ...most popular media owner, creates no content.
Alibaba, the most valuable retailer, has no inventory.
Airbnb... largest accommodation provider, owns no real estate.
Data Oligopolists
Uber Whom do you take a ride with?
- the right picture – also for online dating ...
Facebook Which source do you trust?
- rumor checks change the trust ....
Alibaba Whom do you trust to buy from?
- others‘ ratings
Airbnb Whom do you want for a sleepover?
Vertrauen
Das Recht
27
Vor langer Zeit …
1984 won‘t be like 1984
Quelle: http://oldcomputers.net/macintosh.html
1981 (1987) – Volkszählung
Seither …
Computer überall
Trends
Cloud Mobile
Social Big
Trends
Gratismentalität Kontrollverlust
If the product is for free, you are the product
Zwei große Erzählungen
(1)
Digitale Agenda (2010 ff.)
Diagnose
30% of Europeans have still never used the internet;
Europe has only 1% penetration of fibre-based high-speed
networks whereas (Japan 12%, South Korea is at 15%)
EU spending on ICT research and development stands at only
40% of US levels.
Four times as many legal music downloads in the US as in the
EU
(2)
Europeans have a long tradition of declaring abstract
privacy rights in theory that they fail to enforce in practice.
Neuordnung des europäischen (Datenschutz)rechts
47
01/2012
Zentrale Versprechen
One Continent, one Law
Internetfit
Aber
Diskussion im Parlament
Albrecht: 350 Änderungsvorschläge
3.133 Änderungsanträge
Aber
Themen
Boundless Informant
Genie
XKeyScore
…
FAIRVIEW
Tempora
BULLRUN
Mail
Isolation
Control and
Tracking
PRISM
Und heute …
54
https://isc.sans.edu/diary/Port+7547+SOAP+Remote+Code+Execution+Attack+Against+DSL+Modems/21759
55
In particular, Austria is
experiencing a strong increase in
TR-069 traffic within the last 24
hours.
Ergebnis
04/2016
Broken Law
Delay
Unclarity/
Complexity
Speed
Irrelevance
Fragmentation
Web Science – The next 10 Years
Social challenges
 Discrimination
 Trust
 Moral AI
Legal challenges
 regulation of
infrastructure
for economic competition
 tracking everywhere
Political challenges
 Misinformation
 Participation
 Internet governance
Technical challenges
 Artificial Intelligence
 Security
 ...

More Related Content

What's hot

Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3
SMCFrance
 
Timeline History of Internet
Timeline History of InternetTimeline History of Internet
Timeline History of Internet
gladyslising
 
All about internet2 (1)
All about internet2 (1)All about internet2 (1)
All about internet2 (1)
troye8418
 
Download PPT file
Download PPT fileDownload PPT file
Download PPT file
Videoguy
 

What's hot (20)

Critical Data Studies in the Academy
Critical Data Studies in the AcademyCritical Data Studies in the Academy
Critical Data Studies in the Academy
 
The future of the internet: version 4
The future of the internet: version 4The future of the internet: version 4
The future of the internet: version 4
 
Internet benefits and pitfalls
Internet   benefits and pitfallsInternet   benefits and pitfalls
Internet benefits and pitfalls
 
New Data `New Computation
New Data `New ComputationNew Data `New Computation
New Data `New Computation
 
Open Government Data for transparency, innovation and public engagement in so...
Open Government Data for transparency, innovation and public engagement in so...Open Government Data for transparency, innovation and public engagement in so...
Open Government Data for transparency, innovation and public engagement in so...
 
Homelessness Data Discussion
Homelessness Data DiscussionHomelessness Data Discussion
Homelessness Data Discussion
 
The persistent environmental digital divide(s) -RGS-IBG 2018
The persistent environmental digital divide(s) -RGS-IBG 2018The persistent environmental digital divide(s) -RGS-IBG 2018
The persistent environmental digital divide(s) -RGS-IBG 2018
 
Short and Long of Data Driven Innovation
Short and Long of Data Driven InnovationShort and Long of Data Driven Innovation
Short and Long of Data Driven Innovation
 
The Future of the Internet
The Future of the Internet The Future of the Internet
The Future of the Internet
 
Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3
 
Timeline History of Internet
Timeline History of InternetTimeline History of Internet
Timeline History of Internet
 
Community Data Program Submitted letter to Open Government Partneship
Community Data Program Submitted letter to Open Government PartneshipCommunity Data Program Submitted letter to Open Government Partneship
Community Data Program Submitted letter to Open Government Partneship
 
Living online
Living onlineLiving online
Living online
 
Data! Action! Data journalism issues to watch in the next 10 years
Data! Action! Data journalism issues to watch in the next 10 yearsData! Action! Data journalism issues to watch in the next 10 years
Data! Action! Data journalism issues to watch in the next 10 years
 
Web tech evol
Web tech evolWeb tech evol
Web tech evol
 
Teaching AI in data journalism
Teaching AI in data journalismTeaching AI in data journalism
Teaching AI in data journalism
 
All about internet2 (1)
All about internet2 (1)All about internet2 (1)
All about internet2 (1)
 
Big Data Challenges for the Social Sciences
Big Data Challenges for the Social SciencesBig Data Challenges for the Social Sciences
Big Data Challenges for the Social Sciences
 
What is internet
What is internetWhat is internet
What is internet
 
Download PPT file
Download PPT fileDownload PPT file
Download PPT file
 

Similar to 10 Jahre Web Science

Similar to 10 Jahre Web Science (20)

British Academy - SoBigData - ppt.ppt
British Academy - SoBigData - ppt.pptBritish Academy - SoBigData - ppt.ppt
British Academy - SoBigData - ppt.ppt
 
Web Science Intro Session-Spring2023.pptx
Web Science Intro Session-Spring2023.pptxWeb Science Intro Session-Spring2023.pptx
Web Science Intro Session-Spring2023.pptx
 
Digital Trails Dave King 1 5 10 Part 1 D3
Digital Trails   Dave King   1 5 10   Part 1 D3Digital Trails   Dave King   1 5 10   Part 1 D3
Digital Trails Dave King 1 5 10 Part 1 D3
 
The Internet is shaping the future
The Internet is shaping the futureThe Internet is shaping the future
The Internet is shaping the future
 
Web Observatories and e-Research
Web Observatories and e-ResearchWeb Observatories and e-Research
Web Observatories and e-Research
 
Big Data and Social Machines
Big Data and Social MachinesBig Data and Social Machines
Big Data and Social Machines
 
Internet Safety And The Internet
Internet Safety And The InternetInternet Safety And The Internet
Internet Safety And The Internet
 
European librarians theatre - Social Media Spotlight
European librarians theatre - Social Media SpotlightEuropean librarians theatre - Social Media Spotlight
European librarians theatre - Social Media Spotlight
 
COMPLETE GUIDE ON WRITING A PROFICIENT ESSAY ON THE HISTORY OF THE INTERNET
COMPLETE GUIDE ON WRITING A PROFICIENT ESSAY ON  THE HISTORY OF THE INTERNETCOMPLETE GUIDE ON WRITING A PROFICIENT ESSAY ON  THE HISTORY OF THE INTERNET
COMPLETE GUIDE ON WRITING A PROFICIENT ESSAY ON THE HISTORY OF THE INTERNET
 
Big Data meets Big Social: Social Machines and the Semantic Web
Big Data meets Big Social: Social Machines and the Semantic WebBig Data meets Big Social: Social Machines and the Semantic Web
Big Data meets Big Social: Social Machines and the Semantic Web
 
ISWC 2013 Tutorial on the Web of Things
ISWC 2013 Tutorial on the Web of ThingsISWC 2013 Tutorial on the Web of Things
ISWC 2013 Tutorial on the Web of Things
 
Internet and Bioinformatics for Biologists
Internet and Bioinformatics for BiologistsInternet and Bioinformatics for Biologists
Internet and Bioinformatics for Biologists
 
e-Research: A Social Informatics Perspective
e-Research: A Social Informatics Perspectivee-Research: A Social Informatics Perspective
e-Research: A Social Informatics Perspective
 
Internet
InternetInternet
Internet
 
The Future of the Internet
The Future of the InternetThe Future of the Internet
The Future of the Internet
 
citizens scale scholarly social machines
citizens scale scholarly social machinescitizens scale scholarly social machines
citizens scale scholarly social machines
 
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
 
Presentation internet programming report
Presentation internet programming reportPresentation internet programming report
Presentation internet programming report
 
Introduction to the Social Data on the Web Workshop
Introduction to the Social Data on the Web WorkshopIntroduction to the Social Data on the Web Workshop
Introduction to the Social Data on the Web Workshop
 
Digital Scholarship: Intersection, Automation, and Scholarly Social Machines
Digital Scholarship: Intersection, Automation, and Scholarly Social MachinesDigital Scholarship: Intersection, Automation, and Scholarly Social Machines
Digital Scholarship: Intersection, Automation, and Scholarly Social Machines
 

More from Steffen Staab

More from Steffen Staab (20)

Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
Knowledge graphs for knowing more and knowing for sure
Knowledge graphs for knowing more and knowing for sureKnowledge graphs for knowing more and knowing for sure
Knowledge graphs for knowing more and knowing for sure
 
Symbolic Background Knowledge for Machine Learning
Symbolic Background Knowledge for Machine LearningSymbolic Background Knowledge for Machine Learning
Symbolic Background Knowledge for Machine Learning
 
Soziale Netzwerke und Medien: Multi-disziplinäre Ansätze für ein multi-dimens...
Soziale Netzwerke und Medien: Multi-disziplinäre Ansätze für ein multi-dimens...Soziale Netzwerke und Medien: Multi-disziplinäre Ansätze für ein multi-dimens...
Soziale Netzwerke und Medien: Multi-disziplinäre Ansätze für ein multi-dimens...
 
Web Futures: Inclusive, Intelligent, Sustainable
Web Futures: Inclusive, Intelligent, SustainableWeb Futures: Inclusive, Intelligent, Sustainable
Web Futures: Inclusive, Intelligent, Sustainable
 
Eyeing the Web
Eyeing the WebEyeing the Web
Eyeing the Web
 
Concepts in Application Context ( How we may think conceptually )
Concepts in Application Context ( How we may think conceptually )Concepts in Application Context ( How we may think conceptually )
Concepts in Application Context ( How we may think conceptually )
 
Storing and Querying Semantic Data in the Cloud
Storing and Querying Semantic Data in the CloudStoring and Querying Semantic Data in the Cloud
Storing and Querying Semantic Data in the Cloud
 
Semantics reloaded
Semantics reloadedSemantics reloaded
Semantics reloaded
 
Ontologien und Semantic Web - Impulsvortrag Terminologietag
Ontologien und Semantic Web - Impulsvortrag TerminologietagOntologien und Semantic Web - Impulsvortrag Terminologietag
Ontologien und Semantic Web - Impulsvortrag Terminologietag
 
Opinion Formation and Spreading
Opinion Formation and SpreadingOpinion Formation and Spreading
Opinion Formation and Spreading
 
The Web We Want
The Web We WantThe Web We Want
The Web We Want
 
(Semi-)Automatic analysis of online contents
(Semi-)Automatic analysis of online contents(Semi-)Automatic analysis of online contents
(Semi-)Automatic analysis of online contents
 
Programming with Semantic Broad Data
Programming with Semantic Broad DataProgramming with Semantic Broad Data
Programming with Semantic Broad Data
 
Text Mining using LDA with Context
Text Mining using LDA with ContextText Mining using LDA with Context
Text Mining using LDA with Context
 
Wwsss intro2016-final
Wwsss intro2016-finalWwsss intro2016-final
Wwsss intro2016-final
 
10 Years Web Science
10 Years Web Science10 Years Web Science
10 Years Web Science
 
Semantic Web Technologies: Principles and Practices
Semantic Web Technologies: Principles and PracticesSemantic Web Technologies: Principles and Practices
Semantic Web Technologies: Principles and Practices
 
Closing Session ISWC 2015
Closing Session ISWC 2015Closing Session ISWC 2015
Closing Session ISWC 2015
 
ISWC2015 Opening Session
ISWC2015 Opening SessionISWC2015 Opening Session
ISWC2015 Opening Session
 

Recently uploaded

Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Sérgio Sacani
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Lokesh Kothari
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
RizalinePalanog2
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 

Recently uploaded (20)

Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
American Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptxAmerican Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptx
 

10 Jahre Web Science

Editor's Notes

  1. Gestern fielen hundertausende Internet Anschluesse aus – heute waren mehrere Erfahrungsberichte wie Leute nicht mehr wussten, was sie tun sollten For example, in the US it is now reported that between 15-20% of newly married couples met their spouses on line (cf. http://www.statisticbrain.com/online-dating-statistics/). https://www.timeshighereducation.com/home/open-access-papers-gain-more-traffic-and-citations/2014850.article
  2. Out of 350K dierent sites visited by 200K users over a 7 day period, 273K sites contained trackers that were sending information that we deemed unsafe. Data elements that are only and always sent by a single user, or a reduced set of users, are considered unsafe with regard to privacy. 50% of news site carry at least 11 different trackers
  3. - The majority of your friends in facebook have more friends than you do here: a. the majority of your friends are colored b. the majority of your friends are non-colored (same network) Practical example might be media biases
  4. Big Data presents opportunities for data mining and machine learning previously unimaginable, given the vast size of datasets from which we are able to learn, cluster upon, find associations within, and generally search for insights not before attainable. Mining Big Data is not a plug-and-play, one-size-fit-all, (insert another cliche here) process, however; though there seems to be alarmingly little discussion anymore of their importance in relation to Big Data, statistical thinking, methods, and processes matter. It is possible that the lack of discussion is because most people understand this fundamental truth already, which I find doubtful. Perhaps I simply have not come across relevant such topics of late, and they do, in fact, exist. I also find this doubtful. I fear that oversight or an essential lack of understanding are more likely to blame. Big Data This article is not a blanket criticism of learning from Big Data; instead, it is much more accurately a reminder that time-tested statistical methods are more valid now than ever, in this Era of Big Data. In that regard, this discussion will focus on 2 particular statistical issues to be on the look out for in your own work and in the work of others mining and learning from Big Data. And for the practitioners out there, this is not about abstract statistical theory. This is about practicality. And the highly improbable probabilities that can be improperly gleaned from Big Data. The Bonferroni Principle There is a concept in statistics that goes like this: even in completely random datasets, you can expect particular events of interest to occur, and to occur in increasing numbers as the amount of data grows. These occurrences are nothing more than collections of random features that appear to be instances of interest, but are not. This bears repeating: even amounts of random data lead to what seem to be events of interest, and the number of these seemingly interesting events grows as does the size of the dataset. The Bonferroni Principle1 is a statistical method for accounting for these random events. To employ it, determine the number of expected random events of interest in the dataset, and if the observed number is significantly greater than this number, the chances of any observations providing useful insight are almost nonexistent. The Bonferroni Correction is a technique for helping to avoid such observations. Torture the data, and it will confess to anything. — Ronald Coase, economist, Nobel Prize Laureate. One of the most prominent and easy to understand examples of the Bonferroni Principle is that of the George W. Bush administration's Total Information Awareness data-collection and data mining plan of 20021. The criticism of the plan's effectiveness, and its relationship to the Bonferroni Principal, are as follows. Suppose we are looking for terrorists, from a potential pool made up of a very large number of individuals. Let's say that, in actuality, however, there is an incredibly small number of individuals who are terrorists. Now suppose these potential terrorists are thought to be deliberately visiting particular locations in pairs for meetings, but let's further suppose that these potential terrorists are actually non-terrorists moving about randomly. By using hard numbers for such a scenario and working out the probabilities, Rajaraman & Ullman gives the example of one billion potential "evil-doers," and though the actual number may be something very small (they give the example of 10 pairs), statistical probabilities could put the number of suspected pairs meeting at given locations due to pure randomness at 250,000 (again, in this particular example). Now, this is clearly a problem. In purely practical terms, imagine having to recruit, train, and pay enough police personnel to investigate each of these flagged individuals! If a Big Data mining practitioner had first computed some number which could be proven a reasonable number of expected random events (the Bonferroni Principle in action), the entire investigation would have been immediately recognized as flawed, given the near-absolute certainty that this Bonferroni number would have been less than a quarter of a million, the suspected number of significant events shown above. Knowing when our out-of-the-gate quantitative assumptions are off base is critically useful in the Era of Big Data. The Bonferroni Principle is one example of how Big Data can result in highly unlikely outcomes masquerading as statistically sound.