SlideShare ist ein Scribd-Unternehmen logo
1 von 31
1
The Yotta is not Enough!
The Need for Rethinking Information Science.
Dr Bruno Jacobfeuerborn
Telekom Deutschland GmbH
“Information Science in an Age of Change”, 2nd Conference
Institute for Information Science and Book Studies, University of Warsaw
Warsaw, April 15-16th, 2013
Yotta (Y)
=
1024 or 1 000 000 000 000 000 000 000 000
2© B. Jacobfeuerborn
Metric Prefixes (ISO)
3© B. Jacobfeuerborn
Dr. Bruno Jacobfeuerborn
- Moved to Deutsche Telekom in 1989.
- Head of Radio and Transmission Department in Hanover , 1991.
- Regional Director in Leipzig, 1991.
- Regional Director Technology and later Regional Director Business, responsible for Sales,
Marketing and Technology, Hanover, 1995.
- T-Mobile; the acquisition of the GSM license in Poland, 1996.
- Technical Director T-Mobile Netherlands and Member of the Management Board, 2002.
- Head of Service Management Europe in the T-Mobile International, 2004.
- Technical Director PTC and Member of the Management Board, 2007.
- Director and Management Board Member responsible for technology (fixed and mobile)
in Germany at Telekom Deutschland GmbH, 2009.
- Invited speaker to international conferences and coach of workshops.
- MOST Foundation General Assembly member.
4
5
Contents
─ Prologue
─ Thesis
─ Big Data
─ 4 Paradigms of Science
─ Data Science
─ Epilogue
New Scientific
Paradigm
Big Data
© B. Jacobfeuerborn
6
Prologue.
© B. Jacobfeuerborn
7
Data is the raw material of the XXI century.
Credo
© B. Jacobfeuerborn
8
Thesis.
© B. Jacobfeuerborn
9
A new scientific paradigm emerges.
Information science has to face and cope with it!
Thesis
Source: Cartoonbank.com
© B. Jacobfeuerborn
10
Big Data.
© B. Jacobfeuerborn
11© B. Jacobfeuerborn
12
“Big data refers to datasets whose size is beyond the
ability of typical database software tools to capture,
store, manage, and analyze”.
--- McKinsey, 2011
Big Data
© B. Jacobfeuerborn
How Big is Big?
Today: between Exabytes (1018) and Zettabytes (1021)
Tomorrow: over Zettabytes
13© B. Jacobfeuerborn
Big Data – the Flood
Walmart drags a million hourly retail transactions into a database that
long ago passed 2.5 petabytes; Facebook processes 2.5 billion pieces of
content and 500 terabytes of data each day; and Google, whose YouTube
division alone gains 72 hours of new video every minute, accumulates 24
petabytes of data in a single day.
− David Rowan, Editor, WIRED UK,
http://www.edge.org/response-detail/23859
“Each day, according to IBM,
we collectively generate 2.5
quintillion bytes—a tsunami of
structured and unstructured
data that's growing, in IDC's
reckoning, at 60 per cent a
year.
14© B. Jacobfeuerborn
15
Four Paradigms of Science.
© B. Jacobfeuerborn
Scientific Revolutions
T.S. Kuhn
1922 - 1996
16© B. Jacobfeuerborn
17
Science has been developing
from idea-centricity to data-centricity.
Data leverage ideas!
My Addendum to Kuhn’s Claim
Data
Idea
© B. Jacobfeuerborn
The School of Athens, Raphael, 1509
18© B. Jacobfeuerborn
1. Platonic Approach
In the Greek language science means
knowledge. According to Aristotle and
Plato science/knowledge is:
universal, necessary, certain, and
timeless. Deduction is the only
allowed way of reasoning.
Mathematics is a prototype (model) of
science and a language of nature.
19© B. Jacobfeuerborn
2. Baconian Approach
Francis Bacon’s new methodology of
science and knowledge, empiricism,
that relayed on observation,
collection of data, and experimenting,
along with accepting induction as a
legal inference method for scientific
endeavors can be characterized as
data-centric.
20© B. Jacobfeuerborn
Francis Bacon, 1561 - 1626
3. Computers at Work (Simulation, Modelling)
−J.P. Rini
“The idea is to use a computer program
to perform lengthy computations, and to
provide a proof that the result of these
computations implies the given theorem.
In 1976, the four color theorem was the
first major theorem to be verified using a
computer program.”
http://en.wikipedia.org/wiki/Computer-assisted_proof
21© B. Jacobfeuerborn
22
“It is a capital mistake to
theorize before one has
data.”
− Sherlock Holmes,
A Study in Scarlett
(Arthur Conan Doyle)
The Role of Data
© B. Jacobfeuerborn
23
“We can stop looking for models. We
can analyze the data without
hypotheses about what it might show.
We can throw the numbers into the
biggest computing clusters the world
has ever seen and let algorithms find
patterns where science cannot.”
–Chris Anderson
4. Big Data at Work
© B. Jacobfeuerborn
24
Data Science.
© B. Jacobfeuerborn
25
Data science is a set of scientific theories, methods, tools, and best
practices (including hacking!) aimed to analyse and explore big
datasets in order to discover hidden knowledge thru inference.
Data Science
source: Data Science: An Introduction,
http://en.wikibooks.org/wiki/Data_Science:_An_Introduction
© B. Jacobfeuerborn
!
26© B. Jacobfeuerborn
27
My Vision
Information
Science
Data
Science
© B. Jacobfeuerborn
28
− Definitions of data, information, and knowledge.
− Data structures and databases.
− Big data and analytics trends.
− Elements of logics and non-standard inference mechanisms for big data.
− Assorted methods of knowledge representation.
− Elements of machine learning and artificial intelligence.
− Methods of browsing and retrieval of big data, with a focus on methods to fast delivery of the retrieved
hits.
− Methods and tools to create metadata.
− Data integration.
− Deep data analysis: statistics and data mining technologies.
− Architecture of scalable big data systems.
− Cloud computing; methods of physical storage of big data; virtualization technologies for sharing
processing power and memory.
− Security and privacy within big data infrastructures.
− Big data case studies (e.g. social networking, governance, marketing, health).
Data Science Curriculum for Information Science Students
© B. Jacobfeuerborn
29
Epilogue.
© B. Jacobfeuerborn
30
“With too little data, you won’t be
able to make any conclusions
that you trust. …
Big data isn’t about bits, it’s
about talent.”
–Douglas Merrill
http://www.forbes.com/sites/douglasmerrill/
2012/05/01/r-is-not-enough-for-big-data/
To Remember
© B. Jacobfeuerborn
Thank you for listening!

Weitere ähnliche Inhalte

Was ist angesagt?

ADED 7330 Introduction
ADED 7330 IntroductionADED 7330 Introduction
ADED 7330 Introductionqueenofrug
 
Digitisation Infrastructure - June 2007
Digitisation Infrastructure - June 2007Digitisation Infrastructure - June 2007
Digitisation Infrastructure - June 2007Alastair Dunning
 
InfoFest Kent 2017: Panel discussion - combating fake news
InfoFest Kent 2017: Panel discussion - combating fake newsInfoFest Kent 2017: Panel discussion - combating fake news
InfoFest Kent 2017: Panel discussion - combating fake newsUKC Library and IT
 
Supercomputing and the cloud - the next big paradigm shift?
Supercomputing and the cloud - the next big paradigm shift?Supercomputing and the cloud - the next big paradigm shift?
Supercomputing and the cloud - the next big paradigm shift?Martin Hamilton
 
Health and clinical research - data futures, NIHR accelerating digital programme
Health and clinical research - data futures, NIHR accelerating digital programmeHealth and clinical research - data futures, NIHR accelerating digital programme
Health and clinical research - data futures, NIHR accelerating digital programmeMartin Hamilton
 
Introduction to Big Data and Data Science
Introduction to Big Data and Data ScienceIntroduction to Big Data and Data Science
Introduction to Big Data and Data ScienceFeyzi R. Bagirov
 
Assignment1.final version
Assignment1.final versionAssignment1.final version
Assignment1.final versionmarianozanetti
 
Open Data - a goldmine (JavaZone 2009)
Open Data - a goldmine (JavaZone 2009)Open Data - a goldmine (JavaZone 2009)
Open Data - a goldmine (JavaZone 2009)Svein-Magnus Sørensen
 
Aula 4 27032015 sii-v1
Aula 4   27032015 sii-v1Aula 4   27032015 sii-v1
Aula 4 27032015 sii-v1Aneesh Zutshi
 
Science and Culture in the EU‘s Digital Agenda
Science and Culture  in the EU‘s Digital AgendaScience and Culture  in the EU‘s Digital Agenda
Science and Culture in the EU‘s Digital AgendaCarl-Christian Buhr
 
Setting the Scene for Big Data in Europe, Looking Ahead to the Case Studies
Setting the Scene for Big Data in Europe, Looking Ahead to the Case StudiesSetting the Scene for Big Data in Europe, Looking Ahead to the Case Studies
Setting the Scene for Big Data in Europe, Looking Ahead to the Case StudiesBYTE Project
 
Moldova ICT Summit Open Data Session
Moldova ICT Summit Open Data SessionMoldova ICT Summit Open Data Session
Moldova ICT Summit Open Data SessionePSI Platform
 

Was ist angesagt? (19)

ADED 7330 Introduction
ADED 7330 IntroductionADED 7330 Introduction
ADED 7330 Introduction
 
Digitisation Infrastructure - June 2007
Digitisation Infrastructure - June 2007Digitisation Infrastructure - June 2007
Digitisation Infrastructure - June 2007
 
Prescottimperialbigdata
PrescottimperialbigdataPrescottimperialbigdata
Prescottimperialbigdata
 
Philipine internet
Philipine internetPhilipine internet
Philipine internet
 
InfoFest Kent 2017: Panel discussion - combating fake news
InfoFest Kent 2017: Panel discussion - combating fake newsInfoFest Kent 2017: Panel discussion - combating fake news
InfoFest Kent 2017: Panel discussion - combating fake news
 
Supercomputing and the cloud - the next big paradigm shift?
Supercomputing and the cloud - the next big paradigm shift?Supercomputing and the cloud - the next big paradigm shift?
Supercomputing and the cloud - the next big paradigm shift?
 
Internet
InternetInternet
Internet
 
Health and clinical research - data futures, NIHR accelerating digital programme
Health and clinical research - data futures, NIHR accelerating digital programmeHealth and clinical research - data futures, NIHR accelerating digital programme
Health and clinical research - data futures, NIHR accelerating digital programme
 
Introduction to Big Data and Data Science
Introduction to Big Data and Data ScienceIntroduction to Big Data and Data Science
Introduction to Big Data and Data Science
 
Assignment1.final version
Assignment1.final versionAssignment1.final version
Assignment1.final version
 
Open Data - a goldmine (JavaZone 2009)
Open Data - a goldmine (JavaZone 2009)Open Data - a goldmine (JavaZone 2009)
Open Data - a goldmine (JavaZone 2009)
 
Aula 4 27032015 sii-v1
Aula 4   27032015 sii-v1Aula 4   27032015 sii-v1
Aula 4 27032015 sii-v1
 
Science and Culture in the EU‘s Digital Agenda
Science and Culture  in the EU‘s Digital AgendaScience and Culture  in the EU‘s Digital Agenda
Science and Culture in the EU‘s Digital Agenda
 
Polinter01
Polinter01Polinter01
Polinter01
 
Polinter01
Polinter01Polinter01
Polinter01
 
Setting the Scene for Big Data in Europe, Looking Ahead to the Case Studies
Setting the Scene for Big Data in Europe, Looking Ahead to the Case StudiesSetting the Scene for Big Data in Europe, Looking Ahead to the Case Studies
Setting the Scene for Big Data in Europe, Looking Ahead to the Case Studies
 
Internet
InternetInternet
Internet
 
Moldova ICT Summit Open Data Session
Moldova ICT Summit Open Data SessionMoldova ICT Summit Open Data Session
Moldova ICT Summit Open Data Session
 
University of Texas, Austin
University of Texas, AustinUniversity of Texas, Austin
University of Texas, Austin
 

Ähnlich wie The Yotta is not Enough! / Bruno Jacobfeuerborn

The_Information_Age.pptx
The_Information_Age.pptxThe_Information_Age.pptx
The_Information_Age.pptxAllisaAlcober1
 
Steps towards a Data Value Chain
Steps towards a Data Value ChainSteps towards a Data Value Chain
Steps towards a Data Value ChainPRELIDA Project
 
Internet and Sudan
Internet and SudanInternet and Sudan
Internet and SudanHala Nur
 
Information Overload
Information OverloadInformation Overload
Information OverloadMiro Pusnik
 
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Jisc
 
The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?Anna Fensel
 
Technology innovation 1
Technology innovation 1Technology innovation 1
Technology innovation 1Rayspalding13
 
ContentMining and Copyright at CopyCamp2017
ContentMining and Copyright at CopyCamp2017ContentMining and Copyright at CopyCamp2017
ContentMining and Copyright at CopyCamp2017petermurrayrust
 
Why not use ict in sudan
Why not use ict in sudanWhy not use ict in sudan
Why not use ict in sudanHala Nur
 
06 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.201406 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.2014VinothkumaR Ramu
 
Using Data for Science Journalism
Using Data for Science JournalismUsing Data for Science Journalism
Using Data for Science JournalismLiliana Bounegru
 
Using Data for Science Journalism
Using Data for Science JournalismUsing Data for Science Journalism
Using Data for Science JournalismJonathan Gray
 
Internet of things and nanothings workshop may 2014
Internet of things and nanothings workshop may 2014Internet of things and nanothings workshop may 2014
Internet of things and nanothings workshop may 2014Marios Kyriazis
 
Introducing the Internet of Things: lecture @IULM University
Introducing the Internet of Things: lecture @IULM UniversityIntroducing the Internet of Things: lecture @IULM University
Introducing the Internet of Things: lecture @IULM UniversityLeandro Agro'
 
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Wesley De Neve
 

Ähnlich wie The Yotta is not Enough! / Bruno Jacobfeuerborn (20)

The_Information_Age.pptx
The_Information_Age.pptxThe_Information_Age.pptx
The_Information_Age.pptx
 
Steps towards a Data Value Chain
Steps towards a Data Value ChainSteps towards a Data Value Chain
Steps towards a Data Value Chain
 
Big dataorig
Big dataorigBig dataorig
Big dataorig
 
Internet and Sudan
Internet and SudanInternet and Sudan
Internet and Sudan
 
Information Overload
Information OverloadInformation Overload
Information Overload
 
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
 
Data science general
Data science generalData science general
Data science general
 
top 10 Data Mining Algorithms
top 10 Data Mining Algorithmstop 10 Data Mining Algorithms
top 10 Data Mining Algorithms
 
What is opendata
What is opendata What is opendata
What is opendata
 
The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?
 
Technology innovation 1
Technology innovation 1Technology innovation 1
Technology innovation 1
 
ContentMining and Copyright at CopyCamp2017
ContentMining and Copyright at CopyCamp2017ContentMining and Copyright at CopyCamp2017
ContentMining and Copyright at CopyCamp2017
 
Why not use ict in sudan
Why not use ict in sudanWhy not use ict in sudan
Why not use ict in sudan
 
06 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.201406 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.2014
 
Using Data for Science Journalism
Using Data for Science JournalismUsing Data for Science Journalism
Using Data for Science Journalism
 
Using Data for Science Journalism
Using Data for Science JournalismUsing Data for Science Journalism
Using Data for Science Journalism
 
The Internet of Things
The Internet of ThingsThe Internet of Things
The Internet of Things
 
Internet of things and nanothings workshop may 2014
Internet of things and nanothings workshop may 2014Internet of things and nanothings workshop may 2014
Internet of things and nanothings workshop may 2014
 
Introducing the Internet of Things: lecture @IULM University
Introducing the Internet of Things: lecture @IULM UniversityIntroducing the Internet of Things: lecture @IULM University
Introducing the Internet of Things: lecture @IULM University
 
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
 

Mehr von Zakład Systemów Informacyjnych, Instytut Informacji Naukowej i Studiów Bibliologicznych (UW)

Mehr von Zakład Systemów Informacyjnych, Instytut Informacji Naukowej i Studiów Bibliologicznych (UW) (20)

O postaci książek w bibliotekach cyfrowych / Zdzisław Dobrowolski
O postaci książek w bibliotekach cyfrowych / Zdzisław DobrowolskiO postaci książek w bibliotekach cyfrowych / Zdzisław Dobrowolski
O postaci książek w bibliotekach cyfrowych / Zdzisław Dobrowolski
 
Współautorstwo publikacji naukowych jako wyznacznik współpracy między naukowc...
Współautorstwo publikacji naukowych jako wyznacznik współpracy między naukowc...Współautorstwo publikacji naukowych jako wyznacznik współpracy między naukowc...
Współautorstwo publikacji naukowych jako wyznacznik współpracy między naukowc...
 
Zachowania informacyjne humanistów – badanie potrzeb informacyjnych pracownik...
Zachowania informacyjne humanistów – badanie potrzeb informacyjnych pracownik...Zachowania informacyjne humanistów – badanie potrzeb informacyjnych pracownik...
Zachowania informacyjne humanistów – badanie potrzeb informacyjnych pracownik...
 
Zachowania związane z indywidualnym zarządzaniem wiedzą i informacją – w świe...
Zachowania związane z indywidualnym zarządzaniem wiedzą i informacją – w świe...Zachowania związane z indywidualnym zarządzaniem wiedzą i informacją – w świe...
Zachowania związane z indywidualnym zarządzaniem wiedzą i informacją – w świe...
 
Koncepcje trwałej ochrony polskich zasobów cyfrowych /Aneta Januszko-Szakiel
Koncepcje trwałej ochrony polskich zasobów cyfrowych /Aneta Januszko-SzakielKoncepcje trwałej ochrony polskich zasobów cyfrowych /Aneta Januszko-Szakiel
Koncepcje trwałej ochrony polskich zasobów cyfrowych /Aneta Januszko-Szakiel
 
Publikowanie naukowe jako proces organizacji wiedzy. Zarys koncepcji / Marek ...
Publikowanie naukowe jako proces organizacji wiedzy. Zarys koncepcji / Marek ...Publikowanie naukowe jako proces organizacji wiedzy. Zarys koncepcji / Marek ...
Publikowanie naukowe jako proces organizacji wiedzy. Zarys koncepcji / Marek ...
 
The Library as Place at the Digital Age / Laurence Favier
The Library as Place at the Digital Age /  Laurence FavierThe Library as Place at the Digital Age /  Laurence Favier
The Library as Place at the Digital Age / Laurence Favier
 
JHP BN – rekonfiguracja modelu chwd i dekonstrukcja jiw. Zmiany organizacji t...
JHP BN – rekonfiguracja modelu chwd i dekonstrukcja jiw. Zmiany organizacji t...JHP BN – rekonfiguracja modelu chwd i dekonstrukcja jiw. Zmiany organizacji t...
JHP BN – rekonfiguracja modelu chwd i dekonstrukcja jiw. Zmiany organizacji t...
 
e-Urząd-Biblioteka-Obywatel. Biblioteki jako pośrednicy w dostępie do informa...
e-Urząd-Biblioteka-Obywatel. Biblioteki jako pośrednicy w dostępie do informa...e-Urząd-Biblioteka-Obywatel. Biblioteki jako pośrednicy w dostępie do informa...
e-Urząd-Biblioteka-Obywatel. Biblioteki jako pośrednicy w dostępie do informa...
 
Metodologia badań w nauce o informacji – brakujący element / Arkadiusz Puliko...
Metodologia badań w nauce o informacji – brakujący element / Arkadiusz Puliko...Metodologia badań w nauce o informacji – brakujący element / Arkadiusz Puliko...
Metodologia badań w nauce o informacji – brakujący element / Arkadiusz Puliko...
 
Transitioning from Technical Services to Center for Digital Scholarship and S...
Transitioning from Technical Services to Center for Digital Scholarship and S...Transitioning from Technical Services to Center for Digital Scholarship and S...
Transitioning from Technical Services to Center for Digital Scholarship and S...
 
School library as a heterotopic place / Béatrice Micheau
School library as a heterotopic place / Béatrice Micheau School library as a heterotopic place / Béatrice Micheau
School library as a heterotopic place / Béatrice Micheau
 
Social Media Strategy Design / Robin Effing
Social Media Strategy Design / Robin EffingSocial Media Strategy Design / Robin Effing
Social Media Strategy Design / Robin Effing
 
Information culture as a social cultural practice: (re)defining the concept i...
Information culture as a social cultural practice: (re)defining the concept i...Information culture as a social cultural practice: (re)defining the concept i...
Information culture as a social cultural practice: (re)defining the concept i...
 
Digitization for Access and Preservation: The Evolving Debate in the Cultural...
Digitization for Access and Preservation: The Evolving Debate in the Cultural...Digitization for Access and Preservation: The Evolving Debate in the Cultural...
Digitization for Access and Preservation: The Evolving Debate in the Cultural...
 
Collaborative platformin the agricultural sectorin Algeria. Towards a Knowled...
Collaborative platformin the agricultural sectorin Algeria. Towards a Knowled...Collaborative platformin the agricultural sectorin Algeria. Towards a Knowled...
Collaborative platformin the agricultural sectorin Algeria. Towards a Knowled...
 
The road to providing access to Iran’s heritage resources: Iranian Consortium...
The road to providing access to Iran’s heritage resources: Iranian Consortium...The road to providing access to Iran’s heritage resources: Iranian Consortium...
The road to providing access to Iran’s heritage resources: Iranian Consortium...
 
Information Overload and Information Science / Mieczysław Muraszkiewicz
Information Overload and Information Science / Mieczysław MuraszkiewiczInformation Overload and Information Science / Mieczysław Muraszkiewicz
Information Overload and Information Science / Mieczysław Muraszkiewicz
 
Zachowania informacyjne humanistów – badanie potrzeb informacyjnych pracownik...
Zachowania informacyjne humanistów – badanie potrzeb informacyjnych pracownik...Zachowania informacyjne humanistów – badanie potrzeb informacyjnych pracownik...
Zachowania informacyjne humanistów – badanie potrzeb informacyjnych pracownik...
 
Naukowe systemy informacyjno-wyszukiwawcze – ogólne lub specjalistyczne – pro...
Naukowe systemy informacyjno-wyszukiwawcze – ogólne lub specjalistyczne – pro...Naukowe systemy informacyjno-wyszukiwawcze – ogólne lub specjalistyczne – pro...
Naukowe systemy informacyjno-wyszukiwawcze – ogólne lub specjalistyczne – pro...
 

Kürzlich hochgeladen

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 

Kürzlich hochgeladen (20)

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 

The Yotta is not Enough! / Bruno Jacobfeuerborn

  • 1. 1 The Yotta is not Enough! The Need for Rethinking Information Science. Dr Bruno Jacobfeuerborn Telekom Deutschland GmbH “Information Science in an Age of Change”, 2nd Conference Institute for Information Science and Book Studies, University of Warsaw Warsaw, April 15-16th, 2013
  • 2. Yotta (Y) = 1024 or 1 000 000 000 000 000 000 000 000 2© B. Jacobfeuerborn
  • 3. Metric Prefixes (ISO) 3© B. Jacobfeuerborn
  • 4. Dr. Bruno Jacobfeuerborn - Moved to Deutsche Telekom in 1989. - Head of Radio and Transmission Department in Hanover , 1991. - Regional Director in Leipzig, 1991. - Regional Director Technology and later Regional Director Business, responsible for Sales, Marketing and Technology, Hanover, 1995. - T-Mobile; the acquisition of the GSM license in Poland, 1996. - Technical Director T-Mobile Netherlands and Member of the Management Board, 2002. - Head of Service Management Europe in the T-Mobile International, 2004. - Technical Director PTC and Member of the Management Board, 2007. - Director and Management Board Member responsible for technology (fixed and mobile) in Germany at Telekom Deutschland GmbH, 2009. - Invited speaker to international conferences and coach of workshops. - MOST Foundation General Assembly member. 4
  • 5. 5 Contents ─ Prologue ─ Thesis ─ Big Data ─ 4 Paradigms of Science ─ Data Science ─ Epilogue New Scientific Paradigm Big Data © B. Jacobfeuerborn
  • 7. 7 Data is the raw material of the XXI century. Credo © B. Jacobfeuerborn
  • 9. 9 A new scientific paradigm emerges. Information science has to face and cope with it! Thesis Source: Cartoonbank.com © B. Jacobfeuerborn
  • 10. 10 Big Data. © B. Jacobfeuerborn
  • 12. 12 “Big data refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze”. --- McKinsey, 2011 Big Data © B. Jacobfeuerborn
  • 13. How Big is Big? Today: between Exabytes (1018) and Zettabytes (1021) Tomorrow: over Zettabytes 13© B. Jacobfeuerborn
  • 14. Big Data – the Flood Walmart drags a million hourly retail transactions into a database that long ago passed 2.5 petabytes; Facebook processes 2.5 billion pieces of content and 500 terabytes of data each day; and Google, whose YouTube division alone gains 72 hours of new video every minute, accumulates 24 petabytes of data in a single day. − David Rowan, Editor, WIRED UK, http://www.edge.org/response-detail/23859 “Each day, according to IBM, we collectively generate 2.5 quintillion bytes—a tsunami of structured and unstructured data that's growing, in IDC's reckoning, at 60 per cent a year. 14© B. Jacobfeuerborn
  • 15. 15 Four Paradigms of Science. © B. Jacobfeuerborn
  • 16. Scientific Revolutions T.S. Kuhn 1922 - 1996 16© B. Jacobfeuerborn
  • 17. 17 Science has been developing from idea-centricity to data-centricity. Data leverage ideas! My Addendum to Kuhn’s Claim Data Idea © B. Jacobfeuerborn
  • 18. The School of Athens, Raphael, 1509 18© B. Jacobfeuerborn
  • 19. 1. Platonic Approach In the Greek language science means knowledge. According to Aristotle and Plato science/knowledge is: universal, necessary, certain, and timeless. Deduction is the only allowed way of reasoning. Mathematics is a prototype (model) of science and a language of nature. 19© B. Jacobfeuerborn
  • 20. 2. Baconian Approach Francis Bacon’s new methodology of science and knowledge, empiricism, that relayed on observation, collection of data, and experimenting, along with accepting induction as a legal inference method for scientific endeavors can be characterized as data-centric. 20© B. Jacobfeuerborn Francis Bacon, 1561 - 1626
  • 21. 3. Computers at Work (Simulation, Modelling) −J.P. Rini “The idea is to use a computer program to perform lengthy computations, and to provide a proof that the result of these computations implies the given theorem. In 1976, the four color theorem was the first major theorem to be verified using a computer program.” http://en.wikipedia.org/wiki/Computer-assisted_proof 21© B. Jacobfeuerborn
  • 22. 22 “It is a capital mistake to theorize before one has data.” − Sherlock Holmes, A Study in Scarlett (Arthur Conan Doyle) The Role of Data © B. Jacobfeuerborn
  • 23. 23 “We can stop looking for models. We can analyze the data without hypotheses about what it might show. We can throw the numbers into the biggest computing clusters the world has ever seen and let algorithms find patterns where science cannot.” –Chris Anderson 4. Big Data at Work © B. Jacobfeuerborn
  • 24. 24 Data Science. © B. Jacobfeuerborn
  • 25. 25 Data science is a set of scientific theories, methods, tools, and best practices (including hacking!) aimed to analyse and explore big datasets in order to discover hidden knowledge thru inference. Data Science source: Data Science: An Introduction, http://en.wikibooks.org/wiki/Data_Science:_An_Introduction © B. Jacobfeuerborn !
  • 28. 28 − Definitions of data, information, and knowledge. − Data structures and databases. − Big data and analytics trends. − Elements of logics and non-standard inference mechanisms for big data. − Assorted methods of knowledge representation. − Elements of machine learning and artificial intelligence. − Methods of browsing and retrieval of big data, with a focus on methods to fast delivery of the retrieved hits. − Methods and tools to create metadata. − Data integration. − Deep data analysis: statistics and data mining technologies. − Architecture of scalable big data systems. − Cloud computing; methods of physical storage of big data; virtualization technologies for sharing processing power and memory. − Security and privacy within big data infrastructures. − Big data case studies (e.g. social networking, governance, marketing, health). Data Science Curriculum for Information Science Students © B. Jacobfeuerborn
  • 30. 30 “With too little data, you won’t be able to make any conclusions that you trust. … Big data isn’t about bits, it’s about talent.” –Douglas Merrill http://www.forbes.com/sites/douglasmerrill/ 2012/05/01/r-is-not-enough-for-big-data/ To Remember © B. Jacobfeuerborn
  • 31. Thank you for listening!