SlideShare ist ein Scribd-Unternehmen logo
1 von 29
never trust a

scientist

datajournalist

dataset

	
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
missing data, no value stored	
“I need to solve this”	
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
missing data, no value stored	
“I need to solve this”	
missing data, no value stored	
“I need to write a story about this”	
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
forreporters.com/andrew-lehren/	
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
scientist to journalist: “You twist everything”	
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
journalist to scientist: “Your articles are useless”	
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
 
	
  
“I am right”	
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
can I trust (and use) this dataset?	
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
“Trustworthiness and data
management are vital to the success of
qualitative studies … There is a lack of
scientific literature regarding the
structures and processes for managing
large qualitative data sets.”	
	
(White, Oelken, Friesen, 2012)	
	
	
   Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
“A simple answer to objective reporting
is the kind of reporting that uses relevant
and reliable sources which is not bias or
slanted to a certain party.”	
	
Ibrahim, Pawanteh, Kee (2011)	
	
	
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
question:	
how to validate	
a dataset?	
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
check the data source	
	
what are his/her/its intentions?	
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
what is the citation index	
of the data owner?	
	
	
do other journalists	
cite the data owner?	
	
	
   Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
check the data	
	
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
benefit	
	
do I need this?	
	
	
	
do I need to use it?	
	
	
  
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
check	
	
data gathering?	
	
	
	
clarification of the data?	
	
	
  
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
check	
	
data gathering? 	
is this correct?	
	
	
clarification of the data?
do I understand?	
	
	
   Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
missing data	
	
what is wrong? 	
	
	
	
what is the story?	
	
  
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
missing data	
	
what is wrong? 	
I need to solve	
	
	
what is the story?	
I need to write	
	
  
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
trouble?	
	
TEST!	
	
	
	
CALL!	
	
  
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
I need more sources! (do I?)	
	
give me data	
	
	
	
give me humans	
	
  
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
I need more sources! (do I?)	
	
give me data	
check consistency	
	
	
give me humans	
check my story	
	
  
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
same steps	
different interpretation	
	
  
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
“Dear datajournalist,	
	
Please take a look at the
research method yourself
and act a bit more like a
scientist.”	
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
“Dear scientist,	
	
Try to avoid intellectual
arrogance. There are
other people who are just
as smart.”	
	
   Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
“practice what you preach”	
	
  
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
scientists	
check the
source
(citation)	
check the
data	
check
benefit	
check data
gathering	
TEST!	
more data
sources	
data journalists	
check the
source
(citation)	
check the
data	
check
benefit	
check
clarification	
CALL!	
more
human
sources	
Tilburg	
  University	
  -­‐	
  data	
  journalism	
  
@Hillevanderkaa	
Tilburg University

Weitere ähnliche Inhalte

Ähnlich wie How to validate a dataset? Six steps.

Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Thinkful
 
BioASQ and BDE in SC1.1
BioASQ and BDE in SC1.1BioASQ and BDE in SC1.1
BioASQ and BDE in SC1.1BigData_Europe
 
Fundamentals of Data science Introduction Unit 1
Fundamentals of Data science Introduction Unit 1Fundamentals of Data science Introduction Unit 1
Fundamentals of Data science Introduction Unit 1sasi
 
Watching the workers: researching information behaviours in, and for, workplaces
Watching the workers: researching information behaviours in, and for, workplacesWatching the workers: researching information behaviours in, and for, workplaces
Watching the workers: researching information behaviours in, and for, workplacesHazel Hall
 
Digital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social SciencesDigital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social SciencesChantal van Son
 
Data Journalism - Introduction
Data Journalism - IntroductionData Journalism - Introduction
Data Journalism - IntroductionBahareh Heravi
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data ScienceThinkful
 
'Drinking from the fire hose? The pitfalls and potential of Big Data'.
'Drinking from the fire hose? The pitfalls and potential of Big Data'.'Drinking from the fire hose? The pitfalls and potential of Big Data'.
'Drinking from the fire hose? The pitfalls and potential of Big Data'.Josh Cowls
 
Science as an Open Enterprise – Geoffrey Boulton
Science as an Open Enterprise – Geoffrey BoultonScience as an Open Enterprise – Geoffrey Boulton
Science as an Open Enterprise – Geoffrey BoultonOpenAIRE
 
Data science and good questions eric kostello
Data science and good questions eric kostelloData science and good questions eric kostello
Data science and good questions eric kostelloData Con LA
 
Ethics and Privacy in the Application of Learning Analytics (#EP4LA)
Ethics and Privacy in the Application of Learning Analytics (#EP4LA)Ethics and Privacy in the Application of Learning Analytics (#EP4LA)
Ethics and Privacy in the Application of Learning Analytics (#EP4LA)Hendrik Drachsler
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Thinkful
 
Critical issues in the collection, analysis and use of student (digital) data
Critical issues in the collection, analysis and use of student (digital) dataCritical issues in the collection, analysis and use of student (digital) data
Critical issues in the collection, analysis and use of student (digital) dataUniversity of South Africa (Unisa)
 
Data Science-1 (1).ppt
Data Science-1 (1).pptData Science-1 (1).ppt
Data Science-1 (1).pptSanjayAcharaya
 
Data sharing in the age of the Social Machine
Data sharing in the age of the Social MachineData sharing in the age of the Social Machine
Data sharing in the age of the Social MachineUlrik Lyngs
 
How is Data Made? From Dataset Literacy to Data Infrastructure Literacy
How is Data Made? From Dataset Literacy to Data Infrastructure LiteracyHow is Data Made? From Dataset Literacy to Data Infrastructure Literacy
How is Data Made? From Dataset Literacy to Data Infrastructure LiteracyJonathan Gray
 
An Obligatory Introduction to Data Science
An Obligatory Introduction to Data ScienceAn Obligatory Introduction to Data Science
An Obligatory Introduction to Data ScienceWesley Eldridge
 
Data science and ethics in fundraising
Data science and ethics in fundraisingData science and ethics in fundraising
Data science and ethics in fundraisingJames Orton
 

Ähnlich wie How to validate a dataset? Six steps. (20)

Etmaal
EtmaalEtmaal
Etmaal
 
Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)
 
Open Data Journalism
Open Data JournalismOpen Data Journalism
Open Data Journalism
 
BioASQ and BDE in SC1.1
BioASQ and BDE in SC1.1BioASQ and BDE in SC1.1
BioASQ and BDE in SC1.1
 
Fundamentals of Data science Introduction Unit 1
Fundamentals of Data science Introduction Unit 1Fundamentals of Data science Introduction Unit 1
Fundamentals of Data science Introduction Unit 1
 
Watching the workers: researching information behaviours in, and for, workplaces
Watching the workers: researching information behaviours in, and for, workplacesWatching the workers: researching information behaviours in, and for, workplaces
Watching the workers: researching information behaviours in, and for, workplaces
 
Digital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social SciencesDigital Humanities and “Digital” Social Sciences
Digital Humanities and “Digital” Social Sciences
 
Data Journalism - Introduction
Data Journalism - IntroductionData Journalism - Introduction
Data Journalism - Introduction
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data Science
 
'Drinking from the fire hose? The pitfalls and potential of Big Data'.
'Drinking from the fire hose? The pitfalls and potential of Big Data'.'Drinking from the fire hose? The pitfalls and potential of Big Data'.
'Drinking from the fire hose? The pitfalls and potential of Big Data'.
 
Science as an Open Enterprise – Geoffrey Boulton
Science as an Open Enterprise – Geoffrey BoultonScience as an Open Enterprise – Geoffrey Boulton
Science as an Open Enterprise – Geoffrey Boulton
 
Data science and good questions eric kostello
Data science and good questions eric kostelloData science and good questions eric kostello
Data science and good questions eric kostello
 
Ethics and Privacy in the Application of Learning Analytics (#EP4LA)
Ethics and Privacy in the Application of Learning Analytics (#EP4LA)Ethics and Privacy in the Application of Learning Analytics (#EP4LA)
Ethics and Privacy in the Application of Learning Analytics (#EP4LA)
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)
 
Critical issues in the collection, analysis and use of student (digital) data
Critical issues in the collection, analysis and use of student (digital) dataCritical issues in the collection, analysis and use of student (digital) data
Critical issues in the collection, analysis and use of student (digital) data
 
Data Science-1 (1).ppt
Data Science-1 (1).pptData Science-1 (1).ppt
Data Science-1 (1).ppt
 
Data sharing in the age of the Social Machine
Data sharing in the age of the Social MachineData sharing in the age of the Social Machine
Data sharing in the age of the Social Machine
 
How is Data Made? From Dataset Literacy to Data Infrastructure Literacy
How is Data Made? From Dataset Literacy to Data Infrastructure LiteracyHow is Data Made? From Dataset Literacy to Data Infrastructure Literacy
How is Data Made? From Dataset Literacy to Data Infrastructure Literacy
 
An Obligatory Introduction to Data Science
An Obligatory Introduction to Data ScienceAn Obligatory Introduction to Data Science
An Obligatory Introduction to Data Science
 
Data science and ethics in fundraising
Data science and ethics in fundraisingData science and ethics in fundraising
Data science and ethics in fundraising
 

Mehr von Hille van der Kaa MA MBA

Mehr von Hille van der Kaa MA MBA (11)

Er was eens... een goed ondernemersverhaal
Er was eens... een goed ondernemersverhaalEr was eens... een goed ondernemersverhaal
Er was eens... een goed ondernemersverhaal
 
Robot Reporters or Human Journalists: Who Do You Trust More?
Robot Reporters or Human Journalists: Who Do You Trust More?Robot Reporters or Human Journalists: Who Do You Trust More?
Robot Reporters or Human Journalists: Who Do You Trust More?
 
Storytelling in a digital age - challenges of a Data Journalist
Storytelling in a digital age - challenges of a Data JournalistStorytelling in a digital age - challenges of a Data Journalist
Storytelling in a digital age - challenges of a Data Journalist
 
Location based Apps for journalists
Location based Apps for journalistsLocation based Apps for journalists
Location based Apps for journalists
 
Brand storytelling introduction @iemes fontys
Brand storytelling   introduction @iemes fontysBrand storytelling   introduction @iemes fontys
Brand storytelling introduction @iemes fontys
 
'Happiness on 13'
'Happiness on 13''Happiness on 13'
'Happiness on 13'
 
The Rise of Guerilla Journalism - and the implications for journalism education
The Rise of Guerilla Journalism - and the implications for journalism educationThe Rise of Guerilla Journalism - and the implications for journalism education
The Rise of Guerilla Journalism - and the implications for journalism education
 
Toekomst Van Media
Toekomst Van MediaToekomst Van Media
Toekomst Van Media
 
Storytelling
StorytellingStorytelling
Storytelling
 
Keynote Syntens 'Crossmediaal in 2010'
Keynote Syntens 'Crossmediaal in 2010'Keynote Syntens 'Crossmediaal in 2010'
Keynote Syntens 'Crossmediaal in 2010'
 
Keynote Syntens 'Crossmediaal in 2010'
Keynote Syntens 'Crossmediaal in 2010'Keynote Syntens 'Crossmediaal in 2010'
Keynote Syntens 'Crossmediaal in 2010'
 

Kürzlich hochgeladen

TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGDSC PJATK
 
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfSimplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfFIDO Alliance
 
ERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage IntacctERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage IntacctBrainSell Technologies
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...panagenda
 
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...FIDO Alliance
 
Intro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджераIntro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджераMark Opanasiuk
 
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfThe Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfFIDO Alliance
 
Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingScyllaDB
 
Generative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdfGenerative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdfalexjohnson7307
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024Lorenzo Miniero
 
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...FIDO Alliance
 
Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024Hiroshi SHIBATA
 
TopCryptoSupers 12thReport OrionX May2024
TopCryptoSupers 12thReport OrionX May2024TopCryptoSupers 12thReport OrionX May2024
TopCryptoSupers 12thReport OrionX May2024Stephen Perrenod
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxFIDO Alliance
 
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...ScyllaDB
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!Memoori
 
ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxFIDO Alliance
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfFIDO Alliance
 
Introduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxIntroduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxFIDO Alliance
 

Kürzlich hochgeladen (20)

TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 Warsaw
 
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfSimplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
 
ERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage IntacctERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage Intacct
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
 
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
 
Intro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджераIntro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджера
 
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfThe Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
 
Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream Processing
 
Generative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdfGenerative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdf
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
 
Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024
 
TopCryptoSupers 12thReport OrionX May2024
TopCryptoSupers 12thReport OrionX May2024TopCryptoSupers 12thReport OrionX May2024
TopCryptoSupers 12thReport OrionX May2024
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
 
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!
 
ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptx
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
Introduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxIntroduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptx
 

How to validate a dataset? Six steps.

Hinweis der Redaktion

  1. NameWork at university – work as a writer / data journalistSomewhere in between – I do research something with a scientific goals and soething with a journalustic aim
  2. If you are in between – it is interesting that the worlds of social science and datajournalism in the field are sometimes really different – but sometimes notIf we take fo example this dataset – which is the dataset Andrew Lehren from te New York Times used in Pullitzer prize winner story about the New York Marathon you can see a blind spot
  3. … if a scientist sees this, in gereneral his first reponse it that the dataset is technically not right. There us some missing data. A problem which needs to be solved
  4. While, if a journalist sees a white spot, he is really interested in the story behind the missing data. Why is the data missing?
  5. In this case, both appriaches were all right; some runners missed checkpointBut also some technical flaws
  6. If I talk about journalists with scientists not always as ethustaistic as they could be- They can’t de al with data – they use data in a superficial
  7. Journalists – scietists are really egocentric – and their stories are not useful for the real world. They just do research to please themselves and their collegues at university
  8. At least o eon thing they agree; they assume they aee both right
  9. Because I live in both worlds, I am interested to see the real differences or notAnd one of the differences or not, is how scnetists as well astdatajournalists decide if they trust and use a dataset or not. And what I would like to discuss today is really just a startig point of this topic
  10. So if you dig into the literature of the trustworthiness of data from the perspective of a scientists – you will find a broad variety of articles in different different scietif field. Anf it’s not easy to dtect a specific line in the ariety of articles n all these different field. And there is a lack in specific guidelines how scinetists determine the trustworthiness a scientist
  11. And if you readscientifartciles about what makes a datasettrustworthy for journalists – you will find nothinhYou will only find general readings about the trustwothiness of a news source and general. Like the main principles of Gans. And a dataset could simply be one of these news sources. But on a literature level. Its is hard to compare
  12. So, with no clear starting oint, it seemed right to start with a very general question. And that’s what I did. I asked ten of me scirntif as well a
  13. Are the intentions of any influence on the dataset?
  14. So they both use their collegues as peers
  15. Using a dataaet from another source is not really common in social science -
  16. Experiments – case study