SlideShare ist ein Scribd-Unternehmen logo
1 von 6
Downloaden Sie, um offline zu lesen
Summary	
  
Data	
  coding	
  ,	
  analysis,	
  archiving,	
  and	
  
   sharing	
  for	
  open	
  collabora9on	
  


                Richard	
  Aslin	
  
            University	
  of	
  Rochester	
  
1.	
  	
  What	
  is	
  your	
  hypothesis?	
  
•  9/11	
  occurred	
  because	
  the	
  intelligence	
  
   community	
  suffered	
  from	
  a	
  “failure	
  of	
  
   imagina9on”	
  
   –  BoGom-­‐up	
  data	
  mining	
  (“connec9ng	
  the	
  dots”)	
  
   –  Top-­‐down	
  predic9ons	
  (“what	
  are	
  vulnerabili9es??”)	
  
•  Clearly,	
  you	
  need	
  both	
  
•  Must	
  apply	
  approaches	
  itera9vely	
  and	
  repeatedly	
  
2.	
  	
  Observa9ons	
  are	
  DVs	
  
•  Are	
  the	
  paGerns	
  you	
  “see”	
  the	
  ones	
  that	
  are	
  
   “relevant”	
  or	
  causal?	
  	
  
•  Problem	
  of	
  data	
  sparsity	
  and	
  false	
  correla9ons	
  
•  Hypothesis	
  tes9ng	
  requires	
  an	
  experiment	
  
   (manipula9ng	
  an	
  IV)	
  
•  Tension	
  between	
  “ecology”	
  and	
  “control	
  of	
  
   variables”	
  (sociology	
  of	
  preferred	
  methods)	
  
3.	
  	
  How	
  expand	
  hypothesis	
  space?	
  
•  If	
  large/standard	
  datasets,	
  then	
  evalua9on	
  
   becomes	
  stagnant	
  (only	
  evaluated	
  with	
  that	
  
   dataset)	
  
•  If	
  evalua9on	
  only	
  uses	
  standard	
  (sta9s9cal)	
  
   tools,	
  same	
  problem	
  of	
  stagna9on	
  
•  Is	
  clever	
  visualiza9on	
  the	
  key	
  to	
  hypothesis	
  
   forma9on,	
  even	
  if	
  “simple”	
  variables?	
  

               TED	
  talk	
  by	
  Deb	
  Roy	
  from	
  MIT	
  
4.	
  	
  When	
  do	
  you	
  give	
  up?	
  
•  Reliance	
  on	
  visual	
  paGern	
  recogni9on	
  by	
  
   human	
  coder	
  may	
  not	
  reveal	
  relevant	
  
   (informa9ve)	
  features	
  (sound	
  spectrogram	
  
   cannot	
  be	
  “read”)	
  
•  Failure	
  at	
  macro	
  level	
  prompts	
  search	
  for	
  info	
  
   at	
  micro	
  level	
  (fMRI	
  univariate	
  vs.	
  mul9variate	
  
   analysis):	
  need	
  to	
  “drill	
  down”	
  
•  Failure	
  at	
  micro	
  level	
  may	
  indicate	
  
   indeterminacy	
  of	
  causal	
  hierarchy	
  (Fodor)	
  
5.	
  	
  Rules	
  of	
  sharing	
  
•  When	
  does	
  “your”	
  data	
  become	
  accessible	
  by:	
  
    –  Your	
  collaborators	
  
    –  Friends	
  who	
  ask	
  
    –  Strangers	
  
    –  Anyone	
  
•  Who	
  gets	
  credit?	
  
•  How	
  should	
  junior	
  researchers	
  “share”?	
  	
  
   Especially	
  with	
  senior	
  labs	
  that	
  have	
  $$$.	
  

Weitere ähnliche Inhalte

Ähnlich wie Aslin.discussion

Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds
 
Altman pitt 2013_v3
Altman pitt 2013_v3Altman pitt 2013_v3
Altman pitt 2013_v3
Micah Altman
 
Elizabeth Churchill, "Data by Design"
Elizabeth Churchill, "Data by Design"Elizabeth Churchill, "Data by Design"
Elizabeth Churchill, "Data by Design"
summersocialwebshop
 

Ähnlich wie Aslin.discussion (20)

NGP Retreat Open Science 2015
NGP Retreat Open Science 2015NGP Retreat Open Science 2015
NGP Retreat Open Science 2015
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, Potsdam
 
Data Science Folk Knowledge
Data Science Folk KnowledgeData Science Folk Knowledge
Data Science Folk Knowledge
 
Biswa research
Biswa researchBiswa research
Biswa research
 
NeuroVault and the vision for data sharing in neuroimaging
NeuroVault and the vision for data sharing in neuroimagingNeuroVault and the vision for data sharing in neuroimaging
NeuroVault and the vision for data sharing in neuroimaging
 
Computational Reproducibility vs. Transparency: Is It FAIR Enough?
Computational Reproducibility vs. Transparency: Is It FAIR Enough?Computational Reproducibility vs. Transparency: Is It FAIR Enough?
Computational Reproducibility vs. Transparency: Is It FAIR Enough?
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of Understanding
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of Understanding
 
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
 
Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014
 
Share and Reuse: how data sharing can take your research to the next level
Share and Reuse: how data sharing can take your research to the next levelShare and Reuse: how data sharing can take your research to the next level
Share and Reuse: how data sharing can take your research to the next level
 
The Architecture of Understanding
The Architecture of UnderstandingThe Architecture of Understanding
The Architecture of Understanding
 
Altman pitt 2013_v3
Altman pitt 2013_v3Altman pitt 2013_v3
Altman pitt 2013_v3
 
(One Possible) Future of Scholarly Communication
(One Possible) Future of Scholarly Communication(One Possible) Future of Scholarly Communication
(One Possible) Future of Scholarly Communication
 
From byte to mind
From byte to mindFrom byte to mind
From byte to mind
 
Elsevier CWTS Open Data Report Presentation at RDA meeting in Barcelona
Elsevier CWTS Open Data Report Presentation at RDA meeting in Barcelona Elsevier CWTS Open Data Report Presentation at RDA meeting in Barcelona
Elsevier CWTS Open Data Report Presentation at RDA meeting in Barcelona
 
Editing Digital Imagery in Research: Exploring the Fidelity-to-Artificiality...
Editing Digital Imagery in Research:  Exploring the Fidelity-to-Artificiality...Editing Digital Imagery in Research:  Exploring the Fidelity-to-Artificiality...
Editing Digital Imagery in Research: Exploring the Fidelity-to-Artificiality...
 
Jsm big-data
Jsm big-dataJsm big-data
Jsm big-data
 
Waves keynote2c
Waves keynote2cWaves keynote2c
Waves keynote2c
 
Elizabeth Churchill, "Data by Design"
Elizabeth Churchill, "Data by Design"Elizabeth Churchill, "Data by Design"
Elizabeth Churchill, "Data by Design"
 

Mehr von Jesse Lingeman

Its About Time: Analyzing Temporal MicroLevel Behavioral Patterns
Its About Time: Analyzing Temporal MicroLevel Behavioral PatternsIts About Time: Analyzing Temporal MicroLevel Behavioral Patterns
Its About Time: Analyzing Temporal MicroLevel Behavioral Patterns
Jesse Lingeman
 
Supporting Emergence: Interaction Design for Visual Analytics Approach to ESDA
Supporting Emergence: Interaction Design for Visual Analytics Approach to ESDASupporting Emergence: Interaction Design for Visual Analytics Approach to ESDA
Supporting Emergence: Interaction Design for Visual Analytics Approach to ESDA
Jesse Lingeman
 
Messinger.openshapa.091511
Messinger.openshapa.091511Messinger.openshapa.091511
Messinger.openshapa.091511
Jesse Lingeman
 
Hoffman nsf presentation hoffman-25-aug11.ppt
Hoffman nsf presentation hoffman-25-aug11.pptHoffman nsf presentation hoffman-25-aug11.ppt
Hoffman nsf presentation hoffman-25-aug11.ppt
Jesse Lingeman
 
Alibali mult data streams a
Alibali mult data streams aAlibali mult data streams a
Alibali mult data streams a
Jesse Lingeman
 

Mehr von Jesse Lingeman (12)

Its About Time: Analyzing Temporal MicroLevel Behavioral Patterns
Its About Time: Analyzing Temporal MicroLevel Behavioral PatternsIts About Time: Analyzing Temporal MicroLevel Behavioral Patterns
Its About Time: Analyzing Temporal MicroLevel Behavioral Patterns
 
Supporting Emergence: Interaction Design for Visual Analytics Approach to ESDA
Supporting Emergence: Interaction Design for Visual Analytics Approach to ESDASupporting Emergence: Interaction Design for Visual Analytics Approach to ESDA
Supporting Emergence: Interaction Design for Visual Analytics Approach to ESDA
 
Messinger.openshapa.091511
Messinger.openshapa.091511Messinger.openshapa.091511
Messinger.openshapa.091511
 
Mac whinney macw
Mac whinney macwMac whinney macw
Mac whinney macw
 
Hoffman nsf presentation hoffman-25-aug11.ppt
Hoffman nsf presentation hoffman-25-aug11.pptHoffman nsf presentation hoffman-25-aug11.ppt
Hoffman nsf presentation hoffman-25-aug11.ppt
 
Gray 110916 ns-fwkshp
Gray 110916 ns-fwkshpGray 110916 ns-fwkshp
Gray 110916 ns-fwkshp
 
Davis kean.open shapa
Davis kean.open shapaDavis kean.open shapa
Davis kean.open shapa
 
Borner links
Borner linksBorner links
Borner links
 
Altman links
Altman linksAltman links
Altman links
 
Alibali mult data streams a
Alibali mult data streams aAlibali mult data streams a
Alibali mult data streams a
 
Test1
Test1Test1
Test1
 
Test2
Test2Test2
Test2
 

Kürzlich hochgeladen

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Kürzlich hochgeladen (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 

Aslin.discussion

  • 1. Summary   Data  coding  ,  analysis,  archiving,  and   sharing  for  open  collabora9on   Richard  Aslin   University  of  Rochester  
  • 2. 1.    What  is  your  hypothesis?   •  9/11  occurred  because  the  intelligence   community  suffered  from  a  “failure  of   imagina9on”   –  BoGom-­‐up  data  mining  (“connec9ng  the  dots”)   –  Top-­‐down  predic9ons  (“what  are  vulnerabili9es??”)   •  Clearly,  you  need  both   •  Must  apply  approaches  itera9vely  and  repeatedly  
  • 3. 2.    Observa9ons  are  DVs   •  Are  the  paGerns  you  “see”  the  ones  that  are   “relevant”  or  causal?     •  Problem  of  data  sparsity  and  false  correla9ons   •  Hypothesis  tes9ng  requires  an  experiment   (manipula9ng  an  IV)   •  Tension  between  “ecology”  and  “control  of   variables”  (sociology  of  preferred  methods)  
  • 4. 3.    How  expand  hypothesis  space?   •  If  large/standard  datasets,  then  evalua9on   becomes  stagnant  (only  evaluated  with  that   dataset)   •  If  evalua9on  only  uses  standard  (sta9s9cal)   tools,  same  problem  of  stagna9on   •  Is  clever  visualiza9on  the  key  to  hypothesis   forma9on,  even  if  “simple”  variables?   TED  talk  by  Deb  Roy  from  MIT  
  • 5. 4.    When  do  you  give  up?   •  Reliance  on  visual  paGern  recogni9on  by   human  coder  may  not  reveal  relevant   (informa9ve)  features  (sound  spectrogram   cannot  be  “read”)   •  Failure  at  macro  level  prompts  search  for  info   at  micro  level  (fMRI  univariate  vs.  mul9variate   analysis):  need  to  “drill  down”   •  Failure  at  micro  level  may  indicate   indeterminacy  of  causal  hierarchy  (Fodor)  
  • 6. 5.    Rules  of  sharing   •  When  does  “your”  data  become  accessible  by:   –  Your  collaborators   –  Friends  who  ask   –  Strangers   –  Anyone   •  Who  gets  credit?   •  How  should  junior  researchers  “share”?     Especially  with  senior  labs  that  have  $$$.