SlideShare ist ein Scribd-Unternehmen logo
1 von 42
Downloaden Sie, um offline zu lesen
Oriol	
  Pujol
ABOUT	
  DATA	
  FROM	
  A	
  
MACHINE	
  LEARNING	
  
PERSPECTIVE
TWO	
  FLAVOURS	
  AND	
  ONE	
  
PRESENTATION
Share	
  experiences	
  in	
  openness	
  from	
  the	
  machine	
  learning	
  
community
To	
  propose	
  potential	
  solutions	
  to	
  the	
  problem	
  of	
  sharing	
  
sensible	
  data
A	
  COMMUNITY	
  SHIFT
THE	
  ROLE	
  OF	
  DATA	
  IN	
  MACHINE	
  LEARNING
• Machine	
  learning	
  is	
  sometimes	
  called	
  learning	
  from	
  data.	
  Thus	
  we	
  are	
  
talking	
  about	
  building/coding	
  algorithms	
  that	
  learn	
  from	
  data.
• Data	
  is	
  the	
  key	
  component	
  in	
  most	
  research,	
  but	
  it	
  is	
  obviously	
  crucial	
  
in	
  machine	
  learning.
Automatic	
  captioning
EXAMPLES	
  OF	
  TASKS	
  IN	
  MACHINE	
  LEARNING
CURRENT	
  MACHINE	
  LEARNING	
  RESEARCH	
  FOCUS
• Improving	
  results	
  in	
  terms	
  of	
  generalization,	
  robustness,	
  speed,	
  and	
  
interpretability.	
  
• Comparison	
  with	
  other	
  techniques	
  is	
  fundamental.	
  
• Many	
  different	
  data	
  sets	
  are	
  needed	
  to	
  assess	
  each	
  property	
  we	
  are	
  
tackling	
  in	
  the	
  research,	
  e.g.	
  we	
  need	
  large	
  datasets	
  for	
  speed	
  and	
  big	
  
data.
• Sometimes,	
  their	
  application	
  largely	
  depends	
  on	
  the	
  domain,	
  e.g.,	
  
financial	
  products	
  forecasting,	
  targeted	
  gene	
  identification.
AS	
  A	
  RESEARCHER
DISEMINATION:	
  REPORTING	
  PROCESS,	
  CLAIMING	
  AUTHORSHIP
QUALITY:	
  COMMUNITY	
  VALIDITY	
  ASSESSMENT,	
  PEER	
  REVIEW	
  
(KPI	
  USED	
  BY	
  ORGANISATIONS	
  FOR	
  RESEARCH	
  QUALITY)
We	
  pursue	
  to	
  contribute	
  to	
  advance	
  science:	
  reproducibility	
  of	
  
experimentation	
  that	
  helps	
  to	
  eliminate individual	
  biases.	
  
FOR	
  THIS	
  REASON	
  WE	
  USE	
  PUBLISHING
DISSEMINATION
QUALITY	
  ASSESSMENT
FOR	
  THIS	
  REASON	
  WE	
  USE	
  PUBLISHING
LONG	
  PROCESS	
  >	
  1	
  YEAR
NOT	
  A	
  GOOD	
  DISEMINATION	
  
SOURCE,	
  OR	
  ATTRIBUTION	
  PROC.
PEER	
  QUALITY	
  ASSESSMENT	
  
CRITICAL	
  FEEDBACK
GUARD	
  AGAINST	
  ERRORS
MAJOR	
  SOURCE	
  FOR	
  RANKING	
  RESEARCHERS
NON-­‐PUBLISHED	
  RESEARCH	
  IS	
  
NOT	
  RESEARCH
FOR	
  THIS	
  REASON	
  WE	
  USE	
  PUBLISHING
FAIRER	
  PROCESS >	
  	
  3-­‐6	
  month
QUALITY	
  ASSESSMENT
RANKING	
  RESEARCHERS
NON-­‐PUBLISHED	
  RESEARCH	
  IS	
  
NOT	
  RESEARCH
THE	
  NIPS	
  CASE:	
  WHEN	
  CONFERENCE	
  PAPERS	
  
ARE	
  OLD	
  NEWS
IN	
  MACHINE	
  LEARNING	
  3-­‐6	
  MONTHS	
  IS	
  TOO	
  LONG
DECEMBER	
  5TH – 10TH ,	
  2016
THE	
  NIPS	
  CASE:	
  WHEN	
  CONFERENCE	
  PAPERS	
  ARE	
  
OLD	
  NEWS
24/7	
  PUBLISHING	
  CYCLE
IMMEDIACY,	
  24/7	
  PUBLISHING	
  CYCLE
IMMEDIATE	
  ATTRIBUTION	
  
NO	
  QUALITY	
  ASSESSMENT
RANKING	
  RESEARCHERS
HOW	
  TO	
  DISTINGUISH	
  GOOD	
  RESEARCH	
  FROM	
  FRAUD
HOWEVER,	
  IN	
  REALITY
TOWARDS	
  REPRODUCIBILITY:	
  WE	
  NEED	
  DATA
AN	
  EXAMPLE:	
  PLOS	
  ONE
FOCUS	
  SHIFT:
METHODOLOGICAL	
  
CORRECTNESS	
  
DATA
MACHINE	
  LEARNING	
  DATASET	
  REPOSITORIES
MACHINE	
  LEARNING	
  DATASET	
  REPOSITORIES
Data	
  Sets -­‐ Raw	
  data	
  as	
  a	
  
collection	
  of	
  similarily structured	
  
objects.
Material	
  and	
  Methods -­‐
Descriptions	
  of	
  the	
  computational	
  
pipeline.
Learning	
  Tasks -­‐ Learning	
  tasks	
  
defined	
  on	
  raw	
  data.
Challenges -­‐ Collections	
  of	
  tasks	
  
which	
  have	
  a	
  particular	
  theme.
MACHINE	
  LEARNING	
  DATASET	
  REPOSITORIES
• The	
  U.S.	
  Securities	
  and	
  Exchange	
  
Commission	
  (SEC)	
  2010	
  proposal,	
  
financial	
  products	
  must	
  include	
  Python
code.
WHEN	
  DESCRIPTION	
  AND	
  DATA	
  IS	
  NOT	
  
ENOUGH
WHEN	
  DESCRIPTION	
  AND	
  DATA	
  IS	
  NOT	
  
ENOUGH
GITHUB	
  PROJECTS
BUT	
  …	
  DEVIL	
  IS	
  IN	
  THE	
  DETAILS
DOCUMENTING	
  THE	
  DETAILS
JUPYTER	
  NOTEBOOKS
EXECUTABLE	
  CODE
LATEX
MARKDOWN
HTML+CSS
HYPERLINKS
EMBEDDED	
  FRAMES
…	
  and	
  more.
PUTTING	
  ALL	
  TOGETHER
BOTTOM	
  LINE
1. SCIENCE	
  IS	
  MORE	
  THAN	
  A	
  PUBLICATION
2. AT	
  ITS	
  CORE	
  LIES	
  REPRODUCIBILITY…THIS	
  INVOLVES	
  DATA
3. BUT	
  DATA	
  IS	
  NOT	
  EVEN	
  ENOUGH…	
  CODE	
  AND	
  PRECISE	
  EXPERIMENT	
  
REPRODUCIBILITY
4. DATA	
  AND	
  PROGRAMMING	
  LITERACY	
  IS	
  NEEDED
BUT	
  …	
  SOMETHING	
  ELSE	
  ABOUT	
  QUALITY	
  
ASSESSMENT:	
  OPEN	
  REVIEW	
  CASE
THE	
  RULES
• Submissions	
  posted	
  on	
  arXiv before	
  being	
  submitted	
  to	
  the	
  conference
• Program	
  committee	
  assigns	
  blind	
  reviewers	
  (as	
  always)	
  marked	
  as	
  
designated	
  reviewers
• Anyone	
  can	
  ask	
  the	
  program	
  chairs	
  for	
  permission	
  to	
  become	
  an	
  
anonymous	
  designated	
  reviewer	
  (open	
  bidding).
• Open	
  commenters	
  will	
  have	
  to	
  use	
  their	
  real	
  names,	
  linked	
  with	
  their	
  
Google	
  Scholar	
  profiles.
• Papers	
  that	
  are	
  presented	
  in	
  the	
  workshop	
  track	
  or	
  are	
  not	
  accepted	
  will	
  be	
  
considered	
  non-­‐archival,	
  and	
  may	
  be	
  submitted	
  elsewhere	
  (modified	
  or	
  
not),	
  although	
  the	
  ICLR	
  site	
  will	
  maintain	
  the	
  reviews,	
  the	
  comments,	
  and	
  
the	
  links	
  to	
  the	
  arXiv versions
SOME	
  POINTS	
  ABOUT	
  OPEN	
  REVIEW
• Reading	
  the	
  answers	
  from	
  others	
  is	
  valuable	
  
• Rejected	
  papers	
  have	
  influence
• May	
  become	
  a	
  popularity	
  contest
SHARING	
  DATA	
  IN	
  THE	
  AGE	
  
OF	
  DEEP	
  LEARNING
dis
clo
sure
?
Relativemortality
Privacy budget
Disclosurerisk
More	
  private
Less	
  private
CLASSIFIER
CLASSIFIER
LEARNER
CLASSIFIER
Research is based on reproducibility:
documentation, data, code, and details.
Machine learning methods can help in the
challenge for democratizing data.
CONCLUSIONS
@oriolpujolvila

Weitere ähnliche Inhalte

Was ist angesagt?

Measure for Measure: The role of metrics in assessing research performance - ...
Measure for Measure: The role of metrics in assessing research performance - ...Measure for Measure: The role of metrics in assessing research performance - ...
Measure for Measure: The role of metrics in assessing research performance - ...Michael Habib
 
preSCORE Presentation by Adam Etkin for Highwire Fall 2013 Meeting
preSCORE Presentation by Adam Etkin for Highwire Fall 2013 MeetingpreSCORE Presentation by Adam Etkin for Highwire Fall 2013 Meeting
preSCORE Presentation by Adam Etkin for Highwire Fall 2013 MeetingAdam Etkin
 
Confidently Conduct and Write Your Power Analysis
Confidently Conduct and Write Your Power AnalysisConfidently Conduct and Write Your Power Analysis
Confidently Conduct and Write Your Power AnalysisStatistics Solutions
 
Scientific research: What Anna Karenina teaches us about useful negative results
Scientific research: What Anna Karenina teaches us about useful negative resultsScientific research: What Anna Karenina teaches us about useful negative results
Scientific research: What Anna Karenina teaches us about useful negative resultsDaniel S. Katz
 
General Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative MethodologyGeneral Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative MethodologyStatistics Solutions
 
Risk Based De-identification for Sharing Health Data
Risk Based De-identification for Sharing Health DataRisk Based De-identification for Sharing Health Data
Risk Based De-identification for Sharing Health DataKhaled El Emam
 
LQAS: Pitfalls, Controvery & Addressing Concerns_Luna, Nitkin, Yaggy_5.10.11
LQAS: Pitfalls, Controvery & Addressing Concerns_Luna, Nitkin, Yaggy_5.10.11LQAS: Pitfalls, Controvery & Addressing Concerns_Luna, Nitkin, Yaggy_5.10.11
LQAS: Pitfalls, Controvery & Addressing Concerns_Luna, Nitkin, Yaggy_5.10.11CORE Group
 
National archetype governance in Norway
National archetype governance in NorwayNational archetype governance in Norway
National archetype governance in NorwaySilje Ljosland Bakke
 
Dawson checklist presentation
Dawson checklist presentationDawson checklist presentation
Dawson checklist presentationangeladpmp
 
Lecture 1 ba 1 overview
Lecture 1 ba 1 overviewLecture 1 ba 1 overview
Lecture 1 ba 1 overviewyogesh ingle
 
RESUME_ZEYU XIE
RESUME_ZEYU XIERESUME_ZEYU XIE
RESUME_ZEYU XIEZeyu Xie
 
Selecting an Ideal Survey Instrument for a Quantitative Study
Selecting an Ideal Survey Instrument for a Quantitative StudySelecting an Ideal Survey Instrument for a Quantitative Study
Selecting an Ideal Survey Instrument for a Quantitative StudyStatistics Solutions
 
You Got Your Engineering in my Data Science - Addressing the Reproducibility ...
You Got Your Engineering in my Data Science - Addressing the Reproducibility ...You Got Your Engineering in my Data Science - Addressing the Reproducibility ...
You Got Your Engineering in my Data Science - Addressing the Reproducibility ...jonbodner
 
Umm, how did you get that number? Managing Data Integrity throughout the Data...
Umm, how did you get that number? Managing Data Integrity throughout the Data...Umm, how did you get that number? Managing Data Integrity throughout the Data...
Umm, how did you get that number? Managing Data Integrity throughout the Data...John Kinmonth
 

Was ist angesagt? (17)

What does "data publication" mean to researchers?
What does "data publication" mean to researchers?What does "data publication" mean to researchers?
What does "data publication" mean to researchers?
 
Measure for Measure: The role of metrics in assessing research performance - ...
Measure for Measure: The role of metrics in assessing research performance - ...Measure for Measure: The role of metrics in assessing research performance - ...
Measure for Measure: The role of metrics in assessing research performance - ...
 
preSCORE Presentation by Adam Etkin for Highwire Fall 2013 Meeting
preSCORE Presentation by Adam Etkin for Highwire Fall 2013 MeetingpreSCORE Presentation by Adam Etkin for Highwire Fall 2013 Meeting
preSCORE Presentation by Adam Etkin for Highwire Fall 2013 Meeting
 
Confidently Conduct and Write Your Power Analysis
Confidently Conduct and Write Your Power AnalysisConfidently Conduct and Write Your Power Analysis
Confidently Conduct and Write Your Power Analysis
 
NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...
NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...
NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...
 
Scientific research: What Anna Karenina teaches us about useful negative results
Scientific research: What Anna Karenina teaches us about useful negative resultsScientific research: What Anna Karenina teaches us about useful negative results
Scientific research: What Anna Karenina teaches us about useful negative results
 
General Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative MethodologyGeneral Tips to Fast-Track Your Quantitative Methodology
General Tips to Fast-Track Your Quantitative Methodology
 
Snowball Metrics: University-owned Benchmarking to Reveal Strengths within Al...
Snowball Metrics: University-owned Benchmarking to Reveal Strengths within Al...Snowball Metrics: University-owned Benchmarking to Reveal Strengths within Al...
Snowball Metrics: University-owned Benchmarking to Reveal Strengths within Al...
 
Risk Based De-identification for Sharing Health Data
Risk Based De-identification for Sharing Health DataRisk Based De-identification for Sharing Health Data
Risk Based De-identification for Sharing Health Data
 
LQAS: Pitfalls, Controvery & Addressing Concerns_Luna, Nitkin, Yaggy_5.10.11
LQAS: Pitfalls, Controvery & Addressing Concerns_Luna, Nitkin, Yaggy_5.10.11LQAS: Pitfalls, Controvery & Addressing Concerns_Luna, Nitkin, Yaggy_5.10.11
LQAS: Pitfalls, Controvery & Addressing Concerns_Luna, Nitkin, Yaggy_5.10.11
 
National archetype governance in Norway
National archetype governance in NorwayNational archetype governance in Norway
National archetype governance in Norway
 
Dawson checklist presentation
Dawson checklist presentationDawson checklist presentation
Dawson checklist presentation
 
Lecture 1 ba 1 overview
Lecture 1 ba 1 overviewLecture 1 ba 1 overview
Lecture 1 ba 1 overview
 
RESUME_ZEYU XIE
RESUME_ZEYU XIERESUME_ZEYU XIE
RESUME_ZEYU XIE
 
Selecting an Ideal Survey Instrument for a Quantitative Study
Selecting an Ideal Survey Instrument for a Quantitative StudySelecting an Ideal Survey Instrument for a Quantitative Study
Selecting an Ideal Survey Instrument for a Quantitative Study
 
You Got Your Engineering in my Data Science - Addressing the Reproducibility ...
You Got Your Engineering in my Data Science - Addressing the Reproducibility ...You Got Your Engineering in my Data Science - Addressing the Reproducibility ...
You Got Your Engineering in my Data Science - Addressing the Reproducibility ...
 
Umm, how did you get that number? Managing Data Integrity throughout the Data...
Umm, how did you get that number? Managing Data Integrity throughout the Data...Umm, how did you get that number? Managing Data Integrity throughout the Data...
Umm, how did you get that number? Managing Data Integrity throughout the Data...
 

Andere mochten auch

Developing a Framework for Research Data Management Protocols
Developing a Framework for Research Data Management ProtocolsDeveloping a Framework for Research Data Management Protocols
Developing a Framework for Research Data Management ProtocolsLEARN Project
 
Opening Research Data in EU Universities: Policies, Motivators and Challenges
Opening Research Data in EU Universities: Policies, Motivators and ChallengesOpening Research Data in EU Universities: Policies, Motivators and Challenges
Opening Research Data in EU Universities: Policies, Motivators and ChallengesLEARN Project
 
The Needs of Stakeholders in the RDM Process - the role of LEARN
The Needs of Stakeholders in the RDM Process - the role of LEARNThe Needs of Stakeholders in the RDM Process - the role of LEARN
The Needs of Stakeholders in the RDM Process - the role of LEARNLEARN Project
 
Managing Research Data in the Caribbean: Good practices and challenges
Managing Research Data in the Caribbean: Good practices and challengesManaging Research Data in the Caribbean: Good practices and challenges
Managing Research Data in the Caribbean: Good practices and challengesLEARN Project
 
LEARN Project: The Story So Far
LEARN Project: The Story So FarLEARN Project: The Story So Far
LEARN Project: The Story So FarLEARN Project
 
Data for Development in the Caribbean
Data for Development in the CaribbeanData for Development in the Caribbean
Data for Development in the CaribbeanLEARN Project
 
The Data Deluge: the Role of Research Organisations
The Data Deluge: the Role of Research OrganisationsThe Data Deluge: the Role of Research Organisations
The Data Deluge: the Role of Research OrganisationsLEARN Project
 
Open Data in a Big World by Fernando Ariel López
Open Data in a Big World by Fernando Ariel López Open Data in a Big World by Fernando Ariel López
Open Data in a Big World by Fernando Ariel López LEARN Project
 
Avances en torno a la Ley 26.899 e iniciativa regional de datos primarios de...
Avances en torno a la Ley 26.899 e iniciativa regional de datos primarios de...Avances en torno a la Ley 26.899 e iniciativa regional de datos primarios de...
Avances en torno a la Ley 26.899 e iniciativa regional de datos primarios de...LEARN Project
 
Gestion de datos para la investigacion: el caso peruano by Edward Mezones, Su...
Gestion de datos para la investigacion: el caso peruano by Edward Mezones, Su...Gestion de datos para la investigacion: el caso peruano by Edward Mezones, Su...
Gestion de datos para la investigacion: el caso peruano by Edward Mezones, Su...LEARN Project
 
Gestion de Datos de Investigacion
Gestion de Datos de InvestigacionGestion de Datos de Investigacion
Gestion de Datos de InvestigacionLEARN Project
 
Research Data Management in São Paulo by Fabio Kon FAPESP
Research Data Management in São Paulo by Fabio Kon FAPESPResearch Data Management in São Paulo by Fabio Kon FAPESP
Research Data Management in São Paulo by Fabio Kon FAPESPLEARN Project
 
LEARN: Helping Institutions to Build and Implement a Policy on research Data
LEARN: Helping Institutions to Build and Implement a Policy on research DataLEARN: Helping Institutions to Build and Implement a Policy on research Data
LEARN: Helping Institutions to Build and Implement a Policy on research DataLEARN Project
 
Datos Abiertos de Investigacion - Caso Mexico
Datos Abiertos de Investigacion - Caso MexicoDatos Abiertos de Investigacion - Caso Mexico
Datos Abiertos de Investigacion - Caso MexicoLEARN Project
 
TALLER LEARN SOBRE DATOS DE INVESTIGACIÓN IMPLEMENTACIÓN DE POLÍTICAS Y ESTRA...
TALLER LEARN SOBRE DATOS DE INVESTIGACIÓN IMPLEMENTACIÓN DE POLÍTICAS Y ESTRA...TALLER LEARN SOBRE DATOS DE INVESTIGACIÓN IMPLEMENTACIÓN DE POLÍTICAS Y ESTRA...
TALLER LEARN SOBRE DATOS DE INVESTIGACIÓN IMPLEMENTACIÓN DE POLÍTICAS Y ESTRA...LEARN Project
 
Open Data in the world of Science” by Dr. Claudio Gutiérrez
Open Data in the world of Science” by Dr. Claudio GutiérrezOpen Data in the world of Science” by Dr. Claudio Gutiérrez
Open Data in the world of Science” by Dr. Claudio GutiérrezLEARN Project
 
Conicyt Y Mandato OECD by Patricia Muñoz, CONICYT (Chile)
Conicyt Y Mandato OECD by Patricia Muñoz, CONICYT (Chile)Conicyt Y Mandato OECD by Patricia Muñoz, CONICYT (Chile)
Conicyt Y Mandato OECD by Patricia Muñoz, CONICYT (Chile)LEARN Project
 
El camino hacia la Gestión de Datos de Investigación: la experiencia de Perú ...
El camino hacia la Gestión de Datos de Investigación: la experiencia de Perú ...El camino hacia la Gestión de Datos de Investigación: la experiencia de Perú ...
El camino hacia la Gestión de Datos de Investigación: la experiencia de Perú ...LEARN Project
 
Data management: The new frontier for libraries
Data management: The new frontier for librariesData management: The new frontier for libraries
Data management: The new frontier for librariesLEARN Project
 

Andere mochten auch (20)

Developing a Framework for Research Data Management Protocols
Developing a Framework for Research Data Management ProtocolsDeveloping a Framework for Research Data Management Protocols
Developing a Framework for Research Data Management Protocols
 
Opening Research Data in EU Universities: Policies, Motivators and Challenges
Opening Research Data in EU Universities: Policies, Motivators and ChallengesOpening Research Data in EU Universities: Policies, Motivators and Challenges
Opening Research Data in EU Universities: Policies, Motivators and Challenges
 
The Needs of Stakeholders in the RDM Process - the role of LEARN
The Needs of Stakeholders in the RDM Process - the role of LEARNThe Needs of Stakeholders in the RDM Process - the role of LEARN
The Needs of Stakeholders in the RDM Process - the role of LEARN
 
Managing Research Data in the Caribbean: Good practices and challenges
Managing Research Data in the Caribbean: Good practices and challengesManaging Research Data in the Caribbean: Good practices and challenges
Managing Research Data in the Caribbean: Good practices and challenges
 
LEARN Project: The Story So Far
LEARN Project: The Story So FarLEARN Project: The Story So Far
LEARN Project: The Story So Far
 
Data for Development in the Caribbean
Data for Development in the CaribbeanData for Development in the Caribbean
Data for Development in the Caribbean
 
The Data Deluge: the Role of Research Organisations
The Data Deluge: the Role of Research OrganisationsThe Data Deluge: the Role of Research Organisations
The Data Deluge: the Role of Research Organisations
 
Open Data in a Big World by Fernando Ariel López
Open Data in a Big World by Fernando Ariel López Open Data in a Big World by Fernando Ariel López
Open Data in a Big World by Fernando Ariel López
 
Avances en torno a la Ley 26.899 e iniciativa regional de datos primarios de...
Avances en torno a la Ley 26.899 e iniciativa regional de datos primarios de...Avances en torno a la Ley 26.899 e iniciativa regional de datos primarios de...
Avances en torno a la Ley 26.899 e iniciativa regional de datos primarios de...
 
Gestion de datos para la investigacion: el caso peruano by Edward Mezones, Su...
Gestion de datos para la investigacion: el caso peruano by Edward Mezones, Su...Gestion de datos para la investigacion: el caso peruano by Edward Mezones, Su...
Gestion de datos para la investigacion: el caso peruano by Edward Mezones, Su...
 
Gestion de Datos de Investigacion
Gestion de Datos de InvestigacionGestion de Datos de Investigacion
Gestion de Datos de Investigacion
 
CENTRO DE DATOS
CENTRO DE DATOSCENTRO DE DATOS
CENTRO DE DATOS
 
Research Data Management in São Paulo by Fabio Kon FAPESP
Research Data Management in São Paulo by Fabio Kon FAPESPResearch Data Management in São Paulo by Fabio Kon FAPESP
Research Data Management in São Paulo by Fabio Kon FAPESP
 
LEARN: Helping Institutions to Build and Implement a Policy on research Data
LEARN: Helping Institutions to Build and Implement a Policy on research DataLEARN: Helping Institutions to Build and Implement a Policy on research Data
LEARN: Helping Institutions to Build and Implement a Policy on research Data
 
Datos Abiertos de Investigacion - Caso Mexico
Datos Abiertos de Investigacion - Caso MexicoDatos Abiertos de Investigacion - Caso Mexico
Datos Abiertos de Investigacion - Caso Mexico
 
TALLER LEARN SOBRE DATOS DE INVESTIGACIÓN IMPLEMENTACIÓN DE POLÍTICAS Y ESTRA...
TALLER LEARN SOBRE DATOS DE INVESTIGACIÓN IMPLEMENTACIÓN DE POLÍTICAS Y ESTRA...TALLER LEARN SOBRE DATOS DE INVESTIGACIÓN IMPLEMENTACIÓN DE POLÍTICAS Y ESTRA...
TALLER LEARN SOBRE DATOS DE INVESTIGACIÓN IMPLEMENTACIÓN DE POLÍTICAS Y ESTRA...
 
Open Data in the world of Science” by Dr. Claudio Gutiérrez
Open Data in the world of Science” by Dr. Claudio GutiérrezOpen Data in the world of Science” by Dr. Claudio Gutiérrez
Open Data in the world of Science” by Dr. Claudio Gutiérrez
 
Conicyt Y Mandato OECD by Patricia Muñoz, CONICYT (Chile)
Conicyt Y Mandato OECD by Patricia Muñoz, CONICYT (Chile)Conicyt Y Mandato OECD by Patricia Muñoz, CONICYT (Chile)
Conicyt Y Mandato OECD by Patricia Muñoz, CONICYT (Chile)
 
El camino hacia la Gestión de Datos de Investigación: la experiencia de Perú ...
El camino hacia la Gestión de Datos de Investigación: la experiencia de Perú ...El camino hacia la Gestión de Datos de Investigación: la experiencia de Perú ...
El camino hacia la Gestión de Datos de Investigación: la experiencia de Perú ...
 
Data management: The new frontier for libraries
Data management: The new frontier for librariesData management: The new frontier for libraries
Data management: The new frontier for libraries
 

Ähnlich wie About Data From A Machine Learning Perspective

Cadth 2015 b3 symp 2015 b3 - rr panel - shannon kelly
Cadth 2015 b3 symp 2015   b3 - rr panel - shannon kellyCadth 2015 b3 symp 2015   b3 - rr panel - shannon kelly
Cadth 2015 b3 symp 2015 b3 - rr panel - shannon kellyCADTH Symposium
 
2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...
2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...
2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...Crossref
 
The New Dimensions in Scholcomm: How a global scholarly community collaborati...
The New Dimensions in Scholcomm: How a global scholarly community collaborati...The New Dimensions in Scholcomm: How a global scholarly community collaborati...
The New Dimensions in Scholcomm: How a global scholarly community collaborati...NASIG
 
Research protocol & Systematic Review.docx
Research protocol & Systematic Review.docxResearch protocol & Systematic Review.docx
Research protocol & Systematic Review.docxAmaraZahid
 
2018 Bio-IT World Agile in Wet Labs Speeds Big Data
2018 Bio-IT World Agile in Wet Labs Speeds Big Data2018 Bio-IT World Agile in Wet Labs Speeds Big Data
2018 Bio-IT World Agile in Wet Labs Speeds Big DataBruce Kozuma
 
Screening and Selecting Studies - Moriam Chibuzor
Screening and Selecting Studies - Moriam ChibuzorScreening and Selecting Studies - Moriam Chibuzor
Screening and Selecting Studies - Moriam ChibuzorACSRM
 
Inside Lean Kanban (#lkuk14 keynote)
Inside Lean Kanban (#lkuk14 keynote)Inside Lean Kanban (#lkuk14 keynote)
Inside Lean Kanban (#lkuk14 keynote)Mike Burrows
 
Big Data LDN 2017: Preserving The Key Principles Of Academic Research In A Bu...
Big Data LDN 2017: Preserving The Key Principles Of Academic Research In A Bu...Big Data LDN 2017: Preserving The Key Principles Of Academic Research In A Bu...
Big Data LDN 2017: Preserving The Key Principles Of Academic Research In A Bu...Matt Stubbs
 
Research data spring: CREAM
Research data spring: CREAMResearch data spring: CREAM
Research data spring: CREAMJisc RDM
 
Important.ppt
Important.pptImportant.ppt
Important.pptAcademics
 
SystematicreviewJan2017.pptx
SystematicreviewJan2017.pptxSystematicreviewJan2017.pptx
SystematicreviewJan2017.pptxssuserf2f9e1
 
Systematic literature review | Meta analysis | Retrospective versus
Systematic literature review | Meta analysis | Retrospective versusSystematic literature review | Meta analysis | Retrospective versus
Systematic literature review | Meta analysis | Retrospective versusPubrica
 
ACRL 2011 Data-Driven Library Web Design
ACRL 2011 Data-Driven Library Web DesignACRL 2011 Data-Driven Library Web Design
ACRL 2011 Data-Driven Library Web DesignAmanda Dinscore
 
Curlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge ManagementCurlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge ManagementNick Lynch
 
Willmers&King open con2016-ct-14.11.16
Willmers&King open con2016-ct-14.11.16Willmers&King open con2016-ct-14.11.16
Willmers&King open con2016-ct-14.11.16Michelle Willmers
 
Botor_project_research_methodology_2016
Botor_project_research_methodology_2016Botor_project_research_methodology_2016
Botor_project_research_methodology_2016Shayne Botor
 

Ähnlich wie About Data From A Machine Learning Perspective (20)

Cadth 2015 b3 symp 2015 b3 - rr panel - shannon kelly
Cadth 2015 b3 symp 2015   b3 - rr panel - shannon kellyCadth 2015 b3 symp 2015   b3 - rr panel - shannon kelly
Cadth 2015 b3 symp 2015 b3 - rr panel - shannon kelly
 
RBM Links - Sep15
RBM Links - Sep15RBM Links - Sep15
RBM Links - Sep15
 
2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...
2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...
2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...
 
Systematic Literature Review
Systematic Literature ReviewSystematic Literature Review
Systematic Literature Review
 
The New Dimensions in Scholcomm: How a global scholarly community collaborati...
The New Dimensions in Scholcomm: How a global scholarly community collaborati...The New Dimensions in Scholcomm: How a global scholarly community collaborati...
The New Dimensions in Scholcomm: How a global scholarly community collaborati...
 
Research protocol & Systematic Review.docx
Research protocol & Systematic Review.docxResearch protocol & Systematic Review.docx
Research protocol & Systematic Review.docx
 
2018 Bio-IT World Agile in Wet Labs Speeds Big Data
2018 Bio-IT World Agile in Wet Labs Speeds Big Data2018 Bio-IT World Agile in Wet Labs Speeds Big Data
2018 Bio-IT World Agile in Wet Labs Speeds Big Data
 
Screening and Selecting Studies - Moriam Chibuzor
Screening and Selecting Studies - Moriam ChibuzorScreening and Selecting Studies - Moriam Chibuzor
Screening and Selecting Studies - Moriam Chibuzor
 
Inside Lean Kanban (#lkuk14 keynote)
Inside Lean Kanban (#lkuk14 keynote)Inside Lean Kanban (#lkuk14 keynote)
Inside Lean Kanban (#lkuk14 keynote)
 
Big Data LDN 2017: Preserving The Key Principles Of Academic Research In A Bu...
Big Data LDN 2017: Preserving The Key Principles Of Academic Research In A Bu...Big Data LDN 2017: Preserving The Key Principles Of Academic Research In A Bu...
Big Data LDN 2017: Preserving The Key Principles Of Academic Research In A Bu...
 
Research data spring: CREAM
Research data spring: CREAMResearch data spring: CREAM
Research data spring: CREAM
 
Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"Strasser "Effective data management and its role in open research"
Strasser "Effective data management and its role in open research"
 
Important.ppt
Important.pptImportant.ppt
Important.ppt
 
Rapid Reviews 101
Rapid Reviews 101 Rapid Reviews 101
Rapid Reviews 101
 
SystematicreviewJan2017.pptx
SystematicreviewJan2017.pptxSystematicreviewJan2017.pptx
SystematicreviewJan2017.pptx
 
Systematic literature review | Meta analysis | Retrospective versus
Systematic literature review | Meta analysis | Retrospective versusSystematic literature review | Meta analysis | Retrospective versus
Systematic literature review | Meta analysis | Retrospective versus
 
ACRL 2011 Data-Driven Library Web Design
ACRL 2011 Data-Driven Library Web DesignACRL 2011 Data-Driven Library Web Design
ACRL 2011 Data-Driven Library Web Design
 
Curlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge ManagementCurlew Research Brussels 2014 Electronic Data & Knowledge Management
Curlew Research Brussels 2014 Electronic Data & Knowledge Management
 
Willmers&King open con2016-ct-14.11.16
Willmers&King open con2016-ct-14.11.16Willmers&King open con2016-ct-14.11.16
Willmers&King open con2016-ct-14.11.16
 
Botor_project_research_methodology_2016
Botor_project_research_methodology_2016Botor_project_research_methodology_2016
Botor_project_research_methodology_2016
 

Mehr von LEARN Project

Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster LEARN Project
 
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM PolicyLEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM PolicyLEARN Project
 
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM ToolkitLEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM ToolkitLEARN Project
 
Research Data in an Open Science World - Prof. Dr. Eva Mendez, uc3m
Research Data in an Open Science World - Prof. Dr. Eva Mendez, uc3mResearch Data in an Open Science World - Prof. Dr. Eva Mendez, uc3m
Research Data in an Open Science World - Prof. Dr. Eva Mendez, uc3mLEARN Project
 
Data, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of ChileData, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of ChileLEARN Project
 
LEARN Final Conference: Tutorial Group | How To Engage Early Career Researchers
LEARN Final Conference: Tutorial Group | How To Engage Early Career ResearchersLEARN Final Conference: Tutorial Group | How To Engage Early Career Researchers
LEARN Final Conference: Tutorial Group | How To Engage Early Career ResearchersLEARN Project
 
LEARN Final Conference: Tutorial Group | Costing RDM
LEARN Final Conference: Tutorial Group | Costing RDMLEARN Final Conference: Tutorial Group | Costing RDM
LEARN Final Conference: Tutorial Group | Costing RDMLEARN Project
 
Paolo Budroni at COAR Annual Meeting
Paolo Budroni at COAR Annual MeetingPaolo Budroni at COAR Annual Meeting
Paolo Budroni at COAR Annual MeetingLEARN Project
 
LEARN Carribean Workshop Opening Remarks
LEARN Carribean Workshop Opening RemarksLEARN Carribean Workshop Opening Remarks
LEARN Carribean Workshop Opening RemarksLEARN Project
 
“Data for Development – the value of data for research and society” by Dr. Ma...
“Data for Development – the value of data for research and society” by Dr. Ma...“Data for Development – the value of data for research and society” by Dr. Ma...
“Data for Development – the value of data for research and society” by Dr. Ma...LEARN Project
 

Mehr von LEARN Project (11)

Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster Research Data Management, Challenges and Tools - Per Öster
Research Data Management, Challenges and Tools - Per Öster
 
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM PolicyLEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
 
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM ToolkitLEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
 
Research Data in an Open Science World - Prof. Dr. Eva Mendez, uc3m
Research Data in an Open Science World - Prof. Dr. Eva Mendez, uc3mResearch Data in an Open Science World - Prof. Dr. Eva Mendez, uc3m
Research Data in an Open Science World - Prof. Dr. Eva Mendez, uc3m
 
Data, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of ChileData, Science, Society - Claudio Gutierrez, University of Chile
Data, Science, Society - Claudio Gutierrez, University of Chile
 
LEARN Final Conference: Tutorial Group | How To Engage Early Career Researchers
LEARN Final Conference: Tutorial Group | How To Engage Early Career ResearchersLEARN Final Conference: Tutorial Group | How To Engage Early Career Researchers
LEARN Final Conference: Tutorial Group | How To Engage Early Career Researchers
 
LEARN Final Conference: Tutorial Group | Costing RDM
LEARN Final Conference: Tutorial Group | Costing RDMLEARN Final Conference: Tutorial Group | Costing RDM
LEARN Final Conference: Tutorial Group | Costing RDM
 
Paolo Budroni at COAR Annual Meeting
Paolo Budroni at COAR Annual MeetingPaolo Budroni at COAR Annual Meeting
Paolo Budroni at COAR Annual Meeting
 
LEARN Webinar
LEARN WebinarLEARN Webinar
LEARN Webinar
 
LEARN Carribean Workshop Opening Remarks
LEARN Carribean Workshop Opening RemarksLEARN Carribean Workshop Opening Remarks
LEARN Carribean Workshop Opening Remarks
 
“Data for Development – the value of data for research and society” by Dr. Ma...
“Data for Development – the value of data for research and society” by Dr. Ma...“Data for Development – the value of data for research and society” by Dr. Ma...
“Data for Development – the value of data for research and society” by Dr. Ma...
 

Kürzlich hochgeladen

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxdhanalakshmis0310
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseAnaAcapella
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSCeline George
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docxPoojaSen20
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17Celine George
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxnegromaestrong
 

Kürzlich hochgeladen (20)

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 

About Data From A Machine Learning Perspective

  • 1. Oriol  Pujol ABOUT  DATA  FROM  A   MACHINE  LEARNING   PERSPECTIVE
  • 2. TWO  FLAVOURS  AND  ONE   PRESENTATION Share  experiences  in  openness  from  the  machine  learning   community To  propose  potential  solutions  to  the  problem  of  sharing   sensible  data
  • 4. THE  ROLE  OF  DATA  IN  MACHINE  LEARNING • Machine  learning  is  sometimes  called  learning  from  data.  Thus  we  are   talking  about  building/coding  algorithms  that  learn  from  data. • Data  is  the  key  component  in  most  research,  but  it  is  obviously  crucial   in  machine  learning.
  • 5. Automatic  captioning EXAMPLES  OF  TASKS  IN  MACHINE  LEARNING
  • 6. CURRENT  MACHINE  LEARNING  RESEARCH  FOCUS • Improving  results  in  terms  of  generalization,  robustness,  speed,  and   interpretability.   • Comparison  with  other  techniques  is  fundamental.   • Many  different  data  sets  are  needed  to  assess  each  property  we  are   tackling  in  the  research,  e.g.  we  need  large  datasets  for  speed  and  big   data. • Sometimes,  their  application  largely  depends  on  the  domain,  e.g.,   financial  products  forecasting,  targeted  gene  identification.
  • 7. AS  A  RESEARCHER DISEMINATION:  REPORTING  PROCESS,  CLAIMING  AUTHORSHIP QUALITY:  COMMUNITY  VALIDITY  ASSESSMENT,  PEER  REVIEW   (KPI  USED  BY  ORGANISATIONS  FOR  RESEARCH  QUALITY) We  pursue  to  contribute  to  advance  science:  reproducibility  of   experimentation  that  helps  to  eliminate individual  biases.  
  • 8. FOR  THIS  REASON  WE  USE  PUBLISHING DISSEMINATION QUALITY  ASSESSMENT
  • 9. FOR  THIS  REASON  WE  USE  PUBLISHING LONG  PROCESS  >  1  YEAR NOT  A  GOOD  DISEMINATION   SOURCE,  OR  ATTRIBUTION  PROC. PEER  QUALITY  ASSESSMENT   CRITICAL  FEEDBACK GUARD  AGAINST  ERRORS MAJOR  SOURCE  FOR  RANKING  RESEARCHERS NON-­‐PUBLISHED  RESEARCH  IS   NOT  RESEARCH
  • 10. FOR  THIS  REASON  WE  USE  PUBLISHING FAIRER  PROCESS >    3-­‐6  month QUALITY  ASSESSMENT RANKING  RESEARCHERS NON-­‐PUBLISHED  RESEARCH  IS   NOT  RESEARCH
  • 11. THE  NIPS  CASE:  WHEN  CONFERENCE  PAPERS   ARE  OLD  NEWS IN  MACHINE  LEARNING  3-­‐6  MONTHS  IS  TOO  LONG DECEMBER  5TH – 10TH ,  2016
  • 12. THE  NIPS  CASE:  WHEN  CONFERENCE  PAPERS  ARE   OLD  NEWS
  • 13. 24/7  PUBLISHING  CYCLE IMMEDIACY,  24/7  PUBLISHING  CYCLE IMMEDIATE  ATTRIBUTION   NO  QUALITY  ASSESSMENT RANKING  RESEARCHERS HOW  TO  DISTINGUISH  GOOD  RESEARCH  FROM  FRAUD
  • 16. AN  EXAMPLE:  PLOS  ONE FOCUS  SHIFT: METHODOLOGICAL   CORRECTNESS   DATA
  • 19. Data  Sets -­‐ Raw  data  as  a   collection  of  similarily structured   objects. Material  and  Methods -­‐ Descriptions  of  the  computational   pipeline. Learning  Tasks -­‐ Learning  tasks   defined  on  raw  data. Challenges -­‐ Collections  of  tasks   which  have  a  particular  theme. MACHINE  LEARNING  DATASET  REPOSITORIES
  • 20. • The  U.S.  Securities  and  Exchange   Commission  (SEC)  2010  proposal,   financial  products  must  include  Python code. WHEN  DESCRIPTION  AND  DATA  IS  NOT   ENOUGH
  • 21. WHEN  DESCRIPTION  AND  DATA  IS  NOT   ENOUGH
  • 23. BUT  …  DEVIL  IS  IN  THE  DETAILS
  • 27.
  • 28. BOTTOM  LINE 1. SCIENCE  IS  MORE  THAN  A  PUBLICATION 2. AT  ITS  CORE  LIES  REPRODUCIBILITY…THIS  INVOLVES  DATA 3. BUT  DATA  IS  NOT  EVEN  ENOUGH…  CODE  AND  PRECISE  EXPERIMENT   REPRODUCIBILITY 4. DATA  AND  PROGRAMMING  LITERACY  IS  NEEDED
  • 29. BUT  …  SOMETHING  ELSE  ABOUT  QUALITY   ASSESSMENT:  OPEN  REVIEW  CASE
  • 30. THE  RULES • Submissions  posted  on  arXiv before  being  submitted  to  the  conference • Program  committee  assigns  blind  reviewers  (as  always)  marked  as   designated  reviewers • Anyone  can  ask  the  program  chairs  for  permission  to  become  an   anonymous  designated  reviewer  (open  bidding). • Open  commenters  will  have  to  use  their  real  names,  linked  with  their   Google  Scholar  profiles. • Papers  that  are  presented  in  the  workshop  track  or  are  not  accepted  will  be   considered  non-­‐archival,  and  may  be  submitted  elsewhere  (modified  or   not),  although  the  ICLR  site  will  maintain  the  reviews,  the  comments,  and   the  links  to  the  arXiv versions
  • 31. SOME  POINTS  ABOUT  OPEN  REVIEW • Reading  the  answers  from  others  is  valuable   • Rejected  papers  have  influence • May  become  a  popularity  contest
  • 32. SHARING  DATA  IN  THE  AGE   OF  DEEP  LEARNING
  • 37.
  • 38.
  • 40.
  • 41. Research is based on reproducibility: documentation, data, code, and details. Machine learning methods can help in the challenge for democratizing data. CONCLUSIONS