SlideShare ist ein Scribd-Unternehmen logo
1 von 24
Data Collection Methods
Pros and Cons of Primary and
Secondary Data
Where do data come
from?
   We’ve seen our data for this lab, all
    nice and collated in a database – from:
    – Insurance companies (claims,
      medications, procedures, diagnoses, etc.)
    – Firms (demographic data, productivity
      data, etc.)
Where do data come
from?
   Take a step back – if we’re starting
    from scratch, how do we collect / find
    data?
    – Secondary data
    – Primary data
Secondary Data

   Secondary data – data someone else
    has collected
    – This is what you were looking for in your
      assignment.
Secondary Data –
Examples of Sources
   County health departments
   Vital Statistics – birth, death certificates
   Hospital, clinic, school nurse records
   Private and foundation databases
   City and county governments
   Surveillance data from state government
    programs
   Federal agency statistics - Census, NIH, etc.
Secondary Data –
Limitations
   What did you find on the frustrating
    side as you looked for data on the
    state’s websites?
Secondary Data –
Limitations
   When was it collected? For how long?
    – May be out of date for what you want to
      analyze.
    – May not have been collected long enough
      for detecting trends.
    – E.g. Have new anticorruption laws
      impacted Russia’s government
      accountability ratings?
Secondary Data –
Limitations
   Is the data set complete?
    – There may be missing information on
      some observations
    – Unless such missing information is caught
      and corrected for, analysis will be biased.
Secondary Data –
Limitations
   Are there confounding problems?
    – Sample selection bias?
    – Source choice bias?
    – In time series, did some observations
      drop out over time?
Secondary Data –
Limitations
   Are the data consistent/reliable?
    – Did variables drop out over time?
    – Did variables change in definition over
      time?
          E.g. number of years of education versus
           highest degree obtained.
Secondary Data –
Limitations
   Is the information exactly what you need?
    – In some cases, may have to use “proxy
      variables” – variables that may approximate
      something you really wanted to measure. Are
      they reliable? Is there correlation to what you
      actually want to measure?
    – E.g. gauging student interest in U.W. by their
      ranking on FAFSA – subject to gamesmanship.
Secondary Data –
Advantages
   No need to reinvent the wheel.
    – If someone has already found the data,
      take advantage of it.
Secondary Data –
Advantages
   It will save you money.
    – Even if you have to pay for access, often
      it is cheaper in terms of money than
      collecting your own data. (more on this
      later.)
Secondary Data –
Advantages
   It will save you time.
    – Primary data collection is very time
      consuming. (More on this later, too!)
Secondary Data –
Advantages
   It may be very accurate.
    – When especially a government agency
      has collected the data, incredible
      amounts of time and money went into it.
      It’s probably highly accurate.
Secondary Data –
Advantages
   It has great exploratory value
    – Exploring research questions and
      formulating hypothesis to test.
Primary Data

   Primary data – data you collect
Primary Data - Examples

   Surveys
   Focus groups
   Questionnaires
   Personal interviews
   Experiments and observational study
Primary Data -
Limitations
   Do you have the time and money for:
    – Designing your collection instrument?
    – Selecting your population or sample?
    – Pretesting/piloting the instrument to work
      out sources of bias?
    – Administration of the instrument?
    – Entry/collation of data?
Primary Data -
Limitations
   Uniqueness
    – May not be able to compare to other
      populations
Primary Data -
Limitations
   Researcher error
    – Sample bias
    – Other confounding factors
Data collection choice

   What you must ask yourself:
    – Will the data answer my research
      question?
Data collection choice

   To answer that
    – You much first decide what your research
      question is
    – Then you need to decide what
      data/variables are needed to scientifically
      answer the question
Data collection choice

   If that data exist in secondary form,
    then use them to the extent you can,
    keeping in mind limitations.
   But if it does not, and you are able to
    fund primary collection, then it is the
    method of choice.

Weitere ähnliche Inhalte

Was ist angesagt?

336 Primary Data
336 Primary Data336 Primary Data
336 Primary Data
Fatema Ka
 
Primary and sec data
Primary and sec dataPrimary and sec data
Primary and sec data
Abdul Salim
 
Primary and secondary data (unit iii)
Primary and secondary data (unit iii)Primary and secondary data (unit iii)
Primary and secondary data (unit iii)
Shilpi Vaishkiyar
 
Data collection methods
Data collection methodsData collection methods
Data collection methods
Aanya Kumar
 
Research methods
Research methods Research methods
Research methods
Ash-Leigh
 
Chapter 8 (procedure of data collection)
Chapter 8 (procedure of data collection)Chapter 8 (procedure of data collection)
Chapter 8 (procedure of data collection)
BoreyThai1
 
Lesson 5 - Primary Research Methods 1
Lesson 5  - Primary Research Methods 1Lesson 5  - Primary Research Methods 1
Lesson 5 - Primary Research Methods 1
Kavita Parwani
 
Rm 5 Methods Of Data Collection
Rm   5   Methods Of Data CollectionRm   5   Methods Of Data Collection
Rm 5 Methods Of Data Collection
itsvineeth209
 

Was ist angesagt? (20)

Methods of data collection
Methods of data collectionMethods of data collection
Methods of data collection
 
336 Primary Data
336 Primary Data336 Primary Data
336 Primary Data
 
Data sources and collection methods
Data sources and collection methods Data sources and collection methods
Data sources and collection methods
 
Primary data and secondary data
Primary data and secondary dataPrimary data and secondary data
Primary data and secondary data
 
METHOD OF DATA COLLECTION
METHOD OF DATA COLLECTIONMETHOD OF DATA COLLECTION
METHOD OF DATA COLLECTION
 
Primary and sec data
Primary and sec dataPrimary and sec data
Primary and sec data
 
Primary and secondary data (unit iii)
Primary and secondary data (unit iii)Primary and secondary data (unit iii)
Primary and secondary data (unit iii)
 
Data collection methods
Data collection methodsData collection methods
Data collection methods
 
Primary and Secondary Data collection - Ajay Anoj & Gokul
Primary and Secondary Data collection - Ajay Anoj & GokulPrimary and Secondary Data collection - Ajay Anoj & Gokul
Primary and Secondary Data collection - Ajay Anoj & Gokul
 
What is primary data in detail
What is primary data in detailWhat is primary data in detail
What is primary data in detail
 
Research methods
Research methods Research methods
Research methods
 
Chapter 8 (procedure of data collection)
Chapter 8 (procedure of data collection)Chapter 8 (procedure of data collection)
Chapter 8 (procedure of data collection)
 
Methods for Collecting Data
Methods for Collecting DataMethods for Collecting Data
Methods for Collecting Data
 
Data collection methods in research
Data collection methods in researchData collection methods in research
Data collection methods in research
 
Lesson 5 - Primary Research Methods 1
Lesson 5  - Primary Research Methods 1Lesson 5  - Primary Research Methods 1
Lesson 5 - Primary Research Methods 1
 
Rm 5 Methods Of Data Collection
Rm   5   Methods Of Data CollectionRm   5   Methods Of Data Collection
Rm 5 Methods Of Data Collection
 
Methods of data collection
Methods of data collectionMethods of data collection
Methods of data collection
 
PRIMARY & SECONDARY DATA COLLECTION
PRIMARY & SECONDARY DATA  COLLECTION PRIMARY & SECONDARY DATA  COLLECTION
PRIMARY & SECONDARY DATA COLLECTION
 
Data collection
Data collectionData collection
Data collection
 
Method of Data Collection
Method of Data CollectionMethod of Data Collection
Method of Data Collection
 

Andere mochten auch (10)

Types of data by kamran khan
Types of data by kamran khanTypes of data by kamran khan
Types of data by kamran khan
 
Data Collection
Data CollectionData Collection
Data Collection
 
Economic analysis
Economic analysisEconomic analysis
Economic analysis
 
Basic econometrics lectues_1
Basic econometrics lectues_1Basic econometrics lectues_1
Basic econometrics lectues_1
 
Econometrics lecture 1st
Econometrics lecture 1stEconometrics lecture 1st
Econometrics lecture 1st
 
Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...
Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...
Econometrics notes (Introduction, Simple Linear regression, Multiple linear r...
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis ppt
 
Data Collection-Primary & Secondary
Data Collection-Primary & SecondaryData Collection-Primary & Secondary
Data Collection-Primary & Secondary
 
Methods of data collection
Methods of data collection Methods of data collection
Methods of data collection
 

Ähnlich wie Data collection methods

Data Collection Techniques.ppt
Data Collection Techniques.pptData Collection Techniques.ppt
Data Collection Techniques.ppt
PapuKumarNaik1
 

Ähnlich wie Data collection methods (20)

Rsearch methodology
Rsearch methodologyRsearch methodology
Rsearch methodology
 
Data collection methods
Data collection methodsData collection methods
Data collection methods
 
Data collection methods
Data collection methodsData collection methods
Data collection methods
 
S2
S2S2
S2
 
Managerialstatistics
ManagerialstatisticsManagerialstatistics
Managerialstatistics
 
Collection of data
Collection of dataCollection of data
Collection of data
 
QTB 4.pptx
QTB 4.pptxQTB 4.pptx
QTB 4.pptx
 
unit 2.3.ppt
unit 2.3.pptunit 2.3.ppt
unit 2.3.ppt
 
datacollectionpresentation-140428135118-phpapp02.pdf
datacollectionpresentation-140428135118-phpapp02.pdfdatacollectionpresentation-140428135118-phpapp02.pdf
datacollectionpresentation-140428135118-phpapp02.pdf
 
Data Collection and Diagnosis
Data Collection and DiagnosisData Collection and Diagnosis
Data Collection and Diagnosis
 
Data Collection Techniques.ppt
Data Collection Techniques.pptData Collection Techniques.ppt
Data Collection Techniques.ppt
 
DATA-COLLECTION.pptx
DATA-COLLECTION.pptxDATA-COLLECTION.pptx
DATA-COLLECTION.pptx
 
T3 data collecting techniques
T3 data collecting techniquesT3 data collecting techniques
T3 data collecting techniques
 
Chapter-02 Collection of Data.pptx
Chapter-02 Collection of Data.pptxChapter-02 Collection of Data.pptx
Chapter-02 Collection of Data.pptx
 
Research Data Management
Research  Data ManagementResearch  Data Management
Research Data Management
 
Georgetown lecture 2012 6 2 full
Georgetown lecture 2012 6 2 fullGeorgetown lecture 2012 6 2 full
Georgetown lecture 2012 6 2 full
 
Research Methodology Module-04
Research Methodology Module-04Research Methodology Module-04
Research Methodology Module-04
 
Data collection
Data collection Data collection
Data collection
 
The scientific method
The scientific methodThe scientific method
The scientific method
 
Method of Data Collection
Method of Data CollectionMethod of Data Collection
Method of Data Collection
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Data collection methods

  • 1. Data Collection Methods Pros and Cons of Primary and Secondary Data
  • 2. Where do data come from?  We’ve seen our data for this lab, all nice and collated in a database – from: – Insurance companies (claims, medications, procedures, diagnoses, etc.) – Firms (demographic data, productivity data, etc.)
  • 3. Where do data come from?  Take a step back – if we’re starting from scratch, how do we collect / find data? – Secondary data – Primary data
  • 4. Secondary Data  Secondary data – data someone else has collected – This is what you were looking for in your assignment.
  • 5. Secondary Data – Examples of Sources  County health departments  Vital Statistics – birth, death certificates  Hospital, clinic, school nurse records  Private and foundation databases  City and county governments  Surveillance data from state government programs  Federal agency statistics - Census, NIH, etc.
  • 6. Secondary Data – Limitations  What did you find on the frustrating side as you looked for data on the state’s websites?
  • 7. Secondary Data – Limitations  When was it collected? For how long? – May be out of date for what you want to analyze. – May not have been collected long enough for detecting trends. – E.g. Have new anticorruption laws impacted Russia’s government accountability ratings?
  • 8. Secondary Data – Limitations  Is the data set complete? – There may be missing information on some observations – Unless such missing information is caught and corrected for, analysis will be biased.
  • 9. Secondary Data – Limitations  Are there confounding problems? – Sample selection bias? – Source choice bias? – In time series, did some observations drop out over time?
  • 10. Secondary Data – Limitations  Are the data consistent/reliable? – Did variables drop out over time? – Did variables change in definition over time?  E.g. number of years of education versus highest degree obtained.
  • 11. Secondary Data – Limitations  Is the information exactly what you need? – In some cases, may have to use “proxy variables” – variables that may approximate something you really wanted to measure. Are they reliable? Is there correlation to what you actually want to measure? – E.g. gauging student interest in U.W. by their ranking on FAFSA – subject to gamesmanship.
  • 12. Secondary Data – Advantages  No need to reinvent the wheel. – If someone has already found the data, take advantage of it.
  • 13. Secondary Data – Advantages  It will save you money. – Even if you have to pay for access, often it is cheaper in terms of money than collecting your own data. (more on this later.)
  • 14. Secondary Data – Advantages  It will save you time. – Primary data collection is very time consuming. (More on this later, too!)
  • 15. Secondary Data – Advantages  It may be very accurate. – When especially a government agency has collected the data, incredible amounts of time and money went into it. It’s probably highly accurate.
  • 16. Secondary Data – Advantages  It has great exploratory value – Exploring research questions and formulating hypothesis to test.
  • 17. Primary Data  Primary data – data you collect
  • 18. Primary Data - Examples  Surveys  Focus groups  Questionnaires  Personal interviews  Experiments and observational study
  • 19. Primary Data - Limitations  Do you have the time and money for: – Designing your collection instrument? – Selecting your population or sample? – Pretesting/piloting the instrument to work out sources of bias? – Administration of the instrument? – Entry/collation of data?
  • 20. Primary Data - Limitations  Uniqueness – May not be able to compare to other populations
  • 21. Primary Data - Limitations  Researcher error – Sample bias – Other confounding factors
  • 22. Data collection choice  What you must ask yourself: – Will the data answer my research question?
  • 23. Data collection choice  To answer that – You much first decide what your research question is – Then you need to decide what data/variables are needed to scientifically answer the question
  • 24. Data collection choice  If that data exist in secondary form, then use them to the extent you can, keeping in mind limitations.  But if it does not, and you are able to fund primary collection, then it is the method of choice.