SlideShare ist ein Scribd-Unternehmen logo
1 von 43
RATIONALE
The storage in a smart
 phone would cost
 (in 2011 dollars)

  $7,571 in 2001
 $212,040 in 1991
 $3,796,800 in 1981
$56,168,800 in 1971
$1,233,179,000 in 1961
The Explosion of Scientific Data

Because of the massive decline in the cost of data
collection, storage, and analysis, the quantity of scientific
data being collected is growing at an extraordinary pace

 New opportunities for analysis
 New methods are being applied
 Marked acceleration in the pace of discovery
The Big Challenges

The quantity of scientific data is exploding, but we lack
basic infrastructure to maintain them or capitalize on
opportunities for analysis and discovery

 Most scientific data is at risk of loss
 Most scientific data is inaccessible
 Metadata are usually incomplete and inadequate
 Little interoperability across datasets or data types
 Data are trapped in disciplinary silos
Why Population and Environment?

  Massive Planetary Change
  between 1950 and 2000
 Population
   population doubled
   economy grew seven-fold
 Agriculture
   food consumption tripled
   water use tripled
 Energy use
     fossil fuels increased four-fold
The Temporal Dimension
                                                                           TerraPop
                                      World Population, 1000-2000
                        6000


                        5000
Population (millions)




                        4000


                        3000


                        2000


                        1000


                           0
                           1000     1200      1400          1600    1800       2000
                                                     Year
TerraPop Goals


Provide an organizational and technical framework to
preserve, integrate, disseminate, and analyze global-
scale spatiotemporal data describing population and the
environment.
Primary Objective
Lower barriers to conducting interdisciplinary human-
environment interactions research by making data with
different formats from different scientific domains easily
interoperable

   Population microdata
   Government land-use statistics
   Land cover data from satellite imagery
   Historical climate records (temperature, precipitation,
    cloud cover)
TerraPop Collaborating Organizations
Project Elements

1. Archival Development
2. Data Integration, Dissemination, and
   Analysis
3. Education and Outreach
4. Organizational Development
1. Archival Development
 Collect, integrate, describe, and
    preserve data describing
      changes in the world’s
  population and environment.
Data Collection:
   Initial Population Data Sources

 Population microdata from censuses
 Focus on Brazil and Malawi
Age                   Birthplace
                          Sex                    Mother’s birthplace
           Relationship   Race                        Occupation
H910000240000000088001001000220100
P910000020101032120010010010011504                             Population
P910000010201036220010010010011999
P910201000301011220060010010011999                             Microdata
P 9 1 0 2 0 1 0 0 0 3 0 1 0 0 9 1 2 0 0 6 0 0 1 0 0 1 0 0 1 1 9 9 9 Geographic and housing
                                                                Structure
P 9 1 0 2 0 1 0 0 0 3 0 1 0 0 7 1 2 0 0 6 0 0 1 0 0 1 0 0 1 1 9 9 9 characteristics
P910201000301006120060010010011999
P910201000301004220060010010011999                                     Household record
P910201000301003220060010010011999
P910201000301002220060010010011999                                     (shaded) followed
H910000240000000088001001000110100                                     by a person record
P910000020101030110010290510511310
P910000010201021210010290290171999
                                                                       for each member
P910201000301001110060010290291999                                     of the household
H910000240000000088001001000220100
P910000020101045120010010010011100
P910000010201025220010010010011820                                     For each type of
P910201000301007220060010010011999
H910000240000000088001001000220100                                     record, columns
P910000020101049120010010010011100                                     correspond to
P910000010201049220010010010011820
P910201000301019220060010010011820
                                                                       specific variables
P910201000301015220060010010012820
The Power of Microdata
 Customized measures: Variables based on combined
  characteristics of family and household members,
  capitalizing on the hierarchical structure of the data
 Multivariate analysis: Analyze many individual,
  household, and community characteristics simultaneously
 Interoperability: Harmonize data across time and space

  For each person, detailed information about geographic
                 Age classification for school enrollment
  location, economic activities,U.S. Census for School Enrollment
                      Tablepublished educational attainment,
                         in
                            2. Age Classifications
  literacy, fertility history, child mortality, migration,Imputed
                              1970        1990    Common   place
  of former residence, marital status, consensual unions,
                               3-4         3-4        3-4     3-4
                               5-6         5-6        5-6
  family composition, disabilities, water supply, sewage,     5-6
                              7-14         7-9       7-17    7-14
  building materials (floor, roof, etc.), and many other
                             14-15       10-14             14-15
  characteristics.           16-17       15-17             16-17
Participating Countries
Facebook has data on      We have data on
  800 million people      912 million people




                       USA                 165
                       International       481
                       Historical          266
                       Total               912
Data Collection:
Initial Sources of Environmental Data

   Land cover data from satellite images
    (Global Land Cover 2000)
   Land use data from satellites and government
    records (Global Landscapes Initiative)
   Climate data from weather stations (WorldClim)
Land Cover Data




    Global Land Cover 2000
 Grid of 1 km sq cells
 Cell values are dominant
  land cover
 Derived from satellite
  images
Land Use Data
Global Landscapes
Initiative / Farming the
World
 Grid of 10 km cells
 Values are % of cell used for
  given purpose
 Derived from satellite and
  agricultural census data




                   Additional data sets for 175 specific crops and yields
Climate Data


WorldClim
 Grid of 1 km cells
 Interpolated from climate
  station data
 Incorporate data from
  1950-2000
2. Integration, Dissemination, and Analysis
          Create tools and procedures to
           integrate, disseminate, and
             analyze population and
               environmental data.
Three Source Data Formats


               Microdata:
               Characteristics of individuals
               and households

Area-level data:
Characteristics of places defined
by administrative boundaries



              Raster data:
              Values tied to spatial
              coordinates
Three Output Formats
1.   Census microdata with attached characteristics
     describing land use, land cover, and climate for local
     areas
2.   Aggregate data for administrative districts with tabulated
     population data and environmental characteristics
3.   Gridded data with characteristics of population and
     environment
TerraPop Prototype Data Transformations
 Input Formats            Output Formats


   Microdata                Microdata




  Areal data                Areal data




  Raster data              Raster data
Analysis tool needed for microdata conversion
   Input Formats             Output Formats


     Microdata                 Microdata




     Areal data                Areal data




    Raster data               Raster data
TerraPop Data Integration
Input Formats             Output Formats

                             Microdata
 Microdata                with characteristics
                          of surrounding area


                             Area-level
 Area-level                with summaries of
   data                      microdata and
                               raster data


                            Raster data
                              with gridded
 Raster data               representations of
                             microdata and
                             area-level data
Integration – Microdata Output
Census microdata with attached characteristics describing
land use, land cover, and climate for local areas

 Individuals and households
 with their environmental
 and social context
Integration – Area-Level Output
      Aggregate data for
      administrative districts
      with tabulated population
      data and environmental
      characteristics




               Mean Ann. Max. Ann. Rent, Rent, Own, Own, Vacant, Vacant,
County ID      Temp.     Precip.   Rural Urban Rural Urban Rural Urban
G17003100001         21.2     768   3129 1063 637      365     34     33
G17003100002         23.4     589   2949 1075 1469     717      0      0
G17003100003         24.3     867   3418 1589 1108     617      0      0
G17003100004         21.5     943   1882 425 202       142    123      0
G17003100005         24.1     867   2416 572 426       197    189      0
G17003100006         24.4     697   2560 934 950       563    220     14
G17003100007         25.6     701   2126 653 321       215    209     46
Integration – Raster Output
Gridded data with characteristics of population and
environment




Raster format
compatible with
environmental
models
Data Access System
Browse and select variables
Data Access System
Browse and select variables
Data Access System
Choose output format
Data Access System
Choose output format
Data Access System
Select data transformation options
TerraPop Prototype
 Data to be included
   Population microdata for Brazil (1960-2000) and Malawi (1998 &
    2008)
   Aggregate population data at first and second administrative levels
    for Brazil and Malawi
   Land cover, agricultural land use, and climate data

 Timeline
   Available for beta testing: May 2013
   Initial public version available by the end of 2013
3. Education and Outreach
 Engage the scientific community
         and the public
Education and Outreach
    for the Research Community

 Curriculum of web-based training
 Workshops at conferences
 User support
 Community tools to promote user engagement
Public Education and Outreach
 Partner with educational software developers
   Fathom
 Integration with museum programs
   Science on a Sphere
4. Organizational Development
      Develop structures to ensure
    long-run financial and technical
             sustainability.
Sustainability

Create a sustainable organization that can guarantee
preservation and access over multiple decades


   Organizational sustainability
   Financial sustainability
   Technological sustainability
agriculture


            demography                  transportation



  criminology                                        hazards

                  Population        Climate
                         Terra
pollution
                     Populus
                   Land Use  Land Cover                  health


   economics
                                                   politics
                bio-
              diversity
                                 hydrology
Terra Populus Overview

Weitere ähnliche Inhalte

Was ist angesagt?

Analysis of Sand Dunes Accumulation using Remote Sensing and GIS
Analysis of Sand Dunes Accumulation using Remote Sensing and GISAnalysis of Sand Dunes Accumulation using Remote Sensing and GIS
Analysis of Sand Dunes Accumulation using Remote Sensing and GIS
ijtsrd
 
Use of remote sensing for generation of agricultural statistics
Use of remote sensing for generation of agricultural statisticsUse of remote sensing for generation of agricultural statistics
Use of remote sensing for generation of agricultural statistics
jagannatharao
 
Land use Land Cover Highlight for Jibia Local Government, Nigeria
Land use Land Cover Highlight for Jibia Local Government, NigeriaLand use Land Cover Highlight for Jibia Local Government, Nigeria
Land use Land Cover Highlight for Jibia Local Government, Nigeria
ijtsrd
 
CROP MONITORING HILLARY
CROP MONITORING HILLARYCROP MONITORING HILLARY
CROP MONITORING HILLARY
Hillary Mugiyo
 
Spatial analysis of flooding in delta state,Nigeria
Spatial analysis of flooding in delta state,NigeriaSpatial analysis of flooding in delta state,Nigeria
Spatial analysis of flooding in delta state,Nigeria
Glory Enaruvbe
 

Was ist angesagt? (8)

Analysis of Sand Dunes Accumulation using Remote Sensing and GIS
Analysis of Sand Dunes Accumulation using Remote Sensing and GISAnalysis of Sand Dunes Accumulation using Remote Sensing and GIS
Analysis of Sand Dunes Accumulation using Remote Sensing and GIS
 
Spatial analysis of flood disaster in Delta state, Nigeria
Spatial analysis of flood disaster in Delta state, NigeriaSpatial analysis of flood disaster in Delta state, Nigeria
Spatial analysis of flood disaster in Delta state, Nigeria
 
use of gis and remote sensing in wildlife and forestry
use of gis and remote sensing in wildlife and forestryuse of gis and remote sensing in wildlife and forestry
use of gis and remote sensing in wildlife and forestry
 
Use of remote sensing for generation of agricultural statistics
Use of remote sensing for generation of agricultural statisticsUse of remote sensing for generation of agricultural statistics
Use of remote sensing for generation of agricultural statistics
 
Land use Land Cover Highlight for Jibia Local Government, Nigeria
Land use Land Cover Highlight for Jibia Local Government, NigeriaLand use Land Cover Highlight for Jibia Local Government, Nigeria
Land use Land Cover Highlight for Jibia Local Government, Nigeria
 
CROP MONITORING HILLARY
CROP MONITORING HILLARYCROP MONITORING HILLARY
CROP MONITORING HILLARY
 
Spatial analysis of flooding in delta state,Nigeria
Spatial analysis of flooding in delta state,NigeriaSpatial analysis of flooding in delta state,Nigeria
Spatial analysis of flooding in delta state,Nigeria
 
Remote sensing in agriculture
Remote sensing in agricultureRemote sensing in agriculture
Remote sensing in agriculture
 

Ähnlich wie Terra Populus Overview

Designing & Managing Resilient Landscapes
Designing & Managing Resilient LandscapesDesigning & Managing Resilient Landscapes
Designing & Managing Resilient Landscapes
Erick Fernandes
 
Kevin Byrne’s Presentation: Sustainability Storyboarded and Geovisualized Acr...
Kevin Byrne’s Presentation: Sustainability Storyboarded and Geovisualized Acr...Kevin Byrne’s Presentation: Sustainability Storyboarded and Geovisualized Acr...
Kevin Byrne’s Presentation: Sustainability Storyboarded and Geovisualized Acr...
J. Kevin Byrne
 

Ähnlich wie Terra Populus Overview (20)

Terra Populus: Integrated Data on Population and Environment
Terra Populus: Integrated Data on Population and EnvironmentTerra Populus: Integrated Data on Population and Environment
Terra Populus: Integrated Data on Population and Environment
 
Digital Soil Mapping by Ronald Vargas Rojas
Digital Soil Mapping by Ronald Vargas RojasDigital Soil Mapping by Ronald Vargas Rojas
Digital Soil Mapping by Ronald Vargas Rojas
 
Designing & Managing Resilient Landscapes
Designing & Managing Resilient LandscapesDesigning & Managing Resilient Landscapes
Designing & Managing Resilient Landscapes
 
Geospatial Tools for Targeting and Prioritisation in Agriculture
Geospatial Tools for Targeting and Prioritisation in AgricultureGeospatial Tools for Targeting and Prioritisation in Agriculture
Geospatial Tools for Targeting and Prioritisation in Agriculture
 
Black Soils and Digital Soil Mapping in Canada - Xiaoyuan Geng, Brian McConke...
Black Soils and Digital Soil Mapping in Canada - Xiaoyuan Geng, Brian McConke...Black Soils and Digital Soil Mapping in Canada - Xiaoyuan Geng, Brian McConke...
Black Soils and Digital Soil Mapping in Canada - Xiaoyuan Geng, Brian McConke...
 
bnwebinar07-24-13_1.pptx
bnwebinar07-24-13_1.pptxbnwebinar07-24-13_1.pptx
bnwebinar07-24-13_1.pptx
 
bnwebinar07-24-13_1 (2).pptx
bnwebinar07-24-13_1 (2).pptxbnwebinar07-24-13_1 (2).pptx
bnwebinar07-24-13_1 (2).pptx
 
RDAP14: David Van Riper of Terra Populus
RDAP14: David Van Riper of Terra Populus RDAP14: David Van Riper of Terra Populus
RDAP14: David Van Riper of Terra Populus
 
Application of web ontology to harvest estimation of rice in Thailand
Application of web ontology to harvest estimation of rice in ThailandApplication of web ontology to harvest estimation of rice in Thailand
Application of web ontology to harvest estimation of rice in Thailand
 
Application of web ontology to harvest estimation of rice in thailand
Application of web ontology to harvest estimation of rice in thailandApplication of web ontology to harvest estimation of rice in thailand
Application of web ontology to harvest estimation of rice in thailand
 
A Water Quality Valuation Approach To Strategic Planning
A Water Quality Valuation Approach To Strategic PlanningA Water Quality Valuation Approach To Strategic Planning
A Water Quality Valuation Approach To Strategic Planning
 
Vital Signs: An integrated monitoring system for agricultural landscapes
Vital Signs: An integrated monitoring system for agricultural landscapesVital Signs: An integrated monitoring system for agricultural landscapes
Vital Signs: An integrated monitoring system for agricultural landscapes
 
The status, research progress, and new application of soil inventory in Japan...
The status, research progress, and new application of soil inventory in Japan...The status, research progress, and new application of soil inventory in Japan...
The status, research progress, and new application of soil inventory in Japan...
 
A knowledge-based model for identifying and mapping tropical wetlands and pea...
A knowledge-based model for identifying and mapping tropical wetlands and pea...A knowledge-based model for identifying and mapping tropical wetlands and pea...
A knowledge-based model for identifying and mapping tropical wetlands and pea...
 
CCAFS meeting Hanoi
CCAFS meeting HanoiCCAFS meeting Hanoi
CCAFS meeting Hanoi
 
Kevin Byrne’s Presentation: Sustainability Storyboarded and Geovisualized Acr...
Kevin Byrne’s Presentation: Sustainability Storyboarded and Geovisualized Acr...Kevin Byrne’s Presentation: Sustainability Storyboarded and Geovisualized Acr...
Kevin Byrne’s Presentation: Sustainability Storyboarded and Geovisualized Acr...
 
National Geospatial Program Presentation - IMIA Asia Pacific Conference
National Geospatial Program Presentation - IMIA Asia Pacific ConferenceNational Geospatial Program Presentation - IMIA Asia Pacific Conference
National Geospatial Program Presentation - IMIA Asia Pacific Conference
 
Data specifications and information structures for soils data - Stephen Halle...
Data specifications and information structures for soils data - Stephen Halle...Data specifications and information structures for soils data - Stephen Halle...
Data specifications and information structures for soils data - Stephen Halle...
 
Introduction to GSOC map
Introduction to GSOC mapIntroduction to GSOC map
Introduction to GSOC map
 
GSOC17 Introduction, Product specifications, Existing SOC maps and methodologies
GSOC17 Introduction, Product specifications, Existing SOC maps and methodologiesGSOC17 Introduction, Product specifications, Existing SOC maps and methodologies
GSOC17 Introduction, Product specifications, Existing SOC maps and methodologies
 

Kürzlich hochgeladen

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
SoniaTolstoy
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
fonyou31
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 

Kürzlich hochgeladen (20)

social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 

Terra Populus Overview

  • 1.
  • 3. The storage in a smart phone would cost (in 2011 dollars) $7,571 in 2001 $212,040 in 1991 $3,796,800 in 1981 $56,168,800 in 1971 $1,233,179,000 in 1961
  • 4. The Explosion of Scientific Data Because of the massive decline in the cost of data collection, storage, and analysis, the quantity of scientific data being collected is growing at an extraordinary pace  New opportunities for analysis  New methods are being applied  Marked acceleration in the pace of discovery
  • 5. The Big Challenges The quantity of scientific data is exploding, but we lack basic infrastructure to maintain them or capitalize on opportunities for analysis and discovery  Most scientific data is at risk of loss  Most scientific data is inaccessible  Metadata are usually incomplete and inadequate  Little interoperability across datasets or data types  Data are trapped in disciplinary silos
  • 6. Why Population and Environment? Massive Planetary Change between 1950 and 2000  Population  population doubled  economy grew seven-fold  Agriculture  food consumption tripled  water use tripled  Energy use  fossil fuels increased four-fold
  • 7. The Temporal Dimension TerraPop World Population, 1000-2000 6000 5000 Population (millions) 4000 3000 2000 1000 0 1000 1200 1400 1600 1800 2000 Year
  • 8. TerraPop Goals Provide an organizational and technical framework to preserve, integrate, disseminate, and analyze global- scale spatiotemporal data describing population and the environment.
  • 9. Primary Objective Lower barriers to conducting interdisciplinary human- environment interactions research by making data with different formats from different scientific domains easily interoperable  Population microdata  Government land-use statistics  Land cover data from satellite imagery  Historical climate records (temperature, precipitation, cloud cover)
  • 11. Project Elements 1. Archival Development 2. Data Integration, Dissemination, and Analysis 3. Education and Outreach 4. Organizational Development
  • 12. 1. Archival Development Collect, integrate, describe, and preserve data describing changes in the world’s population and environment.
  • 13. Data Collection: Initial Population Data Sources  Population microdata from censuses  Focus on Brazil and Malawi
  • 14. Age Birthplace Sex Mother’s birthplace Relationship Race Occupation H910000240000000088001001000220100 P910000020101032120010010010011504 Population P910000010201036220010010010011999 P910201000301011220060010010011999 Microdata P 9 1 0 2 0 1 0 0 0 3 0 1 0 0 9 1 2 0 0 6 0 0 1 0 0 1 0 0 1 1 9 9 9 Geographic and housing Structure P 9 1 0 2 0 1 0 0 0 3 0 1 0 0 7 1 2 0 0 6 0 0 1 0 0 1 0 0 1 1 9 9 9 characteristics P910201000301006120060010010011999 P910201000301004220060010010011999 Household record P910201000301003220060010010011999 P910201000301002220060010010011999 (shaded) followed H910000240000000088001001000110100 by a person record P910000020101030110010290510511310 P910000010201021210010290290171999 for each member P910201000301001110060010290291999 of the household H910000240000000088001001000220100 P910000020101045120010010010011100 P910000010201025220010010010011820 For each type of P910201000301007220060010010011999 H910000240000000088001001000220100 record, columns P910000020101049120010010010011100 correspond to P910000010201049220010010010011820 P910201000301019220060010010011820 specific variables P910201000301015220060010010012820
  • 15. The Power of Microdata  Customized measures: Variables based on combined characteristics of family and household members, capitalizing on the hierarchical structure of the data  Multivariate analysis: Analyze many individual, household, and community characteristics simultaneously  Interoperability: Harmonize data across time and space For each person, detailed information about geographic Age classification for school enrollment location, economic activities,U.S. Census for School Enrollment Tablepublished educational attainment, in 2. Age Classifications literacy, fertility history, child mortality, migration,Imputed 1970 1990 Common place of former residence, marital status, consensual unions, 3-4 3-4 3-4 3-4 5-6 5-6 5-6 family composition, disabilities, water supply, sewage, 5-6 7-14 7-9 7-17 7-14 building materials (floor, roof, etc.), and many other 14-15 10-14 14-15 characteristics. 16-17 15-17 16-17
  • 17. Facebook has data on We have data on 800 million people 912 million people USA 165 International 481 Historical 266 Total 912
  • 18. Data Collection: Initial Sources of Environmental Data  Land cover data from satellite images (Global Land Cover 2000)  Land use data from satellites and government records (Global Landscapes Initiative)  Climate data from weather stations (WorldClim)
  • 19. Land Cover Data Global Land Cover 2000  Grid of 1 km sq cells  Cell values are dominant land cover  Derived from satellite images
  • 20. Land Use Data Global Landscapes Initiative / Farming the World  Grid of 10 km cells  Values are % of cell used for given purpose  Derived from satellite and agricultural census data Additional data sets for 175 specific crops and yields
  • 21. Climate Data WorldClim  Grid of 1 km cells  Interpolated from climate station data  Incorporate data from 1950-2000
  • 22. 2. Integration, Dissemination, and Analysis Create tools and procedures to integrate, disseminate, and analyze population and environmental data.
  • 23. Three Source Data Formats Microdata: Characteristics of individuals and households Area-level data: Characteristics of places defined by administrative boundaries Raster data: Values tied to spatial coordinates
  • 24. Three Output Formats 1. Census microdata with attached characteristics describing land use, land cover, and climate for local areas 2. Aggregate data for administrative districts with tabulated population data and environmental characteristics 3. Gridded data with characteristics of population and environment
  • 25. TerraPop Prototype Data Transformations Input Formats Output Formats Microdata Microdata Areal data Areal data Raster data Raster data
  • 26. Analysis tool needed for microdata conversion Input Formats Output Formats Microdata Microdata Areal data Areal data Raster data Raster data
  • 27. TerraPop Data Integration Input Formats Output Formats Microdata Microdata with characteristics of surrounding area Area-level Area-level with summaries of data microdata and raster data Raster data with gridded Raster data representations of microdata and area-level data
  • 28. Integration – Microdata Output Census microdata with attached characteristics describing land use, land cover, and climate for local areas Individuals and households with their environmental and social context
  • 29. Integration – Area-Level Output Aggregate data for administrative districts with tabulated population data and environmental characteristics Mean Ann. Max. Ann. Rent, Rent, Own, Own, Vacant, Vacant, County ID Temp. Precip. Rural Urban Rural Urban Rural Urban G17003100001 21.2 768 3129 1063 637 365 34 33 G17003100002 23.4 589 2949 1075 1469 717 0 0 G17003100003 24.3 867 3418 1589 1108 617 0 0 G17003100004 21.5 943 1882 425 202 142 123 0 G17003100005 24.1 867 2416 572 426 197 189 0 G17003100006 24.4 697 2560 934 950 563 220 14 G17003100007 25.6 701 2126 653 321 215 209 46
  • 30. Integration – Raster Output Gridded data with characteristics of population and environment Raster format compatible with environmental models
  • 31. Data Access System Browse and select variables
  • 32. Data Access System Browse and select variables
  • 33. Data Access System Choose output format
  • 34. Data Access System Choose output format
  • 35. Data Access System Select data transformation options
  • 36. TerraPop Prototype  Data to be included  Population microdata for Brazil (1960-2000) and Malawi (1998 & 2008)  Aggregate population data at first and second administrative levels for Brazil and Malawi  Land cover, agricultural land use, and climate data  Timeline  Available for beta testing: May 2013  Initial public version available by the end of 2013
  • 37. 3. Education and Outreach Engage the scientific community and the public
  • 38. Education and Outreach for the Research Community  Curriculum of web-based training  Workshops at conferences  User support  Community tools to promote user engagement
  • 39. Public Education and Outreach  Partner with educational software developers  Fathom  Integration with museum programs  Science on a Sphere
  • 40. 4. Organizational Development Develop structures to ensure long-run financial and technical sustainability.
  • 41. Sustainability Create a sustainable organization that can guarantee preservation and access over multiple decades  Organizational sustainability  Financial sustainability  Technological sustainability
  • 42. agriculture demography transportation criminology hazards Population Climate Terra pollution Populus Land Use Land Cover health economics politics bio- diversity hydrology