SlideShare ist ein Scribd-Unternehmen logo
1 von 34
GBIF Nodes training– Berlin, 04-05 october 2013
Promoting data use III :
Most frequent data analysis
techniques
Anne-Sophie Archambeau (archambeau@gbif.fr)
GBIF France
04.10.13
1. INTRODUCTION: Some basic concepts
of data analysis and species distribution
modeling.
2. TECHNIQUES: DOMAIN, GARP, MaxEnt...
3. ORGANIZING TRAINING: Workshops and
events about ecological niche modelling.
4. RESOURCES
04.10.13
1. INTRODUCTION: Some basic concepts
of data analysis and species distribution
modeling.
2. TECHNIQUES: DOMAIN, GARP, MaxEnt...
3. ORGANIZING TRAINING: Workshops and
events about ecological niche modelling.
4. RESOURCES
04.10.13
Data analyses and modelling :
what for?
4
Guisan & Thuiller (2005) Ecology Letters 8: 993-1009
Type of use References
1. Quantifying the environmental niche of species Austin et al. 1990; Vetaas 2002
2. Testing biogeographical, ecological and evolutionary hypotheses
(e.g. in phylogeographical)
Leathwick 1998; Anderson et al. 2002;
Graham et al. 2004b, Hugall et al. 2005
3. Assessing species invasion and proliferation Beerling et al. 1995; Peterson 2003
4. Assessing the impact of climate, land use and other
environmental changes on species’ distributions
Thomas et al. 2004; Thuiller 2004
5. Suggesting unsurveyed sites of high potential of occurrence for
rare species
Elith & Burgman 2002; Raxworthy et al.
2003; Engler et al. 2004
6. Supporting appropriate management plans for species recovery
and mapping suitable sites for species’ reintroduction
Pearce & Lindenmayer 1998
7. Supporting conservation planning and reserve selection Ferrier 2002; Araújo et al. 2004
8. Modelling species’ assemblages (biodiversity, composition) from
individual species' predictions
Leathwick et al. 1996; Guisan & Theurillat
2000; Ferrier et al. 2002
9. Building bio- or ecogeographic regions None; but see Kreft & Jetz 2010
10. Improving patch delineation and ecological distance in meta-
population models
Keith et al. 2009, Anderson et al. 2009
A key concept: the environmental niche
G. Evelyn Hutchinson (1957)
Temperature
Water
Hutchinson: species' requirement (environmental niche)
Fundamental
Niche
Realized
Niche
~ ensemble of a species’ suitable habitats
Where can we find a species?
?
E.g. Otter in Europe
Cianfrani et al. (in review)
observed
predicted
1. To test an hypothesis
2. To describe (quantify) the relationship between
a response variable (y) and one or several
explanatory variables (or predictor variables; xi)
y = ƒ(Xi)
3. To predict the likely value of response variable
from values of Xi :
- for another time period (temporal model)
- for another region (spatial model)
Models: why and what for ?
Guisan et al. 2002 Ecol. Modelling
 
+ +++++ -
Guisan et al. (2002) Ecol. Model. | Guisan et al. (2006) J. Appl. Ecol.
Potential
distribution of
the species
Field
data
Environmental
variables
(precipitation,
geology,
topography, water
distribution…..)
Fitting the niche
Spatial
predictions
Data
collection
Statistical
modelling
Response
curves
presence
absence
Temperature
Water
Light
Fundamental
Niche
Realized
Niche
Realized
Niche
Realized
Niche
Principle of species distribution modelling
presence-absence presence
abundance abundance abundance abundance*
1. 3.
most frequent case with data from
natural history collections
4.
Which species data?
2.
most frequent case with data from
vegetation surveys
GLM/GAM (Gaussian,
Poisson), RTREE, GBM...
GLM/GAM (Binomial), RF, BRT, CTREE,
GBM, Almost all models ...
Specific methods: BIOCLIM, ENFA...
GLM/GAM (Gaussian,
Poisson), RTREE, ... 4.2.
pseudo-
absences
Guisan et al
Presence-only
• Much of the occurrence data from herbaria and
museum specimen collections and data that are
made available in GBIF are of the so-called
“presence-only” category.
• To deal with the lack of accurate and reliable
absence data, new modeling methods have been
developed. The latter are based on only presence
data to predict the species distribution and
extrapolate local observation, across the study
site, in function of eco-geographical variables
(Hirzel and Guisan 2002; Elith et al 2006. Dudik
Phillips and 2008. Elith et al 2010),
Pseudo-absence
• Many of the new methods developed to analyze
presence-only data address the lack of absence data
by:
– Creating pseudo-absences, e.g. randomly sample an equal
number “absence” points by different strategies.
– Analyzing (all) background points as representatives of
unsuitable environments.
• Other methods can model what is characteristic of the
sites of recorded species occurrence without looking at
sites where the species is assumed absent.
– E.g. rule-based, principal component, factorial, clustering or
machine-learning methods can be used.
Data quality issues
• Notice also disturbed areas that can be
environmentally suitable for the species even if no
species occurrences are found here.
• Low or variable detectability of the species can
provide similar problems.
• Sample bias often provide another fundamental
problem of species occurrence data where some areas
in the landscape are sampled more intensively than
others (high density of ecologists and other biologists,
accessibility by car etc…).
Data cleaning -
importance of data input
A. Nomenclatural and Taxonomic Error
• Identification certainty (synonyms)
• Spelling of names
- Scientific names
- Common names
- Infraspecific rank
- Cultivars and Hybrids
- Unpublished Names
- Author names
- Collector’s names
B. Spatial Data
• Data Entry
• Georeferencing
C. Descriptive Data
D. Documentation of Error
E. Visualisation of Error
04.10.13
1. INTRODUCTION: Some basic concepts
of data analysis and species distribution
modeling.
2. TECHNIQUES: DOMAIN, GARP, MaxEnt...
3. ORGANIZING TRAINING: Workshops and
events about ecological niche modelling.
4. RESOURCES
Domain (gower metric)
• Uses a similarity metric to predict suitability based on
the minimum distance in environmental space to any
presence record.
• Continuous predictions
(threshold required for binary)
• Does not account for potential
interactions between variables
• Gives equal weight to all variables
DOMAIN
See: Carpenter et al. 1993 Biodiv. Conservation 2: 667-680.
Freeware: http://www.cifor.cgiar.org/docs/_ref/research_tools/domain/
(min distance)
GARP (1999, 2002)
16
• Genetic Algorithm for Rule-set
Production (GARP).
• Originally released as “GARP
algorithm” around 1999.
• “Desktop GARP” software released
around 2002 by the University of
Kansas and CRIA in Brazil.
• http://www.nhm.ku.edu/desktopgarp
/
Maxent (2004)
17
• Maxent Java SDM software released in 2004.
• Well suited and with high performance for presence-only data.
• By default Maxent randomly samples 10,000 background points.
• Maxent currently has six feature classes: linear, product,
quadratic, hinge, threshold and categorical.
• It is common to mask the study area – i.e. setting no-data values
outside the area of interest.
• Assumption: Maxent relies on an unbiased sample.
– One fix is to provide background data of similar bias.
• Assumption: environment layers have grid cells of equal area.
– In un-projected latitude-longitude-degree data, grids cells to the north
and south of the equator have smaller area.
– On fix could be to re-project to an equal-area-projection.
• Available at:
http://www.cs.princeton.edu/~schapire/maxent/
Data Modeling methods
• Parallel Factor Analysis (PARAFAC) (Multi-way)
• Multi-linear Partial Least Squares (N-PLS) (Multi-way)
• Soft Independent Modeling of Class Analogy (SIMCA)
• k-Nearest Neighbor (kNN)
• Partial Least Squares Discriminant Analysis (PLS-DA)
• Linear Discriminant Analysis (LDA)
• Principal component logistic regression (PCLR)
• Generalized Partial Least Squares (GPLS)
• Random Forests (RF)
• Neural Networks (NN)
• Support Vector Machines (SVM)
• Boosted Regression Trees (BRT)
• Multivariate Regression Trees (MRT)
• Bayesian Regression Trees
• MARS (Multivariate adaptive regression splines)
• Classification-like models (CART, MDA) 18
Model performance
19
• The relatively large number of species
distribution (SDM) modeling methods and
software implementations lead to studies
comparing SDM method performances.
• The high score of Maxent observed by Elith
et al (2006) contributed to the increased
popularity of this method.
Elith*, J., H. Graham*, C., P. Anderson, R., Dudík, M., Ferrier, S., Guisan, A., J. Hijmans, R.,
Huettmann, F., R. Leathwick, J., Lehmann, A., Li, J., G. Lohmann, L., A. Loiselle, B., Manion, G.,
Moritz, C., Nakamura, M., Nakazawa, Y., McC. M. Overton, J., Townsend Peterson, A., J. Phillips,
S., Richardson, K., Scachetti-Pereira, R., E. Schapire, R., Soberón, J., Williams, S., S. Wisz, M.
and E. Zimmermann, N. (2006), Novel methods improve prediction of species’ distributions from
occurrence data. Ecography, 29: 129–151. doi: 10.1111/j.2006.0906-7590.04596.x
Data analysis: Software
20
• Many generic and many
specialized software tools exists to
assist you in analyzing data.
• One of the most popular is R
programming language.
DIVA-GIS
A visual background map can often be useful for plotting your
locations.
Relevant links and data at DIVA-GIS website (country level,
global level, global climate, species occurrence); near global 90-
meter resolution elevation data, high-resolution satellite images
(LandSat), www.diva-gis.org/Data
The DIVA-GIS project provides a useful collection of country
based vector data on borders, roads, water bodies, place
names, etc., www.diva-gis.org/Data
21
22
• Thullier W., Georges D., and Engler R. (2013).
Biomod2: Ensemble platform for species
distribution modeling [biomod2 R package].
Available at http://cran.r-
project.org/web/packages/biomod2/index.html
• Thullier W., Georges D., and Engler R. (2013).
[BIOMOD R Package]. Available at https://r-
forge.r-project.org/projects/biomod/
• Thuiller W., Lafourcade B., Engler R. & Araujo
M.B. (2009). BIOMOD – A platform for ensemble
forecasting of species distributions. Ecography,
32, 369-373.
• BIOMOD released around 2008, biomod2
released in 2012. See also: http://www.will.chez-
alice.fr/Software.html
04.10.13
1. INTRODUCTION: Some basic concepts
of data analysis and species distribution
modeling.
2. TECHNIQUES: DOMAIN, GARP, MaxEnt...
3. ORGANIZING TRAINING: Workshops and
events about ecological niche modelling.
4. RESOURCES
04.10.13
- Look at what has already been done
- Clearly define the prerequisites :
• Level of the participants : basic or advanced
• field of research
• knowledge of software
- Test the motivation of candidates, letter or
recommendation (increasing the potential for
dissemination after the training).
- Diversity of representations if possible.
How to proceed ?
04.10.13
- Give access to the presentations and exercises to
participants a few weeks before the beginning of the
training to familiarize them with the concepts covered.
- Prepare some datasets (local or accessible online – be
careful with technical issues such as internet problems)
for the practical part and / or ask participants to bring
their own data set (best solution) .
- Ask participants to bring their own laptop if possible
- Start training with individual presentation of the
training team + presentation of each participant.
How to proceed ?
04.10.13
- Clearly define the context and history of the software /
workflow presented:
scientific context , implementation , disciplines,
future perspectives
- Detailed presentation of the tool and its
functionalities
- On line test : access/download data , statistical
analysis, workflow ...
- Exercises on the tool and on each of its feature : first
test with the data made ​​available to participants and
with their own data to a concrete implementation .
How to proceed ?
04.10.13
- breaks ! Time to discuss on the tool and their own
research topics → opportunity for networking and
cooperation
- End of training:
- conclusion on the effectiveness of the tool,
- concrete applications (links with other software /
workflows, other uses ... )
- If relevant, present the results of exercises by the
participants or teams of participants
After training : give access to all documents , urls...
List of participants to share experiments
How to proceed ?
04.10.13
GBIF web site/training
http://uat.gbif.org/resources/2691
04.10.13
Data Quality, Data Cleaning : Problems, Tools, and approaches
by Arthur D. Chapman.
04.10.13
04.10.13
=> two-day hands-on training event. 10-15 advanced researchers
http://www.biovel.eu/index.php/events/training-events/23-events/training-events/142-using-biovel-
workflows-for-enm-studies
Application deadline: 18 October, 2013 => a month and a half before
Day 1 : preparation of data using BioVeL. general demonstration of the
tool and exercise with a provided data set. In the afternoon, participants
will work with their own data set and/or data from public sources.
Day 2 : demonstrate the ENM workflows, practice with a provided data
set. Model testing, Statistical analysis of GIS data, Invasive and
endangered species distribution modelling
Afternoon : practice with datasets prepared on the first day
04.10.13
1. INTRODUCTION: Some basic concepts
of data analysis and species distribution
modeling.
2. TECHNIQUES: DOMAIN, GARP, MaxEnt...
3. ORGANIZING TRAINING: Workshops and
events about ecological niche modelling.
4. RESOURCES
04.10.13
Resources
• Peterson, A.T., J. Soberón, R.G. Pearson, R.P. Anderson, E.
Martínez-Meyer, M. Nakamura, and M.B. Araújo (2011). Ecological
niches and geographic distributions. Monographs in population
biology 49. Princeton University Press. ISBN: 97806911136882.
• Franklin, J. (2010). Mapping Species Distributions: Spatial Inference
and Prediction. Cambridge University Press. ISBN: 9780521700023.
04.10.13
Resources
• Scheldeman, Xavier and van Zonneveld, Maarten. 2010. Training Manual
on Spatial Analysis of Plant Diversity and Distribution. Bioversity
International, Rome, Italy. ISBN 978-92-9043-880-9. Available in English,
French and Spanish at http://www.gbif.org/orc/?doc_id=4928.
• Hijmans, R.J. and J. Elith (2013). Species distribution modeling with R.
[Dismo vignette manual]. Available at http://cran.r-
project.org/web/packages/dismo/vignettes/sdm.pdf.

Weitere ähnliche Inhalte

Was ist angesagt?

AGU2012_SSC_LA_Eco_Poster_RRR-BB (Develop)
AGU2012_SSC_LA_Eco_Poster_RRR-BB (Develop)AGU2012_SSC_LA_Eco_Poster_RRR-BB (Develop)
AGU2012_SSC_LA_Eco_Poster_RRR-BB (Develop)
Blaise Pezold
 
715BI_PostscriptsReprint
715BI_PostscriptsReprint715BI_PostscriptsReprint
715BI_PostscriptsReprint
Caren Les
 

Was ist angesagt? (20)

Frontiers in Distributional Ecology
Frontiers in Distributional EcologyFrontiers in Distributional Ecology
Frontiers in Distributional Ecology
 
Niche comparisons 201606 para curso Lichos
Niche comparisons 201606 para curso LichosNiche comparisons 201606 para curso Lichos
Niche comparisons 201606 para curso Lichos
 
Curso Lichos - MOP and (separately) Niche conservatism 201606
Curso Lichos - MOP and (separately) Niche conservatism 201606Curso Lichos - MOP and (separately) Niche conservatism 201606
Curso Lichos - MOP and (separately) Niche conservatism 201606
 
BaltezarMangroves2016Final
BaltezarMangroves2016FinalBaltezarMangroves2016Final
BaltezarMangroves2016Final
 
Rnglnd cnd0120806 folker
Rnglnd cnd0120806 folkerRnglnd cnd0120806 folker
Rnglnd cnd0120806 folker
 
Providence_MSc defense University of Twente
Providence_MSc defense University of TwenteProvidence_MSc defense University of Twente
Providence_MSc defense University of Twente
 
AGU2012_SSC_LA_Eco_Poster_RRR-BB (Develop)
AGU2012_SSC_LA_Eco_Poster_RRR-BB (Develop)AGU2012_SSC_LA_Eco_Poster_RRR-BB (Develop)
AGU2012_SSC_LA_Eco_Poster_RRR-BB (Develop)
 
Linear regression
Linear regressionLinear regression
Linear regression
 
GIS as a Land Planning Resource
GIS as a Land Planning ResourceGIS as a Land Planning Resource
GIS as a Land Planning Resource
 
Filazzola NSTP Presentation 2014
Filazzola NSTP Presentation 2014Filazzola NSTP Presentation 2014
Filazzola NSTP Presentation 2014
 
diss_shuber
diss_shuberdiss_shuber
diss_shuber
 
Mound_paper
Mound_paperMound_paper
Mound_paper
 
715BI_PostscriptsReprint
715BI_PostscriptsReprint715BI_PostscriptsReprint
715BI_PostscriptsReprint
 
Cushman parsimony in landscape metrics strength, universality
Cushman parsimony in landscape metrics strength, universalityCushman parsimony in landscape metrics strength, universality
Cushman parsimony in landscape metrics strength, universality
 
Genotype x environment interaction analysis of tef grown in southern ethiopia...
Genotype x environment interaction analysis of tef grown in southern ethiopia...Genotype x environment interaction analysis of tef grown in southern ethiopia...
Genotype x environment interaction analysis of tef grown in southern ethiopia...
 
Stability analysis
Stability analysisStability analysis
Stability analysis
 
Stability analysis and G*E interactions in plants
Stability analysis and G*E interactions in plantsStability analysis and G*E interactions in plants
Stability analysis and G*E interactions in plants
 
Large genotypes by environmental interaction (G x E) for yield
Large genotypes by environmental interaction (G x E) for yieldLarge genotypes by environmental interaction (G x E) for yield
Large genotypes by environmental interaction (G x E) for yield
 
Soil spectroscopy in the africa soil information service: Getting the best ou...
Soil spectroscopy in the africa soil information service: Getting the best ou...Soil spectroscopy in the africa soil information service: Getting the best ou...
Soil spectroscopy in the africa soil information service: Getting the best ou...
 
Modeling GHG emissions and carbon sequestration
Modeling GHG emissions and carbon sequestrationModeling GHG emissions and carbon sequestration
Modeling GHG emissions and carbon sequestration
 

Andere mochten auch (9)

Resolution on Methodological scheme of the report on macroeconomic forecasts ...
Resolution on Methodological scheme of the report on macroeconomic forecasts ...Resolution on Methodological scheme of the report on macroeconomic forecasts ...
Resolution on Methodological scheme of the report on macroeconomic forecasts ...
 
UCA: Data Analysis Techniques
UCA: Data Analysis TechniquesUCA: Data Analysis Techniques
UCA: Data Analysis Techniques
 
060 techniques of_data_analysis
060 techniques of_data_analysis060 techniques of_data_analysis
060 techniques of_data_analysis
 
Qualitative Data Analysis (Steps)
Qualitative Data Analysis (Steps)Qualitative Data Analysis (Steps)
Qualitative Data Analysis (Steps)
 
060 techniques of_data_analysis
060 techniques of_data_analysis060 techniques of_data_analysis
060 techniques of_data_analysis
 
Data analysis powerpoint
Data analysis powerpointData analysis powerpoint
Data analysis powerpoint
 
Chapter 10-DATA ANALYSIS & PRESENTATION
Chapter 10-DATA ANALYSIS & PRESENTATIONChapter 10-DATA ANALYSIS & PRESENTATION
Chapter 10-DATA ANALYSIS & PRESENTATION
 
Quantitative Data Analysis
Quantitative Data AnalysisQuantitative Data Analysis
Quantitative Data Analysis
 
Qualitative data analysis
Qualitative data analysisQualitative data analysis
Qualitative data analysis
 

Ähnlich wie Module 5 - EN - Promoting data use III: Most frequent data analysis techniques

Feature Selection Approach based on Firefly Algorithm and Chi-square
Feature Selection Approach based on Firefly Algorithm and Chi-square Feature Selection Approach based on Firefly Algorithm and Chi-square
Feature Selection Approach based on Firefly Algorithm and Chi-square
IJECEIAES
 
DENSA:An effective negative selection algorithm with flexible boundaries for ...
DENSA:An effective negative selection algorithm with flexible boundaries for ...DENSA:An effective negative selection algorithm with flexible boundaries for ...
DENSA:An effective negative selection algorithm with flexible boundaries for ...
Mario Pavone
 

Ähnlich wie Module 5 - EN - Promoting data use III: Most frequent data analysis techniques (20)

Modeling present and prospective distribution of Phyteuma genus in Carpathian...
Modeling present and prospective distribution of Phyteuma genus in Carpathian...Modeling present and prospective distribution of Phyteuma genus in Carpathian...
Modeling present and prospective distribution of Phyteuma genus in Carpathian...
 
GB20 Nodes Training Course 2013, module 5B: Latest trends in data analysis
GB20 Nodes Training Course 2013, module 5B: Latest trends in data analysisGB20 Nodes Training Course 2013, module 5B: Latest trends in data analysis
GB20 Nodes Training Course 2013, module 5B: Latest trends in data analysis
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trends
 
RPG iEvoBio 2010 Keynote
RPG iEvoBio 2010 KeynoteRPG iEvoBio 2010 Keynote
RPG iEvoBio 2010 Keynote
 
iEvoBio Keynote Talk 2010
iEvoBio Keynote Talk 2010iEvoBio Keynote Talk 2010
iEvoBio Keynote Talk 2010
 
Ecosystem science requirements for uas remote sensing
Ecosystem science requirements for uas remote sensing Ecosystem science requirements for uas remote sensing
Ecosystem science requirements for uas remote sensing
 
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
TERN eMAST : Observations and terrestrial ecosystem models : Terrestrial Ecos...
 
Perth ausplots presentation_070616_internet_qu
Perth ausplots presentation_070616_internet_quPerth ausplots presentation_070616_internet_qu
Perth ausplots presentation_070616_internet_qu
 
Topological Data Analysis (TDA) for volumetric X-ray CT data
Topological Data Analysis (TDA) for volumetric X-ray CT dataTopological Data Analysis (TDA) for volumetric X-ray CT data
Topological Data Analysis (TDA) for volumetric X-ray CT data
 
Esri and the Scientific Community
Esri and the Scientific CommunityEsri and the Scientific Community
Esri and the Scientific Community
 
Statistics for Geography and Environmental Science: an introductory lecture c...
Statistics for Geography and Environmental Science:an introductory lecture c...Statistics for Geography and Environmental Science:an introductory lecture c...
Statistics for Geography and Environmental Science: an introductory lecture c...
 
Open Science and Ecological meta-anlaysis
Open Science and Ecological meta-anlaysisOpen Science and Ecological meta-anlaysis
Open Science and Ecological meta-anlaysis
 
Feature Selection Approach based on Firefly Algorithm and Chi-square
Feature Selection Approach based on Firefly Algorithm and Chi-square Feature Selection Approach based on Firefly Algorithm and Chi-square
Feature Selection Approach based on Firefly Algorithm and Chi-square
 
Morton presentation6
Morton presentation6Morton presentation6
Morton presentation6
 
Bayesian network-based predictive analytics applied to invasive species distr...
Bayesian network-based predictive analytics applied to invasive species distr...Bayesian network-based predictive analytics applied to invasive species distr...
Bayesian network-based predictive analytics applied to invasive species distr...
 
Lehnert_EGU201_SampleMetadataStandards
Lehnert_EGU201_SampleMetadataStandardsLehnert_EGU201_SampleMetadataStandards
Lehnert_EGU201_SampleMetadataStandards
 
DENSA:An effective negative selection algorithm with flexible boundaries for ...
DENSA:An effective negative selection algorithm with flexible boundaries for ...DENSA:An effective negative selection algorithm with flexible boundaries for ...
DENSA:An effective negative selection algorithm with flexible boundaries for ...
 
Using drone data in modelling:A case study applying the BCCVL
Using drone data in modelling:A case study applying the BCCVLUsing drone data in modelling:A case study applying the BCCVL
Using drone data in modelling:A case study applying the BCCVL
 
6 x 9 = 42
6 x 9 = 426 x 9 = 42
6 x 9 = 42
 
Presentation1 - Basis of application of Ecogeography in PGR
Presentation1 - Basis of application of Ecogeography in PGRPresentation1 - Basis of application of Ecogeography in PGR
Presentation1 - Basis of application of Ecogeography in PGR
 

Mehr von Alberto González-Talaván

Mehr von Alberto González-Talaván (20)

BID CE Workshop 1 - Activity X.01 - Wrap-up and Evaluation
BID CE Workshop 1 - Activity X.01 - Wrap-up and EvaluationBID CE Workshop 1 - Activity X.01 - Wrap-up and Evaluation
BID CE Workshop 1 - Activity X.01 - Wrap-up and Evaluation
 
Bid ce workshop 1 Activity V.01 - Planning a biodiversity data mobilization...
Bid ce workshop 1   Activity V.01 - Planning a biodiversity data mobilization...Bid ce workshop 1   Activity V.01 - Planning a biodiversity data mobilization...
Bid ce workshop 1 Activity V.01 - Planning a biodiversity data mobilization...
 
BID CE Workshop 1 - Activity IV.02 - BID Community
BID CE Workshop 1 - Activity IV.02 - BID CommunityBID CE Workshop 1 - Activity IV.02 - BID Community
BID CE Workshop 1 - Activity IV.02 - BID Community
 
Bid CE Workshop 1 - ONLINE VERSION - Activity 01 - Welcome and Introduction
Bid CE Workshop 1 - ONLINE VERSION - Activity 01 - Welcome and IntroductionBid CE Workshop 1 - ONLINE VERSION - Activity 01 - Welcome and Introduction
Bid CE Workshop 1 - ONLINE VERSION - Activity 01 - Welcome and Introduction
 
BID CE Workshop 1 - Session 13 - Advanced Biodiversity Data Publishing
BID CE Workshop 1 -  Session 13 - Advanced Biodiversity Data PublishingBID CE Workshop 1 -  Session 13 - Advanced Biodiversity Data Publishing
BID CE Workshop 1 - Session 13 - Advanced Biodiversity Data Publishing
 
BID CE Workshop 1 - session 11 - Basic concepts about biodiversity data quality
BID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data qualityBID CE Workshop 1 -  session 11 - Basic concepts about biodiversity data quality
BID CE Workshop 1 - session 11 - Basic concepts about biodiversity data quality
 
BID CE Workshop 1 Session 09 - Biodiversity Data Management Tools
BID CE Workshop 1   Session 09 - Biodiversity Data Management ToolsBID CE Workshop 1   Session 09 - Biodiversity Data Management Tools
BID CE Workshop 1 Session 09 - Biodiversity Data Management Tools
 
BID CE Workshop 1 session 07 - Digitization Software Example - BIOTA
BID CE Workshop 1   session 07 - Digitization Software Example - BIOTABID CE Workshop 1   session 07 - Digitization Software Example - BIOTA
BID CE Workshop 1 session 07 - Digitization Software Example - BIOTA
 
BID CE Workshop 1 - session 12 - Basic use of the GBIF IPT
BID CE Workshop 1 -  session 12 - Basic use of the GBIF IPTBID CE Workshop 1 -  session 12 - Basic use of the GBIF IPT
BID CE Workshop 1 - session 12 - Basic use of the GBIF IPT
 
Bid CE Workshop 1 session 06 - Data quality during digitization
Bid CE Workshop 1   session 06 - Data quality during digitizationBid CE Workshop 1   session 06 - Data quality during digitization
Bid CE Workshop 1 session 06 - Data quality during digitization
 
BID CE Workshop 1 Session 15 - Wrap-up and Evaluation
BID CE Workshop 1   Session 15 - Wrap-up and EvaluationBID CE Workshop 1   Session 15 - Wrap-up and Evaluation
BID CE Workshop 1 Session 15 - Wrap-up and Evaluation
 
BID CE Workshop 1 session 02 - Foundations for the Workshop
BID CE Workshop 1   session 02 - Foundations for the WorkshopBID CE Workshop 1   session 02 - Foundations for the Workshop
BID CE Workshop 1 session 02 - Foundations for the Workshop
 
BID CE workshop 1 session 05 - Origins of Biodiversity Data
BID CE workshop 1   session 05  - Origins of Biodiversity DataBID CE workshop 1   session 05  - Origins of Biodiversity Data
BID CE workshop 1 session 05 - Origins of Biodiversity Data
 
BID CE workshop 1 session 03 - Data mobilization planning
BID CE workshop 1   session 03 - Data mobilization planningBID CE workshop 1   session 03 - Data mobilization planning
BID CE workshop 1 session 03 - Data mobilization planning
 
BID CE workshop 1 session 08 - Biodiversity Data Cleaning
BID CE workshop 1   session 08 - Biodiversity Data CleaningBID CE workshop 1   session 08 - Biodiversity Data Cleaning
BID CE workshop 1 session 08 - Biodiversity Data Cleaning
 
BID CE workshop 1 session 10 - presentation - Open Refine
BID CE workshop 1   session 10 - presentation - Open RefineBID CE workshop 1   session 10 - presentation - Open Refine
BID CE workshop 1 session 10 - presentation - Open Refine
 
BID CE WORKSHOP 1 - Session 01 - Introduction
BID CE WORKSHOP 1 -  Session 01 - IntroductionBID CE WORKSHOP 1 -  Session 01 - Introduction
BID CE WORKSHOP 1 - Session 01 - Introduction
 
GBIF Work Programme 2016 Update
GBIF Work Programme 2016 UpdateGBIF Work Programme 2016 Update
GBIF Work Programme 2016 Update
 
GBIF Pilot Experience Using Mozilla Open Badges
GBIF Pilot Experience Using Mozilla Open BadgesGBIF Pilot Experience Using Mozilla Open Badges
GBIF Pilot Experience Using Mozilla Open Badges
 
Séance 07. Démonstration de la publication des données d'échantillonnage. For...
Séance 07. Démonstration de la publication des données d'échantillonnage. For...Séance 07. Démonstration de la publication des données d'échantillonnage. For...
Séance 07. Démonstration de la publication des données d'échantillonnage. For...
 

Kürzlich hochgeladen

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Kürzlich hochgeladen (20)

Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 

Module 5 - EN - Promoting data use III: Most frequent data analysis techniques

  • 1. GBIF Nodes training– Berlin, 04-05 october 2013 Promoting data use III : Most frequent data analysis techniques Anne-Sophie Archambeau (archambeau@gbif.fr) GBIF France
  • 2. 04.10.13 1. INTRODUCTION: Some basic concepts of data analysis and species distribution modeling. 2. TECHNIQUES: DOMAIN, GARP, MaxEnt... 3. ORGANIZING TRAINING: Workshops and events about ecological niche modelling. 4. RESOURCES
  • 3. 04.10.13 1. INTRODUCTION: Some basic concepts of data analysis and species distribution modeling. 2. TECHNIQUES: DOMAIN, GARP, MaxEnt... 3. ORGANIZING TRAINING: Workshops and events about ecological niche modelling. 4. RESOURCES
  • 4. 04.10.13 Data analyses and modelling : what for? 4 Guisan & Thuiller (2005) Ecology Letters 8: 993-1009 Type of use References 1. Quantifying the environmental niche of species Austin et al. 1990; Vetaas 2002 2. Testing biogeographical, ecological and evolutionary hypotheses (e.g. in phylogeographical) Leathwick 1998; Anderson et al. 2002; Graham et al. 2004b, Hugall et al. 2005 3. Assessing species invasion and proliferation Beerling et al. 1995; Peterson 2003 4. Assessing the impact of climate, land use and other environmental changes on species’ distributions Thomas et al. 2004; Thuiller 2004 5. Suggesting unsurveyed sites of high potential of occurrence for rare species Elith & Burgman 2002; Raxworthy et al. 2003; Engler et al. 2004 6. Supporting appropriate management plans for species recovery and mapping suitable sites for species’ reintroduction Pearce & Lindenmayer 1998 7. Supporting conservation planning and reserve selection Ferrier 2002; Araújo et al. 2004 8. Modelling species’ assemblages (biodiversity, composition) from individual species' predictions Leathwick et al. 1996; Guisan & Theurillat 2000; Ferrier et al. 2002 9. Building bio- or ecogeographic regions None; but see Kreft & Jetz 2010 10. Improving patch delineation and ecological distance in meta- population models Keith et al. 2009, Anderson et al. 2009
  • 5. A key concept: the environmental niche G. Evelyn Hutchinson (1957) Temperature Water Hutchinson: species' requirement (environmental niche) Fundamental Niche Realized Niche ~ ensemble of a species’ suitable habitats
  • 6. Where can we find a species? ? E.g. Otter in Europe Cianfrani et al. (in review) observed predicted
  • 7. 1. To test an hypothesis 2. To describe (quantify) the relationship between a response variable (y) and one or several explanatory variables (or predictor variables; xi) y = ƒ(Xi) 3. To predict the likely value of response variable from values of Xi : - for another time period (temporal model) - for another region (spatial model) Models: why and what for ? Guisan et al. 2002 Ecol. Modelling   + +++++ -
  • 8. Guisan et al. (2002) Ecol. Model. | Guisan et al. (2006) J. Appl. Ecol. Potential distribution of the species Field data Environmental variables (precipitation, geology, topography, water distribution…..) Fitting the niche Spatial predictions Data collection Statistical modelling Response curves presence absence Temperature Water Light Fundamental Niche Realized Niche Realized Niche Realized Niche Principle of species distribution modelling
  • 9. presence-absence presence abundance abundance abundance abundance* 1. 3. most frequent case with data from natural history collections 4. Which species data? 2. most frequent case with data from vegetation surveys GLM/GAM (Gaussian, Poisson), RTREE, GBM... GLM/GAM (Binomial), RF, BRT, CTREE, GBM, Almost all models ... Specific methods: BIOCLIM, ENFA... GLM/GAM (Gaussian, Poisson), RTREE, ... 4.2. pseudo- absences Guisan et al
  • 10. Presence-only • Much of the occurrence data from herbaria and museum specimen collections and data that are made available in GBIF are of the so-called “presence-only” category. • To deal with the lack of accurate and reliable absence data, new modeling methods have been developed. The latter are based on only presence data to predict the species distribution and extrapolate local observation, across the study site, in function of eco-geographical variables (Hirzel and Guisan 2002; Elith et al 2006. Dudik Phillips and 2008. Elith et al 2010),
  • 11. Pseudo-absence • Many of the new methods developed to analyze presence-only data address the lack of absence data by: – Creating pseudo-absences, e.g. randomly sample an equal number “absence” points by different strategies. – Analyzing (all) background points as representatives of unsuitable environments. • Other methods can model what is characteristic of the sites of recorded species occurrence without looking at sites where the species is assumed absent. – E.g. rule-based, principal component, factorial, clustering or machine-learning methods can be used.
  • 12. Data quality issues • Notice also disturbed areas that can be environmentally suitable for the species even if no species occurrences are found here. • Low or variable detectability of the species can provide similar problems. • Sample bias often provide another fundamental problem of species occurrence data where some areas in the landscape are sampled more intensively than others (high density of ecologists and other biologists, accessibility by car etc…).
  • 13. Data cleaning - importance of data input A. Nomenclatural and Taxonomic Error • Identification certainty (synonyms) • Spelling of names - Scientific names - Common names - Infraspecific rank - Cultivars and Hybrids - Unpublished Names - Author names - Collector’s names B. Spatial Data • Data Entry • Georeferencing C. Descriptive Data D. Documentation of Error E. Visualisation of Error
  • 14. 04.10.13 1. INTRODUCTION: Some basic concepts of data analysis and species distribution modeling. 2. TECHNIQUES: DOMAIN, GARP, MaxEnt... 3. ORGANIZING TRAINING: Workshops and events about ecological niche modelling. 4. RESOURCES
  • 15. Domain (gower metric) • Uses a similarity metric to predict suitability based on the minimum distance in environmental space to any presence record. • Continuous predictions (threshold required for binary) • Does not account for potential interactions between variables • Gives equal weight to all variables DOMAIN See: Carpenter et al. 1993 Biodiv. Conservation 2: 667-680. Freeware: http://www.cifor.cgiar.org/docs/_ref/research_tools/domain/ (min distance)
  • 16. GARP (1999, 2002) 16 • Genetic Algorithm for Rule-set Production (GARP). • Originally released as “GARP algorithm” around 1999. • “Desktop GARP” software released around 2002 by the University of Kansas and CRIA in Brazil. • http://www.nhm.ku.edu/desktopgarp /
  • 17. Maxent (2004) 17 • Maxent Java SDM software released in 2004. • Well suited and with high performance for presence-only data. • By default Maxent randomly samples 10,000 background points. • Maxent currently has six feature classes: linear, product, quadratic, hinge, threshold and categorical. • It is common to mask the study area – i.e. setting no-data values outside the area of interest. • Assumption: Maxent relies on an unbiased sample. – One fix is to provide background data of similar bias. • Assumption: environment layers have grid cells of equal area. – In un-projected latitude-longitude-degree data, grids cells to the north and south of the equator have smaller area. – On fix could be to re-project to an equal-area-projection. • Available at: http://www.cs.princeton.edu/~schapire/maxent/
  • 18. Data Modeling methods • Parallel Factor Analysis (PARAFAC) (Multi-way) • Multi-linear Partial Least Squares (N-PLS) (Multi-way) • Soft Independent Modeling of Class Analogy (SIMCA) • k-Nearest Neighbor (kNN) • Partial Least Squares Discriminant Analysis (PLS-DA) • Linear Discriminant Analysis (LDA) • Principal component logistic regression (PCLR) • Generalized Partial Least Squares (GPLS) • Random Forests (RF) • Neural Networks (NN) • Support Vector Machines (SVM) • Boosted Regression Trees (BRT) • Multivariate Regression Trees (MRT) • Bayesian Regression Trees • MARS (Multivariate adaptive regression splines) • Classification-like models (CART, MDA) 18
  • 19. Model performance 19 • The relatively large number of species distribution (SDM) modeling methods and software implementations lead to studies comparing SDM method performances. • The high score of Maxent observed by Elith et al (2006) contributed to the increased popularity of this method. Elith*, J., H. Graham*, C., P. Anderson, R., Dudík, M., Ferrier, S., Guisan, A., J. Hijmans, R., Huettmann, F., R. Leathwick, J., Lehmann, A., Li, J., G. Lohmann, L., A. Loiselle, B., Manion, G., Moritz, C., Nakamura, M., Nakazawa, Y., McC. M. Overton, J., Townsend Peterson, A., J. Phillips, S., Richardson, K., Scachetti-Pereira, R., E. Schapire, R., Soberón, J., Williams, S., S. Wisz, M. and E. Zimmermann, N. (2006), Novel methods improve prediction of species’ distributions from occurrence data. Ecography, 29: 129–151. doi: 10.1111/j.2006.0906-7590.04596.x
  • 20. Data analysis: Software 20 • Many generic and many specialized software tools exists to assist you in analyzing data. • One of the most popular is R programming language.
  • 21. DIVA-GIS A visual background map can often be useful for plotting your locations. Relevant links and data at DIVA-GIS website (country level, global level, global climate, species occurrence); near global 90- meter resolution elevation data, high-resolution satellite images (LandSat), www.diva-gis.org/Data The DIVA-GIS project provides a useful collection of country based vector data on borders, roads, water bodies, place names, etc., www.diva-gis.org/Data 21
  • 22. 22 • Thullier W., Georges D., and Engler R. (2013). Biomod2: Ensemble platform for species distribution modeling [biomod2 R package]. Available at http://cran.r- project.org/web/packages/biomod2/index.html • Thullier W., Georges D., and Engler R. (2013). [BIOMOD R Package]. Available at https://r- forge.r-project.org/projects/biomod/ • Thuiller W., Lafourcade B., Engler R. & Araujo M.B. (2009). BIOMOD – A platform for ensemble forecasting of species distributions. Ecography, 32, 369-373. • BIOMOD released around 2008, biomod2 released in 2012. See also: http://www.will.chez- alice.fr/Software.html
  • 23. 04.10.13 1. INTRODUCTION: Some basic concepts of data analysis and species distribution modeling. 2. TECHNIQUES: DOMAIN, GARP, MaxEnt... 3. ORGANIZING TRAINING: Workshops and events about ecological niche modelling. 4. RESOURCES
  • 24. 04.10.13 - Look at what has already been done - Clearly define the prerequisites : • Level of the participants : basic or advanced • field of research • knowledge of software - Test the motivation of candidates, letter or recommendation (increasing the potential for dissemination after the training). - Diversity of representations if possible. How to proceed ?
  • 25. 04.10.13 - Give access to the presentations and exercises to participants a few weeks before the beginning of the training to familiarize them with the concepts covered. - Prepare some datasets (local or accessible online – be careful with technical issues such as internet problems) for the practical part and / or ask participants to bring their own data set (best solution) . - Ask participants to bring their own laptop if possible - Start training with individual presentation of the training team + presentation of each participant. How to proceed ?
  • 26. 04.10.13 - Clearly define the context and history of the software / workflow presented: scientific context , implementation , disciplines, future perspectives - Detailed presentation of the tool and its functionalities - On line test : access/download data , statistical analysis, workflow ... - Exercises on the tool and on each of its feature : first test with the data made ​​available to participants and with their own data to a concrete implementation . How to proceed ?
  • 27. 04.10.13 - breaks ! Time to discuss on the tool and their own research topics → opportunity for networking and cooperation - End of training: - conclusion on the effectiveness of the tool, - concrete applications (links with other software / workflows, other uses ... ) - If relevant, present the results of exercises by the participants or teams of participants After training : give access to all documents , urls... List of participants to share experiments How to proceed ?
  • 29. 04.10.13 Data Quality, Data Cleaning : Problems, Tools, and approaches by Arthur D. Chapman.
  • 31. 04.10.13 => two-day hands-on training event. 10-15 advanced researchers http://www.biovel.eu/index.php/events/training-events/23-events/training-events/142-using-biovel- workflows-for-enm-studies Application deadline: 18 October, 2013 => a month and a half before Day 1 : preparation of data using BioVeL. general demonstration of the tool and exercise with a provided data set. In the afternoon, participants will work with their own data set and/or data from public sources. Day 2 : demonstrate the ENM workflows, practice with a provided data set. Model testing, Statistical analysis of GIS data, Invasive and endangered species distribution modelling Afternoon : practice with datasets prepared on the first day
  • 32. 04.10.13 1. INTRODUCTION: Some basic concepts of data analysis and species distribution modeling. 2. TECHNIQUES: DOMAIN, GARP, MaxEnt... 3. ORGANIZING TRAINING: Workshops and events about ecological niche modelling. 4. RESOURCES
  • 33. 04.10.13 Resources • Peterson, A.T., J. Soberón, R.G. Pearson, R.P. Anderson, E. Martínez-Meyer, M. Nakamura, and M.B. Araújo (2011). Ecological niches and geographic distributions. Monographs in population biology 49. Princeton University Press. ISBN: 97806911136882. • Franklin, J. (2010). Mapping Species Distributions: Spatial Inference and Prediction. Cambridge University Press. ISBN: 9780521700023.
  • 34. 04.10.13 Resources • Scheldeman, Xavier and van Zonneveld, Maarten. 2010. Training Manual on Spatial Analysis of Plant Diversity and Distribution. Bioversity International, Rome, Italy. ISBN 978-92-9043-880-9. Available in English, French and Spanish at http://www.gbif.org/orc/?doc_id=4928. • Hijmans, R.J. and J. Elith (2013). Species distribution modeling with R. [Dismo vignette manual]. Available at http://cran.r- project.org/web/packages/dismo/vignettes/sdm.pdf.

Hinweis der Redaktion

  1. * ANDERSON, R. P., LEW, D. & PETERSON, A. T. 2003. Evaluating predictive models of species' distributions: criteria for selecting optimal models. Ecological Modelling, v. 162, p. 211 232.* Townsend Peterson, A., Papeş, M. and Eaton, M. (2007), Transferability and model evaluation in ecological niche modeling: a comparison of GARP and Maxent. Ecography, 30: 550–560. doi: 10.1111/j.0906-7590.2007.05102.x* Stockwell D.R.B. and D. Peters 1999. The GARP Modeling System: problems and solutions to automated spatial prediction. International Journal of Geographical Information Science 13:2 143-158.
  2. Available at http://www.cs.princeton.edu/~schapire/maxent/* Elith, J., Phillips, S. J., Hastie, T., Dudík, M., Chee, Y. E. and Yates, C. J. (2011), A statistical explanation of MaxEnt for ecologists. Diversity and Distributions, 17: 43–57. doi: 10.1111/j.1472-4642.2010.00725.x* Elith*, J., H. Graham*, C., P. Anderson, R., Dudík, M., Ferrier, S., Guisan, A., J. Hijmans, R., Huettmann, F., R. Leathwick, J., Lehmann, A., Li, J., G. Lohmann, L., A. Loiselle, B., Manion, G., Moritz, C., Nakamura, M., Nakazawa, Y., McC. M. Overton, J., Townsend Peterson, A., J. Phillips, S., Richardson, K., Scachetti-Pereira, R., E. Schapire, R., Soberón, J., Williams, S., S. Wisz, M. and E. Zimmermann, N. (2006), Novel methods improve prediction of species’ distributions from occurrence data. Ecography, 29: 129–151. doi: 10.1111/j.2006.0906-7590.04596.x
  3. Biomod2: http://www.will.chez-alice.fr/Software.html R-Forge: http://r-forge.r-project.org/R/?group_id=302