SlideShare a Scribd company logo
1 of 13
Why R?
Jeffrey Stanton
Syracuse University
What is R?
• R is a statistics, data management, and
graphics platform
• R is open source, maintained and developed
by a community of developers.
• The R code repository, as well as compiled
binaries (ready-to-install software) available
at: http://cran.r-project.org
• R comprises a core program plus 1000s of
freely available add-in packages.
CRAN
So Why or Why Not R?
• Most popular statistics software (other than R)
and some of their audiences:
– SPSS: Social Scientists
– Stata: Social Scientists
– Mathematica/Matlab: Engineers, mathematicians,
computer scientists, and physicists
– Python/NumPy: Computer scientists, web developers
– SAS: Data intensive industries (e.g., financial services)
– Excel: All types of organizations

• R is more popular and used by a larger number of
analysts than each of these
http://r4stats.com/articles/popularity/
But. . .
• Statistics users like point and click
• R is command line oriented; there are GUIs that
can be loaded as add-on packages;
• R-Studio is a Integrated Development
Environment (IDE) for R, but more for code
development than statistical analysis
• R is free, but this also means that there is no
formal support mechanism; large organizations
often like to contract with a commercial provider
R-Studio
Command Line? Advantages?
• In social sciences there has been a lot of talk
lately about replication, the necessity of having
results that are reproducible
• In the world of “big data,” analysts want to
produce systems that are transparent, reliable,
and that maintain a chain of provenance for each
transformation that affects the data
• Looking at statistical analysis as a kind of
“programming” task (like the old days!) has
immense advantages
Look Out! Real Code!
# Read U.S. States shape data from census GIS data set
usShape <- readShapeSpatial("gz_2010_us_040_00_500k.shp")
# Attach the delta CPI data to the states
usShape@data$delta <- stateCPIdelta # Consumer price indices in this table
# This sets up break points for color designations.
# We want 20 gradations of color across all choropleths.
bfloor <- floor(min(usShape@data[,"delta"],na.rm=TRUE)*10)/10
bceil <- (ceiling(max(usShape@data[,"delta"],na.rm=TRUE)*10)/10) + 20
breaks <- seq(bfloor, bceil, 20)
# Attach the color cut points to the shape data
usShape@data$zCat <- cut(usShape@data[,"delta"],breaks,include.lowest=TRUE)
cutpoints <- levels(usShape@data$zCat) # For later use with the legend
Colorful!
Many Packages - CRAN Task View
ChemPhys
Econometrics
Environmetrics
ExperimentalDesign
Finance
Genetics
Graphics
HighPerformanceComputing
MachineLearning
MedicalImaging
MetaAnalysis
Multivariate
NaturalLanguageProcessing
Optimization
Pharmacokinetics
Phylogenetics
Psychometrics
ReproducibleResearch
SocialSciences
Spatial
Survival
TimeSeries
WebTechnologies

Chemometrics and Computational Physics
Computational Econometrics
Analysis of Ecological and Environmental Data
Design of Experiments (DoE) & Analysis of Experimental Data
Empirical Finance
Statistical Genetics
Graphic Displays & Dynamic Graphics & Graphic Devices & Visualization
High-Performance and Parallel Computing with R
Machine Learning & Statistical Learning
Medical Image Analysis
Meta-Analysis
Multivariate Statistics
Natural Language Processing
Optimization and Mathematical Programming
Analysis of Pharmacokinetic Data
Phylogenetics, Especially Comparative Methods
Psychometric Models and Methods
Reproducible Research
Statistics for the Social Sciences
Analysis of Spatial Data
Survival Analysis
Time Series Analysis
Web Technologies and Services
Why R?
• Free and open source
• Huge community of users, enormous
repository of working code examples, many
sources of online expertise/support
• Dizzying array of add-on packages for almost
any imaginable data application
• Encourages good data practice: coding a
reproducible chain of data transformations
Jsresearch.net

More Related Content

What's hot

What's hot (20)

R Programming
R ProgrammingR Programming
R Programming
 
R programming groundup-basic-section-i
R programming groundup-basic-section-iR programming groundup-basic-section-i
R programming groundup-basic-section-i
 
R program
R programR program
R program
 
Introduction to data analysis using R
Introduction to data analysis using RIntroduction to data analysis using R
Introduction to data analysis using R
 
A short tutorial on r
A short tutorial on rA short tutorial on r
A short tutorial on r
 
R programming Fundamentals
R programming  FundamentalsR programming  Fundamentals
R programming Fundamentals
 
LSESU a Taste of R Language Workshop
LSESU a Taste of R Language WorkshopLSESU a Taste of R Language Workshop
LSESU a Taste of R Language Workshop
 
R tutorial
R tutorialR tutorial
R tutorial
 
R programming
R programmingR programming
R programming
 
An Intoduction to R
An Intoduction to RAn Intoduction to R
An Intoduction to R
 
R programming
R programmingR programming
R programming
 
R programming
R programmingR programming
R programming
 
R programming presentation
R programming presentationR programming presentation
R programming presentation
 
Introduction to statistical software R
Introduction to statistical software RIntroduction to statistical software R
Introduction to statistical software R
 
Introduction to R
Introduction to RIntroduction to R
Introduction to R
 
1 R Tutorial Introduction
1 R Tutorial Introduction1 R Tutorial Introduction
1 R Tutorial Introduction
 
A brief introduction to 'R' statistical package
A brief introduction to 'R' statistical packageA brief introduction to 'R' statistical package
A brief introduction to 'R' statistical package
 
Introducing The R Software
Introducing The R Software  Introducing The R Software
Introducing The R Software
 
R language tutorial
R language tutorialR language tutorial
R language tutorial
 
A Workshop on R
A Workshop on RA Workshop on R
A Workshop on R
 

Viewers also liked

R Introduction
R IntroductionR Introduction
R Introduction
schamber
 

Viewers also liked (20)

Class ppt intro to r
Class ppt intro to rClass ppt intro to r
Class ppt intro to r
 
An Interactive Introduction To R (Programming Language For Statistics)
An Interactive Introduction To R (Programming Language For Statistics)An Interactive Introduction To R (Programming Language For Statistics)
An Interactive Introduction To R (Programming Language For Statistics)
 
R programming Basic & Advanced
R programming Basic & AdvancedR programming Basic & Advanced
R programming Basic & Advanced
 
Introduction to R Programming
Introduction to R ProgrammingIntroduction to R Programming
Introduction to R Programming
 
R learning by examples
R learning by examplesR learning by examples
R learning by examples
 
Introduction to R
Introduction to RIntroduction to R
Introduction to R
 
R Introduction
R IntroductionR Introduction
R Introduction
 
R programming language
R programming languageR programming language
R programming language
 
Statistics with R
Statistics with RStatistics with R
Statistics with R
 
Introduction to basic statistics
Introduction to basic statisticsIntroduction to basic statistics
Introduction to basic statistics
 
Data analysis with R
Data analysis with RData analysis with R
Data analysis with R
 
Step By Step Guide to Learn R
Step By Step Guide to Learn RStep By Step Guide to Learn R
Step By Step Guide to Learn R
 
Iris data analysis example in R
Iris data analysis example in RIris data analysis example in R
Iris data analysis example in R
 
R programming language: conceptual overview
R programming language: conceptual overviewR programming language: conceptual overview
R programming language: conceptual overview
 
Using the R Language in BI and Real Time Applications (useR 2015)
Using the R Language in BI and Real Time Applications (useR 2015)Using the R Language in BI and Real Time Applications (useR 2015)
Using the R Language in BI and Real Time Applications (useR 2015)
 
R Programming Features
R Programming FeaturesR Programming Features
R Programming Features
 
Seefeld stats r_bio
Seefeld stats r_bioSeefeld stats r_bio
Seefeld stats r_bio
 
Docopt, beautiful command-line options for R, user2014
Docopt, beautiful command-line options for R,  user2014Docopt, beautiful command-line options for R,  user2014
Docopt, beautiful command-line options for R, user2014
 
Extending and customizing ibm spss statistics with python, r, and .net (2)
Extending and customizing ibm spss statistics with python, r, and .net (2)Extending and customizing ibm spss statistics with python, r, and .net (2)
Extending and customizing ibm spss statistics with python, r, and .net (2)
 
R Statistics
R StatisticsR Statistics
R Statistics
 

Similar to Why R? A Brief Introduction to the Open Source Statistics Platform

Revolution Analytics
Revolution AnalyticsRevolution Analytics
Revolution Analytics
templedf
 
Sas Enterprise Guide A Revolutionary Tool
Sas Enterprise Guide A Revolutionary ToolSas Enterprise Guide A Revolutionary Tool
Sas Enterprise Guide A Revolutionary Tool
sysseminar
 

Similar to Why R? A Brief Introduction to the Open Source Statistics Platform (20)

An R primer for SQL folks
An R primer for SQL folksAn R primer for SQL folks
An R primer for SQL folks
 
Interactive SQL POC on Hadoop (Hive, Presto and Hive-on-Tez)
Interactive SQL POC on Hadoop (Hive, Presto and Hive-on-Tez)Interactive SQL POC on Hadoop (Hive, Presto and Hive-on-Tez)
Interactive SQL POC on Hadoop (Hive, Presto and Hive-on-Tez)
 
Data science | What is Data science
Data science | What is Data scienceData science | What is Data science
Data science | What is Data science
 
Revolution R Enterprise - Portland R User Group, November 2013
Revolution R Enterprise - Portland R User Group, November 2013Revolution R Enterprise - Portland R User Group, November 2013
Revolution R Enterprise - Portland R User Group, November 2013
 
CuRious about R in Power BI? End to end R in Power BI for beginners
CuRious about R in Power BI? End to end R in Power BI for beginners CuRious about R in Power BI? End to end R in Power BI for beginners
CuRious about R in Power BI? End to end R in Power BI for beginners
 
Clio infra Collabs data analysis tools
Clio infra Collabs data analysis toolsClio infra Collabs data analysis tools
Clio infra Collabs data analysis tools
 
Overview of tools for data analysis and visualisation (2021)
Overview of tools for data analysis and visualisation (2021)Overview of tools for data analysis and visualisation (2021)
Overview of tools for data analysis and visualisation (2021)
 
ITCamp 2018 - Andrea Martorana Tusa - Failure prediction for manufacturing in...
ITCamp 2018 - Andrea Martorana Tusa - Failure prediction for manufacturing in...ITCamp 2018 - Andrea Martorana Tusa - Failure prediction for manufacturing in...
ITCamp 2018 - Andrea Martorana Tusa - Failure prediction for manufacturing in...
 
R programming & Machine Learning
R programming & Machine LearningR programming & Machine Learning
R programming & Machine Learning
 
Data Analytics with R and SQL Server
Data Analytics with R and SQL ServerData Analytics with R and SQL Server
Data Analytics with R and SQL Server
 
Statistical softwares
Statistical softwaresStatistical softwares
Statistical softwares
 
CARTO en 5 Pasos: del Dato a la Toma de Decisiones [CARTO]
CARTO en 5 Pasos: del Dato a la Toma de Decisiones [CARTO]CARTO en 5 Pasos: del Dato a la Toma de Decisiones [CARTO]
CARTO en 5 Pasos: del Dato a la Toma de Decisiones [CARTO]
 
An introduction to R is a document useful
An introduction to R is a document usefulAn introduction to R is a document useful
An introduction to R is a document useful
 
What Is SAS | SAS Tutorial For Beginners | SAS Training | SAS Programming | E...
What Is SAS | SAS Tutorial For Beginners | SAS Training | SAS Programming | E...What Is SAS | SAS Tutorial For Beginners | SAS Training | SAS Programming | E...
What Is SAS | SAS Tutorial For Beginners | SAS Training | SAS Programming | E...
 
Revolution Analytics
Revolution AnalyticsRevolution Analytics
Revolution Analytics
 
Overview data analyis and visualisation tools 2020
Overview data analyis and visualisation tools 2020Overview data analyis and visualisation tools 2020
Overview data analyis and visualisation tools 2020
 
Sas Enterprise Guide A Revolutionary Tool
Sas Enterprise Guide A Revolutionary ToolSas Enterprise Guide A Revolutionary Tool
Sas Enterprise Guide A Revolutionary Tool
 
Syamali
SyamaliSyamali
Syamali
 
Graph Database Use Cases - StampedeCon 2015
Graph Database Use Cases - StampedeCon 2015Graph Database Use Cases - StampedeCon 2015
Graph Database Use Cases - StampedeCon 2015
 
Graph database Use Cases
Graph database Use CasesGraph database Use Cases
Graph database Use Cases
 

More from Syracuse University

Carma internet research module scale development
Carma internet research module   scale developmentCarma internet research module   scale development
Carma internet research module scale development
Syracuse University
 
Mining tweets for security information (rev 2)
Mining tweets for security information (rev 2)Mining tweets for security information (rev 2)
Mining tweets for security information (rev 2)
Syracuse University
 
Carma internet research module: Future data collection
Carma internet research module: Future data collectionCarma internet research module: Future data collection
Carma internet research module: Future data collection
Syracuse University
 

More from Syracuse University (20)

Discovery informaticsstanton
Discovery informaticsstantonDiscovery informaticsstanton
Discovery informaticsstanton
 
Basic SEVIS Overview for U.S. University Faculty
Basic SEVIS Overview for U.S. University FacultyBasic SEVIS Overview for U.S. University Faculty
Basic SEVIS Overview for U.S. University Faculty
 
Chapter9 r studio2
Chapter9 r studio2Chapter9 r studio2
Chapter9 r studio2
 
Basic Overview of Data Mining
Basic Overview of Data MiningBasic Overview of Data Mining
Basic Overview of Data Mining
 
Strategic planning
Strategic planningStrategic planning
Strategic planning
 
Carma internet research module scale development
Carma internet research module   scale developmentCarma internet research module   scale development
Carma internet research module scale development
 
Carma internet research module getting started with question pro
Carma internet research module   getting started with question proCarma internet research module   getting started with question pro
Carma internet research module getting started with question pro
 
Carma internet research module visual design issues
Carma internet research module   visual design issuesCarma internet research module   visual design issues
Carma internet research module visual design issues
 
Siop impact of social media
Siop impact of social mediaSiop impact of social media
Siop impact of social media
 
Basic Graphics with R
Basic Graphics with RBasic Graphics with R
Basic Graphics with R
 
R-Studio Vs. Rcmdr
R-Studio Vs. RcmdrR-Studio Vs. Rcmdr
R-Studio Vs. Rcmdr
 
Getting Started with R
Getting Started with RGetting Started with R
Getting Started with R
 
Moving Data to and From R
Moving Data to and From RMoving Data to and From R
Moving Data to and From R
 
Introduction to Advance Analytics Course
Introduction to Advance Analytics CourseIntroduction to Advance Analytics Course
Introduction to Advance Analytics Course
 
Installing R and R-Studio
Installing R and R-StudioInstalling R and R-Studio
Installing R and R-Studio
 
Mining tweets for security information (rev 2)
Mining tweets for security information (rev 2)Mining tweets for security information (rev 2)
Mining tweets for security information (rev 2)
 
What is Data Science
What is Data ScienceWhat is Data Science
What is Data Science
 
Reducing Response Burden
Reducing Response BurdenReducing Response Burden
Reducing Response Burden
 
PACIS Survey Workshop
PACIS Survey WorkshopPACIS Survey Workshop
PACIS Survey Workshop
 
Carma internet research module: Future data collection
Carma internet research module: Future data collectionCarma internet research module: Future data collection
Carma internet research module: Future data collection
 

Recently uploaded

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
ssuserdda66b
 

Recently uploaded (20)

On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 

Why R? A Brief Introduction to the Open Source Statistics Platform

  • 2. What is R? • R is a statistics, data management, and graphics platform • R is open source, maintained and developed by a community of developers. • The R code repository, as well as compiled binaries (ready-to-install software) available at: http://cran.r-project.org • R comprises a core program plus 1000s of freely available add-in packages.
  • 4. So Why or Why Not R? • Most popular statistics software (other than R) and some of their audiences: – SPSS: Social Scientists – Stata: Social Scientists – Mathematica/Matlab: Engineers, mathematicians, computer scientists, and physicists – Python/NumPy: Computer scientists, web developers – SAS: Data intensive industries (e.g., financial services) – Excel: All types of organizations • R is more popular and used by a larger number of analysts than each of these
  • 6. But. . . • Statistics users like point and click • R is command line oriented; there are GUIs that can be loaded as add-on packages; • R-Studio is a Integrated Development Environment (IDE) for R, but more for code development than statistical analysis • R is free, but this also means that there is no formal support mechanism; large organizations often like to contract with a commercial provider
  • 8. Command Line? Advantages? • In social sciences there has been a lot of talk lately about replication, the necessity of having results that are reproducible • In the world of “big data,” analysts want to produce systems that are transparent, reliable, and that maintain a chain of provenance for each transformation that affects the data • Looking at statistical analysis as a kind of “programming” task (like the old days!) has immense advantages
  • 9. Look Out! Real Code! # Read U.S. States shape data from census GIS data set usShape <- readShapeSpatial("gz_2010_us_040_00_500k.shp") # Attach the delta CPI data to the states usShape@data$delta <- stateCPIdelta # Consumer price indices in this table # This sets up break points for color designations. # We want 20 gradations of color across all choropleths. bfloor <- floor(min(usShape@data[,"delta"],na.rm=TRUE)*10)/10 bceil <- (ceiling(max(usShape@data[,"delta"],na.rm=TRUE)*10)/10) + 20 breaks <- seq(bfloor, bceil, 20) # Attach the color cut points to the shape data usShape@data$zCat <- cut(usShape@data[,"delta"],breaks,include.lowest=TRUE) cutpoints <- levels(usShape@data$zCat) # For later use with the legend
  • 11. Many Packages - CRAN Task View ChemPhys Econometrics Environmetrics ExperimentalDesign Finance Genetics Graphics HighPerformanceComputing MachineLearning MedicalImaging MetaAnalysis Multivariate NaturalLanguageProcessing Optimization Pharmacokinetics Phylogenetics Psychometrics ReproducibleResearch SocialSciences Spatial Survival TimeSeries WebTechnologies Chemometrics and Computational Physics Computational Econometrics Analysis of Ecological and Environmental Data Design of Experiments (DoE) & Analysis of Experimental Data Empirical Finance Statistical Genetics Graphic Displays & Dynamic Graphics & Graphic Devices & Visualization High-Performance and Parallel Computing with R Machine Learning & Statistical Learning Medical Image Analysis Meta-Analysis Multivariate Statistics Natural Language Processing Optimization and Mathematical Programming Analysis of Pharmacokinetic Data Phylogenetics, Especially Comparative Methods Psychometric Models and Methods Reproducible Research Statistics for the Social Sciences Analysis of Spatial Data Survival Analysis Time Series Analysis Web Technologies and Services
  • 12. Why R? • Free and open source • Huge community of users, enormous repository of working code examples, many sources of online expertise/support • Dizzying array of add-on packages for almost any imaginable data application • Encourages good data practice: coding a reproducible chain of data transformations