A presentation on the history, design, and use of R. The talk will focus on companies that use and support R, use cases, where it is going, competitors, advantages and disadvantages, and resources to learn more about R. Speaker Bio
Joseph Kambourakis has been the Lead Data Science Instructor at EMC for over two years. He has taught in eight countries and been interviewed by Japanese and Saudi Arabian media about his expertise in Data Science. He holds a Bachelors in Electrical and Computer Engineering from Worcester Polytechnic Institute and an MBA from Bentley University with a concentration in Business Analytics.
2. Ground Rules
• Interrupt me
• These are all my opinions and
not of EMC or Big Data
Analytics, Discovery &
Visualization Meetup
• Slides will be available
13. What is
R is a free software environment for statistical
computing and graphics
A language plus a run-time environment with
graphics, a debugger, access to certain system
functions, and the ability to run programs stored
in script files
21. Open Source
• GNU General Public License
• Freedom 0: The freedom to run the program for any
purpose.
• Freedom 1: The freedom to study how the program
works, and change it to make it do what you wish.
• Freedom 2: The freedom to redistribute copies so you
can help your neighbor.
• Freedom 3: The freedom to improve the program, and
release your improvements (and modified versions in
general) to the public, so that the whole community
benefits.
• source: GNU.org
22. R Project
• The R Foundation is a not for profit organization working in
the public interest. It has been founded by the members of
the R Development Core Team in order to
– Provide support for the R project and other innovations in
statistical computing. We believe that R has become a mature
and valuable tool and we would like to ensure its continued
development and the development of future innovations in
software for statistical and computational research.
– Provide a reference point for individuals, institutions or
commercial enterprises that want to support or interact with
the R development community.
– Hold and administer the copyright of R software and
documentation.
• source: R Project
27. How it Works: Install
• Hosted on Comprehensive R Archive Network
(CRAN)
• 54 megabytes
28. http://cran.rstudio.com/
• Download and Install R
• Precompiled binary distributions of the base system
and contributed packages, Windows and Mac users
most likely want one of these versions of R:
• Download R for Linux
• Download R for (Mac) OS X
• Download R for Windows
• R is part of many Linux distributions, you should check
with your Linux package management system in
addition to the link above.
43. How Does it Compare?
R SAS SPSS Professional MATLAB
Cost Free! Very VERY High High - $9,975 High
Documentation Yes
Very
comprehensive
OK Some examples
Training Course NA Yes Yes Yes
User interface Low Medium Best Medium
Output
Separate
commands
Automatically produce
diagnosis graph and
forecast
Totally automated
Some automated via GUI,
some specific command
Models*
Does not STL moving
average
Does not have
ARCH/GARCH + and
other moving average
models
Does not have MA &
decomposition models
Certification
Program
Yes Yes Yes