SlideShare ist ein Scribd-Unternehmen logo
1 von 32
Downloaden Sie, um offline zu lesen
Revolution Confidential




R evolution R :
100% R and More



P res ented by:
David S mith @ revodavid
V P Marketing and C ommunity
R evolution A nalytic s
Revolution Confidential




P oll Ques tion
    Which stats package do you use
                 most?
Marc h 13, 2013: Welc ome!                               Revolution Confidential




 Thanks for coming.
 Slides and replay available (soon) at:
   http://bit.ly/YbfQo1



                    David Smith
                    VP Marketing & Community, Revolution Analytics
                    Editor, Revolutions blog
                              http://blog.revolutionanalytics.com
                    Twitter: @revodavid




                                                                           3
In today’s webc as t:                 Revolution Confidential




 About Revolution Analytics and R

 How Revolution R Enterprise enhances R

 Resources for getting more from R

 Q&A


                                                        4
Revolution Confidential
      Revolution Analytics is the leading commercial
         provider of software and support for the
       open-source R statistical computing language



R evolution R E nterpris e is
      Enterprise-ready
      Multi-platform
      Scalable from desktop to big data
      Delivers high performance analytics
      Easier to build and deploy analytic applications



                                                                        5
What is R ?                          Download the White PaperConfidential
                                            R is Hot
                                                      Revolution



                                            bit.ly/r-is-hot
 Data analysis software
 A powerful programming language
   Development platform designed by and for statisticians
 A complete environment
   Huge library of algorithms for data access, data
    manipulation, analysis and graphics
 An open-source software project
   Free, open, and active
 A vibrant community
   Thousands of contributors, 2 million users
   Resources and help in every domain

                                                                     6
R is exploding in popularity and
func tionality                                                                       Revolution Confidential




                                                    “I’ve been astonished by the rate at which
                                                       R has been adopted. Four years ago,
                                                    everyone in my economics department [at
                                                        the University of Chicago] was using
                                                       Stata; now, as far as I can tell, R is the
                                                     standard tool, and students learn it first.”



                                                  Deputy Editor for New Products at Forbes




                                                    “A key benefit of R is that it provides near-
                                                          instant availability of new and
                                                    experimental methods created by its user
                                                          base — without waiting for the
                                                    development/release cycle of commercial
                                                     software. SAS recognizes the value of R
                                                             to our customer base…”


                                                  Product Marketing Manager SAS Institute, Inc


                Source: http://r4stats.com/popularity; “Why R is a name to know in 2011”,
                Forbes; number of packages is now 4,250                                                7
A Vibrant R Us er    More: The R Ecosystem

C ommunity          bit.ly/R-ecosystem
                               Revolution Confidential




  Local R User
  Groups (93)
  Groups (102)




                                                  8
R evolution A nalytic s
S c ales R to the E nterpris e                     Revolution Confidential
                                      Revolution R Enterprise


                                   Power
                                    Distributed high
            Power                    performance analytics


                                   Productivity
                                    Build & deploy analytics
                                     applications easily
                    Productivity

   Enterprise                      Enterprise Readiness
   Readiness
                                    Enterprise landscape
                                    Full-service customer
                                     support, consulting and
                                     training

                                                                     9
R evolution R E nterpris e                                                      Revolution Confidential
High P erformanc e, Multi-P latform A nalytic s P latform

                   Revolution R Enterprise
                                                             DevelopR
                DeployR                                Integrated Development
               Web Services
                                                             Environment


                                      ScaleR
                        High Performance Big Data Analytics

                                      RevoR
                      Performance Enhanced Open Source R
                            Open Source R packages

                                    PlatformR
                           Parallel Distributed Computing
              IBM/Netezza, IBM/Platform LSF, MS HPC Server, MS Azure Burst


                                    ConnectR
                               High Speed Connectors
                               HDFS, Hbase, ODBC, SAS


                                                                                                 10
R evolution R E nterpris e:                                       Revolution Confidential




Performance Enhancements
                                            Enterprise     Performance
                                           Deployment
Greater Productivity & Ease of Use
                                                   Open Source

Tackle “Big Data”                    Technical                     Productivity
                                      Support


IT-Friendly Enterprise Deployment
                                            Training           Big
                                          & Consulting     Data Analysis
On-Call Experts




                                                                                   11
Revolution Confidential




R evolution R E nterpris e

       Productivity




                                              12
T he s tandard R interfac e   Revolution Confidential




                                               13
DevelopR Integrated Development E nvironment
                                                                                              Revolution Confidential
                                         Script with type
                                         ahead and code                           Solutions window
                                            snippets                               for organizing
                                                                                   code and data

    Sophisticated
   debugging with
breakpoints , variable                              Objects
     values etc.                                 loaded in the
                                                      R
                                                 Environment
                 Packages                                                                           Object
               installed and                                                                        details
                  loaded




            http://www.revolutionanalytics.com/demos/revolution-productivity-environment/demo.htm

                                                                                                               14
Revolution Confidential




R evolution R E nterpris e

       Performance




                                              15
P erformanc e: Multi-threaded Math                                                               Revolution Confidential




Open                                                    Revolution R
Source R                                                  Enterprise




  Computation (4-core laptop)                Open Source R              Revolution R                Speedup
  Linear Algebra1
        Matrix Multiply                               176 sec                  9.3 sec                    18x
        Cholesky Factorization                       25.5 sec                  1.3 sec                    19x
        Linear Discriminant Analysis                  189 sec                  74 sec                       3x
  General R Benchmarks2
        R Benchmarks (Matrix Functions)                22 sec                  3.5 sec                      5x
        R Benchmarks (Program Control)                 5.6 sec                 5.4 sec        Not appreciable

                                          1. http://www.revolutionanalytics.com/why-revolution-r/benchmarks.php
                                          2. http://r.research.att.com/benchmarks/

                                                                                                                  16
Revolution Confidential




R evolution R E nterpris e

Big Data Analysis with ScaleR




                                                 17
R evoS c aleR brings the power of
B ig Data to R                                               Revolution Confidential




Parallel External
                                                                    Abstracted
Memory Algorithms
                                                       communications layer
exploit available                     Communications
                      Distributed                      provides portability of
compute resources                       Framework
                      Statistical                               code between
(cores & computers)   Algorithms                            platforms: server,
independent of
                                                       cluster, or in-database
platform


Use the high-speed
local data mart                           R Language          Familiar, high-
(XDF), or stream                           Interface            productivity
                        Data Source
data from SAS,                                                 programming
                            API
ODBC, HDFS or other                                       environment for R
remote data                                                             users
sources.




                                                                              18
S c aleR A ddres s es P erformanc e and
C apac ity L imitations of Open S ourc e R
                                        Revolution Confidential




                                                         19
High P erformanc e B ig Data A nalytic s with
S c aleR                                                 Revolution Confidential




  R Data Step      Descriptive     Statistical      Sampling
                    Statistics       Tests




      Predictive       Data        Machine       Simulation
       Models      Visualization   Learning


                                                                          20
R evolution R E nterpris e S c aleR :                                            Revolution Confidential


    High P erformanc e B ig Data A nalytic s
                          Data Prep, Distillation & Descriptive Analytics

              R Data Step                   Descriptive Statistics              Statistical Tests


    Data import – Delimited,         Min / Max                          Chi Square Test
     Fixed, SAS, SPSS, OBDC           Mean                               Kendall Rank Correlation
    Variable creation &              Median (approx.)                   Fisher’s Exact Test
     transformation                   Quantiles (approx.)                Student’s t-Test
    Recode variables                 Standard Deviation
    Factor variables                 Variance
    Missing value handling           Correlation                                   Sampling
    Sort                             Covariance
    Merge                            Sum of Squares (cross product
    Split                             matrix for set variables)          Subsample (observations &
    Aggregate by category            Pairwise Cross tabs                 variables)
     (means, sums)                    Risk Ratio & Odds Ratio            Random Sampling
                                      Cross-Tabulation of Data
                                       (standard tables & long form)
                                      Marginal Summaries of Cross
                                       Tabulations

                                                                                                      21
R evolution R E nterpris e S c aleR :                                                    Revolution Confidential


    High P erformanc e B ig Data A nalytic s
                           Statistical Modeling                                    Machine Learning

            Predictive Models                           Data Visualization                Cluster Analysis

   Sum of Squares (cross product               Histogram                        K-Means
    matrix for set variables)                   Line Plot
   Multiple Linear Regression                  Scatter Plot
   Generalized Linear Models (GLM)             Lorenz Curve
    - All exponential family                    ROC Curves (actual data and                 Classification
    distributions: binomial, Gaussian,           predicted values)
    inverse Gaussian, Poisson,
    Tweedie. Standard link functions                                              Decision Trees
    including: cauchit, identity, log,
    logit, probit. User defined                        Simulation
    distributions & link functions.
   Covariance & Correlation
    Matrices
   Logistic Regression
   Classification & Regression Trees
   Predictions/scoring for models               Monte Carlo
   Residuals for all models

                                                                                                              22
Revolution Confidential




R evolution R E nterpris e

  Enterprise Deployment




                                              23
C reate c us tom, on-demand analytic s applic ations
                                            Revolution Confidential
S ome examples :
                                    On-demand sales
                                    forecasting




                                                      Real-time social
                                 Leveraging the       media sentiment
                                 power of R from              analysis
                                 Microsoft tools

                                                                   24
R evolution R E nterpris e DeployR
integrates R with applic ations                                                Revolution Confidential



                                                                                Data Analysis

                                       DeployR
            R / Statistical                                     Deployment
           Modeling Expert                                        Expert


                                                                             Business Intelligence




                                                                              Mobile Web Apps
              Seamless
                 Bring the power of R to any web enabled application
              Simple
                 Leverage common APIs including JS, Java, .NET
              Scalable
                 Robustly scale user and compute workloads
              Secure                                                            Cloud / SaaS
                 Manage enterprise security with LDAP & SSO




                                                                                                     25
R evolution R             Revolution Confidential

E nterpris e

A rc hitec ture
Use a connected MPP
server or cluster for:
 Data exploration
 On-demand R
   applications
 Big-data predictive
   models
 Offline (batch)
   operations
 Code generation for
   real-time deployment
C onnec tR for Hadoop: S tream data from
                                    Revolution Confidential
Hadoop to R evolution R E nterpris e
Revolution Confidential




 On-Call Technical Support
 Consulting
   Migration | Analytics | Applications | Validation
 Training
   R | Revolution R | Statistical Topics
 Systems Integration
   BI | ERP | Databases | Cloud

                                                                28
Revolution Confidential




P oll Ques tion
     What interests you most about
      Revolution R Enterprise?
Why c us tomers c hoos e R evolution R
E nterpris e                             Revolution Confidential




     INNOVATION            MULTI-PLATFORM




    TIME-to-VALUE               VALUE




                                                          30
T hank You!                                  Revolution Confidential


 Download slides, replay
   http://bit.ly/YbfQo1

 Resources for getting started with R
    http://bit.ly/ZnZGt2

 Get Revolution R Enterprise
    Contact Sales: http://bit.ly/hey-revo
    Free to Academics:
     www.revolutionanalytics.com/academic

 We’re Hiring!
    www.revolutionanalytics.com/careers

                                                              31
T hank you.                                                                      Revolution Confidential




           The leading commercial provider of software and support for the popular
                            open source R statistics language.




 www.revolutionanalytics.com            650.646.9545                  Twitter: @RevolutionR




                                                                                                  32

Weitere ähnliche Inhalte

Ähnlich wie Revolution R Enterprise: 100% R and More (14 Mar 2013)

Creating Value That Scales with Revolution Analytics & Alteryx
Creating Value That Scales with Revolution Analytics & AlteryxCreating Value That Scales with Revolution Analytics & Alteryx
Creating Value That Scales with Revolution Analytics & AlteryxRevolution Analytics
 
Big data analytics on teradata with revolution r enterprise bill jacobs
Big data analytics on teradata with revolution r enterprise   bill jacobsBig data analytics on teradata with revolution r enterprise   bill jacobs
Big data analytics on teradata with revolution r enterprise bill jacobsBill Jacobs
 
Big Data Analytics with R
Big Data Analytics with RBig Data Analytics with R
Big Data Analytics with RGreat Wide Open
 
Revolution R: 100% R and more
Revolution R: 100% R and moreRevolution R: 100% R and more
Revolution R: 100% R and moreMasayoshi Ootsuka
 
Turbo-Charge Your Analytics with IBM Netezza and Revolution R Enterprise: A S...
Turbo-Charge Your Analytics with IBM Netezza and Revolution R Enterprise: A S...Turbo-Charge Your Analytics with IBM Netezza and Revolution R Enterprise: A S...
Turbo-Charge Your Analytics with IBM Netezza and Revolution R Enterprise: A S...Revolution Analytics
 
Batter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and StormBatter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and StormRevolution Analytics
 
Kristof Coussement - The Debate: the Future of (Big) Data Analytics Software
Kristof Coussement - The Debate: the Future of (Big) Data Analytics SoftwareKristof Coussement - The Debate: the Future of (Big) Data Analytics Software
Kristof Coussement - The Debate: the Future of (Big) Data Analytics SoftwareBAQMaR
 
Big Data Predictive Analytics with Revolution R Enterprise (Gartner BI Summit...
Big Data Predictive Analytics with Revolution R Enterprise (Gartner BI Summit...Big Data Predictive Analytics with Revolution R Enterprise (Gartner BI Summit...
Big Data Predictive Analytics with Revolution R Enterprise (Gartner BI Summit...Revolution Analytics
 
100% R and More: Plus What's New in Revolution R Enterprise 6.0
100% R and More: Plus What's New in Revolution R Enterprise 6.0100% R and More: Plus What's New in Revolution R Enterprise 6.0
100% R and More: Plus What's New in Revolution R Enterprise 6.0Revolution Analytics
 
R and Big Data using Revolution R Enterprise with Hadoop
R and Big Data using Revolution R Enterprise with HadoopR and Big Data using Revolution R Enterprise with Hadoop
R and Big Data using Revolution R Enterprise with HadoopRevolution Analytics
 
Revolution R Enterprise - 100% R and More Webinar Presentation
Revolution R Enterprise - 100% R and More Webinar PresentationRevolution R Enterprise - 100% R and More Webinar Presentation
Revolution R Enterprise - 100% R and More Webinar PresentationRevolution Analytics
 
Robert Luong: Analyse prédictive dans Excel
Robert Luong: Analyse prédictive dans ExcelRobert Luong: Analyse prédictive dans Excel
Robert Luong: Analyse prédictive dans ExcelMSDEVMTL
 
Microsoft and Revolution Analytics -- what's the add-value? 20150629
Microsoft and Revolution Analytics -- what's the add-value? 20150629Microsoft and Revolution Analytics -- what's the add-value? 20150629
Microsoft and Revolution Analytics -- what's the add-value? 20150629Mark Tabladillo
 
Revolution Analytics: a 5-minute history
Revolution Analytics: a 5-minute historyRevolution Analytics: a 5-minute history
Revolution Analytics: a 5-minute historyRevolution Analytics
 
Applications in R - Success and Lessons Learned from the Marketplace
Applications in R - Success and Lessons Learned from the MarketplaceApplications in R - Success and Lessons Learned from the Marketplace
Applications in R - Success and Lessons Learned from the MarketplaceRevolution Analytics
 
Introducing Revolution R Open: Enhanced, Open Source R distribution from Revo...
Introducing Revolution R Open: Enhanced, Open Source R distribution from Revo...Introducing Revolution R Open: Enhanced, Open Source R distribution from Revo...
Introducing Revolution R Open: Enhanced, Open Source R distribution from Revo...Revolution Analytics
 

Ähnlich wie Revolution R Enterprise: 100% R and More (14 Mar 2013) (20)

Creating Value That Scales with Revolution Analytics & Alteryx
Creating Value That Scales with Revolution Analytics & AlteryxCreating Value That Scales with Revolution Analytics & Alteryx
Creating Value That Scales with Revolution Analytics & Alteryx
 
Big data analytics on teradata with revolution r enterprise bill jacobs
Big data analytics on teradata with revolution r enterprise   bill jacobsBig data analytics on teradata with revolution r enterprise   bill jacobs
Big data analytics on teradata with revolution r enterprise bill jacobs
 
Revolution R: 100% R and more
Revolution R: 100% R and moreRevolution R: 100% R and more
Revolution R: 100% R and more
 
Big Data Analytics with R
Big Data Analytics with RBig Data Analytics with R
Big Data Analytics with R
 
Revolution R: 100% R and more
Revolution R: 100% R and moreRevolution R: 100% R and more
Revolution R: 100% R and more
 
Turbo-Charge Your Analytics with IBM Netezza and Revolution R Enterprise: A S...
Turbo-Charge Your Analytics with IBM Netezza and Revolution R Enterprise: A S...Turbo-Charge Your Analytics with IBM Netezza and Revolution R Enterprise: A S...
Turbo-Charge Your Analytics with IBM Netezza and Revolution R Enterprise: A S...
 
Batter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and StormBatter Up! Advanced Sports Analytics with R and Storm
Batter Up! Advanced Sports Analytics with R and Storm
 
Kristof Coussement - The Debate: the Future of (Big) Data Analytics Software
Kristof Coussement - The Debate: the Future of (Big) Data Analytics SoftwareKristof Coussement - The Debate: the Future of (Big) Data Analytics Software
Kristof Coussement - The Debate: the Future of (Big) Data Analytics Software
 
Big Data Predictive Analytics with Revolution R Enterprise (Gartner BI Summit...
Big Data Predictive Analytics with Revolution R Enterprise (Gartner BI Summit...Big Data Predictive Analytics with Revolution R Enterprise (Gartner BI Summit...
Big Data Predictive Analytics with Revolution R Enterprise (Gartner BI Summit...
 
100% R and More: Plus What's New in Revolution R Enterprise 6.0
100% R and More: Plus What's New in Revolution R Enterprise 6.0100% R and More: Plus What's New in Revolution R Enterprise 6.0
100% R and More: Plus What's New in Revolution R Enterprise 6.0
 
R and Big Data using Revolution R Enterprise with Hadoop
R and Big Data using Revolution R Enterprise with HadoopR and Big Data using Revolution R Enterprise with Hadoop
R and Big Data using Revolution R Enterprise with Hadoop
 
Revolution R Enterprise - 100% R and More Webinar Presentation
Revolution R Enterprise - 100% R and More Webinar PresentationRevolution R Enterprise - 100% R and More Webinar Presentation
Revolution R Enterprise - 100% R and More Webinar Presentation
 
Robert Luong: Analyse prédictive dans Excel
Robert Luong: Analyse prédictive dans ExcelRobert Luong: Analyse prédictive dans Excel
Robert Luong: Analyse prédictive dans Excel
 
Big Data Analysis Starts with R
Big Data Analysis Starts with RBig Data Analysis Starts with R
Big Data Analysis Starts with R
 
Microsoft and Revolution Analytics -- what's the add-value? 20150629
Microsoft and Revolution Analytics -- what's the add-value? 20150629Microsoft and Revolution Analytics -- what's the add-value? 20150629
Microsoft and Revolution Analytics -- what's the add-value? 20150629
 
Revolution Analytics: a 5-minute history
Revolution Analytics: a 5-minute historyRevolution Analytics: a 5-minute history
Revolution Analytics: a 5-minute history
 
Applications in R - Success and Lessons Learned from the Marketplace
Applications in R - Success and Lessons Learned from the MarketplaceApplications in R - Success and Lessons Learned from the Marketplace
Applications in R - Success and Lessons Learned from the Marketplace
 
Kailash resume
Kailash resumeKailash resume
Kailash resume
 
Introducing Revolution R Open: Enhanced, Open Source R distribution from Revo...
Introducing Revolution R Open: Enhanced, Open Source R distribution from Revo...Introducing Revolution R Open: Enhanced, Open Source R distribution from Revo...
Introducing Revolution R Open: Enhanced, Open Source R distribution from Revo...
 
Kailash resume
Kailash resumeKailash resume
Kailash resume
 

Mehr von Revolution Analytics

Speeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the CloudSpeeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the CloudRevolution Analytics
 
Migrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to AzureMigrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to AzureRevolution Analytics
 
Speed up R with parallel programming in the Cloud
Speed up R with parallel programming in the CloudSpeed up R with parallel programming in the Cloud
Speed up R with parallel programming in the CloudRevolution Analytics
 
Predicting Loan Delinquency at One Million Transactions per Second
Predicting Loan Delinquency at One Million Transactions per SecondPredicting Loan Delinquency at One Million Transactions per Second
Predicting Loan Delinquency at One Million Transactions per SecondRevolution Analytics
 
The Value of Open Source Communities
The Value of Open Source CommunitiesThe Value of Open Source Communities
The Value of Open Source CommunitiesRevolution Analytics
 
Building a scalable data science platform with R
Building a scalable data science platform with RBuilding a scalable data science platform with R
Building a scalable data science platform with RRevolution Analytics
 
The Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data ScienceThe Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data ScienceRevolution Analytics
 
Taking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the CloudTaking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the CloudRevolution Analytics
 
The Network structure of R packages on CRAN & BioConductor
The Network structure of R packages on CRAN & BioConductorThe Network structure of R packages on CRAN & BioConductor
The Network structure of R packages on CRAN & BioConductorRevolution Analytics
 
The network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 finalThe network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 finalRevolution Analytics
 
Simple Reproducibility with the checkpoint package
Simple Reproducibilitywith the checkpoint packageSimple Reproducibilitywith the checkpoint package
Simple Reproducibility with the checkpoint packageRevolution Analytics
 

Mehr von Revolution Analytics (20)

Speeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the CloudSpeeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the Cloud
 
Migrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to AzureMigrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to Azure
 
R in Minecraft
R in Minecraft R in Minecraft
R in Minecraft
 
The case for R for AI developers
The case for R for AI developersThe case for R for AI developers
The case for R for AI developers
 
Speed up R with parallel programming in the Cloud
Speed up R with parallel programming in the CloudSpeed up R with parallel programming in the Cloud
Speed up R with parallel programming in the Cloud
 
The R Ecosystem
The R EcosystemThe R Ecosystem
The R Ecosystem
 
R Then and Now
R Then and NowR Then and Now
R Then and Now
 
Predicting Loan Delinquency at One Million Transactions per Second
Predicting Loan Delinquency at One Million Transactions per SecondPredicting Loan Delinquency at One Million Transactions per Second
Predicting Loan Delinquency at One Million Transactions per Second
 
Reproducible Data Science with R
Reproducible Data Science with RReproducible Data Science with R
Reproducible Data Science with R
 
The Value of Open Source Communities
The Value of Open Source CommunitiesThe Value of Open Source Communities
The Value of Open Source Communities
 
The R Ecosystem
The R EcosystemThe R Ecosystem
The R Ecosystem
 
R at Microsoft (useR! 2016)
R at Microsoft (useR! 2016)R at Microsoft (useR! 2016)
R at Microsoft (useR! 2016)
 
Building a scalable data science platform with R
Building a scalable data science platform with RBuilding a scalable data science platform with R
Building a scalable data science platform with R
 
R at Microsoft
R at MicrosoftR at Microsoft
R at Microsoft
 
The Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data ScienceThe Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data Science
 
Taking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the CloudTaking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the Cloud
 
The Network structure of R packages on CRAN & BioConductor
The Network structure of R packages on CRAN & BioConductorThe Network structure of R packages on CRAN & BioConductor
The Network structure of R packages on CRAN & BioConductor
 
The network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 finalThe network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 final
 
Simple Reproducibility with the checkpoint package
Simple Reproducibilitywith the checkpoint packageSimple Reproducibilitywith the checkpoint package
Simple Reproducibility with the checkpoint package
 
R at Microsoft
R at MicrosoftR at Microsoft
R at Microsoft
 

Revolution R Enterprise: 100% R and More (14 Mar 2013)

  • 1. Revolution Confidential R evolution R : 100% R and More P res ented by: David S mith @ revodavid V P Marketing and C ommunity R evolution A nalytic s
  • 2. Revolution Confidential P oll Ques tion Which stats package do you use most?
  • 3. Marc h 13, 2013: Welc ome! Revolution Confidential  Thanks for coming.  Slides and replay available (soon) at:  http://bit.ly/YbfQo1 David Smith VP Marketing & Community, Revolution Analytics Editor, Revolutions blog http://blog.revolutionanalytics.com Twitter: @revodavid 3
  • 4. In today’s webc as t: Revolution Confidential  About Revolution Analytics and R  How Revolution R Enterprise enhances R  Resources for getting more from R  Q&A 4
  • 5. Revolution Confidential Revolution Analytics is the leading commercial provider of software and support for the open-source R statistical computing language R evolution R E nterpris e is  Enterprise-ready  Multi-platform  Scalable from desktop to big data  Delivers high performance analytics  Easier to build and deploy analytic applications 5
  • 6. What is R ? Download the White PaperConfidential R is Hot Revolution bit.ly/r-is-hot  Data analysis software  A powerful programming language  Development platform designed by and for statisticians  A complete environment  Huge library of algorithms for data access, data manipulation, analysis and graphics  An open-source software project  Free, open, and active  A vibrant community  Thousands of contributors, 2 million users  Resources and help in every domain 6
  • 7. R is exploding in popularity and func tionality Revolution Confidential “I’ve been astonished by the rate at which R has been adopted. Four years ago, everyone in my economics department [at the University of Chicago] was using Stata; now, as far as I can tell, R is the standard tool, and students learn it first.” Deputy Editor for New Products at Forbes “A key benefit of R is that it provides near- instant availability of new and experimental methods created by its user base — without waiting for the development/release cycle of commercial software. SAS recognizes the value of R to our customer base…” Product Marketing Manager SAS Institute, Inc Source: http://r4stats.com/popularity; “Why R is a name to know in 2011”, Forbes; number of packages is now 4,250 7
  • 8. A Vibrant R Us er More: The R Ecosystem C ommunity bit.ly/R-ecosystem Revolution Confidential Local R User Groups (93) Groups (102) 8
  • 9. R evolution A nalytic s S c ales R to the E nterpris e Revolution Confidential Revolution R Enterprise Power  Distributed high Power performance analytics Productivity  Build & deploy analytics applications easily Productivity Enterprise Enterprise Readiness Readiness  Enterprise landscape  Full-service customer support, consulting and training 9
  • 10. R evolution R E nterpris e Revolution Confidential High P erformanc e, Multi-P latform A nalytic s P latform Revolution R Enterprise DevelopR DeployR Integrated Development Web Services Environment ScaleR High Performance Big Data Analytics RevoR Performance Enhanced Open Source R Open Source R packages PlatformR Parallel Distributed Computing IBM/Netezza, IBM/Platform LSF, MS HPC Server, MS Azure Burst ConnectR High Speed Connectors HDFS, Hbase, ODBC, SAS 10
  • 11. R evolution R E nterpris e: Revolution Confidential Performance Enhancements Enterprise Performance Deployment Greater Productivity & Ease of Use Open Source Tackle “Big Data” Technical Productivity Support IT-Friendly Enterprise Deployment Training Big & Consulting Data Analysis On-Call Experts 11
  • 12. Revolution Confidential R evolution R E nterpris e Productivity 12
  • 13. T he s tandard R interfac e Revolution Confidential 13
  • 14. DevelopR Integrated Development E nvironment Revolution Confidential Script with type ahead and code Solutions window snippets for organizing code and data Sophisticated debugging with breakpoints , variable Objects values etc. loaded in the R Environment Packages Object installed and details loaded http://www.revolutionanalytics.com/demos/revolution-productivity-environment/demo.htm 14
  • 15. Revolution Confidential R evolution R E nterpris e Performance 15
  • 16. P erformanc e: Multi-threaded Math Revolution Confidential Open Revolution R Source R Enterprise Computation (4-core laptop) Open Source R Revolution R Speedup Linear Algebra1 Matrix Multiply 176 sec 9.3 sec 18x Cholesky Factorization 25.5 sec 1.3 sec 19x Linear Discriminant Analysis 189 sec 74 sec 3x General R Benchmarks2 R Benchmarks (Matrix Functions) 22 sec 3.5 sec 5x R Benchmarks (Program Control) 5.6 sec 5.4 sec Not appreciable 1. http://www.revolutionanalytics.com/why-revolution-r/benchmarks.php 2. http://r.research.att.com/benchmarks/ 16
  • 17. Revolution Confidential R evolution R E nterpris e Big Data Analysis with ScaleR 17
  • 18. R evoS c aleR brings the power of B ig Data to R Revolution Confidential Parallel External Abstracted Memory Algorithms communications layer exploit available Communications Distributed provides portability of compute resources Framework Statistical code between (cores & computers) Algorithms platforms: server, independent of cluster, or in-database platform Use the high-speed local data mart R Language Familiar, high- (XDF), or stream Interface productivity Data Source data from SAS, programming API ODBC, HDFS or other environment for R remote data users sources. 18
  • 19. S c aleR A ddres s es P erformanc e and C apac ity L imitations of Open S ourc e R Revolution Confidential 19
  • 20. High P erformanc e B ig Data A nalytic s with S c aleR Revolution Confidential R Data Step Descriptive Statistical Sampling Statistics Tests Predictive Data Machine Simulation Models Visualization Learning 20
  • 21. R evolution R E nterpris e S c aleR : Revolution Confidential High P erformanc e B ig Data A nalytic s Data Prep, Distillation & Descriptive Analytics R Data Step Descriptive Statistics Statistical Tests  Data import – Delimited,  Min / Max  Chi Square Test Fixed, SAS, SPSS, OBDC  Mean  Kendall Rank Correlation  Variable creation &  Median (approx.)  Fisher’s Exact Test transformation  Quantiles (approx.)  Student’s t-Test  Recode variables  Standard Deviation  Factor variables  Variance  Missing value handling  Correlation Sampling  Sort  Covariance  Merge  Sum of Squares (cross product  Split matrix for set variables)  Subsample (observations &  Aggregate by category  Pairwise Cross tabs variables) (means, sums)  Risk Ratio & Odds Ratio  Random Sampling  Cross-Tabulation of Data (standard tables & long form)  Marginal Summaries of Cross Tabulations 21
  • 22. R evolution R E nterpris e S c aleR : Revolution Confidential High P erformanc e B ig Data A nalytic s Statistical Modeling Machine Learning Predictive Models Data Visualization Cluster Analysis  Sum of Squares (cross product  Histogram  K-Means matrix for set variables)  Line Plot  Multiple Linear Regression  Scatter Plot  Generalized Linear Models (GLM)  Lorenz Curve - All exponential family  ROC Curves (actual data and Classification distributions: binomial, Gaussian, predicted values) inverse Gaussian, Poisson, Tweedie. Standard link functions  Decision Trees including: cauchit, identity, log, logit, probit. User defined Simulation distributions & link functions.  Covariance & Correlation Matrices  Logistic Regression  Classification & Regression Trees  Predictions/scoring for models  Monte Carlo  Residuals for all models 22
  • 23. Revolution Confidential R evolution R E nterpris e Enterprise Deployment 23
  • 24. C reate c us tom, on-demand analytic s applic ations Revolution Confidential S ome examples : On-demand sales forecasting Real-time social Leveraging the media sentiment power of R from analysis Microsoft tools 24
  • 25. R evolution R E nterpris e DeployR integrates R with applic ations Revolution Confidential Data Analysis DeployR R / Statistical Deployment Modeling Expert Expert Business Intelligence Mobile Web Apps  Seamless Bring the power of R to any web enabled application  Simple Leverage common APIs including JS, Java, .NET  Scalable Robustly scale user and compute workloads  Secure Cloud / SaaS Manage enterprise security with LDAP & SSO 25
  • 26. R evolution R Revolution Confidential E nterpris e A rc hitec ture Use a connected MPP server or cluster for:  Data exploration  On-demand R applications  Big-data predictive models  Offline (batch) operations  Code generation for real-time deployment
  • 27. C onnec tR for Hadoop: S tream data from Revolution Confidential Hadoop to R evolution R E nterpris e
  • 28. Revolution Confidential  On-Call Technical Support  Consulting  Migration | Analytics | Applications | Validation  Training  R | Revolution R | Statistical Topics  Systems Integration  BI | ERP | Databases | Cloud 28
  • 29. Revolution Confidential P oll Ques tion What interests you most about Revolution R Enterprise?
  • 30. Why c us tomers c hoos e R evolution R E nterpris e Revolution Confidential INNOVATION MULTI-PLATFORM TIME-to-VALUE VALUE 30
  • 31. T hank You! Revolution Confidential  Download slides, replay  http://bit.ly/YbfQo1  Resources for getting started with R  http://bit.ly/ZnZGt2  Get Revolution R Enterprise  Contact Sales: http://bit.ly/hey-revo  Free to Academics: www.revolutionanalytics.com/academic  We’re Hiring!  www.revolutionanalytics.com/careers 31
  • 32. T hank you. Revolution Confidential The leading commercial provider of software and support for the popular open source R statistics language. www.revolutionanalytics.com 650.646.9545 Twitter: @RevolutionR 32