SlideShare ist ein Scribd-Unternehmen logo
1 von 29
Downloaden Sie, um offline zu lesen
CERTIFICATE
PROGRAM – USING R
 Statistics.com
 THE INSTITUTE FOR STATISTICS EDUCATION
About Statistics.com

• First course 2002 (resampling methods)
• 2003-2004 added courses in data mining, modeling, intro stats
• Now 100+ courses
• Hybrid model between
  • Professional development (topic centered, scheduling accommodates working
    professionals)
  • Academic (homework and assessment)

• Taught by noted authorities
• Statistics, predictive modeling, data mining, R, optimization, risk
 modeling, clinical trials…
Why learn R?

Why take classes?
The spread of R:

Phase 1: R started in 1993 by academics, and gained popularity
in universities around the world – open source & free!

Phase 2: PhD statisticians who used R in university took it to
their complex quant modeling jobs in industry.

Phase 3 (now): R is ubiquitous:
   •Industry - now that R is seeded by the PhD statisticians,
   other analysts in their companies need to know it.
   •Academic – researchers in a variety of fields who do
   statistics but are not primarily statisticians use R
Why learn R? Let’s look at what
                           employers are looking for:




                                          R

                                                             SAS


                                           SPSS




                         Relative proportion of mentions of statistical tools in “job
                         requirements” section of job postings. A single job may
                         mention more than one tool.



Source: Statistics.com survey of approx. 4000 analytics/statistics job postings on various job sites, May,
2012
ConAgra Foods’ Human Capital Analytics/Reporting (HCA/R) program is searching for a project
manager/statistician ….development of predictive modeling processes to answer different business issues.
Excellent computer skills specifically with advanced Excel (v-lookups, pivot tables, macros), R, and other open
source software. Experience in configuration of data to support complex data mining & statistical analysis.


                 SAS- is seeking a Research Statistician to apply cutting-edge econometric models ...
                 demonstrated experience or knowledge of computer programming; ... particularly with
                 applied econometric modeling or time series analysis; the SAS system; statistical software
                 products, such as WinBugs, R, Stata, EViews, OxMetrics, or S-Plus.



    AmazonLocal, Applied Machine Learning Scientist · Run sampling, clustering, classification, etc on large datasets using a
  variety of analytics software (e.g. SAS, Python, R, etc).


            SRA International, ...
            * Use of statistical algorithms, techniques and models to define data for data integrity and process
            analytics
            * Use of data mining techniques to define data
            ...
            * Experience and demonstrated expertise with at multiple data mining tools including SAS, SPSS, R,
            Weka, etc.


      Need for R goes hand-in-hand with need for higher-level stats skills.
How is R being used in the real world?
Who are your fellow students?
                                                                                                                      China
   Australia
                                                   Germany

                                                                      India
                                                                                                      Academia
                        Industry                                                                               Bioinformaticist
                                                                                            PhD candidate in
                                                                                            epidemiology
    Database marketer,                                          Canada                                           Prof. of
                                  Survey researcher
    international bank                                                                                           medicine
                                                                                          Health researcher
      Digital Marketer J&J                                                                                      Plant ecologist
                                  Project manager, large
                                  consulting firm                                            PhD student in animal
 Circulation manager,                                                                        embryology
 Countryside Pubs.
                                 Statistical geneticist
                                                                                                    Anthropologist, human
                                                                    UK                              remains
                             Web developer
    Casualty actuary                  Farmer, Calif. Central
                                      Valley
                Forecaster, Walt Disney
                                                                     Government                                     Netherlands
                   Commodities analyst, hedge
                                                                                 Researcher, K-12
                   fund
                                                               Risk analyst,     school dist.
                                                               agriculture dept.
                                                                         CDC
                                                                         epidemiologist
                                                                  Team coordinator                                      Brazil
                  Denmark                                         aerospace
                                                                  medicine
                                                                         Forest
                                                                         monitor
Executive and an assistant professor in an academic medical center: I have extensive
   experience with SPSS …I see R as the future for quantitative work and need to begin
   doing more of my work in R.
             Analyst with state government natural resource agency: We have survey designs for regional
             monitoring that we continually need to evaluate and improve. Currently, I program in C and rely
             heavily on Monte Carlo modeling. I plot in Excel and have wanted to learn R to get greater
             flexibility.

     Analyst with health and human service agency: My job is mostly data analysis and some statistical
     modeling which is handled via SAS and PL/SQL. Other agencies have incorporated R. I am
     looking to be prepared should our agency adopt R as well as understand how R compares with SAS
     with the hope of drawing from the strengths of both in the future.
Marketing analyst, international banking: Since we are manipulating tons of data at customer
level for more than 27 countries, R would be the perfect complement tool (we have been using
SAS) for customer analytics.
               Analyst with non-profit organization: We do quite a bit of data analysis (mostly descriptive
               work and GIS mapping) and I started teaching myself R a few years ago in order to
               automate our routine data cleaning.

Database marketer, banking: I have used SAS for 8 years, also have experience in
FICO Model Builder, but am new to R and want to learn those comprehensive
packages which are not available in base SAS to do more advance analytics.

         Commodities analyst at a hedge fund: I'm looking to use R to build more robust, stable and
         dynamic econometric models.
Why take classes? Why not learn on your own?

• R is not like SAS, SPSS
  • SAS has two very distinct user types:
     • Programmer
     • Statistical modeler & analyst
  • SPSS the latter
  • R is powerful, but has more programming and “messiness,” even when used purely in
    analysis/modeling mode.
• Often it is helpful to have an expert on hand while learning R
• 4-week courses allow an iterative process – a short intensive learning
 period, ask lots of questions. Apply what you learn. Come back later for
 another 4-week class. Learn more. Apply. Repeat.
Certificate Program Content
     PREREQUISITES: None for entry into program, but
     introductory statistics is a prerequisite for some courses.



                                                     6 ELECTIVES
       6 REQUIRED
                                          R-Specific:             Include R:
  •Intro to R – Data Handling
  •Intro to R – Statistical               •Data mining            •Probability
  Analysis                                •Spatial                Distributions
  •Programming in R                       •Microarray             •Resampling
  •Programming in R – Adv.                •SVM                    •Bootstrap
  •Modeling in R                          •Clinical Trials Apps   •Logistic
  •Graphics in R                          •ggplot2                Regression
                                          •Smoothing with P-      •GLM
                                          splines                 •Count data
                                          •Survey Analysis
1. The principles of R programming:
• Introduction to R – Data Handling (Paul Murrell) introduces basic
 expressions, symbols, assignment, functions, packages, use of code
 editors (emacs), workspace, data types & structures, subsetting,
 assessor functions, classes, type coercion, text files, binary files, large
 files, memory management, apply function, tabulate, aggregation,
 merging and splitting data, reshape, text processing.

• Programming in R (2 courses with Hadley Wickham) covers lexical
 scoping, dynamic scoping, frames, environments, namespaces, active
 bindings, quoting, evaluation, calling from other functions, string
 processing (stringr), dates and times (lubridate), regular expressions, xml
 and xpath, extracting data with SQL, executing SQL in R, writing compact
 and efficient code (helper function, lapply), anonymous functions, first
 class functions, object oriented programming, S3, tips for producing
 reliable code, functions and options to help debug, speed, testing.
2. Plotting and visualizing data in R:
Graphics in R (Paul Murrell, covers the core R capabilities for
graphing, and teaches you to produce key statistical plots such as
scatterplots, )

R ggplot2 (Hadley Wickham teaches how to use his package,
which is a package with its own language that rests on R, to create
graphs)
3. Application/method/domain specific:
Other classes are application oriented, where syntax and
programming are discussed, as necessary, on the path to
getting R to accomplish something specific. Intro stats,
statistical modeling, microarray analysis, data mining, survey
analysis.

In the most basic of these, Introduction to R – Statistical
Analysis, some familiarity with statistical procedures is assumed
and you learn R by executing these procedures (t-tests, chi-
square, correlation, regression, etc.) in R. In other cases, the
emphasis is on learning the method and R is simply the chosen
tool.

Let’s see an example from the Statistical Analysis course.
Snapshot: Regression. The instructions are given
step-by-step in Lesson 3 of “Introduction to R –
Statistical Analysis.”

The lm function will estimate the regression parameters for the
simple linear regression model. For the two models specified above
we have:

> lm(total ~ w.class, data = d)

Call:
lm(formula = total ~ w.class, data = d)

Coefficients:
(Intercept) w.class

159.815 2.732

which gives estimates ˆb0 = 159.815 and ˆb1 = 2.732.
KIM ASKS

Hi John,
I was plotting the residuals from a linear regression (example on page 19 of the lesson 3), and there was a delay before the
plots would show. The message on the R console was "Waiting to confirm page change." By clicking on the graphics, I could
switch from one plot to the next. Is there anyway to make them tile so I can see all of them at once, or any way to go back and
forth once they've 'printed' on the graphics page?

             JOHN VERZANI REPLIES:

             A couple of possibilities exist:

             You can partition your graphics device so that more than one graphic will appear. For example, par(mfrow=c(2,2)) will set up a 2
             by 2 grid, perfect for the plot function called on the output of the lm function.
             On some implementations you can record plots and scroll back through them. For windows users, the RGui application (your
             basic interface) allows you to turn on recording, I think by right clicking on a plot (if I'm wrong let me know, and I'll check).
             For RStudio, the graphs are already recorded. There are arrows to scroll.
             Hope one of those works for you. --J

 SABINA CHIMES IN

 Where do you type it? In the plot command? I have tried:
 > plot(res.pipeline, par(mfrow=c(2,2)))
 and get
 Error in plot.lm(res.pipeline, par(mfrow = c(2, 2))) :
 'which' must be in 1:6
 ....How do you keep track of all these different ways of doing things. I find that your comments are amazing...

             JOHN REPLIES

             The par settings are done in their own command (well some are). Try:
                par(mfrow=c(2,2))
                plot(res.pipeline)
             The ".lm" extra bit isn't necessary (though doesn't hurt), as R will use the class of res.pipeline to find that function in most usual
             cases.
             Let me know if that doesn't help   .
ALTA ASKS

John, what does masked in the following error message mean? and what is '.GlobeEnv'? thnx in advance
>attach(kid.weights)
The following object(S) are masked _by_ '.GlobeEnv':
age




             JOHN REPLIES

             R looks for objects by traversing a series of nested environments. In this case, when you
             attach(kid.weights) it includes a variable 'age'. However you already have a variable 'age' in your global
             workspace (.GlobalEnv is the secret name for that). Which one do you want? Well, R is answering which
             one it will find. In this case the one in the global workspace, not that in kid.weights. For that one, you will
             need to work harder (using $ or with or ...)

             Does that help?




  gotcha! very helpful--thnx
• Polling question #3 – how many analytics professionals
How courses work


    Discussion                        Homework
    Forum




                 Readings, notes, videos
Weekly Course Schedule
                               Most courses are 4 weeks.
                                                 ~ March 2013 ~
     Sun             Mon          Tue                 Wed              Thu          Fri            Sat
                                                                             1                2
                                                                             Lesson 1 opens



3              4           5                     6                7          8                9
                                                                             Lesson 2 opens



10             11          12                    13               14         15               16
Homework 1 due             Feedback Homework 1                               Lesson 3 opens



17             18          19                    20               21         22               23
Homework 2 due             Feedback Homework 2                               Lesson 4 opens



24             25          26                    27               28         29               30
Homework 3 due             Feedback Homework 3




31             April 1     2                     3                4          5                6
Homework 4 due             Feedback Homework 4
Time Required


                • Estimate 15 hours per week
                • Don’t need to be online at
                particular times or days
                • Time zone does not matter
                • Best not to leave all work until the
                end of the week
                • Materials remain open for a
                couple of weeks after end-of-
                course
                • Most students are working
                professionals, take courses one at
                a time
Faculty


Paul Murrell   John Verzani   Hadley Wickham   Sudha Purohit




Luis Torgo     David Unwin    Thomas Lumley      Din Chen




Karl Peace      Garrett         Brian Marx      Paul Eilers
               Grolemund
Typical Course Contents – R Programming


• “Headquarters” Page
• Lesson Page
• Readings/notes/videos
• Homework
• Discussion Forum
Typical Course Contents – R Programming


• “Headquarters” Page
• Lesson Page
• Readings/notes/videos
• Homework
• Discussion Forum
Typical Course Contents – R Programming


• “Headquarters” Page
• Lesson Page
• Readings/notes/videos
• Homework
• Discussion Forum
Typical Course Contents – R Programming


• “Headquarters” Page
• Lesson Page
• Readings/notes/videos
• Homework
• Discussion Forum
Typical Course Contents – R Programming


• “Headquarters” Page
• Lesson Page
• Readings/notes/videos
• Homework
• Discussion Forum
Equiv. to
                18
                credits,
                US
                system



$5900 approx.
Next Step.
For certificate program application, contact
sales@revolutionanalytics.com or call
1-855-GET-REVO (1-855-438-7386)

• Application fee will be waived (until July 30th)
• Up to 50% discount offered for Revolution Analytics
  software when purchased in combination with training

Weitere ähnliche Inhalte

Andere mochten auch

Docopt, beautiful command-line options for R, user2014
Docopt, beautiful command-line options for R,  user2014Docopt, beautiful command-line options for R,  user2014
Docopt, beautiful command-line options for R, user2014Edwin de Jonge
 
R Programming: First Steps
R Programming: First StepsR Programming: First Steps
R Programming: First StepsRsquared Academy
 
R programming Fundamentals
R programming  FundamentalsR programming  Fundamentals
R programming FundamentalsRagia Ibrahim
 
R Statistics
R StatisticsR Statistics
R Statisticsr content
 
Extending and customizing ibm spss statistics with python, r, and .net (2)
Extending and customizing ibm spss statistics with python, r, and .net (2)Extending and customizing ibm spss statistics with python, r, and .net (2)
Extending and customizing ibm spss statistics with python, r, and .net (2)Armand Ruis
 
R programming intro with examples
R programming intro with examplesR programming intro with examples
R programming intro with examplesDennis
 
R Programming: Introduction to Matrices
R Programming: Introduction to MatricesR Programming: Introduction to Matrices
R Programming: Introduction to MatricesRsquared Academy
 
3 descriptive statistics with R
3 descriptive statistics with R3 descriptive statistics with R
3 descriptive statistics with Rnaroranisha
 
Presentation on use of r statistics
Presentation on use of r statisticsPresentation on use of r statistics
Presentation on use of r statisticsKrishna Dhakal
 
Descriptive Statistics with R
Descriptive Statistics with RDescriptive Statistics with R
Descriptive Statistics with RKazuki Yoshida
 
Introduction to basic statistics
Introduction to basic statisticsIntroduction to basic statistics
Introduction to basic statisticsIBM
 
R programming language
R programming languageR programming language
R programming languageKeerti Verma
 
How to use Logistic Regression in GIS using ArcGIS and R statistics
How to use Logistic Regression in GIS using ArcGIS and R statisticsHow to use Logistic Regression in GIS using ArcGIS and R statistics
How to use Logistic Regression in GIS using ArcGIS and R statisticsOmar F. Althuwaynee
 
Why R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformWhy R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformSyracuse University
 

Andere mochten auch (20)

R program
R programR program
R program
 
Seefeld stats r_bio
Seefeld stats r_bioSeefeld stats r_bio
Seefeld stats r_bio
 
Docopt, beautiful command-line options for R, user2014
Docopt, beautiful command-line options for R,  user2014Docopt, beautiful command-line options for R,  user2014
Docopt, beautiful command-line options for R, user2014
 
R Programming: First Steps
R Programming: First StepsR Programming: First Steps
R Programming: First Steps
 
R programming Fundamentals
R programming  FundamentalsR programming  Fundamentals
R programming Fundamentals
 
R Statistics
R StatisticsR Statistics
R Statistics
 
Extending and customizing ibm spss statistics with python, r, and .net (2)
Extending and customizing ibm spss statistics with python, r, and .net (2)Extending and customizing ibm spss statistics with python, r, and .net (2)
Extending and customizing ibm spss statistics with python, r, and .net (2)
 
Statistics with R
Statistics with RStatistics with R
Statistics with R
 
R programming intro with examples
R programming intro with examplesR programming intro with examples
R programming intro with examples
 
R Programming: Introduction to Matrices
R Programming: Introduction to MatricesR Programming: Introduction to Matrices
R Programming: Introduction to Matrices
 
3 descriptive statistics with R
3 descriptive statistics with R3 descriptive statistics with R
3 descriptive statistics with R
 
Presentation on use of r statistics
Presentation on use of r statisticsPresentation on use of r statistics
Presentation on use of r statistics
 
Descriptive Statistics with R
Descriptive Statistics with RDescriptive Statistics with R
Descriptive Statistics with R
 
Using R For Statistics
Using R For StatisticsUsing R For Statistics
Using R For Statistics
 
Introduction to basic statistics
Introduction to basic statisticsIntroduction to basic statistics
Introduction to basic statistics
 
Chapter 1
Chapter 1Chapter 1
Chapter 1
 
Chapter 4
Chapter 4Chapter 4
Chapter 4
 
R programming language
R programming languageR programming language
R programming language
 
How to use Logistic Regression in GIS using ArcGIS and R statistics
How to use Logistic Regression in GIS using ArcGIS and R statisticsHow to use Logistic Regression in GIS using ArcGIS and R statistics
How to use Logistic Regression in GIS using ArcGIS and R statistics
 
Why R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformWhy R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics Platform
 

Ähnlich wie Getting Up to Speed with R: Certificate Program in R for Statistical Analysis, Visualization and Modeling

Resume-Vishwajit-Sen.pdf
Resume-Vishwajit-Sen.pdfResume-Vishwajit-Sen.pdf
Resume-Vishwajit-Sen.pdfSen, Vishwajit
 
What is The Importance of SPSS How Will I Get SPSS help online in Australia.pdf
What is The Importance of SPSS How Will I Get SPSS help online in Australia.pdfWhat is The Importance of SPSS How Will I Get SPSS help online in Australia.pdf
What is The Importance of SPSS How Will I Get SPSS help online in Australia.pdfWilliamJhons
 
Resume_Analytics_Nidhi
Resume_Analytics_NidhiResume_Analytics_Nidhi
Resume_Analytics_NidhiNidhi Gupta
 
Marilyn Kaye Mowrey Resume 2016
Marilyn Kaye Mowrey Resume 2016Marilyn Kaye Mowrey Resume 2016
Marilyn Kaye Mowrey Resume 2016Kaye Mowrey
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data AnalyticsUtkarsh Sharma
 
Data science using r multisoft systems
Data science using r  multisoft systemsData science using r  multisoft systems
Data science using r multisoft systemsMultisoft Systems
 
Curriculum Vitaeof Dr.Mohit Saxena
Curriculum Vitaeof Dr.Mohit SaxenaCurriculum Vitaeof Dr.Mohit Saxena
Curriculum Vitaeof Dr.Mohit SaxenaMohit Saxena
 
Bo Chen Resume
Bo Chen ResumeBo Chen Resume
Bo Chen ResumeBo Chen
 
Paramdeep Singh'S resume
Paramdeep Singh'S resume Paramdeep Singh'S resume
Paramdeep Singh'S resume Paramdeep Singh
 
Kim bonett cv_2016
Kim bonett cv_2016Kim bonett cv_2016
Kim bonett cv_2016koko bonet
 
LFS302_Real-World Evidence Platform to Enable Therapeutic Innovation
LFS302_Real-World Evidence Platform to Enable Therapeutic InnovationLFS302_Real-World Evidence Platform to Enable Therapeutic Innovation
LFS302_Real-World Evidence Platform to Enable Therapeutic InnovationAmazon Web Services
 

Ähnlich wie Getting Up to Speed with R: Certificate Program in R for Statistical Analysis, Visualization and Modeling (20)

2011phdfair orgs9 08_11
2011phdfair orgs9 08_112011phdfair orgs9 08_11
2011phdfair orgs9 08_11
 
Resume-Vishwajit-Sen.pdf
Resume-Vishwajit-Sen.pdfResume-Vishwajit-Sen.pdf
Resume-Vishwajit-Sen.pdf
 
Resume Diego Marinho de Oliveira
Resume Diego Marinho de OliveiraResume Diego Marinho de Oliveira
Resume Diego Marinho de Oliveira
 
What is The Importance of SPSS How Will I Get SPSS help online in Australia.pdf
What is The Importance of SPSS How Will I Get SPSS help online in Australia.pdfWhat is The Importance of SPSS How Will I Get SPSS help online in Australia.pdf
What is The Importance of SPSS How Will I Get SPSS help online in Australia.pdf
 
CV KH Jul 2016
CV KH Jul 2016CV KH Jul 2016
CV KH Jul 2016
 
Resume_Analytics_Nidhi
Resume_Analytics_NidhiResume_Analytics_Nidhi
Resume_Analytics_Nidhi
 
Marilyn Kaye Mowrey Resume 2016
Marilyn Kaye Mowrey Resume 2016Marilyn Kaye Mowrey Resume 2016
Marilyn Kaye Mowrey Resume 2016
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
RISK EVALUATION-1
RISK EVALUATION-1RISK EVALUATION-1
RISK EVALUATION-1
 
Cv zamir siddiqui
Cv zamir siddiquiCv zamir siddiqui
Cv zamir siddiqui
 
Resume_Shilpi Singhwal
Resume_Shilpi SinghwalResume_Shilpi Singhwal
Resume_Shilpi Singhwal
 
Data science using r multisoft systems
Data science using r  multisoft systemsData science using r  multisoft systems
Data science using r multisoft systems
 
Rasa
RasaRasa
Rasa
 
Go Predictive Analytics
Go Predictive AnalyticsGo Predictive Analytics
Go Predictive Analytics
 
Curriculum Vitaeof Dr.Mohit Saxena
Curriculum Vitaeof Dr.Mohit SaxenaCurriculum Vitaeof Dr.Mohit Saxena
Curriculum Vitaeof Dr.Mohit Saxena
 
Bo Chen Resume
Bo Chen ResumeBo Chen Resume
Bo Chen Resume
 
Big Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARLBig Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARL
 
Paramdeep Singh'S resume
Paramdeep Singh'S resume Paramdeep Singh'S resume
Paramdeep Singh'S resume
 
Kim bonett cv_2016
Kim bonett cv_2016Kim bonett cv_2016
Kim bonett cv_2016
 
LFS302_Real-World Evidence Platform to Enable Therapeutic Innovation
LFS302_Real-World Evidence Platform to Enable Therapeutic InnovationLFS302_Real-World Evidence Platform to Enable Therapeutic Innovation
LFS302_Real-World Evidence Platform to Enable Therapeutic Innovation
 

Mehr von Revolution Analytics

Speeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the CloudSpeeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the CloudRevolution Analytics
 
Migrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to AzureMigrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to AzureRevolution Analytics
 
Speed up R with parallel programming in the Cloud
Speed up R with parallel programming in the CloudSpeed up R with parallel programming in the Cloud
Speed up R with parallel programming in the CloudRevolution Analytics
 
Predicting Loan Delinquency at One Million Transactions per Second
Predicting Loan Delinquency at One Million Transactions per SecondPredicting Loan Delinquency at One Million Transactions per Second
Predicting Loan Delinquency at One Million Transactions per SecondRevolution Analytics
 
The Value of Open Source Communities
The Value of Open Source CommunitiesThe Value of Open Source Communities
The Value of Open Source CommunitiesRevolution Analytics
 
Building a scalable data science platform with R
Building a scalable data science platform with RBuilding a scalable data science platform with R
Building a scalable data science platform with RRevolution Analytics
 
The Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data ScienceThe Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data ScienceRevolution Analytics
 
Taking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the CloudTaking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the CloudRevolution Analytics
 
The Network structure of R packages on CRAN & BioConductor
The Network structure of R packages on CRAN & BioConductorThe Network structure of R packages on CRAN & BioConductor
The Network structure of R packages on CRAN & BioConductorRevolution Analytics
 
The network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 finalThe network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 finalRevolution Analytics
 
Simple Reproducibility with the checkpoint package
Simple Reproducibilitywith the checkpoint packageSimple Reproducibilitywith the checkpoint package
Simple Reproducibility with the checkpoint packageRevolution Analytics
 

Mehr von Revolution Analytics (20)

Speeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the CloudSpeeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the Cloud
 
Migrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to AzureMigrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to Azure
 
R in Minecraft
R in Minecraft R in Minecraft
R in Minecraft
 
The case for R for AI developers
The case for R for AI developersThe case for R for AI developers
The case for R for AI developers
 
Speed up R with parallel programming in the Cloud
Speed up R with parallel programming in the CloudSpeed up R with parallel programming in the Cloud
Speed up R with parallel programming in the Cloud
 
The R Ecosystem
The R EcosystemThe R Ecosystem
The R Ecosystem
 
R Then and Now
R Then and NowR Then and Now
R Then and Now
 
Predicting Loan Delinquency at One Million Transactions per Second
Predicting Loan Delinquency at One Million Transactions per SecondPredicting Loan Delinquency at One Million Transactions per Second
Predicting Loan Delinquency at One Million Transactions per Second
 
Reproducible Data Science with R
Reproducible Data Science with RReproducible Data Science with R
Reproducible Data Science with R
 
The Value of Open Source Communities
The Value of Open Source CommunitiesThe Value of Open Source Communities
The Value of Open Source Communities
 
The R Ecosystem
The R EcosystemThe R Ecosystem
The R Ecosystem
 
R at Microsoft (useR! 2016)
R at Microsoft (useR! 2016)R at Microsoft (useR! 2016)
R at Microsoft (useR! 2016)
 
Building a scalable data science platform with R
Building a scalable data science platform with RBuilding a scalable data science platform with R
Building a scalable data science platform with R
 
R at Microsoft
R at MicrosoftR at Microsoft
R at Microsoft
 
The Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data ScienceThe Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data Science
 
Taking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the CloudTaking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the Cloud
 
The Network structure of R packages on CRAN & BioConductor
The Network structure of R packages on CRAN & BioConductorThe Network structure of R packages on CRAN & BioConductor
The Network structure of R packages on CRAN & BioConductor
 
The network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 finalThe network structure of cran 2015 07-02 final
The network structure of cran 2015 07-02 final
 
Simple Reproducibility with the checkpoint package
Simple Reproducibilitywith the checkpoint packageSimple Reproducibilitywith the checkpoint package
Simple Reproducibility with the checkpoint package
 
R at Microsoft
R at MicrosoftR at Microsoft
R at Microsoft
 

Kürzlich hochgeladen

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 

Kürzlich hochgeladen (20)

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 

Getting Up to Speed with R: Certificate Program in R for Statistical Analysis, Visualization and Modeling

  • 1. CERTIFICATE PROGRAM – USING R Statistics.com THE INSTITUTE FOR STATISTICS EDUCATION
  • 2. About Statistics.com • First course 2002 (resampling methods) • 2003-2004 added courses in data mining, modeling, intro stats • Now 100+ courses • Hybrid model between • Professional development (topic centered, scheduling accommodates working professionals) • Academic (homework and assessment) • Taught by noted authorities • Statistics, predictive modeling, data mining, R, optimization, risk modeling, clinical trials…
  • 3. Why learn R? Why take classes?
  • 4. The spread of R: Phase 1: R started in 1993 by academics, and gained popularity in universities around the world – open source & free! Phase 2: PhD statisticians who used R in university took it to their complex quant modeling jobs in industry. Phase 3 (now): R is ubiquitous: •Industry - now that R is seeded by the PhD statisticians, other analysts in their companies need to know it. •Academic – researchers in a variety of fields who do statistics but are not primarily statisticians use R
  • 5. Why learn R? Let’s look at what employers are looking for: R SAS SPSS Relative proportion of mentions of statistical tools in “job requirements” section of job postings. A single job may mention more than one tool. Source: Statistics.com survey of approx. 4000 analytics/statistics job postings on various job sites, May, 2012
  • 6. ConAgra Foods’ Human Capital Analytics/Reporting (HCA/R) program is searching for a project manager/statistician ….development of predictive modeling processes to answer different business issues. Excellent computer skills specifically with advanced Excel (v-lookups, pivot tables, macros), R, and other open source software. Experience in configuration of data to support complex data mining & statistical analysis. SAS- is seeking a Research Statistician to apply cutting-edge econometric models ... demonstrated experience or knowledge of computer programming; ... particularly with applied econometric modeling or time series analysis; the SAS system; statistical software products, such as WinBugs, R, Stata, EViews, OxMetrics, or S-Plus. AmazonLocal, Applied Machine Learning Scientist · Run sampling, clustering, classification, etc on large datasets using a variety of analytics software (e.g. SAS, Python, R, etc). SRA International, ... * Use of statistical algorithms, techniques and models to define data for data integrity and process analytics * Use of data mining techniques to define data ... * Experience and demonstrated expertise with at multiple data mining tools including SAS, SPSS, R, Weka, etc. Need for R goes hand-in-hand with need for higher-level stats skills.
  • 7. How is R being used in the real world?
  • 8. Who are your fellow students? China Australia Germany India Academia Industry Bioinformaticist PhD candidate in epidemiology Database marketer, Canada Prof. of Survey researcher international bank medicine Health researcher Digital Marketer J&J Plant ecologist Project manager, large consulting firm PhD student in animal Circulation manager, embryology Countryside Pubs. Statistical geneticist Anthropologist, human UK remains Web developer Casualty actuary Farmer, Calif. Central Valley Forecaster, Walt Disney Government Netherlands Commodities analyst, hedge Researcher, K-12 fund Risk analyst, school dist. agriculture dept. CDC epidemiologist Team coordinator Brazil Denmark aerospace medicine Forest monitor
  • 9. Executive and an assistant professor in an academic medical center: I have extensive experience with SPSS …I see R as the future for quantitative work and need to begin doing more of my work in R. Analyst with state government natural resource agency: We have survey designs for regional monitoring that we continually need to evaluate and improve. Currently, I program in C and rely heavily on Monte Carlo modeling. I plot in Excel and have wanted to learn R to get greater flexibility. Analyst with health and human service agency: My job is mostly data analysis and some statistical modeling which is handled via SAS and PL/SQL. Other agencies have incorporated R. I am looking to be prepared should our agency adopt R as well as understand how R compares with SAS with the hope of drawing from the strengths of both in the future. Marketing analyst, international banking: Since we are manipulating tons of data at customer level for more than 27 countries, R would be the perfect complement tool (we have been using SAS) for customer analytics. Analyst with non-profit organization: We do quite a bit of data analysis (mostly descriptive work and GIS mapping) and I started teaching myself R a few years ago in order to automate our routine data cleaning. Database marketer, banking: I have used SAS for 8 years, also have experience in FICO Model Builder, but am new to R and want to learn those comprehensive packages which are not available in base SAS to do more advance analytics. Commodities analyst at a hedge fund: I'm looking to use R to build more robust, stable and dynamic econometric models.
  • 10. Why take classes? Why not learn on your own? • R is not like SAS, SPSS • SAS has two very distinct user types: • Programmer • Statistical modeler & analyst • SPSS the latter • R is powerful, but has more programming and “messiness,” even when used purely in analysis/modeling mode. • Often it is helpful to have an expert on hand while learning R • 4-week courses allow an iterative process – a short intensive learning period, ask lots of questions. Apply what you learn. Come back later for another 4-week class. Learn more. Apply. Repeat.
  • 11. Certificate Program Content PREREQUISITES: None for entry into program, but introductory statistics is a prerequisite for some courses. 6 ELECTIVES 6 REQUIRED R-Specific: Include R: •Intro to R – Data Handling •Intro to R – Statistical •Data mining •Probability Analysis •Spatial Distributions •Programming in R •Microarray •Resampling •Programming in R – Adv. •SVM •Bootstrap •Modeling in R •Clinical Trials Apps •Logistic •Graphics in R •ggplot2 Regression •Smoothing with P- •GLM splines •Count data •Survey Analysis
  • 12. 1. The principles of R programming: • Introduction to R – Data Handling (Paul Murrell) introduces basic expressions, symbols, assignment, functions, packages, use of code editors (emacs), workspace, data types & structures, subsetting, assessor functions, classes, type coercion, text files, binary files, large files, memory management, apply function, tabulate, aggregation, merging and splitting data, reshape, text processing. • Programming in R (2 courses with Hadley Wickham) covers lexical scoping, dynamic scoping, frames, environments, namespaces, active bindings, quoting, evaluation, calling from other functions, string processing (stringr), dates and times (lubridate), regular expressions, xml and xpath, extracting data with SQL, executing SQL in R, writing compact and efficient code (helper function, lapply), anonymous functions, first class functions, object oriented programming, S3, tips for producing reliable code, functions and options to help debug, speed, testing.
  • 13. 2. Plotting and visualizing data in R: Graphics in R (Paul Murrell, covers the core R capabilities for graphing, and teaches you to produce key statistical plots such as scatterplots, ) R ggplot2 (Hadley Wickham teaches how to use his package, which is a package with its own language that rests on R, to create graphs)
  • 14. 3. Application/method/domain specific: Other classes are application oriented, where syntax and programming are discussed, as necessary, on the path to getting R to accomplish something specific. Intro stats, statistical modeling, microarray analysis, data mining, survey analysis. In the most basic of these, Introduction to R – Statistical Analysis, some familiarity with statistical procedures is assumed and you learn R by executing these procedures (t-tests, chi- square, correlation, regression, etc.) in R. In other cases, the emphasis is on learning the method and R is simply the chosen tool. Let’s see an example from the Statistical Analysis course.
  • 15. Snapshot: Regression. The instructions are given step-by-step in Lesson 3 of “Introduction to R – Statistical Analysis.” The lm function will estimate the regression parameters for the simple linear regression model. For the two models specified above we have: > lm(total ~ w.class, data = d) Call: lm(formula = total ~ w.class, data = d) Coefficients: (Intercept) w.class 159.815 2.732 which gives estimates ˆb0 = 159.815 and ˆb1 = 2.732.
  • 16. KIM ASKS Hi John, I was plotting the residuals from a linear regression (example on page 19 of the lesson 3), and there was a delay before the plots would show. The message on the R console was "Waiting to confirm page change." By clicking on the graphics, I could switch from one plot to the next. Is there anyway to make them tile so I can see all of them at once, or any way to go back and forth once they've 'printed' on the graphics page? JOHN VERZANI REPLIES: A couple of possibilities exist: You can partition your graphics device so that more than one graphic will appear. For example, par(mfrow=c(2,2)) will set up a 2 by 2 grid, perfect for the plot function called on the output of the lm function. On some implementations you can record plots and scroll back through them. For windows users, the RGui application (your basic interface) allows you to turn on recording, I think by right clicking on a plot (if I'm wrong let me know, and I'll check). For RStudio, the graphs are already recorded. There are arrows to scroll. Hope one of those works for you. --J SABINA CHIMES IN Where do you type it? In the plot command? I have tried: > plot(res.pipeline, par(mfrow=c(2,2))) and get Error in plot.lm(res.pipeline, par(mfrow = c(2, 2))) : 'which' must be in 1:6 ....How do you keep track of all these different ways of doing things. I find that your comments are amazing... JOHN REPLIES The par settings are done in their own command (well some are). Try: par(mfrow=c(2,2)) plot(res.pipeline) The ".lm" extra bit isn't necessary (though doesn't hurt), as R will use the class of res.pipeline to find that function in most usual cases. Let me know if that doesn't help .
  • 17. ALTA ASKS John, what does masked in the following error message mean? and what is '.GlobeEnv'? thnx in advance >attach(kid.weights) The following object(S) are masked _by_ '.GlobeEnv': age JOHN REPLIES R looks for objects by traversing a series of nested environments. In this case, when you attach(kid.weights) it includes a variable 'age'. However you already have a variable 'age' in your global workspace (.GlobalEnv is the secret name for that). Which one do you want? Well, R is answering which one it will find. In this case the one in the global workspace, not that in kid.weights. For that one, you will need to work harder (using $ or with or ...) Does that help? gotcha! very helpful--thnx
  • 18. • Polling question #3 – how many analytics professionals
  • 19. How courses work Discussion Homework Forum Readings, notes, videos
  • 20. Weekly Course Schedule Most courses are 4 weeks. ~ March 2013 ~ Sun Mon Tue Wed Thu Fri Sat 1 2 Lesson 1 opens 3 4 5 6 7 8 9 Lesson 2 opens 10 11 12 13 14 15 16 Homework 1 due Feedback Homework 1 Lesson 3 opens 17 18 19 20 21 22 23 Homework 2 due Feedback Homework 2 Lesson 4 opens 24 25 26 27 28 29 30 Homework 3 due Feedback Homework 3 31 April 1 2 3 4 5 6 Homework 4 due Feedback Homework 4
  • 21. Time Required • Estimate 15 hours per week • Don’t need to be online at particular times or days • Time zone does not matter • Best not to leave all work until the end of the week • Materials remain open for a couple of weeks after end-of- course • Most students are working professionals, take courses one at a time
  • 22. Faculty Paul Murrell John Verzani Hadley Wickham Sudha Purohit Luis Torgo David Unwin Thomas Lumley Din Chen Karl Peace Garrett Brian Marx Paul Eilers Grolemund
  • 23. Typical Course Contents – R Programming • “Headquarters” Page • Lesson Page • Readings/notes/videos • Homework • Discussion Forum
  • 24. Typical Course Contents – R Programming • “Headquarters” Page • Lesson Page • Readings/notes/videos • Homework • Discussion Forum
  • 25. Typical Course Contents – R Programming • “Headquarters” Page • Lesson Page • Readings/notes/videos • Homework • Discussion Forum
  • 26. Typical Course Contents – R Programming • “Headquarters” Page • Lesson Page • Readings/notes/videos • Homework • Discussion Forum
  • 27. Typical Course Contents – R Programming • “Headquarters” Page • Lesson Page • Readings/notes/videos • Homework • Discussion Forum
  • 28. Equiv. to 18 credits, US system $5900 approx.
  • 29. Next Step. For certificate program application, contact sales@revolutionanalytics.com or call 1-855-GET-REVO (1-855-438-7386) • Application fee will be waived (until July 30th) • Up to 50% discount offered for Revolution Analytics software when purchased in combination with training