SlideShare ist ein Scribd-Unternehmen logo
1 von 36
Downloaden Sie, um offline zu lesen
Training Session
on
Enterprise Guide and Enterprise
Miner:
A Peep into the world of SAS
By
Pratima
02-06-2008
2
Exploration of Analytical Possibilities in SAS
Process Flow
Techniques and Concepts
Contents
2
Getting Started
Practical Exercises
3
Exploration of Analytical possibilties in SAS
Process Flow
Getting Started
Contents
3
Credit Scoring Project – An Example of SAS Usage
Practical Observations
Enterprise Guide (EG) & Enterprise
Miner (Eminer)
4
EG is an interactive interface for data compilation,
transformation, analysis and presentation
Version used:4
Eminer is the advanced analytical interface for Data
Mining
Version used:5.1
4
EG offers:
Data Compilation
&
Transformation
• Datasets in different formats accepted: Excel, csv, txt
files, Microsoft Access
• Data size is no constraint (upto 9 lacs rows downloaded)
• Functions like APPEND, JOIN, SORT, RANDOM
SAMPLE, RANK etc
• Host of Statistical Techniques like
• Regressions-Linear, Non-Linear, Logistic
• Time-Series Forecasting
• Correlations, Principal Components, Factor
Analysis
• Final files to be used for Periodic Reporting like in SBR
• Graphs ,Data summary tables etc
• Final SAS Datasets to be used in EMiner
Data Analysis
Data Export and
Presentation
5
Data Mining
6
Dealing
with Big
Datasets
Extracting
“implicit”,
“potentially
important”
information
EMiner offers:
EMiner
Explore
Sampling
Modify Model
Assess
•Sampling
•Cluster analysis
•Variable Selection
•Imputation
•Variable
Transformation
•Decision tree
•Neural Networks
•Regressions
7
8
Exploration of Analytical Possibilities in SAS
Process Flow
Techniques and Concepts
Contents
8
Getting Started
Practical Exercises
Process Flow For SAS Applications
Database
Servers
8563,
8561
Excel,
CSV,
M.
Access
, txt
files
+
Enterprise Guide
• Data Compilation
• Data Analysis
• Presentation
•Excel files for
Reporting
Purposes
•Html files as
graphs, tables
etc
• SAS Data files to
be used for
advanced
analytics in SAS
EMiner
9
Enterprise Miner
Cluster Analysis
Decision trees
Regressions
Neural networks
Output
• Excel files for
Reporting
Purposes
• Html files as
graphs, tables
etc
10
Exploration of Analytical Possibilities in SAS
Process Flow
Techniques and Concepts
Contents
10
Getting Started
Practical Exercises
‘What’ &‘How’ of ANN
 Artificial Neural Networks
 “Non-linear” Statistical Data Modeling Tool
 Models complex relationships between inputs and
outputs
 Consists of interconnected group of ‘Artificial
Neurons’
 Uses ‘connectionist’ approach to computation –
Multi-layer Perceptron (MLP) most common
approach
Example:
Classification of Good and Bad Credit Risks based on most relevant variables
out of occupation, financials, Age, past Banking Record etc by training neural
network on historic data
‘What’ & ‘How’ of Regression
 Linear and Non-linear Regressions
 Consists of dependent variable, independent
variables, parameter and random error term
 Rely heavily on assumptions for probability
distribution of error term
 Used for modeling of causal relationships,
hypothesis testing and prediction (as of time-
series data)
 NLMs are logarithmic, exponential functions etc
Example:
Examining the relationship between the performance of a Channel partner
(dealer) with his market share, vintage, geo- distribution etc
E
‘What’ & ‘How’ of DT
 Decision Trees
 Predictive Model with ‘leaves’ and ‘branches’
 Leaves mean the cuts or classifications and
Branches mean the criteria for those cuts
 Maps observations into conclusions based on the
target value
Example:
A two level tree showing best performance in ACL & SAL for West Zone and
business profile services
E
‘What’ & ‘How’ of Cluster Analysis
 K- Means Clustering
 Partitioning of data into K clusters
 Data point assigned a cluster which has the
‘centre’ or ‘centroid’ nearest to it.
 “Iterative” refinement of centroids of a cluster
 Convergence when intra –cluster distance
minimized and inter cluster distance maximized
Example:
Dataset has two dimensions - churn and limit utilization in the first MoB. Then
if there are two clusters to start with…1st cluster has a centroid (mean of
vector points) of 5% and 70% lim utiz, 2nd has 10% and 85% and 3rd has 15%
and 99%. Then if new data-point is 17% churn and 95% lim utiz, then it will
most likely fall in Cluster 3. (Distance criteria can be selected)
BIU Concepts Un-coded
Loss forecasting
• Prediction of Delinquencies for a time period in future
• Use of roll rates and flow rates data
• Application of time series tools like ARIMA modeling
• Best results with greater number of data-points
• Analysis of long-term portfolio delinquency trends
• Grouping of data points based on the age in the
portfolio
• Tracking of bad rate over time for each vintage
• Estimation of losses over a period of time
• Statistical expression of the credit worthiness
• Use of client credit files
• Use of tools like logistic regression which give the
probability of default
15
Credit Scoring
Vintage Analysis
16
Exploration of Analytical Possibilities in SAS
Process Flow
Techniques and Concepts
Contents
16
Getting Started
Practical Exercises
EG Page View
Enterprise Guide
17
Basic Functions in SAS EG
18
Importing Data
Exporting final
datasets
Open command can be used also for opening SAS datasets and Projects or
where a change in format of variables not required.
Import of very big datasets can be done directly to the servers
18
Basic Functions in SAS EG
19
Creating a Query
Writing a query
19
Filter Query Page
Joining
datasets
Adding
Tables
Creating derived variables
Filter
data
Changing name
of output
Grouping
data
Join command needs to be executed with the option “select distinct rows only” or
be followed by “Sort” in Data segment of Main toolbar to avoid duplication of
entries
20
Most common Functionalities in EG
•Open
Project/Data/
Code
•Import Data
21
•Append
•Sort
•Random
•Sample
•Summary
Statistics
•Characterize
data
•Frequency
Tables
•Pie Charts
•Bar Chart
•Line Chart
•Anova
•Regression
Linear/Logisti
c
•Multivariate
Analysis
•Time Series
Analysis
File Data Describe Graph Analyze
22
Enterprise Miner
23
Getting Started in Eminer - Import Data
•Source: Server Eminer/FTPLIB, exported from 8561 Server of EG
•Data import is critical
•Column headings should not have special characters or >32 characters/ should not
start with numbers
•Creation of Diagrams
•Adjustment of the Role and the level of variables
E Miner- Modify Function
24
25
Exploration of Analytical Possibilities in SAS
Process Flow
Techniques and Concepts
Contents
25
Getting Started
Practical Exercises
Training Module: Credit Scoring For
Farm Equipments
Use of SAS EG & EMiner
Credit Scoring Project
 A 4-Step exercise
 Acquiring the data - Consolidation (includes addition of
variables), Rolling up, etc
 Knowing the data - Critical Step
 Segregating important variables
 Modelling
Data Acquisition
EG-
Append,
join,
Group
•Choice of Performance Indicator
•Choice of Independent Variables
•Cleaning of dataEG-
Random
Sample
Credit Scoring Project
-Knowing the Data
Knowing Data
EG –
characterise
data
EM-
Stat
Explore
EM-
Explore
•Removal of outliers (based on Summ. Stats and
Domain knowledge)
•Missing values
• Imputation - mean,mode, Percentage
wise Dist. For Categorical Variables
through“Impute” Function
• Full case analysis - Trade-off is Loss
of data
• Detection of Outliers & Errors
•Data issues and solutions (from MFI
Experience)
• Need for Oversampling - Adjustments
to be made later or use alternative
performance indicator
• Need to tackle Undercoverage -
Reject Data
CreditScoringProject
Multi
Plot
Varia
ble
Select
ion
• Why use all techniques?
• Important variables not left out - Need to create Derived variables
• Common variables from all techniques give validation to results
• Factor Analysis, Principal Components Analysis can be used to remove redundant
variables
• All techniques except Cluster Analysis require a ‘target’ variable.
• In cluster Analysis,
• Standardization of data is a must- -”Internal Standardization” option
• Technique more biased of Categorical variables
Segregating Imp Variables Cluster
Analysis
Decision
Tree
Var
selection
Practical Observations in EG & EMiner
 Running a Query and Running the whole Branch
 Refreshing files with the same name and location in the Project
 Refreshing the file with the same name but different location
 Refreshing the file with a different name and a different location
 Creating a code and linking it with adjacent files
 “E:/biu” location to be specified while making new Eminer project
 Data standardization required in Cluster Analysis (option present
in Eminer)





30


31
Thank You
31
ANNEXURE
32
Location of Projects and Data
33
8563 Server
•SAS Main:Files/BI_RA Folder
•SAS Main:Libraries/rmagtrg
8561 Server •SAS Main:Files/ftpdir/sasbilogs
33
ACL AND SAL SBR STREAM ON EG
34
ANN Model
35
Decision Tree Model

Weitere ähnliche Inhalte

Was ist angesagt?

Basics Of SAS Programming Language
Basics Of SAS Programming LanguageBasics Of SAS Programming Language
Basics Of SAS Programming Languageguest2160992
 
SAS Mainframe -Program-Tips
SAS Mainframe -Program-TipsSAS Mainframe -Program-Tips
SAS Mainframe -Program-TipsSrinimf-Slides
 
Sas short course_presentation_11-4-09
Sas short course_presentation_11-4-09Sas short course_presentation_11-4-09
Sas short course_presentation_11-4-09Prashant Ph
 
Data Verification In QA Department Final
Data Verification In QA Department FinalData Verification In QA Department Final
Data Verification In QA Department FinalWayne Yaddow
 
Understanding Tableau's Visual Pipeline
Understanding Tableau's Visual PipelineUnderstanding Tableau's Visual Pipeline
Understanding Tableau's Visual PipelineMrunal Shridhar
 
Introduction to-sas-1211594349119006-8
Introduction to-sas-1211594349119006-8Introduction to-sas-1211594349119006-8
Introduction to-sas-1211594349119006-8thotakoti
 
What is ETL testing & how to enforce it in Data Wharehouse
What is ETL testing & how to enforce it in Data WharehouseWhat is ETL testing & how to enforce it in Data Wharehouse
What is ETL testing & how to enforce it in Data WharehouseBugRaptors
 
Introduction to ETL process
Introduction to ETL process Introduction to ETL process
Introduction to ETL process Omid Vahdaty
 
What is sap hana
What is sap hanaWhat is sap hana
What is sap hanai Mark
 
Introduction to Stata
Introduction to StataIntroduction to Stata
Introduction to Stataizahn
 
ETL Testing Training Presentation
ETL Testing Training PresentationETL Testing Training Presentation
ETL Testing Training PresentationApurba Biswas
 

Was ist angesagt? (19)

Basics Of SAS Programming Language
Basics Of SAS Programming LanguageBasics Of SAS Programming Language
Basics Of SAS Programming Language
 
Sas demo
Sas demoSas demo
Sas demo
 
SAS Mainframe -Program-Tips
SAS Mainframe -Program-TipsSAS Mainframe -Program-Tips
SAS Mainframe -Program-Tips
 
Sas training in hyderabad
Sas training in hyderabadSas training in hyderabad
Sas training in hyderabad
 
Sas short course_presentation_11-4-09
Sas short course_presentation_11-4-09Sas short course_presentation_11-4-09
Sas short course_presentation_11-4-09
 
SAS - Training
SAS - Training SAS - Training
SAS - Training
 
Data stage
Data stageData stage
Data stage
 
SAS Programming Notes
SAS Programming NotesSAS Programming Notes
SAS Programming Notes
 
Data Verification In QA Department Final
Data Verification In QA Department FinalData Verification In QA Department Final
Data Verification In QA Department Final
 
Understanding Tableau's Visual Pipeline
Understanding Tableau's Visual PipelineUnderstanding Tableau's Visual Pipeline
Understanding Tableau's Visual Pipeline
 
Abap dictionary 1
Abap dictionary 1Abap dictionary 1
Abap dictionary 1
 
Introduction to-sas-1211594349119006-8
Introduction to-sas-1211594349119006-8Introduction to-sas-1211594349119006-8
Introduction to-sas-1211594349119006-8
 
Stata tutorial
Stata tutorialStata tutorial
Stata tutorial
 
What is ETL testing & how to enforce it in Data Wharehouse
What is ETL testing & how to enforce it in Data WharehouseWhat is ETL testing & how to enforce it in Data Wharehouse
What is ETL testing & how to enforce it in Data Wharehouse
 
Introduction to ETL process
Introduction to ETL process Introduction to ETL process
Introduction to ETL process
 
What is sap hana
What is sap hanaWhat is sap hana
What is sap hana
 
Introduction to Stata
Introduction to StataIntroduction to Stata
Introduction to Stata
 
Etl testing
Etl testingEtl testing
Etl testing
 
ETL Testing Training Presentation
ETL Testing Training PresentationETL Testing Training Presentation
ETL Testing Training Presentation
 

Andere mochten auch

Administrative Reporting of SAS Visual Analytics 7.1 and Integration with E...
Administrative Reporting of SAS Visual Analytics 7.1  and Integration with  E...Administrative Reporting of SAS Visual Analytics 7.1  and Integration with  E...
Administrative Reporting of SAS Visual Analytics 7.1 and Integration with E...Francesco Marelli
 
SAS Ron Cody Solutions for even Number problems from Chapter 16 to 20
SAS Ron Cody Solutions for even Number problems from Chapter 16 to 20SAS Ron Cody Solutions for even Number problems from Chapter 16 to 20
SAS Ron Cody Solutions for even Number problems from Chapter 16 to 20Ayapparaj SKS
 
BAS 150 Lesson 1 Lecture
BAS 150 Lesson 1 LectureBAS 150 Lesson 1 Lecture
BAS 150 Lesson 1 LectureWake Tech BAS
 
Where Vs If Statement
Where Vs If StatementWhere Vs If Statement
Where Vs If StatementSunil Gupta
 
Learning SAS by Example -A Programmer’s Guide by Ron CodySolution
Learning SAS by Example -A Programmer’s Guide by Ron CodySolutionLearning SAS by Example -A Programmer’s Guide by Ron CodySolution
Learning SAS by Example -A Programmer’s Guide by Ron CodySolutionVibeesh CS
 
SAS and Cloudera – Analytics at Scale
SAS and Cloudera – Analytics at ScaleSAS and Cloudera – Analytics at Scale
SAS and Cloudera – Analytics at ScaleCloudera, Inc.
 
SAS Modernization architectures - Big Data Analytics
SAS Modernization architectures - Big Data AnalyticsSAS Modernization architectures - Big Data Analytics
SAS Modernization architectures - Big Data AnalyticsDeepak Ramanathan
 
Base SAS Full Sample Paper
Base SAS Full Sample Paper Base SAS Full Sample Paper
Base SAS Full Sample Paper Jimmy Rana
 
Base SAS Exam Questions
Base SAS Exam QuestionsBase SAS Exam Questions
Base SAS Exam Questionsguestc45097
 

Andere mochten auch (12)

Administrative Reporting of SAS Visual Analytics 7.1 and Integration with E...
Administrative Reporting of SAS Visual Analytics 7.1  and Integration with  E...Administrative Reporting of SAS Visual Analytics 7.1  and Integration with  E...
Administrative Reporting of SAS Visual Analytics 7.1 and Integration with E...
 
SAS Ron Cody Solutions for even Number problems from Chapter 16 to 20
SAS Ron Cody Solutions for even Number problems from Chapter 16 to 20SAS Ron Cody Solutions for even Number problems from Chapter 16 to 20
SAS Ron Cody Solutions for even Number problems from Chapter 16 to 20
 
BAS 150 Lesson 1 Lecture
BAS 150 Lesson 1 LectureBAS 150 Lesson 1 Lecture
BAS 150 Lesson 1 Lecture
 
BAS 250 Lecture 2
BAS 250 Lecture 2BAS 250 Lecture 2
BAS 250 Lecture 2
 
Where Vs If Statement
Where Vs If StatementWhere Vs If Statement
Where Vs If Statement
 
Learning SAS by Example -A Programmer’s Guide by Ron CodySolution
Learning SAS by Example -A Programmer’s Guide by Ron CodySolutionLearning SAS by Example -A Programmer’s Guide by Ron CodySolution
Learning SAS by Example -A Programmer’s Guide by Ron CodySolution
 
SAS BASICS
SAS BASICSSAS BASICS
SAS BASICS
 
SAS and Cloudera – Analytics at Scale
SAS and Cloudera – Analytics at ScaleSAS and Cloudera – Analytics at Scale
SAS and Cloudera – Analytics at Scale
 
SAS Modernization architectures - Big Data Analytics
SAS Modernization architectures - Big Data AnalyticsSAS Modernization architectures - Big Data Analytics
SAS Modernization architectures - Big Data Analytics
 
Base SAS Full Sample Paper
Base SAS Full Sample Paper Base SAS Full Sample Paper
Base SAS Full Sample Paper
 
Base SAS Exam Questions
Base SAS Exam QuestionsBase SAS Exam Questions
Base SAS Exam Questions
 
SAS basics Step by step learning
SAS basics Step by step learningSAS basics Step by step learning
SAS basics Step by step learning
 

Ähnlich wie SAS Training session - By Pratima

Nose Dive into Apache Spark ML
Nose Dive into Apache Spark MLNose Dive into Apache Spark ML
Nose Dive into Apache Spark MLAhmet Bulut
 
How to Analyze and Tune MySQL Queries for Better Performance
How to Analyze and Tune MySQL Queries for Better PerformanceHow to Analyze and Tune MySQL Queries for Better Performance
How to Analyze and Tune MySQL Queries for Better Performanceoysteing
 
Nandyal_20150513_1145_1230.original.1431384030
Nandyal_20150513_1145_1230.original.1431384030Nandyal_20150513_1145_1230.original.1431384030
Nandyal_20150513_1145_1230.original.1431384030Raghav Nandyal
 
3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.pptBsMath3rdsem
 
Performance management capability
Performance management capabilityPerformance management capability
Performance management capabilitydesigner DATA
 
BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)Syaifuddin Ismail
 
Renew power - ReLead Case Competition
Renew power - ReLead Case CompetitionRenew power - ReLead Case Competition
Renew power - ReLead Case CompetitionArush Sharma
 
Smart E-Logistics for SCM Spend Analysis
Smart E-Logistics for SCM Spend AnalysisSmart E-Logistics for SCM Spend Analysis
Smart E-Logistics for SCM Spend AnalysisIRJET Journal
 
Top 30 Data Analyst Interview Questions.pdf
Top 30 Data Analyst Interview Questions.pdfTop 30 Data Analyst Interview Questions.pdf
Top 30 Data Analyst Interview Questions.pdfShaikSikindar1
 
laptop price prediction presentation
laptop price prediction presentationlaptop price prediction presentation
laptop price prediction presentationNeerajNishad4
 
Application Middleware Overview
Application Middleware OverviewApplication Middleware Overview
Application Middleware OverviewChristalin Nelson
 

Ähnlich wie SAS Training session - By Pratima (20)

Nose Dive into Apache Spark ML
Nose Dive into Apache Spark MLNose Dive into Apache Spark ML
Nose Dive into Apache Spark ML
 
CV_Gangadhar 1
CV_Gangadhar 1CV_Gangadhar 1
CV_Gangadhar 1
 
Complete unit ii notes
Complete unit ii notesComplete unit ii notes
Complete unit ii notes
 
How to Analyze and Tune MySQL Queries for Better Performance
How to Analyze and Tune MySQL Queries for Better PerformanceHow to Analyze and Tune MySQL Queries for Better Performance
How to Analyze and Tune MySQL Queries for Better Performance
 
Nandyal_20150513_1145_1230.original.1431384030
Nandyal_20150513_1145_1230.original.1431384030Nandyal_20150513_1145_1230.original.1431384030
Nandyal_20150513_1145_1230.original.1431384030
 
3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt3._DWH_Architecture__Components.ppt
3._DWH_Architecture__Components.ppt
 
Data Science for Retail Broking
Data Science for Retail BrokingData Science for Retail Broking
Data Science for Retail Broking
 
Data Science for Retail Broking
Data Science for Retail BrokingData Science for Retail Broking
Data Science for Retail Broking
 
Performance management capability
Performance management capabilityPerformance management capability
Performance management capability
 
big-data-anallytics.pptx
big-data-anallytics.pptxbig-data-anallytics.pptx
big-data-anallytics.pptx
 
BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)
 
Internship Presentation.pdf
Internship Presentation.pdfInternship Presentation.pdf
Internship Presentation.pdf
 
Renew power - ReLead Case Competition
Renew power - ReLead Case CompetitionRenew power - ReLead Case Competition
Renew power - ReLead Case Competition
 
Smart E-Logistics for SCM Spend Analysis
Smart E-Logistics for SCM Spend AnalysisSmart E-Logistics for SCM Spend Analysis
Smart E-Logistics for SCM Spend Analysis
 
data mining and data warehousing
data mining and data warehousingdata mining and data warehousing
data mining and data warehousing
 
Business analysis
Business analysisBusiness analysis
Business analysis
 
Top 30 Data Analyst Interview Questions.pdf
Top 30 Data Analyst Interview Questions.pdfTop 30 Data Analyst Interview Questions.pdf
Top 30 Data Analyst Interview Questions.pdf
 
laptop price prediction presentation
laptop price prediction presentationlaptop price prediction presentation
laptop price prediction presentation
 
Application Middleware Overview
Application Middleware OverviewApplication Middleware Overview
Application Middleware Overview
 
DNA: an overview
DNA: an overviewDNA: an overview
DNA: an overview
 

Mehr von Pratima Pandey

Master Plan for Waste Management for Swachh India.pptx
Master Plan for Waste Management for Swachh India.pptxMaster Plan for Waste Management for Swachh India.pptx
Master Plan for Waste Management for Swachh India.pptxPratima Pandey
 
Management of Plastic, Glass, Metal, and Hazardous Waste
Management of Plastic, Glass, Metal, and Hazardous Waste Management of Plastic, Glass, Metal, and Hazardous Waste
Management of Plastic, Glass, Metal, and Hazardous Waste Pratima Pandey
 
India Union Budget 2022 - Five Takeaways for Boosting Green Economy in India
India Union Budget 2022 - Five Takeaways for Boosting Green Economy in IndiaIndia Union Budget 2022 - Five Takeaways for Boosting Green Economy in India
India Union Budget 2022 - Five Takeaways for Boosting Green Economy in IndiaPratima Pandey
 
Presentation on Management of Sanitary Waste and E waste
Presentation on Management of Sanitary Waste and E waste Presentation on Management of Sanitary Waste and E waste
Presentation on Management of Sanitary Waste and E waste Pratima Pandey
 
Composting for Wet Waste Management_IBS Software Session
Composting for Wet Waste Management_IBS Software Session Composting for Wet Waste Management_IBS Software Session
Composting for Wet Waste Management_IBS Software Session Pratima Pandey
 
Magic of Rs: Presentation on Waste management for IBS Software
Magic of Rs: Presentation on Waste management for IBS Software Magic of Rs: Presentation on Waste management for IBS Software
Magic of Rs: Presentation on Waste management for IBS Software Pratima Pandey
 
Waste Management - What and Why
Waste Management - What and Why Waste Management - What and Why
Waste Management - What and Why Pratima Pandey
 
Electric vehicles in india in 2021_economic and feasibility analysis of ownin...
Electric vehicles in india in 2021_economic and feasibility analysis of ownin...Electric vehicles in india in 2021_economic and feasibility analysis of ownin...
Electric vehicles in india in 2021_economic and feasibility analysis of ownin...Pratima Pandey
 
Electric vehicles in india in 2021
Electric vehicles in india in 2021Electric vehicles in india in 2021
Electric vehicles in india in 2021Pratima Pandey
 
Master plan for Solid Waste Management in Mumbai, India
Master plan for Solid Waste Management in Mumbai, IndiaMaster plan for Solid Waste Management in Mumbai, India
Master plan for Solid Waste Management in Mumbai, IndiaPratima Pandey
 
We just need to know! Env awareness
We just need to know! Env awarenessWe just need to know! Env awareness
We just need to know! Env awarenessPratima Pandey
 
Plastics and global warming
Plastics and global warmingPlastics and global warming
Plastics and global warmingPratima Pandey
 
Reduce Plastics Use And Global Warming
Reduce Plastics Use And Global WarmingReduce Plastics Use And Global Warming
Reduce Plastics Use And Global WarmingPratima Pandey
 

Mehr von Pratima Pandey (15)

Master Plan for Waste Management for Swachh India.pptx
Master Plan for Waste Management for Swachh India.pptxMaster Plan for Waste Management for Swachh India.pptx
Master Plan for Waste Management for Swachh India.pptx
 
Management of Plastic, Glass, Metal, and Hazardous Waste
Management of Plastic, Glass, Metal, and Hazardous Waste Management of Plastic, Glass, Metal, and Hazardous Waste
Management of Plastic, Glass, Metal, and Hazardous Waste
 
India Union Budget 2022 - Five Takeaways for Boosting Green Economy in India
India Union Budget 2022 - Five Takeaways for Boosting Green Economy in IndiaIndia Union Budget 2022 - Five Takeaways for Boosting Green Economy in India
India Union Budget 2022 - Five Takeaways for Boosting Green Economy in India
 
Presentation on Management of Sanitary Waste and E waste
Presentation on Management of Sanitary Waste and E waste Presentation on Management of Sanitary Waste and E waste
Presentation on Management of Sanitary Waste and E waste
 
Composting for Wet Waste Management_IBS Software Session
Composting for Wet Waste Management_IBS Software Session Composting for Wet Waste Management_IBS Software Session
Composting for Wet Waste Management_IBS Software Session
 
Magic of Rs: Presentation on Waste management for IBS Software
Magic of Rs: Presentation on Waste management for IBS Software Magic of Rs: Presentation on Waste management for IBS Software
Magic of Rs: Presentation on Waste management for IBS Software
 
Waste Management - What and Why
Waste Management - What and Why Waste Management - What and Why
Waste Management - What and Why
 
Electric vehicles in india in 2021_economic and feasibility analysis of ownin...
Electric vehicles in india in 2021_economic and feasibility analysis of ownin...Electric vehicles in india in 2021_economic and feasibility analysis of ownin...
Electric vehicles in india in 2021_economic and feasibility analysis of ownin...
 
Electric vehicles in india in 2021
Electric vehicles in india in 2021Electric vehicles in india in 2021
Electric vehicles in india in 2021
 
Recycling plastics
Recycling plasticsRecycling plastics
Recycling plastics
 
Recycling plastics
Recycling plasticsRecycling plastics
Recycling plastics
 
Master plan for Solid Waste Management in Mumbai, India
Master plan for Solid Waste Management in Mumbai, IndiaMaster plan for Solid Waste Management in Mumbai, India
Master plan for Solid Waste Management in Mumbai, India
 
We just need to know! Env awareness
We just need to know! Env awarenessWe just need to know! Env awareness
We just need to know! Env awareness
 
Plastics and global warming
Plastics and global warmingPlastics and global warming
Plastics and global warming
 
Reduce Plastics Use And Global Warming
Reduce Plastics Use And Global WarmingReduce Plastics Use And Global Warming
Reduce Plastics Use And Global Warming
 

Kürzlich hochgeladen

Ch. 9- __Skin, hair and nail Assessment (1).pdf
Ch. 9- __Skin, hair and nail Assessment (1).pdfCh. 9- __Skin, hair and nail Assessment (1).pdf
Ch. 9- __Skin, hair and nail Assessment (1).pdfJamalYaseenJameelOde
 
原版定制卡尔加里大学毕业证(UC毕业证)留信学历认证
原版定制卡尔加里大学毕业证(UC毕业证)留信学历认证原版定制卡尔加里大学毕业证(UC毕业证)留信学历认证
原版定制卡尔加里大学毕业证(UC毕业证)留信学历认证diploma001
 
AI ppt introduction , advandtage pros and cons.pptx
AI ppt introduction , advandtage pros and cons.pptxAI ppt introduction , advandtage pros and cons.pptx
AI ppt introduction , advandtage pros and cons.pptxdeepakkrlkr2002
 
办理学位证(Massey证书)新西兰梅西大学毕业证成绩单原版一比一
办理学位证(Massey证书)新西兰梅西大学毕业证成绩单原版一比一办理学位证(Massey证书)新西兰梅西大学毕业证成绩单原版一比一
办理学位证(Massey证书)新西兰梅西大学毕业证成绩单原版一比一A SSS
 
定制(UOIT学位证)加拿大安大略理工大学毕业证成绩单原版一比一
 定制(UOIT学位证)加拿大安大略理工大学毕业证成绩单原版一比一 定制(UOIT学位证)加拿大安大略理工大学毕业证成绩单原版一比一
定制(UOIT学位证)加拿大安大略理工大学毕业证成绩单原版一比一Fs sss
 
办澳洲詹姆斯库克大学毕业证成绩单pdf电子版制作修改
办澳洲詹姆斯库克大学毕业证成绩单pdf电子版制作修改办澳洲詹姆斯库克大学毕业证成绩单pdf电子版制作修改
办澳洲詹姆斯库克大学毕业证成绩单pdf电子版制作修改yuu sss
 
Jumark Morit Diezmo- Career portfolio- BPED 3A
Jumark Morit Diezmo- Career portfolio- BPED 3AJumark Morit Diezmo- Career portfolio- BPED 3A
Jumark Morit Diezmo- Career portfolio- BPED 3Ajumarkdiezmo1
 
ME 205- Chapter 6 - Pure Bending of Beams.pdf
ME 205- Chapter 6 - Pure Bending of Beams.pdfME 205- Chapter 6 - Pure Bending of Beams.pdf
ME 205- Chapter 6 - Pure Bending of Beams.pdfaae4149584
 
定制(SCU毕业证书)南十字星大学毕业证成绩单原版一比一
定制(SCU毕业证书)南十字星大学毕业证成绩单原版一比一定制(SCU毕业证书)南十字星大学毕业证成绩单原版一比一
定制(SCU毕业证书)南十字星大学毕业证成绩单原版一比一z xss
 
Protection of Children in context of IHL and Counter Terrorism
Protection of Children in context of IHL and  Counter TerrorismProtection of Children in context of IHL and  Counter Terrorism
Protection of Children in context of IHL and Counter TerrorismNilendra Kumar
 
格里菲斯大学毕业证(Griffith毕业证)#文凭成绩单#真实留信学历认证永久存档
格里菲斯大学毕业证(Griffith毕业证)#文凭成绩单#真实留信学历认证永久存档格里菲斯大学毕业证(Griffith毕业证)#文凭成绩单#真实留信学历认证永久存档
格里菲斯大学毕业证(Griffith毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
办理学位证(UoM证书)北安普顿大学毕业证成绩单原版一比一
办理学位证(UoM证书)北安普顿大学毕业证成绩单原版一比一办理学位证(UoM证书)北安普顿大学毕业证成绩单原版一比一
办理学位证(UoM证书)北安普顿大学毕业证成绩单原版一比一A SSS
 
Introduction to Political Parties (1).ppt
Introduction to Political Parties (1).pptIntroduction to Political Parties (1).ppt
Introduction to Political Parties (1).pptSohamChavan9
 
Digital Marketing Training Institute in Mohali, India
Digital Marketing Training Institute in Mohali, IndiaDigital Marketing Training Institute in Mohali, India
Digital Marketing Training Institute in Mohali, IndiaDigital Discovery Institute
 
Nathan_Baughman_Resume_copywriter_and_editor
Nathan_Baughman_Resume_copywriter_and_editorNathan_Baughman_Resume_copywriter_and_editor
Nathan_Baughman_Resume_copywriter_and_editorNathanBaughman3
 
定制英国克兰菲尔德大学毕业证成绩单原版一比一
定制英国克兰菲尔德大学毕业证成绩单原版一比一定制英国克兰菲尔德大学毕业证成绩单原版一比一
定制英国克兰菲尔德大学毕业证成绩单原版一比一z zzz
 
Human Rights are notes and helping material
Human Rights are notes and helping materialHuman Rights are notes and helping material
Human Rights are notes and helping materialnadeemcollege26
 
Crack JAG. Guidance program for entry to JAG Dept. & SSB interview
Crack JAG. Guidance program for entry to JAG Dept. & SSB interviewCrack JAG. Guidance program for entry to JAG Dept. & SSB interview
Crack JAG. Guidance program for entry to JAG Dept. & SSB interviewNilendra Kumar
 
AICTE PPT slide of Engineering college kr pete
AICTE PPT slide of Engineering college kr peteAICTE PPT slide of Engineering college kr pete
AICTE PPT slide of Engineering college kr peteshivubhavv
 
Unlock Your Creative Potential: 7 Skills for Content Creator Evolution
Unlock Your Creative Potential: 7 Skills for Content Creator EvolutionUnlock Your Creative Potential: 7 Skills for Content Creator Evolution
Unlock Your Creative Potential: 7 Skills for Content Creator EvolutionRhazes Ghaisan
 

Kürzlich hochgeladen (20)

Ch. 9- __Skin, hair and nail Assessment (1).pdf
Ch. 9- __Skin, hair and nail Assessment (1).pdfCh. 9- __Skin, hair and nail Assessment (1).pdf
Ch. 9- __Skin, hair and nail Assessment (1).pdf
 
原版定制卡尔加里大学毕业证(UC毕业证)留信学历认证
原版定制卡尔加里大学毕业证(UC毕业证)留信学历认证原版定制卡尔加里大学毕业证(UC毕业证)留信学历认证
原版定制卡尔加里大学毕业证(UC毕业证)留信学历认证
 
AI ppt introduction , advandtage pros and cons.pptx
AI ppt introduction , advandtage pros and cons.pptxAI ppt introduction , advandtage pros and cons.pptx
AI ppt introduction , advandtage pros and cons.pptx
 
办理学位证(Massey证书)新西兰梅西大学毕业证成绩单原版一比一
办理学位证(Massey证书)新西兰梅西大学毕业证成绩单原版一比一办理学位证(Massey证书)新西兰梅西大学毕业证成绩单原版一比一
办理学位证(Massey证书)新西兰梅西大学毕业证成绩单原版一比一
 
定制(UOIT学位证)加拿大安大略理工大学毕业证成绩单原版一比一
 定制(UOIT学位证)加拿大安大略理工大学毕业证成绩单原版一比一 定制(UOIT学位证)加拿大安大略理工大学毕业证成绩单原版一比一
定制(UOIT学位证)加拿大安大略理工大学毕业证成绩单原版一比一
 
办澳洲詹姆斯库克大学毕业证成绩单pdf电子版制作修改
办澳洲詹姆斯库克大学毕业证成绩单pdf电子版制作修改办澳洲詹姆斯库克大学毕业证成绩单pdf电子版制作修改
办澳洲詹姆斯库克大学毕业证成绩单pdf电子版制作修改
 
Jumark Morit Diezmo- Career portfolio- BPED 3A
Jumark Morit Diezmo- Career portfolio- BPED 3AJumark Morit Diezmo- Career portfolio- BPED 3A
Jumark Morit Diezmo- Career portfolio- BPED 3A
 
ME 205- Chapter 6 - Pure Bending of Beams.pdf
ME 205- Chapter 6 - Pure Bending of Beams.pdfME 205- Chapter 6 - Pure Bending of Beams.pdf
ME 205- Chapter 6 - Pure Bending of Beams.pdf
 
定制(SCU毕业证书)南十字星大学毕业证成绩单原版一比一
定制(SCU毕业证书)南十字星大学毕业证成绩单原版一比一定制(SCU毕业证书)南十字星大学毕业证成绩单原版一比一
定制(SCU毕业证书)南十字星大学毕业证成绩单原版一比一
 
Protection of Children in context of IHL and Counter Terrorism
Protection of Children in context of IHL and  Counter TerrorismProtection of Children in context of IHL and  Counter Terrorism
Protection of Children in context of IHL and Counter Terrorism
 
格里菲斯大学毕业证(Griffith毕业证)#文凭成绩单#真实留信学历认证永久存档
格里菲斯大学毕业证(Griffith毕业证)#文凭成绩单#真实留信学历认证永久存档格里菲斯大学毕业证(Griffith毕业证)#文凭成绩单#真实留信学历认证永久存档
格里菲斯大学毕业证(Griffith毕业证)#文凭成绩单#真实留信学历认证永久存档
 
办理学位证(UoM证书)北安普顿大学毕业证成绩单原版一比一
办理学位证(UoM证书)北安普顿大学毕业证成绩单原版一比一办理学位证(UoM证书)北安普顿大学毕业证成绩单原版一比一
办理学位证(UoM证书)北安普顿大学毕业证成绩单原版一比一
 
Introduction to Political Parties (1).ppt
Introduction to Political Parties (1).pptIntroduction to Political Parties (1).ppt
Introduction to Political Parties (1).ppt
 
Digital Marketing Training Institute in Mohali, India
Digital Marketing Training Institute in Mohali, IndiaDigital Marketing Training Institute in Mohali, India
Digital Marketing Training Institute in Mohali, India
 
Nathan_Baughman_Resume_copywriter_and_editor
Nathan_Baughman_Resume_copywriter_and_editorNathan_Baughman_Resume_copywriter_and_editor
Nathan_Baughman_Resume_copywriter_and_editor
 
定制英国克兰菲尔德大学毕业证成绩单原版一比一
定制英国克兰菲尔德大学毕业证成绩单原版一比一定制英国克兰菲尔德大学毕业证成绩单原版一比一
定制英国克兰菲尔德大学毕业证成绩单原版一比一
 
Human Rights are notes and helping material
Human Rights are notes and helping materialHuman Rights are notes and helping material
Human Rights are notes and helping material
 
Crack JAG. Guidance program for entry to JAG Dept. & SSB interview
Crack JAG. Guidance program for entry to JAG Dept. & SSB interviewCrack JAG. Guidance program for entry to JAG Dept. & SSB interview
Crack JAG. Guidance program for entry to JAG Dept. & SSB interview
 
AICTE PPT slide of Engineering college kr pete
AICTE PPT slide of Engineering college kr peteAICTE PPT slide of Engineering college kr pete
AICTE PPT slide of Engineering college kr pete
 
Unlock Your Creative Potential: 7 Skills for Content Creator Evolution
Unlock Your Creative Potential: 7 Skills for Content Creator EvolutionUnlock Your Creative Potential: 7 Skills for Content Creator Evolution
Unlock Your Creative Potential: 7 Skills for Content Creator Evolution
 

SAS Training session - By Pratima

  • 1. Training Session on Enterprise Guide and Enterprise Miner: A Peep into the world of SAS By Pratima 02-06-2008
  • 2. 2 Exploration of Analytical Possibilities in SAS Process Flow Techniques and Concepts Contents 2 Getting Started Practical Exercises
  • 3. 3 Exploration of Analytical possibilties in SAS Process Flow Getting Started Contents 3 Credit Scoring Project – An Example of SAS Usage Practical Observations
  • 4. Enterprise Guide (EG) & Enterprise Miner (Eminer) 4 EG is an interactive interface for data compilation, transformation, analysis and presentation Version used:4 Eminer is the advanced analytical interface for Data Mining Version used:5.1 4
  • 5. EG offers: Data Compilation & Transformation • Datasets in different formats accepted: Excel, csv, txt files, Microsoft Access • Data size is no constraint (upto 9 lacs rows downloaded) • Functions like APPEND, JOIN, SORT, RANDOM SAMPLE, RANK etc • Host of Statistical Techniques like • Regressions-Linear, Non-Linear, Logistic • Time-Series Forecasting • Correlations, Principal Components, Factor Analysis • Final files to be used for Periodic Reporting like in SBR • Graphs ,Data summary tables etc • Final SAS Datasets to be used in EMiner Data Analysis Data Export and Presentation 5
  • 7. EMiner offers: EMiner Explore Sampling Modify Model Assess •Sampling •Cluster analysis •Variable Selection •Imputation •Variable Transformation •Decision tree •Neural Networks •Regressions 7
  • 8. 8 Exploration of Analytical Possibilities in SAS Process Flow Techniques and Concepts Contents 8 Getting Started Practical Exercises
  • 9. Process Flow For SAS Applications Database Servers 8563, 8561 Excel, CSV, M. Access , txt files + Enterprise Guide • Data Compilation • Data Analysis • Presentation •Excel files for Reporting Purposes •Html files as graphs, tables etc • SAS Data files to be used for advanced analytics in SAS EMiner 9 Enterprise Miner Cluster Analysis Decision trees Regressions Neural networks Output • Excel files for Reporting Purposes • Html files as graphs, tables etc
  • 10. 10 Exploration of Analytical Possibilities in SAS Process Flow Techniques and Concepts Contents 10 Getting Started Practical Exercises
  • 11. ‘What’ &‘How’ of ANN  Artificial Neural Networks  “Non-linear” Statistical Data Modeling Tool  Models complex relationships between inputs and outputs  Consists of interconnected group of ‘Artificial Neurons’  Uses ‘connectionist’ approach to computation – Multi-layer Perceptron (MLP) most common approach Example: Classification of Good and Bad Credit Risks based on most relevant variables out of occupation, financials, Age, past Banking Record etc by training neural network on historic data
  • 12. ‘What’ & ‘How’ of Regression  Linear and Non-linear Regressions  Consists of dependent variable, independent variables, parameter and random error term  Rely heavily on assumptions for probability distribution of error term  Used for modeling of causal relationships, hypothesis testing and prediction (as of time- series data)  NLMs are logarithmic, exponential functions etc Example: Examining the relationship between the performance of a Channel partner (dealer) with his market share, vintage, geo- distribution etc E
  • 13. ‘What’ & ‘How’ of DT  Decision Trees  Predictive Model with ‘leaves’ and ‘branches’  Leaves mean the cuts or classifications and Branches mean the criteria for those cuts  Maps observations into conclusions based on the target value Example: A two level tree showing best performance in ACL & SAL for West Zone and business profile services E
  • 14. ‘What’ & ‘How’ of Cluster Analysis  K- Means Clustering  Partitioning of data into K clusters  Data point assigned a cluster which has the ‘centre’ or ‘centroid’ nearest to it.  “Iterative” refinement of centroids of a cluster  Convergence when intra –cluster distance minimized and inter cluster distance maximized Example: Dataset has two dimensions - churn and limit utilization in the first MoB. Then if there are two clusters to start with…1st cluster has a centroid (mean of vector points) of 5% and 70% lim utiz, 2nd has 10% and 85% and 3rd has 15% and 99%. Then if new data-point is 17% churn and 95% lim utiz, then it will most likely fall in Cluster 3. (Distance criteria can be selected)
  • 15. BIU Concepts Un-coded Loss forecasting • Prediction of Delinquencies for a time period in future • Use of roll rates and flow rates data • Application of time series tools like ARIMA modeling • Best results with greater number of data-points • Analysis of long-term portfolio delinquency trends • Grouping of data points based on the age in the portfolio • Tracking of bad rate over time for each vintage • Estimation of losses over a period of time • Statistical expression of the credit worthiness • Use of client credit files • Use of tools like logistic regression which give the probability of default 15 Credit Scoring Vintage Analysis
  • 16. 16 Exploration of Analytical Possibilities in SAS Process Flow Techniques and Concepts Contents 16 Getting Started Practical Exercises
  • 18. Basic Functions in SAS EG 18 Importing Data Exporting final datasets Open command can be used also for opening SAS datasets and Projects or where a change in format of variables not required. Import of very big datasets can be done directly to the servers 18
  • 19. Basic Functions in SAS EG 19 Creating a Query Writing a query 19
  • 20. Filter Query Page Joining datasets Adding Tables Creating derived variables Filter data Changing name of output Grouping data Join command needs to be executed with the option “select distinct rows only” or be followed by “Sort” in Data segment of Main toolbar to avoid duplication of entries 20
  • 21. Most common Functionalities in EG •Open Project/Data/ Code •Import Data 21 •Append •Sort •Random •Sample •Summary Statistics •Characterize data •Frequency Tables •Pie Charts •Bar Chart •Line Chart •Anova •Regression Linear/Logisti c •Multivariate Analysis •Time Series Analysis File Data Describe Graph Analyze
  • 23. 23 Getting Started in Eminer - Import Data •Source: Server Eminer/FTPLIB, exported from 8561 Server of EG •Data import is critical •Column headings should not have special characters or >32 characters/ should not start with numbers •Creation of Diagrams •Adjustment of the Role and the level of variables
  • 24. E Miner- Modify Function 24
  • 25. 25 Exploration of Analytical Possibilities in SAS Process Flow Techniques and Concepts Contents 25 Getting Started Practical Exercises
  • 26. Training Module: Credit Scoring For Farm Equipments Use of SAS EG & EMiner
  • 27. Credit Scoring Project  A 4-Step exercise  Acquiring the data - Consolidation (includes addition of variables), Rolling up, etc  Knowing the data - Critical Step  Segregating important variables  Modelling Data Acquisition EG- Append, join, Group •Choice of Performance Indicator •Choice of Independent Variables •Cleaning of dataEG- Random Sample
  • 28. Credit Scoring Project -Knowing the Data Knowing Data EG – characterise data EM- Stat Explore EM- Explore •Removal of outliers (based on Summ. Stats and Domain knowledge) •Missing values • Imputation - mean,mode, Percentage wise Dist. For Categorical Variables through“Impute” Function • Full case analysis - Trade-off is Loss of data • Detection of Outliers & Errors •Data issues and solutions (from MFI Experience) • Need for Oversampling - Adjustments to be made later or use alternative performance indicator • Need to tackle Undercoverage - Reject Data
  • 29. CreditScoringProject Multi Plot Varia ble Select ion • Why use all techniques? • Important variables not left out - Need to create Derived variables • Common variables from all techniques give validation to results • Factor Analysis, Principal Components Analysis can be used to remove redundant variables • All techniques except Cluster Analysis require a ‘target’ variable. • In cluster Analysis, • Standardization of data is a must- -”Internal Standardization” option • Technique more biased of Categorical variables Segregating Imp Variables Cluster Analysis Decision Tree Var selection
  • 30. Practical Observations in EG & EMiner  Running a Query and Running the whole Branch  Refreshing files with the same name and location in the Project  Refreshing the file with the same name but different location  Refreshing the file with a different name and a different location  Creating a code and linking it with adjacent files  “E:/biu” location to be specified while making new Eminer project  Data standardization required in Cluster Analysis (option present in Eminer)      30  
  • 33. Location of Projects and Data 33 8563 Server •SAS Main:Files/BI_RA Folder •SAS Main:Libraries/rmagtrg 8561 Server •SAS Main:Files/ftpdir/sasbilogs 33
  • 34. ACL AND SAL SBR STREAM ON EG 34