SlideShare ist ein Scribd-Unternehmen logo
1 von 15
 R is a language and environment for statistical
computing and graphics
 R provides a wide variety of statistical and
graphical techniques, including linear and
nonlinear modeling, classical statistical tests,
time-series analysis, classification, clustering, and
others.
 R can be considered as a different implementation
of S.
 It compiles and runs on a wide variety of platforms
such as UNIX,Windows and Mac OS.
 An effective data handling and storage facility
 A suite of operators for calculations on arrays
and matrices
 A large, coherent, integrated collection of tools
for data analysis
 Graphical facilities for data analysis and
display either on-screen or on hardcopy
 A well-developed, simple and effective
programming language which includes
conditionals, loops, user-defined recursive
functions and input and output facilities.
 R provides a comprehensive set of statistical
analysis techniques
• Classical statistical tests
• Linear and nonlinear modeling
• Time-series analysis
• Classification & cluster analysis
• Spatial statistics
• Basically any statistical technique you can think of is
part of a contributed package to R
Why Principal Component Analysis used?
 Data Dimension Reduction Technique.
 Principal Component Analysis (PCA) is a powerful tool
during the Analysis, when the data have ‘n’ variables. PCA
finds the combination of each and every variable without
losing the original data.
 PCA are formed some as linear combinations of the data
which is used to preserve the information
 Principal Component Analysis - the extraction of hidden
predictive information from large database organizations,
can identify valuable customers, predict future behaviors,
and enable firms to make proactive, knowledge-driven
decisions.
 There are four students application
 Graduate Admission Office wants to select two graduate students
Who should be selected ?
STUDENT GPA GRE
PROFESSOR
RATING
1. 3.2 1270 38
2. 3.9 1600 42
3. 2.9 1500 22
4. 3.0 1400 32
There are five steps by PCA using R-STATISTICS to select two
best graduate students from rest of the other in the given table.
 Implementing data in R-statistics.
 Calculate the correlation matrix.
 Calculate the eigenvectors and eigen values of the correlation
matrix
 Choose the number of principal components to be retained
 Derive the new data set.
R CODE
> Gpa <- c(3.2,3.9,2.9,3.0)
> Gre <- c(1270,1600,1500,1400)
> Professorrating <- c(38,42,22,32)
> Student <- data. frame(Gpa,Gre,Professorrating)
> Student
Gpa Gre Professorrating
1 3.2 1270 38
2 3.9 1600 42
3 2.9 1500 22
4 3.0 1400 32
>data= cor(Student)
> stud
Gpa Gre Prof.rat
Gpa 1.0000000 0.531991767 0.824316301
Gre 0.5319918 1.000000000 0.009509527
Prof.rat 0.8243163 -0.009509527 1.000000000
 It is used to find the linear relationship between two random
variables
> eigen(stud)
$values
[1] 1.97676210 1.00866512 0.01457279
$vectors
[,1] [,2] [,3]
[1,] 0.7086607 -0.003993348 0.7055382
[2,] 0.3801843 -0.840227900 -0.3866225
[3,] 0.5943568 0.542218710 -0.5939183
>barplot(eigen(stud)$vectors)
pc1
pc2
pc3
>pc1=0.7086607*Gpa+0.3801843*Gre+0.5943568*Professorrating
> pc2=0.003993348*Gpa0.840227900*Gre+0.542218710*Professorrating
> pc3= 0.7055382*Gpa- 0.3866225*Gre - 0.5939183*Professorrating
Student 2 and 3 will be selected if first component (pc1) is used for
calculating the score.
STUDENT GPA GRE
PROFESSOR
RATING SCORE
1. 3.2 1270 38 507.6873
2. 3.9 1600 42 636.0216
3. 2.9 1500 22 585.4074
4. 3.0 1400 32 553.4034
 PCA is limited to re-expressing the data
as a linear combination of its basis
vectors.
• PCA is a non-parametric method –
independent of user and can’t be
configured for specific inputs.
• Principal components are orthogonal.
• Mean and variance are sufficient
Pricipal Component Analysis Using R

Weitere ähnliche Inhalte

Was ist angesagt?

Statistics for data scientists
Statistics for  data scientistsStatistics for  data scientists
Statistics for data scientistsAjay Ohri
 
Exploratory data analysis in R - Data Science Club
Exploratory data analysis in R - Data Science ClubExploratory data analysis in R - Data Science Club
Exploratory data analysis in R - Data Science ClubMartin Bago
 
Linear regression in machine learning
Linear regression in machine learningLinear regression in machine learning
Linear regression in machine learningShajun Nisha
 
Lecture 6: Ensemble Methods
Lecture 6: Ensemble Methods Lecture 6: Ensemble Methods
Lecture 6: Ensemble Methods Marina Santini
 
Density Based Clustering
Density Based ClusteringDensity Based Clustering
Density Based ClusteringSSA KPI
 
Data warehouse 22 concept hierarchies in data modeling
Data warehouse 22 concept hierarchies in data modelingData warehouse 22 concept hierarchies in data modeling
Data warehouse 22 concept hierarchies in data modelingVaibhav Khanna
 
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olap
Data Mining:  Concepts and Techniques (3rd ed.)— Chapter _04 olapData Mining:  Concepts and Techniques (3rd ed.)— Chapter _04 olap
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olapSalah Amean
 
CART – Classification & Regression Trees
CART – Classification & Regression TreesCART – Classification & Regression Trees
CART – Classification & Regression TreesHemant Chetwani
 
Logistic regression
Logistic regressionLogistic regression
Logistic regressionsaba khan
 
Text Analytics Presentation
Text Analytics PresentationText Analytics Presentation
Text Analytics PresentationSkylar Ritchie
 
Slide3.ppt
Slide3.pptSlide3.ppt
Slide3.pptbutest
 
Survey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSurvey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSivagowry Shathesh
 
Introduction to R Graphics with ggplot2
Introduction to R Graphics with ggplot2Introduction to R Graphics with ggplot2
Introduction to R Graphics with ggplot2izahn
 
support vector machine and associative classification
support vector machine and associative classificationsupport vector machine and associative classification
support vector machine and associative classificationrajshreemuthiah
 

Was ist angesagt? (20)

Statistics for data scientists
Statistics for  data scientistsStatistics for  data scientists
Statistics for data scientists
 
Exploratory data analysis in R - Data Science Club
Exploratory data analysis in R - Data Science ClubExploratory data analysis in R - Data Science Club
Exploratory data analysis in R - Data Science Club
 
Clustering
ClusteringClustering
Clustering
 
Linear regression in machine learning
Linear regression in machine learningLinear regression in machine learning
Linear regression in machine learning
 
Lecture 6: Ensemble Methods
Lecture 6: Ensemble Methods Lecture 6: Ensemble Methods
Lecture 6: Ensemble Methods
 
Density Based Clustering
Density Based ClusteringDensity Based Clustering
Density Based Clustering
 
Data warehouse 22 concept hierarchies in data modeling
Data warehouse 22 concept hierarchies in data modelingData warehouse 22 concept hierarchies in data modeling
Data warehouse 22 concept hierarchies in data modeling
 
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olap
Data Mining:  Concepts and Techniques (3rd ed.)— Chapter _04 olapData Mining:  Concepts and Techniques (3rd ed.)— Chapter _04 olap
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olap
 
CART – Classification & Regression Trees
CART – Classification & Regression TreesCART – Classification & Regression Trees
CART – Classification & Regression Trees
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
03 preprocessing
03 preprocessing03 preprocessing
03 preprocessing
 
Presentation on K-Means Clustering
Presentation on K-Means ClusteringPresentation on K-Means Clustering
Presentation on K-Means Clustering
 
Text Analytics Presentation
Text Analytics PresentationText Analytics Presentation
Text Analytics Presentation
 
Slide3.ppt
Slide3.pptSlide3.ppt
Slide3.ppt
 
Survey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSurvey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease prediction
 
Hierachical clustering
Hierachical clusteringHierachical clustering
Hierachical clustering
 
Introduction to R Graphics with ggplot2
Introduction to R Graphics with ggplot2Introduction to R Graphics with ggplot2
Introduction to R Graphics with ggplot2
 
support vector machine and associative classification
support vector machine and associative classificationsupport vector machine and associative classification
support vector machine and associative classification
 
Clique
Clique Clique
Clique
 
Sampling Distributions and Estimators
Sampling Distributions and Estimators Sampling Distributions and Estimators
Sampling Distributions and Estimators
 

Andere mochten auch

A Prolog To Java Translator System And Its Application
A Prolog To Java Translator System And Its ApplicationA Prolog To Java Translator System And Its Application
A Prolog To Java Translator System And Its Applicationguestc68147
 
Key jobskills value proposition flyer
Key jobskills value proposition flyerKey jobskills value proposition flyer
Key jobskills value proposition flyerKiran k
 
R SOFTWARE #GGPLOT2 #BOXPLOT DATASET APLICATIONS YAEM'16
R SOFTWARE #GGPLOT2 #BOXPLOT DATASET APLICATIONS YAEM'16R SOFTWARE #GGPLOT2 #BOXPLOT DATASET APLICATIONS YAEM'16
R SOFTWARE #GGPLOT2 #BOXPLOT DATASET APLICATIONS YAEM'16Fatma ÇINAR
 
Surface Area of Triangular Prism 2
Surface Area of Triangular Prism 2Surface Area of Triangular Prism 2
Surface Area of Triangular Prism 2Moonie Kim
 
India Retail Reforms: A saga of uncertainities
India Retail Reforms: A saga of uncertainitiesIndia Retail Reforms: A saga of uncertainities
India Retail Reforms: A saga of uncertainitiesAklanta Kalita
 
Handling Data boxplots
Handling Data boxplotsHandling Data boxplots
Handling Data boxplotsSteve Bishop
 
FDI in Multi-brand Retail (Issues and Challenges)
FDI in Multi-brand Retail (Issues and Challenges)FDI in Multi-brand Retail (Issues and Challenges)
FDI in Multi-brand Retail (Issues and Challenges)Abee Sharma
 
Foreign Direct Investment In India
Foreign Direct Investment In IndiaForeign Direct Investment In India
Foreign Direct Investment In IndiaApurv Gourav
 

Andere mochten auch (10)

A Prolog To Java Translator System And Its Application
A Prolog To Java Translator System And Its ApplicationA Prolog To Java Translator System And Its Application
A Prolog To Java Translator System And Its Application
 
Key jobskills value proposition flyer
Key jobskills value proposition flyerKey jobskills value proposition flyer
Key jobskills value proposition flyer
 
Fdi
FdiFdi
Fdi
 
R SOFTWARE #GGPLOT2 #BOXPLOT DATASET APLICATIONS YAEM'16
R SOFTWARE #GGPLOT2 #BOXPLOT DATASET APLICATIONS YAEM'16R SOFTWARE #GGPLOT2 #BOXPLOT DATASET APLICATIONS YAEM'16
R SOFTWARE #GGPLOT2 #BOXPLOT DATASET APLICATIONS YAEM'16
 
Surface Area of Triangular Prism 2
Surface Area of Triangular Prism 2Surface Area of Triangular Prism 2
Surface Area of Triangular Prism 2
 
India Retail Reforms: A saga of uncertainities
India Retail Reforms: A saga of uncertainitiesIndia Retail Reforms: A saga of uncertainities
India Retail Reforms: A saga of uncertainities
 
Handling Data boxplots
Handling Data boxplotsHandling Data boxplots
Handling Data boxplots
 
FDI in Multi-brand Retail (Issues and Challenges)
FDI in Multi-brand Retail (Issues and Challenges)FDI in Multi-brand Retail (Issues and Challenges)
FDI in Multi-brand Retail (Issues and Challenges)
 
Foreign Direct Investment In India
Foreign Direct Investment In IndiaForeign Direct Investment In India
Foreign Direct Investment In India
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
 

Ähnlich wie Pricipal Component Analysis Using R

Mml micro project Building a Basic statistic calculator using r programming ...
Mml micro project Building a Basic statistic calculator  using r programming ...Mml micro project Building a Basic statistic calculator  using r programming ...
Mml micro project Building a Basic statistic calculator using r programming ...SakshamDandnaik
 
Statistical Analysis and Data Analysis using R Programming Language: Efficien...
Statistical Analysis and Data Analysis using R Programming Language: Efficien...Statistical Analysis and Data Analysis using R Programming Language: Efficien...
Statistical Analysis and Data Analysis using R Programming Language: Efficien...BRNSSPublicationHubI
 
microsoft r server for distributed computing
microsoft r server for distributed computingmicrosoft r server for distributed computing
microsoft r server for distributed computingBAINIDA
 
Parallel KNN for Big Data using Adaptive Indexing
Parallel KNN for Big Data using Adaptive IndexingParallel KNN for Big Data using Adaptive Indexing
Parallel KNN for Big Data using Adaptive IndexingIRJET Journal
 
Sw metrics for regression testing
Sw metrics for regression testingSw metrics for regression testing
Sw metrics for regression testingJyotsna Sharma
 
R Programming - part 1.pdf
R Programming - part 1.pdfR Programming - part 1.pdf
R Programming - part 1.pdfRohanBorgalli
 
A Firefly based improved clustering algorithm
A Firefly based improved clustering algorithmA Firefly based improved clustering algorithm
A Firefly based improved clustering algorithmIRJET Journal
 
statistical computation using R- report
statistical computation using R- reportstatistical computation using R- report
statistical computation using R- reportKamarudheen KV
 
Proceedings of the 2015 Industrial and Systems Engineering Res.docx
Proceedings of the 2015 Industrial and Systems Engineering Res.docxProceedings of the 2015 Industrial and Systems Engineering Res.docx
Proceedings of the 2015 Industrial and Systems Engineering Res.docxwkyra78
 
IRJET- Matrix Multiplication using Strassen’s Method
IRJET-  	  Matrix Multiplication using Strassen’s MethodIRJET-  	  Matrix Multiplication using Strassen’s Method
IRJET- Matrix Multiplication using Strassen’s MethodIRJET Journal
 
Machine Learning, K-means Algorithm Implementation with R
Machine Learning, K-means Algorithm Implementation with RMachine Learning, K-means Algorithm Implementation with R
Machine Learning, K-means Algorithm Implementation with RIRJET Journal
 
A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...
A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...
A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...IRJET Journal
 
METODOLOGIA DEA EN STATA
METODOLOGIA DEA EN STATAMETODOLOGIA DEA EN STATA
METODOLOGIA DEA EN STATALuhSm
 
R Tutorial For Beginners | R Programming Tutorial l R Language For Beginners ...
R Tutorial For Beginners | R Programming Tutorial l R Language For Beginners ...R Tutorial For Beginners | R Programming Tutorial l R Language For Beginners ...
R Tutorial For Beginners | R Programming Tutorial l R Language For Beginners ...Edureka!
 
Six sigma tools an overview
Six sigma tools  an overviewSix sigma tools  an overview
Six sigma tools an overviewKomal Kamble
 
AIRLINE FARE PRICE PREDICTION
AIRLINE FARE PRICE PREDICTIONAIRLINE FARE PRICE PREDICTION
AIRLINE FARE PRICE PREDICTIONIRJET Journal
 
STARBUCKS Site Selection Analysis drift
STARBUCKS Site Selection Analysis driftSTARBUCKS Site Selection Analysis drift
STARBUCKS Site Selection Analysis driftPark JunPyo
 

Ähnlich wie Pricipal Component Analysis Using R (20)

Mml micro project Building a Basic statistic calculator using r programming ...
Mml micro project Building a Basic statistic calculator  using r programming ...Mml micro project Building a Basic statistic calculator  using r programming ...
Mml micro project Building a Basic statistic calculator using r programming ...
 
Statistical Analysis and Data Analysis using R Programming Language: Efficien...
Statistical Analysis and Data Analysis using R Programming Language: Efficien...Statistical Analysis and Data Analysis using R Programming Language: Efficien...
Statistical Analysis and Data Analysis using R Programming Language: Efficien...
 
microsoft r server for distributed computing
microsoft r server for distributed computingmicrosoft r server for distributed computing
microsoft r server for distributed computing
 
Se notes
Se notesSe notes
Se notes
 
Parallel KNN for Big Data using Adaptive Indexing
Parallel KNN for Big Data using Adaptive IndexingParallel KNN for Big Data using Adaptive Indexing
Parallel KNN for Big Data using Adaptive Indexing
 
Sw metrics for regression testing
Sw metrics for regression testingSw metrics for regression testing
Sw metrics for regression testing
 
R Programming - part 1.pdf
R Programming - part 1.pdfR Programming - part 1.pdf
R Programming - part 1.pdf
 
A Firefly based improved clustering algorithm
A Firefly based improved clustering algorithmA Firefly based improved clustering algorithm
A Firefly based improved clustering algorithm
 
statistical computation using R- report
statistical computation using R- reportstatistical computation using R- report
statistical computation using R- report
 
Proceedings of the 2015 Industrial and Systems Engineering Res.docx
Proceedings of the 2015 Industrial and Systems Engineering Res.docxProceedings of the 2015 Industrial and Systems Engineering Res.docx
Proceedings of the 2015 Industrial and Systems Engineering Res.docx
 
IRJET- Matrix Multiplication using Strassen’s Method
IRJET-  	  Matrix Multiplication using Strassen’s MethodIRJET-  	  Matrix Multiplication using Strassen’s Method
IRJET- Matrix Multiplication using Strassen’s Method
 
Machine Learning, K-means Algorithm Implementation with R
Machine Learning, K-means Algorithm Implementation with RMachine Learning, K-means Algorithm Implementation with R
Machine Learning, K-means Algorithm Implementation with R
 
Sorting_project_2.pdf
Sorting_project_2.pdfSorting_project_2.pdf
Sorting_project_2.pdf
 
A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...
A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...
A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...
 
METODOLOGIA DEA EN STATA
METODOLOGIA DEA EN STATAMETODOLOGIA DEA EN STATA
METODOLOGIA DEA EN STATA
 
Lecture_R.ppt
Lecture_R.pptLecture_R.ppt
Lecture_R.ppt
 
R Tutorial For Beginners | R Programming Tutorial l R Language For Beginners ...
R Tutorial For Beginners | R Programming Tutorial l R Language For Beginners ...R Tutorial For Beginners | R Programming Tutorial l R Language For Beginners ...
R Tutorial For Beginners | R Programming Tutorial l R Language For Beginners ...
 
Six sigma tools an overview
Six sigma tools  an overviewSix sigma tools  an overview
Six sigma tools an overview
 
AIRLINE FARE PRICE PREDICTION
AIRLINE FARE PRICE PREDICTIONAIRLINE FARE PRICE PREDICTION
AIRLINE FARE PRICE PREDICTION
 
STARBUCKS Site Selection Analysis drift
STARBUCKS Site Selection Analysis driftSTARBUCKS Site Selection Analysis drift
STARBUCKS Site Selection Analysis drift
 

Kürzlich hochgeladen

Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Association for Project Management
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfSherif Taha
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701bronxfugly43
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxAmanpreet Kaur
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 

Kürzlich hochgeladen (20)

Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 

Pricipal Component Analysis Using R

  • 1.
  • 2.  R is a language and environment for statistical computing and graphics  R provides a wide variety of statistical and graphical techniques, including linear and nonlinear modeling, classical statistical tests, time-series analysis, classification, clustering, and others.  R can be considered as a different implementation of S.  It compiles and runs on a wide variety of platforms such as UNIX,Windows and Mac OS.
  • 3.  An effective data handling and storage facility  A suite of operators for calculations on arrays and matrices  A large, coherent, integrated collection of tools for data analysis  Graphical facilities for data analysis and display either on-screen or on hardcopy  A well-developed, simple and effective programming language which includes conditionals, loops, user-defined recursive functions and input and output facilities.
  • 4.  R provides a comprehensive set of statistical analysis techniques • Classical statistical tests • Linear and nonlinear modeling • Time-series analysis • Classification & cluster analysis • Spatial statistics • Basically any statistical technique you can think of is part of a contributed package to R
  • 5. Why Principal Component Analysis used?  Data Dimension Reduction Technique.  Principal Component Analysis (PCA) is a powerful tool during the Analysis, when the data have ‘n’ variables. PCA finds the combination of each and every variable without losing the original data.  PCA are formed some as linear combinations of the data which is used to preserve the information  Principal Component Analysis - the extraction of hidden predictive information from large database organizations, can identify valuable customers, predict future behaviors, and enable firms to make proactive, knowledge-driven decisions.
  • 6.  There are four students application  Graduate Admission Office wants to select two graduate students Who should be selected ? STUDENT GPA GRE PROFESSOR RATING 1. 3.2 1270 38 2. 3.9 1600 42 3. 2.9 1500 22 4. 3.0 1400 32
  • 7. There are five steps by PCA using R-STATISTICS to select two best graduate students from rest of the other in the given table.  Implementing data in R-statistics.  Calculate the correlation matrix.  Calculate the eigenvectors and eigen values of the correlation matrix  Choose the number of principal components to be retained  Derive the new data set.
  • 8. R CODE > Gpa <- c(3.2,3.9,2.9,3.0) > Gre <- c(1270,1600,1500,1400) > Professorrating <- c(38,42,22,32) > Student <- data. frame(Gpa,Gre,Professorrating) > Student Gpa Gre Professorrating 1 3.2 1270 38 2 3.9 1600 42 3 2.9 1500 22 4 3.0 1400 32
  • 9. >data= cor(Student) > stud Gpa Gre Prof.rat Gpa 1.0000000 0.531991767 0.824316301 Gre 0.5319918 1.000000000 0.009509527 Prof.rat 0.8243163 -0.009509527 1.000000000  It is used to find the linear relationship between two random variables
  • 10. > eigen(stud) $values [1] 1.97676210 1.00866512 0.01457279 $vectors [,1] [,2] [,3] [1,] 0.7086607 -0.003993348 0.7055382 [2,] 0.3801843 -0.840227900 -0.3866225 [3,] 0.5943568 0.542218710 -0.5939183 >barplot(eigen(stud)$vectors)
  • 13. Student 2 and 3 will be selected if first component (pc1) is used for calculating the score. STUDENT GPA GRE PROFESSOR RATING SCORE 1. 3.2 1270 38 507.6873 2. 3.9 1600 42 636.0216 3. 2.9 1500 22 585.4074 4. 3.0 1400 32 553.4034
  • 14.  PCA is limited to re-expressing the data as a linear combination of its basis vectors. • PCA is a non-parametric method – independent of user and can’t be configured for specific inputs. • Principal components are orthogonal. • Mean and variance are sufficient