SlideShare a Scribd company logo
1 of 27
Download to read offline
High Dimensional Data Visualization
Presented by Fabian Keller
Seminar: Large Scale Visualization
Advisor: Steffen Koch
University of Stuttgart, Summer Term 2015
Motivation
What do you see?
16.07.2015 Fabian Keller 2
Motivation
I can see…
16.07.2015 Fabian Keller 3
Agenda
• Introduction
• Dimension Reduction Techniques
PCA / LLE / ISOMAP / t-SNE
• Visualization Techniques
Scatterplots / Parallel Coordinate Plots / Glyphs
• Conclusion
16.07.2015 Fabian Keller 4
Goal
Of dimensionality reduction
• High Dimensional Data (>>1000 dimensions)
• Reduce Dimensions (for Clustering / Learning / …)
• Extract Meaning
• Visualize and Interact
16.07.2015 Fabian Keller 5
[c.f. Card et al 1999; dos Santos and Brodlie 2004]
Intrinsic Dimensionality
How many dimensions can we reduce?
2D  1D 3D  1D
16.07.2015 Fabian Keller 6
 Intrinsic Dimensionality: 1
Agenda
• Introduction
• Dimension Reduction Techniques
PCA / LLE / ISOMAP / t-SNE
• Visualization Techniques
Scatterplots / Parallel Coordinate Plots / Glyphs
• Conclusion
16.07.2015 Fabian Keller 7
Dimension Reduction
What techniques are there?
DR
Techniques
Linear
Principal
Component
Analysis
Non-Linear
Local
Local Linear
Embedding
Global
ISOMAP t-SNE
16.07.2015 Fabian Keller 8
Principal Component Analysis (PCA)
Eigen-*
16.07.2015 Fabian Keller 9
• Linear, Global
• Find “Principal
Components”
• Minimize
Reconstruction Error
[isomorphismes, 2014]
Principal Component Analysis (PCA)
Eigen-Faces
16.07.2015 Fabian Keller 10
Local-Linear Embedding (LLE)
Assumes the data is locally linear
• Non-Linear, Local
• Select neighbors and
approximate linearly
• Map to lower
dimension
16.07.2015 Fabian Keller 11
[Roweis, 2000]
ISOMAP
Isometric feature mapping
16.07.2015 Fabian Keller 12
• Non-linear, Global
• K-Nearest Neighbors
• Construct
neighborhood graph
• Compute shortest
paths
[Balasubramanian, 2002]
t-SNE
Stochastic Neighbor Embedding
• Non-linear, Global
• Uses Gaussian
similarities
• Preserves the
similarities in lower
dimensions
16.07.2015 Fabian Keller 13
Agenda
• Introduction
• Dimension Reduction Techniques
PCA / LLE / ISOMAP / t-SNE
• Visualization Techniques
Scatterplots / Parallel Coordinate Plots / Glyphs
• Conclusion
16.07.2015 Fabian Keller 14
2D Scatter Plots
Commonly used
• Easy Perception
• (No) Interaction
• Limited to two
dimensions
• Colors?!
16.07.2015 Fabian Keller 15
2D Scatter Plot Matrices
Show relationships with scatter plots
• Slow perception
• May have interaction
• Does not scale well
16.07.2015 Fabian Keller 16
2D Scatter Plot Matrices
Let an algorithm choose the plots
16.07.2015 Fabian Keller 17
[Zheng, 2014]
3D Scatter Plots
Interactive
• Only one additional dimension
• Expensive interaction, useless without!
• Limited benefit compared to 2D scatter plots
16.07.2015 Fabian Keller 18
[Sedlmair, 2013]
Parallel Coordinate Plot
Display >2 dimensions
16.07.2015 Fabian Keller 19
Interaction Examples: https://syntagmatic.github.io/parallel-coordinates/
• Noisy
• Slow perception
• Meaning of x-axis?!
[Harvard Business Manager, 2015-07]
Glyphs
Encode important information
• Memorable semantics
• Small
• Details through
interaction
• Overwhelming?
16.07.2015 Fabian Keller 20
[Fuchs, 2013]
Glyphs
Domain-specific clues
16.07.2015 Fabian Keller 21
[Fuchs, 2014]
Glyphs
Time series data
16.07.2015 Fabian Keller 22
[Kintzel, 2011]
Conclusion
High Dimensional Data Visualization
• Lots of DR / visualization techniques
• Even more combinations
• Application needs to be tailored to needs
16.07.2015 Fabian Keller 23
“A problem well put is half-solved”
– John Dewey
Thank you for your attention!
Questions?
Literature
• Sedlmair, Michael; Munzner, Tamara; Tory, Melanie (2013): Empirical guidance on scatterplot and
dimension reduction technique choices.
• Zheng, Yunzhu; Suematsu, Haruka; Itoh, Takayuki; Fujimaki, Ryohei; Morinaga, Satoshi;
Kawahara, Yoshinobu (2014): Scatterplot layout for high-dimensional data visualization.
• Card, S. K., Mackinlay, J. D., and Shneiderman, B., editors. Readings in Information Visualization:
Using Vision to Think. Morgan Kaufmann, San Francisco. 1999.
• Fuchs, Johannes, et al. "Evaluation of alternative glyph designs for time series data in a small
multiple setting." Proceedings of the SIGCHI Conference on Human Factors in Computing
Systems. ACM, 2013.
• Christopher Kintzel, Johannes Fuchs, and Florian Mansmann. 2011. Monitoring large IP spaces
with ClockView.
• Fuchs, Johaness et al. “Leaf Glyph Visualizing Multi-Dimensional Data with Environmental Cues“.
2014.
• Balasubramanian, Mukund, and Eric L. Schwartz. "The isomap algorithm and topological
stability." Science 295.5552 (2002): 7-7.
• Roweis, Sam T.; Saul, Lawrence K. (2000): Nonlinear dimensionality reduction by locally linear
embedding.
• dos Santos, S. and Brodlie, K. Gaining understanding of multivariate and multidimensional data
through visualization. Computers & Graphics, 28(3):311–325. 2004.
• Harvard Business Manager, 2015-07: Andere Länder, anderer Stil
http://www.harvardbusinessmanager.de/heft/d-135395625.html
• isomorphismes (2014). pca - making sense of principal component analysis, eigenvectors &
eigenvalues - cross validated. http://stats.stackexchange.com/a/82427/80011
16.07.2015 Fabian Keller 25
Example Applications
• Biological / Medical (genes, fMRI)
• Finance (time series)
• Geological (climate, spatial, temporal)
• Big Data Analysis (Netflix Movie Rating Data)
16.07.2015 Fabian Keller 26
Other DR techniques
Matlab toolbox for dimensionality reduction
16.07.2015 Fabian Keller 27
• Principal Component Analysis
(PCA)
• Probabilistic PCA
• Factor Analysis (FA)
• Classical multidimensional
scaling (MDS)
• Sammon mapping
• Linear Discriminant Analysis
(LDA)
• Isomap
• Landmark Isomap
• Local Linear Embedding (LLE)
• Laplacian Eigenmaps
• Hessian LLE
• Local Tangent Space
Alignment (LTSA)
• Conformal Eigenmaps
(extension of LLE)
• Maximum Variance Unfolding
(extension of LLE)
• Landmark MVU
(LandmarkMVU)
• Fast Maximum Variance
Unfolding (FastMVU)
• Kernel PCA
• Generalized Discriminant
Analysis (GDA)
• Diffusion maps
• Neighborhood Preserving
Embedding (NPE)
• Locality Preserving Projection
(LPP)
• Linear Local Tangent Space
Alignment (LLTSA)
• Stochastic Proximity
Embedding (SPE)
• Deep autoencoders (using
denoising autoencoder
pretraining)
• Local Linear Coordination (LLC)
• Manifold charting
• Coordinated Factor Analysis
(CFA)
• Gaussian Process Latent
Variable Model (GPLVM)
• Stochastic Neighbor
Embedding (SNE)
• Symmetric SNE
• t-Distributed Stochastic
Neighbor Embedding (t-SNE)
• Neighborhood Components
Analysis (NCA)
• Maximally Collapsing Metric
Learning (MCML)
• Large-Margin Nearest Neighbor
(LMNN)
See: http://lvdmaaten.github.io/drtoolbox/

More Related Content

What's hot

Logistic regression
Logistic regressionLogistic regression
Logistic regression
saba khan
 
Prediction of heart disease using machine learning.pptx
Prediction of heart disease using machine learning.pptxPrediction of heart disease using machine learning.pptx
Prediction of heart disease using machine learning.pptx
kumari36
 

What's hot (20)

Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Bayesian learning
Bayesian learningBayesian learning
Bayesian learning
 
Principal Component Analysis (PCA) and LDA PPT Slides
Principal Component Analysis (PCA) and LDA PPT SlidesPrincipal Component Analysis (PCA) and LDA PPT Slides
Principal Component Analysis (PCA) and LDA PPT Slides
 
Soft computing
Soft computingSoft computing
Soft computing
 
Simulated Annealing
Simulated AnnealingSimulated Annealing
Simulated Annealing
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Principal Component Analysis
Principal Component AnalysisPrincipal Component Analysis
Principal Component Analysis
 
Vc dimension in Machine Learning
Vc dimension in Machine LearningVc dimension in Machine Learning
Vc dimension in Machine Learning
 
Classification and Regression
Classification and RegressionClassification and Regression
Classification and Regression
 
Missing data handling
Missing data handlingMissing data handling
Missing data handling
 
The fundamentals of Machine Learning
The fundamentals of Machine LearningThe fundamentals of Machine Learning
The fundamentals of Machine Learning
 
Data preprocessing in Machine learning
Data preprocessing in Machine learning Data preprocessing in Machine learning
Data preprocessing in Machine learning
 
Probabilistic Reasoning
Probabilistic ReasoningProbabilistic Reasoning
Probabilistic Reasoning
 
Machine Learning-Linear regression
Machine Learning-Linear regressionMachine Learning-Linear regression
Machine Learning-Linear regression
 
Arithmetic coding
Arithmetic codingArithmetic coding
Arithmetic coding
 
2.4 rule based classification
2.4 rule based classification2.4 rule based classification
2.4 rule based classification
 
Machine learning Lecture 2
Machine learning Lecture 2Machine learning Lecture 2
Machine learning Lecture 2
 
Learning With Complete Data
Learning With Complete DataLearning With Complete Data
Learning With Complete Data
 
Prediction of heart disease using machine learning.pptx
Prediction of heart disease using machine learning.pptxPrediction of heart disease using machine learning.pptx
Prediction of heart disease using machine learning.pptx
 

Viewers also liked

Data Visualization with graphviz
Data Visualization with graphvizData Visualization with graphviz
Data Visualization with graphviz
Tom Kenny
 
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Spark Summit
 
Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...
Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...
Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...
wl820609
 
Holographic projection technology
Holographic projection technologyHolographic projection technology
Holographic projection technology
Janardhan Raju
 
Holographic Projection Technology COMPLETE DETAILS NEW PPT
Holographic Projection Technology COMPLETE DETAILS NEW PPTHolographic Projection Technology COMPLETE DETAILS NEW PPT
Holographic Projection Technology COMPLETE DETAILS NEW PPT
 Abin Baby
 

Viewers also liked (14)

Systematic Architecture Level Fault Diagnosis Using Statistical Techniques
Systematic Architecture Level Fault Diagnosis Using Statistical TechniquesSystematic Architecture Level Fault Diagnosis Using Statistical Techniques
Systematic Architecture Level Fault Diagnosis Using Statistical Techniques
 
X team 2 - presentation
X team 2 - presentationX team 2 - presentation
X team 2 - presentation
 
High dimensional Data Visualization using Star Coordinates on Three Dimensions
High dimensional Data Visualization using Star Coordinates on Three DimensionsHigh dimensional Data Visualization using Star Coordinates on Three Dimensions
High dimensional Data Visualization using Star Coordinates on Three Dimensions
 
Data Visualization with graphviz
Data Visualization with graphvizData Visualization with graphviz
Data Visualization with graphviz
 
Designing Progressive and Interactive Analytics Processes for High-Dimensiona...
Designing Progressive and Interactive Analytics Processes for High-Dimensiona...Designing Progressive and Interactive Analytics Processes for High-Dimensiona...
Designing Progressive and Interactive Analytics Processes for High-Dimensiona...
 
Introducing Performance Awareness in an Integrated Specification Environment
Introducing Performance Awareness in an Integrated Specification EnvironmentIntroducing Performance Awareness in an Integrated Specification Environment
Introducing Performance Awareness in an Integrated Specification Environment
 
Apache Spark Overview
Apache Spark OverviewApache Spark Overview
Apache Spark Overview
 
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
 
Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...
Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...
Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...
 
Holographic projection technology
Holographic projection technologyHolographic projection technology
Holographic projection technology
 
Holographic Projection Technology COMPLETE DETAILS NEW PPT
Holographic Projection Technology COMPLETE DETAILS NEW PPTHolographic Projection Technology COMPLETE DETAILS NEW PPT
Holographic Projection Technology COMPLETE DETAILS NEW PPT
 
Deep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word EmbeddingsDeep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word Embeddings
 
3D HOLOGRAPHIC PROJECTION
3D HOLOGRAPHIC PROJECTION3D HOLOGRAPHIC PROJECTION
3D HOLOGRAPHIC PROJECTION
 
NEW holography TECHNOLOGY ALL PPT
NEW holography TECHNOLOGY ALL PPTNEW holography TECHNOLOGY ALL PPT
NEW holography TECHNOLOGY ALL PPT
 

Similar to High Dimensional Data Visualization

Project Management
Project ManagementProject Management
Project Management
mbchiorean
 
Scalable Machine Learning: The Role of Stratified Data Sharding
Scalable Machine Learning: The Role of Stratified Data ShardingScalable Machine Learning: The Role of Stratified Data Sharding
Scalable Machine Learning: The Role of Stratified Data Sharding
inside-BigData.com
 

Similar to High Dimensional Data Visualization (20)

GIS Orientation 2015
GIS Orientation 2015GIS Orientation 2015
GIS Orientation 2015
 
Session 1 and 2 "Challenges and Opportunities with Big Linked Data Visualiza...
Session 1 and 2  "Challenges and Opportunities with Big Linked Data Visualiza...Session 1 and 2  "Challenges and Opportunities with Big Linked Data Visualiza...
Session 1 and 2 "Challenges and Opportunities with Big Linked Data Visualiza...
 
Talk at PyData Berlin about Manifold Learning and Applications
Talk at PyData Berlin about Manifold Learning and ApplicationsTalk at PyData Berlin about Manifold Learning and Applications
Talk at PyData Berlin about Manifold Learning and Applications
 
Unfolding - Workshop at RCA
Unfolding - Workshop at RCAUnfolding - Workshop at RCA
Unfolding - Workshop at RCA
 
102523_CCC TUG.pptx
102523_CCC TUG.pptx102523_CCC TUG.pptx
102523_CCC TUG.pptx
 
Generating LADs that make sense
Generating LADs that make senseGenerating LADs that make sense
Generating LADs that make sense
 
Elementary Data Analysis with MS excel_Day-1
Elementary Data Analysis with MS excel_Day-1Elementary Data Analysis with MS excel_Day-1
Elementary Data Analysis with MS excel_Day-1
 
Designing Sustainable Workshops - ACLR ULS Webinar April 16, 2015
Designing Sustainable Workshops - ACLR ULS Webinar April 16, 2015Designing Sustainable Workshops - ACLR ULS Webinar April 16, 2015
Designing Sustainable Workshops - ACLR ULS Webinar April 16, 2015
 
李俊良/Feature Engineering in Machine Learning
李俊良/Feature Engineering in Machine Learning李俊良/Feature Engineering in Machine Learning
李俊良/Feature Engineering in Machine Learning
 
Project Management
Project ManagementProject Management
Project Management
 
Metrics notess
Metrics   notessMetrics   notess
Metrics notess
 
Bloom_Resume_2016
Bloom_Resume_2016Bloom_Resume_2016
Bloom_Resume_2016
 
Formalizing Mathematics in Lean
Formalizing Mathematics in LeanFormalizing Mathematics in Lean
Formalizing Mathematics in Lean
 
AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1
AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1
AMIA 2015 Visual Analytics in Healthcare Tutorial Part 1
 
SUCCESS STORY: How Election Day Is Improved using Lean Six Sigma With Matt Ne...
SUCCESS STORY: How Election Day Is Improved using Lean Six Sigma With Matt Ne...SUCCESS STORY: How Election Day Is Improved using Lean Six Sigma With Matt Ne...
SUCCESS STORY: How Election Day Is Improved using Lean Six Sigma With Matt Ne...
 
Scalable Machine Learning: The Role of Stratified Data Sharding
Scalable Machine Learning: The Role of Stratified Data ShardingScalable Machine Learning: The Role of Stratified Data Sharding
Scalable Machine Learning: The Role of Stratified Data Sharding
 
A Method for Detecting Behavior-Based User Profiles in Collaborative Ontology...
A Method for Detecting Behavior-Based User Profiles in Collaborative Ontology...A Method for Detecting Behavior-Based User Profiles in Collaborative Ontology...
A Method for Detecting Behavior-Based User Profiles in Collaborative Ontology...
 
Lean BIM Management 101 (Episode 4)
Lean BIM Management 101 (Episode 4)Lean BIM Management 101 (Episode 4)
Lean BIM Management 101 (Episode 4)
 
T3120-X.pptx Gis Map Construction using datasets
T3120-X.pptx Gis Map Construction using datasetsT3120-X.pptx Gis Map Construction using datasets
T3120-X.pptx Gis Map Construction using datasets
 
Cincinnati Tableau User Group Event #8 (Mapping)
Cincinnati Tableau User Group Event #8 (Mapping)Cincinnati Tableau User Group Event #8 (Mapping)
Cincinnati Tableau User Group Event #8 (Mapping)
 

More from Fabian Keller

More from Fabian Keller (6)

The Challenges of Taking Open Source Cloud Foundry to Production
The Challenges of Taking Open Source Cloud Foundry to ProductionThe Challenges of Taking Open Source Cloud Foundry to Production
The Challenges of Taking Open Source Cloud Foundry to Production
 
Cloud Foundry - A Platform for Everyone
Cloud Foundry - A Platform for EveryoneCloud Foundry - A Platform for Everyone
Cloud Foundry - A Platform for Everyone
 
Quo Vadis Netflix Stack?
Quo Vadis Netflix Stack?Quo Vadis Netflix Stack?
Quo Vadis Netflix Stack?
 
Blasting Through the Clouds - Automating Cloud Foundry with Concourse CI
Blasting Through the Clouds - Automating Cloud Foundry with Concourse CIBlasting Through the Clouds - Automating Cloud Foundry with Concourse CI
Blasting Through the Clouds - Automating Cloud Foundry with Concourse CI
 
Skalierbare Multicast Konzepte
Skalierbare Multicast KonzepteSkalierbare Multicast Konzepte
Skalierbare Multicast Konzepte
 
Referenzmodelle für das Informationsmanagement in der Smart Factory
Referenzmodelle für das Informationsmanagement in der Smart FactoryReferenzmodelle für das Informationsmanagement in der Smart Factory
Referenzmodelle für das Informationsmanagement in der Smart Factory
 

Recently uploaded

Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
amitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 

Recently uploaded (20)

Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 

High Dimensional Data Visualization

  • 1. High Dimensional Data Visualization Presented by Fabian Keller Seminar: Large Scale Visualization Advisor: Steffen Koch University of Stuttgart, Summer Term 2015
  • 2. Motivation What do you see? 16.07.2015 Fabian Keller 2
  • 4. Agenda • Introduction • Dimension Reduction Techniques PCA / LLE / ISOMAP / t-SNE • Visualization Techniques Scatterplots / Parallel Coordinate Plots / Glyphs • Conclusion 16.07.2015 Fabian Keller 4
  • 5. Goal Of dimensionality reduction • High Dimensional Data (>>1000 dimensions) • Reduce Dimensions (for Clustering / Learning / …) • Extract Meaning • Visualize and Interact 16.07.2015 Fabian Keller 5 [c.f. Card et al 1999; dos Santos and Brodlie 2004]
  • 6. Intrinsic Dimensionality How many dimensions can we reduce? 2D  1D 3D  1D 16.07.2015 Fabian Keller 6  Intrinsic Dimensionality: 1
  • 7. Agenda • Introduction • Dimension Reduction Techniques PCA / LLE / ISOMAP / t-SNE • Visualization Techniques Scatterplots / Parallel Coordinate Plots / Glyphs • Conclusion 16.07.2015 Fabian Keller 7
  • 8. Dimension Reduction What techniques are there? DR Techniques Linear Principal Component Analysis Non-Linear Local Local Linear Embedding Global ISOMAP t-SNE 16.07.2015 Fabian Keller 8
  • 9. Principal Component Analysis (PCA) Eigen-* 16.07.2015 Fabian Keller 9 • Linear, Global • Find “Principal Components” • Minimize Reconstruction Error [isomorphismes, 2014]
  • 10. Principal Component Analysis (PCA) Eigen-Faces 16.07.2015 Fabian Keller 10
  • 11. Local-Linear Embedding (LLE) Assumes the data is locally linear • Non-Linear, Local • Select neighbors and approximate linearly • Map to lower dimension 16.07.2015 Fabian Keller 11 [Roweis, 2000]
  • 12. ISOMAP Isometric feature mapping 16.07.2015 Fabian Keller 12 • Non-linear, Global • K-Nearest Neighbors • Construct neighborhood graph • Compute shortest paths [Balasubramanian, 2002]
  • 13. t-SNE Stochastic Neighbor Embedding • Non-linear, Global • Uses Gaussian similarities • Preserves the similarities in lower dimensions 16.07.2015 Fabian Keller 13
  • 14. Agenda • Introduction • Dimension Reduction Techniques PCA / LLE / ISOMAP / t-SNE • Visualization Techniques Scatterplots / Parallel Coordinate Plots / Glyphs • Conclusion 16.07.2015 Fabian Keller 14
  • 15. 2D Scatter Plots Commonly used • Easy Perception • (No) Interaction • Limited to two dimensions • Colors?! 16.07.2015 Fabian Keller 15
  • 16. 2D Scatter Plot Matrices Show relationships with scatter plots • Slow perception • May have interaction • Does not scale well 16.07.2015 Fabian Keller 16
  • 17. 2D Scatter Plot Matrices Let an algorithm choose the plots 16.07.2015 Fabian Keller 17 [Zheng, 2014]
  • 18. 3D Scatter Plots Interactive • Only one additional dimension • Expensive interaction, useless without! • Limited benefit compared to 2D scatter plots 16.07.2015 Fabian Keller 18 [Sedlmair, 2013]
  • 19. Parallel Coordinate Plot Display >2 dimensions 16.07.2015 Fabian Keller 19 Interaction Examples: https://syntagmatic.github.io/parallel-coordinates/ • Noisy • Slow perception • Meaning of x-axis?! [Harvard Business Manager, 2015-07]
  • 20. Glyphs Encode important information • Memorable semantics • Small • Details through interaction • Overwhelming? 16.07.2015 Fabian Keller 20 [Fuchs, 2013]
  • 22. Glyphs Time series data 16.07.2015 Fabian Keller 22 [Kintzel, 2011]
  • 23. Conclusion High Dimensional Data Visualization • Lots of DR / visualization techniques • Even more combinations • Application needs to be tailored to needs 16.07.2015 Fabian Keller 23 “A problem well put is half-solved” – John Dewey
  • 24. Thank you for your attention! Questions?
  • 25. Literature • Sedlmair, Michael; Munzner, Tamara; Tory, Melanie (2013): Empirical guidance on scatterplot and dimension reduction technique choices. • Zheng, Yunzhu; Suematsu, Haruka; Itoh, Takayuki; Fujimaki, Ryohei; Morinaga, Satoshi; Kawahara, Yoshinobu (2014): Scatterplot layout for high-dimensional data visualization. • Card, S. K., Mackinlay, J. D., and Shneiderman, B., editors. Readings in Information Visualization: Using Vision to Think. Morgan Kaufmann, San Francisco. 1999. • Fuchs, Johannes, et al. "Evaluation of alternative glyph designs for time series data in a small multiple setting." Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 2013. • Christopher Kintzel, Johannes Fuchs, and Florian Mansmann. 2011. Monitoring large IP spaces with ClockView. • Fuchs, Johaness et al. “Leaf Glyph Visualizing Multi-Dimensional Data with Environmental Cues“. 2014. • Balasubramanian, Mukund, and Eric L. Schwartz. "The isomap algorithm and topological stability." Science 295.5552 (2002): 7-7. • Roweis, Sam T.; Saul, Lawrence K. (2000): Nonlinear dimensionality reduction by locally linear embedding. • dos Santos, S. and Brodlie, K. Gaining understanding of multivariate and multidimensional data through visualization. Computers & Graphics, 28(3):311–325. 2004. • Harvard Business Manager, 2015-07: Andere Länder, anderer Stil http://www.harvardbusinessmanager.de/heft/d-135395625.html • isomorphismes (2014). pca - making sense of principal component analysis, eigenvectors & eigenvalues - cross validated. http://stats.stackexchange.com/a/82427/80011 16.07.2015 Fabian Keller 25
  • 26. Example Applications • Biological / Medical (genes, fMRI) • Finance (time series) • Geological (climate, spatial, temporal) • Big Data Analysis (Netflix Movie Rating Data) 16.07.2015 Fabian Keller 26
  • 27. Other DR techniques Matlab toolbox for dimensionality reduction 16.07.2015 Fabian Keller 27 • Principal Component Analysis (PCA) • Probabilistic PCA • Factor Analysis (FA) • Classical multidimensional scaling (MDS) • Sammon mapping • Linear Discriminant Analysis (LDA) • Isomap • Landmark Isomap • Local Linear Embedding (LLE) • Laplacian Eigenmaps • Hessian LLE • Local Tangent Space Alignment (LTSA) • Conformal Eigenmaps (extension of LLE) • Maximum Variance Unfolding (extension of LLE) • Landmark MVU (LandmarkMVU) • Fast Maximum Variance Unfolding (FastMVU) • Kernel PCA • Generalized Discriminant Analysis (GDA) • Diffusion maps • Neighborhood Preserving Embedding (NPE) • Locality Preserving Projection (LPP) • Linear Local Tangent Space Alignment (LLTSA) • Stochastic Proximity Embedding (SPE) • Deep autoencoders (using denoising autoencoder pretraining) • Local Linear Coordination (LLC) • Manifold charting • Coordinated Factor Analysis (CFA) • Gaussian Process Latent Variable Model (GPLVM) • Stochastic Neighbor Embedding (SNE) • Symmetric SNE • t-Distributed Stochastic Neighbor Embedding (t-SNE) • Neighborhood Components Analysis (NCA) • Maximally Collapsing Metric Learning (MCML) • Large-Margin Nearest Neighbor (LMNN) See: http://lvdmaaten.github.io/drtoolbox/