SlideShare ist ein Scribd-Unternehmen logo
1 von 38
1




MACHINE LEARNING FOR
MEDICAL IMAGING DATA
Yiou (Leo) Li
Background
2



    Post-doctoral fellow, 07/2009-Present, Neural connectivity Laboratory,
    University of California San Francisco
    •   Developed unsupervised learning method for feature extraction of brain
        imaging data
    •   Applied supervised learning (Naïve Bayes, SVM, Random Forest) for predictive
        modeling of brain trauma
    •   Designed batch data processing protocol to perform image registration,
        segmentation, band-pass filtering, smoothing, and linear model fitting


    Graduate Research Assistant, 08/2002-06/2009, Machine learning for signal
    processing Laboratory, University of Maryland Baltimore County
    •   Developed the effective degrees of freedom of random process and applied it
        to the model order selection by Information Theoretic Criteria
    •   Developed a linear filtering mechanism in independent component analysis for
        feature enhancement
    •   Analyzed canonical correlation analysis for multiple datasets
Outline
3




     Independent component analysis (ICA) and its
      application to sparse feature extraction from
      multivariate dataset


     Multi-set canonical correlation analysis and its
      application to joint pattern extraction from a group of
      datasets


     Order selection of principal component analysis (PCA)
      and its application to data dimension reduction
PCA vs ICA
4




                  PCA                               ICA
           Linear projection                 Linear projection
             (Orthogonal)
       Uncorrelated components           Independent components
             (non sparse)             (sparse, “long tail” distribution)
      Typically analytical solution     Typically iterative solution
                 (SVD)                    (Iterative optimization)
ICA detects independent factors with
    long tails in multivariate dataset
5
Long tail factors are sparse features in
    data samples
6



                                          Weights of
                                           features
                  Data points (N)

                                    ICA
    Sensors (M)         X            =       A         .         S




                                                           Sparse features



                                     X= AS
ICA model
7


           x1   a11 a12 ... a1M   s1 
          x  a       a 22   a 2M   s 2 
           2    21               
           ...        ...          ... 
                                   
           x M  a M1 a M2   a MM  s M 


                x : Observed variables
                A : Mixing matrix
                s : Latent factors



                  x= As -> s =A-1x
ICA by maximum likelihood estimation
8


    Transformation of multivariate random variable: x = As
                              p(s 1, s2 , ... , sM )
    p(x 1,x 2 , ... , x M )                                 (1)
                                    det(A)
    Statistical independence condition of s:

    p(s 1, s2 , ... , sM )  i 1 p(si )
                                          M
                                                             (2)

    Log likelihood function of x with parameter A:

    log p(x 1,x 2 ,...x M )   log p([A x] i )  log det(A)
                                                      -1

                                      i
ICA Application: Sparse feature extraction from
    multivariate dataset
9
Functional MRI experiment
10
Analyze functional MRI data of resting
     state brain
11




                              Sparse features




                    ICA
Feature 1. Primary visual network
12



 +
                                         A




 -
Feature 2. “Default mode network”
13
Feature 3. Attention control network
14
Hierarchical clustering shows link
15
     between features (brain regions)
Predicative modeling of brain trauma
16

                                                 Pattern weights
                                 N

       Healthy
                                 X           =      A        .                 S
       Patients

                                                                               Sparse
                                                                           spatial features


     Subject 1

                             …
                 Subject 2
          16                                                                       Pattern 2
                                                                   Feature 1                   …
                                 Subject M                                         Feature 2

                                                           Y.-O. Li, et al., HBM, 2011
ICA Pattern classification for predictive
     modeling of brain trauma
17




      • 29 healthy + 29 trauma, 10-fold cross-validation


             Classifier                9 patterns                14 patterns
                                   Classification error      Classification error
            Naïve Bayes                0.35+/-0.03               0.32+/-0.03

        K nearest neighbor             0.29+/-0.02               0.30+/-0.03

      Support vector classifier         0.36+/-0.02               0.30 +/-0.02
                                  (c=1, number of SV: 46)   (c=1, number of SV: 20)
Outline
18




      Independent component analysis (ICA) and its
       application to sparse feature extraction from
       multivariate dataset


      Multi-set canonical correlation analysis and its
       application to joint pattern extraction from a group of
       datasets


      Order selection of principal component analysis (PCA)
       and its application to dimension reduction
Joint pattern extraction requires coherency
     on extracted patterns across datasets
19


               Model:   x k =Aksk , k=1,2,...,M




                                     Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Multi-set canonical correlation analysis
20




                          Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Multi-set canonical correlation
     analysis
21




          Correlation matrix of [S1,S2, … SM]

                                        Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Application: joint pattern extraction from a
     group of datasets
22




     •   Analyze group functional MRI data from
         simulated driving experiment
Simulated driving experiment
23

     •   Forty subjects, three repeated sessions (120 datasets)
     •   Experiment paradigm:




     •   Behavioral records:
          •   Average speed (AS)
          •   Differential of speed (DS)
          •   Average steering offset (AR)
          •   Differential steering offset (DR)
          •   Differential pedal offset (DP)
          •   Occurrence of yellow line crossing (YLC)
          •   Occurrence of white passenger-side line crossing (WPLC)

                                                    Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Step I: M-CCA for joint feature extraction
24




                           Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Step II: PCA and behavioral association
25




                          Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Pattern 1: Primary visual function
26




                                                   D = 0:85
                                                   W = 0:42




                        95% CI of behavioral association
Pattern 2: “default mode network”
27




                                                  D = -0.63
                                                  W = -0.39




                      95% CI of behavioral association
Pattern 3: Motor coordination
28




                                                  D = 0.86
                                                  W = 0.15




                       95% CI of behavioral association
Pattern 4: Executive control network
29




                                                 D = 0.64
                                                 W = 0.61




                       95% CI of behavioral association
Cross correlation of Pattern 1
30




                       Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Outline
31




      Independent component analysis (ICA) and its
       application to sparse feature extraction from
       multivariate dataset


      Multi-set canonical correlation analysis and its
       application to joint pattern extraction from a group of
       datasets


      Order selection of principal component analysis (PCA)
       and its application to data dimension reduction
Decreased reproducibility of independent
     component on high-dimensional dataset
32

          •    Functional MRI with 120 time points
          •    Twenty Monte Carlo trials of ICA algorithm
          •    Clustering the IC estimates
          •    Reproducible ICs: compact and separated clusters




        K=20                           K=40                             K=90

                                                     Y.-O. Li, et al., HBM, 2007
Dimension reduction of high-dimensional
     data by PCA
33


                                       ICA              N
               N


       M       X               =            A   .       S


                                       MxM

                       PCA dimension reduction + ICA


                           .       A    .           S
           X   =   E                                        +      N


                                       K-largest PCs            M-k PCs
Failure of information-theoretic criteria with
     uncorrected degrees of freedom
34



                        AIC, MDL
            ˆ
            k  arg min k {l ( x | k )  g (k )}

                                                                            ( M k)
                                                                    
                                                i k 1 
                                                     M       1/ ( M  k )

                          l(x |  k )  N ln  M             i
                                                                     
                                             
                                              i k 1 i / ( M  k) 
                                                                     

                                           AIC : k (2M  k )  1
                            g ( k )  
                                       MDL : 0.5  ln N (k (2M  k )  1)

                                           Y.-O. Li, et al., HBM, 2007
Estimation of degrees of freedom by
     entropy rate
35


                 Entropy rate of a Gaussian process
                                        1       
                     h( x)  ln 2 e 
                                       4        ln s()d
                                               


            h( x)  ln 2 e     iff x[n] is an i.i.d. random process




       h(x) = 0.40               h(x) = 1.28                  h(x) = 1.41
                                                     Y.-O. Li, et al., HBM, 2007
Application: Order selection of high-
     dimensional dataset
36
Corrected order selection criteria
     significantly improves order selection
37




            Original       With correction on degrees of freedom


                                    Y.-O. Li, et al., HBM, 2007
Summary
38




     •   ICA extracts useful patterns from high dimensional imaging data for
         predictive modeling

     •   M-CCA reveals patterns from several datasets in a coherent order

     •   Dimension reduction by PCA improves the reproducibility of ICA extracted
         patterns


         Exploratory multivariate analysis are promising tools for
         data mining applications

Weitere ähnliche Inhalte

Was ist angesagt?

Manifold learning for credit risk assessment
Manifold learning for credit risk assessment Manifold learning for credit risk assessment
Manifold learning for credit risk assessment
Armando Vieira
 
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
Dr. Jai Sakthi
 
Condition Monitoring Of Unsteadily Operating Equipment
Condition Monitoring Of Unsteadily Operating EquipmentCondition Monitoring Of Unsteadily Operating Equipment
Condition Monitoring Of Unsteadily Operating Equipment
Jordan McBain
 
Bytecode'13 presentation
Bytecode'13 presentationBytecode'13 presentation
Bytecode'13 presentation
Enrico Scapin
 
Linked CP Tensor Decomposition (presented by ICONIP2012)
Linked CP Tensor Decomposition (presented by ICONIP2012)Linked CP Tensor Decomposition (presented by ICONIP2012)
Linked CP Tensor Decomposition (presented by ICONIP2012)
Tatsuya Yokota
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, RegistrationCVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
zukun
 
Pres110811
Pres110811Pres110811
Pres110811
shotlub
 
Adaptive polynomial filters
Adaptive polynomial filtersAdaptive polynomial filters
Adaptive polynomial filters
Sandip Joardar
 

Was ist angesagt? (17)

7 - Model Assessment and Selection
7 - Model Assessment and Selection7 - Model Assessment and Selection
7 - Model Assessment and Selection
 
Manifold learning for credit risk assessment
Manifold learning for credit risk assessment Manifold learning for credit risk assessment
Manifold learning for credit risk assessment
 
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
 
Eurogen v
Eurogen vEurogen v
Eurogen v
 
Color Img at Prisma Network meeting 2009
Color Img at Prisma Network meeting 2009Color Img at Prisma Network meeting 2009
Color Img at Prisma Network meeting 2009
 
Nonnegative Matrix Factorization
Nonnegative Matrix FactorizationNonnegative Matrix Factorization
Nonnegative Matrix Factorization
 
Condition Monitoring Of Unsteadily Operating Equipment
Condition Monitoring Of Unsteadily Operating EquipmentCondition Monitoring Of Unsteadily Operating Equipment
Condition Monitoring Of Unsteadily Operating Equipment
 
Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...
Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...
Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...
 
Bytecode'13 presentation
Bytecode'13 presentationBytecode'13 presentation
Bytecode'13 presentation
 
Linked CP Tensor Decomposition (presented by ICONIP2012)
Linked CP Tensor Decomposition (presented by ICONIP2012)Linked CP Tensor Decomposition (presented by ICONIP2012)
Linked CP Tensor Decomposition (presented by ICONIP2012)
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, RegistrationCVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
 
IISc Internship Report
IISc Internship ReportIISc Internship Report
IISc Internship Report
 
08 linear classification_2
08 linear classification_208 linear classification_2
08 linear classification_2
 
Pres110811
Pres110811Pres110811
Pres110811
 
Uniform and non uniform single image deblurring based on sparse representatio...
Uniform and non uniform single image deblurring based on sparse representatio...Uniform and non uniform single image deblurring based on sparse representatio...
Uniform and non uniform single image deblurring based on sparse representatio...
 
An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...
An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...
An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...
 
Adaptive polynomial filters
Adaptive polynomial filtersAdaptive polynomial filters
Adaptive polynomial filters
 

Andere mochten auch

Machine Learning
Machine LearningMachine Learning
Machine Learning
butest
 
Jörg Stelzer
Jörg StelzerJörg Stelzer
Jörg Stelzer
butest
 
Multiple regression analysis
Multiple regression analysisMultiple regression analysis
Multiple regression analysis
Dushyant Bheda
 

Andere mochten auch (20)

Artificial Intelligence in Medical Imaging: An Analysis of Funding for Start-ups
Artificial Intelligence in Medical Imaging: An Analysis of Funding for Start-upsArtificial Intelligence in Medical Imaging: An Analysis of Funding for Start-ups
Artificial Intelligence in Medical Imaging: An Analysis of Funding for Start-ups
 
Medical Exchange MEDTING (April 2010)
Medical Exchange MEDTING (April 2010)Medical Exchange MEDTING (April 2010)
Medical Exchange MEDTING (April 2010)
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Explaining Semantic Search Results of Medical Images in MEDICO
Explaining Semantic Search Results of Medical Images in MEDICOExplaining Semantic Search Results of Medical Images in MEDICO
Explaining Semantic Search Results of Medical Images in MEDICO
 
Whispers of Speckles ( Part I: Building Computational Imaging Frameworks for ...
Whispers of Speckles (Part I: Building Computational Imaging Frameworks for ...Whispers of Speckles (Part I: Building Computational Imaging Frameworks for ...
Whispers of Speckles ( Part I: Building Computational Imaging Frameworks for ...
 
Introduction to digital signal processing 2
Introduction to digital signal processing 2Introduction to digital signal processing 2
Introduction to digital signal processing 2
 
Machine Learning for Medical Image Analysis: What, where and how?
Machine Learning for Medical Image Analysis:What, where and how?Machine Learning for Medical Image Analysis:What, where and how?
Machine Learning for Medical Image Analysis: What, where and how?
 
Artificial intelligence in medical image processing
Artificial intelligence in medical image processingArtificial intelligence in medical image processing
Artificial intelligence in medical image processing
 
Jörg Stelzer
Jörg StelzerJörg Stelzer
Jörg Stelzer
 
Multivariate regression techniques for analyzing auto crash variables in nigeria
Multivariate regression techniques for analyzing auto crash variables in nigeriaMultivariate regression techniques for analyzing auto crash variables in nigeria
Multivariate regression techniques for analyzing auto crash variables in nigeria
 
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
 
Multivariate adaptive regression splines
Multivariate adaptive regression splinesMultivariate adaptive regression splines
Multivariate adaptive regression splines
 
MULTIVARIATE STATISTICAL MODELS’ SYMBOLS
MULTIVARIATE STATISTICAL MODELS’ SYMBOLSMULTIVARIATE STATISTICAL MODELS’ SYMBOLS
MULTIVARIATE STATISTICAL MODELS’ SYMBOLS
 
What determines Sales of a Product?
What determines Sales of a Product?What determines Sales of a Product?
What determines Sales of a Product?
 
Multivariate Data Analysis
Multivariate Data AnalysisMultivariate Data Analysis
Multivariate Data Analysis
 
Multiple regression analysis
Multiple regression analysisMultiple regression analysis
Multiple regression analysis
 
Reverse Logistics in Different Industries
Reverse Logistics in Different IndustriesReverse Logistics in Different Industries
Reverse Logistics in Different Industries
 
Audio mining
Audio miningAudio mining
Audio mining
 
Machine learning for_finance
Machine learning for_financeMachine learning for_finance
Machine learning for_finance
 
Machine Learning, Deep Learning and Data Analysis Introduction
Machine Learning, Deep Learning and Data Analysis IntroductionMachine Learning, Deep Learning and Data Analysis Introduction
Machine Learning, Deep Learning and Data Analysis Introduction
 

Ähnlich wie Machine learning for medical imaging data

Subspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
Subspace Indexing on Grassmannian Manifold for Large Scale Visual IdentificationSubspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
Subspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
United States Air Force Academy
 
Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...
ICAC09
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 

Ähnlich wie Machine learning for medical imaging data (20)

Independent Component Analysis
Independent Component Analysis Independent Component Analysis
Independent Component Analysis
 
Sp18_P1.pptx
Sp18_P1.pptxSp18_P1.pptx
Sp18_P1.pptx
 
Understandig PCA and LDA
Understandig PCA and LDAUnderstandig PCA and LDA
Understandig PCA and LDA
 
Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)
 
Presentation on machine learning
Presentation on machine learningPresentation on machine learning
Presentation on machine learning
 
DimensionalityReduction.pptx
DimensionalityReduction.pptxDimensionalityReduction.pptx
DimensionalityReduction.pptx
 
DataEngConf: Feature Extraction: Modern Questions and Challenges at Google
DataEngConf: Feature Extraction: Modern Questions and Challenges at GoogleDataEngConf: Feature Extraction: Modern Questions and Challenges at Google
DataEngConf: Feature Extraction: Modern Questions and Challenges at Google
 
Subspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
Subspace Indexing on Grassmannian Manifold for Large Scale Visual IdentificationSubspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
Subspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
 
Ph.D. Presentation
Ph.D. PresentationPh.D. Presentation
Ph.D. Presentation
 
Dbm630 lecture09
Dbm630 lecture09Dbm630 lecture09
Dbm630 lecture09
 
Principal Component Analysis For Novelty Detection
Principal Component Analysis For Novelty DetectionPrincipal Component Analysis For Novelty Detection
Principal Component Analysis For Novelty Detection
 
Dimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptxDimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptx
 
Distributed Architecture of Subspace Clustering and Related
Distributed Architecture of Subspace Clustering and RelatedDistributed Architecture of Subspace Clustering and Related
Distributed Architecture of Subspace Clustering and Related
 
Conference_paper.pdf
Conference_paper.pdfConference_paper.pdf
Conference_paper.pdf
 
tutorial.ppt
tutorial.ppttutorial.ppt
tutorial.ppt
 
Spatio-temporal reasoning for traffic scene understanding
Spatio-temporal reasoning for traffic scene understandingSpatio-temporal reasoning for traffic scene understanding
Spatio-temporal reasoning for traffic scene understanding
 
190 195
190 195190 195
190 195
 
Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...
 
Sunum
SunumSunum
Sunum
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Kürzlich hochgeladen (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 

Machine learning for medical imaging data

  • 1. 1 MACHINE LEARNING FOR MEDICAL IMAGING DATA Yiou (Leo) Li
  • 2. Background 2 Post-doctoral fellow, 07/2009-Present, Neural connectivity Laboratory, University of California San Francisco • Developed unsupervised learning method for feature extraction of brain imaging data • Applied supervised learning (Naïve Bayes, SVM, Random Forest) for predictive modeling of brain trauma • Designed batch data processing protocol to perform image registration, segmentation, band-pass filtering, smoothing, and linear model fitting Graduate Research Assistant, 08/2002-06/2009, Machine learning for signal processing Laboratory, University of Maryland Baltimore County • Developed the effective degrees of freedom of random process and applied it to the model order selection by Information Theoretic Criteria • Developed a linear filtering mechanism in independent component analysis for feature enhancement • Analyzed canonical correlation analysis for multiple datasets
  • 3. Outline 3  Independent component analysis (ICA) and its application to sparse feature extraction from multivariate dataset  Multi-set canonical correlation analysis and its application to joint pattern extraction from a group of datasets  Order selection of principal component analysis (PCA) and its application to data dimension reduction
  • 4. PCA vs ICA 4 PCA ICA Linear projection Linear projection (Orthogonal) Uncorrelated components Independent components (non sparse) (sparse, “long tail” distribution) Typically analytical solution Typically iterative solution (SVD) (Iterative optimization)
  • 5. ICA detects independent factors with long tails in multivariate dataset 5
  • 6. Long tail factors are sparse features in data samples 6 Weights of features Data points (N) ICA Sensors (M) X = A . S Sparse features X= AS
  • 7. ICA model 7  x1   a11 a12 ... a1M   s1  x  a a 22 a 2M   s 2   2    21    ...   ...   ...         x M  a M1 a M2 a MM  s M  x : Observed variables A : Mixing matrix s : Latent factors x= As -> s =A-1x
  • 8. ICA by maximum likelihood estimation 8 Transformation of multivariate random variable: x = As p(s 1, s2 , ... , sM ) p(x 1,x 2 , ... , x M )  (1) det(A) Statistical independence condition of s: p(s 1, s2 , ... , sM )  i 1 p(si ) M (2) Log likelihood function of x with parameter A: log p(x 1,x 2 ,...x M )   log p([A x] i )  log det(A) -1 i
  • 9. ICA Application: Sparse feature extraction from multivariate dataset 9
  • 11. Analyze functional MRI data of resting state brain 11 Sparse features ICA
  • 12. Feature 1. Primary visual network 12 + A -
  • 13. Feature 2. “Default mode network” 13
  • 14. Feature 3. Attention control network 14
  • 15. Hierarchical clustering shows link 15 between features (brain regions)
  • 16. Predicative modeling of brain trauma 16 Pattern weights N Healthy X = A . S Patients Sparse spatial features Subject 1 … Subject 2 16 Pattern 2 Feature 1 … Subject M Feature 2 Y.-O. Li, et al., HBM, 2011
  • 17. ICA Pattern classification for predictive modeling of brain trauma 17 • 29 healthy + 29 trauma, 10-fold cross-validation Classifier 9 patterns 14 patterns Classification error Classification error Naïve Bayes 0.35+/-0.03 0.32+/-0.03 K nearest neighbor 0.29+/-0.02 0.30+/-0.03 Support vector classifier 0.36+/-0.02 0.30 +/-0.02 (c=1, number of SV: 46) (c=1, number of SV: 20)
  • 18. Outline 18  Independent component analysis (ICA) and its application to sparse feature extraction from multivariate dataset  Multi-set canonical correlation analysis and its application to joint pattern extraction from a group of datasets  Order selection of principal component analysis (PCA) and its application to dimension reduction
  • 19. Joint pattern extraction requires coherency on extracted patterns across datasets 19 Model: x k =Aksk , k=1,2,...,M Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 20. Multi-set canonical correlation analysis 20 Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 21. Multi-set canonical correlation analysis 21 Correlation matrix of [S1,S2, … SM] Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 22. Application: joint pattern extraction from a group of datasets 22 • Analyze group functional MRI data from simulated driving experiment
  • 23. Simulated driving experiment 23 • Forty subjects, three repeated sessions (120 datasets) • Experiment paradigm: • Behavioral records: • Average speed (AS) • Differential of speed (DS) • Average steering offset (AR) • Differential steering offset (DR) • Differential pedal offset (DP) • Occurrence of yellow line crossing (YLC) • Occurrence of white passenger-side line crossing (WPLC) Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 24. Step I: M-CCA for joint feature extraction 24 Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 25. Step II: PCA and behavioral association 25 Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 26. Pattern 1: Primary visual function 26 D = 0:85 W = 0:42 95% CI of behavioral association
  • 27. Pattern 2: “default mode network” 27 D = -0.63 W = -0.39 95% CI of behavioral association
  • 28. Pattern 3: Motor coordination 28 D = 0.86 W = 0.15 95% CI of behavioral association
  • 29. Pattern 4: Executive control network 29 D = 0.64 W = 0.61 95% CI of behavioral association
  • 30. Cross correlation of Pattern 1 30 Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 31. Outline 31  Independent component analysis (ICA) and its application to sparse feature extraction from multivariate dataset  Multi-set canonical correlation analysis and its application to joint pattern extraction from a group of datasets  Order selection of principal component analysis (PCA) and its application to data dimension reduction
  • 32. Decreased reproducibility of independent component on high-dimensional dataset 32 • Functional MRI with 120 time points • Twenty Monte Carlo trials of ICA algorithm • Clustering the IC estimates • Reproducible ICs: compact and separated clusters K=20 K=40 K=90 Y.-O. Li, et al., HBM, 2007
  • 33. Dimension reduction of high-dimensional data by PCA 33 ICA N N M X = A . S MxM PCA dimension reduction + ICA . A . S X = E + N K-largest PCs M-k PCs
  • 34. Failure of information-theoretic criteria with uncorrected degrees of freedom 34 AIC, MDL ˆ k  arg min k {l ( x | k )  g (k )} ( M k)    i k 1  M 1/ ( M  k ) l(x |  k )  N ln  M i    i k 1 i / ( M  k)    AIC : k (2M  k )  1 g ( k )   MDL : 0.5  ln N (k (2M  k )  1) Y.-O. Li, et al., HBM, 2007
  • 35. Estimation of degrees of freedom by entropy rate 35 Entropy rate of a Gaussian process 1  h( x)  ln 2 e  4   ln s()d  h( x)  ln 2 e iff x[n] is an i.i.d. random process h(x) = 0.40 h(x) = 1.28 h(x) = 1.41 Y.-O. Li, et al., HBM, 2007
  • 36. Application: Order selection of high- dimensional dataset 36
  • 37. Corrected order selection criteria significantly improves order selection 37 Original With correction on degrees of freedom Y.-O. Li, et al., HBM, 2007
  • 38. Summary 38 • ICA extracts useful patterns from high dimensional imaging data for predictive modeling • M-CCA reveals patterns from several datasets in a coherent order • Dimension reduction by PCA improves the reproducibility of ICA extracted patterns Exploratory multivariate analysis are promising tools for data mining applications