SlideShare ist ein Scribd-Unternehmen logo
1 von 1
Downloaden Sie, um offline zu lesen
Connectome Classification: Statistical Graph Theoretic
           Methods for Analysis of MR-Connectome Data
                                 Joshua T. Vogelstein , William R. Gray , John A. Bogovic ,   1                                                                             1,2                                                                                     1
                                          3                 1                 1
                          Susan M. Resnick , Jerry L. Prince , Carey E. Priebe , R. Jacob Vogelstein1,2
                           1                                                                       2
                            Johns Hopkins University, Baltimore, Maryland, Johns Hopkins University Applied Physics Laboratory, Laurel, Maryland
                                                             3
                                                              National Institutes of Health, Bethesda, Maryland



Abstract                                                                                                           Results
• Methods for high-throughput MR connectome inference are available [1]                                            Gender Classifier
• Previous analyses of connectome data relied on classical graph theoretic tools, such as
clustering coefficient                                                                                              • Coherent and incoherent classifiers perform better than chance and the naive Bayes
• We develop a statistical graph theoretic framework to apply to generic connectome                                classifier (coherent classifier is significant with p-value < 0.0001).
classification problems                                                                                             • Best classifier achieves 83% accuracy using 12 signal vertices and 360 signal edges
• Applying the tools to 49 senior individuals from the BLSA data set resulted in connectome                        • Classical graph theoretic tools, such as clustering coeffiecient, number of triangles, etc., do
classification accuracy of up to 85%                                                                                not use vertex labels, which contain useful classification signal.
• Using standard graph theoretic measures, like clustering coefficient, ignores vertex labels, and                  • SOA Machine learning techniques [2] using classical graph theory yield only 75% accuracy
achieves only 75% accuracy even upon using sophisticated multivariate machine learning
methods [2]
• Extensions and further applications aplenty.

                                                                                                                                                                                     incoherent estimator                                                                     coherent estimator
Methods                                                                                                                                                                                                                                                                                                           0.5




                                                                                                                                         misclassification rate




                                                                                                                                                                                                                                           # signal−vertices
                                                                                                                                                                    0.5 L π
                                                                                                                                                                        ˆˆ                                            ˆ
                                                                                                                                                                                                                      L n b = 0. 41                                                   ˆ
Connectome Inference                                                                                                                                                             = 0. 5
                                                                                                                                                                                                                                                                  10
                                                                                                                                                                                                                                                                                      L c o h= 0. 16
                                                                                                                                                                                                                                                                                                                  0.4

• MR Connectome Automated Pipeline (MRCAP) [1] to infer connectomes                                                                                               0.25                                                                                            20                                              0.3
                                                                                                                                                                                             ˆ
                                                                                                                                                                                             L i n c= 0. 27
• Vertices are neuroanatomical gyral regions [3], edges are estimated tracts using FACT [4]
• 49 subjects from the Baltimore Longitudinal Study on Aging; 25 male, 24 female                                                                                                                                                                                  30
                                                                                                                                                                          0 0                 1                   2                  3
                                                                                                                                                                                                                                                                                                                  0.16
                                                                                                                                                                          10           10          10       10                                                          200 400 600 800 1000
                                                                                                                                                                                log size of signal subgraph                                                          size of signal subgraph
                                                                                                                                                                                 some coherent estimators                                                         zoomed in coherent estimator
                                                                                                                                         misclassification rate                                                                                                                                                   0.5
                                                                                                                                                                    0.5




                                                                                                                                                                                                                                           # star−vertices
                                                                                                                                                                                                                                                                  15
                                                                                                                                                                                                                                                                                                                  0.4
                                                                                                                                                                                                                                                                  18
                                                                                                                                                                  0.25                                                                                                                                            0.3
                                                                                                                                                                  0.16                                                                                            21

                                                                                                                                                                          0 0                 1                   2                  3
                                                                                                                                                                                                                                                                                                                  0.16
                                                                                                                                                                          10         10          10       10                                                                 400       500       600
                                                                                                                                                                              log size of signal subgraph                                                                   size of signal subgraph
                                                                                                                                                                           coherent signal subgraph estimate                                                                      coherogram

                                                                                                                                                                                                                                                                                                                  30
                                                                                                                                                                    20                                                                                            20
                                                                                                                                                           vertex




                                                                                                                                                                                                                                                                                                                  20
                                                                                                                                                                    40                                                                                            40
                                                                                                                                                                                                                                                                                                                  10
                                                                                                                                                                    60                                                                                            60
                                                                                                                                                                                                                                                                                                                  0
                                                                                                                                                                                       20             40                   60                                                 0.04 0.14 0.29 0.55
                                                                                                                                                                                                  vertex                                                                              threshold


                                                                                                                   Figure Legend (above): The top two panels depict the relative performances of the
                                                                                                                   incoherent (left) and coherent (right) classifiers as a function of their hyper-parameters. The
                                                                                                                   middle two depict misclassification rate (left) for a few different choices of # of signal vertices
                                                                                                                   and (right) a zoomed in depiction of the top right panel. The bottom left panel shows the
                                                                                                                   estimated signal subgraph, and the bottom right shows the coherogram. Together, these
                                                                                                                   bottom panels suggest that the signal subgraph for these data is neither particularly coherent
                                                                                                                   or incoherent. (below): The figure below visualizes the twelve signal subgraph nodes. Each
                                                                                                                   subplot shows the signal subgraph induced by one of the 12 signal vertices estimated using
                                                                                                                   the coherent classifier. There are 360 edges in the signal subgraph.




                         MRCAP is available at: http://www.nitrc.org/projects/mrcap/




Model
• Joint graph/class model
• Each edge is an independent binary random variable
• A subset of edges comprise the signal subgraph

                          FGY = FG|Y FY
                                 
                              =       Bern(auv ; puv|y )πy
                                        (u,v)∈S
                                            
                                                        Bern(auv ; puv )
                                       (u,v)∈ES

Classifier
• Bayes plug-in classifier is asymptotically optimal
• Robust estimators have better convergence properties than the MLE
                                                                                                                  Synthetic Data Analysis
                    y=
                    ˆ                          Bern(auv ; puv|y )ˆy
                                                          ˆ      π                                                 • Simulations as true to real data as possible suggest model is not wholly unreasonable
                                                                                                                   • Even under true model, we only expect about 50% of the identified edges are true signal
                                     ˆ
                               (u,v)∈S                                                                             edges with 50 samples
                                                                                                                   • With only a few more samples, both misclassification rate and missed-edge rate drop
                                                                                                                   precipitously
Signal Subgraph Estimator                                                                                                                                                       incoherent estimator                                                                           coherent estimator
                                                                                                                                                                  1
                                                                                                                      misclassification rate




• The signal subgraph could be all edges, an incoherent subset, or a coherent subset
                                                                                                                                                                                                                                              # star−vertices




                                                                                                                                                   0.75                                                                                                                                                                 0.7
• We devise a different estimator for the two special cases                                                                                                                                                                                                       10
• For each edge, we compute the significance of the difference between the two clases, using a
                                                                                                                                                              0.5                                                                                                                                                       0.5
Fisherʼs exact test, which is optimal under the model                                                                                                                                                                                                             20
• The incoherent signal subgraph estimator chooses the s most significant edges                                                                     0.25
• The coherent signal subgraph estimator chooses the m most significant vertices, and then the                                                                                                                                                                     30                                                    0.3
s most significant edges incident to those vertices                                                                                                                0                                                                                                                                                     0.18
                                                                                                                                                                    0                    1                    2                   3                                            200     400    600      800 1000
                                                                                                                                                                  10                   10                 10                    10
                                                                                                                                                                            log size of signal subgraph                                                                      size of signal subgraph

                                                                                                                                                                  1                                                                                               0.5
                                                                                                                                                                                                                                         misclassification rate
                                                                                                                                missed−edge rate




                                                                                                                                                                                                                                                                                                                  coh
                                                                                                                                                                                                                                                                  0.4                                             inc
                                                                                                                                                                                                                                                                  0.3                                             nb
                                                                                                                                                              0.5
                                                                                                                                                                                                                                                                  0.2

                                                                                                                                                                                                                                                                  0.1

                                                                                                                                                                  0
                                                                                                                                                                      0         20          40         60             80        100                                     0        20          40        60    80         100
                                                                                                                                                                                     # training samples                                                                               # training samples


                                                                                                                   Assumption Checking
                                                                                                                   • Correlation matrix is significantly correlated, suggesting independent edge assumption is
                                                                                                                   poor (data not shown)


                                                                                                                   Discussion
                                                                                                                   • MRCAP is an effective tool for high-throughput connectome inference
                                                                                                                   •Signal subgraph classifiers significantly improve performance over standard classification
FigureFigure 2: (Top) Gyral labelslabels and associated numeric indicesRef. 5). Connections
          Legend: (Top) Gyral and associated numeric indices (adapted from (adapted from [3]).                     results in both real and synthetic data
        between these regions, as revealed through the DTI tensor data, are quantified in terms of the mean        • Synthetic data suggests a few additional datapoints could yield vastly improved performance
Connections between these regions, as revealed through the DTI tensor data, are quantified in
        fractional anisotropy (FA) of the estimated fibers. (Bottom) Adjacency matrices illustrating connections   • Assumption suggests performance improvements are despite some model inaccuracies, and
terms of the mean regions (vertices) in female(FA)male brains. Each entry in these adjacencyAdjacency
        between gyral fractional anisotropy and of the estimated fibers. (Bottom) matrices                         generalized models might yield further improvements
matrices illustrating connections between gyral gyral region indicated by the row index and terminating
        represents the mean FA of fibers originating in the regions (vertices) in female and male brains.
                                                                                                                   • Standard graph theoretical tools are less effective and do not suggest a signal subgraph
        in the gyral region indicated by the column index, averaged across all subjects from each sex. The
Each entry in these adjacency matrices represents the mean FA of fibers originating in the gyral
        significance of the difference (uncorrected, exact p-values) between female and male brains, computed
region with Fisher’sby the row also shown. In all plots, lighter the gyralimplies higher values.by the column
         indicated exact test, is index and terminating in coloration region indicated Only the lower
index, triangle is shown becausesubjects from each sex.and therefore the adjacency matrices are
        averaged across all these graphs are undirected The significance of the difference
                                                                                                                   References
(uncorrected, exact p-values) assigned to the left hemisphere; 36–70 are assigned to the right
        symmetric. Labels 1–35 are between female and male brains, computed with Fisher’s exact
        hemisphere.                                                                                                [1] Gray et al, submitted and available at: http://www.nitrc.org/projects/mrcap/. .
test, is also shown. In all plots, lighter coloration implies higher values. Only the lower triangle               [2] Drezde et al, 2008.
is shown because these graphs are undirected and therefore the adjacency matrices are                              [3] Desikan et al, 2006.
symmetric. Labels 1–35 are assigned to the left hemisphere; 36–70 are assigned to the right                        [4] Mori,et al. 1999.
hemisphere.

Weitere ähnliche Inhalte

Kürzlich hochgeladen

DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 

Kürzlich hochgeladen (20)

DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 

Empfohlen

Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 

Empfohlen (20)

Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 

Connectome Classification: Statistical Graph Theoretic Methods for Analysis of MR-Connectome Data

  • 1. Connectome Classification: Statistical Graph Theoretic Methods for Analysis of MR-Connectome Data Joshua T. Vogelstein , William R. Gray , John A. Bogovic , 1 1,2 1 3 1 1 Susan M. Resnick , Jerry L. Prince , Carey E. Priebe , R. Jacob Vogelstein1,2 1 2 Johns Hopkins University, Baltimore, Maryland, Johns Hopkins University Applied Physics Laboratory, Laurel, Maryland 3 National Institutes of Health, Bethesda, Maryland Abstract Results • Methods for high-throughput MR connectome inference are available [1] Gender Classifier • Previous analyses of connectome data relied on classical graph theoretic tools, such as clustering coefficient • Coherent and incoherent classifiers perform better than chance and the naive Bayes • We develop a statistical graph theoretic framework to apply to generic connectome classifier (coherent classifier is significant with p-value < 0.0001). classification problems • Best classifier achieves 83% accuracy using 12 signal vertices and 360 signal edges • Applying the tools to 49 senior individuals from the BLSA data set resulted in connectome • Classical graph theoretic tools, such as clustering coeffiecient, number of triangles, etc., do classification accuracy of up to 85% not use vertex labels, which contain useful classification signal. • Using standard graph theoretic measures, like clustering coefficient, ignores vertex labels, and • SOA Machine learning techniques [2] using classical graph theory yield only 75% accuracy achieves only 75% accuracy even upon using sophisticated multivariate machine learning methods [2] • Extensions and further applications aplenty. incoherent estimator coherent estimator Methods 0.5 misclassification rate # signal−vertices 0.5 L π ˆˆ ˆ L n b = 0. 41 ˆ Connectome Inference = 0. 5 10 L c o h= 0. 16 0.4 • MR Connectome Automated Pipeline (MRCAP) [1] to infer connectomes 0.25 20 0.3 ˆ L i n c= 0. 27 • Vertices are neuroanatomical gyral regions [3], edges are estimated tracts using FACT [4] • 49 subjects from the Baltimore Longitudinal Study on Aging; 25 male, 24 female 30 0 0 1 2 3 0.16 10 10 10 10 200 400 600 800 1000 log size of signal subgraph size of signal subgraph some coherent estimators zoomed in coherent estimator misclassification rate 0.5 0.5 # star−vertices 15 0.4 18 0.25 0.3 0.16 21 0 0 1 2 3 0.16 10 10 10 10 400 500 600 log size of signal subgraph size of signal subgraph coherent signal subgraph estimate coherogram 30 20 20 vertex 20 40 40 10 60 60 0 20 40 60 0.04 0.14 0.29 0.55 vertex threshold Figure Legend (above): The top two panels depict the relative performances of the incoherent (left) and coherent (right) classifiers as a function of their hyper-parameters. The middle two depict misclassification rate (left) for a few different choices of # of signal vertices and (right) a zoomed in depiction of the top right panel. The bottom left panel shows the estimated signal subgraph, and the bottom right shows the coherogram. Together, these bottom panels suggest that the signal subgraph for these data is neither particularly coherent or incoherent. (below): The figure below visualizes the twelve signal subgraph nodes. Each subplot shows the signal subgraph induced by one of the 12 signal vertices estimated using the coherent classifier. There are 360 edges in the signal subgraph. MRCAP is available at: http://www.nitrc.org/projects/mrcap/ Model • Joint graph/class model • Each edge is an independent binary random variable • A subset of edges comprise the signal subgraph FGY = FG|Y FY = Bern(auv ; puv|y )πy (u,v)∈S Bern(auv ; puv ) (u,v)∈ES Classifier • Bayes plug-in classifier is asymptotically optimal • Robust estimators have better convergence properties than the MLE Synthetic Data Analysis y= ˆ Bern(auv ; puv|y )ˆy ˆ π • Simulations as true to real data as possible suggest model is not wholly unreasonable • Even under true model, we only expect about 50% of the identified edges are true signal ˆ (u,v)∈S edges with 50 samples • With only a few more samples, both misclassification rate and missed-edge rate drop precipitously Signal Subgraph Estimator incoherent estimator coherent estimator 1 misclassification rate • The signal subgraph could be all edges, an incoherent subset, or a coherent subset # star−vertices 0.75 0.7 • We devise a different estimator for the two special cases 10 • For each edge, we compute the significance of the difference between the two clases, using a 0.5 0.5 Fisherʼs exact test, which is optimal under the model 20 • The incoherent signal subgraph estimator chooses the s most significant edges 0.25 • The coherent signal subgraph estimator chooses the m most significant vertices, and then the 30 0.3 s most significant edges incident to those vertices 0 0.18 0 1 2 3 200 400 600 800 1000 10 10 10 10 log size of signal subgraph size of signal subgraph 1 0.5 misclassification rate missed−edge rate coh 0.4 inc 0.3 nb 0.5 0.2 0.1 0 0 20 40 60 80 100 0 20 40 60 80 100 # training samples # training samples Assumption Checking • Correlation matrix is significantly correlated, suggesting independent edge assumption is poor (data not shown) Discussion • MRCAP is an effective tool for high-throughput connectome inference •Signal subgraph classifiers significantly improve performance over standard classification FigureFigure 2: (Top) Gyral labelslabels and associated numeric indicesRef. 5). Connections Legend: (Top) Gyral and associated numeric indices (adapted from (adapted from [3]). results in both real and synthetic data between these regions, as revealed through the DTI tensor data, are quantified in terms of the mean • Synthetic data suggests a few additional datapoints could yield vastly improved performance Connections between these regions, as revealed through the DTI tensor data, are quantified in fractional anisotropy (FA) of the estimated fibers. (Bottom) Adjacency matrices illustrating connections • Assumption suggests performance improvements are despite some model inaccuracies, and terms of the mean regions (vertices) in female(FA)male brains. Each entry in these adjacencyAdjacency between gyral fractional anisotropy and of the estimated fibers. (Bottom) matrices generalized models might yield further improvements matrices illustrating connections between gyral gyral region indicated by the row index and terminating represents the mean FA of fibers originating in the regions (vertices) in female and male brains. • Standard graph theoretical tools are less effective and do not suggest a signal subgraph in the gyral region indicated by the column index, averaged across all subjects from each sex. The Each entry in these adjacency matrices represents the mean FA of fibers originating in the gyral significance of the difference (uncorrected, exact p-values) between female and male brains, computed region with Fisher’sby the row also shown. In all plots, lighter the gyralimplies higher values.by the column indicated exact test, is index and terminating in coloration region indicated Only the lower index, triangle is shown becausesubjects from each sex.and therefore the adjacency matrices are averaged across all these graphs are undirected The significance of the difference References (uncorrected, exact p-values) assigned to the left hemisphere; 36–70 are assigned to the right symmetric. Labels 1–35 are between female and male brains, computed with Fisher’s exact hemisphere. [1] Gray et al, submitted and available at: http://www.nitrc.org/projects/mrcap/. . test, is also shown. In all plots, lighter coloration implies higher values. Only the lower triangle [2] Drezde et al, 2008. is shown because these graphs are undirected and therefore the adjacency matrices are [3] Desikan et al, 2006. symmetric. Labels 1–35 are assigned to the left hemisphere; 36–70 are assigned to the right [4] Mori,et al. 1999. hemisphere.