SlideShare ist ein Scribd-Unternehmen logo
1 von 40
Downloaden Sie, um offline zu lesen
Medical image analysis and
big data evaluation infrastructures
Henning Müller
HES-SO &
Martinos Center
Overview
• Medical image analysis & retrieval projects
• 3D texture modeling
– Graph models for data analysis
• Big data/data science evaluationinfrastructures
– ImageCLEF
– VISCERAL
– EaaS – Evaluation as a Service
• What comes next?
Henning Müller
• Studies in medical informatics in
Heidelberg, Germany
– Work in Portland, OR, USA
• PhD in image processingin Geneva,
focus on image analysis and retrieval
– Exchange at Monash Univ., Melbourne, Australia
• Prof. at Univ. of Geneva in medicine (2014)
– Medical image analysis and retrieval for decision
support
• Professor at the HES-SO Valais (2007)
– Head of the eHealth unit
• Sabbaticalat the Martinos Center, Boston, MA
Medical imaging is big data!!
• Much imaging datais produced
• Imaging data are very complex
– And getting more complex
• Imaging is essential for
diagnosis and treatment
• Images out of their context
loose most of their sense
– Clinical data are necessary
• Evidence-based medicine&
case-basedreasoning
Medical image retrieval (history)
• MedGIFT project started in 2002
– Global image similarity
• Texture, grey levels
– Teaching files
– Linking text files and
image similarity
• Often data not available
– Medical data hard to get
– Images and text are
connected in cases
• Unrealistic expectations, high quality vs. browsing
– Semantic gap
• Mixing multilingualdata from many resources
and semantic information for medical retrieval
– LinkedLifeData.com
Allan Hanbury, Célia Boyer, Manfred Gschwandtner, Henning Müller, KHRESMOI: Towards
a Multi-Lingual Search and Access System for Biomedical Information, Med-e-Tel, pages
412-416, Luxembourg, 2011.
The informed patient
Integrated interfaces
Prototypes available
• http://everyone.khresmoi.eu/
• http://radiology.khresmoi.eu/
– Ask for login
• http://shangrila.khresmoi.eu/
• http://shambala.khresmoi.eu/
Texture analysis (2D->3D->4D)
• Describe various tissue types
– Brain, lung, …
– Concentration on 3D and 4D data
– Mainly texture descriptors
• Extract visual features/signatures
– Learned, so relation to deep learning
Adrien Depeursinge, Antonio Foncubierta–Rodriguez, Dimitri Van de Ville, and Henning
Müller, Three–dimensional solid texture analysis and retrieval: review and opportunities,
Medical Image Analysis, volume 18, number 1, pages 176-196, 2014.
Database with CT image of
interstitial lung diseases
• 128 cases with CT image series and biopsy
confirmed diagnosis
• Manually annotated regions for tissue classes (1946)
– 6 tissue types of 13 with a larger number of examples
• 159 clinical parameters extracted (sparse)
– Smoking history, age, gender,
hematocrit, …
• Availableafter signing a
license agreement
Learned 3D signatures
• Learn combinations of Riesz wavelets as digital
signatures using SVMs (steerable filters)
– Create signatures to detect small local lesions
and visualize them
Adrien Depeursinge, Antonio Foncubierta–Rodriguez, Dimitri Van de Ville, and Henning
Müller, Rotation–covariant feature learning using steerable Riesz wavelets, IEEE
Transactions on Image Processing, volume 23, number 2, page 898-908, 2014.
Learning Riesz in 3D
• Most medical tissues are naturally 3D
• But modeling gets much more complex
– Vertical planes
– 3-D checkerboard
– 3-D wiggled
checkerboard
Aiding clinical decisions
Using graphs for lung data analysis
• Pulmonary hypertension andpulmonary embolism
– Dual energy CT (DECT)
• Based on a simple lung atlas
– Not based on lobes
• Analyzing relationships between the
lung areas and their perfusion
• Differences in statistical moments
for areas as features
– Can easily be combined with texture
• Good quality for PH (77%) and PE (79%)
– DECT important for PE
Yashin Dicente Cid, Henning Müller, Alexandra Platon, JP Janssens, Frédéric Lador, PA
Poletti, A. Depeursinge, A Lung Graph-Model for Pulmonary Hypertension and Pulmonary
Embolism Detection on DECT images, submitted to MICCAI 2016, Athens Greece, 2016.
Scientific challenges/crowdsourcing
• Most conferences now organize challenge
sessions (MICCAI, ISBI, Grand Challenges, …)
• Public administrations use it increasingly
– NCI uses it: Coding4Cancer
– http://www.challenge.gov/
• Commercial challengeplatforms
– Kaggle, Topcoder, also Netflix challenge
• Open innovation in data science
• Amazon Mechanical Turk for small tasks,
includingmedical image analysis
– But also: https://www.cellslider.net/
• Benchmark on multimodal imageretrieval
– Run since 2003, medical task since 2004
– Part of the Cross Language Evaluation Forum (CLEF)
• Many tasks related to image retrieval
– Image classification
– Image-based retrieval
– Case-based retrieval
– Compound figure separation
– Caption prediction
– …
• Many old databases remain available, imageclef.org
Henning Müller, Paul Clough, Thomas
Deselaers, Barbara Caputo, ImageCLEF –
Experimental evaluation of visual information
retrieval, Springer, 2010.
Challenges with challenges
• Difficult to distribute very big datasets
– Sending around hard disks? Risky, expensive
• Sharing confidentialdata
– Big data is impossible to anonymize automatically
• Quickly changing data sets
– Outdated when a test collection is being created
• Optimizationson the test data are possible
– Manual adaptations, etc.
– Often hard to fully reproduce results
• Groups without large computing are disadvantaged
cloud
Test
Resources available
Test DataTraining Data
Participants Organiser
Participant
Virtual
MachinesRegistration
System
Annotation
Management System
Analysis
System
Annotators
(Radiologists)
Locally Installed
Annotation
Clients
Microsoft
Azure
Cloud
Test Data
Silver corpus (example trachea)
• Executable code of all participants
– Run it on new data, do label fusion
Dice 0.85 Dice 0.71 Dice 0.84 Dice 0.83
Participant segmentations
Dice 0.92
Silver Corpus
Docker vs. Virtual MachinesContainers
Bins/Libs
VM
VS.
Evaluation as a Service (EaaS)
• Moving the algorithms to the data not vice versa
– Required when data are: very large, changing
quickly, confidential (medical, commercial, …)
• Different approaches
– Source code submission, APIs, VMs local or in the
cloud, Docker containers, specific frameworks
• Allows for continuous evaluation, component-
based evaluation, total reproducibility, updates, …
– Workshop March 2015 in Sierre on EaaS
– Workshop November 2015 in Boston on cloud-
based evaluation (http://www.martinos.org/cloudWorkshop/)
Allan Hanbury, Henning Müller, Georg Langs, Marc André Weber, Bjoern H. Menze, and Tomas
Salas Fernandez, Bringing the algorithms to the data: cloud–based benchmarking for medical
image analysis, CLEF conference, Springer Lecture Notes in Computer Science, 2012.
Sharing images, research data
• Very important aspect of research is to have solid
methods, data, large if possible
– If data not available, results can not be reproduced
– If data are small, results may be meaningless
• Many multi-center projects spend most money on
data acquisition, often delayed no time for analysis
– IRB takes long, sometimes restrictions are strange
• Research is international!
• NIH & NCI are great to push data availability
– But data can be made available in an unusable way
Political support for research
infrastructures!
Sustaining biomedical big data
Microsoft Azure
Intels CCC
Institutional support (NCI)
• Using crowdsourcing to link researchers & challenges
Business models for these links
• Manually annotate large data sets for challenges
– Data needs to be available in a secure space
• Have researchers work on data (on infrastructure)
– Deliver code in Docker containers
• Commercialize results and share benefits
• Part of QIN – Quantitative Imaging Network (NCI)
– Cloud-Based Image Biomarker Optimization Platform
• Create challenges for QIN to validate tools
• Use Codalabto run project challenges
– Run code in containers (Docker), well integrated
– Share code blocks across teams, evaluate
combinations
Codalab
• Open Source challenge platform supported by
Microsoft
– Integrated with the Azure cloud infrastructure
• Easy creation of new challenges
• Participant registration, leaderboardof results
CodaLab worksheets
• Running computational experiments
• Execute Docker containers
– Workflow of Docker containers
– Foster collaboration and component reuse
Future of research infrastructures
• Much more centered around data!!
– Nature Scientific Data underlines the importance!
• Data need to be available but in a meaningful way
– Infrastructure needs to be available and way to
evaluate on the data with specific tasks
• More work for data preparation but in line with IRB
– Analysis inside medical institutions
• Code will become even more portable
– Docker helps enormously and develops quickly
• Public private partnerships to be sustainable
• Total reproducibility, long term, sharing tools
• Much higher efficiency
Conclusions
• Medicine is digital medicine
– More data and more complex links (genes, visual,
signals, …)
• Medical data science requires new infrastructures
– Use routine data, not manually extracted, curated
data, curate large scale, accommodate for errors
• Active learning and interactive data curation
– Use large data sets from data warehouses
– Keep data where they are produced
• More “local” computation, so where data are
– Secure aggregation of results
• Sharing infrastructures, data and more
Contact
• More information canbe found at
– http://khresmoi.eu/
– http://visceral.eu/
– http://medgift.hevs.ch/
– http://publications.hevs.ch/
• Contact:
– Henning.mueller@hevs.ch
Medical image analysis and big data evaluation infrastructures

Weitere ähnliche Inhalte

Was ist angesagt?

2014 Medical Imaging
2014 Medical Imaging2014 Medical Imaging
2014 Medical Imaging
Engku Fahmi
 

Was ist angesagt? (20)

Applying deep learning to medical data
Applying deep learning to medical dataApplying deep learning to medical data
Applying deep learning to medical data
 
2014 Medical Imaging
2014 Medical Imaging2014 Medical Imaging
2014 Medical Imaging
 
(2017/06)Practical points of deep learning for medical imaging
(2017/06)Practical points of deep learning for medical imaging(2017/06)Practical points of deep learning for medical imaging
(2017/06)Practical points of deep learning for medical imaging
 
2015 04-18-wilson cg
2015 04-18-wilson cg2015 04-18-wilson cg
2015 04-18-wilson cg
 
Frankie Rybicki slide set for Deep Learning in Radiology / Medicine
Frankie Rybicki slide set for Deep Learning in Radiology / MedicineFrankie Rybicki slide set for Deep Learning in Radiology / Medicine
Frankie Rybicki slide set for Deep Learning in Radiology / Medicine
 
Machine Learning in Modern Medicine with Erin LeDell at Stanford Med
Machine Learning in Modern Medicine with Erin LeDell at Stanford MedMachine Learning in Modern Medicine with Erin LeDell at Stanford Med
Machine Learning in Modern Medicine with Erin LeDell at Stanford Med
 
Medical Segmentation Decathalon
Medical Segmentation DecathalonMedical Segmentation Decathalon
Medical Segmentation Decathalon
 
IRJET- Classifying Chest Pathology Images using Deep Learning Techniques
IRJET- Classifying Chest Pathology Images using Deep Learning TechniquesIRJET- Classifying Chest Pathology Images using Deep Learning Techniques
IRJET- Classifying Chest Pathology Images using Deep Learning Techniques
 
deep learning applications in medical image analysis brain tumor
deep learning applications in medical image analysis brain tumordeep learning applications in medical image analysis brain tumor
deep learning applications in medical image analysis brain tumor
 
Dli meetup moccia
Dli meetup mocciaDli meetup moccia
Dli meetup moccia
 
The XNAT imaging informatics platform
The XNAT imaging informatics platformThe XNAT imaging informatics platform
The XNAT imaging informatics platform
 
2019 Triangle Machine Learning Day - Biomedical Image Understanding and EHRs ...
2019 Triangle Machine Learning Day - Biomedical Image Understanding and EHRs ...2019 Triangle Machine Learning Day - Biomedical Image Understanding and EHRs ...
2019 Triangle Machine Learning Day - Biomedical Image Understanding and EHRs ...
 
Artificial Intelligence in Medical Imaging
Artificial Intelligence in Medical ImagingArtificial Intelligence in Medical Imaging
Artificial Intelligence in Medical Imaging
 
5 Reasons Why Radiology Needs Artificial Intelligence
5 Reasons Why Radiology Needs Artificial Intelligence5 Reasons Why Radiology Needs Artificial Intelligence
5 Reasons Why Radiology Needs Artificial Intelligence
 
Artificial Intelligence and Diagnostics
Artificial Intelligence and DiagnosticsArtificial Intelligence and Diagnostics
Artificial Intelligence and Diagnostics
 
Artificial intelligence in radiology
Artificial intelligence in radiologyArtificial intelligence in radiology
Artificial intelligence in radiology
 
Role of artificial intellegence (a.i) in radiology department nitish virmani
Role of artificial intellegence (a.i) in radiology department nitish virmaniRole of artificial intellegence (a.i) in radiology department nitish virmani
Role of artificial intellegence (a.i) in radiology department nitish virmani
 
Sleep quality prediction from wearable data using deep learning
Sleep quality prediction from wearable data using deep learningSleep quality prediction from wearable data using deep learning
Sleep quality prediction from wearable data using deep learning
 
Use Cases Machine Learning for Healthcare
Use Cases Machine Learning for HealthcareUse Cases Machine Learning for Healthcare
Use Cases Machine Learning for Healthcare
 
Social Networks and Collaborative Platforms for Data Sharing in Radiology
Social Networks and Collaborative Platforms for Data Sharing in RadiologySocial Networks and Collaborative Platforms for Data Sharing in Radiology
Social Networks and Collaborative Platforms for Data Sharing in Radiology
 

Andere mochten auch

Andere mochten auch (11)

How to detect soft falls on devices
How to detect soft falls on devicesHow to detect soft falls on devices
How to detect soft falls on devices
 
A 3-D Riesz-Covariance Texture Model for the Prediction of Nodule Recurrence ...
A 3-D Riesz-Covariance Texture Model for the Prediction of Nodule Recurrence ...A 3-D Riesz-Covariance Texture Model for the Prediction of Nodule Recurrence ...
A 3-D Riesz-Covariance Texture Model for the Prediction of Nodule Recurrence ...
 
Quelle(s) valeur(s) pour le leadership stratégique ?
Quelle(s) valeur(s) pour le leadership stratégique ?Quelle(s) valeur(s) pour le leadership stratégique ?
Quelle(s) valeur(s) pour le leadership stratégique ?
 
Nejm Medical Image Challenge
Nejm Medical Image ChallengeNejm Medical Image Challenge
Nejm Medical Image Challenge
 
Anti-GBM Diseases
Anti-GBM DiseasesAnti-GBM Diseases
Anti-GBM Diseases
 
Medical image analysis
Medical image analysisMedical image analysis
Medical image analysis
 
MRCP MOCK EXAM
MRCP MOCK EXAMMRCP MOCK EXAM
MRCP MOCK EXAM
 
Social media research in the health domain (tutorial) - [part 1]
Social media research in the health domain (tutorial) - [part 1]Social media research in the health domain (tutorial) - [part 1]
Social media research in the health domain (tutorial) - [part 1]
 
Medical image analysis
Medical image analysisMedical image analysis
Medical image analysis
 
Picture Medicine
Picture MedicinePicture Medicine
Picture Medicine
 
Internal Medicine Image Challenge MCQs
Internal Medicine Image Challenge MCQsInternal Medicine Image Challenge MCQs
Internal Medicine Image Challenge MCQs
 

Ähnlich wie Medical image analysis and big data evaluation infrastructures

Henning Müller et Michael Schumacher pour la journée e-health 2013
Henning Müller et Michael Schumacher pour la journée e-health 2013Henning Müller et Michael Schumacher pour la journée e-health 2013
Henning Müller et Michael Schumacher pour la journée e-health 2013
Thearkvalais
 
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Nolan Nichols
 
Making an impact with data science
Making an impact  with data scienceMaking an impact  with data science
Making an impact with data science
Jordan Engbers
 

Ähnlich wie Medical image analysis and big data evaluation infrastructures (20)

MedGIFT projects in medical imaging
MedGIFT projects in medical imagingMedGIFT projects in medical imaging
MedGIFT projects in medical imaging
 
Share and Reuse: how data sharing can take your research to the next level
Share and Reuse: how data sharing can take your research to the next levelShare and Reuse: how data sharing can take your research to the next level
Share and Reuse: how data sharing can take your research to the next level
 
Medical 3D data retrieval
Medical 3D data retrievalMedical 3D data retrieval
Medical 3D data retrieval
 
Information Access to Medical Image Data: from Big Data to Semantics - Academ...
Information Access to Medical Image Data: from Big Data to Semantics - Academ...Information Access to Medical Image Data: from Big Data to Semantics - Academ...
Information Access to Medical Image Data: from Big Data to Semantics - Academ...
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
 
Rdm slides march 2014
Rdm slides march 2014Rdm slides march 2014
Rdm slides march 2014
 
The MedRed Ontology for Representing Clinical Data Acquisition Metadata
The MedRed Ontology for Representing Clinical Data Acquisition MetadataThe MedRed Ontology for Representing Clinical Data Acquisition Metadata
The MedRed Ontology for Representing Clinical Data Acquisition Metadata
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
 
Henning Müller et Michael Schumacher pour la journée e-health 2013
Henning Müller et Michael Schumacher pour la journée e-health 2013Henning Müller et Michael Schumacher pour la journée e-health 2013
Henning Müller et Michael Schumacher pour la journée e-health 2013
 
eHealth unit HES-SO in Sierre
eHealth unit HES-SO in SierreeHealth unit HES-SO in Sierre
eHealth unit HES-SO in Sierre
 
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
Reproducibility in human cognitive neuroimaging: a community-­driven data sha...
 
Meeting the Computational Challenges Associated with Human Health
Meeting the Computational Challenges Associated with Human HealthMeeting the Computational Challenges Associated with Human Health
Meeting the Computational Challenges Associated with Human Health
 
PRISM Project Update
PRISM Project UpdatePRISM Project Update
PRISM Project Update
 
Hadoop Enabled Healthcare
Hadoop Enabled HealthcareHadoop Enabled Healthcare
Hadoop Enabled Healthcare
 
Making an impact with data science
Making an impact  with data scienceMaking an impact  with data science
Making an impact with data science
 
Data commons bonazzi bd2 k fundamentals of science feb 2017
Data commons bonazzi   bd2 k fundamentals of science feb 2017Data commons bonazzi   bd2 k fundamentals of science feb 2017
Data commons bonazzi bd2 k fundamentals of science feb 2017
 
Introduction to Big Data and its Potential for Dementia Research
Introduction to Big Data and its Potential for Dementia ResearchIntroduction to Big Data and its Potential for Dementia Research
Introduction to Big Data and its Potential for Dementia Research
 
Big Data in Pediatric Critical Care by Mohit Mehra
Big Data in Pediatric Critical Care by Mohit MehraBig Data in Pediatric Critical Care by Mohit Mehra
Big Data in Pediatric Critical Care by Mohit Mehra
 
Standards in health informatics - problem, clinical models and terminology
Standards in health informatics - problem, clinical models and terminologyStandards in health informatics - problem, clinical models and terminology
Standards in health informatics - problem, clinical models and terminology
 
Taming the Big Data Beast - Together
Taming the Big Data Beast - TogetherTaming the Big Data Beast - Together
Taming the Big Data Beast - Together
 

Mehr von Institute of Information Systems (HES-SO)

Solar production prediction based on non linear meteo source adaptation
Solar production prediction based on non linear meteo source adaptationSolar production prediction based on non linear meteo source adaptation
Solar production prediction based on non linear meteo source adaptation
Institute of Information Systems (HES-SO)
 
3D Riesz-wavelet Based Covariance Descriptors for Texture Classi cation of Lu...
3D Riesz-wavelet Based Covariance Descriptors for Texture Classication of Lu...3D Riesz-wavelet Based Covariance Descriptors for Texture Classication of Lu...
3D Riesz-wavelet Based Covariance Descriptors for Texture Classi cation of Lu...
Institute of Information Systems (HES-SO)
 

Mehr von Institute of Information Systems (HES-SO) (20)

MIE20232.pptx
MIE20232.pptxMIE20232.pptx
MIE20232.pptx
 
Classification of noisy free-text prostate cancer pathology reports using nat...
Classification of noisy free-text prostate cancer pathology reports using nat...Classification of noisy free-text prostate cancer pathology reports using nat...
Classification of noisy free-text prostate cancer pathology reports using nat...
 
Machine learning assisted citation screening for Systematic Reviews - Anjani ...
Machine learning assisted citation screening for Systematic Reviews - Anjani ...Machine learning assisted citation screening for Systematic Reviews - Anjani ...
Machine learning assisted citation screening for Systematic Reviews - Anjani ...
 
Exploiting biomedical literature to mine out a large multimodal dataset of ra...
Exploiting biomedical literature to mine out a large multimodal dataset of ra...Exploiting biomedical literature to mine out a large multimodal dataset of ra...
Exploiting biomedical literature to mine out a large multimodal dataset of ra...
 
L'IoT dans les usines. Quels avantages ?
L'IoT dans les usines. Quels avantages ?L'IoT dans les usines. Quels avantages ?
L'IoT dans les usines. Quels avantages ?
 
Studying Public Medical Images from Open Access Literature and Social Network...
Studying Public Medical Images from Open Access Literature and Social Network...Studying Public Medical Images from Open Access Literature and Social Network...
Studying Public Medical Images from Open Access Literature and Social Network...
 
Risques opérationnels et le système de contrôle interne : les limites d’un te...
Risques opérationnels et le système de contrôle interne : les limites d’un te...Risques opérationnels et le système de contrôle interne : les limites d’un te...
Risques opérationnels et le système de contrôle interne : les limites d’un te...
 
Le contrôle interne dans les administrations publiques tient-il toutes ses pr...
Le contrôle interne dans les administrations publiques tient-il toutes ses pr...Le contrôle interne dans les administrations publiques tient-il toutes ses pr...
Le contrôle interne dans les administrations publiques tient-il toutes ses pr...
 
Le système de contrôle interne : Présentation générale, enjeux et méthodes
Le système de contrôle interne : Présentation générale, enjeux et méthodesLe système de contrôle interne : Présentation générale, enjeux et méthodes
Le système de contrôle interne : Présentation générale, enjeux et méthodes
 
Crowdsourcing-based Mobile Application for Wheelchair Accessibility
Crowdsourcing-based Mobile Application for Wheelchair AccessibilityCrowdsourcing-based Mobile Application for Wheelchair Accessibility
Crowdsourcing-based Mobile Application for Wheelchair Accessibility
 
NOSE: une approche Smart-City pour les zones périphériques et extra-urbaines
NOSE: une approche Smart-City pour les zones périphériques et extra-urbainesNOSE: une approche Smart-City pour les zones périphériques et extra-urbaines
NOSE: une approche Smart-City pour les zones périphériques et extra-urbaines
 
FUNDAMENTALS OF TEXTURE PROCESSING FOR BIOMEDICAL IMAGE ANALYSIS
FUNDAMENTALS OF TEXTURE PROCESSING FOR BIOMEDICAL IMAGE ANALYSISFUNDAMENTALS OF TEXTURE PROCESSING FOR BIOMEDICAL IMAGE ANALYSIS
FUNDAMENTALS OF TEXTURE PROCESSING FOR BIOMEDICAL IMAGE ANALYSIS
 
MOBILE COLLECTION AND DISSEMINATION OF SENIORS’ SKILLS
MOBILE COLLECTION AND DISSEMINATION OF SENIORS’ SKILLSMOBILE COLLECTION AND DISSEMINATION OF SENIORS’ SKILLS
MOBILE COLLECTION AND DISSEMINATION OF SENIORS’ SKILLS
 
Enhanced Students Laboratory The GET project
Enhanced Students Laboratory The GET projectEnhanced Students Laboratory The GET project
Enhanced Students Laboratory The GET project
 
Solar production prediction based on non linear meteo source adaptation
Solar production prediction based on non linear meteo source adaptationSolar production prediction based on non linear meteo source adaptation
Solar production prediction based on non linear meteo source adaptation
 
Exploring the New Trends of Chinese Tourists in Switzerland
Exploring the New Trends of Chinese Tourists in SwitzerlandExploring the New Trends of Chinese Tourists in Switzerland
Exploring the New Trends of Chinese Tourists in Switzerland
 
Social Media Data analyzis and Semantics for Tourism Understanding
Social Media Data analyzis and Semantics for Tourism UnderstandingSocial Media Data analyzis and Semantics for Tourism Understanding
Social Media Data analyzis and Semantics for Tourism Understanding
 
Valeurs et management agile
Valeurs et management agileValeurs et management agile
Valeurs et management agile
 
3D Riesz-wavelet Based Covariance Descriptors for Texture Classi cation of Lu...
3D Riesz-wavelet Based Covariance Descriptors for Texture Classication of Lu...3D Riesz-wavelet Based Covariance Descriptors for Texture Classication of Lu...
3D Riesz-wavelet Based Covariance Descriptors for Texture Classi cation of Lu...
 
Texture-Based Computational Models of Tissue in Biomedical Images: Initial Ex...
Texture-Based Computational Models of Tissue in Biomedical Images: Initial Ex...Texture-Based Computational Models of Tissue in Biomedical Images: Initial Ex...
Texture-Based Computational Models of Tissue in Biomedical Images: Initial Ex...
 

Kürzlich hochgeladen

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Kürzlich hochgeladen (20)

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

Medical image analysis and big data evaluation infrastructures

  • 1. Medical image analysis and big data evaluation infrastructures Henning Müller HES-SO & Martinos Center
  • 2. Overview • Medical image analysis & retrieval projects • 3D texture modeling – Graph models for data analysis • Big data/data science evaluationinfrastructures – ImageCLEF – VISCERAL – EaaS – Evaluation as a Service • What comes next?
  • 3. Henning Müller • Studies in medical informatics in Heidelberg, Germany – Work in Portland, OR, USA • PhD in image processingin Geneva, focus on image analysis and retrieval – Exchange at Monash Univ., Melbourne, Australia • Prof. at Univ. of Geneva in medicine (2014) – Medical image analysis and retrieval for decision support • Professor at the HES-SO Valais (2007) – Head of the eHealth unit • Sabbaticalat the Martinos Center, Boston, MA
  • 4. Medical imaging is big data!! • Much imaging datais produced • Imaging data are very complex – And getting more complex • Imaging is essential for diagnosis and treatment • Images out of their context loose most of their sense – Clinical data are necessary • Evidence-based medicine& case-basedreasoning
  • 5. Medical image retrieval (history) • MedGIFT project started in 2002 – Global image similarity • Texture, grey levels – Teaching files – Linking text files and image similarity • Often data not available – Medical data hard to get – Images and text are connected in cases • Unrealistic expectations, high quality vs. browsing – Semantic gap
  • 6. • Mixing multilingualdata from many resources and semantic information for medical retrieval – LinkedLifeData.com Allan Hanbury, Célia Boyer, Manfred Gschwandtner, Henning Müller, KHRESMOI: Towards a Multi-Lingual Search and Access System for Biomedical Information, Med-e-Tel, pages 412-416, Luxembourg, 2011.
  • 9. Prototypes available • http://everyone.khresmoi.eu/ • http://radiology.khresmoi.eu/ – Ask for login • http://shangrila.khresmoi.eu/ • http://shambala.khresmoi.eu/
  • 10. Texture analysis (2D->3D->4D) • Describe various tissue types – Brain, lung, … – Concentration on 3D and 4D data – Mainly texture descriptors • Extract visual features/signatures – Learned, so relation to deep learning Adrien Depeursinge, Antonio Foncubierta–Rodriguez, Dimitri Van de Ville, and Henning Müller, Three–dimensional solid texture analysis and retrieval: review and opportunities, Medical Image Analysis, volume 18, number 1, pages 176-196, 2014.
  • 11. Database with CT image of interstitial lung diseases • 128 cases with CT image series and biopsy confirmed diagnosis • Manually annotated regions for tissue classes (1946) – 6 tissue types of 13 with a larger number of examples • 159 clinical parameters extracted (sparse) – Smoking history, age, gender, hematocrit, … • Availableafter signing a license agreement
  • 12. Learned 3D signatures • Learn combinations of Riesz wavelets as digital signatures using SVMs (steerable filters) – Create signatures to detect small local lesions and visualize them Adrien Depeursinge, Antonio Foncubierta–Rodriguez, Dimitri Van de Ville, and Henning Müller, Rotation–covariant feature learning using steerable Riesz wavelets, IEEE Transactions on Image Processing, volume 23, number 2, page 898-908, 2014.
  • 13. Learning Riesz in 3D • Most medical tissues are naturally 3D • But modeling gets much more complex – Vertical planes – 3-D checkerboard – 3-D wiggled checkerboard
  • 15. Using graphs for lung data analysis • Pulmonary hypertension andpulmonary embolism – Dual energy CT (DECT) • Based on a simple lung atlas – Not based on lobes • Analyzing relationships between the lung areas and their perfusion • Differences in statistical moments for areas as features – Can easily be combined with texture • Good quality for PH (77%) and PE (79%) – DECT important for PE Yashin Dicente Cid, Henning Müller, Alexandra Platon, JP Janssens, Frédéric Lador, PA Poletti, A. Depeursinge, A Lung Graph-Model for Pulmonary Hypertension and Pulmonary Embolism Detection on DECT images, submitted to MICCAI 2016, Athens Greece, 2016.
  • 16. Scientific challenges/crowdsourcing • Most conferences now organize challenge sessions (MICCAI, ISBI, Grand Challenges, …) • Public administrations use it increasingly – NCI uses it: Coding4Cancer – http://www.challenge.gov/ • Commercial challengeplatforms – Kaggle, Topcoder, also Netflix challenge • Open innovation in data science • Amazon Mechanical Turk for small tasks, includingmedical image analysis – But also: https://www.cellslider.net/
  • 17. • Benchmark on multimodal imageretrieval – Run since 2003, medical task since 2004 – Part of the Cross Language Evaluation Forum (CLEF) • Many tasks related to image retrieval – Image classification – Image-based retrieval – Case-based retrieval – Compound figure separation – Caption prediction – … • Many old databases remain available, imageclef.org Henning Müller, Paul Clough, Thomas Deselaers, Barbara Caputo, ImageCLEF – Experimental evaluation of visual information retrieval, Springer, 2010.
  • 18. Challenges with challenges • Difficult to distribute very big datasets – Sending around hard disks? Risky, expensive • Sharing confidentialdata – Big data is impossible to anonymize automatically • Quickly changing data sets – Outdated when a test collection is being created • Optimizationson the test data are possible – Manual adaptations, etc. – Often hard to fully reproduce results • Groups without large computing are disadvantaged
  • 19. cloud
  • 20. Test
  • 22.
  • 23. Test DataTraining Data Participants Organiser Participant Virtual MachinesRegistration System Annotation Management System Analysis System Annotators (Radiologists) Locally Installed Annotation Clients Microsoft Azure Cloud Test Data
  • 24. Silver corpus (example trachea) • Executable code of all participants – Run it on new data, do label fusion Dice 0.85 Dice 0.71 Dice 0.84 Dice 0.83 Participant segmentations Dice 0.92 Silver Corpus
  • 25. Docker vs. Virtual MachinesContainers Bins/Libs VM VS.
  • 26. Evaluation as a Service (EaaS) • Moving the algorithms to the data not vice versa – Required when data are: very large, changing quickly, confidential (medical, commercial, …) • Different approaches – Source code submission, APIs, VMs local or in the cloud, Docker containers, specific frameworks • Allows for continuous evaluation, component- based evaluation, total reproducibility, updates, … – Workshop March 2015 in Sierre on EaaS – Workshop November 2015 in Boston on cloud- based evaluation (http://www.martinos.org/cloudWorkshop/) Allan Hanbury, Henning Müller, Georg Langs, Marc André Weber, Bjoern H. Menze, and Tomas Salas Fernandez, Bringing the algorithms to the data: cloud–based benchmarking for medical image analysis, CLEF conference, Springer Lecture Notes in Computer Science, 2012.
  • 27. Sharing images, research data • Very important aspect of research is to have solid methods, data, large if possible – If data not available, results can not be reproduced – If data are small, results may be meaningless • Many multi-center projects spend most money on data acquisition, often delayed no time for analysis – IRB takes long, sometimes restrictions are strange • Research is international! • NIH & NCI are great to push data availability – But data can be made available in an unusable way
  • 28. Political support for research infrastructures!
  • 32. Institutional support (NCI) • Using crowdsourcing to link researchers & challenges
  • 33. Business models for these links • Manually annotate large data sets for challenges – Data needs to be available in a secure space • Have researchers work on data (on infrastructure) – Deliver code in Docker containers • Commercialize results and share benefits
  • 34. • Part of QIN – Quantitative Imaging Network (NCI) – Cloud-Based Image Biomarker Optimization Platform • Create challenges for QIN to validate tools • Use Codalabto run project challenges – Run code in containers (Docker), well integrated – Share code blocks across teams, evaluate combinations
  • 35. Codalab • Open Source challenge platform supported by Microsoft – Integrated with the Azure cloud infrastructure • Easy creation of new challenges • Participant registration, leaderboardof results
  • 36. CodaLab worksheets • Running computational experiments • Execute Docker containers – Workflow of Docker containers – Foster collaboration and component reuse
  • 37. Future of research infrastructures • Much more centered around data!! – Nature Scientific Data underlines the importance! • Data need to be available but in a meaningful way – Infrastructure needs to be available and way to evaluate on the data with specific tasks • More work for data preparation but in line with IRB – Analysis inside medical institutions • Code will become even more portable – Docker helps enormously and develops quickly • Public private partnerships to be sustainable • Total reproducibility, long term, sharing tools • Much higher efficiency
  • 38. Conclusions • Medicine is digital medicine – More data and more complex links (genes, visual, signals, …) • Medical data science requires new infrastructures – Use routine data, not manually extracted, curated data, curate large scale, accommodate for errors • Active learning and interactive data curation – Use large data sets from data warehouses – Keep data where they are produced • More “local” computation, so where data are – Secure aggregation of results • Sharing infrastructures, data and more
  • 39. Contact • More information canbe found at – http://khresmoi.eu/ – http://visceral.eu/ – http://medgift.hevs.ch/ – http://publications.hevs.ch/ • Contact: – Henning.mueller@hevs.ch