SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Downloaden Sie, um offline zu lesen
1
Using Text Mining to Explore 
Concept Complexity in Obesity
through Concept Maps
George Karystianis
School of Computer Science
Supervisors: Goran Nenadic, Iain Buchan
Advisor: Andrea Schalk
2
Motivation
● Complex nature of obesity.
● Wide range of biomedical data sources available.
– implementation of biomedical text/data mining.
● Possible to reveal hidden links between obesity and other
diseases.
● Partial completed knowledge representation models of obesity.
● A systematic approach required for:
– analysis and interpretation of clinical knowledge.
3
Concept Maps
● Knowledge representation models.
● Consisted of:
– nodes (concepts).
– links (relationships between the nodes).
● Aim: gather, understand, explore knowledge.
● Variety of users.
● No explicit detail.
● Implemented primarily in education.
4
Concept Map Example
5
Aim
● To design a framework to build/enhance medical concept maps.
● To improve the understanding of health care concept
complexity.
● Assist medical professionals in the representation, exploration
and validation of their expert knowledge.
● Improvement of the clinical health care.
6
Objectives
● Design and implement methods for health care concept
detection.
● Concept organisation in a concept map form.
● Method generation for concept map updates.
● Build a framework for the design/enhancement/validation of
medical concept maps.
● Methodology evaluation through the health problem of obesity:
– validation of obesity related concepts with current structured obesity
information available.
– identify gaps in clinical knowledge.
7
Research Hypothesis &
Questions
-The analysis required to extract health care concepts.
-The approach to built and enhance a concept map.
-The concept map contribution in the representation/validation of knowledge.
-The text mining results help to understand/explore clinical problems.
Biomedical
Text Mining
Scientific
literature
Concept
map
Improvement of
health care
Framework
8
Obesity
● Worldwide problem.
● Epidemic proportions:
– WHO rates (2005): 1.6 billion overweight, 400 million obese.
● Associations to various diseases.
● Complex risk factors and complications.
● Various aspects.
● Lots of research.
9
10
Biomedical Text Mining
● Extraction of information from unstructured data of biomedical
nature.
● Discovery of new, previously unknown knowledge.
● Performed on documents with complex/specific terminology and
expressions.
● Challenges:
– language ambiguity.
– variation of language expression.
● Various tools and applications (Termine, Whatizit, GATE).
● Adaptation to user's tasks and requirements.
11
What we are looking for?
● Risk Factors
● Causal Factors
● Confounding Factors
● Outcomes
● Complications
● Interventions
● ...
12
Methodology Overview
1. Document retrieval.
2. Term/concept extraction.
3. Feature engineering and Information extraction:
- application of classification/clustering techniques.
4. Concept map design.
13
Evaluation-Obesity Case Study
● Comparison:
– What ?
● biomedical text mining results.
● concept map information.
– How ?
● concepts and relationships.
● New ones.
● Examination/manipulation/validation of new knowledge by experts.
● Enhancement of the concept map.
14
Progress so far (1)
● Corpus collection.
● Application of Automated Term Recognition (ATR).
● C-value method.
● Single word ATR:
– terminological head identification.
– word of a multi-word term that defines the term class.
– example:
● “Childhood diabetes type II”.
● Terminological head: “diabetes”.
15
Progress so far (2)
● Ranking head measures:
– total head frequency,
– single head frequency,
– maximum and average C-value,
– abstract frequency,
– ratio of single head frequency/total head frequency,
– tf*idf (term frequency*inverse document frequency).
16
Results
tf*idf total freq single freq abstract freq word freq max_c aver_c ratio
0
5
10
15
20
25
30
35
40
45
0
10
20
30
40
50
Statistical measure
Numberofkeywords
17
Progress so far (3)
● Pattern extraction from abstracts for:
– risk, confounding and causal factors,
– interventions,
– complications,
– outcomes.
Obesity risk is increased among women with psychiatric disorders
Potential risk factor
18
Example
Potential risk factors Potential interventions Potential complications
19
Future plan
Species identification in obesity corpus (Linneus)
Exploration of single word terms ATR
Calculation of z-score
Integration of single and multi-word terms
Lexical/semantic analysis of the existing concept map
Paper preparation for the extraction of single terms in text
Pattern extraction from manual analysis
Pattern rule design with Minor Third
Feature engineering
Clustering
Classification
Paper preparation for the classification of disease descriptors
Paper preparation for the clustering of health care concepts
Integration of the results
Preparation of the second year interview/report
Design of concept map relationships (exploration)
Application of visual mapping tools
Update of the new concept map
Comparison and validation of knowledge
Exploration of concept complexity in obesity
Paper preparation for the automatic design of clinical concept maps
Produced generic framework of the methodology
Writing the thesis
October 2010 April 2011 November 2011 May 2012
Year 3
Year 2
Date
Year 2 (1/2): Concept extraction
20
Future plan
Species identification in obesity corpus (Linneus)
Exploration of single word terms ATR
Calculation of z-score
Integration of single and multi-word terms
Lexical/semantic analysis of the existing concept map
Paper preparation for the extraction of single terms in text
Pattern extraction from manual analysis
Pattern rule design with Minor Third
Feature engineering
Clustering
Classification
Paper preparation for the classification of disease descriptors
Paper preparation for the clustering of health care concepts
Integration of the results
Preparation of the second year interview/report
Design of concept map relationships (exploration)
Application of visual mapping tools
Update of the new concept map
Comparison and validation of knowledge
Exploration of concept complexity in obesity
Paper preparation for the automatic design of clinical concept maps
Produced generic framework of the methodology
Writing the thesis
October 2010 April 2011 November 2011 May 2012
Year 3
Year 2
Date
Year 2 (2/2): Concept structuring
21
Future plan
Species identification in obesity corpus (Linneus)
Exploration of single word terms ATR
Calculation of z-score
Integration of single and multi-word terms
Lexical/semantic analysis of the existing concept map
Paper preparation for the extraction of single terms in text
Pattern extraction from manual analysis
Pattern rule design with Minor Third
Feature engineering
Clustering
Classification
Paper preparation for the classification of disease descriptors
Paper preparation for the clustering of health care concepts
Integration of the results
Preparation of the second year interview/report
Design of concept map relationships (exploration)
Application of visual mapping tools
Update of the new concept map
Comparison and validation of knowledge
Exploration of concept complexity in obesity
Paper preparation for the automatic design of clinical concept maps
Produced generic framework of the methodology
Writing the thesis
October 2010 April 2011 November 2011 May 2012
Year 3
Year 2
Date
Year 3: Design of the medical concept map
22
Summary
● Framework creation for clinical concept map building and
enhancement.
● Improved understanding of health care concept complexity.
● So far:
– comprehension of literature review.
– methodology design.
– single ATR.
– pattern design.
23
End
Acknowledgements
2. School of Computer Science
University of Manchester
1. Medical Research Council

Weitere ähnliche Inhalte

Ähnlich wie First year present

Data Visuallization for Decision Making - Intel White Paper
Data Visuallization for Decision Making - Intel White PaperData Visuallization for Decision Making - Intel White Paper
Data Visuallization for Decision Making - Intel White PaperNicholas Tenhue
 
Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...
Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...
Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...healthcareisi
 
openEHR template development for COVID-19
openEHR template development for COVID-19openEHR template development for COVID-19
openEHR template development for COVID-19openEHR-Japan
 
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...Maulik Kamdar
 
52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx
52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx
52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docxalinainglis
 
BDCC-06-00004.pdf
BDCC-06-00004.pdfBDCC-06-00004.pdf
BDCC-06-00004.pdfAsiyaKhan63
 
Secinaro et al-2021-bmc_medical_informatics_and_decision_making
Secinaro et al-2021-bmc_medical_informatics_and_decision_makingSecinaro et al-2021-bmc_medical_informatics_and_decision_making
Secinaro et al-2021-bmc_medical_informatics_and_decision_makingNethminiWijesinghe
 
Massey University PhD Induction July 2018 NCTL Session
Massey University PhD Induction July 2018 NCTL SessionMassey University PhD Induction July 2018 NCTL Session
Massey University PhD Induction July 2018 NCTL SessionMartin McMorrow
 
Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...
Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...
Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...Universitat Politècnica de València
 
PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...
PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...
PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...Vlad Manea
 
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...Thien Q. Tran
 
Biomedical Informatics
Biomedical InformaticsBiomedical Informatics
Biomedical Informaticsimprovemed
 
Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...IJERA Editor
 
Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...IJERA Editor
 
Case Retrieval using Bhattacharya Coefficient with Particle Swarm Optimization
Case Retrieval using Bhattacharya Coefficient with Particle Swarm OptimizationCase Retrieval using Bhattacharya Coefficient with Particle Swarm Optimization
Case Retrieval using Bhattacharya Coefficient with Particle Swarm Optimizationrahulmonikasharma
 
Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016Nicola Amoroso
 

Ähnlich wie First year present (20)

Data Visuallization for Decision Making - Intel White Paper
Data Visuallization for Decision Making - Intel White PaperData Visuallization for Decision Making - Intel White Paper
Data Visuallization for Decision Making - Intel White Paper
 
MVilla IUI 2012 Lisbon
MVilla IUI 2012 LisbonMVilla IUI 2012 Lisbon
MVilla IUI 2012 Lisbon
 
Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...
Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...
Creating Archetypes For Patient Assessment With Nurses To Facilitate Shared P...
 
openEHR template development for COVID-19
openEHR template development for COVID-19openEHR template development for COVID-19
openEHR template development for COVID-19
 
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
Elsevier's Healthcare Knowledge Graph: An Actionable Medical Knowledge Platfo...
 
Mapping innovation missions
Mapping innovation missionsMapping innovation missions
Mapping innovation missions
 
52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx
52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx
52 NURSERESEARCHER 2011, 18, 2issues in researchQualit.docx
 
BDCC-06-00004.pdf
BDCC-06-00004.pdfBDCC-06-00004.pdf
BDCC-06-00004.pdf
 
Secinaro et al-2021-bmc_medical_informatics_and_decision_making
Secinaro et al-2021-bmc_medical_informatics_and_decision_makingSecinaro et al-2021-bmc_medical_informatics_and_decision_making
Secinaro et al-2021-bmc_medical_informatics_and_decision_making
 
Massey University PhD Induction July 2018 NCTL Session
Massey University PhD Induction July 2018 NCTL SessionMassey University PhD Induction July 2018 NCTL Session
Massey University PhD Induction July 2018 NCTL Session
 
Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...
Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...
Link - Opportunities and Challenges for Research on Intelligent Algorithms fo...
 
PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...
PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...
PhD Thesis Defence: From Participation Factors to Co-Calibration of Patient- ...
 
36411
3641136411
36411
 
Medinfor Gesiti Hospitais
Medinfor Gesiti HospitaisMedinfor Gesiti Hospitais
Medinfor Gesiti Hospitais
 
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
 
Biomedical Informatics
Biomedical InformaticsBiomedical Informatics
Biomedical Informatics
 
Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...
 
Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...Identifying Structures in Social Conversations in NSCLC Patients through the ...
Identifying Structures in Social Conversations in NSCLC Patients through the ...
 
Case Retrieval using Bhattacharya Coefficient with Particle Swarm Optimization
Case Retrieval using Bhattacharya Coefficient with Particle Swarm OptimizationCase Retrieval using Bhattacharya Coefficient with Particle Swarm Optimization
Case Retrieval using Bhattacharya Coefficient with Particle Swarm Optimization
 
Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016
 

Kürzlich hochgeladen

Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptxPoojaSen20
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 

Kürzlich hochgeladen (20)

Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptx
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 

First year present

  • 2. 2 Motivation ● Complex nature of obesity. ● Wide range of biomedical data sources available. – implementation of biomedical text/data mining. ● Possible to reveal hidden links between obesity and other diseases. ● Partial completed knowledge representation models of obesity. ● A systematic approach required for: – analysis and interpretation of clinical knowledge.
  • 3. 3 Concept Maps ● Knowledge representation models. ● Consisted of: – nodes (concepts). – links (relationships between the nodes). ● Aim: gather, understand, explore knowledge. ● Variety of users. ● No explicit detail. ● Implemented primarily in education.
  • 5. 5 Aim ● To design a framework to build/enhance medical concept maps. ● To improve the understanding of health care concept complexity. ● Assist medical professionals in the representation, exploration and validation of their expert knowledge. ● Improvement of the clinical health care.
  • 6. 6 Objectives ● Design and implement methods for health care concept detection. ● Concept organisation in a concept map form. ● Method generation for concept map updates. ● Build a framework for the design/enhancement/validation of medical concept maps. ● Methodology evaluation through the health problem of obesity: – validation of obesity related concepts with current structured obesity information available. – identify gaps in clinical knowledge.
  • 7. 7 Research Hypothesis & Questions -The analysis required to extract health care concepts. -The approach to built and enhance a concept map. -The concept map contribution in the representation/validation of knowledge. -The text mining results help to understand/explore clinical problems. Biomedical Text Mining Scientific literature Concept map Improvement of health care Framework
  • 8. 8 Obesity ● Worldwide problem. ● Epidemic proportions: – WHO rates (2005): 1.6 billion overweight, 400 million obese. ● Associations to various diseases. ● Complex risk factors and complications. ● Various aspects. ● Lots of research.
  • 9. 9
  • 10. 10 Biomedical Text Mining ● Extraction of information from unstructured data of biomedical nature. ● Discovery of new, previously unknown knowledge. ● Performed on documents with complex/specific terminology and expressions. ● Challenges: – language ambiguity. – variation of language expression. ● Various tools and applications (Termine, Whatizit, GATE). ● Adaptation to user's tasks and requirements.
  • 11. 11 What we are looking for? ● Risk Factors ● Causal Factors ● Confounding Factors ● Outcomes ● Complications ● Interventions ● ...
  • 12. 12 Methodology Overview 1. Document retrieval. 2. Term/concept extraction. 3. Feature engineering and Information extraction: - application of classification/clustering techniques. 4. Concept map design.
  • 13. 13 Evaluation-Obesity Case Study ● Comparison: – What ? ● biomedical text mining results. ● concept map information. – How ? ● concepts and relationships. ● New ones. ● Examination/manipulation/validation of new knowledge by experts. ● Enhancement of the concept map.
  • 14. 14 Progress so far (1) ● Corpus collection. ● Application of Automated Term Recognition (ATR). ● C-value method. ● Single word ATR: – terminological head identification. – word of a multi-word term that defines the term class. – example: ● “Childhood diabetes type II”. ● Terminological head: “diabetes”.
  • 15. 15 Progress so far (2) ● Ranking head measures: – total head frequency, – single head frequency, – maximum and average C-value, – abstract frequency, – ratio of single head frequency/total head frequency, – tf*idf (term frequency*inverse document frequency).
  • 16. 16 Results tf*idf total freq single freq abstract freq word freq max_c aver_c ratio 0 5 10 15 20 25 30 35 40 45 0 10 20 30 40 50 Statistical measure Numberofkeywords
  • 17. 17 Progress so far (3) ● Pattern extraction from abstracts for: – risk, confounding and causal factors, – interventions, – complications, – outcomes. Obesity risk is increased among women with psychiatric disorders Potential risk factor
  • 18. 18 Example Potential risk factors Potential interventions Potential complications
  • 19. 19 Future plan Species identification in obesity corpus (Linneus) Exploration of single word terms ATR Calculation of z-score Integration of single and multi-word terms Lexical/semantic analysis of the existing concept map Paper preparation for the extraction of single terms in text Pattern extraction from manual analysis Pattern rule design with Minor Third Feature engineering Clustering Classification Paper preparation for the classification of disease descriptors Paper preparation for the clustering of health care concepts Integration of the results Preparation of the second year interview/report Design of concept map relationships (exploration) Application of visual mapping tools Update of the new concept map Comparison and validation of knowledge Exploration of concept complexity in obesity Paper preparation for the automatic design of clinical concept maps Produced generic framework of the methodology Writing the thesis October 2010 April 2011 November 2011 May 2012 Year 3 Year 2 Date Year 2 (1/2): Concept extraction
  • 20. 20 Future plan Species identification in obesity corpus (Linneus) Exploration of single word terms ATR Calculation of z-score Integration of single and multi-word terms Lexical/semantic analysis of the existing concept map Paper preparation for the extraction of single terms in text Pattern extraction from manual analysis Pattern rule design with Minor Third Feature engineering Clustering Classification Paper preparation for the classification of disease descriptors Paper preparation for the clustering of health care concepts Integration of the results Preparation of the second year interview/report Design of concept map relationships (exploration) Application of visual mapping tools Update of the new concept map Comparison and validation of knowledge Exploration of concept complexity in obesity Paper preparation for the automatic design of clinical concept maps Produced generic framework of the methodology Writing the thesis October 2010 April 2011 November 2011 May 2012 Year 3 Year 2 Date Year 2 (2/2): Concept structuring
  • 21. 21 Future plan Species identification in obesity corpus (Linneus) Exploration of single word terms ATR Calculation of z-score Integration of single and multi-word terms Lexical/semantic analysis of the existing concept map Paper preparation for the extraction of single terms in text Pattern extraction from manual analysis Pattern rule design with Minor Third Feature engineering Clustering Classification Paper preparation for the classification of disease descriptors Paper preparation for the clustering of health care concepts Integration of the results Preparation of the second year interview/report Design of concept map relationships (exploration) Application of visual mapping tools Update of the new concept map Comparison and validation of knowledge Exploration of concept complexity in obesity Paper preparation for the automatic design of clinical concept maps Produced generic framework of the methodology Writing the thesis October 2010 April 2011 November 2011 May 2012 Year 3 Year 2 Date Year 3: Design of the medical concept map
  • 22. 22 Summary ● Framework creation for clinical concept map building and enhancement. ● Improved understanding of health care concept complexity. ● So far: – comprehension of literature review. – methodology design. – single ATR. – pattern design.