SlideShare ist ein Scribd-Unternehmen logo
1 von 57
Downloaden Sie, um offline zu lesen
Spark NLP for Healthcare
Lessons Learned Building Real-World
Healthcare AI Systems
Veysel Kocaman
Sr. Data Scientist
Agenda
▪ Introducing Spark NLP
▪ Problem areas in healthcare
analytics
▪ Solving healthcare related NLP
problems
▪ Case studies
Introducing Spark NLP
● Natural Language Toolkit (NLTK): The complete toolkit
for all NLP techniques.
● TextBlob: Easy to use NLP tools API, built on top of NLTK
and Pattern.
● SpaCy: Industrial strength NLP with Python and Cython.
● Gensim: Topic Modelling for Humans
● Stanford Core NLP: NLP services and packages by
Stanford NLP Group.
● Fasttext: NLP library by Facebook’s AI Research (FAIR)
lab
● ...
● Spark NLP is an open-source natural language
processing library, built on top of Apache Spark and
Spark ML. (initial release: Oct 2017)
○ A single unified solution for all your NLP needs
○ Take advantage of transfer learning and
implementing the latest and greatest SOTA
algorithms and models in NLP research
○ Lack of any NLP library that’s fully supported by
Spark
○ Delivering a mission-critical, enterprise grade NLP
library (used by multiple Fortune 500)
○ Full-time development team (26 new releases in
2018. 30 new releases in 2019.)
https://medium.com/spark-nlp/introduction-to-spark-nlp-foundations-and-basic-components-part-i-c83b7629ed59
Spark NLP Modules (Enterprise and Public)
Introducing Spark NLP
● Python, Java and Scala, R
● ”State of the art” means the best performing academic
peer-reviewed results
● Built on the Spark ML API’s
● Apache 2.0 Licensed
● Active development & support
● Zero code changes to scale a pipeline to any Spark
cluster
● The only open-source NLP library that is natively
distributed
● Spark provides execution planning, caching,
serialization, shuffling
Introducing Spark NLP
Sitting on the shoulders of Spark ML !
● Reusing the Spark ML Pipeline
● Unified NLP & ML pipelines
● End-to-end execution planning
● Serializable
● Distributable
● Reusing NLP Functionality
● TF-IDF calculation
● String distance calculation
● Topic modeling
● Distributed ML algorithms
Word & Sentence Embeddings
Glove
(100, 200, 300)
ELMO
(512, 1024)
BERT
(768d)
Universal Sentence Encoders
(512)
Clinical Word Embeddings
Clinical Glove
(200d)
ICDO Glove
(200d)
Bio BERT Clinical BERT
Pubmed + PMC Fine tuned Pubmed + PMC +
Discharge summaries
PubMed + ICD10
UMLS + MIMIC III
PubMed + PMC
PubMed abstracts and PMC full-text articles
https://www.nlm.nih.gov/bsd/difference.html
Introducing Spark NLP
Pipeline of annotators
Spark NLP Pretrained Pipeline
Spark is like a locomotive racing a
bicycle. The bike will win if the load
is light, it is quicker to accelerate
and more agile, but with a heavy
load the locomotive might take a
while to get up to speed, but it’s
going to be faster in the end.
LightPipelines are Spark ML pipelines converted into a single
machine but multithreaded task, becoming more than 10x times
faster for smaller amounts of data (small is relative, but 50k
sentences is roughly a good maximum).
Spark NLP Light Pipelines
Faster inference in runtime from Spark
NLP pipelines
Spark NLP in Healthcare
Spark NLP in Healthcare
Raw & unstructured dataClean & structured data Healthcare data
● Less than 50% of the structured data and less than 1% of the unstructured data is being leveraged for decision
making in companies (HBR). This is even worse in healthcare.
● NLP is ultra domain specific, so train your own models.
Spark NLP in Healthcare
"(admission): 50.4 kgn Height: 61 Inchn ICP: 7 (1 - 14) mmHgn Total In:n 3,279 mLn 911 mLn PO:n Tube feeding:n 243 mLn 237 mLn IV
Fluid:n 2,827 mLn 624 mLn Blood products:n Total out:n 2,333 mLn 370 mLn Urine:n 2,330 mLn 370 mLn NG:n Stool:n
Drains:n 3 mLn Balance:n 946 mLn 541 mLn Respiratory supportn O2 Delivery Device: Nonen SPO2: 97%n ABG: ///26/n Physical
Examinationn General Appearance: No acute distress, Non communicative due ton language barriern HEENT: PERRL, EOMIn Cardiovascular:
(Rhythm: Regular)n Respiratory / Chest: (Expansion: Symmetric), (Breath Sounds: CTAn bilateral : ), (Sternum: Stable )n Abdominal: Soft, Non-
distended, Non-tender, Bowel sounds presentn Left Extremities: (Edema: Absent), (Temperature: Warm), (Pulse -n Dorsalis pedis: Present),
(Pulse - Posterior tibial: Present)n Right Extremities: (Edema: Absent), (Temperature: Warm), (Pulse -n Dorsalis pedis: Present), (Pulse - Posterior
tibial: Present)n Skin: (Incision: Clean / Dry / Intact)n Neurologic: (Awake / Alert / Oriented: x 2), Follows simple commands,n Moves all
extremities, Limited due to language barriern Labs / Radiologyn 275 K/uLn 9.8 g/dLn 134 mg/dLn 0.4 mg/dLn 26 mEq/Ln 3.5 mEq/Ln 15
mg/dLn 102 mEq/Ln 137 mEq/Ln 30.3 %n 8.8 K/uLn [image002.jpg]n [**2140-7-23**] 03:30 PMn [**2140-7-24**] 02:51 AMn [**2140-7-
24**] 03:03 AMn [**2140-7-24**] 08:13 AMn [**2140-7-24**] 10:07 AMn [**2140-7-25**] 02:45 AMn [**2140-7-26**] 01:15 AMn [**2140-7-27**]
03:09 AMn [**2140-7-27**] 10:58 AMn [**2140-7-28**] 02:58 AMn WBCn 9.7n 10.3n 11.2n 7.7n 7.1n 8.8n Hctn 31.8n 32.6n 34.3n
33.3n 31.4n 30.3n Pltn [**Telephone/Fax (3) 8785**]n Creatininen 0.5n 0.5n 0.5n 0.5n 0.5n 0.5n 0.4n TCO2n 26n 28n 29n
Glucosen 168n 253n 147n 180n 92n 160n 194n 134n Other labs: PT / PTT / INR:11.6/25.8/1.0, CK / CK-MB / Troponinn T:54//<0.01, ALT
/ AST:25/32, Alk-Phos / T bili:87/,n Differential-Neuts:93.0 %, Lymph:5.3 %, Mono:1.0 %, Eos:0.5 %, Lacticn Acid:1.5 mmol/L, Ca:7.9 mg/dL,
Mg:1.8 mg/dL, PO4:2.5 mg/dLn Assessment and Plann AIRWAY, INABILITY TO PROTECT (RISK FOR ASPIRATION, ALTERED GAG, AIRWAYn
CLEARANCE, COUGH), CVA (STROKE, CEREBRAL INFARCTION), HEMORRHAGIC ,n HYPERTENSION, BENIGN, [**Last Name 12**] PROBLEM - ENTER
DESCRIPTION IN COMMENTSn Assessment and Plan: 69 yo F w/ left cerebellar thrombotic stroke,n hemorrhage, transtentorial herniation s/p EVD
placement, surgicaln decompression on [**7-22**], now w/ improved neuro examsn Neurologic: ICP monitor, Pain controlled, s/p crani for
cerebellarn CVA, moves all 4, EVD clamped.
Output from one of the NLP libraries - MIMIC-III dataset
(an openly available dataset developed by the MIT Lab for Computational Physiology)
Spark NLP in Healthcare
Spark NLP in Healthcare
Spark NLP in Healthcare
NLP Library / Feature State of the Art (SOTA) Research
Named Entity Recognition “Entity Recognition from Clinical Texts via Recurrent Neural Network”.
Liu et al., BMC Medical Informatics & Decision Making, July 2017.
Word Embeddings - “How to Train Good Word Embeddings for Biomedical NLP”.
Chiu et al., In Proceedings of BioNLP’16, August 2016.
- “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding”.
Devlin et. al. (Google Research), October 2018.
Assertion Status Detection - “Improving Classification of Medical Assertions in Clinical Notes”.
Kim et al., In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics:
Human Language Technologies, 2011.
- “Neural Networks For Negation Scope Detection“
Fancellu et al., In Proceedings of the 54th Annual Meeting of the Association for Computational
Linguistics, 2016.
Entity Resolution “CNN-based ranking for biomedical entity normalization”.
Li et al., BMC Bioinformatics, October 2017.
Clinical Named Entity Recognition
Posology NER
Anatomy NER
PHI NER
Clinical NER
NER
Comparison
with AWS
Medical
Comprehend
Clinical Named Entity Recognition
Clinical Assertion Model
Prescribing sick days due to diagnosis of influenza. Present
41 yo man with CRFs of DM Type II, high cholesterol, smoking history,
family hx, HTN p/w episodes of atypical CP x 1 week, with rest and
exertion.
Conditional
Jane’s RIDT came back clean. Absent
Jane is at risk for flu if she’s not vaccinated. Hypothetical
There was a dense hemianopsia on the left side. Present
“Neural Networks For Negation Scope Detection“
Fancellu et al., In Proceedings of the 54th Annual
Meeting of the Association for Computational
Linguistics, 2016.
scope of negation: given a negative instance, to identify which tokens are affected by negation
Clinical Assertion Model
scope of negation: given a negative instance, to identify which tokens are
affected by negation
Clinical Deidentification Model
* Identifies potential pieces of content with personal information about patients and remove them by replacing with semantic tags.
Entity Resolvers Model
Entity Resolvers Model
Entity Resolvers Model
Customer Case Studies
1. How SelectData uses AI to better
understand home health patients
2. How Roche automated knowledge
extraction from pathology and radiology
reports
3. Improving patient flow forecasting at
Kaiser Permanente
4. How Deep6 accelerates clinical trial
recruitment
SelectData
What is Home Health and upcoming problems ?
Silver Tsunami
● By 2022 more than 25 percent of US workers will be 55 or older
● Nearly 10,000 baby boomers reach retirement age each day
● Home Health is expected to grow by 6.7% next year
Expert Reviewer
● Bureau of Labor Statistics projects that the need for medical coders will
increased by 15% by 2027
● Healthcare Data is used in decision-making
Aging Baby Boomers
● By 2039 the rate of Medicare spending and net interest on national debt will
exceed total projected revenues
● Payment reform focused on reduction in price
SelectData
Problems vs Solutions
TL;DR => we have more people, less qualified workers, and our clients are
receiving less money for the care of that patient.
SelectData
● OCR is difficult, different layouts, different
scales, noise, rotation.
● High number of records and pages.
● Need for cluster processing.
● Cluster processing is difficult.
SelectData Spark OCR
SelectData
● We create a pipeline, composed by annotators.
● The pipeline runs in a cluster.
● We can process many documents in parallel and scale out.
SelectData
SelectData
Document Assembler and Tokenizer
SelectData
Spell Checker
SelectData
SelectData
SelectData
Entity Resolution
Case 2: Roche
Manual curation is extremely time consuming, expensive,
and prone to errors
Manually Curated TCGA Report
Sample Results from Curation
Case 2: Roche
1. Natural Language Processing (NLP):
● High accuracy
● Specialized for medical data
● Minimize time to train new models
● Extensible for new content types
1. Optical Character Recognition (OCR):
● High accuracy
● Retain document structure
(i.e. tables, lists, paragraphs,...)
Requirements for both:
● Scalable (support 10 million pathology reports per
year)
● Compliant with privacy laws
● Integrates easily with AWS services
● Low cost
The NAVIFY team identified two significant needs
Action Plan :
● Initial goal of speeding up review of pathology
reports
● Will then automate extraction of high confidence
entities and relationships
● Will keep increasing automation of NLP over time
Case 2: Roche How Spark NLP helped Roche ?
Case 2: Roche
Lessons Learned
● Extracting text from domain specific PDFs/images is unpredictable
● Quantitative evaluation of OCR is challenging
● Bridging the gap between domain knowledge & NLP requires consensus
● Evidence does not always match with standard terminologies
● Building NLP pipelines - that are generalizable:
○ Static components like tokenization, sentence detection, POS tagging and chunking can be
re-utilized
○ Data sources (hospitals) differ, NLP approach needs to be plug and play
Case 3: Kaiser Permanente
Improving Patient Flow Forecasting
Case 3: Kaiser Permanente
Improving Patient Flow Forecasting
Objectives
Optimize the patient flow models & provide insights,
for real-time decision-making and for strategic planning,
by predicting:
● Bed demand
● 'Safe' staffing levels
● Hospital gridlock
Case 3: Kaiser Permanente
Case 4: Deep6
Feature engineering with Spark NLP to accelerate clinical trial recruitment
(reducing the time that it takes to find a patient for trials)
● Your treatments are > 15 years old
● Cutting edge treatments only
available in clinical trials
● Faster cycles make lifesaving
treatments available sooner
Case 4: Deep6
Case 4: Deep6
Case 4: Deep6
Case 4: Deep6
Case 4: Deep6
Spark NLP resources
Spark NLP Official page
Spark NLP Workshop Repo
JSL Youtube channel
JSL Blogs
Introduction to Spark NLP: Foundations and Basic Components (Part-I)
Introduction to: Spark NLP: Installation and Getting Started (Part-II)
Named Entity Recognition with Bert in Spark NLP
Text Classification in Spark NLP with Bert and Universal Sentence Encoders
Spark NLP 101 : Document Assembler
Spark NLP 101: LightPipeline
https://www.oreilly.com/radar/one-simple-chart-who-is-interested-in-spark-nlp/
https://blog.dominodatalab.com/comparing-the-functionality-of-open-source-natural-language-processing-libraries/
https://databricks.com/blog/2017/10/19/introducing-natural-language-processing-library-apache-spark.html
https://databricks.com/fr/session/apache-spark-nlp-extending-spark-ml-to-deliver-fast-scalable-unified-natural-language-processing
https://medium.com/@saif1988/spark-nlp-walkthrough-powered-by-tensorflow-9965538663fd
https://www.kdnuggets.com/2019/06/spark-nlp-getting-started-with-worlds-most-widely-used-nlp-library-enterprise.html
https://www.forbes.com/sites/forbestechcouncil/2019/09/17/winning-in-health-care-ai-with-small-data/#1b2fc2555664
https://medium.com/hackernoon/mueller-report-for-nerds-spark-meets-nlp-with-tensorflow-and-bert-part-1-32490a8f8f12
https://www.analyticsindiamag.com/5-reasons-why-spark-nlp-is-the-most-widely-used-library-in-enterprises/
https://www.oreilly.com/ideas/comparing-production-grade-nlp-libraries-training-spark-nlp-and-spacy-pipelines
https://www.oreilly.com/ideas/comparing-production-grade-nlp-libraries-accuracy-performance-and-scalability
https://www.infoworld.com/article/3031690/analytics/why-you-should-use-spark-for-machine-learning.html
Apache Spark NLP for Healthcare: Lessons Learned Building Real-World Healthcare AI Systems

Weitere ähnliche Inhalte

Was ist angesagt?

Exploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfExploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfDung Hoang
 
AI - Opportunities and Challenges
AI - Opportunities and ChallengesAI - Opportunities and Challenges
AI - Opportunities and ChallengesBert Jan Schrijver
 
Using the power of Generative AI at scale
Using the power of Generative AI at scaleUsing the power of Generative AI at scale
Using the power of Generative AI at scaleMaxim Salnikov
 
営業向け ABテストの統計学 データアーティスト
営業向け ABテストの統計学 データアーティスト営業向け ABテストの統計学 データアーティスト
営業向け ABテストの統計学 データアーティストSatoru Yamamoto
 
AI: Built to Scale
AI: Built to ScaleAI: Built to Scale
AI: Built to Scaleaccenture
 
Let's talk about GPT: A crash course in Generative AI for researchers
Let's talk about GPT: A crash course in Generative AI for researchersLet's talk about GPT: A crash course in Generative AI for researchers
Let's talk about GPT: A crash course in Generative AI for researchersSteven Van Vaerenbergh
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfPremNaraindas1
 
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬VINCI Digital - Industrial IoT (IIoT) Strategic Advisory
 
arbar https://www.slideshare.net/Solutionman/charles-caldwell-improve-your-li...
arbar https://www.slideshare.net/Solutionman/charles-caldwell-improve-your-li...arbar https://www.slideshare.net/Solutionman/charles-caldwell-improve-your-li...
arbar https://www.slideshare.net/Solutionman/charles-caldwell-improve-your-li...SOLTUIONSpeople, THINKubators, THINKathons
 
How Artificial Intelligence is taking over Human Jobs
How Artificial Intelligence is taking over Human JobsHow Artificial Intelligence is taking over Human Jobs
How Artificial Intelligence is taking over Human JobsShradha Jindal
 
Leveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesDianaGray10
 
Jordan Wilson - Expert Chats Train ChatGPT to be your employee with the PPP m...
Jordan Wilson - Expert Chats Train ChatGPT to be your employee with the PPP m...Jordan Wilson - Expert Chats Train ChatGPT to be your employee with the PPP m...
Jordan Wilson - Expert Chats Train ChatGPT to be your employee with the PPP m...SOLTUIONSpeople, THINKubators, THINKathons
 
Top 5 advantages of Artificial Intellegence
 Top 5 advantages of Artificial Intellegence Top 5 advantages of Artificial Intellegence
Top 5 advantages of Artificial Intellegenceparadisetechsoftsolutions
 
Unpacking AI for Healthcare
Unpacking AI for HealthcareUnpacking AI for Healthcare
Unpacking AI for HealthcareLumiata
 
leewayhertz.com-The architecture of Generative AI for enterprises.pdf
leewayhertz.com-The architecture of Generative AI for enterprises.pdfleewayhertz.com-The architecture of Generative AI for enterprises.pdf
leewayhertz.com-The architecture of Generative AI for enterprises.pdfKristiLBurns
 

Was ist angesagt? (20)

Exploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfExploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdf
 
ChatGPT.pptx
ChatGPT.pptxChatGPT.pptx
ChatGPT.pptx
 
AI - Opportunities and Challenges
AI - Opportunities and ChallengesAI - Opportunities and Challenges
AI - Opportunities and Challenges
 
Using the power of Generative AI at scale
Using the power of Generative AI at scaleUsing the power of Generative AI at scale
Using the power of Generative AI at scale
 
営業向け ABテストの統計学 データアーティスト
営業向け ABテストの統計学 データアーティスト営業向け ABテストの統計学 データアーティスト
営業向け ABテストの統計学 データアーティスト
 
AI Redefines Insurance
AI Redefines InsuranceAI Redefines Insurance
AI Redefines Insurance
 
AI: Built to Scale
AI: Built to ScaleAI: Built to Scale
AI: Built to Scale
 
Let's talk about GPT: A crash course in Generative AI for researchers
Let's talk about GPT: A crash course in Generative AI for researchersLet's talk about GPT: A crash course in Generative AI for researchers
Let's talk about GPT: A crash course in Generative AI for researchers
 
Jordan Wilson - Genius ChatGPT Tactis.pdf
Jordan Wilson - Genius ChatGPT Tactis.pdfJordan Wilson - Genius ChatGPT Tactis.pdf
Jordan Wilson - Genius ChatGPT Tactis.pdf
 
AI, ChatGPT and Content Marketing - Andrew Jenkins, Volterra Consulting
AI, ChatGPT and Content Marketing - Andrew Jenkins, Volterra ConsultingAI, ChatGPT and Content Marketing - Andrew Jenkins, Volterra Consulting
AI, ChatGPT and Content Marketing - Andrew Jenkins, Volterra Consulting
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdf
 
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
 
arbar https://www.slideshare.net/Solutionman/charles-caldwell-improve-your-li...
arbar https://www.slideshare.net/Solutionman/charles-caldwell-improve-your-li...arbar https://www.slideshare.net/Solutionman/charles-caldwell-improve-your-li...
arbar https://www.slideshare.net/Solutionman/charles-caldwell-improve-your-li...
 
How Artificial Intelligence is taking over Human Jobs
How Artificial Intelligence is taking over Human JobsHow Artificial Intelligence is taking over Human Jobs
How Artificial Intelligence is taking over Human Jobs
 
Leveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practices
 
Jordan Wilson - Expert Chats Train ChatGPT to be your employee with the PPP m...
Jordan Wilson - Expert Chats Train ChatGPT to be your employee with the PPP m...Jordan Wilson - Expert Chats Train ChatGPT to be your employee with the PPP m...
Jordan Wilson - Expert Chats Train ChatGPT to be your employee with the PPP m...
 
Top 5 advantages of Artificial Intellegence
 Top 5 advantages of Artificial Intellegence Top 5 advantages of Artificial Intellegence
Top 5 advantages of Artificial Intellegence
 
Unpacking AI for Healthcare
Unpacking AI for HealthcareUnpacking AI for Healthcare
Unpacking AI for Healthcare
 
Power of power BI
Power of power BI Power of power BI
Power of power BI
 
leewayhertz.com-The architecture of Generative AI for enterprises.pdf
leewayhertz.com-The architecture of Generative AI for enterprises.pdfleewayhertz.com-The architecture of Generative AI for enterprises.pdf
leewayhertz.com-The architecture of Generative AI for enterprises.pdf
 

Ähnlich wie Apache Spark NLP for Healthcare: Lessons Learned Building Real-World Healthcare AI Systems

Natural Language Understanding in Healthcare
Natural Language Understanding in HealthcareNatural Language Understanding in Healthcare
Natural Language Understanding in HealthcareDavid Talby
 
Apache Spark NLP: Extending Spark ML to Deliver Fast, Scalable & Unified Nat...
 Apache Spark NLP: Extending Spark ML to Deliver Fast, Scalable & Unified Nat... Apache Spark NLP: Extending Spark ML to Deliver Fast, Scalable & Unified Nat...
Apache Spark NLP: Extending Spark ML to Deliver Fast, Scalable & Unified Nat...Databricks
 
How can we har­ness the Human Brain Project to max­i­mize its future health a...
How can we har­ness the Human Brain Project to max­i­mize its future health a...How can we har­ness the Human Brain Project to max­i­mize its future health a...
How can we har­ness the Human Brain Project to max­i­mize its future health a...SharpBrains
 
Connected Data for Machine Learning | Paul Groth
Connected Data for Machine Learning | Paul GrothConnected Data for Machine Learning | Paul Groth
Connected Data for Machine Learning | Paul GrothConnected Data World
 
ENCEPHALOGRAPHY PANKAJ.pptx
ENCEPHALOGRAPHY PANKAJ.pptxENCEPHALOGRAPHY PANKAJ.pptx
ENCEPHALOGRAPHY PANKAJ.pptxpreeminentbot
 
Computer-Aided Detection (1).pptx
Computer-Aided Detection (1).pptxComputer-Aided Detection (1).pptx
Computer-Aided Detection (1).pptxMohammedMasliuddin
 
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...PhD Assistance
 
2011 12 08 - LOINC Introduction
2011 12 08 - LOINC Introduction2011 12 08 - LOINC Introduction
2011 12 08 - LOINC Introductiondvreeman
 
Natural Language Processing to Curate Unstructured Electronic Health Records
Natural Language Processing to Curate Unstructured Electronic Health RecordsNatural Language Processing to Curate Unstructured Electronic Health Records
Natural Language Processing to Curate Unstructured Electronic Health RecordsMMS Holdings
 
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...PhD Assistance
 
Reference Domain Ontologies and Large Medical Language Models.pptx
Reference Domain Ontologies and Large Medical Language Models.pptxReference Domain Ontologies and Large Medical Language Models.pptx
Reference Domain Ontologies and Large Medical Language Models.pptxChimezie Ogbuji
 
NC_Fall_14_web
NC_Fall_14_webNC_Fall_14_web
NC_Fall_14_webErica Kube
 
DRIVE 2017 | 25 October - THE HUMAN TOUCH - Meaningful Data & Smart Fashion
DRIVE 2017 | 25 October - THE HUMAN TOUCH - Meaningful Data & Smart FashionDRIVE 2017 | 25 October - THE HUMAN TOUCH - Meaningful Data & Smart Fashion
DRIVE 2017 | 25 October - THE HUMAN TOUCH - Meaningful Data & Smart FashionCLICKNL
 
[DigiHealth 22] Artificial intelligence in medicine - Kristijan Saric
[DigiHealth 22] Artificial intelligence in medicine - Kristijan Saric[DigiHealth 22] Artificial intelligence in medicine - Kristijan Saric
[DigiHealth 22] Artificial intelligence in medicine - Kristijan SaricDataScienceConferenc1
 
Automated and Explainable Deep Learning for Clinical Language Understanding a...
Automated and Explainable Deep Learning for Clinical Language Understanding a...Automated and Explainable Deep Learning for Clinical Language Understanding a...
Automated and Explainable Deep Learning for Clinical Language Understanding a...Databricks
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchEuropean Bioinformatics Institute
 
Cao report 2007-2012
Cao report 2007-2012Cao report 2007-2012
Cao report 2007-2012Elif Ceylan
 
The Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in BiologyThe Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in Biologyrobertstevens65
 

Ähnlich wie Apache Spark NLP for Healthcare: Lessons Learned Building Real-World Healthcare AI Systems (20)

Natural Language Understanding in Healthcare
Natural Language Understanding in HealthcareNatural Language Understanding in Healthcare
Natural Language Understanding in Healthcare
 
2019 Triangle Machine Learning Day - Biomedical Image Understanding and EHRs ...
2019 Triangle Machine Learning Day - Biomedical Image Understanding and EHRs ...2019 Triangle Machine Learning Day - Biomedical Image Understanding and EHRs ...
2019 Triangle Machine Learning Day - Biomedical Image Understanding and EHRs ...
 
Apache Spark NLP: Extending Spark ML to Deliver Fast, Scalable & Unified Nat...
 Apache Spark NLP: Extending Spark ML to Deliver Fast, Scalable & Unified Nat... Apache Spark NLP: Extending Spark ML to Deliver Fast, Scalable & Unified Nat...
Apache Spark NLP: Extending Spark ML to Deliver Fast, Scalable & Unified Nat...
 
How can we har­ness the Human Brain Project to max­i­mize its future health a...
How can we har­ness the Human Brain Project to max­i­mize its future health a...How can we har­ness the Human Brain Project to max­i­mize its future health a...
How can we har­ness the Human Brain Project to max­i­mize its future health a...
 
Connected Data for Machine Learning | Paul Groth
Connected Data for Machine Learning | Paul GrothConnected Data for Machine Learning | Paul Groth
Connected Data for Machine Learning | Paul Groth
 
ENCEPHALOGRAPHY PANKAJ.pptx
ENCEPHALOGRAPHY PANKAJ.pptxENCEPHALOGRAPHY PANKAJ.pptx
ENCEPHALOGRAPHY PANKAJ.pptx
 
Computer-Aided Detection (1).pptx
Computer-Aided Detection (1).pptxComputer-Aided Detection (1).pptx
Computer-Aided Detection (1).pptx
 
Non intrusive-devices
Non intrusive-devicesNon intrusive-devices
Non intrusive-devices
 
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
 
2011 12 08 - LOINC Introduction
2011 12 08 - LOINC Introduction2011 12 08 - LOINC Introduction
2011 12 08 - LOINC Introduction
 
Natural Language Processing to Curate Unstructured Electronic Health Records
Natural Language Processing to Curate Unstructured Electronic Health RecordsNatural Language Processing to Curate Unstructured Electronic Health Records
Natural Language Processing to Curate Unstructured Electronic Health Records
 
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
Future of Natural Language Processing - Potential Lists of Topics for PhD stu...
 
Reference Domain Ontologies and Large Medical Language Models.pptx
Reference Domain Ontologies and Large Medical Language Models.pptxReference Domain Ontologies and Large Medical Language Models.pptx
Reference Domain Ontologies and Large Medical Language Models.pptx
 
NC_Fall_14_web
NC_Fall_14_webNC_Fall_14_web
NC_Fall_14_web
 
DRIVE 2017 | 25 October - THE HUMAN TOUCH - Meaningful Data & Smart Fashion
DRIVE 2017 | 25 October - THE HUMAN TOUCH - Meaningful Data & Smart FashionDRIVE 2017 | 25 October - THE HUMAN TOUCH - Meaningful Data & Smart Fashion
DRIVE 2017 | 25 October - THE HUMAN TOUCH - Meaningful Data & Smart Fashion
 
[DigiHealth 22] Artificial intelligence in medicine - Kristijan Saric
[DigiHealth 22] Artificial intelligence in medicine - Kristijan Saric[DigiHealth 22] Artificial intelligence in medicine - Kristijan Saric
[DigiHealth 22] Artificial intelligence in medicine - Kristijan Saric
 
Automated and Explainable Deep Learning for Clinical Language Understanding a...
Automated and Explainable Deep Learning for Clinical Language Understanding a...Automated and Explainable Deep Learning for Clinical Language Understanding a...
Automated and Explainable Deep Learning for Clinical Language Understanding a...
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven Research
 
Cao report 2007-2012
Cao report 2007-2012Cao report 2007-2012
Cao report 2007-2012
 
The Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in BiologyThe Past, Present and Future of Knowledge in Biology
The Past, Present and Future of Knowledge in Biology
 

Mehr von Databricks

DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDatabricks
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Databricks
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Databricks
 
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Databricks
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Databricks
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of HadoopDatabricks
 
Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDatabricks
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceDatabricks
 
Why APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringWhy APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringDatabricks
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixThe Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixDatabricks
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationDatabricks
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchDatabricks
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesScaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesDatabricks
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesScaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesDatabricks
 
Sawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsSawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsDatabricks
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkRedis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkDatabricks
 
Re-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkRe-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkDatabricks
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesDatabricks
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkDatabricks
 
Massive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeMassive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeDatabricks
 

Mehr von Databricks (20)

DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2
 
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
 
Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized Platform
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data Science
 
Why APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringWhy APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML Monitoring
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixThe Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI Integration
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorch
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesScaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on Kubernetes
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesScaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
 
Sawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsSawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature Aggregations
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkRedis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
 
Re-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkRe-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and Spark
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction Queries
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
 
Massive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeMassive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta Lake
 

Kürzlich hochgeladen

Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...amitlee9823
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Pooja Nehwal
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 

Kürzlich hochgeladen (20)

Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 

Apache Spark NLP for Healthcare: Lessons Learned Building Real-World Healthcare AI Systems

  • 1.
  • 2. Spark NLP for Healthcare Lessons Learned Building Real-World Healthcare AI Systems Veysel Kocaman Sr. Data Scientist
  • 3. Agenda ▪ Introducing Spark NLP ▪ Problem areas in healthcare analytics ▪ Solving healthcare related NLP problems ▪ Case studies
  • 4. Introducing Spark NLP ● Natural Language Toolkit (NLTK): The complete toolkit for all NLP techniques. ● TextBlob: Easy to use NLP tools API, built on top of NLTK and Pattern. ● SpaCy: Industrial strength NLP with Python and Cython. ● Gensim: Topic Modelling for Humans ● Stanford Core NLP: NLP services and packages by Stanford NLP Group. ● Fasttext: NLP library by Facebook’s AI Research (FAIR) lab ● ... ● Spark NLP is an open-source natural language processing library, built on top of Apache Spark and Spark ML. (initial release: Oct 2017) ○ A single unified solution for all your NLP needs ○ Take advantage of transfer learning and implementing the latest and greatest SOTA algorithms and models in NLP research ○ Lack of any NLP library that’s fully supported by Spark ○ Delivering a mission-critical, enterprise grade NLP library (used by multiple Fortune 500) ○ Full-time development team (26 new releases in 2018. 30 new releases in 2019.) https://medium.com/spark-nlp/introduction-to-spark-nlp-foundations-and-basic-components-part-i-c83b7629ed59
  • 5. Spark NLP Modules (Enterprise and Public)
  • 6.
  • 7. Introducing Spark NLP ● Python, Java and Scala, R ● ”State of the art” means the best performing academic peer-reviewed results ● Built on the Spark ML API’s ● Apache 2.0 Licensed ● Active development & support ● Zero code changes to scale a pipeline to any Spark cluster ● The only open-source NLP library that is natively distributed ● Spark provides execution planning, caching, serialization, shuffling
  • 9. Sitting on the shoulders of Spark ML ! ● Reusing the Spark ML Pipeline ● Unified NLP & ML pipelines ● End-to-end execution planning ● Serializable ● Distributable ● Reusing NLP Functionality ● TF-IDF calculation ● String distance calculation ● Topic modeling ● Distributed ML algorithms
  • 10. Word & Sentence Embeddings Glove (100, 200, 300) ELMO (512, 1024) BERT (768d) Universal Sentence Encoders (512)
  • 11. Clinical Word Embeddings Clinical Glove (200d) ICDO Glove (200d) Bio BERT Clinical BERT Pubmed + PMC Fine tuned Pubmed + PMC + Discharge summaries PubMed + ICD10 UMLS + MIMIC III PubMed + PMC PubMed abstracts and PMC full-text articles https://www.nlm.nih.gov/bsd/difference.html
  • 14. Spark is like a locomotive racing a bicycle. The bike will win if the load is light, it is quicker to accelerate and more agile, but with a heavy load the locomotive might take a while to get up to speed, but it’s going to be faster in the end. LightPipelines are Spark ML pipelines converted into a single machine but multithreaded task, becoming more than 10x times faster for smaller amounts of data (small is relative, but 50k sentences is roughly a good maximum). Spark NLP Light Pipelines Faster inference in runtime from Spark NLP pipelines
  • 15. Spark NLP in Healthcare
  • 16. Spark NLP in Healthcare Raw & unstructured dataClean & structured data Healthcare data ● Less than 50% of the structured data and less than 1% of the unstructured data is being leveraged for decision making in companies (HBR). This is even worse in healthcare. ● NLP is ultra domain specific, so train your own models.
  • 17. Spark NLP in Healthcare
  • 18.
  • 19. "(admission): 50.4 kgn Height: 61 Inchn ICP: 7 (1 - 14) mmHgn Total In:n 3,279 mLn 911 mLn PO:n Tube feeding:n 243 mLn 237 mLn IV Fluid:n 2,827 mLn 624 mLn Blood products:n Total out:n 2,333 mLn 370 mLn Urine:n 2,330 mLn 370 mLn NG:n Stool:n Drains:n 3 mLn Balance:n 946 mLn 541 mLn Respiratory supportn O2 Delivery Device: Nonen SPO2: 97%n ABG: ///26/n Physical Examinationn General Appearance: No acute distress, Non communicative due ton language barriern HEENT: PERRL, EOMIn Cardiovascular: (Rhythm: Regular)n Respiratory / Chest: (Expansion: Symmetric), (Breath Sounds: CTAn bilateral : ), (Sternum: Stable )n Abdominal: Soft, Non- distended, Non-tender, Bowel sounds presentn Left Extremities: (Edema: Absent), (Temperature: Warm), (Pulse -n Dorsalis pedis: Present), (Pulse - Posterior tibial: Present)n Right Extremities: (Edema: Absent), (Temperature: Warm), (Pulse -n Dorsalis pedis: Present), (Pulse - Posterior tibial: Present)n Skin: (Incision: Clean / Dry / Intact)n Neurologic: (Awake / Alert / Oriented: x 2), Follows simple commands,n Moves all extremities, Limited due to language barriern Labs / Radiologyn 275 K/uLn 9.8 g/dLn 134 mg/dLn 0.4 mg/dLn 26 mEq/Ln 3.5 mEq/Ln 15 mg/dLn 102 mEq/Ln 137 mEq/Ln 30.3 %n 8.8 K/uLn [image002.jpg]n [**2140-7-23**] 03:30 PMn [**2140-7-24**] 02:51 AMn [**2140-7- 24**] 03:03 AMn [**2140-7-24**] 08:13 AMn [**2140-7-24**] 10:07 AMn [**2140-7-25**] 02:45 AMn [**2140-7-26**] 01:15 AMn [**2140-7-27**] 03:09 AMn [**2140-7-27**] 10:58 AMn [**2140-7-28**] 02:58 AMn WBCn 9.7n 10.3n 11.2n 7.7n 7.1n 8.8n Hctn 31.8n 32.6n 34.3n 33.3n 31.4n 30.3n Pltn [**Telephone/Fax (3) 8785**]n Creatininen 0.5n 0.5n 0.5n 0.5n 0.5n 0.5n 0.4n TCO2n 26n 28n 29n Glucosen 168n 253n 147n 180n 92n 160n 194n 134n Other labs: PT / PTT / INR:11.6/25.8/1.0, CK / CK-MB / Troponinn T:54//<0.01, ALT / AST:25/32, Alk-Phos / T bili:87/,n Differential-Neuts:93.0 %, Lymph:5.3 %, Mono:1.0 %, Eos:0.5 %, Lacticn Acid:1.5 mmol/L, Ca:7.9 mg/dL, Mg:1.8 mg/dL, PO4:2.5 mg/dLn Assessment and Plann AIRWAY, INABILITY TO PROTECT (RISK FOR ASPIRATION, ALTERED GAG, AIRWAYn CLEARANCE, COUGH), CVA (STROKE, CEREBRAL INFARCTION), HEMORRHAGIC ,n HYPERTENSION, BENIGN, [**Last Name 12**] PROBLEM - ENTER DESCRIPTION IN COMMENTSn Assessment and Plan: 69 yo F w/ left cerebellar thrombotic stroke,n hemorrhage, transtentorial herniation s/p EVD placement, surgicaln decompression on [**7-22**], now w/ improved neuro examsn Neurologic: ICP monitor, Pain controlled, s/p crani for cerebellarn CVA, moves all 4, EVD clamped. Output from one of the NLP libraries - MIMIC-III dataset (an openly available dataset developed by the MIT Lab for Computational Physiology) Spark NLP in Healthcare
  • 20. Spark NLP in Healthcare
  • 21. Spark NLP in Healthcare NLP Library / Feature State of the Art (SOTA) Research Named Entity Recognition “Entity Recognition from Clinical Texts via Recurrent Neural Network”. Liu et al., BMC Medical Informatics & Decision Making, July 2017. Word Embeddings - “How to Train Good Word Embeddings for Biomedical NLP”. Chiu et al., In Proceedings of BioNLP’16, August 2016. - “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding”. Devlin et. al. (Google Research), October 2018. Assertion Status Detection - “Improving Classification of Medical Assertions in Clinical Notes”. Kim et al., In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011. - “Neural Networks For Negation Scope Detection“ Fancellu et al., In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016. Entity Resolution “CNN-based ranking for biomedical entity normalization”. Li et al., BMC Bioinformatics, October 2017.
  • 22.
  • 23. Clinical Named Entity Recognition Posology NER Anatomy NER PHI NER Clinical NER
  • 25. Clinical Assertion Model Prescribing sick days due to diagnosis of influenza. Present 41 yo man with CRFs of DM Type II, high cholesterol, smoking history, family hx, HTN p/w episodes of atypical CP x 1 week, with rest and exertion. Conditional Jane’s RIDT came back clean. Absent Jane is at risk for flu if she’s not vaccinated. Hypothetical There was a dense hemianopsia on the left side. Present “Neural Networks For Negation Scope Detection“ Fancellu et al., In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016. scope of negation: given a negative instance, to identify which tokens are affected by negation
  • 26. Clinical Assertion Model scope of negation: given a negative instance, to identify which tokens are affected by negation
  • 27. Clinical Deidentification Model * Identifies potential pieces of content with personal information about patients and remove them by replacing with semantic tags.
  • 31. Customer Case Studies 1. How SelectData uses AI to better understand home health patients 2. How Roche automated knowledge extraction from pathology and radiology reports 3. Improving patient flow forecasting at Kaiser Permanente 4. How Deep6 accelerates clinical trial recruitment
  • 32. SelectData What is Home Health and upcoming problems ? Silver Tsunami ● By 2022 more than 25 percent of US workers will be 55 or older ● Nearly 10,000 baby boomers reach retirement age each day ● Home Health is expected to grow by 6.7% next year Expert Reviewer ● Bureau of Labor Statistics projects that the need for medical coders will increased by 15% by 2027 ● Healthcare Data is used in decision-making Aging Baby Boomers ● By 2039 the rate of Medicare spending and net interest on national debt will exceed total projected revenues ● Payment reform focused on reduction in price
  • 33. SelectData Problems vs Solutions TL;DR => we have more people, less qualified workers, and our clients are receiving less money for the care of that patient.
  • 34. SelectData ● OCR is difficult, different layouts, different scales, noise, rotation. ● High number of records and pages. ● Need for cluster processing. ● Cluster processing is difficult.
  • 36. SelectData ● We create a pipeline, composed by annotators. ● The pipeline runs in a cluster. ● We can process many documents in parallel and scale out.
  • 43. Case 2: Roche Manual curation is extremely time consuming, expensive, and prone to errors Manually Curated TCGA Report Sample Results from Curation
  • 44. Case 2: Roche 1. Natural Language Processing (NLP): ● High accuracy ● Specialized for medical data ● Minimize time to train new models ● Extensible for new content types 1. Optical Character Recognition (OCR): ● High accuracy ● Retain document structure (i.e. tables, lists, paragraphs,...) Requirements for both: ● Scalable (support 10 million pathology reports per year) ● Compliant with privacy laws ● Integrates easily with AWS services ● Low cost The NAVIFY team identified two significant needs Action Plan : ● Initial goal of speeding up review of pathology reports ● Will then automate extraction of high confidence entities and relationships ● Will keep increasing automation of NLP over time
  • 45. Case 2: Roche How Spark NLP helped Roche ?
  • 46. Case 2: Roche Lessons Learned ● Extracting text from domain specific PDFs/images is unpredictable ● Quantitative evaluation of OCR is challenging ● Bridging the gap between domain knowledge & NLP requires consensus ● Evidence does not always match with standard terminologies ● Building NLP pipelines - that are generalizable: ○ Static components like tokenization, sentence detection, POS tagging and chunking can be re-utilized ○ Data sources (hospitals) differ, NLP approach needs to be plug and play
  • 47. Case 3: Kaiser Permanente Improving Patient Flow Forecasting
  • 48. Case 3: Kaiser Permanente Improving Patient Flow Forecasting Objectives Optimize the patient flow models & provide insights, for real-time decision-making and for strategic planning, by predicting: ● Bed demand ● 'Safe' staffing levels ● Hospital gridlock
  • 49. Case 3: Kaiser Permanente
  • 50. Case 4: Deep6 Feature engineering with Spark NLP to accelerate clinical trial recruitment (reducing the time that it takes to find a patient for trials) ● Your treatments are > 15 years old ● Cutting edge treatments only available in clinical trials ● Faster cycles make lifesaving treatments available sooner
  • 56. Spark NLP resources Spark NLP Official page Spark NLP Workshop Repo JSL Youtube channel JSL Blogs Introduction to Spark NLP: Foundations and Basic Components (Part-I) Introduction to: Spark NLP: Installation and Getting Started (Part-II) Named Entity Recognition with Bert in Spark NLP Text Classification in Spark NLP with Bert and Universal Sentence Encoders Spark NLP 101 : Document Assembler Spark NLP 101: LightPipeline https://www.oreilly.com/radar/one-simple-chart-who-is-interested-in-spark-nlp/ https://blog.dominodatalab.com/comparing-the-functionality-of-open-source-natural-language-processing-libraries/ https://databricks.com/blog/2017/10/19/introducing-natural-language-processing-library-apache-spark.html https://databricks.com/fr/session/apache-spark-nlp-extending-spark-ml-to-deliver-fast-scalable-unified-natural-language-processing https://medium.com/@saif1988/spark-nlp-walkthrough-powered-by-tensorflow-9965538663fd https://www.kdnuggets.com/2019/06/spark-nlp-getting-started-with-worlds-most-widely-used-nlp-library-enterprise.html https://www.forbes.com/sites/forbestechcouncil/2019/09/17/winning-in-health-care-ai-with-small-data/#1b2fc2555664 https://medium.com/hackernoon/mueller-report-for-nerds-spark-meets-nlp-with-tensorflow-and-bert-part-1-32490a8f8f12 https://www.analyticsindiamag.com/5-reasons-why-spark-nlp-is-the-most-widely-used-library-in-enterprises/ https://www.oreilly.com/ideas/comparing-production-grade-nlp-libraries-training-spark-nlp-and-spacy-pipelines https://www.oreilly.com/ideas/comparing-production-grade-nlp-libraries-accuracy-performance-and-scalability https://www.infoworld.com/article/3031690/analytics/why-you-should-use-spark-for-machine-learning.html