Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
How Will Knowledge Graphs Improve Clinical Reporting Workflows
1. gsk.com
How will knowledge graphs improve clinical reporting workflows?
Presenters: Alexey Kuznetsov (alexey.k.kuznetsov@gsk.com) & Jorine Putter (jorine.8.putter@gsk.com)
07Dec2022
2. 07 December 2022 2
Our Problem Statement
Tremendous Resource, Multiple Handoffs, Numerous Transformations
Single
Study
SDTM* ADaM TFLs
Submission
5 – 10
studies
99 modules
*From 77 legacy datasets
Clinical
data flow
Trial
design
Collect
data
Review
observed
datasets
Analyse
datasets
Review
results
EDC
Lab data
Randomisation
Others...
Protocol
Metadata
Examples 71 datasets 42 datasets <= 250 outputs
SDTM ADaM TFLs
350 - 710
to integrate
210 – 420
to integrate
<= 250 integrated
outputs
Re-transformations
(Several Standard)
Re-Mappings
(Several Standards)
3. 07 December 2022 3
Imagine a world where anything is possible…
True automation of
standard analyses
Ad-hoc requests
delivered on demand
(Blinded) Analysis
results reviewed in
real-time
Manual effort of
results validation
virtually eliminated
Data visualisations
available in-stream
GDPR and patient
consent
Risk-based
monitoring is
proactive
Google-like Q&A
system for our
trial data
Clinical
application of
preclinical AI
algorithms
4. 07 December 2022 4
From Imagination to Reality
Clinical Knowledge Graph
Let’s move away from isolated data domain silos…
…to ONE contextualised Clinical Knowledge Graph
Exposure
Domain
Subject = Bob
Study Day = 1 Dosage = 40mg
Trial = Trial1
Medical History
Domain Subject = Bob
Event Date = 2000
Term =
Hypertension
Trial = Trial1
Adverse Events
Domain
Subject = Bob
Study Day = 1 Term = Headache
Trial = Trial 1
Demographics
Domain
Sex = M Age = 75
Subject = Bob
Trial = Trial1
Clinical Knowledge Graph
5. 07 December 2022 5
Our Idea
…the Google Translate for our clinical data – helping us translate our complex data landscape to
answer important scientific questions
Clinical
data flow
Trial
design
Collect
data
Review
observed
datasets
Analyse
datasets
Review
results
EDC
Lab data
Randomisation
Others...
Protocol
Metadata
Examples
99 Modules
GSK Design
(1 Standard)
One Connected Data Model
Parallel Processing
SDTM (71)
ADaM (42)
TFLs (<=250)
ISS/ISE
Select Required Standard
ETL
Modules
Parallel Processing
6. 07 December 2022 6
Unique Value
Knowledge Graph
Greater control
over data privacy
Modern graph
analytics &
visualisation
Decoupling
vertical data
pipeline
Accelerated
decision making
7. 07 December 2022 7
Goal: Test feasibility, desirability & sustainability of idea
Phased agile & risk-based approach with predefined success criteria
EXPERIMENT 1
Can we ingest SDTM
data
into CLD MVP?
EXPERIMENT 3
Can we analyse, report
and egress
TLFs from CLD MVP?
EXPERIMENT 2
Can we enrich CLD MVP
model with ADaM
Transformations?
2021 H1 2021 H2 2022 H1 2022 H2
MVP PILOT
Can we use the CLD
MVP to perform QC for
an ongoing trial?
Read-out end of 2022
Continuous learning and iteration
14. 07 December 2022 14
How do we store machine readable derivation metadata as graph
15. 07 December 2022 15
How do we store machine readable derivation metadata as graph
aage = scrndt - brthdtc + 1
16. 07 December 2022 16
How do we store machine readable derivation metadata as graph
aage = scrndt - brthdtc + 1
17. 07 December 2022 17
How do we store machine readable derivation metadata as graph
aage = scrndt - brthdtc + 1
18. 07 December 2022 18
How do we store machine readable derivation metadata as graph
aage = scrndt - brthdtc + 1
19. 07 December 2022 19
How do we store machine readable derivation metadata as graph
aage = scrndt - brthdtc + 1
modular
dependant
orchestration
20. 07 December 2022 20
How do we store machine readable derivation metadata as graph
21. 07 December 2022 21
How do we store machine readable summary statistics as graph
22. 07 December 2022 22
How do we store machine readable summary statistics as graph
23. 07 December 2022 23
How do we store machine readable summary statistics as graph
specification of statistics
24. 07 December 2022 24
How do we store machine readable summary statistics as graph
specification of statistics
the what
25. 07 December 2022 25
How do we store machine readable summary statistics as graph
specification of statistics
the what
the how
26. 07 December 2022 26
How do we store machine readable summary statistics as graph
specification of statistics
the what
the how
qualifiers
27. 07 December 2022 27
How do we store machine readable summary statistics as graph
28. 07 December 2022 28
How do we store machine readable summary statistics as graph
SEX Mean Value
F 32.4
M 34.0
29. 07 December 2022 29
Open source assets released
To be released:
• tab2neo
• neo4cdisc
GSK-Biostatistics/neointerface:
NeoInterface - Neo4j made easy for
Python programmers! (github.com)
read/write
csv, xls, xlsx, xpt,
sas7bdat, rda
dm/ae/lb/...
dm/ae/../custom
30. 07 December 2022 30
Learnings that helped us accelerate our idea
Pre-defined
success criteria
critical in quick
decision making
Prioritise 1 idea, test it,
refine it, test it, refine it…
Focused innovation
challenge can greatly help
test disruptive ideas
Understand pain points &
test ideas to drive
informed innovation
Timeboxed focused
sprints are great to
inform the path ahead
31. 07 December 2022 31
Special thanks to…
Michael Rimler
Samantha Warden
Kirsten Langendorf
Johannes Ulander
Dave Iberson-Hurst
Eleanor Sparling
Rachel Ren
James Sefton
William McDermott
Jonathan Deacon
Benjamin Grinsted
Julian West
It takes a village
to raise an idea…