6. Raw files
10110
01101
Extraction
ABC
123
Text files
Metadata
(who, when, where, etc.)
3 million files
that needed
scanning
Each file takes
10 seconds to
process
That’s about a year
of processing…
35 Amazon
instances did
the job in 1.5
weeks
surely it will
be done soon…
yessss!
15. Import Process
3
Raw files
10110
01101
Extraction
ABC
123
Text files
Metadata
(who, when, where, etc.)
1
Acquire and
process the data
Import using the
model and enrich it
2
Decide on the
initial domain model
Entity
Inter-
mediaryOfficer
Address
SHAREHO
LDER_O
F IN
TER
M
ED
IA
RY_O
F
REGISTERED_AT
REGISTERED_AT
18. Unlock your relationships
… with Neo4j, the world’s leading graph database
Fredrik Johansson
fredrik.johansson@neotechnology.com
David Montag
david.montag@neotechnology.com