Hotel or Taxi? "Sorting hat" for travel expenses with AWS ML infrastructure

Hotel or Taxi?
"Sorting hat" for travel expenses
with AWS ML infrastructure.
BERLIN, 18.OCT 2018
MICHAEL PERLIN

www.innoq.com
SERVICES
Strategy & technology consulting
Digital business models
Software architecture & development
Digital platforms & infrastructures
Knowledge transfer, coaching & trainings
Big data & machine learning
FACTS
~130 employees
Privately owned
Vendor-independent
OFFICES
Monheim
Berlin
Offenbach
Munich
Hamburg
Zurich
CLIENTS
Finance
Telecommunications
Logistics
E-Commerce
Fortune 500
SMBs
Startups

Agenda
• The value of machine learning
• Problem we‘ve solved
• AWS infrastructure for training
• How deep learning works
• How we run it in production

The value of ML
(aka success stories)

Self-driving cars
Image from: pxhere.com

Automatic translation
Screenshot from: deepl.com

Image classification
Images from: wikipedia.org
ok danger

The travelling
consultant problem

Travel expenses
Screenshot from: haufe.de

Travel expenses
Can we simplify this ???
ßEnter data
ßSubmit scan
with same data

• Export ~5K receipts + data entered
• Use ML to extract document class, VAT rate,
date, price...
• = Save clicking and typing
Travel expenses

Training model in
AWS infrastructure

Two phases
1. Training
your model
with available
data
2. Using your
model for
new data
(Inference)

Training
?
Category
Bus
Flight
Taxi

AWS Rekognition
Limited by 50 words

Requirements for training
Classical ML
• Commodity
hardware
• Libs
• IDE
Deep learning
• GPU-powered
hardware
• Libs
• IDE

Training environment
• EC2 Instance with bare Linux
• Install libraries
• Configure GPU usage
• Install Jupyter
• Add self-signed certificates
• Go!
Option 1

Training environment
• EC2 Instance with AMI from
Marketplace containing pre-
installed and pre-configured
libraries
• Add self-signed certificates
• Go!
Option 2

Artifical Intelligence
Machine Learning
Deep Learning
Terms

Training
?
Category
Bus
Flight
Taxi
Day
Travelcard
Start
Date(2)
End
Ticket(2)
...
(50 words)
Lufthansa
Your
Flight(4)
Trip
Payment
Ticket
...
(200 words)
Heathrow
Taxi(3)
Services(2)
Walton
VISA
DEBIT
...
(100 words)

Training
Heathrow
Taxi(3)
Services(2)
Walton VISA
DEBIT
...
(100 words)
Day
Travelcard
Start
Date(2)
End
Ticket(2)
...
(50 words)
Lufthansa
Your
Flight(4)
Trip
Payment
Ticket
...
(200 words)
TF = Frequency for „Ticket“ in second document: 3/50 = 0.06
IDF = Frequency of documents containing „Ticket“: 500/5000 = 0.1
Replace „Ticket“ in second vector with TF/IDF = 0.06/0.1 = 0.6

Training
?
Category
Bus
Flight
Taxi
0.1
0.03
0.3
0.31
0.44
0.00.
6
0.22
0.2
0.3
0.31
0.3
0.24
0.1
0.1
0.13
0.32
0.1
0.94
0.1
0.3
0.45
0.8

Training
?
0.1
0.03
0.3
0.31
0.44
0.00.
6
0.22
0.0
0.2
0.3
0.31
0.3
0.24
0.1
0.0
0.0
0.0
0.1
0.13
0.32
0.1
0.94
0.1
0.3
0.45
0.8
1 0 0
0 1 0
0 0 1
Bus
Flight
Taxi

Training
?
1
0
0
input matrix output matrix
transformation
0.1
0.03
0.3
0.31
0.44
0.00.
6
0.22
0.0
0.2
0.3
0.31
0.3
0.24
0.1
0.0
0.0
0.0
0.1
0.13
0.32
0.1
0.94
0.1
0.3
0.45
0.8
0
1
0
0
0
1

Training
matrix with
arbitrary values
-1 -0.3 0.3
0.9 -0.3 -0.4 -0.5
0.2 0.4 0.2 0.0
0.1 0.1 0.8 -0.6
0.3 -1.0 0.6 -0.5
0.2, -0.2 0.5 -0.4
0.3 0.2, 0.1, -0.2
another matrix
with arbitrary
values
x x
0.1
0.03
0.3
0.31
0.44
0.00.
6
0.22
0.0
0.2
0.3
0.31
0.3
0.24
0.1
0.0
0.0
0.0
0.1
0.13
0.32
0.1
0.94
0.1
0.3
0.45
0.8
input matrix
computed output
matrix
=
0.7 1.0 0.6
0.0 0.8 0.2
0.5 0.0 0.9

Training computed output
matrix
0.3 1.0 0.6
0.0 0.2 0.2
0.5 0.0 0.3
true output
matrix
1.0 0.0 0.0
0.0 1.0 0.0
0.0 0.0 1.0
error

Training
values are
changed a bit
-0.9 -0.3 0.3
0.8 -0.3 -0.4 -0.4
0.2 0.4 0.4 0.0
0.1 0.2 0.8 -0.6
0.3 -0.9 0.6 -0.5
0.3 -0.2 0.5 -0.5
0.3 0.2 0.2 -0.2
x x
0.1
0.03
0.3
0.31
0.44
0.00.
6
0.22
0.0
0.2
0.3
0.31
0.3
0.24
0.1
0.0
0.0
0.0
0.1
0.13
0.32
0.1
0.94
0.1
0.3
0.45
0.8
input matrix
computed output
matrix
=
0.3 1.0 0.6
0.0 0.8 0.4
0.5 0.1 0.8
values are
changed a bit

matrix
true output
matrix
1.0 0.0 0.0
0.0 1.0 0.0
0.0 0.0 1.0
error 2
0.3 1.0 0.6
0.0 0.8 0.4
0.5 0.1 0.8

Training
-1 -0.3 0.3
0.9 -0.3 -0.4 -0.5
0.2 0.4 0.2 0.0
0.1 0.1 0.8 -0.6
0.3 -1.0 0.6 0.5
0.2, -0.2 0.5 -
0.4 0.3 0.2, 0.1,
-0.2
x
adjust transformation matrices
error
vs.
error2

Training
x
0.1
0.03
0.3
0.31
0.44
0.00.
6
0.22
0.0
0.2
0.3
0.31
0.3
0.24
0.1
0.0
0.0
0.0
0.1
0.13
0.32
0.1
0.94
0.1
0.3
0.45
0.8
input matrix
computed output
matrix
=
-1 -0.3 0.3
0.9 -0.3 -0.4 -0.5
0.2 0.4 0.2 0.0
0.1 0.1 0.8 -0.6
0.3 -1.0 0.6 0.5
0.2, -0.2 0.5 -
0.4 0.3 0.2, 0.1,
-0.2
x
0.7 1.0 0.6
0.0 0.8 0.2
0.5 0.0 0.9

matrix
true output
matrix
1.0 0.0 0.0
0.0 1.0 0.0
0.0 0.0 1.0
error 3
0.7 1.0 0.6
0.0 0.8 0.2
0.5 0.0 0.9

Training
-1 -0.3 0.3
0.9 -0.3 -0.4 -0.5
0.5 0.4 0.2 0.0
0.4 0.1 0.8 -0.1
0.2 -1.0 0.6 0.5
0.2, -0.2 0.8 -
0.5 0.3 0.2, 0.1,
-0.2
x
adjust transformation matrices
error2
vs.
error3

Training
• With all the data iterate until error stops to shrink
• The result of the adjustments is the trained
model
• Now it can be deployed into production

Training
-1 -0.3 0.3
0.9 -0.3 -0.4 -0.5
0.2 0.4 0.2 0.0
0.1 0.1 0.8 -0.6
0.3 -1.0 0.6 -0.5
0.2, -0.2 0.5 -0.4
0.3 0.2, 0.1, -0.2
x x
0.1
0.03
0.3
0.31
0.44
0.00.
6
0.22
0.0
0.2
0.3
0.31
0.3
0.24
0.1
0.0
0.0
0.0
0.1
0.13
0.32
0.1
0.94
0.1
0.3
0.45
0.8
input matrix
computed
output matrix
=
0.7 1.0 0.6
0.0 0.8 0.2
0.5 0.0 0.9
1.0 0.0 0.0
0.0 1.0 0.0
0.0 0.0 1.0
errorX
vs.
errorX+1
true output matrix

Training
-1 -0.3 0.3
0.9 -0.3 -0.4 -0.5
0.2 0.4 0.2 0.0
0.1 0.1 0.8 -0.6
0.3 -1.0 0.6 -0.5
0.2, -0.2 0.5 -0.4
0.3 0.2, 0.1, -0.2
x x
0.1
0.03
0.3
0.31
0.44
0.00.
6
0.22
0.0
0.2
0.3
0.31
0.3
0.24
0.1
0.0
0.0
0.0
0.1
0.13
0.32
0.1
0.94
0.1
0.3
0.45
0.8
input matrix
computed
output matrix
=
0.7 1.0 0.6
0.0 0.8 0.2
0.5 0.0 0.9
1.0 0.0 0.0
0.0 1.0 0.0
0.0 0.0 1.0
diff 2
true output matrix
Covered by frameworks!

Training
Covered by frameworks!

Training
frameworks
Hyperparameters
- „learning rate“
- ...
Network
architecture

Two phases
1. Training
your model
with available
data
2. Using your
model for
new data
(Inference)
DONE

Inference
• General approach: load the model saved by
training, feed the input, get output
• Even cross-language works, i.e. model trained
with Python can be used a Java application
• Usually works on commodity hardware

Package, Build and Deploy
web framework
docker containerOption 1
container scheduler of your choice: EKS, ECS,
OpenShift, Giant Swarm...
deploy
• inference code
• trained model
• dependent libs

Package, Build and Deploy
• inference code
• trained model
• dependent libs
Option 2
deploy
zip
AWS Lambda

Application flow
Lambda S3 Bucket
ECS
OCR
Service
2
3
1
content.json
4read
6
metadata.add(
{class:Bus})
inference
5
7
read
metadata

Thank you!
Questions? www.innoq.com
innoQ Deutschland GmbH
Krischerstr. 100
40789 Monheim am Rhein
Germany
+49 2173 3366-0
Ohlauer Str. 43
10999 Berlin
Germany
Ludwigstr. 180E
63067 Offenbach
Germany
Kreuzstr. 16
80331 München
Germany
Gewerbestr. 11
CH-6330 Cham
Switzerland
+41 41 743 01 11
Albulastr. 55
8048 Zürich
Switzerland
innoQ Schweiz GmbH
Michael Perlin
Michael.Perlin@innoq.com
+49 178 7818063
@ttzt_mp

Hotel or Taxi? "Sorting hat" for travel expenses with AWS ML infrastructure

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Ähnlich wie Hotel or Taxi? "Sorting hat" for travel expenses with AWS ML infrastructure

Ähnlich wie Hotel or Taxi? "Sorting hat" for travel expenses with AWS ML infrastructure (20)

Mehr von AWS Germany

Mehr von AWS Germany (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Hotel or Taxi? "Sorting hat" for travel expenses with AWS ML infrastructure