Beyond the Schema Mapper

Presenters
The
Peak
of
Data
Integration
20
23
Matthias
Holemans
FME Consultant
Nordend
Michaël
Ferré
GIS Analyst
District09

The
Peak
of
Data
Integration
20
23
Agenda
1. Topic
2. Speciﬁc problem
3. Solution
4. Conclusion

The
Peak
of
Data
Integration
20
23
Introduction

The
Peak
of
Data
Integration
20
23
Topic
Using simple spread sheets to extract speciﬁc data from a
big data platform.
Advantages?
• User friendly: separate conﬁguration from scripting
• Transparency: paying attention to functional design
• Future-proof: allowing the source and target platforms to scale
• Limited to no adaptations needed to the script

The
Peak
of
Data
Integration
20
23
Problem
● New business processes shorten
information management cycle
● One source platform replacing x
different sources
● Data available through message
queue streaming JSON objects
● About 30k messages/day, highly
variable in size
● 1 message contains 1-n objects
● ‘last one wins’
● Automate ETL to uniformise and
shorten data updates
● Allow for efficient future scaling

The
Peak
of
Data
Integration
20
23
Difficulties
● How to set up a workflow fit for the job?
● Performant and robust in a non-controllable message stream
volume
● How to make this generic & future-proof?
● Source and target platforms have different stakeholders and
evolve independently
● How to do complex mapping?
● Finding the breakoff point of spreadsheet configuration

The
Peak
of
Data
Integration
20
23
Difficulties
● How to deploy this on FME Form and diﬀerent FME Flow
environments?
● Creating a portable script on FME Form connecting to the source
platform environments on FME Form
● Scalability?
● Accounting for future scaling in the source as well as target
platforms

The
Peak
of
Data
Integration
20
23
How we tackled the problem

The
Peak
of
Data
Integration
20
23
2 workspaces connected with jobSubmitters
1. Filter the relevant information
2. Execute the mapping

The
Peak
of
Data
Integration
20
23
Workspace 1: Client input - excel sheet
Description KoppelingTypeCode KoppelingSubTypeCode TrajectDetailCode Target Table
Neighbourhoud bycicle
parking
1 58 71
Bicycle
Parking
Bicycle Parking Gent 1 58 72
Bicycle
Parking
Byciclin Parking Others 1 58 83
Bicycle
Parking
Parking meters 1 58 85
Parking
meters
Charging posts 1 58 81
Charging
posts

The
Peak
of
Data
Integration
20
23
Workspace 1: Theory

The
Peak
of
Data
Integration
20
23
Worspace 1: Reality

The
Peak
of
Data
Integration
20
23
Workspace 2: Input mapping spreadsheet
Source attribute Target attribute Preﬁx Reference ExternalTable
Join
Attribute
Join Attribute
Values
Attribute External
Table
Parse Seperator
Default
Value
Properties.Capacity Capacity
Capacities.Surface Surface SU_
ContourLabels Owner alfa.TrajectDetailLabel Code (353,354,355) Label # City Ghent

The
Peak
of
Data
Integration
20
23
Source attribute Target attribute Prefix Reference ExternalTable
Join
Attribute
Join Attribute
Values
Attribute External
Table
Parse Seperator
Default
Value
Simple one on one mapping
Input Data
Attribute A Properties.Capacity Attribute ..
.. 50 bikes …
15 bikes
39 bikes
Desired output data
Attribute A Capacity Attribute …
.. 50 bikes …
15 bikes
39 bikes

The
Peak
of
Data
Integration
20
23
Join
Attribute
Join Attribute
Values
Attribute External
Table
Parse Seperator
Default
Value
Simple one on one mapping with preﬁx
Input Data
Attribute A Capacities.Surface Attribute ..
.. Concrete …
Sand
Wood
Desired output data
Attribute A Surface Attribute …
.. SU_Concrete …
SU_Sand
SU_Wood

The
Peak
of
Data
Integration
20
23
Workspace 2: Theory in the most simple way

The
Peak
of
Data
Integration
20
23
But what with the more complex mapping?
Build your own solution with custom transformers

The
Peak
of
Data
Integration
20
23
Join
Attribute
Join Attribute
Values
Attribute External
Table
Parse Seperator
Default
Value
Complex mapping
Input Data
Attribute A Contourlabels Attribute ..
.. #314#354#312 …
#310#320
#355#367#398
alfa.TrajectDetailLabel
Attribute A Code Label Attribute ..
.. 353 Police ..
354 Municipality
355 Fire station
… …
Desired output data
Attribute A Owner Attribute …
.. Municipality …
City Ghent
Fire Station

The
Peak
of
Data
Integration
20
23
Workspace 2: Theory complex mapping
Handy tip: @Value(@Value(attribute))

The
Peak
of
Data
Integration
20
23
Worspace 2: Reality

The
Peak
of
Data
Integration
20
23
Conclusion

The
Peak
of
Data
Integration
20
23
Summary
● Always try to think as generic as possible
● FME is an ideal tool for data migration
● Always keep it simple for the client

The
Peak
of
Data
Integration
20
23
ListExploding may cause
the process to explode.

Call to Action
1. Think further than the standard transformers
2. Make your process future-proof by making it as generic
as possible
3. SchemaMapper cannot be conﬁgured with scripted
parameters
4. @Value(@Value(Attribute)) is a handy trick for generic
ﬂows

ThankYou!
matthias@nordend.eu
michael.ferre@district09.gent

Beyond the Schema Mapper

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Ähnlich wie Beyond the Schema Mapper

Ähnlich wie Beyond the Schema Mapper (20)

Mehr von Safe Software

Mehr von Safe Software (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Beyond the Schema Mapper