SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Downloaden Sie, um offline zu lesen
Beyond Lambda - how to get from logical to physical
Artur Borycki
Director Technology & Innovations
Simplification & Efficiency
Teradata believe in the principles of self-service,
automation and on-demand resource allocation.
These enable faster, more efficient and more
effective data application development and
operation.
‹#›
What is Lambda Architecture
Background
• Reference architecture for Big Data systems
• Designed by Nathan Marz (Twitter)
• Defined as a system that runs arbitrary functions on
arbitrary data
• “query = function(all data)”
Design Principles
• Human fault-tolerant, Immutability, Computable
Lambda Layers
• Batch - Contains the immutable, constantly growing master dataset.
• Speed - Deals only with new data and compensates for the high latency updates of the
serving layer.
• Serving - Loads and exposes the combined view of data so that they can be queried.
‹#›
Active Executor Lambda Framework
• The way this works is that an immutable sequence of records is captured and fed into a batch system and
a stream processing system in parallel.
• You implement your transformation logic twice, once in the batch system and once in the stream
processing system.
• You stitch together the results from both systems at query time to produce a complete answer.
Overall Architecture – Customer example
‹#›
Lambda alternative – Kappa? (Jay Kreps – Linkedin)
Unlike the Lambda Architecture, in this approach you only do
reprocessing when your processing code changes
1. Use Kafka or some other system that
will let you retain the full log of the data
you want to be able to reprocess and
that allows for multiple subscribers. For
example, if you want to reprocess up to
30 days of data, set your retention in
Kafka to 30 days.
2. When you want to do the reprocessing,
start a second instance of your stream
processing job that starts processing
from the beginning of the retained data,
but direct this output data to a new
output table.
3. When the second job has caught up,
switch the application to read from the
new table.
4. Stop the old version of the job, and
delete the old output table.
Real-time
Maturity
Typical path for a customer
Customers typically go through four stages on their
path to real-time analysis.
The evolution typically starts with trying to visualize
results or reports more frequently. This leads to the
realization that the underlying data is not refreshed
frequently. The next stage of maturity is to capture and
ingest data more quickly. Once data is flowing faster,
customers then try to process the data as it is flowing.
The final stage is to remove any human intervention.
‹#›
Events	
  /	
  
Interactions
Consumer	
   of	
  Information	
  
All	
  dataStreams
Other	
  feeds
Consumer	
   of	
  Information	
  
Discovery
Advance	
  
Analytics
Data	
  binding
Reporting
Beyond Lambda – Omega ;) (Artur vision)
• We need events that require actions
and interactions without much of the
analytics
• We need events that are requiring
action, but also they need to be
enhanced by the analytics in the
ecosystem (based on other information
sources)
• We need events that will be handled
later or they are supporting above cases
‹#›
The Teradata UDA
UNIFIED DATA ARCHITECTURE
Security, Workload Management
Applications
INTEGRATED DATA WAREHOUSE
DATA
PLATFORM
INTEGRATED DISCOVERY PLATFORM
Security, Workload ManagementREAL TIME PROCESSING
TERADATA
PORTFOLIO FOR
HADOOP
TERADATA DATABASE
TERADATA ASTER DATABASE
RESTFULAPI
LISTENINGFRAMEWORK
RESTFULAPI
APPFRAMEWORK
10
1
0
BEST
APP
EVER!!
Data  Service  APIs
Access  Data  on  Teradata,  Aster,  
Hadoop via  API  calls
Logging
Push  and  store  events  about  app  to  
UDA  logging  services
Ingest  /  Streaming  
Stream  data  into  UDA  and  build  
applications  on  near  real-­time  data
Scheduling  /  Orchestration
Scheduling  services  allow  devs to  
build  workflows  and  connect  apps.  
Search  &  Metadata
Expose  search  capabilities  in  your  
app  via  UDA  level  search  services.
WebKit
A  toolbox  of  UI  templates,  
visualizations  and  javascript libraries
Package/Deploy  &  Publish
A  simple  package  and  deployment  
application  to  launch  your  app  in  the  
AppCenter ecosystem
Operate
Leverage  monitoring  &  alerting  
services  to  keep  track  app  health.
Key Services, Libraries & Templates
UDA it a concept but also allows to be Development Platform
Instead of a single monolithic database
1
1
Monolith
A monolithic application
puts all of its
functionality into a
single process and
scales by replicating
the monolith on
multiple servers.
Microservices
A microservices
architecture puts each
element of functionality
into a separate service
and scales by
distributing these
services across
servers.
Decoupled Services
Scale by distributing services and replicating as needed
1
2
Monolithic App
A monolithic application
puts all of its
functionality into a single
process and scales by
replicating the
monolith on multiple
servers.
Microservices
A microservices
architecture puts each
elementoffunctionality
into a separate service
and scales by
distributing these
services across
servers.
Think Microservice, not Monolithic
‹#›
Access and move data between systems through service APIs
1
3
UDA
TD TD
INFRASTRUCTURE
DATA
SERVICES
REST API Call
Send Query
Execute Query
Send Response
Teradata Data Services
QueryGrid – Data Movement QueryGrid – Remote Execution
Foreign Table Select – Pass Thru
SELECT *
FROM FOREIGN TABLE (
select
parse_url(refer,'HOST') as host,
v.key as key,
ts as session_ts,
v.val, count(*) as count
from http_inline LATERAL VIEW
explode(str_to_map(parse_url(refer,'QUERY'),'&','=')) v as key,
val
where parse_url(refer, 'QUERY') is not null
group by parse_url(refer, 'HOST'), v.key, v.val
)@hdp21 hdp_dpi
WHERE
session_ts = current_date;
Push foreign grammar to remote.
Hadoop:permits Hive/Impala query for data reduction on non-partitioned
columns.
Import
SELECT source, session FROM
clickstream@Hadoop_sysd WHERE
session_ts =
ʻ‘2013-01-01ʼ’;
Can be used to:
– “Insert/select” & “create table as” to instantiate data locally.
– Joins always possible with local tables.
Export
INSERTINTO
cust_loc@Hadoop_sysd
SELECTcust_id,cust_zip
FROM cust_data
WHERE
last_update = current_date;
Move Data from Teradata to Hadoop
– And/or other Data Stores
15
The Data Lake – Customer slide
• This is not skating to where the puck is going to be - It’s skating to the puck.
– Your CIO should be sitting you on the bench if you are not doing this already
​Most Data Lakes Today
​Passive cheap storage
•Really only using HDFS
•Limited data governance
•Staging Data
•Archiving Data
•DW offload (cost drivers)
​The Data Lakes we Should be Building
​Active balanced nodes
•Using full Hadoop stack+
•Good data governance
•Good information architecture
•Processing and enhancing data
•Data applications (flexibility drivers)
16
New Architecture Architecture
• Information architecturesare distributed
– Focus on data and business questions, not integrating separate systems
• Application architectures are variable
– Don’t force applications into a single architecture
• Applications are Loosely Coupled
– DW is an application
– BI is an application (or many)
– Data applications are everywhere!
• But let’s be smart about it
– Still need strong information architecture and data management practices
– Still need to reduce complexity and make strategic choices on technology
17
© 2015 Teradata
Feature Store Application
GeneratingAnalytical Features
HDFS, Spark, ElasticSearch
CampaignApplication
Targeting Customers
Teradata
Customer Registry
Application
Registry Model of all Customers
Teradata
BI Application
General Query andAccess to Features
Teradata SQL, Microstrategy, Tableau
Customer Matching
Application
Associating Customers
Spark, Python, Scala, R
Loosely Coupled Data Applications
18
Customer– Framework Overview
B2B Gateway
SFTP SFTP SFTP
Flume job Flume job Flume job
Non-SLA Edge Flume DR Edge Flume SLA Edge Flume
Non-SLA HDFS DR HDFS SLA HDFS
Non-SLA HDFS DR HDFS SLA HDFS
Non-SLA Teradata DR Teradata SLA Teradata
External Systems
Customer Gate
Internal DAP
Platform Edge
Hadoop Raw
Processing
Hadoop Access
Teradata Backup/Restore
DistCP
QueryQridQueryQrid
19 © 2014 Teradata
Customer - Microservices Example
‹#›
INFRASTRUCTURE
QUERY GRID
TD 6xxx TD 1xxx ASTER HADOOP 1 HADOOP 2 LISTENER
Move data between systems & access through service APIs
2
0
App App App App AppApp
Data Pipeline
SERVICE LAYER
21
Customer example – Integration Flow
• User starts a Workflow fromthe UI which has a single Pig Job.
• Azkaban Web requests that the Azkaban Executor start a new Pig
Job.
• Pig Job makes a REST call to the TemplateModule to render the
Pig Template.
• TemplateModule fetches config values from the ConfigModule if
needed by the template. The ConfigModule in turn fetches config
values either fromthe PCF Data Schema or from externalsystems.
• TemplateModule renders the Pig Template and returns a
complete Pig Script.
• Pig Job executes the Pig Script against the Hadoop cluster.
• During the Pig Job executionit makes REST calls to the
EventModule informing about its progress.
• As the Job progress is updated Vertx updates the Azkaban Web UI
in real time.
• When the Pig Job has completedit makes a REST call to the
AuditModule to log its completion. The AuditModule in turn stores
auditing information in the PCF Data Schema.
• Finally the Pig Job returns its executionstatus back to the Azkaban
Executor.
MySQL
Azkaban
Web
Azkaban
Executor
KAFKA
Azkaban Bridge
Service
Config
Service
Template
Service
Teradata
Service
Event
Service
Audit
Service
Pig Job
Hadoop
PCF
Pig
Templ
Pig
Script
JSON
REST
22
Customer – Event Flow
KAFKA
Flume Event
Module
Teradata
Service
JSON
Azkaban Web
Azkaban
Executor
HDFS Event
Module
HBase Event
Module
HCatalog
Event
Module
REST
hdfs.filelanded flume.filelanded hdfs.filelanded hbase.tabl
emodified
Hcatalog.n
ewpartition
Hcatalog.n
ewpartition
23
Customer – Docker services
Azkaban Nginx Services
LogStash
Tessera
/Graphite
Consul
Consul Consul
Ambassadord
Container Third Party Used For
Nginx No Front end web server/proxy for all the other UIs.
Vert.x No Application server.
Azkaban Yes Workflow management for Hadoop, Teradata etc.
Tessera/Graphite No Aggregating and displaying applications and system level metrics
LogStash Yes Aggregating and displaying application and system level logs
Consul Yes Distributed key value store used for Service Descovery
Ambassadord Yes Makes it easier for Docker containers to access services hosted in
other Docker containers
Tap into the power of the platform without duplicating effort
YOUR ANALYTIC APP
MICRO SERVICES FRAMEWORK
ASTER DATA
SERVICES
…TD DATA
SERVICES
HIVE DATA
SERVICES
AUTH
SERVICES
Easily Access UDA
‹#›
Extract, Load & Transform in the Layered Architecture
2
5
Level 0
Aggregation
Business Unit Specific Rollups
Calculation
Key Performance Indicators
Level 3
Level 2
Level 1
Integration
Integrated Model at Lowest
Granularity
Staging
1:1 Source Systems
EXTRACT
LOAD
TRANSFORM
APP CENTER
LISTENER
…
BUSINESS
HEALTH
WORKLOAD
ANALYTICS
MEMBER
SEGMENT
ENGINE
CATEGORY
SALES
DAILY
FINANCIALS
UDA & the LDA
Questions…
27
THANK YOU

Weitere ähnliche Inhalte

Was ist angesagt?

Jump Start on Apache Spark 2.2 with Databricks
Jump Start on Apache Spark 2.2 with DatabricksJump Start on Apache Spark 2.2 with Databricks
Jump Start on Apache Spark 2.2 with DatabricksAnyscale
 
The Future of Hadoop: A deeper look at Apache Spark
The Future of Hadoop: A deeper look at Apache SparkThe Future of Hadoop: A deeper look at Apache Spark
The Future of Hadoop: A deeper look at Apache SparkCloudera, Inc.
 
The Microsoft BigData Story
The Microsoft BigData StoryThe Microsoft BigData Story
The Microsoft BigData StoryLynn Langit
 
Data Applications and Infrastructure at LinkedIn__HadoopSummit2010
Data Applications and Infrastructure at LinkedIn__HadoopSummit2010Data Applications and Infrastructure at LinkedIn__HadoopSummit2010
Data Applications and Infrastructure at LinkedIn__HadoopSummit2010Yahoo Developer Network
 
Innovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data WarehouseInnovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data WarehouseDataWorks Summit
 
Data Driving Yahoo Mail Growth and Evolution with a 50 PB Hadoop Warehouse
Data Driving Yahoo Mail Growth and Evolution with a 50 PB Hadoop WarehouseData Driving Yahoo Mail Growth and Evolution with a 50 PB Hadoop Warehouse
Data Driving Yahoo Mail Growth and Evolution with a 50 PB Hadoop WarehouseDataWorks Summit
 
Solution Brief: Real-Time Pipeline Accelerator
Solution Brief: Real-Time Pipeline AcceleratorSolution Brief: Real-Time Pipeline Accelerator
Solution Brief: Real-Time Pipeline AcceleratorBlueData, Inc.
 
Hadoop and Enterprise Data Warehouse
Hadoop and Enterprise Data WarehouseHadoop and Enterprise Data Warehouse
Hadoop and Enterprise Data WarehouseDataWorks Summit
 
Spark-on-Yarn: The Road Ahead-(Marcelo Vanzin, Cloudera)
Spark-on-Yarn: The Road Ahead-(Marcelo Vanzin, Cloudera)Spark-on-Yarn: The Road Ahead-(Marcelo Vanzin, Cloudera)
Spark-on-Yarn: The Road Ahead-(Marcelo Vanzin, Cloudera)Spark Summit
 
Querying Druid in SQL with Superset
Querying Druid in SQL with SupersetQuerying Druid in SQL with Superset
Querying Druid in SQL with SupersetDataWorks Summit
 
Streaming Data Ingest and Processing with Apache Kafka
Streaming Data Ingest and Processing with Apache KafkaStreaming Data Ingest and Processing with Apache Kafka
Streaming Data Ingest and Processing with Apache KafkaAttunity
 
Cisco connect toronto 2015 big data sean mc keown
Cisco connect toronto 2015 big data  sean mc keownCisco connect toronto 2015 big data  sean mc keown
Cisco connect toronto 2015 big data sean mc keownCisco Canada
 
Data Platform at Twitter: Enabling Real-time & Batch Analytics at Scale
Data Platform at Twitter: Enabling Real-time & Batch Analytics at ScaleData Platform at Twitter: Enabling Real-time & Batch Analytics at Scale
Data Platform at Twitter: Enabling Real-time & Batch Analytics at ScaleSriram Krishnan
 
Introducing a horizontally scalable, inference-based business Rules Engine fo...
Introducing a horizontally scalable, inference-based business Rules Engine fo...Introducing a horizontally scalable, inference-based business Rules Engine fo...
Introducing a horizontally scalable, inference-based business Rules Engine fo...Cask Data
 
Capacity Management and BigData/Hadoop - Hitchhiker's guide for the Capacity ...
Capacity Management and BigData/Hadoop - Hitchhiker's guide for the Capacity ...Capacity Management and BigData/Hadoop - Hitchhiker's guide for the Capacity ...
Capacity Management and BigData/Hadoop - Hitchhiker's guide for the Capacity ...Renato Bonomini
 

Was ist angesagt? (20)

Jump Start on Apache Spark 2.2 with Databricks
Jump Start on Apache Spark 2.2 with DatabricksJump Start on Apache Spark 2.2 with Databricks
Jump Start on Apache Spark 2.2 with Databricks
 
Accelerating Data Warehouse Modernization
Accelerating Data Warehouse ModernizationAccelerating Data Warehouse Modernization
Accelerating Data Warehouse Modernization
 
The Future of Hadoop: A deeper look at Apache Spark
The Future of Hadoop: A deeper look at Apache SparkThe Future of Hadoop: A deeper look at Apache Spark
The Future of Hadoop: A deeper look at Apache Spark
 
The Microsoft BigData Story
The Microsoft BigData StoryThe Microsoft BigData Story
The Microsoft BigData Story
 
Data Applications and Infrastructure at LinkedIn__HadoopSummit2010
Data Applications and Infrastructure at LinkedIn__HadoopSummit2010Data Applications and Infrastructure at LinkedIn__HadoopSummit2010
Data Applications and Infrastructure at LinkedIn__HadoopSummit2010
 
Innovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data WarehouseInnovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data Warehouse
 
Data Driving Yahoo Mail Growth and Evolution with a 50 PB Hadoop Warehouse
Data Driving Yahoo Mail Growth and Evolution with a 50 PB Hadoop WarehouseData Driving Yahoo Mail Growth and Evolution with a 50 PB Hadoop Warehouse
Data Driving Yahoo Mail Growth and Evolution with a 50 PB Hadoop Warehouse
 
Solution Brief: Real-Time Pipeline Accelerator
Solution Brief: Real-Time Pipeline AcceleratorSolution Brief: Real-Time Pipeline Accelerator
Solution Brief: Real-Time Pipeline Accelerator
 
Event Driven Architecture
Event Driven ArchitectureEvent Driven Architecture
Event Driven Architecture
 
Continuous Analytics & Optimisation using Apache Spark (Big Data Analytics, L...
Continuous Analytics & Optimisation using Apache Spark (Big Data Analytics, L...Continuous Analytics & Optimisation using Apache Spark (Big Data Analytics, L...
Continuous Analytics & Optimisation using Apache Spark (Big Data Analytics, L...
 
Data streaming fundamentals
Data streaming fundamentalsData streaming fundamentals
Data streaming fundamentals
 
Hadoop and Enterprise Data Warehouse
Hadoop and Enterprise Data WarehouseHadoop and Enterprise Data Warehouse
Hadoop and Enterprise Data Warehouse
 
Spark-on-Yarn: The Road Ahead-(Marcelo Vanzin, Cloudera)
Spark-on-Yarn: The Road Ahead-(Marcelo Vanzin, Cloudera)Spark-on-Yarn: The Road Ahead-(Marcelo Vanzin, Cloudera)
Spark-on-Yarn: The Road Ahead-(Marcelo Vanzin, Cloudera)
 
Querying Druid in SQL with Superset
Querying Druid in SQL with SupersetQuerying Druid in SQL with Superset
Querying Druid in SQL with Superset
 
Lambda-less Stream Processing @Scale in LinkedIn
Lambda-less Stream Processing @Scale in LinkedIn Lambda-less Stream Processing @Scale in LinkedIn
Lambda-less Stream Processing @Scale in LinkedIn
 
Streaming Data Ingest and Processing with Apache Kafka
Streaming Data Ingest and Processing with Apache KafkaStreaming Data Ingest and Processing with Apache Kafka
Streaming Data Ingest and Processing with Apache Kafka
 
Cisco connect toronto 2015 big data sean mc keown
Cisco connect toronto 2015 big data  sean mc keownCisco connect toronto 2015 big data  sean mc keown
Cisco connect toronto 2015 big data sean mc keown
 
Data Platform at Twitter: Enabling Real-time & Batch Analytics at Scale
Data Platform at Twitter: Enabling Real-time & Batch Analytics at ScaleData Platform at Twitter: Enabling Real-time & Batch Analytics at Scale
Data Platform at Twitter: Enabling Real-time & Batch Analytics at Scale
 
Introducing a horizontally scalable, inference-based business Rules Engine fo...
Introducing a horizontally scalable, inference-based business Rules Engine fo...Introducing a horizontally scalable, inference-based business Rules Engine fo...
Introducing a horizontally scalable, inference-based business Rules Engine fo...
 
Capacity Management and BigData/Hadoop - Hitchhiker's guide for the Capacity ...
Capacity Management and BigData/Hadoop - Hitchhiker's guide for the Capacity ...Capacity Management and BigData/Hadoop - Hitchhiker's guide for the Capacity ...
Capacity Management and BigData/Hadoop - Hitchhiker's guide for the Capacity ...
 

Andere mochten auch

Clase 1 imagen pública fundamentos
Clase 1 imagen pública fundamentosClase 1 imagen pública fundamentos
Clase 1 imagen pública fundamentosRodrigo Ga
 
4 презентация офисные стулья
4 презентация офисные стулья4 презентация офисные стулья
4 презентация офисные стульяkolesikmixer
 
18 4 products catalog-2014_gp_sofa
18 4 products catalog-2014_gp_sofa18 4 products catalog-2014_gp_sofa
18 4 products catalog-2014_gp_sofakolesikmixer
 
7 презентация антилия
7 презентация антилия7 презентация антилия
7 презентация антилияkolesikmixer
 
ICI Project 1 Brief
ICI Project 1 BriefICI Project 1 Brief
ICI Project 1 BriefCrystal Chia
 
The Funnel & The Path
The Funnel & The PathThe Funnel & The Path
The Funnel & The PathKyle Kostick
 
ArcGIS_Collector_Field_Guide_201411204_
ArcGIS_Collector_Field_Guide_201411204_ArcGIS_Collector_Field_Guide_201411204_
ArcGIS_Collector_Field_Guide_201411204_Sacha Selim
 
robotics-law-journal_01_july_2015
robotics-law-journal_01_july_2015robotics-law-journal_01_july_2015
robotics-law-journal_01_july_2015Joseph Urli
 
30 Excel Tips in 30 Minutes
30 Excel Tips in 30 Minutes30 Excel Tips in 30 Minutes
30 Excel Tips in 30 MinutesQS-360training
 
Senior software engineer with product experience
Senior software engineer with product experienceSenior software engineer with product experience
Senior software engineer with product experienceSrikant Mukundan
 
9 презентация зонгли
9 презентация зонгли9 презентация зонгли
9 презентация зонглиkolesikmixer
 
Shipping industry-guidance-on-pilot-transfer-arrangements
Shipping industry-guidance-on-pilot-transfer-arrangementsShipping industry-guidance-on-pilot-transfer-arrangements
Shipping industry-guidance-on-pilot-transfer-arrangementsblacks0u1
 
16 презентация ковры
16 презентация ковры16 презентация ковры
16 презентация коврыkolesikmixer
 

Andere mochten auch (18)

Clase 1 imagen pública fundamentos
Clase 1 imagen pública fundamentosClase 1 imagen pública fundamentos
Clase 1 imagen pública fundamentos
 
4 презентация офисные стулья
4 презентация офисные стулья4 презентация офисные стулья
4 презентация офисные стулья
 
18 4 products catalog-2014_gp_sofa
18 4 products catalog-2014_gp_sofa18 4 products catalog-2014_gp_sofa
18 4 products catalog-2014_gp_sofa
 
7 презентация антилия
7 презентация антилия7 презентация антилия
7 презентация антилия
 
ICI Project 1 Brief
ICI Project 1 BriefICI Project 1 Brief
ICI Project 1 Brief
 
project showcase
project showcaseproject showcase
project showcase
 
The Funnel & The Path
The Funnel & The PathThe Funnel & The Path
The Funnel & The Path
 
Service competencies
Service competenciesService competencies
Service competencies
 
ArcGIS_Collector_Field_Guide_201411204_
ArcGIS_Collector_Field_Guide_201411204_ArcGIS_Collector_Field_Guide_201411204_
ArcGIS_Collector_Field_Guide_201411204_
 
robotics-law-journal_01_july_2015
robotics-law-journal_01_july_2015robotics-law-journal_01_july_2015
robotics-law-journal_01_july_2015
 
ColorCuboid_2015
ColorCuboid_2015ColorCuboid_2015
ColorCuboid_2015
 
30 Excel Tips in 30 Minutes
30 Excel Tips in 30 Minutes30 Excel Tips in 30 Minutes
30 Excel Tips in 30 Minutes
 
1002FRCNotebook
1002FRCNotebook1002FRCNotebook
1002FRCNotebook
 
Senior software engineer with product experience
Senior software engineer with product experienceSenior software engineer with product experience
Senior software engineer with product experience
 
Paginas de Web
Paginas de WebPaginas de Web
Paginas de Web
 
9 презентация зонгли
9 презентация зонгли9 презентация зонгли
9 презентация зонгли
 
Shipping industry-guidance-on-pilot-transfer-arrangements
Shipping industry-guidance-on-pilot-transfer-arrangementsShipping industry-guidance-on-pilot-transfer-arrangements
Shipping industry-guidance-on-pilot-transfer-arrangements
 
16 презентация ковры
16 презентация ковры16 презентация ковры
16 презентация ковры
 

Ähnlich wie Artur Borycki - Beyond Lambda - how to get from logical to physical - code.talk.2015

Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Precisely
 
Otimizações de Projetos de Big Data, Dw e AI no Microsoft Azure
Otimizações de Projetos de Big Data, Dw e AI no Microsoft AzureOtimizações de Projetos de Big Data, Dw e AI no Microsoft Azure
Otimizações de Projetos de Big Data, Dw e AI no Microsoft AzureLuan Moreno Medeiros Maciel
 
Modernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio JourneyModernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio JourneyAlluxio, Inc.
 
SQL Engines for Hadoop - The case for Impala
SQL Engines for Hadoop - The case for ImpalaSQL Engines for Hadoop - The case for Impala
SQL Engines for Hadoop - The case for Impalamarkgrover
 
Big Data_Architecture.pptx
Big Data_Architecture.pptxBig Data_Architecture.pptx
Big Data_Architecture.pptxbetalab
 
Simplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache KuduSimplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache KuduCloudera, Inc.
 
Big Data Analytics on the Cloud Oracle Applications AWS Redshift & Tableau
Big Data Analytics on the Cloud Oracle Applications AWS Redshift & TableauBig Data Analytics on the Cloud Oracle Applications AWS Redshift & Tableau
Big Data Analytics on the Cloud Oracle Applications AWS Redshift & TableauSam Palani
 
Big Data Berlin v8.0 Stream Processing with Apache Apex
Big Data Berlin v8.0 Stream Processing with Apache Apex Big Data Berlin v8.0 Stream Processing with Apache Apex
Big Data Berlin v8.0 Stream Processing with Apache Apex Apache Apex
 
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...Dataconomy Media
 
Spark Summit EU talk by Mike Percy
Spark Summit EU talk by Mike PercySpark Summit EU talk by Mike Percy
Spark Summit EU talk by Mike PercySpark Summit
 
Building Data Pipelines on AWS
Building Data Pipelines on AWSBuilding Data Pipelines on AWS
Building Data Pipelines on AWSrudolf eremyan
 
Apache Kafka® and the Data Mesh
Apache Kafka® and the Data MeshApache Kafka® and the Data Mesh
Apache Kafka® and the Data MeshConfluentInc1
 
Whitepaper tableau for-the-enterprise-0
Whitepaper tableau for-the-enterprise-0Whitepaper tableau for-the-enterprise-0
Whitepaper tableau for-the-enterprise-0alok khobragade
 
Search on the fly: how to lighten your Big Data - Simona Russo, Auro Rolle - ...
Search on the fly: how to lighten your Big Data - Simona Russo, Auro Rolle - ...Search on the fly: how to lighten your Big Data - Simona Russo, Auro Rolle - ...
Search on the fly: how to lighten your Big Data - Simona Russo, Auro Rolle - ...Codemotion
 
Confluent & Attunity: Mainframe Data Modern Analytics
Confluent & Attunity: Mainframe Data Modern AnalyticsConfluent & Attunity: Mainframe Data Modern Analytics
Confluent & Attunity: Mainframe Data Modern Analyticsconfluent
 
Apache Tez : Accelerating Hadoop Query Processing
Apache Tez : Accelerating Hadoop Query ProcessingApache Tez : Accelerating Hadoop Query Processing
Apache Tez : Accelerating Hadoop Query ProcessingBikas Saha
 
Delivering the power of data using Spring Cloud DataFlow and DataStax Enterpr...
Delivering the power of data using Spring Cloud DataFlow and DataStax Enterpr...Delivering the power of data using Spring Cloud DataFlow and DataStax Enterpr...
Delivering the power of data using Spring Cloud DataFlow and DataStax Enterpr...VMware Tanzu
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Cécile Poyet
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Hortonworks
 

Ähnlich wie Artur Borycki - Beyond Lambda - how to get from logical to physical - code.talk.2015 (20)

Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
 
Real time analytics
Real time analyticsReal time analytics
Real time analytics
 
Otimizações de Projetos de Big Data, Dw e AI no Microsoft Azure
Otimizações de Projetos de Big Data, Dw e AI no Microsoft AzureOtimizações de Projetos de Big Data, Dw e AI no Microsoft Azure
Otimizações de Projetos de Big Data, Dw e AI no Microsoft Azure
 
Modernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio JourneyModernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio Journey
 
SQL Engines for Hadoop - The case for Impala
SQL Engines for Hadoop - The case for ImpalaSQL Engines for Hadoop - The case for Impala
SQL Engines for Hadoop - The case for Impala
 
Big Data_Architecture.pptx
Big Data_Architecture.pptxBig Data_Architecture.pptx
Big Data_Architecture.pptx
 
Simplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache KuduSimplifying Real-Time Architectures for IoT with Apache Kudu
Simplifying Real-Time Architectures for IoT with Apache Kudu
 
Big Data Analytics on the Cloud Oracle Applications AWS Redshift & Tableau
Big Data Analytics on the Cloud Oracle Applications AWS Redshift & TableauBig Data Analytics on the Cloud Oracle Applications AWS Redshift & Tableau
Big Data Analytics on the Cloud Oracle Applications AWS Redshift & Tableau
 
Big Data Berlin v8.0 Stream Processing with Apache Apex
Big Data Berlin v8.0 Stream Processing with Apache Apex Big Data Berlin v8.0 Stream Processing with Apache Apex
Big Data Berlin v8.0 Stream Processing with Apache Apex
 
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...
 
Spark Summit EU talk by Mike Percy
Spark Summit EU talk by Mike PercySpark Summit EU talk by Mike Percy
Spark Summit EU talk by Mike Percy
 
Building Data Pipelines on AWS
Building Data Pipelines on AWSBuilding Data Pipelines on AWS
Building Data Pipelines on AWS
 
Apache Kafka® and the Data Mesh
Apache Kafka® and the Data MeshApache Kafka® and the Data Mesh
Apache Kafka® and the Data Mesh
 
Whitepaper tableau for-the-enterprise-0
Whitepaper tableau for-the-enterprise-0Whitepaper tableau for-the-enterprise-0
Whitepaper tableau for-the-enterprise-0
 
Search on the fly: how to lighten your Big Data - Simona Russo, Auro Rolle - ...
Search on the fly: how to lighten your Big Data - Simona Russo, Auro Rolle - ...Search on the fly: how to lighten your Big Data - Simona Russo, Auro Rolle - ...
Search on the fly: how to lighten your Big Data - Simona Russo, Auro Rolle - ...
 
Confluent & Attunity: Mainframe Data Modern Analytics
Confluent & Attunity: Mainframe Data Modern AnalyticsConfluent & Attunity: Mainframe Data Modern Analytics
Confluent & Attunity: Mainframe Data Modern Analytics
 
Apache Tez : Accelerating Hadoop Query Processing
Apache Tez : Accelerating Hadoop Query ProcessingApache Tez : Accelerating Hadoop Query Processing
Apache Tez : Accelerating Hadoop Query Processing
 
Delivering the power of data using Spring Cloud DataFlow and DataStax Enterpr...
Delivering the power of data using Spring Cloud DataFlow and DataStax Enterpr...Delivering the power of data using Spring Cloud DataFlow and DataStax Enterpr...
Delivering the power of data using Spring Cloud DataFlow and DataStax Enterpr...
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
 

Mehr von AboutYouGmbH

Tech talk 01.06.2017
Tech talk 01.06.2017Tech talk 01.06.2017
Tech talk 01.06.2017AboutYouGmbH
 
Retention Strategies in Mobile E-Commerce
Retention Strategies in Mobile E-CommerceRetention Strategies in Mobile E-Commerce
Retention Strategies in Mobile E-CommerceAboutYouGmbH
 
Rethinking Fashion E-Commerce
Rethinking Fashion E-CommerceRethinking Fashion E-Commerce
Rethinking Fashion E-CommerceAboutYouGmbH
 
ABOUT YOU get on board
ABOUT YOU get on boardABOUT YOU get on board
ABOUT YOU get on boardAboutYouGmbH
 
Niels Leenheer - Weird browsers - code.talks 2015
Niels Leenheer - Weird browsers - code.talks 2015Niels Leenheer - Weird browsers - code.talks 2015
Niels Leenheer - Weird browsers - code.talks 2015AboutYouGmbH
 
Dennis Benkert & Matthias Lübken - Patterns in a containerized world? - code....
Dennis Benkert & Matthias Lübken - Patterns in a containerized world? - code....Dennis Benkert & Matthias Lübken - Patterns in a containerized world? - code....
Dennis Benkert & Matthias Lübken - Patterns in a containerized world? - code....AboutYouGmbH
 
Stefan Richter - Writing simple, readable and robust code: Examples in Java, ...
Stefan Richter - Writing simple, readable and robust code: Examples in Java, ...Stefan Richter - Writing simple, readable and robust code: Examples in Java, ...
Stefan Richter - Writing simple, readable and robust code: Examples in Java, ...AboutYouGmbH
 
Lars Jankowfsky - Learn or Die - code.talks 2015
Lars Jankowfsky - Learn or Die - code.talks 2015Lars Jankowfsky - Learn or Die - code.talks 2015
Lars Jankowfsky - Learn or Die - code.talks 2015AboutYouGmbH
 
Dr. Jeremias Rößler - Wenn Affen Testen - Das Ende der Bananensoftware - code...
Dr. Jeremias Rößler - Wenn Affen Testen - Das Ende der Bananensoftware - code...Dr. Jeremias Rößler - Wenn Affen Testen - Das Ende der Bananensoftware - code...
Dr. Jeremias Rößler - Wenn Affen Testen - Das Ende der Bananensoftware - code...AboutYouGmbH
 
Zeljko Kvesic - Scrum in verteilten Teams / Agil über die Landesgrenzen - cod...
Zeljko Kvesic - Scrum in verteilten Teams / Agil über die Landesgrenzen - cod...Zeljko Kvesic - Scrum in verteilten Teams / Agil über die Landesgrenzen - cod...
Zeljko Kvesic - Scrum in verteilten Teams / Agil über die Landesgrenzen - cod...AboutYouGmbH
 
Uwe Friedrichsen - CRDT und mehr - über extreme Verfügbarkeit und selbstheile...
Uwe Friedrichsen - CRDT und mehr - über extreme Verfügbarkeit und selbstheile...Uwe Friedrichsen - CRDT und mehr - über extreme Verfügbarkeit und selbstheile...
Uwe Friedrichsen - CRDT und mehr - über extreme Verfügbarkeit und selbstheile...AboutYouGmbH
 
Kai Voigt - Big Data zum Anfassen - code.talks 2015
Kai Voigt - Big Data zum Anfassen - code.talks 2015Kai Voigt - Big Data zum Anfassen - code.talks 2015
Kai Voigt - Big Data zum Anfassen - code.talks 2015AboutYouGmbH
 
Dr. Andreas Lattner - Aufsetzen skalierbarer Prognose- und Analysedienste mit...
Dr. Andreas Lattner - Aufsetzen skalierbarer Prognose- und Analysedienste mit...Dr. Andreas Lattner - Aufsetzen skalierbarer Prognose- und Analysedienste mit...
Dr. Andreas Lattner - Aufsetzen skalierbarer Prognose- und Analysedienste mit...AboutYouGmbH
 
Marcel Hild - Spryker (e)commerce framework als Alternative zu traditioneller...
Marcel Hild - Spryker (e)commerce framework als Alternative zu traditioneller...Marcel Hild - Spryker (e)commerce framework als Alternative zu traditioneller...
Marcel Hild - Spryker (e)commerce framework als Alternative zu traditioneller...AboutYouGmbH
 
Wolfram Kriesing - EcmaScript6 for real - code.talks 2015
Wolfram Kriesing - EcmaScript6 for real - code.talks 2015Wolfram Kriesing - EcmaScript6 for real - code.talks 2015
Wolfram Kriesing - EcmaScript6 for real - code.talks 2015AboutYouGmbH
 
Stefanie Grewenig & Johannes Thönes - Internet ausdrucken mit JavaScript - c...
 Stefanie Grewenig & Johannes Thönes - Internet ausdrucken mit JavaScript - c... Stefanie Grewenig & Johannes Thönes - Internet ausdrucken mit JavaScript - c...
Stefanie Grewenig & Johannes Thönes - Internet ausdrucken mit JavaScript - c...AboutYouGmbH
 
Alex Korotkikh - From 0 to N: Lessons Learned - code.talks 2015
 Alex Korotkikh - From 0 to N: Lessons Learned - code.talks 2015 Alex Korotkikh - From 0 to N: Lessons Learned - code.talks 2015
Alex Korotkikh - From 0 to N: Lessons Learned - code.talks 2015AboutYouGmbH
 
Christian Haider & Helge Nowak - Mehr Demokratie durch Haushaltstransparenz ...
 Christian Haider & Helge Nowak - Mehr Demokratie durch Haushaltstransparenz ... Christian Haider & Helge Nowak - Mehr Demokratie durch Haushaltstransparenz ...
Christian Haider & Helge Nowak - Mehr Demokratie durch Haushaltstransparenz ...AboutYouGmbH
 
Bernhard Wick - appserver.io - code.talks 2015
 Bernhard Wick - appserver.io - code.talks 2015 Bernhard Wick - appserver.io - code.talks 2015
Bernhard Wick - appserver.io - code.talks 2015AboutYouGmbH
 
Moritz Siuts & Robert von Massow - Data Pipeline mit Apache Kafka - code.tal...
 Moritz Siuts & Robert von Massow - Data Pipeline mit Apache Kafka - code.tal... Moritz Siuts & Robert von Massow - Data Pipeline mit Apache Kafka - code.tal...
Moritz Siuts & Robert von Massow - Data Pipeline mit Apache Kafka - code.tal...AboutYouGmbH
 

Mehr von AboutYouGmbH (20)

Tech talk 01.06.2017
Tech talk 01.06.2017Tech talk 01.06.2017
Tech talk 01.06.2017
 
Retention Strategies in Mobile E-Commerce
Retention Strategies in Mobile E-CommerceRetention Strategies in Mobile E-Commerce
Retention Strategies in Mobile E-Commerce
 
Rethinking Fashion E-Commerce
Rethinking Fashion E-CommerceRethinking Fashion E-Commerce
Rethinking Fashion E-Commerce
 
ABOUT YOU get on board
ABOUT YOU get on boardABOUT YOU get on board
ABOUT YOU get on board
 
Niels Leenheer - Weird browsers - code.talks 2015
Niels Leenheer - Weird browsers - code.talks 2015Niels Leenheer - Weird browsers - code.talks 2015
Niels Leenheer - Weird browsers - code.talks 2015
 
Dennis Benkert & Matthias Lübken - Patterns in a containerized world? - code....
Dennis Benkert & Matthias Lübken - Patterns in a containerized world? - code....Dennis Benkert & Matthias Lübken - Patterns in a containerized world? - code....
Dennis Benkert & Matthias Lübken - Patterns in a containerized world? - code....
 
Stefan Richter - Writing simple, readable and robust code: Examples in Java, ...
Stefan Richter - Writing simple, readable and robust code: Examples in Java, ...Stefan Richter - Writing simple, readable and robust code: Examples in Java, ...
Stefan Richter - Writing simple, readable and robust code: Examples in Java, ...
 
Lars Jankowfsky - Learn or Die - code.talks 2015
Lars Jankowfsky - Learn or Die - code.talks 2015Lars Jankowfsky - Learn or Die - code.talks 2015
Lars Jankowfsky - Learn or Die - code.talks 2015
 
Dr. Jeremias Rößler - Wenn Affen Testen - Das Ende der Bananensoftware - code...
Dr. Jeremias Rößler - Wenn Affen Testen - Das Ende der Bananensoftware - code...Dr. Jeremias Rößler - Wenn Affen Testen - Das Ende der Bananensoftware - code...
Dr. Jeremias Rößler - Wenn Affen Testen - Das Ende der Bananensoftware - code...
 
Zeljko Kvesic - Scrum in verteilten Teams / Agil über die Landesgrenzen - cod...
Zeljko Kvesic - Scrum in verteilten Teams / Agil über die Landesgrenzen - cod...Zeljko Kvesic - Scrum in verteilten Teams / Agil über die Landesgrenzen - cod...
Zeljko Kvesic - Scrum in verteilten Teams / Agil über die Landesgrenzen - cod...
 
Uwe Friedrichsen - CRDT und mehr - über extreme Verfügbarkeit und selbstheile...
Uwe Friedrichsen - CRDT und mehr - über extreme Verfügbarkeit und selbstheile...Uwe Friedrichsen - CRDT und mehr - über extreme Verfügbarkeit und selbstheile...
Uwe Friedrichsen - CRDT und mehr - über extreme Verfügbarkeit und selbstheile...
 
Kai Voigt - Big Data zum Anfassen - code.talks 2015
Kai Voigt - Big Data zum Anfassen - code.talks 2015Kai Voigt - Big Data zum Anfassen - code.talks 2015
Kai Voigt - Big Data zum Anfassen - code.talks 2015
 
Dr. Andreas Lattner - Aufsetzen skalierbarer Prognose- und Analysedienste mit...
Dr. Andreas Lattner - Aufsetzen skalierbarer Prognose- und Analysedienste mit...Dr. Andreas Lattner - Aufsetzen skalierbarer Prognose- und Analysedienste mit...
Dr. Andreas Lattner - Aufsetzen skalierbarer Prognose- und Analysedienste mit...
 
Marcel Hild - Spryker (e)commerce framework als Alternative zu traditioneller...
Marcel Hild - Spryker (e)commerce framework als Alternative zu traditioneller...Marcel Hild - Spryker (e)commerce framework als Alternative zu traditioneller...
Marcel Hild - Spryker (e)commerce framework als Alternative zu traditioneller...
 
Wolfram Kriesing - EcmaScript6 for real - code.talks 2015
Wolfram Kriesing - EcmaScript6 for real - code.talks 2015Wolfram Kriesing - EcmaScript6 for real - code.talks 2015
Wolfram Kriesing - EcmaScript6 for real - code.talks 2015
 
Stefanie Grewenig & Johannes Thönes - Internet ausdrucken mit JavaScript - c...
 Stefanie Grewenig & Johannes Thönes - Internet ausdrucken mit JavaScript - c... Stefanie Grewenig & Johannes Thönes - Internet ausdrucken mit JavaScript - c...
Stefanie Grewenig & Johannes Thönes - Internet ausdrucken mit JavaScript - c...
 
Alex Korotkikh - From 0 to N: Lessons Learned - code.talks 2015
 Alex Korotkikh - From 0 to N: Lessons Learned - code.talks 2015 Alex Korotkikh - From 0 to N: Lessons Learned - code.talks 2015
Alex Korotkikh - From 0 to N: Lessons Learned - code.talks 2015
 
Christian Haider & Helge Nowak - Mehr Demokratie durch Haushaltstransparenz ...
 Christian Haider & Helge Nowak - Mehr Demokratie durch Haushaltstransparenz ... Christian Haider & Helge Nowak - Mehr Demokratie durch Haushaltstransparenz ...
Christian Haider & Helge Nowak - Mehr Demokratie durch Haushaltstransparenz ...
 
Bernhard Wick - appserver.io - code.talks 2015
 Bernhard Wick - appserver.io - code.talks 2015 Bernhard Wick - appserver.io - code.talks 2015
Bernhard Wick - appserver.io - code.talks 2015
 
Moritz Siuts & Robert von Massow - Data Pipeline mit Apache Kafka - code.tal...
 Moritz Siuts & Robert von Massow - Data Pipeline mit Apache Kafka - code.tal... Moritz Siuts & Robert von Massow - Data Pipeline mit Apache Kafka - code.tal...
Moritz Siuts & Robert von Massow - Data Pipeline mit Apache Kafka - code.tal...
 

Kürzlich hochgeladen

20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdfMatthew Sinclair
 
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrStory Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrHenryBriggs2
 
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC
 
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime NagercoilNagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoilmeghakumariji156
 
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdfMatthew Sinclair
 
APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC
 
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...gajnagarg
 
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsIndian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsMonica Sydney
 
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Ballia
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime BalliaBallia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Ballia
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Balliameghakumariji156
 
Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.krishnachandrapal52
 
Vip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac Room
Vip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac RoomVip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac Room
Vip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac Roommeghakumariji156
 
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制pxcywzqs
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdfMatthew Sinclair
 
Abu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Abu Dhabi Escorts Service 0508644382 Escorts in Abu DhabiAbu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Abu Dhabi Escorts Service 0508644382 Escorts in Abu DhabiMonica Sydney
 
Call girls Service in Ajman 0505086370 Ajman call girls
Call girls Service in Ajman 0505086370 Ajman call girlsCall girls Service in Ajman 0505086370 Ajman call girls
Call girls Service in Ajman 0505086370 Ajman call girlsMonica Sydney
 
Trump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts SweatshirtTrump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts Sweatshirtrahman018755
 
Mira Road Housewife Call Girls 07506202331, Nalasopara Call Girls
Mira Road Housewife Call Girls 07506202331, Nalasopara Call GirlsMira Road Housewife Call Girls 07506202331, Nalasopara Call Girls
Mira Road Housewife Call Girls 07506202331, Nalasopara Call GirlsPriya Reddy
 
Local Call Girls in Seoni 9332606886 HOT & SEXY Models beautiful and charmin...
Local Call Girls in Seoni  9332606886 HOT & SEXY Models beautiful and charmin...Local Call Girls in Seoni  9332606886 HOT & SEXY Models beautiful and charmin...
Local Call Girls in Seoni 9332606886 HOT & SEXY Models beautiful and charmin...kumargunjan9515
 
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfpdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfJOHNBEBONYAP1
 
一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理F
 

Kürzlich hochgeladen (20)

20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf
 
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrStory Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
Story Board.pptxrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr
 
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
 
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime NagercoilNagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
Nagercoil Escorts Service Girl ^ 9332606886, WhatsApp Anytime Nagercoil
 
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
 
APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53
 
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Dindigul [ 7014168258 ] Call Me For Genuine Models ...
 
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi EscortsIndian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
Indian Escort in Abu DHabi 0508644382 Abu Dhabi Escorts
 
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Ballia
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime BalliaBallia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Ballia
Ballia Escorts Service Girl ^ 9332606886, WhatsApp Anytime Ballia
 
Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.Meaning of On page SEO & its process in detail.
Meaning of On page SEO & its process in detail.
 
Vip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac Room
Vip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac RoomVip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac Room
Vip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac Room
 
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
 
Abu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Abu Dhabi Escorts Service 0508644382 Escorts in Abu DhabiAbu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
Abu Dhabi Escorts Service 0508644382 Escorts in Abu Dhabi
 
Call girls Service in Ajman 0505086370 Ajman call girls
Call girls Service in Ajman 0505086370 Ajman call girlsCall girls Service in Ajman 0505086370 Ajman call girls
Call girls Service in Ajman 0505086370 Ajman call girls
 
Trump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts SweatshirtTrump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts Sweatshirt
 
Mira Road Housewife Call Girls 07506202331, Nalasopara Call Girls
Mira Road Housewife Call Girls 07506202331, Nalasopara Call GirlsMira Road Housewife Call Girls 07506202331, Nalasopara Call Girls
Mira Road Housewife Call Girls 07506202331, Nalasopara Call Girls
 
Local Call Girls in Seoni 9332606886 HOT & SEXY Models beautiful and charmin...
Local Call Girls in Seoni  9332606886 HOT & SEXY Models beautiful and charmin...Local Call Girls in Seoni  9332606886 HOT & SEXY Models beautiful and charmin...
Local Call Girls in Seoni 9332606886 HOT & SEXY Models beautiful and charmin...
 
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfpdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
 
一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理
 

Artur Borycki - Beyond Lambda - how to get from logical to physical - code.talk.2015

  • 1. Beyond Lambda - how to get from logical to physical Artur Borycki Director Technology & Innovations
  • 2. Simplification & Efficiency Teradata believe in the principles of self-service, automation and on-demand resource allocation. These enable faster, more efficient and more effective data application development and operation.
  • 3. ‹#› What is Lambda Architecture Background • Reference architecture for Big Data systems • Designed by Nathan Marz (Twitter) • Defined as a system that runs arbitrary functions on arbitrary data • “query = function(all data)” Design Principles • Human fault-tolerant, Immutability, Computable Lambda Layers • Batch - Contains the immutable, constantly growing master dataset. • Speed - Deals only with new data and compensates for the high latency updates of the serving layer. • Serving - Loads and exposes the combined view of data so that they can be queried.
  • 4. ‹#› Active Executor Lambda Framework • The way this works is that an immutable sequence of records is captured and fed into a batch system and a stream processing system in parallel. • You implement your transformation logic twice, once in the batch system and once in the stream processing system. • You stitch together the results from both systems at query time to produce a complete answer.
  • 5. Overall Architecture – Customer example
  • 6. ‹#› Lambda alternative – Kappa? (Jay Kreps – Linkedin) Unlike the Lambda Architecture, in this approach you only do reprocessing when your processing code changes 1. Use Kafka or some other system that will let you retain the full log of the data you want to be able to reprocess and that allows for multiple subscribers. For example, if you want to reprocess up to 30 days of data, set your retention in Kafka to 30 days. 2. When you want to do the reprocessing, start a second instance of your stream processing job that starts processing from the beginning of the retained data, but direct this output data to a new output table. 3. When the second job has caught up, switch the application to read from the new table. 4. Stop the old version of the job, and delete the old output table.
  • 7. Real-time Maturity Typical path for a customer Customers typically go through four stages on their path to real-time analysis. The evolution typically starts with trying to visualize results or reports more frequently. This leads to the realization that the underlying data is not refreshed frequently. The next stage of maturity is to capture and ingest data more quickly. Once data is flowing faster, customers then try to process the data as it is flowing. The final stage is to remove any human intervention.
  • 8. ‹#› Events  /   Interactions Consumer   of  Information   All  dataStreams Other  feeds Consumer   of  Information   Discovery Advance   Analytics Data  binding Reporting Beyond Lambda – Omega ;) (Artur vision) • We need events that require actions and interactions without much of the analytics • We need events that are requiring action, but also they need to be enhanced by the analytics in the ecosystem (based on other information sources) • We need events that will be handled later or they are supporting above cases
  • 9. ‹#› The Teradata UDA UNIFIED DATA ARCHITECTURE Security, Workload Management Applications INTEGRATED DATA WAREHOUSE DATA PLATFORM INTEGRATED DISCOVERY PLATFORM Security, Workload ManagementREAL TIME PROCESSING TERADATA PORTFOLIO FOR HADOOP TERADATA DATABASE TERADATA ASTER DATABASE RESTFULAPI LISTENINGFRAMEWORK RESTFULAPI APPFRAMEWORK
  • 10. 10 1 0 BEST APP EVER!! Data  Service  APIs Access  Data  on  Teradata,  Aster,   Hadoop via  API  calls Logging Push  and  store  events  about  app  to   UDA  logging  services Ingest  /  Streaming   Stream  data  into  UDA  and  build   applications  on  near  real-­time  data Scheduling  /  Orchestration Scheduling  services  allow  devs to   build  workflows  and  connect  apps.   Search  &  Metadata Expose  search  capabilities  in  your   app  via  UDA  level  search  services. WebKit A  toolbox  of  UI  templates,   visualizations  and  javascript libraries Package/Deploy  &  Publish A  simple  package  and  deployment   application  to  launch  your  app  in  the   AppCenter ecosystem Operate Leverage  monitoring  &  alerting   services  to  keep  track  app  health. Key Services, Libraries & Templates UDA it a concept but also allows to be Development Platform
  • 11. Instead of a single monolithic database 1 1 Monolith A monolithic application puts all of its functionality into a single process and scales by replicating the monolith on multiple servers. Microservices A microservices architecture puts each element of functionality into a separate service and scales by distributing these services across servers. Decoupled Services
  • 12. Scale by distributing services and replicating as needed 1 2 Monolithic App A monolithic application puts all of its functionality into a single process and scales by replicating the monolith on multiple servers. Microservices A microservices architecture puts each elementoffunctionality into a separate service and scales by distributing these services across servers. Think Microservice, not Monolithic
  • 13. ‹#› Access and move data between systems through service APIs 1 3 UDA TD TD INFRASTRUCTURE DATA SERVICES REST API Call Send Query Execute Query Send Response Teradata Data Services
  • 14. QueryGrid – Data Movement QueryGrid – Remote Execution Foreign Table Select – Pass Thru SELECT * FROM FOREIGN TABLE ( select parse_url(refer,'HOST') as host, v.key as key, ts as session_ts, v.val, count(*) as count from http_inline LATERAL VIEW explode(str_to_map(parse_url(refer,'QUERY'),'&','=')) v as key, val where parse_url(refer, 'QUERY') is not null group by parse_url(refer, 'HOST'), v.key, v.val )@hdp21 hdp_dpi WHERE session_ts = current_date; Push foreign grammar to remote. Hadoop:permits Hive/Impala query for data reduction on non-partitioned columns. Import SELECT source, session FROM clickstream@Hadoop_sysd WHERE session_ts = ʻ‘2013-01-01ʼ’; Can be used to: – “Insert/select” & “create table as” to instantiate data locally. – Joins always possible with local tables. Export INSERTINTO cust_loc@Hadoop_sysd SELECTcust_id,cust_zip FROM cust_data WHERE last_update = current_date; Move Data from Teradata to Hadoop – And/or other Data Stores
  • 15. 15 The Data Lake – Customer slide • This is not skating to where the puck is going to be - It’s skating to the puck. – Your CIO should be sitting you on the bench if you are not doing this already ​Most Data Lakes Today ​Passive cheap storage •Really only using HDFS •Limited data governance •Staging Data •Archiving Data •DW offload (cost drivers) ​The Data Lakes we Should be Building ​Active balanced nodes •Using full Hadoop stack+ •Good data governance •Good information architecture •Processing and enhancing data •Data applications (flexibility drivers)
  • 16. 16 New Architecture Architecture • Information architecturesare distributed – Focus on data and business questions, not integrating separate systems • Application architectures are variable – Don’t force applications into a single architecture • Applications are Loosely Coupled – DW is an application – BI is an application (or many) – Data applications are everywhere! • But let’s be smart about it – Still need strong information architecture and data management practices – Still need to reduce complexity and make strategic choices on technology
  • 17. 17 © 2015 Teradata Feature Store Application GeneratingAnalytical Features HDFS, Spark, ElasticSearch CampaignApplication Targeting Customers Teradata Customer Registry Application Registry Model of all Customers Teradata BI Application General Query andAccess to Features Teradata SQL, Microstrategy, Tableau Customer Matching Application Associating Customers Spark, Python, Scala, R Loosely Coupled Data Applications
  • 18. 18 Customer– Framework Overview B2B Gateway SFTP SFTP SFTP Flume job Flume job Flume job Non-SLA Edge Flume DR Edge Flume SLA Edge Flume Non-SLA HDFS DR HDFS SLA HDFS Non-SLA HDFS DR HDFS SLA HDFS Non-SLA Teradata DR Teradata SLA Teradata External Systems Customer Gate Internal DAP Platform Edge Hadoop Raw Processing Hadoop Access Teradata Backup/Restore DistCP QueryQridQueryQrid
  • 19. 19 © 2014 Teradata Customer - Microservices Example
  • 20. ‹#› INFRASTRUCTURE QUERY GRID TD 6xxx TD 1xxx ASTER HADOOP 1 HADOOP 2 LISTENER Move data between systems & access through service APIs 2 0 App App App App AppApp Data Pipeline SERVICE LAYER
  • 21. 21 Customer example – Integration Flow • User starts a Workflow fromthe UI which has a single Pig Job. • Azkaban Web requests that the Azkaban Executor start a new Pig Job. • Pig Job makes a REST call to the TemplateModule to render the Pig Template. • TemplateModule fetches config values from the ConfigModule if needed by the template. The ConfigModule in turn fetches config values either fromthe PCF Data Schema or from externalsystems. • TemplateModule renders the Pig Template and returns a complete Pig Script. • Pig Job executes the Pig Script against the Hadoop cluster. • During the Pig Job executionit makes REST calls to the EventModule informing about its progress. • As the Job progress is updated Vertx updates the Azkaban Web UI in real time. • When the Pig Job has completedit makes a REST call to the AuditModule to log its completion. The AuditModule in turn stores auditing information in the PCF Data Schema. • Finally the Pig Job returns its executionstatus back to the Azkaban Executor. MySQL Azkaban Web Azkaban Executor KAFKA Azkaban Bridge Service Config Service Template Service Teradata Service Event Service Audit Service Pig Job Hadoop PCF Pig Templ Pig Script JSON REST
  • 22. 22 Customer – Event Flow KAFKA Flume Event Module Teradata Service JSON Azkaban Web Azkaban Executor HDFS Event Module HBase Event Module HCatalog Event Module REST hdfs.filelanded flume.filelanded hdfs.filelanded hbase.tabl emodified Hcatalog.n ewpartition Hcatalog.n ewpartition
  • 23. 23 Customer – Docker services Azkaban Nginx Services LogStash Tessera /Graphite Consul Consul Consul Ambassadord Container Third Party Used For Nginx No Front end web server/proxy for all the other UIs. Vert.x No Application server. Azkaban Yes Workflow management for Hadoop, Teradata etc. Tessera/Graphite No Aggregating and displaying applications and system level metrics LogStash Yes Aggregating and displaying application and system level logs Consul Yes Distributed key value store used for Service Descovery Ambassadord Yes Makes it easier for Docker containers to access services hosted in other Docker containers
  • 24. Tap into the power of the platform without duplicating effort YOUR ANALYTIC APP MICRO SERVICES FRAMEWORK ASTER DATA SERVICES …TD DATA SERVICES HIVE DATA SERVICES AUTH SERVICES Easily Access UDA
  • 25. ‹#› Extract, Load & Transform in the Layered Architecture 2 5 Level 0 Aggregation Business Unit Specific Rollups Calculation Key Performance Indicators Level 3 Level 2 Level 1 Integration Integrated Model at Lowest Granularity Staging 1:1 Source Systems EXTRACT LOAD TRANSFORM APP CENTER LISTENER … BUSINESS HEALTH WORKLOAD ANALYTICS MEMBER SEGMENT ENGINE CATEGORY SALES DAILY FINANCIALS UDA & the LDA