SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Apache TrafodionTM (incubating)
Enterprise-Class Transactional
SQL-on-Hadoop DBMS
trafodion.incubator.apache.org
1
The information contained herein is subject to change without notice.2
Brief Bio
2
Rohit Jain
CTO, Esgyn
rohit.jain@esgyn.com
Rohit Jain is the CTO at Esgyn working on Apache TrafodionTM,
currently in incubation. Trafodion is a transactional SQL-on-HBase
RDBMS. Rohit worked for Tandem, Compaq, and Hewlett-Packard for
the last 28 of his 39 years in application and database development.
He has worked as an application developer, solutions architect,
consultant, software engineer, database architect, development and QA
manager, Product Manager, and Chief Technologist. His experience
spans Online Transaction Processing, Operational Data Stores, Data
Marts, Enterprise Data Warehouses, Business Intelligence, and
Advanced Analytics, on distributed massively parallel systems.
The information contained herein is subject to change without notice.3
Apache TrafodionTM
Rides the unstoppable Apache HadoopTM wave!
Transforms how companies store, process, and share big data
Affordable performance, elastic scalability, availability
Open source project - downloadable for free
Apache TrafodionTM is currently undergoing Incubation at the
Apache Software Foundation
Eliminates vendor lock-in and licensing fees
Leverages community development resources and speed
Schema flexibility and multi-structured data
Capturing and storing all data for all business functions
Full-function ANSI SQL with JDBC/ODBC access
Leverages existing SQL skills, tools, & apps for productivity
Distributed ACID transaction protection
Data consistency across multiple rows, tables, SQL statements
Targeted for operational workloads!
Optimized for real-time transaction processing applications,
operational reporting, and Operational Data Stores (ODS),
needing sub-second response times at high levels of concurrency
Data federation: Trafodion/HBase/Hive tables
Enables multiple data model deployment with schema flexibility
Open source project to develop operational SQL-on-Hadoop database engine
+ Transactional SQLApache HBaseTM
The information contained herein is subject to change without notice.4
Hadoop Cluster
Switch Switch
Operational Workloads come to Apache HadoopTM
Shared Disk SAN
Shared Cache
Operational Business Intelligence Analytics
Data movement
& duplication
Data movement
& duplication Column store for fast analytics
Operational Business Intelligence Analytics
Enterprise
Resource
Planning
Customer
Relationship
Management
Supply Chain
Management
Financial
Resource
Management
Manufacturing
Resource
Planning
Human
Resource
Management
ORC Files
• Complement
• Offload
• Transform
• Modernize
• Offload
The information contained herein is subject to change without notice.5
Banking
NonStop Mission Critical OLTP system
Hadoop Cluster
Switch Switch
Commercial
& Consumer
Banking
Transactions
Change Data
Capture Streaming
real-time updates
Operational Data Store
Daily
transactional
Data
Multiple years of
transactions & statements
• Transform
• Modernize
• Offload
Change Data
Capture
IBM Mainframe
Monthly
transactional
Data
The information contained herein is subject to change without notice.6
Telco
Billing &
Revenue Mgt
Mediation
Fulfillment
Intelligent Network (IN), Home Location Register (HLR), Mobile Switching Center (MSC), SMS Center (SMSC),
and network elements for other value added services like Push-to-talk (PTT), Ring Back Tone (RBT)
SMSCIN HLR HRBT ICS PTT MDSP MMSC
For closed
loop analytics
Trafodion for transactions to operational reporting
AudioSocial Media ImagesEmailVideo DocumentsTexts
Unstructured dataSemi-structured data
• Transform
• Modernize
• Offload
7
Online Retail …
• Integration of structured, semi-
structured, and unstructured support
• Integration of operational, historical, &
external (Big) data along common
master data for better insights
Item id Description Cost Price …Structured
Type Display Size Resolution Brand Model 3D …
…ISBN Author Publish Date Format Dept
TV
Book
…
Semi- structured
SELECT all TVs WHERE Price > 2000 and
Type = ‘Plasma’ and Display Size > ‘50’
and customer sentiment is very positive
Unstructured Image …
Review …
Open distributed
HDFS structures
HBase & HiveFree at
last!
Capture data directly into
open file structures
Accessible for reporting &
analytics with no latency
8
Online Retail …
• Create album
• Upload / Import pictures into album
• Create a project / photo book
• Share album / project with family / friends
Asset Management
• Print Calendars, Cards, …
• Order prints, mugs, linen,
jewelry, cases, covers,
cards, teddy bears, …
Shopping
Trafodion
OLTP on
Hadoop
Versus RDBMS & NoSQL
• High concurrency low
latency workloads
• Limitless elastic scale
• Very low TCO
9
Online Retail …
Trafodion
Create album
INSERT into Trafodion table ALBUM
(cust_id, album_id, album_name, …)
Upload pictures
Pictures loaded into HDFS by app
BEGIN WORK
INSERT list of pictures uploaded into
Trafodion table PIC
(cust_id, album_id, pic_id, pic_date, …)
INSERT picture attributes from camera into
HBase table PIC_ATTR as col-value pairs
for each of the pictures using pic_id
END WORK
Transaction
Tag pictures
BEGIN WORK
INSERT custom tags for each tagged picture into
HBase table PIC_ATTR as col-value pairs
END WORK
Share pictures
INSERT into Trafodion table REL
(cust_id, rel_with_cust_id, rel-type, …)
BEGIN WORK
INSERT list of pictures shared into
Trafodion table SHARED_PIC
(pic_id, rel_with_cust_id)
END WORK
Order photo mug & jewelry
BEGIN WORK
INSERT into ORDER
(cust_id, order_no, order_date, order_total, …)
INSERT into ORDER_DETAIL
all items that are part of the order
(cust_id, order_no, item_id, pic_id, qty, amt, …)
END WORK
Search for pictures
SELECT pictures taken with my “Sony DSC-
RX100M2” camera in the last 6 months from my
“Travel” album with a tag “Emma” on it.
Backend operational workloads
Order tracking, supply chain, inventory control,
…
Versus RDBMS & NoSQL
• Rich ANSI SQL RDBMS features
• Full ACID transactional support
• Integration of structured, semi-
structured, & unstructured data
IDOL could be used to analyze
the pictures in HDFS to
automatically create tags to be
stored in HBase PIC_ATTR
OLTP
OLTP
OLTP
ODS
ODS
10
Online Retail
Trafodion
Reporting &
Analytics via Spark
Analytics in Spark to generate
recommendation model
Web
app
Using model &
customer score /
attributes, and
recent purchase
history make
recommendationsRohit, consider a
blanket for your
granddaughter at
50% off with her
image imprinted on it
50%
BI reporting
• Sales growth by
product, region, demo
• Growth in customers,
pictures, storage, …
• Growth in sharing
• …
Analytics
• Items bought together –
market basket analysis
• Promotion success
customer classification
• …
Versus RDBMS & NoSQL
• Data captured in an open file system with open APIs
• Is available with no latency for reporting & analysis
• Via a huge open source & proprietary Hadoop eco-system
Spark
OLTP
BI
Analytics
The information contained herein is subject to change without notice.11
Why Apache TrafodionTM?
1. Time, Money, and Talent
• 20+ years of investment
• $300+ million invested
• Database developers grew up on
– Shared nothing Massively Parallel Architecture
– With a single system image across clusters
The architecture of today decades ago
Ingredients for a world class RDBMS
The information contained herein is subject to change without notice.12
Why Apache TrafodionTM?
2. World Class Optimizer
• Rule-driven and cost-based optimizer
• Based on Cascades & Large Scope Rules
• Parallel and non-parallel plans
• Equal-height histogram statistics
• Pushes down
– Filters e.g. row selection (start-stop key)
– Coprocessors e.g. aggregates
– Multi-Dimensional Access (MDAM)
• Multiple join and aggregation strategies
• Optimized inner, left, right, outer joins
• Subquery un-nesting
Ingredients for a world class RDBMS
Rules
Costs
Cascades II +
Large Scope
Rules
Optimizer
Source Code
C++
Compiler
Trafodion
Optimizer
The information contained herein is subject to change without notice.13
Node 1 Node 2 Node n
Client Application
HDFS
HBase HBase HBaseFilters
HDFS HDFS HDFS HDFS
Ethernet
Coprocessors
Why Apache TrafodionTM?
3. World Class Parallel Data Flow
Execution Engine
• Data Flow pipeline parallel architecture
• Support for co-located joins
• Repartitioning when necessary
• Inner child and outer child broadcasts
• Adaptive Segmentation
• Skew buster
Ingredients for a world class RDBMS
Master
ESP ESP ESP ESP ESP
ESP ESP ESP ESP ESP
Master
Multi-
fragment
Supports salting of data across region servers
The information contained herein is subject to change without notice.14
Why Apache TrafodionTM?
4. World Class Distributed
Transaction Management system
Ingredients for a world class RDBMS
The information contained herein is subject to change without notice.15
Apache TrafodionTM innovation built upon
Apache HadoopTM ecosystem
Leverages Hadoop for core modules
• Hadoop distribution neutral
• Inherited scalability and availability
Differentiation
• Comprehensive ANSI SQL language support
• Relational schema abstraction
• Mature SQL technology with compile and run time
workload optimizations
• Automatic query parallelism
• Distributed transaction protection
• Robust data integrity and security enforcement
• Seamless access and integration of Trafodion,
native-HBase, and Hive tables
The information contained herein is subject to change without notice.16
YCSB operation speeds that approach Apache HBaseTM
Performance
With max
variance at
10.8%
0 128 256 384 512 640 768 896 1,024
Throughput(OPS)
Concurrency (Streams)
YCSB Singleton5050 (Workload A)
Traf 1.1 HBase
The information contained herein is subject to change without notice.17
YCSB and Order Entry scale linearly!
Performance
Transactional
Order Entry
Throughput
YCSB
Selects Updates
50/50
Throughput
Throughput
Throughput
The information contained herein is subject to change without notice.18
Minimum distributed transaction management overhead
Performance
Order Entry: multi-statement transactional workload
• 5 transaction types (New Orders, Payments, Order
Status, Deliver, and Stock Level checks
• On average has about 20 statements per transaction
0 128 256 384 512 640 768 896 1,024
Throughput(TPM)
Concurrency (Streams)
OrderEntry
Traf 1.1 Autcommit
With max
variance at 11.3%
The information contained herein is subject to change without notice.19
Evolution of Trafodion
Incubated as open source project by HP Labs and HP IT
Released as open source under the Apache License, Version 2 in June 2014
First “production ready” 1.0 release in January 2015
Follow-on 1.1 release in April 2015
• Includes significant enhancements in performance, manageability, security, high
availability, and usability
• Throughput at scale reaches Apache HBaseTM and DTM overhead goals
• More than 2x OLTP improvement with proven linear scalability
Project Trafodion entered Apache Incubator in May 2015
Build an open source community around Apache TrafodionTM
The information contained herein is subject to change without notice.20
Community-led software development
Contribute to Apache TrafodionTM
Become a contributor – add a new feature, fix a bug, translate
documentation, more
Discuss your changes on the dev mailing list
Create a JIRA issue unless there already is one
Setup your development environment
Prepare a patch containing your changes
Submit the patch
See trafodion.incubator.apache.org for more information
The information contained herein is subject to change without notice.21
Esgyn Corporation
• New independent company spun out from HP to
build a business on supporting products that include
Apache TrafodionTM
• Global company with offices in Milpitas, USA
(Silicon Valley) and Shanghai, China
• Significant proof of concept (PoC) activity and
successes
• Will be offering an Esgyn Enterprise version which
includes Apache TrafodionTM
• 24x7 enterprise support subscription
• Consulting and implementation services
Q & A
Thank you
Trafodion.incubator.apache.org

Weitere ähnliche Inhalte

Was ist angesagt?

BDM39: HP Vertica BI: Sub-second big data analytics your users and developers...
BDM39: HP Vertica BI: Sub-second big data analytics your users and developers...BDM39: HP Vertica BI: Sub-second big data analytics your users and developers...
BDM39: HP Vertica BI: Sub-second big data analytics your users and developers...Big Data Montreal
 
The DAP - Where YARN, HBase, Kafka and Spark go to Production
The DAP - Where YARN, HBase, Kafka and Spark go to ProductionThe DAP - Where YARN, HBase, Kafka and Spark go to Production
The DAP - Where YARN, HBase, Kafka and Spark go to ProductionDataWorks Summit/Hadoop Summit
 
How can Hadoop & SAP be integrated
How can Hadoop & SAP be integratedHow can Hadoop & SAP be integrated
How can Hadoop & SAP be integratedDouglas Bernardini
 
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewThe Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewDataWorks Summit/Hadoop Summit
 
IBM InfoSphere BigInsights for Hadoop: 10 Reasons to Love It
IBM InfoSphere BigInsights for Hadoop: 10 Reasons to Love ItIBM InfoSphere BigInsights for Hadoop: 10 Reasons to Love It
IBM InfoSphere BigInsights for Hadoop: 10 Reasons to Love ItIBM Analytics
 
Integration of SAP HANA with Hadoop
Integration of SAP HANA with HadoopIntegration of SAP HANA with Hadoop
Integration of SAP HANA with HadoopRamkumar Rajendran
 
Harnessing Big Data in Real-Time
Harnessing Big Data in Real-TimeHarnessing Big Data in Real-Time
Harnessing Big Data in Real-TimeDataWorks Summit
 
Hadoop crash course workshop at Hadoop Summit
Hadoop crash course workshop at Hadoop SummitHadoop crash course workshop at Hadoop Summit
Hadoop crash course workshop at Hadoop SummitDataWorks Summit
 
Big Data Simplified - Is all about Ab'strakSHeN
Big Data Simplified - Is all about Ab'strakSHeNBig Data Simplified - Is all about Ab'strakSHeN
Big Data Simplified - Is all about Ab'strakSHeNDataWorks Summit
 
Internet of things Crash Course Workshop
Internet of things Crash Course WorkshopInternet of things Crash Course Workshop
Internet of things Crash Course WorkshopDataWorks Summit
 
Luo june27 1150am_room230_a_v2
Luo june27 1150am_room230_a_v2Luo june27 1150am_room230_a_v2
Luo june27 1150am_room230_a_v2DataWorks Summit
 
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo ClinicBig Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo ClinicDataWorks Summit
 
Ingesting Data at Blazing Speed Using Apache Orc
Ingesting Data at Blazing Speed Using Apache OrcIngesting Data at Blazing Speed Using Apache Orc
Ingesting Data at Blazing Speed Using Apache OrcDataWorks Summit
 
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...Hortonworks
 
Implementing a Data Lake with Enterprise Grade Data Governance
Implementing a Data Lake with Enterprise Grade Data GovernanceImplementing a Data Lake with Enterprise Grade Data Governance
Implementing a Data Lake with Enterprise Grade Data GovernanceHortonworks
 
Leveraging SAP HANA with Apache Hadoop and SAP Analytics
Leveraging SAP HANA with Apache Hadoop and SAP AnalyticsLeveraging SAP HANA with Apache Hadoop and SAP Analytics
Leveraging SAP HANA with Apache Hadoop and SAP AnalyticsMethod360
 
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudBring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudDataWorks Summit/Hadoop Summit
 
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & TrifactaExtend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & TrifactaDataWorks Summit/Hadoop Summit
 
Pivotal HAWQ 소개
Pivotal HAWQ 소개Pivotal HAWQ 소개
Pivotal HAWQ 소개Seungdon Choi
 

Was ist angesagt? (20)

BDM39: HP Vertica BI: Sub-second big data analytics your users and developers...
BDM39: HP Vertica BI: Sub-second big data analytics your users and developers...BDM39: HP Vertica BI: Sub-second big data analytics your users and developers...
BDM39: HP Vertica BI: Sub-second big data analytics your users and developers...
 
The DAP - Where YARN, HBase, Kafka and Spark go to Production
The DAP - Where YARN, HBase, Kafka and Spark go to ProductionThe DAP - Where YARN, HBase, Kafka and Spark go to Production
The DAP - Where YARN, HBase, Kafka and Spark go to Production
 
How can Hadoop & SAP be integrated
How can Hadoop & SAP be integratedHow can Hadoop & SAP be integrated
How can Hadoop & SAP be integrated
 
SAP HORTONWORKS
SAP HORTONWORKSSAP HORTONWORKS
SAP HORTONWORKS
 
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewThe Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture View
 
IBM InfoSphere BigInsights for Hadoop: 10 Reasons to Love It
IBM InfoSphere BigInsights for Hadoop: 10 Reasons to Love ItIBM InfoSphere BigInsights for Hadoop: 10 Reasons to Love It
IBM InfoSphere BigInsights for Hadoop: 10 Reasons to Love It
 
Integration of SAP HANA with Hadoop
Integration of SAP HANA with HadoopIntegration of SAP HANA with Hadoop
Integration of SAP HANA with Hadoop
 
Harnessing Big Data in Real-Time
Harnessing Big Data in Real-TimeHarnessing Big Data in Real-Time
Harnessing Big Data in Real-Time
 
Hadoop crash course workshop at Hadoop Summit
Hadoop crash course workshop at Hadoop SummitHadoop crash course workshop at Hadoop Summit
Hadoop crash course workshop at Hadoop Summit
 
Big Data Simplified - Is all about Ab'strakSHeN
Big Data Simplified - Is all about Ab'strakSHeNBig Data Simplified - Is all about Ab'strakSHeN
Big Data Simplified - Is all about Ab'strakSHeN
 
Internet of things Crash Course Workshop
Internet of things Crash Course WorkshopInternet of things Crash Course Workshop
Internet of things Crash Course Workshop
 
Luo june27 1150am_room230_a_v2
Luo june27 1150am_room230_a_v2Luo june27 1150am_room230_a_v2
Luo june27 1150am_room230_a_v2
 
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo ClinicBig Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
 
Ingesting Data at Blazing Speed Using Apache Orc
Ingesting Data at Blazing Speed Using Apache OrcIngesting Data at Blazing Speed Using Apache Orc
Ingesting Data at Blazing Speed Using Apache Orc
 
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
 
Implementing a Data Lake with Enterprise Grade Data Governance
Implementing a Data Lake with Enterprise Grade Data GovernanceImplementing a Data Lake with Enterprise Grade Data Governance
Implementing a Data Lake with Enterprise Grade Data Governance
 
Leveraging SAP HANA with Apache Hadoop and SAP Analytics
Leveraging SAP HANA with Apache Hadoop and SAP AnalyticsLeveraging SAP HANA with Apache Hadoop and SAP Analytics
Leveraging SAP HANA with Apache Hadoop and SAP Analytics
 
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudBring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
 
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & TrifactaExtend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
 
Pivotal HAWQ 소개
Pivotal HAWQ 소개Pivotal HAWQ 소개
Pivotal HAWQ 소개
 

Ähnlich wie Trafodion overview

Introduction To Big Data & Hadoop
Introduction To Big Data & HadoopIntroduction To Big Data & Hadoop
Introduction To Big Data & HadoopBlackvard
 
Hitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Vantara
 
Accelerating Big Data Analytics
Accelerating Big Data AnalyticsAccelerating Big Data Analytics
Accelerating Big Data AnalyticsAttunity
 
Summer Shorts: Big Data Integration
Summer Shorts: Big Data IntegrationSummer Shorts: Big Data Integration
Summer Shorts: Big Data Integrationibi
 
Using the Power of Big SQL 3.0 to Build a Big Data-Ready Hybrid Warehouse
Using the Power of Big SQL 3.0 to Build a Big Data-Ready Hybrid WarehouseUsing the Power of Big SQL 3.0 to Build a Big Data-Ready Hybrid Warehouse
Using the Power of Big SQL 3.0 to Build a Big Data-Ready Hybrid WarehouseRizaldy Ignacio
 
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsVerizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsDataWorks Summit
 
Azure Cafe Marketplace with Hortonworks March 31 2016
Azure Cafe Marketplace with Hortonworks March 31 2016Azure Cafe Marketplace with Hortonworks March 31 2016
Azure Cafe Marketplace with Hortonworks March 31 2016Joan Novino
 
Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014Hortonworks
 
Hadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter PointHadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter PointInside Analysis
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to HadoopPOSSCON
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016StampedeCon
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Innovative Management Services
 
Building a Modern Data Architecture with Enterprise Hadoop
Building a Modern Data Architecture with Enterprise HadoopBuilding a Modern Data Architecture with Enterprise Hadoop
Building a Modern Data Architecture with Enterprise HadoopSlim Baltagi
 
Eric Baldeschwieler Keynote from Storage Developers Conference
Eric Baldeschwieler Keynote from Storage Developers ConferenceEric Baldeschwieler Keynote from Storage Developers Conference
Eric Baldeschwieler Keynote from Storage Developers ConferenceHortonworks
 
"Integration of Hadoop in Business landscape", Michal Alexa, IT and Innovatio...
"Integration of Hadoop in Business landscape", Michal Alexa, IT and Innovatio..."Integration of Hadoop in Business landscape", Michal Alexa, IT and Innovatio...
"Integration of Hadoop in Business landscape", Michal Alexa, IT and Innovatio...Dataconomy Media
 
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Precisely
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with HadoopPhilippe Julio
 
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
Differentiate Big Data vs Data Warehouse use cases for a cloud solutionDifferentiate Big Data vs Data Warehouse use cases for a cloud solution
Differentiate Big Data vs Data Warehouse use cases for a cloud solutionJames Serra
 
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...Hortonworks
 

Ähnlich wie Trafodion overview (20)

Introduction To Big Data & Hadoop
Introduction To Big Data & HadoopIntroduction To Big Data & Hadoop
Introduction To Big Data & Hadoop
 
OOP 2014
OOP 2014OOP 2014
OOP 2014
 
Hitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop Solution
 
Accelerating Big Data Analytics
Accelerating Big Data AnalyticsAccelerating Big Data Analytics
Accelerating Big Data Analytics
 
Summer Shorts: Big Data Integration
Summer Shorts: Big Data IntegrationSummer Shorts: Big Data Integration
Summer Shorts: Big Data Integration
 
Using the Power of Big SQL 3.0 to Build a Big Data-Ready Hybrid Warehouse
Using the Power of Big SQL 3.0 to Build a Big Data-Ready Hybrid WarehouseUsing the Power of Big SQL 3.0 to Build a Big Data-Ready Hybrid Warehouse
Using the Power of Big SQL 3.0 to Build a Big Data-Ready Hybrid Warehouse
 
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsVerizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
 
Azure Cafe Marketplace with Hortonworks March 31 2016
Azure Cafe Marketplace with Hortonworks March 31 2016Azure Cafe Marketplace with Hortonworks March 31 2016
Azure Cafe Marketplace with Hortonworks March 31 2016
 
Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014
 
Hadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter PointHadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter Point
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
 
Building a Modern Data Architecture with Enterprise Hadoop
Building a Modern Data Architecture with Enterprise HadoopBuilding a Modern Data Architecture with Enterprise Hadoop
Building a Modern Data Architecture with Enterprise Hadoop
 
Eric Baldeschwieler Keynote from Storage Developers Conference
Eric Baldeschwieler Keynote from Storage Developers ConferenceEric Baldeschwieler Keynote from Storage Developers Conference
Eric Baldeschwieler Keynote from Storage Developers Conference
 
"Integration of Hadoop in Business landscape", Michal Alexa, IT and Innovatio...
"Integration of Hadoop in Business landscape", Michal Alexa, IT and Innovatio..."Integration of Hadoop in Business landscape", Michal Alexa, IT and Innovatio...
"Integration of Hadoop in Business landscape", Michal Alexa, IT and Innovatio...
 
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with Hadoop
 
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
Differentiate Big Data vs Data Warehouse use cases for a cloud solutionDifferentiate Big Data vs Data Warehouse use cases for a cloud solution
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
 
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
 

Kürzlich hochgeladen

BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 

Kürzlich hochgeladen (20)

BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 

Trafodion overview

  • 1. Apache TrafodionTM (incubating) Enterprise-Class Transactional SQL-on-Hadoop DBMS trafodion.incubator.apache.org 1
  • 2. The information contained herein is subject to change without notice.2 Brief Bio 2 Rohit Jain CTO, Esgyn rohit.jain@esgyn.com Rohit Jain is the CTO at Esgyn working on Apache TrafodionTM, currently in incubation. Trafodion is a transactional SQL-on-HBase RDBMS. Rohit worked for Tandem, Compaq, and Hewlett-Packard for the last 28 of his 39 years in application and database development. He has worked as an application developer, solutions architect, consultant, software engineer, database architect, development and QA manager, Product Manager, and Chief Technologist. His experience spans Online Transaction Processing, Operational Data Stores, Data Marts, Enterprise Data Warehouses, Business Intelligence, and Advanced Analytics, on distributed massively parallel systems.
  • 3. The information contained herein is subject to change without notice.3 Apache TrafodionTM Rides the unstoppable Apache HadoopTM wave! Transforms how companies store, process, and share big data Affordable performance, elastic scalability, availability Open source project - downloadable for free Apache TrafodionTM is currently undergoing Incubation at the Apache Software Foundation Eliminates vendor lock-in and licensing fees Leverages community development resources and speed Schema flexibility and multi-structured data Capturing and storing all data for all business functions Full-function ANSI SQL with JDBC/ODBC access Leverages existing SQL skills, tools, & apps for productivity Distributed ACID transaction protection Data consistency across multiple rows, tables, SQL statements Targeted for operational workloads! Optimized for real-time transaction processing applications, operational reporting, and Operational Data Stores (ODS), needing sub-second response times at high levels of concurrency Data federation: Trafodion/HBase/Hive tables Enables multiple data model deployment with schema flexibility Open source project to develop operational SQL-on-Hadoop database engine + Transactional SQLApache HBaseTM
  • 4. The information contained herein is subject to change without notice.4 Hadoop Cluster Switch Switch Operational Workloads come to Apache HadoopTM Shared Disk SAN Shared Cache Operational Business Intelligence Analytics Data movement & duplication Data movement & duplication Column store for fast analytics Operational Business Intelligence Analytics Enterprise Resource Planning Customer Relationship Management Supply Chain Management Financial Resource Management Manufacturing Resource Planning Human Resource Management ORC Files • Complement • Offload • Transform • Modernize • Offload
  • 5. The information contained herein is subject to change without notice.5 Banking NonStop Mission Critical OLTP system Hadoop Cluster Switch Switch Commercial & Consumer Banking Transactions Change Data Capture Streaming real-time updates Operational Data Store Daily transactional Data Multiple years of transactions & statements • Transform • Modernize • Offload Change Data Capture IBM Mainframe Monthly transactional Data
  • 6. The information contained herein is subject to change without notice.6 Telco Billing & Revenue Mgt Mediation Fulfillment Intelligent Network (IN), Home Location Register (HLR), Mobile Switching Center (MSC), SMS Center (SMSC), and network elements for other value added services like Push-to-talk (PTT), Ring Back Tone (RBT) SMSCIN HLR HRBT ICS PTT MDSP MMSC For closed loop analytics Trafodion for transactions to operational reporting AudioSocial Media ImagesEmailVideo DocumentsTexts Unstructured dataSemi-structured data • Transform • Modernize • Offload
  • 7. 7 Online Retail … • Integration of structured, semi- structured, and unstructured support • Integration of operational, historical, & external (Big) data along common master data for better insights Item id Description Cost Price …Structured Type Display Size Resolution Brand Model 3D … …ISBN Author Publish Date Format Dept TV Book … Semi- structured SELECT all TVs WHERE Price > 2000 and Type = ‘Plasma’ and Display Size > ‘50’ and customer sentiment is very positive Unstructured Image … Review … Open distributed HDFS structures HBase & HiveFree at last! Capture data directly into open file structures Accessible for reporting & analytics with no latency
  • 8. 8 Online Retail … • Create album • Upload / Import pictures into album • Create a project / photo book • Share album / project with family / friends Asset Management • Print Calendars, Cards, … • Order prints, mugs, linen, jewelry, cases, covers, cards, teddy bears, … Shopping Trafodion OLTP on Hadoop Versus RDBMS & NoSQL • High concurrency low latency workloads • Limitless elastic scale • Very low TCO
  • 9. 9 Online Retail … Trafodion Create album INSERT into Trafodion table ALBUM (cust_id, album_id, album_name, …) Upload pictures Pictures loaded into HDFS by app BEGIN WORK INSERT list of pictures uploaded into Trafodion table PIC (cust_id, album_id, pic_id, pic_date, …) INSERT picture attributes from camera into HBase table PIC_ATTR as col-value pairs for each of the pictures using pic_id END WORK Transaction Tag pictures BEGIN WORK INSERT custom tags for each tagged picture into HBase table PIC_ATTR as col-value pairs END WORK Share pictures INSERT into Trafodion table REL (cust_id, rel_with_cust_id, rel-type, …) BEGIN WORK INSERT list of pictures shared into Trafodion table SHARED_PIC (pic_id, rel_with_cust_id) END WORK Order photo mug & jewelry BEGIN WORK INSERT into ORDER (cust_id, order_no, order_date, order_total, …) INSERT into ORDER_DETAIL all items that are part of the order (cust_id, order_no, item_id, pic_id, qty, amt, …) END WORK Search for pictures SELECT pictures taken with my “Sony DSC- RX100M2” camera in the last 6 months from my “Travel” album with a tag “Emma” on it. Backend operational workloads Order tracking, supply chain, inventory control, … Versus RDBMS & NoSQL • Rich ANSI SQL RDBMS features • Full ACID transactional support • Integration of structured, semi- structured, & unstructured data IDOL could be used to analyze the pictures in HDFS to automatically create tags to be stored in HBase PIC_ATTR OLTP OLTP OLTP ODS ODS
  • 10. 10 Online Retail Trafodion Reporting & Analytics via Spark Analytics in Spark to generate recommendation model Web app Using model & customer score / attributes, and recent purchase history make recommendationsRohit, consider a blanket for your granddaughter at 50% off with her image imprinted on it 50% BI reporting • Sales growth by product, region, demo • Growth in customers, pictures, storage, … • Growth in sharing • … Analytics • Items bought together – market basket analysis • Promotion success customer classification • … Versus RDBMS & NoSQL • Data captured in an open file system with open APIs • Is available with no latency for reporting & analysis • Via a huge open source & proprietary Hadoop eco-system Spark OLTP BI Analytics
  • 11. The information contained herein is subject to change without notice.11 Why Apache TrafodionTM? 1. Time, Money, and Talent • 20+ years of investment • $300+ million invested • Database developers grew up on – Shared nothing Massively Parallel Architecture – With a single system image across clusters The architecture of today decades ago Ingredients for a world class RDBMS
  • 12. The information contained herein is subject to change without notice.12 Why Apache TrafodionTM? 2. World Class Optimizer • Rule-driven and cost-based optimizer • Based on Cascades & Large Scope Rules • Parallel and non-parallel plans • Equal-height histogram statistics • Pushes down – Filters e.g. row selection (start-stop key) – Coprocessors e.g. aggregates – Multi-Dimensional Access (MDAM) • Multiple join and aggregation strategies • Optimized inner, left, right, outer joins • Subquery un-nesting Ingredients for a world class RDBMS Rules Costs Cascades II + Large Scope Rules Optimizer Source Code C++ Compiler Trafodion Optimizer
  • 13. The information contained herein is subject to change without notice.13 Node 1 Node 2 Node n Client Application HDFS HBase HBase HBaseFilters HDFS HDFS HDFS HDFS Ethernet Coprocessors Why Apache TrafodionTM? 3. World Class Parallel Data Flow Execution Engine • Data Flow pipeline parallel architecture • Support for co-located joins • Repartitioning when necessary • Inner child and outer child broadcasts • Adaptive Segmentation • Skew buster Ingredients for a world class RDBMS Master ESP ESP ESP ESP ESP ESP ESP ESP ESP ESP Master Multi- fragment Supports salting of data across region servers
  • 14. The information contained herein is subject to change without notice.14 Why Apache TrafodionTM? 4. World Class Distributed Transaction Management system Ingredients for a world class RDBMS
  • 15. The information contained herein is subject to change without notice.15 Apache TrafodionTM innovation built upon Apache HadoopTM ecosystem Leverages Hadoop for core modules • Hadoop distribution neutral • Inherited scalability and availability Differentiation • Comprehensive ANSI SQL language support • Relational schema abstraction • Mature SQL technology with compile and run time workload optimizations • Automatic query parallelism • Distributed transaction protection • Robust data integrity and security enforcement • Seamless access and integration of Trafodion, native-HBase, and Hive tables
  • 16. The information contained herein is subject to change without notice.16 YCSB operation speeds that approach Apache HBaseTM Performance With max variance at 10.8% 0 128 256 384 512 640 768 896 1,024 Throughput(OPS) Concurrency (Streams) YCSB Singleton5050 (Workload A) Traf 1.1 HBase
  • 17. The information contained herein is subject to change without notice.17 YCSB and Order Entry scale linearly! Performance Transactional Order Entry Throughput YCSB Selects Updates 50/50 Throughput Throughput Throughput
  • 18. The information contained herein is subject to change without notice.18 Minimum distributed transaction management overhead Performance Order Entry: multi-statement transactional workload • 5 transaction types (New Orders, Payments, Order Status, Deliver, and Stock Level checks • On average has about 20 statements per transaction 0 128 256 384 512 640 768 896 1,024 Throughput(TPM) Concurrency (Streams) OrderEntry Traf 1.1 Autcommit With max variance at 11.3%
  • 19. The information contained herein is subject to change without notice.19 Evolution of Trafodion Incubated as open source project by HP Labs and HP IT Released as open source under the Apache License, Version 2 in June 2014 First “production ready” 1.0 release in January 2015 Follow-on 1.1 release in April 2015 • Includes significant enhancements in performance, manageability, security, high availability, and usability • Throughput at scale reaches Apache HBaseTM and DTM overhead goals • More than 2x OLTP improvement with proven linear scalability Project Trafodion entered Apache Incubator in May 2015 Build an open source community around Apache TrafodionTM
  • 20. The information contained herein is subject to change without notice.20 Community-led software development Contribute to Apache TrafodionTM Become a contributor – add a new feature, fix a bug, translate documentation, more Discuss your changes on the dev mailing list Create a JIRA issue unless there already is one Setup your development environment Prepare a patch containing your changes Submit the patch See trafodion.incubator.apache.org for more information
  • 21. The information contained herein is subject to change without notice.21 Esgyn Corporation • New independent company spun out from HP to build a business on supporting products that include Apache TrafodionTM • Global company with offices in Milpitas, USA (Silicon Valley) and Shanghai, China • Significant proof of concept (PoC) activity and successes • Will be offering an Esgyn Enterprise version which includes Apache TrafodionTM • 24x7 enterprise support subscription • Consulting and implementation services
  • 22. Q & A