SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Mainframe + Hadoop
Bridging the Gap Between Big Iron and Big Data
Matt Brandwein, Director, Product Marketing, Cloudera
Jorge A. Lopez, Director, Product Marketing, Syncsort

(

+

)
Mainframes | A Critical Source of Big Data

71%

30 Billion

Fortune 500
Top 25
World Banks

Bus. Transactions / day

9

of World’s
Top Insurers

23 of Top 25 US
Retailers

2
How Mainframes Work
Copy Application Analysis - TCB Time & Count
70,000

60.0

60,000

50.0

50,000
40.0
40,000
30.0
30,000
20.0
20,000
10.0

10,000
0

0.0

Time of Day

$
$ $
$ $ $
$ $ $ $ $

$
$
$
$
$
$
$

• Every month you pay based on CPU
Expensive
utilization, commonly measured in MIPS
Processing • Any MIPS reduction = Instant OpEx
savings

Number of
COPY
Applications per
Interval

TCB Time (mins)

• Storage can reach up to $100K/TB
Expensive • Tape technology is often present
Storage • Uses compressed data types (EBCDIC,
packed decimal)

$15.7 Million
Typical MIPS costs for the “average” $10B organization
3
The Hadoop Opportunity

Produce POWERFUL INSIGHTS by
combining behavioral data with granular
mainframe data
Increase BUSINESS AGILITY & REDUCE
COSTS by offloading mainframe data &
batch processing to HADOOP
Image: quirkyjoe.com

4
Perception: Just Call the Mainframe Guy…

Images: http://monkeestv.tripod.com/BatMonkee/

5
Reality: Far Away… So Close!

Reality
SMS
Compression

SMS
Compression

SMS
Compression

DB Tables,
Flat Files

DB Tables,
Flat Files

DB
Tables, Flat
Files

Filtering ,
Reformatting

Copy, Sort,
Join,
Aggregation

EBCDIC to
ASCII

Call MF Guy

Filtering ,
Reformatting

Copy, Sort,
Join,
Aggregation

EBCDIC to
ASCII

Call MF Guy

Filtering ,
Reformatting

Copy, Sort,
Join,
Aggregation

EBCDIC to
ASCII

Cobol
copybooks

Cobol
copybooks

Cobol
copybooks

Every Change = Time, Cost
Image: bottletales.com

6
Suits & Hoodies – The Most Unlikely Duo
Integration • Connectivity
• Data conversion (EBCDIC vs ASCII)
Gaps
Expertise
Gaps

• COBOL appeared in 1959, Hadoop in
2005
• Mainframe & Hadoop skills shortage

Security
Gaps

• Hosts mission critical sensitive data
• Very difficult to install new software
on the MF

Costs
Gaps

Suits & Hoodies idea: Merv Adrian, Gartner Research.

• Mainframe data is (expensive) Big
Data
• Even FTP costs CPU cycles (MIPS)

7
Bridging the Gap Between Big Iron and Big Data

+
A Smarter Approach to BIG
Mainframe Data!

Syncsort DMX-h ETL Edition

Connect

Translate

Process

CLOUDERA

THE PLATFORM FOR BIG DATA
CDH

Cloudera
Manager

Brings batch & realtime compute to
storage

-

Cloudera
Navigator

Works with
all types
of data

-

Cloudera
Support
Changes the
economics of data
management

 Zero-MIPS Connectivity
 Painless Integration & Translation
 Mainframe-like Performance &
Reliability
 Massively Affordable Scalability
 Support for Offloading Batch Cobol &
JCL Processing to Hadoop
 Iron-clad Security
 Decades of Proven Mainframe Expertise
 Easy Deployment, Monitoring & Admin
8
Cloudera
The Standard for Apache Hadoop™ in the Enterprise
CLOUDERA UNIVERSITY
ADMINISTRATOR TRAINING
DEVELOPER TRAINING

Processing

Interactive

Interactive

SQL

Search

Resource Management

Storage

ANALYST TRAINING

Analytics

DATA SCIENCE TRAINING

Metadata and Security

Batch

CLOUDERA
SUPPORT

CERTIFICATION PROGRAMS

PROFESSIONAL SERVICES
CLOUDERA
MANAGER

USE CASE DISCOVERY
NEW HADOOP DEPLOYMENT
PROOF-OF-CONCEPT
PRODUCTION PILOTS

Integration

9

CLOUDERA
NAVIGATOR

PROCESS & TEAM DEVELOPMENT
DEPLOYMENT CERTIFICATION
Why Cloudera?

 Assurance – Remove Risk
 Expertise – Maximize Value

Meets Enterprise Requirements
Interactive SQL

Data Access

We Deliver Customer Success
with Hadoop in the Enterprise

 Customers: Over 50% of the Fortune 50 and 65% of the Fortune
500 plus top US intelligence and defense agencies
 Partners: 700+ in hardware, software, and services

 Education: 15,000+ trained annually; developers, admins,
analysts, data scientists
 Community: Founders and top supporters of the Hadoop open
source ecosystem working for you

10

Enterprise Capabilities

 Influence – Build the Future

✔

Interactive Search

✔

SAS, R Integration

✔

Resource Management

✔

Security

✔

Highly Availability

✔

Disaster Recovery

✔

Audit and Lineage

✔

Online Upgrades

✔

Change Mgmt & Rollback

✔
Why Syncsort?
For 40 years we have been helping companies solve their big data
issues…even before they knew the name Big Data!
Integrating Big Data…
Smarter!

Our customers are achieving the
impossible, every day!

• 50% of all mainframes run Syncsort
• 1,500 Mainframe Customers: Most
used & trusted 3rd party mainframe
software
• Speed leader for ETL & Sort
• A history of innovation
• 25+ Issued & Pending Patents

• Large global customer base
• 15,000+ deployments in 68 countries

• First-to-market, fully integrated
approach to Hadoop ETL

Key Partners

11
Smart Contributions to Improve Hadoop
Augmenting Critical Batch
Processing Capabilities
JIRA

Description

2452

Allow External Sorter Plugin for MR

4808

Allow Reduce-side merge to be pluggable

4809

Make classes required for 2454 public

4812

Create reduce input merger plug-in

4842

Shuffle race can hang reducer

…and more!!
Plugin Shipping on CDH 4.2 and later

12
The Smarter Approach to Hadoop ETL… and Mainframe
 Connect – One tool to connect all your data
 Translate - Best in class mainframe data
access with seamless data translation &
COBOL Copybooks support
 Process – Hadoop ETL without coding.
Develop, test & debug locally in Windows;
deploy on Hadoop

PLUS…

Connect

Translate

Process

 Enterprise-grade security
 Smarter deployment, monitoring &
administration
 Disruptive cost-structure
 Decades of Mainframe expertise

13
Integration - Why is Working with Mainframe Data So Hard?

File Definitions
(Metadata) in
Cobol Copybooks

EBCDIC
A B Y Z

A

6 bytes
Non-Viewable, Saves space

B Y

Z

x’41’ x’42’ x’59’ x’5A’

x’C1’ x’C2’ x’E8’ x’E9’

Packed Decimal
x’12843154976C’

ASCII

Conversion

Conversion

Numeric
12,843,154,976
14 bytes
Viewable, WYSIWYG
14
Cloudera + Syncsort: Smarter Connectivity… Also for Mainframe
Because Mainframe Is Big Data Too!

Connect

• Read files directly from mainframe
• No software required on mainframe
• Already installed on 50% of mainframes

Translate

• Parse & transform: packed decimal,
EBCDIC/ASCII, multi-format
• No coding required

Load & • Load directly to HDFS
• Offload batch data processing
Process • Find more insights
15
Iron-clad Security. Zero Pain. Zero Mainframe Footprint

Mainframe Provides Iron-clad Security…
So, Why compromise?
Other solutions

Cloudera + Syncsort

Requires you to install unproven, untested software
on Your Mainframe

Adopt their own security
model and constantly sync
with your own

LDAP

• Nothing to install on
mainframe
• Painless support for
Kerberos & LDAP
• User-level security using
authentication protocol

• Secure data loads & extracts
• Secure job execution
16
The Economics of Data
Cost of managing 1TB of data
$20,000 – $100,000

$15,000 – $80,000

$250 – $2,000

Mainframe

EDW

Hadoop

But there’s more…
Scalability
Performance
Reliability
Agility
Skills Supply

17
Smarter Deployment, Monitoring & Administration…
…through Cloudera Manager

1
Monitor
2
Diagnose
3
Integrate
4
Manage

Easily deploy, configure & optimize clusters

Maintain a central view of all activity

Easily identify and resolve issues

Cloudera Cluster

Unleash Hadoop’s Potential

Use Cloudera Manager with existing tools

+

18
Get a 360° View of Your Cluster, Including DMX-h Logs
View service health
& performance
Get host-level
snapshots
Monitor & diagnose
workloads
Gather, view & search
Hadoop & DMX-h logs

…And more!!

+

19
Understanding Mainframe Data at Major US Bank
Before: Manual Effort

Weeks

After: DMX-h + CDH

?

86-page copybook

Customer hit a wall after months of manual
effort migrating Mainframe data
• Difficult to find data errors. No Mainframe
application logic that matches Copybook
• Large and complex Copybooks
• Depends on Mainframe team to provide data
• Very manual-intensive ; inadequate
documentation
• Not scalable. Only a few Java + Mainframe
experts could do the work

4 hrs

86-page copybook

(

+

)

• Easy to validate Copybooks and find data errors
• Ability to pull data directly from Mainframe
without relying on Mainframe team
• No coding. No scripting. Easier to document,
maintain & reuse
• Enables developers with a broader set of skills
to build complex migration jobs.
20
Three Quick Takeaways
1. Bring Suits & Hoodies together early
in the process
Build cross-organizational teams
Understand mutual concerns
Identify critical data & applications

2. Clearly define business & IT objectives
Reduce costs
Uncover new insights

3. Create a roadmap that gradually
builds the skills of your organization
Copy  Migrate  Offload
Image Source: http://archrecord.construction.com/news/2012/03/A-Stitch-in-the-Urban-Fabric.asp

21
Bridging the Gap Between Big Iron and Big Data

+
A Smarter Approach to BIG
Mainframe Data!

Syncsort DMX-h ETL Edition

Connect

Translate

Process

CLOUDERA

THE PLATFORM FOR BIG DATA
CDH

Cloudera
Manager

Brings batch & realtime compute to
storage

-

Cloudera
Navigator

Works with
all types
of data

-

Cloudera
Support
Changes the
economics of data
management

 Zero-MIPS Connectivity
 Painless Integration & Translation
 Mainframe-like Performance &
Reliability
 Massively Affordable Scalability
 Support for Offloading Batch Cobol &
JCL Processing to Hadoop
 Iron-clad Security
 Decades of Proven Mainframe Expertise
 Easy Deployment, Monitoring & Admin
22
Test Drive DMX-h:
Bridge the Gap Between
Big Iron & Big Data!
• Self-contained image
• Use case accelerators for
• mainframe, Hadoop and more!

(

+
Try it FREE at: syncsort.com/try
Stop by booth #304
Get a demo | Get a shirt

)

Weitere ähnliche Inhalte

Was ist angesagt?

How to combine Db2 on Z, IBM Db2 Analytics Accelerator and IBM Machine Learni...
How to combine Db2 on Z, IBM Db2 Analytics Accelerator and IBM Machine Learni...How to combine Db2 on Z, IBM Db2 Analytics Accelerator and IBM Machine Learni...
How to combine Db2 on Z, IBM Db2 Analytics Accelerator and IBM Machine Learni...Gustav Lundström
 
Mainframe Optimization with Modern Systems
Mainframe Optimization with Modern SystemsMainframe Optimization with Modern Systems
Mainframe Optimization with Modern SystemsModern Systems
 
IBM World of Watson 2016 - DB2 Analytics Accelerator on Cloud
IBM World of Watson 2016 - DB2 Analytics Accelerator on CloudIBM World of Watson 2016 - DB2 Analytics Accelerator on Cloud
IBM World of Watson 2016 - DB2 Analytics Accelerator on CloudDaniel Martin
 
Mainframe Fine Tuning - Fabio Massimo Ottaviani
Mainframe Fine Tuning - Fabio Massimo OttavianiMainframe Fine Tuning - Fabio Massimo Ottaviani
Mainframe Fine Tuning - Fabio Massimo OttavianiNRB
 
Cloud nativecomputingtechnologysupportinghpc cognitiveworkflows
Cloud nativecomputingtechnologysupportinghpc cognitiveworkflowsCloud nativecomputingtechnologysupportinghpc cognitiveworkflows
Cloud nativecomputingtechnologysupportinghpc cognitiveworkflowsYong Feng
 
DB2 Real-Time Analytics Meeting Wayne, PA 2015 - IDAA & DB2 Tools Update
DB2 Real-Time Analytics Meeting Wayne, PA 2015 - IDAA & DB2 Tools UpdateDB2 Real-Time Analytics Meeting Wayne, PA 2015 - IDAA & DB2 Tools Update
DB2 Real-Time Analytics Meeting Wayne, PA 2015 - IDAA & DB2 Tools UpdateBaha Majid
 
2014 01-23-eranea-apalia-private-cloud
2014 01-23-eranea-apalia-private-cloud2014 01-23-eranea-apalia-private-cloud
2014 01-23-eranea-apalia-private-cloudDidier Durand
 
Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Precisely
 
Commonwealth Bank of Australia's Private Cloud Implementation
Commonwealth Bank of Australia's Private Cloud ImplementationCommonwealth Bank of Australia's Private Cloud Implementation
Commonwealth Bank of Australia's Private Cloud ImplementationVishal Sharma
 
Co-Design Architecture for Exascale
Co-Design Architecture for ExascaleCo-Design Architecture for Exascale
Co-Design Architecture for Exascaleinside-BigData.com
 
OpenPOWER Roadmap Toward CORAL
OpenPOWER Roadmap Toward CORALOpenPOWER Roadmap Toward CORAL
OpenPOWER Roadmap Toward CORALinside-BigData.com
 
POWER8 the x86 Server Farm - IBM Business Partners use POWER8 to Lower Client...
POWER8 the x86 Server Farm - IBM Business Partners use POWER8 to Lower Client...POWER8 the x86 Server Farm - IBM Business Partners use POWER8 to Lower Client...
POWER8 the x86 Server Farm - IBM Business Partners use POWER8 to Lower Client...Paula Koziol
 
Huawei Powers Efficient and Scalable HPC
Huawei Powers Efficient and Scalable HPCHuawei Powers Efficient and Scalable HPC
Huawei Powers Efficient and Scalable HPCinside-BigData.com
 
Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Karl Roche
 
Initiative Based Technology Consulting Case Studies
Initiative Based Technology Consulting Case StudiesInitiative Based Technology Consulting Case Studies
Initiative Based Technology Consulting Case Studieschanderdw
 
A Passion for Manufacturing
A Passion for ManufacturingA Passion for Manufacturing
A Passion for ManufacturingWebseology
 
Ibm integrated analytics system
Ibm integrated analytics systemIbm integrated analytics system
Ibm integrated analytics systemModusOptimum
 
EDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics Accelerator
EDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics AcceleratorEDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics Accelerator
EDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics AcceleratorDaniel Martin
 
M|18 Intel and MariaDB: Strategic Collaboration to Enhance MariaDB Functional...
M|18 Intel and MariaDB: Strategic Collaboration to Enhance MariaDB Functional...M|18 Intel and MariaDB: Strategic Collaboration to Enhance MariaDB Functional...
M|18 Intel and MariaDB: Strategic Collaboration to Enhance MariaDB Functional...MariaDB plc
 
How to Solve Real-Time Data Problems
How to Solve Real-Time Data ProblemsHow to Solve Real-Time Data Problems
How to Solve Real-Time Data ProblemsIBM Power Systems
 

Was ist angesagt? (20)

How to combine Db2 on Z, IBM Db2 Analytics Accelerator and IBM Machine Learni...
How to combine Db2 on Z, IBM Db2 Analytics Accelerator and IBM Machine Learni...How to combine Db2 on Z, IBM Db2 Analytics Accelerator and IBM Machine Learni...
How to combine Db2 on Z, IBM Db2 Analytics Accelerator and IBM Machine Learni...
 
Mainframe Optimization with Modern Systems
Mainframe Optimization with Modern SystemsMainframe Optimization with Modern Systems
Mainframe Optimization with Modern Systems
 
IBM World of Watson 2016 - DB2 Analytics Accelerator on Cloud
IBM World of Watson 2016 - DB2 Analytics Accelerator on CloudIBM World of Watson 2016 - DB2 Analytics Accelerator on Cloud
IBM World of Watson 2016 - DB2 Analytics Accelerator on Cloud
 
Mainframe Fine Tuning - Fabio Massimo Ottaviani
Mainframe Fine Tuning - Fabio Massimo OttavianiMainframe Fine Tuning - Fabio Massimo Ottaviani
Mainframe Fine Tuning - Fabio Massimo Ottaviani
 
Cloud nativecomputingtechnologysupportinghpc cognitiveworkflows
Cloud nativecomputingtechnologysupportinghpc cognitiveworkflowsCloud nativecomputingtechnologysupportinghpc cognitiveworkflows
Cloud nativecomputingtechnologysupportinghpc cognitiveworkflows
 
DB2 Real-Time Analytics Meeting Wayne, PA 2015 - IDAA & DB2 Tools Update
DB2 Real-Time Analytics Meeting Wayne, PA 2015 - IDAA & DB2 Tools UpdateDB2 Real-Time Analytics Meeting Wayne, PA 2015 - IDAA & DB2 Tools Update
DB2 Real-Time Analytics Meeting Wayne, PA 2015 - IDAA & DB2 Tools Update
 
2014 01-23-eranea-apalia-private-cloud
2014 01-23-eranea-apalia-private-cloud2014 01-23-eranea-apalia-private-cloud
2014 01-23-eranea-apalia-private-cloud
 
Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?
 
Commonwealth Bank of Australia's Private Cloud Implementation
Commonwealth Bank of Australia's Private Cloud ImplementationCommonwealth Bank of Australia's Private Cloud Implementation
Commonwealth Bank of Australia's Private Cloud Implementation
 
Co-Design Architecture for Exascale
Co-Design Architecture for ExascaleCo-Design Architecture for Exascale
Co-Design Architecture for Exascale
 
OpenPOWER Roadmap Toward CORAL
OpenPOWER Roadmap Toward CORALOpenPOWER Roadmap Toward CORAL
OpenPOWER Roadmap Toward CORAL
 
POWER8 the x86 Server Farm - IBM Business Partners use POWER8 to Lower Client...
POWER8 the x86 Server Farm - IBM Business Partners use POWER8 to Lower Client...POWER8 the x86 Server Farm - IBM Business Partners use POWER8 to Lower Client...
POWER8 the x86 Server Farm - IBM Business Partners use POWER8 to Lower Client...
 
Huawei Powers Efficient and Scalable HPC
Huawei Powers Efficient and Scalable HPCHuawei Powers Efficient and Scalable HPC
Huawei Powers Efficient and Scalable HPC
 
Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.
 
Initiative Based Technology Consulting Case Studies
Initiative Based Technology Consulting Case StudiesInitiative Based Technology Consulting Case Studies
Initiative Based Technology Consulting Case Studies
 
A Passion for Manufacturing
A Passion for ManufacturingA Passion for Manufacturing
A Passion for Manufacturing
 
Ibm integrated analytics system
Ibm integrated analytics systemIbm integrated analytics system
Ibm integrated analytics system
 
EDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics Accelerator
EDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics AcceleratorEDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics Accelerator
EDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics Accelerator
 
M|18 Intel and MariaDB: Strategic Collaboration to Enhance MariaDB Functional...
M|18 Intel and MariaDB: Strategic Collaboration to Enhance MariaDB Functional...M|18 Intel and MariaDB: Strategic Collaboration to Enhance MariaDB Functional...
M|18 Intel and MariaDB: Strategic Collaboration to Enhance MariaDB Functional...
 
How to Solve Real-Time Data Problems
How to Solve Real-Time Data ProblemsHow to Solve Real-Time Data Problems
How to Solve Real-Time Data Problems
 

Ähnlich wie How to Leverage Mainframe Data with Hadoop: Bridging the Gap Between Big Iron & Big Data

Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...
Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...
Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...Steven Totman
 
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...Precisely
 
Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...
Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...
Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...actualtechmedia
 
Streaming IBM i to Kafka for Next-Gen Use Cases
Streaming IBM i to Kafka for Next-Gen Use CasesStreaming IBM i to Kafka for Next-Gen Use Cases
Streaming IBM i to Kafka for Next-Gen Use CasesPrecisely
 
flexpod_hadoop_cloudera
flexpod_hadoop_clouderaflexpod_hadoop_cloudera
flexpod_hadoop_clouderaPrem Jain
 
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Precisely
 
Solving enterprise challenges through scale out storage & big compute final
Solving enterprise challenges through scale out storage & big compute finalSolving enterprise challenges through scale out storage & big compute final
Solving enterprise challenges through scale out storage & big compute finalAvere Systems
 
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...DataWorks Summit
 
The Last Frontier- Virtualization, Hybrid Management and the Cloud
The Last Frontier-  Virtualization, Hybrid Management and the CloudThe Last Frontier-  Virtualization, Hybrid Management and the Cloud
The Last Frontier- Virtualization, Hybrid Management and the CloudKellyn Pot'Vin-Gorman
 
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
Big Data Made Easy:  A Simple, Scalable Solution for Getting Started with HadoopBig Data Made Easy:  A Simple, Scalable Solution for Getting Started with Hadoop
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with HadoopPrecisely
 
The Hidden Value of Hadoop Migration
The Hidden Value of Hadoop MigrationThe Hidden Value of Hadoop Migration
The Hidden Value of Hadoop MigrationDatabricks
 
ITsubbotnik Spring 2017: Dmitriy Yatsyuk "Готовое комплексное инфраструктурно...
ITsubbotnik Spring 2017: Dmitriy Yatsyuk "Готовое комплексное инфраструктурно...ITsubbotnik Spring 2017: Dmitriy Yatsyuk "Готовое комплексное инфраструктурно...
ITsubbotnik Spring 2017: Dmitriy Yatsyuk "Готовое комплексное инфраструктурно...epamspb
 
Fueling AI & Machine Learning: Legacy Data as a Competitive Advantage
Fueling AI & Machine Learning: Legacy Data as a Competitive AdvantageFueling AI & Machine Learning: Legacy Data as a Competitive Advantage
Fueling AI & Machine Learning: Legacy Data as a Competitive AdvantagePrecisely
 
Data Engineer's Lunch #60: Series - Developing Enterprise Consciousness
Data Engineer's Lunch #60: Series - Developing Enterprise ConsciousnessData Engineer's Lunch #60: Series - Developing Enterprise Consciousness
Data Engineer's Lunch #60: Series - Developing Enterprise ConsciousnessAnant Corporation
 
ScyllaDB Virtual Workshop
ScyllaDB Virtual WorkshopScyllaDB Virtual Workshop
ScyllaDB Virtual WorkshopScyllaDB
 
Liberate Your Data: Integrate Data From Traditional On-Prem Systems to Next-G...
Liberate Your Data: Integrate Data From Traditional On-Prem Systems to Next-G...Liberate Your Data: Integrate Data From Traditional On-Prem Systems to Next-G...
Liberate Your Data: Integrate Data From Traditional On-Prem Systems to Next-G...Precisely
 
Keine Angst vorm Dinosaurier: Mainframe-Integration und -Offloading mit Confl...
Keine Angst vorm Dinosaurier: Mainframe-Integration und -Offloading mit Confl...Keine Angst vorm Dinosaurier: Mainframe-Integration und -Offloading mit Confl...
Keine Angst vorm Dinosaurier: Mainframe-Integration und -Offloading mit Confl...Precisely
 
Accelerating Big Data Analytics
Accelerating Big Data AnalyticsAccelerating Big Data Analytics
Accelerating Big Data AnalyticsAttunity
 
Unblocking Innovation for Digital Transformation
Unblocking Innovation for Digital TransformationUnblocking Innovation for Digital Transformation
Unblocking Innovation for Digital TransformationAmazon Web Services
 
Choosing technologies for a big data solution in the cloud
Choosing technologies for a big data solution in the cloudChoosing technologies for a big data solution in the cloud
Choosing technologies for a big data solution in the cloudJames Serra
 

Ähnlich wie How to Leverage Mainframe Data with Hadoop: Bridging the Gap Between Big Iron & Big Data (20)

Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...
Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...
Hadoop Summit Amsterdam 2013 - Making Hadoop Ready for Prime Time - Syncsort ...
 
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
 
Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...
Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...
Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...
 
Streaming IBM i to Kafka for Next-Gen Use Cases
Streaming IBM i to Kafka for Next-Gen Use CasesStreaming IBM i to Kafka for Next-Gen Use Cases
Streaming IBM i to Kafka for Next-Gen Use Cases
 
flexpod_hadoop_cloudera
flexpod_hadoop_clouderaflexpod_hadoop_cloudera
flexpod_hadoop_cloudera
 
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
 
Solving enterprise challenges through scale out storage & big compute final
Solving enterprise challenges through scale out storage & big compute finalSolving enterprise challenges through scale out storage & big compute final
Solving enterprise challenges through scale out storage & big compute final
 
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
 
The Last Frontier- Virtualization, Hybrid Management and the Cloud
The Last Frontier-  Virtualization, Hybrid Management and the CloudThe Last Frontier-  Virtualization, Hybrid Management and the Cloud
The Last Frontier- Virtualization, Hybrid Management and the Cloud
 
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
Big Data Made Easy:  A Simple, Scalable Solution for Getting Started with HadoopBig Data Made Easy:  A Simple, Scalable Solution for Getting Started with Hadoop
Big Data Made Easy: A Simple, Scalable Solution for Getting Started with Hadoop
 
The Hidden Value of Hadoop Migration
The Hidden Value of Hadoop MigrationThe Hidden Value of Hadoop Migration
The Hidden Value of Hadoop Migration
 
ITsubbotnik Spring 2017: Dmitriy Yatsyuk "Готовое комплексное инфраструктурно...
ITsubbotnik Spring 2017: Dmitriy Yatsyuk "Готовое комплексное инфраструктурно...ITsubbotnik Spring 2017: Dmitriy Yatsyuk "Готовое комплексное инфраструктурно...
ITsubbotnik Spring 2017: Dmitriy Yatsyuk "Готовое комплексное инфраструктурно...
 
Fueling AI & Machine Learning: Legacy Data as a Competitive Advantage
Fueling AI & Machine Learning: Legacy Data as a Competitive AdvantageFueling AI & Machine Learning: Legacy Data as a Competitive Advantage
Fueling AI & Machine Learning: Legacy Data as a Competitive Advantage
 
Data Engineer's Lunch #60: Series - Developing Enterprise Consciousness
Data Engineer's Lunch #60: Series - Developing Enterprise ConsciousnessData Engineer's Lunch #60: Series - Developing Enterprise Consciousness
Data Engineer's Lunch #60: Series - Developing Enterprise Consciousness
 
ScyllaDB Virtual Workshop
ScyllaDB Virtual WorkshopScyllaDB Virtual Workshop
ScyllaDB Virtual Workshop
 
Liberate Your Data: Integrate Data From Traditional On-Prem Systems to Next-G...
Liberate Your Data: Integrate Data From Traditional On-Prem Systems to Next-G...Liberate Your Data: Integrate Data From Traditional On-Prem Systems to Next-G...
Liberate Your Data: Integrate Data From Traditional On-Prem Systems to Next-G...
 
Keine Angst vorm Dinosaurier: Mainframe-Integration und -Offloading mit Confl...
Keine Angst vorm Dinosaurier: Mainframe-Integration und -Offloading mit Confl...Keine Angst vorm Dinosaurier: Mainframe-Integration und -Offloading mit Confl...
Keine Angst vorm Dinosaurier: Mainframe-Integration und -Offloading mit Confl...
 
Accelerating Big Data Analytics
Accelerating Big Data AnalyticsAccelerating Big Data Analytics
Accelerating Big Data Analytics
 
Unblocking Innovation for Digital Transformation
Unblocking Innovation for Digital TransformationUnblocking Innovation for Digital Transformation
Unblocking Innovation for Digital Transformation
 
Choosing technologies for a big data solution in the cloud
Choosing technologies for a big data solution in the cloudChoosing technologies for a big data solution in the cloud
Choosing technologies for a big data solution in the cloud
 

Mehr von Precisely

Crucial Considerations for AI-ready Data.pdf
Crucial Considerations for AI-ready Data.pdfCrucial Considerations for AI-ready Data.pdf
Crucial Considerations for AI-ready Data.pdfPrecisely
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Justifying Capacity Managment Webinar 4/10
Justifying Capacity Managment Webinar 4/10Justifying Capacity Managment Webinar 4/10
Justifying Capacity Managment Webinar 4/10Precisely
 
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...Precisely
 
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...Precisely
 
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3fTestjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3fPrecisely
 
Data Innovation Summit: Data Integrity Trends
Data Innovation Summit: Data Integrity TrendsData Innovation Summit: Data Integrity Trends
Data Innovation Summit: Data Integrity TrendsPrecisely
 
AI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarAI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarPrecisely
 
Optimisez la fonction financière en automatisant vos processus SAP
Optimisez la fonction financière en automatisant vos processus SAPOptimisez la fonction financière en automatisant vos processus SAP
Optimisez la fonction financière en automatisant vos processus SAPPrecisely
 
SAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
SAPS/4HANA Migration - Transformation-Management + nachhaltige InvestitionenSAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
SAPS/4HANA Migration - Transformation-Management + nachhaltige InvestitionenPrecisely
 
Automatisierte SAP Prozesse mit Hilfe von APIs
Automatisierte SAP Prozesse mit Hilfe von APIsAutomatisierte SAP Prozesse mit Hilfe von APIs
Automatisierte SAP Prozesse mit Hilfe von APIsPrecisely
 
Moving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and PreciselyMoving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and PreciselyPrecisely
 
Effective Security Monitoring for IBM i: What You Need to Know
Effective Security Monitoring for IBM i: What You Need to KnowEffective Security Monitoring for IBM i: What You Need to Know
Effective Security Monitoring for IBM i: What You Need to KnowPrecisely
 
Automate Your Master Data Processes for Shared Service Center Excellence
Automate Your Master Data Processes for Shared Service Center ExcellenceAutomate Your Master Data Processes for Shared Service Center Excellence
Automate Your Master Data Processes for Shared Service Center ExcellencePrecisely
 
5 Keys to Improved IT Operation Management
5 Keys to Improved IT Operation Management5 Keys to Improved IT Operation Management
5 Keys to Improved IT Operation ManagementPrecisely
 
Unlock Efficiency With Your Address Data Today For a Smarter Tomorrow
Unlock Efficiency With Your Address Data Today For a Smarter TomorrowUnlock Efficiency With Your Address Data Today For a Smarter Tomorrow
Unlock Efficiency With Your Address Data Today For a Smarter TomorrowPrecisely
 
Navigating Cloud Trends in 2024 Webinar Deck
Navigating Cloud Trends in 2024 Webinar DeckNavigating Cloud Trends in 2024 Webinar Deck
Navigating Cloud Trends in 2024 Webinar DeckPrecisely
 
Mainframe Sort Operations: Gaining the Insights You Need for Peak Performance
Mainframe Sort Operations: Gaining the Insights You Need for Peak PerformanceMainframe Sort Operations: Gaining the Insights You Need for Peak Performance
Mainframe Sort Operations: Gaining the Insights You Need for Peak PerformancePrecisely
 
Preventing Downtime with Better IT Operations Management
Preventing Downtime with Better IT Operations ManagementPreventing Downtime with Better IT Operations Management
Preventing Downtime with Better IT Operations ManagementPrecisely
 
Migrating IBM i Systems to the Cloud: Exploring the Pros and Cons
Migrating IBM i Systems to the Cloud: Exploring the Pros and ConsMigrating IBM i Systems to the Cloud: Exploring the Pros and Cons
Migrating IBM i Systems to the Cloud: Exploring the Pros and ConsPrecisely
 

Mehr von Precisely (20)

Crucial Considerations for AI-ready Data.pdf
Crucial Considerations for AI-ready Data.pdfCrucial Considerations for AI-ready Data.pdf
Crucial Considerations for AI-ready Data.pdf
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Justifying Capacity Managment Webinar 4/10
Justifying Capacity Managment Webinar 4/10Justifying Capacity Managment Webinar 4/10
Justifying Capacity Managment Webinar 4/10
 
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
Automate Studio Training: Materials Maintenance Tips for Efficiency and Ease ...
 
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
Leveraging Mainframe Data in Near Real Time to Unleash Innovation With Cloud:...
 
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3fTestjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
Testjrjnejrvnorno4rno3nrfnfjnrfnournfou3nfou3f
 
Data Innovation Summit: Data Integrity Trends
Data Innovation Summit: Data Integrity TrendsData Innovation Summit: Data Integrity Trends
Data Innovation Summit: Data Integrity Trends
 
AI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity WebinarAI You Can Trust - Ensuring Success with Data Integrity Webinar
AI You Can Trust - Ensuring Success with Data Integrity Webinar
 
Optimisez la fonction financière en automatisant vos processus SAP
Optimisez la fonction financière en automatisant vos processus SAPOptimisez la fonction financière en automatisant vos processus SAP
Optimisez la fonction financière en automatisant vos processus SAP
 
SAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
SAPS/4HANA Migration - Transformation-Management + nachhaltige InvestitionenSAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
SAPS/4HANA Migration - Transformation-Management + nachhaltige Investitionen
 
Automatisierte SAP Prozesse mit Hilfe von APIs
Automatisierte SAP Prozesse mit Hilfe von APIsAutomatisierte SAP Prozesse mit Hilfe von APIs
Automatisierte SAP Prozesse mit Hilfe von APIs
 
Moving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and PreciselyMoving IBM i Applications to the Cloud with AWS and Precisely
Moving IBM i Applications to the Cloud with AWS and Precisely
 
Effective Security Monitoring for IBM i: What You Need to Know
Effective Security Monitoring for IBM i: What You Need to KnowEffective Security Monitoring for IBM i: What You Need to Know
Effective Security Monitoring for IBM i: What You Need to Know
 
Automate Your Master Data Processes for Shared Service Center Excellence
Automate Your Master Data Processes for Shared Service Center ExcellenceAutomate Your Master Data Processes for Shared Service Center Excellence
Automate Your Master Data Processes for Shared Service Center Excellence
 
5 Keys to Improved IT Operation Management
5 Keys to Improved IT Operation Management5 Keys to Improved IT Operation Management
5 Keys to Improved IT Operation Management
 
Unlock Efficiency With Your Address Data Today For a Smarter Tomorrow
Unlock Efficiency With Your Address Data Today For a Smarter TomorrowUnlock Efficiency With Your Address Data Today For a Smarter Tomorrow
Unlock Efficiency With Your Address Data Today For a Smarter Tomorrow
 
Navigating Cloud Trends in 2024 Webinar Deck
Navigating Cloud Trends in 2024 Webinar DeckNavigating Cloud Trends in 2024 Webinar Deck
Navigating Cloud Trends in 2024 Webinar Deck
 
Mainframe Sort Operations: Gaining the Insights You Need for Peak Performance
Mainframe Sort Operations: Gaining the Insights You Need for Peak PerformanceMainframe Sort Operations: Gaining the Insights You Need for Peak Performance
Mainframe Sort Operations: Gaining the Insights You Need for Peak Performance
 
Preventing Downtime with Better IT Operations Management
Preventing Downtime with Better IT Operations ManagementPreventing Downtime with Better IT Operations Management
Preventing Downtime with Better IT Operations Management
 
Migrating IBM i Systems to the Cloud: Exploring the Pros and Cons
Migrating IBM i Systems to the Cloud: Exploring the Pros and ConsMigrating IBM i Systems to the Cloud: Exploring the Pros and Cons
Migrating IBM i Systems to the Cloud: Exploring the Pros and Cons
 

Kürzlich hochgeladen

(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 

Kürzlich hochgeladen (20)

(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 

How to Leverage Mainframe Data with Hadoop: Bridging the Gap Between Big Iron & Big Data

  • 1. Mainframe + Hadoop Bridging the Gap Between Big Iron and Big Data Matt Brandwein, Director, Product Marketing, Cloudera Jorge A. Lopez, Director, Product Marketing, Syncsort ( + )
  • 2. Mainframes | A Critical Source of Big Data 71% 30 Billion Fortune 500 Top 25 World Banks Bus. Transactions / day 9 of World’s Top Insurers 23 of Top 25 US Retailers 2
  • 3. How Mainframes Work Copy Application Analysis - TCB Time & Count 70,000 60.0 60,000 50.0 50,000 40.0 40,000 30.0 30,000 20.0 20,000 10.0 10,000 0 0.0 Time of Day $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ • Every month you pay based on CPU Expensive utilization, commonly measured in MIPS Processing • Any MIPS reduction = Instant OpEx savings Number of COPY Applications per Interval TCB Time (mins) • Storage can reach up to $100K/TB Expensive • Tape technology is often present Storage • Uses compressed data types (EBCDIC, packed decimal) $15.7 Million Typical MIPS costs for the “average” $10B organization 3
  • 4. The Hadoop Opportunity Produce POWERFUL INSIGHTS by combining behavioral data with granular mainframe data Increase BUSINESS AGILITY & REDUCE COSTS by offloading mainframe data & batch processing to HADOOP Image: quirkyjoe.com 4
  • 5. Perception: Just Call the Mainframe Guy… Images: http://monkeestv.tripod.com/BatMonkee/ 5
  • 6. Reality: Far Away… So Close! Reality SMS Compression SMS Compression SMS Compression DB Tables, Flat Files DB Tables, Flat Files DB Tables, Flat Files Filtering , Reformatting Copy, Sort, Join, Aggregation EBCDIC to ASCII Call MF Guy Filtering , Reformatting Copy, Sort, Join, Aggregation EBCDIC to ASCII Call MF Guy Filtering , Reformatting Copy, Sort, Join, Aggregation EBCDIC to ASCII Cobol copybooks Cobol copybooks Cobol copybooks Every Change = Time, Cost Image: bottletales.com 6
  • 7. Suits & Hoodies – The Most Unlikely Duo Integration • Connectivity • Data conversion (EBCDIC vs ASCII) Gaps Expertise Gaps • COBOL appeared in 1959, Hadoop in 2005 • Mainframe & Hadoop skills shortage Security Gaps • Hosts mission critical sensitive data • Very difficult to install new software on the MF Costs Gaps Suits & Hoodies idea: Merv Adrian, Gartner Research. • Mainframe data is (expensive) Big Data • Even FTP costs CPU cycles (MIPS) 7
  • 8. Bridging the Gap Between Big Iron and Big Data + A Smarter Approach to BIG Mainframe Data! Syncsort DMX-h ETL Edition Connect Translate Process CLOUDERA THE PLATFORM FOR BIG DATA CDH Cloudera Manager Brings batch & realtime compute to storage - Cloudera Navigator Works with all types of data - Cloudera Support Changes the economics of data management  Zero-MIPS Connectivity  Painless Integration & Translation  Mainframe-like Performance & Reliability  Massively Affordable Scalability  Support for Offloading Batch Cobol & JCL Processing to Hadoop  Iron-clad Security  Decades of Proven Mainframe Expertise  Easy Deployment, Monitoring & Admin 8
  • 9. Cloudera The Standard for Apache Hadoop™ in the Enterprise CLOUDERA UNIVERSITY ADMINISTRATOR TRAINING DEVELOPER TRAINING Processing Interactive Interactive SQL Search Resource Management Storage ANALYST TRAINING Analytics DATA SCIENCE TRAINING Metadata and Security Batch CLOUDERA SUPPORT CERTIFICATION PROGRAMS PROFESSIONAL SERVICES CLOUDERA MANAGER USE CASE DISCOVERY NEW HADOOP DEPLOYMENT PROOF-OF-CONCEPT PRODUCTION PILOTS Integration 9 CLOUDERA NAVIGATOR PROCESS & TEAM DEVELOPMENT DEPLOYMENT CERTIFICATION
  • 10. Why Cloudera?  Assurance – Remove Risk  Expertise – Maximize Value Meets Enterprise Requirements Interactive SQL Data Access We Deliver Customer Success with Hadoop in the Enterprise  Customers: Over 50% of the Fortune 50 and 65% of the Fortune 500 plus top US intelligence and defense agencies  Partners: 700+ in hardware, software, and services  Education: 15,000+ trained annually; developers, admins, analysts, data scientists  Community: Founders and top supporters of the Hadoop open source ecosystem working for you 10 Enterprise Capabilities  Influence – Build the Future ✔ Interactive Search ✔ SAS, R Integration ✔ Resource Management ✔ Security ✔ Highly Availability ✔ Disaster Recovery ✔ Audit and Lineage ✔ Online Upgrades ✔ Change Mgmt & Rollback ✔
  • 11. Why Syncsort? For 40 years we have been helping companies solve their big data issues…even before they knew the name Big Data! Integrating Big Data… Smarter! Our customers are achieving the impossible, every day! • 50% of all mainframes run Syncsort • 1,500 Mainframe Customers: Most used & trusted 3rd party mainframe software • Speed leader for ETL & Sort • A history of innovation • 25+ Issued & Pending Patents • Large global customer base • 15,000+ deployments in 68 countries • First-to-market, fully integrated approach to Hadoop ETL Key Partners 11
  • 12. Smart Contributions to Improve Hadoop Augmenting Critical Batch Processing Capabilities JIRA Description 2452 Allow External Sorter Plugin for MR 4808 Allow Reduce-side merge to be pluggable 4809 Make classes required for 2454 public 4812 Create reduce input merger plug-in 4842 Shuffle race can hang reducer …and more!! Plugin Shipping on CDH 4.2 and later 12
  • 13. The Smarter Approach to Hadoop ETL… and Mainframe  Connect – One tool to connect all your data  Translate - Best in class mainframe data access with seamless data translation & COBOL Copybooks support  Process – Hadoop ETL without coding. Develop, test & debug locally in Windows; deploy on Hadoop PLUS… Connect Translate Process  Enterprise-grade security  Smarter deployment, monitoring & administration  Disruptive cost-structure  Decades of Mainframe expertise 13
  • 14. Integration - Why is Working with Mainframe Data So Hard? File Definitions (Metadata) in Cobol Copybooks EBCDIC A B Y Z A 6 bytes Non-Viewable, Saves space B Y Z x’41’ x’42’ x’59’ x’5A’ x’C1’ x’C2’ x’E8’ x’E9’ Packed Decimal x’12843154976C’ ASCII Conversion Conversion Numeric 12,843,154,976 14 bytes Viewable, WYSIWYG 14
  • 15. Cloudera + Syncsort: Smarter Connectivity… Also for Mainframe Because Mainframe Is Big Data Too! Connect • Read files directly from mainframe • No software required on mainframe • Already installed on 50% of mainframes Translate • Parse & transform: packed decimal, EBCDIC/ASCII, multi-format • No coding required Load & • Load directly to HDFS • Offload batch data processing Process • Find more insights 15
  • 16. Iron-clad Security. Zero Pain. Zero Mainframe Footprint Mainframe Provides Iron-clad Security… So, Why compromise? Other solutions Cloudera + Syncsort Requires you to install unproven, untested software on Your Mainframe Adopt their own security model and constantly sync with your own LDAP • Nothing to install on mainframe • Painless support for Kerberos & LDAP • User-level security using authentication protocol • Secure data loads & extracts • Secure job execution 16
  • 17. The Economics of Data Cost of managing 1TB of data $20,000 – $100,000 $15,000 – $80,000 $250 – $2,000 Mainframe EDW Hadoop But there’s more… Scalability Performance Reliability Agility Skills Supply 17
  • 18. Smarter Deployment, Monitoring & Administration… …through Cloudera Manager 1 Monitor 2 Diagnose 3 Integrate 4 Manage Easily deploy, configure & optimize clusters Maintain a central view of all activity Easily identify and resolve issues Cloudera Cluster Unleash Hadoop’s Potential Use Cloudera Manager with existing tools + 18
  • 19. Get a 360° View of Your Cluster, Including DMX-h Logs View service health & performance Get host-level snapshots Monitor & diagnose workloads Gather, view & search Hadoop & DMX-h logs …And more!! + 19
  • 20. Understanding Mainframe Data at Major US Bank Before: Manual Effort Weeks After: DMX-h + CDH ? 86-page copybook Customer hit a wall after months of manual effort migrating Mainframe data • Difficult to find data errors. No Mainframe application logic that matches Copybook • Large and complex Copybooks • Depends on Mainframe team to provide data • Very manual-intensive ; inadequate documentation • Not scalable. Only a few Java + Mainframe experts could do the work 4 hrs 86-page copybook ( + ) • Easy to validate Copybooks and find data errors • Ability to pull data directly from Mainframe without relying on Mainframe team • No coding. No scripting. Easier to document, maintain & reuse • Enables developers with a broader set of skills to build complex migration jobs. 20
  • 21. Three Quick Takeaways 1. Bring Suits & Hoodies together early in the process Build cross-organizational teams Understand mutual concerns Identify critical data & applications 2. Clearly define business & IT objectives Reduce costs Uncover new insights 3. Create a roadmap that gradually builds the skills of your organization Copy  Migrate  Offload Image Source: http://archrecord.construction.com/news/2012/03/A-Stitch-in-the-Urban-Fabric.asp 21
  • 22. Bridging the Gap Between Big Iron and Big Data + A Smarter Approach to BIG Mainframe Data! Syncsort DMX-h ETL Edition Connect Translate Process CLOUDERA THE PLATFORM FOR BIG DATA CDH Cloudera Manager Brings batch & realtime compute to storage - Cloudera Navigator Works with all types of data - Cloudera Support Changes the economics of data management  Zero-MIPS Connectivity  Painless Integration & Translation  Mainframe-like Performance & Reliability  Massively Affordable Scalability  Support for Offloading Batch Cobol & JCL Processing to Hadoop  Iron-clad Security  Decades of Proven Mainframe Expertise  Easy Deployment, Monitoring & Admin 22
  • 23. Test Drive DMX-h: Bridge the Gap Between Big Iron & Big Data! • Self-contained image • Use case accelerators for • mainframe, Hadoop and more! ( + Try it FREE at: syncsort.com/try Stop by booth #304 Get a demo | Get a shirt )