SlideShare ist ein Scribd-Unternehmen logo
1 von 28
IBM & IDUG 2019 Data Tech Summit
#Db2World #IDUGDb2 #IBMDb2
Bharath Nunepalli, Sr. Db2 DBA
HCA
10/2/19, 2:20 PM
How and why to archive & purge application data?
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
Agenda
About HCA
Inc.
Our ERP
Environment
Why we
needed a
data archive
strategy
What is
Optim
archive?
How did we
achieve data
archive and
purge?
Limitations in
using Optim
archive tool
Q&A
2
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
HCA Inc. – Some facts about us:
3
- HCA is named one of the world’s most ethical companies for
nine years in a row
- 184 hospitals and approximately 2,000 sites of care,
including surgery centers, freestanding ERs, urgent care
centers, and physician clinics in 21 states and the United
Kingdom.
- Ranked 63rd in Fortune 500
- 249,000 employees
38,000 active physicians
90,000 nurses
5,300 IT employees
- 28 million patient encounters per year
8.6 million emergency visits per year
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
4
ERP
System
Financials
Payroll
Supply
Chain
Resource
Planning
HR
Enterprise Resource Planning (ERP)
Environment:
- 120+ databases and different swim lanes
supporting ERP development and maintenance
- 1000+ Tablespaces, 2,800 Tables & 7,500 IX
per DB
- Largest table has 1.5+ billion rows and
7 Indexes
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
Why we needed a data archive strategy
5
App usage
growth
Retention
policies
Tiresome
DBA tasks
Vendor
limitations
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
What is Optim archive?
IBMInfoSphere Optim Data Growth Solution for z/OSprovides everything you need to
create and manage archives of relationally intact data from databases with any number
of tables and relationships. Using the archiving features in Optim Data Growth Solution
for z/OS, you can:
• Isolate historical data from current activity and safely remove it to a secure archive.
• Access archived historical data easily, using familiar tools and interfaces.
• Restore archived data to its original business context when it requires additional
processing.
• Build repetitive process which can be executed whenever needed.
6
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
Prod operational
database
Archive
File
1
2
1
Mainframes
ODBC/JDBC
Reporting Tools
ODM
Optim
Example1
2
7
1
2
Archive
Purge
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
Prod operational
database
Archive
File
Archive
Database
1
Mainframes
Optim
ODM
23
3
2
1
FTP file
4
ODBC/JDBC
Reporting Tools
Example 2
8
Archive
Restore
1
2
3 Purge
4
Create
FTP file
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
Prod operational
database
Archive
File
Archive
Database
DB2LUW/Oracle/SQLServer
ODBC/JDBC
Reporting Tools
Mainframes
Optim
ODM
1
1 2
2
3
Example 3
3
9
Archive
Restore
1
2
3 Purge
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
How did we achieve data archive and purge?
Choosing the suitable archive
path
Creating Access Definitions (AD)
and relationships
Build and execute JCLs
10
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
1. Choosing the suitable archive path
Prod
operational
database
Archive
database
1
1 2
2
3
3
Optim
4
Archive File
Mainframes
11
4
1 Archive
2 Restore
3 Purge
4 Reorg
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
2. Creating Access Definitions (AD) and relationships
An Access Definition describes the data to be extracted from the source database.
The components of an Access Definition include the following:
- A list of tables from which the data is extracted.
- Selection criteria (WHERE clause in SQL query).
- The list of relationships to be traversed.
12
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
13
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
14
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
15
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
16
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
17
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
18
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
19
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
SELECT * FROM creator.EMPLOYEE;
SELECT *
FROM creator.HRHISTORY A
INNER JOIN
creator.EMPLOYEE B ON
A.COMPANY = B.COMPANY AND
A.EMPLOYEE = B.EMPLOYEE
WHERE YEAR(A.DATE_STAMP)<= archive_year;
20
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
3. Building and executing JCLs
21
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
22
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
23
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
24
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
25
Just some stats to wow you!!!
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
Limitations in using Optim archive tool
26
Prod
operational
database
Archive
database
2
2 3
3
4
4 Optim
5
Db2 table
1
SQL
query
1
2
Archive
File
RFE#OPTIM-I-126
5
1
Run SQL
query
2 Archive
3 Restore
4 Purge
5 Reorg
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
Special Thanks to
Greg Czaja (greg.czaja@unicomsi.com)
27
IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
28

Weitere ähnliche Inhalte

Was ist angesagt?

Martin Brooks Green It Workshop Final
Martin Brooks Green It Workshop FinalMartin Brooks Green It Workshop Final
Martin Brooks Green It Workshop FinalBill St. Arnaud
 
International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)ijgca
 
International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)ijgca
 
International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)ijgca
 
International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)ijgca
 
Setup a Data Science Pipeline in a Highly Regulated Environment
Setup a Data Science Pipeline in a Highly Regulated EnvironmentSetup a Data Science Pipeline in a Highly Regulated Environment
Setup a Data Science Pipeline in a Highly Regulated EnvironmentOlaf Hein
 
The use of Asset Management and BIM to Generate Information Requirements #COM...
The use of Asset Management and BIM to Generate Information Requirements #COM...The use of Asset Management and BIM to Generate Information Requirements #COM...
The use of Asset Management and BIM to Generate Information Requirements #COM...Comit Projects Ltd
 
International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)ijgca
 
International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)ijgca
 
International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)ijgca
 
A Digital Future for the Infrastructure Industry #COMIT2019
A Digital Future for the Infrastructure Industry #COMIT2019A Digital Future for the Infrastructure Industry #COMIT2019
A Digital Future for the Infrastructure Industry #COMIT2019Comit Projects Ltd
 
How to Guarantee Exact Count Distinct Queries with Sub-Second Latency on Mass...
How to Guarantee Exact Count Distinct Queries with Sub-Second Latency on Mass...How to Guarantee Exact Count Distinct Queries with Sub-Second Latency on Mass...
How to Guarantee Exact Count Distinct Queries with Sub-Second Latency on Mass...SamanthaBerlant
 
How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...
How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...
How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...Tyler Wishnoff
 
International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)ijgca
 
Clobbi CEO Dmitry Shapovalov Keynote @CRU 2019 Brussels "Practical case-studi...
Clobbi CEO Dmitry Shapovalov Keynote @CRU 2019 Brussels "Practical case-studi...Clobbi CEO Dmitry Shapovalov Keynote @CRU 2019 Brussels "Practical case-studi...
Clobbi CEO Dmitry Shapovalov Keynote @CRU 2019 Brussels "Practical case-studi...Olexiy Lyzun
 
Harel Kodesh, Vice President, Predix and CTO, GE Digital
Harel Kodesh, Vice President, Predix and CTO, GE DigitalHarel Kodesh, Vice President, Predix and CTO, GE Digital
Harel Kodesh, Vice President, Predix and CTO, GE DigitalMIT Enterprise Forum Cambridge
 
Ibm cloud object storage industry workloads
Ibm cloud object storage industry   workloadsIbm cloud object storage industry   workloads
Ibm cloud object storage industry workloadsDiego Alberto Tamayo
 
Cyber infrastructure in engineering design
Cyber infrastructure in engineering designCyber infrastructure in engineering design
Cyber infrastructure in engineering designAmogh Mundhekar
 

Was ist angesagt? (19)

Martin Brooks Green It Workshop Final
Martin Brooks Green It Workshop FinalMartin Brooks Green It Workshop Final
Martin Brooks Green It Workshop Final
 
International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)
 
International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)
 
International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)
 
International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)
 
Setup a Data Science Pipeline in a Highly Regulated Environment
Setup a Data Science Pipeline in a Highly Regulated EnvironmentSetup a Data Science Pipeline in a Highly Regulated Environment
Setup a Data Science Pipeline in a Highly Regulated Environment
 
The use of Asset Management and BIM to Generate Information Requirements #COM...
The use of Asset Management and BIM to Generate Information Requirements #COM...The use of Asset Management and BIM to Generate Information Requirements #COM...
The use of Asset Management and BIM to Generate Information Requirements #COM...
 
Industrial Internet
Industrial InternetIndustrial Internet
Industrial Internet
 
International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)
 
International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)
 
International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)
 
A Digital Future for the Infrastructure Industry #COMIT2019
A Digital Future for the Infrastructure Industry #COMIT2019A Digital Future for the Infrastructure Industry #COMIT2019
A Digital Future for the Infrastructure Industry #COMIT2019
 
How to Guarantee Exact Count Distinct Queries with Sub-Second Latency on Mass...
How to Guarantee Exact Count Distinct Queries with Sub-Second Latency on Mass...How to Guarantee Exact Count Distinct Queries with Sub-Second Latency on Mass...
How to Guarantee Exact Count Distinct Queries with Sub-Second Latency on Mass...
 
How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...
How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...
How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...
 
International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)International Journal of Grid Computing & Applications (IJGCA)
International Journal of Grid Computing & Applications (IJGCA)
 
Clobbi CEO Dmitry Shapovalov Keynote @CRU 2019 Brussels "Practical case-studi...
Clobbi CEO Dmitry Shapovalov Keynote @CRU 2019 Brussels "Practical case-studi...Clobbi CEO Dmitry Shapovalov Keynote @CRU 2019 Brussels "Practical case-studi...
Clobbi CEO Dmitry Shapovalov Keynote @CRU 2019 Brussels "Practical case-studi...
 
Harel Kodesh, Vice President, Predix and CTO, GE Digital
Harel Kodesh, Vice President, Predix and CTO, GE DigitalHarel Kodesh, Vice President, Predix and CTO, GE Digital
Harel Kodesh, Vice President, Predix and CTO, GE Digital
 
Ibm cloud object storage industry workloads
Ibm cloud object storage industry   workloadsIbm cloud object storage industry   workloads
Ibm cloud object storage industry workloads
 
Cyber infrastructure in engineering design
Cyber infrastructure in engineering designCyber infrastructure in engineering design
Cyber infrastructure in engineering design
 

Ähnlich wie InfoSphere Optim archive for archive/purge of application data

The Data Lake of The University of Queensland : Building the Foundations for ...
The Data Lake of The University of Queensland : Building the Foundations for ...The Data Lake of The University of Queensland : Building the Foundations for ...
The Data Lake of The University of Queensland : Building the Foundations for ...Amazon Web Services
 
SF Big Analytics Meetup - Exact Count Distinct with Apache Kylin
SF Big Analytics Meetup - Exact Count Distinct with Apache KylinSF Big Analytics Meetup - Exact Count Distinct with Apache Kylin
SF Big Analytics Meetup - Exact Count Distinct with Apache KylinSamanthaBerlant
 
IBM Storage for AI and Big Data
IBM Storage for AI and Big DataIBM Storage for AI and Big Data
IBM Storage for AI and Big DataTony Pearson
 
S110646 storage-for-ai-jburg-v1909c
S110646 storage-for-ai-jburg-v1909cS110646 storage-for-ai-jburg-v1909c
S110646 storage-for-ai-jburg-v1909cTony Pearson
 
2nd PyData Piraeus meetup - Data Science Initiatives in Titan Cement Company
2nd PyData Piraeus meetup - Data Science Initiatives in Titan Cement Company2nd PyData Piraeus meetup - Data Science Initiatives in Titan Cement Company
2nd PyData Piraeus meetup - Data Science Initiatives in Titan Cement CompanyPyData Piraeus
 
Swoc21 Feb08 Amig
Swoc21 Feb08 AmigSwoc21 Feb08 Amig
Swoc21 Feb08 Amiglatha_only
 
AI Foundations: Simpler Technologies, Smarter Business
AI Foundations: Simpler Technologies, Smarter BusinessAI Foundations: Simpler Technologies, Smarter Business
AI Foundations: Simpler Technologies, Smarter BusinessTIBCO_Software
 
How to build containerized architectures for deep learning - Data Festival 20...
How to build containerized architectures for deep learning - Data Festival 20...How to build containerized architectures for deep learning - Data Festival 20...
How to build containerized architectures for deep learning - Data Festival 20...Antje Barth
 
Mailchimp to the Edge - Establishing Akamai Best Practices at Mailchimp
Mailchimp to the Edge - Establishing Akamai Best Practices at MailchimpMailchimp to the Edge - Establishing Akamai Best Practices at Mailchimp
Mailchimp to the Edge - Establishing Akamai Best Practices at MailchimpBob Strecansky
 
Sensor Data Management & Analytics: Advanced Process Control
Sensor Data Management & Analytics: Advanced Process ControlSensor Data Management & Analytics: Advanced Process Control
Sensor Data Management & Analytics: Advanced Process ControlTIBCO_Software
 
G107980 top-it-trends-atlanta-v1904b
G107980 top-it-trends-atlanta-v1904bG107980 top-it-trends-atlanta-v1904b
G107980 top-it-trends-atlanta-v1904bTony Pearson
 
Program Guide - Big Events
Program Guide - Big EventsProgram Guide - Big Events
Program Guide - Big EventsSapna Nauhria
 
Big Data London Meetup on Customer Experience
Big Data London Meetup on Customer ExperienceBig Data London Meetup on Customer Experience
Big Data London Meetup on Customer ExperienceChristos Hadjinikolis
 
Avoiding Log Data Overload in a CI/CD System While Streaming 190 Billion Even...
Avoiding Log Data Overload in a CI/CD System While Streaming 190 Billion Even...Avoiding Log Data Overload in a CI/CD System While Streaming 190 Billion Even...
Avoiding Log Data Overload in a CI/CD System While Streaming 190 Billion Even...DataWorks Summit
 
Driven by data - Why we need a Modern Enterprise Data Analytics Platform
Driven by data - Why we need a Modern Enterprise Data Analytics PlatformDriven by data - Why we need a Modern Enterprise Data Analytics Platform
Driven by data - Why we need a Modern Enterprise Data Analytics PlatformArne Roßmann
 
Microservices, Containers, Kubernetes, Kafka, Kanban
Microservices, Containers, Kubernetes, Kafka, KanbanMicroservices, Containers, Kubernetes, Kafka, Kanban
Microservices, Containers, Kubernetes, Kafka, KanbanAraf Karsh Hamid
 
Seminar Accelerating Business Using Microservices Architecture in Digital Age...
Seminar Accelerating Business Using Microservices Architecture in Digital Age...Seminar Accelerating Business Using Microservices Architecture in Digital Age...
Seminar Accelerating Business Using Microservices Architecture in Digital Age...PT Datacomm Diangraha
 
Data Center of the Future: Designing a modernized, high performance computing...
Data Center of the Future: Designing a modernized, high performance computing...Data Center of the Future: Designing a modernized, high performance computing...
Data Center of the Future: Designing a modernized, high performance computing...Capgemini
 
Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)
Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)
Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)DataWorks Summit
 
Greenplum: Driving the future of Data Warehousing and Analytics
Greenplum: Driving the future of Data Warehousing and AnalyticsGreenplum: Driving the future of Data Warehousing and Analytics
Greenplum: Driving the future of Data Warehousing and Analyticseaiti
 

Ähnlich wie InfoSphere Optim archive for archive/purge of application data (20)

The Data Lake of The University of Queensland : Building the Foundations for ...
The Data Lake of The University of Queensland : Building the Foundations for ...The Data Lake of The University of Queensland : Building the Foundations for ...
The Data Lake of The University of Queensland : Building the Foundations for ...
 
SF Big Analytics Meetup - Exact Count Distinct with Apache Kylin
SF Big Analytics Meetup - Exact Count Distinct with Apache KylinSF Big Analytics Meetup - Exact Count Distinct with Apache Kylin
SF Big Analytics Meetup - Exact Count Distinct with Apache Kylin
 
IBM Storage for AI and Big Data
IBM Storage for AI and Big DataIBM Storage for AI and Big Data
IBM Storage for AI and Big Data
 
S110646 storage-for-ai-jburg-v1909c
S110646 storage-for-ai-jburg-v1909cS110646 storage-for-ai-jburg-v1909c
S110646 storage-for-ai-jburg-v1909c
 
2nd PyData Piraeus meetup - Data Science Initiatives in Titan Cement Company
2nd PyData Piraeus meetup - Data Science Initiatives in Titan Cement Company2nd PyData Piraeus meetup - Data Science Initiatives in Titan Cement Company
2nd PyData Piraeus meetup - Data Science Initiatives in Titan Cement Company
 
Swoc21 Feb08 Amig
Swoc21 Feb08 AmigSwoc21 Feb08 Amig
Swoc21 Feb08 Amig
 
AI Foundations: Simpler Technologies, Smarter Business
AI Foundations: Simpler Technologies, Smarter BusinessAI Foundations: Simpler Technologies, Smarter Business
AI Foundations: Simpler Technologies, Smarter Business
 
How to build containerized architectures for deep learning - Data Festival 20...
How to build containerized architectures for deep learning - Data Festival 20...How to build containerized architectures for deep learning - Data Festival 20...
How to build containerized architectures for deep learning - Data Festival 20...
 
Mailchimp to the Edge - Establishing Akamai Best Practices at Mailchimp
Mailchimp to the Edge - Establishing Akamai Best Practices at MailchimpMailchimp to the Edge - Establishing Akamai Best Practices at Mailchimp
Mailchimp to the Edge - Establishing Akamai Best Practices at Mailchimp
 
Sensor Data Management & Analytics: Advanced Process Control
Sensor Data Management & Analytics: Advanced Process ControlSensor Data Management & Analytics: Advanced Process Control
Sensor Data Management & Analytics: Advanced Process Control
 
G107980 top-it-trends-atlanta-v1904b
G107980 top-it-trends-atlanta-v1904bG107980 top-it-trends-atlanta-v1904b
G107980 top-it-trends-atlanta-v1904b
 
Program Guide - Big Events
Program Guide - Big EventsProgram Guide - Big Events
Program Guide - Big Events
 
Big Data London Meetup on Customer Experience
Big Data London Meetup on Customer ExperienceBig Data London Meetup on Customer Experience
Big Data London Meetup on Customer Experience
 
Avoiding Log Data Overload in a CI/CD System While Streaming 190 Billion Even...
Avoiding Log Data Overload in a CI/CD System While Streaming 190 Billion Even...Avoiding Log Data Overload in a CI/CD System While Streaming 190 Billion Even...
Avoiding Log Data Overload in a CI/CD System While Streaming 190 Billion Even...
 
Driven by data - Why we need a Modern Enterprise Data Analytics Platform
Driven by data - Why we need a Modern Enterprise Data Analytics PlatformDriven by data - Why we need a Modern Enterprise Data Analytics Platform
Driven by data - Why we need a Modern Enterprise Data Analytics Platform
 
Microservices, Containers, Kubernetes, Kafka, Kanban
Microservices, Containers, Kubernetes, Kafka, KanbanMicroservices, Containers, Kubernetes, Kafka, Kanban
Microservices, Containers, Kubernetes, Kafka, Kanban
 
Seminar Accelerating Business Using Microservices Architecture in Digital Age...
Seminar Accelerating Business Using Microservices Architecture in Digital Age...Seminar Accelerating Business Using Microservices Architecture in Digital Age...
Seminar Accelerating Business Using Microservices Architecture in Digital Age...
 
Data Center of the Future: Designing a modernized, high performance computing...
Data Center of the Future: Designing a modernized, high performance computing...Data Center of the Future: Designing a modernized, high performance computing...
Data Center of the Future: Designing a modernized, high performance computing...
 
Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)
Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)
Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)
 
Greenplum: Driving the future of Data Warehousing and Analytics
Greenplum: Driving the future of Data Warehousing and AnalyticsGreenplum: Driving the future of Data Warehousing and Analytics
Greenplum: Driving the future of Data Warehousing and Analytics
 

Kürzlich hochgeladen

Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataTecnoIncentive
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxHaritikaChhatwal1
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfSubhamKumar3239
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 

Kürzlich hochgeladen (20)

Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded data
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptx
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdf
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 

InfoSphere Optim archive for archive/purge of application data

  • 1. IBM & IDUG 2019 Data Tech Summit #Db2World #IDUGDb2 #IBMDb2 Bharath Nunepalli, Sr. Db2 DBA HCA 10/2/19, 2:20 PM How and why to archive & purge application data?
  • 2. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 Agenda About HCA Inc. Our ERP Environment Why we needed a data archive strategy What is Optim archive? How did we achieve data archive and purge? Limitations in using Optim archive tool Q&A 2
  • 3. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 HCA Inc. – Some facts about us: 3 - HCA is named one of the world’s most ethical companies for nine years in a row - 184 hospitals and approximately 2,000 sites of care, including surgery centers, freestanding ERs, urgent care centers, and physician clinics in 21 states and the United Kingdom. - Ranked 63rd in Fortune 500 - 249,000 employees 38,000 active physicians 90,000 nurses 5,300 IT employees - 28 million patient encounters per year 8.6 million emergency visits per year
  • 4. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 4 ERP System Financials Payroll Supply Chain Resource Planning HR Enterprise Resource Planning (ERP) Environment: - 120+ databases and different swim lanes supporting ERP development and maintenance - 1000+ Tablespaces, 2,800 Tables & 7,500 IX per DB - Largest table has 1.5+ billion rows and 7 Indexes
  • 5. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 Why we needed a data archive strategy 5 App usage growth Retention policies Tiresome DBA tasks Vendor limitations
  • 6. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 What is Optim archive? IBMInfoSphere Optim Data Growth Solution for z/OSprovides everything you need to create and manage archives of relationally intact data from databases with any number of tables and relationships. Using the archiving features in Optim Data Growth Solution for z/OS, you can: • Isolate historical data from current activity and safely remove it to a secure archive. • Access archived historical data easily, using familiar tools and interfaces. • Restore archived data to its original business context when it requires additional processing. • Build repetitive process which can be executed whenever needed. 6
  • 7. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 Prod operational database Archive File 1 2 1 Mainframes ODBC/JDBC Reporting Tools ODM Optim Example1 2 7 1 2 Archive Purge
  • 8. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 Prod operational database Archive File Archive Database 1 Mainframes Optim ODM 23 3 2 1 FTP file 4 ODBC/JDBC Reporting Tools Example 2 8 Archive Restore 1 2 3 Purge 4 Create FTP file
  • 9. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 Prod operational database Archive File Archive Database DB2LUW/Oracle/SQLServer ODBC/JDBC Reporting Tools Mainframes Optim ODM 1 1 2 2 3 Example 3 3 9 Archive Restore 1 2 3 Purge
  • 10. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 How did we achieve data archive and purge? Choosing the suitable archive path Creating Access Definitions (AD) and relationships Build and execute JCLs 10
  • 11. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 1. Choosing the suitable archive path Prod operational database Archive database 1 1 2 2 3 3 Optim 4 Archive File Mainframes 11 4 1 Archive 2 Restore 3 Purge 4 Reorg
  • 12. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 2. Creating Access Definitions (AD) and relationships An Access Definition describes the data to be extracted from the source database. The components of an Access Definition include the following: - A list of tables from which the data is extracted. - Selection criteria (WHERE clause in SQL query). - The list of relationships to be traversed. 12
  • 13. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 13
  • 14. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 14
  • 15. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 15
  • 16. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 16
  • 17. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 17
  • 18. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 18
  • 19. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 19
  • 20. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 SELECT * FROM creator.EMPLOYEE; SELECT * FROM creator.HRHISTORY A INNER JOIN creator.EMPLOYEE B ON A.COMPANY = B.COMPANY AND A.EMPLOYEE = B.EMPLOYEE WHERE YEAR(A.DATE_STAMP)<= archive_year; 20
  • 21. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 3. Building and executing JCLs 21
  • 22. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 22
  • 23. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 23
  • 24. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 24
  • 25. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 25 Just some stats to wow you!!!
  • 26. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 Limitations in using Optim archive tool 26 Prod operational database Archive database 2 2 3 3 4 4 Optim 5 Db2 table 1 SQL query 1 2 Archive File RFE#OPTIM-I-126 5 1 Run SQL query 2 Archive 3 Restore 4 Purge 5 Reorg
  • 27. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 Special Thanks to Greg Czaja (greg.czaja@unicomsi.com) 27
  • 28. IBM & IDUG Data Tech Summit Silicon Valley Lab | October 2-4, 2019 28