Bharath Nunepalli from HCA presented on how and why HCA implemented an application data archiving and purging strategy using IBM InfoSphere Optim Data Growth Solution for z/OS. HCA needed a strategy due to application usage growth, retention policies, and database maintenance tasks. Optim archive allows isolating historical data from current systems and accessing it through familiar tools. HCA achieved archiving by choosing archive paths, creating access definitions and relationships, and building and executing JCL jobs. Limitations of Optim archive include inability to directly query archived Db2 tables.
Advanced Machine Learning for Business Professionals
InfoSphere Optim archive for archive/purge of application data
1. IBM & IDUG 2019 Data Tech Summit
#Db2World #IDUGDb2 #IBMDb2
Bharath Nunepalli, Sr. Db2 DBA
HCA
10/2/19, 2:20 PM
How and why to archive & purge application data?
2. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
Agenda
About HCA
Inc.
Our ERP
Environment
Why we
needed a
data archive
strategy
What is
Optim
archive?
How did we
achieve data
archive and
purge?
Limitations in
using Optim
archive tool
Q&A
2
3. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
HCA Inc. – Some facts about us:
3
- HCA is named one of the world’s most ethical companies for
nine years in a row
- 184 hospitals and approximately 2,000 sites of care,
including surgery centers, freestanding ERs, urgent care
centers, and physician clinics in 21 states and the United
Kingdom.
- Ranked 63rd in Fortune 500
- 249,000 employees
38,000 active physicians
90,000 nurses
5,300 IT employees
- 28 million patient encounters per year
8.6 million emergency visits per year
4. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
4
ERP
System
Financials
Payroll
Supply
Chain
Resource
Planning
HR
Enterprise Resource Planning (ERP)
Environment:
- 120+ databases and different swim lanes
supporting ERP development and maintenance
- 1000+ Tablespaces, 2,800 Tables & 7,500 IX
per DB
- Largest table has 1.5+ billion rows and
7 Indexes
5. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
Why we needed a data archive strategy
5
App usage
growth
Retention
policies
Tiresome
DBA tasks
Vendor
limitations
6. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
What is Optim archive?
IBMInfoSphere Optim Data Growth Solution for z/OSprovides everything you need to
create and manage archives of relationally intact data from databases with any number
of tables and relationships. Using the archiving features in Optim Data Growth Solution
for z/OS, you can:
• Isolate historical data from current activity and safely remove it to a secure archive.
• Access archived historical data easily, using familiar tools and interfaces.
• Restore archived data to its original business context when it requires additional
processing.
• Build repetitive process which can be executed whenever needed.
6
7. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
Prod operational
database
Archive
File
1
2
1
Mainframes
ODBC/JDBC
Reporting Tools
ODM
Optim
Example1
2
7
1
2
Archive
Purge
8. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
Prod operational
database
Archive
File
Archive
Database
1
Mainframes
Optim
ODM
23
3
2
1
FTP file
4
ODBC/JDBC
Reporting Tools
Example 2
8
Archive
Restore
1
2
3 Purge
4
Create
FTP file
9. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
Prod operational
database
Archive
File
Archive
Database
DB2LUW/Oracle/SQLServer
ODBC/JDBC
Reporting Tools
Mainframes
Optim
ODM
1
1 2
2
3
Example 3
3
9
Archive
Restore
1
2
3 Purge
10. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
How did we achieve data archive and purge?
Choosing the suitable archive
path
Creating Access Definitions (AD)
and relationships
Build and execute JCLs
10
11. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
1. Choosing the suitable archive path
Prod
operational
database
Archive
database
1
1 2
2
3
3
Optim
4
Archive File
Mainframes
11
4
1 Archive
2 Restore
3 Purge
4 Reorg
12. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
2. Creating Access Definitions (AD) and relationships
An Access Definition describes the data to be extracted from the source database.
The components of an Access Definition include the following:
- A list of tables from which the data is extracted.
- Selection criteria (WHERE clause in SQL query).
- The list of relationships to be traversed.
12
13. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
13
14. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
14
15. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
15
16. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
16
17. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
17
18. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
18
19. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
19
20. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
SELECT * FROM creator.EMPLOYEE;
SELECT *
FROM creator.HRHISTORY A
INNER JOIN
creator.EMPLOYEE B ON
A.COMPANY = B.COMPANY AND
A.EMPLOYEE = B.EMPLOYEE
WHERE YEAR(A.DATE_STAMP)<= archive_year;
20
21. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
3. Building and executing JCLs
21
22. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
22
23. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
23
24. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
24
25. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
25
Just some stats to wow you!!!
26. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
Limitations in using Optim archive tool
26
Prod
operational
database
Archive
database
2
2 3
3
4
4 Optim
5
Db2 table
1
SQL
query
1
2
Archive
File
RFE#OPTIM-I-126
5
1
Run SQL
query
2 Archive
3 Restore
4 Purge
5 Reorg
27. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
Special Thanks to
Greg Czaja (greg.czaja@unicomsi.com)
27
28. IBM & IDUG Data Tech Summit
Silicon Valley Lab | October 2-4, 2019
28