This document provides an overview and demo of Cloudera Enterprise BDR and Cloudera Navigator. Cloudera Enterprise BDR simplifies backup and disaster recovery for Hadoop by allowing users to centrally configure and monitor replication policies across services like HDFS, HBase and Hive. Cloudera Navigator provides centralized data auditing and access management for Cloudera Enterprise, allowing users to view permissions, audit data access and export audit logs for integration.
4. Why You Need Cloudera Enterprise BDR
1 Cloudera Enterprise is a Mission-Critical Part
of the Data Management Infrastructure
Stores valuable data & runs important workloads
Business continuity is a MUST HAVE
2 Managing Business Continuity for Hadoop
is Complex
Different services that store data – HDFS, HBase, Hive
Backup & disaster recovery is configured separately for each
Processes are manual
4
5. Cloudera Enterprise BDR
Simplified Management of Backup & DR Policies
Central Configuration
Define backup and disaster recover policies and
apply across services
Monitoring & Alerting
Track progress of replication jobs and get notified SITE A SITE B
when data is out of sync
HIVE HIVE
HDFS HDFS
Performance & Reliability NODES NODES
High performance, CDH-optimized replication using
MapReduce (via DistCP)
5
6. Cloudera Enterprise BDR
Version 1.0
CLOUDERA ENTERPRISE
CLOUDERA MANAGER
SELECT CONFIGURE SYNCHRONIZE MONITOR
DISASTER RECOVERY MODULE
CDH
HDFS DISTRIBUTED REPLICATION HIVE METASTORE REPLICATION
HIGH PERFORMANCE REPLICATION THE ONLY DISASTER RECOVERY SOLUTION
USING MAPREDUCE FOR METADATA
HDFS HIVE
6
7. Management Capabilities
Cloudera Enterprise BDR Version 1.0
SELECT Select subset of data or tables to be replicated
CONFIGURE Configure schedule and options for data replication
SYNCHRONIZE Perform synchronization using appropriate tools
MONITOR Report progress, track errors, generate alerts
7
8. Platform Enhancements
CDH 4.2
1 Distributed Copy
Hardened, production-ready DistCP across clusters
Kerberos integration
Cross-cluster HA and federation
Full API access through Cloudera Manager
Detailed error and progress reporting
2 Metastore Replication
SQL import/export between two different metastores
Fix file paths and other cluster-specific information
3 HBase
HBase snapshots v1 (not supported in Cloudera Enterprise BDR 1.0)
8
9. Benefits of Cloudera Enterprise BDR
Centrally manage backup & DR workflows
Reduce Complexity
Simple setup via an intuitive user interface
Simplify processes to meet or exceed SLAs
& Recovery Time Objectives (RTOs)
Maximize Efficiency
Optimize system performance
& network impact through scheduling
Eliminate error-prone manual processes
Reduce Risk & Exposure Get notified when issues occur
The only solution for metadata replication (Hive)
9
10. Cloudera Enterprise BDR
Optional Add-On for Business Continuity
• Backup & DR Management w/Cloudera Manager
• 8x5 or 24x7 Support
• Optional Upgrade from INGEST STORE EXPLORE PROCESS ANALYZE SERVE
Enterprise Core
• Available Now
MANAGEMENT CLOUDERA MANAGER (Sold with Support)
SOFTWARE, DATA
MANAGEMENT &
TECHNICAL SUPPORT CORE BDR
(SUBSCRIPTION)
CDH
100% OPEN SOURCE
OS
OPEN SOURCE PROJECTS
10
12. Why You Need Cloudera Navigator
1 Lots of Data Landing in Cloudera Enterprise
Huge quantities
Many different sources – structured & unstructured
Varying levels of sensitivity
2 Many Users Working with the Data
Administrators & compliance officers
Analysts & data scientists
Business users
3 Need to Effectively Control & Consume Data
Get visibility & control over the environment
Discover, explore and consume data
12
13. Cloudera Navigator
Data Management Suite for Cloudera Enterprise
Audit & Access Management
Ensuring appropriate permissions & auditing
CLOUDERA NAVIGATOR
on data access
Audit &
Discovery & Lifecycle
Access Lineage
Exploration Mgmt.
Discovery & Exploration Mgmt
Discover what data is available and what it Enterprise Metadata Repository
looks like Business metadata
Lineage metadata
Operational metadata
Lineage
Tracing data back to its original source CDH
Lifecycle Management
HDFS HBASE HIVE
Migration of data based on policies
13
14. Cloudera Navigator 1.0
Data Audit & Access Management
Verify Permissions
View which users and groups have access to
files and directories
IAM / LDAP SYSTEM
Audit Configuration
Configuration of audit tracking for CLOUDERA NAVIGATOR 1.0
HDFS, HBase and Hive ACCESS AUDIT LOG
HDFS
SERVICE SERVICE
VIEW PERMISSIONS AUDIT LOG CONFIG
Audit Dashboard AUDIT LOG
COLLECTION
HBASE
Simple, queryable interface to view data access
Information Export 3rd PARTY SIEM / GRC SYSTEM
HIVE
Export audit information for integration with
SIEM tools
14
15. Benefits of Cloudera Navigator 1.0
Store sensitive data
Control Maintain full audit history
The first & only centralized audit tool for Hadoop
Verify access permissions to files & directories
Visibility Report on data access by user and type
View permissions for LDAP/IAM users
Integration Export audit data for integration with 3rd party SIEM tools
15
16. Cloudera Navigator 1.0
Data Management Suite for Cloudera Enterprise
• Centralized Audit Management & Access Control
• 8x5 or 24x7 Support
INGEST STORE EXPLORE PROCESS ANALYZE SERVE
• Add-On to Enterprise
Core
• Available Now MANAGEMENT
SOFTWARE, DATA CLOUDERA NAVIGATOR
MANAGEMENT & AUDIT &
TECHNICAL SUPPORT ACCESS
(SUBSCRIPTION)
CLOUDERA MANAGER
CORE
CDH
100% OPEN SOURCE
OS
OPEN SOURCE PROJECTS
16
Lots of data landing in Cloudera EnterpriseHuge quantitiesMany different sourcesMany different structuresVarying levels of sensitivityDifferent users are working with the same dataAdministrators – how do I ensure the right data is accessible by the right users and applications?Compliance officers – how do I report on who has been accessing the data?Analysts – how do I find out what data is available, where it came from and what it looks like?Need an easy way to empower users and administrators to be effective