SlideShare a Scribd company logo
1 of 40
<Insert Picture Here>
Oracle Big Data Appliance and Solutions
Downlload this slide
http://ouo.io/OtuIkv
The following is intended to outline our general product
direction. It is intended for information purposes only, and
may not be incorporated into any contract. It is not a
commitment to deliver any material, code, or functionality,
and should not be relied upon in making purchasing
decisions.
The development, release, and timing of any features or
functionality described for Oracle’s products remain at the
sole discretion of Oracle.
Case: On-line Ads and Content
NoSQL
DB
Expert
System
Real-time: Determine
best ad to place
on page for this user
Input into
Lookup user
profile
Add user
if not present
Web
logs
HDFS
Profiles
NoSQL DB
High scale
data reductions BI and
Analytics
Billing
Predictions
on browsing
Actual
ads
served
Low
Latency
Batch
Agenda
• Big Data Technology
• Oracle Big Data Appliance
• Big Data Applications
• Summary
• Q&A
<Insert Picture Here>
Big Data Technology
• Deep Analytics
• Agile Development
• Massive Scalability
• Real Time Results
• High Throughput
• In-Place Preparation
• All Data Sources/Structures
• Low, predictable Latency
• High Transaction Volume
• Flexible Data Structures
Big Data: Infrastructure Requirements
Acquire Organize Analyze
Divided Solution Spectrum
Acquire AnalyzeOrganize
MapReduce
Solutions
DBMS
(DW)
DBMS
(OLTP)
Advanced
Analytics
Distributed
File Systems
Transaction
(Key-Value)
Stores
ETL
NoSQL
Flexible
Specialized
Developer
Centric
SQL
Trusted
Secure
Administered
Dynamic
Schema
Data
Variety
Schema
8 Copyright © 2011, Oracle and/or its affiliates. All rights
reserved.
Insert Information Protection Policy Classification from Slide 8
Oracle Integrated Software Solution Stack
Acquire AnalyzeOrganize
Oracle
Database
(DW)
Oracle
Database
(OLTP)
In-DB
Analytics
“R”
Mining
Text
Graph
Spatial
Oracle
BI EE
Oracle NoSQL
DB
HDFS Hadoop
Oracle
Data Integrator
Oracle Loader
for Hadoop
Dynamic
Schema
Data
Variety
Schema
9 Copyright © 2011, Oracle and/or its affiliates. All rights
reserved.
Insert Information Protection Policy Classification from Slide 8
Oracle Engineered Solutions
Oracle
Database
(DW)
Oracle
Database
(OLTP)
In-DB
Analytics
“R”
Mining
Text
Graph
Spatial
Oracle
BI EE
Oracle NoSQL
DB
HDFS Hadoop
Oracle
Data Integrator
Oracle Loader
for Hadoop
Big Data Appliance
• Hadoop
• NoSQL Database
• Oracle Loader for hadoop
• Oracle Data Integrator
Oracle Exadata
• OLTP & DW
• Data Mining & Oracle R
• Semantics
• Spatial
Exalytics
• Speed of
Thought
Analytics
Acquire AnalyzeOrganize
Dynamic
Schema
Data
Variety
Schema
Big Data Appliance
Batch Usage Model
Oracle
Big Data Appliance
Oracle
Exadata
InfiniBand
Acquire Organize Analyze
Oracle
Exalytics
InfiniBand
11 Copyright © 2011, Oracle and/or its affiliates. All rights
reserved.
Insert Information Protection Policy Classification from Slide 8
Why build a Hadoop Appliance?
• Time to Build?
• Required Expertise?
• Cost and Difficulty Maintaining?
•18 Sun X4270 M2 Servers
– 48 GB memory per node = 864 GB memory
– 12 Intel cores per node = 216 cores
– 24 TB storage per node = 432 TB storage
•40 Gb p/sec InfiniBand
•10 Gb p/sec Ethernet
Oracle Big Data Appliance Hardware
Big Data Appliance
Cluster of industry standard servers for Hadoop and NoSQL Database
• Focus on Scalability and Availability at low cost
Compute and Storage
• 18 High-performance low-cost
servers acting as Hadoop
nodes
• 24 TB Capacity per node
• 2 6-core CPUs per node
• Hadoop triple replication
• NoSQL Database triple
replication
10GigE Network
• 8 10GigE ports
• Datacenter connectivity
InfiniBand Network
• Redundant 40Gb/s switches
• IB connectivity to Exadata
Scale Out to Infinity
Scale out by connecting racks
to each other using Infiniband
• Expand up to eight racks without
additional switches
• Scale beyond eight racks by adding
an additional switch
•Oracle Linux 5.6
•Java Hotspot VM
•Apache Hadoop Distribution v0.20.x
•R Distribution
•Oracle NoSQL Database Enterprise
Edition
•Oracle Data Integrator Application
Adapter for Hadoop
•Oracle Loader for Hadoop
Oracle Big Data Appliance Software
Why Open-Source Apache Hadoop?
• Fast evolution in critical features
• Built by the Hadoop experts in the community
• Practical instead of esoteric
• Focus on what is needed for large clusters
• Proven at very large scale
• In production at all the large consumers of Hadoop
• Extremely stable in those environments
• Well-understood by practitioners
Software Layout
• Node 1:
• M: Name Node, Balancer & HBase Master
• S: HDFS Data Node, NoSQL DB Storage Node
• Node 2:
• M: Secondary Name Node, Management,
Zookeeper, MySQL Slave
• S: HDFS Data Node, NoSQL DB Storage Node
• Node 3:
• M: JobTracker, MySQL Master, ODI Agent,
Hive Server
• S: HDFS Data Node, NoSQL DB Storage Node
• Node 4 – 18:
• S: HDFS Data Nodes, Task Tracker, HBase
Region Server, NoSQL DB Storage Nodes
• Your MapReduce runs here!
Big Data Appliance
Big Data for the Enterprise
• Optimized and Complete
• Everything you need to store and integrate
your lower information density data
• Integrated with Oracle Exadata
• Analyze all your data
• Easy to Deploy
• Risk Free, Quick Installation and Setup
• Single Vendor Support
• Full Oracle support for the entire system and
software set
<Insert Picture Here>
Oracle NoSQL Database
Key-Value Store Workloads
• Large dynamic schema based data repositories
• Data capture
• Web applications
• Online retail
• Sensor/statistics/network capture/Mobile Devices
• Data services
• Scalable authentication
• Real-time communication (MMS, SMS, routing)
• Personalization / Localization
• Social Networks
Oracle NoSQL DB
A distributed, scalable key-value database
• Simple Data Model
• Key-value pair with major+sub-key paradigm
• Read/insert/update/delete operations
• Scalability
• Dynamic data partitioning and distribution
• Optimized data access via intelligent driver
• High availability
• One or more replicas
• Disaster recovery through location of replicas
• Resilient to partition master failures
• No single point of failure
• Transparent load balancing
• Reads from master or replicas
• Driver is network topology & latency aware
Storage Nodes
Data Center A
Storage Nodes
Data Center B
NoSQLDB Driver
Application
NoSQLDB Driver
Application
• Operation result
• New Partition Map
• RepNodeStorageTable
information
Resolving a Request
Hash Major Key to determine
Partition id
Use Partition Map to map Partition
id to a Rep Group
Use State Table to determine eligible
Storage Node(s) within Rep Group
Use Load Balancer to select best
eligible Rep Node
Contact Rep Node directly
Client
Operation + Key[M,m] + Value + Transaction Policy
ACID Transactions
Transaction Policy
Write Durability
• Configurable per-operation,
application can set defaults
• Write Transaction Durability consists
of both
a) Sync policy (on Master and
Replica)
• Sync – force to disk
• Write No Sync – force to OS
buffer
• No Sync – write to local log buffer,
flush when convenient
b) Replica Acknowledgement Policy
• All
• Simple Majority
• None
Transaction Policy
Read Consistency
• Configurable per-operation,
application can set defaults
• Read Consistency specified as
Absolute, Time-based, Version or
None
• Absolute  Read from the master
• Time-based  Read from any
replica that is within <time-
interval> of master or better
• Version  Read from any replica
that is current with <transaction-
token> or higher
• None  Read from any replica
Oracle NoSQL DB Differentiation
• Commercial Grade Software and Support
• General-purpose
• Reliable – Based on proven Berkeley DB JE HA
• Easy to install and configure
• Scalable throughput, bounded latency
• Simple Programming and Operational Model
• Simple Major + Sub key and Value data structure
• ACID transactions
• Configurable consistency & durability
• Easy Management
• Web-based console, API accessible
• Manages and Monitors: Topology; Load; Performance; Events; Alerts
• Completes Oracle large scale data storage offerings
Try NoSQL Database on OTN
Oracle NoSQL Database:
• Community Edition is available as a software
only distribution
• Enterprise Edition is available as a separately
licensable product or as part of Big Data Appliance
<Insert Picture Here>
Oracle Loader for Hadoop
27 Copyright © 2011, Oracle and/or its affiliates. All rights
reserved.
Oracle Loader for Hadoop Features
• Load data into a partitioned or non-partitioned table
– Single level, composite or interval partitioned table
– Support for scalar datatypes of Oracle Database
– Load into Oracle Database 11g Release 2
• Runs as a Hadoop job and supports standard options
• Pre-partitions and sorts data on Hadoop
• Online and offline load modes
28 Copyright © 2011, Oracle and/or its affiliates. All rights
reserved.
Oracle Loader for Hadoop
SHUFFLE
/SORT
SHUFFLE
/SORT
MAP
MAP
MAP
MAP
SHUFFLE
/SORT
REDUCE
REDUCE
SHUFFLE
/SORT
SHUFFLE
/SORT
REDUCE
REDUCE
REDUCE
INPUT
2
INPUT
1
MAP
MAP
MAP
MAP
MAP
REDUCE
REDUCE
REDUCE
MAP
MAP
MAP
MAP
MAP
MAP
REDUCE
REDUCE
MAP
MAP
MAP
MAP
MAP
REDUCE
REDUCE
REDUCE
ORACLE LOADER FOR HADOOP
29 Copyright © 2011, Oracle and/or its affiliates. All rights
reserved.
Oracle Loader for Hadoop: Online Option
SHUFFLE
/SORT
SHUFFLE
/SORT
REDUCE
REDUCE
REDUCE
MAP
MAP
MAP
MAP
MAP
MAP
REDUCE
REDUCE
ORACLE LOADER FOR HADOOP Connect to the database
from reducer nodes, load
into database partitions in
parallel
Read target table metadata
from the database
Perform partitioning,
sorting, and data
conversion
30 Copyright © 2011, Oracle and/or its affiliates. All rights
reserved.
Oracle Loader for Hadoop: Offline Option
SHUFFLE
/SORT
SHUFFLE
/SORT
REDUCE
REDUCE
REDUCE
MAP
MAP
MAP
MAP
MAP
MAP
REDUCE
REDUCE
ORACLE LOADER FOR HADOOP
Read target table metadata
from the database
Perform partitioning,
sorting, and data
conversion
Write from reducer nodes to
Oracle Data Pump files
Import into the database in
parallel using external table
mechanism
31 Copyright © 2011, Oracle and/or its affiliates. All rights
reserved.
Oracle Loader for Hadoop Advantages
• Offload database server processing to Hadoop:
– Convert input data to final database format
– Compute table partition for row
– Sort rows by primary key within a table partition
• Generate binary datapump files
• Balance partition groups across reducers
32 Copyright © 2011, Oracle and/or its affiliates. All rights
reserved.
Input and Output Formats
Input Formats
• Delimited text
• Hive tables
– Managed and external tables
– Native and non-native tables
• Write your own input format
Output Formats
Online Mode
• Load directly from Hadoop nodes to
Oracle database
– JDBC
– Parallel direct path
Offline Mode
• Datapump format
– Create binary files for external tables
– Import data into the database from the
external table with a SQL statement
• CSV, delimited text
– Load through SQL*Loader or external
table mechanism
33 Copyright © 2011, Oracle and/or its affiliates. All rights
reserved.
Selection Output Option for Use Case
Oracle Loader for Hadoop
Output Option
Use Case Characteristics
Online load with JDBC The simplest use case for non
partitioned tables
Online load with Direct Path Fast online load for partitioned
tables
Offline load with datapump files Fastest load method for external
tables
On Oracle Big Data Appliance
Direct HDFS
Leave data on HDFS
Parallel access from database
Import into database when
needed
34 Copyright © 2011, Oracle and/or its affiliates. All rights
reserved.
Invoking Oracle Loader for Hadoop
• Command line
$ hadoop jar oraloader.jar oracle.hadoop.loader.OraLoader
-libjars <library jar files>
-D <configuration properties>
$HADOOP_HOME/bin/hadoop jar oraloader.jar oracle.hadoop.loader.oraLoader
-libjars avro-1.4.1.jar, commons-math-2.2.jar
-conf connection.xml
-D mapreduce.inputformat.class=oracle.hadoop.loader.lib.input.DelimitedTextInputFormat
-D mapreduce.outputformat.class=oracle.hadoop.loader.lib.output.JDBCOutputFormat
36 Copyright © 2011, Oracle and/or its affiliates. All rights
reserved.
Automate Usage of Oracle Loader for Hadoop
• ODI has knowledge modules to
– Generate data transformation code to run on Hive/Hadoop
– Invoke Oracle Loader for Hadoop
• Use the drag-and-drop interface in ODI to
– Include invocation of Oracle Loader for Hadoop in any ODI
packaged flow
Oracle Data Integrator (ODI)
37 Copyright © 2011, Oracle and/or its affiliates. All rights
reserved.
<Insert Picture Here>
Summary
Big Data Appliance
Big Data for the Enterprise
• Optimized and Complete
• Everything you need to store and integrate your lower
information density data
• Integrated with Oracle Exadata
• Analyze all your data
• Easy to Deploy
• Risk Free, Quick Installation and Setup
• Single Vendor Support
• Full Oracle support for the entire system and software
set
Big Data Appliance and Exadata
Big Data for the Enterprise
NoSQL DB

HDFS

Hadoop

RDBMS 
Questions

More Related Content

What's hot

Démarche mise en place de référentiel d'architecture
Démarche mise en place de référentiel d'architectureDémarche mise en place de référentiel d'architecture
Démarche mise en place de référentiel d'architecture
Mouhsine LAKHDISSI
 
Solution Architecture And (Robotic) Process Automation Solutions
Solution Architecture And (Robotic) Process Automation SolutionsSolution Architecture And (Robotic) Process Automation Solutions
Solution Architecture And (Robotic) Process Automation Solutions
Alan McSweeney
 
Integrated Project Management And Solution Delivery Process
Integrated Project Management And Solution Delivery ProcessIntegrated Project Management And Solution Delivery Process
Integrated Project Management And Solution Delivery Process
Alan McSweeney
 

What's hot (20)

Slides: Success Stories for Data-to-Cloud
Slides: Success Stories for Data-to-CloudSlides: Success Stories for Data-to-Cloud
Slides: Success Stories for Data-to-Cloud
 
Applying reference models with archi mate
Applying reference models with archi mateApplying reference models with archi mate
Applying reference models with archi mate
 
OCI_Icons.pptx
OCI_Icons.pptxOCI_Icons.pptx
OCI_Icons.pptx
 
Aligning The Business Model to Technology Landscapes Enterprise Systems Arch...
Aligning The Business Model to  Technology Landscapes Enterprise Systems Arch...Aligning The Business Model to  Technology Landscapes Enterprise Systems Arch...
Aligning The Business Model to Technology Landscapes Enterprise Systems Arch...
 
Developing ssas cube
Developing ssas cubeDeveloping ssas cube
Developing ssas cube
 
Oracle SQL Developer for SQL Server?
Oracle SQL Developer for SQL Server?Oracle SQL Developer for SQL Server?
Oracle SQL Developer for SQL Server?
 
Démarche mise en place de référentiel d'architecture
Démarche mise en place de référentiel d'architectureDémarche mise en place de référentiel d'architecture
Démarche mise en place de référentiel d'architecture
 
Solution Architecture And (Robotic) Process Automation Solutions
Solution Architecture And (Robotic) Process Automation SolutionsSolution Architecture And (Robotic) Process Automation Solutions
Solution Architecture And (Robotic) Process Automation Solutions
 
Operational Model Design
Operational Model DesignOperational Model Design
Operational Model Design
 
Master Data Management : quels outils ? quelles bonnes pratiques ?
Master Data Management : quels outils ? quelles bonnes pratiques ?Master Data Management : quels outils ? quelles bonnes pratiques ?
Master Data Management : quels outils ? quelles bonnes pratiques ?
 
Integrated Project Management And Solution Delivery Process
Integrated Project Management And Solution Delivery ProcessIntegrated Project Management And Solution Delivery Process
Integrated Project Management And Solution Delivery Process
 
Oracle Cloud Infrastructure Overview Deck.pptx
Oracle Cloud Infrastructure Overview Deck.pptxOracle Cloud Infrastructure Overview Deck.pptx
Oracle Cloud Infrastructure Overview Deck.pptx
 
Teradata sql-tuning-top-10
Teradata sql-tuning-top-10Teradata sql-tuning-top-10
Teradata sql-tuning-top-10
 
TOGAF Introduction
TOGAF IntroductionTOGAF Introduction
TOGAF Introduction
 
Zachman Framework graphics v3.0
Zachman Framework graphics v3.0Zachman Framework graphics v3.0
Zachman Framework graphics v3.0
 
IBM MQ Disaster Recovery
IBM MQ Disaster RecoveryIBM MQ Disaster Recovery
IBM MQ Disaster Recovery
 
IoT Dynatrace
IoT Dynatrace IoT Dynatrace
IoT Dynatrace
 
Autonomous Data Warehouse
Autonomous Data WarehouseAutonomous Data Warehouse
Autonomous Data Warehouse
 
Enterprise Architecture Management (EAM) I Best Practices I NuggetHub
Enterprise Architecture Management (EAM) I Best Practices I NuggetHubEnterprise Architecture Management (EAM) I Best Practices I NuggetHub
Enterprise Architecture Management (EAM) I Best Practices I NuggetHub
 
En rhel-deploy-oracle-rac-database-12c-rhel-7
En rhel-deploy-oracle-rac-database-12c-rhel-7En rhel-deploy-oracle-rac-database-12c-rhel-7
En rhel-deploy-oracle-rac-database-12c-rhel-7
 

Viewers also liked

Viewers also liked (6)

A7 storytelling with_oracle_analytics_cloud
A7 storytelling with_oracle_analytics_cloudA7 storytelling with_oracle_analytics_cloud
A7 storytelling with_oracle_analytics_cloud
 
BDM9 - Comparison of Oracle RDBMS and Cloudera Impala for a hospital use case
BDM9 - Comparison of Oracle RDBMS and Cloudera Impala for a hospital use caseBDM9 - Comparison of Oracle RDBMS and Cloudera Impala for a hospital use case
BDM9 - Comparison of Oracle RDBMS and Cloudera Impala for a hospital use case
 
Extending Hortonworks with Oracle's Big Data Platform
Extending Hortonworks with Oracle's Big Data PlatformExtending Hortonworks with Oracle's Big Data Platform
Extending Hortonworks with Oracle's Big Data Platform
 
IlOUG Tech Days 2016 - Big Data for Oracle Developers - Towards Spark, Real-T...
IlOUG Tech Days 2016 - Big Data for Oracle Developers - Towards Spark, Real-T...IlOUG Tech Days 2016 - Big Data for Oracle Developers - Towards Spark, Real-T...
IlOUG Tech Days 2016 - Big Data for Oracle Developers - Towards Spark, Real-T...
 
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
 
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
 

Similar to Oracle big data appliance and solutions

Similar to Oracle big data appliance and solutions (20)

Presentation big dataappliance-overview_oow_v3
Presentation   big dataappliance-overview_oow_v3Presentation   big dataappliance-overview_oow_v3
Presentation big dataappliance-overview_oow_v3
 
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
 
Hadoop and SQL: Delivery Analytics Across the Organization
Hadoop and SQL:  Delivery Analytics Across the OrganizationHadoop and SQL:  Delivery Analytics Across the Organization
Hadoop and SQL: Delivery Analytics Across the Organization
 
Oracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsOracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analytics
 
Whats new in Oracle Database 12c release 12.1.0.2
Whats new in Oracle Database 12c release 12.1.0.2Whats new in Oracle Database 12c release 12.1.0.2
Whats new in Oracle Database 12c release 12.1.0.2
 
Using the Power of Big SQL 3.0 to Build a Big Data-Ready Hybrid Warehouse
Using the Power of Big SQL 3.0 to Build a Big Data-Ready Hybrid WarehouseUsing the Power of Big SQL 3.0 to Build a Big Data-Ready Hybrid Warehouse
Using the Power of Big SQL 3.0 to Build a Big Data-Ready Hybrid Warehouse
 
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
 
Simplify IT: Oracle SuperCluster
Simplify IT: Oracle SuperCluster Simplify IT: Oracle SuperCluster
Simplify IT: Oracle SuperCluster
 
Practical introduction to hadoop
Practical introduction to hadoopPractical introduction to hadoop
Practical introduction to hadoop
 
Optimize with Open Source
Optimize with Open SourceOptimize with Open Source
Optimize with Open Source
 
Novinky v Oracle Database 18c
Novinky v Oracle Database 18cNovinky v Oracle Database 18c
Novinky v Oracle Database 18c
 
Hp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar Slides
 
A3 transforming data_management_in_the_cloud
A3 transforming data_management_in_the_cloudA3 transforming data_management_in_the_cloud
A3 transforming data_management_in_the_cloud
 
TDC2016SP - Trilha NoSQL
TDC2016SP - Trilha NoSQLTDC2016SP - Trilha NoSQL
TDC2016SP - Trilha NoSQL
 
Oracle Database Appliance (ODA) X6-2 Portfolio Overview
Oracle Database Appliance (ODA) X6-2 Portfolio OverviewOracle Database Appliance (ODA) X6-2 Portfolio Overview
Oracle Database Appliance (ODA) X6-2 Portfolio Overview
 
Scaling db infra_pay_pal
Scaling db infra_pay_palScaling db infra_pay_pal
Scaling db infra_pay_pal
 
New Ceph capabilities and Reference Architectures
New Ceph capabilities and Reference ArchitecturesNew Ceph capabilities and Reference Architectures
New Ceph capabilities and Reference Architectures
 
Software Defined Storage, Big Data and Ceph - What Is all the Fuss About?
Software Defined Storage, Big Data and Ceph - What Is all the Fuss About?Software Defined Storage, Big Data and Ceph - What Is all the Fuss About?
Software Defined Storage, Big Data and Ceph - What Is all the Fuss About?
 
Big Data training
Big Data trainingBig Data training
Big Data training
 
Delivering Apache Hadoop for the Modern Data Architecture
Delivering Apache Hadoop for the Modern Data Architecture Delivering Apache Hadoop for the Modern Data Architecture
Delivering Apache Hadoop for the Modern Data Architecture
 

More from solarisyougood

More from solarisyougood (20)

Emc vipr srm workshop
Emc vipr srm workshopEmc vipr srm workshop
Emc vipr srm workshop
 
Emc recoverpoint technical
Emc recoverpoint technicalEmc recoverpoint technical
Emc recoverpoint technical
 
Emc vmax3 technical deep workshop
Emc vmax3 technical deep workshopEmc vmax3 technical deep workshop
Emc vmax3 technical deep workshop
 
EMC Atmos for service providers
EMC Atmos for service providersEMC Atmos for service providers
EMC Atmos for service providers
 
Cisco prime network 4.1 technical overview
Cisco prime network 4.1 technical overviewCisco prime network 4.1 technical overview
Cisco prime network 4.1 technical overview
 
Designing your xen desktop 7.5 environment with training guide
Designing your xen desktop 7.5 environment with training guideDesigning your xen desktop 7.5 environment with training guide
Designing your xen desktop 7.5 environment with training guide
 
Ibm aix technical deep dive workshop advanced administration and problem dete...
Ibm aix technical deep dive workshop advanced administration and problem dete...Ibm aix technical deep dive workshop advanced administration and problem dete...
Ibm aix technical deep dive workshop advanced administration and problem dete...
 
Ibm power ha v7 technical deep dive workshop
Ibm power ha v7 technical deep dive workshopIbm power ha v7 technical deep dive workshop
Ibm power ha v7 technical deep dive workshop
 
Power8 hardware technical deep dive workshop
Power8 hardware technical deep dive workshopPower8 hardware technical deep dive workshop
Power8 hardware technical deep dive workshop
 
Power systems virtualization with power kvm
Power systems virtualization with power kvmPower systems virtualization with power kvm
Power systems virtualization with power kvm
 
Power vc for powervm deep dive tips &amp; tricks
Power vc for powervm deep dive tips &amp; tricksPower vc for powervm deep dive tips &amp; tricks
Power vc for powervm deep dive tips &amp; tricks
 
Emc data domain technical deep dive workshop
Emc data domain  technical deep dive workshopEmc data domain  technical deep dive workshop
Emc data domain technical deep dive workshop
 
Ibm flash system v9000 technical deep dive workshop
Ibm flash system v9000 technical deep dive workshopIbm flash system v9000 technical deep dive workshop
Ibm flash system v9000 technical deep dive workshop
 
Emc vnx2 technical deep dive workshop
Emc vnx2 technical deep dive workshopEmc vnx2 technical deep dive workshop
Emc vnx2 technical deep dive workshop
 
Emc isilon technical deep dive workshop
Emc isilon technical deep dive workshopEmc isilon technical deep dive workshop
Emc isilon technical deep dive workshop
 
Emc ecs 2 technical deep dive workshop
Emc ecs 2 technical deep dive workshopEmc ecs 2 technical deep dive workshop
Emc ecs 2 technical deep dive workshop
 
Emc vplex deep dive
Emc vplex deep diveEmc vplex deep dive
Emc vplex deep dive
 
Cisco mds 9148 s training workshop
Cisco mds 9148 s training workshopCisco mds 9148 s training workshop
Cisco mds 9148 s training workshop
 
Cisco cloud computing deploying openstack
Cisco cloud computing deploying openstackCisco cloud computing deploying openstack
Cisco cloud computing deploying openstack
 
Se training storage grid webscale technical overview
Se training   storage grid webscale technical overviewSe training   storage grid webscale technical overview
Se training storage grid webscale technical overview
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 

Oracle big data appliance and solutions

  • 1. <Insert Picture Here> Oracle Big Data Appliance and Solutions Downlload this slide http://ouo.io/OtuIkv
  • 2. The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remain at the sole discretion of Oracle.
  • 3. Case: On-line Ads and Content NoSQL DB Expert System Real-time: Determine best ad to place on page for this user Input into Lookup user profile Add user if not present Web logs HDFS Profiles NoSQL DB High scale data reductions BI and Analytics Billing Predictions on browsing Actual ads served Low Latency Batch
  • 4. Agenda • Big Data Technology • Oracle Big Data Appliance • Big Data Applications • Summary • Q&A
  • 5. <Insert Picture Here> Big Data Technology
  • 6. • Deep Analytics • Agile Development • Massive Scalability • Real Time Results • High Throughput • In-Place Preparation • All Data Sources/Structures • Low, predictable Latency • High Transaction Volume • Flexible Data Structures Big Data: Infrastructure Requirements Acquire Organize Analyze
  • 7. Divided Solution Spectrum Acquire AnalyzeOrganize MapReduce Solutions DBMS (DW) DBMS (OLTP) Advanced Analytics Distributed File Systems Transaction (Key-Value) Stores ETL NoSQL Flexible Specialized Developer Centric SQL Trusted Secure Administered Dynamic Schema Data Variety Schema
  • 8. 8 Copyright © 2011, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 8 Oracle Integrated Software Solution Stack Acquire AnalyzeOrganize Oracle Database (DW) Oracle Database (OLTP) In-DB Analytics “R” Mining Text Graph Spatial Oracle BI EE Oracle NoSQL DB HDFS Hadoop Oracle Data Integrator Oracle Loader for Hadoop Dynamic Schema Data Variety Schema
  • 9. 9 Copyright © 2011, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 8 Oracle Engineered Solutions Oracle Database (DW) Oracle Database (OLTP) In-DB Analytics “R” Mining Text Graph Spatial Oracle BI EE Oracle NoSQL DB HDFS Hadoop Oracle Data Integrator Oracle Loader for Hadoop Big Data Appliance • Hadoop • NoSQL Database • Oracle Loader for hadoop • Oracle Data Integrator Oracle Exadata • OLTP & DW • Data Mining & Oracle R • Semantics • Spatial Exalytics • Speed of Thought Analytics Acquire AnalyzeOrganize Dynamic Schema Data Variety Schema
  • 10. Big Data Appliance Batch Usage Model Oracle Big Data Appliance Oracle Exadata InfiniBand Acquire Organize Analyze Oracle Exalytics InfiniBand
  • 11. 11 Copyright © 2011, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 8 Why build a Hadoop Appliance? • Time to Build? • Required Expertise? • Cost and Difficulty Maintaining?
  • 12. •18 Sun X4270 M2 Servers – 48 GB memory per node = 864 GB memory – 12 Intel cores per node = 216 cores – 24 TB storage per node = 432 TB storage •40 Gb p/sec InfiniBand •10 Gb p/sec Ethernet Oracle Big Data Appliance Hardware
  • 13. Big Data Appliance Cluster of industry standard servers for Hadoop and NoSQL Database • Focus on Scalability and Availability at low cost Compute and Storage • 18 High-performance low-cost servers acting as Hadoop nodes • 24 TB Capacity per node • 2 6-core CPUs per node • Hadoop triple replication • NoSQL Database triple replication 10GigE Network • 8 10GigE ports • Datacenter connectivity InfiniBand Network • Redundant 40Gb/s switches • IB connectivity to Exadata
  • 14. Scale Out to Infinity Scale out by connecting racks to each other using Infiniband • Expand up to eight racks without additional switches • Scale beyond eight racks by adding an additional switch
  • 15. •Oracle Linux 5.6 •Java Hotspot VM •Apache Hadoop Distribution v0.20.x •R Distribution •Oracle NoSQL Database Enterprise Edition •Oracle Data Integrator Application Adapter for Hadoop •Oracle Loader for Hadoop Oracle Big Data Appliance Software
  • 16. Why Open-Source Apache Hadoop? • Fast evolution in critical features • Built by the Hadoop experts in the community • Practical instead of esoteric • Focus on what is needed for large clusters • Proven at very large scale • In production at all the large consumers of Hadoop • Extremely stable in those environments • Well-understood by practitioners
  • 17. Software Layout • Node 1: • M: Name Node, Balancer & HBase Master • S: HDFS Data Node, NoSQL DB Storage Node • Node 2: • M: Secondary Name Node, Management, Zookeeper, MySQL Slave • S: HDFS Data Node, NoSQL DB Storage Node • Node 3: • M: JobTracker, MySQL Master, ODI Agent, Hive Server • S: HDFS Data Node, NoSQL DB Storage Node • Node 4 – 18: • S: HDFS Data Nodes, Task Tracker, HBase Region Server, NoSQL DB Storage Nodes • Your MapReduce runs here!
  • 18. Big Data Appliance Big Data for the Enterprise • Optimized and Complete • Everything you need to store and integrate your lower information density data • Integrated with Oracle Exadata • Analyze all your data • Easy to Deploy • Risk Free, Quick Installation and Setup • Single Vendor Support • Full Oracle support for the entire system and software set
  • 20. Key-Value Store Workloads • Large dynamic schema based data repositories • Data capture • Web applications • Online retail • Sensor/statistics/network capture/Mobile Devices • Data services • Scalable authentication • Real-time communication (MMS, SMS, routing) • Personalization / Localization • Social Networks
  • 21. Oracle NoSQL DB A distributed, scalable key-value database • Simple Data Model • Key-value pair with major+sub-key paradigm • Read/insert/update/delete operations • Scalability • Dynamic data partitioning and distribution • Optimized data access via intelligent driver • High availability • One or more replicas • Disaster recovery through location of replicas • Resilient to partition master failures • No single point of failure • Transparent load balancing • Reads from master or replicas • Driver is network topology & latency aware Storage Nodes Data Center A Storage Nodes Data Center B NoSQLDB Driver Application NoSQLDB Driver Application
  • 22. • Operation result • New Partition Map • RepNodeStorageTable information Resolving a Request Hash Major Key to determine Partition id Use Partition Map to map Partition id to a Rep Group Use State Table to determine eligible Storage Node(s) within Rep Group Use Load Balancer to select best eligible Rep Node Contact Rep Node directly Client Operation + Key[M,m] + Value + Transaction Policy
  • 23. ACID Transactions Transaction Policy Write Durability • Configurable per-operation, application can set defaults • Write Transaction Durability consists of both a) Sync policy (on Master and Replica) • Sync – force to disk • Write No Sync – force to OS buffer • No Sync – write to local log buffer, flush when convenient b) Replica Acknowledgement Policy • All • Simple Majority • None Transaction Policy Read Consistency • Configurable per-operation, application can set defaults • Read Consistency specified as Absolute, Time-based, Version or None • Absolute  Read from the master • Time-based  Read from any replica that is within <time- interval> of master or better • Version  Read from any replica that is current with <transaction- token> or higher • None  Read from any replica
  • 24. Oracle NoSQL DB Differentiation • Commercial Grade Software and Support • General-purpose • Reliable – Based on proven Berkeley DB JE HA • Easy to install and configure • Scalable throughput, bounded latency • Simple Programming and Operational Model • Simple Major + Sub key and Value data structure • ACID transactions • Configurable consistency & durability • Easy Management • Web-based console, API accessible • Manages and Monitors: Topology; Load; Performance; Events; Alerts • Completes Oracle large scale data storage offerings
  • 25. Try NoSQL Database on OTN Oracle NoSQL Database: • Community Edition is available as a software only distribution • Enterprise Edition is available as a separately licensable product or as part of Big Data Appliance
  • 26. <Insert Picture Here> Oracle Loader for Hadoop
  • 27. 27 Copyright © 2011, Oracle and/or its affiliates. All rights reserved. Oracle Loader for Hadoop Features • Load data into a partitioned or non-partitioned table – Single level, composite or interval partitioned table – Support for scalar datatypes of Oracle Database – Load into Oracle Database 11g Release 2 • Runs as a Hadoop job and supports standard options • Pre-partitions and sorts data on Hadoop • Online and offline load modes
  • 28. 28 Copyright © 2011, Oracle and/or its affiliates. All rights reserved. Oracle Loader for Hadoop SHUFFLE /SORT SHUFFLE /SORT MAP MAP MAP MAP SHUFFLE /SORT REDUCE REDUCE SHUFFLE /SORT SHUFFLE /SORT REDUCE REDUCE REDUCE INPUT 2 INPUT 1 MAP MAP MAP MAP MAP REDUCE REDUCE REDUCE MAP MAP MAP MAP MAP MAP REDUCE REDUCE MAP MAP MAP MAP MAP REDUCE REDUCE REDUCE ORACLE LOADER FOR HADOOP
  • 29. 29 Copyright © 2011, Oracle and/or its affiliates. All rights reserved. Oracle Loader for Hadoop: Online Option SHUFFLE /SORT SHUFFLE /SORT REDUCE REDUCE REDUCE MAP MAP MAP MAP MAP MAP REDUCE REDUCE ORACLE LOADER FOR HADOOP Connect to the database from reducer nodes, load into database partitions in parallel Read target table metadata from the database Perform partitioning, sorting, and data conversion
  • 30. 30 Copyright © 2011, Oracle and/or its affiliates. All rights reserved. Oracle Loader for Hadoop: Offline Option SHUFFLE /SORT SHUFFLE /SORT REDUCE REDUCE REDUCE MAP MAP MAP MAP MAP MAP REDUCE REDUCE ORACLE LOADER FOR HADOOP Read target table metadata from the database Perform partitioning, sorting, and data conversion Write from reducer nodes to Oracle Data Pump files Import into the database in parallel using external table mechanism
  • 31. 31 Copyright © 2011, Oracle and/or its affiliates. All rights reserved. Oracle Loader for Hadoop Advantages • Offload database server processing to Hadoop: – Convert input data to final database format – Compute table partition for row – Sort rows by primary key within a table partition • Generate binary datapump files • Balance partition groups across reducers
  • 32. 32 Copyright © 2011, Oracle and/or its affiliates. All rights reserved. Input and Output Formats Input Formats • Delimited text • Hive tables – Managed and external tables – Native and non-native tables • Write your own input format Output Formats Online Mode • Load directly from Hadoop nodes to Oracle database – JDBC – Parallel direct path Offline Mode • Datapump format – Create binary files for external tables – Import data into the database from the external table with a SQL statement • CSV, delimited text – Load through SQL*Loader or external table mechanism
  • 33. 33 Copyright © 2011, Oracle and/or its affiliates. All rights reserved. Selection Output Option for Use Case Oracle Loader for Hadoop Output Option Use Case Characteristics Online load with JDBC The simplest use case for non partitioned tables Online load with Direct Path Fast online load for partitioned tables Offline load with datapump files Fastest load method for external tables On Oracle Big Data Appliance Direct HDFS Leave data on HDFS Parallel access from database Import into database when needed
  • 34. 34 Copyright © 2011, Oracle and/or its affiliates. All rights reserved. Invoking Oracle Loader for Hadoop • Command line $ hadoop jar oraloader.jar oracle.hadoop.loader.OraLoader -libjars <library jar files> -D <configuration properties> $HADOOP_HOME/bin/hadoop jar oraloader.jar oracle.hadoop.loader.oraLoader -libjars avro-1.4.1.jar, commons-math-2.2.jar -conf connection.xml -D mapreduce.inputformat.class=oracle.hadoop.loader.lib.input.DelimitedTextInputFormat -D mapreduce.outputformat.class=oracle.hadoop.loader.lib.output.JDBCOutputFormat
  • 35. 36 Copyright © 2011, Oracle and/or its affiliates. All rights reserved. Automate Usage of Oracle Loader for Hadoop • ODI has knowledge modules to – Generate data transformation code to run on Hive/Hadoop – Invoke Oracle Loader for Hadoop • Use the drag-and-drop interface in ODI to – Include invocation of Oracle Loader for Hadoop in any ODI packaged flow Oracle Data Integrator (ODI)
  • 36. 37 Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
  • 38. Big Data Appliance Big Data for the Enterprise • Optimized and Complete • Everything you need to store and integrate your lower information density data • Integrated with Oracle Exadata • Analyze all your data • Easy to Deploy • Risk Free, Quick Installation and Setup • Single Vendor Support • Full Oracle support for the entire system and software set
  • 39. Big Data Appliance and Exadata Big Data for the Enterprise NoSQL DB  HDFS  Hadoop  RDBMS 