SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Downloaden Sie, um offline zu lesen
ACCELERATING BIG DATA: IOMEMORY, CISCO UCS AND NOSQL
Ashok Joshi, Senior Director – Oracle NoSQL development, Oracle
Sumeet Bansal, Principal Solutions Architect, Fusion-io
AGENDA


▸ Big Data overview
▸ Oracle NoSQL Database overview
▸ Real-time big data management – a business
  perspective
▸ NoSQL testing with YCSB
▸ The Fusion-io value



                         October 1, 2012       2
VOLUME, VELOCITY, VARIETY, VALUE

Big Data characteristics

▸ Terabytes, Petabytes
▸ Multiple sources for data
▸ Text, images, XML, JSON,
  sensor readings…
▸ Not “master” data, but
  important for business
▸ “Real-time” needs



                              October 1, 2012   3
WHO USES BIG DATA?

Web Services                Big Energy                                 Telco
• Clickstream Analysis       • Granular rate plans                     • Churn reduction
• Abuse Prevention           • Grid management                         • Network optimization



Storage                     Financial Services                    Government
•   Cost efficient           •   Customer loyalty                 • Regulatory compliance
•   Analytics-ready          •   Risk                             • Environmental monitoring
•   Data Store               •   Trading                          • Cyber security
•   Scalable                 •   Fraud
•   Distributed              •   Compliance
                             •   Credit scoring


Large-scale, E-commerce
• Recommendation engines
• Cross-channel analytics
• Golden path to purchase

                                                October 1, 2012                                 4
BIG DATA BUSINESS BENEFITS


                                                           GLOBAL PERSONAL    EUROPE PUBLIC
US HEALTH CARE      US RETAIL      MANUFACTURING
                                                           LOCATION DATA      SECTOR ADMIN




$300 B              60+%           -50%                    $100 B             €250 B
Increase industry   Increase net   Decrease dev.,          Increase service   Increase industry
value per year      margin         assembly costs          provider revenue   value per year




 “In a big data world, a competitor that fails to
  sufficiently develop its capabilities will be left behind.”


                                                    October 1, 2012                               5
AGENDA


▸ Big Data overview
▸ Oracle NoSQL Database overview
▸ Real-time big data management – a business
  perspective
▸ NoSQL testing with YCSB
▸ The Fusion-io value



                         October 1, 2012       6
NOSQL DATABASE ARCHITECTURE

Highlights
                                               Application      Application

▸ Available; scalable; fast                                   NoSQL Database
                                           NoSQL Database     NoSQL Database
▸ Simple administration, key-
                                                                   Driver
                                               Driver             Driver

  value data model; transaction
  support
▸ Transparent load balancing;
  elastic
▸ Commercial grade software
  and support
▸ Integrated with related Oracle
  technologies
                                              Storage Nodes    Storage Nodes

                                   October 1, 2012                             7
AGENDA


▸ Big Data overview
▸ Oracle NoSQL Database overview
▸ Real-time big data management – a business
  perspective
▸ NoSQL testing with YCSB
▸ The Fusion-io value



                         October 1, 2012       8
NOSQL DB AND CISCO UCS COLLABORATION
WHY DOES IT MATTER



▸ Many components: network, processors, memory,
  software, storage – tested, tuned and optimized
▸ Business can focus on core competency and
  leveraging benefits of big data




                          October 1, 2012           9
NOSQL DB AND FUSION-IO COLLABORATION
WHY DOES IT MATTER



▸ Speed (latency) is critical
  • Amazon study:
    Every 100 millisecond increase in latency costs 1% in sales
    http://highscalability.com/latency-everywhere-and-it-costs-you-sales-how-crush-it


▸ ioMemory enables consistent, extremely low latency and
  extreme throughput




                                            October 1, 2012                             10
ORACLE NOSQL DB, CISCO UCS, FUSION-IO




▸ Commercial grade solution and support
▸ Tested, tuned, optimized for real-time data
  management
▸ For equivalent performance, much lower CapEx
  and OpEx compared to commodity (DIY) solutions




                            October 1, 2012        11
AGENDA


▸ Big Data overview
▸ Oracle NoSQL Database overview
▸ Real-time big data management – a business
  perspective
▸ NoSQL testing with YCSB
▸ The Fusion-io value



                         October 1, 2012       12
SYSTEM UNDER TEST



▸ YCSB (Yahoo! Cloud
  Serving Benchmark)
  • 10 client machines
    generate load
  • Mixed workload
    (5% updates/95% reads)
▸ 15 UCS C240 M3 Rack
  Servers
▸ 30 Fusion’s ioDrive2
▸ 2TB of data




                             October 1, 2012   13
CONFIGURATION DETAILS


                             Client machine (YCSB driver)




                                                  UCS 240 M3 server
 1    2         1    2           1      2


                                                  NoSQL Database on ioDrive2




Note: only two shards are shown in the illustration
                                         October 1, 2012                       14
PERFORMANCE TEST RESULTS


 Number of shards                2                       4      8           10

 Mixed workload (95 read/5
                               302,152         558,569       1,028,868   1,244,550
 write) throughput (ops/sec)

 Read latency (milliseconds)    0.76               0.79        0.85        0.88

 Mixed workload update
                                3.08               3.82        4.29        4.47
 latency (milliseconds)




                                       October 1, 2012                               15
PUTTING PERFORMANCE IN CONTEXT

Highlights

▸ For Example, Twitter: ~150K
  API calls/sec
▸ We can achieve that
  performance on a single
  UCS c240 server using
  two ioDrive2s
▸ Plenty of capacity to handle
  fluctuating demand without
  compromising performance

http://blog.programmableweb.com/2011/05/25/who-belongs-to-the-api-billionaires-club/
                                             October 1, 2012                           16
AGENDA


▸ Big Data overview
▸ Oracle NoSQL Database overview
▸ Real-time big data management – a business
  perspective
▸ NoSQL testing with YCSB
▸ The Fusion-io value



                         October 1, 2012       17
CUT-THROUGH ARCHITECTURE AND VSL
FOR EXTREME THROUGHPUT AND LOW LATENCY

▸ Sophisticated architecture     Host

  • maximum performance




                                                          Virtualization
                                                           ioMemory
                                   DRAM /




                                                             Tables
                                   Memory /
                                                                                     CPU and cores
                                   Operating System and

▸ Intelligent software             Application Memory


  • advanced features                                                                Virtual Storage Layer




                                                                T RANSFERS
                                                                                             (VSL)




                                                                   DAT A
                                                                                      Commands

                                                                             PCIe
       Applications/Databases

                                 ioDrive                           ioMemory
             File System                                           Data-Path
                                                                   Controller
               Kernel




                                           Banks
  Virtual Storage Layer (VSL)
                                                                     Channels Wide


              ioMemory

                                October 1, 2012                                                              18
SOFTWARE DEVELOPMENT KIT ADVANTAGES
FOR GREATER PERFORMANCE OPTIMIZATION

   APPLICAT ION


   Application source code

          Conventional access                            Direct access I/O                  Memory access




                                         Transactional   Native                 Key-Value   Auto-Commit
                                                                      Logging
                                             Block        File                    Pair       Memory™
 Simple      Network            Simple
 Block         File             Block




Proprietary Storage OS                                     Native Flash Translation Layer

    Storage Media                                                  Storage Media




Traditional Storage                                        Software Defined Storage

                                                         October 1, 2012                                    19
TYPES OF IOMEMORY

For Cisco UCS B-Series Blade Servers   For Cisco UCS C-Series Rack Servers




                                                         365 GB, 785 GB, 1.2 TB



                    Mezzanine Card
             365 GB, 785 GB




                                                             2.4 TB

                                       October 1, 2012                            20
BIG DATA ANALYTICS - HADOOP




                         October 1, 2012   21
COMPLETE BIG DATA SOLUTION


▸ Right Partners – Cisco and Fusion-io
▸ Multiple technologies for a comprehensive big data
  solution – NoSQL, map-reduce, relational
▸ Tested, integrated, optimized, commercially
  supported solution delivered by leaders
▸ Cost-effective, reliable, ready for the enterprise




                             October 1, 2012           22
T H AN K Y O U

Weitere ähnliche Inhalte

Was ist angesagt?

Connecting Legacy Data Sources to the Data Lifecycle
 Connecting Legacy Data Sources to the Data Lifecycle Connecting Legacy Data Sources to the Data Lifecycle
Connecting Legacy Data Sources to the Data LifecyclePrecisely
 
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...Precisely
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera, Inc.
 
Cloudera Altus: Big Data in der Cloud einfach gemacht
Cloudera Altus: Big Data in der Cloud einfach gemachtCloudera Altus: Big Data in der Cloud einfach gemacht
Cloudera Altus: Big Data in der Cloud einfach gemachtCloudera, Inc.
 
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...TheInevitableCloud
 
Logicalis IoT & Smart Cities (Use Case)
Logicalis IoT & Smart Cities (Use Case)Logicalis IoT & Smart Cities (Use Case)
Logicalis IoT & Smart Cities (Use Case)Cloudera, Inc.
 
Powering the Internet of Things with Apache Hadoop
Powering the Internet of Things with Apache HadoopPowering the Internet of Things with Apache Hadoop
Powering the Internet of Things with Apache HadoopCloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Cloudera, Inc.
 
Strategies for Enterprise Grade Azure-based Analytics
Strategies for Enterprise Grade Azure-based AnalyticsStrategies for Enterprise Grade Azure-based Analytics
Strategies for Enterprise Grade Azure-based AnalyticsCloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Cloudera, Inc.
 
The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)Cloudera, Inc.
 
Cloudera - Enabling the IoT Revolution Driving Insights in a Connected World
Cloudera - Enabling the IoT Revolution Driving Insights in a Connected WorldCloudera - Enabling the IoT Revolution Driving Insights in a Connected World
Cloudera - Enabling the IoT Revolution Driving Insights in a Connected Worldandreas kuncoro
 
GDPR: 20 Million Reasons to Get Ready - Part 2: Living Compliance
GDPR: 20 Million Reasons to Get Ready - Part 2: Living ComplianceGDPR: 20 Million Reasons to Get Ready - Part 2: Living Compliance
GDPR: 20 Million Reasons to Get Ready - Part 2: Living ComplianceCloudera, Inc.
 
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...DataStax
 
Hyperconverged Systems for Digital Transformation
Hyperconverged Systems for Digital TransformationHyperconverged Systems for Digital Transformation
Hyperconverged Systems for Digital TransformationHitachi Vantara
 
CyberVille® DHC System - May 12th, 2015
CyberVille® DHC System - May 12th, 2015CyberVille® DHC System - May 12th, 2015
CyberVille® DHC System - May 12th, 2015Ville Mickelsson
 
Logicalis Backup as a Service: Re-defining Data Protection
Logicalis Backup as a Service: Re-defining Data ProtectionLogicalis Backup as a Service: Re-defining Data Protection
Logicalis Backup as a Service: Re-defining Data ProtectionLogicalis Australia
 
IBM Rational Software Conference 2009 Day 2 Keynote: Steve Mills
IBM Rational Software Conference 2009 Day 2 Keynote: Steve MillsIBM Rational Software Conference 2009 Day 2 Keynote: Steve Mills
IBM Rational Software Conference 2009 Day 2 Keynote: Steve MillsKathy (Kat) Mandelstein
 
Using Big Data to Transform Your Customer’s Experience - Part 1

Using Big Data to Transform Your Customer’s Experience - Part 1
Using Big Data to Transform Your Customer’s Experience - Part 1

Using Big Data to Transform Your Customer’s Experience - Part 1
Cloudera, Inc.
 

Was ist angesagt? (20)

Connecting Legacy Data Sources to the Data Lifecycle
 Connecting Legacy Data Sources to the Data Lifecycle Connecting Legacy Data Sources to the Data Lifecycle
Connecting Legacy Data Sources to the Data Lifecycle
 
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
Cloudera + Syncsort: Fuel Business Insights, Analytics, and Next Generation T...
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
Cloudera Altus: Big Data in der Cloud einfach gemacht
Cloudera Altus: Big Data in der Cloud einfach gemachtCloudera Altus: Big Data in der Cloud einfach gemacht
Cloudera Altus: Big Data in der Cloud einfach gemacht
 
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
 
Logicalis IoT & Smart Cities (Use Case)
Logicalis IoT & Smart Cities (Use Case)Logicalis IoT & Smart Cities (Use Case)
Logicalis IoT & Smart Cities (Use Case)
 
Powering the Internet of Things with Apache Hadoop
Powering the Internet of Things with Apache HadoopPowering the Internet of Things with Apache Hadoop
Powering the Internet of Things with Apache Hadoop
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Strategies for Enterprise Grade Azure-based Analytics
Strategies for Enterprise Grade Azure-based AnalyticsStrategies for Enterprise Grade Azure-based Analytics
Strategies for Enterprise Grade Azure-based Analytics
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Top 5 IoT Use Cases
Top 5 IoT Use CasesTop 5 IoT Use Cases
Top 5 IoT Use Cases
 
The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)
 
Cloudera - Enabling the IoT Revolution Driving Insights in a Connected World
Cloudera - Enabling the IoT Revolution Driving Insights in a Connected WorldCloudera - Enabling the IoT Revolution Driving Insights in a Connected World
Cloudera - Enabling the IoT Revolution Driving Insights in a Connected World
 
GDPR: 20 Million Reasons to Get Ready - Part 2: Living Compliance
GDPR: 20 Million Reasons to Get Ready - Part 2: Living ComplianceGDPR: 20 Million Reasons to Get Ready - Part 2: Living Compliance
GDPR: 20 Million Reasons to Get Ready - Part 2: Living Compliance
 
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
 
Hyperconverged Systems for Digital Transformation
Hyperconverged Systems for Digital TransformationHyperconverged Systems for Digital Transformation
Hyperconverged Systems for Digital Transformation
 
CyberVille® DHC System - May 12th, 2015
CyberVille® DHC System - May 12th, 2015CyberVille® DHC System - May 12th, 2015
CyberVille® DHC System - May 12th, 2015
 
Logicalis Backup as a Service: Re-defining Data Protection
Logicalis Backup as a Service: Re-defining Data ProtectionLogicalis Backup as a Service: Re-defining Data Protection
Logicalis Backup as a Service: Re-defining Data Protection
 
IBM Rational Software Conference 2009 Day 2 Keynote: Steve Mills
IBM Rational Software Conference 2009 Day 2 Keynote: Steve MillsIBM Rational Software Conference 2009 Day 2 Keynote: Steve Mills
IBM Rational Software Conference 2009 Day 2 Keynote: Steve Mills
 
Using Big Data to Transform Your Customer’s Experience - Part 1

Using Big Data to Transform Your Customer’s Experience - Part 1
Using Big Data to Transform Your Customer’s Experience - Part 1

Using Big Data to Transform Your Customer’s Experience - Part 1

 

Ähnlich wie Accelerating Big Data with Oracle NoSQL, Cisco UCS and Fusion-io

Intel and MariaDB: web-scale applications with distributed logs
Intel and MariaDB: web-scale applications with distributed logsIntel and MariaDB: web-scale applications with distributed logs
Intel and MariaDB: web-scale applications with distributed logsMariaDB plc
 
Storage simplicity value_110810
Storage simplicity value_110810Storage simplicity value_110810
Storage simplicity value_110810rjmurphyslideshare
 
DAT101 Understanding AWS Database Options - AWS re: Invent 2012
DAT101 Understanding AWS Database Options - AWS re: Invent 2012DAT101 Understanding AWS Database Options - AWS re: Invent 2012
DAT101 Understanding AWS Database Options - AWS re: Invent 2012Amazon Web Services
 
Introduction to NuoDB - March 2018
Introduction to NuoDB - March 2018Introduction to NuoDB - March 2018
Introduction to NuoDB - March 2018NuoDB
 
Dell - Storage 12sept2012
Dell - Storage 12sept2012Dell - Storage 12sept2012
Dell - Storage 12sept2012Agora Group
 
Postgres Plus Cloud Database
Postgres Plus Cloud DatabasePostgres Plus Cloud Database
Postgres Plus Cloud DatabaseGary Carter
 
The Last Frontier- Virtualization, Hybrid Management and the Cloud
The Last Frontier-  Virtualization, Hybrid Management and the CloudThe Last Frontier-  Virtualization, Hybrid Management and the Cloud
The Last Frontier- Virtualization, Hybrid Management and the CloudKellyn Pot'Vin-Gorman
 
BigDataCloud meetup Feb 16th - Microsoft's Saptak Sen's presentation
BigDataCloud meetup Feb 16th - Microsoft's Saptak Sen's presentationBigDataCloud meetup Feb 16th - Microsoft's Saptak Sen's presentation
BigDataCloud meetup Feb 16th - Microsoft's Saptak Sen's presentationBigDataCloud
 
Red hat Storage Day LA - Designing Ceph Clusters Using Intel-Based Hardware
Red hat Storage Day LA - Designing Ceph Clusters Using Intel-Based HardwareRed hat Storage Day LA - Designing Ceph Clusters Using Intel-Based Hardware
Red hat Storage Day LA - Designing Ceph Clusters Using Intel-Based HardwareRed_Hat_Storage
 
The CIOs Guide to NoSQL 2012
The CIOs Guide to NoSQL 2012The CIOs Guide to NoSQL 2012
The CIOs Guide to NoSQL 2012DATAVERSITY
 
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-finalDDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-finalIntelHealthcare
 
Data Virtualization: An Essential Component of a Cloud Data Lake
Data Virtualization: An Essential Component of a Cloud Data LakeData Virtualization: An Essential Component of a Cloud Data Lake
Data Virtualization: An Essential Component of a Cloud Data LakeDenodo
 
The Perfect Storm: The Impact of Analytics, Big Data and Analytics
The Perfect Storm: The Impact of Analytics, Big Data and AnalyticsThe Perfect Storm: The Impact of Analytics, Big Data and Analytics
The Perfect Storm: The Impact of Analytics, Big Data and AnalyticsInside Analysis
 
By Popular Demand: The Rise of Elastic SQL
By Popular Demand: The Rise of Elastic SQLBy Popular Demand: The Rise of Elastic SQL
By Popular Demand: The Rise of Elastic SQLNuoDB
 
Solix Corporate Overview
Solix Corporate OverviewSolix Corporate Overview
Solix Corporate OverviewKunal Grover
 
From the Big Data keynote at InCSIghts 2012
From the Big Data keynote at InCSIghts 2012From the Big Data keynote at InCSIghts 2012
From the Big Data keynote at InCSIghts 2012Anand Deshpande
 
A Hybrid Technology Platform for Increasing the Speed of Operational Analytics
A Hybrid Technology Platform for Increasing the Speed of Operational AnalyticsA Hybrid Technology Platform for Increasing the Speed of Operational Analytics
A Hybrid Technology Platform for Increasing the Speed of Operational AnalyticsIBMGovernmentCA
 
Architecting Cloud Solutions
Architecting Cloud SolutionsArchitecting Cloud Solutions
Architecting Cloud SolutionsAMD
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformDATAVERSITY
 
Database revolution opening webcast 01 18-12
Database revolution opening webcast 01 18-12Database revolution opening webcast 01 18-12
Database revolution opening webcast 01 18-12mark madsen
 

Ähnlich wie Accelerating Big Data with Oracle NoSQL, Cisco UCS and Fusion-io (20)

Intel and MariaDB: web-scale applications with distributed logs
Intel and MariaDB: web-scale applications with distributed logsIntel and MariaDB: web-scale applications with distributed logs
Intel and MariaDB: web-scale applications with distributed logs
 
Storage simplicity value_110810
Storage simplicity value_110810Storage simplicity value_110810
Storage simplicity value_110810
 
DAT101 Understanding AWS Database Options - AWS re: Invent 2012
DAT101 Understanding AWS Database Options - AWS re: Invent 2012DAT101 Understanding AWS Database Options - AWS re: Invent 2012
DAT101 Understanding AWS Database Options - AWS re: Invent 2012
 
Introduction to NuoDB - March 2018
Introduction to NuoDB - March 2018Introduction to NuoDB - March 2018
Introduction to NuoDB - March 2018
 
Dell - Storage 12sept2012
Dell - Storage 12sept2012Dell - Storage 12sept2012
Dell - Storage 12sept2012
 
Postgres Plus Cloud Database
Postgres Plus Cloud DatabasePostgres Plus Cloud Database
Postgres Plus Cloud Database
 
The Last Frontier- Virtualization, Hybrid Management and the Cloud
The Last Frontier-  Virtualization, Hybrid Management and the CloudThe Last Frontier-  Virtualization, Hybrid Management and the Cloud
The Last Frontier- Virtualization, Hybrid Management and the Cloud
 
BigDataCloud meetup Feb 16th - Microsoft's Saptak Sen's presentation
BigDataCloud meetup Feb 16th - Microsoft's Saptak Sen's presentationBigDataCloud meetup Feb 16th - Microsoft's Saptak Sen's presentation
BigDataCloud meetup Feb 16th - Microsoft's Saptak Sen's presentation
 
Red hat Storage Day LA - Designing Ceph Clusters Using Intel-Based Hardware
Red hat Storage Day LA - Designing Ceph Clusters Using Intel-Based HardwareRed hat Storage Day LA - Designing Ceph Clusters Using Intel-Based Hardware
Red hat Storage Day LA - Designing Ceph Clusters Using Intel-Based Hardware
 
The CIOs Guide to NoSQL 2012
The CIOs Guide to NoSQL 2012The CIOs Guide to NoSQL 2012
The CIOs Guide to NoSQL 2012
 
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-finalDDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
DDN Accelerating-Decisions-Through-Enterprise-Hadoop-final
 
Data Virtualization: An Essential Component of a Cloud Data Lake
Data Virtualization: An Essential Component of a Cloud Data LakeData Virtualization: An Essential Component of a Cloud Data Lake
Data Virtualization: An Essential Component of a Cloud Data Lake
 
The Perfect Storm: The Impact of Analytics, Big Data and Analytics
The Perfect Storm: The Impact of Analytics, Big Data and AnalyticsThe Perfect Storm: The Impact of Analytics, Big Data and Analytics
The Perfect Storm: The Impact of Analytics, Big Data and Analytics
 
By Popular Demand: The Rise of Elastic SQL
By Popular Demand: The Rise of Elastic SQLBy Popular Demand: The Rise of Elastic SQL
By Popular Demand: The Rise of Elastic SQL
 
Solix Corporate Overview
Solix Corporate OverviewSolix Corporate Overview
Solix Corporate Overview
 
From the Big Data keynote at InCSIghts 2012
From the Big Data keynote at InCSIghts 2012From the Big Data keynote at InCSIghts 2012
From the Big Data keynote at InCSIghts 2012
 
A Hybrid Technology Platform for Increasing the Speed of Operational Analytics
A Hybrid Technology Platform for Increasing the Speed of Operational AnalyticsA Hybrid Technology Platform for Increasing the Speed of Operational Analytics
A Hybrid Technology Platform for Increasing the Speed of Operational Analytics
 
Architecting Cloud Solutions
Architecting Cloud SolutionsArchitecting Cloud Solutions
Architecting Cloud Solutions
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics Platform
 
Database revolution opening webcast 01 18-12
Database revolution opening webcast 01 18-12Database revolution opening webcast 01 18-12
Database revolution opening webcast 01 18-12
 

Accelerating Big Data with Oracle NoSQL, Cisco UCS and Fusion-io

  • 1. ACCELERATING BIG DATA: IOMEMORY, CISCO UCS AND NOSQL Ashok Joshi, Senior Director – Oracle NoSQL development, Oracle Sumeet Bansal, Principal Solutions Architect, Fusion-io
  • 2. AGENDA ▸ Big Data overview ▸ Oracle NoSQL Database overview ▸ Real-time big data management – a business perspective ▸ NoSQL testing with YCSB ▸ The Fusion-io value October 1, 2012 2
  • 3. VOLUME, VELOCITY, VARIETY, VALUE Big Data characteristics ▸ Terabytes, Petabytes ▸ Multiple sources for data ▸ Text, images, XML, JSON, sensor readings… ▸ Not “master” data, but important for business ▸ “Real-time” needs October 1, 2012 3
  • 4. WHO USES BIG DATA? Web Services Big Energy Telco • Clickstream Analysis • Granular rate plans • Churn reduction • Abuse Prevention • Grid management • Network optimization Storage Financial Services Government • Cost efficient • Customer loyalty • Regulatory compliance • Analytics-ready • Risk • Environmental monitoring • Data Store • Trading • Cyber security • Scalable • Fraud • Distributed • Compliance • Credit scoring Large-scale, E-commerce • Recommendation engines • Cross-channel analytics • Golden path to purchase October 1, 2012 4
  • 5. BIG DATA BUSINESS BENEFITS GLOBAL PERSONAL EUROPE PUBLIC US HEALTH CARE US RETAIL MANUFACTURING LOCATION DATA SECTOR ADMIN $300 B 60+% -50% $100 B €250 B Increase industry Increase net Decrease dev., Increase service Increase industry value per year margin assembly costs provider revenue value per year “In a big data world, a competitor that fails to sufficiently develop its capabilities will be left behind.” October 1, 2012 5
  • 6. AGENDA ▸ Big Data overview ▸ Oracle NoSQL Database overview ▸ Real-time big data management – a business perspective ▸ NoSQL testing with YCSB ▸ The Fusion-io value October 1, 2012 6
  • 7. NOSQL DATABASE ARCHITECTURE Highlights Application Application ▸ Available; scalable; fast NoSQL Database NoSQL Database NoSQL Database ▸ Simple administration, key- Driver Driver Driver value data model; transaction support ▸ Transparent load balancing; elastic ▸ Commercial grade software and support ▸ Integrated with related Oracle technologies Storage Nodes Storage Nodes October 1, 2012 7
  • 8. AGENDA ▸ Big Data overview ▸ Oracle NoSQL Database overview ▸ Real-time big data management – a business perspective ▸ NoSQL testing with YCSB ▸ The Fusion-io value October 1, 2012 8
  • 9. NOSQL DB AND CISCO UCS COLLABORATION WHY DOES IT MATTER ▸ Many components: network, processors, memory, software, storage – tested, tuned and optimized ▸ Business can focus on core competency and leveraging benefits of big data October 1, 2012 9
  • 10. NOSQL DB AND FUSION-IO COLLABORATION WHY DOES IT MATTER ▸ Speed (latency) is critical • Amazon study: Every 100 millisecond increase in latency costs 1% in sales http://highscalability.com/latency-everywhere-and-it-costs-you-sales-how-crush-it ▸ ioMemory enables consistent, extremely low latency and extreme throughput October 1, 2012 10
  • 11. ORACLE NOSQL DB, CISCO UCS, FUSION-IO ▸ Commercial grade solution and support ▸ Tested, tuned, optimized for real-time data management ▸ For equivalent performance, much lower CapEx and OpEx compared to commodity (DIY) solutions October 1, 2012 11
  • 12. AGENDA ▸ Big Data overview ▸ Oracle NoSQL Database overview ▸ Real-time big data management – a business perspective ▸ NoSQL testing with YCSB ▸ The Fusion-io value October 1, 2012 12
  • 13. SYSTEM UNDER TEST ▸ YCSB (Yahoo! Cloud Serving Benchmark) • 10 client machines generate load • Mixed workload (5% updates/95% reads) ▸ 15 UCS C240 M3 Rack Servers ▸ 30 Fusion’s ioDrive2 ▸ 2TB of data October 1, 2012 13
  • 14. CONFIGURATION DETAILS Client machine (YCSB driver) UCS 240 M3 server 1 2 1 2 1 2 NoSQL Database on ioDrive2 Note: only two shards are shown in the illustration October 1, 2012 14
  • 15. PERFORMANCE TEST RESULTS Number of shards 2 4 8 10 Mixed workload (95 read/5 302,152 558,569 1,028,868 1,244,550 write) throughput (ops/sec) Read latency (milliseconds) 0.76 0.79 0.85 0.88 Mixed workload update 3.08 3.82 4.29 4.47 latency (milliseconds) October 1, 2012 15
  • 16. PUTTING PERFORMANCE IN CONTEXT Highlights ▸ For Example, Twitter: ~150K API calls/sec ▸ We can achieve that performance on a single UCS c240 server using two ioDrive2s ▸ Plenty of capacity to handle fluctuating demand without compromising performance http://blog.programmableweb.com/2011/05/25/who-belongs-to-the-api-billionaires-club/ October 1, 2012 16
  • 17. AGENDA ▸ Big Data overview ▸ Oracle NoSQL Database overview ▸ Real-time big data management – a business perspective ▸ NoSQL testing with YCSB ▸ The Fusion-io value October 1, 2012 17
  • 18. CUT-THROUGH ARCHITECTURE AND VSL FOR EXTREME THROUGHPUT AND LOW LATENCY ▸ Sophisticated architecture Host • maximum performance Virtualization ioMemory DRAM / Tables Memory / CPU and cores Operating System and ▸ Intelligent software Application Memory • advanced features Virtual Storage Layer T RANSFERS (VSL) DAT A Commands PCIe Applications/Databases ioDrive ioMemory File System Data-Path Controller Kernel Banks Virtual Storage Layer (VSL) Channels Wide ioMemory October 1, 2012 18
  • 19. SOFTWARE DEVELOPMENT KIT ADVANTAGES FOR GREATER PERFORMANCE OPTIMIZATION APPLICAT ION Application source code Conventional access Direct access I/O Memory access Transactional Native Key-Value Auto-Commit Logging Block File Pair Memory™ Simple Network Simple Block File Block Proprietary Storage OS Native Flash Translation Layer Storage Media Storage Media Traditional Storage Software Defined Storage October 1, 2012 19
  • 20. TYPES OF IOMEMORY For Cisco UCS B-Series Blade Servers For Cisco UCS C-Series Rack Servers 365 GB, 785 GB, 1.2 TB Mezzanine Card 365 GB, 785 GB 2.4 TB October 1, 2012 20
  • 21. BIG DATA ANALYTICS - HADOOP October 1, 2012 21
  • 22. COMPLETE BIG DATA SOLUTION ▸ Right Partners – Cisco and Fusion-io ▸ Multiple technologies for a comprehensive big data solution – NoSQL, map-reduce, relational ▸ Tested, integrated, optimized, commercially supported solution delivered by leaders ▸ Cost-effective, reliable, ready for the enterprise October 1, 2012 22
  • 23. T H AN K Y O U