SlideShare a Scribd company logo
1 of 26
Download to read offline
Couchbase 2012



     Couchbase Server and IBM BigInsights:
                          One + One = Three
     Steve Beier
     Program Director, Big Data Applications & Solutions, IBM
     Dipti Borkar
     Director, Product Management, Couchbase




                                                                © 2012 IBM Corporation
2 kinds of database management system




                           OLTP	
  




                         Analy+cs	
  




2                                       © 2012 IBM Corporation
2 kinds of database management system




                              OLTP	
  




                            Analy+cs	
  




3                                           © 2012 IBM Corporation
2 kinds of database management system




                              OLTP	
  




                            Analy+cs	
  




4                                           © 2012 IBM Corporation
2 kinds of database management system




                                            Big	
  Users	
  




                                            Big	
  Data	
  




5                                                       © 2012 IBM Corporation
2 kinds of database management system



                                       Simple,	
  fast,	
  elas+c	
  
                                       NoSQL	
  database	
  
                                       with	
  sub-­‐
                                       millisecond	
  
                                       performance	
  at	
  
                                       scale	
  

                                       Map-­‐reduce	
  against	
  
                                       huge	
  datasets	
  to	
  
                                       cook	
  up	
  insights	
  
                                       and	
  answers	
  


6                                                                 © 2012 IBM Corporation
Ad and offer targeting




                                         Ad Targeting                               40	
  milliseconds	
  to	
  
                                                                                    pick	
  the	
  right	
  
                                                                                    offer	
  

                                                                                    profiles,	
  
              raw	
  event	
  data	
  




                                                                                    campaigns	
  /	
  
                                                                                    offers,	
  
                                                        ac:onable	
  insights	
  
                                                                                    cooked	
  insights	
  
                                                                                    raw	
  event	
  data	
  
                                                                                    cooked	
  insights	
  
7                                                                                                              © 2012 IBM Corporation
Content Recommendation Targeting




                                               content
               3	
                          oriented site
            targeted	
  
        recommenda:ons	
  
                                                      1	
  
                                                   events	
     relational
                                                                database


                                    2	
  
                             user	
  profiles	
  




8                                                                            © 2012 IBM Corporation
sqoop

sqoop == sql RDBMS + hadoop

    • a data transfer tool for Hadoop
    • for moving data from non-Hadoop datasources (like
      relational databases, NoSQL) into/out-of Hadoop

Couchbase provides Cloudera Certified sqoop
connector




9                                              © 2012 IBM Corporation
Ad Targeting




                                      Ad Targeting
                                       Platform


                                                       Logs
                                                         Logs
                                                           Logs
       Couchbase Server Cluster                              Logs
                                        sqoop export           Logs


                                                                      flume
                                                                       flow
                                  sqoop import

                                                                         Hadoop Cluster




10                                                                                © 2012 IBM Corporation
Content Driven Site


     In order to keep up with changing needs on
     richer, more targeted content that is delivered
     to larger and larger audiences very quickly,        Content Driven
     data behind content driven sites is shifting to       Web Site
     Couchbase.




                Couchbase Server Cluster                                                        Original RDBMS


                                                                  Logs
                                                                    Logs
                                                                      Logs
                                                                        Logs
                                                                          Logs      Hadoop excels at complex analytics which
                                                                                    may involve multiple steps of processing
                                                                                    which incorporate a number of different data
                                                                                    sources.
                                                                            flume
                                                                             flow                        sqoop import
                                          sqoop export
               sqoop import

                                                                       Hadoop Cluster




11                                                                                                                  © 2012 IBM Corporation
Couchbase à Hadoop



$ sqoop import 
      –-connect http://couchbase-01:8091/pools                  --
table DUMP
$ sqoop import 
      –-connect http://couchbase-01:8091/pools                  --
table BACKFILL_5




12                                                 © 2012 IBM Corporation
Couchbase à Hadoop



$ sqoop import 
      –-connect http://couchbase-01:8091/pools                     --
table DUMP
$ sqoop import 
      –-connect http://couchbase-01:8091/pools                     --
table BACKFILL_5

For import, table must be either:
    •  DUMP: All items currently in Couchbase
    •  BACKFILL_n: All item mutations for n minutes




13                                                    © 2012 IBM Corporation
Hadoop à Couchbase



$ sqoop export 
      --connect http://couchbase-01:8091/pools 
      --table REQUIRED_BUT_IGNORED 
      -–export-dir HDFS_DIRECTORY_TO_EXPORT




14                                                 © 2012 IBM Corporation
sqoop Versions



sqoop 1.4.2

Cloudera CDH3
   •  Ubuntu 10.10 – 11.10; later versions missing package needed for CDH3

Cloudera CDH4 update 1 needed
   •  sqoop bug fix in Cloudera CDH4u1 required




15                                                                           © 2012 IBM Corporation
Couchbase sqoop - Resources



http://www.couchbase.com/develop/connectors/hadoop

http://www.couchbase.com/docs/hadoop-plugin/

https://github.com/couchbase/couchbase-hadoop-plugin

http://www.ibm.com/developerworks/opensource/library/ba-hadoop-couchbase/ba-
    hadoop-couchbase-pdf.pdf




16                                                                 © 2012 IBM Corporation
Big Data platform: Bring Together a Large Volume and Variety of Data
to Find New Insights
                                                     T-Mobile
                §  Analyzing a variety of data at
                    enormous volumes"                Multi-channel customer
                                                     experience analysis
                §  Insights on streaming data"
                §  Large volume structured,
                    semi-structure and               UOIT
                    unstructured data analysis"      Detect life-threatening
                                                     conditions in time to intervene


                                                     Vestas
                                                     Predict weather patterns to plan
                                                     optimal wind turbine usage

                Big Data Platform
                                                     Dublin City Council
                •  Variety
                                                     Optimization and monitoring of
                •  Velocity                          public transportations
                •  Volume
                                                     Brocade
                                                     Identify network security
                                                     intrusions
17                                                                   © 2012 IBM Corporation
                                                                        © 2011 IBM Corporation
Green Energy: Vestas Wind Systems A/S
                                                      Volume

      §  Weather and geographic data
        analysis for wind turbine and wind
        farm site planning
      §  Deployed IBM Big Data to store,
        manage and to analyze location-
        specific data
      §  Analyzing 2.8 petabytes of public
        and private weather data for each
        geographic location
      §  Reduced by 97% - from weeks to
        hours – the modeling time for wind
        forecasting information




18                                            © 2012 IBM Corporation
IBM Watson Demonstrated the Power of Big Data Analytics
                                                                              Variety




    Can we design a computing system that rivals a human’s ability to answer
   questions posed in natural language, interpreting meaning and context and
retrieving, analyzing and understanding vast amounts of information in real-time?
19                                                                   © 2012 IBM Corporation
Big Data Analytics in Smarter Hospitals
                                                                              Velocity

     Big Data enabled doctors from University of Ontario to apply neonatal infant
              monitoring to predict infection in ICU 24 hours in advance




                                                                IBM Data Baby
                                                                 youtube.com


20                                                                    © 2012 IBM Corporation
Asian telco reduces
          billing costs and
          improves customer
          satisfaction.
          Capabilities:
               Stream Computing
               Analytic Accelerators

          Real-time mediation and analysis of
            6B CDRs per day

          Data processing time reduced       from
            12 hrs to 1 sec

          Hardware cost reduced to 1/8th
          Proactively address issues
     21
           (e.g. dropped calls) impacting customer
                                    © 2012 IBM Corporation
21         satisfaction.
Telecommunications – Analyze in real time
§  A Telco processing Call Detail Records                                                500K/sec, 6B+ IPDRs analyzed
     –  6 Billion CDRs per day                                                            per day on more than 4 PBs/yr.
     –  Deduplicating data over 7 days                                                    sustaining 1GBps.
     –  Processing latency reduced from 12 hours to a few seconds

§  A Telco implementing a solution to access and analyze call, internet usage and texting detail
    records (xDRs) in real-time
     –  91% reduction in time to merge data
     –  93% reduction in storage requirements
     –  85% reduction in servers used

§  A Telco requiring a solution to analyze up to 25M messages per second. At these volumes, in-
    motion analysis is the only option
     –  “Streams handled at least an order of magnitude more events per second on the same hardware than competitors.” (Telco’s
        Chief Architect)
     –  Even at these volumes, Streams provided near linear scalability




22                                                                                                             © 2012 IBM Corporation
Big Data is an integral part of an enterprise data platform
 §    Manage Big Data from the instant it enters the enterprise
 §    High fidelity – no changes to original format
 §    Available for new uses, analyses, and integrations                                                Business Analytic
                                                                                                         Applications (e.g. Cognos,
                                                                                                         SPSS) and Solutions	

                          Big Data Applications	

                                                                             Operational Data Store	





                                    Big Data Platform
                  IBM Big Data Solutions          Client and Partner Solutions                                       Warehouse and
                                                                                                                     Appliances	

              Big Data User Environment
                   Developers              End Users               Admin.




                      Big Data Enterprise Engine                                                  Traditional data sources
                     Streaming                              Internet-scale
                     analytics                                 analytics


                                                                                                      Govern:
               Source data (Web, sensors, logs, media, etc. )	

                 Quality, Lifecycle Management, Security, Privacy
23                                                                                                                 © 2012 IBM Corporation
IBM’s Big Data Platform
                          Bringing Big Data to the Enterprise
                                                                                                              Data
                  IBM Big Data Solutions            Client and Partner Solutions                            Warehouse
                                                                                                             InfoSphere
                                                                                                             Warehouse

                                                                                                            Warehouse
                                                                                                            Appliances
                            Big Data User Environments                                                         Netezza

                  Developers               End Users             Administrators                             Master Data
                                                                                                              Mgmt
                                                                                                           InfoSphere MDM




                                                                                   INTEGRATION
AGENTS




                                                                                                             Database


                            Big Data Enterprise Engines                                                      DB2, Informix

                                                                                                             Content
                                                                                                             Analytics
                                                                                                                ECM




                                                                                   Information Server
                                                                                                             Business
                                                                                                             Analytics
                  Streaming Analytics                 Internet Scale Analytics
                                                                                                           Cognos & SPSS


                                                                                                             Marketing
                    Open Source Foundational Components
                                                                                                                Unica


                    Hadoop       HBase        Pig      Lucene     Jaql   Hive                              Data Growth
                                                                                                           Management
                                                                                                           InfoSphere Optim
24 24                                                                                             © 2012 IBM Corporation
IBM Big Data Platform Tools


Business Users

Data Scientists

Business Analysts

Developers

Administrators



        •    Determine product sentiment, intent, customer segmentation
        •    Execute reusable Apps to classify users, predict sales, and forecast trends
        •    Create spreadsheets and dashboards Analyzing big data
        •    Productive environment for executing analysis (cluster, rank, score with R, ML, Text)
        •    Create reusable analytic Apps without programming
        •    Dynamic open dashboard
25                                                                                   © 2012 IBM Corporation
THANK YOU

     sbeier@us.ibm.com
     dipti@couchbase.com




26                         © 2012 IBM Corporation

More Related Content

What's hot

HugeTable:Application-Oriented Structure Data Storage System
HugeTable:Application-Oriented Structure Data Storage SystemHugeTable:Application-Oriented Structure Data Storage System
HugeTable:Application-Oriented Structure Data Storage Systemqlw5
 
Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10keirdo1
 
Cloudera Sessions - Clinic 1 - Getting Started With Hadoop
Cloudera Sessions - Clinic 1 - Getting Started With HadoopCloudera Sessions - Clinic 1 - Getting Started With Hadoop
Cloudera Sessions - Clinic 1 - Getting Started With HadoopCloudera, Inc.
 
The 25 Most Promising Open Source Projects
The 25 Most Promising Open Source ProjectsThe 25 Most Promising Open Source Projects
The 25 Most Promising Open Source Projectsaf83
 
Impact of in-memory technology and SAP HANA on your business, IT, and career
Impact of in-memory technology and SAP HANA on your business, IT, and careerImpact of in-memory technology and SAP HANA on your business, IT, and career
Impact of in-memory technology and SAP HANA on your business, IT, and careerVitaliy Rudnytskiy
 
Impact of in-memory technology and SAP HANA (2012 Update)
Impact of in-memory technology and SAP HANA (2012 Update)Impact of in-memory technology and SAP HANA (2012 Update)
Impact of in-memory technology and SAP HANA (2012 Update)Vitaliy Rudnytskiy
 
information-broadcasting-in-sap-bw-35
 information-broadcasting-in-sap-bw-35 information-broadcasting-in-sap-bw-35
information-broadcasting-in-sap-bw-35Phani Kumar
 
Hadoop Summit 2012 | Integrating Hadoop Into the Enterprise
Hadoop Summit 2012 | Integrating Hadoop Into the EnterpriseHadoop Summit 2012 | Integrating Hadoop Into the Enterprise
Hadoop Summit 2012 | Integrating Hadoop Into the EnterpriseCloudera, Inc.
 
HCLT Whitepaper: Thermal Design and Management of Servers
HCLT Whitepaper: Thermal Design and Management of ServersHCLT Whitepaper: Thermal Design and Management of Servers
HCLT Whitepaper: Thermal Design and Management of ServersHCL Technologies
 
Top 6 Reasons to Use a Distributed Data Grid
Top 6 Reasons to Use a Distributed Data GridTop 6 Reasons to Use a Distributed Data Grid
Top 6 Reasons to Use a Distributed Data GridScaleOut Software
 
Vision - The Agile Data Center
Vision - The Agile Data CenterVision - The Agile Data Center
Vision - The Agile Data Centerincommoninc
 
Document Imaging Tools and Strategies to Accelerate Your Accounts Payable Act...
Document Imaging Tools and Strategies to Accelerate Your Accounts Payable Act...Document Imaging Tools and Strategies to Accelerate Your Accounts Payable Act...
Document Imaging Tools and Strategies to Accelerate Your Accounts Payable Act...Verbella CMG
 
Sap sap so h 2013
Sap sap so h 2013Sap sap so h 2013
Sap sap so h 2013deepersnet
 

What's hot (18)

HugeTable:Application-Oriented Structure Data Storage System
HugeTable:Application-Oriented Structure Data Storage SystemHugeTable:Application-Oriented Structure Data Storage System
HugeTable:Application-Oriented Structure Data Storage System
 
Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10
 
Cloudera Sessions - Clinic 1 - Getting Started With Hadoop
Cloudera Sessions - Clinic 1 - Getting Started With HadoopCloudera Sessions - Clinic 1 - Getting Started With Hadoop
Cloudera Sessions - Clinic 1 - Getting Started With Hadoop
 
The 25 Most Promising Open Source Projects
The 25 Most Promising Open Source ProjectsThe 25 Most Promising Open Source Projects
The 25 Most Promising Open Source Projects
 
Impact of in-memory technology and SAP HANA on your business, IT, and career
Impact of in-memory technology and SAP HANA on your business, IT, and careerImpact of in-memory technology and SAP HANA on your business, IT, and career
Impact of in-memory technology and SAP HANA on your business, IT, and career
 
Introduction to h base
Introduction to h baseIntroduction to h base
Introduction to h base
 
Impact of in-memory technology and SAP HANA (2012 Update)
Impact of in-memory technology and SAP HANA (2012 Update)Impact of in-memory technology and SAP HANA (2012 Update)
Impact of in-memory technology and SAP HANA (2012 Update)
 
Treasure Data and Heroku
Treasure Data and HerokuTreasure Data and Heroku
Treasure Data and Heroku
 
Denbury Resources Case Study
Denbury Resources Case StudyDenbury Resources Case Study
Denbury Resources Case Study
 
information-broadcasting-in-sap-bw-35
 information-broadcasting-in-sap-bw-35 information-broadcasting-in-sap-bw-35
information-broadcasting-in-sap-bw-35
 
Hadoop Summit 2012 | Integrating Hadoop Into the Enterprise
Hadoop Summit 2012 | Integrating Hadoop Into the EnterpriseHadoop Summit 2012 | Integrating Hadoop Into the Enterprise
Hadoop Summit 2012 | Integrating Hadoop Into the Enterprise
 
HCLT Whitepaper: Thermal Design and Management of Servers
HCLT Whitepaper: Thermal Design and Management of ServersHCLT Whitepaper: Thermal Design and Management of Servers
HCLT Whitepaper: Thermal Design and Management of Servers
 
Top 6 Reasons to Use a Distributed Data Grid
Top 6 Reasons to Use a Distributed Data GridTop 6 Reasons to Use a Distributed Data Grid
Top 6 Reasons to Use a Distributed Data Grid
 
Zh tw cloud computing era
Zh tw cloud computing eraZh tw cloud computing era
Zh tw cloud computing era
 
Vision - The Agile Data Center
Vision - The Agile Data CenterVision - The Agile Data Center
Vision - The Agile Data Center
 
Document Imaging Tools and Strategies to Accelerate Your Accounts Payable Act...
Document Imaging Tools and Strategies to Accelerate Your Accounts Payable Act...Document Imaging Tools and Strategies to Accelerate Your Accounts Payable Act...
Document Imaging Tools and Strategies to Accelerate Your Accounts Payable Act...
 
Sap sap so h 2013
Sap sap so h 2013Sap sap so h 2013
Sap sap so h 2013
 
Oracle Data Warehouse
Oracle Data WarehouseOracle Data Warehouse
Oracle Data Warehouse
 

Viewers also liked

Go simple-fast-elastic-with-couchbase-server-borkar
Go simple-fast-elastic-with-couchbase-server-borkarGo simple-fast-elastic-with-couchbase-server-borkar
Go simple-fast-elastic-with-couchbase-server-borkarDipti Borkar
 
EMC Starter Kit - IBM BigInsights - EMC Isilon
EMC Starter Kit - IBM BigInsights - EMC IsilonEMC Starter Kit - IBM BigInsights - EMC Isilon
EMC Starter Kit - IBM BigInsights - EMC IsilonBoni Bruno
 
Openshift/Kubernetes integration with Apache YARN
Openshift/Kubernetes integration with Apache YARNOpenshift/Kubernetes integration with Apache YARN
Openshift/Kubernetes integration with Apache YARNverbal1714
 
Slides: NoSQL Data Modeling Using JSON Documents – A Practical Approach
Slides: NoSQL Data Modeling Using JSON Documents – A Practical ApproachSlides: NoSQL Data Modeling Using JSON Documents – A Practical Approach
Slides: NoSQL Data Modeling Using JSON Documents – A Practical ApproachDATAVERSITY
 
Dynamic Column Masking and Row-Level Filtering in HDP
Dynamic Column Masking and Row-Level Filtering in HDPDynamic Column Masking and Row-Level Filtering in HDP
Dynamic Column Masking and Row-Level Filtering in HDPHortonworks
 
Ibm spectrum scale_backup_n_archive_v03_ash
Ibm spectrum scale_backup_n_archive_v03_ashIbm spectrum scale_backup_n_archive_v03_ash
Ibm spectrum scale_backup_n_archive_v03_ashAshutosh Mate
 

Viewers also liked (7)

Go simple-fast-elastic-with-couchbase-server-borkar
Go simple-fast-elastic-with-couchbase-server-borkarGo simple-fast-elastic-with-couchbase-server-borkar
Go simple-fast-elastic-with-couchbase-server-borkar
 
EMC Starter Kit - IBM BigInsights - EMC Isilon
EMC Starter Kit - IBM BigInsights - EMC IsilonEMC Starter Kit - IBM BigInsights - EMC Isilon
EMC Starter Kit - IBM BigInsights - EMC Isilon
 
Openshift/Kubernetes integration with Apache YARN
Openshift/Kubernetes integration with Apache YARNOpenshift/Kubernetes integration with Apache YARN
Openshift/Kubernetes integration with Apache YARN
 
Slides: NoSQL Data Modeling Using JSON Documents – A Practical Approach
Slides: NoSQL Data Modeling Using JSON Documents – A Practical ApproachSlides: NoSQL Data Modeling Using JSON Documents – A Practical Approach
Slides: NoSQL Data Modeling Using JSON Documents – A Practical Approach
 
Ansible + Hadoop
Ansible + HadoopAnsible + Hadoop
Ansible + Hadoop
 
Dynamic Column Masking and Row-Level Filtering in HDP
Dynamic Column Masking and Row-Level Filtering in HDPDynamic Column Masking and Row-Level Filtering in HDP
Dynamic Column Masking and Row-Level Filtering in HDP
 
Ibm spectrum scale_backup_n_archive_v03_ash
Ibm spectrum scale_backup_n_archive_v03_ashIbm spectrum scale_backup_n_archive_v03_ash
Ibm spectrum scale_backup_n_archive_v03_ash
 

Similar to Couchbase Server and IBM BigInsights: One + One = Three

Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011Cloudera, Inc.
 
Big Data 視覺化分析解決方案
Big Data 視覺化分析解決方案Big Data 視覺化分析解決方案
Big Data 視覺化分析解決方案Etu Solution
 
Leveraging System z to Turn Information Into Insight
Leveraging System z to Turn Information Into InsightLeveraging System z to Turn Information Into Insight
Leveraging System z to Turn Information Into Insightdkang
 
Cast Iron Overview Webinar 6.13
Cast Iron Overview Webinar 6.13Cast Iron Overview Webinar 6.13
Cast Iron Overview Webinar 6.13gaborvodics
 
InfiniDB 3 - Speeding Big Data Analytics in Amazon EC2
InfiniDB 3 - Speeding Big Data Analytics in Amazon EC2InfiniDB 3 - Speeding Big Data Analytics in Amazon EC2
InfiniDB 3 - Speeding Big Data Analytics in Amazon EC2Calpont Corporation
 
Business Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache HadoopBusiness Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache HadoopCloudera, Inc.
 
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...Amr Awadallah
 
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...Cloudera, Inc.
 
Cast Iron Overview Webinar 6.13.12 Final(Jb)
Cast Iron Overview Webinar 6.13.12 Final(Jb)Cast Iron Overview Webinar 6.13.12 Final(Jb)
Cast Iron Overview Webinar 6.13.12 Final(Jb)Carolyn Crowe
 
ALM Integration in a Web 2.0 World
ALM Integration in a Web 2.0 WorldALM Integration in a Web 2.0 World
ALM Integration in a Web 2.0 Worldoslc
 
Integrating Hadoop Into the Enterprise – Hadoop Summit 2012
Integrating Hadoop Into the Enterprise – Hadoop Summit 2012Integrating Hadoop Into the Enterprise – Hadoop Summit 2012
Integrating Hadoop Into the Enterprise – Hadoop Summit 2012Jonathan Seidman
 
Business continuity with SAP on IBM i
Business continuity with SAP on IBM iBusiness continuity with SAP on IBM i
Business continuity with SAP on IBM iCOMMON Europe
 
IBM Cognos - IBM informations-integration för IBM Cognos användare
IBM Cognos - IBM informations-integration för IBM Cognos användareIBM Cognos - IBM informations-integration för IBM Cognos användare
IBM Cognos - IBM informations-integration för IBM Cognos användareIBM Sverige
 
Cloud Computing: Making IT Simple
Cloud Computing: Making IT SimpleCloud Computing: Making IT Simple
Cloud Computing: Making IT SimpleBob Rhubart
 
DB2 Web Query whats new
DB2 Web Query whats newDB2 Web Query whats new
DB2 Web Query whats newCOMMON Europe
 
Cloud Computing - Making IT Simple
 Cloud Computing - Making IT Simple Cloud Computing - Making IT Simple
Cloud Computing - Making IT SimpleBob Rhubart
 
Hw09 Data Processing In The Enterprise
Hw09   Data Processing In The EnterpriseHw09   Data Processing In The Enterprise
Hw09 Data Processing In The EnterpriseCloudera, Inc.
 

Similar to Couchbase Server and IBM BigInsights: One + One = Three (20)

Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
 
Accelerate Return on Data
Accelerate Return on DataAccelerate Return on Data
Accelerate Return on Data
 
Big Data 視覺化分析解決方案
Big Data 視覺化分析解決方案Big Data 視覺化分析解決方案
Big Data 視覺化分析解決方案
 
Leveraging System z to Turn Information Into Insight
Leveraging System z to Turn Information Into InsightLeveraging System z to Turn Information Into Insight
Leveraging System z to Turn Information Into Insight
 
Cast Iron Overview Webinar 6.13
Cast Iron Overview Webinar 6.13Cast Iron Overview Webinar 6.13
Cast Iron Overview Webinar 6.13
 
InfiniDB 3 - Speeding Big Data Analytics in Amazon EC2
InfiniDB 3 - Speeding Big Data Analytics in Amazon EC2InfiniDB 3 - Speeding Big Data Analytics in Amazon EC2
InfiniDB 3 - Speeding Big Data Analytics in Amazon EC2
 
Business Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache HadoopBusiness Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache Hadoop
 
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
 
Yahoo & Hadoop
Yahoo & HadoopYahoo & Hadoop
Yahoo & Hadoop
 
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
 
Cast Iron Overview Webinar 6.13.12 Final(Jb)
Cast Iron Overview Webinar 6.13.12 Final(Jb)Cast Iron Overview Webinar 6.13.12 Final(Jb)
Cast Iron Overview Webinar 6.13.12 Final(Jb)
 
ALM Integration in a Web 2.0 World
ALM Integration in a Web 2.0 WorldALM Integration in a Web 2.0 World
ALM Integration in a Web 2.0 World
 
Integrating Hadoop Into the Enterprise – Hadoop Summit 2012
Integrating Hadoop Into the Enterprise – Hadoop Summit 2012Integrating Hadoop Into the Enterprise – Hadoop Summit 2012
Integrating Hadoop Into the Enterprise – Hadoop Summit 2012
 
Business continuity with SAP on IBM i
Business continuity with SAP on IBM iBusiness continuity with SAP on IBM i
Business continuity with SAP on IBM i
 
IBM Cognos - IBM informations-integration för IBM Cognos användare
IBM Cognos - IBM informations-integration för IBM Cognos användareIBM Cognos - IBM informations-integration för IBM Cognos användare
IBM Cognos - IBM informations-integration för IBM Cognos användare
 
Cloud Computing: Making IT Simple
Cloud Computing: Making IT SimpleCloud Computing: Making IT Simple
Cloud Computing: Making IT Simple
 
DB2 Web Query whats new
DB2 Web Query whats newDB2 Web Query whats new
DB2 Web Query whats new
 
Cloud Computing - Making IT Simple
 Cloud Computing - Making IT Simple Cloud Computing - Making IT Simple
Cloud Computing - Making IT Simple
 
Hw09 Data Processing In The Enterprise
Hw09   Data Processing In The EnterpriseHw09   Data Processing In The Enterprise
Hw09 Data Processing In The Enterprise
 
IBM Cloud Strategy
IBM Cloud StrategyIBM Cloud Strategy
IBM Cloud Strategy
 

More from Dipti Borkar

Presto – Today and Beyond – The Open Source SQL Engine for Querying all Data...
Presto – Today and Beyond – The Open Source SQL Engine for Querying all Data...Presto – Today and Beyond – The Open Source SQL Engine for Querying all Data...
Presto – Today and Beyond – The Open Source SQL Engine for Querying all Data...Dipti Borkar
 
Revolutionizing the customer experience - Hello Engagement Database
Revolutionizing the customer experience - Hello Engagement DatabaseRevolutionizing the customer experience - Hello Engagement Database
Revolutionizing the customer experience - Hello Engagement DatabaseDipti Borkar
 
How companies use NoSQL & Couchbase - NoSQL Now 2014
How companies use NoSQL & Couchbase - NoSQL Now 2014How companies use NoSQL & Couchbase - NoSQL Now 2014
How companies use NoSQL & Couchbase - NoSQL Now 2014Dipti Borkar
 
Introduction to couchbase
Introduction to couchbaseIntroduction to couchbase
Introduction to couchbaseDipti Borkar
 
How companies-use-no sql-and-couchbase-10152013
How companies-use-no sql-and-couchbase-10152013How companies-use-no sql-and-couchbase-10152013
How companies-use-no sql-and-couchbase-10152013Dipti Borkar
 
Characteristics of no sql databases
Characteristics of no sql databasesCharacteristics of no sql databases
Characteristics of no sql databasesDipti Borkar
 
How companies use NoSQL and Couchbase - NoSQL Now 2013
How companies use NoSQL and Couchbase - NoSQL Now 2013How companies use NoSQL and Couchbase - NoSQL Now 2013
How companies use NoSQL and Couchbase - NoSQL Now 2013Dipti Borkar
 
How companies use NoSQL and Couchbase
How companies use NoSQL and CouchbaseHow companies use NoSQL and Couchbase
How companies use NoSQL and CouchbaseDipti Borkar
 
Launch webinar-introducing couchbase server 2.0-01202013
Launch webinar-introducing couchbase server 2.0-01202013Launch webinar-introducing couchbase server 2.0-01202013
Launch webinar-introducing couchbase server 2.0-01202013Dipti Borkar
 
Part 2 of the webinar - Which freaking database should I use?
Part 2 of the webinar - Which freaking database should I use?Part 2 of the webinar - Which freaking database should I use?
Part 2 of the webinar - Which freaking database should I use?Dipti Borkar
 
Couchbase Server 2.0 - XDCR - Deep dive
Couchbase Server 2.0 - XDCR - Deep diveCouchbase Server 2.0 - XDCR - Deep dive
Couchbase Server 2.0 - XDCR - Deep diveDipti Borkar
 
Couchbase Server 2.0 - Indexing and Querying - Deep dive
Couchbase Server 2.0 - Indexing and Querying - Deep diveCouchbase Server 2.0 - Indexing and Querying - Deep dive
Couchbase Server 2.0 - Indexing and Querying - Deep diveDipti Borkar
 
Introduction to Couchbase Server 2.0
Introduction to Couchbase Server 2.0Introduction to Couchbase Server 2.0
Introduction to Couchbase Server 2.0Dipti Borkar
 
Transition from relational to NoSQL Philly DAMA Day
Transition from relational to NoSQL Philly DAMA DayTransition from relational to NoSQL Philly DAMA Day
Transition from relational to NoSQL Philly DAMA DayDipti Borkar
 
Introduction to NoSQL and Couchbase
Introduction to NoSQL and CouchbaseIntroduction to NoSQL and Couchbase
Introduction to NoSQL and CouchbaseDipti Borkar
 
Navigating the Transition from relational to NoSQL - CloudCon Expo 2012
Navigating the Transition from relational to NoSQL - CloudCon Expo 2012Navigating the Transition from relational to NoSQL - CloudCon Expo 2012
Navigating the Transition from relational to NoSQL - CloudCon Expo 2012Dipti Borkar
 
Introduction to Couchbase Server 2.0 - CouchConf SF - Tour and Demo
Introduction to Couchbase Server 2.0 - CouchConf SF - Tour and DemoIntroduction to Couchbase Server 2.0 - CouchConf SF - Tour and Demo
Introduction to Couchbase Server 2.0 - CouchConf SF - Tour and DemoDipti Borkar
 

More from Dipti Borkar (18)

Presto – Today and Beyond – The Open Source SQL Engine for Querying all Data...
Presto – Today and Beyond – The Open Source SQL Engine for Querying all Data...Presto – Today and Beyond – The Open Source SQL Engine for Querying all Data...
Presto – Today and Beyond – The Open Source SQL Engine for Querying all Data...
 
Couchbase 101
Couchbase 101 Couchbase 101
Couchbase 101
 
Revolutionizing the customer experience - Hello Engagement Database
Revolutionizing the customer experience - Hello Engagement DatabaseRevolutionizing the customer experience - Hello Engagement Database
Revolutionizing the customer experience - Hello Engagement Database
 
How companies use NoSQL & Couchbase - NoSQL Now 2014
How companies use NoSQL & Couchbase - NoSQL Now 2014How companies use NoSQL & Couchbase - NoSQL Now 2014
How companies use NoSQL & Couchbase - NoSQL Now 2014
 
Introduction to couchbase
Introduction to couchbaseIntroduction to couchbase
Introduction to couchbase
 
How companies-use-no sql-and-couchbase-10152013
How companies-use-no sql-and-couchbase-10152013How companies-use-no sql-and-couchbase-10152013
How companies-use-no sql-and-couchbase-10152013
 
Characteristics of no sql databases
Characteristics of no sql databasesCharacteristics of no sql databases
Characteristics of no sql databases
 
How companies use NoSQL and Couchbase - NoSQL Now 2013
How companies use NoSQL and Couchbase - NoSQL Now 2013How companies use NoSQL and Couchbase - NoSQL Now 2013
How companies use NoSQL and Couchbase - NoSQL Now 2013
 
How companies use NoSQL and Couchbase
How companies use NoSQL and CouchbaseHow companies use NoSQL and Couchbase
How companies use NoSQL and Couchbase
 
Launch webinar-introducing couchbase server 2.0-01202013
Launch webinar-introducing couchbase server 2.0-01202013Launch webinar-introducing couchbase server 2.0-01202013
Launch webinar-introducing couchbase server 2.0-01202013
 
Part 2 of the webinar - Which freaking database should I use?
Part 2 of the webinar - Which freaking database should I use?Part 2 of the webinar - Which freaking database should I use?
Part 2 of the webinar - Which freaking database should I use?
 
Couchbase Server 2.0 - XDCR - Deep dive
Couchbase Server 2.0 - XDCR - Deep diveCouchbase Server 2.0 - XDCR - Deep dive
Couchbase Server 2.0 - XDCR - Deep dive
 
Couchbase Server 2.0 - Indexing and Querying - Deep dive
Couchbase Server 2.0 - Indexing and Querying - Deep diveCouchbase Server 2.0 - Indexing and Querying - Deep dive
Couchbase Server 2.0 - Indexing and Querying - Deep dive
 
Introduction to Couchbase Server 2.0
Introduction to Couchbase Server 2.0Introduction to Couchbase Server 2.0
Introduction to Couchbase Server 2.0
 
Transition from relational to NoSQL Philly DAMA Day
Transition from relational to NoSQL Philly DAMA DayTransition from relational to NoSQL Philly DAMA Day
Transition from relational to NoSQL Philly DAMA Day
 
Introduction to NoSQL and Couchbase
Introduction to NoSQL and CouchbaseIntroduction to NoSQL and Couchbase
Introduction to NoSQL and Couchbase
 
Navigating the Transition from relational to NoSQL - CloudCon Expo 2012
Navigating the Transition from relational to NoSQL - CloudCon Expo 2012Navigating the Transition from relational to NoSQL - CloudCon Expo 2012
Navigating the Transition from relational to NoSQL - CloudCon Expo 2012
 
Introduction to Couchbase Server 2.0 - CouchConf SF - Tour and Demo
Introduction to Couchbase Server 2.0 - CouchConf SF - Tour and DemoIntroduction to Couchbase Server 2.0 - CouchConf SF - Tour and Demo
Introduction to Couchbase Server 2.0 - CouchConf SF - Tour and Demo
 

Recently uploaded

Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 

Recently uploaded (20)

Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 

Couchbase Server and IBM BigInsights: One + One = Three

  • 1. Couchbase 2012 Couchbase Server and IBM BigInsights: One + One = Three Steve Beier Program Director, Big Data Applications & Solutions, IBM Dipti Borkar Director, Product Management, Couchbase © 2012 IBM Corporation
  • 2. 2 kinds of database management system OLTP   Analy+cs   2 © 2012 IBM Corporation
  • 3. 2 kinds of database management system OLTP   Analy+cs   3 © 2012 IBM Corporation
  • 4. 2 kinds of database management system OLTP   Analy+cs   4 © 2012 IBM Corporation
  • 5. 2 kinds of database management system Big  Users   Big  Data   5 © 2012 IBM Corporation
  • 6. 2 kinds of database management system Simple,  fast,  elas+c   NoSQL  database   with  sub-­‐ millisecond   performance  at   scale   Map-­‐reduce  against   huge  datasets  to   cook  up  insights   and  answers   6 © 2012 IBM Corporation
  • 7. Ad and offer targeting Ad Targeting 40  milliseconds  to   pick  the  right   offer   profiles,   raw  event  data   campaigns  /   offers,   ac:onable  insights   cooked  insights   raw  event  data   cooked  insights   7 © 2012 IBM Corporation
  • 8. Content Recommendation Targeting content 3   oriented site targeted   recommenda:ons   1   events   relational database 2   user  profiles   8 © 2012 IBM Corporation
  • 9. sqoop sqoop == sql RDBMS + hadoop • a data transfer tool for Hadoop • for moving data from non-Hadoop datasources (like relational databases, NoSQL) into/out-of Hadoop Couchbase provides Cloudera Certified sqoop connector 9 © 2012 IBM Corporation
  • 10. Ad Targeting Ad Targeting Platform Logs Logs Logs Couchbase Server Cluster Logs sqoop export Logs flume flow sqoop import Hadoop Cluster 10 © 2012 IBM Corporation
  • 11. Content Driven Site In order to keep up with changing needs on richer, more targeted content that is delivered to larger and larger audiences very quickly, Content Driven data behind content driven sites is shifting to Web Site Couchbase. Couchbase Server Cluster Original RDBMS Logs Logs Logs Logs Logs Hadoop excels at complex analytics which may involve multiple steps of processing which incorporate a number of different data sources. flume flow sqoop import sqoop export sqoop import Hadoop Cluster 11 © 2012 IBM Corporation
  • 12. Couchbase à Hadoop $ sqoop import –-connect http://couchbase-01:8091/pools -- table DUMP $ sqoop import –-connect http://couchbase-01:8091/pools -- table BACKFILL_5 12 © 2012 IBM Corporation
  • 13. Couchbase à Hadoop $ sqoop import –-connect http://couchbase-01:8091/pools -- table DUMP $ sqoop import –-connect http://couchbase-01:8091/pools -- table BACKFILL_5 For import, table must be either: •  DUMP: All items currently in Couchbase •  BACKFILL_n: All item mutations for n minutes 13 © 2012 IBM Corporation
  • 14. Hadoop à Couchbase $ sqoop export --connect http://couchbase-01:8091/pools --table REQUIRED_BUT_IGNORED -–export-dir HDFS_DIRECTORY_TO_EXPORT 14 © 2012 IBM Corporation
  • 15. sqoop Versions sqoop 1.4.2 Cloudera CDH3 •  Ubuntu 10.10 – 11.10; later versions missing package needed for CDH3 Cloudera CDH4 update 1 needed •  sqoop bug fix in Cloudera CDH4u1 required 15 © 2012 IBM Corporation
  • 16. Couchbase sqoop - Resources http://www.couchbase.com/develop/connectors/hadoop http://www.couchbase.com/docs/hadoop-plugin/ https://github.com/couchbase/couchbase-hadoop-plugin http://www.ibm.com/developerworks/opensource/library/ba-hadoop-couchbase/ba- hadoop-couchbase-pdf.pdf 16 © 2012 IBM Corporation
  • 17. Big Data platform: Bring Together a Large Volume and Variety of Data to Find New Insights T-Mobile §  Analyzing a variety of data at enormous volumes" Multi-channel customer experience analysis §  Insights on streaming data" §  Large volume structured, semi-structure and UOIT unstructured data analysis" Detect life-threatening conditions in time to intervene Vestas Predict weather patterns to plan optimal wind turbine usage Big Data Platform Dublin City Council •  Variety Optimization and monitoring of •  Velocity public transportations •  Volume Brocade Identify network security intrusions 17 © 2012 IBM Corporation © 2011 IBM Corporation
  • 18. Green Energy: Vestas Wind Systems A/S Volume §  Weather and geographic data analysis for wind turbine and wind farm site planning §  Deployed IBM Big Data to store, manage and to analyze location- specific data §  Analyzing 2.8 petabytes of public and private weather data for each geographic location §  Reduced by 97% - from weeks to hours – the modeling time for wind forecasting information 18 © 2012 IBM Corporation
  • 19. IBM Watson Demonstrated the Power of Big Data Analytics Variety Can we design a computing system that rivals a human’s ability to answer questions posed in natural language, interpreting meaning and context and retrieving, analyzing and understanding vast amounts of information in real-time? 19 © 2012 IBM Corporation
  • 20. Big Data Analytics in Smarter Hospitals Velocity Big Data enabled doctors from University of Ontario to apply neonatal infant monitoring to predict infection in ICU 24 hours in advance IBM Data Baby youtube.com 20 © 2012 IBM Corporation
  • 21. Asian telco reduces billing costs and improves customer satisfaction. Capabilities: Stream Computing Analytic Accelerators Real-time mediation and analysis of 6B CDRs per day Data processing time reduced from 12 hrs to 1 sec Hardware cost reduced to 1/8th Proactively address issues 21 (e.g. dropped calls) impacting customer © 2012 IBM Corporation 21 satisfaction.
  • 22. Telecommunications – Analyze in real time §  A Telco processing Call Detail Records 500K/sec, 6B+ IPDRs analyzed –  6 Billion CDRs per day per day on more than 4 PBs/yr. –  Deduplicating data over 7 days sustaining 1GBps. –  Processing latency reduced from 12 hours to a few seconds §  A Telco implementing a solution to access and analyze call, internet usage and texting detail records (xDRs) in real-time –  91% reduction in time to merge data –  93% reduction in storage requirements –  85% reduction in servers used §  A Telco requiring a solution to analyze up to 25M messages per second. At these volumes, in- motion analysis is the only option –  “Streams handled at least an order of magnitude more events per second on the same hardware than competitors.” (Telco’s Chief Architect) –  Even at these volumes, Streams provided near linear scalability 22 © 2012 IBM Corporation
  • 23. Big Data is an integral part of an enterprise data platform §  Manage Big Data from the instant it enters the enterprise §  High fidelity – no changes to original format §  Available for new uses, analyses, and integrations Business Analytic Applications (e.g. Cognos, SPSS) and Solutions Big Data Applications Operational Data Store Big Data Platform IBM Big Data Solutions Client and Partner Solutions Warehouse and Appliances Big Data User Environment Developers End Users Admin. Big Data Enterprise Engine Traditional data sources Streaming Internet-scale analytics analytics Govern: Source data (Web, sensors, logs, media, etc. ) Quality, Lifecycle Management, Security, Privacy 23 © 2012 IBM Corporation
  • 24. IBM’s Big Data Platform Bringing Big Data to the Enterprise Data IBM Big Data Solutions Client and Partner Solutions Warehouse InfoSphere Warehouse Warehouse Appliances Big Data User Environments Netezza Developers End Users Administrators Master Data Mgmt InfoSphere MDM INTEGRATION AGENTS Database Big Data Enterprise Engines DB2, Informix Content Analytics ECM Information Server Business Analytics Streaming Analytics Internet Scale Analytics Cognos & SPSS Marketing Open Source Foundational Components Unica Hadoop HBase Pig Lucene Jaql Hive Data Growth Management InfoSphere Optim 24 24 © 2012 IBM Corporation
  • 25. IBM Big Data Platform Tools Business Users Data Scientists Business Analysts Developers Administrators •  Determine product sentiment, intent, customer segmentation •  Execute reusable Apps to classify users, predict sales, and forecast trends •  Create spreadsheets and dashboards Analyzing big data •  Productive environment for executing analysis (cluster, rank, score with R, ML, Text) •  Create reusable analytic Apps without programming •  Dynamic open dashboard 25 © 2012 IBM Corporation
  • 26. THANK YOU sbeier@us.ibm.com dipti@couchbase.com 26 © 2012 IBM Corporation