SlideShare ist ein Scribd-Unternehmen logo
1 von 58
Downloaden Sie, um offline zu lesen
The Briefing Room
Welcome




                       Host:
                       Eric Kavanagh
                       eric.kavanagh@bloorgroup.com




Twitter Tag: #briefr                                  The Briefing Room
Mission


  !   Reveal the essential characteristics of enterprise software,
      good and bad

  !   Provide a forum for detailed analysis of today s innovative
      technologies

  !   Give vendors a chance to explain their product to savvy
      analysts

  !   Allow audience members to pose serious questions... and get
      answers!




Twitter Tag: #briefr                                   The Briefing Room
JANUARY: Big Data



 February: Analytics

 March: Open Source

 April: Intelligence



Twitter Tag: #briefr   The Briefing Room
Big Data




Twitter Tag: #briefr
                              NEW SOURCES	
                       New Insights
                           NEW	
  Challenges	
  

The Briefing Room




                                                   Copyrighted property. May not be copied or downloaded without permission from 123RF Limited.
Analyst: Robin Bloor




                         Robin Bloor is
                       Chief Analyst at
                       The Bloor Group


                          robin.bloor@bloorgroup.com




Twitter Tag: #briefr                      The Briefing Room
Teradata Aster

    !   Teradata is known for its data analytics solutions with a
        focus on integrated data warehousing, big data analytics
        and business applications

    !   It offers a broad suite of technology platforms and
        solutions; data management applications; and data mining
        capabilities

    !   Teradata Aster is its MapReduce platform to handle big data
        analytics on multi-structured data




Twitter Tag: #briefr                                   The Briefing Room
Steve Wooledge




       Steve is Senior Director
       of Product Marketing for
          Teradata Aster and
       has 10 years of industry
             experience.




Twitter Tag: #briefr              The Briefing Room
Bringing Big Data into the Light:
Teradata Big Analytics Appliance
Steve Wooledge – Sr. Director, Product Marketing, Teradata Aster

January 2013
TOPICS




WHAT IS DIFFERENT ABOUT BIG DATA ANALYTICS?

MAKING BIG ANALYTICS & DISCOVERY FAST AND EASY

TERADATA ASTER BIG ANALYTICS APPLIANCE




Confidential and proprietary. Copyright © 2012 Teradata Corporation.
10       Confidential and proprietary. Copyright © 2012 Teradata Corporation.
What is Different about
                          Big Analytics and Discovery?




Confidential and proprietary. Copyright © 2012 Teradata Corporation.
The Lytro and Big Data




12   Confidential and proprietary. Copyright © 2013 Teradata Corporation.
“Interactive, Living Pictures”




13   Confidential and proprietary. Copyright © 2013 Teradata Corporation.
See Your Business in High-Definition
Big Analytics & Discovery Unlocks Hidden Value


                                                                Classic BI
                                                  Structured & Repeatable Analysis




 Business determines what                                                                IT structures the data to
      questions to ask                                                                    answer those questions

                                                                                         “Capture only what’s
                                                                                              needed”




  IT delivers a platform for                          Big Data Analytics
    storing, refining, and                      Multi-structured & Iterative Analysis   Business explores data for
 analyzing all data sources                                                             questions worth answering

     “Capture in case it’s
          needed”
14     Confidential and proprietary. Copyright © 2013 Teradata Corporation.
Iterative Analytics Accelerates Discovery


                                                                              Analytical
                                                                                Idea




     Operational DB
        or EDW
                                                     Operationalize
                                                      or Move On
                                                                          5x                   Zero-ETL Data
                                                                                              Load/Integration


                                                                          Faster
                                                                 Discovery Process
                                                                    with Aster -
                                                              Evaluate vs. Days
                                                                  Hours
                                                                Results                SQL and non-SQL
                                                                                           Analysis



15     Confidential and proprietary. Copyright © 2013 Teradata Corporation.
Need for a Unified Data Architecture for New Insights
Enabling Any User for Any Data Type from Data Capture to Analysis




                    Java, C/C++, Python, R, SAS, SQL, Excel, BI, Visualization


                                                                              Reporting and Execution
               Discover and Explore
                                                                                 in the Enterprise


                                            Capture, Store and Refine


     Audio/                                                          Web &       Machine
                   Images             Docs            Text                                 CRM   SCM   ERP
     Video                                                           Social       Logs



16    Confidential and proprietary. Copyright © 2013 Teradata Corporation.
Big Data Comes with BIG HEADACHES


     “     Even free software like Hadoop is causing
     companies to spend more money…Many CIOs believe
     data is inexpensive because storage has become
     inexpensive. But data is inherently messy—it can be
     wrong, it can be duplicative, and it can be irrelevant—
     which means it requires handling, which is where the
     real expenses come in.

                                             ”
       “        Through 2015, 85% of Fortune 500 organizations will
        be unable to exploit big data for competitive advantage.

Source: The Wall Street Journal. “CIOs’ Big Problem with Big Data”. Aug 2012
Source: Gartner. “Information Innovation: Innovation Key Initiative Overview”. April 2012
                                                                                            ”
17       Confidential and proprietary. Copyright © 2013 Teradata Corporation.
UNIFIED DATA ARCHITECTURE
                   Data Scientists                 Quants                Customers / Partners        Front-Line Workers
                         Engineers          Business Analysts                  Executives            Operational Systems




                        LANGUAGES         MATH & STATS         DATA MINING        BUSINESS INTELLIGENCE   APPLICATIONS




     Big Data Analytics                    DISCOVERY                                          INTEGRATED
                                           PLATFORM                                         DATA WAREHOUSE




                                                                                                    Big Data Management
                                                                CAPTURE | STORE | REFINE




        AUDIO & VIDEO        IMAGES             TEXT          WEB & SOCIAL      MACHINE LOGS       CRM           SCM       ERP




18   Confidential and proprietary. Copyright © 2013 Teradata Corporation.
TERADATA UNIFIED DATA ARCHITECTURE
                   Data Scientists                 Quants                Customers / Partners        Front-Line Workers
                         Engineers          Business Analysts                  Executives            Operational Systems




                        LANGUAGES         MATH & STATS         DATA MINING        BUSINESS INTELLIGENCE   APPLICATIONS




                                           DISCOVERY                                          INTEGRATED
                                           PLATFORM                                         DATA WAREHOUSE




                                                                CAPTURE | STORE | REFINE




        AUDIO & VIDEO        IMAGES             TEXT          WEB & SOCIAL      MACHINE LOGS       CRM           SCM       ERP




19   Confidential and proprietary. Copyright © 2013 Teradata Corporation.
TERADATA UNIFIED DATA ARCHITECTURE
                     Data Scientists                Quants                Customers / Partners        Front-Line Workers
                           Engineers          Business Analysts                 Executives            Operational Systems




     VIEWPOINT            LANGUAGES         MATH & STATS        DATA MINING        BUSINESS INTELLIGENCE      APPLICATIONS      SUPPORT




                                            DISCOVERY              Aster Teradata              INTEGRATED
                                            PLATFORM                 Connector               DATA WAREHOUSE




                     Aster Connector for                 SQL-H                              SQL-H          Teradata Connector
                           Hadoop                                                                              for Hadoop


             Aster Loader                                                                                           Teradata Loader
                                                                 CAPTURE | STORE | REFINE




20     ConfidentialVIDEOproprietary. Copyright © 2013 Teradata Corporation.
          AUDIO & and           IMAGES             TEXT         WEB & SOCIAL     MACHINE LOGS       CRM             SCM         ERP
Shift from a Single Platform to an Ecosystem




      “Big Data requirements are solved by a range of
      platforms including analytical databases, discovery
      platforms and NoSQL solutions beyond Hadoop.”

     Source: “Big Data Comes of Age”. EMA and 9sight Consulting. Nov 2012.

21      Confidential and proprietary. Copyright © 2013 Teradata Corporation.
How Does Big Analytics and Discovery
                Add Business Value?




Confidential and proprietary. Copyright © 2012 Teradata Corporation.
Customer Behavior Analysis

                 BI Tools                                     Database Tools                  Monitoring Tools




   EMAIL                                      ONLINE             STORE VISION
                     BRANCH                                                     CALL CENTER    CUSTOMER       CUSTOMER
CORRESPOND-                                  BANKING               PLATFORM
                   TELLER DATA                                                     DATA       PROFILE DATA   SURVEY DATA
 ENCE DATA                                     DATA                  DATA



 23   Confidential and proprietary. Copyright © 2013 Teradata Corporation.
Events Preceding Account Closure




24   Confidential and proprietary. Copyright © 2013 Teradata Corporation.
Events Preceding Account Closure




SELECT * FROM npath (
  ON (
     SELECT …
     WHERE u.event_description IN (
       SELECT aper.event FROM attrition_paths_event_rank aper
                                                                               Interactive Analytics
       ORDER BY aper.count DESC LIMIT 10)

  …
     )
                                                                                Reducing the “Noise”
                                                                                 to find the “Signal”
  PATTERN ('(OTHER|EVENT){1,20}$')
  SYMBOLS (…) RESULT (…)
  )
) n;




25      Confidential and proprietary. Copyright © 2013 Teradata Corporation.
How Do We Make
              Big Analytics & Discovery Possible?




Confidential and proprietary. Copyright © 2012 Teradata Corporation.
Key Requirements of a Discovery Platform



                   Highly Efficient & Performant Big Data Platform
        1          That Allows Quick Iterations


                   Hybrid Capabilities that supports SQL, statistics,
        2          and new MapReduce analytics


                   Significant Out-of-the-Box Analytical Functions
        3          that Minimize Development



        Democratize Big Data & Maximize Enterprise Adoption

27   Confidential and proprietary. Copyright © 2013 Teradata Corporation.
Teradata Aster Big Analytics Appliance
First Deeply Integrated SQL, MapReduce and Hadoop Appliance
                                                           UNIQUE FEATURES

 1.  Integrated, modular Aster Database and 100% Open-Source
     Hortonworks HDP
 2.  First and only ANSI SQL & HCatalog integration via SQL-H™
 3.  Industry’s only ANSI-standard SQL & MapReduce integration
     via SQL-MapReduce®
 4.  Industry’s most manageable & supportable Apache Hadoop
     appliance via Teradata Viewpoint™ & TVI™
 5.  Most complete MapReduce App Portfolio with 70+ pre-built
     MapReduce functions
 6.  Fully engineered and supported by Teradata, with Level-4
     support by Hortonworks world-class Hadoop team

     Benefits
     •  Leverage existing investments in standard BI, ETL tools & people with SQL skills
     •  Industry’s highest performance platform for Big Analytics
     •  Lowest TCO (technology + people), highest ROI, and fastest time to value


28     Confidential and proprietary. Copyright © 2013 Teradata Corporation.
Teradata Aster Analytics Portfolio
The App Store of Big Data


                   PATH ANALYSIS                                            TEXT ANALYSIS
                   Discover Patterns in Rows of                             Derive Patterns and Extract
                   Sequential Data                                          Features in Textual Data




                   STATISTICAL ANALYSIS
                   High-Performance Processing                              SEGMENTATION
                   of Common Statistical                                    Discover Natural Groupings
                   Calculations                                             of Data Points




                   MARKETING                                                DATA
                   ANALYTICS                                                TRANSFORMATION
                   Analyze Customer
                                                                            Transform Data for More
                   Interactions to Optimize
                                                                            Advanced Analysis
                   Marketing Decisions


29   Confidential and proprietary. Copyright © 2013 Teradata Corporation.
Unified Big Data Analytics Architecture
Integrated Analytics and Navigation
                                                                            BI Tools, SQL, ETL


                                          BIG ANALYTICS
                                                                             TERADATA IDW
                                            APPLIANCE
                                     Unified Big Analytics
                                         Architecture

                                     Behavior          Sentiments


Multi-
                                      Discovery
Structured                            Platform
Data

                                                         Facebook
Unstructured
                                                          Twitter
Data
                                                          Pinterest

                                                           Social
                                     Revenue
                                                           Media

                                            Iterative
                                                                            Operationalized      Best Decision
                                          Information
                                                                              Analytics            Possible
                                           Discovery


30   Confidential and proprietary. Copyright © 2013 Teradata Corporation.
Teradata Aster Big Analytics Appliance
Solution Value Add

       NEW                                                                                          •  Single vendor for lowest TCO
                                          SQL BI             Analytic                      Hadoop
                                                                                                    •  Common system management tools
                                           Tools             SQL Apps                       Tools
     Troubleshooting, and Support
        Common Management,




                                             Aster MapReduce                                        •  Supports standard BI and ETL tools
                                           Portfolio of Functions                                   •  Use Hadoop tools like Hive and Pig
                                                                                      Hive, Pig,
                                                         SQL-
                                          SQL          MapReduce                          …
                                                                                                    •  Analytics Library w/ 70+ functions
                                    NEW             SQL-H                                           •  SQL interface to MapReduce and
                                                                                                       Hadoop

                                                                                                    •  Pre-tuned HDFS and MapReduce
                                                                                                       parameters for Big Data workloads
                                           Aster Database                                           •  Store and manage data in Apache
                                                                                                       Hadoop or Aster Database

                                     NEW InfiniBand (40 GB/s) Interconnect Fabric                   •  Processing, storage, and networking
                                                                                                       designed for Big Data workloads
                                                   Big Analytics Appliance Hardware                 •  40 GB/s InfiniBand network
                                            NEW



31                  Confidential and proprietary. Copyright © 2013 Teradata Corporation.
ESG Benchmark Report Summary
Third Party Validation of Aster and Hadoop “Fit”
                       Scope
                       •  Identical hardware for Aster and Hadoop
                       •  Clickstream, sentiment, and traditional retail data
                       •  Compare “time to insight” and “time to develop”



                                                                  RESULTS
 Discovery Process:                        Analytics:                   Development:          Loading:         Transforms:
        Aster                      Aster 35x Faster                             Aster         Hadoop            Hadoop
      5x Faster                    (range: 4–416x)                            3x Faster     1.8x Faster       1.3x Faster



     Hadoop MapReduce                                               32 Hours                   Aster 5x Faster
                                                                                               Discovery Cycle-Time
Aster SQL-MapReduce                             6 Hours                                         (Development + Execution Time)



                                   FULL REPORT AVAILABLE AT                   www.asterdata.com/esg

32     Confidential and proprietary. Copyright © 2013 Teradata Corporation.
Comparing Advanced Analytic Development
and Execution
Example: Determine Spikes In Hourly Pageviews
                    Apache Hadoop                                                                 Teradata Aster
  “By using SQL-MapReduce,                                                        “This is also why the execution
   Asterand find allMRfewerpageviews/hr to
   1      takespages <100 steps pagename
      •  Write Java     job to group records by
                                                                                   time•  in Aster nPath is SQL as regular expressions
                                                                                   1   •  Input parameters in much faster.”
                                                                                           Use
                                                                                               Aster
   developbyanalytics” hour fields
      •  Sort    the yy/mm/dd and
                                                                                           •  Single Pass of the data
             •  Java reduce phase to place all same-keyed
                records into temporary arrays                                              •  SQL handles group-by, counts, sorts
      2                                                                          Execute
                                                                                           •  MapReduce perform regular pattern matching
             •  Compute counts for low/high/low hourly page
                views                                                                         over a sequence of rows

  “Rather than using MapReduce “Map•  or Reduce relational table data
  3+
        •  Create custom partitioner
                                                             3
                                                                    Outputs written to requires
        •  Create custom grouping comparator
   processing for each step in the shuffling andtools to visualize results
        •  Create custom key comparator
                                                                 •  Use SQL or BI
                                                                                    produces higher
   analysis, SQL is used in place                            latency than SQL”
Execute •  Execute each Mapper and Reducer
   of a•  Map (or Reduce) phase
           Multiple passes of data

   and•  MapReduce is used only in
           Save output to flat files making it unstructured,
   stepsDB interfaces (e.g. ODBC/JDBC)
   5
            that cannot be
        •  No relational semantics and preventing use of

   expressed in with other tools (e.g., SSH/FTP)
        •  Retrieve results SQL.”


            Development Time: 4 hours
Source: Enterprise Strategy Group, Lab Validation Report, September 2012
                                                                                    Development Time: 1 hour (4x faster)
           Execution Time: 149 seconds                                             Execution Time: 3 seconds (50x faster)


 33       Confidential and proprietary. Copyright © 2013 Teradata Corporation.
Teradata Aster Big Analytics Appliance—
                Key Innovations




Confidential and proprietary. Copyright © 2012 Teradata Corporation.
Aster SQL-H™
A Business User’s Bridge to Analyze Hadoop Data

                                                                                                                              NEW
Aster SQL-H Gives Analysts and
Data Scientists a Better Way to                                                                     Aster: SQL-H
Analyze Data Stored in Hadoop
•  Allow standard ANSI SQL access to                                                                  Hadoop
   Hadoop data                                                                                          MR


•  Leverage existing BI tool and enable




                                                                                   Data Filtering
   self service




                                                                            Data
                                                                                                        Hive       HCatalog

•  Enable 50+ prebuilt SQL-MapReduce
   Apps and IDE
                                                                                                        Pig




                                                                                   Hadoop Layer: HDFS




35   Confidential and proprietary. Copyright © 2013 Teradata Corporation.
Hortonworks Data Platform
Enterprise-Ready Hadoop

                                            The ONLY 100% open source
                                            data platform for Hadoop

       •  Tightly aligned with core Apache code lines
       •  All code committed back to open source
       •  Engineered integration with Teradata Viewpoint and Ambari
       •  HCatalog - centralized metadata services for easy data sharing
       •  Dependable full stack high availability
       •  Capacity scheduler for better multi-tenancy
       •  Intuitive graphical data integration tools

36   Confidential and proprietary. Copyright © 2013 Teradata Corporation.
Teradata Viewpoint Integration
Easier, Faster, and Better System Management

Common Management Console for
Aster, Teradata and Apache Hadoop

Aster-Specific                       Query Portlets
Portlets                             •  Query Monitor
•  Aster Node
   Monitoring                        Admin Portlets
•  Aster Completed                   •  Teradata System
   Processes                         •  Roles Manager

Trend/                               Other Portlets
                                     •  System Health
Visualization                        •  Canary queries
Portlets                             •  Aster Alerting
•  Capacity
   Heat Map
•  Metrics Graph
•  Metrics Analysis




37    Confidential and proprietary. Copyright © 2013 Teradata Corporation.
Teradata Vital Infrastructure (TVI)
Integrated hardware & software solution for systems
  management
     PROACTIVE RELIABILITY, AVAILABILITY, AND MANAGEABILITY

1U server virtualizes system and cabinet management software
Server Management VMS
•  Cabinet Management Interface Controller (CMIC)
•  Service Work Station (SWS)
•  Automatically installed on base/first cabinet

VMS allows full                    Eliminates need                          Supports       TVI Support for
rack solutions                     for expansion                            Teradata       Aster and
without additional                 racks, reducing                          hardware and   Hadoop
cabinet for                        customers’ floor                         Aster/Hadoop
traditional SWS                    space & energy                           software
                                   costs




 62–70%                                   of Incidents Discovered through TVI

38   Confidential and proprietary. Copyright © 2013 Teradata Corporation.
How Can You Get Started?
                                   Aster Express




Confidential and proprietary. Copyright © 2012 Teradata Corporation.
Making it easy to try Aster Big Analytics Solutions
Aster Express, Aster Live, Aster Big Analytics Appliance




                                                                            Aster Live
Aster Express                                                                                Aster
                                                                                         Big Analytics
                                                                                          Appliance




40   Confidential and proprietary. Copyright © 2013 Teradata Corporation.
Aster Express Tutorials Make it Easy to Start
www.asterdata.com/asterexpress




41   Confidential and proprietary. Copyright © 2013 Teradata Corporation.
Teradata Aster Big Analytics Appliance Summary
Bring Big Data to Life with Big Analytics & Discovery


INDUSTRY’S FIRST UNIFIED BIG ANALYTICS APPLIANCE


UNIFIED INTERFACES FOR ITERATIVE SQL AND MAPREDUCE ANALYTICS


TERADATA-TRUSTED RELIABILITY, AVAILABILITY & MANAGEABILITY


EASY TO DEPLOY, MANAGE & USE




      Get Started Now! asterdata.com/AsterExpress


42   Confidential and proprietary. Copyright © 2013 Teradata Corporation.
When to Use Which?
The best approach by workload and data type
Processing as a Function of Schema Requirements and Stage of Data Pipeline

                                                                    “Simple math
                                               Data Pre-
                       Low Cost                                        at scale”         Joins,       Analytics
                                              Processing,
                     Storage and                                     (Score, filter,    Unions,     (Iterative and   Reporting
                                               Refining,
                     Fast Loading                                     sort, avg.,      Aggregates    data mining)
                                               Cleansing
                                                                       count...)


                                                 Financial Analysis, Ad-Hoc/OLAP
       Stable          Teradata/                Enterprise-Wide BITeradata
                                                Teradata  Teradata
                                                                     and Reporting
                                                                             Teradata                                 Teradata
      Schema            Hadoop
                                                         Spatial/Temporal
                                                         Active Execution

                                                 Interactive Data Discovery                                            Aster
  Evolving                                  Aster /     Aster /                                                       (SQL +
   Schema
                         Hadoop           Web Clickstream, Set-Top Box Analysis
                                            Hadoop     Hadoop     Aster    Aster                                       Aster
                                                                                                                     MapReduce
                                                   CDRs, Sensor Logs, JSON                                           Analytics)



                                             Social Feeds, Text, Image Processing                                      Aster
   Format,
No Schema
                        Hadoop
                        Hadoop                 Audio/Video Storage and Refining
                                               Hadoop
                                               Hadoop   Hadoop     Aster
                                                                   Aster    Aster                                       Aster
                                                                                                                     (MapReduce
                                                                                                                      Analytics)
                                              Storage and Batch Transformations

 44      Confidential and proprietary. Copyright © 2013 Teradata Corporation.
When to Use Which?
The best approach by workload and data type
Processing as a Function of Schema Requirements and Stage of Data Pipeline

                                                                    “Simple math
                                               Data Pre-
                       Low Cost                                        at scale”         Joins,       Analytics
                                              Processing,
                     Storage and                                     (Score, filter,    Unions,     (Iterative and   Reporting
                                               Refining,
                     Fast Loading                                     sort, avg.,      Aggregates    data mining)
                                               Cleansing
                                                                       count...)




       Stable          Teradata/
                        Hadoop                  Teradata                Teradata       Teradata       Teradata        Teradata
      Schema




                                                                                                                       Aster
  Evolving                                       Aster /                 Aster /                                      (SQL +
                         Hadoop                  Hadoop                  Hadoop          Aster          Aster          Aster
   Schema                                                                                                            MapReduce
                                                                                                                     Analytics)




                                                                                                                       Aster
   Format,
                        Hadoop
                        Hadoop                  Hadoop
                                                Hadoop                   Hadoop          Aster
                                                                                         Aster         Aster            Aster
                                                                                                                     (MapReduce
No Schema                                                                                                             Analytics)




 45      Confidential and proprietary. Copyright © 2013 Teradata Corporation.
Ease of Development and Reuse
Analytic Foundation : 70+ out-of-the-box modules
Modules                                          Business-ready SQL-MapReduce Functions
                                                 •  nPath: complex sequential analysis for time series analysis and
                                                    behavioral pattern analysis
Path Analysis                                    •  Sessionization: identifies sessions from time series data in a single
Discover patterns in rows of                        pass over the data
sequential data                                  •  Attribution: operator to help ad networks and websites to distribute
                                                    “credit”

                                                 •  Histogram: function to provide capability of generating
                                                 •  Decision Trees: Native implementation of parallel random forests.
                                                 •  Approximate percentiles and distinct counts: calculate
Statistical                                         percentiles and counts within specific variance

Analysis                                         •  Correlation: calculation that characterizes the strength of the
                                                    relation between different columns
High-performance processing of                   •  Regression: performs linear or logistic regression between an output
common statistical calculations                     variable and a set of input variables
                                                 •  Averages: calculate moving, weighted, exponential or volume-
                                                    weighted averages over a window of data


Relational                                       •  Graph analysis: finds shortest path from a distinct node to all other

Analysis                                            nodes in a graph
                                                 •  Tokenization: splits strings into individual words to assist text
Discover important relationships                    processing
among Confidential and proprietary. Copyright © 2013 Teradata Corporation.
46     data
Ease of Development and Reuse
Analytic Foundation : 50+ out-of-the-box modules
Modules                                           SQL-MapReduce Analytic Functions

                                                  •  Text Processing: counts occurrences of words, identifies roots, &
Text Analysis                                        tracks relative positions of words & multi-word phrases
                                                  •  Text Partition: analyzes text data over multiple rows
Derive patterns in textual data
                                                  •  Levenshtein Distance: computes the distance between two words


                                                  •  k-Means: clusters data into a specified number of groupings
                                                  •  Canopy: partitions data into overlapping subsets within which k-
Cluster                                              means is performed

Analysis                                          •  Minhash: buckets highly-dimensional items for cluster analysis
                                                  •  Basket analysis: creates configurable groupings of related items
Discover natural groupings of data                   from transaction records in single pass
points
                                                  •  Collaborative Filter: predicts the interests of a user by collecting
                                                     interest information from many users


Data                                              •  Unpack: extracts nested data for further analysis
                                                  •  Pack: compress multi-column data into a single column
Transformation                                    •  Antiselect: returns all columns except for specified column
Transform data for more advanced                  •  Multicase: case statement that supports row match for multiple
analysis                                             cases

47    Confidential and proprietary. Copyright © 2013 Teradata Corporation.
Perceptions & Questions




                       Analyst:
                       Robin Bloor


Twitter Tag: #briefr                 The Briefing Room
The Bloor Group
Big Data Is About Analytics
       DATA AIN’T WHAT IT USED TO BE
             Machine generated data (logs)
                       Web data
                  Social media data
                 Public data services
                   Supply chain data
                 Real-time data flows




        THE ANALOGY OF STRIP-MINING IS
     RELEVANT BECAUSE THE SCALE OF DATA
    ANALYTICS HAS EXPANDED DRAMATICALLY

                                             The Bloor Group
The Data Analytics Issue




                           The Bloor Group
What Hadoop Is NOT
     A MULTIUSER HIGHLY
        TUNED ENGINE

        AN ANALYTICS
         PLATFORM

         A SOLUTION

    But it IS:
      A USEFUL, FLEXIBLE
     AND VERY ECONOMIC
      DATA STORE – WITH
            PLUG-INS       The Bloor Group
About Data Analytics
  It is all about TIME TO INSIGHT – as long as that
                 is followed by action

     Fast time to insight requires FLEXIBLE
  management of high performance data flows -
       for the benefit of the data analyst

       The data analyst needs to be able to
               MARSHAL the data

   Then maybe, just maybe, he will deserve the
            title of DATA SCIENTIST
                                          The Bloor Group
Clearly the Teradata Aster Big Analytics Appliance is a
powerful data flow engine, so:

   !   How does Aster Data achieve its performance
     lift with MapReduce?

   !   How is it most usually deployed?

   !   Can it do data cleansing in flight?

   !   Can it perform analytic tasks?



                                             The Bloor Group
!   Why an appliance? What is gained and what is
  sacrificed?

!   Which sectors/businesses do you expect to be
  able to make best use of this technology?

!   Which companies/products do you regard as
  competitors (either direct or near)?

!   Which companies/products do you partner with?

!   How does the appliance fit in the cloud?


                                          The Bloor Group
Twitter Tag: #briefr   The Briefing Room
Upcoming Topics



   This month: Big Data

   February: Analytics

   March: Open Source

   April: Intelligence
   www.insideanalysis.com




Twitter Tag: #briefr        The Briefing Room
Thank You
                        for Your
                       Attention


Twitter Tag: #briefr               The Briefing Room

Weitere ähnliche Inhalte

Was ist angesagt?

No Time-Outs: How to Empower Round-the-Clock Analytics
No Time-Outs: How to Empower Round-the-Clock AnalyticsNo Time-Outs: How to Empower Round-the-Clock Analytics
No Time-Outs: How to Empower Round-the-Clock AnalyticsInside Analysis
 
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)Ajay Ohri
 
Implementing Big Data at the Speed of Business
Implementing Big Data at the Speed of BusinessImplementing Big Data at the Speed of Business
Implementing Big Data at the Speed of BusinessDataWorks Summit
 
Powering Next Generation Data Architecture With Apache Hadoop
Powering Next Generation Data Architecture With Apache HadoopPowering Next Generation Data Architecture With Apache Hadoop
Powering Next Generation Data Architecture With Apache HadoopHortonworks
 
HPE IDOL 10 (Intelligent Data Operating Layer)
HPE IDOL 10 (Intelligent Data Operating Layer)HPE IDOL 10 (Intelligent Data Operating Layer)
HPE IDOL 10 (Intelligent Data Operating Layer)Andrey Karpov
 
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012Calpont Corporation
 
How the Analytics Translator can make your organisation more AI driven
How the Analytics Translator can make your organisation more AI drivenHow the Analytics Translator can make your organisation more AI driven
How the Analytics Translator can make your organisation more AI drivenSteven Nooijen
 
IP&A109 Next-Generation Analytics Architecture for the Year 2020
IP&A109 Next-Generation Analytics Architecture for the Year 2020IP&A109 Next-Generation Analytics Architecture for the Year 2020
IP&A109 Next-Generation Analytics Architecture for the Year 2020Anjan Roy, PMP
 
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data European Data Forum
 
Exploring Big Data value for your business
Exploring Big Data value for your businessExploring Big Data value for your business
Exploring Big Data value for your businessAcunu
 
Big data meets big analytics
Big data meets big analyticsBig data meets big analytics
Big data meets big analyticsDeepak Ramanathan
 
Big Data World Forum
Big Data World ForumBig Data World Forum
Big Data World Forumbigdatawf
 
A New Day for Oracle Analytics
A New Day for Oracle AnalyticsA New Day for Oracle Analytics
A New Day for Oracle AnalyticsRich Clayton
 
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?Harald Erb
 
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...Jürgen Ambrosi
 
Big data ibm keynote d advani presentation
Big data ibm keynote d advani presentationBig data ibm keynote d advani presentation
Big data ibm keynote d advani presentationMassTLC
 
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision MakingFast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision MakingCodemotion
 
Semantic AI Cortex EIP Overview
Semantic AI Cortex EIP OverviewSemantic AI Cortex EIP Overview
Semantic AI Cortex EIP OverviewSemantic AI
 
Inteligencia artificial - Quebrando el paradigma de la amnesia empresarial
Inteligencia artificial - Quebrando el paradigma de la amnesia empresarialInteligencia artificial - Quebrando el paradigma de la amnesia empresarial
Inteligencia artificial - Quebrando el paradigma de la amnesia empresarialMarcos Quezada
 

Was ist angesagt? (20)

No Time-Outs: How to Empower Round-the-Clock Analytics
No Time-Outs: How to Empower Round-the-Clock AnalyticsNo Time-Outs: How to Empower Round-the-Clock Analytics
No Time-Outs: How to Empower Round-the-Clock Analytics
 
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
 
Implementing Big Data at the Speed of Business
Implementing Big Data at the Speed of BusinessImplementing Big Data at the Speed of Business
Implementing Big Data at the Speed of Business
 
Powering Next Generation Data Architecture With Apache Hadoop
Powering Next Generation Data Architecture With Apache HadoopPowering Next Generation Data Architecture With Apache Hadoop
Powering Next Generation Data Architecture With Apache Hadoop
 
HPE IDOL 10 (Intelligent Data Operating Layer)
HPE IDOL 10 (Intelligent Data Operating Layer)HPE IDOL 10 (Intelligent Data Operating Layer)
HPE IDOL 10 (Intelligent Data Operating Layer)
 
BI outsourcing and emerging trends 2012
BI outsourcing and emerging trends 2012BI outsourcing and emerging trends 2012
BI outsourcing and emerging trends 2012
 
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
 
How the Analytics Translator can make your organisation more AI driven
How the Analytics Translator can make your organisation more AI drivenHow the Analytics Translator can make your organisation more AI driven
How the Analytics Translator can make your organisation more AI driven
 
IP&A109 Next-Generation Analytics Architecture for the Year 2020
IP&A109 Next-Generation Analytics Architecture for the Year 2020IP&A109 Next-Generation Analytics Architecture for the Year 2020
IP&A109 Next-Generation Analytics Architecture for the Year 2020
 
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
 
Exploring Big Data value for your business
Exploring Big Data value for your businessExploring Big Data value for your business
Exploring Big Data value for your business
 
Big data meets big analytics
Big data meets big analyticsBig data meets big analytics
Big data meets big analytics
 
Big Data World Forum
Big Data World ForumBig Data World Forum
Big Data World Forum
 
A New Day for Oracle Analytics
A New Day for Oracle AnalyticsA New Day for Oracle Analytics
A New Day for Oracle Analytics
 
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
 
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
 
Big data ibm keynote d advani presentation
Big data ibm keynote d advani presentationBig data ibm keynote d advani presentation
Big data ibm keynote d advani presentation
 
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision MakingFast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
Fast Data Mining: Real Time Knowledge Discovery for Predictive Decision Making
 
Semantic AI Cortex EIP Overview
Semantic AI Cortex EIP OverviewSemantic AI Cortex EIP Overview
Semantic AI Cortex EIP Overview
 
Inteligencia artificial - Quebrando el paradigma de la amnesia empresarial
Inteligencia artificial - Quebrando el paradigma de la amnesia empresarialInteligencia artificial - Quebrando el paradigma de la amnesia empresarial
Inteligencia artificial - Quebrando el paradigma de la amnesia empresarial
 

Ähnlich wie Left Brain, Right Brain: How to Unify Enterprise Analytics

Simplifying Big Data Analytics for the Business
Simplifying Big Data Analytics for the BusinessSimplifying Big Data Analytics for the Business
Simplifying Big Data Analytics for the BusinessTeradata Aster
 
The Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information ArchitectureThe Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information ArchitectureInside Analysis
 
Exploiting Data Lakes: Architecture, Capabilities & Future
Exploiting Data Lakes: Architecture, Capabilities & FutureExploiting Data Lakes: Architecture, Capabilities & Future
Exploiting Data Lakes: Architecture, Capabilities & FutureAgilisium Consulting
 
The New Normal: Predictive Power on the Front Lines
The New Normal: Predictive Power on the Front LinesThe New Normal: Predictive Power on the Front Lines
The New Normal: Predictive Power on the Front LinesInside Analysis
 
Teradata Big Data London Seminar
Teradata Big Data London SeminarTeradata Big Data London Seminar
Teradata Big Data London SeminarHortonworks
 
Innovate Analytics with Oracle Data Mining & Oracle R
Innovate Analytics with Oracle Data Mining & Oracle RInnovate Analytics with Oracle Data Mining & Oracle R
Innovate Analytics with Oracle Data Mining & Oracle RCapgemini
 
Four Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyFour Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyArcadia Data
 
Incorporating the Data Lake into Your Analytic Architecture
Incorporating the Data Lake into Your Analytic ArchitectureIncorporating the Data Lake into Your Analytic Architecture
Incorporating the Data Lake into Your Analytic ArchitectureCaserta
 
Unified big data architecture
Unified big data architectureUnified big data architecture
Unified big data architectureDataWorks Summit
 
When Worlds Collide: Intelligence, Analytics and Operations
When Worlds Collide: Intelligence, Analytics and OperationsWhen Worlds Collide: Intelligence, Analytics and Operations
When Worlds Collide: Intelligence, Analytics and OperationsInside Analysis
 
A Strategic View of Enterprise Reporting and Analytics: The Data Funnel
A Strategic View of Enterprise Reporting and Analytics: The Data FunnelA Strategic View of Enterprise Reporting and Analytics: The Data Funnel
A Strategic View of Enterprise Reporting and Analytics: The Data FunnelInside Analysis
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendCaserta
 
What Data Do You Have and Where is It?
What Data Do You Have and Where is It? What Data Do You Have and Where is It?
What Data Do You Have and Where is It? Caserta
 
Big data webinar may23 nrit by sunil
Big data webinar may23 nrit by sunilBig data webinar may23 nrit by sunil
Big data webinar may23 nrit by sunilSujit Ghosh
 
Empowering the Business with Agile Analytics
Empowering the Business with Agile AnalyticsEmpowering the Business with Agile Analytics
Empowering the Business with Agile AnalyticsInside Analysis
 
Connecting the Dots with Data Mashups
Connecting the Dots with Data MashupsConnecting the Dots with Data Mashups
Connecting the Dots with Data MashupsInside Analysis
 
Introduction to Big Data An analogy between Sugar Cane & Big Data
Introduction to Big Data An analogy  between Sugar Cane & Big DataIntroduction to Big Data An analogy  between Sugar Cane & Big Data
Introduction to Big Data An analogy between Sugar Cane & Big DataJean-Marc Desvaux
 
Big Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data LakeBig Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data LakeCaserta
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)mark madsen
 

Ähnlich wie Left Brain, Right Brain: How to Unify Enterprise Analytics (20)

Simplifying Big Data Analytics for the Business
Simplifying Big Data Analytics for the BusinessSimplifying Big Data Analytics for the Business
Simplifying Big Data Analytics for the Business
 
The Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information ArchitectureThe Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information Architecture
 
Exploiting Data Lakes: Architecture, Capabilities & Future
Exploiting Data Lakes: Architecture, Capabilities & FutureExploiting Data Lakes: Architecture, Capabilities & Future
Exploiting Data Lakes: Architecture, Capabilities & Future
 
The New Normal: Predictive Power on the Front Lines
The New Normal: Predictive Power on the Front LinesThe New Normal: Predictive Power on the Front Lines
The New Normal: Predictive Power on the Front Lines
 
Teradata Big Data London Seminar
Teradata Big Data London SeminarTeradata Big Data London Seminar
Teradata Big Data London Seminar
 
Innovate Analytics with Oracle Data Mining & Oracle R
Innovate Analytics with Oracle Data Mining & Oracle RInnovate Analytics with Oracle Data Mining & Oracle R
Innovate Analytics with Oracle Data Mining & Oracle R
 
Four Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyFour Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics Strategy
 
Incorporating the Data Lake into Your Analytic Architecture
Incorporating the Data Lake into Your Analytic ArchitectureIncorporating the Data Lake into Your Analytic Architecture
Incorporating the Data Lake into Your Analytic Architecture
 
Unified big data architecture
Unified big data architectureUnified big data architecture
Unified big data architecture
 
When Worlds Collide: Intelligence, Analytics and Operations
When Worlds Collide: Intelligence, Analytics and OperationsWhen Worlds Collide: Intelligence, Analytics and Operations
When Worlds Collide: Intelligence, Analytics and Operations
 
A Strategic View of Enterprise Reporting and Analytics: The Data Funnel
A Strategic View of Enterprise Reporting and Analytics: The Data FunnelA Strategic View of Enterprise Reporting and Analytics: The Data Funnel
A Strategic View of Enterprise Reporting and Analytics: The Data Funnel
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
 
What Data Do You Have and Where is It?
What Data Do You Have and Where is It? What Data Do You Have and Where is It?
What Data Do You Have and Where is It?
 
Informatics technologies in an evolving r & d landscape
Informatics technologies in an evolving r & d landscapeInformatics technologies in an evolving r & d landscape
Informatics technologies in an evolving r & d landscape
 
Big data webinar may23 nrit by sunil
Big data webinar may23 nrit by sunilBig data webinar may23 nrit by sunil
Big data webinar may23 nrit by sunil
 
Empowering the Business with Agile Analytics
Empowering the Business with Agile AnalyticsEmpowering the Business with Agile Analytics
Empowering the Business with Agile Analytics
 
Connecting the Dots with Data Mashups
Connecting the Dots with Data MashupsConnecting the Dots with Data Mashups
Connecting the Dots with Data Mashups
 
Introduction to Big Data An analogy between Sugar Cane & Big Data
Introduction to Big Data An analogy  between Sugar Cane & Big DataIntroduction to Big Data An analogy  between Sugar Cane & Big Data
Introduction to Big Data An analogy between Sugar Cane & Big Data
 
Big Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data LakeBig Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data Lake
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
 

Mehr von Inside Analysis

An Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BIAn Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BIInside Analysis
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessInside Analysis
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationInside Analysis
 
Fit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data LetdownFit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data LetdownInside Analysis
 
To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security Inside Analysis
 
The Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On TimeThe Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On TimeInside Analysis
 
Introducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataIntroducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataInside Analysis
 
The Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop AdoptionThe Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop AdoptionInside Analysis
 
Ahead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsAhead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsInside Analysis
 
All Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of EverythingAll Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of EverythingInside Analysis
 
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETLGoodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETLInside Analysis
 
The Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global LevelThe Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global LevelInside Analysis
 
Structurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your ArchitectureStructurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your ArchitectureInside Analysis
 
SQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the RiskSQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the RiskInside Analysis
 
The Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big DataThe Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big DataInside Analysis
 
A Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data WarehouseA Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data WarehouseInside Analysis
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopInside Analysis
 
Rethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldRethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldInside Analysis
 
DisrupTech - Dave Duggal
DisrupTech - Dave DuggalDisrupTech - Dave Duggal
DisrupTech - Dave DuggalInside Analysis
 

Mehr von Inside Analysis (20)

An Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BIAn Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BI
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for Success
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter Integration
 
Fit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data LetdownFit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data Letdown
 
To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security
 
The Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On TimeThe Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On Time
 
Introducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataIntroducing: A Complete Algebra of Data
Introducing: A Complete Algebra of Data
 
The Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop AdoptionThe Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop Adoption
 
Ahead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsAhead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time Analytics
 
All Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of EverythingAll Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of Everything
 
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETLGoodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
 
The Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global LevelThe Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global Level
 
Structurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your ArchitectureStructurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your Architecture
 
SQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the RiskSQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the Risk
 
The Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big DataThe Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big Data
 
A Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data WarehouseA Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data Warehouse
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of Hadoop
 
Rethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldRethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile World
 
DisrupTech - Dave Duggal
DisrupTech - Dave DuggalDisrupTech - Dave Duggal
DisrupTech - Dave Duggal
 
Modus Operandi
Modus OperandiModus Operandi
Modus Operandi
 

Kürzlich hochgeladen

2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 

Kürzlich hochgeladen (20)

2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 

Left Brain, Right Brain: How to Unify Enterprise Analytics

  • 2. Welcome Host: Eric Kavanagh eric.kavanagh@bloorgroup.com Twitter Tag: #briefr The Briefing Room
  • 3. Mission !   Reveal the essential characteristics of enterprise software, good and bad !   Provide a forum for detailed analysis of today s innovative technologies !   Give vendors a chance to explain their product to savvy analysts !   Allow audience members to pose serious questions... and get answers! Twitter Tag: #briefr The Briefing Room
  • 4. JANUARY: Big Data February: Analytics March: Open Source April: Intelligence Twitter Tag: #briefr The Briefing Room
  • 5. Big Data Twitter Tag: #briefr NEW SOURCES New Insights NEW  Challenges   The Briefing Room Copyrighted property. May not be copied or downloaded without permission from 123RF Limited.
  • 6. Analyst: Robin Bloor  Robin Bloor is Chief Analyst at The Bloor Group robin.bloor@bloorgroup.com Twitter Tag: #briefr The Briefing Room
  • 7. Teradata Aster !   Teradata is known for its data analytics solutions with a focus on integrated data warehousing, big data analytics and business applications !   It offers a broad suite of technology platforms and solutions; data management applications; and data mining capabilities !   Teradata Aster is its MapReduce platform to handle big data analytics on multi-structured data Twitter Tag: #briefr The Briefing Room
  • 8. Steve Wooledge Steve is Senior Director of Product Marketing for Teradata Aster and has 10 years of industry experience. Twitter Tag: #briefr The Briefing Room
  • 9. Bringing Big Data into the Light: Teradata Big Analytics Appliance Steve Wooledge – Sr. Director, Product Marketing, Teradata Aster January 2013
  • 10. TOPICS WHAT IS DIFFERENT ABOUT BIG DATA ANALYTICS? MAKING BIG ANALYTICS & DISCOVERY FAST AND EASY TERADATA ASTER BIG ANALYTICS APPLIANCE Confidential and proprietary. Copyright © 2012 Teradata Corporation. 10 Confidential and proprietary. Copyright © 2012 Teradata Corporation.
  • 11. What is Different about Big Analytics and Discovery? Confidential and proprietary. Copyright © 2012 Teradata Corporation.
  • 12. The Lytro and Big Data 12 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 13. “Interactive, Living Pictures” 13 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 14. See Your Business in High-Definition Big Analytics & Discovery Unlocks Hidden Value Classic BI Structured & Repeatable Analysis Business determines what IT structures the data to questions to ask answer those questions “Capture only what’s needed” IT delivers a platform for Big Data Analytics storing, refining, and Multi-structured & Iterative Analysis Business explores data for analyzing all data sources questions worth answering “Capture in case it’s needed” 14 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 15. Iterative Analytics Accelerates Discovery Analytical Idea Operational DB or EDW Operationalize or Move On 5x Zero-ETL Data Load/Integration Faster Discovery Process with Aster - Evaluate vs. Days Hours Results SQL and non-SQL Analysis 15 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 16. Need for a Unified Data Architecture for New Insights Enabling Any User for Any Data Type from Data Capture to Analysis Java, C/C++, Python, R, SAS, SQL, Excel, BI, Visualization Reporting and Execution Discover and Explore in the Enterprise Capture, Store and Refine Audio/ Web & Machine Images Docs Text CRM SCM ERP Video Social Logs 16 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 17. Big Data Comes with BIG HEADACHES “ Even free software like Hadoop is causing companies to spend more money…Many CIOs believe data is inexpensive because storage has become inexpensive. But data is inherently messy—it can be wrong, it can be duplicative, and it can be irrelevant— which means it requires handling, which is where the real expenses come in. ” “ Through 2015, 85% of Fortune 500 organizations will be unable to exploit big data for competitive advantage. Source: The Wall Street Journal. “CIOs’ Big Problem with Big Data”. Aug 2012 Source: Gartner. “Information Innovation: Innovation Key Initiative Overview”. April 2012 ” 17 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 18. UNIFIED DATA ARCHITECTURE Data Scientists Quants Customers / Partners Front-Line Workers Engineers Business Analysts Executives Operational Systems LANGUAGES MATH & STATS DATA MINING BUSINESS INTELLIGENCE APPLICATIONS Big Data Analytics DISCOVERY INTEGRATED PLATFORM DATA WAREHOUSE Big Data Management CAPTURE | STORE | REFINE AUDIO & VIDEO IMAGES TEXT WEB & SOCIAL MACHINE LOGS CRM SCM ERP 18 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 19. TERADATA UNIFIED DATA ARCHITECTURE Data Scientists Quants Customers / Partners Front-Line Workers Engineers Business Analysts Executives Operational Systems LANGUAGES MATH & STATS DATA MINING BUSINESS INTELLIGENCE APPLICATIONS DISCOVERY INTEGRATED PLATFORM DATA WAREHOUSE CAPTURE | STORE | REFINE AUDIO & VIDEO IMAGES TEXT WEB & SOCIAL MACHINE LOGS CRM SCM ERP 19 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 20. TERADATA UNIFIED DATA ARCHITECTURE Data Scientists Quants Customers / Partners Front-Line Workers Engineers Business Analysts Executives Operational Systems VIEWPOINT LANGUAGES MATH & STATS DATA MINING BUSINESS INTELLIGENCE APPLICATIONS SUPPORT DISCOVERY Aster Teradata INTEGRATED PLATFORM Connector DATA WAREHOUSE Aster Connector for SQL-H SQL-H Teradata Connector Hadoop for Hadoop Aster Loader Teradata Loader CAPTURE | STORE | REFINE 20 ConfidentialVIDEOproprietary. Copyright © 2013 Teradata Corporation. AUDIO & and IMAGES TEXT WEB & SOCIAL MACHINE LOGS CRM SCM ERP
  • 21. Shift from a Single Platform to an Ecosystem “Big Data requirements are solved by a range of platforms including analytical databases, discovery platforms and NoSQL solutions beyond Hadoop.” Source: “Big Data Comes of Age”. EMA and 9sight Consulting. Nov 2012. 21 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 22. How Does Big Analytics and Discovery Add Business Value? Confidential and proprietary. Copyright © 2012 Teradata Corporation.
  • 23. Customer Behavior Analysis BI Tools Database Tools Monitoring Tools EMAIL ONLINE STORE VISION BRANCH CALL CENTER CUSTOMER CUSTOMER CORRESPOND- BANKING PLATFORM TELLER DATA DATA PROFILE DATA SURVEY DATA ENCE DATA DATA DATA 23 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 24. Events Preceding Account Closure 24 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 25. Events Preceding Account Closure SELECT * FROM npath ( ON ( SELECT … WHERE u.event_description IN ( SELECT aper.event FROM attrition_paths_event_rank aper Interactive Analytics ORDER BY aper.count DESC LIMIT 10) … ) Reducing the “Noise” to find the “Signal” PATTERN ('(OTHER|EVENT){1,20}$') SYMBOLS (…) RESULT (…) ) ) n; 25 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 26. How Do We Make Big Analytics & Discovery Possible? Confidential and proprietary. Copyright © 2012 Teradata Corporation.
  • 27. Key Requirements of a Discovery Platform Highly Efficient & Performant Big Data Platform 1 That Allows Quick Iterations Hybrid Capabilities that supports SQL, statistics, 2 and new MapReduce analytics Significant Out-of-the-Box Analytical Functions 3 that Minimize Development Democratize Big Data & Maximize Enterprise Adoption 27 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 28. Teradata Aster Big Analytics Appliance First Deeply Integrated SQL, MapReduce and Hadoop Appliance UNIQUE FEATURES 1.  Integrated, modular Aster Database and 100% Open-Source Hortonworks HDP 2.  First and only ANSI SQL & HCatalog integration via SQL-H™ 3.  Industry’s only ANSI-standard SQL & MapReduce integration via SQL-MapReduce® 4.  Industry’s most manageable & supportable Apache Hadoop appliance via Teradata Viewpoint™ & TVI™ 5.  Most complete MapReduce App Portfolio with 70+ pre-built MapReduce functions 6.  Fully engineered and supported by Teradata, with Level-4 support by Hortonworks world-class Hadoop team Benefits •  Leverage existing investments in standard BI, ETL tools & people with SQL skills •  Industry’s highest performance platform for Big Analytics •  Lowest TCO (technology + people), highest ROI, and fastest time to value 28 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 29. Teradata Aster Analytics Portfolio The App Store of Big Data PATH ANALYSIS TEXT ANALYSIS Discover Patterns in Rows of Derive Patterns and Extract Sequential Data Features in Textual Data STATISTICAL ANALYSIS High-Performance Processing SEGMENTATION of Common Statistical Discover Natural Groupings Calculations of Data Points MARKETING DATA ANALYTICS TRANSFORMATION Analyze Customer Transform Data for More Interactions to Optimize Advanced Analysis Marketing Decisions 29 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 30. Unified Big Data Analytics Architecture Integrated Analytics and Navigation BI Tools, SQL, ETL BIG ANALYTICS TERADATA IDW APPLIANCE Unified Big Analytics Architecture Behavior Sentiments Multi- Discovery Structured Platform Data Facebook Unstructured Twitter Data Pinterest Social Revenue Media Iterative Operationalized Best Decision Information Analytics Possible Discovery 30 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 31. Teradata Aster Big Analytics Appliance Solution Value Add NEW •  Single vendor for lowest TCO SQL BI Analytic Hadoop •  Common system management tools Tools SQL Apps Tools Troubleshooting, and Support Common Management, Aster MapReduce •  Supports standard BI and ETL tools Portfolio of Functions •  Use Hadoop tools like Hive and Pig Hive, Pig, SQL- SQL MapReduce … •  Analytics Library w/ 70+ functions NEW SQL-H •  SQL interface to MapReduce and Hadoop •  Pre-tuned HDFS and MapReduce parameters for Big Data workloads Aster Database •  Store and manage data in Apache Hadoop or Aster Database NEW InfiniBand (40 GB/s) Interconnect Fabric •  Processing, storage, and networking designed for Big Data workloads Big Analytics Appliance Hardware •  40 GB/s InfiniBand network NEW 31 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 32. ESG Benchmark Report Summary Third Party Validation of Aster and Hadoop “Fit” Scope •  Identical hardware for Aster and Hadoop •  Clickstream, sentiment, and traditional retail data •  Compare “time to insight” and “time to develop” RESULTS Discovery Process: Analytics: Development: Loading: Transforms: Aster Aster 35x Faster Aster Hadoop Hadoop 5x Faster (range: 4–416x) 3x Faster 1.8x Faster 1.3x Faster Hadoop MapReduce 32 Hours Aster 5x Faster Discovery Cycle-Time Aster SQL-MapReduce 6 Hours (Development + Execution Time) FULL REPORT AVAILABLE AT www.asterdata.com/esg 32 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 33. Comparing Advanced Analytic Development and Execution Example: Determine Spikes In Hourly Pageviews Apache Hadoop Teradata Aster “By using SQL-MapReduce, “This is also why the execution Asterand find allMRfewerpageviews/hr to 1 takespages <100 steps pagename •  Write Java job to group records by time•  in Aster nPath is SQL as regular expressions 1 •  Input parameters in much faster.” Use Aster developbyanalytics” hour fields •  Sort the yy/mm/dd and •  Single Pass of the data •  Java reduce phase to place all same-keyed records into temporary arrays •  SQL handles group-by, counts, sorts 2 Execute •  MapReduce perform regular pattern matching •  Compute counts for low/high/low hourly page views over a sequence of rows “Rather than using MapReduce “Map•  or Reduce relational table data 3+ •  Create custom partitioner 3 Outputs written to requires •  Create custom grouping comparator processing for each step in the shuffling andtools to visualize results •  Create custom key comparator •  Use SQL or BI produces higher analysis, SQL is used in place latency than SQL” Execute •  Execute each Mapper and Reducer of a•  Map (or Reduce) phase Multiple passes of data and•  MapReduce is used only in Save output to flat files making it unstructured, stepsDB interfaces (e.g. ODBC/JDBC) 5 that cannot be •  No relational semantics and preventing use of expressed in with other tools (e.g., SSH/FTP) •  Retrieve results SQL.” Development Time: 4 hours Source: Enterprise Strategy Group, Lab Validation Report, September 2012 Development Time: 1 hour (4x faster) Execution Time: 149 seconds Execution Time: 3 seconds (50x faster) 33 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 34. Teradata Aster Big Analytics Appliance— Key Innovations Confidential and proprietary. Copyright © 2012 Teradata Corporation.
  • 35. Aster SQL-H™ A Business User’s Bridge to Analyze Hadoop Data NEW Aster SQL-H Gives Analysts and Data Scientists a Better Way to Aster: SQL-H Analyze Data Stored in Hadoop •  Allow standard ANSI SQL access to Hadoop Hadoop data MR •  Leverage existing BI tool and enable Data Filtering self service Data Hive HCatalog •  Enable 50+ prebuilt SQL-MapReduce Apps and IDE Pig Hadoop Layer: HDFS 35 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 36. Hortonworks Data Platform Enterprise-Ready Hadoop The ONLY 100% open source data platform for Hadoop •  Tightly aligned with core Apache code lines •  All code committed back to open source •  Engineered integration with Teradata Viewpoint and Ambari •  HCatalog - centralized metadata services for easy data sharing •  Dependable full stack high availability •  Capacity scheduler for better multi-tenancy •  Intuitive graphical data integration tools 36 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 37. Teradata Viewpoint Integration Easier, Faster, and Better System Management Common Management Console for Aster, Teradata and Apache Hadoop Aster-Specific Query Portlets Portlets •  Query Monitor •  Aster Node Monitoring Admin Portlets •  Aster Completed •  Teradata System Processes •  Roles Manager Trend/ Other Portlets •  System Health Visualization •  Canary queries Portlets •  Aster Alerting •  Capacity Heat Map •  Metrics Graph •  Metrics Analysis 37 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 38. Teradata Vital Infrastructure (TVI) Integrated hardware & software solution for systems management PROACTIVE RELIABILITY, AVAILABILITY, AND MANAGEABILITY 1U server virtualizes system and cabinet management software Server Management VMS •  Cabinet Management Interface Controller (CMIC) •  Service Work Station (SWS) •  Automatically installed on base/first cabinet VMS allows full Eliminates need Supports TVI Support for rack solutions for expansion Teradata Aster and without additional racks, reducing hardware and Hadoop cabinet for customers’ floor Aster/Hadoop traditional SWS space & energy software costs 62–70% of Incidents Discovered through TVI 38 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 39. How Can You Get Started? Aster Express Confidential and proprietary. Copyright © 2012 Teradata Corporation.
  • 40. Making it easy to try Aster Big Analytics Solutions Aster Express, Aster Live, Aster Big Analytics Appliance Aster Live Aster Express Aster Big Analytics Appliance 40 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 41. Aster Express Tutorials Make it Easy to Start www.asterdata.com/asterexpress 41 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 42. Teradata Aster Big Analytics Appliance Summary Bring Big Data to Life with Big Analytics & Discovery INDUSTRY’S FIRST UNIFIED BIG ANALYTICS APPLIANCE UNIFIED INTERFACES FOR ITERATIVE SQL AND MAPREDUCE ANALYTICS TERADATA-TRUSTED RELIABILITY, AVAILABILITY & MANAGEABILITY EASY TO DEPLOY, MANAGE & USE Get Started Now! asterdata.com/AsterExpress 42 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 43.
  • 44. When to Use Which? The best approach by workload and data type Processing as a Function of Schema Requirements and Stage of Data Pipeline “Simple math Data Pre- Low Cost at scale” Joins, Analytics Processing, Storage and (Score, filter, Unions, (Iterative and Reporting Refining, Fast Loading sort, avg., Aggregates data mining) Cleansing count...) Financial Analysis, Ad-Hoc/OLAP Stable Teradata/ Enterprise-Wide BITeradata Teradata Teradata and Reporting Teradata Teradata Schema Hadoop Spatial/Temporal Active Execution Interactive Data Discovery Aster Evolving Aster / Aster / (SQL + Schema Hadoop Web Clickstream, Set-Top Box Analysis Hadoop Hadoop Aster Aster Aster MapReduce CDRs, Sensor Logs, JSON Analytics) Social Feeds, Text, Image Processing Aster Format, No Schema Hadoop Hadoop Audio/Video Storage and Refining Hadoop Hadoop Hadoop Aster Aster Aster Aster (MapReduce Analytics) Storage and Batch Transformations 44 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 45. When to Use Which? The best approach by workload and data type Processing as a Function of Schema Requirements and Stage of Data Pipeline “Simple math Data Pre- Low Cost at scale” Joins, Analytics Processing, Storage and (Score, filter, Unions, (Iterative and Reporting Refining, Fast Loading sort, avg., Aggregates data mining) Cleansing count...) Stable Teradata/ Hadoop Teradata Teradata Teradata Teradata Teradata Schema Aster Evolving Aster / Aster / (SQL + Hadoop Hadoop Hadoop Aster Aster Aster Schema MapReduce Analytics) Aster Format, Hadoop Hadoop Hadoop Hadoop Hadoop Aster Aster Aster Aster (MapReduce No Schema Analytics) 45 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 46. Ease of Development and Reuse Analytic Foundation : 70+ out-of-the-box modules Modules Business-ready SQL-MapReduce Functions •  nPath: complex sequential analysis for time series analysis and behavioral pattern analysis Path Analysis •  Sessionization: identifies sessions from time series data in a single Discover patterns in rows of pass over the data sequential data •  Attribution: operator to help ad networks and websites to distribute “credit” •  Histogram: function to provide capability of generating •  Decision Trees: Native implementation of parallel random forests. •  Approximate percentiles and distinct counts: calculate Statistical percentiles and counts within specific variance Analysis •  Correlation: calculation that characterizes the strength of the relation between different columns High-performance processing of •  Regression: performs linear or logistic regression between an output common statistical calculations variable and a set of input variables •  Averages: calculate moving, weighted, exponential or volume- weighted averages over a window of data Relational •  Graph analysis: finds shortest path from a distinct node to all other Analysis nodes in a graph •  Tokenization: splits strings into individual words to assist text Discover important relationships processing among Confidential and proprietary. Copyright © 2013 Teradata Corporation. 46 data
  • 47. Ease of Development and Reuse Analytic Foundation : 50+ out-of-the-box modules Modules SQL-MapReduce Analytic Functions •  Text Processing: counts occurrences of words, identifies roots, & Text Analysis tracks relative positions of words & multi-word phrases •  Text Partition: analyzes text data over multiple rows Derive patterns in textual data •  Levenshtein Distance: computes the distance between two words •  k-Means: clusters data into a specified number of groupings •  Canopy: partitions data into overlapping subsets within which k- Cluster means is performed Analysis •  Minhash: buckets highly-dimensional items for cluster analysis •  Basket analysis: creates configurable groupings of related items Discover natural groupings of data from transaction records in single pass points •  Collaborative Filter: predicts the interests of a user by collecting interest information from many users Data •  Unpack: extracts nested data for further analysis •  Pack: compress multi-column data into a single column Transformation •  Antiselect: returns all columns except for specified column Transform data for more advanced •  Multicase: case statement that supports row match for multiple analysis cases 47 Confidential and proprietary. Copyright © 2013 Teradata Corporation.
  • 48. Perceptions & Questions Analyst: Robin Bloor Twitter Tag: #briefr The Briefing Room
  • 50. Big Data Is About Analytics DATA AIN’T WHAT IT USED TO BE Machine generated data (logs) Web data Social media data Public data services Supply chain data Real-time data flows THE ANALOGY OF STRIP-MINING IS RELEVANT BECAUSE THE SCALE OF DATA ANALYTICS HAS EXPANDED DRAMATICALLY The Bloor Group
  • 51. The Data Analytics Issue The Bloor Group
  • 52. What Hadoop Is NOT A MULTIUSER HIGHLY TUNED ENGINE AN ANALYTICS PLATFORM A SOLUTION But it IS: A USEFUL, FLEXIBLE AND VERY ECONOMIC DATA STORE – WITH PLUG-INS The Bloor Group
  • 53. About Data Analytics It is all about TIME TO INSIGHT – as long as that is followed by action Fast time to insight requires FLEXIBLE management of high performance data flows - for the benefit of the data analyst The data analyst needs to be able to MARSHAL the data Then maybe, just maybe, he will deserve the title of DATA SCIENTIST The Bloor Group
  • 54. Clearly the Teradata Aster Big Analytics Appliance is a powerful data flow engine, so: !   How does Aster Data achieve its performance lift with MapReduce? !   How is it most usually deployed? !   Can it do data cleansing in flight? !   Can it perform analytic tasks? The Bloor Group
  • 55. !   Why an appliance? What is gained and what is sacrificed? !   Which sectors/businesses do you expect to be able to make best use of this technology? !   Which companies/products do you regard as competitors (either direct or near)? !   Which companies/products do you partner with? !   How does the appliance fit in the cloud? The Bloor Group
  • 56. Twitter Tag: #briefr The Briefing Room
  • 57. Upcoming Topics This month: Big Data February: Analytics March: Open Source April: Intelligence www.insideanalysis.com Twitter Tag: #briefr The Briefing Room
  • 58. Thank You for Your Attention Twitter Tag: #briefr The Briefing Room