SlideShare ist ein Scribd-Unternehmen logo
1 von 28
Modernizing Business Processes with Big Data:
Real-World Use Cases for Production
Christoph Streubert & Amit Satoor
April 2017
© 2017 SAP SE or an SAP affiliate company. All rights reserved. 2PUBLIC
Agenda
• Use-Cases
• What is Vora
• Architecture
Utilities rise to the
smart meter
challenge The mass of information from smart
meters is leading utility suppliers to
reconsider how they use their data
• Smart meters generate TBs of data/month
• Regulatory requirement to retain data for 10 years
• Forecasting energy usage
• Benefits of integrating data
• Meter data could help fraud detection, predict
maintenance requirements and eventually lead to
smart grids which respond intelligently to variations
in supply and demand
Agriculture takes advantage of
Precision Farming
Top Line Revenue
Growth and Lower
Costs
• Run Reports in Minutes
versus a Day or Two Days
• Improved and Scalable
Architecture Lowering
Costs
• Accurate Weather
Forecasting Leads to an
Increase in Production
Business
Challenge
SAP HANA with SAP Vora
• Migrate DW to the SAP HANA
• Leverage in-DB Machine Learning for predictive analytics
• Hadoop and Vora for low cost storage and compute of
unstructured data
Technical
Enablers
Cost Effectiveness & Improve Product Yield
• Increase in costs and lost revenue due to forecasting
challenges
• Sugar production requires accurate timing
• Managing strategic acquisitions and multiple farms
Benefits Improve Speed and Accuracy of Weather Data
• Leverage IOT data
• OCR parsing of satellite imagery data
• Focus on automation and improvement of forecasting process
• Improve the UX presentation and options
Business Benefits
© 2017 SAP SE or an SAP affiliate company. All rights reserved. 5PUBLIC
Agenda
• Use-Cases
• What is Vora
• Architecture
Data Storage
Scalable and unified storage
across data types and sources
Data Compute
Data processing and analysis,
discovery, enhancement, and
governance, making data usable
Data Consumption
Data-driven insight connected to action
Unifying
the Data
Landscape
Integrating across
storage, compute
and consumption
CIO Imperatives & Challenges
Common Lesson
Big Data journey is incomplete
without business transformation
1.
2.
3.
4.
Big Data Journey
53%:
difficulty
integrating with
other enterprise
systems
49% can’t apply external data quickly enough
to enable context-based decision making
59% Only few analysts
with specialized training can
analyze big data
Harvard Business Reivew Analytic Services in Sep 2015
Need lower skill, production support and
performance optimization costs
© 2017 SAP SE or an SAP affiliate company. All rights reserved. 7PUBLIC
Draft
SAP Vora
SAP Vora is an enterprise-ready, easy-to-use in-
memory distributed computing solution to help
organizations uncover actionable insights from big data.
Builds upon
Apache Spark
Seamless Integration
with SAP HANA
Runs on
Hadoop
SAP
Vora
© 2017 SAP SE or an SAP affiliate company. All rights reserved. 8PUBLIC
Distributed Computing for the Digital Enterprise
Hortonworks Data Platform
Spark
Distributed Transaction Log
Disk-to-Memory Accelerator
Data Modeler
OLAP Time Series Graph Doc Store
SAP Vora
OPEN CONSUM PTION
Data Science, Predictive, Business Intelligence, Visualization Apps
Insights from
one single solution
In-memory distributed computing engines:
OLAP, Time Series, Graph, JSON/Doc
Disk-to-memory accelerator
Enterprise-ready
Production-ready, integrated solution
Integration with SAP HANA
Easier to use
Intuitive web interface
One SQL entry point
Open consumption
OLAP on
Hadoop for 360º
view of data Creating business scenarios views:
• Data Browser for viewing and exporting data
• SQL Editor for writing and running SQL scripts
• Modeler to visually create data models with
intuitive web interface
Time series data
analysis across
Big Data
-30
-20
-10
0
10
Temperature °C
Halifax Waterloo
Trend | Cyclical | Seasonal | Random | Exception
Efficiently analyze time series data in
distributed environments:
• Interactive access to standard time series
analysis functions using the well-known SQL
language
• Efficient compression allowing analysis of more
data using less memory
• Build time series models visually using Vora
Data Modeler
Graph engine
to uncover
connected
data
relationships
Native graph processing for:
• Interactive analysis of graphs using graph extension for SQL
• Supports directed and undirected graphs
• Algorithms for pattern matching, shortest path, and connected
components
1:Actor
NAME=‘Brad Pitt’ 4:MOVIE
TITLE=‘Mr. & Mrs. Smith’
YEAR=2005
RATING:6.5
7:DIRECTOR
NAME=‘Doug Liman’
1:Actor
NAME=‘Angelina Jolie’
3:Actor
NAME=‘Shah Rukh Khan’
5:MOVIE
TITLE=‘Kal Ho Naa Ho’
YEAR=2003
RATING=8.1
4:MOVIE
TITLE=‘My Name is Khan’
YEAR=2010
RATING-8.90
Plays in
Plays in
Director
Flexible
storage with
document
store
Support for collection of documents with
different structures:
• Interactive analysis of schema-less JSON data using the
well-known SQL language
• Capability to flexibly add or remove fields from any JSON docs
Document #1
Key: Value
Document #2
{Key: Value, Key:
Value}
Document #3
{Key: Value}
Document #4
Key: {Key: Value,
Key: Value|
Collection
Collection
Collection
Document Store
Big Data is complex
It gets more complicated as you scale
Introducing: SAP Cloud Platform Big Data Services
Fully Managed Big Data Cloud offering for Production Use
Data Centers optimized for Hadoop
Automated Operations Center
Unified Control Plane
Workbench
Business
Analytics
Search &
Discovery
Data
Exploration
Data Science
& Modeling
Custom
Applications
DataTransfer
Portal
ProactiveHelpdesk
SAP Vora
© 2017 SAP SE or an SAP affiliate company. All rights reserved. 15PUBLIC
Agenda
• Use-Cases
• What is Vora
• Architecture
© 2017 SAP SE or an SAP affiliate company. All rights reserved. 16PUBLIC
“Big Data” Style
 Opportunity Oriented
 Bottom-up Experimentation
 Immediate use and gratification
 Tool proliferation
 “World of Hadoop”
 Hackathons
 Better business
 Open Source
Suit vs. Hoddie
Traditional IM
 Requirements based
 Top-down design
 Integration and re-use
 Technology Consolidation
 World of EDW, CRM, ERP, ECM
 Competence Centers
 Better decisions
 Commercial Software
SAP
Vora
© 2017 SAP SE or an SAP affiliate company. All rights reserved. 17PUBLIC
SAP centric viewCOMPUTEConsumeDataStore
GBs - TBs TBs - 10s of TBs 10s of TBs - PBs
In-
Memory
System of Record
HANA / BW/4HANA
Data
Tiering
In-
Memory
Structured data
for fast analytics
Less frequently
accessed,
structured data
Raw data:
semi-structured,
unstructured,
streaming data etc.
Data Lake
On-Premises In the Cloud
Hadoop and Spark
SAP Vora
Next Generation
Data Warehouse
= SCPBDS
© 2017 SAP SE or an SAP affiliate company. All rights reserved. 18PUBLIC
Hadoop/Spark centric viewCOMPUTEIngestSourcesConsumeDataStore
GBs - TBs TBs - 10s of TBs 100s of TBs - PBs
Smart Data Streaming Data Services
Log Data Sensors Machine Data
In-
Memory
System of Record
HANA / BW/4HANA
Data
Tiering
In-
Memory
Structured data
for fast analytics
Less frequently
accessed,
structured data
Raw data:
semi-structured,
unstructured,
streaming data etc.
Data Lake
On-Premises In the Cloud
Hadoop and Spark
SAP Vora
Next Generation
Data Warehouse
etc.
etc.
= SCPBDS
© 2017 SAP SE or an SAP affiliate company. All rights reserved. 19PUBLIC
InfrastructureFunctions&datatypes&toolsDataaccess
methods
Vora Value in the Hadoop world (what we mean by )
Graph
function
TimeSeries
function
JSONDocu
Store
OLAP
functions/
relational
modeling
Cypher, SQL
browser
SQL editor
Dedicated
infrastructure
Data model Data model Data model Data model
JavaScript
shell
Dedicated
infrastructure
Dedicated
infrastructure
Dedicated
infrastructure
SQL editor
Complex federation
etc.
etc.
etc.
Security Security Security Security
Distributed Transaction Log
Disk-to-Memory Accelerator
Vora Tools
(Data Browser, SQL Editor, OLAP Modeler)
Graph
JSONDoc
Store
Relational
e.g.hierarchies,
curr.conversion
Time
Series
Future
text,spatial,
video,etc.
Integrated Engines
Data Exchange
Security
Shared Hadoop Infrastructure
SAP HANA Vora
etc.
complex custom code
© 2017 SAP SE or an SAP affiliate company. All rights reserved. 20PUBLIC
Architecture
• Delivers an in-memory relational engine that
processes Hadoop data stored in HDFS, S3,
parquet, ORC
• Combines multiple processing engines like
Time Series, Graph and Document Store
• Uses Dlog for storing metadata catalog from
Vora and HANA data sources
• Leverages SparkSQL to build native
VoraSQLcontext to provide distributed
processing of relational and other workloads
• Connects via Spark Thriftserver to provide web
based modeling UI to build “olap” models on
Hadoop data
© 2017 SAP SE or an SAP affiliate company. All rights reserved. 21PUBLIC
Vora Cluster Manager – nodes and services assignment
© 2017 SAP SE or an SAP affiliate company. All rights reserved. 22PUBLIC
Integration with Hadoop Management Tools
• Admin tools like Apache Ambari, are
used to administer and monitor the
Hadoop landscape
© 2017 SAP SE or an SAP affiliate company. All rights reserved. 23PUBLIC
Native HANA
Integration
(no need for Spark
adapter)
Improved Usability of
Vora Modeler
Four New Engines:
Disk, Graph, Doc Store,
Time Series
Improved Vora Core:
Stability & enhancement
Key Capabilities of SAP Vora
New with Vora 1.3
Vora Vora Vora
Vora Vora Vora
Vora Vora Vora
Native „Data store“
in Hadoop
Multiple Engines
Relational
(OLAP)
In-
Memor
Time
Series
Doc Store
Graphs
Intuitive Tools
Tight HANA
Integration
0.1sec
∞
HANA
Hadoo
p
New
© 2017 SAP SE or an SAP affiliate company. All rights reserved. 24PUBLIC
New in 1.4
• New Product Name "SAP Vora“
• Installation Package
• Supported Platforms (HDP2.5)
• Next-generation in-memory engine featuring SAP
Vora's native distribution technology
• Additional functions in engines
• Avro support
• Data preview
© 2017 SAP AG or an SAP affiliate company. All rights reserved. 25
Telco Customer 360 – Value Accelerators
How can you get started with SAP Vora?
1. Blog: https://blogs.sap.com/2016/12/19/a-
look-at-the-sap-hana-vora-1.3-new-analysis-
engines/
2. Developer Community:
https://www.sap.com/developer/topics/hana-
vora.html
Download
and Install
Access From
the Cloud
1. Access from the SAP Cloud Appliance Library
https://www.sap.com/developer/topics/hana-
vora.html
2. Enter credentials
3. Get up and running in the cloud
* Free SAP Vora Developer Edition plus infrastructure cost
© 2017 SAP SE or an SAP affiliate company. All rights reserved. 27PUBLIC
More Information!
Technical documentation
https://help.sap.com/viewer/p/SAP_VORA
Developer downloads
https://www.sap.com/developer/topics/vora.html#freetria
Helpful links
https://blogs.sap.com/2017/03/30/useful-sap-vora-links/
CTA
© 2017 SAP SE or an SAP affiliate company. All rights reserved.
Thank you
Christoph Streubert @cstreubert christoph.Streubert@sap.com
Amit Satoor @asatoor amit.Satoor@sap.com

Weitere ähnliche Inhalte

Was ist angesagt?

Insights into Real World Data Management Challenges
Insights into Real World Data Management ChallengesInsights into Real World Data Management Challenges
Insights into Real World Data Management ChallengesDataWorks Summit
 
Format Wars: from VHS and Beta to Avro and Parquet
Format Wars: from VHS and Beta to Avro and ParquetFormat Wars: from VHS and Beta to Avro and Parquet
Format Wars: from VHS and Beta to Avro and ParquetDataWorks Summit
 
Hadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in ProductionHadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in ProductionDataWorks Summit/Hadoop Summit
 
Efficient Data Formats for Analytics with Parquet and Arrow
Efficient Data Formats for Analytics with Parquet and ArrowEfficient Data Formats for Analytics with Parquet and Arrow
Efficient Data Formats for Analytics with Parquet and ArrowDataWorks Summit/Hadoop Summit
 
Using SparkR to Scale Data Science Applications in Production. Lessons from t...
Using SparkR to Scale Data Science Applications in Production. Lessons from t...Using SparkR to Scale Data Science Applications in Production. Lessons from t...
Using SparkR to Scale Data Science Applications in Production. Lessons from t...DataWorks Summit/Hadoop Summit
 
Combining Machine Learning frameworks with Apache Spark
Combining Machine Learning frameworks with Apache SparkCombining Machine Learning frameworks with Apache Spark
Combining Machine Learning frameworks with Apache SparkDataWorks Summit/Hadoop Summit
 
Innovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data WarehouseInnovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data WarehouseDataWorks Summit
 
The Unbearable Lightness of Ephemeral Processing
The Unbearable Lightness of Ephemeral ProcessingThe Unbearable Lightness of Ephemeral Processing
The Unbearable Lightness of Ephemeral ProcessingDataWorks Summit
 
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...DataWorks Summit
 
Big Data in the Cloud - The What, Why and How from the Experts
Big Data in the Cloud - The What, Why and How from the ExpertsBig Data in the Cloud - The What, Why and How from the Experts
Big Data in the Cloud - The What, Why and How from the ExpertsDataWorks Summit/Hadoop Summit
 
Hadoop in the Cloud - The what, why and how from the experts
Hadoop in the Cloud - The what, why and how from the expertsHadoop in the Cloud - The what, why and how from the experts
Hadoop in the Cloud - The what, why and how from the expertsDataWorks Summit/Hadoop Summit
 
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewThe Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewDataWorks Summit/Hadoop Summit
 
Visualizing Big Data in Realtime
Visualizing Big Data in RealtimeVisualizing Big Data in Realtime
Visualizing Big Data in RealtimeDataWorks Summit
 
Insights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesInsights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesDataWorks Summit
 
Adding structure to your streaming pipelines: moving from Spark streaming to ...
Adding structure to your streaming pipelines: moving from Spark streaming to ...Adding structure to your streaming pipelines: moving from Spark streaming to ...
Adding structure to your streaming pipelines: moving from Spark streaming to ...DataWorks Summit
 

Was ist angesagt? (20)

Insights into Real World Data Management Challenges
Insights into Real World Data Management ChallengesInsights into Real World Data Management Challenges
Insights into Real World Data Management Challenges
 
Format Wars: from VHS and Beta to Avro and Parquet
Format Wars: from VHS and Beta to Avro and ParquetFormat Wars: from VHS and Beta to Avro and Parquet
Format Wars: from VHS and Beta to Avro and Parquet
 
Log I am your father
Log I am your fatherLog I am your father
Log I am your father
 
Hadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in ProductionHadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in Production
 
Deep Learning using Spark and DL4J for fun and profit
Deep Learning using Spark and DL4J for fun and profitDeep Learning using Spark and DL4J for fun and profit
Deep Learning using Spark and DL4J for fun and profit
 
LLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in HiveLLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in Hive
 
Efficient Data Formats for Analytics with Parquet and Arrow
Efficient Data Formats for Analytics with Parquet and ArrowEfficient Data Formats for Analytics with Parquet and Arrow
Efficient Data Formats for Analytics with Parquet and Arrow
 
How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient
 
Using SparkR to Scale Data Science Applications in Production. Lessons from t...
Using SparkR to Scale Data Science Applications in Production. Lessons from t...Using SparkR to Scale Data Science Applications in Production. Lessons from t...
Using SparkR to Scale Data Science Applications in Production. Lessons from t...
 
Combining Machine Learning frameworks with Apache Spark
Combining Machine Learning frameworks with Apache SparkCombining Machine Learning frameworks with Apache Spark
Combining Machine Learning frameworks with Apache Spark
 
Innovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data WarehouseInnovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data Warehouse
 
The Unbearable Lightness of Ephemeral Processing
The Unbearable Lightness of Ephemeral ProcessingThe Unbearable Lightness of Ephemeral Processing
The Unbearable Lightness of Ephemeral Processing
 
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
 
Big Data in the Cloud - The What, Why and How from the Experts
Big Data in the Cloud - The What, Why and How from the ExpertsBig Data in the Cloud - The What, Why and How from the Experts
Big Data in the Cloud - The What, Why and How from the Experts
 
Hadoop in the Cloud - The what, why and how from the experts
Hadoop in the Cloud - The what, why and how from the expertsHadoop in the Cloud - The what, why and how from the experts
Hadoop in the Cloud - The what, why and how from the experts
 
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewThe Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture View
 
Visualizing Big Data in Realtime
Visualizing Big Data in RealtimeVisualizing Big Data in Realtime
Visualizing Big Data in Realtime
 
Securing Hadoop in an Enterprise Context
Securing Hadoop in an Enterprise ContextSecuring Hadoop in an Enterprise Context
Securing Hadoop in an Enterprise Context
 
Insights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesInsights into Real-world Data Management Challenges
Insights into Real-world Data Management Challenges
 
Adding structure to your streaming pipelines: moving from Spark streaming to ...
Adding structure to your streaming pipelines: moving from Spark streaming to ...Adding structure to your streaming pipelines: moving from Spark streaming to ...
Adding structure to your streaming pipelines: moving from Spark streaming to ...
 

Ähnlich wie Modernizing Business Processes with Big Data: Real-World Use Cases for Production

SAP HANA Vora SITMTY 20160707
SAP HANA Vora SITMTY 20160707SAP HANA Vora SITMTY 20160707
SAP HANA Vora SITMTY 20160707Henrique Pinto
 
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudBring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudDataWorks Summit/Hadoop Summit
 
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the CloudBring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the CloudDataWorks Summit
 
SAP Data Hub e SUSE Container as a Service Platform
SAP Data Hub e SUSE Container as a Service PlatformSAP Data Hub e SUSE Container as a Service Platform
SAP Data Hub e SUSE Container as a Service PlatformSUSE Italy
 
What's New in SAP Replication Server 15.7.1 SP100
What's New in SAP Replication Server 15.7.1 SP100What's New in SAP Replication Server 15.7.1 SP100
What's New in SAP Replication Server 15.7.1 SP100Dobler Consulting
 
MDS ap_OEM Product Portfolio Intorduction to the DT & Analytics
MDS ap_OEM Product Portfolio Intorduction to the DT & AnalyticsMDS ap_OEM Product Portfolio Intorduction to the DT & Analytics
MDS ap_OEM Product Portfolio Intorduction to the DT & AnalyticsMDS ap
 
Oracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsOracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsjdijcks
 
Data & Analytics with CIS & Microsoft Platforms
Data & Analytics with CIS & Microsoft PlatformsData & Analytics with CIS & Microsoft Platforms
Data & Analytics with CIS & Microsoft PlatformsSonata Software
 
Accelerate your journey to SAP S/4HANA with SAP’s Business Technology Platform
Accelerate your journey to SAP S/4HANA with SAP’s Business Technology PlatformAccelerate your journey to SAP S/4HANA with SAP’s Business Technology Platform
Accelerate your journey to SAP S/4HANA with SAP’s Business Technology PlatformSAP Technology
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Group
 
Exclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR DataExclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR DataPentaho
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database RoundtableEric Kavanagh
 
ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWSACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWSAWS User Group Kochi
 
Business intelligence in the era of big data
Business intelligence in the era of big dataBusiness intelligence in the era of big data
Business intelligence in the era of big dataJC Raveneau
 
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin MotgiWhither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin MotgiFelicia Haggarty
 
Bringing the Power of Big Data Computation to Salesforce
Bringing the Power of Big Data Computation to SalesforceBringing the Power of Big Data Computation to Salesforce
Bringing the Power of Big Data Computation to SalesforceSalesforce Developers
 
What's Planned for SAP HANA SPS10
What's Planned for SAP HANA SPS10What's Planned for SAP HANA SPS10
What's Planned for SAP HANA SPS10SAP Technology
 

Ähnlich wie Modernizing Business Processes with Big Data: Real-World Use Cases for Production (20)

SAP HANA Vora SITMTY 20160707
SAP HANA Vora SITMTY 20160707SAP HANA Vora SITMTY 20160707
SAP HANA Vora SITMTY 20160707
 
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudBring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
 
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the CloudBring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
 
SAP Data Hub e SUSE Container as a Service Platform
SAP Data Hub e SUSE Container as a Service PlatformSAP Data Hub e SUSE Container as a Service Platform
SAP Data Hub e SUSE Container as a Service Platform
 
SAP Vora CodeJam
SAP Vora CodeJamSAP Vora CodeJam
SAP Vora CodeJam
 
What's New in SAP Replication Server 15.7.1 SP100
What's New in SAP Replication Server 15.7.1 SP100What's New in SAP Replication Server 15.7.1 SP100
What's New in SAP Replication Server 15.7.1 SP100
 
MDS ap_OEM Product Portfolio Intorduction to the DT & Analytics
MDS ap_OEM Product Portfolio Intorduction to the DT & AnalyticsMDS ap_OEM Product Portfolio Intorduction to the DT & Analytics
MDS ap_OEM Product Portfolio Intorduction to the DT & Analytics
 
Oracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsOracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analytics
 
Data & Analytics with CIS & Microsoft Platforms
Data & Analytics with CIS & Microsoft PlatformsData & Analytics with CIS & Microsoft Platforms
Data & Analytics with CIS & Microsoft Platforms
 
Accelerate your journey to SAP S/4HANA with SAP’s Business Technology Platform
Accelerate your journey to SAP S/4HANA with SAP’s Business Technology PlatformAccelerate your journey to SAP S/4HANA with SAP’s Business Technology Platform
Accelerate your journey to SAP S/4HANA with SAP’s Business Technology Platform
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2
 
Exclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR DataExclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR Data
 
Skilwise Big data
Skilwise Big dataSkilwise Big data
Skilwise Big data
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWSACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
 
Business intelligence in the era of big data
Business intelligence in the era of big dataBusiness intelligence in the era of big data
Business intelligence in the era of big data
 
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin MotgiWhither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
 
HANA a PoV
HANA a PoVHANA a PoV
HANA a PoV
 
Bringing the Power of Big Data Computation to Salesforce
Bringing the Power of Big Data Computation to SalesforceBringing the Power of Big Data Computation to Salesforce
Bringing the Power of Big Data Computation to Salesforce
 
What's Planned for SAP HANA SPS10
What's Planned for SAP HANA SPS10What's Planned for SAP HANA SPS10
What's Planned for SAP HANA SPS10
 

Mehr von DataWorks Summit/Hadoop Summit

Unleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache RangerUnleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache RangerDataWorks Summit/Hadoop Summit
 
Enabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science PlatformEnabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science PlatformDataWorks Summit/Hadoop Summit
 
Double Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSenseDouble Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSenseDataWorks Summit/Hadoop Summit
 
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...DataWorks Summit/Hadoop Summit
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...DataWorks Summit/Hadoop Summit
 
Mool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and MLMool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and MLDataWorks Summit/Hadoop Summit
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)DataWorks Summit/Hadoop Summit
 
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...DataWorks Summit/Hadoop Summit
 
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage SchemesScaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage SchemesDataWorks Summit/Hadoop Summit
 

Mehr von DataWorks Summit/Hadoop Summit (20)

Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in ProductionRunning Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
 
State of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache ZeppelinState of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache Zeppelin
 
Unleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache RangerUnleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache Ranger
 
Enabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science PlatformEnabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science Platform
 
Revolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and ZeppelinRevolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and Zeppelin
 
Double Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSenseDouble Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSense
 
Hadoop Crash Course
Hadoop Crash CourseHadoop Crash Course
Hadoop Crash Course
 
Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Apache Spark Crash Course
Apache Spark Crash CourseApache Spark Crash Course
Apache Spark Crash Course
 
Dataflow with Apache NiFi
Dataflow with Apache NiFiDataflow with Apache NiFi
Dataflow with Apache NiFi
 
Schema Registry - Set you Data Free
Schema Registry - Set you Data FreeSchema Registry - Set you Data Free
Schema Registry - Set you Data Free
 
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
 
Mool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and MLMool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and ML
 
HBase in Practice
HBase in Practice HBase in Practice
HBase in Practice
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)
 
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS HadoopBreaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
 
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
 
Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop
 
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage SchemesScaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
 

Kürzlich hochgeladen

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 

Kürzlich hochgeladen (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 

Modernizing Business Processes with Big Data: Real-World Use Cases for Production

  • 1. Modernizing Business Processes with Big Data: Real-World Use Cases for Production Christoph Streubert & Amit Satoor April 2017
  • 2. © 2017 SAP SE or an SAP affiliate company. All rights reserved. 2PUBLIC Agenda • Use-Cases • What is Vora • Architecture
  • 3. Utilities rise to the smart meter challenge The mass of information from smart meters is leading utility suppliers to reconsider how they use their data • Smart meters generate TBs of data/month • Regulatory requirement to retain data for 10 years • Forecasting energy usage • Benefits of integrating data • Meter data could help fraud detection, predict maintenance requirements and eventually lead to smart grids which respond intelligently to variations in supply and demand
  • 4. Agriculture takes advantage of Precision Farming Top Line Revenue Growth and Lower Costs • Run Reports in Minutes versus a Day or Two Days • Improved and Scalable Architecture Lowering Costs • Accurate Weather Forecasting Leads to an Increase in Production Business Challenge SAP HANA with SAP Vora • Migrate DW to the SAP HANA • Leverage in-DB Machine Learning for predictive analytics • Hadoop and Vora for low cost storage and compute of unstructured data Technical Enablers Cost Effectiveness & Improve Product Yield • Increase in costs and lost revenue due to forecasting challenges • Sugar production requires accurate timing • Managing strategic acquisitions and multiple farms Benefits Improve Speed and Accuracy of Weather Data • Leverage IOT data • OCR parsing of satellite imagery data • Focus on automation and improvement of forecasting process • Improve the UX presentation and options Business Benefits
  • 5. © 2017 SAP SE or an SAP affiliate company. All rights reserved. 5PUBLIC Agenda • Use-Cases • What is Vora • Architecture
  • 6. Data Storage Scalable and unified storage across data types and sources Data Compute Data processing and analysis, discovery, enhancement, and governance, making data usable Data Consumption Data-driven insight connected to action Unifying the Data Landscape Integrating across storage, compute and consumption CIO Imperatives & Challenges Common Lesson Big Data journey is incomplete without business transformation 1. 2. 3. 4. Big Data Journey 53%: difficulty integrating with other enterprise systems 49% can’t apply external data quickly enough to enable context-based decision making 59% Only few analysts with specialized training can analyze big data Harvard Business Reivew Analytic Services in Sep 2015 Need lower skill, production support and performance optimization costs
  • 7. © 2017 SAP SE or an SAP affiliate company. All rights reserved. 7PUBLIC Draft SAP Vora SAP Vora is an enterprise-ready, easy-to-use in- memory distributed computing solution to help organizations uncover actionable insights from big data. Builds upon Apache Spark Seamless Integration with SAP HANA Runs on Hadoop SAP Vora
  • 8. © 2017 SAP SE or an SAP affiliate company. All rights reserved. 8PUBLIC Distributed Computing for the Digital Enterprise Hortonworks Data Platform Spark Distributed Transaction Log Disk-to-Memory Accelerator Data Modeler OLAP Time Series Graph Doc Store SAP Vora OPEN CONSUM PTION Data Science, Predictive, Business Intelligence, Visualization Apps Insights from one single solution In-memory distributed computing engines: OLAP, Time Series, Graph, JSON/Doc Disk-to-memory accelerator Enterprise-ready Production-ready, integrated solution Integration with SAP HANA Easier to use Intuitive web interface One SQL entry point Open consumption
  • 9. OLAP on Hadoop for 360º view of data Creating business scenarios views: • Data Browser for viewing and exporting data • SQL Editor for writing and running SQL scripts • Modeler to visually create data models with intuitive web interface
  • 10. Time series data analysis across Big Data -30 -20 -10 0 10 Temperature °C Halifax Waterloo Trend | Cyclical | Seasonal | Random | Exception Efficiently analyze time series data in distributed environments: • Interactive access to standard time series analysis functions using the well-known SQL language • Efficient compression allowing analysis of more data using less memory • Build time series models visually using Vora Data Modeler
  • 11. Graph engine to uncover connected data relationships Native graph processing for: • Interactive analysis of graphs using graph extension for SQL • Supports directed and undirected graphs • Algorithms for pattern matching, shortest path, and connected components 1:Actor NAME=‘Brad Pitt’ 4:MOVIE TITLE=‘Mr. & Mrs. Smith’ YEAR=2005 RATING:6.5 7:DIRECTOR NAME=‘Doug Liman’ 1:Actor NAME=‘Angelina Jolie’ 3:Actor NAME=‘Shah Rukh Khan’ 5:MOVIE TITLE=‘Kal Ho Naa Ho’ YEAR=2003 RATING=8.1 4:MOVIE TITLE=‘My Name is Khan’ YEAR=2010 RATING-8.90 Plays in Plays in Director
  • 12. Flexible storage with document store Support for collection of documents with different structures: • Interactive analysis of schema-less JSON data using the well-known SQL language • Capability to flexibly add or remove fields from any JSON docs Document #1 Key: Value Document #2 {Key: Value, Key: Value} Document #3 {Key: Value} Document #4 Key: {Key: Value, Key: Value| Collection Collection Collection Document Store
  • 13. Big Data is complex It gets more complicated as you scale
  • 14. Introducing: SAP Cloud Platform Big Data Services Fully Managed Big Data Cloud offering for Production Use Data Centers optimized for Hadoop Automated Operations Center Unified Control Plane Workbench Business Analytics Search & Discovery Data Exploration Data Science & Modeling Custom Applications DataTransfer Portal ProactiveHelpdesk SAP Vora
  • 15. © 2017 SAP SE or an SAP affiliate company. All rights reserved. 15PUBLIC Agenda • Use-Cases • What is Vora • Architecture
  • 16. © 2017 SAP SE or an SAP affiliate company. All rights reserved. 16PUBLIC “Big Data” Style  Opportunity Oriented  Bottom-up Experimentation  Immediate use and gratification  Tool proliferation  “World of Hadoop”  Hackathons  Better business  Open Source Suit vs. Hoddie Traditional IM  Requirements based  Top-down design  Integration and re-use  Technology Consolidation  World of EDW, CRM, ERP, ECM  Competence Centers  Better decisions  Commercial Software SAP Vora
  • 17. © 2017 SAP SE or an SAP affiliate company. All rights reserved. 17PUBLIC SAP centric viewCOMPUTEConsumeDataStore GBs - TBs TBs - 10s of TBs 10s of TBs - PBs In- Memory System of Record HANA / BW/4HANA Data Tiering In- Memory Structured data for fast analytics Less frequently accessed, structured data Raw data: semi-structured, unstructured, streaming data etc. Data Lake On-Premises In the Cloud Hadoop and Spark SAP Vora Next Generation Data Warehouse = SCPBDS
  • 18. © 2017 SAP SE or an SAP affiliate company. All rights reserved. 18PUBLIC Hadoop/Spark centric viewCOMPUTEIngestSourcesConsumeDataStore GBs - TBs TBs - 10s of TBs 100s of TBs - PBs Smart Data Streaming Data Services Log Data Sensors Machine Data In- Memory System of Record HANA / BW/4HANA Data Tiering In- Memory Structured data for fast analytics Less frequently accessed, structured data Raw data: semi-structured, unstructured, streaming data etc. Data Lake On-Premises In the Cloud Hadoop and Spark SAP Vora Next Generation Data Warehouse etc. etc. = SCPBDS
  • 19. © 2017 SAP SE or an SAP affiliate company. All rights reserved. 19PUBLIC InfrastructureFunctions&datatypes&toolsDataaccess methods Vora Value in the Hadoop world (what we mean by ) Graph function TimeSeries function JSONDocu Store OLAP functions/ relational modeling Cypher, SQL browser SQL editor Dedicated infrastructure Data model Data model Data model Data model JavaScript shell Dedicated infrastructure Dedicated infrastructure Dedicated infrastructure SQL editor Complex federation etc. etc. etc. Security Security Security Security Distributed Transaction Log Disk-to-Memory Accelerator Vora Tools (Data Browser, SQL Editor, OLAP Modeler) Graph JSONDoc Store Relational e.g.hierarchies, curr.conversion Time Series Future text,spatial, video,etc. Integrated Engines Data Exchange Security Shared Hadoop Infrastructure SAP HANA Vora etc. complex custom code
  • 20. © 2017 SAP SE or an SAP affiliate company. All rights reserved. 20PUBLIC Architecture • Delivers an in-memory relational engine that processes Hadoop data stored in HDFS, S3, parquet, ORC • Combines multiple processing engines like Time Series, Graph and Document Store • Uses Dlog for storing metadata catalog from Vora and HANA data sources • Leverages SparkSQL to build native VoraSQLcontext to provide distributed processing of relational and other workloads • Connects via Spark Thriftserver to provide web based modeling UI to build “olap” models on Hadoop data
  • 21. © 2017 SAP SE or an SAP affiliate company. All rights reserved. 21PUBLIC Vora Cluster Manager – nodes and services assignment
  • 22. © 2017 SAP SE or an SAP affiliate company. All rights reserved. 22PUBLIC Integration with Hadoop Management Tools • Admin tools like Apache Ambari, are used to administer and monitor the Hadoop landscape
  • 23. © 2017 SAP SE or an SAP affiliate company. All rights reserved. 23PUBLIC Native HANA Integration (no need for Spark adapter) Improved Usability of Vora Modeler Four New Engines: Disk, Graph, Doc Store, Time Series Improved Vora Core: Stability & enhancement Key Capabilities of SAP Vora New with Vora 1.3 Vora Vora Vora Vora Vora Vora Vora Vora Vora Native „Data store“ in Hadoop Multiple Engines Relational (OLAP) In- Memor Time Series Doc Store Graphs Intuitive Tools Tight HANA Integration 0.1sec ∞ HANA Hadoo p New
  • 24. © 2017 SAP SE or an SAP affiliate company. All rights reserved. 24PUBLIC New in 1.4 • New Product Name "SAP Vora“ • Installation Package • Supported Platforms (HDP2.5) • Next-generation in-memory engine featuring SAP Vora's native distribution technology • Additional functions in engines • Avro support • Data preview
  • 25. © 2017 SAP AG or an SAP affiliate company. All rights reserved. 25 Telco Customer 360 – Value Accelerators
  • 26. How can you get started with SAP Vora? 1. Blog: https://blogs.sap.com/2016/12/19/a- look-at-the-sap-hana-vora-1.3-new-analysis- engines/ 2. Developer Community: https://www.sap.com/developer/topics/hana- vora.html Download and Install Access From the Cloud 1. Access from the SAP Cloud Appliance Library https://www.sap.com/developer/topics/hana- vora.html 2. Enter credentials 3. Get up and running in the cloud * Free SAP Vora Developer Edition plus infrastructure cost
  • 27. © 2017 SAP SE or an SAP affiliate company. All rights reserved. 27PUBLIC More Information! Technical documentation https://help.sap.com/viewer/p/SAP_VORA Developer downloads https://www.sap.com/developer/topics/vora.html#freetria Helpful links https://blogs.sap.com/2017/03/30/useful-sap-vora-links/ CTA
  • 28. © 2017 SAP SE or an SAP affiliate company. All rights reserved. Thank you Christoph Streubert @cstreubert christoph.Streubert@sap.com Amit Satoor @asatoor amit.Satoor@sap.com