SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Hadoop running on Retail Business
1
Author: Douglas Bernardini
Onofre Profile
2
• Onofre: CVS Brazil´s operations.
• Pharmacy network 50 stores.
• 2100 employees
• 01 distribuition center
• 37% sales thru e-commerce
• 25% thru mobile/tablet
• CallCenter: 201 positions
• No omni-channel process.
IT perspective
3
• SAP/ECC IS Retail as a central component
• SAP/BObjects: Limited licences per users
• Just finantial team
• POS/System legacy: Cobol
• Okidata/Itautec
• Ecommerce legacy: Vanroy
• .NET customized solution
• 100% datacenter operation internal
• No outsourcing
• No Cloud Services
SAP/System landscape
4
Case: Sales Performance Info
5
• No mobile for sales report: Just
desktop access.
• No friendly & resumed dashboard
• +1 day delay: Todahy sales just from
yesterday.
• Slow performance: More than 1
minute per report
• E-commerce
• No sales result by region
• No compete conversion rate report
• Main Physical store needs
• No sales loss caused by stock-rupture
Project ‘WEB Pharma’
6
• Objectives
• Make user-fliendly dashboard with main business retail decision info.
• Be mobile!. Users must use dashboard remotly using internet devices.
• Ecommerce & Physical stores resume sales toghether
• All reports must be delivered in less than 10s
• Strategy
• Export legacy data for a external-cloud dataserver. (No use internal datacenter)
• Data-streaming must process data from last 1 hour sales.
• Premisses
• 100% secure connection (SOX complience)
• Low CAPEX & limited budget
• 03 months deadline.
Big Data Architecture
7
Brick&Mortar Store
E-commerce (WEB)
Vanroy
.NET
Cobol
Okidata
.csv
.csv
Data
Pipe
Data
Integrator
Apache
Flume
MapReduce
HDFS
User
Interface
Apache
Flume
Workflow
Scheduling
Apache
Oozie
CDH3
Hbase
HiveSQL
Tableau Connector
Sqoop
Tableau OnLine
D3 Visualization
SSH
SSH
MySQL/S3
BI x Big Data: Comparison
8
Business Intelligence Big Data
Volume Terabytes Petabytes
Velocity Batch, Real-Time, Near RT Streams
Data Source Internal ExternalValue One single font of true Statistical and hypothetical
Variety Single sources Probabilistic and multi-factor
Data sharpness Consistent and reliable
Better to be roughly right than precisely
wrong
Frequency Millions of records per minute Billions of documents per second
Master Data Important part of results Not necessary
Servers Sizing
Evolution planned. Could be done internally.
Elastic Cloud considered an alternative.
Storage/memory growing faster than ever.
Elastic Cloud is crucial.
BI x Big Data: Comparison
9
Business Intelligence Big Data
Main Business
Objective
Business Monitoring, internal insights and
process optimization
Data monetization, business metamorphosis
and new opportunities
Object of analysis Current business process Non existent business process
Data Source Internal ExternalApproach
Reactive. What happened and lets see what
we can do?
Predictive. What will happen tomorrow and
lets be prepared?
Mindset
Examine the data and find the problem root
causes, proposing process optimization
How we can make some REAL money with
this data?
Data sharpness Consistent and reliable
Better to be roughly right than precisely
wrong
Scope 02 or 03 departments Intire company cross departments
Business Model Benchmark pre-existent No benchmark
View Modeling
Pre concepted KPI already
pre-formatted
No idea what exactly the objective and
business needs
Why AWS?
10
• Ready to GO cloud services;
• Scalable;
• Cost-Effective;
In this project
• Ready Secure Internet connection (SSH)
• S3: Simple web services interface
• EC2: Linux CentOS ready to go template.
• Cloudera Partner
• Pipeline: Reliably process and move data between different AWS
compute and storage services
Server Highlights
11
• 21.5 TB historic data (03 years)
• Risk: Poor data-transfer network
• AWS Import/Export Snowball
Data
• Data transfer Estimate> 140MB per
data-package
• 200 package/day: 28GB/DAY
PRD Server config
• RedHat 6.4, 256GB of RAM,
• Processor: 4 x 12 Cores – 5Ghz
• 2x420 storage (10G)
Users
• 350 users
• 50 stores
• 40MB/day each
Network Bandwidth
• Inbound:
• 5Gb
• Outbound:
• 10Gb
Hadoop Highlights
12
• Objective: Fast response for final users
• Masternodes> 01 (*)
• SlaveNodes > 07
• Sqoop: Hadoop native connector >
MySQL
• Hue: SQLlike soft UI for DBA for data-
validation.
• Oozie: Scheduler system to manage
Hadoop jobs.
• triggered by time (frequency) and data
availability.
• Hive: Querying large datasets.
• SQL-like language: HiveQL.
(*) Modified after go-live
Why Cloudera?
13
• Stable Hadoop distribuition
• Simple admin: Cloudera Manager
• Integraded
In this project
• Tableau ready-to-go connector
• CDH3: Open source (cost-effective)
• Fast installation
• Fast Tunning
Why Tableau?
14
• User friendly with high user satisfaction impact
• Mobile ready-to-go application
• Easy to install in Androi Apps.
In this project
• Cost-effective solution
• Lowest price by final user.
• Retail ready-to-go template.
• Brazilian localization done.
• BC in Retail
Why Not SAP?
15
• High cost in user-licence (Project demands 350 new users)
• SAP/Business objects retail template with Low adherence
• Huge investiment in customized reporting
• Hardware processing concorrence with financial users
• Impact in results monthly closing reporting.
• High investment in hardware instance to get expected performance
• 2013: No AWS instance ready for SAP/BOBJ
• SAP/HANA not mature yet.
• Lack of consultants
• No business case (Retail) running in Brazil
Project Methodology
16
• BI projects: Intensive REAL data validation
• Key-Users must really believe in new indicators (expectations).
• Intense deliverable schudule: Antecipation for Validation
• Minimum project Scope: 10 reports
• 07 standards: Tableau
• 03 Customized: D3 visualization
• 01 Dashboard
• Tableau
• Project implementation Strategy: PoC
• Consistent validation: 02 Stores & 10 users
• Testin with real environment: Consistent Issues Log (performance)
Project Schedule
17
AWS
S3, EC2 & Data Pipes Instalation
Cloudera (Hadoop)
CDH3 Installation
Flume & Hive Set-up
Integrations
CSV data entry
Tableau conector
Sqoop set-up
Visualization
Indicators Design
Tableau configuration
D3 configuration
Testing & QA
Load historic Data
Final Devs Validation
PoC (02 stores)
Adjustments & Tunning
GO
Final PRD Delivery
Assisted Operation
01 02 03
Go-LiveDuration (in months)
Activities
PoC
Project Results
18
• Reponse time: 0,4s
• High adherence from users.
• Data visualization triggers
several bisiness iniciatives
• 2ª wave aproveed with 02
additional dashboards and 32
new reports.
• WEB reports demonstrate
OMNI channel process
struturation & new Business
needs
douglas.bernardini@d2-data.com
Questions?
19

Weitere ähnliche Inhalte

Was ist angesagt?

Spot Buy – A New Sales Channel for Suppliers to Reach SAP Customers
Spot Buy – A New Sales Channel for Suppliers to Reach SAP CustomersSpot Buy – A New Sales Channel for Suppliers to Reach SAP Customers
Spot Buy – A New Sales Channel for Suppliers to Reach SAP CustomersSAP Ariba
 
Retail Big Data and Analytics
Retail Big Data and AnalyticsRetail Big Data and Analytics
Retail Big Data and AnalyticsCloudera, Inc.
 
Achieving Procurement 2.0 with Guided Buying: The Royal Philips Story - 56623
Achieving Procurement 2.0 with Guided Buying: The Royal Philips Story - 56623Achieving Procurement 2.0 with Guided Buying: The Royal Philips Story - 56623
Achieving Procurement 2.0 with Guided Buying: The Royal Philips Story - 56623SAP Ariba Live 2018
 
Customer Deep Dive: Supplier Innovations - 56585
Customer Deep Dive: Supplier Innovations - 56585Customer Deep Dive: Supplier Innovations - 56585
Customer Deep Dive: Supplier Innovations - 56585SAP Ariba Live 2018
 
Towards connected planning for Supply Chain
Towards connected planning for Supply Chain Towards connected planning for Supply Chain
Towards connected planning for Supply Chain Bluecrux
 
New Approach to Supply Chain Analytics
New Approach to Supply Chain AnalyticsNew Approach to Supply Chain Analytics
New Approach to Supply Chain Analyticsdemando
 
Aligning Procurement and Payables to Strengthen Your Supply Chain - 56566
Aligning Procurement and Payables to Strengthen Your Supply Chain - 56566Aligning Procurement and Payables to Strengthen Your Supply Chain - 56566
Aligning Procurement and Payables to Strengthen Your Supply Chain - 56566SAP Ariba Live 2018
 
Supply Chain Analytics with Simulation
Supply Chain Analytics with SimulationSupply Chain Analytics with Simulation
Supply Chain Analytics with SimulationSteve Haekler
 
Financial Services Procurement in a Digital World - 56608
Financial Services Procurement in a Digital World - 56608Financial Services Procurement in a Digital World - 56608
Financial Services Procurement in a Digital World - 56608SAP Ariba Live 2018
 
The Future of Design-to-Deliver Supply Chains - 56560
The Future of Design-to-Deliver Supply Chains - 56560The Future of Design-to-Deliver Supply Chains - 56560
The Future of Design-to-Deliver Supply Chains - 56560SAP Ariba Live 2018
 
Collaborating with Direct Spend Suppliers in the Life Sciences Industry - 56556
Collaborating with Direct Spend Suppliers in the Life Sciences Industry - 56556Collaborating with Direct Spend Suppliers in the Life Sciences Industry - 56556
Collaborating with Direct Spend Suppliers in the Life Sciences Industry - 56556SAP Ariba Live 2018
 
Automated Product Ratings and Review dashboard
Automated Product Ratings and Review dashboardAutomated Product Ratings and Review dashboard
Automated Product Ratings and Review dashboardBalaji Katakam
 
IT Solutions for Retail
IT Solutions for RetailIT Solutions for Retail
IT Solutions for RetailScienceSoft
 
Digitalized Supply-Chain Collaboration - 56547
Digitalized Supply-Chain Collaboration - 56547Digitalized Supply-Chain Collaboration - 56547
Digitalized Supply-Chain Collaboration - 56547SAP Ariba Live 2018
 
Raise Your Game: Meeting the Changing Expectations in MRO Procurement
Raise Your Game: Meeting the Changing Expectations in MRO ProcurementRaise Your Game: Meeting the Changing Expectations in MRO Procurement
Raise Your Game: Meeting the Changing Expectations in MRO ProcurementOpusCapita
 
Big Data Analytic with Hadoop: Customer Stories
Big Data Analytic with Hadoop: Customer StoriesBig Data Analytic with Hadoop: Customer Stories
Big Data Analytic with Hadoop: Customer StoriesYellowfin
 

Was ist angesagt? (20)

Spot Buy – A New Sales Channel for Suppliers to Reach SAP Customers
Spot Buy – A New Sales Channel for Suppliers to Reach SAP CustomersSpot Buy – A New Sales Channel for Suppliers to Reach SAP Customers
Spot Buy – A New Sales Channel for Suppliers to Reach SAP Customers
 
Retail Big Data and Analytics
Retail Big Data and AnalyticsRetail Big Data and Analytics
Retail Big Data and Analytics
 
Achieving Procurement 2.0 with Guided Buying: The Royal Philips Story - 56623
Achieving Procurement 2.0 with Guided Buying: The Royal Philips Story - 56623Achieving Procurement 2.0 with Guided Buying: The Royal Philips Story - 56623
Achieving Procurement 2.0 with Guided Buying: The Royal Philips Story - 56623
 
Customer Deep Dive: Supplier Innovations - 56585
Customer Deep Dive: Supplier Innovations - 56585Customer Deep Dive: Supplier Innovations - 56585
Customer Deep Dive: Supplier Innovations - 56585
 
Towards connected planning for Supply Chain
Towards connected planning for Supply Chain Towards connected planning for Supply Chain
Towards connected planning for Supply Chain
 
Supply chain analytics
Supply chain analyticsSupply chain analytics
Supply chain analytics
 
New Approach to Supply Chain Analytics
New Approach to Supply Chain AnalyticsNew Approach to Supply Chain Analytics
New Approach to Supply Chain Analytics
 
Aligning Procurement and Payables to Strengthen Your Supply Chain - 56566
Aligning Procurement and Payables to Strengthen Your Supply Chain - 56566Aligning Procurement and Payables to Strengthen Your Supply Chain - 56566
Aligning Procurement and Payables to Strengthen Your Supply Chain - 56566
 
Supply Chain Analytics with Simulation
Supply Chain Analytics with SimulationSupply Chain Analytics with Simulation
Supply Chain Analytics with Simulation
 
Financial Services Procurement in a Digital World - 56608
Financial Services Procurement in a Digital World - 56608Financial Services Procurement in a Digital World - 56608
Financial Services Procurement in a Digital World - 56608
 
The Future of Design-to-Deliver Supply Chains - 56560
The Future of Design-to-Deliver Supply Chains - 56560The Future of Design-to-Deliver Supply Chains - 56560
The Future of Design-to-Deliver Supply Chains - 56560
 
SAP Ariba Overview Roca
SAP Ariba Overview RocaSAP Ariba Overview Roca
SAP Ariba Overview Roca
 
Delivering the Future to the Shelf Edge
Delivering the Future to the Shelf EdgeDelivering the Future to the Shelf Edge
Delivering the Future to the Shelf Edge
 
Collaborating with Direct Spend Suppliers in the Life Sciences Industry - 56556
Collaborating with Direct Spend Suppliers in the Life Sciences Industry - 56556Collaborating with Direct Spend Suppliers in the Life Sciences Industry - 56556
Collaborating with Direct Spend Suppliers in the Life Sciences Industry - 56556
 
Automated Product Ratings and Review dashboard
Automated Product Ratings and Review dashboardAutomated Product Ratings and Review dashboard
Automated Product Ratings and Review dashboard
 
IT Solutions for Retail
IT Solutions for RetailIT Solutions for Retail
IT Solutions for Retail
 
Digitalized Supply-Chain Collaboration - 56547
Digitalized Supply-Chain Collaboration - 56547Digitalized Supply-Chain Collaboration - 56547
Digitalized Supply-Chain Collaboration - 56547
 
Raise Your Game: Meeting the Changing Expectations in MRO Procurement
Raise Your Game: Meeting the Changing Expectations in MRO ProcurementRaise Your Game: Meeting the Changing Expectations in MRO Procurement
Raise Your Game: Meeting the Changing Expectations in MRO Procurement
 
Big Data Analytic with Hadoop: Customer Stories
Big Data Analytic with Hadoop: Customer StoriesBig Data Analytic with Hadoop: Customer Stories
Big Data Analytic with Hadoop: Customer Stories
 
Retail & CPG
Retail & CPGRetail & CPG
Retail & CPG
 

Andere mochten auch

BIG Data & Hadoop Applications in Retail
BIG Data & Hadoop Applications in RetailBIG Data & Hadoop Applications in Retail
BIG Data & Hadoop Applications in RetailSkillspeed
 
Real-time Market Basket Analysis for Retail with Hadoop
Real-time Market Basket Analysis for Retail with HadoopReal-time Market Basket Analysis for Retail with Hadoop
Real-time Market Basket Analysis for Retail with HadoopDataWorks Summit
 
Market your content as you would a product
Market your content as you would a productMarket your content as you would a product
Market your content as you would a productSocial Intelligence
 
TCI 2014 The case of Medellin Health City
TCI 2014 The case of Medellin Health CityTCI 2014 The case of Medellin Health City
TCI 2014 The case of Medellin Health CityTCI Network
 
Hortonworks hadoop big data_retail__white_paper
Hortonworks hadoop big data_retail__white_paperHortonworks hadoop big data_retail__white_paper
Hortonworks hadoop big data_retail__white_paperShyam Babu
 
Cloud Computing: Hadoop
Cloud Computing: HadoopCloud Computing: Hadoop
Cloud Computing: Hadoopdarugar
 
Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?Hortonworks
 
10 Common Hadoop-able Problems Webinar
10 Common Hadoop-able Problems Webinar10 Common Hadoop-able Problems Webinar
10 Common Hadoop-able Problems WebinarCloudera, Inc.
 
Surprising failure factors when implementing eCommerce and Omnichannel eBusiness
Surprising failure factors when implementing eCommerce and Omnichannel eBusinessSurprising failure factors when implementing eCommerce and Omnichannel eBusiness
Surprising failure factors when implementing eCommerce and Omnichannel eBusinessDivante
 
Magento scalability from the trenches (Meet Magento Sweden 2016)
Magento scalability from the trenches (Meet Magento Sweden 2016)Magento scalability from the trenches (Meet Magento Sweden 2016)
Magento scalability from the trenches (Meet Magento Sweden 2016)Divante
 
Omnichannel Customer Experience
Omnichannel Customer ExperienceOmnichannel Customer Experience
Omnichannel Customer ExperienceDivante
 
Practical Problem Solving with Apache Hadoop & Pig
Practical Problem Solving with Apache Hadoop & PigPractical Problem Solving with Apache Hadoop & Pig
Practical Problem Solving with Apache Hadoop & PigMilind Bhandarkar
 
The Technical SEO Renaissance
The Technical SEO RenaissanceThe Technical SEO Renaissance
The Technical SEO RenaissanceMichael King
 
End to-end hadoop development using OBIEE, ODI, Oracle Big Data SQL and Oracl...
End to-end hadoop development using OBIEE, ODI, Oracle Big Data SQL and Oracl...End to-end hadoop development using OBIEE, ODI, Oracle Big Data SQL and Oracl...
End to-end hadoop development using OBIEE, ODI, Oracle Big Data SQL and Oracl...Mark Rittman
 
Building a Scalable Web Crawler with Hadoop
Building a Scalable Web Crawler with HadoopBuilding a Scalable Web Crawler with Hadoop
Building a Scalable Web Crawler with HadoopHadoop User Group
 
Hadoop project design and a usecase
Hadoop project design and  a usecaseHadoop project design and  a usecase
Hadoop project design and a usecasesudhakara st
 
Big Data & Hadoop Tutorial
Big Data & Hadoop TutorialBig Data & Hadoop Tutorial
Big Data & Hadoop TutorialEdureka!
 
Hadoop introduction , Why and What is Hadoop ?
Hadoop introduction , Why and What is  Hadoop ?Hadoop introduction , Why and What is  Hadoop ?
Hadoop introduction , Why and What is Hadoop ?sudhakara st
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with HadoopPhilippe Julio
 

Andere mochten auch (20)

BIG Data & Hadoop Applications in Retail
BIG Data & Hadoop Applications in RetailBIG Data & Hadoop Applications in Retail
BIG Data & Hadoop Applications in Retail
 
Real-time Market Basket Analysis for Retail with Hadoop
Real-time Market Basket Analysis for Retail with HadoopReal-time Market Basket Analysis for Retail with Hadoop
Real-time Market Basket Analysis for Retail with Hadoop
 
Market your content as you would a product
Market your content as you would a productMarket your content as you would a product
Market your content as you would a product
 
TCI 2014 The case of Medellin Health City
TCI 2014 The case of Medellin Health CityTCI 2014 The case of Medellin Health City
TCI 2014 The case of Medellin Health City
 
Hortonworks hadoop big data_retail__white_paper
Hortonworks hadoop big data_retail__white_paperHortonworks hadoop big data_retail__white_paper
Hortonworks hadoop big data_retail__white_paper
 
Cloud Computing: Hadoop
Cloud Computing: HadoopCloud Computing: Hadoop
Cloud Computing: Hadoop
 
Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?
 
10 Common Hadoop-able Problems Webinar
10 Common Hadoop-able Problems Webinar10 Common Hadoop-able Problems Webinar
10 Common Hadoop-able Problems Webinar
 
Surprising failure factors when implementing eCommerce and Omnichannel eBusiness
Surprising failure factors when implementing eCommerce and Omnichannel eBusinessSurprising failure factors when implementing eCommerce and Omnichannel eBusiness
Surprising failure factors when implementing eCommerce and Omnichannel eBusiness
 
Magento scalability from the trenches (Meet Magento Sweden 2016)
Magento scalability from the trenches (Meet Magento Sweden 2016)Magento scalability from the trenches (Meet Magento Sweden 2016)
Magento scalability from the trenches (Meet Magento Sweden 2016)
 
Omnichannel Customer Experience
Omnichannel Customer ExperienceOmnichannel Customer Experience
Omnichannel Customer Experience
 
Practical Problem Solving with Apache Hadoop & Pig
Practical Problem Solving with Apache Hadoop & PigPractical Problem Solving with Apache Hadoop & Pig
Practical Problem Solving with Apache Hadoop & Pig
 
The Technical SEO Renaissance
The Technical SEO RenaissanceThe Technical SEO Renaissance
The Technical SEO Renaissance
 
End to-end hadoop development using OBIEE, ODI, Oracle Big Data SQL and Oracl...
End to-end hadoop development using OBIEE, ODI, Oracle Big Data SQL and Oracl...End to-end hadoop development using OBIEE, ODI, Oracle Big Data SQL and Oracl...
End to-end hadoop development using OBIEE, ODI, Oracle Big Data SQL and Oracl...
 
Building a Scalable Web Crawler with Hadoop
Building a Scalable Web Crawler with HadoopBuilding a Scalable Web Crawler with Hadoop
Building a Scalable Web Crawler with Hadoop
 
Hadoop project design and a usecase
Hadoop project design and  a usecaseHadoop project design and  a usecase
Hadoop project design and a usecase
 
Big Data & Hadoop Tutorial
Big Data & Hadoop TutorialBig Data & Hadoop Tutorial
Big Data & Hadoop Tutorial
 
Hadoop introduction , Why and What is Hadoop ?
Hadoop introduction , Why and What is  Hadoop ?Hadoop introduction , Why and What is  Hadoop ?
Hadoop introduction , Why and What is Hadoop ?
 
Big data and Hadoop
Big data and HadoopBig data and Hadoop
Big data and Hadoop
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with Hadoop
 

Ähnlich wie Hadoop on retail

apidays LIVE LONDON - Old meets New - Managing transactions on the edge of th...
apidays LIVE LONDON - Old meets New - Managing transactions on the edge of th...apidays LIVE LONDON - Old meets New - Managing transactions on the edge of th...
apidays LIVE LONDON - Old meets New - Managing transactions on the edge of th...apidays
 
OpenWorld: 4 Real-world Cloud Migration Case Studies
OpenWorld: 4 Real-world Cloud Migration Case StudiesOpenWorld: 4 Real-world Cloud Migration Case Studies
OpenWorld: 4 Real-world Cloud Migration Case StudiesDatavail
 
How Schneider Electric Transformed Front-office Operations With Real-time Dat...
How Schneider Electric Transformed Front-office Operations With Real-time Dat...How Schneider Electric Transformed Front-office Operations With Real-time Dat...
How Schneider Electric Transformed Front-office Operations With Real-time Dat...Informatica Cloud
 
Case Study: Lessons from Newell Rubbermaid's SAP HANA Proof of Concept
Case Study: Lessons from Newell Rubbermaid's SAP HANA Proof of ConceptCase Study: Lessons from Newell Rubbermaid's SAP HANA Proof of Concept
Case Study: Lessons from Newell Rubbermaid's SAP HANA Proof of ConceptSAPinsider Events
 
Migration to Oracle ERP Cloud: A must read winning recipe for all
Migration to Oracle ERP Cloud: A must read winning recipe for allMigration to Oracle ERP Cloud: A must read winning recipe for all
Migration to Oracle ERP Cloud: A must read winning recipe for allJim Pang
 
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven DecisionsPower to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven DecisionsLooker
 
SAP migration and integration success
SAP migration and integration successSAP migration and integration success
SAP migration and integration successVivian Yang Shic
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsDenodo
 
CDS Overview (May 2015)
CDS Overview (May 2015)CDS Overview (May 2015)
CDS Overview (May 2015)Karim Lalji
 
Assessing New Database Capabilities – Multi-Model
Assessing New Database Capabilities – Multi-ModelAssessing New Database Capabilities – Multi-Model
Assessing New Database Capabilities – Multi-ModelDATAVERSITY
 
Microsoft Dynamics 365 IA - Copilot/ Fabric
Microsoft Dynamics 365 IA - Copilot/ FabricMicrosoft Dynamics 365 IA - Copilot/ Fabric
Microsoft Dynamics 365 IA - Copilot/ FabricJuan Fabian
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Cloudera, Inc.
 
Social Solutions Apricot 360: Client Case Management Software
Social Solutions Apricot 360: Client Case Management SoftwareSocial Solutions Apricot 360: Client Case Management Software
Social Solutions Apricot 360: Client Case Management SoftwareJeffrey Haguewood
 
Pratibha Chaudhary 6 years Exp SAP ABAP
Pratibha Chaudhary 6 years Exp SAP ABAPPratibha Chaudhary 6 years Exp SAP ABAP
Pratibha Chaudhary 6 years Exp SAP ABAPpratibha44
 
How to Place Data at the Center of Digital Transformation in BFSI
How to Place Data at the Center of Digital Transformation in BFSIHow to Place Data at the Center of Digital Transformation in BFSI
How to Place Data at the Center of Digital Transformation in BFSIDenodo
 
Jan 2017 Investment Recommendation for Tableau
Jan 2017 Investment Recommendation for TableauJan 2017 Investment Recommendation for Tableau
Jan 2017 Investment Recommendation for Tableaupaulchenuva
 
How to Revamp your Legacy Applications For More Agility and Better Service - ...
How to Revamp your Legacy Applications For More Agility and Better Service - ...How to Revamp your Legacy Applications For More Agility and Better Service - ...
How to Revamp your Legacy Applications For More Agility and Better Service - ...NRB
 
Bulletproof Your QAD ERP to Cloud | JK Tech Webinar
Bulletproof Your QAD ERP to Cloud | JK Tech WebinarBulletproof Your QAD ERP to Cloud | JK Tech Webinar
Bulletproof Your QAD ERP to Cloud | JK Tech WebinarJK Tech
 

Ähnlich wie Hadoop on retail (20)

apidays LIVE LONDON - Old meets New - Managing transactions on the edge of th...
apidays LIVE LONDON - Old meets New - Managing transactions on the edge of th...apidays LIVE LONDON - Old meets New - Managing transactions on the edge of th...
apidays LIVE LONDON - Old meets New - Managing transactions on the edge of th...
 
OpenWorld: 4 Real-world Cloud Migration Case Studies
OpenWorld: 4 Real-world Cloud Migration Case StudiesOpenWorld: 4 Real-world Cloud Migration Case Studies
OpenWorld: 4 Real-world Cloud Migration Case Studies
 
How Schneider Electric Transformed Front-office Operations With Real-time Dat...
How Schneider Electric Transformed Front-office Operations With Real-time Dat...How Schneider Electric Transformed Front-office Operations With Real-time Dat...
How Schneider Electric Transformed Front-office Operations With Real-time Dat...
 
Case Study: Lessons from Newell Rubbermaid's SAP HANA Proof of Concept
Case Study: Lessons from Newell Rubbermaid's SAP HANA Proof of ConceptCase Study: Lessons from Newell Rubbermaid's SAP HANA Proof of Concept
Case Study: Lessons from Newell Rubbermaid's SAP HANA Proof of Concept
 
Migration to Oracle ERP Cloud: A must read winning recipe for all
Migration to Oracle ERP Cloud: A must read winning recipe for allMigration to Oracle ERP Cloud: A must read winning recipe for all
Migration to Oracle ERP Cloud: A must read winning recipe for all
 
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven DecisionsPower to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
 
SAP migration and integration success
SAP migration and integration successSAP migration and integration success
SAP migration and integration success
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
 
CDS Overview (May 2015)
CDS Overview (May 2015)CDS Overview (May 2015)
CDS Overview (May 2015)
 
Hadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data WarehouseHadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data Warehouse
 
Assessing New Database Capabilities – Multi-Model
Assessing New Database Capabilities – Multi-ModelAssessing New Database Capabilities – Multi-Model
Assessing New Database Capabilities – Multi-Model
 
Microsoft Dynamics 365 IA - Copilot/ Fabric
Microsoft Dynamics 365 IA - Copilot/ FabricMicrosoft Dynamics 365 IA - Copilot/ Fabric
Microsoft Dynamics 365 IA - Copilot/ Fabric
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8
 
Social Solutions Apricot 360: Client Case Management Software
Social Solutions Apricot 360: Client Case Management SoftwareSocial Solutions Apricot 360: Client Case Management Software
Social Solutions Apricot 360: Client Case Management Software
 
Pratibha Chaudhary 6 years Exp SAP ABAP
Pratibha Chaudhary 6 years Exp SAP ABAPPratibha Chaudhary 6 years Exp SAP ABAP
Pratibha Chaudhary 6 years Exp SAP ABAP
 
How to Place Data at the Center of Digital Transformation in BFSI
How to Place Data at the Center of Digital Transformation in BFSIHow to Place Data at the Center of Digital Transformation in BFSI
How to Place Data at the Center of Digital Transformation in BFSI
 
Jan 2017 Investment Recommendation for Tableau
Jan 2017 Investment Recommendation for TableauJan 2017 Investment Recommendation for Tableau
Jan 2017 Investment Recommendation for Tableau
 
The HANA effect
The HANA effect The HANA effect
The HANA effect
 
How to Revamp your Legacy Applications For More Agility and Better Service - ...
How to Revamp your Legacy Applications For More Agility and Better Service - ...How to Revamp your Legacy Applications For More Agility and Better Service - ...
How to Revamp your Legacy Applications For More Agility and Better Service - ...
 
Bulletproof Your QAD ERP to Cloud | JK Tech Webinar
Bulletproof Your QAD ERP to Cloud | JK Tech WebinarBulletproof Your QAD ERP to Cloud | JK Tech Webinar
Bulletproof Your QAD ERP to Cloud | JK Tech Webinar
 

Mehr von Douglas Bernardini

Top reasons to choose SAP hana
Top reasons to choose SAP hanaTop reasons to choose SAP hana
Top reasons to choose SAP hanaDouglas Bernardini
 
How can Hadoop & SAP be integrated
How can Hadoop & SAP be integratedHow can Hadoop & SAP be integrated
How can Hadoop & SAP be integratedDouglas Bernardini
 
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapRHadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapRDouglas Bernardini
 
Finance month closing with HANA
Finance month closing with HANAFinance month closing with HANA
Finance month closing with HANADouglas Bernardini
 
Hortonworks.Cluster Config Guide
Hortonworks.Cluster Config GuideHortonworks.Cluster Config Guide
Hortonworks.Cluster Config GuideDouglas Bernardini
 
SAP Business Objects - Lopes Supermarket
SAP   Business Objects - Lopes SupermarketSAP   Business Objects - Lopes Supermarket
SAP Business Objects - Lopes SupermarketDouglas Bernardini
 
SAP - Business Objects - Ri happy
SAP - Business Objects - Ri happySAP - Business Objects - Ri happy
SAP - Business Objects - Ri happyDouglas Bernardini
 
Retail: Big data e Omni-Channel
Retail: Big data e Omni-ChannelRetail: Big data e Omni-Channel
Retail: Big data e Omni-ChannelDouglas Bernardini
 
Granular Access Control Using Cell Level Security In Accumulo
Granular Access Control  Using Cell Level Security  In Accumulo             Granular Access Control  Using Cell Level Security  In Accumulo
Granular Access Control Using Cell Level Security In Accumulo Douglas Bernardini
 
Proposta aderencia drogaria onofre
Proposta aderencia   drogaria onofreProposta aderencia   drogaria onofre
Proposta aderencia drogaria onofreDouglas Bernardini
 

Mehr von Douglas Bernardini (20)

Top reasons to choose SAP hana
Top reasons to choose SAP hanaTop reasons to choose SAP hana
Top reasons to choose SAP hana
 
The REAL face of Big Data
The REAL face of Big DataThe REAL face of Big Data
The REAL face of Big Data
 
How can Hadoop & SAP be integrated
How can Hadoop & SAP be integratedHow can Hadoop & SAP be integrated
How can Hadoop & SAP be integrated
 
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapRHadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
 
SAP HORTONWORKS
SAP HORTONWORKSSAP HORTONWORKS
SAP HORTONWORKS
 
R-language
R-languageR-language
R-language
 
REDSHIFT - Amazon
REDSHIFT - AmazonREDSHIFT - Amazon
REDSHIFT - Amazon
 
Splunk
SplunkSplunk
Splunk
 
Finance month closing with HANA
Finance month closing with HANAFinance month closing with HANA
Finance month closing with HANA
 
RDBMS x NoSQL
RDBMS x NoSQLRDBMS x NoSQL
RDBMS x NoSQL
 
SAP - SOLUTION MANAGER
SAP - SOLUTION MANAGER SAP - SOLUTION MANAGER
SAP - SOLUTION MANAGER
 
MS-SQL SERVER ARCHITECTURE
MS-SQL SERVER ARCHITECTUREMS-SQL SERVER ARCHITECTURE
MS-SQL SERVER ARCHITECTURE
 
DBA oracle
DBA oracleDBA oracle
DBA oracle
 
Hortonworks.Cluster Config Guide
Hortonworks.Cluster Config GuideHortonworks.Cluster Config Guide
Hortonworks.Cluster Config Guide
 
SAP Business Objects - Lopes Supermarket
SAP   Business Objects - Lopes SupermarketSAP   Business Objects - Lopes Supermarket
SAP Business Objects - Lopes Supermarket
 
SAP - Business Objects - Ri happy
SAP - Business Objects - Ri happySAP - Business Objects - Ri happy
SAP - Business Objects - Ri happy
 
Retail: Big data e Omni-Channel
Retail: Big data e Omni-ChannelRetail: Big data e Omni-Channel
Retail: Big data e Omni-Channel
 
Granular Access Control Using Cell Level Security In Accumulo
Granular Access Control  Using Cell Level Security  In Accumulo             Granular Access Control  Using Cell Level Security  In Accumulo
Granular Access Control Using Cell Level Security In Accumulo
 
Proposta aderencia drogaria onofre
Proposta aderencia   drogaria onofreProposta aderencia   drogaria onofre
Proposta aderencia drogaria onofre
 
SAP-Solution-Manager
SAP-Solution-ManagerSAP-Solution-Manager
SAP-Solution-Manager
 

Kürzlich hochgeladen

Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 

Kürzlich hochgeladen (20)

Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 

Hadoop on retail

  • 1. Hadoop running on Retail Business 1 Author: Douglas Bernardini
  • 2. Onofre Profile 2 • Onofre: CVS Brazil´s operations. • Pharmacy network 50 stores. • 2100 employees • 01 distribuition center • 37% sales thru e-commerce • 25% thru mobile/tablet • CallCenter: 201 positions • No omni-channel process.
  • 3. IT perspective 3 • SAP/ECC IS Retail as a central component • SAP/BObjects: Limited licences per users • Just finantial team • POS/System legacy: Cobol • Okidata/Itautec • Ecommerce legacy: Vanroy • .NET customized solution • 100% datacenter operation internal • No outsourcing • No Cloud Services
  • 5. Case: Sales Performance Info 5 • No mobile for sales report: Just desktop access. • No friendly & resumed dashboard • +1 day delay: Todahy sales just from yesterday. • Slow performance: More than 1 minute per report • E-commerce • No sales result by region • No compete conversion rate report • Main Physical store needs • No sales loss caused by stock-rupture
  • 6. Project ‘WEB Pharma’ 6 • Objectives • Make user-fliendly dashboard with main business retail decision info. • Be mobile!. Users must use dashboard remotly using internet devices. • Ecommerce & Physical stores resume sales toghether • All reports must be delivered in less than 10s • Strategy • Export legacy data for a external-cloud dataserver. (No use internal datacenter) • Data-streaming must process data from last 1 hour sales. • Premisses • 100% secure connection (SOX complience) • Low CAPEX & limited budget • 03 months deadline.
  • 7. Big Data Architecture 7 Brick&Mortar Store E-commerce (WEB) Vanroy .NET Cobol Okidata .csv .csv Data Pipe Data Integrator Apache Flume MapReduce HDFS User Interface Apache Flume Workflow Scheduling Apache Oozie CDH3 Hbase HiveSQL Tableau Connector Sqoop Tableau OnLine D3 Visualization SSH SSH MySQL/S3
  • 8. BI x Big Data: Comparison 8 Business Intelligence Big Data Volume Terabytes Petabytes Velocity Batch, Real-Time, Near RT Streams Data Source Internal ExternalValue One single font of true Statistical and hypothetical Variety Single sources Probabilistic and multi-factor Data sharpness Consistent and reliable Better to be roughly right than precisely wrong Frequency Millions of records per minute Billions of documents per second Master Data Important part of results Not necessary Servers Sizing Evolution planned. Could be done internally. Elastic Cloud considered an alternative. Storage/memory growing faster than ever. Elastic Cloud is crucial.
  • 9. BI x Big Data: Comparison 9 Business Intelligence Big Data Main Business Objective Business Monitoring, internal insights and process optimization Data monetization, business metamorphosis and new opportunities Object of analysis Current business process Non existent business process Data Source Internal ExternalApproach Reactive. What happened and lets see what we can do? Predictive. What will happen tomorrow and lets be prepared? Mindset Examine the data and find the problem root causes, proposing process optimization How we can make some REAL money with this data? Data sharpness Consistent and reliable Better to be roughly right than precisely wrong Scope 02 or 03 departments Intire company cross departments Business Model Benchmark pre-existent No benchmark View Modeling Pre concepted KPI already pre-formatted No idea what exactly the objective and business needs
  • 10. Why AWS? 10 • Ready to GO cloud services; • Scalable; • Cost-Effective; In this project • Ready Secure Internet connection (SSH) • S3: Simple web services interface • EC2: Linux CentOS ready to go template. • Cloudera Partner • Pipeline: Reliably process and move data between different AWS compute and storage services
  • 11. Server Highlights 11 • 21.5 TB historic data (03 years) • Risk: Poor data-transfer network • AWS Import/Export Snowball Data • Data transfer Estimate> 140MB per data-package • 200 package/day: 28GB/DAY PRD Server config • RedHat 6.4, 256GB of RAM, • Processor: 4 x 12 Cores – 5Ghz • 2x420 storage (10G) Users • 350 users • 50 stores • 40MB/day each Network Bandwidth • Inbound: • 5Gb • Outbound: • 10Gb
  • 12. Hadoop Highlights 12 • Objective: Fast response for final users • Masternodes> 01 (*) • SlaveNodes > 07 • Sqoop: Hadoop native connector > MySQL • Hue: SQLlike soft UI for DBA for data- validation. • Oozie: Scheduler system to manage Hadoop jobs. • triggered by time (frequency) and data availability. • Hive: Querying large datasets. • SQL-like language: HiveQL. (*) Modified after go-live
  • 13. Why Cloudera? 13 • Stable Hadoop distribuition • Simple admin: Cloudera Manager • Integraded In this project • Tableau ready-to-go connector • CDH3: Open source (cost-effective) • Fast installation • Fast Tunning
  • 14. Why Tableau? 14 • User friendly with high user satisfaction impact • Mobile ready-to-go application • Easy to install in Androi Apps. In this project • Cost-effective solution • Lowest price by final user. • Retail ready-to-go template. • Brazilian localization done. • BC in Retail
  • 15. Why Not SAP? 15 • High cost in user-licence (Project demands 350 new users) • SAP/Business objects retail template with Low adherence • Huge investiment in customized reporting • Hardware processing concorrence with financial users • Impact in results monthly closing reporting. • High investment in hardware instance to get expected performance • 2013: No AWS instance ready for SAP/BOBJ • SAP/HANA not mature yet. • Lack of consultants • No business case (Retail) running in Brazil
  • 16. Project Methodology 16 • BI projects: Intensive REAL data validation • Key-Users must really believe in new indicators (expectations). • Intense deliverable schudule: Antecipation for Validation • Minimum project Scope: 10 reports • 07 standards: Tableau • 03 Customized: D3 visualization • 01 Dashboard • Tableau • Project implementation Strategy: PoC • Consistent validation: 02 Stores & 10 users • Testin with real environment: Consistent Issues Log (performance)
  • 17. Project Schedule 17 AWS S3, EC2 & Data Pipes Instalation Cloudera (Hadoop) CDH3 Installation Flume & Hive Set-up Integrations CSV data entry Tableau conector Sqoop set-up Visualization Indicators Design Tableau configuration D3 configuration Testing & QA Load historic Data Final Devs Validation PoC (02 stores) Adjustments & Tunning GO Final PRD Delivery Assisted Operation 01 02 03 Go-LiveDuration (in months) Activities PoC
  • 18. Project Results 18 • Reponse time: 0,4s • High adherence from users. • Data visualization triggers several bisiness iniciatives • 2ª wave aproveed with 02 additional dashboards and 32 new reports. • WEB reports demonstrate OMNI channel process struturation & new Business needs