SlideShare ist ein Scribd-Unternehmen logo
1 von 42
Big Data : From HindSight To InSight To
ForeSight
-Delivering Data Driven Business Insights
Adopt
MarketInnovate
Sunil S Ranka
Director – Big Data and Advance Analytics
Key Topics
 About Jade
 About Me
 What is Big Data
 Meaning of HindSight to Insight to Foresight
 How to make progression
 Business Impact
 Real World Example
 Next Steps
Technology Projects750+
200+ Customers
Referenceable
100%
98%Customer
Retention
500IT professional
worldwide
Services High-Tech Manufacturing Energy Social Media & Entertainment
5 Global Delivery
Centers
8 Offices
Worldwide
Atlanta
Pune
Noida
San Jose
Los Angeles
London
Hyderabad
San Diego
Global Delivery Model Serving Many Industries
Strategic
Partnerships
Salesforce.com
Sales, Service, Marketing,
force.com
Testing
Tools/Frameworks
QC, QTP, Selenium, LoadRunner,
JIRA Bugzilla, JUnit, TestNG
Microsoft
Dynamics, SharePoint,
Office 365, Lync, BI
Custom Development
Java, .Net, J2EE, Product
Engineering, Open Source
Technologies
Integration
Oracle SOA, Tibco, Weblogic,
Oracle Cloud Platform, ICS, JCS,
Mulesoft, Dell Boomi
Infrastructure
Management
IBM AIX, HP-UX, RHEL,
OEL Linux, Windows Server
Cloud Financials, Projects, SCM,
HCM and EBS Financials,
Procurement, Value Chain, CRM,
Demantra, Agile, GRC
Oracle EBS Suite
ServiceNow
IT Service Automation Applications,
CreateNow Development Suite,
Orchestration, Discovery
Big Data & Analytics
Hadoop, KNIME, R, Tableau, Hadoop
Jade Global Clientele (Representative list)
Dilbert On Big Data
During a Data Analytics Session
About Me
• Venture Partner : Investing and Advisor with early stage startups focusing on Data.
• Director – Big Data and Advance Analytics
• Oracle ACE (Business Intelligence with Proficiency in Big Data)
• Extensively worked with fortune 500 leaders.
• Held positions of Head Of Product Development, Architect, etc.
• http://sranka.wordpress.com, sunil_ranka
• Featured Tech writer for IT Next Magazine.
• Speaking engagements at following conferences :
• COLLABORATE ( 2009, 2010 , 2011 ,2012, 2013,2015)
• BIWA SIG TechCast Series (2010 , 2011 , 2012, 2013,2014,2016),
• NorCal OAUG-2010 at Santa Clara Convention Center, CA
• Session speaker at NoCouG in San Francisco
• Oracle Open World ( 2009 , 2010 , 2012)
My Tag Line :: “Superior BI is the antidote to Business Failure”
Why Data Is Important
Data is the new Oil. Data is just like crude. It’s
valuable, but if unrefined it cannot really be used.
– Clive Humby, DunnHumby
11
We have for the first time an economy based on
a key resource [Information] that is not only renewable,
but self-generating. Running out of it is not a problem,
but drowning in it is.
– John Naisbitt
Big Data and Analytics is Helping
Smarter Revenue
Management
Smarter Healtcare
Analytics
$16Billion
Reduced
Improper Payment
Smarter Crime
Prevention
Helps detect life
threatening conditions
up to 24 hours sooner
30%
Cut
serious crime
by
Tax Agency
* Courtesy - IBM
Big Data Definition
No single standard definition…
“Big Data” is data whose scale, diversity, and complexity require new
architecture, techniques, algorithms, and analytics to manage it and
extract value and hidden knowledge from it…
What is Big Data
Big data Represents new data features created by today’s Data Driven Organization for Decision
Making
volume
Variety
Velocity
Value
Data At Scale
Terabyte To Petabyte of Data
Data In Many Forms
Structured, unstructured, text, Media
Data In Motion
Analysis of stream data to make decision in real time
Data with Insight
Deriving valuable insight from the data
Characteristicsofbigdata
Harnessing Big Data
 OLTP: Online Transaction Processing (DBMSs)
 OLAP: Online Analytical Processing (Data Warehousing)
 RTAP: Real-Time Analytics Processing (Big Data Architecture & technology)
15
Who’s Generating Big Data
Social media and networks
(all of us are generating data)
Scientific instruments
(collecting all sorts of data)
Mobile devices
(tracking all objects all the time)
Sensor technology and networks
(measuring all kinds of data)
 The progress and innovation is no longer hindered by the ability to
collect data
 But, by the ability to manage, analyze, summarize, visualize, and discover
knowledge from the collected data in a timely manner and in a scalable
fashion
16
The Model Has Changed…
 The Model of Generating/Consuming Data has
Changed
Old Model: Few companies are generating data, all others are consuming data
New Model: all of us are generating data, and all of us are consuming data
17
Hindsight to Insight to Foresight
Essentials for Analytics Strategy
Where are we ?
• What business decisions do we not have sufficient
information ?
Where do we want to be ?
• What is our vision for information accessibility and
usage?
• What should the high-level BI roadmap of initiatives look
like ?
What capabilities will get us there ?
• What capabilities are required to make information
available and useful ?
What are the dependencies ?
• What metrics should we use to manage the
implementation and fulfill BI business goals ?
• How should we design the processes, applications, and
organization to fulfill our BI vision?
• What toolsets should I use to fulfill our BI vision?
Definitions
Hindsight ( What happed ?)
Understanding of a situation or event only after it has happened
or developed – What lesson can be learnt ?
Insight ( Why it is happening?)
The capacity to gain an accurate and deep intuitive
understanding of a person or thing. – What matters now ?
Foresight ( What will happen ?)
The ability to predict or the action of predicting what will happen
or be needed in the future. –What matters next ?
Challenges in Handling Big Data
 The Bottleneck is in technology
 New architecture, algorithms, techniques are needed
 Also in technical skills
 Experts in using the new technology and dealing with
big data
21
What’s driving Big Data
- Ad-hoc querying and reporting
- Data mining techniques
- Structured data, typical sources
- Small to mid-size datasets
- Optimizations and predictive analytics
- Complex statistical analysis
- All types of data, and many sources
- Very large datasets
- More of a real-time
22
Value of Big Data Analytics
 Big data is more real-time in
nature than traditional DW
applications
 Traditional DW architectures (e.g.
Exadata, Teradata) are not well-
suited for big data apps
 Shared nothing, massively
parallel processing, scale out
architectures are well-suited for
big data apps
23
Analytics Maturity Pyramid
No Reporting
Struggling to get basic information
Reactive Analytics
Concerned with current Issues
What Happened ?
Diagnostic Analytics
Hindsight
Why it Happened ?
Predictive Analytics
Insight
What will Happened?
Prescriptive Analytics
Foresight
What should I do ?
Hindsight
insight
foresight
God Bless You!!
How it changes ?
Traditional Reporting Business Intelligence Analytics and Big Data
Push Pull Predictive
Fixed Format Interactive/Slice and Dice Interactive and business driven
Typically transacation oriented Applies to all business functions, “Front
Office” and “Back Office”
Applies to mostly front office CRM and
product development
Mostly internal data silos Still mostly internal and structure data,
but bringing together more data silos
Combines internal and significant
external data, often unstruture and
large data
Implemented post transactional system
implementations
Implemented post transactional system
but tighly coupled with transaction
system
Implemented as a business capability,
with dedicated analytics team.
Technology is not a differentiator More technology differentiator, but
leaning more on packaged solution
Many specialist and highly
differentiating cutting edge
technologies
“rear view mirror”(what happened ?) Still “rear view mirror”,but looking at
what happened and why ?
Forward Looking
Business Impact
Business Value
BI Analytics
Big Data
Analytics
BI Big Data and BI
Proactive
Reactive
Data Size
Analytics
Capability
• The lower left quadrant represents traditional business
• In the upper left quadrant, you have traditional analytic
processing technologies performing more complex
assessments.
• The lower right quadrant represents the use of big data
technologies to expedite hindsight reporting.
• The upper right quadrant is the sweet spot – big data
analytics – the combination of big data technologies with
predictive and hybrid analytics.
How To Make The Progression
Big Data Needs Diversified Skill Sets
Math and
Operations Research
Expertise
Develop analytic algorithms
Visualization
Expertise
Interpret data sets,
determine correlations and
present in meaningful ways
Tool Developers
Mask complexity and
analytics to lower skills
boundaries
Industry Vertical
Domain Expertise
Develop hypothesis, identify
relevant business issues,
ask the right questions
Data Experts
Data architecture, management,
governance, policy
Decision Making
Executive and
Management
Apply information to solve
business issues
"By 2015, big data demand will reach 4.4 million jobs globally, but only one-third of those jobs will be filled."
Source: Gartner "Gartner's Top Predictions for IT Organizations and Users, 2013 and Beyond: Balancing Economics, Risk, Opportunity and Innovation" 19 Oct 2012
What Technologies We Have
Big Data Technology
31
Where Does Big Data Fit In
Technologies Needed
Analytics Cloud/OnPrem
Data Cloud/OnPrem
Hive Metastore
Elastic Cloud HDFS
Infinite Compute
Hadoop/Spark
Ingest Transform Analyze
External
Dashboards
Internal
Dashboards
Tableau Excel R Zeppelin
Web interface for distributed users
Data set definition
Social metadata dictionary
Export Web interface to dash-
boarding, query, and
data dictionary
Integrated ingestion,
transformation, and
query application for
business analysts
World-class, elastic
Big Data infrastructure
Hybrid Analytics Cloud/On Premises
Analytics Cloud/OnPrem
Analytics Cloud/OnPrem
Hive Metastore
Elastic Cloud HDFS
Infinite Compute
Hadoop/Spark
External
Dashboards
Internal
Dashboards
Tableau Excel R Zeppelin
Web interface for distributed users
Data set definition
Social metadata dictionary
Export Web interface to dash-
boarding, query, and data
dictionary
Integrated ingestion,
transformation, and query
application for business
analysts
World-class, elastic
Big Data infrastructure
Build reports
and
dashboards
Build outgoing
connectors
Ingest Transform Analyze
Business
Analytics, data
science
training
Write ETL and
perform data
engineering
Build
connectors
Hybrid Analytics Cloud/OnPrem
Jade Experience
On-Premise DW to Cloud DW Migration for Leading Web Search Company
 Data base Availability : Most of the time ParaAccell DB database was down
due to Node Failure / Disk Failure. This was highly impacting on data load
process and relevant data is not available when it is required.
 Performance Issue : The query response time was low, which was impacting on
Reports performance.
 Cost / Support Issue : The overall maintenance cost for PADB was higher and
support from vendor was not up to the mark.
Jade Solution:
 Snowflake is cloud based DWH, which has cut down hardware cost
 Zero maintenance cost on hardware and software
 Required ongoing maintenance of database will be handled by Snowflake.
 Implemented Unified framework for Data Load
Business Problem:
Solution Architecture:
Value Addition:
Before Upgrade
After Upgrade
Analyze
Pilot
Migration
Migration
Roadmap
Migrate
 Analyze the current PADB environment and current Inventory
 Analyze the dependencies
 Identify pilot candidate for Migration
 Migration Analysis for pilot candidate
 Migration of pilot candidate
 Migration Validation of pilot candidate
 Build Migration of all objects.
 Include the learning from pilot migration
 Migrate Data from PADB to Snowflake
 Validate the migrated objects
 Point upstream and downstream application to snowflake
Decommissi
on / Other
 Implement Data Security and include required new changes
 Build unified framework to monitor the Data load and performance
 Decommissioned PADB schema
*** Received TDWI-Best Practices Award for 2016 under Emerging Technologies and Methods Category .
Segmenting Customer Base for Target Marketing
Business Problem :
 Identifying segment of customers who could respond to Target Marketing
Jade Solution:
 Performed clustering analysis in order
to group similar customers together
based on the numerical variables such
as Revenue, IT budget, # of
employees, # of IT employees &
manager count
 Since variables are measured on
different scales, the data is
standardized
 The distance matrix between each of
observations is calculated using the
clustering functions in the R package
 The predictive model that is created is
validated using silhouette plot
Smart Sense – Automation of Infra / App Monitoring and Management​
Business Problem:
 Manual monitoring of IT operations
decreasing efficiency and increasing costs.
 Applications downtime causing business loss
and customer trust​
 Lack of automated solutions for repeatable
problems causing organizations to re-vent
the wheel
 False alarms and unwanted alerts taking
away the focus on fixing the core problems
at hand​
Jade Solution:
 Smart Sense to automate Infra / App
monitoring and management thus freeing
the resources to focus on value added tasks​
 Self learning system that adopts to dynamic
environment thus alerting only for critical
business scenarios​
 Analyze to determine anomalistic behavior.
Solution Features
Issue Detection & Correlation with real-time events and
alert streams
Continuous Learning and Self-Optimizing​
Predictive Maintenance
Harnessing Predictive Analytics for Increased Cash Flow​
ERP
 Invoice base amount​
 Payment Term​
 ERP​
 No of total invoice paid​
 No of total outstanding invoices  Random Forest Trees​
 Association Rule​
 Cluster Analysis​
INCREASED
CASH FLOW​
Analysis Across
“Order-to-Cash” to
Improve DSO​
Business Problem:
 Accounts receivable (AR) can be a source of financial difficulty for firms when they
are not efficiently managed and are underperforming​
 Lack of systems and process that Identifies high risk invoices and customers for
better collection strategies​
 Better financial contingency measures if systems can predict with high accuracy if an
invoice will be paid on time or not and can provide estimates of the magnitude of
the delay​
Jade Solution:
 Jade’s AR Analytics improves Cash Flow by:
 Reducing Days Sales Outstanding (DSO)
 Freeing up cash​
 Enabling customized collection actions tailored for each invoice or customer through
predictive models​
 Effective management of AR would have positive impact on the financial performance of the firm​
 Provides solution for reducing outstanding receivables through improvements in the collections
strategy.​
Closing Thoughts
 today is all about engaging with analytics, in the future, it will be about
engaging the analytics battle with the help of technology.
 The leader will be decided by whoever has been more effective in
harnessing all this data to make better decisions
 Using the right technology to funnel, process and analyze all this data
for valuable actionable insights and decisions helping effective
business outcomes
How We Can Help

Weitere ähnliche Inhalte

Was ist angesagt?

The BA Role in Data Projects
The BA Role in  Data ProjectsThe BA Role in  Data Projects
The BA Role in Data Projects
IIBA UK Chapter
 

Was ist angesagt? (20)

The BA Role in Data Projects
The BA Role in  Data ProjectsThe BA Role in  Data Projects
The BA Role in Data Projects
 
Apache Kafka With Spark Structured Streaming With Emma Liu, Nitin Saksena, Ra...
Apache Kafka With Spark Structured Streaming With Emma Liu, Nitin Saksena, Ra...Apache Kafka With Spark Structured Streaming With Emma Liu, Nitin Saksena, Ra...
Apache Kafka With Spark Structured Streaming With Emma Liu, Nitin Saksena, Ra...
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Looking for Disruptive Business Models in Higher Education
Looking for Disruptive Business Models in Higher EducationLooking for Disruptive Business Models in Higher Education
Looking for Disruptive Business Models in Higher Education
 
Storytelling with data and data visualization
Storytelling with data and data visualizationStorytelling with data and data visualization
Storytelling with data and data visualization
 
Creando la estrategia de gestiĂłn de datos para tu organizaciĂłn
Creando la estrategia de gestiĂłn de datos para tu organizaciĂłnCreando la estrategia de gestiĂłn de datos para tu organizaciĂłn
Creando la estrategia de gestiĂłn de datos para tu organizaciĂłn
 
Self-Service Analytics Framework - Connected Brains 2018
Self-Service Analytics Framework - Connected Brains 2018Self-Service Analytics Framework - Connected Brains 2018
Self-Service Analytics Framework - Connected Brains 2018
 
Advanced Analytics in Banking, CITI
Advanced Analytics in Banking, CITIAdvanced Analytics in Banking, CITI
Advanced Analytics in Banking, CITI
 
8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy
 
The need for Business design to underpin strategic and operational agility
The need for Business design to underpin strategic and operational agility The need for Business design to underpin strategic and operational agility
The need for Business design to underpin strategic and operational agility
 
Three Big Data Case Studies
Three Big Data Case StudiesThree Big Data Case Studies
Three Big Data Case Studies
 
Digital transformation with microsoft data and ai
Digital transformation with microsoft data and ai Digital transformation with microsoft data and ai
Digital transformation with microsoft data and ai
 
RWDG Slides: Governing Your Data Catalog, Business Glossary, and Data Dictionary
RWDG Slides: Governing Your Data Catalog, Business Glossary, and Data DictionaryRWDG Slides: Governing Your Data Catalog, Business Glossary, and Data Dictionary
RWDG Slides: Governing Your Data Catalog, Business Glossary, and Data Dictionary
 
Visualisation & Storytelling in Data Science & Analytics
Visualisation & Storytelling in Data Science & AnalyticsVisualisation & Storytelling in Data Science & Analytics
Visualisation & Storytelling in Data Science & Analytics
 
Building a Data Driven Culture and AI Revolution With Gregory Little | Curren...
Building a Data Driven Culture and AI Revolution With Gregory Little | Curren...Building a Data Driven Culture and AI Revolution With Gregory Little | Curren...
Building a Data Driven Culture and AI Revolution With Gregory Little | Curren...
 
Tableau ppt
Tableau pptTableau ppt
Tableau ppt
 
Banking Sector and Business Intelligence
Banking Sector and Business IntelligenceBanking Sector and Business Intelligence
Banking Sector and Business Intelligence
 
Data Strategy
Data StrategyData Strategy
Data Strategy
 
Data Visualization
Data VisualizationData Visualization
Data Visualization
 
Digital transformation in banking - PiServe
Digital transformation in banking - PiServeDigital transformation in banking - PiServe
Digital transformation in banking - PiServe
 

Ă„hnlich wie Big Data : From HindSight to Insight to Foresight

Big Data - Bridging Technology and Humans
Big Data - Bridging Technology and HumansBig Data - Bridging Technology and Humans
Big Data - Bridging Technology and Humans
Mark Laurance
 
Big Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBig Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the Marketspace
Bala Iyer
 
Ch1-Introduction to Business Intelligence.pptx
Ch1-Introduction to Business Intelligence.pptxCh1-Introduction to Business Intelligence.pptx
Ch1-Introduction to Business Intelligence.pptx
sommaikhantong
 
Building the Analytics Capability
Building the Analytics CapabilityBuilding the Analytics Capability
Building the Analytics Capability
Bala Iyer
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
Raul Chong
 
Big data destruction of bus. models
Big data destruction of bus. modelsBig data destruction of bus. models
Big data destruction of bus. models
Edgar Revilla Lavado
 

Ă„hnlich wie Big Data : From HindSight to Insight to Foresight (20)

Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A Lie
 
Big Data - Bridging Technology and Humans
Big Data - Bridging Technology and HumansBig Data - Bridging Technology and Humans
Big Data - Bridging Technology and Humans
 
Big Data at a Glance
Big Data at a GlanceBig Data at a Glance
Big Data at a Glance
 
Big Data, Big Thinking: Untapped Opportunities
Big Data, Big Thinking: Untapped OpportunitiesBig Data, Big Thinking: Untapped Opportunities
Big Data, Big Thinking: Untapped Opportunities
 
Big data and your career final
Big data and your career finalBig data and your career final
Big data and your career final
 
Big Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBig Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the Marketspace
 
BIG DATA.pptx
BIG DATA.pptxBIG DATA.pptx
BIG DATA.pptx
 
The value of our data
The value of our dataThe value of our data
The value of our data
 
Ch1-Introduction to Business Intelligence.pptx
Ch1-Introduction to Business Intelligence.pptxCh1-Introduction to Business Intelligence.pptx
Ch1-Introduction to Business Intelligence.pptx
 
Bardess Moderated - Analytics and Business Intelligence - Society of Informat...
Bardess Moderated - Analytics and Business Intelligence - Society of Informat...Bardess Moderated - Analytics and Business Intelligence - Society of Informat...
Bardess Moderated - Analytics and Business Intelligence - Society of Informat...
 
Building the Analytics Capability
Building the Analytics CapabilityBuilding the Analytics Capability
Building the Analytics Capability
 
BIG DATA CHAPTER 2 IN DSS.pptx
BIG DATA CHAPTER 2 IN DSS.pptxBIG DATA CHAPTER 2 IN DSS.pptx
BIG DATA CHAPTER 2 IN DSS.pptx
 
Big Data - Everything you need to know
Big Data - Everything you need to knowBig Data - Everything you need to know
Big Data - Everything you need to know
 
It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201...
 It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201... It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201...
It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201...
 
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
 
Big data
Big dataBig data
Big data
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Big data destruction of bus. models
Big data destruction of bus. modelsBig data destruction of bus. models
Big data destruction of bus. models
 
big data analytics pgpmx2015
big data analytics pgpmx2015big data analytics pgpmx2015
big data analytics pgpmx2015
 
Analytics for actuaries cia
Analytics for actuaries ciaAnalytics for actuaries cia
Analytics for actuaries cia
 

KĂĽrzlich hochgeladen

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
Christopher Logan Kennedy
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

KĂĽrzlich hochgeladen (20)

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 

Big Data : From HindSight to Insight to Foresight

  • 1. Big Data : From HindSight To InSight To ForeSight -Delivering Data Driven Business Insights Adopt MarketInnovate Sunil S Ranka Director – Big Data and Advance Analytics
  • 2. Key Topics  About Jade  About Me  What is Big Data  Meaning of HindSight to Insight to Foresight  How to make progression  Business Impact  Real World Example  Next Steps
  • 4. Services High-Tech Manufacturing Energy Social Media & Entertainment 5 Global Delivery Centers 8 Offices Worldwide Atlanta Pune Noida San Jose Los Angeles London Hyderabad San Diego Global Delivery Model Serving Many Industries
  • 5. Strategic Partnerships Salesforce.com Sales, Service, Marketing, force.com Testing Tools/Frameworks QC, QTP, Selenium, LoadRunner, JIRA Bugzilla, JUnit, TestNG Microsoft Dynamics, SharePoint, Office 365, Lync, BI Custom Development Java, .Net, J2EE, Product Engineering, Open Source Technologies Integration Oracle SOA, Tibco, Weblogic, Oracle Cloud Platform, ICS, JCS, Mulesoft, Dell Boomi Infrastructure Management IBM AIX, HP-UX, RHEL, OEL Linux, Windows Server Cloud Financials, Projects, SCM, HCM and EBS Financials, Procurement, Value Chain, CRM, Demantra, Agile, GRC Oracle EBS Suite ServiceNow IT Service Automation Applications, CreateNow Development Suite, Orchestration, Discovery Big Data & Analytics Hadoop, KNIME, R, Tableau, Hadoop
  • 6. Jade Global Clientele (Representative list)
  • 8. During a Data Analytics Session
  • 9. About Me • Venture Partner : Investing and Advisor with early stage startups focusing on Data. • Director – Big Data and Advance Analytics • Oracle ACE (Business Intelligence with Proficiency in Big Data) • Extensively worked with fortune 500 leaders. • Held positions of Head Of Product Development, Architect, etc. • http://sranka.wordpress.com, sunil_ranka • Featured Tech writer for IT Next Magazine. • Speaking engagements at following conferences : • COLLABORATE ( 2009, 2010 , 2011 ,2012, 2013,2015) • BIWA SIG TechCast Series (2010 , 2011 , 2012, 2013,2014,2016), • NorCal OAUG-2010 at Santa Clara Convention Center, CA • Session speaker at NoCouG in San Francisco • Oracle Open World ( 2009 , 2010 , 2012) My Tag Line :: “Superior BI is the antidote to Business Failure”
  • 10. Why Data Is Important
  • 11. Data is the new Oil. Data is just like crude. It’s valuable, but if unrefined it cannot really be used. – Clive Humby, DunnHumby 11 We have for the first time an economy based on a key resource [Information] that is not only renewable, but self-generating. Running out of it is not a problem, but drowning in it is. – John Naisbitt
  • 12. Big Data and Analytics is Helping Smarter Revenue Management Smarter Healtcare Analytics $16Billion Reduced Improper Payment Smarter Crime Prevention Helps detect life threatening conditions up to 24 hours sooner 30% Cut serious crime by Tax Agency * Courtesy - IBM
  • 13. Big Data Definition No single standard definition… “Big Data” is data whose scale, diversity, and complexity require new architecture, techniques, algorithms, and analytics to manage it and extract value and hidden knowledge from it…
  • 14. What is Big Data Big data Represents new data features created by today’s Data Driven Organization for Decision Making volume Variety Velocity Value Data At Scale Terabyte To Petabyte of Data Data In Many Forms Structured, unstructured, text, Media Data In Motion Analysis of stream data to make decision in real time Data with Insight Deriving valuable insight from the data Characteristicsofbigdata
  • 15. Harnessing Big Data  OLTP: Online Transaction Processing (DBMSs)  OLAP: Online Analytical Processing (Data Warehousing)  RTAP: Real-Time Analytics Processing (Big Data Architecture & technology) 15
  • 16. Who’s Generating Big Data Social media and networks (all of us are generating data) Scientific instruments (collecting all sorts of data) Mobile devices (tracking all objects all the time) Sensor technology and networks (measuring all kinds of data)  The progress and innovation is no longer hindered by the ability to collect data  But, by the ability to manage, analyze, summarize, visualize, and discover knowledge from the collected data in a timely manner and in a scalable fashion 16
  • 17. The Model Has Changed…  The Model of Generating/Consuming Data has Changed Old Model: Few companies are generating data, all others are consuming data New Model: all of us are generating data, and all of us are consuming data 17
  • 18. Hindsight to Insight to Foresight
  • 19. Essentials for Analytics Strategy Where are we ? • What business decisions do we not have sufficient information ? Where do we want to be ? • What is our vision for information accessibility and usage? • What should the high-level BI roadmap of initiatives look like ? What capabilities will get us there ? • What capabilities are required to make information available and useful ? What are the dependencies ? • What metrics should we use to manage the implementation and fulfill BI business goals ? • How should we design the processes, applications, and organization to fulfill our BI vision? • What toolsets should I use to fulfill our BI vision?
  • 20. Definitions Hindsight ( What happed ?) Understanding of a situation or event only after it has happened or developed – What lesson can be learnt ? Insight ( Why it is happening?) The capacity to gain an accurate and deep intuitive understanding of a person or thing. – What matters now ? Foresight ( What will happen ?) The ability to predict or the action of predicting what will happen or be needed in the future. –What matters next ?
  • 21. Challenges in Handling Big Data  The Bottleneck is in technology  New architecture, algorithms, techniques are needed  Also in technical skills  Experts in using the new technology and dealing with big data 21
  • 22. What’s driving Big Data - Ad-hoc querying and reporting - Data mining techniques - Structured data, typical sources - Small to mid-size datasets - Optimizations and predictive analytics - Complex statistical analysis - All types of data, and many sources - Very large datasets - More of a real-time 22
  • 23. Value of Big Data Analytics  Big data is more real-time in nature than traditional DW applications  Traditional DW architectures (e.g. Exadata, Teradata) are not well- suited for big data apps  Shared nothing, massively parallel processing, scale out architectures are well-suited for big data apps 23
  • 24. Analytics Maturity Pyramid No Reporting Struggling to get basic information Reactive Analytics Concerned with current Issues What Happened ? Diagnostic Analytics Hindsight Why it Happened ? Predictive Analytics Insight What will Happened? Prescriptive Analytics Foresight What should I do ? Hindsight insight foresight God Bless You!!
  • 25. How it changes ? Traditional Reporting Business Intelligence Analytics and Big Data Push Pull Predictive Fixed Format Interactive/Slice and Dice Interactive and business driven Typically transacation oriented Applies to all business functions, “Front Office” and “Back Office” Applies to mostly front office CRM and product development Mostly internal data silos Still mostly internal and structure data, but bringing together more data silos Combines internal and significant external data, often unstruture and large data Implemented post transactional system implementations Implemented post transactional system but tighly coupled with transaction system Implemented as a business capability, with dedicated analytics team. Technology is not a differentiator More technology differentiator, but leaning more on packaged solution Many specialist and highly differentiating cutting edge technologies “rear view mirror”(what happened ?) Still “rear view mirror”,but looking at what happened and why ? Forward Looking
  • 27. Business Value BI Analytics Big Data Analytics BI Big Data and BI Proactive Reactive Data Size Analytics Capability • The lower left quadrant represents traditional business • In the upper left quadrant, you have traditional analytic processing technologies performing more complex assessments. • The lower right quadrant represents the use of big data technologies to expedite hindsight reporting. • The upper right quadrant is the sweet spot – big data analytics – the combination of big data technologies with predictive and hybrid analytics.
  • 28. How To Make The Progression
  • 29. Big Data Needs Diversified Skill Sets Math and Operations Research Expertise Develop analytic algorithms Visualization Expertise Interpret data sets, determine correlations and present in meaningful ways Tool Developers Mask complexity and analytics to lower skills boundaries Industry Vertical Domain Expertise Develop hypothesis, identify relevant business issues, ask the right questions Data Experts Data architecture, management, governance, policy Decision Making Executive and Management Apply information to solve business issues "By 2015, big data demand will reach 4.4 million jobs globally, but only one-third of those jobs will be filled." Source: Gartner "Gartner's Top Predictions for IT Organizations and Users, 2013 and Beyond: Balancing Economics, Risk, Opportunity and Innovation" 19 Oct 2012
  • 32. Where Does Big Data Fit In
  • 34. Analytics Cloud/OnPrem Data Cloud/OnPrem Hive Metastore Elastic Cloud HDFS Infinite Compute Hadoop/Spark Ingest Transform Analyze External Dashboards Internal Dashboards Tableau Excel R Zeppelin Web interface for distributed users Data set definition Social metadata dictionary Export Web interface to dash- boarding, query, and data dictionary Integrated ingestion, transformation, and query application for business analysts World-class, elastic Big Data infrastructure Hybrid Analytics Cloud/On Premises
  • 35. Analytics Cloud/OnPrem Analytics Cloud/OnPrem Hive Metastore Elastic Cloud HDFS Infinite Compute Hadoop/Spark External Dashboards Internal Dashboards Tableau Excel R Zeppelin Web interface for distributed users Data set definition Social metadata dictionary Export Web interface to dash- boarding, query, and data dictionary Integrated ingestion, transformation, and query application for business analysts World-class, elastic Big Data infrastructure Build reports and dashboards Build outgoing connectors Ingest Transform Analyze Business Analytics, data science training Write ETL and perform data engineering Build connectors Hybrid Analytics Cloud/OnPrem
  • 37. On-Premise DW to Cloud DW Migration for Leading Web Search Company  Data base Availability : Most of the time ParaAccell DB database was down due to Node Failure / Disk Failure. This was highly impacting on data load process and relevant data is not available when it is required.  Performance Issue : The query response time was low, which was impacting on Reports performance.  Cost / Support Issue : The overall maintenance cost for PADB was higher and support from vendor was not up to the mark. Jade Solution:  Snowflake is cloud based DWH, which has cut down hardware cost  Zero maintenance cost on hardware and software  Required ongoing maintenance of database will be handled by Snowflake.  Implemented Unified framework for Data Load Business Problem: Solution Architecture: Value Addition: Before Upgrade After Upgrade Analyze Pilot Migration Migration Roadmap Migrate  Analyze the current PADB environment and current Inventory  Analyze the dependencies  Identify pilot candidate for Migration  Migration Analysis for pilot candidate  Migration of pilot candidate  Migration Validation of pilot candidate  Build Migration of all objects.  Include the learning from pilot migration  Migrate Data from PADB to Snowflake  Validate the migrated objects  Point upstream and downstream application to snowflake Decommissi on / Other  Implement Data Security and include required new changes  Build unified framework to monitor the Data load and performance  Decommissioned PADB schema *** Received TDWI-Best Practices Award for 2016 under Emerging Technologies and Methods Category .
  • 38. Segmenting Customer Base for Target Marketing Business Problem :  Identifying segment of customers who could respond to Target Marketing Jade Solution:  Performed clustering analysis in order to group similar customers together based on the numerical variables such as Revenue, IT budget, # of employees, # of IT employees & manager count  Since variables are measured on different scales, the data is standardized  The distance matrix between each of observations is calculated using the clustering functions in the R package  The predictive model that is created is validated using silhouette plot
  • 39. Smart Sense – Automation of Infra / App Monitoring and Management​ Business Problem:  Manual monitoring of IT operations decreasing efficiency and increasing costs.  Applications downtime causing business loss and customer trust​  Lack of automated solutions for repeatable problems causing organizations to re-vent the wheel  False alarms and unwanted alerts taking away the focus on fixing the core problems at hand​ Jade Solution:  Smart Sense to automate Infra / App monitoring and management thus freeing the resources to focus on value added tasks​  Self learning system that adopts to dynamic environment thus alerting only for critical business scenarios​  Analyze to determine anomalistic behavior. Solution Features Issue Detection & Correlation with real-time events and alert streams Continuous Learning and Self-Optimizing​ Predictive Maintenance
  • 40. Harnessing Predictive Analytics for Increased Cash Flow​ ERP  Invoice base amount​  Payment Term​  ERP​  No of total invoice paid​  No of total outstanding invoices  Random Forest Trees​  Association Rule​  Cluster Analysis​ INCREASED CASH FLOW​ Analysis Across “Order-to-Cash” to Improve DSO​ Business Problem:  Accounts receivable (AR) can be a source of financial difficulty for firms when they are not efficiently managed and are underperforming​  Lack of systems and process that Identifies high risk invoices and customers for better collection strategies​  Better financial contingency measures if systems can predict with high accuracy if an invoice will be paid on time or not and can provide estimates of the magnitude of the delay​ Jade Solution:  Jade’s AR Analytics improves Cash Flow by:  Reducing Days Sales Outstanding (DSO)  Freeing up cash​  Enabling customized collection actions tailored for each invoice or customer through predictive models​  Effective management of AR would have positive impact on the financial performance of the firm​  Provides solution for reducing outstanding receivables through improvements in the collections strategy.​
  • 41. Closing Thoughts  today is all about engaging with analytics, in the future, it will be about engaging the analytics battle with the help of technology.  The leader will be decided by whoever has been more effective in harnessing all this data to make better decisions  Using the right technology to funnel, process and analyze all this data for valuable actionable insights and decisions helping effective business outcomes
  • 42. How We Can Help

Hinweis der Redaktion

  1. Oil which is the fuel for modern economy for centuries. However, Oil in its raw form has little value. It needs to be refined and separated into a large number of consumer products, from petrol and kerosene to asphalt and chemical reagents used to make plastics and pharmaceuticals. It is also used in manufacturing a wide variety of materials. Big Data is just like oil, in it’s raw form it provide no value to enterprise, until it is processed and valuable and actionable business insights are “distilled”. Just like the technology that made available 100 years ago to discover oil and process it to consumable products. Big Data technology is going to transform and revolutionize the way enterprise get and use.