SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Downloaden Sie, um offline zu lesen
CARDINALITY
innovative analytics:
- per customer
- per service
- network wide
CONFIDENTIALITY STATEMENT - THIS DOCUMENT AND THE INFORMATION IN IT ARE PROVIDED IN THE STRICTEST COMMERCIAL CONFIDENCE, FOR THE SOLE
PURPOSE OF EVALUATING CARDINALITY LTD AS A SUPPLIER, AND SHALL NOT BE DISCLOSED TO ANY THIRD PARTY OR USED FOR ANY OTHER PURPOSE WITHOUT THE
EXPRESS WRITTEN PERMISSION OF CARDINALITY LTD
Copyright © 2016 Cardinality Ltd., All Rights Reserved. No part of this work, which is
protected by copyright, may be reproduced, stored, transmitted, or disseminated in any form or
by any means without prior written permission from Cardinality Ltd.
Cardinality Overview
q  Cardinality	
  are	
  a	
  group	
  of	
  wireless	
  telecommunica5ons	
  engineers,	
  
architects,	
  data	
  scien5sts,	
  and	
  so8ware	
  developers.	
  	
  
q  The	
  largest	
  Big	
  Data	
  Hadoop	
  Solu5on	
  deployed	
  in	
  any	
  telecommunica5ons	
  
operator	
  in	
  Europe.	
  
q  Cardinality	
  have	
  extensive	
  radio	
  design,	
  op5misa5on	
  and	
  opera5onal	
  
support	
  system	
  (OSS)	
  experience	
  
q  Leveraging	
  innova5ve	
  methodologies	
  and	
  pro-­‐ac5ve	
  approach	
  
Cardinality is….
q  Innova5ve	
  Start-­‐Up,	
  with	
  referenceable	
  customers	
  &	
  sound	
  financial	
  backing	
  
q  Founders	
  have	
  real	
  world	
  Big	
  Data	
  experience	
  with	
  Mobile	
  Network	
  Operators	
  (MNOs)	
  
q  Using	
  proven	
  open	
  source	
  architectures,	
  which	
  has	
  evolved	
  based	
  on	
  analy5cs	
  experience	
  
gained	
  from	
  over	
  4	
  years	
  using	
  Hadoop	
  clusters	
  in	
  mul5ple	
  operators	
  
q  Tier	
  One	
  MNO	
  Big	
  Data	
  contact	
  won	
  in	
  summer	
  2015	
  (via	
  open	
  RFP,	
  against	
  large	
  /	
  established	
  
“big	
  data”	
  companies)	
  
q  Ini5al	
  volumes:	
  	
  
	
  
	
  
	
  
q  Big	
  Data	
  is	
  tainted	
  with	
  hype	
  –	
  Cardinality	
  iden5fy,	
  target,	
  real	
  business	
  problems	
  &	
  solve	
  them	
  
ETL	
  Throughput	
   100K	
  events	
  per	
  sec	
  (per	
  Feed)	
  
Big	
  Data	
  Storage	
   10-­‐12	
  billion	
  complex	
  rows/day	
  (40	
  TB	
  uncompressed	
  
per	
  day)	
  
Cache	
   ~200	
  million	
  complex	
  rows	
  (100GB)	
  
CARDINALITY’S Approach
Market Approach
.
Collect Data from all the network elements
MonetiseStore Visualise
Data Lake: Collect Everything
AnalyseCorrelate
This is a sample text.
Insert your desired
text here. 	
  
here. 	
   This is a sample text.
Insert your desired
text here. 	
  
Data Accessibility
Data Transfer
Data Storage
Data Correlation
Either (real)time
data is not
available or there
is license cost
q too costly to transfer the
data from source to DW
q too late by the time data
reaches the DW
q Complex Correlation
needed
q Batch processing does
not complete in tine
q Not real time
q Existing DW doesn’t
have enough space to
store all data sets
q It’s too expensive to
store all the data sets in
existing DW
Common Issues
 
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
Data Lake
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
ETL	
  Engine	
  
Analysis	
  Engine	
  
Data Lake
Cardinality Perception	
  
Visualisa5on	
  
Unconnected	
  &	
  Un-­‐enriched	
  	
  
data	
  silo	
  approach	
  repeated	
  in	
  
	
  New	
  Data	
  Lakes	
  create	
  unusable	
  big	
  data	
  sets	
  
	
  
Data	
  Science	
  driven	
  enriched	
  and	
  connected	
  data	
  sets	
  
	
  imported	
  in	
  Data	
  Lakes	
  using	
  	
  
Cardinality’s	
  Percep5on	
  Plaborm	
  can	
  be	
  easily	
  accessed	
  by	
  
users	
  and	
  machines	
  	
  
Data Silos Vs. Enriched Data Sets
	
  
	
  
	
  
	
  
	
  
	
  
	
  
Tradi&onal  Approach  
DESIGN	
  
Big Data
Strategy
Data Lake
Big Data
Analytics
Architecture
CREATE	
  
BUILD	
  
DELIVER	
  
Cardinality
Perception
How can we help?
Full	
  stack	
  deployment	
  of	
  
Cardinality’s	
  Percep5on	
  plaborm	
  
including	
  a	
  Hadoop	
  based	
  data	
  lake.	
  
Greenfield	
  
Deployment	
  of	
  Cardinality’s	
  	
  	
  
Percep5on	
  plaborm	
  around	
  the	
  
exis5ng	
  	
  Hadoop	
  cluster	
  	
  	
  	
  
Exis5ng	
  Hadoop	
  Cluster	
  
Offerings	
  
Deployment	
  of	
  Cardinality	
  use	
  cases	
  
and	
  data	
  visualisa5on,	
  using	
  
exis5ng	
  Hadoop	
  Cluster	
  and	
  ETL	
  
engine	
  
Use	
  Cases/Data	
  Analy5cs	
   Big	
  Data	
  Consul5ng	
  
Leverage	
  Cardinality’s	
  	
  exper5se	
  in	
  
crea5ng	
  	
  organisa5on	
  	
  wide	
  
strategies,	
  designing	
  big	
  data	
  
architectures	
  and	
  end-­‐2-­‐end	
  	
  data	
  	
  
analysis/use	
  cases	
  
Works with existing hadoop cluster
and flexible options incl. full stack
Open Approach to Infrastructure
Cloud	
  
(Public,	
  Private,	
  or	
  Hybrid)	
  
Appliance	
  
Model	
  
(Pre-­‐Built)	
  
Data	
  Centre	
  
✔	
   ✔	
   ✔	
  
Components Deployed
• ETL Engine
• Data Lake: New Hadoop Cluster
• Magnitude: Cache, SQL, NoSQL
• Marvel: Data Analysis/Use
Cases
• Scene: Visualisation
Data	
  Collec*on	
  
(SFTP,	
  TCP,	
  
WebSocket)	
  
Data/Event	
  
Stream	
  
Data	
  Parser,	
  	
  
Enrichment	
  &	
  
Analysis	
  
DATALAKE&STORAGE
Batch	
  Analysis	
  Spark	
  Jobs	
  
APPLICATIONS
Enterprise
Applications
Interactive Web &
Mobile Applications
BI / Reporting, Ad
Hoc Analysis
Statistical Analysis
DATASOURCE
Probes CDRs Control
Plane
Sensor
Data
Geo-
location
Data
Network
Data
User Plane
Full Stack Deployment
Deployed Components
•  ETL Engine
•  Magnitude: Cache, SQL,
NoSQL
•  Data Analysis/Use Cases
•  Visualisation
Data	
  Collec*on	
  
(SFTP,	
  TCP,	
  
WebSocket)	
  
Data/Event	
  
Stream	
  
Data	
  Parser,	
  	
  
Enrichment	
  &	
  
Analysis	
  
DATALAKE&STORAGE
Batch	
  Analysis	
  Spark	
  Jobs	
  
APPLICATIONS
Enterprise
Applications
Interactive Web &
Mobile Applications
BI / Reporting, Ad
Hoc Analysis
Statistical Analysis
DATASOURCE
Probes CDRs Control
Plane
Sensor
Data
Geo-
location
Data
Network
Data
User Plane
Exis5ng	
  Hadoop	
  
Cluster	
  
Existing Hadoop Cluster
Components Deployed
•  Magnitude: Cache, SQL,
NoSQL
•  Marvel: Data Analysis/
Cardinality use cases
•  Scene: Visualisation
Cardinality’s	
  Data	
  
PipeLine	
  (ETL)	
  
DATALAKE&STORAGE
Batch	
  Analysis	
  Spark	
  Jobs	
  
APPLICATIONS
Enterprise
Applications
Interactive Web &
Mobile Applications
BI / Reporting, Ad
Hoc Analysis
Statistical Analysis
DATASOURCE
Probes CDRs Control
Plane
Sensor
Data
Geo-
location
Data
Network
Data
User Plane
Exis*ng	
  ETL	
  
Exis5ng	
  Hadoop	
  
Cluster	
  
Use Case / Data Analysis
Cardinality Perception
MME	
  Mul5-­‐Vendor	
  Probes	
  
Data	
  Proxy	
  
Firewall	
  
Scene	
  
Voice,	
  SMS,	
  Data	
  
(Mediated	
  &	
  Billed	
  
CDRs)	
  
Marvel	
  
(Data	
  Analysis)	
  
Other	
  Data	
  
Sources	
  
Data	
  Pipeline	
  Data	
  Fetcher	
   Data	
  Parser	
  
Magnitude	
  
(Data	
  Storage)	
  
HDFS	
  
NO	
  SQL	
  
SQL	
  
Cache	
  
Management	
  
Server	
  
	
  
Manage
ment	
  
Cluster	
  
Manager	
  
Component	
  
Repository	
  
Map	
  Reduce	
  	
  
Impala	
  (SQL)	
  
Spark	
  
•  Enterprise Ready (Scale, Robustness,
Secure, Performance) Platform
•  Utilising Enterprise Grade Open
Source Components
•  Vendor Independent, Best in Class
component selection
•  Each component is wrapped for simple
upgrades and version control
•  Architected using Micro Services to
keep platform AGILE
Security	
  
API	
  Open	
  Access	
  (Resbul	
  &	
  SOAP)	
  	
  
Access	
  
Subscriber	
  Driven	
  
Coverage	
  
Sta5s5cal	
  
Repor5ng	
  
Data	
  Visualisa5on	
  
Real Time
Decision
EngineData	
  Feeds	
  
File
Sources
ETLEngine
API’s
Machine
Learning
Data
Hadoop
RealTimePrivacyFilter
(MSISDN,IPaddress,UAAgent,Domainsetc.)
Streaming
Sources
Process	
  Feeds	
  
API
Interface
File
Handler
Stream
Monitor
API
Monitor
Encryption
&Enrichment
Data
Link
Data
Feed
File
Drop
Administration & Monitoring
Analysis Layer
Presentation Layer	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
Technology Overview
Behavioral Analytics
q Anomaly Detection Tools
q Recommender Tools
Spatiotemporal Analytics
q Road Network Algorithms
q Spatiotemporal Data Mining
q Spatiotemporal Indexing
Cognitive Networks
q Makovian & Bayesian
Networks
q Deep neural networks
q K-mean Clustering
q Graph analysis
Cognitive Analytics
q Visual Sentiment Analysis
q Emotion Analysis
Example Algorithms
Data Sources
This is a sample text.
Insert your desired text
here. 	
  
This is a sample text.
Insert your desired text
here. 	
  
SAMPLE TEXT
This is a sample text.
Insert your desired text
here. 	
  
This is a sample text.
Insert your desired text
here. 	
  
This is a sample text.
Insert your desired text
here. 	
  
Data Sources
User Plane Data
•  DPI
•  Proxy
•  Probes
Control Plane Data
•  MME/GGSN
•  Probes
•  PCRF
Other Network Data
•  Subscriber Location
•  Probes
•  Network Performance
CDRs
•  Voice
•  SMS
•  Data
•  Roaming
Reference Data
•  Tariff Data
•  Cell Site Reference
•  Demographics
Other Data Sources
•  Marketing Promotions
•  Customer Feedback
DATA	
  
Network
q Radio Analytics
q Congestion Analytics
q Video Analytics
q Location Based Analysis
q Network Performance
Business & Finance
q Cell Profitability
q Tariff Profitability
q Future Network Investment
Analysis
Customer Services
q Near RT customer issues
q Churn Prediction & Prevention
q Customer Loyalty
Marketing
q Customer Segmentation
q Segment of One
q Device & Tariff Upsell
q New Tariffs
q Content Upsell
q OTT impact
360° use of Big Data
Network	
  
Conges5on	
  
Hotspot	
  Analysis	
  
Heat	
  Maps	
  
Web	
  &	
  Video	
  Analy5cs	
  
Performance	
  &	
  Capacity	
  
Analy5cs	
  
Profitability	
  
Marke5ng	
  
User	
  Segmenta5on	
  
Sales	
  &	
  Upsell	
  Targe5ng	
  
Tariff	
  Proposi5ons	
  
Handset	
  &	
  
Consump5on	
  Analy5cs	
  
OTT	
  Service	
  Package	
  
Analysis	
  
Social	
  Media	
  Impact	
  
Customer	
  
Services	
  
Churn	
  &	
  Reten5on	
  
Proac5ve	
  Subscriber	
  
Engagement	
  
Billshock	
  &	
  Selfcare	
  
Event	
  Monitoring	
  
(fes5vals,	
  spor5ng	
  
events)	
  
Finance	
  &	
  
Fraud	
  
Fraud	
  Predic5on	
  
Customer	
  Profitability	
  
Targeted	
  Investment	
  
Analysis	
  
Board	
  
Repor5ng	
  
KPI	
  Metrics	
  
Dashboards	
  
Compliance	
  
Security	
  Monitoring	
  
Criminal	
  Ac5vity	
  
Data	
  Privacy	
  
Freedom	
  of	
  Informa5on	
  
Requests	
  
Compliance	
  Reports	
  
IT	
  
Data	
  Warehouse	
  
Bill	
  Verifica5on	
  
Business	
  Intelligence	
  
Example use cases
Africa MNO
Simple Use cases
&
POC
Example Use Case Value
•  Device	
  Analy5cs	
  –	
  Ability	
  to	
  iden5fy	
  and	
  monitor	
  “rouge”	
  device	
  in	
  the	
  network,	
  
and	
  inves5gate	
  device	
  upsell	
  opportuni5es	
  
•  Data	
  &	
  Voice	
  QoE	
  –	
  Ability	
  to	
  monitor	
  QoE	
  across	
  the	
  network,	
  iden5fica5on	
  of	
  
poten5al	
  op5miza5ons,	
  and	
  to	
  standardise	
  published	
  QoE	
  metrics	
  
•  Subscriber	
  Ac5ve	
  States	
  –	
  Ability	
  to	
  iden5fy	
  the	
  subscriber	
  migra5on	
  and	
  
ac5va5on	
  paherns	
  based	
  on	
  the	
  prepaid	
  datasets(top-­‐up	
  &	
  ac5va5ons).	
  
Use Case: Device AnalyticsDevice Analytics
Type of Devices
•  Smartphones
•  Feature phones]
•  IoT
Manufacturer Distribution
•  Samsung
•  Apple
•  HTC etc.
Operating System Distribution
•  Android
•  iOS
•  Other
Rogue Device Identification
•  Non Standard Devices
•  Poor Voice Experience
•  Poor Data Experience
•  Anomaly Detection
Device Market Performance
Device Up Sell Target
Application QoE
•  Facebook, YouTube etc.
•  Feature phones]
•  IoT
Data QoE
•  Latency
•  Speed
•  Traffic Analysis
Operating System Distribution
•  Android
•  iOS
•  Other
Rogue Device Identification
•  Non Standard Devices
•  Poor Voice Experience
•  Poor Data Experience
•  Anomaly Detection
Voice Experience
•  Call Drop Analysis
•  Location based Call Experience
•  Rogue Device Analysis
Use Case: Data & Voice quality of
experience
Prepaid Patterns
•  Top-up distribution
•  Up-Sell opportunities
Subscriber Activation Pattern
Distribution
•  Promotion to activation co-relation
•  Network Activation stats
Network Experience Co-
relations
•  Churn to network experience
•  Promotion to NT to Churn
Prepaid Promotion
Analysis
•  Competitor Promotion co-
relation
•  Subscriber churn analysis
Use Case: Subscriber Active
States
Data Discovery Workshop
Deployment of Cardinality’s
Perception Platform in the
cloud
Transfer/Load encrypted data
in Perception
Reports and Actionable
analytics
Proof-of-Concept (Cloud)
www.cardinality.xyz
info@cardinality.xyz

Weitere ähnliche Inhalte

Was ist angesagt?

Compute-based sizing and system dashboard
Compute-based sizing and system dashboardCompute-based sizing and system dashboard
Compute-based sizing and system dashboard
DataWorks Summit
 
Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...
DataWorks Summit
 

Was ist angesagt? (20)

Paris FOD Meetup #5 Hortonworks Presentation
Paris FOD Meetup #5 Hortonworks PresentationParis FOD Meetup #5 Hortonworks Presentation
Paris FOD Meetup #5 Hortonworks Presentation
 
Paris FOD Meetup #5 Cognizant Presentation
Paris FOD Meetup #5 Cognizant PresentationParis FOD Meetup #5 Cognizant Presentation
Paris FOD Meetup #5 Cognizant Presentation
 
Insights into Real World Data Management Challenges
Insights into Real World Data Management ChallengesInsights into Real World Data Management Challenges
Insights into Real World Data Management Challenges
 
FOD Paris Meetup - Global Data Management with DataPlane Services (DPS)
FOD Paris Meetup -  Global Data Management with DataPlane Services (DPS)FOD Paris Meetup -  Global Data Management with DataPlane Services (DPS)
FOD Paris Meetup - Global Data Management with DataPlane Services (DPS)
 
Innovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data WarehouseInnovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data Warehouse
 
Multi-tenant Hadoop - the challenge of maintaining high SLAS
Multi-tenant Hadoop - the challenge of maintaining high SLASMulti-tenant Hadoop - the challenge of maintaining high SLAS
Multi-tenant Hadoop - the challenge of maintaining high SLAS
 
Log I am your father
Log I am your fatherLog I am your father
Log I am your father
 
GeoWave: Open Source Geospatial/Temporal/N-dimensional Indexing for Accumulo,...
GeoWave: Open Source Geospatial/Temporal/N-dimensional Indexing for Accumulo,...GeoWave: Open Source Geospatial/Temporal/N-dimensional Indexing for Accumulo,...
GeoWave: Open Source Geospatial/Temporal/N-dimensional Indexing for Accumulo,...
 
Synchronicity of a distributed financial system
Synchronicity of a distributed financial systemSynchronicity of a distributed financial system
Synchronicity of a distributed financial system
 
Compute-based sizing and system dashboard
Compute-based sizing and system dashboardCompute-based sizing and system dashboard
Compute-based sizing and system dashboard
 
MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -
 
Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...Designing data pipelines for analytics and machine learning in industrial set...
Designing data pipelines for analytics and machine learning in industrial set...
 
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & TrifactaExtend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
Extend Governance in Hadoop with Atlas Ecosystem: Waterline, Attivo & Trifacta
 
Operating a secure big data platform in a multi-cloud environment
Operating a secure big data platform in a multi-cloud environmentOperating a secure big data platform in a multi-cloud environment
Operating a secure big data platform in a multi-cloud environment
 
High Performance Spatial-Temporal Trajectory Analysis with Spark
High Performance Spatial-Temporal Trajectory Analysis with Spark High Performance Spatial-Temporal Trajectory Analysis with Spark
High Performance Spatial-Temporal Trajectory Analysis with Spark
 
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
 
Bridging the gap: achieving fast data synchronization from SAP HANA by levera...
Bridging the gap: achieving fast data synchronization from SAP HANA by levera...Bridging the gap: achieving fast data synchronization from SAP HANA by levera...
Bridging the gap: achieving fast data synchronization from SAP HANA by levera...
 
Pouring the Foundation: Data Management in the Energy Industry
Pouring the Foundation: Data Management in the Energy IndustryPouring the Foundation: Data Management in the Energy Industry
Pouring the Foundation: Data Management in the Energy Industry
 
Pivotal Real Time Data Stream Analytics
Pivotal Real Time Data Stream AnalyticsPivotal Real Time Data Stream Analytics
Pivotal Real Time Data Stream Analytics
 
Data-In-Motion Unleashed
Data-In-Motion UnleashedData-In-Motion Unleashed
Data-In-Motion Unleashed
 

Andere mochten auch (6)

Pooja 160210125116.vpg (1)
Pooja 160210125116.vpg (1)Pooja 160210125116.vpg (1)
Pooja 160210125116.vpg (1)
 
3Com 10001874 REVAB
3Com 10001874 REVAB3Com 10001874 REVAB
3Com 10001874 REVAB
 
Reportage Gael Maison avril 2016 l
Reportage Gael Maison avril 2016 lReportage Gael Maison avril 2016 l
Reportage Gael Maison avril 2016 l
 
El agua
El aguaEl agua
El agua
 
How CIS of Chicago Can Help Enhance Your School-based Program
How CIS of Chicago Can Help Enhance Your School-based ProgramHow CIS of Chicago Can Help Enhance Your School-based Program
How CIS of Chicago Can Help Enhance Your School-based Program
 
Navigating Chicago Public Schools (CPS)
Navigating Chicago Public Schools (CPS)Navigating Chicago Public Schools (CPS)
Navigating Chicago Public Schools (CPS)
 

Ähnlich wie Cardinality-HL-Overview

Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine Learning
Provectus
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
StampedeCon
 

Ähnlich wie Cardinality-HL-Overview (20)

Intelligent Integration OOW2017 - Jeff Pollock
Intelligent Integration OOW2017 - Jeff PollockIntelligent Integration OOW2017 - Jeff Pollock
Intelligent Integration OOW2017 - Jeff Pollock
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action
 
2017 OpenWorld Keynote for Data Integration
2017 OpenWorld Keynote for Data Integration2017 OpenWorld Keynote for Data Integration
2017 OpenWorld Keynote for Data Integration
 
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin MotgiWhither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi
 
Equinix Big Data Platform and Cassandra - A view into the journey
Equinix Big Data Platform and Cassandra - A view into the journeyEquinix Big Data Platform and Cassandra - A view into the journey
Equinix Big Data Platform and Cassandra - A view into the journey
 
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
 
Oracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsOracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analytics
 
Exploring the Wider World of Big Data
Exploring the Wider World of Big DataExploring the Wider World of Big Data
Exploring the Wider World of Big Data
 
Using real time big data analytics for competitive advantage
 Using real time big data analytics for competitive advantage Using real time big data analytics for competitive advantage
Using real time big data analytics for competitive advantage
 
Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014
 
Wasp2 - IoT and Streaming Platform
Wasp2 - IoT and Streaming PlatformWasp2 - IoT and Streaming Platform
Wasp2 - IoT and Streaming Platform
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine Learning
 
Big data Question bank.pdf
Big data Question bank.pdfBig data Question bank.pdf
Big data Question bank.pdf
 
Data lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amiryData lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amiry
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
 
4AA6-4492ENW
4AA6-4492ENW4AA6-4492ENW
4AA6-4492ENW
 
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the CloudBring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
 
Actian Analytics Platform - Hadoop SQL Edition
Actian Analytics Platform - Hadoop SQL EditionActian Analytics Platform - Hadoop SQL Edition
Actian Analytics Platform - Hadoop SQL Edition
 

Cardinality-HL-Overview

  • 1. CARDINALITY innovative analytics: - per customer - per service - network wide CONFIDENTIALITY STATEMENT - THIS DOCUMENT AND THE INFORMATION IN IT ARE PROVIDED IN THE STRICTEST COMMERCIAL CONFIDENCE, FOR THE SOLE PURPOSE OF EVALUATING CARDINALITY LTD AS A SUPPLIER, AND SHALL NOT BE DISCLOSED TO ANY THIRD PARTY OR USED FOR ANY OTHER PURPOSE WITHOUT THE EXPRESS WRITTEN PERMISSION OF CARDINALITY LTD Copyright © 2016 Cardinality Ltd., All Rights Reserved. No part of this work, which is protected by copyright, may be reproduced, stored, transmitted, or disseminated in any form or by any means without prior written permission from Cardinality Ltd.
  • 2. Cardinality Overview q  Cardinality  are  a  group  of  wireless  telecommunica5ons  engineers,   architects,  data  scien5sts,  and  so8ware  developers.     q  The  largest  Big  Data  Hadoop  Solu5on  deployed  in  any  telecommunica5ons   operator  in  Europe.   q  Cardinality  have  extensive  radio  design,  op5misa5on  and  opera5onal   support  system  (OSS)  experience   q  Leveraging  innova5ve  methodologies  and  pro-­‐ac5ve  approach  
  • 3. Cardinality is…. q  Innova5ve  Start-­‐Up,  with  referenceable  customers  &  sound  financial  backing   q  Founders  have  real  world  Big  Data  experience  with  Mobile  Network  Operators  (MNOs)   q  Using  proven  open  source  architectures,  which  has  evolved  based  on  analy5cs  experience   gained  from  over  4  years  using  Hadoop  clusters  in  mul5ple  operators   q  Tier  One  MNO  Big  Data  contact  won  in  summer  2015  (via  open  RFP,  against  large  /  established   “big  data”  companies)   q  Ini5al  volumes:           q  Big  Data  is  tainted  with  hype  –  Cardinality  iden5fy,  target,  real  business  problems  &  solve  them   ETL  Throughput   100K  events  per  sec  (per  Feed)   Big  Data  Storage   10-­‐12  billion  complex  rows/day  (40  TB  uncompressed   per  day)   Cache   ~200  million  complex  rows  (100GB)  
  • 5. . Collect Data from all the network elements MonetiseStore Visualise Data Lake: Collect Everything AnalyseCorrelate
  • 6. This is a sample text. Insert your desired text here.   here.   This is a sample text. Insert your desired text here.   Data Accessibility Data Transfer Data Storage Data Correlation Either (real)time data is not available or there is license cost q too costly to transfer the data from source to DW q too late by the time data reaches the DW q Complex Correlation needed q Batch processing does not complete in tine q Not real time q Existing DW doesn’t have enough space to store all data sets q It’s too expensive to store all the data sets in existing DW Common Issues
  • 7.                               Data Lake                           ETL  Engine   Analysis  Engine   Data Lake Cardinality Perception   Visualisa5on   Unconnected  &  Un-­‐enriched     data  silo  approach  repeated  in    New  Data  Lakes  create  unusable  big  data  sets     Data  Science  driven  enriched  and  connected  data  sets    imported  in  Data  Lakes  using     Cardinality’s  Percep5on  Plaborm  can  be  easily  accessed  by   users  and  machines     Data Silos Vs. Enriched Data Sets               Tradi&onal  Approach  
  • 8. DESIGN   Big Data Strategy Data Lake Big Data Analytics Architecture CREATE   BUILD   DELIVER   Cardinality Perception How can we help?
  • 9. Full  stack  deployment  of   Cardinality’s  Percep5on  plaborm   including  a  Hadoop  based  data  lake.   Greenfield   Deployment  of  Cardinality’s       Percep5on  plaborm  around  the   exis5ng    Hadoop  cluster         Exis5ng  Hadoop  Cluster   Offerings   Deployment  of  Cardinality  use  cases   and  data  visualisa5on,  using   exis5ng  Hadoop  Cluster  and  ETL   engine   Use  Cases/Data  Analy5cs   Big  Data  Consul5ng   Leverage  Cardinality’s    exper5se  in   crea5ng    organisa5on    wide   strategies,  designing  big  data   architectures  and  end-­‐2-­‐end    data     analysis/use  cases   Works with existing hadoop cluster and flexible options incl. full stack
  • 10. Open Approach to Infrastructure Cloud   (Public,  Private,  or  Hybrid)   Appliance   Model   (Pre-­‐Built)   Data  Centre   ✔   ✔   ✔  
  • 11. Components Deployed • ETL Engine • Data Lake: New Hadoop Cluster • Magnitude: Cache, SQL, NoSQL • Marvel: Data Analysis/Use Cases • Scene: Visualisation Data  Collec*on   (SFTP,  TCP,   WebSocket)   Data/Event   Stream   Data  Parser,     Enrichment  &   Analysis   DATALAKE&STORAGE Batch  Analysis  Spark  Jobs   APPLICATIONS Enterprise Applications Interactive Web & Mobile Applications BI / Reporting, Ad Hoc Analysis Statistical Analysis DATASOURCE Probes CDRs Control Plane Sensor Data Geo- location Data Network Data User Plane Full Stack Deployment
  • 12. Deployed Components •  ETL Engine •  Magnitude: Cache, SQL, NoSQL •  Data Analysis/Use Cases •  Visualisation Data  Collec*on   (SFTP,  TCP,   WebSocket)   Data/Event   Stream   Data  Parser,     Enrichment  &   Analysis   DATALAKE&STORAGE Batch  Analysis  Spark  Jobs   APPLICATIONS Enterprise Applications Interactive Web & Mobile Applications BI / Reporting, Ad Hoc Analysis Statistical Analysis DATASOURCE Probes CDRs Control Plane Sensor Data Geo- location Data Network Data User Plane Exis5ng  Hadoop   Cluster   Existing Hadoop Cluster
  • 13. Components Deployed •  Magnitude: Cache, SQL, NoSQL •  Marvel: Data Analysis/ Cardinality use cases •  Scene: Visualisation Cardinality’s  Data   PipeLine  (ETL)   DATALAKE&STORAGE Batch  Analysis  Spark  Jobs   APPLICATIONS Enterprise Applications Interactive Web & Mobile Applications BI / Reporting, Ad Hoc Analysis Statistical Analysis DATASOURCE Probes CDRs Control Plane Sensor Data Geo- location Data Network Data User Plane Exis*ng  ETL   Exis5ng  Hadoop   Cluster   Use Case / Data Analysis
  • 14. Cardinality Perception MME  Mul5-­‐Vendor  Probes   Data  Proxy   Firewall   Scene   Voice,  SMS,  Data   (Mediated  &  Billed   CDRs)   Marvel   (Data  Analysis)   Other  Data   Sources   Data  Pipeline  Data  Fetcher   Data  Parser   Magnitude   (Data  Storage)   HDFS   NO  SQL   SQL   Cache   Management   Server     Manage ment   Cluster   Manager   Component   Repository   Map  Reduce     Impala  (SQL)   Spark   •  Enterprise Ready (Scale, Robustness, Secure, Performance) Platform •  Utilising Enterprise Grade Open Source Components •  Vendor Independent, Best in Class component selection •  Each component is wrapped for simple upgrades and version control •  Architected using Micro Services to keep platform AGILE Security   API  Open  Access  (Resbul  &  SOAP)     Access   Subscriber  Driven   Coverage   Sta5s5cal   Repor5ng   Data  Visualisa5on  
  • 15. Real Time Decision EngineData  Feeds   File Sources ETLEngine API’s Machine Learning Data Hadoop RealTimePrivacyFilter (MSISDN,IPaddress,UAAgent,Domainsetc.) Streaming Sources Process  Feeds   API Interface File Handler Stream Monitor API Monitor Encryption &Enrichment Data Link Data Feed File Drop Administration & Monitoring Analysis Layer Presentation Layer                           Technology Overview
  • 16. Behavioral Analytics q Anomaly Detection Tools q Recommender Tools Spatiotemporal Analytics q Road Network Algorithms q Spatiotemporal Data Mining q Spatiotemporal Indexing Cognitive Networks q Makovian & Bayesian Networks q Deep neural networks q K-mean Clustering q Graph analysis Cognitive Analytics q Visual Sentiment Analysis q Emotion Analysis Example Algorithms
  • 17. Data Sources This is a sample text. Insert your desired text here.   This is a sample text. Insert your desired text here.   SAMPLE TEXT This is a sample text. Insert your desired text here.   This is a sample text. Insert your desired text here.   This is a sample text. Insert your desired text here.   Data Sources User Plane Data •  DPI •  Proxy •  Probes Control Plane Data •  MME/GGSN •  Probes •  PCRF Other Network Data •  Subscriber Location •  Probes •  Network Performance CDRs •  Voice •  SMS •  Data •  Roaming Reference Data •  Tariff Data •  Cell Site Reference •  Demographics Other Data Sources •  Marketing Promotions •  Customer Feedback
  • 18. DATA   Network q Radio Analytics q Congestion Analytics q Video Analytics q Location Based Analysis q Network Performance Business & Finance q Cell Profitability q Tariff Profitability q Future Network Investment Analysis Customer Services q Near RT customer issues q Churn Prediction & Prevention q Customer Loyalty Marketing q Customer Segmentation q Segment of One q Device & Tariff Upsell q New Tariffs q Content Upsell q OTT impact 360° use of Big Data
  • 19. Network   Conges5on   Hotspot  Analysis   Heat  Maps   Web  &  Video  Analy5cs   Performance  &  Capacity   Analy5cs   Profitability   Marke5ng   User  Segmenta5on   Sales  &  Upsell  Targe5ng   Tariff  Proposi5ons   Handset  &   Consump5on  Analy5cs   OTT  Service  Package   Analysis   Social  Media  Impact   Customer   Services   Churn  &  Reten5on   Proac5ve  Subscriber   Engagement   Billshock  &  Selfcare   Event  Monitoring   (fes5vals,  spor5ng   events)   Finance  &   Fraud   Fraud  Predic5on   Customer  Profitability   Targeted  Investment   Analysis   Board   Repor5ng   KPI  Metrics   Dashboards   Compliance   Security  Monitoring   Criminal  Ac5vity   Data  Privacy   Freedom  of  Informa5on   Requests   Compliance  Reports   IT   Data  Warehouse   Bill  Verifica5on   Business  Intelligence   Example use cases
  • 20. Africa MNO Simple Use cases & POC
  • 21. Example Use Case Value •  Device  Analy5cs  –  Ability  to  iden5fy  and  monitor  “rouge”  device  in  the  network,   and  inves5gate  device  upsell  opportuni5es   •  Data  &  Voice  QoE  –  Ability  to  monitor  QoE  across  the  network,  iden5fica5on  of   poten5al  op5miza5ons,  and  to  standardise  published  QoE  metrics   •  Subscriber  Ac5ve  States  –  Ability  to  iden5fy  the  subscriber  migra5on  and   ac5va5on  paherns  based  on  the  prepaid  datasets(top-­‐up  &  ac5va5ons).  
  • 22. Use Case: Device AnalyticsDevice Analytics Type of Devices •  Smartphones •  Feature phones] •  IoT Manufacturer Distribution •  Samsung •  Apple •  HTC etc. Operating System Distribution •  Android •  iOS •  Other Rogue Device Identification •  Non Standard Devices •  Poor Voice Experience •  Poor Data Experience •  Anomaly Detection Device Market Performance Device Up Sell Target
  • 23. Application QoE •  Facebook, YouTube etc. •  Feature phones] •  IoT Data QoE •  Latency •  Speed •  Traffic Analysis Operating System Distribution •  Android •  iOS •  Other Rogue Device Identification •  Non Standard Devices •  Poor Voice Experience •  Poor Data Experience •  Anomaly Detection Voice Experience •  Call Drop Analysis •  Location based Call Experience •  Rogue Device Analysis Use Case: Data & Voice quality of experience
  • 24. Prepaid Patterns •  Top-up distribution •  Up-Sell opportunities Subscriber Activation Pattern Distribution •  Promotion to activation co-relation •  Network Activation stats Network Experience Co- relations •  Churn to network experience •  Promotion to NT to Churn Prepaid Promotion Analysis •  Competitor Promotion co- relation •  Subscriber churn analysis Use Case: Subscriber Active States
  • 25. Data Discovery Workshop Deployment of Cardinality’s Perception Platform in the cloud Transfer/Load encrypted data in Perception Reports and Actionable analytics Proof-of-Concept (Cloud)