SlideShare ist ein Scribd-Unternehmen logo
1 von 14
Driving Network and Marketing Investments at O2 by Focusing on Improving the
Customer Experience
Ankur Agarwal
Network Data & Analytics Lead
Ajay Kaushik
Data Platform Design Lead
Evolution of O2 Network Data Analytics Hub
About the Speakers today
2
Ankur Agarwal
Ankur currently drives the data strategy and roadmap of Network Data & Analytics in
O2. Prior to this role, Ankur led the BI and Big data design at O2 where he owned the
data architecture, design and validation of data deliveries.
Before joining O2, Ankur was leading the Presales and Solutions function for Data and
Analytics practice at TCS where he has helped a number of clients in UK and EU on
their data initiatives.
Ajay Kaushik
Ajay is a Platform Design Lead in Telefónica’s Big Data Analytics team with a diverse
background in systems engineering and platform design. He has wide ranging
experience in the digital and network domain evangelising DevOps.
O2 UK - Telefónica
3
O2 is a mobile network operator and the principal commercial brand of Telefónica UK Limited, which is part
of the global telecommunications group Telefónica S.A, headquartered in Spain and operating in Europe,
and North, Central and South America.
O2 was awarded Best Network Coverage in 2019 by uSwitch, for a second year in a row, and with over 32 million
connections to the network, it runs 2G, 3G and 4G services across the UK, as well as operating its nationwide O2
Wifi service.
The company is the network of choice for mobile virtual network operators such as giffgaff, Sky Mobile and Lyca
Mobile as well as managing a 50:50 joint venture with Tesco for Tesco Mobile.
O2 has around 6,700 employees and over 450 retail stores and sponsors England Rugby, The O2 and 19 O2
Academy music venues across the UK. Through a comprehensive sustainability strategy O2 is also creating work
experience opportunities for 16-24 year olds via its GoThinkBig platform, enabling customers to reduce their impact
on the environment by recycling their old devices through O2 Recycle and, in partnership with the NSPCC, helping
parents to keep their children safe online.
O2 is the only mobile operator in the 2018 Social Mobility Employer Index and was named as one of the best places
to work in the 2019 Glassdoor Employee’s Choice Award.
Background
4
Specific focus was to be
given to keep the
maintenance cost ultra low
O2 needed a data platform to ingest
network events for measuring &
predicting customer network
experience
There was lots of manual
intervention required co-
relate the data from a
number of data sources
Very large volume of structured and
semi structured data had to be
ingested in real time without
overhead on network
Complex enrichment
required to co-relate the data
in the platform during
ingestion
Requirement for consistent
transformation rules and
data governance & quality
monitoring
Key Architectural Decision for building the platform
5
Build vs Buy
Open Source or
not
Transform & Enrich
Data while in
motion or at rest
Start small with a
prototype & evolveOn-premise vs
Cloud
To Hadoop or not
HDP
Data Sources Data Ingestion
Streaming and mini batches
Data Storage & Analytics Data Presentation Data Consumption
Features
KPIs
M/L Models
Hive
Data feeds
Tableau Extracts
Master data
Aggregated views
Web
Mobility
Probes
CRM
DPI
GSMA
Surveys
Alarms
Tickets
Micro services
based ETL
Engine
HDF
API
High Level Architecture
MongoDB ODS
30TB RAM, 4500 VCores, 2.5 PB
20TB RAM,
3800 VCores,
650 TB
30B+ Daily Events
Technology Selection and Considerations
7
Data Loading / Storage /
Lineage
• Exploiting the capability of
HDF and HDP to meet the
business requirements.
• Easily expandable for
future Use Cases.
• Deployed on standard
commodity hardware.
ETL
• Micro services based
ETL platform
• Complex enrichment
capability during
ingestion
Reporting
• O2 selected tool for
Discovery &
Visualisation.
• Hosted in-premise &
(coming up in) Cloud.
Data Science Toolkit
• Open Source tools
chosen having
continuous
contribution from
developers
• Deployed on GPU
machines
Platform Evolution
8
42 Nodes 200 Nodes 260 Nodes 320 Nodes
2014 2015-16 2017 2018 2019
Micro Service Based
ETL
Spark Adoption for
performance
Introduced Self
Service
Migration to
Supported HDP
Data Governance Tools
Policy Based Security
E2E Lineage
Data Encryption
Analytics
Data
Management
Platform
Capability
Cluster Size
Virtual Drive
Test
Experience
Customer
Segmentation
(Personalization)
CSI
Predictor
(NCX)
NCX Predictor
(Voice)
Data APIs
ML
Capability
NLP
NCX What-if
Hybrid Cloud
Analytics APIs
Hybrid
CSI Predictor (NCX)
Weblogs
IPRF
Magnet
Arcanum
GSMA
User Catalogue
Fanbase
Web Analytics
Mobility
Signalling
Calls
Feature
Table (FT)
Day 7
Feature Table
Aggregated
(FT)
Feature Table
Aggregated Scored
(NCX)
MME
Feature
Table (FT)
Day 1
ML
CRM
Hive
NCX driving focus on Network & Marketing
investments
• Marketing - NCX as a driver of customer communication
• Always on marketing campaign highlighting customers who have had an x
improvement in their NCX score
• Use individual NCX as a post-disruption targeting mechanism
• Experience / improvement reinforcement message to people who we know
have their experience improved
• Customer Service
• Use NCX to identify if there is a tipping point when customers churn or complain
• Networks
• Driving end to end network performance
• Impact on customer experience due to network roll out and changes
• Strategic network forecasting
• Using NCX to prioritise capacity, coverage and technology investment
10
O2 Network
Data &
Analytics
Platform
O2 Labs
Marketing
D&A
TEF
Research &
Innovation
Digital
Network
Ops &
Performance
Revenue
Assurance
GiffGaff/Sky
Smart
Metering
Data, Platform & Analytics capabilities consumed
by all the spokes
Analytics products like automated Anomaly
Detections are developed jointly between
Netpulse & TEF Research & Innovation Teams
Collaboration between Netpulse, O2 labs and
Marketing D&A team for jointly evolve more re-
usable analytics product like NLP – initially built by
Netpulse.
Extending the analytics development capability to
Digital team to enhance Smartsteps & Smartcities
products using customer insight generated from
Network data.
And now we are.. Network Data & Analytics Hub
Considerations and Lessons Learnt
12
Build the Datalake on specific Business Value
Always build a Datalake on defined Use Cases that have business value from
day-1, this will ensure that the lake won’t turn into a very expensive resource with
no financial return for the business.
Utilise the experts – Hortonworks
1) Helped correctly size environment (Nifi, Data Nodes, Edge Nodes)
2) Installed all software and setup initial environment.
3) Part of the core team, answering queries and responding to technical tickets.
4) Provided subject matter experts, architectural guidance, design and security
knowledge.
Considerations and Lessons Learnt
13
Our approach to Hybrid Environment (Cloud and on-premise)
• Quick time to market for capacity expansion
• Avoid huge Cloud cost by keeping the hot skeleton site active and bursting it
on demand – leveraging cloud elasticity
• Determine any data security and residency requirements.
• Plug in readily available cognitive services APIs in Cloud in the Analytics
pipeline to rapidly experiment the model.
• Option to explore the alternative architecture options with bucket storage
with auto scaling compute & APIs
• Cold/Warm Storage capability
Any Questions?
14

Weitere ähnliche Inhalte

Was ist angesagt?

The Inside Story: How OPC UA and DDS Can Work Together in Industrial Systems
The Inside Story: How OPC UA and DDS Can Work Together in Industrial SystemsThe Inside Story: How OPC UA and DDS Can Work Together in Industrial Systems
The Inside Story: How OPC UA and DDS Can Work Together in Industrial SystemsReal-Time Innovations (RTI)
 
DO-178B/ED-12B Presentation
DO-178B/ED-12B PresentationDO-178B/ED-12B Presentation
DO-178B/ED-12B PresentationAnkit Singh
 
Learnable Image Encryption
Learnable Image EncryptionLearnable Image Encryption
Learnable Image EncryptionMasayuki Tanaka
 
Marketing Your Open Source Project
Marketing Your Open Source ProjectMarketing Your Open Source Project
Marketing Your Open Source Projectdeirdrestraughan
 
ISO/IEC 42010 Recommended Practice for Architectural description
ISO/IEC 42010 Recommended Practice for Architectural descriptionISO/IEC 42010 Recommended Practice for Architectural description
ISO/IEC 42010 Recommended Practice for Architectural descriptionHongseok Lee
 
Point Cloud and its applications
Point Cloud and its applicationsPoint Cloud and its applications
Point Cloud and its applicationsLeonis Wong
 
Model-Driven Software Engineering in Practice - Chapter 6 - Modeling Language...
Model-Driven Software Engineering in Practice - Chapter 6 - Modeling Language...Model-Driven Software Engineering in Practice - Chapter 6 - Modeling Language...
Model-Driven Software Engineering in Practice - Chapter 6 - Modeling Language...Jordi Cabot
 
Enterprise cloud strategy - PDF
Enterprise cloud strategy - PDFEnterprise cloud strategy - PDF
Enterprise cloud strategy - PDFHoangVanDai
 
Introducing Eclipse MoDisco
Introducing Eclipse MoDiscoIntroducing Eclipse MoDisco
Introducing Eclipse MoDiscoHugo Bruneliere
 
Software Engineering - chp2- requirements specification
Software Engineering - chp2- requirements specificationSoftware Engineering - chp2- requirements specification
Software Engineering - chp2- requirements specificationLilia Sfaxi
 
5th Qatar BIM User Day, Understanding stakeholder roles in BIM
5th Qatar BIM User Day, Understanding stakeholder roles in BIM5th Qatar BIM User Day, Understanding stakeholder roles in BIM
5th Qatar BIM User Day, Understanding stakeholder roles in BIMBIM User Day
 
Quality Assurance of FME Scripts
Quality Assurance of FME ScriptsQuality Assurance of FME Scripts
Quality Assurance of FME ScriptsSafe Software
 
Architecting AI Applications
Architecting AI ApplicationsArchitecting AI Applications
Architecting AI ApplicationsMikio L. Braun
 
Seminar on Chaos Based Cryptography
Seminar on Chaos Based CryptographySeminar on Chaos Based Cryptography
Seminar on Chaos Based CryptographyMuhammad Hamid
 
Message Authentication and Hash Function.pdf
Message Authentication and Hash Function.pdfMessage Authentication and Hash Function.pdf
Message Authentication and Hash Function.pdfsunil sharma
 

Was ist angesagt? (16)

The Inside Story: How OPC UA and DDS Can Work Together in Industrial Systems
The Inside Story: How OPC UA and DDS Can Work Together in Industrial SystemsThe Inside Story: How OPC UA and DDS Can Work Together in Industrial Systems
The Inside Story: How OPC UA and DDS Can Work Together in Industrial Systems
 
DO-178B/ED-12B Presentation
DO-178B/ED-12B PresentationDO-178B/ED-12B Presentation
DO-178B/ED-12B Presentation
 
Learnable Image Encryption
Learnable Image EncryptionLearnable Image Encryption
Learnable Image Encryption
 
Marketing Your Open Source Project
Marketing Your Open Source ProjectMarketing Your Open Source Project
Marketing Your Open Source Project
 
ISO/IEC 42010 Recommended Practice for Architectural description
ISO/IEC 42010 Recommended Practice for Architectural descriptionISO/IEC 42010 Recommended Practice for Architectural description
ISO/IEC 42010 Recommended Practice for Architectural description
 
Point Cloud and its applications
Point Cloud and its applicationsPoint Cloud and its applications
Point Cloud and its applications
 
Model-Driven Software Engineering in Practice - Chapter 6 - Modeling Language...
Model-Driven Software Engineering in Practice - Chapter 6 - Modeling Language...Model-Driven Software Engineering in Practice - Chapter 6 - Modeling Language...
Model-Driven Software Engineering in Practice - Chapter 6 - Modeling Language...
 
Enterprise cloud strategy - PDF
Enterprise cloud strategy - PDFEnterprise cloud strategy - PDF
Enterprise cloud strategy - PDF
 
Introducing Eclipse MoDisco
Introducing Eclipse MoDiscoIntroducing Eclipse MoDisco
Introducing Eclipse MoDisco
 
Virtual Design and Construction
Virtual Design and ConstructionVirtual Design and Construction
Virtual Design and Construction
 
Software Engineering - chp2- requirements specification
Software Engineering - chp2- requirements specificationSoftware Engineering - chp2- requirements specification
Software Engineering - chp2- requirements specification
 
5th Qatar BIM User Day, Understanding stakeholder roles in BIM
5th Qatar BIM User Day, Understanding stakeholder roles in BIM5th Qatar BIM User Day, Understanding stakeholder roles in BIM
5th Qatar BIM User Day, Understanding stakeholder roles in BIM
 
Quality Assurance of FME Scripts
Quality Assurance of FME ScriptsQuality Assurance of FME Scripts
Quality Assurance of FME Scripts
 
Architecting AI Applications
Architecting AI ApplicationsArchitecting AI Applications
Architecting AI Applications
 
Seminar on Chaos Based Cryptography
Seminar on Chaos Based CryptographySeminar on Chaos Based Cryptography
Seminar on Chaos Based Cryptography
 
Message Authentication and Hash Function.pdf
Message Authentication and Hash Function.pdfMessage Authentication and Hash Function.pdf
Message Authentication and Hash Function.pdf
 

Ähnlich wie Driving Network and Marketing Investments at O2 by Focusing on Improving the Customer Experience

From Business Idea to Successful Delivery by Serhiy Haziyev & Olha Hrytsay, S...
From Business Idea to Successful Delivery by Serhiy Haziyev & Olha Hrytsay, S...From Business Idea to Successful Delivery by Serhiy Haziyev & Olha Hrytsay, S...
From Business Idea to Successful Delivery by Serhiy Haziyev & Olha Hrytsay, S...SoftServe
 
Connecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnectaDigital
 
Bas van Dorst - Microsoft
Bas van Dorst - MicrosoftBas van Dorst - Microsoft
Bas van Dorst - MicrosoftDutch Power
 
It Consulting & Services - Black Basil Technologies
It Consulting & Services  - Black Basil TechnologiesIt Consulting & Services  - Black Basil Technologies
It Consulting & Services - Black Basil TechnologiesBlack Basil Technologies
 
Digital Reinvention by NRB
Digital Reinvention by NRBDigital Reinvention by NRB
Digital Reinvention by NRBWilliam Poos
 
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...Denodo
 
SpeedyCloud Services Introduction Vol-5
SpeedyCloud Services Introduction Vol-5SpeedyCloud Services Introduction Vol-5
SpeedyCloud Services Introduction Vol-5Zaighum Malik 赞谋
 
Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches DataWorks Summit
 
O2’s Financial Data Hub: going beyond IFRS compliance to support digital tran...
O2’s Financial Data Hub: going beyond IFRS compliance to support digital tran...O2’s Financial Data Hub: going beyond IFRS compliance to support digital tran...
O2’s Financial Data Hub: going beyond IFRS compliance to support digital tran...DataWorks Summit
 
TSSG Innovation Breakfast Seminar, Dublin - June 4th
TSSG Innovation Breakfast Seminar, Dublin - June 4thTSSG Innovation Breakfast Seminar, Dublin - June 4th
TSSG Innovation Breakfast Seminar, Dublin - June 4thWalton Institute
 
How We Do DevOps at Walmart: OneOps OSS Application Lifecycle Management Plat...
How We Do DevOps at Walmart: OneOps OSS Application Lifecycle Management Plat...How We Do DevOps at Walmart: OneOps OSS Application Lifecycle Management Plat...
How We Do DevOps at Walmart: OneOps OSS Application Lifecycle Management Plat...WalmartLabs
 
Cubitic: Predictive Analytics
Cubitic: Predictive AnalyticsCubitic: Predictive Analytics
Cubitic: Predictive Analyticshuguk
 
The Proof is in the Pudding
The Proof is in the PuddingThe Proof is in the Pudding
The Proof is in the PuddingDenodo
 
Apidays Paris 2023 - Building APIs At Scale, Ado Trakic, Capital One
Apidays Paris 2023 - Building APIs At Scale, Ado Trakic, Capital OneApidays Paris 2023 - Building APIs At Scale, Ado Trakic, Capital One
Apidays Paris 2023 - Building APIs At Scale, Ado Trakic, Capital Oneapidays
 
What is the future of data strategy?
What is the future of data strategy?What is the future of data strategy?
What is the future of data strategy?Denodo
 
¿Cómo las manufacturas están evolucionando hacia la Industria 4.0 con la virt...
¿Cómo las manufacturas están evolucionando hacia la Industria 4.0 con la virt...¿Cómo las manufacturas están evolucionando hacia la Industria 4.0 con la virt...
¿Cómo las manufacturas están evolucionando hacia la Industria 4.0 con la virt...Denodo
 
Infrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
Infrastructure Solutions for Deploying AI/ML/DL Workloads at ScaleInfrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
Infrastructure Solutions for Deploying AI/ML/DL Workloads at ScaleRobb Boyd
 

Ähnlich wie Driving Network and Marketing Investments at O2 by Focusing on Improving the Customer Experience (20)

From Business Idea to Successful Delivery by Serhiy Haziyev & Olha Hrytsay, S...
From Business Idea to Successful Delivery by Serhiy Haziyev & Olha Hrytsay, S...From Business Idea to Successful Delivery by Serhiy Haziyev & Olha Hrytsay, S...
From Business Idea to Successful Delivery by Serhiy Haziyev & Olha Hrytsay, S...
 
Connecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud Platform
 
Bas van Dorst - Microsoft
Bas van Dorst - MicrosoftBas van Dorst - Microsoft
Bas van Dorst - Microsoft
 
It Consulting & Services - Black Basil Technologies
It Consulting & Services  - Black Basil TechnologiesIt Consulting & Services  - Black Basil Technologies
It Consulting & Services - Black Basil Technologies
 
Digital Reinvention by NRB
Digital Reinvention by NRBDigital Reinvention by NRB
Digital Reinvention by NRB
 
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
 
SpeedyCloud Services Introduction Vol-5
SpeedyCloud Services Introduction Vol-5SpeedyCloud Services Introduction Vol-5
SpeedyCloud Services Introduction Vol-5
 
Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches
 
O2’s Financial Data Hub: going beyond IFRS compliance to support digital tran...
O2’s Financial Data Hub: going beyond IFRS compliance to support digital tran...O2’s Financial Data Hub: going beyond IFRS compliance to support digital tran...
O2’s Financial Data Hub: going beyond IFRS compliance to support digital tran...
 
Cloud asia 2012
Cloud asia 2012Cloud asia 2012
Cloud asia 2012
 
Extelligence sro
Extelligence sroExtelligence sro
Extelligence sro
 
TSSG Innovation Breakfast Seminar, Dublin - June 4th
TSSG Innovation Breakfast Seminar, Dublin - June 4thTSSG Innovation Breakfast Seminar, Dublin - June 4th
TSSG Innovation Breakfast Seminar, Dublin - June 4th
 
Rushabh_Doshi_1_
Rushabh_Doshi_1_Rushabh_Doshi_1_
Rushabh_Doshi_1_
 
How We Do DevOps at Walmart: OneOps OSS Application Lifecycle Management Plat...
How We Do DevOps at Walmart: OneOps OSS Application Lifecycle Management Plat...How We Do DevOps at Walmart: OneOps OSS Application Lifecycle Management Plat...
How We Do DevOps at Walmart: OneOps OSS Application Lifecycle Management Plat...
 
Cubitic: Predictive Analytics
Cubitic: Predictive AnalyticsCubitic: Predictive Analytics
Cubitic: Predictive Analytics
 
The Proof is in the Pudding
The Proof is in the PuddingThe Proof is in the Pudding
The Proof is in the Pudding
 
Apidays Paris 2023 - Building APIs At Scale, Ado Trakic, Capital One
Apidays Paris 2023 - Building APIs At Scale, Ado Trakic, Capital OneApidays Paris 2023 - Building APIs At Scale, Ado Trakic, Capital One
Apidays Paris 2023 - Building APIs At Scale, Ado Trakic, Capital One
 
What is the future of data strategy?
What is the future of data strategy?What is the future of data strategy?
What is the future of data strategy?
 
¿Cómo las manufacturas están evolucionando hacia la Industria 4.0 con la virt...
¿Cómo las manufacturas están evolucionando hacia la Industria 4.0 con la virt...¿Cómo las manufacturas están evolucionando hacia la Industria 4.0 con la virt...
¿Cómo las manufacturas están evolucionando hacia la Industria 4.0 con la virt...
 
Infrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
Infrastructure Solutions for Deploying AI/ML/DL Workloads at ScaleInfrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
Infrastructure Solutions for Deploying AI/ML/DL Workloads at Scale
 

Mehr von DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 

Mehr von DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Kürzlich hochgeladen

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 

Kürzlich hochgeladen (20)

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 

Driving Network and Marketing Investments at O2 by Focusing on Improving the Customer Experience

  • 1. Driving Network and Marketing Investments at O2 by Focusing on Improving the Customer Experience Ankur Agarwal Network Data & Analytics Lead Ajay Kaushik Data Platform Design Lead Evolution of O2 Network Data Analytics Hub
  • 2. About the Speakers today 2 Ankur Agarwal Ankur currently drives the data strategy and roadmap of Network Data & Analytics in O2. Prior to this role, Ankur led the BI and Big data design at O2 where he owned the data architecture, design and validation of data deliveries. Before joining O2, Ankur was leading the Presales and Solutions function for Data and Analytics practice at TCS where he has helped a number of clients in UK and EU on their data initiatives. Ajay Kaushik Ajay is a Platform Design Lead in Telefónica’s Big Data Analytics team with a diverse background in systems engineering and platform design. He has wide ranging experience in the digital and network domain evangelising DevOps.
  • 3. O2 UK - Telefónica 3 O2 is a mobile network operator and the principal commercial brand of Telefónica UK Limited, which is part of the global telecommunications group Telefónica S.A, headquartered in Spain and operating in Europe, and North, Central and South America. O2 was awarded Best Network Coverage in 2019 by uSwitch, for a second year in a row, and with over 32 million connections to the network, it runs 2G, 3G and 4G services across the UK, as well as operating its nationwide O2 Wifi service. The company is the network of choice for mobile virtual network operators such as giffgaff, Sky Mobile and Lyca Mobile as well as managing a 50:50 joint venture with Tesco for Tesco Mobile. O2 has around 6,700 employees and over 450 retail stores and sponsors England Rugby, The O2 and 19 O2 Academy music venues across the UK. Through a comprehensive sustainability strategy O2 is also creating work experience opportunities for 16-24 year olds via its GoThinkBig platform, enabling customers to reduce their impact on the environment by recycling their old devices through O2 Recycle and, in partnership with the NSPCC, helping parents to keep their children safe online. O2 is the only mobile operator in the 2018 Social Mobility Employer Index and was named as one of the best places to work in the 2019 Glassdoor Employee’s Choice Award.
  • 4. Background 4 Specific focus was to be given to keep the maintenance cost ultra low O2 needed a data platform to ingest network events for measuring & predicting customer network experience There was lots of manual intervention required co- relate the data from a number of data sources Very large volume of structured and semi structured data had to be ingested in real time without overhead on network Complex enrichment required to co-relate the data in the platform during ingestion Requirement for consistent transformation rules and data governance & quality monitoring
  • 5. Key Architectural Decision for building the platform 5 Build vs Buy Open Source or not Transform & Enrich Data while in motion or at rest Start small with a prototype & evolveOn-premise vs Cloud To Hadoop or not
  • 6. HDP Data Sources Data Ingestion Streaming and mini batches Data Storage & Analytics Data Presentation Data Consumption Features KPIs M/L Models Hive Data feeds Tableau Extracts Master data Aggregated views Web Mobility Probes CRM DPI GSMA Surveys Alarms Tickets Micro services based ETL Engine HDF API High Level Architecture MongoDB ODS 30TB RAM, 4500 VCores, 2.5 PB 20TB RAM, 3800 VCores, 650 TB 30B+ Daily Events
  • 7. Technology Selection and Considerations 7 Data Loading / Storage / Lineage • Exploiting the capability of HDF and HDP to meet the business requirements. • Easily expandable for future Use Cases. • Deployed on standard commodity hardware. ETL • Micro services based ETL platform • Complex enrichment capability during ingestion Reporting • O2 selected tool for Discovery & Visualisation. • Hosted in-premise & (coming up in) Cloud. Data Science Toolkit • Open Source tools chosen having continuous contribution from developers • Deployed on GPU machines
  • 8. Platform Evolution 8 42 Nodes 200 Nodes 260 Nodes 320 Nodes 2014 2015-16 2017 2018 2019 Micro Service Based ETL Spark Adoption for performance Introduced Self Service Migration to Supported HDP Data Governance Tools Policy Based Security E2E Lineage Data Encryption Analytics Data Management Platform Capability Cluster Size Virtual Drive Test Experience Customer Segmentation (Personalization) CSI Predictor (NCX) NCX Predictor (Voice) Data APIs ML Capability NLP NCX What-if Hybrid Cloud Analytics APIs Hybrid
  • 9. CSI Predictor (NCX) Weblogs IPRF Magnet Arcanum GSMA User Catalogue Fanbase Web Analytics Mobility Signalling Calls Feature Table (FT) Day 7 Feature Table Aggregated (FT) Feature Table Aggregated Scored (NCX) MME Feature Table (FT) Day 1 ML CRM Hive
  • 10. NCX driving focus on Network & Marketing investments • Marketing - NCX as a driver of customer communication • Always on marketing campaign highlighting customers who have had an x improvement in their NCX score • Use individual NCX as a post-disruption targeting mechanism • Experience / improvement reinforcement message to people who we know have their experience improved • Customer Service • Use NCX to identify if there is a tipping point when customers churn or complain • Networks • Driving end to end network performance • Impact on customer experience due to network roll out and changes • Strategic network forecasting • Using NCX to prioritise capacity, coverage and technology investment 10
  • 11. O2 Network Data & Analytics Platform O2 Labs Marketing D&A TEF Research & Innovation Digital Network Ops & Performance Revenue Assurance GiffGaff/Sky Smart Metering Data, Platform & Analytics capabilities consumed by all the spokes Analytics products like automated Anomaly Detections are developed jointly between Netpulse & TEF Research & Innovation Teams Collaboration between Netpulse, O2 labs and Marketing D&A team for jointly evolve more re- usable analytics product like NLP – initially built by Netpulse. Extending the analytics development capability to Digital team to enhance Smartsteps & Smartcities products using customer insight generated from Network data. And now we are.. Network Data & Analytics Hub
  • 12. Considerations and Lessons Learnt 12 Build the Datalake on specific Business Value Always build a Datalake on defined Use Cases that have business value from day-1, this will ensure that the lake won’t turn into a very expensive resource with no financial return for the business. Utilise the experts – Hortonworks 1) Helped correctly size environment (Nifi, Data Nodes, Edge Nodes) 2) Installed all software and setup initial environment. 3) Part of the core team, answering queries and responding to technical tickets. 4) Provided subject matter experts, architectural guidance, design and security knowledge.
  • 13. Considerations and Lessons Learnt 13 Our approach to Hybrid Environment (Cloud and on-premise) • Quick time to market for capacity expansion • Avoid huge Cloud cost by keeping the hot skeleton site active and bursting it on demand – leveraging cloud elasticity • Determine any data security and residency requirements. • Plug in readily available cognitive services APIs in Cloud in the Analytics pipeline to rapidly experiment the model. • Option to explore the alternative architecture options with bucket storage with auto scaling compute & APIs • Cold/Warm Storage capability

Hinweis der Redaktion

  1. Explain the need of a data and analytics platform to ingest and analyze the network events. Various team used to gather and co-related data from various systems which was time consuming and was not giving the insight when it was needed. Platform was to be scaled to handle 30 billion events a day streaming continuously from the network and co-relate those in real time to measure and monitor the customer experience. A number of metrics to be created to assess the impact of those with actual customer experience. Instead of various team co-relating data with various definition, this platform was to create consistent transformation/enrichment rules to provide co-herant analysis.
  2. For building the data platform, here are some of the key decisions which were taken. There are good lesson learnt from some of them - explained in the slides later. Open Source or not – Decision was taken to go open source in line with our strategy. Transform & Enrich Data while in motion or at rest – To avoid re-processing huge data set again which would have added to additional latency. Proof of concept was conducted to compare the non functional for enrichment while data in motion or at rest. To Hadoop or not – Decision was taken to go community Hadoop against any MPP relational databases available at that time Onpremise vs Cloud – Decision was taken to go on-premise to leverage in-house build capability. Cloud was not the strategy at that time due to uncertainty of cost for large data platform and sensitive data. Build vs Buy – No scalable technology available to handle the scale without burning our pockets. Compatibility with Hadoop was not proven for many of them. Data in motion co-relation a question mark for such a volume. Decision was to build and not buy. Prototype and Evolve – Instead of going big bang and setting up massive data lake, smaller use cases were chosen for quick ROI and those were evolved. If you ask us today, how many decision would we like to revert - may be half of them
  3. This slide describes the architecture on which types of source data is currently used in the platform and various types of formats that data is produced by the network. Point to note here is that specific focus was given not to create overhead on the network to integrate the data for this platform and therefore standard network logs were used. A very performant modular micro services based ingestion layer (ETL engine) was built to consume, parse, transform/enrich and load the data into Hadoop cluster (HDFS). Additional a more dynamic and agile HDF technology was deployed for simplified ETL which could be written by technical analysts instead of expensive developers. In the data storage layer, there are 3 pipelines – a) which generate master data from events e.g. how many users use O2 network include MVNO and in-roamers b) KPI generation for network and customer reporting c) feature generation which measure various performance parameter which are then fed into machine learning models to predict the customer experience. Data is then exposed by Hive and Tableau technologies to end user tools like Zepplin, Tableau, Ambari and data feeds. We are also building APIs which will take the insight from the platform directly to customer and agent facing channels to help them handle/orchestrate the customer interaction appropriately. Stats provided to show the size and design of the platform to process and store 30B+ events daily.
  4. This slide shows our tool of choice of each layer mentioned in the previous slide.
  5. This slide shows our journey from 2014 across various dimensions of data & analytics – Cluster size, platform capability, Data management & Analytics. 2014 – 2016 – we started small and delivered initial use case called virtual drive test analytics tool which provide more accurate and timely view of network experience at network element level at every hour. Previously, O2 used to use Drive test results from various agencies coming to us parodically. Power of the data in the platform also enabled Web & Location Analytics to support Smartsteps, revenue assurance, Weve & Personalization use case and that’s when we started getting ROI. Major break thru happened in 2017 with the launch of NCX – which started driving network & marketing investment mentioned in slide 9 2019 will be the year to integrate the NCX with customer journey in channels to drive CX.
  6. This slide shows more technical data flow and data science process which is used to predict the NCX.
  7. Work with Network Performance team and the regional teams to drive performance E2E. Regional teams identify customers groups/regions with a poor NCX score < 40 then identify if there is a known issue in the area or a new radio issue.
  8. Platform and the team continued to grow and there was a increasing interest from various parts of the business to utilize the analytics from networks data. We now act as the hub and server various spokes in the business for their data & analytics needs. Where needed we collaborate with other analytics team to jointly develop analytics products which have much wider use cases. Data from the platform helps MVNOs detect anomaly in customer network profile which is notified to MVNOs to take appropriate actions. Platform will soon be opened up to MVNOs for reporting.