SlideShare a Scribd company logo
1 of 21
Impetus Technologies Inc. 
Big Data Technologies for Social 
© 2014 1 Impetus Technologies 
Media Analytics 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=4 
8
Outline 
• Social Media Analytics- Need and Benefits 
• Effective convergence of disparate data sources 
• Big Data technologies to enable Social Analytics 
• Our recommended approach 
• Industry relevant use cases 
© 2014 2 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Social Analytics 
Recommendation 
Engine 
© 2014 3 Impetus Technologies 
Reports and 
Statistics 
Data visualization Sentiment Analysis 
via Interactive 
Interface 
Social Media Sources 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Business Intelligence & Product Research 
 Customer Analysis 
 Identifies users from different geographies, 
locations 
 Tracks users activities to determine usage 
patterns 
 Feature Analysis 
 Track the usage of various social features 
 Product Growth Analysis 
 Track customer feedback on products 
 Target the right customers 
 Recommendation Engine 
 Related products and customers 
 Third Party Data Analysis 
 Analysis of customers on third party sites 
© 2014 4 Impetus Technologies 
Social Analytics provides smarter 
ways of data tracking, powerful 
analytics and metrics for informed 
decision making 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
How it Helps? 
Outcome Based Approach 
• Customer retention 
• Brand building and recall (harvests/ address sentiment) 
• Simplifies customer service 
• Reduces operational cost 
• Builds up the customer base 
• Understands customer’s opinions and addresses their 
needs 
• Competition benchmarking 
• Proactive on demographic changes 
© 2014 5 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Convergence of Data Sources 
Data Sources 
© 2014 6 Impetus Technologies 
Website Traffic Analysis 
(On-site web analytics) 
Internal CSR Logs, Customer 
Queries 
Automated Agent discussions 
Complaints and Resolutions 
Employee Insights 
External Data Sources 
(Off-site web analytics) 
Industry Reports 
Market Research 
Social Media 
Social Media Analytics 
Social Media Analytics effectively converges on-site, social media and third party data 
to extract useful information 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Technical Tenets of Social Media Analytics 
Data Sources 
© 2014 7 Impetus Technologies 
Website Traffic Analysis 
(On-site web analytics) 
Internal CSR Logs, Customer 
Queries 
Automated Agent discussions 
Complaints and Resolutions 
Employee Insights 
External Data Sources 
(Off-site web analytics) 
Industry Reports 
Market Research 
Social Media 
Social Media Analytics 
Clustering Classification Sequential classification 
Entity extraction Event extraction Communication graph 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Why Big Data for Social Analytics? 
• Large data volumes in the order TBs and PBs 
• Complex unstructured data from social sources 
• Deeper insights into customers and trends 
• Storing images, videos 
• The bottom-line - $/TB 
© 2014 8 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Our Recommended Approach 
Technologies 
• Data collection - Social media data 
– Live feeds 
– Historical bulk data 
• NLP (NLTK is a good option) 
• Data preparation/ Mashup 
– M/R, PIG, Hive, Oozie, R, Sqoop 
• Classification/ Clustering (Mahout) 
• Recommendation (Mahout) 
• Loopback/ Feed output to live applications 
• Analytical reporting and deep mining 
© 2014 9 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Our Recommended Approach 
• Collecting Twitter Feed (Streaming feed) using filter fire 
hose 
– Tweets for keywords 
– Based on brand, product, category, industry, product 
segment, special offers and marketing buzz words 
– Streaming API and HBASE based sink for high writes 
• Collect/create training data 
– Standalone Tweets for individual keywords 
© 2014 10 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Our Recommended Approach 
• Creating or classifying text data and demographics 
• Quantitative analytics 
• Ascertaining daily trend 
• General tweets v/s product-specific tweets 
• Tweets targeted at competitors v/s own product 
• Location based trends (for available data sets) 
• Identifying and categorizing the output 
• Sentiment analysis of own product - Good, Neutral, 
Bad 
• Use training data for classification - Mahout/NLTK 
• Run trained models on Tweet data - Mahout/NLTK 
© 2014 11 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Our Recommended Approach 
• Mash up Analytics from RDBMS with Social media 
analytics 
• Using customer data to recommend new/related 
products 
• Preparing mock customer data for Social ID mapping 
• Running recommendations (item or user based) using 
Mahout 
• Analytical Reporting 
• Demonstrates drill down reports on data generated by 
Mahout 
• Reports over Hive/MySQL using a traditional Reporting 
product or framework 
© 2014 12 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
© 2014 13 Impetus Technologies 
iLaDaP 
Impetus Large Data Analytics Platform
iLaDaP- Technology Stack 
• Scalable data store 
– Hadoop HDFS 
– Hbase 
• Connectors (In/Out) 
– Flume 
– Sqoop 
– Messaging queue 
– ESB- Apache Camel 
• Analytics and ETL 
– Mahout for NL and text mining 
• Classification/ Clustering 
• Recommendation 
– Oozie for complex ETL and workflow 
– JDBC/ODBC compliant Analytics tools – Intellicus, Jasper etc. 
© 2014 14 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Case Study- Financial Services 
The Client 
– Leading financial services company 
Key Challenge 
– Recommend products based on User profile/location 
– Recommend alternate products based Social Media feedback 
Impetus Solution 
• Proposed iLaDaP based solution 
• Sentiment Analysis using Naïve Bayesian algorithm for 
classification/sentiment analysis 
• Clustering using k-means algorithm of Mahout 
• Apache Mahout based recommendation engine 
Benefits Realised 
• Better product recommendations 
© 2014 15 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Case Study- Online Retailer 
The Client 
– Leading online product retailer 
Key Challenge 
• Recommendation engine 
• Cross product customer analysis 
• Provide ‘Big Picture’ across business units 
Impetus Solution 
• Proposed iLaDaP based solution 
• Clustering using k-means algorithm of Mahout 
• Apache Mahout based recommendation engine 
Benefits Realised 
• True centralized business overview across product and business lines 
© 2014 16 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
Summing Up 
• Using Big Data technologies for Social Analytics needs a 
well-thought of strategy 
• Open source yields better results for social media data 
• Hadoop based Big Data Analytics is a scalable and cost 
effective option. 
• Selecting the right tools is the key to build a successful 
Social Analytics EDW using Big Data 
• Easy extension of the existing Data Warehouse and 
Analytics infrastructure is possible to leverage existing 
investments 
© 2014 17 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
© 2014 18 Impetus Technologies 
About Impetus
• Strategic partners for software product engineering and 
R&D 
• Thought leaders in cutting-edge technologies 
• Mature processes and practices that are methodical, yet 
flexible 
• Diverse domain expertise 
© 2014 19 Impetus Technologies 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48
© 2014 20 Impetus Technologies 
Q & A
© 2014 21 Impetus Technologies 
Thank You 
Write to us at inquiry@impetus.com 
Follow us on Twitter @impetustech 
Recorded version available at 
http://www.impetus.com/webinar_registration?event=archived&eid=48

More Related Content

Similar to Big Data Technologies for Social Media Analytics- Impetus Webinar

The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
StampedeCon
 
From Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your businessFrom Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your business
NG DATA
 
Why Your Product Needs A Data & Analytics Strategy
Why Your Product Needs A Data & Analytics StrategyWhy Your Product Needs A Data & Analytics Strategy
Why Your Product Needs A Data & Analytics Strategy
AIPMM Administration
 

Similar to Big Data Technologies for Social Media Analytics- Impetus Webinar (20)

The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
 
Webinar: Take Your Brandwatch Data Anywhere
Webinar: Take Your Brandwatch Data AnywhereWebinar: Take Your Brandwatch Data Anywhere
Webinar: Take Your Brandwatch Data Anywhere
 
The Importance of an Analytics Platform
The Importance of an Analytics PlatformThe Importance of an Analytics Platform
The Importance of an Analytics Platform
 
Come fare business con i big data in concreto
Come fare business con i big data in concretoCome fare business con i big data in concreto
Come fare business con i big data in concreto
 
CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014
 
From Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your businessFrom Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your business
 
First Friday Forum December 5th Featuring Pentaho
First Friday Forum December 5th Featuring PentahoFirst Friday Forum December 5th Featuring Pentaho
First Friday Forum December 5th Featuring Pentaho
 
Why Your Product Needs A Data & Analytics Strategy
Why Your Product Needs A Data & Analytics StrategyWhy Your Product Needs A Data & Analytics Strategy
Why Your Product Needs A Data & Analytics Strategy
 
Mar-Tech Oversight
Mar-Tech OversightMar-Tech Oversight
Mar-Tech Oversight
 
Get your data analytics strategy right!
Get your data analytics strategy right!Get your data analytics strategy right!
Get your data analytics strategy right!
 
Extending BI with Big Data Analytics
Extending BI with Big Data AnalyticsExtending BI with Big Data Analytics
Extending BI with Big Data Analytics
 
Direct Materials Sourcing and Procurement Strategies – Accelerating Savings a...
Direct Materials Sourcing and Procurement Strategies – Accelerating Savings a...Direct Materials Sourcing and Procurement Strategies – Accelerating Savings a...
Direct Materials Sourcing and Procurement Strategies – Accelerating Savings a...
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
 
HyperHack 2023 Global Presentation - AMER Enablement_070623.pdf
HyperHack 2023 Global Presentation - AMER Enablement_070623.pdfHyperHack 2023 Global Presentation - AMER Enablement_070623.pdf
HyperHack 2023 Global Presentation - AMER Enablement_070623.pdf
 
Why Your Product Needs an Analytic Strategy
Why Your Product Needs an Analytic Strategy Why Your Product Needs an Analytic Strategy
Why Your Product Needs an Analytic Strategy
 
OC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMOC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBM
 
SD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMSD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBM
 
Tips and Tricks for Beginning Cognos Report Studio Authors
Tips and Tricks for Beginning Cognos Report Studio AuthorsTips and Tricks for Beginning Cognos Report Studio Authors
Tips and Tricks for Beginning Cognos Report Studio Authors
 
How to Capitalize on Big Data with Oracle Analytics Cloud
How to Capitalize on Big Data with Oracle Analytics CloudHow to Capitalize on Big Data with Oracle Analytics Cloud
How to Capitalize on Big Data with Oracle Analytics Cloud
 

More from Impetus Technologies

Webinar maturity of mobile test automation- approaches and future trends
Webinar  maturity of mobile test automation- approaches and future trendsWebinar  maturity of mobile test automation- approaches and future trends
Webinar maturity of mobile test automation- approaches and future trends
Impetus Technologies
 

More from Impetus Technologies (20)

Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
Data Warehouse Modernization Webinar Series- Critical Trends, Implementation ...
 
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix WebinarFuture-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
 
Building Real-time Streaming Apps in Minutes- Impetus Webinar
Building Real-time Streaming Apps in Minutes- Impetus WebinarBuilding Real-time Streaming Apps in Minutes- Impetus Webinar
Building Real-time Streaming Apps in Minutes- Impetus Webinar
 
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
 
Impetus White Paper- Handling Data Corruption in Elasticsearch
Impetus White Paper- Handling  Data Corruption  in ElasticsearchImpetus White Paper- Handling  Data Corruption  in Elasticsearch
Impetus White Paper- Handling Data Corruption in Elasticsearch
 
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarReal-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
 
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix WebinarReal-world Applications of Streaming Analytics- StreamAnalytix Webinar
Real-world Applications of Streaming Analytics- StreamAnalytix Webinar
 
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
 
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
Accelerating Hadoop Solution Lifecycle and Improving ROI- Impetus On-demand W...
 
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
 
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...SPARK USE CASE-  Distributed Reinforcement Learning for Electricity Market Bi...
SPARK USE CASE- Distributed Reinforcement Learning for Electricity Market Bi...
 
Enterprise Ready Android and Manageability- Impetus Webcast
Enterprise Ready Android and Manageability- Impetus WebcastEnterprise Ready Android and Manageability- Impetus Webcast
Enterprise Ready Android and Manageability- Impetus Webcast
 
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
 
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
 
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
Maturity of Mobile Test Automation: Approaches and Future Trends- Impetus Web...
 
Big Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLabBig Data Analytics with Storm, Spark and GraphLab
Big Data Analytics with Storm, Spark and GraphLab
 
Webinar maturity of mobile test automation- approaches and future trends
Webinar  maturity of mobile test automation- approaches and future trendsWebinar  maturity of mobile test automation- approaches and future trends
Webinar maturity of mobile test automation- approaches and future trends
 
Next generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph labNext generation analytics with yarn, spark and graph lab
Next generation analytics with yarn, spark and graph lab
 
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
The Shared Elephant - Hadoop as a Shared Service for Multiple Departments – I...
 
Performance Testing of Big Data Applications - Impetus Webcast
Performance Testing of Big Data Applications - Impetus WebcastPerformance Testing of Big Data Applications - Impetus Webcast
Performance Testing of Big Data Applications - Impetus Webcast
 

Recently uploaded

Recently uploaded (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 

Big Data Technologies for Social Media Analytics- Impetus Webinar

  • 1. Impetus Technologies Inc. Big Data Technologies for Social © 2014 1 Impetus Technologies Media Analytics Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=4 8
  • 2. Outline • Social Media Analytics- Need and Benefits • Effective convergence of disparate data sources • Big Data technologies to enable Social Analytics • Our recommended approach • Industry relevant use cases © 2014 2 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 3. Social Analytics Recommendation Engine © 2014 3 Impetus Technologies Reports and Statistics Data visualization Sentiment Analysis via Interactive Interface Social Media Sources Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 4. Business Intelligence & Product Research  Customer Analysis  Identifies users from different geographies, locations  Tracks users activities to determine usage patterns  Feature Analysis  Track the usage of various social features  Product Growth Analysis  Track customer feedback on products  Target the right customers  Recommendation Engine  Related products and customers  Third Party Data Analysis  Analysis of customers on third party sites © 2014 4 Impetus Technologies Social Analytics provides smarter ways of data tracking, powerful analytics and metrics for informed decision making Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 5. How it Helps? Outcome Based Approach • Customer retention • Brand building and recall (harvests/ address sentiment) • Simplifies customer service • Reduces operational cost • Builds up the customer base • Understands customer’s opinions and addresses their needs • Competition benchmarking • Proactive on demographic changes © 2014 5 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 6. Convergence of Data Sources Data Sources © 2014 6 Impetus Technologies Website Traffic Analysis (On-site web analytics) Internal CSR Logs, Customer Queries Automated Agent discussions Complaints and Resolutions Employee Insights External Data Sources (Off-site web analytics) Industry Reports Market Research Social Media Social Media Analytics Social Media Analytics effectively converges on-site, social media and third party data to extract useful information Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 7. Technical Tenets of Social Media Analytics Data Sources © 2014 7 Impetus Technologies Website Traffic Analysis (On-site web analytics) Internal CSR Logs, Customer Queries Automated Agent discussions Complaints and Resolutions Employee Insights External Data Sources (Off-site web analytics) Industry Reports Market Research Social Media Social Media Analytics Clustering Classification Sequential classification Entity extraction Event extraction Communication graph Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 8. Why Big Data for Social Analytics? • Large data volumes in the order TBs and PBs • Complex unstructured data from social sources • Deeper insights into customers and trends • Storing images, videos • The bottom-line - $/TB © 2014 8 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 9. Our Recommended Approach Technologies • Data collection - Social media data – Live feeds – Historical bulk data • NLP (NLTK is a good option) • Data preparation/ Mashup – M/R, PIG, Hive, Oozie, R, Sqoop • Classification/ Clustering (Mahout) • Recommendation (Mahout) • Loopback/ Feed output to live applications • Analytical reporting and deep mining © 2014 9 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 10. Our Recommended Approach • Collecting Twitter Feed (Streaming feed) using filter fire hose – Tweets for keywords – Based on brand, product, category, industry, product segment, special offers and marketing buzz words – Streaming API and HBASE based sink for high writes • Collect/create training data – Standalone Tweets for individual keywords © 2014 10 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 11. Our Recommended Approach • Creating or classifying text data and demographics • Quantitative analytics • Ascertaining daily trend • General tweets v/s product-specific tweets • Tweets targeted at competitors v/s own product • Location based trends (for available data sets) • Identifying and categorizing the output • Sentiment analysis of own product - Good, Neutral, Bad • Use training data for classification - Mahout/NLTK • Run trained models on Tweet data - Mahout/NLTK © 2014 11 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 12. Our Recommended Approach • Mash up Analytics from RDBMS with Social media analytics • Using customer data to recommend new/related products • Preparing mock customer data for Social ID mapping • Running recommendations (item or user based) using Mahout • Analytical Reporting • Demonstrates drill down reports on data generated by Mahout • Reports over Hive/MySQL using a traditional Reporting product or framework © 2014 12 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 13. © 2014 13 Impetus Technologies iLaDaP Impetus Large Data Analytics Platform
  • 14. iLaDaP- Technology Stack • Scalable data store – Hadoop HDFS – Hbase • Connectors (In/Out) – Flume – Sqoop – Messaging queue – ESB- Apache Camel • Analytics and ETL – Mahout for NL and text mining • Classification/ Clustering • Recommendation – Oozie for complex ETL and workflow – JDBC/ODBC compliant Analytics tools – Intellicus, Jasper etc. © 2014 14 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 15. Case Study- Financial Services The Client – Leading financial services company Key Challenge – Recommend products based on User profile/location – Recommend alternate products based Social Media feedback Impetus Solution • Proposed iLaDaP based solution • Sentiment Analysis using Naïve Bayesian algorithm for classification/sentiment analysis • Clustering using k-means algorithm of Mahout • Apache Mahout based recommendation engine Benefits Realised • Better product recommendations © 2014 15 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 16. Case Study- Online Retailer The Client – Leading online product retailer Key Challenge • Recommendation engine • Cross product customer analysis • Provide ‘Big Picture’ across business units Impetus Solution • Proposed iLaDaP based solution • Clustering using k-means algorithm of Mahout • Apache Mahout based recommendation engine Benefits Realised • True centralized business overview across product and business lines © 2014 16 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 17. Summing Up • Using Big Data technologies for Social Analytics needs a well-thought of strategy • Open source yields better results for social media data • Hadoop based Big Data Analytics is a scalable and cost effective option. • Selecting the right tools is the key to build a successful Social Analytics EDW using Big Data • Easy extension of the existing Data Warehouse and Analytics infrastructure is possible to leverage existing investments © 2014 17 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 18. © 2014 18 Impetus Technologies About Impetus
  • 19. • Strategic partners for software product engineering and R&D • Thought leaders in cutting-edge technologies • Mature processes and practices that are methodical, yet flexible • Diverse domain expertise © 2014 19 Impetus Technologies Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48
  • 20. © 2014 20 Impetus Technologies Q & A
  • 21. © 2014 21 Impetus Technologies Thank You Write to us at inquiry@impetus.com Follow us on Twitter @impetustech Recorded version available at http://www.impetus.com/webinar_registration?event=archived&eid=48