SlideShare ist ein Scribd-Unternehmen logo
1 von 18
Downloaden Sie, um offline zu lesen
Jigsaw Academy and ValueLabs
Sentiment Analysis Competition
A Snapshot of the Winning Submissions
Business Objective
The Company wants to see if we can use the Sentiment Score
derived from the comments section in the feedback form to
predict the recommended score
● To find out sentiment score using comments given by clients.
● To build a model to predict Recommended score (RECOM) using the derived
sentiment score
Analysis by Rajesh Peruri
● Mine the "Comments" section and arrive at a sentiment score
 Association Matrix to Capture the correlation between words of interest
 Clustering Dendograms to Interpret the Sentiment of the various QPs
based on word clusters
 Derive an Linear regression Model using training dataset which
captures the dependency of RECOM on other QPs
Methodology
Analysis
Attributes Mean Rating Sentiment Sentiment Sentiment Score Sentiment Sentiment
(Original Data) (Original Data) (Dendrogram) (SA algorithm) (SA algorithm) (LRM 2)
>= mean(RECOM)
RECOM 3.32 + + 0.37 + + 0.28
OTD 3.17 - - - 0.97 -
FLEX 3.44 + + + 1.7 +
QUAL 2.98 - - - 0.57 -
COMMIT 3.44 + + + 3.85 +
PROCESS 3.19 - - - 0.21 -
TRUST 3.49 + + + 1.28 +
RESPONSE 3.42 + + + 0.1 +
PARTNER 3.37 + + + 2.3 +
ESCAL/ISSUE 3.34 + - - 0.18 -
MINDSHARE
3.22 - - + 0.48 +
Sentiment Table (+ is Positive, - is Negative)
Results
Recommend(RECOM)
• Positive rating from the original dataset
• Positive cumulative weighted sentiment,
when SA was performed on the
“Comments”
• Positive sentiment using LRM
On Time Delivery(OTD)
• Low rating from the original dataset.
• Negative sentiment = -0.92, when SA
was performed on the “Comments”
• Word clusters from Dendrogram
• Need quality, delivery
Quality(QUAL)
• Low rating from the original dataset.
• Negative sentiment = -0.57, when SA was
performed on the “Comments”
• Word clusters from Dendrogram
• Need quality, delivery
• Improve process and testing
• Require detail understanding
Commitment(COMMIT)
• High rating from the original dataset.
• Positive sentiment = 3.85, when SA was
performed on the “Comments”
• Word clusters from Dendrogram
• Provide consistency, meet expectation
• Always commit really
• Happy service
Results
• Year 2011 has been the worst
 Low rating from the original dataset.
 Low sentiment, when SA was performed on the “Comments” section from 2011
 Need to investigate what went wrong
• Year 2013 has been the best
 High rating from the original dataset.
 High sentiment, when SA was performed on the “Comments” section from 2013
 Need to investigate what went right
• These countries have given low rating. Investigate.
 MAL, ME, ROW
 UK, USA sometimes
• These countries have given high rating(3/4), in general
 IND, AUS, UK/Europe, USA
• The following LRM has been deduced
 recom ~ (0.15*otd) + (0.15*flex) + (0.19*qual) + (0.11*proce) + (0.12*respons) + (0.11*partn) + (0.14*escal) + (0.12*minds)
Analysis by Rajanikar
● Overall Sentiment: Positive, Sentiment Score of 2 approx.
● Sentiment Analysis of Comments
 Extracted all positive and negative words
 Identified positive and negative words in each comment
 Used dictionary to assign sentiment score to words
 Identified negations and adjusted word sentiment
 Polarity of word's sentiment was reversed
 Identified suggestions and recommendations
 Sentiment score calculated for each comment
Methodology
Model
Model Used: Sentiment Score Density as only Predictor Variable
 RECOM = 9.848 + 3.848*(Score Density)
 Here, Normalized Score (Score Density is used for the modeling)
 80% data-points taken for training the model and 20% testing
Results:
 60% of cases were classified correctly
 However, if only polarity is considered 86% of cases were classified correctly
 Out of cases of incorrect classification, 4 cases have negative sentiment score- This
shows that model is not good in classifying cases with negative sentiments
Word Cloud Analysis
Most Frequent Positive Polarity words:
Excellent, Good, Like & Support
Most Frequent Negative Polarity words:
Issues, Issue
Cluster Dendogram
Conclusion
● Comparing sentiment score directly with Recommended Score- Although, only in
35% of cases Normalized Score matches perfectly with recommended Score, 88%
of cases were classified correctly in terms of polarity
● Therefore, sentiment score itself gives better result than using regression model
to predict recommended score.
Analysis by Priyadarshini
Methodology
Jeffrey Breen Sentiment
Algorithm was implemented.
It estimates the sentiment by
assigning an integer score by
subtracting the number of
occurrences of negative words
from that of the positive
words.
Analysis
• The histogram and ggplot of the sentiment scores for all the comments spanning across the year 2009
to 2013 has been plotted.
• The plot is right tailed and we can infer that comments are more on the positive note and there are
very few negative comments.
Analysis
Plot of year-wise score distribution :Based on the plot we
can infer that there are more positive comments for the
company and minimal negative comments.
Sentiment Score : The total positive count, negative count and
Sentiment Score percentage for each year from 2009 to 2013
has been tabulated below
Ratings Score : The average rating for each record in the dataset is
calculated using the formula:
Score=(OTD+FLEX+QUAL+COMMIT+PROCE+TRU.REL+
RESPONS+PARTN+ESCAL+MINDS+OVERAL+RECOM)/12
Results
• From the table we can infer that sentiment score
obtained by using the Jeffrey Breen algorithm
almost coincide with the ratings score.
• Therefore we can conclude that the sentiment score
derived from the comments useful in predicting the
recommended score.
• The straight line represents a simple linear regression between the
rating score and the sentiment scores.
• According to the plot, we can infer that the sentiment score are
pretty close to our ratings score

Weitere ähnliche Inhalte

Andere mochten auch

Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Parinds...
 Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation  - Parinds... Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation  - Parinds...
Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Parinds...Jigsaw Academy
 
Jigsaw Academy Digital India Contest - Andhra Pradesh & Karnataka
Jigsaw Academy Digital India Contest - Andhra Pradesh & KarnatakaJigsaw Academy Digital India Contest - Andhra Pradesh & Karnataka
Jigsaw Academy Digital India Contest - Andhra Pradesh & KarnatakaJigsaw Academy
 
Jigsaw Academy Digital India Contest - Uttarakhand & Himachal Pradesh
Jigsaw Academy Digital India Contest - Uttarakhand & Himachal PradeshJigsaw Academy Digital India Contest - Uttarakhand & Himachal Pradesh
Jigsaw Academy Digital India Contest - Uttarakhand & Himachal PradeshJigsaw Academy
 
CUbRIK research on social aspects
CUbRIK research on social aspectsCUbRIK research on social aspects
CUbRIK research on social aspectsCUbRIK Project
 
Outreach campaign status module
Outreach campaign status moduleOutreach campaign status module
Outreach campaign status moduleeCairn Inc.
 
Deep Dive - Consumer Sentiment Rating & Analysis White Paper
Deep Dive - Consumer Sentiment Rating & Analysis White PaperDeep Dive - Consumer Sentiment Rating & Analysis White Paper
Deep Dive - Consumer Sentiment Rating & Analysis White PaperJon LeMire
 
Sentiment analysis module
Sentiment analysis moduleSentiment analysis module
Sentiment analysis moduleeCairn Inc.
 
Proposal final
Proposal finalProposal final
Proposal finalMido Razaz
 
Sentiment analytics
Sentiment analytics Sentiment analytics
Sentiment analytics Kamalika Some
 
Jigsaw Academy Digital India Contest - Kerala & Tamil Nadu
Jigsaw Academy Digital India Contest - Kerala & Tamil NaduJigsaw Academy Digital India Contest - Kerala & Tamil Nadu
Jigsaw Academy Digital India Contest - Kerala & Tamil NaduJigsaw Academy
 
Aspect-level sentiment analysis of customer reviews using Double Propagation
Aspect-level sentiment analysis of customer reviews using Double PropagationAspect-level sentiment analysis of customer reviews using Double Propagation
Aspect-level sentiment analysis of customer reviews using Double PropagationHardik Dalal
 
Psychographic Marketing | What You Show Know
Psychographic Marketing | What You Show KnowPsychographic Marketing | What You Show Know
Psychographic Marketing | What You Show KnowGet A Clue Marketing Show
 
Yelp Data Challenge - Discovering Latent Factors using Ratings and Reviews
Yelp Data Challenge - Discovering Latent Factors using Ratings and ReviewsYelp Data Challenge - Discovering Latent Factors using Ratings and Reviews
Yelp Data Challenge - Discovering Latent Factors using Ratings and ReviewsTharindu Mathew
 
"Managing User-Generated Reviews" - Jed Nachman (Yelp) - 2009 AIM Conference
"Managing User-Generated Reviews" - Jed Nachman (Yelp) - 2009 AIM Conference"Managing User-Generated Reviews" - Jed Nachman (Yelp) - 2009 AIM Conference
"Managing User-Generated Reviews" - Jed Nachman (Yelp) - 2009 AIM ConferenceJoshua Tree Internet Media, LLC
 
Snapchat Group Snaps Proposal
Snapchat Group Snaps ProposalSnapchat Group Snaps Proposal
Snapchat Group Snaps ProposalRyan Cunningham
 
Apache Giraph: Large-scale graph processing done better
Apache Giraph: Large-scale graph processing done betterApache Giraph: Large-scale graph processing done better
Apache Giraph: Large-scale graph processing done better🧑‍💻 Manuel Coppotelli
 
SentiCheNews - Sentiment Analysis on Newspapers and Tweets
SentiCheNews - Sentiment Analysis on Newspapers and TweetsSentiCheNews - Sentiment Analysis on Newspapers and Tweets
SentiCheNews - Sentiment Analysis on Newspapers and Tweets🧑‍💻 Manuel Coppotelli
 
How Sentiment Analysis works
How Sentiment Analysis worksHow Sentiment Analysis works
How Sentiment Analysis worksCJ Jenkins
 

Andere mochten auch (20)

Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Parinds...
 Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation  - Parinds... Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation  - Parinds...
Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Parinds...
 
Jigsaw Academy Digital India Contest - Andhra Pradesh & Karnataka
Jigsaw Academy Digital India Contest - Andhra Pradesh & KarnatakaJigsaw Academy Digital India Contest - Andhra Pradesh & Karnataka
Jigsaw Academy Digital India Contest - Andhra Pradesh & Karnataka
 
Jigsaw Academy Digital India Contest - Uttarakhand & Himachal Pradesh
Jigsaw Academy Digital India Contest - Uttarakhand & Himachal PradeshJigsaw Academy Digital India Contest - Uttarakhand & Himachal Pradesh
Jigsaw Academy Digital India Contest - Uttarakhand & Himachal Pradesh
 
CUbRIK research on social aspects
CUbRIK research on social aspectsCUbRIK research on social aspects
CUbRIK research on social aspects
 
Outreach campaign status module
Outreach campaign status moduleOutreach campaign status module
Outreach campaign status module
 
Deep Dive - Consumer Sentiment Rating & Analysis White Paper
Deep Dive - Consumer Sentiment Rating & Analysis White PaperDeep Dive - Consumer Sentiment Rating & Analysis White Paper
Deep Dive - Consumer Sentiment Rating & Analysis White Paper
 
Sentiment analysis module
Sentiment analysis moduleSentiment analysis module
Sentiment analysis module
 
Proposal final
Proposal finalProposal final
Proposal final
 
Sentiment analytics
Sentiment analytics Sentiment analytics
Sentiment analytics
 
Jigsaw Academy Digital India Contest - Kerala & Tamil Nadu
Jigsaw Academy Digital India Contest - Kerala & Tamil NaduJigsaw Academy Digital India Contest - Kerala & Tamil Nadu
Jigsaw Academy Digital India Contest - Kerala & Tamil Nadu
 
Aspect-level sentiment analysis of customer reviews using Double Propagation
Aspect-level sentiment analysis of customer reviews using Double PropagationAspect-level sentiment analysis of customer reviews using Double Propagation
Aspect-level sentiment analysis of customer reviews using Double Propagation
 
Psychographic Marketing | What You Show Know
Psychographic Marketing | What You Show KnowPsychographic Marketing | What You Show Know
Psychographic Marketing | What You Show Know
 
Yelp Data Challenge - Discovering Latent Factors using Ratings and Reviews
Yelp Data Challenge - Discovering Latent Factors using Ratings and ReviewsYelp Data Challenge - Discovering Latent Factors using Ratings and Reviews
Yelp Data Challenge - Discovering Latent Factors using Ratings and Reviews
 
"Managing User-Generated Reviews" - Jed Nachman (Yelp) - 2009 AIM Conference
"Managing User-Generated Reviews" - Jed Nachman (Yelp) - 2009 AIM Conference"Managing User-Generated Reviews" - Jed Nachman (Yelp) - 2009 AIM Conference
"Managing User-Generated Reviews" - Jed Nachman (Yelp) - 2009 AIM Conference
 
Snapchat Group Snaps Proposal
Snapchat Group Snaps ProposalSnapchat Group Snaps Proposal
Snapchat Group Snaps Proposal
 
Apache Giraph: Large-scale graph processing done better
Apache Giraph: Large-scale graph processing done betterApache Giraph: Large-scale graph processing done better
Apache Giraph: Large-scale graph processing done better
 
SentiCheNews - Sentiment Analysis on Newspapers and Tweets
SentiCheNews - Sentiment Analysis on Newspapers and TweetsSentiCheNews - Sentiment Analysis on Newspapers and Tweets
SentiCheNews - Sentiment Analysis on Newspapers and Tweets
 
Yelp Project
Yelp ProjectYelp Project
Yelp Project
 
Yelp final
Yelp finalYelp final
Yelp final
 
How Sentiment Analysis works
How Sentiment Analysis worksHow Sentiment Analysis works
How Sentiment Analysis works
 

Ähnlich wie Snapshot of winning submissions- Jigsaw Academy ValueLabs Sentiment Analysis feb2015

presentation.pdf
presentation.pdfpresentation.pdf
presentation.pdfcaa28steve
 
Optimizing marketing campaigns using experimental designs
Optimizing marketing campaigns using experimental designsOptimizing marketing campaigns using experimental designs
Optimizing marketing campaigns using experimental designsPankaj Sharma
 
Customer Satisfaction Data - Multiple Linear Regression Model.pdf
Customer Satisfaction Data -  Multiple Linear Regression Model.pdfCustomer Satisfaction Data -  Multiple Linear Regression Model.pdf
Customer Satisfaction Data - Multiple Linear Regression Model.pdfruwanp2000
 
Nova Southeastern University H. Wayne Huizenga School of Bus.docx
Nova Southeastern University H. Wayne Huizenga School of Bus.docxNova Southeastern University H. Wayne Huizenga School of Bus.docx
Nova Southeastern University H. Wayne Huizenga School of Bus.docxhenrymartin15260
 
Textual & Sentiment Analysis of Movie Reviews
Textual & Sentiment Analysis of Movie ReviewsTextual & Sentiment Analysis of Movie Reviews
Textual & Sentiment Analysis of Movie ReviewsYousef Fadila
 
IRJET- Survey of Classification of Business Reviews using Sentiment Analysis
IRJET- Survey of Classification of Business Reviews using Sentiment AnalysisIRJET- Survey of Classification of Business Reviews using Sentiment Analysis
IRJET- Survey of Classification of Business Reviews using Sentiment AnalysisIRJET Journal
 
OPTIMIZATION OF CROSS DOMAIN SENTIMENT ANALYSIS USING SENTIWORDNET
OPTIMIZATION OF CROSS DOMAIN SENTIMENT ANALYSIS USING SENTIWORDNETOPTIMIZATION OF CROSS DOMAIN SENTIMENT ANALYSIS USING SENTIWORDNET
OPTIMIZATION OF CROSS DOMAIN SENTIMENT ANALYSIS USING SENTIWORDNETijfcstjournal
 
Predicting Cab Booking Cancellations- Data Mining Project
Predicting Cab Booking Cancellations- Data Mining ProjectPredicting Cab Booking Cancellations- Data Mining Project
Predicting Cab Booking Cancellations- Data Mining Projectraj
 
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Md. Main Uddin Rony
 
Sentiment analysis presentation
Sentiment analysis presentationSentiment analysis presentation
Sentiment analysis presentationGunjanSrivastava23
 
measurment scaling & businesses data.pdf
measurment scaling & businesses data.pdfmeasurment scaling & businesses data.pdf
measurment scaling & businesses data.pdfmaheebhavinshah
 
Dowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceDowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceAmit Sharma
 
How to choose the right machine learning algorithm for your project
How to choose the right machine learning algorithm for your projectHow to choose the right machine learning algorithm for your project
How to choose the right machine learning algorithm for your projectFathimaZajelLiyakath
 
Operational Risk Management: Standard Requirements
Operational Risk Management: Standard RequirementsOperational Risk Management: Standard Requirements
Operational Risk Management: Standard RequirementsGerardus Blokdyk
 
Behind The Scenes Data Science Coolblue 2018-03-22
Behind The Scenes Data Science Coolblue 2018-03-22Behind The Scenes Data Science Coolblue 2018-03-22
Behind The Scenes Data Science Coolblue 2018-03-22Matthias Schuurmans
 
COMMTRUST: A MULTI-DIMENSIONAL TRUST MODEL FOR E-COMMERCE APPLICATIONS
COMMTRUST: A MULTI-DIMENSIONAL TRUST MODEL FOR E-COMMERCE APPLICATIONSCOMMTRUST: A MULTI-DIMENSIONAL TRUST MODEL FOR E-COMMERCE APPLICATIONS
COMMTRUST: A MULTI-DIMENSIONAL TRUST MODEL FOR E-COMMERCE APPLICATIONSijnlc
 

Ähnlich wie Snapshot of winning submissions- Jigsaw Academy ValueLabs Sentiment Analysis feb2015 (20)

presentation.pdf
presentation.pdfpresentation.pdf
presentation.pdf
 
Optimizing marketing campaigns using experimental designs
Optimizing marketing campaigns using experimental designsOptimizing marketing campaigns using experimental designs
Optimizing marketing campaigns using experimental designs
 
Customer Satisfaction Data - Multiple Linear Regression Model.pdf
Customer Satisfaction Data -  Multiple Linear Regression Model.pdfCustomer Satisfaction Data -  Multiple Linear Regression Model.pdf
Customer Satisfaction Data - Multiple Linear Regression Model.pdf
 
Nova Southeastern University H. Wayne Huizenga School of Bus.docx
Nova Southeastern University H. Wayne Huizenga School of Bus.docxNova Southeastern University H. Wayne Huizenga School of Bus.docx
Nova Southeastern University H. Wayne Huizenga School of Bus.docx
 
Textual & Sentiment Analysis of Movie Reviews
Textual & Sentiment Analysis of Movie ReviewsTextual & Sentiment Analysis of Movie Reviews
Textual & Sentiment Analysis of Movie Reviews
 
Final.Version
Final.VersionFinal.Version
Final.Version
 
IRJET- Survey of Classification of Business Reviews using Sentiment Analysis
IRJET- Survey of Classification of Business Reviews using Sentiment AnalysisIRJET- Survey of Classification of Business Reviews using Sentiment Analysis
IRJET- Survey of Classification of Business Reviews using Sentiment Analysis
 
Mr4 ms10
Mr4 ms10Mr4 ms10
Mr4 ms10
 
OPTIMIZATION OF CROSS DOMAIN SENTIMENT ANALYSIS USING SENTIWORDNET
OPTIMIZATION OF CROSS DOMAIN SENTIMENT ANALYSIS USING SENTIWORDNETOPTIMIZATION OF CROSS DOMAIN SENTIMENT ANALYSIS USING SENTIWORDNET
OPTIMIZATION OF CROSS DOMAIN SENTIMENT ANALYSIS USING SENTIWORDNET
 
Predicting Cab Booking Cancellations- Data Mining Project
Predicting Cab Booking Cancellations- Data Mining ProjectPredicting Cab Booking Cancellations- Data Mining Project
Predicting Cab Booking Cancellations- Data Mining Project
 
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
 
Sentiment analysis presentation
Sentiment analysis presentationSentiment analysis presentation
Sentiment analysis presentation
 
measurment scaling & businesses data.pdf
measurment scaling & businesses data.pdfmeasurment scaling & businesses data.pdf
measurment scaling & businesses data.pdf
 
Dowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceDowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inference
 
How to choose the right machine learning algorithm for your project
How to choose the right machine learning algorithm for your projectHow to choose the right machine learning algorithm for your project
How to choose the right machine learning algorithm for your project
 
Checkpoint comparison
Checkpoint comparisonCheckpoint comparison
Checkpoint comparison
 
Operational Risk Management: Standard Requirements
Operational Risk Management: Standard RequirementsOperational Risk Management: Standard Requirements
Operational Risk Management: Standard Requirements
 
Behind The Scenes Data Science Coolblue 2018-03-22
Behind The Scenes Data Science Coolblue 2018-03-22Behind The Scenes Data Science Coolblue 2018-03-22
Behind The Scenes Data Science Coolblue 2018-03-22
 
COMMTRUST: A MULTI-DIMENSIONAL TRUST MODEL FOR E-COMMERCE APPLICATIONS
COMMTRUST: A MULTI-DIMENSIONAL TRUST MODEL FOR E-COMMERCE APPLICATIONSCOMMTRUST: A MULTI-DIMENSIONAL TRUST MODEL FOR E-COMMERCE APPLICATIONS
COMMTRUST: A MULTI-DIMENSIONAL TRUST MODEL FOR E-COMMERCE APPLICATIONS
 
Py data19 final
Py data19   finalPy data19   final
Py data19 final
 

Mehr von Jigsaw Academy

Jigsaw Academy Pexitics Student Projects
Jigsaw Academy Pexitics Student ProjectsJigsaw Academy Pexitics Student Projects
Jigsaw Academy Pexitics Student ProjectsJigsaw Academy
 
Taximan Challenge on Data Visualization - Vicky Crasto
Taximan Challenge on Data Visualization - Vicky CrastoTaximan Challenge on Data Visualization - Vicky Crasto
Taximan Challenge on Data Visualization - Vicky CrastoJigsaw Academy
 
Jigsaw Academy Cafe Great Contest - Winning Presentations
Jigsaw Academy Cafe Great Contest - Winning PresentationsJigsaw Academy Cafe Great Contest - Winning Presentations
Jigsaw Academy Cafe Great Contest - Winning PresentationsJigsaw Academy
 
Topic 1 Introduction to web analytics
Topic  1   Introduction to web analytics Topic  1   Introduction to web analytics
Topic 1 Introduction to web analytics Jigsaw Academy
 
Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Shyam An...
 Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Shyam An... Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Shyam An...
Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Shyam An...Jigsaw Academy
 
Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Balaji A...
 Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Balaji A... Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Balaji A...
Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Balaji A...Jigsaw Academy
 
Advanced certication in financial analytics e-brochure
Advanced certication in financial analytics e-brochureAdvanced certication in financial analytics e-brochure
Advanced certication in financial analytics e-brochureJigsaw Academy
 

Mehr von Jigsaw Academy (8)

Jigsaw Academy Pexitics Student Projects
Jigsaw Academy Pexitics Student ProjectsJigsaw Academy Pexitics Student Projects
Jigsaw Academy Pexitics Student Projects
 
Taximan Challenge on Data Visualization - Vicky Crasto
Taximan Challenge on Data Visualization - Vicky CrastoTaximan Challenge on Data Visualization - Vicky Crasto
Taximan Challenge on Data Visualization - Vicky Crasto
 
Jigsaw Academy Cafe Great Contest - Winning Presentations
Jigsaw Academy Cafe Great Contest - Winning PresentationsJigsaw Academy Cafe Great Contest - Winning Presentations
Jigsaw Academy Cafe Great Contest - Winning Presentations
 
The Jigsaw Team
The Jigsaw TeamThe Jigsaw Team
The Jigsaw Team
 
Topic 1 Introduction to web analytics
Topic  1   Introduction to web analytics Topic  1   Introduction to web analytics
Topic 1 Introduction to web analytics
 
Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Shyam An...
 Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Shyam An... Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Shyam An...
Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Shyam An...
 
Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Balaji A...
 Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Balaji A... Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Balaji A...
Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Balaji A...
 
Advanced certication in financial analytics e-brochure
Advanced certication in financial analytics e-brochureAdvanced certication in financial analytics e-brochure
Advanced certication in financial analytics e-brochure
 

Kürzlich hochgeladen

Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...ttt fff
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Business Analytics using Microsoft Excel
Business Analytics using Microsoft ExcelBusiness Analytics using Microsoft Excel
Business Analytics using Microsoft Excelysmaelreyes
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 

Kürzlich hochgeladen (20)

Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Business Analytics using Microsoft Excel
Business Analytics using Microsoft ExcelBusiness Analytics using Microsoft Excel
Business Analytics using Microsoft Excel
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 

Snapshot of winning submissions- Jigsaw Academy ValueLabs Sentiment Analysis feb2015

  • 1. Jigsaw Academy and ValueLabs Sentiment Analysis Competition A Snapshot of the Winning Submissions
  • 2. Business Objective The Company wants to see if we can use the Sentiment Score derived from the comments section in the feedback form to predict the recommended score ● To find out sentiment score using comments given by clients. ● To build a model to predict Recommended score (RECOM) using the derived sentiment score
  • 4. ● Mine the "Comments" section and arrive at a sentiment score  Association Matrix to Capture the correlation between words of interest  Clustering Dendograms to Interpret the Sentiment of the various QPs based on word clusters  Derive an Linear regression Model using training dataset which captures the dependency of RECOM on other QPs Methodology
  • 5. Analysis Attributes Mean Rating Sentiment Sentiment Sentiment Score Sentiment Sentiment (Original Data) (Original Data) (Dendrogram) (SA algorithm) (SA algorithm) (LRM 2) >= mean(RECOM) RECOM 3.32 + + 0.37 + + 0.28 OTD 3.17 - - - 0.97 - FLEX 3.44 + + + 1.7 + QUAL 2.98 - - - 0.57 - COMMIT 3.44 + + + 3.85 + PROCESS 3.19 - - - 0.21 - TRUST 3.49 + + + 1.28 + RESPONSE 3.42 + + + 0.1 + PARTNER 3.37 + + + 2.3 + ESCAL/ISSUE 3.34 + - - 0.18 - MINDSHARE 3.22 - - + 0.48 + Sentiment Table (+ is Positive, - is Negative)
  • 6. Results Recommend(RECOM) • Positive rating from the original dataset • Positive cumulative weighted sentiment, when SA was performed on the “Comments” • Positive sentiment using LRM On Time Delivery(OTD) • Low rating from the original dataset. • Negative sentiment = -0.92, when SA was performed on the “Comments” • Word clusters from Dendrogram • Need quality, delivery Quality(QUAL) • Low rating from the original dataset. • Negative sentiment = -0.57, when SA was performed on the “Comments” • Word clusters from Dendrogram • Need quality, delivery • Improve process and testing • Require detail understanding Commitment(COMMIT) • High rating from the original dataset. • Positive sentiment = 3.85, when SA was performed on the “Comments” • Word clusters from Dendrogram • Provide consistency, meet expectation • Always commit really • Happy service
  • 7. Results • Year 2011 has been the worst  Low rating from the original dataset.  Low sentiment, when SA was performed on the “Comments” section from 2011  Need to investigate what went wrong • Year 2013 has been the best  High rating from the original dataset.  High sentiment, when SA was performed on the “Comments” section from 2013  Need to investigate what went right • These countries have given low rating. Investigate.  MAL, ME, ROW  UK, USA sometimes • These countries have given high rating(3/4), in general  IND, AUS, UK/Europe, USA • The following LRM has been deduced  recom ~ (0.15*otd) + (0.15*flex) + (0.19*qual) + (0.11*proce) + (0.12*respons) + (0.11*partn) + (0.14*escal) + (0.12*minds)
  • 9. ● Overall Sentiment: Positive, Sentiment Score of 2 approx. ● Sentiment Analysis of Comments  Extracted all positive and negative words  Identified positive and negative words in each comment  Used dictionary to assign sentiment score to words  Identified negations and adjusted word sentiment  Polarity of word's sentiment was reversed  Identified suggestions and recommendations  Sentiment score calculated for each comment Methodology
  • 10. Model Model Used: Sentiment Score Density as only Predictor Variable  RECOM = 9.848 + 3.848*(Score Density)  Here, Normalized Score (Score Density is used for the modeling)  80% data-points taken for training the model and 20% testing Results:  60% of cases were classified correctly  However, if only polarity is considered 86% of cases were classified correctly  Out of cases of incorrect classification, 4 cases have negative sentiment score- This shows that model is not good in classifying cases with negative sentiments
  • 11. Word Cloud Analysis Most Frequent Positive Polarity words: Excellent, Good, Like & Support Most Frequent Negative Polarity words: Issues, Issue
  • 13. Conclusion ● Comparing sentiment score directly with Recommended Score- Although, only in 35% of cases Normalized Score matches perfectly with recommended Score, 88% of cases were classified correctly in terms of polarity ● Therefore, sentiment score itself gives better result than using regression model to predict recommended score.
  • 15. Methodology Jeffrey Breen Sentiment Algorithm was implemented. It estimates the sentiment by assigning an integer score by subtracting the number of occurrences of negative words from that of the positive words.
  • 16. Analysis • The histogram and ggplot of the sentiment scores for all the comments spanning across the year 2009 to 2013 has been plotted. • The plot is right tailed and we can infer that comments are more on the positive note and there are very few negative comments.
  • 17. Analysis Plot of year-wise score distribution :Based on the plot we can infer that there are more positive comments for the company and minimal negative comments. Sentiment Score : The total positive count, negative count and Sentiment Score percentage for each year from 2009 to 2013 has been tabulated below Ratings Score : The average rating for each record in the dataset is calculated using the formula: Score=(OTD+FLEX+QUAL+COMMIT+PROCE+TRU.REL+ RESPONS+PARTN+ESCAL+MINDS+OVERAL+RECOM)/12
  • 18. Results • From the table we can infer that sentiment score obtained by using the Jeffrey Breen algorithm almost coincide with the ratings score. • Therefore we can conclude that the sentiment score derived from the comments useful in predicting the recommended score. • The straight line represents a simple linear regression between the rating score and the sentiment scores. • According to the plot, we can infer that the sentiment score are pretty close to our ratings score