Insights from Data: Overcoming Objections

A DATA SCIENCE COMPANY
We handle terabyte-size data via non-traditional analytics and visualise it in real-time.
Gramener visualises
your data
Gramener transforms your data into concise dashboards
that make your business problem & solution visually obvious.
We help you find insights quickly, based on cognitive research,
and our visualisations guide you towards actionable decisions.

S ANAND, GRAMENER
HOW YOU CAN GET
INSIGHTS FROM DATA
OVERCOMING COMMON OBJECTIONS ON READINESS

DATA
ANALYSIS VISUALSEXPLORATION
IS
EVERYWHERE

DATA
IS
EVERYWHERE
COMMON COMPLAINT #1
WE CAN’T DRILL INTO RAW DATA

IMPACT OF THE BUDGET ON STOCK PRICES

INDIA’S BUDGET: FORECASTING & PLANNING

DATA
IS
EVERYWHERE
COMMON COMPLAINT #2
WE ALREADY USE CHARTS

TIMES NOW COVERAGE HAD
80%+ VIEWERSHIP

DATA
IS
EVERYWHERE
COMMON COMPLAINT #3
NOT INTEGRATED IN WORKFLOW

Portfolio Performance Visual
Worldwide$288.0mn
A: Accelerate$68.9mn
B: Build$77.2mn
C: Cut down$141.9mn
Worldwide:
$288 mn
The visualization shows the market
opportunities across various countries to
identify areas of focus. This chart has
been built as an interactive-app to
present the key findings, while letting
user click-through and drill-down to a
custom view across 4 different levels.

DATA
IS
EVERYWHERE
COMMON COMPLAINT #1
WE DON’T HAVE THE TOOLS

Billing fraud at an energy utility
This plot shows the frequency of all meter readings from Apr-
2010 to Mar-2011. An unusually large number of readings are
aligned with the slab boundaries.
Below is a simple histogram (or frequency distribution) of usage levels.
Each bar represents the number of customers with a customers with a
specific bill amount (in units, or KWh).
Tariffs are based on the usage slab. Someone with 101 units is billed in
full at a higher tariff than someone with 100 units. So people have a
strong incentive to stay at or within a slab boundary.
An energy utility (with over 50 million
subscribers) had 10 years worth of
customer billing data available.
Most fraud detection software failed to
load the data, and sampled data
revealed little or no insight.
This can happen in one of two ways.
First, people may be monitoring their
usage very carefully, and turn of their
lights and fans the instant their usage
hits the slab boundary.
Or, more realistically, there’s probably some level of corruption
involved, where customers pay a small sum to the meter reading staff
to ensure that it stays exactly at the slab boundary, giving them the
advantage of a lower price.

This is a dataset (1975 – 1990) that has
been around for several years, and has
been studied extensively. Yet, a
visualization can reveal patterns that
are neither obvious nor well known.
For example,
• Are birthdays uniformly distributed?
• Do doctors or parents exercise the C-section option to move dates?
• Is there any day of the month that has unusually high or low births?
• Are there any months with relatively high or low births?
More births Fewer births … on average, for each day of the year (from 1975 to 1990)
LET’S LOOK AT 15 YEARS OF US BIRTH DATA

THE PATTERN IN INDIA IS QUITE DIFFERENT
This is a birth date dataset that’s
obtained from school admission data
for over 10 million children. When we
compare this with births in the US, we
see none of the same patterns.
For example,
• Is there an aversion to the 13th or is there a local cultural nuance?
• Are holidays avoided for births?
• Which months have a higher propensity for births, and why?
• Are there any patterns not found in the US data?
More births Fewer births … on average, for each day of the year (from 2007 to 2013)

THIS ADVERSELY IMPACTS CHILDREN’S MARKS
It’s a well established fact that older
children tend to do better at school in
most activities. Since many children
have had their birth dates brought
forward, these younger children suffer.
The average marks of children “born” on the 1st, 5th, 10th, 15th etc. of the
month tend to score lower marks.
• Are holidays avoided for births?
• Which months have a higher propensity for births, and why?
• Are there any patterns not found in the US data?
Higher marks Lower marks … on average, for children born on a given day of the year (from 2007 to 2013)

DEPLOY
MODERN
TOOLS
ANALYSIS IS
EVERYWHERE
COMMON COMPLAINT #1
WE DON’T HAVE THE TOOLS
COMMON COMPLAINT #2
WE DON’T GET INSIGHTS
R
SAS
EXCEL
PYTHON
DATABASES
ML SERVICES

68% correlation
between AUD & EUR
Plot of 6 month daily
AUD - EUR values
Block of correlated
currencies
… clustered
hierarchically

RESTAURANT: PRODUCT SALES
CORRELATION

DATA
IS
EVERYWHERE
COMMON COMPLAINT #1
WE DON’T HAVE DATA

We have internal
information. Getting
information from outside is
our challenge. There’s no way
of doing that.
– Senior Editor
Leading Media Company
“

AUGMENT YOUR
DATA
SOURCES
DATA IS
EVERYWHERE
COMMON COMPLAINT #1
WE DON’T HAVE DATA
COMMON COMPLAINT #2
THE DATA ISN’T STRUCTURED
CRM DATA
SALES DATA
PRICING DATA
CALL RECORDS
WEB LOG DATA
VENDOR INVOICES
SOCIAL MEDIA DATA
CLICKTHROUGH DATA
COMPETITOR RESEARCH
CUSTOMER TRANSACTIONS
…
CENSUS DATA
E-COMMERCE PRICES
COMMODITY PRICES
STOCK MARKET DATA
FINANCIAL REPORTING
SOCIAL MEDIA DATA
MOBILE PENETRATION
AADHAR DATA
COURT CASE BRIEFS
SHAPE FILES
…

Recruiting top quality developers is always a problem. We decided to use an
algorithmic approach and pulled out the social network of developers on
Github (a social network for open source code).
In this visualisation, each circle is a person. The size of the circle
represents the number of followers. Larger circles have more
followers (but not in proportion – it’s a log scale.)
The circle’s colour represents the city the
programmer’s live in. This visual is a slice showing the
tale of two cities: Bangalore and Singapore
Two people are connected if one
follows the other. This leads to a
clustering of people in the form of a
network.
Here, you can see that Bangalore and
Singapore are reasonably well
connected cities. Bangalore has more
developers, but Singapore has more
popular ones (larger circles).
However, the interaction between
Bangalore and Singapore are few and
far between. But for a few people
across both cities, like:
… etc.
Sudar, Yahoo!
Anand C, Consultant
Kiran, Hasgeek
Anand S, Gramener
Mugunth, Steinlogic
Honcheng, buUuk
Sau Sheong, HP Labs
Lim Chee Aung
Bangalore
Singapore
1 follower
100 followers
A follows B (or)
B follows A
Most followed in
Bangalore
Most followed in
Singapore
Ciju Cherian
Lin Junjie
Amudhi Sebastian
There are, of course, a number of smaller
independent circles – people who are not connected
to others in the same city. (They may be connected to
people in other cities.)
Apart from this, there are a few small networks of
connected people – often people within the same
company or start-up – who form a community of their
own.
THE SOCIAL TALE OF TWO CITIES: BANGALORE & SINGAPORE

Tata Teleservices
Tata Consultancy Services
Tata Business Support Services
Tata Global Beverages
Tata Infotech (merged)
Tata Toyo Radiator
Honeywell Automation India
Tata Communications
A G C Networks
Tata Technologies
Tata Projects
Tata Power
Tata Finance
Idea Cellular
Tata Motors
Tata Sons
Tata Steel
Tayo Rolls
Tata Securities
Tata Coffee
Tata Investment Corp
A J Engineer
H H Malgham
H K Sethna
Keshub Mahindra
Ravi Kant
Russi Mody
Sujit Gupta
A S Bam
Amal Ganguli
D B Engineer
D N Ghosh
M N Bhagwat
N N Kampani
U M Rao
B Muthuraman
Ishaat Hussain
J J Irani
N A Palkhivala
N A Soonawala
R Gopalakrishnan
Ratan Tata
S Ramadorai
S Ramakrishnan
DIRECTORSHIPS AT THE TATAS
Every person who was a Director at the Tata Group
is shown here as an orange circle. The size of the circle
is based on the number of directorship positions held
over their lifetime.
Every company in the Tata Group is shown
here as a blue circle. The size of the circle is
based on the number of directors the
company has had over time.
Every directorship relation is shown by a
line. If a person has held a directorship
position at a company, the two are
connected by a line.
The group appears to be divided into
two clusters based on the network of
directorship roles.
Prominent leaders
bridge the groups
Second group of companies
First group of companies
Some directors are
mainly associated with
the first group of
companies
Some directors are
mainly associated with
the second group of
companies
We’ve used network diagrams to detect terrorism, corporate fraud, product
affinities and behavioural customer segmentation

WHAT DO FINANCIAL ANALYSTS ASK IBM VS
MSFT?

How does Mahabharata, one of the largest epics with 1.8
million words lend itself to text analytics?
Can this ‘unstructured data’ be processed to extract
analytical insights?
What does sentiment analysis of this tome convey?
Is there a better way to explore relations between
characters?
How can closeness of characters be analysed & visualized?
VISUALISING THE
MAHABHARATA

DATA IS
EVERYWHERE
EXTRACT THE
META DATA
AUGMENT YOUR
DATA
SOURCES
COMMON COMPLAINT #2
THE DATA ISN’T STRUCTURED
COMMON
WHO, WHAT, WHEN, WHERE
TEXT
TEXT KEYWORDS
SENTIMENT
IMAGE
VISUAL RECOGNITION
AUDIO / CALLS
TRANSCRIPTS
MOOD ANALYSIS

THE CAPABILITIES ARE
IN YOUR REACH TODAY
EXPLORE THE ART OF DATA
S ANAND s.anand@gramener.com
CEO, GRAMENER 9741552552

Insights from Data: Overcoming Objections

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Insights from Data: Overcoming Objections

Ähnlich wie Insights from Data: Overcoming Objections (20)

Mehr von Gramener

Mehr von Gramener (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Insights from Data: Overcoming Objections

Hinweis der Redaktion