SlideShare ist ein Scribd-Unternehmen logo
1 von 55
Downloaden Sie, um offline zu lesen
Workshop

 A
Fine‐Grained
Analysis
of
User‐Generated

    Content
to
Support
Decision
Making


                        Marcirio
Silveira
Chaves

                          h/p://mchaves.wikidot.com



                      Informa<on
Systems
Research
Group

         Business
and
Informa<on
Technology
Research
Centre
(BITREC)

Ins<tute
for
Scien<fic
and
Technological
Research
of
Universidade
Atlân<ca
(ISTR)

User‐Generated
Content
(UGC)

•  As
known
as
                                     •  Can
be
expressed
throught

      –  User‐Generated
Data
                              –  Opinions

      –  User‐Created
Content
                             –  Reviews

      –  User‐Contributed
Data
                            –  Comments

      –  Consumer‐Generated
                               –  Posts

         Media

      –  …


   • 
Notes:

        • 
All
the
examples
described
in
this
workshop
are
real
data.

        • 
Some
papers
men<oned
here
are
under
review.

        • 
Color
legend:

             • 
Examples

             • 
Posi<ve
feature

             • 
Nega<ve
feature


Apr‐18‐12
                    Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
      2

Example
of
UGC

•  An
opinion
posted
in
Facebook
                                 Dec‐10‐2011,
12:30
pm



      –  “would
highly
recommend
Infinity
Motorcycles,

         Southampton
for
all
motorbiking
gear.
Very

         reasonable
people.
Earlier
they
gave
me
a
full

         money
back
for
a
unused
(aer
explaining
why
it

         was
unused)
ladies
motorbike
jacket
(no
defects

         whasoever)
and
today
the
zipper
on
my
new

         jacket
was
broken
and
they
gave
me
a
brand
new

         one
(no
ques<ons
asked,
no
receipt
business
and

         no
fuss
created).
Five
Star
service.”


      –  This
user
had
226
friends.

Apr‐18‐12
            Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
                            3

Some
sta<s<cs
about
UGC

•  More
than
50%
of
all
internet
visits
are
now
to

   UGC/social
media
sites.
•  More
than
75%
of
<me
spent
on
the
internet

   is
"social”.

•  Facebook
now
captures
as
much
<me
spent

   on
the
internet
as
Google,
Yahoo,
and
AOL.

•  More
than
80%
of
consumers
are
influenced

   by
Social
MarkeJng.



             Source: http://www.bbrisco.com/2010/05/social.html 
Apr‐18‐12
               Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   4

Main
Objec<ves
of
this
Workshop

•  In‐depth
analysis
of
UGC


•  Use
UGC
to
support
decision
making


•  Study
a
domain
ontology
to
support
Ar<ficial

   Intelligence
tasks


•  Address
approaches
for
sen<ment
analysis


•  From
theory
to
prac<ce:
Hands‐on
Session

Apr‐18‐12
            Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   5

Outline

Part
1
                                    Part
2


•  Workshop
Context
             •  Sen<ment
Analysis/Opinion

•  User‐Generated
Content
          Mining

   (UGC)
                        •  Polarity
Recognizer
in

•  Characterisa<on
of
UGC
          Portuguese
(PIRPO)

•  Knowledge
Engineering
‐
      •  Informa<on
Visualisa<on

   Ontology
Development

•  Hands‐on
Session
(Individual

   Task):
Dealing
with
UGC





Apr‐18‐12
           Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   6

Context
Workshop





     A
framework
for
Customer
Knowledge
Management
based
on
Social
Seman<c
Web.



    
Chaves,
Marcirio
Silveira;
Trojahn,
Cássia
and
Pedron,
Cris<ane
Drebes.
     A
Framework
for
Customer
Knowledge
Management
based
on
Social
Seman<c
Web:
A
Hotel
Sector
Approach.
In:

     Customer
Rela<onship
Management
and
the
Social
and
Seman<c
Web:
Enabling
Cliens
Conexus.
Colomo‐Palacios,

     R.;
Varajão,
J.
and
Soto‐Acosta,
P.
(Eds.).
p.
141‐157,
Hershey,
PA:
IGI
Global,
2012.
ISBN:
978‐161‐35‐0044‐6


Apr‐18‐12
                             Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
                                   7

An
Fine‐grained
Analysis
of
UGC

•  Overall
opinion
about
a
topic
is
only
a
part
of
the

   informaJon
of
interest.


•  Document‐level
senJment
classificaJon
fails
to
detect

   sen<ment
about
individual
aspects
of
the
topic.
In

   reality,
for
example,
though
one
could
be
generally

   happy
about
his
car,
he
might
be
dissaJsfied
by
the

   engine
noise.


•  To
the
manufacturers,
these
individual
weaknesses
and

   strengths
are
equally
important
to
know,
or
even
more

   valuable
than
the
overall
sa<sfac<on
level
of
customers.

   (Tang
et
al.
2009)



Apr‐18‐12
           Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   8

UGC



      An
opinion
is
simply
a
posiJve
or
negaJve

         senJment,
view,
aPtude,
emoJon,
or

      appraisal
about
an
enJty
or
an
aspect
of
the

       enJty
(Hu
and
Liu,
2004;
Liu,
2006)
from
an

      opinion
holder
(Bethard
et
al.,
2004;
Kim
and

            Hovy,
2004;
Wiebe
et
al.,
2005).




Apr‐18‐12
         Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   9


Characterisa<on
of
UGC

•  Opinion’s
Characterisa<on

      –  I
use
and
extend
the
defini<on
proposed
by
(Ding

         et
al.,
2008;
Liu,
2010;
Mar<n
and
White,
2005)
to

         analyse
the
sentences
of
reviews.



      –  Let
the
review
be
r.



      –  In
the
most
general
case,
r
is
characterised
as
a
set

         of
the
following
elements
{O,F,SO,H,S,A,R,I,SG},

         where:



Apr‐18‐12
             Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   10

Characterisa<on
of
UGC

•  Opinion’s
Characterisa<on

      –  O:
Object

      –  F:
Feature

      –  SO:
Seman<c‐Orienta<on

      –  H:
Holder


      –  S:
Source

      –  A:
A%tude

      –  SG:
Sugges.on


      –  R:
Recommenda.on


      –  I:
Inten.on


Apr‐18‐12
          Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   11

Characterisa<on
of
UGC

1  ‐
Object
(O)

     – 
An
object
is
a
product
(e.g.
movie
and
book)
or
a

     service
(e.g.
hotel
and
restaurant)
under
review

     which
is
composed
by
features.

     – 
Objects
are
also
called
enJJes.

2
‐
Feature
(F)

      –  A
feature
is
a
component
or
part
of
an
object.


             •  actor
and
photography
are
features
on
a
movie.

             •  pool
and
staff
are
features
on
a
hotel.


      –  Features
are
also
called
aXributes
or
facets.


      –  A
feature
can
be
men<oned
explicitly
or
implicitly

         in
a
review
(Ding
et
al.
2008).


Apr‐18‐12
                 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   12

Characterisa<on
of
UGC

2.1
‐
Explicit
Feature
(F)

      –  If
a
feature
f
appears
in
review
r,
it
is
called
an

         explicit
feature
in
r.


      –  The
hotel
is
located
very
near
the
center
city.

             •  loca<on
is
an
explicit
feature.

2.2
‐
Implicit
Feature
(F):


      –  If
f
does
not
appear
in
r
but
is
implied,
it
is
called

         an
implicit
feature
in
r.


      –  Hotel
is
far
from
public
transporta<on.

             •  loca<on
is
an
implicit
feature.




Apr‐18‐12
                   Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   13

Characterisa<on
of
UGC

3
‐
Sentence‐OrientaJon
(SO)

      –  A
review
consists
of
a
sequence
of
sentences

         r=〈
s1,
s2,
…,
sm〉(Ding
et
al.,
2008).



      –  A
sentence
can
be
evaluated
as
the
following

         perspec<ves:





Apr‐18‐12
           Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   14

Characterisa<on
of
UGC

3.1
ObjecJvity


      –  An
objec<ve
sentence
contains
or
menJon
facts.

             •  This
hotel
is
far
from
the
airport,
ca.
15km.

      –  A
subjec<ve
sentence
does
not
menJon
any
fact.

             •  The
parking
could
be
free.



3.2
Polarity

      –  It
describes
the
orientaJon
present
in
a
sentence

         (i.e.
posiJve,
negaJve,
neutral
and
irrelevant).




Apr‐18‐12
                  Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   15

Characterisa<on
of
UGC

3.3
Intensity
(strength
of
the
polarity)

      –  It
refers
to
the
strength
of
the
private
state
that
is

         being
expressed,
in
other
words,
how
strong
is
an

         emo<on
or
a
convic<on
of
belief
(Wilson,
2008).


      –  It
describes
how
intense
it
was
the
experience
using

         a
product
or
service:

             •  very
posiJve,
posiJve,
neutral,
negaJve
and
very

                negaJve.


             •  Very
kindly
staff.
refers
to
a
very
posi<ve
impression
on

                the
staff
service.




Apr‐18‐12
                  Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   16

Characterisa<on
of
UGC

4
‐
Opinion
Holder
(H)

      –  The
holder
of
a
par<cular
opinion
is
the
person
or
the

         organisaJon
that
holds
the
opinion
(Ding
et
al.,
2008).


      –  A
holder
is
iden<fied
with
demographic
characterisJcs

         (e.g.
name,
city
and
country).


      –  Sites
such
as
tripadvisor.com
and
booking.com
classify

         holders
as
types
including:

             •    families
with
older
children

             •    families
with
young
children


             •    mature
couples

             •    groups
of
friends


             •    solo
travellers


             •    young
couples


Apr‐18‐12
                      Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   17

Characterisa<on
of
UGC

5
–
Source

      –  An
informa<on
source
is
a
web
site
which
provides

         a
set
of
reviews.


             •  tripadvisor.com


             •  booking.com

             •  amazon.com




•    A:
A%tude

•    SG:
Sugges.on


•    R:
Recommenda.on


•    I:
Inten.on

Apr‐18‐12
                  Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   18

Outline

Part
1
                                    Part
2


•  Workshop
Context
             •  Sen<ment
Analysis/Opinion

•  User‐Generated
Content
          Mining

   (UGC)
                        •  Polarity
Recognizer
in

•  Characterisa<on
of
UGC
          Portuguese
(PIRPO)

•  Knowledge
Engineering
‐
      •  Informa<on
Visualisa<on

   Ontology
Development

•  Hands‐on
Session
(Individual

   Task):
Dealing
with
UGC





Apr‐18‐12
           Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   19

Limita<ons
for
represen<ng
knowledge
in
the

              accommoda<on
sector





                                                              language?



Apr‐18‐12
        Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
        20

More
limita<ons

•  Actually,
web
agents
are
unable
to
answer

   ques<ons
such
as:

      –  What
are
the
hotels
with
longer
indoor
swimming

         pool
Jme
table
in
Roma?


      –  What
are
the
hotels
with
the
cheapest
breakfast

         in
Lisbon?


      –  What
are
the
cheapest
hotels
with
family
suite

         room
with
sea
view
in
Barcelona?




Apr‐18‐12
           Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   21

Knowledge
Engineering

•  Ontology
as
a
support
to
evaluate
UGC

      –  Set
of
concepts
to
a
specific
domain

      –  Human
and
machine
readable

      –  Support
to
fine‐grained
analysis
of
the
instances

         (e.g.
reviews)


      –  Hontology
(H
stands
for
hotel,
hostal
and
hostel)


             •  A
robust,
coherent
and
mul<lingual
representa<on
of

                the
accommoda<on
sector.






Apr‐18‐12
                 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   22

Context
Workshop





     A
framework
for
Customer
Knowledge
Management
based
on
Social
Seman<c
Web.



    
Chaves,
Marcirio
Silveira;
Trojahn,
Cássia
and
Pedron,
Cris<ane
Drebes.
     A
Framework
for
Customer
Knowledge
Management
based
on
Social
Seman<c
Web:
A
Hotel
Sector
Approach.
In:

     Customer
Rela<onship
Management
and
the
Social
and
Seman<c
Web:
Enabling
Cliens
Conexus.
Colomo‐Palacios,

     R.;
Varajão,
J.
and
Soto‐Acosta,
P.
(Eds.).
p.
141‐157,
Hershey,
PA:
IGI
Global,
2012.
ISBN:
978‐161‐35‐0044‐6


Apr‐18‐12
                             Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
                                  23

Knowledge
Engineering

 •  Development
Methodology






       –      Iden<fy
exis<ng
ontologies
on
related
domains

       –      Select
the
main
concepts
and
proper<es

       –      Organize
concepts
and
proper<es
hierarchically
into
categories

       –      Translate
the
ontology
(manual)

       –      Expand
concepts
and
proper<es
based
on
comments

       –      Translate
the
new
concepts
and
proper<es
(manual)

       –      Generate
the
ontology
in
several
formats


Chaves,
M.
S.
and
Trojahn,
C.
Towards
a
MulJlingual
Ontology
for
Ontology‐driven
Content
Mining
in
Social
Web
Sites.

Proc.
of
the
ISWC
2010
Workshops,
Volume
I,
1st
InternaJonal
Workshop
on
Cross‐Cultural
and
Cross‐Lingual
Aspects

of
the
SemanJc
Web.
Shanghai,
China,
November
7th,
2010.

 Apr‐18‐12
                          Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
                            24

Knowledge
Engineering

•  Hontology

      –  A
mulJlingual
ontology
for
the
accommodaJon

         sector.



•  Demo
Protégé





    
Chaves,
M.
S.;
Freitas,
L.
A.
and
Vieira,
R.
(2012).
Hontology:
A
mulJlingual
ontology
for
the

     accommodaJon
sector.
4th
InternaJonal
Conference
on
Knowledge
Engineering
and
Ontology

     Development,
Barcelona,
Spain,
4‐7
October.
(SubmiXed)


Apr‐18‐12
                        Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
                            25

Knowledge
Engineering

                                Metrics
                     Value

                  Number
of
Concepts
                           285

                  Number
of
Object
Proper<es
                    10

                  Number
of
Data
Proper<es
                      31

                            Concept
Axioms

  Preliminary
    Subconcept
axioms
                            270

  Hontology
      Equivalent
concepts
axioms
                     4

                  Disjoint
concepts
axioms
                      93

   Sta<s<cs

           Object
Property
Axioms

                  Func<onal
object
property
axioms
               6

                  Object
property
domain
axioms
                 11

                  Object
property
range
axioms
                   8

                         Data
Property
Axioms

                  Func<onal
data
property
axioms
                12

                  Object
data
domain
axioms
                     17

                  Object
data
range
axioms
                       1

Apr‐18‐12
       Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
       26

Hands‐on
Session

•  The
aim
of
this
hands‐on
session
is
to
allow
you
thinking

   in‐depth
about
UGC
on
the
context
of
the
accommoda<on

   sector.

•  You
are
going
to
receive
a
set
of
4
or
5
reviews
about

   accommoda<ons
and
should
evaluate
each
one
according

   to
the
following
parameters:

    –  Features
present
in
the
review
(see
the
concepts
of

       Hontology)

    –  Intensity
(Strength
of
the
Polarity):
(very
nega<ve,

       nega<ve,
neutral,
posi<ve,
very
posi<ve)

•  Notes:

    –  Evaluate
one
feature
per
line.

    –  Please,
save
your
sheet
in
another
file
and
send
to

       mschaves@gmail.com.
Subject:
UB:GX

    –  X
=
number
of
the
group.


Apr‐18‐12
          Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   27

Outline

Part
1
                                    Part
2


•  Workshop
Context
             •  Sen<ment
Analysis/Opinion

•  User‐Generated
Content
          Mining

   (UGC)
                        •  Polarity
Recognizer
in

•  Characterisa<on
of
UGC
          Portuguese
(PIRPO)

•  Knowledge
Engineering
‐
      •  Informa<on
Visualisa<on

   Ontology
Development

•  Hands‐on
Session
(Individual

   Task):
Dealing
with
UGC





Apr‐18‐12
           Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   28

Sen<ment
Analysis


•  Analysis
and
automaJc
extracJon
of
SemanJc

   OrientaJon


•  SemanJc
orientaJon
refers
to
the
polarity
and

   strength
of
words,
phrases,
or
texts.


•  Approaches

      –  Lexicon‐based

             •  Dic<onaries
of
words
annotated
with
the
word´s
seman<c

                orienta<on,
or
polarity.


             •  A
manually
built
dicJonary
provides
a
solid
foundaJon
for
a

                lexicon‐based
approach
(Taboada
et.
al.,
2011).


      –  StaJsJcal
or
Machine‐learning

             •  Supervised
classifica<on


Apr‐18‐12
                   Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
    29

Sen<ment
Analysis


•  Lexicon‐based
Approach

      –  Sen<ment‐bearing
words:
a
list
of
nouns,
verbs,

         adjecJves
and
adverbs
(Chesley
et
al.,
2006)


          •  use
verbs
and
adjec<ves
to
classify
English

             opinionated
blog
texts.


      –  List
of
conjuncJons
and
connecJves
(Liu,
2010).


      –  Use
of
auxiliary
verbs
to
get
features
and
opinion‐
         oriented
words
about
products
from
texts
(Khan
et

         al.,
2010).






Apr‐18‐12
           Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   30

Sen<ment
Analysis


•  Seed
words


      –  are
a
small
set
of
words
with
strong
negaJve
or

         posiJve
associa<ons,
such
as
excellent
or
abysmal.


      –  In
principle,
a
posi<ve
adjec<ve
should
occur
more

         frequently
alongside
the
posi<ve
seed
words,
and

         thus
will
obtain
a
posi<ve
score,
whereas
nega<ve

         adjec<ves
will
occur
most
oen
in
the
vicinity
of

         nega<ve
seed
words,
thus
obtaining
a
nega<ve

         score
(Taboada
et.
al.
2011).


             •  This
restaurant
has
a
bad
and
expensive
food.





Apr‐18‐12
                 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   31

Sen<ment
Analysis


•  Part‐of‐Speech
(PoS)

      –  In
order
to
evaluate
a
sentence
in
a
review,
we

         should
consider
the
parts‐of‐speech
men<oned

         such
as
adjecJves,
adverbs
and
verbs.


      –  Adjec<ves
are
classified
as:

             •  posi<ve
(good,
excellent
and
clean),


             •  nega<ve
(awful,
boring
and
terrible),


             •  neutral
(regular
and
indifferent)
and


             •  dual,
which
can
express
posi<ve
and
nega<ve
opinion

                (small,
long).


      –  In
some
approaches
nouns
are
represented
by

         concepts
of
a
domain
ontology
and
mapped
as

         features.


Apr‐18‐12
                 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   32

Sen<ment
Analysis


•  ConjuncJon
and
ConnecJve
(CC)

      –  Connec<ves
are
words
that
help
iden<fying

         addiJonal
adjecJve
opinion
words
and
their

         orientaJons.


      –  One
of
the
constraints
is
about
conjunc<on
(i.e.
and),

         which
says
that
conjoined
adjec<ves
usually
have
the

         same
orienta<on
(Liu,
2010).


             •  This
room
is
beau<ful
and
spacious.

                –  if
beau<ful
is
known
to
be
posi<ve,
it
can
be
inferred
that
spacious

                   is
also
posi<ve.


      –  HeurisJc:

             •  People
usually
express
the
same
opinion
on
both
sides
of

                a
conjuncJon.


Apr‐18‐12
                   Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
           33

Sen<ment
Analysis

•  ConjuncJon
and
ConnecJve
(CC)

      –  Rules
or
constraints
are
also
designed
for
other

         connec<ves
(e.g.
or,
but,
either‐or,
and
neither‐nor).


             •  This
hotel
is
beau<ful
but
difficult
to
get
there.

                –  The
occurrence
aer
the
connec<ve
but
is
an
indicator
of
a

                   nega<ve
opinion.





Apr‐18‐12
                   Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
           34

Sen<ment
Analysis

•  Strength
of
the
PolaJry
or
Intensity
or

   IntensificaJon

      –  Amplifiers
(very,
a
lot)
increase
the
seman<c
intensity

         of
a
neighboring
lexical
item;

      –  AXenuators/Downtoners
(a
li/le,
slightly)
decrease
it.




•  Some
approaches
have
implemented
intensifiers

   using
simple
addiJon
and
subtracJon

      –  if
a
posi<ve
adjec<ve
has
an
SO
value
of
2:

             •  an
amplified
adjec<ve
would
have
an
SO
value
of
3,
and


             •  a
downtoned
adjec<ve
an
SO
value
of
1.


Apr‐18‐12
                 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   35

Sen<ment
Analysis

•  NegaJon

      –  The
obvious
approach
to
nega<on
is
simply
to

         reverse
the
polarity
of
the
lexical
item
next
to
a

         negator,
changing
good
(+3)
into
not
good
(−3).


      –  Not,
none,
nobody,
never,
and
nothing,
and
other

         words,
such
as
without
or
lack.





Apr‐18‐12
            Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   36

Polarity
Recognizer
in
Portuguese
(PIRPO)


•  Polarity
Recognizer
in
Portuguese
to
classify
senJment
in

   online
reviews.



•  PIRPO
was
built
from
the
ground
to
Portuguese
for

   recognising
the
polarity
of
the
user
opinion
on

   accommoda<on
reviews.



•  Each
review
is
analysed
according
to
concepts
from
a

   domain
ontology.



•  We
decompose
the
review
in
sentences
in
order
to
assign
a

   polarity
to
each
concept
of
the
ontology
in
the
sentence.



Chaves,
M.
S.,
Freitas,
L.,
Souza,
M.
and
Vieira,
R.
PIRPO:
An
Algorithm
to
deal
with
Polarity
in
Portuguese
Online

Reviews
from
the
AccommodaJon
Sector.
17th
InternaJonal
conference
on
ApplicaJons
of
Natural
Language

Processing
to
InformaJon
Systems
(NLDB),
Groningen,
The
Netherlands,
26‐28
June
2012.


Apr‐18‐12
                           Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
                               37

PIRPO
Informa<on
Architecture





Apr‐18‐12
           Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   38

PIRPO

•  Reviews

      –  Full
dataset:
1500
reviews
from
January
2010
to

         April
2011
in
Portuguese,
English
and
Spanish,

         from
which
180
in
Portuguese.




•  Ontology
Concepts

      –  The
concepts
used
to
classify
the
reviews
are

         provided
by
Hontology,
which
in
its
current

         version,
has
110
concepts.




Apr‐18‐12
            Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   39

PIRPO

•  List
of
adjecJves:
It
is
composed
by
sen<ment‐
   bearing
words.


      –  This
list
of
polar
adjecJves
in
Portuguese


             •  contains
30.322
entries.


             •  is
composed
by
the
name
of
the
adjecJve
and
a
polarity

                which
can
assign
one
of
three
values:
+1,
‐1
and
0.


             •  These
values
corresponding
to
the
posiJve,
negaJve
and

                neutral
senses
of
the
adjec<ve.


      –  PIRPO
uses
this
list
to
calculate
the
semanJc

         orientaJon
of
the
concepts
found
in
the
sentences.





Apr‐18‐12
                 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   40

PIRPO
Algorithm





Apr‐18‐12
    Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   41

PIRPO
Measure
Evalua<on

•  Precision

                {relevantConcepts} ∩ {retrievedConcepts}
             P=
                          {retrievedConcepts}

•  Recall

                {relevantConcepts} ∩ {retrievedConcepts}
      €      R=
                          {relevantConcepts}

•  F‐score
(harmonic
mean
of
precision
and
recall)

    €              P×R
             F =2×
                   P+R

Apr‐18‐12
         Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   42

PIRPO
Preliminary
Results





Apr‐18‐12
        Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   43

PIRPO:
Discussion
on
the
Results

•  PIRPO
reached
a
be/er

   recall
for
concepts
with

   posi<ve
polarity,
while

   mixed
polarity
had
a

   higher
precision.


•  The
low
F‐score
can
be

   mainly
due
to
the

   algorithm
has
assigned

   a
polarity
to
a
specific

   concept
of
the

   ontology,
while
the

   human
classified
the

   review
as
a
whole.


Apr‐18‐12
            Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   44

Outline

Part
1
                                     Part
2


•  Workshop
Context
                        •  Knowledge
Engineering
‐

•  User‐Generated
Content
                     Modelling
UGC

   (UGC)
                                   •  Sen<ment
Analysis/Opinion

•  Characterisa<on
of
UGC
                     Mining

•  Knowledge
Engineering
‐
                 •  Polarity
Recognizer
in

   Ontology
Development
                       Portuguese
(PIRPO)

•  Hands‐on
Session
(Individual
            •  Informa<on
Visualisa<on

   Task):
Dealing
with
UGC





Apr‐18‐12
            Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
        45

Context
Workshop





     A
framework
for
Customer
Knowledge
Management
based
on
Social
Seman<c
Web.



    
Chaves,
Marcirio
Silveira;
Trojahn,
Cássia
and
Pedron,
Cris<ane
Drebes.
     A
Framework
for
Customer
Knowledge
Management
based
on
Social
Seman<c
Web:
A
Hotel
Sector
Approach.
In:

     Customer
Rela<onship
Management
and
the
Social
and
Seman<c
Web:
Enabling
Cliens
Conexus.
Colomo‐Palacios,

     R.;
Varajão,
J.
and
Soto‐Acosta,
P.
(Eds.).
p.
141‐157,
Hershey,
PA:
IGI
Global,
2012.
ISBN:
978‐161‐35‐0044‐6


Apr‐18‐12
                             Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
                                  46

Informa<on
Visualisa<on


•  What
is
the
visual
model
of
the
poten<al
end‐user?

•  How
should
we
properly
map
and
render:

      –  the
most
valued
accommoda<on
features?

      –  the
percep<on
of
the
quality
offered
by
the
hotel?

      –  the
correla<on
between
the
guest’s
profile
and
the

         mostly
relevant
features?

      –  the
intensity
of
the
posi<vity
or
nega<vity
of
the

         features?



•  Does
the
use
of
advanced
visual
techniques
(such
as

   tree
oriented)
to
map
the
results
will
help
the

   accommoda<on
managers
and
guests
to
have
a

   be/er
insight
of
the
data?

Apr‐18‐12
             Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   47

Exploring
Informa<on
Visualisa<on


•  In
the
next
figures


      –  The
color
was
used
to
map
the
polarity
and
the

         strength
of
the
polarity
values
on
the
CO.





      –  The
size
was
used
to
map
the
frequency
that
the

         CO
is
men<oned
in
the
reviews.






Apr‐18‐12
             Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   48

Exploring
Informa<on
Visualisa<on






     Result
of
the
applica<on
of
Bubble
Tree
visualisaJon
of
the

      rela<on
among
concepts
of
the
ontology,
polarity
(le)
and

                     strength
of
the
polarity
(right).


•    Carvalho,
E.;
Chaves,
M.
S.,
2012.
Exploring
User‐Generated
Data
VisualizaJon
in
the
AccommodaJon

     Sector.
16th
InternaJonal
Conference
InformaJon
VisualisaJon,
IEEE.
(SubmiXed)


Apr‐18‐12
                          Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
                             49

Exploring
Informa<on
Visualisa<on






    Results
using
Treemap
visualisaJon
of
the
rela<on
among
type
of

            customer,
concepts
of
the
ontology
and
polarity.



Apr‐18‐12
              Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
      50

Ques<onnaire
(in
Spanish)

•  You
are
going
to
receive
a
ques<onnaire
about

   informa<on
visualisa<on
using
UGC
in
the
context
of

   the
accommoda<on
sector.


•  Please,
click
here
h/p://kwiksurveys.com?u=Infovises

   to
answer
it.






Apr‐18‐12
        Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   51

Final
Remarks

•  In‐depth
analysis
of
UGC
can
be
used
as
input

   to
improve
decision
making.

•  It
is
<me
to
think
about
new
models
to
store

   UGC
data.

•  It
is
necessary
the
building
from
the
ground
of

   new
algorithms
to
deal
with
UGC
for

   languages
other
than
English.

•  InformaJon
visualisaJon
of
UGC
is
in
its

   infancy
state.



Apr‐18‐12
       Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   52

Main
References

•    S.
Bethard,
H.
Yu,
A.
Thornton,
V.
Hatzivassiloglou,
and
D.
Jurafsky,
2004.
Automa<c
extrac<on
of
opinion
proposi<ons
and

     their
holders.
in
Proceedings
of
the
AAAI
Spring
Symposium
on
Exploring
A%tude
and
Affect
in
Text.

•    Chesley,
P.;
Vincent,
B.;
Xu,
L.
and
Srihari
R.,
2006.
Using
verbs
and
adjec<ves
to
automa<cally
classify
blog
sen<ment.
in

     AAAI
Symposium
on
Computa<onal
Approaches
to
Analysing
Weblogs
(AAAI‐CAAW),
27–29.


•    Ding,
X.,
Liu,
B.,
and
Yu,
P.
S.,
2008.
A
holis<c
lexicon‐based
approach
to
opinion
mining.
Proceedings
of
the
Conference
on

     Web
Search
and
Web
Data
Mining
(WSDM).

•    M.
Hu
and
B.
Liu,
2004.
Mining
opinion
features
in
customer
reviews.
In
Proceedings
of
AAAI,
pp.
755–760.

•    S.‐M.
Kim
and
E.
Hovy,
2004.
Determining
the
sen<ment
of
opinions.
In
Proceedings
of
the
Interna.onal
Conference
on

     Computa.onal
Linguis.cs
(COLING),
2004.

•    Liu,
Bing,
2010.
Sen<ment
Analysis
and
Subjec<vity.
In
Handbook
of
Natural
Language
Processing,
Second
Edi<on,
Eds:
N.

     Indurkhya
and
F.
J.
Damerau),
CRC
Press,
Taylor
and
Francis
Group,
Boca
Raton,
FL.
Chapter
28.



•    Mar<n,
J.R.
and
White,
P.
R.
R.,
2005.
The
Language
of
Evalua<on,
Appraisal
in
English,
Palgrave
Macmillan,
London
&

     New
York.

•    Taboada,
M.,
Brooke,
J.,
Tofiloski,
M.,
Voll,
K.D.,
Stede,
M.,
2011.
Lexicon‐based
methods
for
sen<ment
analysis.

     Computa<onal
Linguis<cs
37(2),
267–307.

•    Tang,
H.,
Tan,
S.,
Cheng,
X.,
2009.
A
survey
on
sen<ment
detec<on
of
reviews.
Expert
Systems
with
Applica<ons
36(7),

     10760
–
10773.

•    Whitelaw,
C.;
Garg,
N.
and
Argamon,
S.,
2005.
Using
appraisal
groups
for
sen<ment
analysis.
In
Proceedings
of
the
14th

     ACM
interna<onal
conference
on
Informa<on
and
knowledge
management
(CIKM
'05).
ACM,
New
York,
NY,
USA,
625‐631.

•    Wilson,
T.,
2008.
Fine‐Grained
Subjec<vity
Analysis.
PhD
Disserta<on,
Intelligent
Systems
Program,
University
of

     Pi/sburgh.

•    Wilson,
T.,
Wiebe,
J.,
Hoffmann,
P.,
2009.
Recognizing
contextual
polarity:
An
explora<on
of
features
for
phrase‐level

     sen<ment
analysis.
Computa<onal
Linguis<cs
35,
399–433.

•    Y.
Wu,
F.
Wei,
S.
Liu,
N.
Au,
W.
Cui,
H.
Zhou,
and
H.
Qu,
2010.
OpinionSeer:
Interac<ve
Visualisa<on
of
Hotel
Customer

     Feedback.
IEEE
Transac<ons
on
Visualiza<on
and
Computer
Graphics,
6,
1109‐1118.
Nov‐Dec.



Apr‐18‐12
                               Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
                                      53

Open‐source
sen<ment‐analysis
tools

•  Python
NLTK
(Natural
Language
Toolkit)

–  h/p://www.nltk.org
and
h/p://text‐processing.com/demo/sen<ment



•  R,
TM
(text
mining)
module
h/p://cran.r‐project.org/web/packages/tm/index.html



•  RapidMiner
h/p://rapid‐i.com/content/view/184/196/



•  GATE,
the
General
Architecture
for
Text
Engineering
h/p://gate.ac.uk/sen<ment



•  UIMA‐plug‐in
annotators
for
sen<ment
—
Apache
UIMA
is
the

   Unstructured
Informa<on
Management
Architecture,
h/p://uima.apache.org/




•  SenJment
classifiers
for
the
WEKA
data‐mining
workbench,

  h/p://www.cs.waikato.ac.nz/ml/weka/.




•  Stanford
NLP
tools
‐
h/p://www‐nlp.stanford.edu/soware
maximum‐entropy

   classifica<on
approach
for
sen<ment.

Apr‐18‐12
                    Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
           54

Thank
you
very
much
for
your

              a/en<on!!


              Ques<ons



Apr‐18‐12
    Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
   55


Weitere ähnliche Inhalte

Andere mochten auch

Database management system chapter15
Database management system chapter15Database management system chapter15
Database management system chapter15Md. Mahedi Mahfuj
 
Database management system chapter12
Database management system chapter12Database management system chapter12
Database management system chapter12Md. Mahedi Mahfuj
 
Tutorial on Parallel Computing and Message Passing Model - C5
Tutorial on Parallel Computing and Message Passing Model - C5Tutorial on Parallel Computing and Message Passing Model - C5
Tutorial on Parallel Computing and Message Passing Model - C5Marcirio Chaves
 
Collective Communications in MPI
 Collective Communications in MPI Collective Communications in MPI
Collective Communications in MPIHanif Durad
 
Tutorial on Parallel Computing and Message Passing Model - C4
Tutorial on Parallel Computing and Message Passing Model - C4Tutorial on Parallel Computing and Message Passing Model - C4
Tutorial on Parallel Computing and Message Passing Model - C4Marcirio Chaves
 
Tutorial on Parallel Computing and Message Passing Model - C2
Tutorial on Parallel Computing and Message Passing Model - C2Tutorial on Parallel Computing and Message Passing Model - C2
Tutorial on Parallel Computing and Message Passing Model - C2Marcirio Chaves
 
Lessons Learned Model for Projects Supported by Web 2.0 Tools: a Mixed Method...
Lessons Learned Model for Projects Supported by Web 2.0 Tools: a Mixed Method...Lessons Learned Model for Projects Supported by Web 2.0 Tools: a Mixed Method...
Lessons Learned Model for Projects Supported by Web 2.0 Tools: a Mixed Method...Marcirio Chaves
 
Point-to-Point Communicationsin MPI
Point-to-Point Communicationsin MPIPoint-to-Point Communicationsin MPI
Point-to-Point Communicationsin MPIHanif Durad
 
Strategies in job search process
Strategies in job search processStrategies in job search process
Strategies in job search processMd. Mahedi Mahfuj
 
WEB 2.0 TECHNOLOGIES TO SUPPORT LESSONS LEARNED IN PROJECT MANAGEMENT
WEB 2.0 TECHNOLOGIES TO SUPPORT LESSONS LEARNED IN PROJECT MANAGEMENTWEB 2.0 TECHNOLOGIES TO SUPPORT LESSONS LEARNED IN PROJECT MANAGEMENT
WEB 2.0 TECHNOLOGIES TO SUPPORT LESSONS LEARNED IN PROJECT MANAGEMENTMarcirio Chaves
 

Andere mochten auch (20)

Matrix multiplication graph
Matrix multiplication graphMatrix multiplication graph
Matrix multiplication graph
 
Parallel computing(1)
Parallel computing(1)Parallel computing(1)
Parallel computing(1)
 
Observer pattern
Observer patternObserver pattern
Observer pattern
 
Job search_resume
Job search_resumeJob search_resume
Job search_resume
 
Database management system chapter15
Database management system chapter15Database management system chapter15
Database management system chapter15
 
Database management system chapter12
Database management system chapter12Database management system chapter12
Database management system chapter12
 
Job search_interview
Job search_interviewJob search_interview
Job search_interview
 
Tutorial on Parallel Computing and Message Passing Model - C5
Tutorial on Parallel Computing and Message Passing Model - C5Tutorial on Parallel Computing and Message Passing Model - C5
Tutorial on Parallel Computing and Message Passing Model - C5
 
Collective Communications in MPI
 Collective Communications in MPI Collective Communications in MPI
Collective Communications in MPI
 
Tutorial on Parallel Computing and Message Passing Model - C4
Tutorial on Parallel Computing and Message Passing Model - C4Tutorial on Parallel Computing and Message Passing Model - C4
Tutorial on Parallel Computing and Message Passing Model - C4
 
Tutorial on Parallel Computing and Message Passing Model - C2
Tutorial on Parallel Computing and Message Passing Model - C2Tutorial on Parallel Computing and Message Passing Model - C2
Tutorial on Parallel Computing and Message Passing Model - C2
 
Chapter 1 pc
Chapter 1 pcChapter 1 pc
Chapter 1 pc
 
Lessons Learned Model for Projects Supported by Web 2.0 Tools: a Mixed Method...
Lessons Learned Model for Projects Supported by Web 2.0 Tools: a Mixed Method...Lessons Learned Model for Projects Supported by Web 2.0 Tools: a Mixed Method...
Lessons Learned Model for Projects Supported by Web 2.0 Tools: a Mixed Method...
 
Lecture11
Lecture11Lecture11
Lecture11
 
Point-to-Point Communicationsin MPI
Point-to-Point Communicationsin MPIPoint-to-Point Communicationsin MPI
Point-to-Point Communicationsin MPI
 
Lecture12
Lecture12Lecture12
Lecture12
 
Strategies in job search process
Strategies in job search processStrategies in job search process
Strategies in job search process
 
WEB 2.0 TECHNOLOGIES TO SUPPORT LESSONS LEARNED IN PROJECT MANAGEMENT
WEB 2.0 TECHNOLOGIES TO SUPPORT LESSONS LEARNED IN PROJECT MANAGEMENTWEB 2.0 TECHNOLOGIES TO SUPPORT LESSONS LEARNED IN PROJECT MANAGEMENT
WEB 2.0 TECHNOLOGIES TO SUPPORT LESSONS LEARNED IN PROJECT MANAGEMENT
 
Chapter 5 pc
Chapter 5 pcChapter 5 pc
Chapter 5 pc
 
Parallel computing(2)
Parallel computing(2)Parallel computing(2)
Parallel computing(2)
 

Ähnlich wie A Fine-Grained Analysis of User-Generated Content to Support Decision Making

Is The Sky Falling? Segmented Risk Identification Questions
Is The Sky Falling? Segmented Risk Identification QuestionsIs The Sky Falling? Segmented Risk Identification Questions
Is The Sky Falling? Segmented Risk Identification QuestionsChristopher Gentry, PMP, ITILv3
 
Charles Rygula: Value Beyond Words
Charles Rygula: Value Beyond WordsCharles Rygula: Value Beyond Words
Charles Rygula: Value Beyond WordsJack Molisani
 
Review on Opinion Targets and Opinion Words Extraction Techniques from Online...
Review on Opinion Targets and Opinion Words Extraction Techniques from Online...Review on Opinion Targets and Opinion Words Extraction Techniques from Online...
Review on Opinion Targets and Opinion Words Extraction Techniques from Online...IRJET Journal
 
CUTGroup Detroit Slides for CUTGroup Collective Call
CUTGroup Detroit Slides for CUTGroup Collective CallCUTGroup Detroit Slides for CUTGroup Collective Call
CUTGroup Detroit Slides for CUTGroup Collective CallSmart Chicago Collaborative
 
Bazley understanding online audiences vsg conf march 2016 for uploading
Bazley understanding online audiences vsg conf march 2016 for uploadingBazley understanding online audiences vsg conf march 2016 for uploading
Bazley understanding online audiences vsg conf march 2016 for uploadingMartin Bazley
 
Data-Driven Design for User Experience
Data-Driven Design for User Experience Data-Driven Design for User Experience
Data-Driven Design for User Experience Emi Kwon
 
Giving Personas a Life of Their Own
Giving Personas a Life of Their OwnGiving Personas a Life of Their Own
Giving Personas a Life of Their OwnUXPA International
 
Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...
Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...
Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...John Makridis
 
How to Deliver Data Insights by Fmr Google Sr Analytical Lead
How to Deliver Data Insights by Fmr Google Sr Analytical LeadHow to Deliver Data Insights by Fmr Google Sr Analytical Lead
How to Deliver Data Insights by Fmr Google Sr Analytical LeadProduct School
 
Human Factors at the Grid Edge
Human Factors at the Grid EdgeHuman Factors at the Grid Edge
Human Factors at the Grid EdgeCori Faklaris
 
Knowledge Extraction from Social Media
Knowledge Extraction from Social MediaKnowledge Extraction from Social Media
Knowledge Extraction from Social MediaSeth Grimes
 
Sgd research-damr-19-may-2018
Sgd research-damr-19-may-2018Sgd research-damr-19-may-2018
Sgd research-damr-19-may-2018Sanjeev Deshmukh
 
Big Data Meetup by Chad Richeson
Big Data Meetup by Chad RichesonBig Data Meetup by Chad Richeson
Big Data Meetup by Chad RichesonSocietyConsulting
 
Clinical Trial Performance Metrics Conference Dec 2016
Clinical Trial Performance Metrics Conference Dec 2016Clinical Trial Performance Metrics Conference Dec 2016
Clinical Trial Performance Metrics Conference Dec 2016Mike Fitzpatrick
 
Crafting Infrastructures
Crafting InfrastructuresCrafting Infrastructures
Crafting InfrastructuresLuca Galli
 
Requirements and Drupal: Planning for Successful Projects
Requirements and Drupal: Planning for Successful ProjectsRequirements and Drupal: Planning for Successful Projects
Requirements and Drupal: Planning for Successful ProjectsNavigationArts
 
Community Capability Model Framework Checklist Tool - Demo & Review
Community Capability Model Framework Checklist Tool - Demo & ReviewCommunity Capability Model Framework Checklist Tool - Demo & Review
Community Capability Model Framework Checklist Tool - Demo & ReviewManjulaPatel
 
The State of Digital Identity Wallets – Darrell O’Donnell
The State of Digital Identity Wallets – Darrell O’DonnellThe State of Digital Identity Wallets – Darrell O’Donnell
The State of Digital Identity Wallets – Darrell O’DonnellSSIMeetup
 
Session 3 qualitative market reserach for entrepreneurs market
Session 3   qualitative market reserach for entrepreneurs marketSession 3   qualitative market reserach for entrepreneurs market
Session 3 qualitative market reserach for entrepreneurs marketAngela Ferrara
 

Ähnlich wie A Fine-Grained Analysis of User-Generated Content to Support Decision Making (20)

Is The Sky Falling? Segmented Risk Identification Questions
Is The Sky Falling? Segmented Risk Identification QuestionsIs The Sky Falling? Segmented Risk Identification Questions
Is The Sky Falling? Segmented Risk Identification Questions
 
Charles Rygula: Value Beyond Words
Charles Rygula: Value Beyond WordsCharles Rygula: Value Beyond Words
Charles Rygula: Value Beyond Words
 
Review on Opinion Targets and Opinion Words Extraction Techniques from Online...
Review on Opinion Targets and Opinion Words Extraction Techniques from Online...Review on Opinion Targets and Opinion Words Extraction Techniques from Online...
Review on Opinion Targets and Opinion Words Extraction Techniques from Online...
 
CUTGroup Detroit Slides for CUTGroup Collective Call
CUTGroup Detroit Slides for CUTGroup Collective CallCUTGroup Detroit Slides for CUTGroup Collective Call
CUTGroup Detroit Slides for CUTGroup Collective Call
 
Bazley understanding online audiences vsg conf march 2016 for uploading
Bazley understanding online audiences vsg conf march 2016 for uploadingBazley understanding online audiences vsg conf march 2016 for uploading
Bazley understanding online audiences vsg conf march 2016 for uploading
 
Data-Driven Design for User Experience
Data-Driven Design for User Experience Data-Driven Design for User Experience
Data-Driven Design for User Experience
 
Giving Personas a Life of Their Own
Giving Personas a Life of Their OwnGiving Personas a Life of Their Own
Giving Personas a Life of Their Own
 
Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...
Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...
Big Data Analytics and Knowledge Discovery through Location-Based Social Netw...
 
How to Deliver Data Insights by Fmr Google Sr Analytical Lead
How to Deliver Data Insights by Fmr Google Sr Analytical LeadHow to Deliver Data Insights by Fmr Google Sr Analytical Lead
How to Deliver Data Insights by Fmr Google Sr Analytical Lead
 
Human Factors at the Grid Edge
Human Factors at the Grid EdgeHuman Factors at the Grid Edge
Human Factors at the Grid Edge
 
Beyond User Research
Beyond User ResearchBeyond User Research
Beyond User Research
 
Knowledge Extraction from Social Media
Knowledge Extraction from Social MediaKnowledge Extraction from Social Media
Knowledge Extraction from Social Media
 
Sgd research-damr-19-may-2018
Sgd research-damr-19-may-2018Sgd research-damr-19-may-2018
Sgd research-damr-19-may-2018
 
Big Data Meetup by Chad Richeson
Big Data Meetup by Chad RichesonBig Data Meetup by Chad Richeson
Big Data Meetup by Chad Richeson
 
Clinical Trial Performance Metrics Conference Dec 2016
Clinical Trial Performance Metrics Conference Dec 2016Clinical Trial Performance Metrics Conference Dec 2016
Clinical Trial Performance Metrics Conference Dec 2016
 
Crafting Infrastructures
Crafting InfrastructuresCrafting Infrastructures
Crafting Infrastructures
 
Requirements and Drupal: Planning for Successful Projects
Requirements and Drupal: Planning for Successful ProjectsRequirements and Drupal: Planning for Successful Projects
Requirements and Drupal: Planning for Successful Projects
 
Community Capability Model Framework Checklist Tool - Demo & Review
Community Capability Model Framework Checklist Tool - Demo & ReviewCommunity Capability Model Framework Checklist Tool - Demo & Review
Community Capability Model Framework Checklist Tool - Demo & Review
 
The State of Digital Identity Wallets – Darrell O’Donnell
The State of Digital Identity Wallets – Darrell O’DonnellThe State of Digital Identity Wallets – Darrell O’Donnell
The State of Digital Identity Wallets – Darrell O’Donnell
 
Session 3 qualitative market reserach for entrepreneurs market
Session 3   qualitative market reserach for entrepreneurs marketSession 3   qualitative market reserach for entrepreneurs market
Session 3 qualitative market reserach for entrepreneurs market
 

Mehr von Marcirio Chaves

A Look at Risks in IT Projects: A Case Study during the Merger Period in the ...
A Look at Risks in IT Projects: A Case Study during the Merger Period in the ...A Look at Risks in IT Projects: A Case Study during the Merger Period in the ...
A Look at Risks in IT Projects: A Case Study during the Merger Period in the ...Marcirio Chaves
 
Rethinking Lessons Learned in the PMBoK Process Groups: A Model based on Peop...
Rethinking Lessons Learned in the PMBoK Process Groups: A Model based on Peop...Rethinking Lessons Learned in the PMBoK Process Groups: A Model based on Peop...
Rethinking Lessons Learned in the PMBoK Process Groups: A Model based on Peop...Marcirio Chaves
 
Tutorial on Parallel Computing and Message Passing Model - C3
Tutorial on Parallel Computing and Message Passing Model - C3Tutorial on Parallel Computing and Message Passing Model - C3
Tutorial on Parallel Computing and Message Passing Model - C3Marcirio Chaves
 
Tutorial on Parallel Computing and Message Passing Model - C1
Tutorial on Parallel Computing and Message Passing Model - C1Tutorial on Parallel Computing and Message Passing Model - C1
Tutorial on Parallel Computing and Message Passing Model - C1Marcirio Chaves
 
Simpósio Brasileiro de Banco de Dados 2005
Simpósio Brasileiro de Banco de Dados 2005Simpósio Brasileiro de Banco de Dados 2005
Simpósio Brasileiro de Banco de Dados 2005Marcirio Chaves
 
defesa dissertação mestrado
defesa dissertação mestradodefesa dissertação mestrado
defesa dissertação mestradoMarcirio Chaves
 

Mehr von Marcirio Chaves (6)

A Look at Risks in IT Projects: A Case Study during the Merger Period in the ...
A Look at Risks in IT Projects: A Case Study during the Merger Period in the ...A Look at Risks in IT Projects: A Case Study during the Merger Period in the ...
A Look at Risks in IT Projects: A Case Study during the Merger Period in the ...
 
Rethinking Lessons Learned in the PMBoK Process Groups: A Model based on Peop...
Rethinking Lessons Learned in the PMBoK Process Groups: A Model based on Peop...Rethinking Lessons Learned in the PMBoK Process Groups: A Model based on Peop...
Rethinking Lessons Learned in the PMBoK Process Groups: A Model based on Peop...
 
Tutorial on Parallel Computing and Message Passing Model - C3
Tutorial on Parallel Computing and Message Passing Model - C3Tutorial on Parallel Computing and Message Passing Model - C3
Tutorial on Parallel Computing and Message Passing Model - C3
 
Tutorial on Parallel Computing and Message Passing Model - C1
Tutorial on Parallel Computing and Message Passing Model - C1Tutorial on Parallel Computing and Message Passing Model - C1
Tutorial on Parallel Computing and Message Passing Model - C1
 
Simpósio Brasileiro de Banco de Dados 2005
Simpósio Brasileiro de Banco de Dados 2005Simpósio Brasileiro de Banco de Dados 2005
Simpósio Brasileiro de Banco de Dados 2005
 
defesa dissertação mestrado
defesa dissertação mestradodefesa dissertação mestrado
defesa dissertação mestrado
 

Kürzlich hochgeladen

4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptxiammrhaywood
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
Food processing presentation for bsc agriculture hons
Food processing presentation for bsc agriculture honsFood processing presentation for bsc agriculture hons
Food processing presentation for bsc agriculture honsManeerUddin
 

Kürzlich hochgeladen (20)

4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
Food processing presentation for bsc agriculture hons
Food processing presentation for bsc agriculture honsFood processing presentation for bsc agriculture hons
Food processing presentation for bsc agriculture hons
 

A Fine-Grained Analysis of User-Generated Content to Support Decision Making

  • 1. Workshop
 A
Fine‐Grained
Analysis
of
User‐Generated
 Content
to
Support
Decision
Making
 Marcirio
Silveira
Chaves
 h/p://mchaves.wikidot.com

 Informa<on
Systems
Research
Group
 Business
and
Informa<on
Technology
Research
Centre
(BITREC)
 Ins<tute
for
Scien<fic
and
Technological
Research
of
Universidade
Atlân<ca
(ISTR)

  • 2. User‐Generated
Content
(UGC)
 •  As
known
as
 •  Can
be
expressed
throught
 –  User‐Generated
Data
 –  Opinions
 –  User‐Created
Content
 –  Reviews
 –  User‐Contributed
Data
 –  Comments
 –  Consumer‐Generated
 –  Posts
 Media
 –  …
 • 
Notes:
 • 
All
the
examples
described
in
this
workshop
are
real
data.
 • 
Some
papers
men<oned
here
are
under
review.
 • 
Color
legend:
 • 
Examples
 • 
Posi<ve
feature
 • 
Nega<ve
feature
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 2

  • 3. Example
of
UGC
 •  An
opinion
posted
in
Facebook
 Dec‐10‐2011,
12:30
pm
 –  “would
highly
recommend
Infinity
Motorcycles,
 Southampton
for
all
motorbiking
gear.
Very
 reasonable
people.
Earlier
they
gave
me
a
full
 money
back
for
a
unused
(aer
explaining
why
it
 was
unused)
ladies
motorbike
jacket
(no
defects
 whasoever)
and
today
the
zipper
on
my
new
 jacket
was
broken
and
they
gave
me
a
brand
new
 one
(no
ques<ons
asked,
no
receipt
business
and
 no
fuss
created).
Five
Star
service.”
 –  This
user
had
226
friends.
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 3

  • 4. Some
sta<s<cs
about
UGC
 •  More
than
50%
of
all
internet
visits
are
now
to
 UGC/social
media
sites. •  More
than
75%
of
<me
spent
on
the
internet
 is
"social”.
 •  Facebook
now
captures
as
much
<me
spent
 on
the
internet
as
Google,
Yahoo,
and
AOL.
 •  More
than
80%
of
consumers
are
influenced
 by
Social
MarkeJng.
 Source: http://www.bbrisco.com/2010/05/social.html Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 4

  • 5. Main
Objec<ves
of
this
Workshop
 •  In‐depth
analysis
of
UGC
 •  Use
UGC
to
support
decision
making
 •  Study
a
domain
ontology
to
support
Ar<ficial
 Intelligence
tasks
 •  Address
approaches
for
sen<ment
analysis
 •  From
theory
to
prac<ce:
Hands‐on
Session
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 5

  • 6. Outline
 Part
1
 Part
2
 •  Workshop
Context
 •  Sen<ment
Analysis/Opinion
 •  User‐Generated
Content
 Mining
 (UGC)
 •  Polarity
Recognizer
in
 •  Characterisa<on
of
UGC
 Portuguese
(PIRPO)
 •  Knowledge
Engineering
‐
 •  Informa<on
Visualisa<on
 Ontology
Development
 •  Hands‐on
Session
(Individual
 Task):
Dealing
with
UGC
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 6

  • 7. Context
Workshop
 A
framework
for
Customer
Knowledge
Management
based
on
Social
Seman<c
Web.
 
Chaves,
Marcirio
Silveira;
Trojahn,
Cássia
and
Pedron,
Cris<ane
Drebes. A
Framework
for
Customer
Knowledge
Management
based
on
Social
Seman<c
Web:
A
Hotel
Sector
Approach.
In:
 Customer
Rela<onship
Management
and
the
Social
and
Seman<c
Web:
Enabling
Cliens
Conexus.
Colomo‐Palacios,
 R.;
Varajão,
J.
and
Soto‐Acosta,
P.
(Eds.).
p.
141‐157,
Hershey,
PA:
IGI
Global,
2012.
ISBN:
978‐161‐35‐0044‐6
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 7

  • 8. An
Fine‐grained
Analysis
of
UGC
 •  Overall
opinion
about
a
topic
is
only
a
part
of
the
 informaJon
of
interest.

 •  Document‐level
senJment
classificaJon
fails
to
detect
 sen<ment
about
individual
aspects
of
the
topic.
In
 reality,
for
example,
though
one
could
be
generally
 happy
about
his
car,
he
might
be
dissaJsfied
by
the
 engine
noise.

 •  To
the
manufacturers,
these
individual
weaknesses
and
 strengths
are
equally
important
to
know,
or
even
more
 valuable
than
the
overall
sa<sfac<on
level
of
customers.
 (Tang
et
al.
2009)
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 8

  • 9. UGC
 An
opinion
is
simply
a
posiJve
or
negaJve
 senJment,
view,
aPtude,
emoJon,
or
 appraisal
about
an
enJty
or
an
aspect
of
the
 enJty
(Hu
and
Liu,
2004;
Liu,
2006)
from
an
 opinion
holder
(Bethard
et
al.,
2004;
Kim
and
 Hovy,
2004;
Wiebe
et
al.,
2005).
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 9

  • 10. 
Characterisa<on
of
UGC
 •  Opinion’s
Characterisa<on
 –  I
use
and
extend
the
defini<on
proposed
by
(Ding
 et
al.,
2008;
Liu,
2010;
Mar<n
and
White,
2005)
to
 analyse
the
sentences
of
reviews.

 –  Let
the
review
be
r.

 –  In
the
most
general
case,
r
is
characterised
as
a
set
 of
the
following
elements
{O,F,SO,H,S,A,R,I,SG},
 where:
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 10

  • 11. Characterisa<on
of
UGC
 •  Opinion’s
Characterisa<on
 –  O:
Object
 –  F:
Feature
 –  SO:
Seman<c‐Orienta<on
 –  H:
Holder

 –  S:
Source
 –  A:
A%tude
 –  SG:
Sugges.on

 –  R:
Recommenda.on

 –  I:
Inten.on
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 11

  • 12. Characterisa<on
of
UGC
 1  ‐
Object
(O)
 – 
An
object
is
a
product
(e.g.
movie
and
book)
or
a
 service
(e.g.
hotel
and
restaurant)
under
review
 which
is
composed
by
features.
 – 
Objects
are
also
called
enJJes.
 2
‐
Feature
(F)
 –  A
feature
is
a
component
or
part
of
an
object.

 •  actor
and
photography
are
features
on
a
movie.
 •  pool
and
staff
are
features
on
a
hotel.

 –  Features
are
also
called
aXributes
or
facets.

 –  A
feature
can
be
men<oned
explicitly
or
implicitly
 in
a
review
(Ding
et
al.
2008).
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 12

  • 13. Characterisa<on
of
UGC
 2.1
‐
Explicit
Feature
(F)
 –  If
a
feature
f
appears
in
review
r,
it
is
called
an
 explicit
feature
in
r.

 –  The
hotel
is
located
very
near
the
center
city.
 •  loca<on
is
an
explicit
feature.
 2.2
‐
Implicit
Feature
(F):

 –  If
f
does
not
appear
in
r
but
is
implied,
it
is
called
 an
implicit
feature
in
r.

 –  Hotel
is
far
from
public
transporta<on.
 •  loca<on
is
an
implicit
feature.
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 13

  • 14. Characterisa<on
of
UGC
 3
‐
Sentence‐OrientaJon
(SO)
 –  A
review
consists
of
a
sequence
of
sentences
 r=〈
s1,
s2,
…,
sm〉(Ding
et
al.,
2008).

 –  A
sentence
can
be
evaluated
as
the
following
 perspec<ves:
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 14

  • 15. Characterisa<on
of
UGC
 3.1
ObjecJvity

 –  An
objec<ve
sentence
contains
or
menJon
facts.
 •  This
hotel
is
far
from
the
airport,
ca.
15km.
 –  A
subjec<ve
sentence
does
not
menJon
any
fact.
 •  The
parking
could
be
free.
 3.2
Polarity
 –  It
describes
the
orientaJon
present
in
a
sentence
 (i.e.
posiJve,
negaJve,
neutral
and
irrelevant).
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 15

  • 16. Characterisa<on
of
UGC
 3.3
Intensity
(strength
of
the
polarity)
 –  It
refers
to
the
strength
of
the
private
state
that
is
 being
expressed,
in
other
words,
how
strong
is
an
 emo<on
or
a
convic<on
of
belief
(Wilson,
2008).
 –  It
describes
how
intense
it
was
the
experience
using
 a
product
or
service:
 •  very
posiJve,
posiJve,
neutral,
negaJve
and
very
 negaJve.

 •  Very
kindly
staff.
refers
to
a
very
posi<ve
impression
on
 the
staff
service.

 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 16

  • 17. Characterisa<on
of
UGC
 4
‐
Opinion
Holder
(H)
 –  The
holder
of
a
par<cular
opinion
is
the
person
or
the
 organisaJon
that
holds
the
opinion
(Ding
et
al.,
2008).

 –  A
holder
is
iden<fied
with
demographic
characterisJcs
 (e.g.
name,
city
and
country).

 –  Sites
such
as
tripadvisor.com
and
booking.com
classify
 holders
as
types
including:
 •  families
with
older
children
 •  families
with
young
children

 •  mature
couples
 •  groups
of
friends

 •  solo
travellers

 •  young
couples
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 17

  • 18. Characterisa<on
of
UGC
 5
–
Source
 –  An
informa<on
source
is
a
web
site
which
provides
 a
set
of
reviews.

 •  tripadvisor.com

 •  booking.com
 •  amazon.com

 •  A:
A%tude
 •  SG:
Sugges.on

 •  R:
Recommenda.on

 •  I:
Inten.on
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 18

  • 19. Outline
 Part
1
 Part
2
 •  Workshop
Context
 •  Sen<ment
Analysis/Opinion
 •  User‐Generated
Content
 Mining
 (UGC)
 •  Polarity
Recognizer
in
 •  Characterisa<on
of
UGC
 Portuguese
(PIRPO)
 •  Knowledge
Engineering
‐
 •  Informa<on
Visualisa<on
 Ontology
Development
 •  Hands‐on
Session
(Individual
 Task):
Dealing
with
UGC
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 19

  • 20. Limita<ons
for
represen<ng
knowledge
in
the
 accommoda<on
sector
 language?
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 20

  • 21. More
limita<ons
 •  Actually,
web
agents
are
unable
to
answer
 ques<ons
such
as:
 –  What
are
the
hotels
with
longer
indoor
swimming
 pool
Jme
table
in
Roma?
 –  What
are
the
hotels
with
the
cheapest
breakfast
 in
Lisbon?
 –  What
are
the
cheapest
hotels
with
family
suite
 room
with
sea
view
in
Barcelona?

 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 21

  • 22. Knowledge
Engineering
 •  Ontology
as
a
support
to
evaluate
UGC
 –  Set
of
concepts
to
a
specific
domain
 –  Human
and
machine
readable
 –  Support
to
fine‐grained
analysis
of
the
instances
 (e.g.
reviews)
 –  Hontology
(H
stands
for
hotel,
hostal
and
hostel)

 •  A
robust,
coherent
and
mul<lingual
representa<on
of
 the
accommoda<on
sector.

 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 22

  • 23. Context
Workshop
 A
framework
for
Customer
Knowledge
Management
based
on
Social
Seman<c
Web.
 
Chaves,
Marcirio
Silveira;
Trojahn,
Cássia
and
Pedron,
Cris<ane
Drebes. A
Framework
for
Customer
Knowledge
Management
based
on
Social
Seman<c
Web:
A
Hotel
Sector
Approach.
In:
 Customer
Rela<onship
Management
and
the
Social
and
Seman<c
Web:
Enabling
Cliens
Conexus.
Colomo‐Palacios,
 R.;
Varajão,
J.
and
Soto‐Acosta,
P.
(Eds.).
p.
141‐157,
Hershey,
PA:
IGI
Global,
2012.
ISBN:
978‐161‐35‐0044‐6
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 23

  • 24. Knowledge
Engineering
 •  Development
Methodology

 –  Iden<fy
exis<ng
ontologies
on
related
domains
 –  Select
the
main
concepts
and
proper<es
 –  Organize
concepts
and
proper<es
hierarchically
into
categories
 –  Translate
the
ontology
(manual)
 –  Expand
concepts
and
proper<es
based
on
comments
 –  Translate
the
new
concepts
and
proper<es
(manual)
 –  Generate
the
ontology
in
several
formats
 Chaves,
M.
S.
and
Trojahn,
C.
Towards
a
MulJlingual
Ontology
for
Ontology‐driven
Content
Mining
in
Social
Web
Sites.
 Proc.
of
the
ISWC
2010
Workshops,
Volume
I,
1st
InternaJonal
Workshop
on
Cross‐Cultural
and
Cross‐Lingual
Aspects
 of
the
SemanJc
Web.
Shanghai,
China,
November
7th,
2010.
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 24

  • 25. Knowledge
Engineering
 •  Hontology
 –  A
mulJlingual
ontology
for
the
accommodaJon
 sector.

 •  Demo
Protégé
 
Chaves,
M.
S.;
Freitas,
L.
A.
and
Vieira,
R.
(2012).
Hontology:
A
mulJlingual
ontology
for
the
 accommodaJon
sector.
4th
InternaJonal
Conference
on
Knowledge
Engineering
and
Ontology
 Development,
Barcelona,
Spain,
4‐7
October.
(SubmiXed)
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 25

  • 26. Knowledge
Engineering
 Metrics
 Value
 Number
of
Concepts
 285
 Number
of
Object
Proper<es
 10
 Number
of
Data
Proper<es
 31
 Concept
Axioms
 Preliminary
 Subconcept
axioms
 270
 Hontology
 Equivalent
concepts
axioms
 4
 Disjoint
concepts
axioms
 93
 Sta<s<cs

 Object
Property
Axioms
 Func<onal
object
property
axioms
 6
 Object
property
domain
axioms
 11
 Object
property
range
axioms
 8
 Data
Property
Axioms
 Func<onal
data
property
axioms
 12
 Object
data
domain
axioms
 17
 Object
data
range
axioms
 1
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 26

  • 27. Hands‐on
Session
 •  The
aim
of
this
hands‐on
session
is
to
allow
you
thinking
 in‐depth
about
UGC
on
the
context
of
the
accommoda<on
 sector.
 •  You
are
going
to
receive
a
set
of
4
or
5
reviews
about
 accommoda<ons
and
should
evaluate
each
one
according
 to
the
following
parameters:
 –  Features
present
in
the
review
(see
the
concepts
of
 Hontology)
 –  Intensity
(Strength
of
the
Polarity):
(very
nega<ve,
 nega<ve,
neutral,
posi<ve,
very
posi<ve)
 •  Notes:
 –  Evaluate
one
feature
per
line.
 –  Please,
save
your
sheet
in
another
file
and
send
to
 mschaves@gmail.com.
Subject:
UB:GX
 –  X
=
number
of
the
group.
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 27

  • 28. Outline
 Part
1
 Part
2
 •  Workshop
Context
 •  Sen<ment
Analysis/Opinion
 •  User‐Generated
Content
 Mining
 (UGC)
 •  Polarity
Recognizer
in
 •  Characterisa<on
of
UGC
 Portuguese
(PIRPO)
 •  Knowledge
Engineering
‐
 •  Informa<on
Visualisa<on
 Ontology
Development
 •  Hands‐on
Session
(Individual
 Task):
Dealing
with
UGC
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 28

  • 29. Sen<ment
Analysis

 •  Analysis
and
automaJc
extracJon
of
SemanJc
 OrientaJon
 •  SemanJc
orientaJon
refers
to
the
polarity
and
 strength
of
words,
phrases,
or
texts.
 •  Approaches
 –  Lexicon‐based
 •  Dic<onaries
of
words
annotated
with
the
word´s
seman<c
 orienta<on,
or
polarity.

 •  A
manually
built
dicJonary
provides
a
solid
foundaJon
for
a
 lexicon‐based
approach
(Taboada
et.
al.,
2011).

 –  StaJsJcal
or
Machine‐learning
 •  Supervised
classifica<on
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 29

  • 30. Sen<ment
Analysis

 •  Lexicon‐based
Approach
 –  Sen<ment‐bearing
words:
a
list
of
nouns,
verbs,
 adjecJves
and
adverbs
(Chesley
et
al.,
2006)

 •  use
verbs
and
adjec<ves
to
classify
English
 opinionated
blog
texts.

 –  List
of
conjuncJons
and
connecJves
(Liu,
2010).

 –  Use
of
auxiliary
verbs
to
get
features
and
opinion‐ oriented
words
about
products
from
texts
(Khan
et
 al.,
2010).

 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 30

  • 31. Sen<ment
Analysis

 •  Seed
words

 –  are
a
small
set
of
words
with
strong
negaJve
or
 posiJve
associa<ons,
such
as
excellent
or
abysmal.

 –  In
principle,
a
posi<ve
adjec<ve
should
occur
more
 frequently
alongside
the
posi<ve
seed
words,
and
 thus
will
obtain
a
posi<ve
score,
whereas
nega<ve
 adjec<ves
will
occur
most
oen
in
the
vicinity
of
 nega<ve
seed
words,
thus
obtaining
a
nega<ve
 score
(Taboada
et.
al.
2011).

 •  This
restaurant
has
a
bad
and
expensive
food.
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 31

  • 32. Sen<ment
Analysis

 •  Part‐of‐Speech
(PoS)
 –  In
order
to
evaluate
a
sentence
in
a
review,
we
 should
consider
the
parts‐of‐speech
men<oned
 such
as
adjecJves,
adverbs
and
verbs.

 –  Adjec<ves
are
classified
as:
 •  posi<ve
(good,
excellent
and
clean),

 •  nega<ve
(awful,
boring
and
terrible),

 •  neutral
(regular
and
indifferent)
and

 •  dual,
which
can
express
posi<ve
and
nega<ve
opinion
 (small,
long).

 –  In
some
approaches
nouns
are
represented
by
 concepts
of
a
domain
ontology
and
mapped
as
 features.
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 32

  • 33. Sen<ment
Analysis

 •  ConjuncJon
and
ConnecJve
(CC)
 –  Connec<ves
are
words
that
help
iden<fying
 addiJonal
adjecJve
opinion
words
and
their
 orientaJons.

 –  One
of
the
constraints
is
about
conjunc<on
(i.e.
and),
 which
says
that
conjoined
adjec<ves
usually
have
the
 same
orienta<on
(Liu,
2010).

 •  This
room
is
beau<ful
and
spacious.
 –  if
beau<ful
is
known
to
be
posi<ve,
it
can
be
inferred
that
spacious
 is
also
posi<ve.

 –  HeurisJc:
 •  People
usually
express
the
same
opinion
on
both
sides
of
 a
conjuncJon.

 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 33

  • 34. Sen<ment
Analysis
 •  ConjuncJon
and
ConnecJve
(CC)
 –  Rules
or
constraints
are
also
designed
for
other
 connec<ves
(e.g.
or,
but,
either‐or,
and
neither‐nor).

 •  This
hotel
is
beau<ful
but
difficult
to
get
there.
 –  The
occurrence
aer
the
connec<ve
but
is
an
indicator
of
a
 nega<ve
opinion.
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 34

  • 35. Sen<ment
Analysis
 •  Strength
of
the
PolaJry
or
Intensity
or
 IntensificaJon
 –  Amplifiers
(very,
a
lot)
increase
the
seman<c
intensity
 of
a
neighboring
lexical
item;
 –  AXenuators/Downtoners
(a
li/le,
slightly)
decrease
it.

 •  Some
approaches
have
implemented
intensifiers
 using
simple
addiJon
and
subtracJon
 –  if
a
posi<ve
adjec<ve
has
an
SO
value
of
2:
 •  an
amplified
adjec<ve
would
have
an
SO
value
of
3,
and

 •  a
downtoned
adjec<ve
an
SO
value
of
1.
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 35

  • 36. Sen<ment
Analysis
 •  NegaJon
 –  The
obvious
approach
to
nega<on
is
simply
to
 reverse
the
polarity
of
the
lexical
item
next
to
a
 negator,
changing
good
(+3)
into
not
good
(−3).
 –  Not,
none,
nobody,
never,
and
nothing,
and
other
 words,
such
as
without
or
lack.
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 36

  • 37. Polarity
Recognizer
in
Portuguese
(PIRPO)

 •  Polarity
Recognizer
in
Portuguese
to
classify
senJment
in
 online
reviews.

 •  PIRPO
was
built
from
the
ground
to
Portuguese
for
 recognising
the
polarity
of
the
user
opinion
on
 accommoda<on
reviews.

 •  Each
review
is
analysed
according
to
concepts
from
a
 domain
ontology.

 •  We
decompose
the
review
in
sentences
in
order
to
assign
a
 polarity
to
each
concept
of
the
ontology
in
the
sentence.
 Chaves,
M.
S.,
Freitas,
L.,
Souza,
M.
and
Vieira,
R.
PIRPO:
An
Algorithm
to
deal
with
Polarity
in
Portuguese
Online
 Reviews
from
the
AccommodaJon
Sector.
17th
InternaJonal
conference
on
ApplicaJons
of
Natural
Language
 Processing
to
InformaJon
Systems
(NLDB),
Groningen,
The
Netherlands,
26‐28
June
2012.
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 37

  • 38. PIRPO
Informa<on
Architecture
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 38

  • 39. PIRPO
 •  Reviews
 –  Full
dataset:
1500
reviews
from
January
2010
to
 April
2011
in
Portuguese,
English
and
Spanish,
 from
which
180
in
Portuguese.

 •  Ontology
Concepts
 –  The
concepts
used
to
classify
the
reviews
are
 provided
by
Hontology,
which
in
its
current
 version,
has
110
concepts.
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 39

  • 40. PIRPO
 •  List
of
adjecJves:
It
is
composed
by
sen<ment‐ bearing
words.

 –  This
list
of
polar
adjecJves
in
Portuguese

 •  contains
30.322
entries.

 •  is
composed
by
the
name
of
the
adjecJve
and
a
polarity
 which
can
assign
one
of
three
values:
+1,
‐1
and
0.

 •  These
values
corresponding
to
the
posiJve,
negaJve
and
 neutral
senses
of
the
adjec<ve.

 –  PIRPO
uses
this
list
to
calculate
the
semanJc
 orientaJon
of
the
concepts
found
in
the
sentences.

 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 40

  • 41. PIRPO
Algorithm
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 41

  • 42. PIRPO
Measure
Evalua<on
 •  Precision
 {relevantConcepts} ∩ {retrievedConcepts} P= {retrievedConcepts} •  Recall
 {relevantConcepts} ∩ {retrievedConcepts} € R= {relevantConcepts} •  F‐score
(harmonic
mean
of
precision
and
recall)
 € P×R F =2× P+R Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 42

  • 43. PIRPO
Preliminary
Results
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 43

  • 44. PIRPO:
Discussion
on
the
Results
 •  PIRPO
reached
a
be/er
 recall
for
concepts
with
 posi<ve
polarity,
while
 mixed
polarity
had
a
 higher
precision.

 •  The
low
F‐score
can
be
 mainly
due
to
the
 algorithm
has
assigned
 a
polarity
to
a
specific
 concept
of
the
 ontology,
while
the
 human
classified
the
 review
as
a
whole.

 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 44

  • 45. Outline
 Part
1
 Part
2
 •  Workshop
Context
 •  Knowledge
Engineering
‐
 •  User‐Generated
Content
 Modelling
UGC
 (UGC)
 •  Sen<ment
Analysis/Opinion
 •  Characterisa<on
of
UGC
 Mining
 •  Knowledge
Engineering
‐
 •  Polarity
Recognizer
in
 Ontology
Development
 Portuguese
(PIRPO)
 •  Hands‐on
Session
(Individual
 •  Informa<on
Visualisa<on
 Task):
Dealing
with
UGC
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 45

  • 46. Context
Workshop
 A
framework
for
Customer
Knowledge
Management
based
on
Social
Seman<c
Web.
 
Chaves,
Marcirio
Silveira;
Trojahn,
Cássia
and
Pedron,
Cris<ane
Drebes. A
Framework
for
Customer
Knowledge
Management
based
on
Social
Seman<c
Web:
A
Hotel
Sector
Approach.
In:
 Customer
Rela<onship
Management
and
the
Social
and
Seman<c
Web:
Enabling
Cliens
Conexus.
Colomo‐Palacios,
 R.;
Varajão,
J.
and
Soto‐Acosta,
P.
(Eds.).
p.
141‐157,
Hershey,
PA:
IGI
Global,
2012.
ISBN:
978‐161‐35‐0044‐6
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 46

  • 47. Informa<on
Visualisa<on

 •  What
is
the
visual
model
of
the
poten<al
end‐user?
 •  How
should
we
properly
map
and
render:
 –  the
most
valued
accommoda<on
features?
 –  the
percep<on
of
the
quality
offered
by
the
hotel?
 –  the
correla<on
between
the
guest’s
profile
and
the
 mostly
relevant
features?
 –  the
intensity
of
the
posi<vity
or
nega<vity
of
the
 features?


 •  Does
the
use
of
advanced
visual
techniques
(such
as
 tree
oriented)
to
map
the
results
will
help
the
 accommoda<on
managers
and
guests
to
have
a
 be/er
insight
of
the
data?
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 47

  • 48. Exploring
Informa<on
Visualisa<on

 •  In
the
next
figures
 –  The
color
was
used
to
map
the
polarity
and
the
 strength
of
the
polarity
values
on
the
CO.

 –  The
size
was
used
to
map
the
frequency
that
the
 CO
is
men<oned
in
the
reviews.

 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 48

  • 49. Exploring
Informa<on
Visualisa<on

 Result
of
the
applica<on
of
Bubble
Tree
visualisaJon
of
the
 rela<on
among
concepts
of
the
ontology,
polarity
(le)
and
 strength
of
the
polarity
(right).
 •  Carvalho,
E.;
Chaves,
M.
S.,
2012.
Exploring
User‐Generated
Data
VisualizaJon
in
the
AccommodaJon
 Sector.
16th
InternaJonal
Conference
InformaJon
VisualisaJon,
IEEE.
(SubmiXed)
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 49

  • 50. Exploring
Informa<on
Visualisa<on

 Results
using
Treemap
visualisaJon
of
the
rela<on
among
type
of
 customer,
concepts
of
the
ontology
and
polarity.

 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 50

  • 51. Ques<onnaire
(in
Spanish)
 •  You
are
going
to
receive
a
ques<onnaire
about
 informa<on
visualisa<on
using
UGC
in
the
context
of
 the
accommoda<on
sector.
 •  Please,
click
here
h/p://kwiksurveys.com?u=Infovises
 to
answer
it.

 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 51

  • 52. Final
Remarks
 •  In‐depth
analysis
of
UGC
can
be
used
as
input
 to
improve
decision
making.
 •  It
is
<me
to
think
about
new
models
to
store
 UGC
data.
 •  It
is
necessary
the
building
from
the
ground
of
 new
algorithms
to
deal
with
UGC
for
 languages
other
than
English.
 •  InformaJon
visualisaJon
of
UGC
is
in
its
 infancy
state.
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 52

  • 53. Main
References
 •  S.
Bethard,
H.
Yu,
A.
Thornton,
V.
Hatzivassiloglou,
and
D.
Jurafsky,
2004.
Automa<c
extrac<on
of
opinion
proposi<ons
and
 their
holders.
in
Proceedings
of
the
AAAI
Spring
Symposium
on
Exploring
A%tude
and
Affect
in
Text.
 •  Chesley,
P.;
Vincent,
B.;
Xu,
L.
and
Srihari
R.,
2006.
Using
verbs
and
adjec<ves
to
automa<cally
classify
blog
sen<ment.
in
 AAAI
Symposium
on
Computa<onal
Approaches
to
Analysing
Weblogs
(AAAI‐CAAW),
27–29.

 •  Ding,
X.,
Liu,
B.,
and
Yu,
P.
S.,
2008.
A
holis<c
lexicon‐based
approach
to
opinion
mining.
Proceedings
of
the
Conference
on
 Web
Search
and
Web
Data
Mining
(WSDM).
 •  M.
Hu
and
B.
Liu,
2004.
Mining
opinion
features
in
customer
reviews.
In
Proceedings
of
AAAI,
pp.
755–760.
 •  S.‐M.
Kim
and
E.
Hovy,
2004.
Determining
the
sen<ment
of
opinions.
In
Proceedings
of
the
Interna.onal
Conference
on
 Computa.onal
Linguis.cs
(COLING),
2004.
 •  Liu,
Bing,
2010.
Sen<ment
Analysis
and
Subjec<vity.
In
Handbook
of
Natural
Language
Processing,
Second
Edi<on,
Eds:
N.
 Indurkhya
and
F.
J.
Damerau),
CRC
Press,
Taylor
and
Francis
Group,
Boca
Raton,
FL.
Chapter
28.


 •  Mar<n,
J.R.
and
White,
P.
R.
R.,
2005.
The
Language
of
Evalua<on,
Appraisal
in
English,
Palgrave
Macmillan,
London
&
 New
York.
 •  Taboada,
M.,
Brooke,
J.,
Tofiloski,
M.,
Voll,
K.D.,
Stede,
M.,
2011.
Lexicon‐based
methods
for
sen<ment
analysis.
 Computa<onal
Linguis<cs
37(2),
267–307.
 •  Tang,
H.,
Tan,
S.,
Cheng,
X.,
2009.
A
survey
on
sen<ment
detec<on
of
reviews.
Expert
Systems
with
Applica<ons
36(7),
 10760
–
10773.
 •  Whitelaw,
C.;
Garg,
N.
and
Argamon,
S.,
2005.
Using
appraisal
groups
for
sen<ment
analysis.
In
Proceedings
of
the
14th
 ACM
interna<onal
conference
on
Informa<on
and
knowledge
management
(CIKM
'05).
ACM,
New
York,
NY,
USA,
625‐631.
 •  Wilson,
T.,
2008.
Fine‐Grained
Subjec<vity
Analysis.
PhD
Disserta<on,
Intelligent
Systems
Program,
University
of
 Pi/sburgh.
 •  Wilson,
T.,
Wiebe,
J.,
Hoffmann,
P.,
2009.
Recognizing
contextual
polarity:
An
explora<on
of
features
for
phrase‐level
 sen<ment
analysis.
Computa<onal
Linguis<cs
35,
399–433.
 •  Y.
Wu,
F.
Wei,
S.
Liu,
N.
Au,
W.
Cui,
H.
Zhou,
and
H.
Qu,
2010.
OpinionSeer:
Interac<ve
Visualisa<on
of
Hotel
Customer
 Feedback.
IEEE
Transac<ons
on
Visualiza<on
and
Computer
Graphics,
6,
1109‐1118.
Nov‐Dec.
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 53

  • 54. Open‐source
sen<ment‐analysis
tools
 •  Python
NLTK
(Natural
Language
Toolkit)
 –  h/p://www.nltk.org
and
h/p://text‐processing.com/demo/sen<ment
 •  R,
TM
(text
mining)
module
h/p://cran.r‐project.org/web/packages/tm/index.html

 •  RapidMiner
h/p://rapid‐i.com/content/view/184/196/

 •  GATE,
the
General
Architecture
for
Text
Engineering
h/p://gate.ac.uk/sen<ment

 •  UIMA‐plug‐in
annotators
for
sen<ment
—
Apache
UIMA
is
the
 Unstructured
Informa<on
Management
Architecture,
h/p://uima.apache.org/


 •  SenJment
classifiers
for
the
WEKA
data‐mining
workbench,
 h/p://www.cs.waikato.ac.nz/ml/weka/.

 •  Stanford
NLP
tools
‐
h/p://www‐nlp.stanford.edu/soware
maximum‐entropy
 classifica<on
approach
for
sen<ment.
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 54

  • 55. Thank
you
very
much
for
your
 a/en<on!!
 Ques<ons
 Apr‐18‐12
 Marcirio
Chaves
‐
marcirioc@uatlan<ca.pt
 55