SlideShare ist ein Scribd-Unternehmen logo
1 von 60
Downloaden Sie, um offline zu lesen
Seman&c	
  Analysis	
  in	
  Language	
  Technology	
  
http://stp.lingfil.uu.se/~santinim/sais/2016/sais_2016.htm 



Word Senses


Marina	
  San(ni	
  
san$nim@stp.lingfil.uu.se	
  
	
  
Department	
  of	
  Linguis(cs	
  and	
  Philology	
  
Uppsala	
  University,	
  Uppsala,	
  Sweden	
  
	
  
Spring	
  2016	
  
	
  
	
  
Previous	
  Lecture:	
  Sen$ment	
  Analysis	
  
•  Affec(ve	
  meaning	
  is	
  a	
  kind	
  of	
  connota(onal	
  meaning	
  	
  
•  The	
  importance	
  of	
  sen(ment	
  lexicons	
  
•  Methods	
  for	
  the	
  automa(c	
  expansion	
  of	
  manually-­‐
annotated	
  lexicons	
  
•  A	
  baseline	
  algorithm:	
  Naive	
  Bayes	
  
•  Prac(cal	
  ac(vity:	
  ML-­‐based	
  sen(ment	
  classifier:	
  
movie	
  reviews,	
  product	
  reviews,	
  restaurant	
  reviews…	
  
•  Results…	
  uhm…	
  	
  somehow	
  biassed	
  (short	
  text	
  vs	
  
long	
  texts);	
  never	
  a	
  full	
  posi(ve	
  polarity;	
  etc.	
  
2	
   Lecture	
  4:	
  Word	
  Senses	
  
How	
  is	
  *meaning*	
  handled	
  in	
  Seman$c-­‐Based	
  LT-­‐Applica$ons?	
  
•  Seman(c	
  Role	
  Labelling/Predicate-­‐Argument	
  Structure	
  	
  
•  Main	
  trend:	
  
•  crea(on	
  of	
  annotated	
  resources	
  (PropBank,	
  FrameNet,	
  etc.);	
  	
  	
  
•  use	
  of	
  supervised	
  machine	
  learning:	
  classifiers	
  are	
  trained	
  on	
  annotated	
  resources,	
  such	
  as	
  PropBank	
  and	
  
FrameNet.	
  	
  
•  Sen(ment	
  Analysis	
  	
  
•  Main	
  trends:	
  
•  iden(fica(on	
  of	
  sen(ment-­‐bearing	
  features	
  or	
  iden(fica(on	
  of	
  representa(ve	
  features	
  for	
  the	
  problem	
  
•  use	
  of	
  supervervised	
  learning	
  à	
  cf	
  the	
  results	
  of	
  NLTK	
  classifier	
  
•  Word	
  sense	
  disambigua(on	
  (???)	
  
•  Informa(on	
  extrac(on	
  (???)	
  
•  Ques(on	
  Answering	
  (???)	
  
•  Ontologies	
  (???)	
  
Lecture	
  4:	
  Word	
  Senses	
  3	
  
Reminder:	
  Glossary	
  Entries	
  
•  Which	
  concepts	
  are	
  the	
  most	
  salient	
  in	
  Lect	
  3?	
  
•  Update	
  your	
  Glossary…	
  
Lecture	
  4:	
  Word	
  Senses	
  4	
  
Previous	
  lecture:	
  end	
  
5	
   Lecture	
  4:	
  Word	
  Senses	
  
Word	
  Senses	
  
•  Master	
  Students	
  à	
  NLP	
  course	
  
•  Bachelor	
  Students	
  à	
  Seman(cs	
  
6	
   Lecture	
  4:	
  Word	
  Senses	
  
From	
  Lect	
  2:	
  PropBank	
  &	
  Selec$onal	
  Restric$ons	
  
•  PropBank	
  is	
  organized	
  by	
  word	
  
senses	
  :	
  word	
  senses	
  are	
  different	
  
aspects	
  of	
  meaning	
  of	
  a	
  word	
  
•  Selec(onal	
  restric(ons…	
  we	
  can	
  use	
  
seman(c	
  constraints	
  to	
  
disambiguate	
  senses.	
  Ex:	
  eat,	
  
serve….	
  bear	
  me	
  some	
  pa(ence…	
  
7	
   Lecture	
  4:	
  Word	
  Senses	
  
Acknowledgements
Most	
  slides	
  borrowed	
  from:	
  
Dan	
  Jurafsky	
  and	
  James	
  H.	
  Mar(n	
  
Some	
  slides	
  borrowed	
  from	
  D.	
  Jurafsky	
  and	
  C.	
  Manning	
  	
  and	
  D.	
  Radev	
  (Coursera)	
  
	
  
J&M(2015,	
  draf):	
  hgps://web.stanford.edu/~jurafsky/slp3/	
  	
  
	
  
	
  	
  	
  
Outline	
  
•  Word	
  Meaning	
  
•  WordNet	
  and	
  Other	
  Lexical	
  Resources	
  
•  Selec(onal	
  Restric(ons	
  
9	
   Lecture	
  4:	
  Word	
  Senses	
  
Logic:	
  meaning	
  representa$on:	
  uppercase	
  words!	
  
•  Constant	
  
•  Variables	
  
•  Predicates	
  
•  Boolean	
  connec(ves	
  
•  Quan(fiers	
  
•  Brackets	
  and	
  comma	
  to	
  group	
  the	
  symbols	
  together	
  
•  Ex:	
  A	
  woman	
  crosses	
  Sunset	
  Boulevard	
  
Lecture	
  4:	
  Word	
  Senses	
  10	
  
Formal	
  Seman(cs	
  
Defini$ons	
  
•  Lexical	
  seman$cs	
  is	
  the	
  study	
  of	
  the	
  meaning	
  of	
  words	
  and	
  the	
  systema(c	
  
meaning-­‐related	
  connec(ons	
  between	
  words.	
  	
  	
  
•  A	
  word	
  sense	
  is	
  the	
  locus	
  of	
  word	
  meaning;	
  defini(ons	
  and	
  meaning	
  rela(ons	
  
are	
  defined	
  at	
  the	
  level	
  of	
  the	
  word	
  sense	
  rather	
  than	
  wordforms.	
  	
  
•  Homonymy	
  is	
  the	
  rela(on	
  between	
  unrelated	
  senses	
  that	
  share	
  a	
  form.	
  	
  
•  Polysemy	
  is	
  the	
  rela(on	
  between	
  related	
  senses	
  that	
  share	
  a	
  form.	
  
•  Synonymy	
  holds	
  between	
  different	
  words	
  with	
  the	
  same	
  meaning.	
  
•  Hyponymy	
  and	
  hypernymy	
  rela(ons	
  hold	
  between	
  words	
  that	
  are	
  in	
  a	
  class	
  
inclusion	
  rela(onship.	
  	
  
•  Meronymy	
  type	
  of	
  hierarchy	
  that	
  deals	
  with	
  part–whole	
  rela(onships.	
  
•  WordNet	
  is	
  a	
  large	
  database	
  of	
  lexical	
  rela(ons	
  for	
  English	
  
11	
  
	
  
Lecture	
  4:	
  Word	
  Senses	
  
Word Meaning and
Similarity
Word	
  Senses	
  and	
  
Word	
  Rela(ons	
  
Reminder:	
  lemma	
  and	
  wordform	
  
•  A	
  lemma	
  or	
  cita$on	
  form	
  
•  Same	
  stem,	
  part	
  of	
  speech,	
  rough	
  seman(cs	
  
•  A	
  wordform	
  
•  The	
  “inflected”	
  word	
  as	
  it	
  appears	
  in	
  text	
  
Wordform	
   Lemma	
  
banks	
   bank	
  
sung	
   sing	
  
duermes	
   dormir	
   Lecture	
  4:	
  Word	
  Senses	
  13	
  
Cf.	
  token/type	
  ra(o:	
  
crude	
  measure	
  of	
  
lexical	
  densi(y:	
  	
  
	
  
If a text is 1,000 words long,
it is said to have 1,000
"tokens". But a lot of these
words will be repeated, and
there may be only say 400
different words in the text.
"Types", therefore, are the
different words.
The ratio between types and
tokens in this example would
be 40%. (source: wordsmith
tools)	
  
Lemmas	
  have	
  senses	
  
•  One	
  lemma	
  “bank”	
  can	
  have	
  many	
  meanings:	
  
•  …a bank can hold the investments in a custodial
account…!
•  “…as agriculture burgeons on the east bank the
river will shrink even more”	
  
•  Sense	
  (or	
  word	
  sense)	
  
•  A	
  discrete	
  representa(on	
  	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  of	
  an	
  aspect	
  of	
  a	
  word’s	
  meaning.	
  
•  The	
  lemma	
  bank	
  here	
  has	
  two	
  senses	
  
1!
2!
Sense	
  1:	
  
Sense	
  2:	
  
Lecture	
  4:	
  Word	
  Senses	
  
14	
  
Other	
  examples?	
  
Lecture	
  4:	
  Word	
  Senses	
  15	
  
Homonymy	
  
Homonyms:	
  words	
  that	
  share	
  a	
  form	
  but	
  have	
  
unrelated,	
  dis(nct	
  meanings:	
  
•  bank1:	
  financial	
  ins(tu(on,	
  	
  	
  	
  bank2:	
  	
  sloping	
  land	
  
•  bat1:	
  club	
  for	
  hiqng	
  a	
  ball,	
  	
  	
  	
  bat2:	
  	
  nocturnal	
  flying	
  mammal	
  
1.  Homographs	
  (bank/bank,	
  bat/bat)	
  
2.  Homophones:	
  
1.  Write	
  and	
  right	
  
2.  Piece	
  and	
  peace	
  
Lecture	
  4:	
  Word	
  Senses	
  16	
  
Homonymy	
  causes	
  problems	
  for	
  NLP	
  
applica$ons	
  
•  Informa(on	
  retrieval	
  
•  “bat care”!
•  Machine	
  Transla(on	
  
•  bat:	
  	
  murciélago	
  	
  (animal)	
  or	
  	
  bate	
  (for	
  baseball)	
  
•  Text-­‐to-­‐Speech	
  
•  bass	
  (stringed	
  instrument)	
  vs.	
  bass	
  (fish)	
  
•  There	
  would	
  be	
  no	
  ambiguity	
  for	
  Speech	
  to	
  Text:	
  why?	
  
Lecture	
  4:	
  Word	
  Senses	
  17	
  
Polysemy	
  
•  1.	
  The	
  bank	
  was	
  constructed	
  in	
  1875	
  out	
  of	
  local	
  red	
  brick.	
  
•  2.	
  I	
  withdrew	
  the	
  money	
  from	
  the	
  bank	
  	
  
•  Are	
  those	
  the	
  same	
  sense?	
  
•  Sense	
  2:	
  “A	
  financial	
  ins(tu(on”	
  
•  Sense	
  1:	
  “The	
  building	
  belonging	
  to	
  a	
  financial	
  ins(tu(on”	
  
•  A	
  polysemous	
  word	
  has	
  related	
  meanings	
  
•  Most	
  non-­‐rare	
  words	
  have	
  mul(ple	
  meanings	
  
Lecture	
  4:	
  Word	
  Senses	
  18	
  
Polysemy	
  
19	
   Lecture	
  4:	
  Word	
  Senses	
  
•  Lots	
  of	
  types	
  of	
  polysemy	
  are	
  systema(c	
  
•  School, university, hospital!
•  All	
  can	
  mean	
  the	
  ins(tu(on	
  or	
  the	
  building.	
  
•  A	
  systema(c	
  rela(onship:	
  
•  Building	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Organiza(on	
  
•  Other	
  such	
  kinds	
  of	
  systema(c	
  polysemy:	
  	
  
Author	
  (Jane Austen wrote Emma)	
  	
  	
  	
  	
  
	
  Works	
  of	
  Author	
  (I love Jane Austen)	
  
Tree	
  (Plums have beautiful blossoms) !
!Fruit	
  (I ate a preserved plum)!
Metonymy	
  or	
  Systema$c	
  Polysemy:	
  	
  
A	
  systema$c	
  rela$onship	
  between	
  senses	
  
Lecture	
  4:	
  Word	
  Senses	
  20	
  
How	
  do	
  we	
  know	
  when	
  a	
  word	
  has	
  more	
  
than	
  one	
  sense?	
  
Lecture	
  4:	
  Word	
  Senses	
  21	
  
How	
  do	
  we	
  know	
  when	
  a	
  word	
  has	
  more	
  
than	
  one	
  sense?	
  
•  The	
  “zeugma”	
  test:	
  Two	
  senses	
  of	
  serve?	
  
•  Which flights serve breakfast?!
•  Does Lufthansa serve Philadelphia?!
•  ?Does	
  Lufhansa	
  serve	
  breakfast	
  and	
  San	
  Jose?	
  
•  Since	
  this	
  conjunc(on	
  sounds	
  weird,	
  	
  
•  we	
  say	
  that	
  these	
  are	
  two	
  different	
  senses	
  of	
  “serve”	
  
Lecture	
  4:	
  Word	
  Senses	
  22	
  
Synonyms	
  
•  Word	
  that	
  have	
  the	
  same	
  meaning	
  in	
  some	
  or	
  all	
  contexts.	
  
•  filbert	
  /	
  hazelnut	
  
•  couch	
  /	
  sofa	
  
•  big	
  /	
  large	
  
•  automobile	
  /	
  car	
  
•  vomit	
  /	
  throw	
  up	
  
•  Water	
  /	
  H20	
  
•  Two	
  lexemes	
  are	
  synonyms	
  	
  
•  if	
  they	
  can	
  be	
  subs(tuted	
  for	
  each	
  other	
  in	
  all	
  situa(ons	
  
•  If	
  so	
  they	
  have	
  the	
  same	
  proposi$onal	
  meaning	
  
Lecture	
  4:	
  Word	
  Senses	
  23	
  
Synonyms	
  
•  But	
  there	
  are	
  few	
  (or	
  no)	
  examples	
  of	
  perfect	
  synonymy.	
  
•  Even	
  if	
  many	
  aspects	
  of	
  meaning	
  are	
  iden(cal	
  
•  S(ll	
  may	
  not	
  preserve	
  the	
  acceptability	
  based	
  on	
  no(ons	
  of	
  politeness,	
  
slang,	
  register,	
  genre,	
  etc.	
  
•  Example:	
  
•  Water/H20	
  
•  Big/large	
  
•  Brave/courageous 	
   	
   	
   	
  high	
  brow:	
  la(nate	
  words	
  
Lecture	
  4:	
  Word	
  Senses	
  24	
  
Synonymy	
  is	
  a	
  rela$on	
  	
  
between	
  senses	
  rather	
  than	
  words	
  
•  Consider	
  the	
  words	
  big	
  and	
  large	
  
•  Are	
  they	
  synonyms?	
  
•  How	
  big	
  is	
  that	
  plane?	
  
•  Would	
  I	
  be	
  flying	
  on	
  a	
  large	
  or	
  small	
  plane?	
  
•  How	
  about	
  here:	
  
•  Miss	
  Nelson	
  became	
  a	
  kind	
  of	
  big	
  sister	
  to	
  Benjamin.	
  
•  ?Miss	
  Nelson	
  became	
  a	
  kind	
  of	
  large	
  sister	
  to	
  Benjamin.	
  
•  Why?	
  
•  big	
  has	
  a	
  sense	
  that	
  means	
  being	
  older,	
  or	
  grown	
  up	
  
•  large	
  lacks	
  this	
  sense	
  
Lecture	
  4:	
  Word	
  Senses	
  25	
  
Synonymy:	
  Summary	
  
26	
   Lecture	
  4:	
  Word	
  Senses	
  
Other	
  seman$c	
  rela$ons…	
  
27	
   Lecture	
  4:	
  Word	
  Senses	
  
Antonyms	
  
•  Senses	
  that	
  are	
  opposites	
  with	
  respect	
  to	
  one	
  feature	
  of	
  meaning	
  
•  Otherwise,	
  they	
  are	
  very	
  similar!	
  
dark/light short/long !fast/slow !rise/fall!
hot/cold! up/down! in/out!
•  More	
  formally:	
  antonyms	
  can	
  
•  define	
  a	
  binary	
  opposi(on	
  
	
  or	
  be	
  at	
  opposite	
  ends	
  of	
  a	
  scale	
  
•  	
  long/short, fast/slow!
•  Be	
  reversives:	
  
•  rise/fall, up/down!
Lecture	
  4:	
  Word	
  Senses	
  
28	
  
Hyponymy	
  and	
  Hypernymy	
  
•  One	
  sense	
  is	
  a	
  hyponym	
  of	
  another	
  if	
  the	
  first	
  sense	
  is	
  more	
  
specific,	
  deno(ng	
  a	
  subclass	
  of	
  the	
  other	
  
•  car	
  is	
  a	
  hyponym	
  of	
  vehicle	
  
•  mango	
  is	
  a	
  hyponym	
  of	
  fruit	
  
•  Conversely	
  hypernym/superordinate	
  (“hyper	
  is	
  super”)	
  
•  vehicle	
  is	
  a	
  hypernym	
  	
  of	
  car	
  
•  fruit	
  is	
  a	
  hypernym	
  of	
  mango	
  
Superordinate/hyper vehicle fruit furniture
Subordinate/hyponym car mango chair Lecture	
  4:	
  Word	
  Senses	
  29	
  
Hyponymy	
  more	
  formally	
  
•  Extensional:	
  
•  The	
  class	
  denoted	
  by	
  the	
  superordinate	
  extensionally	
  includes	
  the	
  class	
  
denoted	
  by	
  the	
  hyponym	
  
•  Entailment:	
  
•  A	
  sense	
  A	
  is	
  a	
  hyponym	
  of	
  sense	
  B	
  if	
  being	
  an	
  A	
  entails	
  being	
  a	
  B	
  
•  Hyponymy	
  is	
  usually	
  transi(ve	
  	
  
•  (A	
  hypo	
  B	
  and	
  B	
  hypo	
  C	
  entails	
  A	
  hypo	
  C)	
  
•  Another	
  name:	
  the	
  IS-­‐A	
  hierarchy	
  
•  A	
  IS-­‐A	
  B	
  	
  	
  	
  	
  	
  (or	
  A	
  ISA	
  B)	
  
•  B	
  subsumes	
  A	
  
Lecture	
  4:	
  Word	
  Senses	
  30	
  
Hyponyms	
  and	
  Instances	
  
•  WordNet	
  has	
  both	
  classes	
  and	
  instances.	
  
•  An	
  instance	
  is	
  an	
  individual,	
  a	
  proper	
  noun	
  that	
  is	
  a	
  unique	
  en(ty	
  
•  San Francisco is	
  an	
  instance	
  of	
  city!
•  But	
  city	
  is	
  a	
  class	
  
•  city	
  is	
  a	
  hyponym	
  of	
  	
  	
  	
  municipality...location...!
31	
   Lecture	
  4:	
  Word	
  Senses	
  
WordNet	
  and	
  other	
  
Online	
  Thesauri	
  
Applica$ons	
  of	
  Thesauri	
  and	
  Ontologies	
  
•  Informa(on	
  Extrac(on	
  
•  Informa(on	
  Retrieval	
  
•  Ques(on	
  Answering	
  
•  Bioinforma(cs	
  and	
  Medical	
  Informa(cs	
  
•  Machine	
  Transla(on	
  
Lecture	
  4:	
  Word	
  Senses	
  33	
  
WordNet	
  
34	
   Lecture	
  4:	
  Word	
  Senses	
  
Synsets	
  
35	
   Lecture	
  4:	
  Word	
  Senses	
  
How	
  is	
  “sense”	
  defined	
  in	
  WordNet?	
  
•  The	
  synset	
  (synonym	
  set),	
  the	
  set	
  of	
  near-­‐synonyms,	
  
instan(ates	
  a	
  sense	
  or	
  concept,	
  with	
  a	
  gloss	
  
•  Example:	
  chump	
  as	
  a	
  noun	
  with	
  the	
  gloss:	
  
“a	
  person	
  who	
  is	
  gullible	
  and	
  easy	
  to	
  take	
  advantage	
  of”	
  
•  This	
  sense	
  of	
  “chump”	
  is	
  shared	
  by	
  9	
  words:	
  
chump1, fool2, gull1, mark9, patsy1, fall guy1,
sucker1, soft touch1, mug2!
•  Each	
  of	
  these	
  senses	
  have	
  this	
  same	
  gloss	
  
•  (Not	
  every	
  sense;	
  sense	
  2	
  of	
  gull	
  is	
  the	
  aqua(c	
  bird)	
  
	
  
Lecture	
  4:	
  Word	
  Senses	
  
36	
  
gullible=naive	
  
Tree-­‐like	
  Structure	
  
37	
   Lecture	
  4:	
  Word	
  Senses	
  
WordNet:	
  bar	
  1/6	
  
38	
   Lecture	
  4:	
  Word	
  Senses	
  
2/6	
  
39	
   Lecture	
  4:	
  Word	
  Senses	
  
3/6	
  
40	
   Lecture	
  4:	
  Word	
  Senses	
  
4/6	
  
41	
   Lecture	
  4:	
  Word	
  Senses	
  
5/6	
  
42	
   Lecture	
  4:	
  Word	
  Senses	
  
6/6	
  
43	
   Lecture	
  4:	
  Word	
  Senses	
  
Polysemy	
  
44	
   Lecture	
  4:	
  Word	
  Senses	
  
WordNet	
  3.0	
  
•  A	
  hierarchically	
  organized	
  lexical	
  database	
  
•  On-­‐line	
  thesaurus	
  +	
  aspects	
  of	
  a	
  dic(onary	
  
•  Some	
  other	
  languages	
  available	
  or	
  under	
  development	
  
•  (Arabic,	
  Finnish,	
  German,	
  Portuguese…)	
  
Category	
   Unique	
  Strings	
  
Noun	
   117,798	
  
Verb	
   11,529	
  
Adjec(ve	
   22,479	
  
Adverb	
   4,481	
  
Lecture	
  4:	
  Word	
  Senses	
  
45	
  
Senses	
  of	
  “bass”	
  in	
  Wordnet	
  
Lecture	
  4:	
  Word	
  Senses	
  46	
  
WordNet	
  Hypernym	
  Hierarchy	
  for	
  “bass”	
  
Lecture	
  4:	
  Word	
  Senses	
  47	
  
WordNet	
  Noun	
  Rela$ons	
  
Lecture	
  4:	
  Word	
  Senses	
  48	
  
WordNet	
  3.0	
  
•  Where	
  it	
  is:	
  
•  hgp://wordnetweb.princeton.edu/perl/webwn	
  
•  Libraries	
  
•  Python:	
  	
  WordNet	
  	
  from	
  NLTK	
  
•  hgp://www.nltk.org/Home	
  
•  Java:	
  
•  JWNL,	
  extJWNL	
  on	
  sourceforge	
  Lecture	
  4:	
  Word	
  Senses	
  
Synset
•  MeSH	
  (Medical	
  Subject	
  Headings)	
  
•  177,000	
  entry	
  terms	
  	
  that	
  correspond	
  to	
  26,142	
  biomedical	
  
“headings”	
  
•  Hemoglobins	
  
Entry	
  Terms:	
  	
  Eryhem,	
  Ferrous	
  Hemoglobin,	
  Hemoglobin	
  
Defini$on:	
  	
  The	
  oxygen-­‐carrying	
  proteins	
  of	
  ERYTHROCYTES.	
  
They	
  are	
  found	
  in	
  all	
  vertebrates	
  and	
  some	
  invertebrates.	
  
The	
  number	
  of	
  globin	
  subunits	
  in	
  the	
  hemoglobin	
  quaternary	
  
structure	
  differs	
  between	
  species.	
  Structures	
  range	
  from	
  
monomeric	
  to	
  a	
  variety	
  of	
  mul(meric	
  arrangements	
  
MeSH:	
  Medical	
  Subject	
  Headings	
  
thesaurus	
  from	
  the	
  Na$onal	
  Library	
  of	
  Medicine	
  
Lecture	
  4:	
  Word	
  Senses	
  50	
  
The	
  MeSH	
  Hierarchy	
  
•  a	
  
51	
   Lecture	
  4:	
  Word	
  Senses	
  
Uses	
  of	
  the	
  MeSH	
  Ontology	
  
•  Provide	
  synonyms	
  (“entry	
  terms”)	
  
•  E.g.,	
  glucose	
  and	
  dextrose	
  
•  Provide	
  hypernyms	
  (from	
  the	
  hierarchy)	
  
•  E.g.,	
  glucose	
  ISA	
  monosaccharide	
  
•  Indexing	
  in	
  MEDLINE/PubMED	
  database	
  
•  NLM’s	
  bibliographic	
  database:	
  	
  
•  20	
  million	
  journal	
  ar(cles	
  
•  Each	
  ar(cle	
  hand-­‐assigned	
  10-­‐20	
  MeSH	
  terms	
   Lecture	
  4:	
  Word	
  Senses	
  52	
  
Other	
  resources	
  
53	
   Lecture	
  4:	
  Word	
  Senses	
  
BabelNet	
  
54	
   Lecture	
  4:	
  Word	
  Senses	
  
Selectional
Restrictions
Selec$onal	
  Restric$ons	
  
Consider	
  the	
  two	
  interpreta(ons	
  of:	
  
	
  I	
  want	
  to	
  eat	
  someplace	
  nearby.	
  	
  
a)  sensible: 	
  	
  
Eat	
  is	
  intransi(ve	
  and	
  “someplace	
  nearby”	
  is	
  a	
  loca(on	
  adjunct	
  
b)  Speaker	
  is	
  Godzilla:	
  a	
  monster	
  that	
  likes	
  ea(ng	
  buildings!!!	
  
	
  	
  	
  	
  	
  Eat	
  is	
  transi(ve	
  and	
  “someplace	
  nearby”	
  is	
  a	
  direct	
  object	
  
How	
  do	
  we	
  know	
  speaker	
  didn’t	
  mean	
  b)	
  	
  ?	
  
	
  Because	
  the	
  THEME	
  of	
  ea(ng	
  tends	
  to	
  be	
  something	
  edible	
  
56	
   Lecture	
  4:	
  Word	
  Senses	
  
Selec$onal	
  restric$ons	
  are	
  associated	
  with	
  senses	
  
•  The	
  restaurant	
  serves	
  green-­‐lipped	
  mussels.	
  	
  
•  THEME	
  is	
  some	
  kind	
  of	
  food	
  
•  Which	
  airlines	
  serve	
  Denver?	
  	
  
•  THEME	
  is	
  an	
  appropriate	
  loca(on	
  
57	
   Lecture	
  4:	
  Word	
  Senses	
  
apply	
  zeugma	
  test	
  
Represen$ng	
  selec$onal	
  restric$ons	
  
t consists of a single variable that stands for the event, a predicat
f event, and variables and relations for the event roles. Ignoring t
ctures and using thematic roles rather than deep event roles, the
on of a verb like eat might look like the following:
9e,x,y Eating(e)^Agent(e,x)^Theme(e,y)
representation, all we know about y, the filler of the THEME ro
iated with an Eating event through the Theme relation. To sti
l restriction that y must be something edible, we simply add a ne
:
9e,x,y Eating(e)^Agent(e,x)^Theme(e,y)^EdibleThing(y)58	
  
ntribution of a verb like eat might look like the following:
9e,x,y Eating(e)^Agent(e,x)^Theme(e,y)
th this representation, all we know about y, the filler of the THEME role, is th
s associated with an Eating event through the Theme relation. To stipulate t
ectional restriction that y must be something edible, we simply add a new term
t effect:
9e,x,y Eating(e)^Agent(e,x)^Theme(e,y)^EdibleThing(y)
=> dish
=> nutriment, nourishment, nutrition...
=> food, nutrient
=> substance
=> matter
=> physical entity
=> entity
Figure 22.6 Evidence from WordNet that hamburgers are edible.
When a phrase like ate a hamburger is encountered, a semantic analyzer can
form the following kind of representation:
9e,x,y Eating(e)^Eater(e,x)^Theme(e,y)^EdibleThing(y)^Hamburger(y)
This representation is perfectly reasonable since the membership of y in the category
Hamburger is consistent with its membership in the category EdibleThing, assuming
Instead	
  of	
  represen(ng	
  “eat”	
  as:	
  
Just	
  add:	
  
And	
  “eat	
  a	
  hamburger”	
  becomes	
  
But	
  this	
  assumes	
  we	
  have	
  a	
  large	
  knowledge	
  base	
  of	
  facts	
  
about	
  edible	
  things	
  and	
  hamburgers	
  and	
  whatnot.	
  
Let’s	
  use	
  WordNet	
  synsets	
  to	
  specify	
  
selec$onal	
  restric$ons	
  
•  The	
  THEME	
  of	
  eat	
  must	
  be	
  WordNet	
  synset	
  {food,	
  nutrient}	
  	
  
“any	
  substance	
  that	
  can	
  be	
  metabolized	
  by	
  an	
  animal	
  to	
  give	
  energy	
  and	
  build	
  8ssue”	
  
•  Similarly	
  
THEME	
  of	
  imagine:	
  synset	
  {en(ty}	
  
THEME	
  of	
  li=:	
  synset	
  {physical	
  en(ty}	
  
THEME	
  of	
  diagonalize:	
  synset	
  {matrix}	
  	
  
•  This	
  allows	
  
imagine	
  a	
  hamburger	
  	
  and	
  	
  li=	
  a	
  hamburger,	
  	
  
•  Correctly	
  rules	
  out	
  	
  
diagonalize	
  a	
  hamburger.	
  	
  59	
   Lecture	
  4:	
  Word	
  Senses	
  
The end

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Lexical Relations By Nasir
Lexical Relations By NasirLexical Relations By Nasir
Lexical Relations By Nasir
 
Semantics
SemanticsSemantics
Semantics
 
Syntax
SyntaxSyntax
Syntax
 
Tree diagram
Tree diagramTree diagram
Tree diagram
 
Dictionaries
DictionariesDictionaries
Dictionaries
 
Lexical semantics
Lexical semanticsLexical semantics
Lexical semantics
 
Lexicography
LexicographyLexicography
Lexicography
 
Lexical semantics
Lexical semanticsLexical semantics
Lexical semantics
 
Semantics
SemanticsSemantics
Semantics
 
Constituency, Trees and Rules
Constituency, Trees and Rules Constituency, Trees and Rules
Constituency, Trees and Rules
 
Lecture 2 sentence structure constituents
Lecture 2 sentence structure constituentsLecture 2 sentence structure constituents
Lecture 2 sentence structure constituents
 
Wordnet
WordnetWordnet
Wordnet
 
Sense relations 2
Sense relations 2Sense relations 2
Sense relations 2
 
1. An Overview of Functional Grammar
1. An Overview of Functional Grammar1. An Overview of Functional Grammar
1. An Overview of Functional Grammar
 
Deep structure and surface structure
Deep structure and surface structureDeep structure and surface structure
Deep structure and surface structure
 
Minimalism.pptx
Minimalism.pptxMinimalism.pptx
Minimalism.pptx
 
Generative grammar
Generative grammarGenerative grammar
Generative grammar
 
Sense relations (linguistics)
Sense relations (linguistics)Sense relations (linguistics)
Sense relations (linguistics)
 
Sense relation
Sense relationSense relation
Sense relation
 
Syntax
SyntaxSyntax
Syntax
 

Andere mochten auch

Radial categories franklin delacruz
Radial categories franklin delacruzRadial categories franklin delacruz
Radial categories franklin delacruzFranklin De la Cruz
 
SYNONYMS, ANTONYMS, POLYSEMY, HOMONYM, AND HOMOGRAPH
SYNONYMS, ANTONYMS, POLYSEMY,  HOMONYM, AND HOMOGRAPHSYNONYMS, ANTONYMS, POLYSEMY,  HOMONYM, AND HOMOGRAPH
SYNONYMS, ANTONYMS, POLYSEMY, HOMONYM, AND HOMOGRAPHLili Lulu
 
Lexical relations
Lexical relationsLexical relations
Lexical relationsHina Honey
 
Edad 820 ghada to ambiguity 2 3 12
Edad 820 ghada to ambiguity 2 3 12Edad 820 ghada to ambiguity 2 3 12
Edad 820 ghada to ambiguity 2 3 12professorlkb
 
Alexandra Arapinis : From ontological structures to semantic lexical structur...
Alexandra Arapinis : From ontological structures to semantic lexical structur...Alexandra Arapinis : From ontological structures to semantic lexical structur...
Alexandra Arapinis : From ontological structures to semantic lexical structur...PhiloWeb
 
Compound Noun Polysemy and Sense Enumeration in WordNet
Compound Noun Polysemy and Sense Enumeration in WordNet Compound Noun Polysemy and Sense Enumeration in WordNet
Compound Noun Polysemy and Sense Enumeration in WordNet Biswanath Dutta
 
Semantics relations among words
Semantics relations among wordsSemantics relations among words
Semantics relations among wordsRatna Nurhidayati
 
semantic ambiguity final
semantic ambiguity finalsemantic ambiguity final
semantic ambiguity finalRuby Slabicki
 
Seven types of ambiguity - the role of communications in the modern media soc...
Seven types of ambiguity - the role of communications in the modern media soc...Seven types of ambiguity - the role of communications in the modern media soc...
Seven types of ambiguity - the role of communications in the modern media soc...Johannes Meier
 
Lex bài tập
Lex bài tậpLex bài tập
Lex bài tậpNana D
 
Ad polysemy
Ad polysemyAd polysemy
Ad polysemySanja B
 
Seven types of ambiguity
Seven types of ambiguity Seven types of ambiguity
Seven types of ambiguity Johannes Meier
 
homophone, homonomy, polysemy
homophone, homonomy, polysemyhomophone, homonomy, polysemy
homophone, homonomy, polysemystefani llontop
 
Some problems of ambiguity in translation with reference to english and arabic
Some problems of ambiguity in translation with reference to english and arabicSome problems of ambiguity in translation with reference to english and arabic
Some problems of ambiguity in translation with reference to english and arabicfalah_hasan77
 

Andere mochten auch (20)

Radial categories franklin delacruz
Radial categories franklin delacruzRadial categories franklin delacruz
Radial categories franklin delacruz
 
Semantic relation among words
Semantic relation among wordsSemantic relation among words
Semantic relation among words
 
SYNONYMS, ANTONYMS, POLYSEMY, HOMONYM, AND HOMOGRAPH
SYNONYMS, ANTONYMS, POLYSEMY,  HOMONYM, AND HOMOGRAPHSYNONYMS, ANTONYMS, POLYSEMY,  HOMONYM, AND HOMOGRAPH
SYNONYMS, ANTONYMS, POLYSEMY, HOMONYM, AND HOMOGRAPH
 
Lexical relations
Lexical relationsLexical relations
Lexical relations
 
Homonym & polysemy
Homonym & polysemyHomonym & polysemy
Homonym & polysemy
 
A Pattern-Based Approach to Hyponymy Relation Acquisition for the Agricultura...
A Pattern-Based Approach to Hyponymy Relation Acquisition for the Agricultura...A Pattern-Based Approach to Hyponymy Relation Acquisition for the Agricultura...
A Pattern-Based Approach to Hyponymy Relation Acquisition for the Agricultura...
 
Edad 820 ghada to ambiguity 2 3 12
Edad 820 ghada to ambiguity 2 3 12Edad 820 ghada to ambiguity 2 3 12
Edad 820 ghada to ambiguity 2 3 12
 
Alexandra Arapinis : From ontological structures to semantic lexical structur...
Alexandra Arapinis : From ontological structures to semantic lexical structur...Alexandra Arapinis : From ontological structures to semantic lexical structur...
Alexandra Arapinis : From ontological structures to semantic lexical structur...
 
Compound Noun Polysemy and Sense Enumeration in WordNet
Compound Noun Polysemy and Sense Enumeration in WordNet Compound Noun Polysemy and Sense Enumeration in WordNet
Compound Noun Polysemy and Sense Enumeration in WordNet
 
Semantics 2
Semantics 2Semantics 2
Semantics 2
 
Semantics relations among words
Semantics relations among wordsSemantics relations among words
Semantics relations among words
 
semantic ambiguity final
semantic ambiguity finalsemantic ambiguity final
semantic ambiguity final
 
Seven types of ambiguity - the role of communications in the modern media soc...
Seven types of ambiguity - the role of communications in the modern media soc...Seven types of ambiguity - the role of communications in the modern media soc...
Seven types of ambiguity - the role of communications in the modern media soc...
 
Lex bài tập
Lex bài tậpLex bài tập
Lex bài tập
 
Ad polysemy
Ad polysemyAd polysemy
Ad polysemy
 
Polysemi warohmah hasanah
Polysemi warohmah hasanahPolysemi warohmah hasanah
Polysemi warohmah hasanah
 
Seven types of ambiguity
Seven types of ambiguity Seven types of ambiguity
Seven types of ambiguity
 
homophone, homonomy, polysemy
homophone, homonomy, polysemyhomophone, homonomy, polysemy
homophone, homonomy, polysemy
 
Some problems of ambiguity in translation with reference to english and arabic
Some problems of ambiguity in translation with reference to english and arabicSome problems of ambiguity in translation with reference to english and arabic
Some problems of ambiguity in translation with reference to english and arabic
 
AMBIGUITY IN A LANGUAGE
AMBIGUITY IN A LANGUAGEAMBIGUITY IN A LANGUAGE
AMBIGUITY IN A LANGUAGE
 

Ähnlich wie Lecture: Word Senses

Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingToine Bogers
 
Semantics: The Meaning of Language
Semantics: The Meaning of LanguageSemantics: The Meaning of Language
Semantics: The Meaning of LanguageJeffren Miguel
 
how to teach vocabulary
how to teach vocabularyhow to teach vocabulary
how to teach vocabularyKhalid Hanafi
 
Teach student vocabulary comprehensively
Teach student vocabulary comprehensivelyTeach student vocabulary comprehensively
Teach student vocabulary comprehensivelyHuynTrn222573
 
Deep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsDeep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsRoelof Pieters
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1Saurabh Kaushik
 
Inquiry on the Philosophy of Language.pptx
Inquiry on the Philosophy of Language.pptxInquiry on the Philosophy of Language.pptx
Inquiry on the Philosophy of Language.pptxutcrash88
 
Lecture: Word Sense Disambiguation
Lecture: Word Sense DisambiguationLecture: Word Sense Disambiguation
Lecture: Word Sense DisambiguationMarina Santini
 
リーディング研究会2014年6月輪読_最終版(関西学院大学大学院・金澤)
リーディング研究会2014年6月輪読_最終版(関西学院大学大学院・金澤)リーディング研究会2014年6月輪読_最終版(関西学院大学大学院・金澤)
リーディング研究会2014年6月輪読_最終版(関西学院大学大学院・金澤)Yu Kanazawa / Osaka University
 
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...Scottish Language Dictionaries
 
Natural Language Processing - Unit 1
Natural Language Processing - Unit 1Natural Language Processing - Unit 1
Natural Language Processing - Unit 1Mithun B N
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksMLconf
 
discourse analysis0 1.pptx
discourse analysis0 1.pptxdiscourse analysis0 1.pptx
discourse analysis0 1.pptxAlkadumiHamletto
 
Jarrar.lecture notes.lexicalsemanticsandmultilingualism
Jarrar.lecture notes.lexicalsemanticsandmultilingualismJarrar.lecture notes.lexicalsemanticsandmultilingualism
Jarrar.lecture notes.lexicalsemanticsandmultilingualismBayan Maali
 
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...Natural Language Processing and Search Intent Understanding C3 Conductor 2019...
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...Dawn Anderson MSc DigM
 
Interactional socio & training
Interactional socio & trainingInteractional socio & training
Interactional socio & trainingAnna Trester
 

Ähnlich wie Lecture: Word Senses (20)

Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Semantics: The Meaning of Language
Semantics: The Meaning of LanguageSemantics: The Meaning of Language
Semantics: The Meaning of Language
 
how to teach vocabulary
how to teach vocabularyhow to teach vocabulary
how to teach vocabulary
 
Teach student vocabulary comprehensively
Teach student vocabulary comprehensivelyTeach student vocabulary comprehensively
Teach student vocabulary comprehensively
 
Deep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsDeep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word Embeddings
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
 
Inquiry on the Philosophy of Language.pptx
Inquiry on the Philosophy of Language.pptxInquiry on the Philosophy of Language.pptx
Inquiry on the Philosophy of Language.pptx
 
Lecture: Word Sense Disambiguation
Lecture: Word Sense DisambiguationLecture: Word Sense Disambiguation
Lecture: Word Sense Disambiguation
 
リーディング研究会2014年6月輪読_最終版(関西学院大学大学院・金澤)
リーディング研究会2014年6月輪読_最終版(関西学院大学大学院・金澤)リーディング研究会2014年6月輪読_最終版(関西学院大学大学院・金澤)
リーディング研究会2014年6月輪読_最終版(関西学院大学大学院・金澤)
 
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
 
Natural Language Processing - Unit 1
Natural Language Processing - Unit 1Natural Language Processing - Unit 1
Natural Language Processing - Unit 1
 
Syntax
SyntaxSyntax
Syntax
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
 
discourse analysis0 1.pptx
discourse analysis0 1.pptxdiscourse analysis0 1.pptx
discourse analysis0 1.pptx
 
Word meaning
Word meaning Word meaning
Word meaning
 
Jarrar.lecture notes.lexicalsemanticsandmultilingualism
Jarrar.lecture notes.lexicalsemanticsandmultilingualismJarrar.lecture notes.lexicalsemanticsandmultilingualism
Jarrar.lecture notes.lexicalsemanticsandmultilingualism
 
Semantics
SemanticsSemantics
Semantics
 
Syntax.ppt
Syntax.pptSyntax.ppt
Syntax.ppt
 
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...Natural Language Processing and Search Intent Understanding C3 Conductor 2019...
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...
 
Interactional socio & training
Interactional socio & trainingInteractional socio & training
Interactional socio & training
 

Mehr von Marina Santini

Can We Quantify Domainhood? Exploring Measures to Assess Domain-Specificity i...
Can We Quantify Domainhood? Exploring Measures to Assess Domain-Specificity i...Can We Quantify Domainhood? Exploring Measures to Assess Domain-Specificity i...
Can We Quantify Domainhood? Exploring Measures to Assess Domain-Specificity i...Marina Santini
 
Towards a Quality Assessment of Web Corpora for Language Technology Applications
Towards a Quality Assessment of Web Corpora for Language Technology ApplicationsTowards a Quality Assessment of Web Corpora for Language Technology Applications
Towards a Quality Assessment of Web Corpora for Language Technology ApplicationsMarina Santini
 
A Web Corpus for eCare: Collection, Lay Annotation and Learning -First Results-
A Web Corpus for eCare: Collection, Lay Annotation and Learning -First Results-A Web Corpus for eCare: Collection, Lay Annotation and Learning -First Results-
A Web Corpus for eCare: Collection, Lay Annotation and Learning -First Results-Marina Santini
 
An Exploratory Study on Genre Classification using Readability Features
An Exploratory Study on Genre Classification using Readability FeaturesAn Exploratory Study on Genre Classification using Readability Features
An Exploratory Study on Genre Classification using Readability FeaturesMarina Santini
 
Lecture: Semantic Word Clouds
Lecture: Semantic Word CloudsLecture: Semantic Word Clouds
Lecture: Semantic Word CloudsMarina Santini
 
Lecture: Ontologies and the Semantic Web
Lecture: Ontologies and the Semantic WebLecture: Ontologies and the Semantic Web
Lecture: Ontologies and the Semantic WebMarina Santini
 
Lecture: Summarization
Lecture: SummarizationLecture: Summarization
Lecture: SummarizationMarina Santini
 
Lecture: Question Answering
Lecture: Question AnsweringLecture: Question Answering
Lecture: Question AnsweringMarina Santini
 
IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)Marina Santini
 
Lecture: Vector Semantics (aka Distributional Semantics)
Lecture: Vector Semantics (aka Distributional Semantics)Lecture: Vector Semantics (aka Distributional Semantics)
Lecture: Vector Semantics (aka Distributional Semantics)Marina Santini
 
Semantic Role Labeling
Semantic Role LabelingSemantic Role Labeling
Semantic Role LabelingMarina Santini
 
Semantics and Computational Semantics
Semantics and Computational SemanticsSemantics and Computational Semantics
Semantics and Computational SemanticsMarina Santini
 
Lecture 9: Machine Learning in Practice (2)
Lecture 9: Machine Learning in Practice (2)Lecture 9: Machine Learning in Practice (2)
Lecture 9: Machine Learning in Practice (2)Marina Santini
 
Lecture 8: Machine Learning in Practice (1)
Lecture 8: Machine Learning in Practice (1) Lecture 8: Machine Learning in Practice (1)
Lecture 8: Machine Learning in Practice (1) Marina Santini
 
Lecture 5: Interval Estimation
Lecture 5: Interval Estimation Lecture 5: Interval Estimation
Lecture 5: Interval Estimation Marina Santini
 
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain Ratio
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain RatioLecture 4 Decision Trees (2): Entropy, Information Gain, Gain Ratio
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain RatioMarina Santini
 
Lecture 3b: Decision Trees (1 part)
Lecture 3b: Decision Trees (1 part)Lecture 3b: Decision Trees (1 part)
Lecture 3b: Decision Trees (1 part) Marina Santini
 
Lecture 3: Basic Concepts of Machine Learning - Induction & Evaluation
Lecture 3: Basic Concepts of Machine Learning - Induction & EvaluationLecture 3: Basic Concepts of Machine Learning - Induction & Evaluation
Lecture 3: Basic Concepts of Machine Learning - Induction & EvaluationMarina Santini
 

Mehr von Marina Santini (20)

Can We Quantify Domainhood? Exploring Measures to Assess Domain-Specificity i...
Can We Quantify Domainhood? Exploring Measures to Assess Domain-Specificity i...Can We Quantify Domainhood? Exploring Measures to Assess Domain-Specificity i...
Can We Quantify Domainhood? Exploring Measures to Assess Domain-Specificity i...
 
Towards a Quality Assessment of Web Corpora for Language Technology Applications
Towards a Quality Assessment of Web Corpora for Language Technology ApplicationsTowards a Quality Assessment of Web Corpora for Language Technology Applications
Towards a Quality Assessment of Web Corpora for Language Technology Applications
 
A Web Corpus for eCare: Collection, Lay Annotation and Learning -First Results-
A Web Corpus for eCare: Collection, Lay Annotation and Learning -First Results-A Web Corpus for eCare: Collection, Lay Annotation and Learning -First Results-
A Web Corpus for eCare: Collection, Lay Annotation and Learning -First Results-
 
An Exploratory Study on Genre Classification using Readability Features
An Exploratory Study on Genre Classification using Readability FeaturesAn Exploratory Study on Genre Classification using Readability Features
An Exploratory Study on Genre Classification using Readability Features
 
Lecture: Semantic Word Clouds
Lecture: Semantic Word CloudsLecture: Semantic Word Clouds
Lecture: Semantic Word Clouds
 
Lecture: Ontologies and the Semantic Web
Lecture: Ontologies and the Semantic WebLecture: Ontologies and the Semantic Web
Lecture: Ontologies and the Semantic Web
 
Lecture: Summarization
Lecture: SummarizationLecture: Summarization
Lecture: Summarization
 
Relation Extraction
Relation ExtractionRelation Extraction
Relation Extraction
 
Lecture: Question Answering
Lecture: Question AnsweringLecture: Question Answering
Lecture: Question Answering
 
IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)
 
Lecture: Vector Semantics (aka Distributional Semantics)
Lecture: Vector Semantics (aka Distributional Semantics)Lecture: Vector Semantics (aka Distributional Semantics)
Lecture: Vector Semantics (aka Distributional Semantics)
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Semantic Role Labeling
Semantic Role LabelingSemantic Role Labeling
Semantic Role Labeling
 
Semantics and Computational Semantics
Semantics and Computational SemanticsSemantics and Computational Semantics
Semantics and Computational Semantics
 
Lecture 9: Machine Learning in Practice (2)
Lecture 9: Machine Learning in Practice (2)Lecture 9: Machine Learning in Practice (2)
Lecture 9: Machine Learning in Practice (2)
 
Lecture 8: Machine Learning in Practice (1)
Lecture 8: Machine Learning in Practice (1) Lecture 8: Machine Learning in Practice (1)
Lecture 8: Machine Learning in Practice (1)
 
Lecture 5: Interval Estimation
Lecture 5: Interval Estimation Lecture 5: Interval Estimation
Lecture 5: Interval Estimation
 
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain Ratio
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain RatioLecture 4 Decision Trees (2): Entropy, Information Gain, Gain Ratio
Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain Ratio
 
Lecture 3b: Decision Trees (1 part)
Lecture 3b: Decision Trees (1 part)Lecture 3b: Decision Trees (1 part)
Lecture 3b: Decision Trees (1 part)
 
Lecture 3: Basic Concepts of Machine Learning - Induction & Evaluation
Lecture 3: Basic Concepts of Machine Learning - Induction & EvaluationLecture 3: Basic Concepts of Machine Learning - Induction & Evaluation
Lecture 3: Basic Concepts of Machine Learning - Induction & Evaluation
 

Kürzlich hochgeladen

The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 

Kürzlich hochgeladen (20)

The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 

Lecture: Word Senses

  • 1. Seman&c  Analysis  in  Language  Technology   http://stp.lingfil.uu.se/~santinim/sais/2016/sais_2016.htm 
 
 Word Senses
 Marina  San(ni   san$nim@stp.lingfil.uu.se     Department  of  Linguis(cs  and  Philology   Uppsala  University,  Uppsala,  Sweden     Spring  2016      
  • 2. Previous  Lecture:  Sen$ment  Analysis   •  Affec(ve  meaning  is  a  kind  of  connota(onal  meaning     •  The  importance  of  sen(ment  lexicons   •  Methods  for  the  automa(c  expansion  of  manually-­‐ annotated  lexicons   •  A  baseline  algorithm:  Naive  Bayes   •  Prac(cal  ac(vity:  ML-­‐based  sen(ment  classifier:   movie  reviews,  product  reviews,  restaurant  reviews…   •  Results…  uhm…    somehow  biassed  (short  text  vs   long  texts);  never  a  full  posi(ve  polarity;  etc.   2   Lecture  4:  Word  Senses  
  • 3. How  is  *meaning*  handled  in  Seman$c-­‐Based  LT-­‐Applica$ons?   •  Seman(c  Role  Labelling/Predicate-­‐Argument  Structure     •  Main  trend:   •  crea(on  of  annotated  resources  (PropBank,  FrameNet,  etc.);       •  use  of  supervised  machine  learning:  classifiers  are  trained  on  annotated  resources,  such  as  PropBank  and   FrameNet.     •  Sen(ment  Analysis     •  Main  trends:   •  iden(fica(on  of  sen(ment-­‐bearing  features  or  iden(fica(on  of  representa(ve  features  for  the  problem   •  use  of  supervervised  learning  à  cf  the  results  of  NLTK  classifier   •  Word  sense  disambigua(on  (???)   •  Informa(on  extrac(on  (???)   •  Ques(on  Answering  (???)   •  Ontologies  (???)   Lecture  4:  Word  Senses  3  
  • 4. Reminder:  Glossary  Entries   •  Which  concepts  are  the  most  salient  in  Lect  3?   •  Update  your  Glossary…   Lecture  4:  Word  Senses  4  
  • 5. Previous  lecture:  end   5   Lecture  4:  Word  Senses  
  • 6. Word  Senses   •  Master  Students  à  NLP  course   •  Bachelor  Students  à  Seman(cs   6   Lecture  4:  Word  Senses  
  • 7. From  Lect  2:  PropBank  &  Selec$onal  Restric$ons   •  PropBank  is  organized  by  word   senses  :  word  senses  are  different   aspects  of  meaning  of  a  word   •  Selec(onal  restric(ons…  we  can  use   seman(c  constraints  to   disambiguate  senses.  Ex:  eat,   serve….  bear  me  some  pa(ence…   7   Lecture  4:  Word  Senses  
  • 8. Acknowledgements Most  slides  borrowed  from:   Dan  Jurafsky  and  James  H.  Mar(n   Some  slides  borrowed  from  D.  Jurafsky  and  C.  Manning    and  D.  Radev  (Coursera)     J&M(2015,  draf):  hgps://web.stanford.edu/~jurafsky/slp3/            
  • 9. Outline   •  Word  Meaning   •  WordNet  and  Other  Lexical  Resources   •  Selec(onal  Restric(ons   9   Lecture  4:  Word  Senses  
  • 10. Logic:  meaning  representa$on:  uppercase  words!   •  Constant   •  Variables   •  Predicates   •  Boolean  connec(ves   •  Quan(fiers   •  Brackets  and  comma  to  group  the  symbols  together   •  Ex:  A  woman  crosses  Sunset  Boulevard   Lecture  4:  Word  Senses  10   Formal  Seman(cs  
  • 11. Defini$ons   •  Lexical  seman$cs  is  the  study  of  the  meaning  of  words  and  the  systema(c   meaning-­‐related  connec(ons  between  words.       •  A  word  sense  is  the  locus  of  word  meaning;  defini(ons  and  meaning  rela(ons   are  defined  at  the  level  of  the  word  sense  rather  than  wordforms.     •  Homonymy  is  the  rela(on  between  unrelated  senses  that  share  a  form.     •  Polysemy  is  the  rela(on  between  related  senses  that  share  a  form.   •  Synonymy  holds  between  different  words  with  the  same  meaning.   •  Hyponymy  and  hypernymy  rela(ons  hold  between  words  that  are  in  a  class   inclusion  rela(onship.     •  Meronymy  type  of  hierarchy  that  deals  with  part–whole  rela(onships.   •  WordNet  is  a  large  database  of  lexical  rela(ons  for  English   11     Lecture  4:  Word  Senses  
  • 12. Word Meaning and Similarity Word  Senses  and   Word  Rela(ons  
  • 13. Reminder:  lemma  and  wordform   •  A  lemma  or  cita$on  form   •  Same  stem,  part  of  speech,  rough  seman(cs   •  A  wordform   •  The  “inflected”  word  as  it  appears  in  text   Wordform   Lemma   banks   bank   sung   sing   duermes   dormir   Lecture  4:  Word  Senses  13   Cf.  token/type  ra(o:   crude  measure  of   lexical  densi(y:       If a text is 1,000 words long, it is said to have 1,000 "tokens". But a lot of these words will be repeated, and there may be only say 400 different words in the text. "Types", therefore, are the different words. The ratio between types and tokens in this example would be 40%. (source: wordsmith tools)  
  • 14. Lemmas  have  senses   •  One  lemma  “bank”  can  have  many  meanings:   •  …a bank can hold the investments in a custodial account…! •  “…as agriculture burgeons on the east bank the river will shrink even more”   •  Sense  (or  word  sense)   •  A  discrete  representa(on                                        of  an  aspect  of  a  word’s  meaning.   •  The  lemma  bank  here  has  two  senses   1! 2! Sense  1:   Sense  2:   Lecture  4:  Word  Senses   14  
  • 15. Other  examples?   Lecture  4:  Word  Senses  15  
  • 16. Homonymy   Homonyms:  words  that  share  a  form  but  have   unrelated,  dis(nct  meanings:   •  bank1:  financial  ins(tu(on,        bank2:    sloping  land   •  bat1:  club  for  hiqng  a  ball,        bat2:    nocturnal  flying  mammal   1.  Homographs  (bank/bank,  bat/bat)   2.  Homophones:   1.  Write  and  right   2.  Piece  and  peace   Lecture  4:  Word  Senses  16  
  • 17. Homonymy  causes  problems  for  NLP   applica$ons   •  Informa(on  retrieval   •  “bat care”! •  Machine  Transla(on   •  bat:    murciélago    (animal)  or    bate  (for  baseball)   •  Text-­‐to-­‐Speech   •  bass  (stringed  instrument)  vs.  bass  (fish)   •  There  would  be  no  ambiguity  for  Speech  to  Text:  why?   Lecture  4:  Word  Senses  17  
  • 18. Polysemy   •  1.  The  bank  was  constructed  in  1875  out  of  local  red  brick.   •  2.  I  withdrew  the  money  from  the  bank     •  Are  those  the  same  sense?   •  Sense  2:  “A  financial  ins(tu(on”   •  Sense  1:  “The  building  belonging  to  a  financial  ins(tu(on”   •  A  polysemous  word  has  related  meanings   •  Most  non-­‐rare  words  have  mul(ple  meanings   Lecture  4:  Word  Senses  18  
  • 19. Polysemy   19   Lecture  4:  Word  Senses  
  • 20. •  Lots  of  types  of  polysemy  are  systema(c   •  School, university, hospital! •  All  can  mean  the  ins(tu(on  or  the  building.   •  A  systema(c  rela(onship:   •  Building                        Organiza(on   •  Other  such  kinds  of  systema(c  polysemy:     Author  (Jane Austen wrote Emma)            Works  of  Author  (I love Jane Austen)   Tree  (Plums have beautiful blossoms) ! !Fruit  (I ate a preserved plum)! Metonymy  or  Systema$c  Polysemy:     A  systema$c  rela$onship  between  senses   Lecture  4:  Word  Senses  20  
  • 21. How  do  we  know  when  a  word  has  more   than  one  sense?   Lecture  4:  Word  Senses  21  
  • 22. How  do  we  know  when  a  word  has  more   than  one  sense?   •  The  “zeugma”  test:  Two  senses  of  serve?   •  Which flights serve breakfast?! •  Does Lufthansa serve Philadelphia?! •  ?Does  Lufhansa  serve  breakfast  and  San  Jose?   •  Since  this  conjunc(on  sounds  weird,     •  we  say  that  these  are  two  different  senses  of  “serve”   Lecture  4:  Word  Senses  22  
  • 23. Synonyms   •  Word  that  have  the  same  meaning  in  some  or  all  contexts.   •  filbert  /  hazelnut   •  couch  /  sofa   •  big  /  large   •  automobile  /  car   •  vomit  /  throw  up   •  Water  /  H20   •  Two  lexemes  are  synonyms     •  if  they  can  be  subs(tuted  for  each  other  in  all  situa(ons   •  If  so  they  have  the  same  proposi$onal  meaning   Lecture  4:  Word  Senses  23  
  • 24. Synonyms   •  But  there  are  few  (or  no)  examples  of  perfect  synonymy.   •  Even  if  many  aspects  of  meaning  are  iden(cal   •  S(ll  may  not  preserve  the  acceptability  based  on  no(ons  of  politeness,   slang,  register,  genre,  etc.   •  Example:   •  Water/H20   •  Big/large   •  Brave/courageous        high  brow:  la(nate  words   Lecture  4:  Word  Senses  24  
  • 25. Synonymy  is  a  rela$on     between  senses  rather  than  words   •  Consider  the  words  big  and  large   •  Are  they  synonyms?   •  How  big  is  that  plane?   •  Would  I  be  flying  on  a  large  or  small  plane?   •  How  about  here:   •  Miss  Nelson  became  a  kind  of  big  sister  to  Benjamin.   •  ?Miss  Nelson  became  a  kind  of  large  sister  to  Benjamin.   •  Why?   •  big  has  a  sense  that  means  being  older,  or  grown  up   •  large  lacks  this  sense   Lecture  4:  Word  Senses  25  
  • 26. Synonymy:  Summary   26   Lecture  4:  Word  Senses  
  • 27. Other  seman$c  rela$ons…   27   Lecture  4:  Word  Senses  
  • 28. Antonyms   •  Senses  that  are  opposites  with  respect  to  one  feature  of  meaning   •  Otherwise,  they  are  very  similar!   dark/light short/long !fast/slow !rise/fall! hot/cold! up/down! in/out! •  More  formally:  antonyms  can   •  define  a  binary  opposi(on    or  be  at  opposite  ends  of  a  scale   •   long/short, fast/slow! •  Be  reversives:   •  rise/fall, up/down! Lecture  4:  Word  Senses   28  
  • 29. Hyponymy  and  Hypernymy   •  One  sense  is  a  hyponym  of  another  if  the  first  sense  is  more   specific,  deno(ng  a  subclass  of  the  other   •  car  is  a  hyponym  of  vehicle   •  mango  is  a  hyponym  of  fruit   •  Conversely  hypernym/superordinate  (“hyper  is  super”)   •  vehicle  is  a  hypernym    of  car   •  fruit  is  a  hypernym  of  mango   Superordinate/hyper vehicle fruit furniture Subordinate/hyponym car mango chair Lecture  4:  Word  Senses  29  
  • 30. Hyponymy  more  formally   •  Extensional:   •  The  class  denoted  by  the  superordinate  extensionally  includes  the  class   denoted  by  the  hyponym   •  Entailment:   •  A  sense  A  is  a  hyponym  of  sense  B  if  being  an  A  entails  being  a  B   •  Hyponymy  is  usually  transi(ve     •  (A  hypo  B  and  B  hypo  C  entails  A  hypo  C)   •  Another  name:  the  IS-­‐A  hierarchy   •  A  IS-­‐A  B            (or  A  ISA  B)   •  B  subsumes  A   Lecture  4:  Word  Senses  30  
  • 31. Hyponyms  and  Instances   •  WordNet  has  both  classes  and  instances.   •  An  instance  is  an  individual,  a  proper  noun  that  is  a  unique  en(ty   •  San Francisco is  an  instance  of  city! •  But  city  is  a  class   •  city  is  a  hyponym  of        municipality...location...! 31   Lecture  4:  Word  Senses  
  • 32. WordNet  and  other   Online  Thesauri  
  • 33. Applica$ons  of  Thesauri  and  Ontologies   •  Informa(on  Extrac(on   •  Informa(on  Retrieval   •  Ques(on  Answering   •  Bioinforma(cs  and  Medical  Informa(cs   •  Machine  Transla(on   Lecture  4:  Word  Senses  33  
  • 34. WordNet   34   Lecture  4:  Word  Senses  
  • 35. Synsets   35   Lecture  4:  Word  Senses  
  • 36. How  is  “sense”  defined  in  WordNet?   •  The  synset  (synonym  set),  the  set  of  near-­‐synonyms,   instan(ates  a  sense  or  concept,  with  a  gloss   •  Example:  chump  as  a  noun  with  the  gloss:   “a  person  who  is  gullible  and  easy  to  take  advantage  of”   •  This  sense  of  “chump”  is  shared  by  9  words:   chump1, fool2, gull1, mark9, patsy1, fall guy1, sucker1, soft touch1, mug2! •  Each  of  these  senses  have  this  same  gloss   •  (Not  every  sense;  sense  2  of  gull  is  the  aqua(c  bird)     Lecture  4:  Word  Senses   36   gullible=naive  
  • 37. Tree-­‐like  Structure   37   Lecture  4:  Word  Senses  
  • 38. WordNet:  bar  1/6   38   Lecture  4:  Word  Senses  
  • 39. 2/6   39   Lecture  4:  Word  Senses  
  • 40. 3/6   40   Lecture  4:  Word  Senses  
  • 41. 4/6   41   Lecture  4:  Word  Senses  
  • 42. 5/6   42   Lecture  4:  Word  Senses  
  • 43. 6/6   43   Lecture  4:  Word  Senses  
  • 44. Polysemy   44   Lecture  4:  Word  Senses  
  • 45. WordNet  3.0   •  A  hierarchically  organized  lexical  database   •  On-­‐line  thesaurus  +  aspects  of  a  dic(onary   •  Some  other  languages  available  or  under  development   •  (Arabic,  Finnish,  German,  Portuguese…)   Category   Unique  Strings   Noun   117,798   Verb   11,529   Adjec(ve   22,479   Adverb   4,481   Lecture  4:  Word  Senses   45  
  • 46. Senses  of  “bass”  in  Wordnet   Lecture  4:  Word  Senses  46  
  • 47. WordNet  Hypernym  Hierarchy  for  “bass”   Lecture  4:  Word  Senses  47  
  • 48. WordNet  Noun  Rela$ons   Lecture  4:  Word  Senses  48  
  • 49. WordNet  3.0   •  Where  it  is:   •  hgp://wordnetweb.princeton.edu/perl/webwn   •  Libraries   •  Python:    WordNet    from  NLTK   •  hgp://www.nltk.org/Home   •  Java:   •  JWNL,  extJWNL  on  sourceforge  Lecture  4:  Word  Senses  
  • 50. Synset •  MeSH  (Medical  Subject  Headings)   •  177,000  entry  terms    that  correspond  to  26,142  biomedical   “headings”   •  Hemoglobins   Entry  Terms:    Eryhem,  Ferrous  Hemoglobin,  Hemoglobin   Defini$on:    The  oxygen-­‐carrying  proteins  of  ERYTHROCYTES.   They  are  found  in  all  vertebrates  and  some  invertebrates.   The  number  of  globin  subunits  in  the  hemoglobin  quaternary   structure  differs  between  species.  Structures  range  from   monomeric  to  a  variety  of  mul(meric  arrangements   MeSH:  Medical  Subject  Headings   thesaurus  from  the  Na$onal  Library  of  Medicine   Lecture  4:  Word  Senses  50  
  • 51. The  MeSH  Hierarchy   •  a   51   Lecture  4:  Word  Senses  
  • 52. Uses  of  the  MeSH  Ontology   •  Provide  synonyms  (“entry  terms”)   •  E.g.,  glucose  and  dextrose   •  Provide  hypernyms  (from  the  hierarchy)   •  E.g.,  glucose  ISA  monosaccharide   •  Indexing  in  MEDLINE/PubMED  database   •  NLM’s  bibliographic  database:     •  20  million  journal  ar(cles   •  Each  ar(cle  hand-­‐assigned  10-­‐20  MeSH  terms   Lecture  4:  Word  Senses  52  
  • 53. Other  resources   53   Lecture  4:  Word  Senses  
  • 54. BabelNet   54   Lecture  4:  Word  Senses  
  • 56. Selec$onal  Restric$ons   Consider  the  two  interpreta(ons  of:    I  want  to  eat  someplace  nearby.     a)  sensible:     Eat  is  intransi(ve  and  “someplace  nearby”  is  a  loca(on  adjunct   b)  Speaker  is  Godzilla:  a  monster  that  likes  ea(ng  buildings!!!            Eat  is  transi(ve  and  “someplace  nearby”  is  a  direct  object   How  do  we  know  speaker  didn’t  mean  b)    ?    Because  the  THEME  of  ea(ng  tends  to  be  something  edible   56   Lecture  4:  Word  Senses  
  • 57. Selec$onal  restric$ons  are  associated  with  senses   •  The  restaurant  serves  green-­‐lipped  mussels.     •  THEME  is  some  kind  of  food   •  Which  airlines  serve  Denver?     •  THEME  is  an  appropriate  loca(on   57   Lecture  4:  Word  Senses   apply  zeugma  test  
  • 58. Represen$ng  selec$onal  restric$ons   t consists of a single variable that stands for the event, a predicat f event, and variables and relations for the event roles. Ignoring t ctures and using thematic roles rather than deep event roles, the on of a verb like eat might look like the following: 9e,x,y Eating(e)^Agent(e,x)^Theme(e,y) representation, all we know about y, the filler of the THEME ro iated with an Eating event through the Theme relation. To sti l restriction that y must be something edible, we simply add a ne : 9e,x,y Eating(e)^Agent(e,x)^Theme(e,y)^EdibleThing(y)58   ntribution of a verb like eat might look like the following: 9e,x,y Eating(e)^Agent(e,x)^Theme(e,y) th this representation, all we know about y, the filler of the THEME role, is th s associated with an Eating event through the Theme relation. To stipulate t ectional restriction that y must be something edible, we simply add a new term t effect: 9e,x,y Eating(e)^Agent(e,x)^Theme(e,y)^EdibleThing(y) => dish => nutriment, nourishment, nutrition... => food, nutrient => substance => matter => physical entity => entity Figure 22.6 Evidence from WordNet that hamburgers are edible. When a phrase like ate a hamburger is encountered, a semantic analyzer can form the following kind of representation: 9e,x,y Eating(e)^Eater(e,x)^Theme(e,y)^EdibleThing(y)^Hamburger(y) This representation is perfectly reasonable since the membership of y in the category Hamburger is consistent with its membership in the category EdibleThing, assuming Instead  of  represen(ng  “eat”  as:   Just  add:   And  “eat  a  hamburger”  becomes   But  this  assumes  we  have  a  large  knowledge  base  of  facts   about  edible  things  and  hamburgers  and  whatnot.  
  • 59. Let’s  use  WordNet  synsets  to  specify   selec$onal  restric$ons   •  The  THEME  of  eat  must  be  WordNet  synset  {food,  nutrient}     “any  substance  that  can  be  metabolized  by  an  animal  to  give  energy  and  build  8ssue”   •  Similarly   THEME  of  imagine:  synset  {en(ty}   THEME  of  li=:  synset  {physical  en(ty}   THEME  of  diagonalize:  synset  {matrix}     •  This  allows   imagine  a  hamburger    and    li=  a  hamburger,     •  Correctly  rules  out     diagonalize  a  hamburger.    59   Lecture  4:  Word  Senses