SlideShare ist ein Scribd-Unternehmen logo
1 von 69
Downloaden Sie, um offline zu lesen
Alexander Panchenko
Inducing Interpretable Word
Senses for WSD and Enrichment of
Lexical Resources
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 2/54
Overview
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 3/54
Inducing word sense representations:
word sense embeddings via retrofitting
[Pelevina et al., 2016, Remus & Biemann, 2018];
inducing synsets [Ustalov et al., 2017b, Ustalov et al., 2017a,
Ustalov et al., 2018]
inducing semantic classes [Panchenko et al., 2018]
Overview
Overview
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 3/54
Inducing word sense representations:
word sense embeddings via retrofitting
[Pelevina et al., 2016, Remus & Biemann, 2018];
inducing synsets [Ustalov et al., 2017b, Ustalov et al., 2017a,
Ustalov et al., 2018]
inducing semantic classes [Panchenko et al., 2018]
Making induced senses interpretable
[Panchenko et al., 2017b, Panchenko et al., 2017c]
Overview
Overview
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 3/54
Inducing word sense representations:
word sense embeddings via retrofitting
[Pelevina et al., 2016, Remus & Biemann, 2018];
inducing synsets [Ustalov et al., 2017b, Ustalov et al., 2017a,
Ustalov et al., 2018]
inducing semantic classes [Panchenko et al., 2018]
Making induced senses interpretable
[Panchenko et al., 2017b, Panchenko et al., 2017c]
Linking induced word senses to lexical
resources [Panchenko, 2016, Faralli et al., 2016,
Panchenko et al., 2017a, Biemann et al., 2018]
Overview
Overview
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 4/54
Inducing word sense representations
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 5/54
Inducing word sense representations
Word vs sense embeddings
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 6/54
Inducing word sense representations
Word vs sense embeddings
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 7/54
Inducing word sense representations
Related work
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 8/54
AutoExtend [Rothe & Schütze, 2015]
* image is reproduced from the original paper
Inducing word sense representations
Related work: knowledge-based
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 9/54
Adagram [Bartunov et al., 2016]
Multiple vector representations θ for each word:
Inducing word sense representations
Related work: knowledge-free
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 9/54
Adagram [Bartunov et al., 2016]
Multiple vector representations θ for each word:
p(Y, Z, β|X, α, θ) =
V∏
w=1
∞∏
k=1
p(βwk|α)
N∏
i=1
[p(zi|xi, β)
C∏
j=1
p(yij|zi, xi, θ)],
zi – a hidden variable: a sense index of word xi in context C;
α – a meta-parameter controlling number of senses.
Inducing word sense representations
Related work: knowledge-free
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 9/54
Adagram [Bartunov et al., 2016]
Multiple vector representations θ for each word:
p(Y, Z, β|X, α, θ) =
V∏
w=1
∞∏
k=1
p(βwk|α)
N∏
i=1
[p(zi|xi, β)
C∏
j=1
p(yij|zi, xi, θ)],
zi – a hidden variable: a sense index of word xi in context C;
α – a meta-parameter controlling number of senses.
See also: [Neelakantan et al., 2014] and [Li and Jurafsky, 2015]
Inducing word sense representations
Related work: knowledge-free
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 10/54
Word sense induction (WSI) based on graph clustering:
[Lin, 1998]
[Pantel and Lin, 2002]
[Widdows and Dorow, 2002]
Chinese Whispers [Biemann, 2006]
[Hope and Keller, 2013]
Inducing word sense representations
Related work: word sense induction
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 11/54
* source of the image: http://ic.pics.livejournal.com/blagin_anton/33716210/2701748/2701748_800.jpg
Inducing word sense representations
Related work: Chinese Whispers#1
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 12/54
Inducing word sense representations
Related work: Chinese Whispers#2
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 13/54
Inducing word sense representations
Related work: Chinese Whispers#2
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 14/54
Inducing word sense representations
Related work: Chinese Whispers#2
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 15/54
RepL4NLP@ACL’16 [Pelevina et al., 2016], LREC’18 [Remus & Biemann, 2018]
Prior methods:
Induce inventory by clustering of word instances
Use existing sense inventories
Our method:
Input: word embeddings
Output: word sense embeddings
Word sense induction by clustering of word ego-networks
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 16/54
From word embeddings to sense embeddings
Calculate Word
Similarity Graph
Learning Word Vectors
Word Sense Induction
Text Corpus
Word Vectors
Word Similarity Graph
Pooling of Word Vectors
Sense Inventory
Sense Vectors
1 2
4 3
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 17/54
Word sense induction using ego-network clustering
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 18/54
Neighbours of Word and Sense Vectors
Vector Nearest Neighbors
table tray, bottom, diagram, bucket, brackets, stack, basket,
list, parenthesis, cup, saucer, pile, playfield, bracket,
pot, drop-down, cue, plate
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 18/54
Neighbours of Word and Sense Vectors
Vector Nearest Neighbors
table tray, bottom, diagram, bucket, brackets, stack, basket,
list, parenthesis, cup, saucer, pile, playfield, bracket,
pot, drop-down, cue, plate
table#0 leftmost#0, column#1, tableau#1, indent#1, bracket#3,
pointer#0, footer#1, cursor#1, diagram#0, grid#0
table#1 pile#1, stool#1, tray#0, basket#0, bowl#1, bucket#0,
box#0, cage#0, saucer#3, mirror#1, pan#1, lid#0
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 19/54
Word Sense Disambiguation
1 Context extraction: use context words around the target
word
2 Context filtering: based on context word’s relevance for
disambiguation
3 Sense choice in context: maximise similarity between a
context vector and a sense vector
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 20/54
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 21/54
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 22/54
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 23/54
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 24/54
Unsupervised WSD SemEval’13, ReprL4NLP [Pelevina et al., 2016]:
comparable to SOTA, incl. Adagram sense embeddings.
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 24/54
Unsupervised WSD SemEval’13, ReprL4NLP [Pelevina et al., 2016]:
comparable to SOTA, incl. Adagram sense embeddings.
Semantic relatedness, LREC’2018 [Remus & Biemann, 2018]:
autoextend
adagram
SGNS
.
glove
.
sympat
.
LSAbow
.
LSAhal
.
paragramSL
.
SimLex999 0.45 0.29 0.44 0.37 0.54 0.30 0.27 0.68
MEN 0.72 0.67 0.77 0.73 0.53 0.67 0.71 0.77
SimVerb 0.43 0.27 0.36 0.23 0.37 0.15 0.19 0.53
WordSim353 0.58 0.61 0.70 0.61 0.47 0.67 0.59 0.72
SimLex999-N 0.44 0.33 0.45 0.39 0.48 0.32 0.34 0.68
MEN-N 0.72 0.68 0.77 ____ 0.76 ____ 0.57 ____ 0.71 ____ 0.73 ____ 0.78 ____
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 25/54
Unsupervised WSD SemEval’13, ReprL4NLP [Pelevina et al., 2016]:
comparable to SOTA, incl. sense embeddings.
Semantic relatedness, LREC’2018 [Remus & Biemann, 2018]:
autoextend
adagram
SGNS
SGNS+senses
glove
glove+senses
sympat
sympat+senses
LSAbow
LSAbow+senses
LSAhal
LSAhal+senses
paragramSL
paragramSL+senses
SimLex999 0.45 0.29 0.44 0.46 0.37 0.41 0.54 0.55 0.30 0.39 0.27 0.38 0.68 0.64
MEN 0.72 0.67 0.77 0.78 0.73 0.77 0.53 0.68 0.67 0.70 0.71 0.74 0.77 0.80
SimVerb 0.43 0.27 0.36 0.39 0.23 0.30 0.37 0.45 0.15 0.22 0.19 0.28 0.53 0.53
WordSim353 0.58 0.61 0.70 0.69 0.61 0.65 0.47 0.62 0.67 0.66 0.59 0.63 0.72 0.73
SimLex999-N 0.44 0.33 0.45 0.50 0.39 0.47 0.48 0.55 0.32 0.46 0.34 0.44 0.68 0.66
MEN-N 0.72 0.68 0.77 0.79 0.76 0.80 0.57 0.74 0.71 0.73 0.73 0.76 0.78 0.81
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 26/54
Word and sense embeddings
of words iron and vitamin.
LREC’18 [Remus & Biemann, 2018]
Inducing word sense representations
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 27/54
ACL’17 [Ustalov et al., 2017b]
Examples of extracted synsets:
Size Synset
2 {decimal point, dot}
3 {gullet, throat, food pipe}
4 {microwave meal, ready meal, TV dinner, frozen dinner}
5 {objective case, accusative case, oblique case, object
case, accusative}
6 {radiotheater, dramatizedaudiobook, audiotheater, ra-
dio play, radio drama, audio play}
Inducing word sense representations
Synset induction
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 28/54
Outline of the ’Watset’ method:
Background Corpus
Synonymy Dictionary
Learning

Word
Embeddings
Graph
Construction

Synsets
Word
Similarities
Ambiguous
Weighted Graph
Local
Clustering:
Word Sense Induction
Global
Clustering:

Synset Induction
Sense Inventory
Disambiguation
of

Neighbors

Disambiguated
Weighted Graph
Local-Global
Fuzzy
Graph
Clustering
Inducing word sense representations
Synset induction
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 29/54
Inducing word sense representations
Synset induction
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 30/54
Inducing word sense representations
Synset induction
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 31/54
Inducing word sense representations
Synset induction
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 32/54
CW MCL MaxMax ECO CPM Watset
0.0
0.1
0.2
0.3
WordNet (English)
F−score
CW MCL MaxMax ECO CPM Watset
0.0
0.1
0.2
0.3
BabelNet (English)
F−score
Inducing word sense representations
Synset induction
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 32/54
CW MCL MaxMax ECO CPM Watset
0.0
0.1
0.2
0.3
WordNet (English)
F−score
CW MCL MaxMax ECO CPM Watset
0.0
0.1
0.2
0.3
BabelNet (English)
F−score
CW MCL MaxMax ECO CPM Watset
0.00
0.05
0.10
0.15
0.20
RuWordNet (Russian)
F−score
CW MCL MaxMax ECO CPM Watset
0.0
0.1
0.2
0.3
0.4
YARN (Russian)
F−score
Inducing word sense representations
Synset induction
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 33/54
Examples of semantic classes:
ID Sense Cluster Hypernyms
1 peach#1, banana#1, pineapple#0, berry#0,
blackberry#0, grapefruit#0, strawberry#0, blue-
berry#0, grape#0, melon#0, orange#0, pear#0,
plum#0, raspberry#0, watermelon#0, apple#0,
apricot#0, …
fruit#0, crop#0, ingredi-
ent#0, food#0, ·
2 C#4, Basic#2, Haskell#5, Flash#1, Java#1, Pas-
cal#0, Ruby#6, PHP#0, Ada#1, Oracle#3, Python#3,
Apache#3, Visual Basic#1, ASP#2, Delphi#2, SQL
Server#0, CSS#0, AJAX#0, the Java#0, …
programming language#3,
technology#0, language#0,
format#2, app#0
Inducing word sense representations
Induction of semantic classes
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 34/54
Text
Corpus
Representing
Senses

with
Ego
Networks
Semantic
Classes
Word
Sense
Induction

from
Text
Corpus
Sense
Graph

Construction
Clustering
of

Word
Senes
Labeling
Sense
Clusters

with
Hypernyms

Induced Word Senses Sense Ego-Networks Global Sense Graph
s
Noisy
Hypernyms
Cleansed
Hypernyms
Induction
of
Semantic
Classes
Global Sense Clusters
Inducing word sense representations
Induction of semantic classes
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 35/54
Filtering noisy hypernyms with semantic classes
LREC’18 [Panchenko et al., 2018]:
fruit#1
food#0

apple#2 mango#0 pear#0
Hypernyms,
Sense Cluster,
mangosteen#0
city#2
Removed
Wrong
Added
Missing
Inducing word sense representations
Induction of sense semantic classes
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 36/54
Filtering of a noisy hypernymy database with semantic classes.
LREC’18 [Panchenko et al., 2018]
Precision Recall F-score
Original Hypernyms (Seitner et al., 2016) 0.475 0.546 0.508
Semantic Classes (coarse-grained) 0.541 0.679 0.602
Inducing word sense representations
Induction of sense semantic classes
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 37/54
Making induced senses interpretable
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 38/54
Knowledge-based sense representations are interpretable
Making induced senses interpretable
Making induced senses interpretable
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 39/54
Most knowledge-free sense representations are uninterpretable
Making induced senses interpretable
Making induced senses interpretable
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 40/54
Making induced senses interpretable
Making induced senses interpretable
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 41/54
Hypernymy prediction in context. EMNLP’17 [Panchenko et al., 2017b]
Making induced senses interpretable
Making induced senses interpretable
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 42/54
11.702 sentences, 863 words with avg.polysemy of 3.1.
WSD Model Accuracy
Inventory Features Hypers HyperHypers
Word Senses Random 0.257 0.610
Word Senses MFS 0.292 0.682
Word Senses Cluster Words 0.291 0.650
Word Senses Context Words 0.308 0.686
Making induced senses interpretable
Making induced senses interpretable
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 42/54
11.702 sentences, 863 words with avg.polysemy of 3.1.
WSD Model Accuracy
Inventory Features Hypers HyperHypers
Word Senses Random 0.257 0.610
Word Senses MFS 0.292 0.682
Word Senses Cluster Words 0.291 0.650
Word Senses Context Words 0.308 0.686
Super Senses Random 0.001 0.001
Super Senses MFS 0.001 0.001
Super Senses Cluster Words 0.174 0.365
Super Senses Context Words 0.086 0.188
Making induced senses interpretable
Making induced senses interpretable
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 43/54
Linking induced senses to resources
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 44/54
Text Corpus
Linking
Induced
Senses

to

Senses
of
the
LR


Induce
a
Graph
of
Sem.
Related
Words
Enriched
Lexical Resource
Graph of Related Words
Word
Sense

Induction

Labeling
Senses

with
Hypernyms

Disambiguation

of
Neighbours

Typing
of
the
Unmapped

Induced
Senses



Word Sense Inventory Labeled Word Senses
PCZ
Construction
of
Proto-Conceptualization (PCZ)
Linking
Proto-Conceptualization
to
Lexical
Resource

Part. Linked Senses to the LR
Lexical Resource (LR):
WordNet, BabelNet, ...
Construction
of
sense

feature
representations

Graph of Related Senses
LREC’16 [Panchenko, 2016], ISWC’16 [Faralli et al., 2016],
SENSE@EACL’17 [Panchenko et al., 2017a],
NLE’18 [Biemann et al., 2018]
Linking induced senses to resources
Linking induced senses to resources
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 45/54
Word AdaGram BabelNet AdaGram BoW BabelNet BoW
python 2 bn:01713224n perl, php, java, smalltalk, ruby,
lua, tcl, scripting, javascript,
bindings, binding, programming,
coldfusion, actionscript, net, . . .
language, programming, python-
ista, python programming,
python3, python2, level, com-
puter, pythonistas, python3000,
python 1 bn:01157670n monty, circus, spamalot, python,
magoo, muppet, snoopy, fea-
turette, disney, tunes, tune, clas-
sic, shorts, short, apocalypse, . . .
monty, comedy, monty python,
british, monte, monte python,
troupe, pythonesque, foot, artist,
record, surreal, terry, . . .
python 3 bn:00046456n spectacled, unicornis, snake, gi-
ant, caiman, leopard, squirrel,
crocodile, horned, cat, mole, ele-
phant, opossum, pheasant, . . .
molurus, indian, boa, tigris,
tiger python, rock, tiger, indian
python, reptile, python molurus,
indian rock python, coluber, . . .
python 4 bn:01157670n circus, fly, flying, dusk, lizard,
moth, unicorn, puff, adder, vul-
ture, tyrannosaurus, zephyr, bad-
ger, . . .
monty, comedy, monty python,
british, monte, monte python,
troupe, pythonesque, foot, artist,
record, surreal, terry, . . .
python 1 bn:00473212n monty, circus, spamalot, python,
magoo, muppet, snoopy, fea-
turette, disney, tunes, tune, clas-
sic, shorts, short, apocalypse, . . .
pictures, monty, python monty
pictures, limited, company,
python pictures limited, king-
dom, picture, serve, director,
. . .
python 1 bn:03489893n monty, circus, spamalot, python,
magoo, muppet, snoopy, fea-
turette, disney, tunes, tune, clas-
sic, shorts, short, apocalypse, . . .
film, horror, movie, clabaugh,
richard, monster, century, direct,
snake, python movie, television,
giant, natural, language, for-tv,
. . .
Linking induced senses to resources
Linking induced senses to resources
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 46/54
Model Representation of the Sense ”disk (medium)”
WordNet memory, device, floppy, disk, hard, disk, disk, computer, science, computing, diskette, fixed, disk,
floppy, magnetic, disc, magnetic, disk, hard, disc, storage, device
WordNet + Linked recorder, disk, floppy, console, diskette, handset, desktop, iPhone, iPod, HDTV, kit, RAM, Discs, Blu-
ray, computer, GB, microchip, site, cartridge, printer, tv, VCR, Disc, player, LCD, software, component,
camcorder, cellphone, card, monitor, display, burner, Web, stereo, internet, model, iTunes, turntable,
chip, cable, camera, iphone, notebook, device, server, surface, wafer, page, drive, laptop, screen,
pc, television, hardware, YouTube, dvr, DVD, product, folder, VCR, radio, phone, circuitry, partition,
megabyte, peripheral, format, machine, tuner, website, merchandise, equipment, gb, discs, MP3,
hard-drive, piece, video, storage device, memory device, microphone, hd, EP, content, soundtrack,
webcam, system, blade, graphic, microprocessor, collection, document, programming, battery, key-
board, HD, handheld, CDs, reel, web, material, hard-disk, ep, chart, debut, configuration, recording,
album, broadcast, download, fixed disk, planet, pda, microfilm, iPod, videotape, text, cylinder, cpu,
canvas, label, sampler, workstation, electrode, magnetic disc, catheter, magnetic disk, Video, mo-
bile, cd, song, modem, mouse, tube, set, ipad, signal, substrate, vinyl, music, clip, pad, audio, com-
pilation, memory, message, reissue, ram, CD, subsystem, hdd, touchscreen, electronics, demo, shell,
sensor, file, shelf, processor, cassette, extra, mainframe, motherboard, floppy disk, lp, tape, version,
kilobyte, pacemaker, browser, Playstation, pager, module, cache, DVD, movie, Windows, cd-rom, e-
book, valve, directory, harddrive, smartphone, audiotape, technology, hard disk, show, computing,
computer science, Blu-Ray, blu-ray, HDD, HD-DVD, scanner, hard disc, gadget, booklet, copier, play-
back, TiVo, controller, filter, DVDs, gigabyte, paper, mp3, CPU, dvd-r, pipe, cd-r, playlist, slot, VHS, film,
videocassette, interface, adapter, database, manual, book, channel, changer, storage
Linking induced senses to resources
Linking induced senses to resources
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 47/54
Evaluation of linking accuracy:
Linking induced senses to resources
Linking induced senses to resources
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 48/54
Evaluation of enriched representations based on WSD:
Linking induced senses to resources
Linking induced senses to resources
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 49/54
Conclusion
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 50/54
Conclusion
Vectors + Graphs = ♡
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 51/54
We can induce word senses, synsets and semantic classes in
a knowledge-free way using graph clustering and
distributional models.
Conclusion
Take home messages
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 51/54
We can induce word senses, synsets and semantic classes in
a knowledge-free way using graph clustering and
distributional models.
We can make the induced word senses interpretable in a
knowledge-free way with hypernyms, images, definitions.
Conclusion
Take home messages
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 51/54
We can induce word senses, synsets and semantic classes in
a knowledge-free way using graph clustering and
distributional models.
We can make the induced word senses interpretable in a
knowledge-free way with hypernyms, images, definitions.
We can link induced senses to lexical resources to
improve performance of WSD;
enrich lexical resources with emerging senses.
Conclusion
Take home messages
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 52/54
Participate in an ACL SIGSLAV sponsored shared task on
word sense induction and disambiguation for Russian!
More details: http://russe.nlpub.org/2018/wsi
Conclusion
An ongoing shared task on WSI&D
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 53/54
Thank you! Questions?
This research was supported by
Conclusion
Acknowledgments
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54
Evaluation on SemEval 2013 Task 13 WSI&D:
Model Jacc. Tau WNDCG F.NMI F.B-Cubed
AI-KU (add1000) 0.176 0.609 0.205 0.033 0.317
AI-KU 0.176 0.619 0.393 0.066 0.382
AI-KU (remove5-add1000) 0.228 0.654 0.330 0.040 0.463
Unimelb (5p) 0.198 0.623 0.374 0.056 0.475
Unimelb (50k) 0.198 0.633 0.384 0.060 0.494
UoS (#WN senses) 0.171 0.600 0.298 0.046 0.186
UoS (top-3) 0.220 0.637 0.370 0.044 0.451
La Sapienza (1) 0.131 0.544 0.332 – –
La Sapienza (2) 0.131 0.535 0.394 – –
AdaGram, α = 0.05, 100 dim 0.274 0.644 0.318 0.058 0.470
w2v 0.197 0.615 0.291 0.011 0.615
w2v (nouns) 0.179 0.626 0.304 0.011 0.623
JBT 0.205 0.624 0.291 0.017 0.598
JBT (nouns) 0.198 0.643 0.310 0.031 0.595
TWSI (nouns) 0.215 0.651 0.318 0.030 0.573
Conclusion
Sense embeddings using retrofitting
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54
Bartunov, S., Kondrashkin, D., Osokin, A., & Vetrov, D. (2016).
Breaking sticks and ambiguities with adaptive skip-gram.
In Artificial Intelligence and Statistics (pp. 130–138).
Biemann, C., Faralli, S., Panchenko, A., & Ponzetto, S. P. (2018).
A framework for enriching lexical semantic resources with
distributional semantics.
In Journal of Natural Language Engineering (pp. 56–64).:
Cambridge Press.
Faralli, S., Panchenko, A., Biemann, C., & Ponzetto, S. P. (2016).
Linked disambiguated distributional semantic networks.
In International Semantic Web Conference (pp. 56–64).:
Springer.
Panchenko, A. (2016).
Best of both worlds: Making word sense embeddings
interpretable.
In LREC.
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54
Panchenko, A., Faralli, S., Ponzetto, S. P., & Biemann, C.
(2017a).
Using linked disambiguated distributional networks for word
sense disambiguation.
In Proceedings of the 1st Workshop on Sense, Concept and
Entity Representations and their Applications (pp. 72–78).
Valencia, Spain: Association for Computational Linguistics.
Panchenko, A., Marten, F., Ruppert, E., Faralli, S., Ustalov, D.,
Ponzetto, S. P., & Biemann, C. (2017b).
Unsupervised, knowledge-free, and interpretable word sense
disambiguation.
In Proceedings of the 2017 Conference on Empirical Methods in
Natural Language Processing: System Demonstrations (pp.
91–96). Copenhagen, Denmark: Association for
Computational Linguistics.
Panchenko, A., Ruppert, E., Faralli, S., Ponzetto, S. P., &
Biemann, C. (2017c).
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54
Unsupervised does not mean uninterpretable: The case for
word sense induction and disambiguation.
In Proceedings of the 15th Conference of the European Chapter
of the Association for Computational Linguistics: Volume 1,
Long Papers (pp. 86–98). Valencia, Spain: Association for
Computational Linguistics.
Panchenko, A., Ustalov, D., Faralli, S., Ponzetto, S. P., &
Biemann, C. (2018).
Improving hypernymy extraction with distributional
semantic classes.
In Proceedings of the LREC 2018 Miyazaki, Japan: European
Language Resources Association.
Pelevina, M., Arefiev, N., Biemann, C., & Panchenko, A. (2016).
Making sense of word embeddings.
In Proceedings of the 1st Workshop on Representation
Learning for NLP (pp. 174–183). Berlin, Germany: Association
for Computational Linguistics.
Remus, S. & Biemann, C. (2018).
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54
Retrofittingword representations for unsupervised sense
aware word similarities.
In Proceedings of the LREC 2018 Miyazaki, Japan: European
Language Resources Association.
Rothe, S. & Schütze, H. (2015).
Autoextend: Extending word embeddings to embeddings for
synsets and lexemes.
In Proceedings of the 53rd Annual Meeting of the Association
for Computational Linguistics and the 7th International Joint
Conference on Natural Language Processing (Volume 1: Long
Papers) (pp. 1793–1803). Beijing, China: Association for
Computational Linguistics.
Ustalov, D., Chernoskutov, M., Biemann, C., & Panchenko, A.
(2017a).
Fighting with the sparsity of synonymy dictionaries for
automatic synset induction.
In International Conference on Analysis of Images, Social
Networks and Texts (pp. 94–105).: Springer.
Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54
Ustalov, D., Panchenko, A., & Biemann, C. (2017b).
Watset: Automatic induction of synsets from a graph of
synonyms.
In Proceedings of the 55th Annual Meeting of the Association
for Computational Linguistics (Volume 1: Long Papers) (pp.
1579–1590). Vancouver, Canada: Association for
Computational Linguistics.
Ustalov, D., Teslenko, D., Panchenko, A., Chernoskutov, M., &
Biemann, C. (2018).
Word sense disambiguation based on automatically induced
synsets.
In LREC 2018, 11th International Conference on Language
Resources and Evaluation : 7-12 May 2018, Miyazaki (Japan)
(pp. tba). Paris: European Language Resources Association,
ELRA-ELDA.
Accepted for publication.

Weitere ähnliche Inhalte

Mehr von Alexander Panchenko

IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...
IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...
IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...Alexander Panchenko
 
Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...
Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...
Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...Alexander Panchenko
 
The 6th Conference on Analysis of Images, Social Networks, and Texts (AIST 2...
The 6th Conference on Analysis of Images, Social Networks, and Texts  (AIST 2...The 6th Conference on Analysis of Images, Social Networks, and Texts  (AIST 2...
The 6th Conference on Analysis of Images, Social Networks, and Texts (AIST 2...Alexander Panchenko
 
Using Linked Disambiguated Distributional Networks for Word Sense Disambiguation
Using Linked Disambiguated Distributional Networks for Word Sense DisambiguationUsing Linked Disambiguated Distributional Networks for Word Sense Disambiguation
Using Linked Disambiguated Distributional Networks for Word Sense DisambiguationAlexander Panchenko
 
Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction...
Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction...Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction...
Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction...Alexander Panchenko
 
Noun Sense Induction and Disambiguation using Graph-Based Distributional Sema...
Noun Sense Induction and Disambiguation using Graph-Based Distributional Sema...Noun Sense Induction and Disambiguation using Graph-Based Distributional Sema...
Noun Sense Induction and Disambiguation using Graph-Based Distributional Sema...Alexander Panchenko
 
Getting started in Apache Spark and Flink (with Scala) - Part II
Getting started in Apache Spark and Flink (with Scala) - Part IIGetting started in Apache Spark and Flink (with Scala) - Part II
Getting started in Apache Spark and Flink (with Scala) - Part IIAlexander Panchenko
 
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...Alexander Panchenko
 
Text Analysis of Social Networks: Working with FB and VK Data
Text Analysis of Social Networks: Working with FB and VK DataText Analysis of Social Networks: Working with FB and VK Data
Text Analysis of Social Networks: Working with FB and VK DataAlexander Panchenko
 
Неологизмы в социальной сети Фейсбук
Неологизмы в социальной сети ФейсбукНеологизмы в социальной сети Фейсбук
Неологизмы в социальной сети ФейсбукAlexander Panchenko
 
Sentiment Index of the Russian Speaking Facebook
Sentiment Index of the Russian Speaking FacebookSentiment Index of the Russian Speaking Facebook
Sentiment Index of the Russian Speaking FacebookAlexander Panchenko
 
Similarity Measures for Semantic Relation Extraction
Similarity Measures for Semantic Relation ExtractionSimilarity Measures for Semantic Relation Extraction
Similarity Measures for Semantic Relation ExtractionAlexander Panchenko
 
Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...
Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...
Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...Alexander Panchenko
 
Detecting Gender by Full Name: Experiments with the Russian Language
Detecting Gender by Full Name:  Experiments with the Russian LanguageDetecting Gender by Full Name:  Experiments with the Russian Language
Detecting Gender by Full Name: Experiments with the Russian LanguageAlexander Panchenko
 
Вычислительная лексическая семантика: метрики семантической близости и их при...
Вычислительная лексическая семантика: метрики семантической близости и их при...Вычислительная лексическая семантика: метрики семантической близости и их при...
Вычислительная лексическая семантика: метрики семантической близости и их при...Alexander Panchenko
 
Semantic Similarity Measures for Semantic Relation Extraction
Semantic Similarity Measures for Semantic Relation ExtractionSemantic Similarity Measures for Semantic Relation Extraction
Semantic Similarity Measures for Semantic Relation ExtractionAlexander Panchenko
 

Mehr von Alexander Panchenko (18)

IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...
IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...
IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...
 
Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...
Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...
Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...
 
The 6th Conference on Analysis of Images, Social Networks, and Texts (AIST 2...
The 6th Conference on Analysis of Images, Social Networks, and Texts  (AIST 2...The 6th Conference on Analysis of Images, Social Networks, and Texts  (AIST 2...
The 6th Conference on Analysis of Images, Social Networks, and Texts (AIST 2...
 
Using Linked Disambiguated Distributional Networks for Word Sense Disambiguation
Using Linked Disambiguated Distributional Networks for Word Sense DisambiguationUsing Linked Disambiguated Distributional Networks for Word Sense Disambiguation
Using Linked Disambiguated Distributional Networks for Word Sense Disambiguation
 
Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction...
Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction...Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction...
Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction...
 
Making Sense of Word Embeddings
Making Sense of Word EmbeddingsMaking Sense of Word Embeddings
Making Sense of Word Embeddings
 
Noun Sense Induction and Disambiguation using Graph-Based Distributional Sema...
Noun Sense Induction and Disambiguation using Graph-Based Distributional Sema...Noun Sense Induction and Disambiguation using Graph-Based Distributional Sema...
Noun Sense Induction and Disambiguation using Graph-Based Distributional Sema...
 
Getting started in Apache Spark and Flink (with Scala) - Part II
Getting started in Apache Spark and Flink (with Scala) - Part IIGetting started in Apache Spark and Flink (with Scala) - Part II
Getting started in Apache Spark and Flink (with Scala) - Part II
 
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
 
Text Analysis of Social Networks: Working with FB and VK Data
Text Analysis of Social Networks: Working with FB and VK DataText Analysis of Social Networks: Working with FB and VK Data
Text Analysis of Social Networks: Working with FB and VK Data
 
Неологизмы в социальной сети Фейсбук
Неологизмы в социальной сети ФейсбукНеологизмы в социальной сети Фейсбук
Неологизмы в социальной сети Фейсбук
 
Sentiment Index of the Russian Speaking Facebook
Sentiment Index of the Russian Speaking FacebookSentiment Index of the Russian Speaking Facebook
Sentiment Index of the Russian Speaking Facebook
 
Similarity Measures for Semantic Relation Extraction
Similarity Measures for Semantic Relation ExtractionSimilarity Measures for Semantic Relation Extraction
Similarity Measures for Semantic Relation Extraction
 
Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...
Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...
Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...
 
Detecting Gender by Full Name: Experiments with the Russian Language
Detecting Gender by Full Name:  Experiments with the Russian LanguageDetecting Gender by Full Name:  Experiments with the Russian Language
Detecting Gender by Full Name: Experiments with the Russian Language
 
Document
DocumentDocument
Document
 
Вычислительная лексическая семантика: метрики семантической близости и их при...
Вычислительная лексическая семантика: метрики семантической близости и их при...Вычислительная лексическая семантика: метрики семантической близости и их при...
Вычислительная лексическая семантика: метрики семантической близости и их при...
 
Semantic Similarity Measures for Semantic Relation Extraction
Semantic Similarity Measures for Semantic Relation ExtractionSemantic Similarity Measures for Semantic Relation Extraction
Semantic Similarity Measures for Semantic Relation Extraction
 

Kürzlich hochgeladen

PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxSuji236384
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedDelhi Call girls
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxseri bangash
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLkantirani197
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedDelhi Call girls
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learninglevieagacer
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfSumit Kumar yadav
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Serviceshivanisharma5244
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑Damini Dixit
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professormuralinath2
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Servicenishacall1
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformationAreesha Ahmad
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
chemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdfchemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdfTukamushabaBismark
 

Kürzlich hochgeladen (20)

PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
chemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdfchemical bonding Essentials of Physical Chemistry2.pdf
chemical bonding Essentials of Physical Chemistry2.pdf
 

Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources

  • 1. Alexander Panchenko Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources
  • 2. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 2/54 Overview
  • 3. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 3/54 Inducing word sense representations: word sense embeddings via retrofitting [Pelevina et al., 2016, Remus & Biemann, 2018]; inducing synsets [Ustalov et al., 2017b, Ustalov et al., 2017a, Ustalov et al., 2018] inducing semantic classes [Panchenko et al., 2018] Overview Overview
  • 4. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 3/54 Inducing word sense representations: word sense embeddings via retrofitting [Pelevina et al., 2016, Remus & Biemann, 2018]; inducing synsets [Ustalov et al., 2017b, Ustalov et al., 2017a, Ustalov et al., 2018] inducing semantic classes [Panchenko et al., 2018] Making induced senses interpretable [Panchenko et al., 2017b, Panchenko et al., 2017c] Overview Overview
  • 5. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 3/54 Inducing word sense representations: word sense embeddings via retrofitting [Pelevina et al., 2016, Remus & Biemann, 2018]; inducing synsets [Ustalov et al., 2017b, Ustalov et al., 2017a, Ustalov et al., 2018] inducing semantic classes [Panchenko et al., 2018] Making induced senses interpretable [Panchenko et al., 2017b, Panchenko et al., 2017c] Linking induced word senses to lexical resources [Panchenko, 2016, Faralli et al., 2016, Panchenko et al., 2017a, Biemann et al., 2018] Overview Overview
  • 6. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 4/54 Inducing word sense representations
  • 7. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 5/54 Inducing word sense representations Word vs sense embeddings
  • 8. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 6/54 Inducing word sense representations Word vs sense embeddings
  • 9. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 7/54 Inducing word sense representations Related work
  • 10. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 8/54 AutoExtend [Rothe & Schütze, 2015] * image is reproduced from the original paper Inducing word sense representations Related work: knowledge-based
  • 11. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 9/54 Adagram [Bartunov et al., 2016] Multiple vector representations θ for each word: Inducing word sense representations Related work: knowledge-free
  • 12. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 9/54 Adagram [Bartunov et al., 2016] Multiple vector representations θ for each word: p(Y, Z, β|X, α, θ) = V∏ w=1 ∞∏ k=1 p(βwk|α) N∏ i=1 [p(zi|xi, β) C∏ j=1 p(yij|zi, xi, θ)], zi – a hidden variable: a sense index of word xi in context C; α – a meta-parameter controlling number of senses. Inducing word sense representations Related work: knowledge-free
  • 13. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 9/54 Adagram [Bartunov et al., 2016] Multiple vector representations θ for each word: p(Y, Z, β|X, α, θ) = V∏ w=1 ∞∏ k=1 p(βwk|α) N∏ i=1 [p(zi|xi, β) C∏ j=1 p(yij|zi, xi, θ)], zi – a hidden variable: a sense index of word xi in context C; α – a meta-parameter controlling number of senses. See also: [Neelakantan et al., 2014] and [Li and Jurafsky, 2015] Inducing word sense representations Related work: knowledge-free
  • 14. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 10/54 Word sense induction (WSI) based on graph clustering: [Lin, 1998] [Pantel and Lin, 2002] [Widdows and Dorow, 2002] Chinese Whispers [Biemann, 2006] [Hope and Keller, 2013] Inducing word sense representations Related work: word sense induction
  • 15. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 11/54 * source of the image: http://ic.pics.livejournal.com/blagin_anton/33716210/2701748/2701748_800.jpg Inducing word sense representations Related work: Chinese Whispers#1
  • 16. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 12/54 Inducing word sense representations Related work: Chinese Whispers#2
  • 17. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 13/54 Inducing word sense representations Related work: Chinese Whispers#2
  • 18. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 14/54 Inducing word sense representations Related work: Chinese Whispers#2
  • 19. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 15/54 RepL4NLP@ACL’16 [Pelevina et al., 2016], LREC’18 [Remus & Biemann, 2018] Prior methods: Induce inventory by clustering of word instances Use existing sense inventories Our method: Input: word embeddings Output: word sense embeddings Word sense induction by clustering of word ego-networks Inducing word sense representations Sense embeddings using retrofitting
  • 20. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 16/54 From word embeddings to sense embeddings Calculate Word Similarity Graph Learning Word Vectors Word Sense Induction Text Corpus Word Vectors Word Similarity Graph Pooling of Word Vectors Sense Inventory Sense Vectors 1 2 4 3 Inducing word sense representations Sense embeddings using retrofitting
  • 21. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 17/54 Word sense induction using ego-network clustering Inducing word sense representations Sense embeddings using retrofitting
  • 22. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 18/54 Neighbours of Word and Sense Vectors Vector Nearest Neighbors table tray, bottom, diagram, bucket, brackets, stack, basket, list, parenthesis, cup, saucer, pile, playfield, bracket, pot, drop-down, cue, plate Inducing word sense representations Sense embeddings using retrofitting
  • 23. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 18/54 Neighbours of Word and Sense Vectors Vector Nearest Neighbors table tray, bottom, diagram, bucket, brackets, stack, basket, list, parenthesis, cup, saucer, pile, playfield, bracket, pot, drop-down, cue, plate table#0 leftmost#0, column#1, tableau#1, indent#1, bracket#3, pointer#0, footer#1, cursor#1, diagram#0, grid#0 table#1 pile#1, stool#1, tray#0, basket#0, bowl#1, bucket#0, box#0, cage#0, saucer#3, mirror#1, pan#1, lid#0 Inducing word sense representations Sense embeddings using retrofitting
  • 24. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 19/54 Word Sense Disambiguation 1 Context extraction: use context words around the target word 2 Context filtering: based on context word’s relevance for disambiguation 3 Sense choice in context: maximise similarity between a context vector and a sense vector Inducing word sense representations Sense embeddings using retrofitting
  • 25. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 20/54 Inducing word sense representations Sense embeddings using retrofitting
  • 26. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 21/54 Inducing word sense representations Sense embeddings using retrofitting
  • 27. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 22/54 Inducing word sense representations Sense embeddings using retrofitting
  • 28. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 23/54 Inducing word sense representations Sense embeddings using retrofitting
  • 29. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 24/54 Unsupervised WSD SemEval’13, ReprL4NLP [Pelevina et al., 2016]: comparable to SOTA, incl. Adagram sense embeddings. Inducing word sense representations Sense embeddings using retrofitting
  • 30. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 24/54 Unsupervised WSD SemEval’13, ReprL4NLP [Pelevina et al., 2016]: comparable to SOTA, incl. Adagram sense embeddings. Semantic relatedness, LREC’2018 [Remus & Biemann, 2018]: autoextend adagram SGNS . glove . sympat . LSAbow . LSAhal . paragramSL . SimLex999 0.45 0.29 0.44 0.37 0.54 0.30 0.27 0.68 MEN 0.72 0.67 0.77 0.73 0.53 0.67 0.71 0.77 SimVerb 0.43 0.27 0.36 0.23 0.37 0.15 0.19 0.53 WordSim353 0.58 0.61 0.70 0.61 0.47 0.67 0.59 0.72 SimLex999-N 0.44 0.33 0.45 0.39 0.48 0.32 0.34 0.68 MEN-N 0.72 0.68 0.77 ____ 0.76 ____ 0.57 ____ 0.71 ____ 0.73 ____ 0.78 ____ Inducing word sense representations Sense embeddings using retrofitting
  • 31. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 25/54 Unsupervised WSD SemEval’13, ReprL4NLP [Pelevina et al., 2016]: comparable to SOTA, incl. sense embeddings. Semantic relatedness, LREC’2018 [Remus & Biemann, 2018]: autoextend adagram SGNS SGNS+senses glove glove+senses sympat sympat+senses LSAbow LSAbow+senses LSAhal LSAhal+senses paragramSL paragramSL+senses SimLex999 0.45 0.29 0.44 0.46 0.37 0.41 0.54 0.55 0.30 0.39 0.27 0.38 0.68 0.64 MEN 0.72 0.67 0.77 0.78 0.73 0.77 0.53 0.68 0.67 0.70 0.71 0.74 0.77 0.80 SimVerb 0.43 0.27 0.36 0.39 0.23 0.30 0.37 0.45 0.15 0.22 0.19 0.28 0.53 0.53 WordSim353 0.58 0.61 0.70 0.69 0.61 0.65 0.47 0.62 0.67 0.66 0.59 0.63 0.72 0.73 SimLex999-N 0.44 0.33 0.45 0.50 0.39 0.47 0.48 0.55 0.32 0.46 0.34 0.44 0.68 0.66 MEN-N 0.72 0.68 0.77 0.79 0.76 0.80 0.57 0.74 0.71 0.73 0.73 0.76 0.78 0.81 Inducing word sense representations Sense embeddings using retrofitting
  • 32. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 26/54 Word and sense embeddings of words iron and vitamin. LREC’18 [Remus & Biemann, 2018] Inducing word sense representations Sense embeddings using retrofitting
  • 33. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 27/54 ACL’17 [Ustalov et al., 2017b] Examples of extracted synsets: Size Synset 2 {decimal point, dot} 3 {gullet, throat, food pipe} 4 {microwave meal, ready meal, TV dinner, frozen dinner} 5 {objective case, accusative case, oblique case, object case, accusative} 6 {radiotheater, dramatizedaudiobook, audiotheater, ra- dio play, radio drama, audio play} Inducing word sense representations Synset induction
  • 34. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 28/54 Outline of the ’Watset’ method: Background Corpus Synonymy Dictionary Learning Word Embeddings Graph Construction Synsets Word Similarities Ambiguous Weighted Graph Local Clustering: Word Sense Induction Global Clustering: Synset Induction Sense Inventory Disambiguation of Neighbors Disambiguated Weighted Graph Local-Global Fuzzy Graph Clustering Inducing word sense representations Synset induction
  • 35. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 29/54 Inducing word sense representations Synset induction
  • 36. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 30/54 Inducing word sense representations Synset induction
  • 37. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 31/54 Inducing word sense representations Synset induction
  • 38. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 32/54 CW MCL MaxMax ECO CPM Watset 0.0 0.1 0.2 0.3 WordNet (English) F−score CW MCL MaxMax ECO CPM Watset 0.0 0.1 0.2 0.3 BabelNet (English) F−score Inducing word sense representations Synset induction
  • 39. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 32/54 CW MCL MaxMax ECO CPM Watset 0.0 0.1 0.2 0.3 WordNet (English) F−score CW MCL MaxMax ECO CPM Watset 0.0 0.1 0.2 0.3 BabelNet (English) F−score CW MCL MaxMax ECO CPM Watset 0.00 0.05 0.10 0.15 0.20 RuWordNet (Russian) F−score CW MCL MaxMax ECO CPM Watset 0.0 0.1 0.2 0.3 0.4 YARN (Russian) F−score Inducing word sense representations Synset induction
  • 40. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 33/54 Examples of semantic classes: ID Sense Cluster Hypernyms 1 peach#1, banana#1, pineapple#0, berry#0, blackberry#0, grapefruit#0, strawberry#0, blue- berry#0, grape#0, melon#0, orange#0, pear#0, plum#0, raspberry#0, watermelon#0, apple#0, apricot#0, … fruit#0, crop#0, ingredi- ent#0, food#0, · 2 C#4, Basic#2, Haskell#5, Flash#1, Java#1, Pas- cal#0, Ruby#6, PHP#0, Ada#1, Oracle#3, Python#3, Apache#3, Visual Basic#1, ASP#2, Delphi#2, SQL Server#0, CSS#0, AJAX#0, the Java#0, … programming language#3, technology#0, language#0, format#2, app#0 Inducing word sense representations Induction of semantic classes
  • 41. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 34/54 Text Corpus Representing Senses with Ego Networks Semantic Classes Word Sense Induction from Text Corpus Sense Graph Construction Clustering of Word Senes Labeling Sense Clusters with Hypernyms Induced Word Senses Sense Ego-Networks Global Sense Graph s Noisy Hypernyms Cleansed Hypernyms Induction of Semantic Classes Global Sense Clusters Inducing word sense representations Induction of semantic classes
  • 42. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 35/54 Filtering noisy hypernyms with semantic classes LREC’18 [Panchenko et al., 2018]: fruit#1 food#0 apple#2 mango#0 pear#0 Hypernyms, Sense Cluster, mangosteen#0 city#2 Removed Wrong Added Missing Inducing word sense representations Induction of sense semantic classes
  • 43. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 36/54 Filtering of a noisy hypernymy database with semantic classes. LREC’18 [Panchenko et al., 2018] Precision Recall F-score Original Hypernyms (Seitner et al., 2016) 0.475 0.546 0.508 Semantic Classes (coarse-grained) 0.541 0.679 0.602 Inducing word sense representations Induction of sense semantic classes
  • 44. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 37/54 Making induced senses interpretable
  • 45. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 38/54 Knowledge-based sense representations are interpretable Making induced senses interpretable Making induced senses interpretable
  • 46. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 39/54 Most knowledge-free sense representations are uninterpretable Making induced senses interpretable Making induced senses interpretable
  • 47. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 40/54 Making induced senses interpretable Making induced senses interpretable
  • 48. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 41/54 Hypernymy prediction in context. EMNLP’17 [Panchenko et al., 2017b] Making induced senses interpretable Making induced senses interpretable
  • 49. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 42/54 11.702 sentences, 863 words with avg.polysemy of 3.1. WSD Model Accuracy Inventory Features Hypers HyperHypers Word Senses Random 0.257 0.610 Word Senses MFS 0.292 0.682 Word Senses Cluster Words 0.291 0.650 Word Senses Context Words 0.308 0.686 Making induced senses interpretable Making induced senses interpretable
  • 50. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 42/54 11.702 sentences, 863 words with avg.polysemy of 3.1. WSD Model Accuracy Inventory Features Hypers HyperHypers Word Senses Random 0.257 0.610 Word Senses MFS 0.292 0.682 Word Senses Cluster Words 0.291 0.650 Word Senses Context Words 0.308 0.686 Super Senses Random 0.001 0.001 Super Senses MFS 0.001 0.001 Super Senses Cluster Words 0.174 0.365 Super Senses Context Words 0.086 0.188 Making induced senses interpretable Making induced senses interpretable
  • 51. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 43/54 Linking induced senses to resources
  • 52. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 44/54 Text Corpus Linking Induced Senses to Senses of the LR Induce a Graph of Sem. Related Words Enriched Lexical Resource Graph of Related Words Word Sense Induction Labeling Senses with Hypernyms Disambiguation of Neighbours Typing of the Unmapped Induced Senses Word Sense Inventory Labeled Word Senses PCZ Construction of Proto-Conceptualization (PCZ) Linking Proto-Conceptualization to Lexical Resource Part. Linked Senses to the LR Lexical Resource (LR): WordNet, BabelNet, ... Construction of sense feature representations Graph of Related Senses LREC’16 [Panchenko, 2016], ISWC’16 [Faralli et al., 2016], SENSE@EACL’17 [Panchenko et al., 2017a], NLE’18 [Biemann et al., 2018] Linking induced senses to resources Linking induced senses to resources
  • 53. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 45/54 Word AdaGram BabelNet AdaGram BoW BabelNet BoW python 2 bn:01713224n perl, php, java, smalltalk, ruby, lua, tcl, scripting, javascript, bindings, binding, programming, coldfusion, actionscript, net, . . . language, programming, python- ista, python programming, python3, python2, level, com- puter, pythonistas, python3000, python 1 bn:01157670n monty, circus, spamalot, python, magoo, muppet, snoopy, fea- turette, disney, tunes, tune, clas- sic, shorts, short, apocalypse, . . . monty, comedy, monty python, british, monte, monte python, troupe, pythonesque, foot, artist, record, surreal, terry, . . . python 3 bn:00046456n spectacled, unicornis, snake, gi- ant, caiman, leopard, squirrel, crocodile, horned, cat, mole, ele- phant, opossum, pheasant, . . . molurus, indian, boa, tigris, tiger python, rock, tiger, indian python, reptile, python molurus, indian rock python, coluber, . . . python 4 bn:01157670n circus, fly, flying, dusk, lizard, moth, unicorn, puff, adder, vul- ture, tyrannosaurus, zephyr, bad- ger, . . . monty, comedy, monty python, british, monte, monte python, troupe, pythonesque, foot, artist, record, surreal, terry, . . . python 1 bn:00473212n monty, circus, spamalot, python, magoo, muppet, snoopy, fea- turette, disney, tunes, tune, clas- sic, shorts, short, apocalypse, . . . pictures, monty, python monty pictures, limited, company, python pictures limited, king- dom, picture, serve, director, . . . python 1 bn:03489893n monty, circus, spamalot, python, magoo, muppet, snoopy, fea- turette, disney, tunes, tune, clas- sic, shorts, short, apocalypse, . . . film, horror, movie, clabaugh, richard, monster, century, direct, snake, python movie, television, giant, natural, language, for-tv, . . . Linking induced senses to resources Linking induced senses to resources
  • 54. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 46/54 Model Representation of the Sense ”disk (medium)” WordNet memory, device, floppy, disk, hard, disk, disk, computer, science, computing, diskette, fixed, disk, floppy, magnetic, disc, magnetic, disk, hard, disc, storage, device WordNet + Linked recorder, disk, floppy, console, diskette, handset, desktop, iPhone, iPod, HDTV, kit, RAM, Discs, Blu- ray, computer, GB, microchip, site, cartridge, printer, tv, VCR, Disc, player, LCD, software, component, camcorder, cellphone, card, monitor, display, burner, Web, stereo, internet, model, iTunes, turntable, chip, cable, camera, iphone, notebook, device, server, surface, wafer, page, drive, laptop, screen, pc, television, hardware, YouTube, dvr, DVD, product, folder, VCR, radio, phone, circuitry, partition, megabyte, peripheral, format, machine, tuner, website, merchandise, equipment, gb, discs, MP3, hard-drive, piece, video, storage device, memory device, microphone, hd, EP, content, soundtrack, webcam, system, blade, graphic, microprocessor, collection, document, programming, battery, key- board, HD, handheld, CDs, reel, web, material, hard-disk, ep, chart, debut, configuration, recording, album, broadcast, download, fixed disk, planet, pda, microfilm, iPod, videotape, text, cylinder, cpu, canvas, label, sampler, workstation, electrode, magnetic disc, catheter, magnetic disk, Video, mo- bile, cd, song, modem, mouse, tube, set, ipad, signal, substrate, vinyl, music, clip, pad, audio, com- pilation, memory, message, reissue, ram, CD, subsystem, hdd, touchscreen, electronics, demo, shell, sensor, file, shelf, processor, cassette, extra, mainframe, motherboard, floppy disk, lp, tape, version, kilobyte, pacemaker, browser, Playstation, pager, module, cache, DVD, movie, Windows, cd-rom, e- book, valve, directory, harddrive, smartphone, audiotape, technology, hard disk, show, computing, computer science, Blu-Ray, blu-ray, HDD, HD-DVD, scanner, hard disc, gadget, booklet, copier, play- back, TiVo, controller, filter, DVDs, gigabyte, paper, mp3, CPU, dvd-r, pipe, cd-r, playlist, slot, VHS, film, videocassette, interface, adapter, database, manual, book, channel, changer, storage Linking induced senses to resources Linking induced senses to resources
  • 55. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 47/54 Evaluation of linking accuracy: Linking induced senses to resources Linking induced senses to resources
  • 56. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 48/54 Evaluation of enriched representations based on WSD: Linking induced senses to resources Linking induced senses to resources
  • 57. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 49/54 Conclusion
  • 58. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 50/54 Conclusion Vectors + Graphs = ♡
  • 59. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 51/54 We can induce word senses, synsets and semantic classes in a knowledge-free way using graph clustering and distributional models. Conclusion Take home messages
  • 60. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 51/54 We can induce word senses, synsets and semantic classes in a knowledge-free way using graph clustering and distributional models. We can make the induced word senses interpretable in a knowledge-free way with hypernyms, images, definitions. Conclusion Take home messages
  • 61. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 51/54 We can induce word senses, synsets and semantic classes in a knowledge-free way using graph clustering and distributional models. We can make the induced word senses interpretable in a knowledge-free way with hypernyms, images, definitions. We can link induced senses to lexical resources to improve performance of WSD; enrich lexical resources with emerging senses. Conclusion Take home messages
  • 62. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 52/54 Participate in an ACL SIGSLAV sponsored shared task on word sense induction and disambiguation for Russian! More details: http://russe.nlpub.org/2018/wsi Conclusion An ongoing shared task on WSI&D
  • 63. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 53/54 Thank you! Questions? This research was supported by Conclusion Acknowledgments
  • 64. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54 Evaluation on SemEval 2013 Task 13 WSI&D: Model Jacc. Tau WNDCG F.NMI F.B-Cubed AI-KU (add1000) 0.176 0.609 0.205 0.033 0.317 AI-KU 0.176 0.619 0.393 0.066 0.382 AI-KU (remove5-add1000) 0.228 0.654 0.330 0.040 0.463 Unimelb (5p) 0.198 0.623 0.374 0.056 0.475 Unimelb (50k) 0.198 0.633 0.384 0.060 0.494 UoS (#WN senses) 0.171 0.600 0.298 0.046 0.186 UoS (top-3) 0.220 0.637 0.370 0.044 0.451 La Sapienza (1) 0.131 0.544 0.332 – – La Sapienza (2) 0.131 0.535 0.394 – – AdaGram, α = 0.05, 100 dim 0.274 0.644 0.318 0.058 0.470 w2v 0.197 0.615 0.291 0.011 0.615 w2v (nouns) 0.179 0.626 0.304 0.011 0.623 JBT 0.205 0.624 0.291 0.017 0.598 JBT (nouns) 0.198 0.643 0.310 0.031 0.595 TWSI (nouns) 0.215 0.651 0.318 0.030 0.573 Conclusion Sense embeddings using retrofitting
  • 65. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54 Bartunov, S., Kondrashkin, D., Osokin, A., & Vetrov, D. (2016). Breaking sticks and ambiguities with adaptive skip-gram. In Artificial Intelligence and Statistics (pp. 130–138). Biemann, C., Faralli, S., Panchenko, A., & Ponzetto, S. P. (2018). A framework for enriching lexical semantic resources with distributional semantics. In Journal of Natural Language Engineering (pp. 56–64).: Cambridge Press. Faralli, S., Panchenko, A., Biemann, C., & Ponzetto, S. P. (2016). Linked disambiguated distributional semantic networks. In International Semantic Web Conference (pp. 56–64).: Springer. Panchenko, A. (2016). Best of both worlds: Making word sense embeddings interpretable. In LREC.
  • 66. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54 Panchenko, A., Faralli, S., Ponzetto, S. P., & Biemann, C. (2017a). Using linked disambiguated distributional networks for word sense disambiguation. In Proceedings of the 1st Workshop on Sense, Concept and Entity Representations and their Applications (pp. 72–78). Valencia, Spain: Association for Computational Linguistics. Panchenko, A., Marten, F., Ruppert, E., Faralli, S., Ustalov, D., Ponzetto, S. P., & Biemann, C. (2017b). Unsupervised, knowledge-free, and interpretable word sense disambiguation. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing: System Demonstrations (pp. 91–96). Copenhagen, Denmark: Association for Computational Linguistics. Panchenko, A., Ruppert, E., Faralli, S., Ponzetto, S. P., & Biemann, C. (2017c).
  • 67. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54 Unsupervised does not mean uninterpretable: The case for word sense induction and disambiguation. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers (pp. 86–98). Valencia, Spain: Association for Computational Linguistics. Panchenko, A., Ustalov, D., Faralli, S., Ponzetto, S. P., & Biemann, C. (2018). Improving hypernymy extraction with distributional semantic classes. In Proceedings of the LREC 2018 Miyazaki, Japan: European Language Resources Association. Pelevina, M., Arefiev, N., Biemann, C., & Panchenko, A. (2016). Making sense of word embeddings. In Proceedings of the 1st Workshop on Representation Learning for NLP (pp. 174–183). Berlin, Germany: Association for Computational Linguistics. Remus, S. & Biemann, C. (2018).
  • 68. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54 Retrofittingword representations for unsupervised sense aware word similarities. In Proceedings of the LREC 2018 Miyazaki, Japan: European Language Resources Association. Rothe, S. & Schütze, H. (2015). Autoextend: Extending word embeddings to embeddings for synsets and lexemes. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) (pp. 1793–1803). Beijing, China: Association for Computational Linguistics. Ustalov, D., Chernoskutov, M., Biemann, C., & Panchenko, A. (2017a). Fighting with the sparsity of synonymy dictionaries for automatic synset induction. In International Conference on Analysis of Images, Social Networks and Texts (pp. 94–105).: Springer.
  • 69. Jan 11, 2018 Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources, Alexander Panchenko 54/54 Ustalov, D., Panchenko, A., & Biemann, C. (2017b). Watset: Automatic induction of synsets from a graph of synonyms. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp. 1579–1590). Vancouver, Canada: Association for Computational Linguistics. Ustalov, D., Teslenko, D., Panchenko, A., Chernoskutov, M., & Biemann, C. (2018). Word sense disambiguation based on automatically induced synsets. In LREC 2018, 11th International Conference on Language Resources and Evaluation : 7-12 May 2018, Miyazaki (Japan) (pp. tba). Paris: European Language Resources Association, ELRA-ELDA. Accepted for publication.