SlideShare a Scribd company logo
1 of 35
Download to read offline
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Associative thesauri: structure and analysis
Brown bag seminar

Ekaterina Vylomova
Fulbright scholar at Montclair State University

February 21, 2014

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Brief bio

Brief Bio
2011: MSc, Bauman Moscow State Technical University

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Brief bio

Brief Bio
2011: MSc, Bauman Moscow State Technical University
2009: BSc, Bauman Moscow State Technical University

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Brief bio

Brief Bio
2011: MSc, Bauman Moscow State Technical University
2009: BSc, Bauman Moscow State Technical University
2009: Yandex School of Data Analysis (Moscow Institute of
Physics & Technology)

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

What's AE?

Associative Experiments
What's AE?
Associative experiment is one of methods of psycholinguistics. It's
based on method of free associations.
Sir Francis Galton conducted the rst experiment in 1879.

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

What's AE?

Associative Experiments
What's AE?
Associative experiment is one of methods of psycholinguistics. It's
based on method of free associations.
Sir Francis Galton conducted the rst experiment in 1879.
Types of AE
Single Free Association
Multiple Free Associations
Single Controlled Association (synonym, noun, verb, hyponym,
etc.)
Multiple Controlled Associations
E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

What's associative thesaurus?
Example of data
AT for dierent languages
Slavic Associative Thesauri

What's associative thesaurus?

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

What's associative thesaurus?
Example of data
AT for dierent languages
Slavic Associative Thesauri

Example of data
EAT Word Associations
CAT stimulated the following associations:
DOG 49 0.52
MOUSE 8 0.08
BLACK 4 0.04
MAT 3 0.03
ANIMAL 2 0.02
EYES 2 0.02
GUT 2 0.02
KITTEN 2 0.02
E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

What's associative thesaurus?
Example of data
AT for dierent languages
Slavic Associative Thesauri

AT for dierent languages
English
The Structure of Associations in Language and Thought
(Deese, 1965)
Word association (Cramer, 1968)
An associative thesaurus of English and its computer analysis
(Kiss et al., 1973)
Word Association, rhyme and fragment norms (Nelson,
McEvoy  Schreiber, 1999)

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

What's associative thesaurus?
Example of data
AT for dierent languages
Slavic Associative Thesauri

AT for dierent languages
Dutch
Word association norms with response times (De Groot, 1988)
Word associations: Norms for 1,424 Dutch words in a
continuous task (De Deyne  Storms, 2008)
Swedish
A Swedish Associative Thesaurus (Lonngren, 1998)

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

What's associative thesaurus?
Example of data
AT for dierent languages
Slavic Associative Thesauri

AT for dierent languages
Japanese
Construction of associative concept dictionary with distance
information, and comparison with electronic concept dictionary
(Okamoto  Ishizaki, 2001)
Building a word association database for basic Japanese
vocabulary (Joyce, 2005)
Korean
Network analysis of Korean Word Associations(Jung et al.,
2010)
E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

What's associative thesaurus?
Example of data
AT for dierent languages
Slavic Associative Thesauri

AT for dierent languages

Czech
Volne slovni parove asociace v cestine (Novak, 1988)
Hebrew
Free association norms in the Hebrew language (Rubinsten,
2005)

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

What's associative thesaurus?
Example of data
AT for dierent languages
Slavic Associative Thesauri

Slavic Associative Thesauri
Dictionary of associative norms in Russian (Leontiev,1973)
Russian Associative Thesaurus (Karaulov et al.,2002)
Slavic Associative Thesaurus(Russian, Belorussian,Bulgarian,
Ukrainian) (Umtseva et al., 2004)
Normas asociativas del espanol y del ruso(Sanchez
Puig,Karaulov,Cherkasova, 2000)

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Data
Research

Russian associative experiment description
Time frame: 1988-1998
Participants: 11,000 1st-3rd year students; 34 specialities
Stimuli: 6,624(initial list: 1,277)
Associative pairs:1,032,522 (dierent - 462,500)
Reactions:102,926
Subset used for analysis
Stimuli: 6,577
Reactions:21,312
Associative pairs:102,516
Dataset
Set of triplets:  c , r , w , where w =
i

j

ij

E. Vylomova

ij

freqij

n

1 freqij
Associative thesauri
j=

.
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Data
Research

Comparison with frequency dictionary of Russian language
Frequency dictionary
Frequency dictionary of modern Russian language (Lyashevskaya,
Sharov, 2009).
Based on the texts from Russian National Corpus
(www.ruscorpora.ru) and includes information about 20,000 most
common words in Russian language.
RAT Lemmatisation
RAT-MyStem(Segalovich, 2003)-lemmas

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Data
Research

Comparison with frequency dictionary of Russian language
TOP-11 Nouns
RAT

FreqDict

E. Vylomova

Associative thesauri

Human
Home, House
Money
Day
Friend
Home
Male
Fool
Business
Life
Illness

Year
Human
Time
Business
Life
Day
Hand
Work
Word
Place
Friend
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Data
Research

Comparison with frequency dictionary of Russian language

Semantic primes?
Concept Human: human child friend male
Concept Time: day time
Adjectives: good bad big.
These concepts don't change over the time.

Positive correlation with semantic primes (Wierzbicka)

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Description
Associative Network based on RAT'98
Network analysis

Description
Nodes correspond to words(lemmas)
Edges correspond to associations
Edge's weight correspond to association strength

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Description
Associative Network based on RAT'98
Network analysis

Main characteristics of the network
Nodes: |V | = 23, 195, among them:

nodes with outgoing edges(stimuli): |S | = 1, 883
nodes with incoming edges(reactions): |R | = 16, 618
nodes with both types of edges: |SR | = 4, 694

Edges: |E | = 102, 516

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Description
Associative Network based on RAT'98
Network analysis

Table of network characteristics
Sign

N
L
D
k
ψ

Description

Number of nodes
average shortest path length
Diameter
Average node degree
Degree distribution (P(k)) par.

Directed

23,195
3.98
9
4.42
2.2

Directed to undirected

w

ij

= wji = wij + wji

Degree distribution function

P (k ) ≈ k −ψ

E. Vylomova

Associative thesauri

Undirected

23,195
3.83
8
8.83
1.85
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Description
Associative Network based on RAT'98
Network analysis

Small-world networks
Denition
Introduced by Milgram, 1967 (The small world problem)
L ∝ log (N ),i.e. distance L between two randomly chosen nodes
grows proportionally to the logarithm of the number of nodes N in
the network
Also known as Six degrees of separation
Examples
World Wide Web (WWW; Adamic, 1999; Albert, Jeong, 
Barabasi, 1999), networks of scientic collaboration (Newman,
2001),metabolic networks in biology (Jeong, Tombor, Albert,
Oltval,  Barabasi, 2000)
E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Description
Associative Network based on RAT'98
Network analysis

Scale-free networks
Description
Amaral, Scala et al., 2000 studied small-world networks and
compared degree distribution function P (k ).
2 types of distribution:
exponential(power grid system in USA, neural system of
C.elegans)
power law(WWW, metabolic networks): P (k ) = k −ψ ,
ψ ∈ (2..4)
Scale-free networks provide better signal propagation.
E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Description
Associative Network based on RAT'98
Network analysis

Scale-free networks

Other examples
Similar results were obtained for Roget thesaurus(Roget,
1911),WordNet and associative networks(Steyvers and Tenenbaum,
2005).
E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Models of Associative Network
Concept-based model

Three Models of Associative Network
Concept-based model
Vector-based models
Multidimensional scaling(Torgerson,1958)
Latent Semantic analysis(Landauer, Dumais, 1997)

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Models of Associative Network
Concept-based model

Data
Core of the network: 4,692 lemmas with 59,392 connections
The structure is similar to associative network(nodes-lemmas, edges
- associations)
Activity accumulation
1. Initial state: random activity
2. Spreading of activation: S = S −1 +
w S −1 , where S is
activity of neuron i at the moment t .
3. Activation exceeds the threshold = produce the reaction.
S = 0.
t

t

i

i

t

j

ij

t

i

E. Vylomova

Associative thesauri

t

j

i
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Models of Associative Network
Concept-based model

Pros and cons
Pros

very simple model
easy to understand
easy to modify(no need in reevaluation of the model)

Cons
unclear how to choose the threshold value(we did series of
experiment to nd optimal value)
once activation is released, should we also do modication for
neighbouring neurons?
E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Models of Associative Network
Concept-based model

Multidimensional scaling
From concept to vector
Distance matrix:

δ1,1 δ1,2 · · ·
δ2,1 δ2,2 · · ·

= .
. ..
.
 .
.
.
.
δI ,1 δI ,2 · · ·


δ1,I
δ2,I 

. 
. 
.
δ I ,I

where I means number of objects(words).
Our goal is to nd such vectors x1 , ..., x ∈ R that
for all i , j ∈ I .
In other words:
2
min 1 ,..,
 ( x −x −δ ) .
I

x

xI

i

j

i

j

N

ij

E. Vylomova

Associative thesauri

x

i

− xj ≈ δij
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Models of Associative Network
Concept-based model

Latent Semantic Analysis
From concept to vector-2
Technique of analysing relationships between a set of documents
and the terms they contain by producing a set of concepts related
to the documents and terms.
In my case
Terms are lemmas, document is a set of associations for a given
stimulus.
Inputs: term-document matrix with TF*IDF values
Term frequency: TF = w =
, Inverse document
freqij

ij

N

1 freqij

j=

frequency: IDF = log
|S | is a total number of
stimuli.Singular Value Decomposition = vector representations.
|S |
|s ∈ S : r ∈ s | ,

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Models of Associative Network
Concept-based model

Clustering
k-means
So, we've got vectors. What's next?
Let's evaluate similarity:
First, set a distance metric, e.g. d =
=1 |x − x |
And use it with k-means clustering:
min =1 ∈ (x − µ )2 ,
where k is a number of clusters, S are evaluated clusters,µ are
centers of the clusters.
So, the technique is based on nding the nearest cluster.
r

ij

N
k

r

ik

jk

k
i

xj

Si

j

i

i

E. Vylomova

i

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Models of Associative Network
Concept-based model

Clustering

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Models of Associative Network
Concept-based model

Pros and cons
Pros

easy to operate with vectors: add, multiple, subtract, etc.
possible to set preferred dimensionality and visualize

Cons
problem with storage: matrices are huge
complexity: MDS and LSA are based on SVD; it takes O (n3 )
choosing optimal number of clusters and dimensionality

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Models of Associative Network
Concept-based model

Tip of the tongueapplication
DataMethod
Data: RAT+Abramov's synonym dictionary
Method: LSA+k-means

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Models of Associative Network
Concept-based model

Tip of the tongueapplication
DataMethod
Usage of associative thesauri for solving tasks related
to the “tip of the tongue” phenomenon
Ekaterina Vylomova
Bauman Moscow State Technical University, Moscow State University of Printing Arts

DATA

INTRODUCTION
The tip-of-the-tongue(TOT) phenomenon is the failure to retrieve a word
from memory, combined with partial recall and the feeling that retrieval is
imminent. People in a tip-of-the-tongue state can often recall one or more
features of the target word, such as the first letter, its syllabic stress, and
words similar in sound and/or meaning.
•TOT appears to be universal (Brennenet al. 2007)
•An occasional tip-of-the-tongue state is normal for people of all ages
•TOT becomes more frequent as people age.
R. Braun, D. McNeill and A. Luria consider the processes of
recalling and naming the words as processes of probabilistic choice of a
word from involuntary associations’ chain and relate them to the
construction of human semantic memory.

Abramov. Dictionary of Russian synonyms and similar expressions,
1890-1999
19,297 words  phrases
18,136 synonym articles
Karaulov Y.,, Tarasov E., Sorokin Y., Ufimtseva N., Cherkasova G..
1999. Associative thesaurus of modern russian language. RAS,
Moscow.
56,540 associative pairs
50,923 associative pairs (after lemmatization)
26,803 lemmas
Overall (synonym and associative pairs combined together)
316,018

METHODOLOGY

RAT
RAT
Abramov

RAT
Abramov
dict.
lemmas

Lemmatization

Abramov
RAT
dict.

Lemmatization using Yandex
mystem stemmer

LSA  k-NN

Apply Latent Semantic Algorithm to
get vector representation of words
and k-nearest neighbours for
clustering

Clusters containing similar by
meaning and association words

EXAMPLE

Hmm...What's the name
Hmm...What's the name
of that Ukranian food?
of that Ukranian food?

Associative thesauri+Abramov dictionary:
Комильфо - приличие
After clustering:
не выходить из пределов благопристойности

0.001

степенный

0.591

чинный

0.591

благочинный

0.646

бонтонный

0.646

комильфотный

0.646

пристойный

0.646

благонравный

0.684

благоприличный

0.684

благопристойный

0.684

корректный

0.684

REFERENCES
FUTURE PLANS
1. Expand synonym and associative thesauri with new ones
2. Add first letter filtering (see above)
3. Add hyponyms and hyperonyms

RHF #12-04-12039B

E. Vylomova

1. Brown, R., and McNeill, D. (1966). The tip of the tongue phenomenon.
Journal of Verbal Learning and Verbal Behavior 5, 325-337.
2. Караулов Ю.Н., Тарасов Е.Ф., Сорокин Ю.А., Уфимцева Н.В., Черкасова
Г.А. (1999). Ассоциативный тезаурус современного русского языка. РАН.
(russian)
3. Лурия А.Р. (1979). Язык и сознание.//под редакцией Хомско Е.Д., МГУ,
Москва - 320 стр.(russian)

E-mail: evylomova@gmail.com

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

RAT'10
Time frame: 20 years after the rst one(2009-2010)
Location: dierent regions of Russia.
Stimuli included 1000 most frequent words in Russian language.
The participants: young people at the age of 17-25.

E. Vylomova

Associative thesauri
Introduction
Associative Experiments
Associative Thesauri
Russian Associative Thesaurus'98
Associative Network(Graph)
Modelling of Associative Network
Future work

Thank you!
Questions?

E. Vylomova

Associative thesauri

More Related Content

Similar to Ekaterina Vylomova/Brown Bag seminar presentation

Text Analysis of Social Networks: Working with FB and VK Data
Text Analysis of Social Networks: Working with FB and VK DataText Analysis of Social Networks: Working with FB and VK Data
Text Analysis of Social Networks: Working with FB and VK DataAlexander Panchenko
 
Using Text Embeddings for Information Retrieval
Using Text Embeddings for Information RetrievalUsing Text Embeddings for Information Retrieval
Using Text Embeddings for Information RetrievalBhaskar Mitra
 
Designing, Visualizing and Understanding Deep Neural Networks
Designing, Visualizing and Understanding Deep Neural NetworksDesigning, Visualizing and Understanding Deep Neural Networks
Designing, Visualizing and Understanding Deep Neural Networksconnectbeubax
 
EVALution 1.0 - An Evolving Semantic Dataset for Trainining and Evaluation of...
EVALution 1.0 - An Evolving Semantic Dataset for Trainining and Evaluation of...EVALution 1.0 - An Evolving Semantic Dataset for Trainining and Evaluation of...
EVALution 1.0 - An Evolving Semantic Dataset for Trainining and Evaluation of...Enrico Santus Aversano
 
Citation metrics and the stories they tell
Citation metrics and the stories they tellCitation metrics and the stories they tell
Citation metrics and the stories they tellCarl Bergstrom
 
Aletras, Nikolaos and Stevenson, Mark (2013) "Evaluating Topic Coherence Us...
Aletras, Nikolaos  and  Stevenson, Mark (2013) "Evaluating Topic Coherence Us...Aletras, Nikolaos  and  Stevenson, Mark (2013) "Evaluating Topic Coherence Us...
Aletras, Nikolaos and Stevenson, Mark (2013) "Evaluating Topic Coherence Us...pathsproject
 

Similar to Ekaterina Vylomova/Brown Bag seminar presentation (8)

Text Analysis of Social Networks: Working with FB and VK Data
Text Analysis of Social Networks: Working with FB and VK DataText Analysis of Social Networks: Working with FB and VK Data
Text Analysis of Social Networks: Working with FB and VK Data
 
Semeval Deep Learning In Semantic Similarity
Semeval Deep Learning In Semantic SimilaritySemeval Deep Learning In Semantic Similarity
Semeval Deep Learning In Semantic Similarity
 
Using Text Embeddings for Information Retrieval
Using Text Embeddings for Information RetrievalUsing Text Embeddings for Information Retrieval
Using Text Embeddings for Information Retrieval
 
Designing, Visualizing and Understanding Deep Neural Networks
Designing, Visualizing and Understanding Deep Neural NetworksDesigning, Visualizing and Understanding Deep Neural Networks
Designing, Visualizing and Understanding Deep Neural Networks
 
AINL 2016: Malykh
AINL 2016: MalykhAINL 2016: Malykh
AINL 2016: Malykh
 
EVALution 1.0 - An Evolving Semantic Dataset for Trainining and Evaluation of...
EVALution 1.0 - An Evolving Semantic Dataset for Trainining and Evaluation of...EVALution 1.0 - An Evolving Semantic Dataset for Trainining and Evaluation of...
EVALution 1.0 - An Evolving Semantic Dataset for Trainining and Evaluation of...
 
Citation metrics and the stories they tell
Citation metrics and the stories they tellCitation metrics and the stories they tell
Citation metrics and the stories they tell
 
Aletras, Nikolaos and Stevenson, Mark (2013) "Evaluating Topic Coherence Us...
Aletras, Nikolaos  and  Stevenson, Mark (2013) "Evaluating Topic Coherence Us...Aletras, Nikolaos  and  Stevenson, Mark (2013) "Evaluating Topic Coherence Us...
Aletras, Nikolaos and Stevenson, Mark (2013) "Evaluating Topic Coherence Us...
 

More from Katerina Vylomova

Documenting and modeling inflectional paradigms in under-resourced languages
Documenting and modeling inflectional paradigms in under-resourced languages Documenting and modeling inflectional paradigms in under-resourced languages
Documenting and modeling inflectional paradigms in under-resourced languages Katerina Vylomova
 
The UniMorph Project and Morphological Reinflection Task: Past, Present, and ...
The UniMorph Project and Morphological Reinflection Task: Past, Present, and ...The UniMorph Project and Morphological Reinflection Task: Past, Present, and ...
The UniMorph Project and Morphological Reinflection Task: Past, Present, and ...Katerina Vylomova
 
Sigmorphon 2021. Keynote. UniMorph, Morphological inflection
Sigmorphon 2021. Keynote. UniMorph, Morphological inflectionSigmorphon 2021. Keynote. UniMorph, Morphological inflection
Sigmorphon 2021. Keynote. UniMorph, Morphological inflectionKaterina Vylomova
 
The Secret Life of Words: Exploring Regularity and Systematicity (joint talk ...
The Secret Life of Words: Exploring Regularity and Systematicity (joint talk ...The Secret Life of Words: Exploring Regularity and Systematicity (joint talk ...
The Secret Life of Words: Exploring Regularity and Systematicity (joint talk ...Katerina Vylomova
 
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological InflectionSIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological InflectionKaterina Vylomova
 
Ekaterina vylomova-what-do-neural models-know-about-language-p2
Ekaterina vylomova-what-do-neural models-know-about-language-p2Ekaterina vylomova-what-do-neural models-know-about-language-p2
Ekaterina vylomova-what-do-neural models-know-about-language-p2Katerina Vylomova
 
Ekaterina vylomova-what-do-neural models-know-about-language-p1
Ekaterina vylomova-what-do-neural models-know-about-language-p1Ekaterina vylomova-what-do-neural models-know-about-language-p1
Ekaterina vylomova-what-do-neural models-know-about-language-p1Katerina Vylomova
 
Evaluation of Semantic Change of Harm-Related Concepts in Psychology
Evaluation of Semantic Change of Harm-Related Concepts in PsychologyEvaluation of Semantic Change of Harm-Related Concepts in Psychology
Evaluation of Semantic Change of Harm-Related Concepts in PsychologyKaterina Vylomova
 
Contextualization of Morphological Inflection
Contextualization of Morphological InflectionContextualization of Morphological Inflection
Contextualization of Morphological InflectionKaterina Vylomova
 
Paradigm Completion for Derivational Morphology
Paradigm Completion for Derivational MorphologyParadigm Completion for Derivational Morphology
Paradigm Completion for Derivational MorphologyKaterina Vylomova
 
Contemporary Models of Natural Language Processing
Contemporary Models of Natural Language ProcessingContemporary Models of Natural Language Processing
Contemporary Models of Natural Language ProcessingKaterina Vylomova
 
Context-Aware Derivation Prediction // EACL 2017
Context-Aware Derivation Prediction // EACL 2017Context-Aware Derivation Prediction // EACL 2017
Context-Aware Derivation Prediction // EACL 2017Katerina Vylomova
 
Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vec...
Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vec...Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vec...
Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vec...Katerina Vylomova
 
Neural models for recognition of basic units of semiographic chants
Neural models for recognition of basic units of semiographic chantsNeural models for recognition of basic units of semiographic chants
Neural models for recognition of basic units of semiographic chantsKaterina Vylomova
 
Russia, Russians and Russian language
Russia, Russians and Russian languageRussia, Russians and Russian language
Russia, Russians and Russian languageKaterina Vylomova
 

More from Katerina Vylomova (16)

Documenting and modeling inflectional paradigms in under-resourced languages
Documenting and modeling inflectional paradigms in under-resourced languages Documenting and modeling inflectional paradigms in under-resourced languages
Documenting and modeling inflectional paradigms in under-resourced languages
 
The UniMorph Project and Morphological Reinflection Task: Past, Present, and ...
The UniMorph Project and Morphological Reinflection Task: Past, Present, and ...The UniMorph Project and Morphological Reinflection Task: Past, Present, and ...
The UniMorph Project and Morphological Reinflection Task: Past, Present, and ...
 
Sigmorphon 2021. Keynote. UniMorph, Morphological inflection
Sigmorphon 2021. Keynote. UniMorph, Morphological inflectionSigmorphon 2021. Keynote. UniMorph, Morphological inflection
Sigmorphon 2021. Keynote. UniMorph, Morphological inflection
 
The Secret Life of Words: Exploring Regularity and Systematicity (joint talk ...
The Secret Life of Words: Exploring Regularity and Systematicity (joint talk ...The Secret Life of Words: Exploring Regularity and Systematicity (joint talk ...
The Secret Life of Words: Exploring Regularity and Systematicity (joint talk ...
 
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological InflectionSIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
 
Ekaterina vylomova-what-do-neural models-know-about-language-p2
Ekaterina vylomova-what-do-neural models-know-about-language-p2Ekaterina vylomova-what-do-neural models-know-about-language-p2
Ekaterina vylomova-what-do-neural models-know-about-language-p2
 
Ekaterina vylomova-what-do-neural models-know-about-language-p1
Ekaterina vylomova-what-do-neural models-know-about-language-p1Ekaterina vylomova-what-do-neural models-know-about-language-p1
Ekaterina vylomova-what-do-neural models-know-about-language-p1
 
Evaluation of Semantic Change of Harm-Related Concepts in Psychology
Evaluation of Semantic Change of Harm-Related Concepts in PsychologyEvaluation of Semantic Change of Harm-Related Concepts in Psychology
Evaluation of Semantic Change of Harm-Related Concepts in Psychology
 
Contextualization of Morphological Inflection
Contextualization of Morphological InflectionContextualization of Morphological Inflection
Contextualization of Morphological Inflection
 
Paradigm Completion for Derivational Morphology
Paradigm Completion for Derivational MorphologyParadigm Completion for Derivational Morphology
Paradigm Completion for Derivational Morphology
 
Contemporary Models of Natural Language Processing
Contemporary Models of Natural Language ProcessingContemporary Models of Natural Language Processing
Contemporary Models of Natural Language Processing
 
Context-Aware Derivation Prediction // EACL 2017
Context-Aware Derivation Prediction // EACL 2017Context-Aware Derivation Prediction // EACL 2017
Context-Aware Derivation Prediction // EACL 2017
 
Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vec...
Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vec...Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vec...
Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vec...
 
Working with text data
Working with text dataWorking with text data
Working with text data
 
Neural models for recognition of basic units of semiographic chants
Neural models for recognition of basic units of semiographic chantsNeural models for recognition of basic units of semiographic chants
Neural models for recognition of basic units of semiographic chants
 
Russia, Russians and Russian language
Russia, Russians and Russian languageRussia, Russians and Russian language
Russia, Russians and Russian language
 

Recently uploaded

ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)cama23
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 

Recently uploaded (20)

YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 

Ekaterina Vylomova/Brown Bag seminar presentation

  • 1. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Associative thesauri: structure and analysis Brown bag seminar Ekaterina Vylomova Fulbright scholar at Montclair State University February 21, 2014 E. Vylomova Associative thesauri
  • 2. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Brief bio Brief Bio 2011: MSc, Bauman Moscow State Technical University E. Vylomova Associative thesauri
  • 3. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Brief bio Brief Bio 2011: MSc, Bauman Moscow State Technical University 2009: BSc, Bauman Moscow State Technical University E. Vylomova Associative thesauri
  • 4. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Brief bio Brief Bio 2011: MSc, Bauman Moscow State Technical University 2009: BSc, Bauman Moscow State Technical University 2009: Yandex School of Data Analysis (Moscow Institute of Physics & Technology) E. Vylomova Associative thesauri
  • 5. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work What's AE? Associative Experiments What's AE? Associative experiment is one of methods of psycholinguistics. It's based on method of free associations. Sir Francis Galton conducted the rst experiment in 1879. E. Vylomova Associative thesauri
  • 6. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work What's AE? Associative Experiments What's AE? Associative experiment is one of methods of psycholinguistics. It's based on method of free associations. Sir Francis Galton conducted the rst experiment in 1879. Types of AE Single Free Association Multiple Free Associations Single Controlled Association (synonym, noun, verb, hyponym, etc.) Multiple Controlled Associations E. Vylomova Associative thesauri
  • 7. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work What's associative thesaurus? Example of data AT for dierent languages Slavic Associative Thesauri What's associative thesaurus? E. Vylomova Associative thesauri
  • 8. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work What's associative thesaurus? Example of data AT for dierent languages Slavic Associative Thesauri Example of data EAT Word Associations CAT stimulated the following associations: DOG 49 0.52 MOUSE 8 0.08 BLACK 4 0.04 MAT 3 0.03 ANIMAL 2 0.02 EYES 2 0.02 GUT 2 0.02 KITTEN 2 0.02 E. Vylomova Associative thesauri
  • 9. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work What's associative thesaurus? Example of data AT for dierent languages Slavic Associative Thesauri AT for dierent languages English The Structure of Associations in Language and Thought (Deese, 1965) Word association (Cramer, 1968) An associative thesaurus of English and its computer analysis (Kiss et al., 1973) Word Association, rhyme and fragment norms (Nelson, McEvoy Schreiber, 1999) E. Vylomova Associative thesauri
  • 10. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work What's associative thesaurus? Example of data AT for dierent languages Slavic Associative Thesauri AT for dierent languages Dutch Word association norms with response times (De Groot, 1988) Word associations: Norms for 1,424 Dutch words in a continuous task (De Deyne Storms, 2008) Swedish A Swedish Associative Thesaurus (Lonngren, 1998) E. Vylomova Associative thesauri
  • 11. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work What's associative thesaurus? Example of data AT for dierent languages Slavic Associative Thesauri AT for dierent languages Japanese Construction of associative concept dictionary with distance information, and comparison with electronic concept dictionary (Okamoto Ishizaki, 2001) Building a word association database for basic Japanese vocabulary (Joyce, 2005) Korean Network analysis of Korean Word Associations(Jung et al., 2010) E. Vylomova Associative thesauri
  • 12. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work What's associative thesaurus? Example of data AT for dierent languages Slavic Associative Thesauri AT for dierent languages Czech Volne slovni parove asociace v cestine (Novak, 1988) Hebrew Free association norms in the Hebrew language (Rubinsten, 2005) E. Vylomova Associative thesauri
  • 13. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work What's associative thesaurus? Example of data AT for dierent languages Slavic Associative Thesauri Slavic Associative Thesauri Dictionary of associative norms in Russian (Leontiev,1973) Russian Associative Thesaurus (Karaulov et al.,2002) Slavic Associative Thesaurus(Russian, Belorussian,Bulgarian, Ukrainian) (Umtseva et al., 2004) Normas asociativas del espanol y del ruso(Sanchez Puig,Karaulov,Cherkasova, 2000) E. Vylomova Associative thesauri
  • 14. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Data Research Russian associative experiment description Time frame: 1988-1998 Participants: 11,000 1st-3rd year students; 34 specialities Stimuli: 6,624(initial list: 1,277) Associative pairs:1,032,522 (dierent - 462,500) Reactions:102,926 Subset used for analysis Stimuli: 6,577 Reactions:21,312 Associative pairs:102,516 Dataset Set of triplets: c , r , w , where w = i j ij E. Vylomova ij freqij n 1 freqij Associative thesauri j= .
  • 15. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Data Research Comparison with frequency dictionary of Russian language Frequency dictionary Frequency dictionary of modern Russian language (Lyashevskaya, Sharov, 2009). Based on the texts from Russian National Corpus (www.ruscorpora.ru) and includes information about 20,000 most common words in Russian language. RAT Lemmatisation RAT-MyStem(Segalovich, 2003)-lemmas E. Vylomova Associative thesauri
  • 16. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Data Research Comparison with frequency dictionary of Russian language TOP-11 Nouns RAT FreqDict E. Vylomova Associative thesauri Human Home, House Money Day Friend Home Male Fool Business Life Illness Year Human Time Business Life Day Hand Work Word Place Friend
  • 17. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Data Research Comparison with frequency dictionary of Russian language Semantic primes? Concept Human: human child friend male Concept Time: day time Adjectives: good bad big. These concepts don't change over the time. Positive correlation with semantic primes (Wierzbicka) E. Vylomova Associative thesauri
  • 18. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Description Associative Network based on RAT'98 Network analysis Description Nodes correspond to words(lemmas) Edges correspond to associations Edge's weight correspond to association strength E. Vylomova Associative thesauri
  • 19. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Description Associative Network based on RAT'98 Network analysis Main characteristics of the network Nodes: |V | = 23, 195, among them: nodes with outgoing edges(stimuli): |S | = 1, 883 nodes with incoming edges(reactions): |R | = 16, 618 nodes with both types of edges: |SR | = 4, 694 Edges: |E | = 102, 516 E. Vylomova Associative thesauri
  • 20. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Description Associative Network based on RAT'98 Network analysis Table of network characteristics Sign N L D k ψ Description Number of nodes average shortest path length Diameter Average node degree Degree distribution (P(k)) par. Directed 23,195 3.98 9 4.42 2.2 Directed to undirected w ij = wji = wij + wji Degree distribution function P (k ) ≈ k −ψ E. Vylomova Associative thesauri Undirected 23,195 3.83 8 8.83 1.85
  • 21. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Description Associative Network based on RAT'98 Network analysis Small-world networks Denition Introduced by Milgram, 1967 (The small world problem) L ∝ log (N ),i.e. distance L between two randomly chosen nodes grows proportionally to the logarithm of the number of nodes N in the network Also known as Six degrees of separation Examples World Wide Web (WWW; Adamic, 1999; Albert, Jeong, Barabasi, 1999), networks of scientic collaboration (Newman, 2001),metabolic networks in biology (Jeong, Tombor, Albert, Oltval, Barabasi, 2000) E. Vylomova Associative thesauri
  • 22. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Description Associative Network based on RAT'98 Network analysis Scale-free networks Description Amaral, Scala et al., 2000 studied small-world networks and compared degree distribution function P (k ). 2 types of distribution: exponential(power grid system in USA, neural system of C.elegans) power law(WWW, metabolic networks): P (k ) = k −ψ , ψ ∈ (2..4) Scale-free networks provide better signal propagation. E. Vylomova Associative thesauri
  • 23. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Description Associative Network based on RAT'98 Network analysis Scale-free networks Other examples Similar results were obtained for Roget thesaurus(Roget, 1911),WordNet and associative networks(Steyvers and Tenenbaum, 2005). E. Vylomova Associative thesauri
  • 24. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Models of Associative Network Concept-based model Three Models of Associative Network Concept-based model Vector-based models Multidimensional scaling(Torgerson,1958) Latent Semantic analysis(Landauer, Dumais, 1997) E. Vylomova Associative thesauri
  • 25. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Models of Associative Network Concept-based model Data Core of the network: 4,692 lemmas with 59,392 connections The structure is similar to associative network(nodes-lemmas, edges - associations) Activity accumulation 1. Initial state: random activity 2. Spreading of activation: S = S −1 + w S −1 , where S is activity of neuron i at the moment t . 3. Activation exceeds the threshold = produce the reaction. S = 0. t t i i t j ij t i E. Vylomova Associative thesauri t j i
  • 26. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Models of Associative Network Concept-based model Pros and cons Pros very simple model easy to understand easy to modify(no need in reevaluation of the model) Cons unclear how to choose the threshold value(we did series of experiment to nd optimal value) once activation is released, should we also do modication for neighbouring neurons? E. Vylomova Associative thesauri
  • 27. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Models of Associative Network Concept-based model Multidimensional scaling From concept to vector Distance matrix:  δ1,1 δ1,2 · · · δ2,1 δ2,2 · · ·  = . . .. .  . . . . δI ,1 δI ,2 · · ·  δ1,I δ2,I   .  .  . δ I ,I where I means number of objects(words). Our goal is to nd such vectors x1 , ..., x ∈ R that for all i , j ∈ I . In other words: 2 min 1 ,.., ( x −x −δ ) . I x xI i j i j N ij E. Vylomova Associative thesauri x i − xj ≈ δij
  • 28. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Models of Associative Network Concept-based model Latent Semantic Analysis From concept to vector-2 Technique of analysing relationships between a set of documents and the terms they contain by producing a set of concepts related to the documents and terms. In my case Terms are lemmas, document is a set of associations for a given stimulus. Inputs: term-document matrix with TF*IDF values Term frequency: TF = w = , Inverse document freqij ij N 1 freqij j= frequency: IDF = log |S | is a total number of stimuli.Singular Value Decomposition = vector representations. |S | |s ∈ S : r ∈ s | , E. Vylomova Associative thesauri
  • 29. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Models of Associative Network Concept-based model Clustering k-means So, we've got vectors. What's next? Let's evaluate similarity: First, set a distance metric, e.g. d = =1 |x − x | And use it with k-means clustering: min =1 ∈ (x − µ )2 , where k is a number of clusters, S are evaluated clusters,µ are centers of the clusters. So, the technique is based on nding the nearest cluster. r ij N k r ik jk k i xj Si j i i E. Vylomova i Associative thesauri
  • 30. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Models of Associative Network Concept-based model Clustering E. Vylomova Associative thesauri
  • 31. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Models of Associative Network Concept-based model Pros and cons Pros easy to operate with vectors: add, multiple, subtract, etc. possible to set preferred dimensionality and visualize Cons problem with storage: matrices are huge complexity: MDS and LSA are based on SVD; it takes O (n3 ) choosing optimal number of clusters and dimensionality E. Vylomova Associative thesauri
  • 32. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Models of Associative Network Concept-based model Tip of the tongueapplication DataMethod Data: RAT+Abramov's synonym dictionary Method: LSA+k-means E. Vylomova Associative thesauri
  • 33. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Models of Associative Network Concept-based model Tip of the tongueapplication DataMethod Usage of associative thesauri for solving tasks related to the “tip of the tongue” phenomenon Ekaterina Vylomova Bauman Moscow State Technical University, Moscow State University of Printing Arts DATA INTRODUCTION The tip-of-the-tongue(TOT) phenomenon is the failure to retrieve a word from memory, combined with partial recall and the feeling that retrieval is imminent. People in a tip-of-the-tongue state can often recall one or more features of the target word, such as the first letter, its syllabic stress, and words similar in sound and/or meaning. •TOT appears to be universal (Brennenet al. 2007) •An occasional tip-of-the-tongue state is normal for people of all ages •TOT becomes more frequent as people age. R. Braun, D. McNeill and A. Luria consider the processes of recalling and naming the words as processes of probabilistic choice of a word from involuntary associations’ chain and relate them to the construction of human semantic memory. Abramov. Dictionary of Russian synonyms and similar expressions, 1890-1999 19,297 words phrases 18,136 synonym articles Karaulov Y.,, Tarasov E., Sorokin Y., Ufimtseva N., Cherkasova G.. 1999. Associative thesaurus of modern russian language. RAS, Moscow. 56,540 associative pairs 50,923 associative pairs (after lemmatization) 26,803 lemmas Overall (synonym and associative pairs combined together) 316,018 METHODOLOGY RAT RAT Abramov RAT Abramov dict. lemmas Lemmatization Abramov RAT dict. Lemmatization using Yandex mystem stemmer LSA k-NN Apply Latent Semantic Algorithm to get vector representation of words and k-nearest neighbours for clustering Clusters containing similar by meaning and association words EXAMPLE Hmm...What's the name Hmm...What's the name of that Ukranian food? of that Ukranian food? Associative thesauri+Abramov dictionary: Комильфо - приличие After clustering: не выходить из пределов благопристойности 0.001 степенный 0.591 чинный 0.591 благочинный 0.646 бонтонный 0.646 комильфотный 0.646 пристойный 0.646 благонравный 0.684 благоприличный 0.684 благопристойный 0.684 корректный 0.684 REFERENCES FUTURE PLANS 1. Expand synonym and associative thesauri with new ones 2. Add first letter filtering (see above) 3. Add hyponyms and hyperonyms RHF #12-04-12039B E. Vylomova 1. Brown, R., and McNeill, D. (1966). The tip of the tongue phenomenon. Journal of Verbal Learning and Verbal Behavior 5, 325-337. 2. Караулов Ю.Н., Тарасов Е.Ф., Сорокин Ю.А., Уфимцева Н.В., Черкасова Г.А. (1999). Ассоциативный тезаурус современного русского языка. РАН. (russian) 3. Лурия А.Р. (1979). Язык и сознание.//под редакцией Хомско Е.Д., МГУ, Москва - 320 стр.(russian) E-mail: evylomova@gmail.com Associative thesauri
  • 34. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work RAT'10 Time frame: 20 years after the rst one(2009-2010) Location: dierent regions of Russia. Stimuli included 1000 most frequent words in Russian language. The participants: young people at the age of 17-25. E. Vylomova Associative thesauri
  • 35. Introduction Associative Experiments Associative Thesauri Russian Associative Thesaurus'98 Associative Network(Graph) Modelling of Associative Network Future work Thank you! Questions? E. Vylomova Associative thesauri