Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Metaphor Mining in Historical German Novels: Using Unsupervised Learning to Uncover Conceptual Systems in Literature
1. DH2016
Krakow , July 14th 2016
METAPHOR MINING IN HISTORICAL GERMAN NOVELS:
USING UNSUPERVISED LEARNING TO UNCOVER
CONCEPTUAL SYSTEMS IN LITERATURE.
STEFAN PERNES
UNIVERSITY OF WÜRZBURG
2. ‣ a systematic mapping, or rather, the projection of one
domain (e.g. light) onto another (e.g. beauty)
‣ a (usually abstract) concept can be experienced through
another (usually concrete) concept
DEFINITIONS
Der Mensch ist ein Wolf.
Die Tante meckert.
Es ist strahlend schön.
‣ linguistic metaphor
Man is a wolf.
The aunt bleats (complains).
It is radiantly beautiful.
3. ‣ conceptual metaphor
DEFINITIONS
Wie geht’s?
Es läuft gut.
Es geht voran.
LIFE IS A JOURNEY ARGUMENT IS WAR
Das Argument ist vernichtend.
Die Theorie hält stand.
Die Kritik trifft gezielt.
‣ a cognitive ‘bracket’ that encompasses all possible specific realisations
‣ a cognitive phenomenon that expresses itself in language
It’s going well. The theory withstands.
5. ‣ metaphor as lexeme vs. metaphor as an utterance
DEFINITIONS
6. ‣ noun metaphor
‣ copula-construction (“ist”, “wie”, “als”)
“Peter ist ein schräger Vogel.”
‣ genitive metaphor
“der Strom des Lebens”
‣ compound metaphor
“Geistesblitz”
FEATURES
“the stream of life”
“Peter is an awkward
bird (a crazy guy)”
“flash of inspiration”
7. ‣ verb metaphor:
‣ adjective metaphor:
FEATURES
“die Ideen sprudelten”
“ein strahlender Tag”
overview on metaphor constructions in
German, cf. Skirl & Schwarz-Friesel 2007
“ideas gush”
“a radiant (bright) day”
‣ preposition metaphor
“in Gedanken versunken”
“deep in thought”
10. PREPROCESSING
‣ process corpus with POS-tagging and dependency parsing
‣ extract metaphor candidate constructions
‣ build similarity matrix using a distance/divergence measure
(here: Jensen-Shannon divergence)
12. EVALUATION
"HGFC operates with a precision of P = 0.69, whereas the baselines
attain P = 0.36 (AGG) and P = 0.29 (WN). The precision of annotator
judgements against each other (the human ceiling) is P = 0.80,
suggesting that this is a challenging task."
Shutova & Sun 2013
➡ a manually annotated metaphor corpus for German is in the works
13. ‣ a combination of texts from TextGrid's digital library and from
the Gutenberg project
‣ 1418 works
‣ 116,423,486 tokens
DATASET
24. THANK YOU!
• Shutova, E. and Sun, L., 2013, June. Unsupervised Metaphor Identification Using
Hierarchical Graph Factorization Clustering. In HLT-NAACL (pp. 978-988).
• Skirl, Helge, and Monika Schwarz-Friesel. 2007. Metapher. Universitätsverlag Winter.
• Yu, K., Yu, S. and Tresp, V., 2005. Soft clustering on graphs. In Advances in neural
information processing systems (pp. 1553-1560).
References