This document summarizes a project that models music metadata in semantic graphs to improve music description and foster music exchange. The project aims to connect different music data sources, enrich the user experience, and make vocabularies and data publicly available as linked open data. The project develops a music-specific data model and tools for visualization, recommendation, and other applications. State-of-the-art music ontologies are extended and complex metadata from different sources is converted and interlinked using controlled vocabularies and natural language processing.
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Modeling Music Metadata Complexity in Semantic Graphs
1. Modeling the Complexity of Music Metadata
in Semantic Graphs for Exploration and Discovery
ANR-14-CE24-0020
@pasqlisena
pasquale.lisena@eurecom.fr
Pasquale Lisena, Raphaël Troncy, Konstantin Todorov, Manel Achichi
Digital Libraries for Musicology (DLfM) Workshop
28th October 2017 | Shanghai Conservatory of Music
3. Information contained in librarian knowledge
but not publicly available
Hard question for current
music models and ontologies
Different practical implications
(MIR, concert and radio programming, music recommendation)
3
5. Project Goals • Improve music description to foster
music exchange and reuse
• Connect sources, multiply usage,
enrich user experience
• Music specific data model
• Vocabularies and data public available as
Linked Open Data
• Tools for visualization, interconnections,
recommendation
• Experience and praxis for other institutions
5
6. Works
62 550 | XML
Scores
9 154 | XML
Concerts
340 609 | XML
Discs
9 500 | XML
Works
6 846 | UNIMARC
Scores
30 319 | UNIMARC
Concerts
5 164 | XML
Discs
8 602 | XML
Source Datasets
Works
135 940 | INTERMARC
Scores
89 184 | INTERMARC
6
8. State of the Art: MusicOntology
- One of the first example of describing
music using Semantic Web
- Extend FRBR, Timeline Ontology,
Event Ontology
- Uses vocabularies for Keys, Musical
Instrument (by MusicBrainz), Genres
(DBpedia)
8
Raimond, Samer A. Abdallah, Mark B. Sandler, and Frederick Giasson. 2007. The Music Ontology. In 15th
International Conference on Music Information Retrieval (ISMIR). 417–422
9. The DOREMUS model
F15
Work
F22
Expression
F28
Expression
Creation
- Music specific extension of
FRBRoo
- Triplet pattern:
Work-Expression-Event
- Dynamic:
every triplet is autonomous, and
linkable to the other ones
- Relies on Linked Data principles
(everything is an URI,
RDF model)
9http://data.doremus.org/ontology
10. F14
Work
F22
Expression
M2
Opus
Statement
F28
Expression
Creation
R3 is
realized in
E7
Activity
5
1
“Sonate pour violoncelle et piano no 1”@fr
“Sonates" , "Sonata in F"
Ludwig van
Beethoven
Ludwig von Beethoven
composer
compositeur@fr
compositore@it
U17 has opus
statement
U12 has
genre
P102 has title
U31 had
function of
type
P14 carried
out by
P9 consists
of
P4 has time
span1796
Sonata
sonata@it , sonate@fr ,
klaviersonate@de
M42 Performed
Expression
Creation
M43
Performed
Expression
Berlin
P4 has time
span
1796
P7 took
place at
F24 Publication
Expression
F30
Publication
Event
P4 has
time span
1797
P7 took
place at
Vienna
U4 had princeps
publication
U54 is performed
expression of
P165
incorporates
1770
1827
P98
born
P100
died
F Major
F Dur@de , Fa majeur@fr,
Fa maggiore@it , Fa mayor@es
M6
Casting
M23
Casting
Detail 1
U30
quantity
U2
foresees
mop
Piano
Pianoforte@it
Fortepian@pl
M23
Casting
Detail
1
U30
quantity
U2
foresees
mop
Cello
Violoncello@it
Violoncelle@fr
F15
Complex
Work
F19
Publication
Work
M44
Performed
Work
U5 had
premiere
U38 has
descriptive
expression
R10 has member
15. 001 FRBNF139081882FR
100 $313891295$w.0..b.....$aBeethoven$mLudwig van$d1770-1827
144 $w....b.fre.$aSonates$bPiano$pOp. 27, no 2$tDo dièse mineur
LANG TITLE MOP OPUS KEY
“MARC must die” -- Roy Tennant, 2002
http://lj.libraryjournal.com/2002/10/ljarchives/marc-must-die/#_
16. MARC issues
16
• Different variants
UNIMARC, INTERMARC
• Free text field
different practices in describing the same information
“Op. 27 n. 2” - “Op. 27 no 2”
• Frequent mistakes in editorial work
wrong fields, typos, wrong punctuation
19. Interlinking: Works
19
1. Data cleaning
removing “noisy” properties, i.e. identifiers, comments, …
2. Instance profiling
represent each resource as sub-graph
3. Instance indexing and matching
convert the sub-graph in a set of keywords in order to
apply text document matching techniques
4. Post-processing
Clustering of the datasets, identify false positive of
previous points
21. Future Work
21
• Pivot Vocabularies of Genres and MoPs
as result of the interconnection task
• Recommendation System
first step: “Combining Music Specific Embeddings for
Computing Artist Similarity” @ISMIR2017
• Schema.org injection in all pages
goals: SEO optimization, simplification of the data in
order to extend their usage