SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Networks of Music Groups as
Success Predictors
Dmitry Zinoviev
Department of Mathematics and Computer Science
Suffolk University, Boston
Dmitry Zinoviev * Suffolk University 2
Research Question
Who Rocks and Why?
Dmitry Zinoviev * Suffolk University 3
Real Research Questions
● Does sharing performers with other groups
influence the groups' eventual success?
● If so, is the success predictable from the
performers' sharing network?
● What is the linguocultural and genre structure
of the ex-Soviet music universe?
Dmitry Zinoviev * Suffolk University 4
Research Strategy
● Collect data about sharing and success
● Build a network based on shared musicians
● Define “success”
● Correlate network measures (such as centralities)
with success measures
● Attempt to predict success from the network
measures using machine learning techniques
● Look into genres/languages and communities
Dmitry Zinoviev * Suffolk University 5
DATA
Dmitry Zinoviev * Suffolk University 6
Data Set
● 4,560 non-academic music groups performing in
the USSR and post-Soviet countries in 1960–2015
● 17,000 performers (at least 3,600 shared)
● 275 genres (rock, pop, disco, jazz, folk, etc.)
● Wikipedia pages in 122 languages
Dmitry Zinoviev * Suffolk University 7
New Groups by Year
Dmitry Zinoviev * Suffolk University 8
2,216 Groups on Wikipedia
● Russia
● Estonia
● Ukraine
● Latvia
● Lithuania
● Belarus
● Moldova
Dmitry Zinoviev * Suffolk University 9
NETWORK
Dmitry Zinoviev * Suffolk University 10
Network Construction
●
Group → node; labels in the original language
● Two nodes connected if the groups shared at least
one musician over their lifetime
● Undirected, unweighted, unconnected graph with
no loops and no parallel edges
● For each node, calculate degree, average neighbors
degree, closeness, betweenness, and eigenvalue
centrality, and clustering coefficient
Dmitry Zinoviev * Suffolk University 11
Network
Overview
● Node size
represents
degree
(number of
shares)
Dmitry Zinoviev * Suffolk University 12
Network Description
● 80% of the groups (3,602) are in the giant
connected component; all other connected
components have <13 groups each
● Excellent community structure (m=0.76), 43
communities; each of the largest 25 communities
has 20+ groups
● Community = groups that have a lot of mutual
musician sharing
Dmitry Zinoviev * Suffolk University 13
SUCCESS
Dmitry Zinoviev * Suffolk University 14
What's “Success”?
● No sales data!
● No charts!
● Informal/semi-legal/illegal status
● Proxies for long-term success (we still remember them!):
– Wikipedia page(s) visit frequency within last 3 years (collected
from http://stats.grok.se)
– Wikipedia page(s) Google PageRank
– Available for 2,000 groups
Dmitry Zinoviev * Suffolk University 15
PageRank (PR) Correlations
Dmitry Zinoviev * Suffolk University 16
Visit Frequency (VF) Correlations
Dmitry Zinoviev * Suffolk University 17
Prediction
● Random Decision Forest (RDF) machine learning
predictor
● Predict above-median VF vs below-median VF:
accuracy 71% (expected by chance: 50%)
● Predict Google PR: accuracy 49% (expected by
chance: 17%)
● Quite poor, but not hopeless
Dmitry Zinoviev * Suffolk University 18
GENRES
Dmitry Zinoviev * Suffolk University 19
Genres and Sharing
● Build a network of similar genres (recursive
generalized similarity):
– Two genres are similar if used by similar groups
– Two groups are similar if play similar genres
●
Genre → node; two nodes are connected if the
genres are “very similar”
● Community structure (m=0.3):
– Punk/jazz, metal, disco/pop, blues/hip-hop, light rock
Dmitry Zinoviev * Suffolk University 20
Genre
Network
Metal
Light rock
Punk
Soul
Folk/jazz/hh
Disco
Ethno
Some genres are
hierarchical
(rock/metal/black metal).
TODO: Assign them to
different levels.
Dmitry Zinoviev * Suffolk University 21
Musicians Prefer Similar Genres
Dmitry Zinoviev * Suffolk University 22
LINGUOCULTURAL
STRUCTURE
Dmitry Zinoviev * Suffolk University 23
Languages, Genres, and Sharing
● Group sharing network has 25 communities with
20+ groups in each
● Preferred language = language of the most
frequently visited Wikipedia page
● Look into genres and preferred languages within
each community: Are they homo- or
heterogeneous?
Dmitry Zinoviev * Suffolk University 24
Genres per Community
In 9
communities,
>50% of groups
perform the one
genre.
In 23
communities,
>50% of groups
perform in no
more than 2
genres.
71% of all
shares—
homogeneous
Dmitry Zinoviev * Suffolk University 25
Preferred Languages per Community
In 24
communities,
>50% of groups
have the same
preferred
language!
84% of all shares
—homogeneous
Dmitry Zinoviev * Suffolk University 26
Language and Genre Homogeneity: Either or Both?
Language-defined
Genre-defined
Not very convincing?
Mixed
Dmitry Zinoviev * Suffolk University 27
Conclusion
● Musician sharing networks of non-academic music
groups in the USSR and post-Soviet countries have
community structure inspired by preferred
language and musical genre
● Centrality and clustering measures of this network
are correlated with long-term success of groups in
terms of popularity on Wikipedia and to some
extent can serve as success predictors

Weitere ähnliche Inhalte

Mehr von Dmitry Zinoviev

Machine Learning Basics for Dummies (no math!)
Machine Learning Basics for Dummies (no math!)Machine Learning Basics for Dummies (no math!)
Machine Learning Basics for Dummies (no math!)Dmitry Zinoviev
 
WHat is star discourse in post-Soviet film journals?
WHat is star discourse in post-Soviet film journals?WHat is star discourse in post-Soviet film journals?
WHat is star discourse in post-Soviet film journals?Dmitry Zinoviev
 
The “Musk” Effect at Twitter
The “Musk” Effect at TwitterThe “Musk” Effect at Twitter
The “Musk” Effect at TwitterDmitry Zinoviev
 
Are Twitter Networks of Regional Entrepreneurs Gendered?
Are Twitter Networks of Regional Entrepreneurs Gendered?Are Twitter Networks of Regional Entrepreneurs Gendered?
Are Twitter Networks of Regional Entrepreneurs Gendered?Dmitry Zinoviev
 
Using Complex Network Analysis for Periodization
Using Complex Network Analysis for PeriodizationUsing Complex Network Analysis for Periodization
Using Complex Network Analysis for PeriodizationDmitry Zinoviev
 
Text analysis of The Book Club Play
Text analysis of The Book Club PlayText analysis of The Book Club Play
Text analysis of The Book Club PlayDmitry Zinoviev
 
Exploring the History of Mental Stigma
Exploring the History of Mental StigmaExploring the History of Mental Stigma
Exploring the History of Mental StigmaDmitry Zinoviev
 
Roles and Words in a massive NSSI-Related Interaction Network
Roles and Words in a massive NSSI-Related Interaction NetworkRoles and Words in a massive NSSI-Related Interaction Network
Roles and Words in a massive NSSI-Related Interaction NetworkDmitry Zinoviev
 
“A Quaint and Curious Volume of Forgotten Lore,” or an Exercise in Digital Hu...
“A Quaint and Curious Volume of Forgotten Lore,” or an Exercise in Digital Hu...“A Quaint and Curious Volume of Forgotten Lore,” or an Exercise in Digital Hu...
“A Quaint and Curious Volume of Forgotten Lore,” or an Exercise in Digital Hu...Dmitry Zinoviev
 
Network analysis of the 2016 USA presidential campaign tweets
Network analysis of the 2016 USA presidential campaign tweetsNetwork analysis of the 2016 USA presidential campaign tweets
Network analysis of the 2016 USA presidential campaign tweetsDmitry Zinoviev
 
Network Analysis of The Shining
Network Analysis of The ShiningNetwork Analysis of The Shining
Network Analysis of The ShiningDmitry Zinoviev
 
The Lord of the Ring. A Network Analysis
The Lord of the Ring. A Network AnalysisThe Lord of the Ring. A Network Analysis
The Lord of the Ring. A Network AnalysisDmitry Zinoviev
 
DaVinci Code. Network Analysis
DaVinci Code. Network AnalysisDaVinci Code. Network Analysis
DaVinci Code. Network AnalysisDmitry Zinoviev
 
C for Java programmers (part 2)
C for Java programmers (part 2)C for Java programmers (part 2)
C for Java programmers (part 2)Dmitry Zinoviev
 

Mehr von Dmitry Zinoviev (20)

Machine Learning Basics for Dummies (no math!)
Machine Learning Basics for Dummies (no math!)Machine Learning Basics for Dummies (no math!)
Machine Learning Basics for Dummies (no math!)
 
WHat is star discourse in post-Soviet film journals?
WHat is star discourse in post-Soviet film journals?WHat is star discourse in post-Soviet film journals?
WHat is star discourse in post-Soviet film journals?
 
The “Musk” Effect at Twitter
The “Musk” Effect at TwitterThe “Musk” Effect at Twitter
The “Musk” Effect at Twitter
 
Are Twitter Networks of Regional Entrepreneurs Gendered?
Are Twitter Networks of Regional Entrepreneurs Gendered?Are Twitter Networks of Regional Entrepreneurs Gendered?
Are Twitter Networks of Regional Entrepreneurs Gendered?
 
Using Complex Network Analysis for Periodization
Using Complex Network Analysis for PeriodizationUsing Complex Network Analysis for Periodization
Using Complex Network Analysis for Periodization
 
Algorithms
AlgorithmsAlgorithms
Algorithms
 
Text analysis of The Book Club Play
Text analysis of The Book Club PlayText analysis of The Book Club Play
Text analysis of The Book Club Play
 
Exploring the History of Mental Stigma
Exploring the History of Mental StigmaExploring the History of Mental Stigma
Exploring the History of Mental Stigma
 
Roles and Words in a massive NSSI-Related Interaction Network
Roles and Words in a massive NSSI-Related Interaction NetworkRoles and Words in a massive NSSI-Related Interaction Network
Roles and Words in a massive NSSI-Related Interaction Network
 
“A Quaint and Curious Volume of Forgotten Lore,” or an Exercise in Digital Hu...
“A Quaint and Curious Volume of Forgotten Lore,” or an Exercise in Digital Hu...“A Quaint and Curious Volume of Forgotten Lore,” or an Exercise in Digital Hu...
“A Quaint and Curious Volume of Forgotten Lore,” or an Exercise in Digital Hu...
 
Network analysis of the 2016 USA presidential campaign tweets
Network analysis of the 2016 USA presidential campaign tweetsNetwork analysis of the 2016 USA presidential campaign tweets
Network analysis of the 2016 USA presidential campaign tweets
 
Network Analysis of The Shining
Network Analysis of The ShiningNetwork Analysis of The Shining
Network Analysis of The Shining
 
The Lord of the Ring. A Network Analysis
The Lord of the Ring. A Network AnalysisThe Lord of the Ring. A Network Analysis
The Lord of the Ring. A Network Analysis
 
Pickling and CSV
Pickling and CSVPickling and CSV
Pickling and CSV
 
Python overview
Python overviewPython overview
Python overview
 
Welcome to CS310!
Welcome to CS310!Welcome to CS310!
Welcome to CS310!
 
Programming languages
Programming languagesProgramming languages
Programming languages
 
The P4 of Networkacy
The P4 of NetworkacyThe P4 of Networkacy
The P4 of Networkacy
 
DaVinci Code. Network Analysis
DaVinci Code. Network AnalysisDaVinci Code. Network Analysis
DaVinci Code. Network Analysis
 
C for Java programmers (part 2)
C for Java programmers (part 2)C for Java programmers (part 2)
C for Java programmers (part 2)
 

Kürzlich hochgeladen

Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...D. B. S. College Kanpur
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingNetHelix
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologycaarthichand2003
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxJorenAcuavera1
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》rnrncn29
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx023NiWayanAnggiSriWa
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024innovationoecd
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naJASISJULIANOELYNV
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPirithiRaju
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...lizamodels9
 
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXDole Philippines School
 
preservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxpreservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxnoordubaliya2003
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRlizamodels9
 

Kürzlich hochgeladen (20)

Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technology
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptx
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by na
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
 
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTXALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
ALL ABOUT MIXTURES IN GRADE 7 CLASS PPTX
 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -I
 
preservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxpreservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptx
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdf
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
 

Networks of Music Groups as Success Predictors

  • 1. Networks of Music Groups as Success Predictors Dmitry Zinoviev Department of Mathematics and Computer Science Suffolk University, Boston
  • 2. Dmitry Zinoviev * Suffolk University 2 Research Question Who Rocks and Why?
  • 3. Dmitry Zinoviev * Suffolk University 3 Real Research Questions ● Does sharing performers with other groups influence the groups' eventual success? ● If so, is the success predictable from the performers' sharing network? ● What is the linguocultural and genre structure of the ex-Soviet music universe?
  • 4. Dmitry Zinoviev * Suffolk University 4 Research Strategy ● Collect data about sharing and success ● Build a network based on shared musicians ● Define “success” ● Correlate network measures (such as centralities) with success measures ● Attempt to predict success from the network measures using machine learning techniques ● Look into genres/languages and communities
  • 5. Dmitry Zinoviev * Suffolk University 5 DATA
  • 6. Dmitry Zinoviev * Suffolk University 6 Data Set ● 4,560 non-academic music groups performing in the USSR and post-Soviet countries in 1960–2015 ● 17,000 performers (at least 3,600 shared) ● 275 genres (rock, pop, disco, jazz, folk, etc.) ● Wikipedia pages in 122 languages
  • 7. Dmitry Zinoviev * Suffolk University 7 New Groups by Year
  • 8. Dmitry Zinoviev * Suffolk University 8 2,216 Groups on Wikipedia ● Russia ● Estonia ● Ukraine ● Latvia ● Lithuania ● Belarus ● Moldova
  • 9. Dmitry Zinoviev * Suffolk University 9 NETWORK
  • 10. Dmitry Zinoviev * Suffolk University 10 Network Construction ● Group → node; labels in the original language ● Two nodes connected if the groups shared at least one musician over their lifetime ● Undirected, unweighted, unconnected graph with no loops and no parallel edges ● For each node, calculate degree, average neighbors degree, closeness, betweenness, and eigenvalue centrality, and clustering coefficient
  • 11. Dmitry Zinoviev * Suffolk University 11 Network Overview ● Node size represents degree (number of shares)
  • 12. Dmitry Zinoviev * Suffolk University 12 Network Description ● 80% of the groups (3,602) are in the giant connected component; all other connected components have <13 groups each ● Excellent community structure (m=0.76), 43 communities; each of the largest 25 communities has 20+ groups ● Community = groups that have a lot of mutual musician sharing
  • 13. Dmitry Zinoviev * Suffolk University 13 SUCCESS
  • 14. Dmitry Zinoviev * Suffolk University 14 What's “Success”? ● No sales data! ● No charts! ● Informal/semi-legal/illegal status ● Proxies for long-term success (we still remember them!): – Wikipedia page(s) visit frequency within last 3 years (collected from http://stats.grok.se) – Wikipedia page(s) Google PageRank – Available for 2,000 groups
  • 15. Dmitry Zinoviev * Suffolk University 15 PageRank (PR) Correlations
  • 16. Dmitry Zinoviev * Suffolk University 16 Visit Frequency (VF) Correlations
  • 17. Dmitry Zinoviev * Suffolk University 17 Prediction ● Random Decision Forest (RDF) machine learning predictor ● Predict above-median VF vs below-median VF: accuracy 71% (expected by chance: 50%) ● Predict Google PR: accuracy 49% (expected by chance: 17%) ● Quite poor, but not hopeless
  • 18. Dmitry Zinoviev * Suffolk University 18 GENRES
  • 19. Dmitry Zinoviev * Suffolk University 19 Genres and Sharing ● Build a network of similar genres (recursive generalized similarity): – Two genres are similar if used by similar groups – Two groups are similar if play similar genres ● Genre → node; two nodes are connected if the genres are “very similar” ● Community structure (m=0.3): – Punk/jazz, metal, disco/pop, blues/hip-hop, light rock
  • 20. Dmitry Zinoviev * Suffolk University 20 Genre Network Metal Light rock Punk Soul Folk/jazz/hh Disco Ethno Some genres are hierarchical (rock/metal/black metal). TODO: Assign them to different levels.
  • 21. Dmitry Zinoviev * Suffolk University 21 Musicians Prefer Similar Genres
  • 22. Dmitry Zinoviev * Suffolk University 22 LINGUOCULTURAL STRUCTURE
  • 23. Dmitry Zinoviev * Suffolk University 23 Languages, Genres, and Sharing ● Group sharing network has 25 communities with 20+ groups in each ● Preferred language = language of the most frequently visited Wikipedia page ● Look into genres and preferred languages within each community: Are they homo- or heterogeneous?
  • 24. Dmitry Zinoviev * Suffolk University 24 Genres per Community In 9 communities, >50% of groups perform the one genre. In 23 communities, >50% of groups perform in no more than 2 genres. 71% of all shares— homogeneous
  • 25. Dmitry Zinoviev * Suffolk University 25 Preferred Languages per Community In 24 communities, >50% of groups have the same preferred language! 84% of all shares —homogeneous
  • 26. Dmitry Zinoviev * Suffolk University 26 Language and Genre Homogeneity: Either or Both? Language-defined Genre-defined Not very convincing? Mixed
  • 27. Dmitry Zinoviev * Suffolk University 27 Conclusion ● Musician sharing networks of non-academic music groups in the USSR and post-Soviet countries have community structure inspired by preferred language and musical genre ● Centrality and clustering measures of this network are correlated with long-term success of groups in terms of popularity on Wikipedia and to some extent can serve as success predictors