Our presentation about open-collaboration given at the International Conference on Collaborative Innovation Networks (COINs2011) in Basel, Switzerland, Sep. 9, 2011. The video of this presentation is available at the Livestream site http://www.livestream.com/coinsconference
Editorial Collaboration Networks of Wikipedia Articles in Various Languages
1. COINs2011
Editorial Collaboration Networks of
Wikipedia Articles in Various Languages
Takashi Iba
Ko Matsuzuka
Daiki Muramatsu
Faculty of Policy Management, Keio University
2. Editorial Collaboration Networks of
Wikipedia Articles in Various Languages
• The characteristics of collaboration patterns of
all articles in a certain language.
• The commonality and differences of collaboration
patterns among Wikipedias written in various
languages.
3. Editorial Collaboration Networks of
Wikipedia Articles in Various Languages
Method: Sequential collaboration network
Analysis 1: Comparison of 12 different languages
Analysis 2: Distribution of account and IP users
Analysis 3: Distribution of Featured Articles
4. Editorial Collaboration Networks of
Wikipedia Articles in Various Languages
Method: Sequential collaboration network
Analysis 1: Comparison of 12 different languages
Analysis 2: Distribution of account and IP users
Analysis 3: Distribution of Featured Articles
5. Method: Sequential collaboration network
Building a sequential collaboration network, connecting a
relation from editor A to editor B, if editor B follows on
work done by editor A.
order
1 A A
2
B
3
4 A B
C C
5
6. Sequential Collaboration Network of Article
“Collaborative Innovation Networks” in English Wikipedia
The number of Nodes = 51
Average path length = 6.399
10. Method: Sequential collaboration network
Building a sequential collaboration network, connecting a
relation from editor A to editor B, if editor B follows on
work done by editor A.
order
1 A A
2
B
3
4 A B
C C
5
11. Our Previous Study: Featured Articles in English Wikipedia
each sequential collaboration network
The average path length of Linear graph
2,545 articles [Jun 27 2009]
The order of each sequential collaboration network
(The number of editors in each article)
T. Iba, K. Nemoto, B. Peters & P. Gloor, "Analyzing the Creative Editing Behavior of Wikipedia
Editors Through Dynamical Social Network Analysis", COINs2011, 2009
T. Iba and S. Itoh, "Sequential Collaboration Network of Open Collaboration", NetSci'09, 2009
12. Editorial Collaboration Networks of
Wikipedia Articles in Various Languages
Method: Sequential collaboration network
Analysis 1: Comparison of 12 different languages
Analysis 2: Distribution of account and IP users
Analysis 3: Distribution of Featured Articles
13. Analysis 1: Comparison of 12 different languages
Target Languages
Rank 1: English
Rank 2: German
Rank 3: French
Rank 4: Polish
Rank 5: Italian
Rank 6: Japanese
Rank 7: Spanish
Rank 8: Dutch
Rank 9: Portuguese
Analyzing ALL articles as of Rank 10: Russian
January 1st, 2011 in each language. …
Rank 15: Finnish
The ranking based on the data as of …
January 6th, 2011. Rank 20: Turkish
14. English Rank 1
3,490,325 articles
each sequential collaboration network
The average path length of
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
15. English Rank 1
3,490,325 articles
each sequential collaboration network
The average path length of
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
16. German Rank 2
1,155,210 articles
each sequential collaboration network
The average path length of
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
17. French Rank 3
1,039,251 articles
each sequential collaboration network
The average path length of
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
18. Polish Rank 4
752,734 articles
each sequential collaboration network
The average path length of
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
19. Italian Rank 5
750,634 articles
each sequential collaboration network
The average path length of
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
20. Japanese Rank 6
718,974 articles
each sequential collaboration network
The average path length of
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
21. Spanish Rank 7
676,866 articles
each sequential collaboration network
The average path length of
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
22. Dutch Rank 8
656,079 articles
each sequential collaboration network
The average path length of
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
23. Portuguese Rank 9
638,747 articles
each sequential collaboration network
The average path length of
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
24. Russian Rank 10
627,139 articles
each sequential collaboration network
The average path length of
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
25. Finnish Rank 15
255,712 articles
each sequential collaboration network
The average path length of
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
26. Turkish Rank 20
152,262 articles
each sequential collaboration network
The average path length of
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
27. English German French Polish
Italian Japanese Spanish Dutch
Portuguese Russian Finnish Turkish
28. Result of Analysis 1: Comparison of 12 different languages
• Scatter plot of all articles exhibits a tilted triangle
in all languages.
• The height of triangle gets shorter as the number of
articles decreases.
29. Editorial Collaboration Networks of
Wikipedia Articles in Various Languages
Method: Sequential collaboration network
Analysis 1: Comparison of 12 different languages
Analysis 2: Distribution of account and IP users
Analysis 3: Distribution of Featured Articles
31. Scatter plot of articles in English Wikipedia
each sequential collaboration network
The average path length of
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
32. Scatter plot of articles with
number of IP users / number of total editors
each sequential collaboration network
The average path length of
0.0 PIP 1.0
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
33. Scatter plot of articles with
number of IP users / number of total editors
PIP = 0.0 PIP = 0.1 PIP = 0.2
PIP = 0.3 PIP = 0.4 PIP = 0.5
PIP = 0.6 PIP = 0.7 PIP = 0.8
34. Scatter plot of articles with
number of IP users / number of total editors
each sequential collaboration network
The average path length of
0.0 PIP 1.0
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
35. Result of Analysis 2: Distribution of account and IP users
• Top and right area of the “triangle” in scatter plot
consist of articles which ratios of users is high.
• As a result, both the average path length and order
of network can be large in these areas.
PIP = 0.0 PIP = 0.6
36. Editorial Collaboration Networks of
Wikipedia Articles in Various Languages
Method: Sequential collaboration network
Analysis 1: Comparison of 12 different languages
Analysis 2: Distribution of account and IP users
Analysis 3: Distribution of Featured Articles
37. Analysis 3: Distribution of Featured Articles
3,372 featured articles / 3,732,033 articles
In English Wikipedia
38. Scatter plot of all articles in English Wikipedia
each sequential collaboration network
The average path length of
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
39. Scatter plot of featured articles on the all articles
in English Wikipedia
each sequential collaboration network
The average path length of
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
40. Scatter plot of featured articles on the all articles
in English Wikipedia
each sequential collaboration network
The average path length of
Double logarithmic graph
The order of each sequential collaboration network
(The number of editors in each article)
41. Result of Analysis 3: Distribution of Featured Articles
• Features articles are located at a certain area in the
scatter plot.
• It implies that there would be characteristic
patterns of collaboration producing good results.
42. Editorial Collaboration Networks of
Wikipedia Articles in Various Languages
Method: Sequential collaboration network
Analysis 1: Comparison of 12 different languages
Analysis 2: Distribution of account and IP users
Analysis 3: Distribution of Featured Articles
43. Editorial Collaboration Networks of
Wikipedia Articles in Various Languages
• Scatter plot of all articles commonly
exhibits a tilted triangle in all languages,
but the height of triangle gets shorter as the
number of articles decreases.
• Top and right area of the “triangle” in
scatter plot consist of articles which the
ratios of IP users are high.
• Features articles are located at a certain
area in the scatter plot.
44. Collaborators “Editorial Collaboration Networks of
Wikipedia Articles in Various Languages”
Contact us: e-mail to iba@sfc.keio.ac.jp
Takashi Iba
Associate Professor,
Faculty of Policy Management,
Keio University
Ph.D. in media and governance
Natsumi Yotsumoto
Ko Matsuzuka
Former student of Iba Lab.
Iba Lab. Faculty of Policy Management,
Faculty of Policy Management, Keio University
Keio University
Bui Hong Ha
Daiki Muramatsu
Former student of Iba Lab.
Iba Lab. Faculty of Policy Management,
Faculty of Policy Management, Keio University
Keio University