SlideShare ist ein Scribd-Unternehmen logo
1 von 38
Downloaden Sie, um offline zu lesen
NESSHI and GEPHI

Sociology of science as a breeding ground
for tool building in the digital humanities

Clement Levallois
Meertens Institute, 28 November 2013

1
Bio notes
PhD
(2003-2008)

Post-Doc
(2008-2013)

• Member of the Gephi Consortium

• Starting at EMLyon in January 2014
2
Plan
1. NESSHI (2011-2014)
2. Building tools to reveal insights from data in
the NESSHI project
3. A focus on GEPHI as a platform of
distribution for data processing tools (not
just networks!)
3
The take away
• NESSHI and other empirically oriented
projects are very likely to lead to the creation
of data processing tools to support the
analysis.
• Instead of “wasting” these tools in scripts that
will never be reused, one could look for
platforms of distribution to release them as
apps. Gephi is such a platform.
4
1. NESSHI
The Neuro-Turn in European Social Sciences and Humanities:
Impact of neurosciences on economics, marketing and
philosophy
5
NESSHI

6
The neuro-turn
• A focus on empirical investigations
• Questioning the turn itself:
– do we find it?
– is it unidirectional?

• A look at academia in relation to society
7
Challenges
• We tried to track a broad, undefined
intellectual movement through
– Time ( ~1985-~2013?)
– Scientific disciplines (all of them?)
– Science/society spheres (beware dichotomies!)
8
• Archival work

NRC, 1996
9
• Archival work – on the web

Neuromarketing in the press, 2003
(and 2007)
10
• Bibliographical data
– Neuroeconomics papers overlaid on an ISI map
(Rafols, Porter and Leydesdorff overlay toolkit approach)

– Co-author networks
– Tried journal maps…

11
• Online surveys
– 630+ individual emails:
“Who do you discuss with in neuroeconomics?”

A
I discuss with…

B
… this person

12
• Textual data
– Web of Knowledge:
coocurrences in abstracts of
papers in neuroeconomics
– Mainz bibliography in
neuroethics: clustering of
papers in neuroethics based
on a semi-manual
classification scheme.

13
+
• Interviews (NL),
• Ethnographies (UK),
• Experimental work (France),
• Philosophical analysis (France, Germany)
14
Results (NL team)
• Neuroeconomics (in 2012): a complete survey of the field
• Neuromarketing (2001 - ): a first history based on
archives, in academia and industry
• Neuroethics (1995 – 2012): a definition based on the
Mainz bibliography – collab with DE team.
• The neuro-turn society? Pictures of the neuro in Dutch
newspapers (1987-2011) (on going)
15
2. DEVELOPMENT OF TOOLS
FOR NESSHI

16
(note)
All software discussed here are available in open
source with Apache License 2 or equivalent at:
https://github.com/seinecle

17
Why new tools?
• Unstructured data brings specific needs
• Technologies evolve quickly – tools built 3 years ago don’t
exploit all possibilities available today
• From scripting to releasing:

– A script can take days to create, why not add a few hours of
work to make it useable by the larger public?

• Creating tools can be fun (actually, addictive).
• Is it worth always? Does it work always? No…
18
1. Calenicon (show demo at www.calenicon.org)
• Problem: how to edit / share / archive pictures in an online
environment?
• Using the vocabulary of VRAcore for annotations

VRAcore: data standard for the description of works of visual culture as well as the images that
document them. http://www.loc.gov/standards/vracore/

• Uses the API of www.datahub.io (CKAN), the repository of the Open
Knowledge Foundation (Figshare and DANS were other candidates).
• Developed in Java EE.
• www.calenicon.org

19
2. Alphabetical sorter
• Problem: how to get a neat alphabetical list of
scientists in Inkscape or Illustrator?
• Developed in Java as a plugin in Gephi
• Trivial (40 lines of code).
(open pdf to show example of use)
20
3. Overlay toolkit
•

Problem: exploratory work for the history of
neuroethics project

– can we see an evolution of a neuroethics in terms of ISI
categories?

•

Overlay toolkit by Rafols, Porter and Leydesdorff
(2010)

– Originally a series of command-line utilities available
from Loet Leydesdorff website, with an output to Pajek.

•

Excel macro to enable an export to Gephi (2011)

•

Overlyde (2012): stand alone Java program to
create dynamic viz in Gephi.

•

Now a Gephi plugin (2013)

– Allows for dynamic datasets, filtering, relative views...

21
4. Gaze
• Problem: how to create
networks of agents, based
on attributes they share?
(attempt to create a map of
journals)
(launch gephi file to
illustrate)

• Gaze computes
similarities between
agents and creates the
resulting network
22
5. Cowo
• Problem:
For the map of cooccurences in
neuroeconomics abstracts I need:
- “functional magnetic resonance
imaging” (4-gram) in VosViewer!
- And adjectives too. And no
stemming. And no td-idf
weighting. Etc.

• Cowo provides an n-gram
approach instead of a POS
tagging approach for text
analysis.
23
6. Force Atlas 3D
• Problem: can’t we get more insight on the
structure of a network with a layout in 3D
space?
(GraphInsight provides 3D layouts, but not as
convenient to use as in Gephi IMHO)

• Solution: plugin in Gephi
(built on top of Force Atlas 2)
24
3. TOOLS – LESSONS LEARNED

25
Some things fail
• Calenicon? Project members did not use it
• I am the only user, not a good effort/results ratio!
• Got no star or forks on Github 
• Time of development would be too heavy to make it
useable by anybody with a www.datahub.io account.
• Not sure of the extent of the potential user base
26
Some things work
but remain confidential
• Cowo: 6x
• Overlyde: 0x

Monthly downloads from
www.clementlevallois.net
in the 3 months leading to Nov 2013

• Gaze: 0x

A few stars on Github, a couple of emails showing that some users do find Gaze and Cowo very useful in their professional activities.
27
Some things work
• Alphabetical Sorter: 20x
• Force Atlas 3D: 120x
Monthly downloads from
https://marketplace.gephi.org
Since their release (all sometime in 2013)

• Excel/csv importer: 300x
• Map of countries: 210x

Figures can be checked on the website
above.

2 general purpose plugins I released for the
Gephi community

28
(the big difference in downloads is
probably due to the distribution
channel)

29
4. HOW TO MAXIMIZE THE
IMPACT OF TOOLS?
Choose the distribution channel wisely

30
DISCLAIMER: next slides might read like an advertisement for Gephi.
I am a member of the “Gephi Community” support team and I have
no stake in the software – just sharing lessons learned here.

31
Several ingredients needed

1.
2.
3.
4.
5.

Easy to use
Cross platform support
Easy to install
Can be found
Meets a need

(2. and 3. are necessary but not sufficient – see the low
number of downloads on my apps reported above).
32
How to share tools?
•

Desktop applications

– Loet Leydesdorff command-line tools
– VosViewer
– Zotero desktop

•

Web applications

– Issue Crawler https://www.issuecrawler.net/
– Sc-Po MediaLab http://www.medialab.sciences-po.fr/tools/

•

Sharing the source code

– Git (Github, Bitbucket, SourceForge…)
– Run My Code (http://www.runmycode.org/)

•

Via platforms

– Firefox: offers plugins like Zotero
– Gephi: can be extended with plugins
– R: packages can be reviewed and shared widely

33
Can be discovered? Runs on Mac, PC,
Linux

Easy to install /
user friendliness

Desktop app

Depends on your
own notoriety.

NO (except for
Java)

NO, with
exceptions (well
designed UI and
good doc. Example:
Vosviewer)

Web app

Depends on your
own notoriety.

YES

YES

Run my code / Git

YES – for techy
audiences

Not guaranteed

NO, except for
specialized
audiences
(example: an R
script for R users)

Gephi

YES - wide audience

YES

YES

34
Gephi is a platform because
•

Extensible with plugins (your tool!) with extensive support to create them
– https://github.com/gephi/gephi-plugins-bootcamp

•

Functions for branding: your tool can have its own identity (not obliged to blend
into Gephi)

•

License: free including for commercial purposes. Your plugin can have a different
license.

•

OpenGL visualization engine

•

GUI all managed by the underlying Netbeans platform

•

All java libraries available

•

Includes a “data laboratory” for spreadsheet-like operations

•

And I/O, client-server communication, hardware acceleration, auto-update
functionalities, etc.
35
Example: API client for Europeana
• Use case: creating an app to query Europeana

– To retrieve meta-data, including pictures in some cases

• Desktop app or Europeana website?

– Desktop app: how to make it cross platform? How to make this
app discoverable by a large user base?
– Europeana website: no possibility to export in a personalized
way

• Create the Europeana as a Gephi plugin:

– gives the flexibility of a rich client application (RCA),
– without the pain to develop a RCA from scratch,
– and provides a popular distribution channel for the app
36
The take away (again)
• NESSHI and other empirically oriented projects are very likely
to lead to the creation of data processing tools to support the
analysis.

• Instead of “wasting” these tools in scripts that will rarely be
reused, one could look for platforms of distribution to release
them as apps. Gephi is such a platform.

37
Thank you.
@seinecle
These slides are online at:

www.clementlevallois.net

38

Weitere ähnliche Inhalte

Ähnlich wie NESSHI and GEPHI: sociology of science as a breeding ground for tool building in the digital humanities

How to sustain a tool building community-driven effort
How to sustain a tool building community-driven effortHow to sustain a tool building community-driven effort
How to sustain a tool building community-driven effortJordi Cabot
 
GoOpen 2010: Reidar Conradi
GoOpen 2010: Reidar ConradiGoOpen 2010: Reidar Conradi
GoOpen 2010: Reidar ConradiFriprogsenteret
 
Giving Researchers Credit for their Data
Giving Researchers Credit for their DataGiving Researchers Credit for their Data
Giving Researchers Credit for their DataFiona Murphy
 
5212303961620480 1585670953 joanna_stachera_proposal_g_soc2020
5212303961620480 1585670953 joanna_stachera_proposal_g_soc20205212303961620480 1585670953 joanna_stachera_proposal_g_soc2020
5212303961620480 1585670953 joanna_stachera_proposal_g_soc2020JoannaStachera1
 
Demystifying Digital Scholarship Workshop 6 Slides
Demystifying Digital Scholarship Workshop 6 SlidesDemystifying Digital Scholarship Workshop 6 Slides
Demystifying Digital Scholarship Workshop 6 SlidesPaige Morgan
 
Visualization for Software Analytics
Visualization for Software AnalyticsVisualization for Software Analytics
Visualization for Software AnalyticsMargaret-Anne Storey
 
EGU GA 2017 OSGeo Townhall
EGU GA 2017 OSGeo TownhallEGU GA 2017 OSGeo Townhall
EGU GA 2017 OSGeo TownhallPeter Löwe
 
20180718 dirk-frigne-foss4 g-europe-final
20180718 dirk-frigne-foss4 g-europe-final20180718 dirk-frigne-foss4 g-europe-final
20180718 dirk-frigne-foss4 g-europe-finalDirk Frigne
 
Pathways to Technology Transfer and Adoption: Achievements and Challenges
Pathways to Technology Transfer and Adoption: Achievements and ChallengesPathways to Technology Transfer and Adoption: Achievements and Challenges
Pathways to Technology Transfer and Adoption: Achievements and ChallengesTao Xie
 
Foundation Comparison
Foundation ComparisonFoundation Comparison
Foundation ComparisonJody Garnett
 
Tools to help and engage emerging open source communities towards development...
Tools to help and engage emerging open source communities towards development...Tools to help and engage emerging open source communities towards development...
Tools to help and engage emerging open source communities towards development...Jose Astrain
 
Introducing TensorFlow: The game changer in building "intelligent" applications
Introducing TensorFlow: The game changer in building "intelligent" applicationsIntroducing TensorFlow: The game changer in building "intelligent" applications
Introducing TensorFlow: The game changer in building "intelligent" applicationsRokesh Jankie
 
Open source softwares, 2011
Open source softwares, 2011Open source softwares, 2011
Open source softwares, 2011Florent Renucci
 
Introduction to GeoNetwork and GeoCat Bridge - teknologiforum Oslo 11-2012
Introduction to GeoNetwork and GeoCat Bridge - teknologiforum Oslo 11-2012Introduction to GeoNetwork and GeoCat Bridge - teknologiforum Oslo 11-2012
Introduction to GeoNetwork and GeoCat Bridge - teknologiforum Oslo 11-2012Jeroen Ticheler
 
Software Analytics - Achievements and Challenges
Software Analytics - Achievements and ChallengesSoftware Analytics - Achievements and Challenges
Software Analytics - Achievements and ChallengesTao Xie
 
xAPI: The Landscape
xAPI: The LandscapexAPI: The Landscape
xAPI: The LandscapeMegan Bowe
 
Gephi Consortium Presentation
Gephi Consortium PresentationGephi Consortium Presentation
Gephi Consortium PresentationGephi Consortium
 

Ähnlich wie NESSHI and GEPHI: sociology of science as a breeding ground for tool building in the digital humanities (20)

How to sustain a tool building community-driven effort
How to sustain a tool building community-driven effortHow to sustain a tool building community-driven effort
How to sustain a tool building community-driven effort
 
GoOpen 2010: Reidar Conradi
GoOpen 2010: Reidar ConradiGoOpen 2010: Reidar Conradi
GoOpen 2010: Reidar Conradi
 
Giving Researchers Credit for their Data
Giving Researchers Credit for their DataGiving Researchers Credit for their Data
Giving Researchers Credit for their Data
 
5212303961620480 1585670953 joanna_stachera_proposal_g_soc2020
5212303961620480 1585670953 joanna_stachera_proposal_g_soc20205212303961620480 1585670953 joanna_stachera_proposal_g_soc2020
5212303961620480 1585670953 joanna_stachera_proposal_g_soc2020
 
Demystifying Digital Scholarship Workshop 6 Slides
Demystifying Digital Scholarship Workshop 6 SlidesDemystifying Digital Scholarship Workshop 6 Slides
Demystifying Digital Scholarship Workshop 6 Slides
 
Visualization for Software Analytics
Visualization for Software AnalyticsVisualization for Software Analytics
Visualization for Software Analytics
 
EGU GA 2017 OSGeo Townhall
EGU GA 2017 OSGeo TownhallEGU GA 2017 OSGeo Townhall
EGU GA 2017 OSGeo Townhall
 
20180718 dirk-frigne-foss4 g-europe-final
20180718 dirk-frigne-foss4 g-europe-final20180718 dirk-frigne-foss4 g-europe-final
20180718 dirk-frigne-foss4 g-europe-final
 
Pathways to Technology Transfer and Adoption: Achievements and Challenges
Pathways to Technology Transfer and Adoption: Achievements and ChallengesPathways to Technology Transfer and Adoption: Achievements and Challenges
Pathways to Technology Transfer and Adoption: Achievements and Challenges
 
Foundation Comparison
Foundation ComparisonFoundation Comparison
Foundation Comparison
 
Tools to help and engage emerging open source communities towards development...
Tools to help and engage emerging open source communities towards development...Tools to help and engage emerging open source communities towards development...
Tools to help and engage emerging open source communities towards development...
 
Intro-EOSC.pptx
Intro-EOSC.pptxIntro-EOSC.pptx
Intro-EOSC.pptx
 
Network Automation e-Academy
Network Automation e-AcademyNetwork Automation e-Academy
Network Automation e-Academy
 
Introducing TensorFlow: The game changer in building "intelligent" applications
Introducing TensorFlow: The game changer in building "intelligent" applicationsIntroducing TensorFlow: The game changer in building "intelligent" applications
Introducing TensorFlow: The game changer in building "intelligent" applications
 
Benoit Visual Only Retrieval
Benoit Visual Only RetrievalBenoit Visual Only Retrieval
Benoit Visual Only Retrieval
 
Open source softwares, 2011
Open source softwares, 2011Open source softwares, 2011
Open source softwares, 2011
 
Introduction to GeoNetwork and GeoCat Bridge - teknologiforum Oslo 11-2012
Introduction to GeoNetwork and GeoCat Bridge - teknologiforum Oslo 11-2012Introduction to GeoNetwork and GeoCat Bridge - teknologiforum Oslo 11-2012
Introduction to GeoNetwork and GeoCat Bridge - teknologiforum Oslo 11-2012
 
Software Analytics - Achievements and Challenges
Software Analytics - Achievements and ChallengesSoftware Analytics - Achievements and Challenges
Software Analytics - Achievements and Challenges
 
xAPI: The Landscape
xAPI: The LandscapexAPI: The Landscape
xAPI: The Landscape
 
Gephi Consortium Presentation
Gephi Consortium PresentationGephi Consortium Presentation
Gephi Consortium Presentation
 

Kürzlich hochgeladen

Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...RKavithamani
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 

Kürzlich hochgeladen (20)

Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 

NESSHI and GEPHI: sociology of science as a breeding ground for tool building in the digital humanities

  • 1. NESSHI and GEPHI Sociology of science as a breeding ground for tool building in the digital humanities Clement Levallois Meertens Institute, 28 November 2013 1
  • 2. Bio notes PhD (2003-2008) Post-Doc (2008-2013) • Member of the Gephi Consortium • Starting at EMLyon in January 2014 2
  • 3. Plan 1. NESSHI (2011-2014) 2. Building tools to reveal insights from data in the NESSHI project 3. A focus on GEPHI as a platform of distribution for data processing tools (not just networks!) 3
  • 4. The take away • NESSHI and other empirically oriented projects are very likely to lead to the creation of data processing tools to support the analysis. • Instead of “wasting” these tools in scripts that will never be reused, one could look for platforms of distribution to release them as apps. Gephi is such a platform. 4
  • 5. 1. NESSHI The Neuro-Turn in European Social Sciences and Humanities: Impact of neurosciences on economics, marketing and philosophy 5
  • 7. The neuro-turn • A focus on empirical investigations • Questioning the turn itself: – do we find it? – is it unidirectional? • A look at academia in relation to society 7
  • 8. Challenges • We tried to track a broad, undefined intellectual movement through – Time ( ~1985-~2013?) – Scientific disciplines (all of them?) – Science/society spheres (beware dichotomies!) 8
  • 10. • Archival work – on the web Neuromarketing in the press, 2003 (and 2007) 10
  • 11. • Bibliographical data – Neuroeconomics papers overlaid on an ISI map (Rafols, Porter and Leydesdorff overlay toolkit approach) – Co-author networks – Tried journal maps… 11
  • 12. • Online surveys – 630+ individual emails: “Who do you discuss with in neuroeconomics?” A I discuss with… B … this person 12
  • 13. • Textual data – Web of Knowledge: coocurrences in abstracts of papers in neuroeconomics – Mainz bibliography in neuroethics: clustering of papers in neuroethics based on a semi-manual classification scheme. 13
  • 14. + • Interviews (NL), • Ethnographies (UK), • Experimental work (France), • Philosophical analysis (France, Germany) 14
  • 15. Results (NL team) • Neuroeconomics (in 2012): a complete survey of the field • Neuromarketing (2001 - ): a first history based on archives, in academia and industry • Neuroethics (1995 – 2012): a definition based on the Mainz bibliography – collab with DE team. • The neuro-turn society? Pictures of the neuro in Dutch newspapers (1987-2011) (on going) 15
  • 16. 2. DEVELOPMENT OF TOOLS FOR NESSHI 16
  • 17. (note) All software discussed here are available in open source with Apache License 2 or equivalent at: https://github.com/seinecle 17
  • 18. Why new tools? • Unstructured data brings specific needs • Technologies evolve quickly – tools built 3 years ago don’t exploit all possibilities available today • From scripting to releasing: – A script can take days to create, why not add a few hours of work to make it useable by the larger public? • Creating tools can be fun (actually, addictive). • Is it worth always? Does it work always? No… 18
  • 19. 1. Calenicon (show demo at www.calenicon.org) • Problem: how to edit / share / archive pictures in an online environment? • Using the vocabulary of VRAcore for annotations VRAcore: data standard for the description of works of visual culture as well as the images that document them. http://www.loc.gov/standards/vracore/ • Uses the API of www.datahub.io (CKAN), the repository of the Open Knowledge Foundation (Figshare and DANS were other candidates). • Developed in Java EE. • www.calenicon.org 19
  • 20. 2. Alphabetical sorter • Problem: how to get a neat alphabetical list of scientists in Inkscape or Illustrator? • Developed in Java as a plugin in Gephi • Trivial (40 lines of code). (open pdf to show example of use) 20
  • 21. 3. Overlay toolkit • Problem: exploratory work for the history of neuroethics project – can we see an evolution of a neuroethics in terms of ISI categories? • Overlay toolkit by Rafols, Porter and Leydesdorff (2010) – Originally a series of command-line utilities available from Loet Leydesdorff website, with an output to Pajek. • Excel macro to enable an export to Gephi (2011) • Overlyde (2012): stand alone Java program to create dynamic viz in Gephi. • Now a Gephi plugin (2013) – Allows for dynamic datasets, filtering, relative views... 21
  • 22. 4. Gaze • Problem: how to create networks of agents, based on attributes they share? (attempt to create a map of journals) (launch gephi file to illustrate) • Gaze computes similarities between agents and creates the resulting network 22
  • 23. 5. Cowo • Problem: For the map of cooccurences in neuroeconomics abstracts I need: - “functional magnetic resonance imaging” (4-gram) in VosViewer! - And adjectives too. And no stemming. And no td-idf weighting. Etc. • Cowo provides an n-gram approach instead of a POS tagging approach for text analysis. 23
  • 24. 6. Force Atlas 3D • Problem: can’t we get more insight on the structure of a network with a layout in 3D space? (GraphInsight provides 3D layouts, but not as convenient to use as in Gephi IMHO) • Solution: plugin in Gephi (built on top of Force Atlas 2) 24
  • 25. 3. TOOLS – LESSONS LEARNED 25
  • 26. Some things fail • Calenicon? Project members did not use it • I am the only user, not a good effort/results ratio! • Got no star or forks on Github  • Time of development would be too heavy to make it useable by anybody with a www.datahub.io account. • Not sure of the extent of the potential user base 26
  • 27. Some things work but remain confidential • Cowo: 6x • Overlyde: 0x Monthly downloads from www.clementlevallois.net in the 3 months leading to Nov 2013 • Gaze: 0x A few stars on Github, a couple of emails showing that some users do find Gaze and Cowo very useful in their professional activities. 27
  • 28. Some things work • Alphabetical Sorter: 20x • Force Atlas 3D: 120x Monthly downloads from https://marketplace.gephi.org Since their release (all sometime in 2013) • Excel/csv importer: 300x • Map of countries: 210x Figures can be checked on the website above. 2 general purpose plugins I released for the Gephi community 28
  • 29. (the big difference in downloads is probably due to the distribution channel) 29
  • 30. 4. HOW TO MAXIMIZE THE IMPACT OF TOOLS? Choose the distribution channel wisely 30
  • 31. DISCLAIMER: next slides might read like an advertisement for Gephi. I am a member of the “Gephi Community” support team and I have no stake in the software – just sharing lessons learned here. 31
  • 32. Several ingredients needed 1. 2. 3. 4. 5. Easy to use Cross platform support Easy to install Can be found Meets a need (2. and 3. are necessary but not sufficient – see the low number of downloads on my apps reported above). 32
  • 33. How to share tools? • Desktop applications – Loet Leydesdorff command-line tools – VosViewer – Zotero desktop • Web applications – Issue Crawler https://www.issuecrawler.net/ – Sc-Po MediaLab http://www.medialab.sciences-po.fr/tools/ • Sharing the source code – Git (Github, Bitbucket, SourceForge…) – Run My Code (http://www.runmycode.org/) • Via platforms – Firefox: offers plugins like Zotero – Gephi: can be extended with plugins – R: packages can be reviewed and shared widely 33
  • 34. Can be discovered? Runs on Mac, PC, Linux Easy to install / user friendliness Desktop app Depends on your own notoriety. NO (except for Java) NO, with exceptions (well designed UI and good doc. Example: Vosviewer) Web app Depends on your own notoriety. YES YES Run my code / Git YES – for techy audiences Not guaranteed NO, except for specialized audiences (example: an R script for R users) Gephi YES - wide audience YES YES 34
  • 35. Gephi is a platform because • Extensible with plugins (your tool!) with extensive support to create them – https://github.com/gephi/gephi-plugins-bootcamp • Functions for branding: your tool can have its own identity (not obliged to blend into Gephi) • License: free including for commercial purposes. Your plugin can have a different license. • OpenGL visualization engine • GUI all managed by the underlying Netbeans platform • All java libraries available • Includes a “data laboratory” for spreadsheet-like operations • And I/O, client-server communication, hardware acceleration, auto-update functionalities, etc. 35
  • 36. Example: API client for Europeana • Use case: creating an app to query Europeana – To retrieve meta-data, including pictures in some cases • Desktop app or Europeana website? – Desktop app: how to make it cross platform? How to make this app discoverable by a large user base? – Europeana website: no possibility to export in a personalized way • Create the Europeana as a Gephi plugin: – gives the flexibility of a rich client application (RCA), – without the pain to develop a RCA from scratch, – and provides a popular distribution channel for the app 36
  • 37. The take away (again) • NESSHI and other empirically oriented projects are very likely to lead to the creation of data processing tools to support the analysis. • Instead of “wasting” these tools in scripts that will rarely be reused, one could look for platforms of distribution to release them as apps. Gephi is such a platform. 37
  • 38. Thank you. @seinecle These slides are online at: www.clementlevallois.net 38