The document discusses using provenance to analyze social machines. It presents provenance templating as a way to standardize provenance capture from social machines. Templates define a provenance graph structure with variable placeholders that can be populated with observed values. Summarizing and applying network analysis to provenance data can provide insights into the behaviors and patterns within social machines. The work is demonstrated through a Pokémon Go simulation and aims to also study citizen science projects on exoplanet discovery.
2. SOCIAM All-hands – Sep 19, 2017
Provenance as the Foundation for
Analytics of Social Machines
SOCIALMACHINES
INSIGHTS
Provenance Capture Provenance Analytics
3. A Case Study
Ingress location-based AR mobile game by Niantic.
Gameplay involves capturing portals at places of cultural
significance and linking them over geographical areas.
Portals are crowdsourced 24+ million submissions 2012-
2015
Social machine observing Pokémon GO Pixelmon
A highly observed social machine + radar apps
/r/PokemonGO
11. Provenance Summarisation
• Essence of Provenance: a
provenance summary should
capture the essence of the
provenance graph that it
summarises.
• Outliers: It should be possible
to detect anomalies or outliers
in a provenance summary
• Conformance: It should be
possible to decide whether a
provenance graph is
compatible, or conformant,
with a provenance summary.
SOCIAM All-hands – Sep 19, 2017
Understanding Provenance at Scale
12. Pokémon GO Provenance Summary
pgo:PokemonNormal_6
pgo:Player-pgo:Instinct_7
pgo:PokemonCapture_1
pgo:BallCollection_2
pgo:PokemonStrong_0 pgo:PokemonWeak_4
pgo:Pokestop_5
pgo:Valor-pgo:Player_3
pgo:Player-pgo:Mystic_8
Original Summary
Nodes 1488 9
Edges 3384 46
• Clear Narrative
• Common Patterns
• Outliers
Paper: Luc Moreau. Aggregation by provenance types: A technique for summarising provenance
graphs. In Graphs as Models 2015 (An ETAPS'15 workshop), Electronic Proceedings in Theoretical
Computer Science, pages 129-144, London, UK, April 2015.
14. Provenance Network Analytics
SOCIAM All-hands – Sep 19, 2017
Paper: Trung Dong Huynh, Mark Ebden,
Joel Fischer, Stephen Roberts, and Luc
Moreau. Provenance Network Analytics: An
approach to data analytics using data
provenance. In Data Mining and Knowledge
Discovery, (in review).
15. PNA over Provenance Summaries
SOCIAM All-hands – Sep 19, 2017
Initial results on Pokémon GO provenance:
• Identifying correctly Yellow team (100%)
• Cannot distinguish the summaries of Blue and Red
16. Summary
SOCIAM All-hands – Sep 19, 2017
b
operation
player_before
use
pokemon_before
use
type:var:operation_type
type: var:player_type
label: var:user_before_label
team: var:team
type: var:pokemon_type
label: var:object_before_label
species: var:species
strength: var:strength
der
player_after
type: var:player_type
label:var:user_after_label
team: var:team
pokemon_after
gender
der
type: var:pokemon_type
label: var:object_after_label
pgo:PokemonNormal_5
pgo:Player-pgo:Instinct_6
pgo:PokemonCapture_1
pgo:Pokestop_4
pgo:PokemonStrong_0
pgo:BallCollection_2
pgo:PokemonWeak_3
pgo:Player-pgo:Mystic_4
pgo:PokemonWeak_2
pgo:PokemonCapture_0
pgo:Pokestop_3
pgo:BallCollection_1
pgo:BallCollection_2
pgo:Pokestop_4
pgo:Valor-pgo:Player_3
pgo:PokemonCapture_1
pgo:PokemonStrong_0
A Social Machine
Insights
Provenance Templating
Summarisation
Network Analytics
17. Where’s next…
• Pokémon Go
• Extending provenance network
metrics for provenance
summaries
• Enriching the simulator with
more player behaviours
• StarGazing Live Datasets
• Planet 9: 5m classifications
• Exoplanet Explorers: 6m
classifications
• Discovery of a new 4-planet
system
• Data: classifications & forum
interactions
(between contributors &
researchers)
SOCIAM All-hands – Sep 19, 2017
pgo:BallCollection_2
pgo:Pokestop_4
pgo:Valor-pgo:Player_3
pgo:PokemonCapture_1
pgo:PokemonStrong_0
pgo:Player-pgo:Mystic_4
pgo:PokemonWeak_2
pgo:PokemonCapture_0
pgo:Pokestop_3
pgo:BallCollection_1
Hinweis der Redaktion
It’s a location-based augmented reality game (+ cultural heritage app) with massive take-up.
The app earned $35 million from its 30 million users by the end of its first month, and was back at the top of the App store last month after the Generation 2 update.
But daily user numbers have dropped significantly, from 28 million down to 5 million accoring to comScore’s latest mega data report, as reported by recode.
Pokemon Go Fest last July in Chicago with 20,000 attendees, ticket price $20, but 1 reached $800 on ebay
A behavioural intervention (see “Death Tracker”)
Leads to real-world social interactions
Standardised 2013.
After 4 years, we’re seeing evidence of adoptions by various organisations around the globe
Obviously, should we provenance enable a social machine that has significant take up, the type of provenance representation that we would obtain would very quickly become unmanageable. The point of the picture here is not for you to be able to follow each edge in this graph. Instead, it is to show that very quickly our screens can become filled with information, and it is critical to extract the essence of provenance.
If we talk about accountability, we may identify behaviour that is the norm and behaviour that does not correspond to the norm, without the norm being defined ‘a priori’. Being able to detect outlier behaviour is critical in gaining insights in past actions of a social machine
Finally, if we know that a social machine “behaved well” over a period of time, can we decide if today’s behaviour is compatible with the past? That’s the idea of conformance.
We developed a summarisation technique, to be presented next month
Planet 9: find the ninth planet in our Solar System
Exoplanet Explorers: Discover new planets orbiting stars in our galaxy
Question: any patterns emerge