SlideShare ist ein Scribd-Unternehmen logo
1 von 77
Downloaden Sie, um offline zu lesen
BIG DATA
OPPORTUNITÀ E RISCHI
Le parole dell'innovazione e il lavoro
Paolo Bajardi, PhD 

Applied Data Science Manager, ISI Foundation
Torino - Aprile 4, 2019
ISI Foundation
www.isi.it
‣ basic and applied research
‣ 35+ years of history
‣ ~50+ reseachers
‣ Turin, Italy & New York, USA
‣ international network
‣ supported by:
• institutional philanthropy
• research grants
• industrial partnerships
‣ focus on
• data science & AI
• complex systems science
• comp. soc. sci, comp. epi.
Avanzamenti scientifici e
trasferimento nelle applicazioni
GOOGLE TRENDS
digital transformation
Avanzamenti scientifici e
trasferimento nelle applicazioni
[…] Companies are placing big bets on data and analytics. But
adapting to an era of more data-driven decision making has not
always proven to be a simple proposition for people or
organizations. Many are struggling to develop talent, business
processes, and organizational muscle to capture real value
from analytics.
McKinsey Insights (2016)
?
dati
decisioni
modelli
(big) data
toddwschneider.com
1.1 miliardi di chiamate taxi

6 anni

350 Gb di dati
New York City
tracce digitali
prospettiva storica

orizzonte temporale limitato

riproducibilità limitata

contesto limitato

privacy e protezione dei dati
disponibili come effetto collaterale di attività ordinarie

alto livello di copertura, accesso alle grandi scale

possibilità di elaborazione automatica
73% della popolazione accede ad Internet

57% della popolazione usa social media

51% accede da smartphone

6+ ore al giorno online
wearesocial.com/blog/2018/01/global-digital-report-2018
Italia:
twitter.github.io/interactive/sotu2015
{"id"=>12296272736,
"text"=>
"An early look at Annotations:
http://groups.google.com/group/twitter-api-announce/browse_thread/thread/fa5da2608865453",
"created_at"=>"Fri Apr 16 17:55:46 +0000 2010",
"in_reply_to_user_id"=>nil,
"in_reply_to_screen_name"=>nil,
"in_reply_to_status_id"=>nil
"favorited"=>false,
"truncated"=>false,
"user"=>
{"id"=>6253282,
"screen_name"=>"twitterapi",
"name"=>"Twitter API",
"description"=>
"The Real Twitter API. I tweet about API changes, service issues and
happily answer questions about Twitter and our API. Don't get an answer? It's on my website.",
"url"=>"http://apiwiki.twitter.com",
"location"=>"San Francisco, CA",
"profile_background_color"=>"c1dfee",
"profile_background_image_url"=>
"http://a3.twimg.com/profile_background_images/59931895/twitterapi-background-new.png",
"profile_background_tile"=>false,
"profile_image_url"=>"http://a3.twimg.com/profile_images/689684365/api_normal.png",
"profile_link_color"=>"0000ff",
"profile_sidebar_border_color"=>"87bc44",
"profile_sidebar_fill_color"=>"e0ff92",
"profile_text_color"=>"000000",
"created_at"=>"Wed May 23 06:01:13 +0000 2007",
"contributors_enabled"=>true,
"favourites_count"=>1,
"statuses_count"=>1628,
"friends_count"=>13,
"time_zone"=>"Pacific Time (US & Canada)",
"utc_offset"=>-28800,
"lang"=>"en",
"protected"=>false,
"followers_count"=>100581,
The tweet's unique ID. These
IDs are roughly sorted &
developers should treat them
as opaque (http://bit.ly/dCkppc).
Text of the tweet.
Consecutive duplicate tweets
are rejected. 140 character
max (http://bit.ly/4ud3he).
Tweet's
creation
date.
DEPRECATED
The ID of an existing tweet that
this tweet is in reply to. Won't
be set unless the author of the
referenced tweet is mentioned.
The screen name &
user ID of replied to
tweet author.
Truncated to 140
characters. Only
possible from SMS.
Theauthorofthetweet.This
embeddedobjectcangetoutofsync.
Theauthor's
userID.
The author's
user name.
The author's
screen name.
The author's
biography.
The author's
URL.
The author's "location". This is a free-form text field, and
there are no guarantees on whether it can be geocoded.
Rendering information
for the author. Colors
are encoded in hex
values (RGB).
The creation date
for this account.
Whether this account has
contributors enabled
(http://bit.ly/50npuu). Number of
favorites this
user has.
Numberoftweets
thisuserhas.
Number of
users this user
is following.The timezone and offset
(in seconds) for this user.
The user's selected
language.
metadati
"profile_sidebar_border_color"=>"87bc44",
"profile_sidebar_fill_color"=>"e0ff92",
"profile_text_color"=>"000000",
"created_at"=>"Wed May 23 06:01:13 +0000 2007",
"contributors_enabled"=>true,
"favourites_count"=>1,
"statuses_count"=>1628,
"friends_count"=>13,
"time_zone"=>"Pacific Time (US & Canada)",
"utc_offset"=>-28800,
"lang"=>"en",
"protected"=>false,
"followers_count"=>100581,
"geo_enabled"=>true,
"notifications"=>false,
"following"=>true,
"verified"=>true},
"contributors"=>[3191321],
"geo"=>nil,
"coordinates"=>nil,
"place"=>
{"id"=>"2b6ff8c22edd9576",
"url"=>"http://api.twitter.com/1/geo/id/2b6ff8c22edd9576.json",
"name"=>"SoMa",
"full_name"=>"SoMa, San Francisco",
"place_type"=>"neighborhood",
"country_code"=>"US",
"country"=>"The United States of America",
"bounding_box"=>
{"coordinates"=>
[[[-122.42284884, 37.76893497],
[-122.3964, 37.76893497],
[-122.3964, 37.78752897],
[-122.42284884, 37.78752897]]],
"type"=>"Polygon"}},
"source"=>"web"}
em
The creation date
for this account.
Whether this account has
contributors enabled
(http://bit.ly/50npuu). Number of
favorites this
user has.
Numberoftweets
thisuserhas.
Number of
users this user
is following.The timezone and offset
(in seconds) for this user.
The user's selected
language.
Whether this user is protected
or not. If the user is protected,
then this tweet is not visible
except to "friends".
Number of
followers for
this user.
Whetherthisuserhasgeo
enabled(http://bit.ly/4pFY77).
DEPRECATED
in this context
Whether this user
has a verified badge.
Thegeotagonthistweetin
GeoJSON(http://bit.ly/b8L1Cp).
The contributors' (if any) user
IDs (http://bit.ly/50npuu).
DEPRECATED
The place associated with this
Tweet (http://bit.ly/b8L1Cp).
The place ID
The URL to fetch a detailed
polygon for this placeThe printable names of this place
The type of this
place - can be a
"neighborhood"
or "city"
The country this place is in
The bounding
box for this
place
The application
that sent this
tweet
Map of a Twitter Status Object
Raffi Krikorian <raffi@twitter.com>
18 April 2010
{"id"=>12296272736,
"text"=>
"An early look at Annotations:
http://groups.google.com/group/twitter-api-announce/browse_thread/thread/fa5da2608865453",
"created_at"=>"Fri Apr 16 17:55:46 +0000 2010",
"in_reply_to_user_id"=>nil,
"in_reply_to_screen_name"=>nil,
"in_reply_to_status_id"=>nil
"favorited"=>false,
"truncated"=>false,
"user"=>
{"id"=>6253282,
"screen_name"=>"twitterapi",
"name"=>"Twitter API",
"description"=>
"The Real Twitter API. I tweet about API changes, service issues and
happily answer questions about Twitter and our API. Don't get an answer? It's on my website.",
"url"=>"http://apiwiki.twitter.com",
"location"=>"San Francisco, CA",
"profile_background_color"=>"c1dfee",
"profile_background_image_url"=>
"http://a3.twimg.com/profile_background_images/59931895/twitterapi-background-new.png",
"profile_background_tile"=>false,
"profile_image_url"=>"http://a3.twimg.com/profile_images/689684365/api_normal.png",
"profile_link_color"=>"0000ff",
"profile_sidebar_border_color"=>"87bc44",
"profile_sidebar_fill_color"=>"e0ff92",
"profile_text_color"=>"000000",
"created_at"=>"Wed May 23 06:01:13 +0000 2007",
"contributors_enabled"=>true,
"favourites_count"=>1,
"statuses_count"=>1628,
"friends_count"=>13,
"time_zone"=>"Pacific Time (US & Canada)",
"utc_offset"=>-28800,
"lang"=>"en",
"protected"=>false,
"followers_count"=>100581,
The tweet's unique ID. These
IDs are roughly sorted &
developers should treat them
as opaque (http://bit.ly/dCkppc).
Text of the tweet.
Consecutive duplicate tweets
are rejected. 140 character
max (http://bit.ly/4ud3he).
Tweet's
creation
date.
DEPRECATED
The ID of an existing tweet that
this tweet is in reply to. Won't
be set unless the author of the
referenced tweet is mentioned.
The screen name &
user ID of replied to
tweet author.
Truncated to 140
characters. Only
possible from SMS.
Theauthorofthetweet.This
embeddedobjectcangetoutofsync.
Theauthor's
userID.
The author's
user name.
The author's
screen name.
The author's
biography.
The author's
URL.
The author's "location". This is a free-form text field, and
there are no guarantees on whether it can be geocoded.
Rendering information
for the author. Colors
are encoded in hex
values (RGB).
The creation date
for this account.
Whether this account has
contributors enabled
(http://bit.ly/50npuu). Number of
favorites this
user has.
Numberoftweets
thisuserhas.
Number of
users this user
is following.The timezone and offset
(in seconds) for this user.
The user's selected
language.
metadata
"profile_sidebar_border_color"=>"87bc44",
"profile_sidebar_fill_color"=>"e0ff92",
"profile_text_color"=>"000000",
"created_at"=>"Wed May 23 06:01:13 +0000 2007",
"contributors_enabled"=>true,
"favourites_count"=>1,
"statuses_count"=>1628,
"friends_count"=>13,
"time_zone"=>"Pacific Time (US & Canada)",
"utc_offset"=>-28800,
"lang"=>"en",
"protected"=>false,
"followers_count"=>100581,
"geo_enabled"=>true,
"notifications"=>false,
"following"=>true,
"verified"=>true},
"contributors"=>[3191321],
"geo"=>nil,
"coordinates"=>nil,
"place"=>
{"id"=>"2b6ff8c22edd9576",
"url"=>"http://api.twitter.com/1/geo/id/2b6ff8c22edd9576.json",
"name"=>"SoMa",
"full_name"=>"SoMa, San Francisco",
"place_type"=>"neighborhood",
"country_code"=>"US",
"country"=>"The United States of America",
"bounding_box"=>
{"coordinates"=>
[[[-122.42284884, 37.76893497],
[-122.3964, 37.76893497],
[-122.3964, 37.78752897],
[-122.42284884, 37.78752897]]],
"type"=>"Polygon"}},
"source"=>"web"}
em
The creation date
for this account.
Whether this account has
contributors enabled
(http://bit.ly/50npuu). Number of
favorites this
user has.
Numberoftweets
thisuserhas.
Number of
users this user
is following.The timezone and offset
(in seconds) for this user.
The user's selected
language.
Whether this user is protected
or not. If the user is protected,
then this tweet is not visible
except to "friends".
Number of
followers for
this user.
Whetherthisuserhasgeo
enabled(http://bit.ly/4pFY77).
DEPRECATED
in this context
Whether this user
has a verified badge.
Thegeotagonthistweetin
GeoJSON(http://bit.ly/b8L1Cp).
The contributors' (if any) user
IDs (http://bit.ly/50npuu).
DEPRECATED
The place associated with this
Tweet (http://bit.ly/b8L1Cp).
The place ID
The URL to fetch a detailed
polygon for this placeThe printable names of this place
The type of this
place - can be a
"neighborhood"
or "city"
The country this place is in
The bounding
box for this
place
The application
that sent this
tweet
Map of a Twitter Status Object
Raffi Krikorian <raffi@twitter.com>
18 April 2010
J. Ginsberg et al., Nature 457, 1012 (2009)
google.org/flutrends
Segnali “impliciti”
sensori
jawbone.com/blog/napa-earthquake-effect-on-sleep
dati da sensori
Ospedale Pediatrico
Bambino Gesù
doctors
nurses
auxiliaries
children
parents
A
N D
C
P
www.sociopatterns.org
Lyon, France

primary school

231 students

10 teachers
J. Stehlé et al.,
PLoS ONE 6(8), e23176 (2011)
emergency response exercise
PATIENT&FLOW&
L. Ozella et al., arxiv.org/abs/1809.06887 (JMIR, in press)
reti sociali
P. Butler
reti sociali
healthmap.org
thehumanproject.org
A. Okan et al., “Using Big Data to Understand the Human Condition:
The Kavli HUMAN Project”, Big Data 3:3 (2015)
www.projectbaseline.com
‣ più dati comportamentali da piattaforme digitali

‣ grandi coorti, visibilità di intere comunità,
risoluzione di comportamenti individuali

su lunghi orizzonti temporali

‣ uso crescente di dati non-strutturati
‣ connessione sempre più stretta fra mondo fisico e
mondo digitale: sensori, ambienti intelligenti,
Internet of Things
‣ uso crescente di dati non tradizionali e/o esterni,

nuove partnership legate allo scambio dei dati
trend
‣ è possibile usare metodi automatici per estrarre
regolarità e generare ipotesi, usando statistica
inferenziale, data mining, machine learning, analisi del
linguaggio naturale, visualizzazione dati, etc.

‣ i modelli matematici sono costruiti su un ricco
substrato di dati (transazioni, social media, mobilità,
preferenze espresse o inferite) e sono informati da grandi
basi di dati e da flussi di dati in tempo reale
‣ è possibile confrontare modello e realtà di un sistema

a velocità e scale che non hanno precedenti
l’immagine digitale del mondo
è sempre più fedele alla realtà
trend
modelli
“modello” ?
• modello matematico

• modello statistico

• modello generativo

• modello di apprendimento automatico

• modello descrittivo

• modello dinamico

• modello ad agenti

• modello predittivo (di fattori ignoti)

• modello predittivo (del futuro)

• …
mobilità umanapopolazione
scala geografica
short range
mobility layerpopulation layer
long range
mobility layer101
105
101
105
Balcan et al. PNAS 2009
pendolarismo viaggio aereo
esempio: predire un'epidemica
www.gleamviz.org
epidemic forecast
45
“Digital Phenotype”
Computational
Social Science
+
+
Behavioral Economics,
Social Sciences,
Game Theory, …
models of people
machine learning
Y. Abu-Mustafa, M. Magdon-Ismail, H.-T. Lin

Learning from Data, http://amlbook.com
Learning from Data, http://amlbook.com
machine learning
decision boundary
yes
no
decisione algoritmica
sign(
lX
i=1
yi(↵i/⇢)(K(xi, x) + b))
decisione algoritmica
50
“Bestiario”
Principali applicazioni dell’ IA
Perchè è così difficile?
Un nuovo paradigma
Un esempio
0
1
2
3
4
5
6
7
8
9
Un esempio
“The CNN (convolutional neural network) achieves performance
on par with all tested experts across both tasks, demonstrating
an artificial intelligence capable of classifying skin cancer
with a level of competence comparable to dermatologists.”
decisioni

e politiche
modelli matematici,

sistemi complessi,

comp. soc. sci.,

statistica, …
data mining,

machine learning,

natural language
processing, …
dati da piattaforme digitali
expertise

di dominio
decisioni & politiche
dai dati ai modelli alle decisioni
“This is a world where massive amounts of data and applied
mathematics replace every other tool that might be brought to
bear. Out with every theory of human behavior, from linguistics
to sociology. Forget taxonomy, ontology, and psychology. Who
knows why people do what they do? The point is they do it, and
we can track and measure it with unprecedented fidelity. With
enough data, the numbers speak for themselves.”
Chris Anderson (2008)
Bias e discriminazione algoritmica:

sfide etiche e regolatorie 
‣ accesso ai dati

‣ bias delle sorgenti di dati

‣ leggibilità dei modelli

‣ big (personal) data 

‣ dati industriali e nuove partnership
sign(
lX
i=1
yi(↵i/⇢)(K(xi, x) + b))
decisione algoritmica
P. Butler
Accesso ai dati
69
Bias
“ […] ensure that by using big data algorithms [firms] are not
accidentally classifying people based on categories that
society has decided— by law or ethics— not to use, such
as race, ethnic background, gender, and sexual orientation.”
Edith Ramirez, chair of the Federal Trade Commission
discriminazione algoritmica
+
++
+
+
+
+
+
+
+
+
+
+
+
+++
+
o
o
o
o
oo
o
o
o
oo
oo
o
o
(attributo sensibile)
attributononsensibile
M F
sign(
lX
i=1
yi(↵i/⇢)(K(xi, x) + b))
decisione algoritmica
blog.openai.com/unsupervised-sentiment-neuron
un esempio
di “black box”
Explainable AI
Moritz Hardt, medium.com/@mrtz/how-big-data-is-unfair-9aa544d739de
Understanding sources of unfairness in data driven decision making
correct in 90% of the cases because of noise

or
correct in 90% because it misclassifies a 10% minority
algorithmic discrimination
sign(
lX
i=1
yi(↵i/⇢)(K(xi, x) + b))
decisione algoritmica
Attacchi a sistemi algoritmici
Attacchi a sistemi algoritmici
Attacchi a sistemi algoritmici
big (personal) data
la nuova prospettiva sui dati personali
http://www.weforum.org/issues/rethinking-personal-data
sfida: nuove partnership istituzionali
competenze accesso ai dati+
non tradizionali non tradizionali
RESEARCH & INSIGHTS
REVENUE
RESPONSIBILITY
RECIPROCITY
REPUTATION
REGULATORY COMPLIANCE
• dati commerciali

• dati sensibili

• big / fast data
datacollaboratives.org

Weitere ähnliche Inhalte

Ähnlich wie Big data. Opportunità e rischi

Unleashing Twitter Data for Fun and Insight
Unleashing Twitter Data for Fun and InsightUnleashing Twitter Data for Fun and Insight
Unleashing Twitter Data for Fun and InsightMatthew Russell
 
Intro to developing for @twitterapi
Intro to developing for @twitterapiIntro to developing for @twitterapi
Intro to developing for @twitterapiRaffi Krikorian
 
Intro to developing for @twitterapi (updated)
Intro to developing for @twitterapi (updated)Intro to developing for @twitterapi (updated)
Intro to developing for @twitterapi (updated)Raffi Krikorian
 
Embedded Tweets, Timelines and Twitter Cards - Social Developers London 09 Ja...
Embedded Tweets, Timelines and Twitter Cards - Social Developers London 09 Ja...Embedded Tweets, Timelines and Twitter Cards - Social Developers London 09 Ja...
Embedded Tweets, Timelines and Twitter Cards - Social Developers London 09 Ja...Angus Fox
 
What to expect when you are visualizing
What to expect when you are visualizingWhat to expect when you are visualizing
What to expect when you are visualizingKrist Wongsuphasawat
 
What I tell myself before visualizing
What I tell myself before visualizingWhat I tell myself before visualizing
What I tell myself before visualizingKrist Wongsuphasawat
 
Talentbin Sales Deck
Talentbin Sales DeckTalentbin Sales Deck
Talentbin Sales DeckVishal Kumar
 
Talent Bin
Talent BinTalent Bin
Talent BinRyan Gum
 
Seattle bot + Twitter data prezo
Seattle bot + Twitter data prezoSeattle bot + Twitter data prezo
Seattle bot + Twitter data prezoHarrison Neff
 
Social Media Mining using R
Social Media Mining using RSocial Media Mining using R
Social Media Mining using RSubhankar Mishra
 
Build 2017 - B8002 - Introducing Adaptive Cards
Build 2017 - B8002 - Introducing Adaptive CardsBuild 2017 - B8002 - Introducing Adaptive Cards
Build 2017 - B8002 - Introducing Adaptive CardsWindows Developer
 
GraphQL, l'avenir du REST par François ZANINOTTO
GraphQL, l'avenir du REST par François ZANINOTTOGraphQL, l'avenir du REST par François ZANINOTTO
GraphQL, l'avenir du REST par François ZANINOTTOLa Cuisine du Web
 
Government Next: NIC Presentation
Government Next: NIC PresentationGovernment Next: NIC Presentation
Government Next: NIC PresentationTara Hunt
 
Goodle Developer Days Munich 2008 - Open Social Update
Goodle Developer Days Munich 2008 - Open Social UpdateGoodle Developer Days Munich 2008 - Open Social Update
Goodle Developer Days Munich 2008 - Open Social UpdatePatrick Chanezon
 
Sps mad2019 es el momento, empieza a desarrollar para microsoft teams
Sps mad2019   es el momento, empieza a desarrollar para microsoft teams Sps mad2019   es el momento, empieza a desarrollar para microsoft teams
Sps mad2019 es el momento, empieza a desarrollar para microsoft teams Ruben Ramos
 
Lies you have been told about REST
Lies you have been told about RESTLies you have been told about REST
Lies you have been told about RESTdarrelmiller71
 
Connecting to the Pulse of the Planet with the Twitter Platform
Connecting to the Pulse of the Planet with the Twitter PlatformConnecting to the Pulse of the Planet with the Twitter Platform
Connecting to the Pulse of the Planet with the Twitter PlatformAndy Piper
 
Open Social Introduction - JUG SummerCamp 2010
Open Social Introduction - JUG SummerCamp 2010Open Social Introduction - JUG SummerCamp 2010
Open Social Introduction - JUG SummerCamp 2010Tugdual Grall
 

Ähnlich wie Big data. Opportunità e rischi (20)

Unleashing Twitter Data for Fun and Insight
Unleashing Twitter Data for Fun and InsightUnleashing Twitter Data for Fun and Insight
Unleashing Twitter Data for Fun and Insight
 
Intro to developing for @twitterapi
Intro to developing for @twitterapiIntro to developing for @twitterapi
Intro to developing for @twitterapi
 
Intro to developing for @twitterapi (updated)
Intro to developing for @twitterapi (updated)Intro to developing for @twitterapi (updated)
Intro to developing for @twitterapi (updated)
 
Embedded Tweets, Timelines and Twitter Cards - Social Developers London 09 Ja...
Embedded Tweets, Timelines and Twitter Cards - Social Developers London 09 Ja...Embedded Tweets, Timelines and Twitter Cards - Social Developers London 09 Ja...
Embedded Tweets, Timelines and Twitter Cards - Social Developers London 09 Ja...
 
What to expect when you are visualizing
What to expect when you are visualizingWhat to expect when you are visualizing
What to expect when you are visualizing
 
What I tell myself before visualizing
What I tell myself before visualizingWhat I tell myself before visualizing
What I tell myself before visualizing
 
Talentbin Sales Deck
Talentbin Sales DeckTalentbin Sales Deck
Talentbin Sales Deck
 
Talent Bin
Talent BinTalent Bin
Talent Bin
 
Seattle bot + Twitter data prezo
Seattle bot + Twitter data prezoSeattle bot + Twitter data prezo
Seattle bot + Twitter data prezo
 
Social Media Mining using R
Social Media Mining using RSocial Media Mining using R
Social Media Mining using R
 
Build 2017 - B8002 - Introducing Adaptive Cards
Build 2017 - B8002 - Introducing Adaptive CardsBuild 2017 - B8002 - Introducing Adaptive Cards
Build 2017 - B8002 - Introducing Adaptive Cards
 
The Value of Twitter
The Value of TwitterThe Value of Twitter
The Value of Twitter
 
GraphQL, l'avenir du REST par François ZANINOTTO
GraphQL, l'avenir du REST par François ZANINOTTOGraphQL, l'avenir du REST par François ZANINOTTO
GraphQL, l'avenir du REST par François ZANINOTTO
 
Government Next: NIC Presentation
Government Next: NIC PresentationGovernment Next: NIC Presentation
Government Next: NIC Presentation
 
Goodle Developer Days Munich 2008 - Open Social Update
Goodle Developer Days Munich 2008 - Open Social UpdateGoodle Developer Days Munich 2008 - Open Social Update
Goodle Developer Days Munich 2008 - Open Social Update
 
Sps mad2019 es el momento, empieza a desarrollar para microsoft teams
Sps mad2019   es el momento, empieza a desarrollar para microsoft teams Sps mad2019   es el momento, empieza a desarrollar para microsoft teams
Sps mad2019 es el momento, empieza a desarrollar para microsoft teams
 
Lies you have been told about REST
Lies you have been told about RESTLies you have been told about REST
Lies you have been told about REST
 
API Design - 3rd Edition
API Design - 3rd EditionAPI Design - 3rd Edition
API Design - 3rd Edition
 
Connecting to the Pulse of the Planet with the Twitter Platform
Connecting to the Pulse of the Planet with the Twitter PlatformConnecting to the Pulse of the Planet with the Twitter Platform
Connecting to the Pulse of the Planet with the Twitter Platform
 
Open Social Introduction - JUG SummerCamp 2010
Open Social Introduction - JUG SummerCamp 2010Open Social Introduction - JUG SummerCamp 2010
Open Social Introduction - JUG SummerCamp 2010
 

Mehr von Ismel - Istituto per la Memoria e la Cultura del Lavoro, dell'Impresa e dei Diritti Sociali

Mehr von Ismel - Istituto per la Memoria e la Cultura del Lavoro, dell'Impresa e dei Diritti Sociali (20)

Conoscenza è libertà - Francesco Fiermonte
Conoscenza è libertà - Francesco FiermonteConoscenza è libertà - Francesco Fiermonte
Conoscenza è libertà - Francesco Fiermonte
 
Oltre il '68 - Gianfranco Marocchi
Oltre il '68 - Gianfranco MarocchiOltre il '68 - Gianfranco Marocchi
Oltre il '68 - Gianfranco Marocchi
 
Oltre il '68 - Fiorenzo Alfieri
Oltre il '68 - Fiorenzo AlfieriOltre il '68 - Fiorenzo Alfieri
Oltre il '68 - Fiorenzo Alfieri
 
Oltre il '68 - Giancarlo Gonella
Oltre il '68 - Giancarlo GonellaOltre il '68 - Giancarlo Gonella
Oltre il '68 - Giancarlo Gonella
 
Oltre il '68 - Danila Mezzano
Oltre il '68 - Danila MezzanoOltre il '68 - Danila Mezzano
Oltre il '68 - Danila Mezzano
 
Oltre il '68 - Nicoletta Fratta
Oltre il '68 - Nicoletta FrattaOltre il '68 - Nicoletta Fratta
Oltre il '68 - Nicoletta Fratta
 
Oltre il '68 - Giovanni Ferrero
Oltre il '68 - Giovanni FerreroOltre il '68 - Giovanni Ferrero
Oltre il '68 - Giovanni Ferrero
 
Torino. Automotive Heritage - Rossella Maspoli
Torino. Automotive Heritage - Rossella MaspoliTorino. Automotive Heritage - Rossella Maspoli
Torino. Automotive Heritage - Rossella Maspoli
 
Festival del Mutualismo | Il mutuo soccorso aumentato: chiama il tuo gemello ...
Festival del Mutualismo | Il mutuo soccorso aumentato: chiama il tuo gemello ...Festival del Mutualismo | Il mutuo soccorso aumentato: chiama il tuo gemello ...
Festival del Mutualismo | Il mutuo soccorso aumentato: chiama il tuo gemello ...
 
Cambiamenti del lavoro, modelli organizzativi e partecipazione dei lavoratori...
Cambiamenti del lavoro, modelli organizzativi e partecipazione dei lavoratori...Cambiamenti del lavoro, modelli organizzativi e partecipazione dei lavoratori...
Cambiamenti del lavoro, modelli organizzativi e partecipazione dei lavoratori...
 
Il dialogo sociale nel contesto delle Istituzioni europee: riconoscimento deg...
Il dialogo sociale nel contesto delle Istituzioni europee: riconoscimento deg...Il dialogo sociale nel contesto delle Istituzioni europee: riconoscimento deg...
Il dialogo sociale nel contesto delle Istituzioni europee: riconoscimento deg...
 
Lavoro, partecipazione e impresa nella Costituzione italiana - Paolo Tosi
Lavoro, partecipazione e impresa nella Costituzione italiana - Paolo TosiLavoro, partecipazione e impresa nella Costituzione italiana - Paolo Tosi
Lavoro, partecipazione e impresa nella Costituzione italiana - Paolo Tosi
 
La ricerca scientifica nell'era dei Big Data - Sabina Leonelli
La ricerca scientifica nell'era dei Big Data - Sabina LeonelliLa ricerca scientifica nell'era dei Big Data - Sabina Leonelli
La ricerca scientifica nell'era dei Big Data - Sabina Leonelli
 
La condizione delle donne nel mercato del lavoro in Piemonte - Mauro Zangola
La condizione delle donne nel mercato del lavoro in Piemonte - Mauro ZangolaLa condizione delle donne nel mercato del lavoro in Piemonte - Mauro Zangola
La condizione delle donne nel mercato del lavoro in Piemonte - Mauro Zangola
 
Valore lavoro. Strategie e vissuti di donne nel mondo del lavoro
Valore lavoro. Strategie e vissuti di donne nel mondo del lavoroValore lavoro. Strategie e vissuti di donne nel mondo del lavoro
Valore lavoro. Strategie e vissuti di donne nel mondo del lavoro
 
Internet delle Cose: tecnologie e campi di applicazione
Internet delle Cose: tecnologie e campi di applicazioneInternet delle Cose: tecnologie e campi di applicazione
Internet delle Cose: tecnologie e campi di applicazione
 
Internet delle Cose: casi di industria 4.0 e azione sindacale
Internet delle Cose: casi di industria 4.0 e azione sindacaleInternet delle Cose: casi di industria 4.0 e azione sindacale
Internet delle Cose: casi di industria 4.0 e azione sindacale
 
Una valutazione critica di industria 4.0
Una valutazione critica di industria 4.0Una valutazione critica di industria 4.0
Una valutazione critica di industria 4.0
 
L'industria della finanza, la digitalizzazione dei processi lavorativi e la b...
L'industria della finanza, la digitalizzazione dei processi lavorativi e la b...L'industria della finanza, la digitalizzazione dei processi lavorativi e la b...
L'industria della finanza, la digitalizzazione dei processi lavorativi e la b...
 
L'industria della finanza, la digitalizzazione dei processi lavorativi e la b...
L'industria della finanza, la digitalizzazione dei processi lavorativi e la b...L'industria della finanza, la digitalizzazione dei processi lavorativi e la b...
L'industria della finanza, la digitalizzazione dei processi lavorativi e la b...
 

Kürzlich hochgeladen

Guide Complete Set of Residential Architectural Drawings PDF
Guide Complete Set of Residential Architectural Drawings PDFGuide Complete Set of Residential Architectural Drawings PDF
Guide Complete Set of Residential Architectural Drawings PDFChandresh Chudasama
 
Church Building Grants To Assist With New Construction, Additions, And Restor...
Church Building Grants To Assist With New Construction, Additions, And Restor...Church Building Grants To Assist With New Construction, Additions, And Restor...
Church Building Grants To Assist With New Construction, Additions, And Restor...Americas Got Grants
 
Fordham -How effective decision-making is within the IT department - Analysis...
Fordham -How effective decision-making is within the IT department - Analysis...Fordham -How effective decision-making is within the IT department - Analysis...
Fordham -How effective decision-making is within the IT department - Analysis...Peter Ward
 
Innovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdfInnovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdfrichard876048
 
Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Kirill Klimov
 
APRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdfAPRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdfRbc Rbcua
 
The-Ethical-issues-ghhhhhhhhjof-Byjus.pptx
The-Ethical-issues-ghhhhhhhhjof-Byjus.pptxThe-Ethical-issues-ghhhhhhhhjof-Byjus.pptx
The-Ethical-issues-ghhhhhhhhjof-Byjus.pptxmbikashkanyari
 
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCRashishs7044
 
Call Girls Contact Number Andheri 9920874524
Call Girls Contact Number Andheri 9920874524Call Girls Contact Number Andheri 9920874524
Call Girls Contact Number Andheri 9920874524najka9823
 
Kenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith PereraKenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith Pereraictsugar
 
8447779800, Low rate Call girls in Dwarka mor Delhi NCR
8447779800, Low rate Call girls in Dwarka mor Delhi NCR8447779800, Low rate Call girls in Dwarka mor Delhi NCR
8447779800, Low rate Call girls in Dwarka mor Delhi NCRashishs7044
 
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
8447779800, Low rate Call girls in Uttam Nagar Delhi NCRashishs7044
 
TriStar Gold Corporate Presentation - April 2024
TriStar Gold Corporate Presentation - April 2024TriStar Gold Corporate Presentation - April 2024
TriStar Gold Corporate Presentation - April 2024Adnet Communications
 
Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Riya Pathan
 
8447779800, Low rate Call girls in Tughlakabad Delhi NCR
8447779800, Low rate Call girls in Tughlakabad Delhi NCR8447779800, Low rate Call girls in Tughlakabad Delhi NCR
8447779800, Low rate Call girls in Tughlakabad Delhi NCRashishs7044
 
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607dollysharma2066
 
Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Seta Wicaksana
 
Entrepreneurship lessons in Philippines
Entrepreneurship lessons in  PhilippinesEntrepreneurship lessons in  Philippines
Entrepreneurship lessons in PhilippinesDavidSamuel525586
 
PSCC - Capability Statement Presentation
PSCC - Capability Statement PresentationPSCC - Capability Statement Presentation
PSCC - Capability Statement PresentationAnamaria Contreras
 
Memorándum de Entendimiento (MoU) entre Codelco y SQM
Memorándum de Entendimiento (MoU) entre Codelco y SQMMemorándum de Entendimiento (MoU) entre Codelco y SQM
Memorándum de Entendimiento (MoU) entre Codelco y SQMVoces Mineras
 

Kürzlich hochgeladen (20)

Guide Complete Set of Residential Architectural Drawings PDF
Guide Complete Set of Residential Architectural Drawings PDFGuide Complete Set of Residential Architectural Drawings PDF
Guide Complete Set of Residential Architectural Drawings PDF
 
Church Building Grants To Assist With New Construction, Additions, And Restor...
Church Building Grants To Assist With New Construction, Additions, And Restor...Church Building Grants To Assist With New Construction, Additions, And Restor...
Church Building Grants To Assist With New Construction, Additions, And Restor...
 
Fordham -How effective decision-making is within the IT department - Analysis...
Fordham -How effective decision-making is within the IT department - Analysis...Fordham -How effective decision-making is within the IT department - Analysis...
Fordham -How effective decision-making is within the IT department - Analysis...
 
Innovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdfInnovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdf
 
Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024
 
APRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdfAPRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdf
 
The-Ethical-issues-ghhhhhhhhjof-Byjus.pptx
The-Ethical-issues-ghhhhhhhhjof-Byjus.pptxThe-Ethical-issues-ghhhhhhhhjof-Byjus.pptx
The-Ethical-issues-ghhhhhhhhjof-Byjus.pptx
 
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
 
Call Girls Contact Number Andheri 9920874524
Call Girls Contact Number Andheri 9920874524Call Girls Contact Number Andheri 9920874524
Call Girls Contact Number Andheri 9920874524
 
Kenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith PereraKenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith Perera
 
8447779800, Low rate Call girls in Dwarka mor Delhi NCR
8447779800, Low rate Call girls in Dwarka mor Delhi NCR8447779800, Low rate Call girls in Dwarka mor Delhi NCR
8447779800, Low rate Call girls in Dwarka mor Delhi NCR
 
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
 
TriStar Gold Corporate Presentation - April 2024
TriStar Gold Corporate Presentation - April 2024TriStar Gold Corporate Presentation - April 2024
TriStar Gold Corporate Presentation - April 2024
 
Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737
 
8447779800, Low rate Call girls in Tughlakabad Delhi NCR
8447779800, Low rate Call girls in Tughlakabad Delhi NCR8447779800, Low rate Call girls in Tughlakabad Delhi NCR
8447779800, Low rate Call girls in Tughlakabad Delhi NCR
 
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
 
Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...
 
Entrepreneurship lessons in Philippines
Entrepreneurship lessons in  PhilippinesEntrepreneurship lessons in  Philippines
Entrepreneurship lessons in Philippines
 
PSCC - Capability Statement Presentation
PSCC - Capability Statement PresentationPSCC - Capability Statement Presentation
PSCC - Capability Statement Presentation
 
Memorándum de Entendimiento (MoU) entre Codelco y SQM
Memorándum de Entendimiento (MoU) entre Codelco y SQMMemorándum de Entendimiento (MoU) entre Codelco y SQM
Memorándum de Entendimiento (MoU) entre Codelco y SQM
 

Big data. Opportunità e rischi

  • 1. BIG DATA OPPORTUNITÀ E RISCHI Le parole dell'innovazione e il lavoro Paolo Bajardi, PhD Applied Data Science Manager, ISI Foundation Torino - Aprile 4, 2019
  • 2. ISI Foundation www.isi.it ‣ basic and applied research ‣ 35+ years of history ‣ ~50+ reseachers ‣ Turin, Italy & New York, USA ‣ international network ‣ supported by: • institutional philanthropy • research grants • industrial partnerships ‣ focus on • data science & AI • complex systems science • comp. soc. sci, comp. epi.
  • 4.
  • 5. GOOGLE TRENDS digital transformation Avanzamenti scientifici e trasferimento nelle applicazioni
  • 6. […] Companies are placing big bets on data and analytics. But adapting to an era of more data-driven decision making has not always proven to be a simple proposition for people or organizations. Many are struggling to develop talent, business processes, and organizational muscle to capture real value from analytics. McKinsey Insights (2016)
  • 9. toddwschneider.com 1.1 miliardi di chiamate taxi 6 anni 350 Gb di dati New York City
  • 10. tracce digitali prospettiva storica orizzonte temporale limitato riproducibilità limitata contesto limitato privacy e protezione dei dati disponibili come effetto collaterale di attività ordinarie alto livello di copertura, accesso alle grandi scale possibilità di elaborazione automatica
  • 11. 73% della popolazione accede ad Internet 57% della popolazione usa social media
 51% accede da smartphone 6+ ore al giorno online wearesocial.com/blog/2018/01/global-digital-report-2018 Italia:
  • 13. {"id"=>12296272736, "text"=> "An early look at Annotations: http://groups.google.com/group/twitter-api-announce/browse_thread/thread/fa5da2608865453", "created_at"=>"Fri Apr 16 17:55:46 +0000 2010", "in_reply_to_user_id"=>nil, "in_reply_to_screen_name"=>nil, "in_reply_to_status_id"=>nil "favorited"=>false, "truncated"=>false, "user"=> {"id"=>6253282, "screen_name"=>"twitterapi", "name"=>"Twitter API", "description"=> "The Real Twitter API. I tweet about API changes, service issues and happily answer questions about Twitter and our API. Don't get an answer? It's on my website.", "url"=>"http://apiwiki.twitter.com", "location"=>"San Francisco, CA", "profile_background_color"=>"c1dfee", "profile_background_image_url"=> "http://a3.twimg.com/profile_background_images/59931895/twitterapi-background-new.png", "profile_background_tile"=>false, "profile_image_url"=>"http://a3.twimg.com/profile_images/689684365/api_normal.png", "profile_link_color"=>"0000ff", "profile_sidebar_border_color"=>"87bc44", "profile_sidebar_fill_color"=>"e0ff92", "profile_text_color"=>"000000", "created_at"=>"Wed May 23 06:01:13 +0000 2007", "contributors_enabled"=>true, "favourites_count"=>1, "statuses_count"=>1628, "friends_count"=>13, "time_zone"=>"Pacific Time (US & Canada)", "utc_offset"=>-28800, "lang"=>"en", "protected"=>false, "followers_count"=>100581, The tweet's unique ID. These IDs are roughly sorted & developers should treat them as opaque (http://bit.ly/dCkppc). Text of the tweet. Consecutive duplicate tweets are rejected. 140 character max (http://bit.ly/4ud3he). Tweet's creation date. DEPRECATED The ID of an existing tweet that this tweet is in reply to. Won't be set unless the author of the referenced tweet is mentioned. The screen name & user ID of replied to tweet author. Truncated to 140 characters. Only possible from SMS. Theauthorofthetweet.This embeddedobjectcangetoutofsync. Theauthor's userID. The author's user name. The author's screen name. The author's biography. The author's URL. The author's "location". This is a free-form text field, and there are no guarantees on whether it can be geocoded. Rendering information for the author. Colors are encoded in hex values (RGB). The creation date for this account. Whether this account has contributors enabled (http://bit.ly/50npuu). Number of favorites this user has. Numberoftweets thisuserhas. Number of users this user is following.The timezone and offset (in seconds) for this user. The user's selected language. metadati
  • 14. "profile_sidebar_border_color"=>"87bc44", "profile_sidebar_fill_color"=>"e0ff92", "profile_text_color"=>"000000", "created_at"=>"Wed May 23 06:01:13 +0000 2007", "contributors_enabled"=>true, "favourites_count"=>1, "statuses_count"=>1628, "friends_count"=>13, "time_zone"=>"Pacific Time (US & Canada)", "utc_offset"=>-28800, "lang"=>"en", "protected"=>false, "followers_count"=>100581, "geo_enabled"=>true, "notifications"=>false, "following"=>true, "verified"=>true}, "contributors"=>[3191321], "geo"=>nil, "coordinates"=>nil, "place"=> {"id"=>"2b6ff8c22edd9576", "url"=>"http://api.twitter.com/1/geo/id/2b6ff8c22edd9576.json", "name"=>"SoMa", "full_name"=>"SoMa, San Francisco", "place_type"=>"neighborhood", "country_code"=>"US", "country"=>"The United States of America", "bounding_box"=> {"coordinates"=> [[[-122.42284884, 37.76893497], [-122.3964, 37.76893497], [-122.3964, 37.78752897], [-122.42284884, 37.78752897]]], "type"=>"Polygon"}}, "source"=>"web"} em The creation date for this account. Whether this account has contributors enabled (http://bit.ly/50npuu). Number of favorites this user has. Numberoftweets thisuserhas. Number of users this user is following.The timezone and offset (in seconds) for this user. The user's selected language. Whether this user is protected or not. If the user is protected, then this tweet is not visible except to "friends". Number of followers for this user. Whetherthisuserhasgeo enabled(http://bit.ly/4pFY77). DEPRECATED in this context Whether this user has a verified badge. Thegeotagonthistweetin GeoJSON(http://bit.ly/b8L1Cp). The contributors' (if any) user IDs (http://bit.ly/50npuu). DEPRECATED The place associated with this Tweet (http://bit.ly/b8L1Cp). The place ID The URL to fetch a detailed polygon for this placeThe printable names of this place The type of this place - can be a "neighborhood" or "city" The country this place is in The bounding box for this place The application that sent this tweet Map of a Twitter Status Object Raffi Krikorian <raffi@twitter.com> 18 April 2010
  • 15. {"id"=>12296272736, "text"=> "An early look at Annotations: http://groups.google.com/group/twitter-api-announce/browse_thread/thread/fa5da2608865453", "created_at"=>"Fri Apr 16 17:55:46 +0000 2010", "in_reply_to_user_id"=>nil, "in_reply_to_screen_name"=>nil, "in_reply_to_status_id"=>nil "favorited"=>false, "truncated"=>false, "user"=> {"id"=>6253282, "screen_name"=>"twitterapi", "name"=>"Twitter API", "description"=> "The Real Twitter API. I tweet about API changes, service issues and happily answer questions about Twitter and our API. Don't get an answer? It's on my website.", "url"=>"http://apiwiki.twitter.com", "location"=>"San Francisco, CA", "profile_background_color"=>"c1dfee", "profile_background_image_url"=> "http://a3.twimg.com/profile_background_images/59931895/twitterapi-background-new.png", "profile_background_tile"=>false, "profile_image_url"=>"http://a3.twimg.com/profile_images/689684365/api_normal.png", "profile_link_color"=>"0000ff", "profile_sidebar_border_color"=>"87bc44", "profile_sidebar_fill_color"=>"e0ff92", "profile_text_color"=>"000000", "created_at"=>"Wed May 23 06:01:13 +0000 2007", "contributors_enabled"=>true, "favourites_count"=>1, "statuses_count"=>1628, "friends_count"=>13, "time_zone"=>"Pacific Time (US & Canada)", "utc_offset"=>-28800, "lang"=>"en", "protected"=>false, "followers_count"=>100581, The tweet's unique ID. These IDs are roughly sorted & developers should treat them as opaque (http://bit.ly/dCkppc). Text of the tweet. Consecutive duplicate tweets are rejected. 140 character max (http://bit.ly/4ud3he). Tweet's creation date. DEPRECATED The ID of an existing tweet that this tweet is in reply to. Won't be set unless the author of the referenced tweet is mentioned. The screen name & user ID of replied to tweet author. Truncated to 140 characters. Only possible from SMS. Theauthorofthetweet.This embeddedobjectcangetoutofsync. Theauthor's userID. The author's user name. The author's screen name. The author's biography. The author's URL. The author's "location". This is a free-form text field, and there are no guarantees on whether it can be geocoded. Rendering information for the author. Colors are encoded in hex values (RGB). The creation date for this account. Whether this account has contributors enabled (http://bit.ly/50npuu). Number of favorites this user has. Numberoftweets thisuserhas. Number of users this user is following.The timezone and offset (in seconds) for this user. The user's selected language. metadata
  • 16. "profile_sidebar_border_color"=>"87bc44", "profile_sidebar_fill_color"=>"e0ff92", "profile_text_color"=>"000000", "created_at"=>"Wed May 23 06:01:13 +0000 2007", "contributors_enabled"=>true, "favourites_count"=>1, "statuses_count"=>1628, "friends_count"=>13, "time_zone"=>"Pacific Time (US & Canada)", "utc_offset"=>-28800, "lang"=>"en", "protected"=>false, "followers_count"=>100581, "geo_enabled"=>true, "notifications"=>false, "following"=>true, "verified"=>true}, "contributors"=>[3191321], "geo"=>nil, "coordinates"=>nil, "place"=> {"id"=>"2b6ff8c22edd9576", "url"=>"http://api.twitter.com/1/geo/id/2b6ff8c22edd9576.json", "name"=>"SoMa", "full_name"=>"SoMa, San Francisco", "place_type"=>"neighborhood", "country_code"=>"US", "country"=>"The United States of America", "bounding_box"=> {"coordinates"=> [[[-122.42284884, 37.76893497], [-122.3964, 37.76893497], [-122.3964, 37.78752897], [-122.42284884, 37.78752897]]], "type"=>"Polygon"}}, "source"=>"web"} em The creation date for this account. Whether this account has contributors enabled (http://bit.ly/50npuu). Number of favorites this user has. Numberoftweets thisuserhas. Number of users this user is following.The timezone and offset (in seconds) for this user. The user's selected language. Whether this user is protected or not. If the user is protected, then this tweet is not visible except to "friends". Number of followers for this user. Whetherthisuserhasgeo enabled(http://bit.ly/4pFY77). DEPRECATED in this context Whether this user has a verified badge. Thegeotagonthistweetin GeoJSON(http://bit.ly/b8L1Cp). The contributors' (if any) user IDs (http://bit.ly/50npuu). DEPRECATED The place associated with this Tweet (http://bit.ly/b8L1Cp). The place ID The URL to fetch a detailed polygon for this placeThe printable names of this place The type of this place - can be a "neighborhood" or "city" The country this place is in The bounding box for this place The application that sent this tweet Map of a Twitter Status Object Raffi Krikorian <raffi@twitter.com> 18 April 2010
  • 17.
  • 18. J. Ginsberg et al., Nature 457, 1012 (2009) google.org/flutrends Segnali “impliciti”
  • 19.
  • 20.
  • 25. Lyon, France primary school 231 students 10 teachers J. Stehlé et al., PLoS ONE 6(8), e23176 (2011)
  • 27.
  • 28.
  • 30. L. Ozella et al., arxiv.org/abs/1809.06887 (JMIR, in press)
  • 34. thehumanproject.org A. Okan et al., “Using Big Data to Understand the Human Condition: The Kavli HUMAN Project”, Big Data 3:3 (2015)
  • 36. ‣ più dati comportamentali da piattaforme digitali ‣ grandi coorti, visibilità di intere comunità, risoluzione di comportamenti individuali
 su lunghi orizzonti temporali ‣ uso crescente di dati non-strutturati ‣ connessione sempre più stretta fra mondo fisico e mondo digitale: sensori, ambienti intelligenti, Internet of Things ‣ uso crescente di dati non tradizionali e/o esterni,
 nuove partnership legate allo scambio dei dati trend
  • 37. ‣ è possibile usare metodi automatici per estrarre regolarità e generare ipotesi, usando statistica inferenziale, data mining, machine learning, analisi del linguaggio naturale, visualizzazione dati, etc. ‣ i modelli matematici sono costruiti su un ricco substrato di dati (transazioni, social media, mobilità, preferenze espresse o inferite) e sono informati da grandi basi di dati e da flussi di dati in tempo reale ‣ è possibile confrontare modello e realtà di un sistema
 a velocità e scale che non hanno precedenti l’immagine digitale del mondo è sempre più fedele alla realtà trend
  • 39. “modello” ? • modello matematico • modello statistico • modello generativo • modello di apprendimento automatico • modello descrittivo • modello dinamico • modello ad agenti • modello predittivo (di fattori ignoti) • modello predittivo (del futuro) • …
  • 40.
  • 41. mobilità umanapopolazione scala geografica short range mobility layerpopulation layer long range mobility layer101 105 101 105 Balcan et al. PNAS 2009 pendolarismo viaggio aereo esempio: predire un'epidemica
  • 43. 45 “Digital Phenotype” Computational Social Science + + Behavioral Economics, Social Sciences, Game Theory, … models of people
  • 44. machine learning Y. Abu-Mustafa, M. Magdon-Ismail, H.-T. Lin Learning from Data, http://amlbook.com
  • 45. Learning from Data, http://amlbook.com machine learning
  • 47. sign( lX i=1 yi(↵i/⇢)(K(xi, x) + b)) decisione algoritmica
  • 49.
  • 51. Perchè è così difficile?
  • 53.
  • 56. “The CNN (convolutional neural network) achieves performance on par with all tested experts across both tasks, demonstrating an artificial intelligence capable of classifying skin cancer with a level of competence comparable to dermatologists.”
  • 57.
  • 59. modelli matematici, sistemi complessi, comp. soc. sci., statistica, … data mining, machine learning, natural language processing, … dati da piattaforme digitali expertise di dominio decisioni & politiche dai dati ai modelli alle decisioni
  • 60. “This is a world where massive amounts of data and applied mathematics replace every other tool that might be brought to bear. Out with every theory of human behavior, from linguistics to sociology. Forget taxonomy, ontology, and psychology. Who knows why people do what they do? The point is they do it, and we can track and measure it with unprecedented fidelity. With enough data, the numbers speak for themselves.” Chris Anderson (2008)
  • 61. Bias e discriminazione algoritmica:
 sfide etiche e regolatorie  ‣ accesso ai dati ‣ bias delle sorgenti di dati ‣ leggibilità dei modelli ‣ big (personal) data ‣ dati industriali e nuove partnership
  • 62. sign( lX i=1 yi(↵i/⇢)(K(xi, x) + b)) decisione algoritmica
  • 65. “ […] ensure that by using big data algorithms [firms] are not accidentally classifying people based on categories that society has decided— by law or ethics— not to use, such as race, ethnic background, gender, and sexual orientation.” Edith Ramirez, chair of the Federal Trade Commission discriminazione algoritmica + ++ + + + + + + + + + + + +++ + o o o o oo o o o oo oo o o (attributo sensibile) attributononsensibile M F
  • 66. sign( lX i=1 yi(↵i/⇢)(K(xi, x) + b)) decisione algoritmica
  • 69. Moritz Hardt, medium.com/@mrtz/how-big-data-is-unfair-9aa544d739de Understanding sources of unfairness in data driven decision making correct in 90% of the cases because of noise or correct in 90% because it misclassifies a 10% minority algorithmic discrimination
  • 70. sign( lX i=1 yi(↵i/⇢)(K(xi, x) + b)) decisione algoritmica
  • 71. Attacchi a sistemi algoritmici
  • 72. Attacchi a sistemi algoritmici
  • 73. Attacchi a sistemi algoritmici
  • 75. la nuova prospettiva sui dati personali http://www.weforum.org/issues/rethinking-personal-data
  • 76. sfida: nuove partnership istituzionali competenze accesso ai dati+ non tradizionali non tradizionali RESEARCH & INSIGHTS REVENUE RESPONSIBILITY RECIPROCITY REPUTATION REGULATORY COMPLIANCE • dati commerciali • dati sensibili • big / fast data