SlideShare ist ein Scribd-Unternehmen logo
1 von 32
Downloaden Sie, um offline zu lesen
@I seek ‘fb.me’:
Identifying Users across Multiple
Online Social Networks
Workshop	
  on	
  Web	
  of	
  Linked	
  En11es	
  (WoLE)
Paridhi	
  Jain¶,	
  Ponnurangam	
  Kumaraguru¶,	
  Anupam	
  Joshi*
¶Indraprastha	
  Ins1tute	
  of	
  Informa1on	
  Technology	
  (IIIT-­‐Delhi)

*University	
  of	
  Maryland,	
  Bal1more	
  County	
  (UMBC)

1
Motivation
Multiple OSNs
Multiple Identities

Difficult to manage? Difficult to find?

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

2

2
Motivation
Multiple OSNs
Multiple Identities

Social Aggregation site

Difficult to manage? Difficult to find?

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

2

2
Motivation
Multiple OSNs
Multiple Identities

Social Aggregation site

Difficult to manage? Difficult to find?

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

Friend	
  Finder?
Malicious	
  user?
Influen1al	
  user?
User	
  of	
  interest?

2

2
Motivation
Multiple OSNs
Multiple Identities

Social Aggregation site

Difficult to manage? Difficult to find?

Friend	
  Finder?
Malicious	
  user?
Influen1al	
  user?
User	
  of	
  interest?

Identity Resolution Problem
13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

2

2
Identity Resolution
• For a user I, given a user identity IA on a social network A, find user
identity IB on social network B.

{IA}

Alice

13/05/13

{IB}

??

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

3
3
Identity Resolution =
Identity Search + Identity Matching
•

Identity Search
For a user I, given her identity IA on a social network A, and a search
parameter S, find the set of identities IBj on social network B such that
S(IA) ⋍ S(IB).

{IA,S}
•

{IB1, ... IBj, ... , IBN} = Q

Identity Matching
Given a user identity IA on a social network A, a set of candidate
identities Q on social network B, and a match function M, locate an
identity pair (IA, IBj) such that M(IA, IBj) = max{M(IA, IB1), M(IA, IBN)}

{IA, Q, M}
13/05/13

{IA, IBj}

{IB}

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

4
4
Research Gaps?
– Till	
  now,	
  focus	
  on	
  bePer	
  iden1ty	
  matching	
  algorithms
– Only	
  profile	
  aPributes	
  (private	
  and	
  public)	
  for	
  Iden1ty	
  Search
– Limita1ons	
  of	
  Profile	
  Search	
  -­‐
– Restric1ve	
  search,	
  owing	
  to	
  non-­‐availability	
  of	
  common	
  aPributes	
  across	
  
networks.	
  [Gender	
  on	
  Facebook,	
  but	
  not	
  on	
  TwiPer]
– Search	
  with	
  Limited	
  aPributes	
  →	
  Large	
  candidate	
  set	
  size	
  →	
  Intensive	
  
Iden1ty	
  Matching	
  computa1on
– Users	
  may	
  choose	
  different	
  profile	
  aPributes	
  →	
  Miss	
  out	
  correct	
  iden1ty	
  in	
  
the	
  candidate	
  set
– LiPle	
  research	
  on	
  using	
  content	
  and	
  network	
  aPributes	
  to	
  search	
  for	
  candidate	
  
iden11es
– Extensive	
  use	
  of	
  both	
  private	
  and	
  public	
  aPributes.	
  Need	
  user	
  authoriza1on	
  for	
  
iden1ty	
  search

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

5
5
Research Gaps?
– Till	
  now,	
  focus	
  on	
  bePer	
  iden1ty	
  matching	
  algorithms
– Only	
  profile	
  aPributes	
  (private	
  and	
  public)	
  for	
  Iden1ty	
  Search
– Limita1ons	
  of	
  Profile	
  Search	
  -­‐
– Restric1ve	
  search,	
  owing	
  to	
  non-­‐availability	
  of	
  common	
  aPributes	
  across	
  
networks.	
  [Gender	
  on	
  Facebook,	
  but	
  not	
  on	
  TwiPer]
– Search	
  with	
  Limited	
  aPributes	
  →	
  Large	
  candidate	
  set	
  size	
  →	
  Intensive	
  
Iden1ty	
  Matching	
  computa1on
– Users	
  may	
  choose	
  different	
  profile	
  aPributes	
  →	
  Miss	
  out	
  correct	
  iden1ty	
  in	
  
the	
  candidate	
  set
– LiPle	
  research	
  on	
  using	
  content	
  and	
  network	
  aPributes	
  to	
  search	
  for	
  candidate	
  
iden11es
– Extensive	
  use	
  of	
  both	
  private	
  and	
  public	
  aPributes.	
  Need	
  user	
  authoriza1on	
  for	
  
iden1ty	
  search

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

6
6
Research Gaps?
– Till	
  now,	
  focus	
  on	
  bePer	
  iden1ty	
  matching	
  algorithms
– Only	
  profile	
  aPributes	
  (private	
  and	
  public)	
  for	
  Iden1ty	
  Search
– Limita1ons	
  of	
  Profile	
  Search	
  -­‐
– Restric1ve	
  search,	
  owing	
  to	
  non-­‐availability	
  of	
  common	
  aPributes	
  across	
  
networks.	
  [Gender	
  on	
  Facebook,	
  but	
  not	
  on	
  TwiPer]
– Search	
  with	
  Limited	
  aPributes	
  →	
  Large	
  candidate	
  set	
  size	
  →	
  Intensive	
  
Iden1ty	
  Matching	
  computa1on
– Users	
  may	
  choose	
  different	
  profile	
  aPributes	
  →	
  Miss	
  out	
  correct	
  iden1ty	
  in	
  
the	
  candidate	
  set
– LiPle	
  research	
  on	
  using	
  content	
  and	
  network	
  aPributes	
  to	
  search	
  for	
  candidate	
  
iden11es
– Extensive	
  use	
  of	
  both	
  private	
  and	
  public	
  aPributes.	
  Need	
  user	
  authoriza1on	
  for	
  
iden1ty	
  search

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

7
7
Proposal
– Include	
  content	
  and	
  network	
  aPributes	
  as	
  search	
  parameters
– Access	
  only	
  publicly	
  accessible	
  aPributes
– Focus	
  on	
  two	
  popular	
  social	
  networks	
  -­‐	
  TwiPer	
  and	
  Facebook

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

8
8
Contribution
– Proposed	
  novel	
  iden1ty	
  search	
  methods	
  on	
  social	
  networks
– Our	
  iden1ty	
  resolu1on	
  methods	
  return	
  correct	
  Facebook	
  iden1ty	
  for	
  39%	
  
TwiPer	
  users	
  within	
  top-­‐2	
  ranks
– We	
  observe	
  an	
  increase	
  in	
  accuracy	
  of	
  iden1ty	
  resolu1on	
  by	
  11.6%	
  owing	
  to	
  
inclusion	
  of	
  content	
  and	
  network	
  iden1ty	
  search,	
  along	
  with	
  improvised	
  profile	
  
search

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

9
9
Methodology
?
?
?
?

Search

13/05/13

Candidate
Identities

If self-identified /
returned by
more than one
search method

Yes

No

Syntactic
and Image

Manual
Verification

Match

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

10
10
Identity Matching
– Syntac1c	
  Matching
– Jaro	
  Distance	
  comparison	
  between	
  username	
  and	
  name
– Example:	
  {alice123,	
  jane_alice},	
  {Alice	
  Naura,	
  Alice	
  N.	
  Janice}

– Image	
  Matching

where	
  hIA	
  and	
  hIBj	
  are	
  the	
  RGB	
  histograms	
  of	
  the	
  profile	
  image	
  and	
  Ns	
  represent	
  
histogram	
  size	
  of	
  IA

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

11
11
Profile Search
Self	
  -­‐	
  Iden1fica1on	
  

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

12
12
Profile Search
Self	
  -­‐	
  Iden1fica1on	
  

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

12
12
Content Search

13

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

13
Content Search

13

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

13
Self-mention Search

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

14
14
Self-mention Search

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

14
14
Network Search

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

15
15
Instance,

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

16
16
Instance,

Public	
  Friend	
  List	
  
of	
  a	
  user	
  extracted	
  
from	
  public	
  feeds

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

16
16
Integrated System -

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

17
17
Evaluation
Dataset

# of users

Social Graph API

543

Method (543 users)

% Accurate

Profile (P)

205

37.7

Content (C + SM)

34

6.3

Network (N)

1

0.2

Finding Nemo

13/05/13

# of users

212

39

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

18
18
Evaluation
Dataset

# of users

Social Graph API

543

Method (543 users)

% Accurate

Profile (P)

205

37.7

Content (C + SM)

34

6.3

Network (N)

1

0.2

Finding Nemo

212

39

Search Algorithm

# of users
identified

Accuracy

P (without URL)

149

27.4%

P (with URL) + C + N +
SM
13/05/13

# of users

149+56+6+1 =
149+71

27.4% +
11.6%

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

18
18
Mean Average Precision

↓

Matching algorithm
Image (profile image)

0.83

Syntactic (username)

0.76

Syntactic (name)

13/05/13

MAP Score

0.80

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

19
19
Demo

hPp://www.youtube.com/watch?v=-­‐AFsCtKwO0c

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

20
20
Take away

Inclusion	
  of	
  content	
  and	
  network	
  a9ributes	
  for	
  iden1ty	
  search	
  
not	
  only	
  improves	
  iden1ty	
  resolu1on	
  accuracy	
  but	
  returns	
  
correct	
  Facebook	
  iden1ty	
  within	
  top-­‐2	
  ranks	
  for	
  majority	
  of	
  the	
  
TwiPer	
  users.

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

21
21
Current and Future Work
– Extend	
  the	
  social	
  networks	
  to	
  search	
  for	
  a	
  given	
  iden1ty.	
  
Example,	
  Google+,	
  Foursquare,	
  etc.
– Extend	
  the	
  search	
  methods	
  to	
  include	
  social-­‐network	
  specific	
  
features
– Find	
  mul1ple	
  (fake)	
  iden11es	
  of	
  users	
  within	
  social	
  networks

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

22
22
Questions?
paridhij@iiitd.ac.in,	
  pk@iiitd.ac.in,	
  joshi@cs.umbc.edu
precog.iiitd.edu.in
Paper:	
  hPp://precog.iiitd.edu.in/publica1ons.html

13/05/13

@I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks

23
23
For	
  any	
  further	
  informa1on,	
  please	
  write	
  to	
  
pk@iiitd.ac.in
precog.iiitd.edu.in

24

Weitere ähnliche Inhalte

Mehr von IIIT Hyderabad

Beyond the Surface: A Computational Exploration of Linguistic Ambiguity
Beyond the Surface: A Computational Exploration of Linguistic AmbiguityBeyond the Surface: A Computational Exploration of Linguistic Ambiguity
Beyond the Surface: A Computational Exploration of Linguistic AmbiguityIIIT Hyderabad
 
Data Science for Social Good: #LegalNLP #AlgorithmicBias...
Data Science for Social Good:                      #LegalNLP #AlgorithmicBias...Data Science for Social Good:                      #LegalNLP #AlgorithmicBias...
Data Science for Social Good: #LegalNLP #AlgorithmicBias...IIIT Hyderabad
 
How to Write a (Good) Research Paper
How to Write a (Good) Research Paper How to Write a (Good) Research Paper
How to Write a (Good) Research Paper IIIT Hyderabad
 
Data Science for Social Good: #LegalNLP #AlgorithmicBias
Data Science for Social Good: #LegalNLP #AlgorithmicBiasData Science for Social Good: #LegalNLP #AlgorithmicBias
Data Science for Social Good: #LegalNLP #AlgorithmicBiasIIIT Hyderabad
 
Social Computing Research in India
Social Computing Research in IndiaSocial Computing Research in India
Social Computing Research in IndiaIIIT Hyderabad
 
Social Computing Research in India
Social Computing Research in IndiaSocial Computing Research in India
Social Computing Research in IndiaIIIT Hyderabad
 
Modeling Online User Interactions and their Offline effects on Socio-Technica...
Modeling Online User Interactions and their Offline effects on Socio-Technica...Modeling Online User Interactions and their Offline effects on Socio-Technica...
Modeling Online User Interactions and their Offline effects on Socio-Technica...IIIT Hyderabad
 
Privacy. Winter School on “Topics in Digital Trust”. IIT Bombay
Privacy. Winter School on “Topics in Digital Trust”. IIT BombayPrivacy. Winter School on “Topics in Digital Trust”. IIT Bombay
Privacy. Winter School on “Topics in Digital Trust”. IIT BombayIIIT Hyderabad
 
It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...IIIT Hyderabad
 
It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...IIIT Hyderabad
 
Leveraging Social Media for Financial Advice
Leveraging Social Media for Financial AdviceLeveraging Social Media for Financial Advice
Leveraging Social Media for Financial AdviceIIIT Hyderabad
 
Development of Stress Induction and Detection System to Study its Effect on B...
Development of Stress Induction and Detection System to Study its Effect on B...Development of Stress Induction and Detection System to Study its Effect on B...
Development of Stress Induction and Detection System to Study its Effect on B...IIIT Hyderabad
 
A Framework for Automatic Question Answering in Indian Languages
A Framework for Automatic Question Answering in Indian LanguagesA Framework for Automatic Question Answering in Indian Languages
A Framework for Automatic Question Answering in Indian LanguagesIIIT Hyderabad
 
A Framework For Automatic Question Answering in Indian Languages
A Framework For Automatic Question Answering in Indian LanguagesA Framework For Automatic Question Answering in Indian Languages
A Framework For Automatic Question Answering in Indian LanguagesIIIT Hyderabad
 
Exposing, Examining and Intervening Fake News
Exposing, Examining and Intervening Fake NewsExposing, Examining and Intervening Fake News
Exposing, Examining and Intervening Fake NewsIIIT Hyderabad
 
It's MY JOB: Identifying and Improving Content Quality for Online recruitmen...
 It's MY JOB: Identifying and Improving Content Quality for Online recruitmen... It's MY JOB: Identifying and Improving Content Quality for Online recruitmen...
It's MY JOB: Identifying and Improving Content Quality for Online recruitmen...IIIT Hyderabad
 
De-anonymizing, Preserving and Democratizing Data Privacy and Ownership
De-anonymizing, Preserving and Democratizing Data Privacy and OwnershipDe-anonymizing, Preserving and Democratizing Data Privacy and Ownership
De-anonymizing, Preserving and Democratizing Data Privacy and OwnershipIIIT Hyderabad
 
Justice Delayed is Justice Denied: Enabling Legal Artificial Intelligence via...
Justice Delayed is Justice Denied: Enabling Legal Artificial Intelligence via...Justice Delayed is Justice Denied: Enabling Legal Artificial Intelligence via...
Justice Delayed is Justice Denied: Enabling Legal Artificial Intelligence via...IIIT Hyderabad
 
NLP / Language Research at Precog
NLP / Language Research at PrecogNLP / Language Research at Precog
NLP / Language Research at PrecogIIIT Hyderabad
 
“It is our choices, Harry, that show what we truly are, far more than our abi...
“It is our choices, Harry, that show what we truly are, far more than our abi...“It is our choices, Harry, that show what we truly are, far more than our abi...
“It is our choices, Harry, that show what we truly are, far more than our abi...IIIT Hyderabad
 

Mehr von IIIT Hyderabad (20)

Beyond the Surface: A Computational Exploration of Linguistic Ambiguity
Beyond the Surface: A Computational Exploration of Linguistic AmbiguityBeyond the Surface: A Computational Exploration of Linguistic Ambiguity
Beyond the Surface: A Computational Exploration of Linguistic Ambiguity
 
Data Science for Social Good: #LegalNLP #AlgorithmicBias...
Data Science for Social Good:                      #LegalNLP #AlgorithmicBias...Data Science for Social Good:                      #LegalNLP #AlgorithmicBias...
Data Science for Social Good: #LegalNLP #AlgorithmicBias...
 
How to Write a (Good) Research Paper
How to Write a (Good) Research Paper How to Write a (Good) Research Paper
How to Write a (Good) Research Paper
 
Data Science for Social Good: #LegalNLP #AlgorithmicBias
Data Science for Social Good: #LegalNLP #AlgorithmicBiasData Science for Social Good: #LegalNLP #AlgorithmicBias
Data Science for Social Good: #LegalNLP #AlgorithmicBias
 
Social Computing Research in India
Social Computing Research in IndiaSocial Computing Research in India
Social Computing Research in India
 
Social Computing Research in India
Social Computing Research in IndiaSocial Computing Research in India
Social Computing Research in India
 
Modeling Online User Interactions and their Offline effects on Socio-Technica...
Modeling Online User Interactions and their Offline effects on Socio-Technica...Modeling Online User Interactions and their Offline effects on Socio-Technica...
Modeling Online User Interactions and their Offline effects on Socio-Technica...
 
Privacy. Winter School on “Topics in Digital Trust”. IIT Bombay
Privacy. Winter School on “Topics in Digital Trust”. IIT BombayPrivacy. Winter School on “Topics in Digital Trust”. IIT Bombay
Privacy. Winter School on “Topics in Digital Trust”. IIT Bombay
 
It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...
 
It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...It is our choices, Harry, that show what we truly are, far more than our abil...
It is our choices, Harry, that show what we truly are, far more than our abil...
 
Leveraging Social Media for Financial Advice
Leveraging Social Media for Financial AdviceLeveraging Social Media for Financial Advice
Leveraging Social Media for Financial Advice
 
Development of Stress Induction and Detection System to Study its Effect on B...
Development of Stress Induction and Detection System to Study its Effect on B...Development of Stress Induction and Detection System to Study its Effect on B...
Development of Stress Induction and Detection System to Study its Effect on B...
 
A Framework for Automatic Question Answering in Indian Languages
A Framework for Automatic Question Answering in Indian LanguagesA Framework for Automatic Question Answering in Indian Languages
A Framework for Automatic Question Answering in Indian Languages
 
A Framework For Automatic Question Answering in Indian Languages
A Framework For Automatic Question Answering in Indian LanguagesA Framework For Automatic Question Answering in Indian Languages
A Framework For Automatic Question Answering in Indian Languages
 
Exposing, Examining and Intervening Fake News
Exposing, Examining and Intervening Fake NewsExposing, Examining and Intervening Fake News
Exposing, Examining and Intervening Fake News
 
It's MY JOB: Identifying and Improving Content Quality for Online recruitmen...
 It's MY JOB: Identifying and Improving Content Quality for Online recruitmen... It's MY JOB: Identifying and Improving Content Quality for Online recruitmen...
It's MY JOB: Identifying and Improving Content Quality for Online recruitmen...
 
De-anonymizing, Preserving and Democratizing Data Privacy and Ownership
De-anonymizing, Preserving and Democratizing Data Privacy and OwnershipDe-anonymizing, Preserving and Democratizing Data Privacy and Ownership
De-anonymizing, Preserving and Democratizing Data Privacy and Ownership
 
Justice Delayed is Justice Denied: Enabling Legal Artificial Intelligence via...
Justice Delayed is Justice Denied: Enabling Legal Artificial Intelligence via...Justice Delayed is Justice Denied: Enabling Legal Artificial Intelligence via...
Justice Delayed is Justice Denied: Enabling Legal Artificial Intelligence via...
 
NLP / Language Research at Precog
NLP / Language Research at PrecogNLP / Language Research at Precog
NLP / Language Research at Precog
 
“It is our choices, Harry, that show what we truly are, far more than our abi...
“It is our choices, Harry, that show what we truly are, far more than our abi...“It is our choices, Harry, that show what we truly are, far more than our abi...
“It is our choices, Harry, that show what we truly are, far more than our abi...
 

Kürzlich hochgeladen

SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 

Kürzlich hochgeladen (20)

SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 

@I seek 'fb.me': Identifying Users across Multiple Online Social Networks

  • 1. @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks Workshop  on  Web  of  Linked  En11es  (WoLE) Paridhi  Jain¶,  Ponnurangam  Kumaraguru¶,  Anupam  Joshi* ¶Indraprastha  Ins1tute  of  Informa1on  Technology  (IIIT-­‐Delhi) *University  of  Maryland,  Bal1more  County  (UMBC) 1
  • 2. Motivation Multiple OSNs Multiple Identities Difficult to manage? Difficult to find? 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 2 2
  • 3. Motivation Multiple OSNs Multiple Identities Social Aggregation site Difficult to manage? Difficult to find? 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 2 2
  • 4. Motivation Multiple OSNs Multiple Identities Social Aggregation site Difficult to manage? Difficult to find? 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks Friend  Finder? Malicious  user? Influen1al  user? User  of  interest? 2 2
  • 5. Motivation Multiple OSNs Multiple Identities Social Aggregation site Difficult to manage? Difficult to find? Friend  Finder? Malicious  user? Influen1al  user? User  of  interest? Identity Resolution Problem 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 2 2
  • 6. Identity Resolution • For a user I, given a user identity IA on a social network A, find user identity IB on social network B. {IA} Alice 13/05/13 {IB} ?? @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 3 3
  • 7. Identity Resolution = Identity Search + Identity Matching • Identity Search For a user I, given her identity IA on a social network A, and a search parameter S, find the set of identities IBj on social network B such that S(IA) ⋍ S(IB). {IA,S} • {IB1, ... IBj, ... , IBN} = Q Identity Matching Given a user identity IA on a social network A, a set of candidate identities Q on social network B, and a match function M, locate an identity pair (IA, IBj) such that M(IA, IBj) = max{M(IA, IB1), M(IA, IBN)} {IA, Q, M} 13/05/13 {IA, IBj} {IB} @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 4 4
  • 8. Research Gaps? – Till  now,  focus  on  bePer  iden1ty  matching  algorithms – Only  profile  aPributes  (private  and  public)  for  Iden1ty  Search – Limita1ons  of  Profile  Search  -­‐ – Restric1ve  search,  owing  to  non-­‐availability  of  common  aPributes  across   networks.  [Gender  on  Facebook,  but  not  on  TwiPer] – Search  with  Limited  aPributes  →  Large  candidate  set  size  →  Intensive   Iden1ty  Matching  computa1on – Users  may  choose  different  profile  aPributes  →  Miss  out  correct  iden1ty  in   the  candidate  set – LiPle  research  on  using  content  and  network  aPributes  to  search  for  candidate   iden11es – Extensive  use  of  both  private  and  public  aPributes.  Need  user  authoriza1on  for   iden1ty  search 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 5 5
  • 9. Research Gaps? – Till  now,  focus  on  bePer  iden1ty  matching  algorithms – Only  profile  aPributes  (private  and  public)  for  Iden1ty  Search – Limita1ons  of  Profile  Search  -­‐ – Restric1ve  search,  owing  to  non-­‐availability  of  common  aPributes  across   networks.  [Gender  on  Facebook,  but  not  on  TwiPer] – Search  with  Limited  aPributes  →  Large  candidate  set  size  →  Intensive   Iden1ty  Matching  computa1on – Users  may  choose  different  profile  aPributes  →  Miss  out  correct  iden1ty  in   the  candidate  set – LiPle  research  on  using  content  and  network  aPributes  to  search  for  candidate   iden11es – Extensive  use  of  both  private  and  public  aPributes.  Need  user  authoriza1on  for   iden1ty  search 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 6 6
  • 10. Research Gaps? – Till  now,  focus  on  bePer  iden1ty  matching  algorithms – Only  profile  aPributes  (private  and  public)  for  Iden1ty  Search – Limita1ons  of  Profile  Search  -­‐ – Restric1ve  search,  owing  to  non-­‐availability  of  common  aPributes  across   networks.  [Gender  on  Facebook,  but  not  on  TwiPer] – Search  with  Limited  aPributes  →  Large  candidate  set  size  →  Intensive   Iden1ty  Matching  computa1on – Users  may  choose  different  profile  aPributes  →  Miss  out  correct  iden1ty  in   the  candidate  set – LiPle  research  on  using  content  and  network  aPributes  to  search  for  candidate   iden11es – Extensive  use  of  both  private  and  public  aPributes.  Need  user  authoriza1on  for   iden1ty  search 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 7 7
  • 11. Proposal – Include  content  and  network  aPributes  as  search  parameters – Access  only  publicly  accessible  aPributes – Focus  on  two  popular  social  networks  -­‐  TwiPer  and  Facebook 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 8 8
  • 12. Contribution – Proposed  novel  iden1ty  search  methods  on  social  networks – Our  iden1ty  resolu1on  methods  return  correct  Facebook  iden1ty  for  39%   TwiPer  users  within  top-­‐2  ranks – We  observe  an  increase  in  accuracy  of  iden1ty  resolu1on  by  11.6%  owing  to   inclusion  of  content  and  network  iden1ty  search,  along  with  improvised  profile   search 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 9 9
  • 13. Methodology ? ? ? ? Search 13/05/13 Candidate Identities If self-identified / returned by more than one search method Yes No Syntactic and Image Manual Verification Match @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 10 10
  • 14. Identity Matching – Syntac1c  Matching – Jaro  Distance  comparison  between  username  and  name – Example:  {alice123,  jane_alice},  {Alice  Naura,  Alice  N.  Janice} – Image  Matching where  hIA  and  hIBj  are  the  RGB  histograms  of  the  profile  image  and  Ns  represent   histogram  size  of  IA 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 11 11
  • 15. Profile Search Self  -­‐  Iden1fica1on   13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 12 12
  • 16. Profile Search Self  -­‐  Iden1fica1on   13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 12 12
  • 17. Content Search 13 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 13
  • 18. Content Search 13 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 13
  • 19. Self-mention Search 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 14 14
  • 20. Self-mention Search 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 14 14
  • 21. Network Search 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 15 15
  • 22. Instance, 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 16 16
  • 23. Instance, Public  Friend  List   of  a  user  extracted   from  public  feeds 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 16 16
  • 24. Integrated System - 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 17 17
  • 25. Evaluation Dataset # of users Social Graph API 543 Method (543 users) % Accurate Profile (P) 205 37.7 Content (C + SM) 34 6.3 Network (N) 1 0.2 Finding Nemo 13/05/13 # of users 212 39 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 18 18
  • 26. Evaluation Dataset # of users Social Graph API 543 Method (543 users) % Accurate Profile (P) 205 37.7 Content (C + SM) 34 6.3 Network (N) 1 0.2 Finding Nemo 212 39 Search Algorithm # of users identified Accuracy P (without URL) 149 27.4% P (with URL) + C + N + SM 13/05/13 # of users 149+56+6+1 = 149+71 27.4% + 11.6% @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 18 18
  • 27. Mean Average Precision ↓ Matching algorithm Image (profile image) 0.83 Syntactic (username) 0.76 Syntactic (name) 13/05/13 MAP Score 0.80 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 19 19
  • 28. Demo hPp://www.youtube.com/watch?v=-­‐AFsCtKwO0c 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 20 20
  • 29. Take away Inclusion  of  content  and  network  a9ributes  for  iden1ty  search   not  only  improves  iden1ty  resolu1on  accuracy  but  returns   correct  Facebook  iden1ty  within  top-­‐2  ranks  for  majority  of  the   TwiPer  users. 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 21 21
  • 30. Current and Future Work – Extend  the  social  networks  to  search  for  a  given  iden1ty.   Example,  Google+,  Foursquare,  etc. – Extend  the  search  methods  to  include  social-­‐network  specific   features – Find  mul1ple  (fake)  iden11es  of  users  within  social  networks 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 22 22
  • 31. Questions? paridhij@iiitd.ac.in,  pk@iiitd.ac.in,  joshi@cs.umbc.edu precog.iiitd.edu.in Paper:  hPp://precog.iiitd.edu.in/publica1ons.html 13/05/13 @I seek ‘fb.me’: Identifying Users across Multiple Online Social Networks 23 23
  • 32. For  any  further  informa1on,  please  write  to   pk@iiitd.ac.in precog.iiitd.edu.in 24