Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Research Using Behavioral Big Data (BBD)
1. Research Using Behavioral Big Data
Methodological, Practical, Ethical & Moral Issues
IEEE BigData Congress, Taipei Satellite Session, May 2016
Galit Shmueli 徐茉莉
Institute of Service Science
2. What is Behavioral Big Data (BBD)
• Special type of Big Data
• Behavioral: people’s actions, interactions,
self-reported opinions, thoughts, feelings
• Human and social aspects: Intentions,
deception, emotion, reciprocation, herding,…
• When aware of data collection -> modified
behavior (legal risks, embarrassment,
unwanted solicitation)
6. BBD on Citizens, Customers, Employees: Internet!
• BBD now also available to small companies & organizations
• Online platforms have BBD (e-commerce, gaming, search,
social networks…)
• Voluntarily entered by users: personal details, photos,
comments, messages, search terms, bids in auctions, likes,
payment information, connections with “friends”
• Passive footprints: duration on the website, pages browsed,
sequence, referring website, Internet browser, operating
system, location, IP address.
• BBD now available to individuals: Quantified Self (and apps)
15. ONE WAY MIRRORS IN
ONLINE DATING
A Randomized Field
Experiment
Ravi Bapna, University of Minnesota
Jui Ramaprasad, Mcgill University
Galit Shmueli, National Tsing Hua University
Akhmed Umyarov, University of Minnesota
16. Online Dating
46of the single population in the US uses online dating
to find a partner (Gelles 2011)
%
20. Research Question (in simple words)
How does
anonymous browsing
affect user behavior?
… and matching?
21. Formal Research Question
what is the relative causal effect
of social inhibitions on search
preferences vs. social inhibitions
of contact initiation in dating
markets?
given known gender
asymmetries, how does this
effect differ for men vs. women?
23. Results
Users treated with anonymity
become disinhibited
view more profiles, view more
same-sex and interracial mates
get less matches
lose ability to leave a weak signal
- especially harmful for women!
25. In Academia
Causal Qs are most popular
• Methodological challenges:
• scalability of stat models
• small-sample stat inference
• self-selection
Predictive Qs (quite rare)
• How to use results beyond
application-specific? 6 uses of
predictive analytics for theory
building [Shmueli & Koppius,
2011]
In Industry
Purpose: evaluate or improve
products, service, operations, etc.
• Netflix Prize: movie recommender
system
• Yahoo!, LinkedIn: personalized
news content to increase user
engagement/clicks [Agarwal &
Chen 2016]
• Target: pregnancy prediction
• Amazon: pricing, etc.
• Government: campaign targeting
BBD-based Research Questions
34. Observational BBD: Issues
Ethical and Moral Issues
• Privacy (Netflix)
• Data protection and reproducible
research
• Conflict of interest company-vs-users
(Study conclusions lead to
operational actions that trade-off
the company’s interest with user
well-being)
• AMT – payment to workers
Methodological Issues
1. Self-selection Bias
Users choose treatment
• Scaling of PSM to big data?
2. Simpson’s Paradox
Causal direction reverses when data are
disaggregated
• Does a dataset have a paradox?
3. Contamination by Experiments
4. Data Size & Dimension
Need very large+rich data to answer predictive Qs
[Junque de Fortuny et al. 2014]
43. Methodical Analysis Cycle of BBD
Inspired by Lifecycle view [Kenett, 2014], and stat thinking
building blocks [Hoerl et al. 2014]
1. understand company context and BBD
2. set up the research question
3. determine experimental design
4. obtain IRB approval (if needed)
5. possibly: pilot experiment
6. communicate design with company; assure feasibility
7. company deploys experiment and collects the data
8. company shares the data with the researchers
9. researchers analyze the data and arrive at conclusions
10. researchers share the insights and conclusions with company and research community
11. company operationalizes the insights to improve their business
12. company deploys impact study