SlideShare ist ein Scribd-Unternehmen logo
1 von 42
Downloaden Sie, um offline zu lesen
Continuous Experience-aware
Language Model
Subhabrata Mukherjee†
, Stephan Günnemann‡
, Gerhard Weikum†
†
Max Planck Institute for Informatik, ‡
Technical University of Munich
22nd ACM SIGKDD Conference on
Knowledge Discovery and Data Mining, San Francisco, 2016
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 2
Outline
● Motivation
● Prior Work
● Current Work
– Modeling continuous experience evolution
– Modeling continuous language model evolution
– Inference
● Experiments
● Conclusions
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 3
Motivation
● Online communities are dynamic
– Users join and leave
– Adopt vocabulary, adapt to evolving trends
– Mature over time
● How to capture evolving user maturity?
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 4
Example 1
● Consider following camera reviews by John:
– My first DSLR. Excellent camera, takes great
pictures with high definition, without a doubt it
makes honor to its name. [Aug, 1997]
– The EF 75-300 mm lens is only good to be used
outside. The 2.2X HD lens can only be used for
specific items; filters are useless if ISO, AP,... .
The short 18-55mm lens is cheap and should
have a hood to keep light off lens.
[Oct, 2012]
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 5
Example 2
● Beer: Moosehead Lager
– The smell of grains a malts on the nose with
the slight hop aroma in there. The taste of
the beer is crisp ....
– The beer tastes absolutely terrible ...
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 6
Example
● Consider following camera reviews by John:
– My first DSLR. Excellent camera, take great
pictures with high definition, without a doubt it
makes honor to its name. [Aug, 1997]
– The EF 75-300 mm lens is only good to be used
outside. The 2.2X HD lens can only be used for
specific items; filters are useless if ISO, AP,... .
The short 18-55mm lens is cheap and should
have a hood to keep light off lens.
[Oct, 2012]
RQ1: How to quantify this change in user maturity
(referred to as experience in our work) ?
RQ2: How to model this evolution or
progression in maturity?
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 7
Use-cases
● Recommend item to a user based on her
maturity to consume it
– Maturity / Experience evolves over time
● Identify experienced users
– E.g.: Medical professionals in Health
communities
● Crowd-sourcing applications / aggregation /
community Q&A
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 8
Outline
● Motivation
● Prior Work
● Current Work
– Modeling continuous experience evolution
– Modeling continuous language model evolution
– Inference
● Experiments
● Conclusions
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 9
Prior Work: Experience Evolution
(1) Users at similar levels of experience have
similar rating behavior and facet preferences
[J. McAuley, J. Leskovec, WWW '13]
(2) Additionally, users at similar levels of
experience have similar writing style
[S. Mukherjee, H. Lamba, G, Weikum, ICDM '15]
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 10
Abrupt Transition
Prior Works: Discrete Experience
Assumption: At each timepoint (of writing a review) a user remains
at the same level of experience or moves to the next level
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 11
Current Work: Continuous Experience
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 12
Effect of Discrete Evolution on
Language Model
Language Model changes at the same
(discrete) level of experience.
Exchangeability principle violated in discrete model.
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 13
Outline
● Motivation
● Prior Work
● Current Work
– Modeling continuous experience evolution
– Modeling continuous language model evolution
– Inference
● Experiments
● Conclusions
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 14
Continuous Experience Evolution:
Assumptions
● Continuous-time process, always positive
● Markovian assumption: Experience at current time
depends only on the latest observed experience
● Drift: Overall trend to increase over time
● Volatility: Progression may not be smooth with occasional
volatility. E.g.: series of expert reviews followed by a
sloppy one
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 15
● Continuous-time process, always positive
● Markovian assumption: Experience at current time
depends only on the latest observed experience
● Drift: Overall trend to increase over time
● Volatility: Progression may not be smooth with occasional
volatility. E.g.: series of expert reviews followed by a
sloppy one
We show these properties to be satisfied by the
continuous-time stochastic process:
Geometric Brownian Motion
Continuous Experience Evolution:
Assumptions
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 16
Geometric Brownian Motion
● Stochastic process to model population growth,
financial processes like stock price behavior with
random noise
● Natural continuous-state alternative to discrete-state
space Hidden Markov Model
● Continuous time stochastic process, where log(Et
)
follows Brownian Motion with volatility and drift
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 17
Geometric Brownian Motion
● Stochastic Differential Equation: dEt
= μEt
dt + σEt
dWt
● Analytic solution: Et
= E0
exp( ( μ – σ2
/ 2 ) t ) + σ Wt
)
deterministic
trend
unpredictable
volatility
Et
→ Experience at time 't'
Starting experience trend volatility Wiener Process /
Standard Brownian Motion
t → time
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 18
Geometric Brownian Motion
● Stochastic Differential Equation: dEt
= μEt
dt + σEt
dWt
● Analytic solution: Et
= E0
exp( ( μ – σ2
/ 2 ) t ) + σ Wt
)
● Log(Et
) is log-normally distributed
deterministic
trend
Et
→ Experience at time 't'
Starting experience trend volatilityt → time
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 19
Outline
● Motivation
● Prior Work
● Current Work
– Modeling continuous experience evolution
– Modeling continuous language model evolution
– Inference
● Experiments
● Conclusions
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 20
Continuous Language Model (LM)
Evolution: Assumptions
● Users' LM also evolves with experience evolution
● Smoothly evolve over time preserving Markov property of
experience evolution
● Variance changes with experience change between timepoints
– If user's experience does not change between
successive timepoints, LM remains almost same
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 21
Continuous Language Model
βt,z,w
= Probability of observing word 'w' for facet
'z' at time 't'
βt,z,w
~ Normal ( βt-1,z,w
, σ ●
|et
- et-1
|)
LM at time 't' LM at time 't-1' Experience change
The smell of grains a malts on the nose with the slight hop aroma....
The taste of the beer is crisp
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 22
Continuous Language Model
βt,z,w
= Probability of observing word 'w' for facet
'z' at time 't'
βt,z,w
~ Normal ( βt-1,z,w
, σ ●
|et
- et-1
|)
LM at time 't' LM at time 't-1' Experience change
Following principles of standard dynamic systems
with Gaussian noise
The smell of grains a malts on the nose with the slight hop aroma....
The taste of the beer is crisp
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 23
Continuous Language Model (LM):
Challenges
● Experience and LM are continuous
distributions, but words in documents have to
be generated from discrete distribution
● Temporal granularity: LM does not evolve at
the same resolution as experience does
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 24
Outline
● Motivation
● Prior Work
● Current Work
– Modeling continuous experience evolution
– Modeling continuous language model evolution
– Inference
● Experiments
● Conclusions
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 25
Inference
Facets
(Latent)
Experience
(Latent)
Language Model
Words (Observed) at
(Observed) Timepoints
E.g.: The smell of grains a malts
on the nose with the slight hop aroma....
The taste of the beer is crisp
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 26
Inference
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 27
Inference
Topic (or, Facet) Model (Blei et al., JMLR '03)
+ Users
( Author-topic model, Rosen-Zvi et al., UAI '04)
+ Continuous Time
(Continuous time dynamic topic model,
Wang et al., UAI '08)
+ Continuous Experience
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 28
Inference: Estimate Facets
Facets
(Latent)
Estimate facets by Gibbs sampling as in standard
Latent Dirichlet Allocation
keeping LM and experience unchanged
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 29
Inference: Estimate LM
Language Model (Latent)
Words (Observed) at
(Observed) Timepoints
Estimate the following state transition model:
βt
~ Normal ( βt-1
, σ ●
Δet
)
wn
~ Multinomial ( f ( βt
) )
with Kalman Filter and previously inferred latent facets
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 30
Inference: Estimate LM
➢ Models sequential LM evolution
➢ Continuous-state analog to
discrete HMM
Estimate the following state transition model:
βt
~ Normal ( βt-1
, σ ●
Δet
)
wn
~ Multinomial ( f ( βt
) )
with Kalman Filter and previously inferred latent facets
Language Model (Latent)
Words (Observed) at
(Observed) Timepoints
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 31
Inference: Estimate Experience
Experience
(Latent)
et-1
et
et+1
βt-1
βt βt+1
Change in experience at time 't' affects
language models at time 't' and 't+1'
Exploit this to derive proposal distribution for
Metropolis Hastings (MCMC sampling)
βt
~ Normal ( βt-1
, σ |et
- et-1
|)
et
→ experience at time 't'
βt
→ LM at time 't'
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 32
Outline
● Motivation
● Prior Work
● Current Work
– Modeling continuous experience evolution
– Modeling continuous language model evolution
– Inference
● Experiments
● Conclusions
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 33
Dataset Statistics
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 34
Log-Likelihood, Smoothness, Convergence
Loglikelihood, Smoothness and
Convergence
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 35
Mean Squared Error:
Item Rating Prediction
RQ3: Can we recommend items better, if we consider
user experience ?
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 36
Mean Squared Error:
Item Rating Prediction
RQ3: Can we recommend items better if we consider
user experience ?
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 37
Experience Progression: Insights
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 38
Experience Progression: Insights
Experienced users mature faster than amateurs,
exhibit a higher variance
Progression depends more on time spent in
community than on activity
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 39
Overall usage of each word increased over time.
However evolution path is different.
Trace Evolving Norms with Time
and Experience
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 40
Interpretability: Top Words by Experienced Users
Interpretability: Top Words by
Experienced Users
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 41
Interpretability: Top Words by Experienced Users
Interpretability: Top Words by
Experienced Users
Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 42
Conclusions
● Users' experience evolve continuously in nature, along
with their language usage
● Experienced users have distinctive writing style, maturing
over time
● Recommendation models can be improved by explicitly
considering user experience
● Finally, we proposed a Brownian Motion based stochastic
model to capture the above phenomena

Weitere ähnliche Inhalte

Ähnlich wie Continuous Experience-aware Language Model

Transformer Seq2Sqe Models: Concepts, Trends & Limitations (DLI)
Transformer Seq2Sqe Models: Concepts, Trends & Limitations (DLI)Transformer Seq2Sqe Models: Concepts, Trends & Limitations (DLI)
Transformer Seq2Sqe Models: Concepts, Trends & Limitations (DLI)Deep Learning Italia
 
UX Research in an Agile World
UX Research in an Agile WorldUX Research in an Agile World
UX Research in an Agile WorldHirajaved10
 
Supporting Learners in Adaptive Learning Environments through the enhancement...
Supporting Learners in Adaptive Learning Environments through the enhancement...Supporting Learners in Adaptive Learning Environments through the enhancement...
Supporting Learners in Adaptive Learning Environments through the enhancement...Luca Mazzola
 
Go/Ruby/Java: What's next?
Go/Ruby/Java: What's next?Go/Ruby/Java: What's next?
Go/Ruby/Java: What's next?Hernan Wilkinson
 
CMPE255: Short Story Assignment FinGPT
CMPE255: Short Story Assignment FinGPTCMPE255: Short Story Assignment FinGPT
CMPE255: Short Story Assignment FinGPTshawnchumbar
 
Your learning journey to scrum mastery
Your learning journey to scrum masteryYour learning journey to scrum mastery
Your learning journey to scrum masterySiew Onn Wan
 
Extreme programming - Kent Beck
Extreme programming - Kent BeckExtreme programming - Kent Beck
Extreme programming - Kent BeckAngelLuisBlasco
 
UXPA 2023: Experience Maps - A designer's framework for working in Agile team...
UXPA 2023: Experience Maps - A designer's framework for working in Agile team...UXPA 2023: Experience Maps - A designer's framework for working in Agile team...
UXPA 2023: Experience Maps - A designer's framework for working in Agile team...UXPA International
 
Online learning platforms
Online learning platformsOnline learning platforms
Online learning platformsSergio Cigoli
 
Introducing Usability Concerns Early in the DSL Development Cycle: FlowSL Ex...
Introducing Usability Concerns Early in the DSL Development Cycle:  FlowSL Ex...Introducing Usability Concerns Early in the DSL Development Cycle:  FlowSL Ex...
Introducing Usability Concerns Early in the DSL Development Cycle: FlowSL Ex...Ankica Barisic
 
Designing Object Oriented Software - lecture slides 2013
Designing Object Oriented Software - lecture slides 2013Designing Object Oriented Software - lecture slides 2013
Designing Object Oriented Software - lecture slides 2013Jouni Smed
 
Pragmatic software testing education - SIGCSE 2019
Pragmatic software testing education - SIGCSE 2019Pragmatic software testing education - SIGCSE 2019
Pragmatic software testing education - SIGCSE 2019Maurício Aniche
 
How to use Ai for UX UI Design | ChatGPT
How to use Ai for UX UI Design | ChatGPTHow to use Ai for UX UI Design | ChatGPT
How to use Ai for UX UI Design | ChatGPTThink 360 Studio
 
Human-AI Co-Creation of Worked Examples for Programming Classes
Human-AI Co-Creation of Worked Examples for Programming ClassesHuman-AI Co-Creation of Worked Examples for Programming Classes
Human-AI Co-Creation of Worked Examples for Programming ClassesMohammad Hassany
 
How to Create Interactive Videos - Rohan Dasgupta.pdf
How to Create Interactive Videos - Rohan Dasgupta.pdfHow to Create Interactive Videos - Rohan Dasgupta.pdf
How to Create Interactive Videos - Rohan Dasgupta.pdfRohan Dasgupta
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processinggokulprasath06
 
How to deliver the right software (Specification by example)
How to deliver the right software (Specification by example)How to deliver the right software (Specification by example)
How to deliver the right software (Specification by example)Asier Barrenetxea
 

Ähnlich wie Continuous Experience-aware Language Model (20)

Transformer Seq2Sqe Models: Concepts, Trends & Limitations (DLI)
Transformer Seq2Sqe Models: Concepts, Trends & Limitations (DLI)Transformer Seq2Sqe Models: Concepts, Trends & Limitations (DLI)
Transformer Seq2Sqe Models: Concepts, Trends & Limitations (DLI)
 
Chi2017
Chi2017Chi2017
Chi2017
 
UX Research in an Agile World
UX Research in an Agile WorldUX Research in an Agile World
UX Research in an Agile World
 
2013-04-02-the_fog_of_spa
2013-04-02-the_fog_of_spa2013-04-02-the_fog_of_spa
2013-04-02-the_fog_of_spa
 
Supporting Learners in Adaptive Learning Environments through the enhancement...
Supporting Learners in Adaptive Learning Environments through the enhancement...Supporting Learners in Adaptive Learning Environments through the enhancement...
Supporting Learners in Adaptive Learning Environments through the enhancement...
 
Go/Ruby/Java: What's next?
Go/Ruby/Java: What's next?Go/Ruby/Java: What's next?
Go/Ruby/Java: What's next?
 
CMPE255: Short Story Assignment FinGPT
CMPE255: Short Story Assignment FinGPTCMPE255: Short Story Assignment FinGPT
CMPE255: Short Story Assignment FinGPT
 
Your learning journey to scrum mastery
Your learning journey to scrum masteryYour learning journey to scrum mastery
Your learning journey to scrum mastery
 
Extreme programming - Kent Beck
Extreme programming - Kent BeckExtreme programming - Kent Beck
Extreme programming - Kent Beck
 
3 Day Structure
3 Day Structure3 Day Structure
3 Day Structure
 
UXPA 2023: Experience Maps - A designer's framework for working in Agile team...
UXPA 2023: Experience Maps - A designer's framework for working in Agile team...UXPA 2023: Experience Maps - A designer's framework for working in Agile team...
UXPA 2023: Experience Maps - A designer's framework for working in Agile team...
 
Online learning platforms
Online learning platformsOnline learning platforms
Online learning platforms
 
Introducing Usability Concerns Early in the DSL Development Cycle: FlowSL Ex...
Introducing Usability Concerns Early in the DSL Development Cycle:  FlowSL Ex...Introducing Usability Concerns Early in the DSL Development Cycle:  FlowSL Ex...
Introducing Usability Concerns Early in the DSL Development Cycle: FlowSL Ex...
 
Designing Object Oriented Software - lecture slides 2013
Designing Object Oriented Software - lecture slides 2013Designing Object Oriented Software - lecture slides 2013
Designing Object Oriented Software - lecture slides 2013
 
Pragmatic software testing education - SIGCSE 2019
Pragmatic software testing education - SIGCSE 2019Pragmatic software testing education - SIGCSE 2019
Pragmatic software testing education - SIGCSE 2019
 
How to use Ai for UX UI Design | ChatGPT
How to use Ai for UX UI Design | ChatGPTHow to use Ai for UX UI Design | ChatGPT
How to use Ai for UX UI Design | ChatGPT
 
Human-AI Co-Creation of Worked Examples for Programming Classes
Human-AI Co-Creation of Worked Examples for Programming ClassesHuman-AI Co-Creation of Worked Examples for Programming Classes
Human-AI Co-Creation of Worked Examples for Programming Classes
 
How to Create Interactive Videos - Rohan Dasgupta.pdf
How to Create Interactive Videos - Rohan Dasgupta.pdfHow to Create Interactive Videos - Rohan Dasgupta.pdf
How to Create Interactive Videos - Rohan Dasgupta.pdf
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
How to deliver the right software (Specification by example)
How to deliver the right software (Specification by example)How to deliver the right software (Specification by example)
How to deliver the right software (Specification by example)
 

Mehr von Subhabrata Mukherjee

XtremeDistil: Multi-stage Distillation for Massive Multilingual Models
XtremeDistil: Multi-stage Distillation for Massive Multilingual ModelsXtremeDistil: Multi-stage Distillation for Massive Multilingual Models
XtremeDistil: Multi-stage Distillation for Massive Multilingual ModelsSubhabrata Mukherjee
 
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...Subhabrata Mukherjee
 
OpenTag: Open Attribute Value Extraction From Product Profiles
OpenTag: Open Attribute Value Extraction From Product ProfilesOpenTag: Open Attribute Value Extraction From Product Profiles
OpenTag: Open Attribute Value Extraction From Product ProfilesSubhabrata Mukherjee
 
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...Subhabrata Mukherjee
 
Experience aware Item Recommendation in Evolving Review Communities
Experience aware Item Recommendation in Evolving Review CommunitiesExperience aware Item Recommendation in Evolving Review Communities
Experience aware Item Recommendation in Evolving Review CommunitiesSubhabrata Mukherjee
 
Domain Cartridge: Unsupervised Framework for Shallow Domain Ontology Construc...
Domain Cartridge: Unsupervised Framework for Shallow Domain Ontology Construc...Domain Cartridge: Unsupervised Framework for Shallow Domain Ontology Construc...
Domain Cartridge: Unsupervised Framework for Shallow Domain Ontology Construc...Subhabrata Mukherjee
 
Author-Specific Hierarchical Sentiment Aggregation for Rating Prediction of R...
Author-Specific Hierarchical Sentiment Aggregation for Rating Prediction of R...Author-Specific Hierarchical Sentiment Aggregation for Rating Prediction of R...
Author-Specific Hierarchical Sentiment Aggregation for Rating Prediction of R...Subhabrata Mukherjee
 
Joint Author Sentiment Topic Model
Joint Author Sentiment Topic ModelJoint Author Sentiment Topic Model
Joint Author Sentiment Topic ModelSubhabrata Mukherjee
 
TwiSent: A Multi-Stage System for Analyzing Sentiment in Twitter
TwiSent: A Multi-Stage System for Analyzing Sentiment in TwitterTwiSent: A Multi-Stage System for Analyzing Sentiment in Twitter
TwiSent: A Multi-Stage System for Analyzing Sentiment in TwitterSubhabrata Mukherjee
 
Adaptation of Sentiment Analysis to New Linguistic Features, Informal Languag...
Adaptation of Sentiment Analysis to New Linguistic Features, Informal Languag...Adaptation of Sentiment Analysis to New Linguistic Features, Informal Languag...
Adaptation of Sentiment Analysis to New Linguistic Features, Informal Languag...Subhabrata Mukherjee
 
Leveraging Sentiment to Compute Word Similarity
Leveraging Sentiment to Compute Word SimilarityLeveraging Sentiment to Compute Word Similarity
Leveraging Sentiment to Compute Word SimilaritySubhabrata Mukherjee
 
WikiSent : Weakly Supervised Sentiment Analysis Through Extractive Summarizat...
WikiSent : Weakly Supervised Sentiment Analysis Through Extractive Summarizat...WikiSent : Weakly Supervised Sentiment Analysis Through Extractive Summarizat...
WikiSent : Weakly Supervised Sentiment Analysis Through Extractive Summarizat...Subhabrata Mukherjee
 
Feature specific analysis of reviews
Feature specific analysis of reviewsFeature specific analysis of reviews
Feature specific analysis of reviewsSubhabrata Mukherjee
 
YouCat : Weakly Supervised Youtube Video Categorization System from Meta Data...
YouCat : Weakly Supervised Youtube Video Categorization System from Meta Data...YouCat : Weakly Supervised Youtube Video Categorization System from Meta Data...
YouCat : Weakly Supervised Youtube Video Categorization System from Meta Data...Subhabrata Mukherjee
 
Sentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse AnalysisSentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse AnalysisSubhabrata Mukherjee
 

Mehr von Subhabrata Mukherjee (16)

XtremeDistil: Multi-stage Distillation for Massive Multilingual Models
XtremeDistil: Multi-stage Distillation for Massive Multilingual ModelsXtremeDistil: Multi-stage Distillation for Massive Multilingual Models
XtremeDistil: Multi-stage Distillation for Massive Multilingual Models
 
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
 
Fact Checking from Text
Fact Checking from TextFact Checking from Text
Fact Checking from Text
 
OpenTag: Open Attribute Value Extraction From Product Profiles
OpenTag: Open Attribute Value Extraction From Product ProfilesOpenTag: Open Attribute Value Extraction From Product Profiles
OpenTag: Open Attribute Value Extraction From Product Profiles
 
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
 
Experience aware Item Recommendation in Evolving Review Communities
Experience aware Item Recommendation in Evolving Review CommunitiesExperience aware Item Recommendation in Evolving Review Communities
Experience aware Item Recommendation in Evolving Review Communities
 
Domain Cartridge: Unsupervised Framework for Shallow Domain Ontology Construc...
Domain Cartridge: Unsupervised Framework for Shallow Domain Ontology Construc...Domain Cartridge: Unsupervised Framework for Shallow Domain Ontology Construc...
Domain Cartridge: Unsupervised Framework for Shallow Domain Ontology Construc...
 
Author-Specific Hierarchical Sentiment Aggregation for Rating Prediction of R...
Author-Specific Hierarchical Sentiment Aggregation for Rating Prediction of R...Author-Specific Hierarchical Sentiment Aggregation for Rating Prediction of R...
Author-Specific Hierarchical Sentiment Aggregation for Rating Prediction of R...
 
Joint Author Sentiment Topic Model
Joint Author Sentiment Topic ModelJoint Author Sentiment Topic Model
Joint Author Sentiment Topic Model
 
TwiSent: A Multi-Stage System for Analyzing Sentiment in Twitter
TwiSent: A Multi-Stage System for Analyzing Sentiment in TwitterTwiSent: A Multi-Stage System for Analyzing Sentiment in Twitter
TwiSent: A Multi-Stage System for Analyzing Sentiment in Twitter
 
Adaptation of Sentiment Analysis to New Linguistic Features, Informal Languag...
Adaptation of Sentiment Analysis to New Linguistic Features, Informal Languag...Adaptation of Sentiment Analysis to New Linguistic Features, Informal Languag...
Adaptation of Sentiment Analysis to New Linguistic Features, Informal Languag...
 
Leveraging Sentiment to Compute Word Similarity
Leveraging Sentiment to Compute Word SimilarityLeveraging Sentiment to Compute Word Similarity
Leveraging Sentiment to Compute Word Similarity
 
WikiSent : Weakly Supervised Sentiment Analysis Through Extractive Summarizat...
WikiSent : Weakly Supervised Sentiment Analysis Through Extractive Summarizat...WikiSent : Weakly Supervised Sentiment Analysis Through Extractive Summarizat...
WikiSent : Weakly Supervised Sentiment Analysis Through Extractive Summarizat...
 
Feature specific analysis of reviews
Feature specific analysis of reviewsFeature specific analysis of reviews
Feature specific analysis of reviews
 
YouCat : Weakly Supervised Youtube Video Categorization System from Meta Data...
YouCat : Weakly Supervised Youtube Video Categorization System from Meta Data...YouCat : Weakly Supervised Youtube Video Categorization System from Meta Data...
YouCat : Weakly Supervised Youtube Video Categorization System from Meta Data...
 
Sentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse AnalysisSentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse Analysis
 

Kürzlich hochgeladen

Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
detection and classification of knee osteoarthritis.pptx
detection and classification of knee osteoarthritis.pptxdetection and classification of knee osteoarthritis.pptx
detection and classification of knee osteoarthritis.pptxAleenaJamil4
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
Vision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxVision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxellehsormae
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhYasamin16
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 

Kürzlich hochgeladen (20)

Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
detection and classification of knee osteoarthritis.pptx
detection and classification of knee osteoarthritis.pptxdetection and classification of knee osteoarthritis.pptx
detection and classification of knee osteoarthritis.pptx
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
Vision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxVision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptx
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 

Continuous Experience-aware Language Model

  • 1. Continuous Experience-aware Language Model Subhabrata Mukherjee† , Stephan Günnemann‡ , Gerhard Weikum† † Max Planck Institute for Informatik, ‡ Technical University of Munich 22nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining, San Francisco, 2016
  • 2. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 2 Outline ● Motivation ● Prior Work ● Current Work – Modeling continuous experience evolution – Modeling continuous language model evolution – Inference ● Experiments ● Conclusions
  • 3. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 3 Motivation ● Online communities are dynamic – Users join and leave – Adopt vocabulary, adapt to evolving trends – Mature over time ● How to capture evolving user maturity?
  • 4. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 4 Example 1 ● Consider following camera reviews by John: – My first DSLR. Excellent camera, takes great pictures with high definition, without a doubt it makes honor to its name. [Aug, 1997] – The EF 75-300 mm lens is only good to be used outside. The 2.2X HD lens can only be used for specific items; filters are useless if ISO, AP,... . The short 18-55mm lens is cheap and should have a hood to keep light off lens. [Oct, 2012]
  • 5. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 5 Example 2 ● Beer: Moosehead Lager – The smell of grains a malts on the nose with the slight hop aroma in there. The taste of the beer is crisp .... – The beer tastes absolutely terrible ...
  • 6. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 6 Example ● Consider following camera reviews by John: – My first DSLR. Excellent camera, take great pictures with high definition, without a doubt it makes honor to its name. [Aug, 1997] – The EF 75-300 mm lens is only good to be used outside. The 2.2X HD lens can only be used for specific items; filters are useless if ISO, AP,... . The short 18-55mm lens is cheap and should have a hood to keep light off lens. [Oct, 2012] RQ1: How to quantify this change in user maturity (referred to as experience in our work) ? RQ2: How to model this evolution or progression in maturity?
  • 7. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 7 Use-cases ● Recommend item to a user based on her maturity to consume it – Maturity / Experience evolves over time ● Identify experienced users – E.g.: Medical professionals in Health communities ● Crowd-sourcing applications / aggregation / community Q&A
  • 8. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 8 Outline ● Motivation ● Prior Work ● Current Work – Modeling continuous experience evolution – Modeling continuous language model evolution – Inference ● Experiments ● Conclusions
  • 9. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 9 Prior Work: Experience Evolution (1) Users at similar levels of experience have similar rating behavior and facet preferences [J. McAuley, J. Leskovec, WWW '13] (2) Additionally, users at similar levels of experience have similar writing style [S. Mukherjee, H. Lamba, G, Weikum, ICDM '15]
  • 10. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 10 Abrupt Transition Prior Works: Discrete Experience Assumption: At each timepoint (of writing a review) a user remains at the same level of experience or moves to the next level
  • 11. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 11 Current Work: Continuous Experience
  • 12. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 12 Effect of Discrete Evolution on Language Model Language Model changes at the same (discrete) level of experience. Exchangeability principle violated in discrete model.
  • 13. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 13 Outline ● Motivation ● Prior Work ● Current Work – Modeling continuous experience evolution – Modeling continuous language model evolution – Inference ● Experiments ● Conclusions
  • 14. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 14 Continuous Experience Evolution: Assumptions ● Continuous-time process, always positive ● Markovian assumption: Experience at current time depends only on the latest observed experience ● Drift: Overall trend to increase over time ● Volatility: Progression may not be smooth with occasional volatility. E.g.: series of expert reviews followed by a sloppy one
  • 15. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 15 ● Continuous-time process, always positive ● Markovian assumption: Experience at current time depends only on the latest observed experience ● Drift: Overall trend to increase over time ● Volatility: Progression may not be smooth with occasional volatility. E.g.: series of expert reviews followed by a sloppy one We show these properties to be satisfied by the continuous-time stochastic process: Geometric Brownian Motion Continuous Experience Evolution: Assumptions
  • 16. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 16 Geometric Brownian Motion ● Stochastic process to model population growth, financial processes like stock price behavior with random noise ● Natural continuous-state alternative to discrete-state space Hidden Markov Model ● Continuous time stochastic process, where log(Et ) follows Brownian Motion with volatility and drift
  • 17. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 17 Geometric Brownian Motion ● Stochastic Differential Equation: dEt = μEt dt + σEt dWt ● Analytic solution: Et = E0 exp( ( μ – σ2 / 2 ) t ) + σ Wt ) deterministic trend unpredictable volatility Et → Experience at time 't' Starting experience trend volatility Wiener Process / Standard Brownian Motion t → time
  • 18. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 18 Geometric Brownian Motion ● Stochastic Differential Equation: dEt = μEt dt + σEt dWt ● Analytic solution: Et = E0 exp( ( μ – σ2 / 2 ) t ) + σ Wt ) ● Log(Et ) is log-normally distributed deterministic trend Et → Experience at time 't' Starting experience trend volatilityt → time
  • 19. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 19 Outline ● Motivation ● Prior Work ● Current Work – Modeling continuous experience evolution – Modeling continuous language model evolution – Inference ● Experiments ● Conclusions
  • 20. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 20 Continuous Language Model (LM) Evolution: Assumptions ● Users' LM also evolves with experience evolution ● Smoothly evolve over time preserving Markov property of experience evolution ● Variance changes with experience change between timepoints – If user's experience does not change between successive timepoints, LM remains almost same
  • 21. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 21 Continuous Language Model βt,z,w = Probability of observing word 'w' for facet 'z' at time 't' βt,z,w ~ Normal ( βt-1,z,w , σ ● |et - et-1 |) LM at time 't' LM at time 't-1' Experience change The smell of grains a malts on the nose with the slight hop aroma.... The taste of the beer is crisp
  • 22. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 22 Continuous Language Model βt,z,w = Probability of observing word 'w' for facet 'z' at time 't' βt,z,w ~ Normal ( βt-1,z,w , σ ● |et - et-1 |) LM at time 't' LM at time 't-1' Experience change Following principles of standard dynamic systems with Gaussian noise The smell of grains a malts on the nose with the slight hop aroma.... The taste of the beer is crisp
  • 23. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 23 Continuous Language Model (LM): Challenges ● Experience and LM are continuous distributions, but words in documents have to be generated from discrete distribution ● Temporal granularity: LM does not evolve at the same resolution as experience does
  • 24. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 24 Outline ● Motivation ● Prior Work ● Current Work – Modeling continuous experience evolution – Modeling continuous language model evolution – Inference ● Experiments ● Conclusions
  • 25. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 25 Inference Facets (Latent) Experience (Latent) Language Model Words (Observed) at (Observed) Timepoints E.g.: The smell of grains a malts on the nose with the slight hop aroma.... The taste of the beer is crisp
  • 26. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 26 Inference
  • 27. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 27 Inference Topic (or, Facet) Model (Blei et al., JMLR '03) + Users ( Author-topic model, Rosen-Zvi et al., UAI '04) + Continuous Time (Continuous time dynamic topic model, Wang et al., UAI '08) + Continuous Experience
  • 28. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 28 Inference: Estimate Facets Facets (Latent) Estimate facets by Gibbs sampling as in standard Latent Dirichlet Allocation keeping LM and experience unchanged
  • 29. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 29 Inference: Estimate LM Language Model (Latent) Words (Observed) at (Observed) Timepoints Estimate the following state transition model: βt ~ Normal ( βt-1 , σ ● Δet ) wn ~ Multinomial ( f ( βt ) ) with Kalman Filter and previously inferred latent facets
  • 30. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 30 Inference: Estimate LM ➢ Models sequential LM evolution ➢ Continuous-state analog to discrete HMM Estimate the following state transition model: βt ~ Normal ( βt-1 , σ ● Δet ) wn ~ Multinomial ( f ( βt ) ) with Kalman Filter and previously inferred latent facets Language Model (Latent) Words (Observed) at (Observed) Timepoints
  • 31. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 31 Inference: Estimate Experience Experience (Latent) et-1 et et+1 βt-1 βt βt+1 Change in experience at time 't' affects language models at time 't' and 't+1' Exploit this to derive proposal distribution for Metropolis Hastings (MCMC sampling) βt ~ Normal ( βt-1 , σ |et - et-1 |) et → experience at time 't' βt → LM at time 't'
  • 32. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 32 Outline ● Motivation ● Prior Work ● Current Work – Modeling continuous experience evolution – Modeling continuous language model evolution – Inference ● Experiments ● Conclusions
  • 33. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 33 Dataset Statistics
  • 34. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 34 Log-Likelihood, Smoothness, Convergence Loglikelihood, Smoothness and Convergence
  • 35. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 35 Mean Squared Error: Item Rating Prediction RQ3: Can we recommend items better, if we consider user experience ?
  • 36. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 36 Mean Squared Error: Item Rating Prediction RQ3: Can we recommend items better if we consider user experience ?
  • 37. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 37 Experience Progression: Insights
  • 38. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 38 Experience Progression: Insights Experienced users mature faster than amateurs, exhibit a higher variance Progression depends more on time spent in community than on activity
  • 39. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 39 Overall usage of each word increased over time. However evolution path is different. Trace Evolving Norms with Time and Experience
  • 40. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 40 Interpretability: Top Words by Experienced Users Interpretability: Top Words by Experienced Users
  • 41. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 41 Interpretability: Top Words by Experienced Users Interpretability: Top Words by Experienced Users
  • 42. Subhabrata Mukherjee Continuous Experience-aware Language Model SIGKDD 2016 42 Conclusions ● Users' experience evolve continuously in nature, along with their language usage ● Experienced users have distinctive writing style, maturing over time ● Recommendation models can be improved by explicitly considering user experience ● Finally, we proposed a Brownian Motion based stochastic model to capture the above phenomena