10. What are Trending Topics?
• Twitter: a global communication network.
• Tweet: a short, public message.
11. What are Trending Topics?
• Twitter: a global communication network.
• Tweet: a short, public message.
• Topic: a phrase in a tweet.
12. What are Trending Topics?
• Twitter: a global communication network.
• Tweet: a short, public message.
• Topic: a phrase in a tweet.
• Trending topic (a “trend”): a topic that
becomes popular.
13. A Parametric Model
• Expect certain type of pattern (e.g.
constant + jumps).
activity
time
14. A Parametric Model
• Expect certain type of pattern (e.g.
constant + jumps).
• Fit parameters to data (e.g. how much of
a jump).
activity
time
15. A Parametric Model
• Expect certain type of pattern (e.g.
constant + jumps).
• Fit parameters to data (e.g. how much of
a jump).
activity
p = 0.1
time
16. A Parametric Model!
• Expect certain type of pattern (e.g.
constant + jumps).
• Fit parameters to data (e.g. how much of
a jump).
activity
p = 0.6
time
17. A Parametric Model!
• Expect certain type of pattern (e.g.
constant + jumps).
• Fit parameters to data (e.g. how much of
a jump).
activity
p = 4.1
time
18. A Parametric Model!
• Expect certain type of pattern (e.g.
constant + jumps).
• Fit parameters to data (e.g. how much of
a jump).
• Decide if jump is big enough.
trend detected!
activity
p = 4.1
time
25. A Data-Driven Approach!
• All of the information is in the data.
• Hypothesis
– Tweets are written by people.
26. A Data-Driven Approach
• All of the information is in the data.
• Hypothesis
– Tweets are written by people.
– People are simple.
27. A Data-Driven Approach!
• All of the information is in the data.
• Hypothesis
– Tweets are written by people.
– People are simple.
• In how they spread information.
28. A Data-Driven Approach!
• All of the information is in the data.
• Hypothesis
– Tweets are written by people.
– People are simple.
• In how they spread information.
• In how they connect to one another.
29. A Data-Driven Approach!
• All of the information is in the data.
• Hypothesis
– Tweets are written by people.
– People are simple.
• In how they spread information.
• In how they connect to one another.
– Small number of distinct “ways” in which a
topic can become trending.
46. Properties
• Simple (just compute distances)
• Scalable (can compute distances in
parallel)
• Non-parametric – model “parameters”
scale with the data