Lecture 1 - Manuel Gomez Rodriguez

Download Report

Transcript Lecture 1 - Manuel Gomez Rodriguez

Machine learning for
Dynamic Social Network Analysis
Manuel Gomez Rodriguez
Max Planck Institute for Software Systems
UNIVERSITY OF SYDNEY, JANUARY 2017
Interconnected World
SOCIAL
NETWORKS
TRANSPORTATION
NETWORKS
WORLD WIDE
WEB
PROTEIN
INTERACTIONS
INFORMATION
NETWORKS
INTERNET OF
THINGS
2
Many discrete events in continuous time
3
Qmee, 2013
Variety of processes behind these events
Events are (noisy) observations of a
variety of complex dynamic processes…
News spread
in Twitter
Product reviews
and sales in
Amazon
Video
becomes viral
in Youtube
FAST
A user gains
recognition in
Quora
Article
creation in
Wikipedia
SLOW
…in a wide range of temporal scales.
4
Example I: Idea adoption/viral marketing
S means D
Christine
D follows S
Bob
3.00pm
3.25pm
Beth
3.27pm
Joe
David
4.15pm
Friggeri et al., 2014
They can have an impact
in the off-line world
5
Example II: Information creation & curation
✗
Addition
Refutation
Question
Answer
Upvote
Example III: Learning trajectories
1st year computer science student
Introduction to programming
Discrete math
Project presentation
For/do-while
loops
Define
Set theory
functions
Powerpoint
Graph Theory
Class
vs. Keynote
inheritance
Export
Geometrypptx to pdf
t
If … else
How to write
switch
Logic
Private
functions
PP
templates
Class
destructor
Plot
library
7
DETAILED TRACES OF ACTIVITY
Detailed event traces
The availability of event traces
boosts a new generation of
data-driven models and
algorithms
#greece
retweet
s
8
Previously: discrete-time models & algorithms
Epoch 1
Epoch 2
Epoch 3
Epoch 4
Discrete-time models artificially introduce epochs:
1. How long is each epoch? Data is very heterogeneous.
2. How to aggregate events within an epoch?
3. What if no event within an epoch?
4. Time is treated as index or conditioning variable, not easy
to deal with time-related queries.
9
Outline of the Seminar
REPRESENTATION: TEMPORAL POINT PROCESSES
1. Intensity function
2. Basic building blocks
3. Superposition
4. Marks and SDEs with jumps
APPLICATIONS: MODELS
1. Information propagation
2. Opinion dynamics
3. Information reliability
4. Knowledge acquisition
APPLICATIONS: CONTROL
1. Influence maximization
2. Activity shaping
3. When-to-post
Slides/references: learning.mpi-sws.org/sydney-seminar
10
Representation:
Temporal Point Processes
1. Intensity function
2. Basic building blocks
3. Superposition
4. Marks and SDEs with jumps
11
Temporal point processes
Temporal point process:
A random process whose realization consists of
discrete events localized in time
Discrete events
time
History,
Dirac delta function
Formally:
12
Model time as a random variable
density
Prob. between [t, t+dt)
time
History,
Likelihood of a timeline:
Prob. not before t
13
Problem of parametrizing density
time
Likelihood is not concave in w:
It is difficult for model design and interpretability:
1. Densities need to integrate to 1
2. Combination of timelines results in density convolutions
14
Intensity function
density
Prob. between [t, t+dt)
time
History,
Prob. not before t
Intensity:
Probability between [t, t+dt) but not before t
Observation:
15
Advantages of parametrizing intensity
time
Likelihood is concave in w:
Suitable for model design and interpretable:
1. Intensities only need to be nonnegative
2. Combination of timelines results in intensity additions
16
Relation between f*, F*, S*, λ*
Central quantity
we will use!
17
Representation:
Temporal Point Processes
1. Intensity function
2. Basic building blocks
3. Superposition
4. Marks and SDEs with jumps
18
Poisson process
time
Intensity of a Poisson process
Observations:
1. Intensity independent of history
2. Uniformly random occurrence
3. Time interval follows exponential distribution
19
Inhomogeneous Poisson process
time
Intensity of an inhomogeneous Poisson process
Observations:
1. Intensity independent of history
20
Nonparametric inhomogeneous Poisson process
Positive combination of (Gaussian) RFB kernels:
21
Terminating (or survival) process
time
Intensity of a terminating (or survival) process
Observations:
1. Limited number of occurrences
22
Self-exciting (or Hawkes) process
time
History,
Triggering kernel
Intensity of self-exciting
(or Hawkes) process:
Observations:
1. Clustered (or bursty) occurrence of events
2. Intensity is stochastic and history dependent
23
How do we sample from a Hawkes process?
time
Thinning procedure (similar to rejection sampling):
1. Sample
from Poisson process with intensity
2. Keep the sample with probability
24
Summary
Building blocks to represent different dynamic processes:
Poisson processes:
Inhomogeneous Poisson processes:
Terminating point processes:
Self-exciting point processes:
25
Representation:
Temporal Point Processes
1. Intensity function
2. Basic building blocks
3. Superposition
4. Marks and SDEs with jumps
26
Superposition of processes
time
Sample each intensity + take minimum = Additive intensity
27
Mutually exciting process
time
Bob
History
Christine
time
History
Clustered occurrence affected by neighbors
28
Mutually exciting terminating process
time
Bob
Christine
time
History
Clustered occurrence affected by neighbors
29
Representation:
Temporal Point Processes
1. Intensity function
2. Basic building blocks
3. Superposition
4. Marks and SDEs with jumps
30
Marked temporal point processes
Marked temporal point process:
A random process whose realization consists of discrete
marked events localized in time
time
time
31
History,
Independent identically distributed marks
time
Distribution for the marks:
Observations:
1. Marks independent of the temporal dynamics
2. Independent identically distributed (I.I.D.)
32
Dependent marks: SDEs with jumps
time
History,
Marks given by stochastic differential equation with jumps:
Observations:
Drift
Event influence
1. Marks dependent of the temporal dynamics
2. Defined for all values of t
33
Dependent marks: distribution + SDE with jumps
time
History,
Distribution for the marks:
Drift
Event influence
Observations:
1. Marks dependent on the temporal dynamics
34
2. Distribution represents additional source of uncertainty
Mutually exciting + marks
Bob
time
Christine
Marks affected by neighbors
35
Drift
Neighbor influence
REPRESENTATION: TEMPORAL POINT PROCESSES
1. Intensity function
2. Basic building blocks
3. Superposition
4. Marks and SDEs with jumps
This
lecture
APPLICATIONS: MODELS
1. Information propagation
2. Opinion dynamics
3. Information reliability
4. Knowledge acquisition
Next
lecture
APPLICATIONS: CONTROL
1. Influence maximization
2. Activity shaping
3. When-to-post
Slides/references: learning.mpi-sws.org/sydney-seminar
36