diff options
| author | jeanpouget-abadie <jean.pougetabadie@gmail.com> | 2015-01-31 12:41:53 -0500 |
|---|---|---|
| committer | jeanpouget-abadie <jean.pougetabadie@gmail.com> | 2015-01-31 12:41:53 -0500 |
| commit | 57b3fb8c1fdd998c0245672599de2249d44cfdf2 (patch) | |
| tree | 2029aaca0c688c693a78d5bf98cabb3cb6b4343e | |
| parent | 6e8c086ad5e54b68eb5653f1c9d02682b7813aa1 (diff) | |
| download | cascades-57b3fb8c1fdd998c0245672599de2249d44cfdf2.tar.gz | |
additional changes to model section
| -rw-r--r-- | paper/sections/model.tex | 19 |
1 files changed, 11 insertions, 8 deletions
diff --git a/paper/sections/model.tex b/paper/sections/model.tex index a645fcf..ae49830 100644 --- a/paper/sections/model.tex +++ b/paper/sections/model.tex @@ -1,6 +1,6 @@ We consider a graph ${\cal G}= (V, E, \Theta)$, with $\Theta$ a set of parameters to the graph. Let $m\defeq |V|$. A \emph{cascade model} is a finite state space Markov process over $\{0, 1, \dots \}^V$. An \emph{influence cascade} is a realisation of the random process described by a cascade model. -In practice, we restrict ourselves to discrete-time homogeneous cascade models. $\Theta$ usually represents the edge weights of ${\cal G}$. Finally, the state of each node can often be reduced to two classes: able to infect neighboring nodes at the next round, or unable to infect its neighbors. +In practice, we restrict ourselves to discrete-time homogeneous cascade models. $\Theta$ usually represents the edge weights of ${\cal G}$. Each influence cascade begins at $t=0$ with a designated source `state' (e.g ``active''). All nodes in the source state at $t=0$ are called \emph{source nodes}. A standard theoretical assumption we can make is that each node at $t=0$ has a constant probability $p_{\text{init}}$ of being in the source state. This is more realistic and less restrictive than ``single source'' assumption made in \cite{Daneshmand:2014} and \cite{Abrahao:13}. Recovering the intrinsic graph parameters $\Theta$ from observed influence cascades is the central question of the present work and is important for the following two reasons: \begin{enumerate} @@ -13,23 +13,26 @@ We note that recovering the edges in $E$ from observed influence cascades is a well-identified problem known as the \emph{Graph Inference} problem. However, recovering the influence parameters is no less important and has been seemingly overlooked so far. The machinery developped in this work addresses both -problems. +problems, in the context of generalized linear cascade models, defined below. We show that this definition encompasses a large class of well-known cascade models. -\subsection{Generalized Linear Models} +\subsection{Generalized Linear Cascade Models} -Denoting by $X^t$ the state of the cascade at time step $t-1$, we interpret $X^t_j = 1$ for some node $j\in V$ as ``node $j$ can infect other nodes at time step $t+1$''. By the Markov property of cascades, $$\mathbb{P}(X^t | X^0, X^1, \dots, X^{t-1}) = \mathbb{P}(X^t | X^{t-1})$$ -We consider the following \emph{generalized linear model} (GLM) framework for cascades. +Denoting by $X^t$ the state of the cascade at time step $t-1$, we interpret $X^t_j = 1$ for some node $j\in V$ as ``node $j$ exhibits the source nodes' behavior at time step $t+1$''. By the Markov property of cascades, $$\mathbb{P}(X^t | X^0, X^1, \dots, X^{t-1}) = \mathbb{P}(X^t | X^{t-1})$$ +We draw inspiration from \emph{generalized linear models} (GLM) to define the following: \begin{definition} -Let .... A \emph{generalized linear cascade} is: +Let $f: \mathbb{R} \leftarrow \mathbb{R}$ be an inverse link function. Let ${\cal F}_{< t}$ be the filtration defined by $\{ X^0, X^1, \dots, X^{t-1} \}$. A \emph{generalized linear cascade} is defined as: +$$\mathbb{P}(X^t = 1|{\cal F}_{< t}) = \exp(blabla$$ \end{definition} \subsection{Examples} \label{subsec:examples} +In this section, we show that both the Independent Cascade Model and the Voter model are Generalized Linear Cascades. Discussion about the Linear Threshold model has been deferred to Section~\ref{sec:linear_threshold} + \subsubsection{Independent Cascade Model} -In the independent cascade model, nodes can be either susceptible, active/infected or inactive. All nodes start either as susceptible or active. At each time step $t$, for each edge $(i,j)$ where $j$ is susceptible and $i$ is active, $i$ attempts to infect $j$ with probability $p_{i,j}\in[0,1]$. If $i$ succeeds, $j$ will become active at time step $t+1$. Regardless of $i$'s success, node $i$ will be inactive at time $t+1$: nodes stay active for only one time step. The cascade continues until no active nodes remain. +In the independent cascade model, nodes can be either susceptible, active/infected or inactive. At $t=0$, all source nodes are ``active'' and all remaining nodes are ``susceptible''. At each time step $t$, for each edge $(i,j)$ where $j$ is susceptible and $i$ is active, $i$ attempts to infect $j$ with probability $p_{i,j}\in[0,1]$. If $i$ succeeds, $j$ will become active at time step $t+1$. Regardless of $i$'s success, node $i$ will be inactive at time $t+1$. In other words, nodes stay active for only one time step. The cascade process terminates When no active nodes remain. If we denote by $X^t$ the indicator variable of the set of active nodes at time step $t-1$, then if $j$ is susceptible at time step $t-1$, we have: \begin{displaymath} @@ -46,7 +49,7 @@ Defining $\Theta_{i,j} \defeq \log(1-p_{i,j})$, this can be rewritten as: \subsubsection{The Voter Model} - +In the Voter Model, nodes have two states \subsection{Maximum Likelihood Estimation} |
