07/06/2020

Generating Realistic Interest-Driven Information Cascades

Federico Cinus, Francesco Bonchi, Corrado Monti, André Panisson

Keywords: cascades, communities, embeddings, graphs, influences, level homophily, memes, network structure, networks, spaces, structure, topic

Abstract: We propose a model for the synthetic generation of information cascades in social media. In our model the information ``memes" propagating in the social network are characterized by a probability distribution in a topic space, accompanied by a textual description, i.e., a bag of keywords coherent with the topic distribution. Similarly, every person is described by a vector of interests defined over the same topic space. Information cascades are governed by the topic of the meme, its level of virality, the interests of each person, community pressure, and social influence. The main technical challenge we face towards our goal is the generation of realistic interest vectors, given a known network structure and a tunable level of homophily. We tackle this problem by means of a method based on non-negative matrix factorization, which is shown experimentally to outperform non-trivial baselines based on label propagation and random-walk-based graph embedding. As we showcase in our experiments, our model offers a small set of simple and easily interpretable ``knobs´´ which allow to study, \emph{in vitro}, how each set of assumptions affects the resulting propagations. Finally, we show how to generate synthetic cascades that have similar macro-statistics to the real-world cascades for a dataset containing both the network and the cascades.

 0
 0
 0
 0
This is an embedded video. Talk and the respective paper are published at ICWSM 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment
no comments yet
code of conduct: tbd Characters remaining: 140

Similar Papers