Pina-Garcia, CA and Gu, Dongbing (2013) Collecting Random Samples from Facebook: An Efficient Heuristic for Sampling Large and Undirected Graphs via a Metropolis-Hastings Random Walk. In: 2013 IEEE International Conference on Systems, Man and Cybernetics (SMC 2013), 2013-10-13 - 2013-10-16.
Pina-Garcia, CA and Gu, Dongbing (2013) Collecting Random Samples from Facebook: An Efficient Heuristic for Sampling Large and Undirected Graphs via a Metropolis-Hastings Random Walk. In: 2013 IEEE International Conference on Systems, Man and Cybernetics (SMC 2013), 2013-10-13 - 2013-10-16.
Pina-Garcia, CA and Gu, Dongbing (2013) Collecting Random Samples from Facebook: An Efficient Heuristic for Sampling Large and Undirected Graphs via a Metropolis-Hastings Random Walk. In: 2013 IEEE International Conference on Systems, Man and Cybernetics (SMC 2013), 2013-10-13 - 2013-10-16.
Abstract
The lack of a sampling frame (i.e., a complete list of users) for most Online Social Networks (OSNs) makes sampling methods especially difficult. Thus, reliable and efficient sampling methods are essential for practical estimation of OSN properties. Recent work in this area has thus focused on sampling methods that allow precise inference from a relatively large-scale social networks such as Facebook. We propose a sampling method on OSNs, based on a Metropolis-Hastings Random Walk (MHRW) algorithm. In this regard, we have developed a social explorer in order to collect random samples from Facebook. In addition, we address the question whether different probability distributions may be able to alter the behavior of the MHRW and enhance the effectiveness of yielding a representative sample. Thus, in this paper, we seek to understand whether the MHRW algorithm can be exploited by switching the random generator to provide better results. We evaluated the performance of our MHRW algorithm providing a descriptive statistics of the collected data. Moreover, we sketch the collecting procedure carried out on Facebook in real-time. Finally, we provide a formal convergence analysis to evaluate whether the sample of draws has attained an equilibrium state to get a rough estimate of the sample quality. © 2013 IEEE.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Additional Information: | Published proceedings: Proceedings - 2013 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2013 |
Uncontrolled Keywords: | RandomWalks; Social Networks; Markov Chain Monte Carlo Methods; Facebook; Probability Distributions |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 08 Jan 2015 13:52 |
Last Modified: | 05 Dec 2024 21:44 |
URI: | http://repository.essex.ac.uk/id/eprint/12195 |