LiveJournal Data Set
Download: Dataset
Abstract: LiveJournal (LJ) is a virtual community where Internet users can keep a blog, journal or diary.
|
|
Number of Nodes: |
2238731 |
Number of Edges: |
14608137 |
Missing Values? |
no |
Source:
Reza Zafarani*, William D. Cole*, Huan Liu*
* School of Computing, Informatics and Decision Systems Engineering, Arizona State University. E-mail: reza@asu.edu, wcole@asu.edu, huan.liu@asu.edu
Data Set Information:
2 files are included:
1. nodes.csv
-- it's the file of all the users. This file works as a dictionary of all the users in this data set. It's useful for fast reference. It contains
all the node ids used in the dataset
2. edges.csv
-- this is the friendship network among the user. The user's friends are represented using edges. Here is an example.
1,2
This means user with id "1" is friend with user id "2".
Attribute Information:
This is the data set crawled on July, 2010 from LiveJournal ( http://www.livejournal.com ).
This contains the friendship network crawled. For easier understanding, all the contents are organized in CSV file format.
-. Basic statistics
Number of users : 88,784
Number of friendship pairs: 4,186,390
Relevant Papers:
Reza Zafarani, William D. Cole, and Huan Liu. "Sentiment Propagation in Social Networks: A Case Study in LiveJournal", Advances in Social Computing: Third International Conference on Social Computing, Behavioral Modeling, and Prediction, SBP 2010, Bethesda, MD, USA, March 30-31, 2010, Proceedings, pp. 413-420.
Citation Request:
Please refer to the Social Computing Data
Repository's citation policy
|