orkut-2007

If you publish results based on this graph, please quote the references suggested in the dataset page.

Orkut is a social networking and discussion site operated by Google. This snapshot is part of the IMC 2007 Data Sets. The previous site provides a link list that can be easily converted into any WebGraph format. Note that after conversion it is necessary to symmetrize the resulting graph, as about some of the links in the list miss their opposite.

Basic data
nodes3 072 626
arcs234 370 166
bits/link11.046 (65.98%)
bits/link (transpose)11.046 (65.98%)
average degree76.277
maximum indegree33 313
maximum outdegree33 313
dangling nodes0.01%
buckets99.99%
largest component3 072 441 (99.99%)
spid0.13 (± 0.000)
average distance4.21 (± 0.001)
reachable pairs100.00% (± 0.295)
median distance4 (64.22%)
harmonic diameter4.06 (± 0.011)
Random access (recommended)
FilenameSize
orkut-2007.properties4.0K
orkut-2007-t.properties4.0K
orkut-2007.stats4.0K
orkut-2007.indegree68K
orkut-2007.outdegree68K
Sequential access (high compression)
FilenameSize
orkut-2007-hc.properties4.0K
orkut-2007-hc-t.properties4.0K
Natural order (random access)
FilenameSize
orkut-2007-nat.properties4.0K
Indegree-frequency plotIndegree-frequency plot (with Fibonacci binning)
Outdegree-frequency plotOutdegree-frequency plot (with Fibonacci binning)
Indegree-rank plot (cumulative)Indegree-rank plot (cumulative)
Outdegree-rank plot (cumulative)Outdegree-rank plot (cumulative)
Distance probability mass functiondistance probability mass function
Connected-components size distributionConnected-components size distribution
Large connected componentsLarge connected components
Distribution of the logarithm of successor gapsDistribution of the logarithm of the successor gaps
Distribution of successor gapsDistribution of successor gaps