enwiki-2013

If you publish results based on this graph, please quote the references suggested in the dataset page.

This graph represent a snapshot of the English part of Wikipedia as of late February 2013. The identifiers are the titles of the pages. Redirects have been carefully taken into account when computing the links, but redirect pages are not part of the final graph. The graph does not contain namespaced pages such as Template:something.

This data set has been collected with support of the EU-FET grant NADINE (GA 288956).

Basic data
nodes4 206 785
arcs101 355 853
bits/link12.639 (67.03%)
bits/link (transpose)10.841 (57.49%)
average degree24.093
maximum indegree431 795
maximum outdegree8 104
dangling nodes0.19%
buckets0.00%
largest component3 744 228 (89.00%)
average distance4.87 (± 0.003)
reachable pairs89.88% (± 0.316)
median distance5 (72.26%)
harmonic diameter5.24 (± 0.016)
Random access (recommended)
FilenameSize
enwiki-2013.graph159M
enwiki-2013.properties4.0K
enwiki-2013-t.graph133M
enwiki-2013-t.properties4.0K
enwiki-2013.map45M
enwiki-2013.smap45M
enwiki-2013.md5sums4.0K
enwiki-2013.lmap112M
enwiki-2013.fcl100M
enwiki-2013.ids.gz31M
enwiki-2013.stats4.0K
enwiki-2013.indegree848K
enwiki-2013.outdegree20K
enwiki-2013.scc17M
enwiki-2013.sccsizes1.8M
Sequential access (high compression)
FilenameSize
enwiki-2013-hc.graph153M
enwiki-2013-hc.properties4.0K
enwiki-2013-hc-t.graph131M
enwiki-2013-hc-t.properties4.0K
Natural order (random access)
FilenameSize
enwiki-2013-nat.graph229M
enwiki-2013-nat.properties4.0K
enwiki-2013-nat.fcl84M
enwiki-2013-nat.ids.gz38M
JGraphT serialized succinct representation
FilenameSize
enwiki-2013.suxdir498M
enwiki-2013.suxmap79M
Indegree-frequency plotIndegree-frequency plot (with Fibonacci binning)
Outdegree-frequency plotOutdegree-frequency plot (with Fibonacci binning)
Indegree-rank plot (cumulative)Indegree-rank plot (cumulative)
Outdegree-rank plot (cumulative)Outdegree-rank plot (cumulative)
Distance probability mass functiondistance probability mass function
Connected-components size distributionConnected-components size distribution
Large connected componentsLarge connected components
Distribution of the logarithm of successor gapsDistribution of the logarithm of the successor gaps
Distribution of successor gapsDistribution of successor gaps