enwiki-2016

If you publish results based on this graph, please quote the references suggested in the dataset page.

This graph represent a snapshot of the English part of Wikipedia as of mid 2016. The identifiers are the titles of the pages. Redirects have been carefully taken into account when computing the links, but redirect pages are not part of the final graph. The graph does not contain namespaced pages such as Template:something.

Basic data
nodes5 096 287
arcs113 095 771
bits/link13.185 (68.49%)
bits/link (transpose)11.56 (60.05%)
average degree22.192
maximum indegree311 872
maximum outdegree7 337
dangling nodes0.43%
buckets0.00%
largest component4 411 242 (86.56%)
average distance4.96 (± 0.000)
reachable pairs86.41% (± 0.055)
median distance5 (66.34%)
harmonic diameter5.55 (± 0.003)
Random access (recommended)
FilenameSize
enwiki-2016.graph184M
enwiki-2016.properties4.0K
enwiki-2016-t.graph158M
enwiki-2016-t.properties4.0K
enwiki-2016.map55M
enwiki-2016.smap55M
enwiki-2016.md5sums4.0K
enwiki-2016.lmap138M
enwiki-2016.fcl122M
enwiki-2016.ids.gz39M
enwiki-2016.stats4.0K
enwiki-2016.indegree612K
enwiki-2016.outdegree16K
enwiki-2016.scc20M
enwiki-2016.sccsizes2.6M
Sequential access (high compression)
FilenameSize
enwiki-2016-hc.graph178M
enwiki-2016-hc.properties4.0K
enwiki-2016-hc-t.graph156M
enwiki-2016-hc-t.properties4.0K
Natural order (random access)
FilenameSize
enwiki-2016-nat.graph263M
enwiki-2016-nat.properties4.0K
enwiki-2016-nat.fcl103M
enwiki-2016-nat.ids.gz46M
JGraphT serialized succinct representation
FilenameSize
enwiki-2016.suxdir565M
enwiki-2016.suxmap96M
Indegree-frequency plotIndegree-frequency plot (with Fibonacci binning)
Outdegree-frequency plotOutdegree-frequency plot (with Fibonacci binning)
Indegree-rank plot (cumulative)Indegree-rank plot (cumulative)
Outdegree-rank plot (cumulative)Outdegree-rank plot (cumulative)
Distance probability mass functiondistance probability mass function
Connected-components size distributionConnected-components size distribution
Large connected componentsLarge connected components
Distribution of the logarithm of successor gapsDistribution of the logarithm of the successor gaps
Distribution of successor gapsDistribution of successor gaps