enwiki-2017

If you publish results based on this graph, please quote the references suggested in the dataset page.

This graph represent a snapshot of the English part of Wikipedia as of mid 2017. The identifiers are the titles of the pages. Redirects have been carefully taken into account when computing the links, but redirect pages are not part of the final graph. The graph does not contain namespaced pages such as Template:something.

Basic data
nodes5 409 498
arcs122 008 994
bits/link13.153 (68.10%)
bits/link (transpose)11.506 (59.57%)
average degree22.555
maximum indegree286 564
maximum outdegree7 781
dangling nodes0.42%
buckets0.00%
largest component4 663 907 (86.22%)
average distance4.96 (± 0.001)
reachable pairs86.11% (± 0.277)
median distance5 (66.22%)
harmonic diameter5.57 (± 0.017)
Random access (recommended)
FilenameSize
enwiki-2017.graph197M
enwiki-2017.properties4.0K
enwiki-2017-t.graph170M
enwiki-2017-t.properties4.0K
enwiki-2017.map58M
enwiki-2017.smap58M
enwiki-2017.md5sums4.0K
enwiki-2017.lmap146M
enwiki-2017.fcl130M
enwiki-2017.ids.gz41M
enwiki-2017.stats4.0K
enwiki-2017.indegree564K
enwiki-2017.outdegree20K
enwiki-2017.scc21M
enwiki-2017.sccsizes2.8M
Sequential access (high compression)
FilenameSize
enwiki-2017-hc.graph192M
enwiki-2017-hc.properties4.0K
enwiki-2017-hc-t.graph168M
enwiki-2017-hc-t.properties4.0K
Natural order (random access)
FilenameSize
enwiki-2017-nat.graph285M
enwiki-2017-nat.properties4.0K
enwiki-2017-nat.fcl109M
enwiki-2017-nat.ids.gz49M
JGraphT serialized succinct representation
FilenameSize
enwiki-2017.suxdir610M
enwiki-2017.suxmap103M
Indegree-frequency plotIndegree-frequency plot (with Fibonacci binning)
Outdegree-frequency plotOutdegree-frequency plot (with Fibonacci binning)
Indegree-rank plot (cumulative)Indegree-rank plot (cumulative)
Outdegree-rank plot (cumulative)Outdegree-rank plot (cumulative)
Distance probability mass functiondistance probability mass function
Connected-components size distributionConnected-components size distribution
Large connected componentsLarge connected components
Distribution of the logarithm of successor gapsDistribution of the logarithm of the successor gaps
Distribution of successor gapsDistribution of successor gaps