enwiki-2020

If you publish results based on this graph, please quote the references suggested in the dataset page.

This graph represent a snapshot of the English part of Wikipedia as of mid 2020. The identifiers are the titles of the pages. Redirects have been carefully taken into account when computing the links, but redirect pages are not part of the final graph. The graph does not contain namespaced pages such as Template:something.

Basic data
nodes6 047 510
arcs142 691 609
bits/link13.128 (67.63%)
bits/link (transpose)11.515 (59.32%)
average degree23.595
maximum indegree236 348
maximum outdegree10 694
dangling nodes0.51%
buckets0.00%
largest component5 253 206 (86.87%)
average distance4.96 (± 0.001)
reachable pairs87.43% (± 0.325)
median distance5 (67.63%)
harmonic diameter5.48 (± 0.019)
Random access (recommended)
FilenameSize
enwiki-2020.graph230M
enwiki-2020.properties4.0K
enwiki-2020-t.graph198M
enwiki-2020-t.properties4.0K
enwiki-2020.map65M
enwiki-2020.smap65M
enwiki-2020.md5sums4.0K
enwiki-2020.lmap165M
enwiki-2020.fcl146M
enwiki-2020.ids.gz46M
enwiki-2020.stats4.0K
enwiki-2020.indegree464K
enwiki-2020.outdegree24K
enwiki-2020.scc24M
enwiki-2020.sccsizes3.0M
Sequential access (high compression)
FilenameSize
enwiki-2020-hc.graph224M
enwiki-2020-hc.properties4.0K
enwiki-2020-hc-t.graph196M
enwiki-2020-hc-t.properties4.0K
Natural order (random access)
FilenameSize
enwiki-2020-nat.graph335M
enwiki-2020-nat.properties4.0K
enwiki-2020-nat.fcl123M
enwiki-2020-nat.ids.gz55M
JGraphT serialized succinct representation
FilenameSize
enwiki-2020.suxdir714M
enwiki-2020.suxmap115M
Indegree-frequency plotIndegree-frequency plot (with Fibonacci binning)
Outdegree-frequency plotOutdegree-frequency plot (with Fibonacci binning)
Indegree-rank plot (cumulative)Indegree-rank plot (cumulative)
Outdegree-rank plot (cumulative)Outdegree-rank plot (cumulative)
Distance probability mass functiondistance probability mass function
Connected-components size distributionConnected-components size distribution
Large connected componentsLarge connected components
Distribution of the logarithm of successor gapsDistribution of the logarithm of the successor gaps
Distribution of successor gapsDistribution of successor gaps