enwiki-2021

If you publish results based on this graph, please quote the references suggested in the dataset page.

This graph represent a snapshot of the English part of Wikipedia as of mid 2021. The identifiers are the titles of the pages. Redirects have been carefully taken into account when computing the links, but redirect pages are not part of the final graph. The graph does not contain namespaced pages such as Template:something.

Basic data
nodes6 261 502
arcs150 124 927
bits/link13.139 (67.60%)
bits/link (transpose)11.577 (59.56%)
average degree23.976
average distance4.95 (± 0.001)
reachable pairs87.48% (± 0.310)
median distance5 (67.93%)
harmonic diameter5.47 (± 0.018)
Random access (recommended)
FilenameSize
enwiki-2021.graph242M
enwiki-2021.properties4.0K
enwiki-2021-t.graph210M
enwiki-2021-t.properties4.0K
enwiki-2021.map67M
enwiki-2021.smap67M
enwiki-2021.md5sums4.0K
enwiki-2021.lmap171M
enwiki-2021.fcl152M
enwiki-2021.ids.gz47M
enwiki-2021.stats4.0K
enwiki-2021.indegree456K
enwiki-2021.outdegree28K
enwiki-2021.scc24M
enwiki-2021.sccsizes3.1M
Sequential access (high compression)
FilenameSize
enwiki-2021-hc.graph236M
enwiki-2021-hc.properties4.0K
enwiki-2021-hc-t.graph208M
enwiki-2021-hc-t.properties4.0K
Natural order (random access)
FilenameSize
enwiki-2021-nat.graph353M
enwiki-2021-nat.properties4.0K
enwiki-2021-nat.fcl128M
enwiki-2021-nat.ids.gz57M
Indegree-frequency plotIndegree-frequency plot (with Fibonacci binning)
Outdegree-frequency plotOutdegree-frequency plot (with Fibonacci binning)
Indegree-rank plot (cumulative)Indegree-rank plot (cumulative)
Outdegree-rank plot (cumulative)Outdegree-rank plot (cumulative)
Distance probability mass functiondistance probability mass function
Connected-components size distributionConnected-components size distribution
Large connected componentsLarge connected components
Distribution of the logarithm of successor gapsDistribution of the logarithm of the successor gaps
Distribution of successor gapsDistribution of successor gaps