uk-2014

If you publish results based on this graph, please quote the references suggested in the dataset page.

This graph is a large snapshot of the .uk domain taken at the end of 2014 by BUbiNG starting from the BBC site (there was a special exception for bbc.com, as bbc.co.uk redirects to bbc.com). The maximim number of pages per host was set to 10000.

You can also get the graph of hosts and top private domains (explained here).

Basic data
nodes787 801 471
arcs47 614 527 250
bits/link1.027 (4.10%)
bits/link (transpose)0.818 (3.26%)
average degree60.440
maximum indegree8 605 490
maximum outdegree16 365
dangling nodes12.03%
buckets3.04%
largest component538 924 839 (68.41%)
spid76.94 (± 0.170)
average distance20.61 (± 0.047)
reachable pairs67.27% (± 0.871)
median distance20 (53.33%)
harmonic diameter24.63 (± 0.280)
Random access (recommended)
FilenameSize
uk-2014.graph7.3G
uk-2014.properties4.0K
uk-2014-t.graph5.2G
uk-2014-t.properties4.0K
uk-2014.map8.9G
uk-2014.smap8.9G
uk-2014.md5sums4.0K
uk-2014.lmap28G
uk-2014.fcl26G
uk-2014.urls.gz6.5G
uk-2014.stats4.0K
uk-2014.indegree17M
uk-2014.outdegree44K
uk-2014.scc3.0G
uk-2014.sccsizes405M
Sequential access (high compression)
FilenameSize
uk-2014-hc.graph5.7G
uk-2014-hc.properties4.0K
uk-2014-hc-t.graph4.6G
uk-2014-hc-t.properties4.0K
Natural order (random access)
FilenameSize
uk-2014-nat.graph11G
uk-2014-nat.properties4.0K
uk-2014-nat.fcl23G
uk-2014-nat.urls.gz5.9G
Indegree-frequency plotIndegree-frequency plot (with Fibonacci binning)
Outdegree-frequency plotOutdegree-frequency plot (with Fibonacci binning)
Indegree-rank plot (cumulative)Indegree-rank plot (cumulative)
Outdegree-rank plot (cumulative)Outdegree-rank plot (cumulative)
Distance probability mass functiondistance probability mass function
Connected-components size distributionConnected-components size distribution
Large connected componentsLarge connected components
Distribution of the logarithm of successor gapsDistribution of the logarithm of the successor gaps
Distribution of successor gapsDistribution of successor gaps