eu-2015

If you publish results based on this graph, please quote the references suggested in the dataset page.

This graph is a large snapshot of the EU countries (that is, domains .ad, .al, .at, .be, .bg, .ch, .cz, .de, .dk, .ee, .es, .eu, .fi, .fo, .fr, .gb, .gr, .hr, .hu, .ie, .im, .is, .it, .li, .lt, .lu, .lv, .mc, .md, .me, .nl, .no, .pl, .pt, .ro, .se, .si, .sk, .sm, .uk, .va) taken in 2015 by BUbiNG starting from the site http://europa.eu/. The maximum number of pages per host was set to 10M (and never reached).

You can also get the graph of hosts and top private domains (explained here).

Basic data
nodes1 070 557 254
arcs91 792 261 600
bits/link0.899 (3.59%)
bits/link (transpose)0.723 (2.89%)
average degree85.743
maximum indegree20 252 239
maximum outdegree35 340
dangling nodes10.06%
buckets1.26%
largest component897 758 878 (83.86%)
spid0.35 (± 0.007)
average distance12.45 (± 0.033)
reachable pairs85.14% (± 1.412)
median distance13 (60.75%)
harmonic diameter14.18 (± 0.220)
Random access (recommended)
FilenameSize
eu-2015.graph13G
eu-2015.properties4.0K
eu-2015-t.graph9.1G
eu-2015-t.properties4.0K
eu-2015.map12G
eu-2015.smap12G
eu-2015.md5sums4.0K
eu-2015.lmap39G
eu-2015.fcl35G
eu-2015.urls.gz11G
eu-2015.stats4.0K
eu-2015.indegree39M
eu-2015.outdegree84K
eu-2015.scc4.0G
eu-2015.sccsizes461M
Sequential access (high compression)
FilenameSize
eu-2015-hc.graph9.7G
eu-2015-hc.properties4.0K
eu-2015-hc-t.graph7.8G
eu-2015-hc-t.properties4.0K
Natural order (random access)
FilenameSize
eu-2015-nat.graph20G
eu-2015-nat.properties4.0K
eu-2015-nat.fcl31G
eu-2015-nat.urls.gz9.4G
Indegree-frequency plotIndegree-frequency plot (with Fibonacci binning)
Outdegree-frequency plotOutdegree-frequency plot (with Fibonacci binning)
Indegree-rank plot (cumulative)Indegree-rank plot (cumulative)
Outdegree-rank plot (cumulative)Outdegree-rank plot (cumulative)
Distance probability mass functiondistance probability mass function
Connected-components size distributionConnected-components size distribution
Large connected componentsLarge connected components
Distribution of the logarithm of successor gapsDistribution of the logarithm of the successor gaps
Distribution of successor gapsDistribution of successor gaps