sk-2005

If you publish results based on this graph, please quote the references suggested in the dataset page.

This graph has been obtained from a 2005 crawl of the .sk domain performed by UbiCrawler for a group of Slovakian researchers. An interesting feature of this crawl is that we were provided a very large seed.

Basic data
nodes50 636 154
arcs1 949 412 601
bits/link1.576 (7.24%)
bits/link (transpose)1.289 (5.92%)
average degree38.498
maximum indegree8 563 808
maximum outdegree12 870
dangling nodes13.63%
buckets2.92%
largest component35 874 412 (70.85%)
spid1.58 (± 0.042)
average distance13.71 (± 0.051)
reachable pairs70.28% (± 0.699)
median distance15 (50.67%)
harmonic diameter17.56 (± 0.144)
Random access (recommended)
FilenameSize
sk-2005.graph495M
sk-2005.properties4.0K
sk-2005-t.graph327M
sk-2005-t.properties4.0K
sk-2005.map168M
sk-2005.smap556M
sk-2005.md5sums4.0K
sk-2005.lmap1.6G
sk-2005.fcl1.4G
sk-2005.urls.gz287M
sk-2005.stats4.0K
sk-2005.indegree17M
sk-2005.outdegree28K
sk-2005.scc194M
sk-2005.sccsizes34M
Sequential access (high compression)
FilenameSize
sk-2005-hc.graph367M
sk-2005-hc.properties4.0K
sk-2005-hc-t.graph300M
sk-2005-hc-t.properties4.0K
Natural order (random access)
FilenameSize
sk-2005-nat.graph897M
sk-2005-nat.properties4.0K
sk-2005-nat.fcl1.2G
sk-2005-nat.urls.gz261M
Indegree-frequency plotIndegree-frequency plot (with Fibonacci binning)
Outdegree-frequency plotOutdegree-frequency plot (with Fibonacci binning)
Indegree-rank plot (cumulative)Indegree-rank plot (cumulative)
Outdegree-rank plot (cumulative)Outdegree-rank plot (cumulative)
Distance probability mass functiondistance probability mass function
Connected-components size distributionConnected-components size distribution
Large connected componentsLarge connected components
Distribution of the logarithm of successor gapsDistribution of the logarithm of the successor gaps
Distribution of successor gapsDistribution of successor gaps