uk-union-2006-06-2007-05

If you publish results based on this graph, please quote the references suggested in the dataset page.

This graph is a time-aware graph generated by combining twelve monthly snapshot of the .uk domain collected for the the DELIS project (see also this paper for some elaboration).

The DELIS (Dynamically Evolving Large-scale Information Systems) European FP6 project dealt with methods, techniques, tools, and prototypical implementations coping with challenges imposed by the size and dynamics of today's and especially future information systems. The "DELIS dataset" is a collection of web graphs collected within such project by taking snapshots at a monthly rate focussing on the .uk domain: the choice of the domain was the most obvious, given the European nature of the project and in consideration of the linguistic and social centrality of the UK within Europe; the frequency chosen is the largest possible that does not raise issues of unpoliteness. The first snapshot was collected in May 2006. All snapshots were collected at the DSI, using hardware partly funded by the DELIS project. A detailed description of the dataset can be found at the end of this page.

The graph is an ArcLabelledImmutableGraph whose labels are instances of CompressedIntLabel. The documentation of the latter class explains how to access both node and the arc labels. In both cases, a label is a 12-bit mask telling in which of the twelve snapshot the node or arc was present.

If you publish results based on these graphs, please acknowledge our work by quoting the following paper:

@ARTICLE{BSVLTAG,
        AUTHOR = "Paolo Boldi and Massimo Santini and Sebastiano Vigna",
        TITLE = "A Large Time-Aware Graph",
        JOURNAL = "SIGIR Forum",
        YEAR = 2008,
        VOLUME = 42,
        NUMBER = 2,
        PAGES = "33--38"
}
Basic data
nodes133 633 040
arcs5 507 679 822
bits/link2.631 (11.40%)
average degree41.215
maximum indegree6 366 525
maximum outdegree22 429
dangling nodes9.08%
Random access (recommended)
FilenameSize
uk-union-2006-06-2007-05.properties4.0K
uk-union-2006-06-2007-05.urls.gz793M
uk-union-2006-06-2007-05.stats4.0K
uk-union-2006-06-2007-05.labeldecoders66M
uk-union-2006-06-2007-05.labeloffsets142M
uk-union-2006-06-2007-05.labels1.1G
uk-union-2006-06-2007-05.nodelabels192M
uk-union-2006-06-2007-05-underlying.graph1.7G
uk-union-2006-06-2007-05-underlying.properties4.0K
Full text
DatasetPagesSize (Gb)GZip Size (Gb)
06/06 112 386 7631 893.11 402.45
06/07 136 956 5592 287.36 477.03
06/08 141 395 8952 424.82 507.59
06/09 148 965 2982 756.61 546.70
06/10 129 558 4912 336.19 478.31
06/11 150 146 1322 637.70 546.81
06/12 144 489 4462 552.80 525.77
07/01 151 578 1132 651.65 553.97
07/02 153 966 5402 692.88 564.98
07/03 151 427 4612 568.80 545.80
07/04 150 606 6892 700.06 559.84
07/05 150 054 5512 658.18 556.46
Host count (overlap)
 06/0606/0706/0806/0906/1006/1106/1207/0107/0207/0307/0407/05
06/0694 96773 30471 68669 89965 51664 50159 47862 45962 44758 95357 67157 747
06/07 130 778102 25099 48989 95190 90981 49186 74188 14384 08282 73182 138
06/08  128 505102 87384 99990 37881 02386 48986 76281 90880 06679 637
06/09   136 60588 00694 65584 33590 88789 62084 99382 09781 156
06/10    109 91886 17575 83181 13081 61476 61675 66075 128
06/11     121 20886 71491 46191 54984 12582 32281 664
06/12      113 47188 85284 33579 29876 25475 850
07/01       125 13494 25986 40284 47483 127
07/02        122 95691 09487 86486 708
07/03         122 50684 97183 839
07/04          113 15791 636
07/05           114 529
Static URLs (overlap)
 06/0606/0706/0806/0906/1006/1106/1207/0107/0207/0307/0407/05
06/0631 316 40319 034 35518 260 76217 169 96515 264 48414 997 44213 833 22513 675 12613 211 56612 321 65711 912 70311 142 177
06/07 35 160 31923 301 31321 706 03218 531 81318 266 51516 195 04616 407 10915 968 92915 167 84514 545 24313 577 199
06/08  37 263 27824 265 50719 372 50919 379 72417 130 69217 336 16316 709 68615 507 10115 024 06413 802 044
06/09   39 946 09721 240 95521 246 30218 743 74919 047 73518 089 15416 877 62816 304 74614 740 879
06/10    33 812 04322 246 36719 041 23319 059 26718 264 34116 626 25316 355 45715 007 365
06/11     37 337 24222 297 44821 882 83021 279 91118 826 43618 264 55316 568 057
06/12      36 641 05623 526 46720 984 35818 999 38617 903 95216 621 065
07/01       39 042 25723 702 05820 773 37319 932 71718 394 875
07/02        37 693 73223 076 72822 180 33719 977 866
07/03         38 109 72222 364 12620 204 640
07/04          36 896 84924 202 971
07/05           36 864 749