arabic-2005

If you publish results based on this graph, please quote the references suggested in the dataset page.

This graph has been obtained from a 2005 crawl performed by UbiCrawler. The crawl aimed at countries whose web sites could contain (at least potentially) pages written in Arabic.

Basic data
nodes22 744 080
arcs639 999 458
bits/link1.383 (6.56%)
bits/link (transpose)1.131 (5.37%)
average degree28.139
maximum indegree575 618
maximum outdegree9 905
dangling nodes14.51%
buckets3.30%
largest component15 177 163 (66.73%)
spid1.83 (± 0.024)
average distance16.58 (± 0.048)
reachable pairs66.33% (± 0.675)
median distance20 (51.69%)
harmonic diameter22.39 (± 0.197)
Random access (recommended)
FilenameSize
arabic-2005.graph141M
arabic-2005.properties4.0K
arabic-2005-t.graph96M
arabic-2005-t.properties4.0K
arabic-2005.map250M
arabic-2005.smap250M
arabic-2005.md5sums4.0K
arabic-2005.lmap726M
arabic-2005.fcl651M
arabic-2005.urls.gz131M
arabic-2005.stats4.0K
arabic-2005.indegree1.2M
arabic-2005.outdegree24K
arabic-2005.scc87M
arabic-2005.sccsizes16M
Sequential access (high compression)
FilenameSize
arabic-2005-hc.graph106M
arabic-2005-hc.properties4.0K
arabic-2005-hc-t.graph87M
arabic-2005-hc-t.properties4.0K
Natural order (random access)
FilenameSize
arabic-2005-nat.graph214M
arabic-2005-nat.properties4.0K
arabic-2005-nat.fcl564M
arabic-2005-nat.urls.gz118M
Indegree-frequency plotIndegree-frequency plot (with Fibonacci binning)
Outdegree-frequency plotOutdegree-frequency plot (with Fibonacci binning)
Indegree-rank plot (cumulative)Indegree-rank plot (cumulative)
Outdegree-rank plot (cumulative)Outdegree-rank plot (cumulative)
Distance probability mass functiondistance probability mass function
Connected-components size distributionConnected-components size distribution
Large connected componentsLarge connected components
Distribution of the logarithm of successor gapsDistribution of the logarithm of the successor gaps
Distribution of successor gapsDistribution of successor gaps