amazon-2008

If you publish results based on this graph, please quote the references suggested in the dataset page.

A directed graph describing similarity among books as reported by the Amazon store. More precisely the data was obtained using the Amazon E- Commerce Service APIs using SimilarityLookup queries.

Basic data
nodes735 323
arcs5 158 388
bits/link9.153 (50.51%)
bits/link (transpose)8.819 (48.67%)
average degree7.015
maximum indegree1 076
maximum outdegree10
dangling nodes12.04%
buckets1.36%
largest component627 646 (85.36%)
average distance12.06 (± 0.021)
reachable pairs84.40% (± 0.695)
median distance12 (51.65%)
harmonic diameter13.42 (± 0.098)
Random access (recommended)
FilenameSize
amazon-2008.graph5.7M
amazon-2008.properties4.0K
amazon-2008-t.graph5.5M
amazon-2008-t.properties4.0K
amazon-2008.md5sums4.0K
amazon-2008.stats4.0K
amazon-2008.indegree4.0K
amazon-2008.outdegree4.0K
amazon-2008.scc2.9M
amazon-2008.sccsizes356K
Sequential access (high compression)
FilenameSize
amazon-2008-hc.graph5.7M
amazon-2008-hc.properties4.0K
amazon-2008-hc-t.graph5.5M
amazon-2008-hc-t.properties4.0K
Natural order (random access)
FilenameSize
amazon-2008-nat.graph11M
amazon-2008-nat.properties4.0K
JGraphT serialized succinct representation
FilenameSize
amazon-2008.suxdir25M
Indegree-frequency plotIndegree-frequency plot (with Fibonacci binning)
Outdegree-frequency plotOutdegree-frequency plot (with Fibonacci binning)
Indegree-rank plot (cumulative)Indegree-rank plot (cumulative)
Outdegree-rank plot (cumulative)Outdegree-rank plot (cumulative)
Distance probability mass functiondistance probability mass function
Connected-components size distributionConnected-components size distribution
Large connected componentsLarge connected components
Distribution of the logarithm of successor gapsDistribution of the logarithm of the successor gaps
Distribution of successor gapsDistribution of successor gaps