enron

If you publish results based on this graph, please quote the references suggested in the dataset page.

This dataset was made public by the Federal Energy Regulatory Commission during its investigations: it is a partially anonymised corpus of e-mail messages exchanged by some Enron employees (mostly part of the senior management). We turned this dataset into a directed graph, whose nodes represent people and with an arc from x to y whenever y was the recipient of (at least) a message sent by x.

Basic data
nodes69 244
arcs276 143
bits/link5.874 (37.83%)
bits/link (transpose)7.388 (47.58%)
average degree3.988
maximum indegree1 394
maximum outdegree1 392
dangling nodes74.63%
buckets0.39%
largest component8 271 (11.94%)
average distance4.25 (± 0.007)
reachable pairs11.74% (± 0.113)
median distance
harmonic diameter34.28 (± 0.295)
Random access (recommended)
FilenameSize
enron.graph208K
enron.properties4.0K
enron-t.graph276K
enron-t.properties4.0K
enron.md5sums4.0K
enron.stats4.0K
enron.indegree4.0K
enron.outdegree4.0K
enron.scc272K
enron.sccsizes240K
Sequential access (high compression)
FilenameSize
enron-hc.graph200K
enron-hc.properties4.0K
enron-hc-t.graph252K
enron-hc-t.properties4.0K
Natural order (random access)
FilenameSize
enron-nat.graph460K
enron-nat.properties4.0K
JGraphT serialized succinct representation
FilenameSize
enron.suxdir1.2M
Indegree-frequency plotIndegree-frequency plot (with Fibonacci binning)
Outdegree-frequency plotOutdegree-frequency plot (with Fibonacci binning)
Indegree-rank plot (cumulative)Indegree-rank plot (cumulative)
Outdegree-rank plot (cumulative)Outdegree-rank plot (cumulative)
Distance probability mass functiondistance probability mass function
Connected-components size distributionConnected-components size distribution
Large connected componentsLarge connected components
Distribution of the logarithm of successor gapsDistribution of the logarithm of the successor gaps
Distribution of successor gapsDistribution of successor gaps