public class GenerateGraphMap
extends java.lang.Object
The input format for the tool are a number of TAB-separated files (one per store), each containing triples <URL,digest,position>, which are assumed to be stably reverse-sorted starting at the digest (the position is the local final position). The positions are set to -1 in correspondence to duplicate pages.
The output is given by a text URL list, a corresponding binary list of long values (the node assigned to each URL) and a list of nodes and corresponding archetype URL, TAB separated.
This tool will print on standard output the number of nodes of the resulting graph.
Constructor | Description |
---|---|
GenerateGraphMap() |
Modifier and Type | Method | Description |
---|---|---|
static void |
main(java.lang.String[] arg) |
public static void main(java.lang.String[] arg) throws java.io.IOException, JSAPException
java.io.IOException
JSAPException