protected static final class ParsingThread.FrontierEnqueuer
extends java.lang.Object
| Modifier and Type | Field | Description |
|---|---|---|
int |
outlinks |
|
int |
scheduledLinks |
| Constructor | Description |
|---|---|
FrontierEnqueuer(Frontier frontier,
RuntimeConfiguration rc) |
Creates the enqueuer.
|
| Modifier and Type | Method | Description |
|---|---|---|
void |
enqueue(java.net.URI url) |
Enqueues the given URL, provided that it passes the schedule filter, its host is
blacklisted. |
void |
init(java.net.URI uri,
byte[] schemeAuthority,
char[][] robotsFilter) |
Initializes the enqueuer for parsing a page with a specific scheme+authority and robots filter.
|
public FrontierEnqueuer(Frontier frontier, RuntimeConfiguration rc)
frontier - the frontier instantiating the enqueuer.rc - the configuration to be used.public void init(java.net.URI uri,
byte[] schemeAuthority,
char[][] robotsFilter)
schemeAuthority - the scheme+authority of the page to be parsed.robotsFilter - the robots filter of the (authority of the) page to be parsed.public void enqueue(java.net.URI url)
blacklisted.
Moreover, if the scheme+authority is the same as the one of the page being parsed, we check that the URL respects the robots filter.url - the URL to be enqueued.