protected static final class ParsingThread.FrontierEnqueuer
extends java.lang.Object
Modifier and Type | Field | Description |
---|---|---|
int |
outlinks |
|
int |
scheduledLinks |
Constructor | Description |
---|---|
FrontierEnqueuer(Frontier frontier,
RuntimeConfiguration rc) |
Creates the enqueuer.
|
Modifier and Type | Method | Description |
---|---|---|
void |
enqueue(java.net.URI url) |
Enqueues the given URL, provided that it passes the schedule filter, its host is
blacklisted . |
void |
init(java.net.URI uri,
byte[] schemeAuthority,
char[][] robotsFilter) |
Initializes the enqueuer for parsing a page with a specific scheme+authority and robots filter.
|
public FrontierEnqueuer(Frontier frontier, RuntimeConfiguration rc)
frontier
- the frontier instantiating the enqueuer.rc
- the configuration to be used.public void init(java.net.URI uri, byte[] schemeAuthority, char[][] robotsFilter)
schemeAuthority
- the scheme+authority of the page to be parsed.robotsFilter
- the robots filter of the (authority of the) page to be parsed.public void enqueue(java.net.URI url)
blacklisted
.
Moreover, if the scheme+authority is the same as the one of the page being parsed, we check that the URL respects the robots filter.url
- the URL to be enqueued.