Package it.unimi.dsi.law.warc.parser
Extensions of the BulletParser
.
-
Interface Summary Interface Description Parser A generic parser forresponses
.Parser.LinkReceiver A class that can receive URLs discovered during parsing. -
Class Summary Class Description BinaryParser A universal binary parser that just computes digests.Digester A callback computing the digest of a page.HTMLParser An HTML parser with additional responsibilities (such as guessing the character encoding and resolving relative URLs).HTMLParser.SetLinkReceiver