- backToRoot - Variable in class it.unimi.dsi.law.rank.PageRankPush
-
The amount of ranking going back to the root.
- backup(int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Backup a number of characters.
- BAD_CHAR - Static variable in class it.unimi.dsi.law.bubing.util.BURL
-
Deprecated.
A list of bad characters.
- BAD_CHAR_SUBSTITUTE - Static variable in class it.unimi.dsi.law.bubing.util.BURL
-
Deprecated.
- beginColumn - Variable in class it.unimi.dsi.law.warc.filters.parser.Token
-
The column number of the first character of this Token.
- beginLine - Variable in class it.unimi.dsi.law.warc.filters.parser.Token
-
The line number of the first character of this Token.
- BeginToken() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Start.
- BFS - Class in it.unimi.dsi.law.graph
-
Computes the visit order with respect to a depth-first visit.
- BFS() - Constructor for class it.unimi.dsi.law.graph.BFS
-
- bfsperm(ImmutableGraph, int, int[]) - Static method in class it.unimi.dsi.law.graph.BFS
-
Return the permutation induced by the visit order of a depth-first visit.
- BINARY_CHECK_SCAN_LENGTH - Static variable in class it.unimi.dsi.law.warc.filters.IsProbablyBinary
-
- BinaryParser - Class in it.unimi.dsi.law.warc.parser
-
A universal binary parser that just computes digests.
- BinaryParser(MessageDigest) - Constructor for class it.unimi.dsi.law.warc.parser.BinaryParser
-
Builds a parser for digesting a page.
- BinaryParser(String) - Constructor for class it.unimi.dsi.law.warc.parser.BinaryParser
-
Builds a parser for digesting a page.
- block - Variable in class it.unimi.dsi.law.warc.io.WarcRecord
-
The warc block
.
- BoundedCountingInputStream - Class in it.unimi.dsi.law.warc.io
-
- BoundedCountingInputStream() - Constructor for class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
-
Builds the bounded stream.
- BoundedCountingInputStream(InputStream, long, CRC32) - Constructor for class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
-
Builds the bounded stream.
- BoundedCountingInputStream(InputStream, long) - Constructor for class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
-
Builds the bounded stream.
- buckets - Variable in class it.unimi.dsi.law.rank.PageRank
-
- buckets - Variable in class it.unimi.dsi.law.util.ConsistentHashFunction
-
- buckets() - Method in class it.unimi.dsi.law.util.ConsistentHashFunction
-
Returns the set of buckets of this consistent hash function.
- bufcolumn - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
- buffer - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
- buffer - Variable in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
The buffer.
- buffer - Variable in class it.unimi.dsi.law.warc.parser.HTMLParser
-
The character buffer.
- bufline - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
- bufpos - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Position in buffer.
- buildProperties(String) - Method in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
-
Returns a Properties object that contains all the parameters used by the computation.
- buildProperties(String, String) - Method in class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
-
Returns a Properties object that contains all the parameters used by the computation.
- buildProperties(String) - Method in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
-
Returns a Properties object that contains all the parameters used by the computation.
- buildProperties(String, String, String) - Method in class it.unimi.dsi.law.rank.PageRank
-
Returns a
Properties
object that contains all the parameters used by the computation.
- buildProperties(String, String) - Method in class it.unimi.dsi.law.rank.PowerSeries
-
Returns a Properties object that contains all the parameters used by the computation.
- buildProperties(String) - Method in class it.unimi.dsi.law.rank.SpectralRanking
-
Returns a
Properties
object that contains all parameters used by the computation.
- BURL - Class in it.unimi.dsi.law.bubing.util
-
- BYTE_REPRESENTATION_TO_CONTENT_TYPE - Static variable in class it.unimi.dsi.law.warc.io.WarcRecord
-
- BYTE_REPRESENTATION_TO_RECORD_TYPE - Static variable in class it.unimi.dsi.law.warc.io.WarcRecord
-
- ByteArrayCharSequence - Class in it.unimi.dsi.law.warc.util
-
An adapter exposing a byte array as an ISO-8859-1-encoded
character sequence.
- ByteArrayCharSequence(byte[], int, int) - Constructor for class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
-
Creates a new byte-array character sequence using the provided byte-array fragment.
- ByteArrayCharSequence(byte[]) - Constructor for class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
-
Creates a new byte-array character sequence using the provided byte array.
- ByteArrayCharSequence() - Constructor for class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
-
Creates a new empty byte-array character sequence.
- byteRepresentation - Variable in enum it.unimi.dsi.law.warc.io.WarcRecord.ContentType
-
- byteRepresentation - Variable in enum it.unimi.dsi.law.warc.io.WarcRecord.RecordType
-
- cachedContent - Variable in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
-
- CASE_INSENSITIVE_STRING_HASH_STRATEGY - Static variable in class it.unimi.dsi.law.warc.util.Util
-
The strategy used to decide whether two header names are the same: we require that they are equal up to case.
- cdata(Element, char[], int, int) - Method in class it.unimi.dsi.law.warc.parser.Digester
-
- changed() - Method in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
-
- changed(int) - Method in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
-
- CHAR_BUFFER_SIZE - Static variable in class it.unimi.dsi.law.warc.parser.HTMLParser
-
The size of the internal Jericho buffer.
- characters(char[], int, int, boolean) - Method in class it.unimi.dsi.law.warc.parser.Digester
-
- charAt(int) - Method in class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
-
- checkCRC(FastBufferedInputStream) - Method in class it.unimi.dsi.law.warc.io.GZWarcRecord
-
- clear() - Method in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
-
- clear() - Method in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
-
- clear() - Method in class it.unimi.dsi.law.rank.PageRankGaussSeidel
-
- clear() - Method in class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
-
- clear() - Method in class it.unimi.dsi.law.rank.PageRankParallelPowerSeries
-
- clear() - Method in class it.unimi.dsi.law.rank.PageRankPowerSeries
-
- clear() - Method in class it.unimi.dsi.law.rank.PageRankPush
-
- clear() - Method in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
-
- clear() - Method in class it.unimi.dsi.law.rank.PowerSeries
-
- clear() - Method in class it.unimi.dsi.law.rank.SpectralRanking
-
Clears all data and releases resources by nulling
SpectralRanking.rank
(i.e., results we no longer be available).
- clear() - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
-
- clone() - Method in class it.unimi.dsi.law.warc.parser.BinaryParser
-
- clone() - Method in class it.unimi.dsi.law.warc.parser.HTMLParser
-
- close() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
Makes this stream ready.
- close() - Method in class it.unimi.dsi.law.warc.io.MeasurableSequenceInputStream
-
- CLOSEPAREN - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
-
RegularExpression Id.
- coeffBasename - Variable in class it.unimi.dsi.law.rank.PageRankPowerSeries
-
If not null
, the basename for coefficents.
- column - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
- comment - Variable in class it.unimi.dsi.law.warc.io.GZWarcRecord.GZHeader
-
The (optional) comment
gzip field.
- CompressedIntLabel - Class in it.unimi.dsi.law.webgraph
-
An integer label that uses a coder/decoder pair depending on the the source node.
- CompressedIntLabel(String, int, String, LongBigList, Int2ObjectMap<Coder>, Int2ObjectMap<Decoder>) - Constructor for class it.unimi.dsi.law.webgraph.CompressedIntLabel
-
- CompressedIntLabel(String, int, LongBigList, Int2ObjectMap<Decoder>, Int2ObjectMap<Coder>) - Constructor for class it.unimi.dsi.law.webgraph.CompressedIntLabel
-
Creates a compressed integer label.
- CompressedIntLabel(Object, String, String, String, String, String, String) - Constructor for class it.unimi.dsi.law.webgraph.CompressedIntLabel
-
Creates a compressed integer label from a specification that includes decoders and coders.
- CompressedIntLabel(Object, String, String, String, String, String) - Constructor for class it.unimi.dsi.law.webgraph.CompressedIntLabel
-
Creates a compressed integer label from a specification that includes just decoders.
- compressedSkipLength - Variable in class it.unimi.dsi.law.warc.io.GZWarcRecord.GZHeader
-
The compressed-skip-length
warc-required extra gzip field.
- CompressWarc - Class in it.unimi.dsi.law.warc.tool
-
A tool to compress a WARC file.
- CompressWarc() - Constructor for class it.unimi.dsi.law.warc.tool.CompressWarc
-
- compute(String, int, String, double[], int[]) - Static method in class it.unimi.dsi.law.rank.PageRankFromCoefficients
-
Computes PageRank and its derivatives for given damping factor values.
- compute(double[], double[]) - Method in class it.unimi.dsi.law.stat.AveragePrecisionCorrelation
-
Computes AP correlation between two score vectors.
- compute(double[], double[]) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Computes the correlation between two score vectors.
- compute(CharSequence, Class<?>, CharSequence, Class<?>, boolean, int) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Computes the correlation between two (possibly reversed) score vectors with a given precision.
- compute(CharSequence, Class<?>, CharSequence, Class<?>, boolean) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Computes the correlation between two (possibly reversed) score vectors.
- compute(CharSequence, Class<?>, CharSequence, Class<?>, int) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Computes the correlation between two score vectors with a given precision.
- compute(CharSequence, CharSequence, Class<?>) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Computes the correlation between two score vectors.
- compute(double[], double[]) - Method in class it.unimi.dsi.law.stat.KendallTau
-
Computes Kendall's τ between two score vectors.
- compute(double[], double[]) - Method in class it.unimi.dsi.law.stat.WeightedTau
-
Computes the symmetrized weighted τ between two score vectors.
- compute(double[], double[], int[]) - Method in class it.unimi.dsi.law.stat.WeightedTau
-
Computes the weighted τ between two score vectors, given a reference rank.
- compute(byte[]) - Static method in class it.unimi.dsi.law.util.CRC64
-
Computes the 64 bit CRC of a byte array.
- compute(char[], int, int) - Static method in class it.unimi.dsi.law.util.CRC64
-
Computes the 64 bit CRC of a character array fragment.
- compute(String) - Static method in class it.unimi.dsi.law.util.CRC64
-
Computes the 64 bit CRC of a string, using the ISO8859-1 representation of its Unicode
characters.
- compute(MutableString) - Static method in class it.unimi.dsi.law.util.CRC64
-
Computes the 64 bit CRC of a mutable string, using the ISO8859-1 representation of its
Unicode characters.
- compute(double[]) - Method in enum it.unimi.dsi.law.util.Norm
-
Computes the norm of a vector.
- compute(double[], double[]) - Method in enum it.unimi.dsi.law.util.Norm
-
Computes the norm of the difference of two vectors.
- compute(double[]) - Static method in class it.unimi.dsi.law.util.NormL1
-
Deprecated.
- compute(double[], double[]) - Static method in class it.unimi.dsi.law.util.NormL1
-
Deprecated.
- compute(double[]) - Static method in class it.unimi.dsi.law.util.NormL2
-
Deprecated.
- compute(double[], double[]) - Static method in class it.unimi.dsi.law.util.NormL2
-
Deprecated.
- computeDoubles(CharSequence, CharSequence) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Computes the correlation between two score vectors.
- computeDoubles(CharSequence, CharSequence, boolean) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Computes the correlation between two (possible reversed) score vectors.
- computeDoubles(CharSequence, CharSequence, int) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Computes the correlation between two score vectors with a given precision.
- computeDoubles(CharSequence, CharSequence, boolean, int) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Computes the correlation between two (possible reversed) score vectors with a given precision.
- computeFloats(CharSequence, CharSequence) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Computes the correlation between two score vectors.
- computeFloats(CharSequence, CharSequence, boolean) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Computes the correlation between two (possibly reversed) score vectors.
- computeFloats(CharSequence, CharSequence, int) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Computes the correlation between two score vectors with a given precision.
- computeFloats(CharSequence, CharSequence, boolean, int) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Computes the correlation between two (possibly reversed) score vectors with a given precision.
- computeInts(CharSequence, CharSequence) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Computes the correlation between two score vectors.
- computeInts(CharSequence, CharSequence, boolean) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Computes the correlation between two (possibly reversed) score vectors.
- computeLabels(double) - Method in class it.unimi.dsi.law.graph.LayeredLabelPropagation
-
- computeLabels(double, int) - Method in class it.unimi.dsi.law.graph.LayeredLabelPropagation
-
Computes the labels of a graph for a given value of γ.
- computeLongs(CharSequence, CharSequence) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Computes the correlation between two score vectors.
- computeLongs(CharSequence, CharSequence, boolean) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Computes the correlation between (possibly reversed) two score vectors.
- computePermutation(String) - Method in class it.unimi.dsi.law.graph.LayeredLabelPropagation
-
- computePermutation(double[], String) - Method in class it.unimi.dsi.law.graph.LayeredLabelPropagation
-
- computePermutation(double[], String, int) - Method in class it.unimi.dsi.law.graph.LayeredLabelPropagation
-
Computes the final permutation of the graph.
- configure(BulletParser) - Method in class it.unimi.dsi.law.warc.parser.Digester
-
- connect(InputStream) - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
Connects to a given input stream, and fills the buffer accordingly.
- ConsistentHashFunction<T extends java.lang.Comparable<? super T>> - Class in it.unimi.dsi.law.util
-
Provides an implementation of consistent hashing.
- ConsistentHashFunction() - Constructor for class it.unimi.dsi.law.util.ConsistentHashFunction
-
Creates a new consistent hash function.
- ConsistentHashFunction(ConsistentHashFunction.SkipStrategy<T>) - Constructor for class it.unimi.dsi.law.util.ConsistentHashFunction
-
Creates a new consistent hash function with given skip strategy.
- ConsistentHashFunction.SkipStrategy<T> - Interface in it.unimi.dsi.law.util
-
Allows to skip suitable items when searching for the closest replica.
- consume() - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
-
Invokes
EntityUtils.consume(HttpEntity)
on the entity returned by the underlying
Apache HTTP Components
HttpResponse
.
- consume(InputStream, long) - Static method in class it.unimi.dsi.law.warc.util.Util
-
Consumes a given number of bytes from a stream.
- consume(InputStream) - Static method in class it.unimi.dsi.law.warc.util.Util
-
Consumes all the bytes of a stream.
- contains(int) - Method in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
-
- containsKey(int) - Method in class it.unimi.dsi.law.stat.WeightedTau.AbstractWeigher
-
- containsKey(Object) - Method in class it.unimi.dsi.law.warc.util.RemappedStringMap
-
- contentAsStream() - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
-
- contentAsStream() - Method in interface it.unimi.dsi.law.warc.util.HttpResponse
-
Returns the content of this response as a stream.
- contentAsStream(MeasurableInputStream) - Method in class it.unimi.dsi.law.warc.util.MutableHttpResponse
-
Sets the content.
- contentAsStream() - Method in class it.unimi.dsi.law.warc.util.MutableHttpResponse
-
- contentAsStream() - Method in class it.unimi.dsi.law.warc.util.WarcHttpResponse
-
- contentReady - Variable in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
-
- contentType - Variable in class it.unimi.dsi.law.warc.io.WarcRecord.Header
-
The warc content-type
.
- ContentTypeStartsWith - Class in it.unimi.dsi.law.warc.filters
-
Accepts only fetched response whose content type starts with a given string.
- ContentTypeStartsWith(String) - Constructor for class it.unimi.dsi.law.warc.filters.ContentTypeStartsWith
-
- copy(WarcRecord) - Method in class it.unimi.dsi.law.warc.io.WarcRecord
-
Copies this warc record fields from another warc record.
- copy(WarcRecord.Header) - Method in class it.unimi.dsi.law.warc.io.WarcRecord.Header
-
Copies this heaer fields from another header.
- copy() - Method in class it.unimi.dsi.law.webgraph.CompressedIntLabel
-
- CorrelationIndex - Class in it.unimi.dsi.law.stat
-
- CorrelationIndex() - Constructor for class it.unimi.dsi.law.stat.CorrelationIndex
-
- CosineSimilarityStrategy - Class in it.unimi.dsi.law.vector
-
A class that compute the similarity between pattern using cosine similarity.
- CosineSimilarityStrategy() - Constructor for class it.unimi.dsi.law.vector.CosineSimilarityStrategy
-
- count() - Method in class it.unimi.dsi.law.util.ExchangeCounter
-
Computes the number of exchanges.
- crc - Variable in class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
-
A class to compute the crc of read bytes.
- crc - Variable in class it.unimi.dsi.law.warc.io.WarcRecord
-
- CRC64 - Class in it.unimi.dsi.law.util
-
Provides static methods to compute 64-bit CRCs of strings and byte arrays.
- createHierarchicalTempFile(File, int, String, String) - Static method in class it.unimi.dsi.law.warc.util.Util
-
Creates a temporary file with a random hierachical path.
- creationDate - Variable in class it.unimi.dsi.law.warc.io.WarcRecord.Header
-
The warc creation-date
.
- CRLF - Static variable in class it.unimi.dsi.law.warc.io.WarcRecord
-
Some constant strings in their byte equivalent.
- curChar - Variable in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
-
- currentToken - Variable in exception it.unimi.dsi.law.warc.filters.parser.ParseException
-
This is the last token that has been consumed successfully.
- CutWarc - Class in it.unimi.dsi.law.warc.tool
-
A class to extract specific records from a WARC file.
- CutWarc() - Constructor for class it.unimi.dsi.law.warc.tool.CutWarc
-
- danglingNodeDistribution - Variable in class it.unimi.dsi.law.rank.PageRank
-
The vector used used to patch null rows of the adjacency matrix (u in the general formula).
- DataInput2Text - Class in it.unimi.dsi.law.io.tool
-
The main method of this class converts a binary DataOutput
file containing numbers to text format.
- DataInput2Text.Type - Enum in it.unimi.dsi.law.io.tool
-
- dataLength - Variable in class it.unimi.dsi.law.warc.io.WarcRecord.Header
-
The warc data-length
.
- DEBUG - Static variable in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
- DEBUG - Static variable in class it.unimi.dsi.law.warc.io.WarcRecord
-
- DEBUG - Static variable in class it.unimi.dsi.law.webgraph.CompressedIntLabel
-
- debugStream - Variable in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
-
Debug output.
- DEFAULT - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
-
Lexical state.
- DEFAULT_ALPHA - Static variable in class it.unimi.dsi.law.rank.PageRank
-
The default damping factor.
- DEFAULT_BUFFER_SIZE - Static variable in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
The default buffer size (64KiB).
- DEFAULT_BUFFER_SIZE - Static variable in class it.unimi.dsi.law.warc.io.WarcRecord
-
The default size of the internal buffer used for headers read/write.
- DEFAULT_BUFFER_SIZE - Static variable in class it.unimi.dsi.law.warc.tool.ExtractDigestUrls
-
- DEFAULT_BUFFER_SIZE - Static variable in class it.unimi.dsi.law.warc.tool.ExtractLinks
-
- DEFAULT_BUFFER_SIZE - Static variable in class it.unimi.dsi.law.warc.tool.ListGZWarcComments
-
- DEFAULT_BUFFER_SIZE - Static variable in class it.unimi.dsi.law.warc.util.RemappedStringMap
-
- DEFAULT_DOMINANT_EIGENVECTOR_NORM - Static variable in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
-
- DEFAULT_GAMMAS - Static variable in class it.unimi.dsi.law.graph.LayeredLabelPropagation
-
The list of default γ values.
- DEFAULT_MAX_ITER - Static variable in class it.unimi.dsi.law.rank.SpectralRanking
-
Default maximum number of iterations (note that this value is used as a default by main methods).
- DEFAULT_NORM - Static variable in class it.unimi.dsi.law.rank.SpectralRanking
-
- DEFAULT_THRESHOLD - Static variable in class it.unimi.dsi.law.rank.SpectralRanking
-
Default threshold (note that this value is used as a default by main methods).
- DenseVector - Class in it.unimi.dsi.law.vector
-
A mutable implementation of
Vector
optimized for dense vectors.
- dequeue() - Method in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
-
- derivative - Variable in class it.unimi.dsi.law.rank.PageRankPowerSeries
-
- DFS - Class in it.unimi.dsi.law.graph
-
Computes the visit order with respect to a depth-first visit.
- DFS() - Constructor for class it.unimi.dsi.law.graph.DFS
-
- dfsperm(ImmutableGraph, int[]) - Static method in class it.unimi.dsi.law.graph.DFS
-
Return the permutation induced by the visit order of a depth-first visit.
- digest() - Method in class it.unimi.dsi.law.warc.parser.Digester
-
Returns the digest computed.
- digest() - Method in interface it.unimi.dsi.law.warc.util.DigestBasedDuplicateDetection
-
Returns the content digest.
- digest() - Method in class it.unimi.dsi.law.warc.util.WarcHttpResponse
-
- DIGEST_HEADER - Static variable in interface it.unimi.dsi.law.warc.util.HttpResponse
-
The WARC anvl-filed
name to store the digest.
- DigestBasedDuplicateDetection - Interface in it.unimi.dsi.law.warc.util
-
- DigestEquals - Class in it.unimi.dsi.law.warc.filters
-
Accepts only records of given digest, specified as a hexadecimal string.
- Digester - Class in it.unimi.dsi.law.warc.parser
-
A callback computing the digest of a page.
- Digester(String) - Constructor for class it.unimi.dsi.law.warc.parser.Digester
-
Creates a new callback using the given message digest.
- digits(int) - Static method in class it.unimi.dsi.law.warc.util.Util
-
Returns the number of decimal digits that are necessary to represent the argument.
- digits(long) - Static method in class it.unimi.dsi.law.warc.util.Util
-
Returns the number of decimal digits that are necessary to represent the argument.
- disable_tracing() - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
Disable tracing.
- dispose() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
Disposes this stream, deleting the overflow file and nulling the buffer.
- DominantEigenvectorParallelPowerMethod - Class in it.unimi.dsi.law.rank
-
Computes the left dominant eigenvalue and eigenvector of a graph using a parallel implementation of the power method.
- DominantEigenvectorParallelPowerMethod(ImmutableGraph, int, Logger) - Constructor for class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
-
Creates a new instance.
- DominantEigenvectorParallelPowerMethod(ImmutableGraph, Logger) - Constructor for class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
-
Creates a new instance.
- DominantEigenvectorParallelPowerMethod(ImmutableGraph) - Constructor for class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
-
Creates a new instance.
- Done() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Reset buffer when finished.
- dotProduct(Vector) - Method in class it.unimi.dsi.law.vector.DenseVector
-
- dotProduct(Vector) - Method in class it.unimi.dsi.law.vector.ImmutableSparseVector
-
- dotProduct(Vector) - Method in class it.unimi.dsi.law.vector.Vector
-
Returns the dot product between v and this vector.
- downHeap(double[], int[], int[], int, int) - Static method in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
-
- DuplicateSegmentsLessThan - Class in it.unimi.dsi.law.warc.filters
-
Accepts only URIs whose path does not contain too many duplicate segments.
- DuplicateSegmentsLessThan(int) - Constructor for class it.unimi.dsi.law.warc.filters.DuplicateSegmentsLessThan
-
Creates a filter that only accepts URIs whose path does contains less duplicate consecutive segments than
the given threshold.
- ell1Norm() - Method in class it.unimi.dsi.law.vector.DenseVector
-
- ell1Norm() - Method in class it.unimi.dsi.law.vector.ImmutableSparseVector
-
- ell1norm - Variable in class it.unimi.dsi.law.vector.Vector
-
The value of computed L1 norm.
- ell1Norm() - Method in class it.unimi.dsi.law.vector.Vector
-
Returns the l1 norm of this vector.
- ell2Norm() - Method in class it.unimi.dsi.law.vector.DenseVector
-
- ell2Norm() - Method in class it.unimi.dsi.law.vector.ImmutableSparseVector
-
- ell2norm - Variable in class it.unimi.dsi.law.vector.Vector
-
The value of computed norm.
- ell2Norm() - Method in class it.unimi.dsi.law.vector.Vector
-
Returns the l2 norm of this vector.
- EMPTY_ARRAY - Static variable in class it.unimi.dsi.law.warc.filters.Filters
-
- EmptyQueueStoppingCritertion() - Constructor for class it.unimi.dsi.law.rank.PageRankPush.EmptyQueueStoppingCritertion
-
- enable_tracing() - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
Enable tracing.
- endColumn - Variable in class it.unimi.dsi.law.warc.filters.parser.Token
-
The column number of the last character of this Token.
- endDocument() - Method in class it.unimi.dsi.law.warc.parser.Digester
-
- endElement(Element) - Method in class it.unimi.dsi.law.warc.parser.Digester
-
- endLine - Variable in class it.unimi.dsi.law.warc.filters.parser.Token
-
The line number of the last character of this Token.
- enqueue(int) - Method in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
-
- ensureDirectory(File) - Static method in class it.unimi.dsi.law.warc.util.Util
-
Checks if the given File exists and is a directory, or if not existent, it makes a directory (and its parent).
- entrySet - Variable in class it.unimi.dsi.law.util.ConsistentHashFunction
-
- EOF - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
-
End of File.
- EOL - Static variable in exception it.unimi.dsi.law.warc.filters.parser.ParseException
-
The end of line string for this machine.
- equals(Object) - Method in class it.unimi.dsi.law.warc.filters.ContentTypeStartsWith
-
- equals(Object) - Method in class it.unimi.dsi.law.warc.filters.DuplicateSegmentsLessThan
-
- equals(Object) - Method in class it.unimi.dsi.law.warc.filters.HostEndsWith
-
- equals(Object) - Method in class it.unimi.dsi.law.warc.filters.HostEquals
-
- equals(Object) - Method in class it.unimi.dsi.law.warc.filters.PathEndsWithOneOf
-
- equals(Object) - Method in class it.unimi.dsi.law.warc.filters.SchemeEquals
-
- equals(Object) - Method in class it.unimi.dsi.law.warc.filters.StatusCategory
-
- equals(Object) - Method in class it.unimi.dsi.law.warc.filters.URLEquals
-
- equals(Object) - Method in class it.unimi.dsi.law.warc.filters.URLMatchesRegex
-
- equals(Object) - Method in class it.unimi.dsi.law.warc.filters.URLShorterThan
-
- equals(Object) - Method in class it.unimi.dsi.law.warc.io.GZWarcRecord.GZHeader
-
- equals(Object) - Method in class it.unimi.dsi.law.warc.io.WarcRecord.Header
-
- euclideanDistance(Vector) - Method in class it.unimi.dsi.law.vector.DenseVector
-
- euclideanDistance(Vector) - Method in class it.unimi.dsi.law.vector.ImmutableSparseVector
-
- euclideanDistance(Vector) - Method in class it.unimi.dsi.law.vector.Vector
-
Returns the euclidean distance between v and this vector.
- EuclideanSimilarityStrategy - Class in it.unimi.dsi.law.vector
-
A class that compute the similarity between pattern using the euclidean distance.
- EuclideanSimilarityStrategy() - Constructor for class it.unimi.dsi.law.vector.EuclideanSimilarityStrategy
-
- ExchangeCounter - Class in it.unimi.dsi.law.util
-
Computes the number of discordances between two score vectors
using Knight's O(n log n)
MergeSort-based algorithm.
- ExchangeCounter(int[], double[], int[]) - Constructor for class it.unimi.dsi.law.util.ExchangeCounter
-
Creates a new exchange counter with a provided support array.
- ExchangeCounter(int[], double[]) - Constructor for class it.unimi.dsi.law.util.ExchangeCounter
-
Creates a new exchange counter.
- ExchangeWeigher - Class in it.unimi.dsi.law.util
-
Computes the weight of discordances using a generalisation of Knight's algorithm.
- ExchangeWeigher(Int2DoubleFunction, int[], double[], int[], boolean, int[]) - Constructor for class it.unimi.dsi.law.util.ExchangeWeigher
-
Creates a new exchange weigher.
- ExpandBuff(boolean) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
- expectedTokenSequences - Variable in exception it.unimi.dsi.law.warc.filters.parser.ParseException
-
Each entry in this array is an array of integers.
- ExtractDigestUrls - Class in it.unimi.dsi.law.warc.tool
-
A tool to extract digests and URLs from response records of a WARC file.
- ExtractDigestUrls() - Constructor for class it.unimi.dsi.law.warc.tool.ExtractDigestUrls
-
- ExtractLinks - Class in it.unimi.dsi.law.warc.tool
-
Extracts links from a WARC file.
- ExtractLinks() - Constructor for class it.unimi.dsi.law.warc.tool.ExtractLinks
-
- falling(int, int) - Static method in class it.unimi.dsi.law.Util
-
Computes falling powers.
- FALSE - Static variable in class it.unimi.dsi.law.warc.filters.Filters
-
- FALSE - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
-
RegularExpression Id.
- fill(long) - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
Reads the underlying input stream up to a given limit.
- fill() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
Reads fully the underlying input stream.
- fillAndRewind() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
Reads fully the underlying input stream and rewinds.
- FillBuff() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
- Filter<T> - Interface in it.unimi.dsi.law.warc.filters
-
A filter is a strategy to decide whether to accept a given
object or not.
- FILTER_PACKAGE_NAME - Static variable in interface it.unimi.dsi.law.warc.filters.Filter
-
The name of the package that contains this interface as well as
most filters.
- FilterParser<T> - Class in it.unimi.dsi.law.warc.filters.parser
-
A simple parser that transforms a filter expression into a filter.
- FilterParser(Class<T>) - Constructor for class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
- FilterParser(InputStream) - Constructor for class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
Constructor with InputStream.
- FilterParser(InputStream, String) - Constructor for class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
Constructor with InputStream and supplied encoding
- FilterParser(Reader) - Constructor for class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
Constructor.
- FilterParser(FilterParserTokenManager) - Constructor for class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
Constructor with generated Token Manager.
- FilterParserConstants - Interface in it.unimi.dsi.law.warc.filters.parser
-
Token literal values and constants.
- FilterParserTokenManager - Class in it.unimi.dsi.law.warc.filters.parser
-
Token Manager.
- FilterParserTokenManager(SimpleCharStream) - Constructor for class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
-
Constructor.
- FilterParserTokenManager(SimpleCharStream, int) - Constructor for class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
-
Constructor.
- Filters - Class in it.unimi.dsi.law.warc.filters
-
A collection of static methods to deal with
filters
.
- Filters() - Constructor for class it.unimi.dsi.law.warc.filters.Filters
-
- finalize() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
- fixedWidth() - Method in class it.unimi.dsi.law.webgraph.CompressedIntLabel
-
- fixURL(MutableString) - Static method in class it.unimi.dsi.law.warc.util.Util
-
Fixes a given URL so that it is
BURL
-parsable.
- FORBIDDEN_CHARS - Static variable in class it.unimi.dsi.law.bubing.util.BURL
-
Deprecated.
Characters that will cause a URI spec to be rejected.
- FormatException(String) - Constructor for exception it.unimi.dsi.law.warc.io.WarcRecord.FormatException
-
- fromBitStream(InputBitStream, int) - Method in class it.unimi.dsi.law.webgraph.CompressedIntLabel
-
- fromHexString(String) - Static method in class it.unimi.dsi.law.warc.util.Util
-
Returns a byte array corresponding to the given number.
- fromNormalizedByteArray(byte[]) - Static method in class it.unimi.dsi.law.bubing.util.BURL
-
Deprecated.
Creates a new BUbiNG URL from a normalized ASCII string represented by a byte array.
- fromNormalizedSchemeAuthorityAndPathQuery(String, byte[]) - Static method in class it.unimi.dsi.law.bubing.util.BURL
-
Deprecated.
Creates a new BUbiNG URL from a normalized ASCII string representing scheme and
authority and a byte-array representation of a normalized ASCII path and query.
- fromWarcRecord(WarcRecord) - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
-
- fromWarcRecord(WarcRecord) - Method in class it.unimi.dsi.law.warc.util.MutableHttpResponse
-
- fromWarcRecord(WarcRecord) - Method in interface it.unimi.dsi.law.warc.util.Response
-
Fills this response with the content of a
WarcRecord
(optional operation).
- fromWarcRecord(WarcRecord) - Method in class it.unimi.dsi.law.warc.util.WarcHttpResponse
-
- generateParseException() - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
Generate ParseException.
- get(int) - Method in class it.unimi.dsi.law.vector.DenseVector
-
- get(int) - Method in class it.unimi.dsi.law.vector.ImmutableSparseVector
-
- get(int) - Method in class it.unimi.dsi.law.vector.Int2DoubleMapVector
-
- get(int) - Method in class it.unimi.dsi.law.vector.Vector
-
Gets the value at index idx.
- get(Object) - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse.HttpResponseHeaderMap
-
- getASCIIBytes(String) - Static method in class it.unimi.dsi.law.warc.util.Util
-
Returns the given ASCII string as a byte array; characters are filtered through the 1111111(=0x7F) mask.
- getASCIIBytes(MutableString) - Static method in class it.unimi.dsi.law.warc.util.Util
-
Returns the given ASCII mutable string as a byte array; characters are filtered through the 1111111(=0x7F) mask.
- getBeginColumn() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Get token beginning column number.
- getBeginLine() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Get token beginning line number.
- getCharsetName(byte[], int) - Static method in class it.unimi.dsi.law.warc.parser.HTMLParser
-
Returns the charset name as indicated by a META
HTTP-EQUIV
element, if
present, interpreting the provided byte array as a sequence of
ISO-8859-1-encoded characters.
- getCharsetNameFromHeader(String) - Static method in class it.unimi.dsi.law.warc.parser.HTMLParser
-
Extracts the charset name from the header value of a content-type
header.
- getColumn() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Deprecated.
- getEndColumn() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Get token end column number.
- getEndLine() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Get token end line number.
- getFilterFromSpec(String, String, Class<T>) - Static method in class it.unimi.dsi.law.warc.filters.Filters
-
Creates a filter from a filter class name and an external form.
- GetImage() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Get token literal value.
- getInstance(int, int) - Static method in class it.unimi.dsi.law.vector.DenseVector
-
Returns an instance of given size with zero values.
- getInstance(double[], int) - Static method in class it.unimi.dsi.law.vector.DenseVector
-
Returns an instance from an array of values.
- getInstance(int, double[], int[], int) - Static method in class it.unimi.dsi.law.vector.ImmutableSparseVector
-
Returns an instance of given size from an array of values.
- getInstance(Vector, double, int) - Static method in class it.unimi.dsi.law.vector.ImmutableSparseVector
-
Returns an instance containing all the values of a given vector larger or smaller of a given threshold.
- getInstance(double[], double, int) - Static method in class it.unimi.dsi.law.vector.ImmutableSparseVector
-
Returns an instance containing all the values of a given array larger or smaller of a given threshold.
- getInstance(int[], double[], int, double, int) - Static method in class it.unimi.dsi.law.vector.ImmutableSparseVector
-
Returns an instance containing all the values of a given array larger than a given threshold.
- getInstance(int, int) - Static method in class it.unimi.dsi.law.vector.Int2DoubleMapVector
-
Returns an instance of given size with zero values.
- getInstance(int, Int2DoubleMap, int) - Static method in class it.unimi.dsi.law.vector.Int2DoubleMapVector
-
- getLine() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Deprecated.
- getLong(Object) - Method in class it.unimi.dsi.law.warc.util.RemappedStringMap
-
- getMessage() - Method in error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
-
You can also modify the body of this method to customize your error messages.
- getNextToken() - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
Get the next Token.
- getNextToken() - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
-
Get the next Token.
- getString(byte[]) - Static method in class it.unimi.dsi.law.warc.util.Util
-
Returns the given byte array as an ASCII string.
- getString(byte[], int, int) - Static method in class it.unimi.dsi.law.warc.util.Util
-
Returns the given byte array as an ASCII string.
- GetSuffix(int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Get the suffix.
- getTabSize() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
- getToken(int) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
Get the specific Token.
- getValue() - Method in class it.unimi.dsi.law.warc.filters.parser.Token
-
An optional attribute value of the Token.
- graph - Variable in class it.unimi.dsi.law.rank.SpectralRanking
-
The graph.
- GrepWarc - Class in it.unimi.dsi.law.warc.tool
-
A "grep" for WARC files.
- GrepWarc() - Constructor for class it.unimi.dsi.law.warc.tool.GrepWarc
-
- ground() - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
- GUESSED_CHARSET_HEADER - Static variable in interface it.unimi.dsi.law.warc.util.HttpResponse
-
The WARC anvl-filed
name to store the charset recognized during parsing.
- guessedCharset() - Method in class it.unimi.dsi.law.warc.parser.BinaryParser
-
- guessedCharset() - Method in class it.unimi.dsi.law.warc.parser.HTMLParser
-
- guessedCharset() - Method in interface it.unimi.dsi.law.warc.parser.Parser
-
Returns a guessed charset for the document, or null
if the charset could
not be guessed.
- gzheader - Variable in class it.unimi.dsi.law.warc.io.GZWarcRecord
-
The GZip headers used by this object.
- GZHeader() - Constructor for class it.unimi.dsi.law.warc.io.GZWarcRecord.GZHeader
-
- GZWarcRecord - Class in it.unimi.dsi.law.warc.io
-
A class to read/write WARC/0.9 records in compressed form (for format details, please see the
WARC and
GZip format specifications).
- GZWarcRecord() - Constructor for class it.unimi.dsi.law.warc.io.GZWarcRecord
-
- GZWarcRecord.GZHeader - Class in it.unimi.dsi.law.warc.io
-
A class to contain fields contained in the gzip header.
- GZWarcStats - Class in it.unimi.dsi.law.warc.tool
-
A tool to compute some statistics about a gzipped WARC file.
- GZWarcStats() - Constructor for class it.unimi.dsi.law.warc.tool.GZWarcStats
-
- hash(long, int) - Method in class it.unimi.dsi.law.util.ConsistentHashFunction
-
Returns an array of buckets whose replicae are close to the given point.
- hash(Object, int) - Method in class it.unimi.dsi.law.util.ConsistentHashFunction
-
Returns an array of buckets whose replicae are close to the given object.
- hash(long) - Method in class it.unimi.dsi.law.util.ConsistentHashFunction
-
Returns the bucket of the replica that is closest to the given point.
- hash(Object) - Method in class it.unimi.dsi.law.util.ConsistentHashFunction
-
Returns the bucket of the replica that is closest to the given key.
- hashCode() - Method in class it.unimi.dsi.law.warc.io.GZWarcRecord.GZHeader
-
- hashCode() - Method in class it.unimi.dsi.law.warc.io.WarcRecord.Header
-
- hashCode() - Method in class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
-
- hasNext() - Method in class it.unimi.dsi.law.warc.io.HttpResponseFilteredIterator
-
- hasNext() - Method in class it.unimi.dsi.law.warc.io.WarcFilteredIterator
-
- header - Variable in class it.unimi.dsi.law.warc.io.WarcRecord
-
The warc header
.
- Header() - Constructor for class it.unimi.dsi.law.warc.io.WarcRecord.Header
-
- HEADER_CHARSET - Static variable in interface it.unimi.dsi.law.warc.util.HttpResponse
-
The Charset
used to encode/decode the HTTP headers.
- headerMap - Variable in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
-
- headerMap - Variable in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
-
The header map.
- HeaderMap() - Constructor for class it.unimi.dsi.law.warc.util.MetadataHttpResponse.HeaderMap
-
- headers() - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
-
- headers() - Method in interface it.unimi.dsi.law.warc.util.HttpResponse
-
Returns the headers of this response.
- headers() - Method in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
-
- headers(Object2ObjectMap<String, String>) - Method in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
-
Sets the headers.
- headers() - Method in class it.unimi.dsi.law.warc.util.WarcHttpResponse
-
- host(byte[]) - Static method in class it.unimi.dsi.law.bubing.util.BURL
-
Deprecated.
Extracts the host of an absolute BUbiNG URL in its byte-array representation.
- HostEndsWith - Class in it.unimi.dsi.law.warc.filters
-
Accepts only URIs whose host ends with (case-insensitively) a certain suffix.
- HostEndsWith(String) - Constructor for class it.unimi.dsi.law.warc.filters.HostEndsWith
-
Creates a filter that only accepts URLs with a given suffix.
- HostEquals - Class in it.unimi.dsi.law.warc.filters
-
Accepts only URIs whose host equals (case-insensitively) a certain string.
- HostEquals(String) - Constructor for class it.unimi.dsi.law.warc.filters.HostEquals
-
Creates a filter that only accepts URLs with a given host.
- hostFromSchemeAndAuthority(String) - Static method in class it.unimi.dsi.law.bubing.util.BURL
-
Deprecated.
Extracts the host part from a scheme and authority by removing the scheme, the user info and the port number.
- HTMLParser - Class in it.unimi.dsi.law.warc.parser
-
An HTML parser with additional responsibilities (such as guessing the character encoding
and resolving relative URLs).
- HTMLParser(MessageDigest) - Constructor for class it.unimi.dsi.law.warc.parser.HTMLParser
-
Builds a parser for link extraction and, possibly, digesting a page.
- HTMLParser(String) - Constructor for class it.unimi.dsi.law.warc.parser.HTMLParser
-
Builds a parser for link extraction and, possibly, digesting a page.
- HTMLParser() - Constructor for class it.unimi.dsi.law.warc.parser.HTMLParser
-
Builds a parser for link extraction.
- HTMLParser.SetLinkReceiver - Class in it.unimi.dsi.law.warc.parser
-
- HttpComponentsHttpResponse - Class in it.unimi.dsi.law.warc.util
-
- HttpComponentsHttpResponse() - Constructor for class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
-
Creates a new instance.
- HttpComponentsHttpResponse(URI, HttpResponse) - Constructor for class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
-
Creates a new instance wrapping a given Apache HTTP Components
HttpResponse
.
- HttpComponentsHttpResponse.HttpResponseHeaderMap - Class in it.unimi.dsi.law.warc.util
-
- httpResponse - Variable in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
-
- HttpResponse - Interface in it.unimi.dsi.law.warc.util
-
Provides high level access to WARC records with record-type
equal to
response
and content-type
equal to HTTP
(or HTTPS
).
- HttpResponseFilteredIterator - Class in it.unimi.dsi.law.warc.io
-
A class to iterate over WARC files getting only records corresponding to
HttpResponse
that satisfy a given filter.
- HttpResponseFilteredIterator(FastBufferedInputStream, WarcRecord, WarcHttpResponse, Filter<HttpResponse>, ProgressLogger) - Constructor for class it.unimi.dsi.law.warc.io.HttpResponseFilteredIterator
-
Builds the filtered iterator.
- HttpResponseFilteredIterator(FastBufferedInputStream, WarcRecord, WarcHttpResponse, Filter<HttpResponse>) - Constructor for class it.unimi.dsi.law.warc.io.HttpResponseFilteredIterator
-
Builds the filtered iterator.
- HttpResponseHeaderMap() - Constructor for class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse.HttpResponseHeaderMap
-
- HYPERBOLIC - Static variable in class it.unimi.dsi.law.stat.WeightedTau
-
A singleton instance of the symmetric hyperbolic additive τ.
- HYPERBOLIC_WEIGHER - Static variable in class it.unimi.dsi.law.stat.WeightedTau
-
A hyperbolic weigher (the default one).
- id - Variable in class it.unimi.dsi.law.vector.Vector
-
The description ID associated with this vector (immutable).
- image - Variable in class it.unimi.dsi.law.warc.filters.parser.Token
-
The string image of the token.
- ImmutableSparseVector - Class in it.unimi.dsi.law.vector
-
An immutable implementation of
Vector
optimized for sparse vectors.
- inBuf - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
- index - Variable in class it.unimi.dsi.law.vector.ImmutableSparseVector
-
An arrays containing the indexes of the vector values.
- IndexWarc - Class in it.unimi.dsi.law.warc.tool
-
A tool to index a WARC file.
- IndexWarc() - Constructor for class it.unimi.dsi.law.warc.tool.IndexWarc
-
- init() - Method in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
-
- init() - Method in class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
-
- init() - Method in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
-
- init() - Method in class it.unimi.dsi.law.rank.PageRank
-
Basic initialization: we log the damping factor, check that the preference vector is correctly sized and stochastic,
fill
SpectralRanking.rank
with the preference vector and set the dangling-node distribution
depending on the value of
PageRank.stronglyPreferential
.
- init() - Method in class it.unimi.dsi.law.rank.PageRankGaussSeidel
-
- init() - Method in class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
-
- init() - Method in class it.unimi.dsi.law.rank.PageRankParallelPowerSeries
-
- init() - Method in class it.unimi.dsi.law.rank.PageRankPowerSeries
-
- init() - Method in class it.unimi.dsi.law.rank.PageRankPush
-
- init() - Method in class it.unimi.dsi.law.rank.PowerSeries
-
- init() - Method in class it.unimi.dsi.law.rank.SpectralRanking
-
- init(URI) - Method in class it.unimi.dsi.law.warc.parser.HTMLParser.SetLinkReceiver
-
- init(URI) - Method in interface it.unimi.dsi.law.warc.parser.Parser.LinkReceiver
-
Initializes this receiver for a new page.
- input_stream - Variable in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
-
- inputStream - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
- inspectable - Variable in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
The number of bytes read in the buffer, when connected.
- InspectableBufferedInputStream - Class in it.unimi.dsi.law.warc.io
-
An input stream that wraps an underlying input stream to make it
rewindable and partially inspectable, using a bounded-capacity memory buffer and an overflow file.
- InspectableBufferedInputStream(int, File) - Constructor for class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
Creates a new ready stream.
- InspectableBufferedInputStream(int) - Constructor for class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
Creates a new ready stream using default temporary directory for the overflow file.
- InspectableBufferedInputStream() - Constructor for class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
Creates a new ready stream with default buffer size, and using default temporary directory for the overflow file.
- InspectableBufferedInputStream.State - Enum in it.unimi.dsi.law.warc.io
-
The possible states of this stream, as explained above.
- INSTANCE - Static variable in class it.unimi.dsi.law.stat.AveragePrecisionCorrelation
-
The singleton instance of this class.
- INSTANCE - Static variable in class it.unimi.dsi.law.stat.KendallTau
-
The singleton instance of this class.
- INSTANCE - Static variable in class it.unimi.dsi.law.warc.filters.IsHttpResponse
-
- INSTANCE - Static variable in class it.unimi.dsi.law.warc.filters.IsProbablyBinary
-
- Int2DoubleMapVector - Class in it.unimi.dsi.law.vector
-
A mutable implementation of
Vector
for sparse vectors.
- intermediateRank - Variable in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
-
The rank vector obtained after the first half of a round.
- IntHeapIndirectPriorityQueue() - Constructor for class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
-
- INVALID_ID - Static variable in class it.unimi.dsi.law.vector.Vector
-
A value indicating that the norm is not computed for current values.
- INVALID_LEXICAL_STATE - Static variable in error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
-
Tried to change to an invalid lexical state.
- INVALID_NORM - Static variable in class it.unimi.dsi.law.vector.Vector
-
A value indicating that the norm is not computed for current values.
- isDuplicate() - Method in interface it.unimi.dsi.law.warc.util.DigestBasedDuplicateDetection
-
Returns the duplicate status of this response.
- isDuplicate() - Method in class it.unimi.dsi.law.warc.util.WarcHttpResponse
-
- ISDUPLICATE_HEADER - Static variable in interface it.unimi.dsi.law.warc.util.HttpResponse
-
The WARC anvl-filed
name to store the digest.
- isEmpty() - Method in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
-
- IsHttpResponse - Class in it.unimi.dsi.law.warc.filters
-
Accepts only records that are http/https responses.
- isMutable() - Method in class it.unimi.dsi.law.vector.Vector
-
Returns the mutability status of this vector.
- IsProbablyBinary - Class in it.unimi.dsi.law.warc.filters
-
Accepts only http responses whose content stream appears to be binary.
- isSkippable(T) - Method in interface it.unimi.dsi.law.util.ConsistentHashFunction.SkipStrategy
-
Checks whether a bucket can be returned or should be skipped.
- isStochastic(DoubleList) - Static method in class it.unimi.dsi.law.rank.SpectralRanking
-
- it.unimi.dsi.law - package it.unimi.dsi.law
-
Basic classes.
- it.unimi.dsi.law.bubing.util - package it.unimi.dsi.law.bubing.util
-
- it.unimi.dsi.law.graph - package it.unimi.dsi.law.graph
-
Graph-related classes.
- it.unimi.dsi.law.io.tool - package it.unimi.dsi.law.io.tool
-
Tools manipulating and converting files.
- it.unimi.dsi.law.rank - package it.unimi.dsi.law.rank
-
- it.unimi.dsi.law.stat - package it.unimi.dsi.law.stat
-
Statistical tools (in particular,
Kendall's τ) for large-size data.
- it.unimi.dsi.law.util - package it.unimi.dsi.law.util
-
Utility classes.
- it.unimi.dsi.law.vector - package it.unimi.dsi.law.vector
-
- it.unimi.dsi.law.warc.filters - package it.unimi.dsi.law.warc.filters
-
A comprehensive filtering system.
- it.unimi.dsi.law.warc.filters.parser - package it.unimi.dsi.law.warc.filters.parser
-
- it.unimi.dsi.law.warc.io - package it.unimi.dsi.law.warc.io
-
Provides classes performing low and high level WARC I/O (for format details, please see the
ISO draft).
- it.unimi.dsi.law.warc.io.examples - package it.unimi.dsi.law.warc.io.examples
-
- it.unimi.dsi.law.warc.parser - package it.unimi.dsi.law.warc.parser
-
- it.unimi.dsi.law.warc.tool - package it.unimi.dsi.law.warc.tool
-
Command-line tools that manipulate WARC files.
- it.unimi.dsi.law.warc.util - package it.unimi.dsi.law.warc.util
-
- it.unimi.dsi.law.webgraph - package it.unimi.dsi.law.webgraph
-
- iteration - Variable in class it.unimi.dsi.law.rank.SpectralRanking
-
- IterationNumberStoppingCriterion(int) - Constructor for class it.unimi.dsi.law.rank.SpectralRanking.IterationNumberStoppingCriterion
-
Creates an instance with a given number of iterations.
- iterator() - Method in class it.unimi.dsi.law.warc.parser.HTMLParser.SetLinkReceiver
-
- L1NormStoppingCritertion() - Constructor for class it.unimi.dsi.law.rank.PageRankPush.L1NormStoppingCritertion
-
- labelBasename(String) - Method in class it.unimi.dsi.law.graph.LayeredLabelPropagation
-
Sets the basename for label files.
- labelPropagation(ImmutableGraph) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
-
- lambda - Variable in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
-
The dominant eigenvalue.
- largestIndegree(ImmutableGraph) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
-
- largestOutdegree(ImmutableGraph) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
-
- LayeredLabelPropagation - Class in it.unimi.dsi.law.graph
-
An implementation of the layered label propagation algorithm described by
by Paolo Boldi, Sebastiano Vigna, Marco Rosa, Massimo Santini, and Sebastiano Vigna in “Layered label propagation:
A multiresolution coordinate-free ordering for compressing social networks”,
Proceedings of the 20th international conference on World Wide Web, pages 587−596, ACM, 2011.
- LayeredLabelPropagation(ImmutableGraph, long) - Constructor for class it.unimi.dsi.law.graph.LayeredLabelPropagation
-
Creates a new instance.
- LayeredLabelPropagation(ImmutableGraph, int[], long) - Constructor for class it.unimi.dsi.law.graph.LayeredLabelPropagation
-
Creates a new instance using a specific initial permutation.
- LayeredLabelPropagation(ImmutableGraph, int[], long, boolean) - Constructor for class it.unimi.dsi.law.graph.LayeredLabelPropagation
-
Creates a new instance using a specific initial permutation.
- LeftSingularVectorParallelPowerMethod - Class in it.unimi.dsi.law.rank
-
Computes the left singular vector of a graph using a parallel implementation of the power method.
- LeftSingularVectorParallelPowerMethod(ImmutableGraph, ImmutableGraph, int, Logger) - Constructor for class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
-
Creates a new instance.
- LeftSingularVectorParallelPowerMethod(ImmutableGraph, ImmutableGraph, Logger) - Constructor for class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
-
Creates a new instance.
- LeftSingularVectorParallelPowerMethod(ImmutableGraph, ImmutableGraph, int) - Constructor for class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
-
Creates a new instance.
- LeftSingularVectorParallelPowerMethod(ImmutableGraph, ImmutableGraph) - Constructor for class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
-
Creates a new instance.
- length() - Method in class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
-
- length() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
Returns the overall length of this input stream.
- length() - Method in class it.unimi.dsi.law.warc.io.MeasurableSequenceInputStream
-
- length() - Method in class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
-
- LEXICAL_ERROR - Static variable in error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
-
Lexical error occurred.
- LexicalErr(boolean, int, int, int, String, int) - Static method in error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
-
Returns a detailed message for the Error when it is thrown by the
token manager to indicate a lexical error.
- lexStateNames - Static variable in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
-
Lexer state names.
- line - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
- link(URI) - Method in class it.unimi.dsi.law.warc.parser.HTMLParser.SetLinkReceiver
-
- link(URI) - Method in interface it.unimi.dsi.law.warc.parser.Parser.LinkReceiver
-
Handles a link.
- list() - Method in class it.unimi.dsi.law.warc.util.RemappedStringMap
-
- ListGZWarcComments - Class in it.unimi.dsi.law.warc.tool
-
A tool to list the GZip header comments contained in a compressed WARC file.
- ListGZWarcComments() - Constructor for class it.unimi.dsi.law.warc.tool.ListGZWarcComments
-
- loadAsDoubles(CharSequence, Class<?>, boolean) - Static method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Loads a vector of doubles, either in binary or textual form.
- location() - Method in class it.unimi.dsi.law.warc.parser.HTMLParser
-
Returns the BURL location header, if present; if it is not present, but the page contains a valid metalocation, the latter
is returned.
- location(URI) - Method in class it.unimi.dsi.law.warc.parser.HTMLParser.SetLinkReceiver
-
- location(URI) - Method in interface it.unimi.dsi.law.warc.parser.Parser.LinkReceiver
-
Handles the location defined by headers.
- log10(int) - Static method in class it.unimi.dsi.law.warc.util.Util
-
Returns ⌊ log10(x
) ⌋.
- log10(long) - Static method in class it.unimi.dsi.law.warc.util.Util
-
Returns ⌊ log10(x
) ⌋.
- LOGARITHMIC_WEIGHER - Static variable in class it.unimi.dsi.law.stat.WeightedTau
-
A logarithmic weigher.
- logger - Variable in class it.unimi.dsi.law.rank.SpectralRanking
-
A logger defined by the implementing subclasses.
- LOGGER - Static variable in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
- LOOP_DETECTED - Static variable in error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
-
Detected (and bailed out of) an infinite loop in the token manager.
- n - Variable in class it.unimi.dsi.law.rank.SpectralRanking
-
- name - Variable in class it.unimi.dsi.law.warc.io.GZWarcRecord.GZHeader
-
The (optional) name
gzip field.
- newToken(int, String) - Static method in class it.unimi.dsi.law.warc.filters.parser.Token
-
Returns a new Token object, by default.
- newToken(int) - Static method in class it.unimi.dsi.law.warc.filters.parser.Token
-
- next - Variable in class it.unimi.dsi.law.warc.filters.parser.Token
-
A reference to the next regular (non-special) token from the input
stream.
- next() - Method in class it.unimi.dsi.law.warc.io.HttpResponseFilteredIterator
-
- next() - Method in class it.unimi.dsi.law.warc.io.WarcFilteredIterator
-
- node2Seen - Variable in class it.unimi.dsi.law.rank.PageRankPush
-
A map from nodes to the seen-order.
- nodeLabels - Variable in class it.unimi.dsi.law.webgraph.CompressedIntLabel
-
The node labels stream containing the node labels.
- nonZero - Variable in class it.unimi.dsi.law.vector.ImmutableSparseVector
-
The number of non-zero entry in this vector.
- norm - Variable in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
-
The norm.
- norm - Variable in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
-
The norm.
- Norm - Enum in it.unimi.dsi.law.util
-
An Enum
providing different ℓ norms.
- normalize(double[], double) - Method in enum it.unimi.dsi.law.util.Norm
-
Normalizes a vector to a given norm value.
- normalize(double[], double) - Static method in class it.unimi.dsi.law.util.NormL1
-
Deprecated.
- normalize(double[], double) - Static method in class it.unimi.dsi.law.util.NormL2
-
Deprecated.
- normDelta() - Method in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
-
- normDelta() - Method in class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
-
- normDelta() - Method in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
-
- normDelta() - Method in class it.unimi.dsi.law.rank.PageRankGaussSeidel
-
Return an upper bound to the ℓ1 norm of the error, obtained multiplying by
α / (1 − α) the ℓ1 norm of the difference between the last two approximations (this idea arose in discussions with David Gleich).
- normDelta() - Method in class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
-
Return the following values: if a
suitable norm vector has been set, an upper bound on the error (the ℓ
∞ distance from the rank to be computed);
otherwise, an upper bound to the ℓ
1 norm of the error, obtained multiplying by
α / (1 − α) the ℓ
1 norm of the difference between the last two approximations (this idea arose in discussions with David Gleich).
- normDelta() - Method in class it.unimi.dsi.law.rank.PageRankParallelPowerSeries
-
- normDelta() - Method in class it.unimi.dsi.law.rank.PageRankPowerSeries
-
- normDelta() - Method in class it.unimi.dsi.law.rank.PowerSeries
-
- normDelta() - Method in class it.unimi.dsi.law.rank.SpectralRanking
-
Returns the norm of an estimation of the distance to the limit of the iterative process: depending
on the implementation, this can be an actual bound or, for example, just the difference between the
last two approximations.
- NormL1 - Class in it.unimi.dsi.law.util
-
- NormL2 - Class in it.unimi.dsi.law.util
-
- NormStoppingCriterion(double) - Constructor for class it.unimi.dsi.law.rank.SpectralRanking.NormStoppingCriterion
-
Creates an instance with given threshold.
- normVector(String, double) - Method in class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
-
Sets the norm vector.
- normVector(double[], double) - Method in class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
-
Sets the norm vector.
- normVector(String, double) - Method in class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
-
Sets the norm vector.
- normVector(double[], double) - Method in class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
-
Sets the norm vector.
- not(Filter<T>) - Static method in class it.unimi.dsi.law.warc.filters.Filters
-
Produces the negation of the given filter.
- NOT - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
-
RegularExpression Id.
- NULL_LINK_RECEIVER - Static variable in interface it.unimi.dsi.law.warc.parser.Parser
-
- NumberDistinctLines - Class in it.unimi.dsi.law.io.tool
-
The main method of this class reads a UTF-8 file containg a newline separated
list of strings and writes a
DataOutputStream
containing a
list of ints such that the
i-th int is equal to the
j-th
int iff the (
crc of the)
i-th
string is equal to the (
crc of
the)
j-th string.
- pageRank(ImmutableGraph) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
-
- PageRank - Class in it.unimi.dsi.law.rank
-
An abstract class defining methods and attributes supporting PageRank computations.
- PageRank(ImmutableGraph, Logger) - Constructor for class it.unimi.dsi.law.rank.PageRank
-
Creates a new instance.
- PageRankFromCoefficients - Class in it.unimi.dsi.law.rank
-
Computes PageRank using its power series.
- PageRankFromCoefficients() - Constructor for class it.unimi.dsi.law.rank.PageRankFromCoefficients
-
- PageRankGaussSeidel - Class in it.unimi.dsi.law.rank
-
Computes PageRank of a graph using the Gauß–Seidel method.
- PageRankGaussSeidel(ImmutableGraph, Logger) - Constructor for class it.unimi.dsi.law.rank.PageRankGaussSeidel
-
Creates a new instance.
- PageRankGaussSeidel(ImmutableGraph) - Constructor for class it.unimi.dsi.law.rank.PageRankGaussSeidel
-
Creates a new instance.
- PageRankParallelGaussSeidel - Class in it.unimi.dsi.law.rank
-
Computes PageRank using a parallel (multicore) implementation of the
Gauß–Seidel method.
- PageRankParallelGaussSeidel(ImmutableGraph, int, Logger) - Constructor for class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
-
Creates a new instance.
- PageRankParallelGaussSeidel(ImmutableGraph) - Constructor for class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
-
Creates a new instance.
- PageRankParallelGaussSeidel(ImmutableGraph, Logger) - Constructor for class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
-
Creates a new instance.
- PageRankParallelPowerSeries - Class in it.unimi.dsi.law.rank
-
Computes PageRank using a parallel (multicore) implementation of the
power-series method, which runs
the power method starting from the preference vector, thus evaluating the truncated PageRank power series (see
PageRankPowerSeries
).
- PageRankParallelPowerSeries(ImmutableGraph, int, Logger) - Constructor for class it.unimi.dsi.law.rank.PageRankParallelPowerSeries
-
Creates a new instance.
- PageRankParallelPowerSeries(ImmutableGraph) - Constructor for class it.unimi.dsi.law.rank.PageRankParallelPowerSeries
-
Creates a new instance.
- PageRankPowerSeries - Class in it.unimi.dsi.law.rank
-
Computes PageRank (and possibly its derivatives in the damping factor) using its power series.
- PageRankPowerSeries(ImmutableGraph, Logger) - Constructor for class it.unimi.dsi.law.rank.PageRankPowerSeries
-
Creates a new instance.
- PageRankPowerSeries(ImmutableGraph) - Constructor for class it.unimi.dsi.law.rank.PageRankPowerSeries
-
Creates a new instance.
- PageRankPush - Class in it.unimi.dsi.law.rank
-
Computes strongly preferential PageRank for a preference vector concentrated on a node using the push algorithm.
- PageRankPush(ImmutableGraph, Logger, boolean) - Constructor for class it.unimi.dsi.law.rank.PageRankPush
-
Creates a new instance.
- PageRankPush(ImmutableGraph, boolean) - Constructor for class it.unimi.dsi.law.rank.PageRankPush
-
Creates a new instance.
- PageRankPush.EmptyQueueStoppingCritertion - Class in it.unimi.dsi.law.rank
-
- PageRankPush.IntHeapIndirectPriorityQueue - Class in it.unimi.dsi.law.rank
-
- PageRankPush.L1NormStoppingCritertion - Class in it.unimi.dsi.law.rank
-
- parse(String) - Static method in class it.unimi.dsi.law.bubing.util.BURL
-
Deprecated.
Creates a new BUbiNG URL from a string specification if possible, or returns null
otherwise.
- parse(MutableString) - Static method in class it.unimi.dsi.law.bubing.util.BURL
-
Deprecated.
Creates a new BUbiNG URL from a
mutable string
specification if possible, or returns
null
otherwise.
- parse(String) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
- parse(Response, Parser.LinkReceiver) - Method in class it.unimi.dsi.law.warc.parser.BinaryParser
-
- parse(Response, Parser.LinkReceiver) - Method in class it.unimi.dsi.law.warc.parser.HTMLParser
-
- parse(Response, Parser.LinkReceiver) - Method in interface it.unimi.dsi.law.warc.parser.Parser
-
Parses a response.
- parseCommaSeparatedProperty(String) - Static method in class it.unimi.dsi.law.warc.util.Util
-
The given string is parsed as a comma-separated list of items, and the items are returned
in the form of an array, possibly after resolving an indirection.
- ParseException - Exception in it.unimi.dsi.law.warc.filters.parser
-
This exception is thrown when parse errors are encountered.
- ParseException(Token, int[][], String[]) - Constructor for exception it.unimi.dsi.law.warc.filters.parser.ParseException
-
This constructor is used by the method "generateParseException"
in the generated parser.
- ParseException() - Constructor for exception it.unimi.dsi.law.warc.filters.parser.ParseException
-
The following constructors are for use by you for whatever
purpose you can think of.
- ParseException(String) - Constructor for exception it.unimi.dsi.law.warc.filters.parser.ParseException
-
Constructor with message.
- parseInputTypes(JSAPResult) - Static method in class it.unimi.dsi.law.stat.CorrelationIndex
-
Commodity method to extract from a
JSAPResult
instance the file type information provided by
the user, or supply the default (doubles in binary form).
- Parser - Interface in it.unimi.dsi.law.warc.parser
-
- Parser.LinkReceiver - Interface in it.unimi.dsi.law.warc.parser
-
A class that can receive URLs discovered during parsing.
- pathAndQuery(URI) - Static method in class it.unimi.dsi.law.bubing.util.BURL
-
Deprecated.
Returns the concatenated raw path and raw query of a BUbiNG URL.
- pathAndQueryAsByteArray(URI) - Static method in class it.unimi.dsi.law.bubing.util.BURL
-
Deprecated.
Returns an ASCII byte-array representation of
the raw path and raw query of a BUbiNG URL.
- pathAndQueryAsByteArray(byte[]) - Static method in class it.unimi.dsi.law.bubing.util.BURL
-
Deprecated.
Extracts the path and query of an absolute BUbiNG URL in its byte-array representation.
- PathEndsWithOneOf - Class in it.unimi.dsi.law.warc.filters
-
Accepts only URIs whose path ends (case-insensitively) with one of a given set of suffixes.
- PathEndsWithOneOf(String[]) - Constructor for class it.unimi.dsi.law.warc.filters.PathEndsWithOneOf
-
Creates a filter that only accepts URLs whose path ends with one of a given set of suffixes.
- pNorm - Variable in class it.unimi.dsi.law.rank.PageRankPush
-
- position() - Method in class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
-
- position() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
- position() - Method in class it.unimi.dsi.law.warc.io.MeasurableSequenceInputStream
-
- PowerSeries - Class in it.unimi.dsi.law.rank
-
Computes a power series on a graph using a parallel implementation.
- PowerSeries(ImmutableGraph, int, Logger) - Constructor for class it.unimi.dsi.law.rank.PowerSeries
-
Creates a new instance.
- PowerSeries(ImmutableGraph, int) - Constructor for class it.unimi.dsi.law.rank.PowerSeries
-
Creates a new instance.
- PowerSeries(ImmutableGraph, Logger) - Constructor for class it.unimi.dsi.law.rank.PowerSeries
-
Creates a new instance.
- PowerSeries(ImmutableGraph) - Constructor for class it.unimi.dsi.law.rank.PowerSeries
-
Creates a new instance.
- pr(ImmutableGraph) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
-
- Precision - Class in it.unimi.dsi.law.util
-
A set of commodity methods to manipulate precision of doubles.
- Precision() - Constructor for class it.unimi.dsi.law.util.Precision
-
- preference - Variable in class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
-
The preference vector to be used (or null
if the uniform preference vector should be used).
- preference - Variable in class it.unimi.dsi.law.rank.PageRank
-
The preference vector to be used (or null
if the uniform preference vector should be used).
- preference - Variable in class it.unimi.dsi.law.rank.PowerSeries
-
The preference vector to be used (or null
if the uniform preference vector should be used).
- prevCharIsCR - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
- prevCharIsLF - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
- previousRank - Variable in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
-
The rank vector after the last iteration (only meaningful after at least one step).
- previousRank - Variable in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
-
The rank vector at the end of the last round.
- previousRank - Variable in class it.unimi.dsi.law.rank.PageRankParallelPowerSeries
-
The rank vector after the last iteration (only meaningful after at least one step).
- previousRank - Variable in class it.unimi.dsi.law.rank.PageRankPowerSeries
-
The rank vector after the last iteration (only meaningful after at least one step).
- previousRank - Variable in class it.unimi.dsi.law.rank.PowerSeries
-
The approximation obtained after the last iteration (only meaningful after at least one step).
- progressLogger - Variable in class it.unimi.dsi.law.rank.PageRankPush
-
A progress logger.
- pseudoRank - Variable in class it.unimi.dsi.law.rank.PageRankGaussSeidel
-
If true, an everywhere zero dangling-node distribution will be simulated, resulting in the computation of a pseudorank.
- pseudoRank - Variable in class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
-
If true, an everywhere zero dangling-node distribution will be simulated, resulting in the computation of a pseudorank.
- put(String, String) - Method in class it.unimi.dsi.law.warc.util.MetadataHttpResponse.HeaderMap
-
- random(ImmutableGraph) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
-
- random - Variable in class it.unimi.dsi.law.warc.util.AbstractHttpResponse
-
A high-quality pseudorandom generator to generate UUIDs.
- rank(ImmutableGraph, double[]) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
-
- rank - Variable in class it.unimi.dsi.law.rank.SpectralRanking
-
The current rank vector.
- read() - Method in class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
-
- read(byte[], int, int) - Method in class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
-
- read(FastBufferedInputStream) - Method in class it.unimi.dsi.law.warc.io.GZWarcRecord
-
- read(byte[], int, int) - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
- read(byte[]) - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
- read() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
- read() - Method in class it.unimi.dsi.law.warc.io.MeasurableSequenceInputStream
-
- read(byte[], int, int) - Method in class it.unimi.dsi.law.warc.io.MeasurableSequenceInputStream
-
- read(FastBufferedInputStream) - Method in class it.unimi.dsi.law.warc.io.WarcRecord
-
A method to read a record from an InputStream
.
- readANVLHeaders(MeasurableInputStream, Map<String, String>, Charset) - Static method in class it.unimi.dsi.law.warc.util.Util
-
Parses headers from the given stream.
- readBytes() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
The number of bytes read so far from the underlying stream.
- readChar() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Read a character.
- readHeaderLine(InputStream, Charset) - Static method in class it.unimi.dsi.law.warc.util.Util
-
Return byte array from an (unchunked) input stream.
- readStatusLine(MeasurableInputStream, Charset) - Static method in class it.unimi.dsi.law.warc.util.Util
-
- recordId - Variable in class it.unimi.dsi.law.warc.io.WarcRecord.Header
-
The warc record id
.
- recordType - Variable in class it.unimi.dsi.law.warc.io.WarcRecord.Header
-
The warc record-type
.
- refArray - Variable in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
-
The reference array.
- ReInit(InputStream) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
Reinitialise.
- ReInit(InputStream, String) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
Reinitialise.
- ReInit(Reader) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
Reinitialise.
- ReInit(FilterParserTokenManager) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
Reinitialise.
- ReInit(SimpleCharStream) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
-
Reinitialise parser.
- ReInit(SimpleCharStream, int) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
-
Reinitialise parser.
- ReInit(Reader, int, int, int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Reinitialise.
- ReInit(Reader, int, int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Reinitialise.
- ReInit(Reader) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Reinitialise.
- ReInit(InputStream, String, int, int, int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Reinitialise.
- ReInit(InputStream, int, int, int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Reinitialise.
- ReInit(InputStream, String) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Reinitialise.
- ReInit(InputStream) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Reinitialise.
- ReInit(InputStream, String, int, int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Reinitialise.
- ReInit(InputStream, int, int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Reinitialise.
- RemappedStringMap - Class in it.unimi.dsi.law.warc.util
-
- RemappedStringMap(StringMap<? extends CharSequence>, int[]) - Constructor for class it.unimi.dsi.law.warc.util.RemappedStringMap
-
Creates a new remapped minimal perfect hash.
- remove(T) - Method in class it.unimi.dsi.law.util.ConsistentHashFunction
-
Removes a bucket.
- remove() - Method in class it.unimi.dsi.law.warc.io.HttpResponseFilteredIterator
-
- remove() - Method in class it.unimi.dsi.law.warc.io.WarcFilteredIterator
-
- RemoveHubs - Class in it.unimi.dsi.law.graph
-
Removes nodes from a graph following a number of strategies.
- RemoveHubs() - Constructor for class it.unimi.dsi.law.graph.RemoveHubs
-
- replicae - Variable in class it.unimi.dsi.law.util.ConsistentHashFunction
-
Maps points in the unit interval to buckets.
- REPLICAE_PER_BUCKET - Static variable in class it.unimi.dsi.law.util.ConsistentHashFunction
-
Each bucket is replicated this number of times.
- reset() - Method in class it.unimi.dsi.law.util.KahanSummation
-
Resets the current value and correction to zero.
- resetRead() - Method in class it.unimi.dsi.law.warc.io.GZWarcRecord
-
- resetRead() - Method in class it.unimi.dsi.law.warc.io.WarcRecord
-
A method to allow the reuse of the present object for non consecutive reads.
- residual - Variable in class it.unimi.dsi.law.rank.PageRankPush
-
- response(HttpResponse) - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse.HttpResponseHeaderMap
-
Sets the response whose headers will be wrapped by this map.
- Response - Interface in it.unimi.dsi.law.warc.util
-
Provides high level access to WARC records with record-type
equal to
response
.
- rewind() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
Rewinds this stream.
- root - Variable in class it.unimi.dsi.law.rank.PageRankPush
-
The node where the preference vector is concentrated.
- run(FastBufferedInputStream, OutputStream) - Static method in class it.unimi.dsi.law.warc.tool.CompressWarc
-
This method reads from a given input stream a sequence of uncompressed
WARC records and writes to a given output stream a compressed version of
them.
- run(FastBufferedInputStream, RandomAccessFile, boolean, boolean, long[], int, OutputStream) - Static method in class it.unimi.dsi.law.warc.tool.CutWarc
-
- run(FastBufferedInputStream, boolean, Filter<HttpResponse>, PrintWriter) - Static method in class it.unimi.dsi.law.warc.tool.ExtractDigestUrls
-
- run(FastBufferedInputStream, boolean, Filter<HttpResponse>, PrintWriter, StringMap<? extends CharSequence>, StringMap<? extends CharSequence>) - Static method in class it.unimi.dsi.law.warc.tool.ExtractLinks
-
Extracts links from a WARC file.
- run(FastBufferedInputStream, boolean, Filter<WarcRecord>, OutputStream) - Static method in class it.unimi.dsi.law.warc.tool.GrepWarc
-
This method acts as a sort of "grep" for WARC files.
- run(FastBufferedInputStream, SummaryStats, SummaryStats, SummaryStats) - Static method in class it.unimi.dsi.law.warc.tool.GZWarcStats
-
- run(FastBufferedInputStream, boolean, OutputStream) - Static method in class it.unimi.dsi.law.warc.tool.IndexWarc
-
This method reads from a given input stream a sequence of WARC records and writes to a given output stream
the byte offset of the read records.
- run(FastBufferedInputStream, PrintWriter) - Static method in class it.unimi.dsi.law.warc.tool.ListGZWarcComments
-
Writes on the given writer the GZip header comment filed.
- run(String, String, StringMap<? extends CharSequence>, String, int) - Static method in class it.unimi.dsi.law.warc.util.RemappedStringMap
-
- salsa - Variable in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
-
Compute the SALSA score (only for historical and testing reasons: please use the Salsa
class instead).
- scale - Variable in class it.unimi.dsi.law.rank.PowerSeries
-
The overall scaling that has been applied to the current approximation.
- scale(double) - Method in class it.unimi.dsi.law.vector.DenseVector
-
- scale(double) - Method in class it.unimi.dsi.law.vector.ImmutableSparseVector
-
- scale(double) - Method in class it.unimi.dsi.law.vector.Vector
-
Scale values in this vector by a value alpha.
- schemeAndAuthority(URI) - Static method in class it.unimi.dsi.law.bubing.util.BURL
-
Deprecated.
Returns the concatenated URI.getScheme() and raw authority
of a BUbiNG URL.
- schemeAndAuthority(byte[]) - Static method in class it.unimi.dsi.law.bubing.util.BURL
-
Deprecated.
Extracts the scheme and authority of an absolute BUbiNG URL in its byte-array representation.
- SchemeEquals - Class in it.unimi.dsi.law.warc.filters
-
Accepts only URIs whose scheme equals a certain string (typically, http
).
- SchemeEquals(String) - Constructor for class it.unimi.dsi.law.warc.filters.SchemeEquals
-
Creates a filter that only accepts URIs with a given scheme.
- seen2Node - Variable in class it.unimi.dsi.law.rank.PageRankPush
-
A map from seen-order to nodes.
- SequentialHttpResponseRead - Class in it.unimi.dsi.law.warc.io.examples
-
- SequentialHttpResponseRead() - Constructor for class it.unimi.dsi.law.warc.io.examples.SequentialHttpResponseRead
-
- SequentialHttpResponseWrite - Class in it.unimi.dsi.law.warc.io.examples
-
- SequentialHttpResponseWrite() - Constructor for class it.unimi.dsi.law.warc.io.examples.SequentialHttpResponseWrite
-
- SequentialWarcRecordRead - Class in it.unimi.dsi.law.warc.io.examples
-
- SequentialWarcRecordRead() - Constructor for class it.unimi.dsi.law.warc.io.examples.SequentialWarcRecordRead
-
- SequentialWarcRecordWrite - Class in it.unimi.dsi.law.warc.io.examples
-
- SequentialWarcRecordWrite() - Constructor for class it.unimi.dsi.law.warc.io.examples.SequentialWarcRecordWrite
-
- set(int, double) - Method in class it.unimi.dsi.law.vector.DenseVector
-
- set(int, double) - Method in class it.unimi.dsi.law.vector.ImmutableSparseVector
-
- set(int, double) - Method in class it.unimi.dsi.law.vector.Int2DoubleMapVector
-
- set(int, double) - Method in class it.unimi.dsi.law.vector.Vector
-
Sets the value val at index idx.
- set(URI, HttpResponse) - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
-
Sets the response wrapped by this instance.
- setDebugStream(PrintStream) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
-
Set debug output.
- setInput(InputStream, long, CRC32) - Method in class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
-
Resets the bounded stream fields, for reusing it.
- setInput(InputStream, long) - Method in class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
-
Resets the bounded stream fields, for reusing it.
- SetLinkReceiver() - Constructor for class it.unimi.dsi.law.warc.parser.HTMLParser.SetLinkReceiver
-
- setTabSize(int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
- shift - Variable in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
-
A shift.
- shift - Variable in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
-
A shift.
- shouldStop(SpectralRanking) - Method in class it.unimi.dsi.law.rank.PageRankPush.EmptyQueueStoppingCritertion
-
- shouldStop(SpectralRanking) - Method in class it.unimi.dsi.law.rank.PageRankPush.L1NormStoppingCritertion
-
- shouldStop(SpectralRanking) - Method in class it.unimi.dsi.law.rank.SpectralRanking.IterationNumberStoppingCriterion
-
- shouldStop(SpectralRanking) - Method in class it.unimi.dsi.law.rank.SpectralRanking.NormStoppingCriterion
-
- shouldStop(SpectralRanking) - Method in interface it.unimi.dsi.law.rank.SpectralRanking.StoppingCriterion
-
Determines if the computation should be stopped.
- similarity(Vector, Vector) - Method in class it.unimi.dsi.law.vector.CosineSimilarityStrategy
-
- similarity(Vector, Vector) - Method in class it.unimi.dsi.law.vector.EuclideanSimilarityStrategy
-
- similarity(Vector, Vector) - Method in interface it.unimi.dsi.law.vector.SimilarityStrategy
-
Returns the similarity value between two vectors.
- SimilarityStrategy - Interface in it.unimi.dsi.law.vector
-
An interface specifying methods used to obtain pattern similarities.
- SimpleCharStream - Class in it.unimi.dsi.law.warc.filters.parser
-
An implementation of interface CharStream, where the stream is assumed to
contain only ASCII characters (without unicode processing).
- SimpleCharStream(Reader, int, int, int) - Constructor for class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Constructor.
- SimpleCharStream(Reader, int, int) - Constructor for class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Constructor.
- SimpleCharStream(Reader) - Constructor for class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Constructor.
- SimpleCharStream(InputStream, String, int, int, int) - Constructor for class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Constructor.
- SimpleCharStream(InputStream, int, int, int) - Constructor for class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Constructor.
- SimpleCharStream(InputStream, String, int, int) - Constructor for class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Constructor.
- SimpleCharStream(InputStream, int, int) - Constructor for class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Constructor.
- SimpleCharStream(InputStream, String) - Constructor for class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Constructor.
- SimpleCharStream(InputStream) - Constructor for class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Constructor.
- size() - Method in class it.unimi.dsi.law.stat.WeightedTau.AbstractWeigher
-
- size - Variable in class it.unimi.dsi.law.vector.Vector
-
The vector size (immutable).
- size() - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse.HttpResponseHeaderMap
-
- size() - Method in class it.unimi.dsi.law.warc.util.RemappedStringMap
-
- size64() - Method in class it.unimi.dsi.law.warc.util.RemappedStringMap
-
- sizes - Variable in class it.unimi.dsi.law.util.ConsistentHashFunction
-
For each bucket, its size.
- skip(FastBufferedInputStream) - Method in class it.unimi.dsi.law.warc.io.GZWarcRecord
-
- skip(long) - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
- skip(FastBufferedInputStream) - Method in class it.unimi.dsi.law.warc.io.WarcRecord
-
A method to skip a record from an InputStream
.
- skipStrategy - Variable in class it.unimi.dsi.law.util.ConsistentHashFunction
-
The optional strategy to skip buckets, or null
.
- specialToken - Variable in class it.unimi.dsi.law.warc.filters.parser.Token
-
This field is used to access special tokens that occur prior to this
token, but after the immediately preceding regular (non-special) token.
- SpectralRanking - Class in it.unimi.dsi.law.rank
-
- SpectralRanking(ImmutableGraph, Logger) - Constructor for class it.unimi.dsi.law.rank.SpectralRanking
-
Creates a new instance.
- SpectralRanking.IterationNumberStoppingCriterion - Class in it.unimi.dsi.law.rank
-
A stopping criterion that stops whenever the number of iterations exceeds a given bound.
- SpectralRanking.NormStoppingCriterion - Class in it.unimi.dsi.law.rank
-
- SpectralRanking.StoppingCriterion - Interface in it.unimi.dsi.law.rank
-
A a strategy that decides when a computation should be stopped.
- standardFilters() - Static method in class it.unimi.dsi.law.warc.filters.Filters
-
Returns a list of the standard filter classes.
- start() - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
Parser.
- startDocument() - Method in class it.unimi.dsi.law.warc.parser.Digester
-
- startElement(Element, Map<Attribute, MutableString>) - Method in class it.unimi.dsi.law.warc.parser.Digester
-
- STATIC_LEXER_ERROR - Static variable in error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
-
An attempt was made to create a second instance of a static token manager.
- staticFlag - Static variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
Whether parser is static.
- status() - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
-
- status() - Method in interface it.unimi.dsi.law.warc.util.HttpResponse
-
Returns the response status.
- status() - Method in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
-
- StatusCategory - Class in it.unimi.dsi.law.warc.filters
-
Accepts only fetched response whose status category (status/100) has a certain value.
- StatusCategory(int) - Constructor for class it.unimi.dsi.law.warc.filters.StatusCategory
-
Creates a filter that only accepts responses of the given category.
- statusLine() - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
-
- statusLine() - Method in interface it.unimi.dsi.law.warc.util.HttpResponse
-
Returns the response status line.
- statusLine - Variable in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
-
The status line of this response.
- statusLine() - Method in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
-
- statusLine(StatusLine) - Method in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
-
Sets the status line
- step() - Method in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
-
- step() - Method in class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
-
- step() - Method in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
-
- step() - Method in class it.unimi.dsi.law.rank.PageRankGaussSeidel
-
- step() - Method in class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
-
- step() - Method in class it.unimi.dsi.law.rank.PageRankParallelPowerSeries
-
- step() - Method in class it.unimi.dsi.law.rank.PageRankPowerSeries
-
- step() - Method in class it.unimi.dsi.law.rank.PageRankPush
-
- step() - Method in class it.unimi.dsi.law.rank.PowerSeries
-
- step() - Method in class it.unimi.dsi.law.rank.SpectralRanking
-
Performs one computation step.
- stepUntil(SpectralRanking.StoppingCriterion) - Method in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
-
- stepUntil(SpectralRanking.StoppingCriterion) - Method in class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
-
- stepUntil(SpectralRanking.StoppingCriterion) - Method in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
-
- stepUntil(SpectralRanking.StoppingCriterion) - Method in class it.unimi.dsi.law.rank.PageRankGaussSeidel
-
- stepUntil(SpectralRanking.StoppingCriterion) - Method in class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
-
- stepUntil(SpectralRanking.StoppingCriterion) - Method in class it.unimi.dsi.law.rank.PageRankParallelPowerSeries
-
- stepUntil(SpectralRanking.StoppingCriterion) - Method in class it.unimi.dsi.law.rank.PageRankPowerSeries
-
- stepUntil(SpectralRanking.StoppingCriterion) - Method in class it.unimi.dsi.law.rank.PowerSeries
-
- stepUntil(SpectralRanking.StoppingCriterion) - Method in class it.unimi.dsi.law.rank.SpectralRanking
-
- STOCHASTIC_TOLERANCE - Static variable in class it.unimi.dsi.law.rank.SpectralRanking
-
- store(ImmutableGraph, ImmutableGraph, double[], int[], String, String) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
-
- stronglyPreferential - Variable in class it.unimi.dsi.law.rank.PageRank
-
Decides whether we use the strongly or weakly (the default) preferential algorithm.
- subjectUri - Variable in class it.unimi.dsi.law.warc.io.WarcRecord.Header
-
The warc subject-uri
.
- subSequence(int, int) - Method in class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
-
- subset - Variable in class it.unimi.dsi.law.rank.PageRankPowerSeries
-
If not null
, the subset of nodes over which the derivatives should be computed.
- SwitchTo(int) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
-
Switch to specified lex state.
- symPageRank(ImmutableGraph, ImmutableGraph) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
-
- tabSize - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
- Text2DataOutput - Class in it.unimi.dsi.law.io.tool
-
The main method of this class converts converts a text file containing numbers to binary DataOutput
format.
- threshold - Variable in class it.unimi.dsi.law.rank.PageRankPush
-
The threshold for stopping.
- THRESHOLD - Static variable in class it.unimi.dsi.law.warc.filters.IsProbablyBinary
-
The number of zeroes that must appear to cause the page to be considered probably
binary.
- toBitStream(OutputBitStream, int) - Method in class it.unimi.dsi.law.webgraph.CompressedIntLabel
-
- toByteArray(URI) - Static method in class it.unimi.dsi.law.bubing.util.BURL
-
Deprecated.
Returns an ASCII byte-array representation of a BUbiNG URL.
- toHexString(byte[]) - Static method in class it.unimi.dsi.law.warc.util.Util
-
Returns a mutable string representing in hexadecimal a digest.
- toImmutableSparseVector() - Method in class it.unimi.dsi.law.vector.Int2DoubleMapVector
-
- token - Variable in class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
Current token.
- Token - Class in it.unimi.dsi.law.warc.filters.parser
-
Describes the input token stream.
- Token() - Constructor for class it.unimi.dsi.law.warc.filters.parser.Token
-
No-argument constructor
- Token(int) - Constructor for class it.unimi.dsi.law.warc.filters.parser.Token
-
Constructs a new token for the specified Image.
- Token(int, String) - Constructor for class it.unimi.dsi.law.warc.filters.parser.Token
-
Constructs a new token for the specified Image and Kind.
- token_source - Variable in class it.unimi.dsi.law.warc.filters.parser.FilterParser
-
Generated Token Manager.
- tokenImage - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
-
Literal token values.
- tokenImage - Variable in exception it.unimi.dsi.law.warc.filters.parser.ParseException
-
This is a reference to the "tokenImage" array of the generated
parser within which the parse error occurred.
- TokenMgrError - Error in it.unimi.dsi.law.warc.filters.parser
-
Token Manager Error.
- TokenMgrError() - Constructor for error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
-
No arg constructor.
- TokenMgrError(String, int) - Constructor for error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
-
Constructor with message and reason.
- TokenMgrError(boolean, int, int, int, String, int, int) - Constructor for error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
-
Full Constructor.
- toSpec() - Method in class it.unimi.dsi.law.webgraph.CompressedIntLabel
-
- toString(byte[]) - Static method in class it.unimi.dsi.law.bubing.util.BURL
-
Deprecated.
Returns an ASCII byte-array representation of a BUbiNG URL.
- toString() - Method in class it.unimi.dsi.law.util.ConsistentHashFunction
-
- toString(Object...) - Method in class it.unimi.dsi.law.warc.filters.AbstractFilter
-
A helper method that generates a string version of this filter (mainly
useful for atomic, i.e., class-based, filters).
- toString() - Method in class it.unimi.dsi.law.warc.filters.ContentTypeStartsWith
-
- toString() - Method in class it.unimi.dsi.law.warc.filters.DigestEquals
-
- toString() - Method in class it.unimi.dsi.law.warc.filters.DuplicateSegmentsLessThan
-
- toString() - Method in class it.unimi.dsi.law.warc.filters.HostEndsWith
-
- toString() - Method in class it.unimi.dsi.law.warc.filters.HostEquals
-
- toString() - Method in class it.unimi.dsi.law.warc.filters.IsHttpResponse
-
- toString() - Method in class it.unimi.dsi.law.warc.filters.IsProbablyBinary
-
- toString() - Method in class it.unimi.dsi.law.warc.filters.parser.Token
-
Returns the image.
- toString() - Method in class it.unimi.dsi.law.warc.filters.PathEndsWithOneOf
-
- toString() - Method in class it.unimi.dsi.law.warc.filters.SchemeEquals
-
- toString() - Method in class it.unimi.dsi.law.warc.filters.StatusCategory
-
- toString() - Method in class it.unimi.dsi.law.warc.filters.URLEquals
-
- toString() - Method in class it.unimi.dsi.law.warc.filters.URLMatchesRegex
-
- toString() - Method in class it.unimi.dsi.law.warc.filters.URLShorterThan
-
- toString() - Method in class it.unimi.dsi.law.warc.io.GZWarcRecord.GZHeader
-
- toString() - Method in class it.unimi.dsi.law.warc.io.GZWarcRecord
-
- toString() - Method in class it.unimi.dsi.law.warc.io.WarcRecord.Header
-
- toString() - Method in class it.unimi.dsi.law.warc.io.WarcRecord
-
- toString() - Method in class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
-
- toWarcRecord(WarcRecord) - Method in class it.unimi.dsi.law.warc.util.AbstractHttpResponse
-
Populates a WARC record with contents from this response.
- trackLineColumn - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
- TRUE - Static variable in class it.unimi.dsi.law.warc.filters.Filters
-
The constantly true filter.
- TRUE - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
-
RegularExpression Id.
- truncate(double, int) - Static method in class it.unimi.dsi.law.util.Precision
-
Truncates the given double value to the given number of fractional binary digits.
- truncate(double[], int) - Static method in class it.unimi.dsi.law.util.Precision
-
- truncate(long) - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
-
Truncates the overflow file to a given size.
- uncompressedSkipLength - Variable in class it.unimi.dsi.law.warc.io.GZWarcRecord.GZHeader
-
The uncompressed-skip-length
warc-required extra gzip field.
- UpdateLineColumn(char) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
-
- upHeap(double[], int[], int[], int) - Static method in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
-
- uri() - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
-
- uri - Variable in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
-
The URI that is currently contained in this response.
- uri() - Method in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
-
- uri(URI) - Method in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
-
Sets the url.
- uri() - Method in interface it.unimi.dsi.law.warc.util.Response
-
Returns the URI associated with this response.
- url(ImmutableGraph, FastBufferedReader) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
-
- url(URI) - Method in class it.unimi.dsi.law.warc.parser.Digester
-
Sets the URI that will be used to tune the next digest.
- url - Variable in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
-
- URLEquals - Class in it.unimi.dsi.law.warc.filters
-
Accepts only a given URIs.
- URLEquals(String) - Constructor for class it.unimi.dsi.law.warc.filters.URLEquals
-
Creates a filter that only accepts URIs equal to a given URI.
- URLMatchesRegex - Class in it.unimi.dsi.law.warc.filters
-
Accepts only URIs that match a certain regular expression.
- URLMatchesRegex(String) - Constructor for class it.unimi.dsi.law.warc.filters.URLMatchesRegex
-
Creates a filter that only accepts URLs matching a given regular expression.
- URLShorterThan - Class in it.unimi.dsi.law.warc.filters
-
Accepts only URIs whose overall length is below a given threshold.
- URLShorterThan(int) - Constructor for class it.unimi.dsi.law.warc.filters.URLShorterThan
-
Creates a filter that only accepts URLs shorter than the given threshold.
- USE_POSITION_INSTEAD_OF_SKIP - Static variable in class it.unimi.dsi.law.warc.io.GZWarcRecord
-
Tells what method to use to skip bytes in the input stream.
- USE_POSITION_INSTEAD_OF_SKIP - Static variable in class it.unimi.dsi.law.warc.io.WarcRecord
-
Tells what method to use to skip bytes in the input stream.
- Util - Class in it.unimi.dsi.law
-
A static container of utility methods for all LAW software.
- Util() - Constructor for class it.unimi.dsi.law.Util
-
- Util - Class in it.unimi.dsi.law.warc.util
-
Static utility methods.
- UUID_FIELD_NAME - Static variable in class it.unimi.dsi.law.warc.io.WarcRecord
-
Some constant strings in their byte equivalent.
- value() - Method in class it.unimi.dsi.law.util.KahanSummation
-
Returns the sum computed so far.
- value - Variable in class it.unimi.dsi.law.vector.ImmutableSparseVector
-
An arrays containing vector values.
- valueOf(String) - Static method in enum it.unimi.dsi.law.io.tool.DataInput2Text.Type
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum it.unimi.dsi.law.util.Norm
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.ContentTypeStartsWith
-
- valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.DigestEquals
-
- valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.DuplicateSegmentsLessThan
-
- valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.HostEndsWith
-
- valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.HostEquals
-
- valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.IsHttpResponse
-
- valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.IsProbablyBinary
-
- valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.PathEndsWithOneOf
-
- valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.SchemeEquals
-
- valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.StatusCategory
-
- valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.URLEquals
-
- valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.URLMatchesRegex
-
- valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.URLShorterThan
-
- valueOf(String) - Static method in enum it.unimi.dsi.law.warc.io.InspectableBufferedInputStream.State
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum it.unimi.dsi.law.warc.io.WarcRecord.ContentType
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum it.unimi.dsi.law.warc.io.WarcRecord.RecordType
-
Returns the enum constant of this type with the specified name.
- values() - Static method in enum it.unimi.dsi.law.io.tool.DataInput2Text.Type
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum it.unimi.dsi.law.util.Norm
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum it.unimi.dsi.law.warc.io.InspectableBufferedInputStream.State
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum it.unimi.dsi.law.warc.io.WarcRecord.ContentType
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- values() - Static method in enum it.unimi.dsi.law.warc.io.WarcRecord.RecordType
-
Returns an array containing the constants of this enum type, in
the order they are declared.
- Vector - Class in it.unimi.dsi.law.vector
-
A class representing a vector of double
.
- Vector(int, boolean, int) - Constructor for class it.unimi.dsi.law.vector.Vector
-
Build a vector of given size and set the mutability status of this vector.
- WARC_ID - Static variable in class it.unimi.dsi.law.warc.io.WarcRecord
-
Some constant strings in their byte equivalent.
- WarcFilteredIterator - Class in it.unimi.dsi.law.warc.io
-
A class to iterate over WARC files getting only records that satisfy a given filter.
- WarcFilteredIterator(FastBufferedInputStream, WarcRecord, Filter<WarcRecord>, ProgressLogger) - Constructor for class it.unimi.dsi.law.warc.io.WarcFilteredIterator
-
Builds the filtered iterator.
- WarcFilteredIterator(FastBufferedInputStream, WarcRecord, Filter<WarcRecord>) - Constructor for class it.unimi.dsi.law.warc.io.WarcFilteredIterator
-
Builds the filtered iterator.
- WarcHttpResponse - Class in it.unimi.dsi.law.warc.util
-
- WarcHttpResponse() - Constructor for class it.unimi.dsi.law.warc.util.WarcHttpResponse
-
- WarcRecord - Class in it.unimi.dsi.law.warc.io
-
A class to read/write WARC/0.9 records (for format details, please see the
WARC format specifications).
- WarcRecord(byte[]) - Constructor for class it.unimi.dsi.law.warc.io.WarcRecord
-
Builds a warc record.
- WarcRecord() - Constructor for class it.unimi.dsi.law.warc.io.WarcRecord
-
Builds a warc record.
- WarcRecord.ContentType - Enum in it.unimi.dsi.law.warc.io
-
Content types.
- WarcRecord.FormatException - Exception in it.unimi.dsi.law.warc.io
-
An exception to denote parsing errors during reads.
- WarcRecord.Header - Class in it.unimi.dsi.law.warc.io
-
A class to contain fields contained in the warc header
.
- WarcRecord.RecordType - Enum in it.unimi.dsi.law.warc.io
-
Record types.
- weigh() - Method in class it.unimi.dsi.law.util.ExchangeWeigher
-
Computes the weight of exchanges for the current data.
- WeightedTau - Class in it.unimi.dsi.law.stat
-
Computes the weighted τ between two score vectors.
- WeightedTau() - Constructor for class it.unimi.dsi.law.stat.WeightedTau
-
Create an additive hyperbolic τ.
- WeightedTau(Int2DoubleFunction) - Constructor for class it.unimi.dsi.law.stat.WeightedTau
-
Create an additive weighted τ using the specified weigher.
- WeightedTau(Int2DoubleFunction, boolean) - Constructor for class it.unimi.dsi.law.stat.WeightedTau
-
Create an additive or multiplicative weighted τ using the specified weigher and combination strategy.
- WeightedTau.AbstractWeigher - Class in it.unimi.dsi.law.stat
-
- WORD - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
-
RegularExpression Id.
- wrap(byte[], int, int) - Method in class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
-
Wraps a byte-array fragment into this byte-array character sequence.
- wrap(byte[]) - Method in class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
-
Wraps a byte array into this byte-array character sequence.
- write(OutputStream) - Method in class it.unimi.dsi.law.warc.io.GZWarcRecord
-
- write(OutputStream) - Method in class it.unimi.dsi.law.warc.io.WarcRecord
-
A method to write this record to an OutputStream
.
- writeANVLHeaders(OutputStream, Map<String, String>, Charset) - Static method in class it.unimi.dsi.law.warc.util.Util
-
Writes a (name, value) map as an ANVL segment in a given stream.