Skip navigation links
A B C D E F G H I J K L M N O P Q R S T U V W Z 

A

AbstractFilter<T> - Class in it.unimi.dsi.law.warc.filters
An abstract implementation of a Filter providing a method that helps in implementing properly Object.toString() for atomic (i.e., class-based) filters.
AbstractFilter() - Constructor for class it.unimi.dsi.law.warc.filters.AbstractFilter
 
AbstractHttpResponse - Class in it.unimi.dsi.law.warc.util
An abstract implementation of HttpResponse providing a AbstractHttpResponse.toWarcRecord(WarcRecord) method that can be used to populate a WARC record (in order to write it).
AbstractHttpResponse() - Constructor for class it.unimi.dsi.law.warc.util.AbstractHttpResponse
 
AbstractWeigher() - Constructor for class it.unimi.dsi.law.stat.WeightedTau.AbstractWeigher
 
adaptFilterString2URI(Filter<String>) - Static method in class it.unimi.dsi.law.warc.filters.Filters
Adapts a filter with String base type to a filter with URI base type.
adaptFilterURI2HttpResponse(Filter<URI>) - Static method in class it.unimi.dsi.law.warc.filters.Filters
Adapts a filter with URI base type to a filter with HttpResponse base type.
adaptFilterURI2WarcRecord(Filter<URI>) - Static method in class it.unimi.dsi.law.warc.filters.Filters
Adapts a filter with URI base type to a filter with WarcRecord base type.
add(T, int) - Method in class it.unimi.dsi.law.util.ConsistentHashFunction
Adds a bucket to the map.
add(double) - Method in class it.unimi.dsi.law.util.KahanSummation
Adds a value.
add(double, Vector) - Method in class it.unimi.dsi.law.vector.DenseVector
 
add(double, Vector) - Method in class it.unimi.dsi.law.vector.ImmutableSparseVector
 
add(double, Vector) - Method in class it.unimi.dsi.law.vector.Vector
Adds values in vector v scaled by alpha to this vector.
addEscapes(String) - Static method in error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
Replaces unprintable characters by their escaped (or unicode escaped) equivalents in the given string
adjustBeginLineColumn(int, int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Method to adjust line and column numbers for the start of a token.
alpha - Variable in class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
The attenuation factor.
alpha - Variable in class it.unimi.dsi.law.rank.PageRank
The damping factor.
alpha - Variable in class it.unimi.dsi.law.rank.PowerSeries
The attenuation factor.
and(SpectralRanking.StoppingCriterion, SpectralRanking.StoppingCriterion) - Static method in class it.unimi.dsi.law.rank.SpectralRanking
Composes two stopping criteria, producing a single stopping criterion (the computation stops iff both conditions become true; lazy boolean evaluation is applied).
and(Filter<T>...) - Static method in class it.unimi.dsi.law.warc.filters.Filters
Produces the conjunction of the given filters.
and() - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
 
AND - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
RegularExpression Id.
anvlFields - Variable in class it.unimi.dsi.law.warc.io.WarcRecord.Header
The warc anvl-fields.
apply(HttpResponse) - Method in class it.unimi.dsi.law.warc.filters.ContentTypeStartsWith
 
apply(WarcRecord) - Method in class it.unimi.dsi.law.warc.filters.DigestEquals
 
apply(URI) - Method in class it.unimi.dsi.law.warc.filters.DuplicateSegmentsLessThan
 
apply(URI) - Method in class it.unimi.dsi.law.warc.filters.HostEndsWith
 
apply(URI) - Method in class it.unimi.dsi.law.warc.filters.HostEquals
 
apply(WarcRecord) - Method in class it.unimi.dsi.law.warc.filters.IsHttpResponse
 
apply(HttpResponse) - Method in class it.unimi.dsi.law.warc.filters.IsProbablyBinary
This method implements a simple heuristic for guessing whether a page is binary.
apply(URI) - Method in class it.unimi.dsi.law.warc.filters.PathEndsWithOneOf
 
apply(URI) - Method in class it.unimi.dsi.law.warc.filters.SchemeEquals
 
apply(HttpResponse) - Method in class it.unimi.dsi.law.warc.filters.StatusCategory
 
apply(URI) - Method in class it.unimi.dsi.law.warc.filters.URLEquals
 
apply(URI) - Method in class it.unimi.dsi.law.warc.filters.URLMatchesRegex
 
apply(URI) - Method in class it.unimi.dsi.law.warc.filters.URLShorterThan
 
apply(Response) - Method in class it.unimi.dsi.law.warc.parser.BinaryParser
 
apply(Response) - Method in class it.unimi.dsi.law.warc.parser.HTMLParser
 
approximateNormVector(DoubleIterator) - Method in class it.unimi.dsi.law.rank.SpectralRanking
Returns a compact logarithmic approximation of a norm vector.
ARGS - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
RegularExpression Id.
ASSERTS - Static variable in class it.unimi.dsi.law.warc.io.GZWarcRecord
 
ASSERTS - Static variable in class it.unimi.dsi.law.warc.io.WarcRecord
 
atom() - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
 
available() - Method in class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
 
available() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
 
AveragePrecisionCorrelation - Class in it.unimi.dsi.law.stat
Computes the AP (average-precision) correlation between two score vectors without ties.

B

backToRoot - Variable in class it.unimi.dsi.law.rank.PageRankPush
The amount of ranking going back to the root.
backup(int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Backup a number of characters.
BAD_CHAR - Static variable in class it.unimi.dsi.law.bubing.util.BURL
Deprecated.
A list of bad characters.
BAD_CHAR_SUBSTITUTE - Static variable in class it.unimi.dsi.law.bubing.util.BURL
Deprecated.
Substitutes for bad characters.
beginColumn - Variable in class it.unimi.dsi.law.warc.filters.parser.Token
The column number of the first character of this Token.
beginLine - Variable in class it.unimi.dsi.law.warc.filters.parser.Token
The line number of the first character of this Token.
BeginToken() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Start.
BFS - Class in it.unimi.dsi.law.graph
Computes the visit order with respect to a depth-first visit.
BFS() - Constructor for class it.unimi.dsi.law.graph.BFS
 
bfsperm(ImmutableGraph, int, int[]) - Static method in class it.unimi.dsi.law.graph.BFS
Return the permutation induced by the visit order of a depth-first visit.
BINARY_CHECK_SCAN_LENGTH - Static variable in class it.unimi.dsi.law.warc.filters.IsProbablyBinary
 
BinaryParser - Class in it.unimi.dsi.law.warc.parser
A universal binary parser that just computes digests.
BinaryParser(MessageDigest) - Constructor for class it.unimi.dsi.law.warc.parser.BinaryParser
Builds a parser for digesting a page.
BinaryParser(String) - Constructor for class it.unimi.dsi.law.warc.parser.BinaryParser
Builds a parser for digesting a page.
block - Variable in class it.unimi.dsi.law.warc.io.WarcRecord
The warc block.
BoundedCountingInputStream - Class in it.unimi.dsi.law.warc.io
A class that decorates an InputStream to obtain a MeasurableInputStream.
BoundedCountingInputStream() - Constructor for class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
Builds the bounded stream.
BoundedCountingInputStream(InputStream, long, CRC32) - Constructor for class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
Builds the bounded stream.
BoundedCountingInputStream(InputStream, long) - Constructor for class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
Builds the bounded stream.
buckets - Variable in class it.unimi.dsi.law.rank.PageRank
If not null, the set of buckets of SpectralRanking.graph.
buckets - Variable in class it.unimi.dsi.law.util.ConsistentHashFunction
The cached key set of ConsistentHashFunction.sizes.
buckets() - Method in class it.unimi.dsi.law.util.ConsistentHashFunction
Returns the set of buckets of this consistent hash function.
bufcolumn - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
 
buffer - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
 
buffer - Variable in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
The buffer.
buffer - Variable in class it.unimi.dsi.law.warc.parser.HTMLParser
The character buffer.
bufline - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
 
bufpos - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Position in buffer.
buildProperties(String) - Method in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
Returns a Properties object that contains all the parameters used by the computation.
buildProperties(String, String) - Method in class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
Returns a Properties object that contains all the parameters used by the computation.
buildProperties(String) - Method in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
Returns a Properties object that contains all the parameters used by the computation.
buildProperties(String, String, String) - Method in class it.unimi.dsi.law.rank.PageRank
Returns a Properties object that contains all the parameters used by the computation.
buildProperties(String, String) - Method in class it.unimi.dsi.law.rank.PowerSeries
Returns a Properties object that contains all the parameters used by the computation.
buildProperties(String) - Method in class it.unimi.dsi.law.rank.SpectralRanking
Returns a Properties object that contains all parameters used by the computation.
BURL - Class in it.unimi.dsi.law.bubing.util
Deprecated.
BYTE_REPRESENTATION_TO_CONTENT_TYPE - Static variable in class it.unimi.dsi.law.warc.io.WarcRecord
 
BYTE_REPRESENTATION_TO_RECORD_TYPE - Static variable in class it.unimi.dsi.law.warc.io.WarcRecord
 
ByteArrayCharSequence - Class in it.unimi.dsi.law.warc.util
An adapter exposing a byte array as an ISO-8859-1-encoded character sequence.
ByteArrayCharSequence(byte[], int, int) - Constructor for class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
Creates a new byte-array character sequence using the provided byte-array fragment.
ByteArrayCharSequence(byte[]) - Constructor for class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
Creates a new byte-array character sequence using the provided byte array.
ByteArrayCharSequence() - Constructor for class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
Creates a new empty byte-array character sequence.
byteRepresentation - Variable in enum it.unimi.dsi.law.warc.io.WarcRecord.ContentType
 
byteRepresentation - Variable in enum it.unimi.dsi.law.warc.io.WarcRecord.RecordType
 

C

cachedContent - Variable in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
A cache for the content of the entity returned by HttpComponentsHttpResponse.httpResponse.
CASE_INSENSITIVE_STRING_HASH_STRATEGY - Static variable in class it.unimi.dsi.law.warc.util.Util
The strategy used to decide whether two header names are the same: we require that they are equal up to case.
cdata(Element, char[], int, int) - Method in class it.unimi.dsi.law.warc.parser.Digester
 
changed() - Method in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
 
changed(int) - Method in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
 
CHAR_BUFFER_SIZE - Static variable in class it.unimi.dsi.law.warc.parser.HTMLParser
The size of the internal Jericho buffer.
characters(char[], int, int, boolean) - Method in class it.unimi.dsi.law.warc.parser.Digester
 
charAt(int) - Method in class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
 
checkCRC(FastBufferedInputStream) - Method in class it.unimi.dsi.law.warc.io.GZWarcRecord
 
clear() - Method in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
 
clear() - Method in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
 
clear() - Method in class it.unimi.dsi.law.rank.PageRankGaussSeidel
 
clear() - Method in class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
 
clear() - Method in class it.unimi.dsi.law.rank.PageRankParallelPowerSeries
 
clear() - Method in class it.unimi.dsi.law.rank.PageRankPowerSeries
 
clear() - Method in class it.unimi.dsi.law.rank.PageRankPush
 
clear() - Method in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
 
clear() - Method in class it.unimi.dsi.law.rank.PowerSeries
 
clear() - Method in class it.unimi.dsi.law.rank.SpectralRanking
Clears all data and releases resources by nulling SpectralRanking.rank (i.e., results we no longer be available).
clear() - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
Clears this HttpComponentsHttpResponse, in particular trimming the content cache.
clone() - Method in class it.unimi.dsi.law.warc.parser.BinaryParser
 
clone() - Method in class it.unimi.dsi.law.warc.parser.HTMLParser
 
close() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
Makes this stream ready.
close() - Method in class it.unimi.dsi.law.warc.io.MeasurableSequenceInputStream
 
CLOSEPAREN - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
RegularExpression Id.
coeffBasename - Variable in class it.unimi.dsi.law.rank.PageRankPowerSeries
If not null, the basename for coefficents.
column - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
 
comment - Variable in class it.unimi.dsi.law.warc.io.GZWarcRecord.GZHeader
The (optional) comment gzip field.
CompressedIntLabel - Class in it.unimi.dsi.law.webgraph
An integer label that uses a coder/decoder pair depending on the the source node.
CompressedIntLabel(String, int, String, LongBigList, Int2ObjectMap<Coder>, Int2ObjectMap<Decoder>) - Constructor for class it.unimi.dsi.law.webgraph.CompressedIntLabel
 
CompressedIntLabel(String, int, LongBigList, Int2ObjectMap<Decoder>, Int2ObjectMap<Coder>) - Constructor for class it.unimi.dsi.law.webgraph.CompressedIntLabel
Creates a compressed integer label.
CompressedIntLabel(Object, String, String, String, String, String, String) - Constructor for class it.unimi.dsi.law.webgraph.CompressedIntLabel
Creates a compressed integer label from a specification that includes decoders and coders.
CompressedIntLabel(Object, String, String, String, String, String) - Constructor for class it.unimi.dsi.law.webgraph.CompressedIntLabel
Creates a compressed integer label from a specification that includes just decoders.
compressedSkipLength - Variable in class it.unimi.dsi.law.warc.io.GZWarcRecord.GZHeader
The compressed-skip-length warc-required extra gzip field.
CompressWarc - Class in it.unimi.dsi.law.warc.tool
A tool to compress a WARC file.
CompressWarc() - Constructor for class it.unimi.dsi.law.warc.tool.CompressWarc
 
compute(String, int, String, double[], int[]) - Static method in class it.unimi.dsi.law.rank.PageRankFromCoefficients
Computes PageRank and its derivatives for given damping factor values.
compute(double[], double[]) - Method in class it.unimi.dsi.law.stat.AveragePrecisionCorrelation
Computes AP correlation between two score vectors.
compute(double[], double[]) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
Computes the correlation between two score vectors.
compute(CharSequence, Class<?>, CharSequence, Class<?>, boolean, int) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
Computes the correlation between two (possibly reversed) score vectors with a given precision.
compute(CharSequence, Class<?>, CharSequence, Class<?>, boolean) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
Computes the correlation between two (possibly reversed) score vectors.
compute(CharSequence, Class<?>, CharSequence, Class<?>, int) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
Computes the correlation between two score vectors with a given precision.
compute(CharSequence, CharSequence, Class<?>) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
Computes the correlation between two score vectors.
compute(double[], double[]) - Method in class it.unimi.dsi.law.stat.KendallTau
Computes Kendall's τ between two score vectors.
compute(double[], double[]) - Method in class it.unimi.dsi.law.stat.WeightedTau
Computes the symmetrized weighted τ between two score vectors.
compute(double[], double[], int[]) - Method in class it.unimi.dsi.law.stat.WeightedTau
Computes the weighted τ between two score vectors, given a reference rank.
compute(byte[]) - Static method in class it.unimi.dsi.law.util.CRC64
Computes the 64 bit CRC of a byte array.
compute(char[], int, int) - Static method in class it.unimi.dsi.law.util.CRC64
Computes the 64 bit CRC of a character array fragment.
compute(String) - Static method in class it.unimi.dsi.law.util.CRC64
Computes the 64 bit CRC of a string, using the ISO8859-1 representation of its Unicode characters.
compute(MutableString) - Static method in class it.unimi.dsi.law.util.CRC64
Computes the 64 bit CRC of a mutable string, using the ISO8859-1 representation of its Unicode characters.
compute(double[]) - Method in enum it.unimi.dsi.law.util.Norm
Computes the norm of a vector.
compute(double[], double[]) - Method in enum it.unimi.dsi.law.util.Norm
Computes the norm of the difference of two vectors.
compute(double[]) - Static method in class it.unimi.dsi.law.util.NormL1
Deprecated.
 
compute(double[], double[]) - Static method in class it.unimi.dsi.law.util.NormL1
Deprecated.
 
compute(double[]) - Static method in class it.unimi.dsi.law.util.NormL2
Deprecated.
 
compute(double[], double[]) - Static method in class it.unimi.dsi.law.util.NormL2
Deprecated.
 
computeDoubles(CharSequence, CharSequence) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
Computes the correlation between two score vectors.
computeDoubles(CharSequence, CharSequence, boolean) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
Computes the correlation between two (possible reversed) score vectors.
computeDoubles(CharSequence, CharSequence, int) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
Computes the correlation between two score vectors with a given precision.
computeDoubles(CharSequence, CharSequence, boolean, int) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
Computes the correlation between two (possible reversed) score vectors with a given precision.
computeFloats(CharSequence, CharSequence) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
Computes the correlation between two score vectors.
computeFloats(CharSequence, CharSequence, boolean) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
Computes the correlation between two (possibly reversed) score vectors.
computeFloats(CharSequence, CharSequence, int) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
Computes the correlation between two score vectors with a given precision.
computeFloats(CharSequence, CharSequence, boolean, int) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
Computes the correlation between two (possibly reversed) score vectors with a given precision.
computeInts(CharSequence, CharSequence) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
Computes the correlation between two score vectors.
computeInts(CharSequence, CharSequence, boolean) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
Computes the correlation between two (possibly reversed) score vectors.
computeLabels(double) - Method in class it.unimi.dsi.law.graph.LayeredLabelPropagation
Computes the labels of a graph for a given value of γ using the default maximum number of updates.
computeLabels(double, int) - Method in class it.unimi.dsi.law.graph.LayeredLabelPropagation
Computes the labels of a graph for a given value of γ.
computeLongs(CharSequence, CharSequence) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
Computes the correlation between two score vectors.
computeLongs(CharSequence, CharSequence, boolean) - Method in class it.unimi.dsi.law.stat.CorrelationIndex
Computes the correlation between (possibly reversed) two score vectors.
computePermutation(double[], String) - Method in class it.unimi.dsi.law.graph.LayeredLabelPropagation
Computes the final permutation of the graph using the default maximum number of updates.
computePermutation(double[], String, int) - Method in class it.unimi.dsi.law.graph.LayeredLabelPropagation
Computes the final permutation of the graph.
configure(BulletParser) - Method in class it.unimi.dsi.law.warc.parser.Digester
 
connect(InputStream) - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
Connects to a given input stream, and fills the buffer accordingly.
ConsistentHashFunction<T extends Comparable<? super T>> - Class in it.unimi.dsi.law.util
Provides an implementation of consistent hashing.
ConsistentHashFunction() - Constructor for class it.unimi.dsi.law.util.ConsistentHashFunction
Creates a new consistent hash function.
ConsistentHashFunction(ConsistentHashFunction.SkipStrategy<T>) - Constructor for class it.unimi.dsi.law.util.ConsistentHashFunction
Creates a new consistent hash function with given skip strategy.
ConsistentHashFunction.SkipStrategy<T> - Interface in it.unimi.dsi.law.util
Allows to skip suitable items when searching for the closest replica.
consume() - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
Invokes EntityUtils.consume(HttpEntity) on the entity returned by the underlying Apache HTTP Components HttpResponse.
consume(InputStream, long) - Static method in class it.unimi.dsi.law.warc.util.Util
Consumes a given number of bytes from a stream.
consume(InputStream) - Static method in class it.unimi.dsi.law.warc.util.Util
Consumes all the bytes of a stream.
contains(int) - Method in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
 
containsKey(int) - Method in class it.unimi.dsi.law.stat.WeightedTau.AbstractWeigher
 
containsKey(Object) - Method in class it.unimi.dsi.law.warc.util.RemappedStringMap
 
contentAsStream() - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
 
contentAsStream() - Method in interface it.unimi.dsi.law.warc.util.HttpResponse
Returns the content of this response as a stream.
contentAsStream(MeasurableInputStream) - Method in class it.unimi.dsi.law.warc.util.MutableHttpResponse
Sets the content.
contentAsStream() - Method in class it.unimi.dsi.law.warc.util.MutableHttpResponse
 
contentAsStream() - Method in class it.unimi.dsi.law.warc.util.WarcHttpResponse
 
contentReady - Variable in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
Whether the content of the entity returned by HttpComponentsHttpResponse.httpResponse has been cached in HttpComponentsHttpResponse.cachedContent.
contentType - Variable in class it.unimi.dsi.law.warc.io.WarcRecord.Header
The warc content-type.
ContentTypeStartsWith - Class in it.unimi.dsi.law.warc.filters
Accepts only fetched response whose content type starts with a given string.
ContentTypeStartsWith(String) - Constructor for class it.unimi.dsi.law.warc.filters.ContentTypeStartsWith
 
copy(WarcRecord) - Method in class it.unimi.dsi.law.warc.io.WarcRecord
Copies this warc record fields from another warc record.
copy(WarcRecord.Header) - Method in class it.unimi.dsi.law.warc.io.WarcRecord.Header
Copies this heaer fields from another header.
copy() - Method in class it.unimi.dsi.law.webgraph.CompressedIntLabel
 
CorrelationIndex - Class in it.unimi.dsi.law.stat
An abstract class providing basic infrastructure for all classes computing some correlation index between two score vectors, such as KendallTau, WeightedTau and AveragePrecisionCorrelation.
CorrelationIndex() - Constructor for class it.unimi.dsi.law.stat.CorrelationIndex
 
CosineSimilarityStrategy - Class in it.unimi.dsi.law.vector
A class that compute the similarity between pattern using cosine similarity.
CosineSimilarityStrategy() - Constructor for class it.unimi.dsi.law.vector.CosineSimilarityStrategy
 
count() - Method in class it.unimi.dsi.law.util.ExchangeCounter
Computes the number of exchanges.
crc - Variable in class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
A class to compute the crc of read bytes.
crc - Variable in class it.unimi.dsi.law.warc.io.WarcRecord
The class used in WarcRecord.write(OutputStream) to compute CRC32 of the content for GZWarcRecord.
CRC64 - Class in it.unimi.dsi.law.util
Provides static methods to compute 64-bit CRCs of strings and byte arrays.
createHierarchicalTempFile(File, int, String, String) - Static method in class it.unimi.dsi.law.warc.util.Util
Creates a temporary file with a random hierachical path.
creationDate - Variable in class it.unimi.dsi.law.warc.io.WarcRecord.Header
The warc creation-date.
CRLF - Static variable in class it.unimi.dsi.law.warc.io.WarcRecord
Some constant strings in their byte equivalent.
curChar - Variable in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
 
currentToken - Variable in exception it.unimi.dsi.law.warc.filters.parser.ParseException
This is the last token that has been consumed successfully.
CutWarc - Class in it.unimi.dsi.law.warc.tool
A class to extract specific records from a WARC file.
CutWarc() - Constructor for class it.unimi.dsi.law.warc.tool.CutWarc
 

D

danglingNodeDistribution - Variable in class it.unimi.dsi.law.rank.PageRank
The vector used used to patch null rows of the adjacency matrix (u in the general formula).
DataInput2Text - Class in it.unimi.dsi.law.io.tool
The main method of this class converts a binary DataOutput file containing numbers to text format.
DataInput2Text.Type - Enum in it.unimi.dsi.law.io.tool
 
dataLength - Variable in class it.unimi.dsi.law.warc.io.WarcRecord.Header
The warc data-length.
DEBUG - Static variable in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
 
DEBUG - Static variable in class it.unimi.dsi.law.warc.io.WarcRecord
 
DEBUG - Static variable in class it.unimi.dsi.law.webgraph.CompressedIntLabel
 
debugStream - Variable in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
Debug output.
DEFAULT - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
Lexical state.
DEFAULT_ALPHA - Static variable in class it.unimi.dsi.law.rank.PageRank
The default damping factor.
DEFAULT_BUFFER_SIZE - Static variable in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
The default buffer size (64KiB).
DEFAULT_BUFFER_SIZE - Static variable in class it.unimi.dsi.law.warc.io.WarcRecord
The default size of the internal buffer used for headers read/write.
DEFAULT_BUFFER_SIZE - Static variable in class it.unimi.dsi.law.warc.tool.ExtractDigestUrls
 
DEFAULT_BUFFER_SIZE - Static variable in class it.unimi.dsi.law.warc.tool.ExtractLinks
 
DEFAULT_BUFFER_SIZE - Static variable in class it.unimi.dsi.law.warc.tool.ListGZWarcComments
 
DEFAULT_BUFFER_SIZE - Static variable in class it.unimi.dsi.law.warc.util.RemappedStringMap
 
DEFAULT_DOMINANT_EIGENVECTOR_NORM - Static variable in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
The default norm (Norm.L_2).
DEFAULT_MAX_ITER - Static variable in class it.unimi.dsi.law.rank.SpectralRanking
Default maximum number of iterations (note that this value is used as a default by main methods).
DEFAULT_NORM - Static variable in class it.unimi.dsi.law.rank.SpectralRanking
The default norm (Norm.L_INFINITY).
DEFAULT_THRESHOLD - Static variable in class it.unimi.dsi.law.rank.SpectralRanking
Default threshold (note that this value is used as a default by main methods).
DenseVector - Class in it.unimi.dsi.law.vector
A mutable implementation of Vector optimized for dense vectors.
dequeue() - Method in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
 
derivative - Variable in class it.unimi.dsi.law.rank.PageRankPowerSeries
The value of derivatives (only for the subset of nodes specified in PageRankPowerSeries.subset, if not null).
DFS - Class in it.unimi.dsi.law.graph
Computes the visit order with respect to a depth-first visit.
DFS() - Constructor for class it.unimi.dsi.law.graph.DFS
 
dfsperm(ImmutableGraph, int[]) - Static method in class it.unimi.dsi.law.graph.DFS
Return the permutation induced by the visit order of a depth-first visit.
digest() - Method in class it.unimi.dsi.law.warc.parser.Digester
Returns the digest computed.
digest() - Method in interface it.unimi.dsi.law.warc.util.DigestBasedDuplicateDetection
Returns the content digest.
digest() - Method in class it.unimi.dsi.law.warc.util.WarcHttpResponse
 
DIGEST_HEADER - Static variable in interface it.unimi.dsi.law.warc.util.HttpResponse
The WARC anvl-filed name to store the digest.
DigestBasedDuplicateDetection - Interface in it.unimi.dsi.law.warc.util
Allows to determine if an HttpResponse is duplicate.
DigestEquals - Class in it.unimi.dsi.law.warc.filters
Accepts only records of given digest, specified as a hexadecimal string.
Digester - Class in it.unimi.dsi.law.warc.parser
A callback computing the digest of a page.
Digester(String) - Constructor for class it.unimi.dsi.law.warc.parser.Digester
Creates a new callback using the given message digest.
digits(int) - Static method in class it.unimi.dsi.law.warc.util.Util
Returns the number of decimal digits that are necessary to represent the argument.
digits(long) - Static method in class it.unimi.dsi.law.warc.util.Util
Returns the number of decimal digits that are necessary to represent the argument.
disable_tracing() - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
Disable tracing.
dispose() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
Disposes this stream, deleting the overflow file and nulling the buffer.
DominantEigenvectorParallelPowerMethod - Class in it.unimi.dsi.law.rank
Computes the left dominant eigenvalue and eigenvector of a graph using a parallel implementation of the power method.
DominantEigenvectorParallelPowerMethod(ImmutableGraph, int, Logger) - Constructor for class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
Creates a new instance.
DominantEigenvectorParallelPowerMethod(ImmutableGraph, Logger) - Constructor for class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
Creates a new instance.
DominantEigenvectorParallelPowerMethod(ImmutableGraph) - Constructor for class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
Creates a new instance.
Done() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Reset buffer when finished.
dotProduct(Vector) - Method in class it.unimi.dsi.law.vector.DenseVector
 
dotProduct(Vector) - Method in class it.unimi.dsi.law.vector.ImmutableSparseVector
 
dotProduct(Vector) - Method in class it.unimi.dsi.law.vector.Vector
Returns the dot product between v and this vector.
downHeap(double[], int[], int[], int, int) - Static method in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
 
DuplicateSegmentsLessThan - Class in it.unimi.dsi.law.warc.filters
Accepts only URIs whose path does not contain too many duplicate segments.
DuplicateSegmentsLessThan(int) - Constructor for class it.unimi.dsi.law.warc.filters.DuplicateSegmentsLessThan
Creates a filter that only accepts URIs whose path does contains less duplicate consecutive segments than the given threshold.

E

ell1Norm() - Method in class it.unimi.dsi.law.vector.DenseVector
 
ell1Norm() - Method in class it.unimi.dsi.law.vector.ImmutableSparseVector
 
ell1norm - Variable in class it.unimi.dsi.law.vector.Vector
The value of computed L1 norm.
ell1Norm() - Method in class it.unimi.dsi.law.vector.Vector
Returns the l1 norm of this vector.
ell2Norm() - Method in class it.unimi.dsi.law.vector.DenseVector
 
ell2Norm() - Method in class it.unimi.dsi.law.vector.ImmutableSparseVector
 
ell2norm - Variable in class it.unimi.dsi.law.vector.Vector
The value of computed norm.
ell2Norm() - Method in class it.unimi.dsi.law.vector.Vector
Returns the l2 norm of this vector.
EMPTY_ARRAY - Static variable in class it.unimi.dsi.law.warc.filters.Filters
 
EmptyQueueStoppingCritertion() - Constructor for class it.unimi.dsi.law.rank.PageRankPush.EmptyQueueStoppingCritertion
 
enable_tracing() - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
Enable tracing.
endColumn - Variable in class it.unimi.dsi.law.warc.filters.parser.Token
The column number of the last character of this Token.
endDocument() - Method in class it.unimi.dsi.law.warc.parser.Digester
 
endElement(Element) - Method in class it.unimi.dsi.law.warc.parser.Digester
 
endLine - Variable in class it.unimi.dsi.law.warc.filters.parser.Token
The line number of the last character of this Token.
enqueue(int) - Method in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
 
ensureDirectory(File) - Static method in class it.unimi.dsi.law.warc.util.Util
Checks if the given File exists and is a directory, or if not existent, it makes a directory (and its parent).
entrySet - Variable in class it.unimi.dsi.law.util.ConsistentHashFunction
The cached key set of ConsistentHashFunction.replicae.
EOF - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
End of File.
EOL - Static variable in exception it.unimi.dsi.law.warc.filters.parser.ParseException
The end of line string for this machine.
equals(Object) - Method in class it.unimi.dsi.law.warc.filters.ContentTypeStartsWith
 
equals(Object) - Method in class it.unimi.dsi.law.warc.filters.DuplicateSegmentsLessThan
 
equals(Object) - Method in class it.unimi.dsi.law.warc.filters.HostEndsWith
 
equals(Object) - Method in class it.unimi.dsi.law.warc.filters.HostEquals
 
equals(Object) - Method in class it.unimi.dsi.law.warc.filters.PathEndsWithOneOf
 
equals(Object) - Method in class it.unimi.dsi.law.warc.filters.SchemeEquals
 
equals(Object) - Method in class it.unimi.dsi.law.warc.filters.StatusCategory
 
equals(Object) - Method in class it.unimi.dsi.law.warc.filters.URLEquals
 
equals(Object) - Method in class it.unimi.dsi.law.warc.filters.URLMatchesRegex
 
equals(Object) - Method in class it.unimi.dsi.law.warc.filters.URLShorterThan
 
equals(Object) - Method in class it.unimi.dsi.law.warc.io.GZWarcRecord.GZHeader
 
equals(Object) - Method in class it.unimi.dsi.law.warc.io.WarcRecord.Header
 
euclideanDistance(Vector) - Method in class it.unimi.dsi.law.vector.DenseVector
 
euclideanDistance(Vector) - Method in class it.unimi.dsi.law.vector.ImmutableSparseVector
 
euclideanDistance(Vector) - Method in class it.unimi.dsi.law.vector.Vector
Returns the euclidean distance between v and this vector.
EuclideanSimilarityStrategy - Class in it.unimi.dsi.law.vector
A class that compute the similarity between pattern using the euclidean distance.
EuclideanSimilarityStrategy() - Constructor for class it.unimi.dsi.law.vector.EuclideanSimilarityStrategy
 
ExchangeCounter - Class in it.unimi.dsi.law.util
Computes the number of discordances between two score vectors using Knight's O(n log n) MergeSort-based algorithm.
ExchangeCounter(int[], double[], int[]) - Constructor for class it.unimi.dsi.law.util.ExchangeCounter
Creates a new exchange counter with a provided support array.
ExchangeCounter(int[], double[]) - Constructor for class it.unimi.dsi.law.util.ExchangeCounter
Creates a new exchange counter.
ExchangeWeigher - Class in it.unimi.dsi.law.util
Computes the weight of discordances using a generalisation of Knight's algorithm.
ExchangeWeigher(Int2DoubleFunction, int[], double[], int[], boolean, int[]) - Constructor for class it.unimi.dsi.law.util.ExchangeWeigher
Creates a new exchange weigher.
ExpandBuff(boolean) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
 
expectedTokenSequences - Variable in exception it.unimi.dsi.law.warc.filters.parser.ParseException
Each entry in this array is an array of integers.
ExtractDigestUrls - Class in it.unimi.dsi.law.warc.tool
A tool to extract digests and URLs from response records of a WARC file.
ExtractDigestUrls() - Constructor for class it.unimi.dsi.law.warc.tool.ExtractDigestUrls
 
ExtractLinks - Class in it.unimi.dsi.law.warc.tool
Extracts links from a WARC file.
ExtractLinks() - Constructor for class it.unimi.dsi.law.warc.tool.ExtractLinks
 

F

falling(int, int) - Static method in class it.unimi.dsi.law.Util
Computes falling powers.
FALSE - Static variable in class it.unimi.dsi.law.warc.filters.Filters
 
FALSE - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
RegularExpression Id.
fill(long) - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
Reads the underlying input stream up to a given limit.
fill() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
Reads fully the underlying input stream.
fillAndRewind() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
Reads fully the underlying input stream and rewinds.
FillBuff() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
 
Filter<T> - Interface in it.unimi.dsi.law.warc.filters
A filter is a strategy to decide whether to accept a given object or not.
FILTER_PACKAGE_NAME - Static variable in interface it.unimi.dsi.law.warc.filters.Filter
The name of the package that contains this interface as well as most filters.
FilterParser<T> - Class in it.unimi.dsi.law.warc.filters.parser
A simple parser that transforms a filter expression into a filter.
FilterParser(Class<T>) - Constructor for class it.unimi.dsi.law.warc.filters.parser.FilterParser
 
FilterParser(InputStream) - Constructor for class it.unimi.dsi.law.warc.filters.parser.FilterParser
Constructor with InputStream.
FilterParser(InputStream, String) - Constructor for class it.unimi.dsi.law.warc.filters.parser.FilterParser
Constructor with InputStream and supplied encoding
FilterParser(Reader) - Constructor for class it.unimi.dsi.law.warc.filters.parser.FilterParser
Constructor.
FilterParser(FilterParserTokenManager) - Constructor for class it.unimi.dsi.law.warc.filters.parser.FilterParser
Constructor with generated Token Manager.
FilterParserConstants - Interface in it.unimi.dsi.law.warc.filters.parser
Token literal values and constants.
FilterParserTokenManager - Class in it.unimi.dsi.law.warc.filters.parser
Token Manager.
FilterParserTokenManager(SimpleCharStream) - Constructor for class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
Constructor.
FilterParserTokenManager(SimpleCharStream, int) - Constructor for class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
Constructor.
Filters - Class in it.unimi.dsi.law.warc.filters
A collection of static methods to deal with filters.
Filters() - Constructor for class it.unimi.dsi.law.warc.filters.Filters
 
finalize() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
 
fixedWidth() - Method in class it.unimi.dsi.law.webgraph.CompressedIntLabel
 
fixURL(MutableString) - Static method in class it.unimi.dsi.law.warc.util.Util
Fixes a given URL so that it is BURL-parsable.
FORBIDDEN_CHARS - Static variable in class it.unimi.dsi.law.bubing.util.BURL
Deprecated.
Characters that will cause a URI spec to be rejected.
FormatException(String) - Constructor for exception it.unimi.dsi.law.warc.io.WarcRecord.FormatException
 
fromBitStream(InputBitStream, int) - Method in class it.unimi.dsi.law.webgraph.CompressedIntLabel
 
fromHexString(String) - Static method in class it.unimi.dsi.law.warc.util.Util
Returns a byte array corresponding to the given number.
fromNormalizedByteArray(byte[]) - Static method in class it.unimi.dsi.law.bubing.util.BURL
Deprecated.
Creates a new BUbiNG URL from a normalized ASCII string represented by a byte array.
fromNormalizedSchemeAuthorityAndPathQuery(String, byte[]) - Static method in class it.unimi.dsi.law.bubing.util.BURL
Deprecated.
Creates a new BUbiNG URL from a normalized ASCII string representing scheme and authority and a byte-array representation of a normalized ASCII path and query.
fromWarcRecord(WarcRecord) - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
 
fromWarcRecord(WarcRecord) - Method in class it.unimi.dsi.law.warc.util.MutableHttpResponse
 
fromWarcRecord(WarcRecord) - Method in interface it.unimi.dsi.law.warc.util.Response
Fills this response with the content of a WarcRecord (optional operation).
fromWarcRecord(WarcRecord) - Method in class it.unimi.dsi.law.warc.util.WarcHttpResponse
 

G

generateParseException() - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
Generate ParseException.
get(int) - Method in class it.unimi.dsi.law.vector.DenseVector
 
get(int) - Method in class it.unimi.dsi.law.vector.ImmutableSparseVector
 
get(int) - Method in class it.unimi.dsi.law.vector.Int2DoubleMapVector
 
get(int) - Method in class it.unimi.dsi.law.vector.Vector
Gets the value at index idx.
get(Object) - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse.HttpResponseHeaderMap
 
getASCIIBytes(String) - Static method in class it.unimi.dsi.law.warc.util.Util
Returns the given ASCII string as a byte array; characters are filtered through the 1111111(=0x7F) mask.
getASCIIBytes(MutableString) - Static method in class it.unimi.dsi.law.warc.util.Util
Returns the given ASCII mutable string as a byte array; characters are filtered through the 1111111(=0x7F) mask.
getBeginColumn() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Get token beginning column number.
getBeginLine() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Get token beginning line number.
getCharsetName(byte[], int) - Static method in class it.unimi.dsi.law.warc.parser.HTMLParser
Returns the charset name as indicated by a META HTTP-EQUIV element, if present, interpreting the provided byte array as a sequence of ISO-8859-1-encoded characters.
getCharsetNameFromHeader(String) - Static method in class it.unimi.dsi.law.warc.parser.HTMLParser
Extracts the charset name from the header value of a content-type header.
getColumn() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Deprecated.
getEndColumn() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Get token end column number.
getEndLine() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Get token end line number.
getFilterFromSpec(String, String, Class<T>) - Static method in class it.unimi.dsi.law.warc.filters.Filters
Creates a filter from a filter class name and an external form.
GetImage() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Get token literal value.
getInstance(int, int) - Static method in class it.unimi.dsi.law.vector.DenseVector
Returns an instance of given size with zero values.
getInstance(double[], int) - Static method in class it.unimi.dsi.law.vector.DenseVector
Returns an instance from an array of values.
getInstance(int, double[], int[], int) - Static method in class it.unimi.dsi.law.vector.ImmutableSparseVector
Returns an instance of given size from an array of values.
getInstance(Vector, double, int) - Static method in class it.unimi.dsi.law.vector.ImmutableSparseVector
Returns an instance containing all the values of a given vector larger or smaller of a given threshold.
getInstance(double[], double, int) - Static method in class it.unimi.dsi.law.vector.ImmutableSparseVector
Returns an instance containing all the values of a given array larger or smaller of a given threshold.
getInstance(int[], double[], int, double, int) - Static method in class it.unimi.dsi.law.vector.ImmutableSparseVector
Returns an instance containing all the values of a given array larger than a given threshold.
getInstance(int, int) - Static method in class it.unimi.dsi.law.vector.Int2DoubleMapVector
Returns an instance of given size with zero values.
getInstance(int, Int2DoubleMap, int) - Static method in class it.unimi.dsi.law.vector.Int2DoubleMapVector
Returns an instance from the given Int2DoubleMap.
getLine() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Deprecated.
getLong(Object) - Method in class it.unimi.dsi.law.warc.util.RemappedStringMap
 
getMessage() - Method in error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
You can also modify the body of this method to customize your error messages.
getNextToken() - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
Get the next Token.
getNextToken() - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
Get the next Token.
getString(byte[]) - Static method in class it.unimi.dsi.law.warc.util.Util
Returns the given byte array as an ASCII string.
getString(byte[], int, int) - Static method in class it.unimi.dsi.law.warc.util.Util
Returns the given byte array as an ASCII string.
GetSuffix(int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Get the suffix.
getTabSize() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
 
getToken(int) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
Get the specific Token.
getValue() - Method in class it.unimi.dsi.law.warc.filters.parser.Token
An optional attribute value of the Token.
graph - Variable in class it.unimi.dsi.law.rank.SpectralRanking
The graph.
GrepWarc - Class in it.unimi.dsi.law.warc.tool
A "grep" for WARC files.
GrepWarc() - Constructor for class it.unimi.dsi.law.warc.tool.GrepWarc
 
ground() - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
 
GUESSED_CHARSET_HEADER - Static variable in interface it.unimi.dsi.law.warc.util.HttpResponse
The WARC anvl-filed name to store the charset recognized during parsing.
guessedCharset() - Method in class it.unimi.dsi.law.warc.parser.BinaryParser
 
guessedCharset() - Method in class it.unimi.dsi.law.warc.parser.HTMLParser
 
guessedCharset() - Method in interface it.unimi.dsi.law.warc.parser.Parser
Returns a guessed charset for the document, or null if the charset could not be guessed.
gzheader - Variable in class it.unimi.dsi.law.warc.io.GZWarcRecord
The GZip headers used by this object.
GZHeader() - Constructor for class it.unimi.dsi.law.warc.io.GZWarcRecord.GZHeader
 
GZWarcRecord - Class in it.unimi.dsi.law.warc.io
A class to read/write WARC/0.9 records in compressed form (for format details, please see the WARC and GZip format specifications).
GZWarcRecord() - Constructor for class it.unimi.dsi.law.warc.io.GZWarcRecord
 
GZWarcRecord.GZHeader - Class in it.unimi.dsi.law.warc.io
A class to contain fields contained in the gzip header.
GZWarcStats - Class in it.unimi.dsi.law.warc.tool
A tool to compute some statistics about a gzipped WARC file.
GZWarcStats() - Constructor for class it.unimi.dsi.law.warc.tool.GZWarcStats
 

H

hash(long, int) - Method in class it.unimi.dsi.law.util.ConsistentHashFunction
Returns an array of buckets whose replicae are close to the given point.
hash(Object, int) - Method in class it.unimi.dsi.law.util.ConsistentHashFunction
Returns an array of buckets whose replicae are close to the given object.
hash(long) - Method in class it.unimi.dsi.law.util.ConsistentHashFunction
Returns the bucket of the replica that is closest to the given point.
hash(Object) - Method in class it.unimi.dsi.law.util.ConsistentHashFunction
Returns the bucket of the replica that is closest to the given key.
hashCode() - Method in class it.unimi.dsi.law.warc.io.GZWarcRecord.GZHeader
 
hashCode() - Method in class it.unimi.dsi.law.warc.io.WarcRecord.Header
 
hashCode() - Method in class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
 
hasNext() - Method in class it.unimi.dsi.law.warc.io.HttpResponseFilteredIterator
 
hasNext() - Method in class it.unimi.dsi.law.warc.io.WarcFilteredIterator
 
header - Variable in class it.unimi.dsi.law.warc.io.WarcRecord
The warc header.
Header() - Constructor for class it.unimi.dsi.law.warc.io.WarcRecord.Header
 
HEADER_CHARSET - Static variable in interface it.unimi.dsi.law.warc.util.HttpResponse
The Charset used to encode/decode the HTTP headers.
headerMap - Variable in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
The header map wrapping HttpComponentsHttpResponse.httpResponse's headers.
headerMap - Variable in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
The header map.
HeaderMap() - Constructor for class it.unimi.dsi.law.warc.util.MetadataHttpResponse.HeaderMap
 
headers() - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
 
headers() - Method in interface it.unimi.dsi.law.warc.util.HttpResponse
Returns the headers of this response.
headers() - Method in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
 
headers(Object2ObjectMap<String, String>) - Method in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
Sets the headers.
headers() - Method in class it.unimi.dsi.law.warc.util.WarcHttpResponse
 
host(byte[]) - Static method in class it.unimi.dsi.law.bubing.util.BURL
Deprecated.
Extracts the host of an absolute BUbiNG URL in its byte-array representation.
HostEndsWith - Class in it.unimi.dsi.law.warc.filters
Accepts only URIs whose host ends with (case-insensitively) a certain suffix.
HostEndsWith(String) - Constructor for class it.unimi.dsi.law.warc.filters.HostEndsWith
Creates a filter that only accepts URLs with a given suffix.
HostEquals - Class in it.unimi.dsi.law.warc.filters
Accepts only URIs whose host equals (case-insensitively) a certain string.
HostEquals(String) - Constructor for class it.unimi.dsi.law.warc.filters.HostEquals
Creates a filter that only accepts URLs with a given host.
hostFromSchemeAndAuthority(String) - Static method in class it.unimi.dsi.law.bubing.util.BURL
Deprecated.
Extracts the host part from a scheme and authority by removing the scheme, the user info and the port number.
HTMLParser - Class in it.unimi.dsi.law.warc.parser
An HTML parser with additional responsibilities (such as guessing the character encoding and resolving relative URLs).
HTMLParser(MessageDigest) - Constructor for class it.unimi.dsi.law.warc.parser.HTMLParser
Builds a parser for link extraction and, possibly, digesting a page.
HTMLParser(String) - Constructor for class it.unimi.dsi.law.warc.parser.HTMLParser
Builds a parser for link extraction and, possibly, digesting a page.
HTMLParser() - Constructor for class it.unimi.dsi.law.warc.parser.HTMLParser
Builds a parser for link extraction.
HTMLParser.SetLinkReceiver - Class in it.unimi.dsi.law.warc.parser
 
HttpComponentsHttpResponse - Class in it.unimi.dsi.law.warc.util
An concrete subclass of AbstractHttpResponse that implements missing methods by wrapping an Apache HTTP Components HttpResponse.
HttpComponentsHttpResponse() - Constructor for class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
Creates a new instance.
HttpComponentsHttpResponse(URI, HttpResponse) - Constructor for class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
Creates a new instance wrapping a given Apache HTTP Components HttpResponse.
HttpComponentsHttpResponse.HttpResponseHeaderMap - Class in it.unimi.dsi.law.warc.util
A wrapper class exposing headers in HttpResponse.headers() format by delegating to an HttpResponse.
httpResponse - Variable in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
The response wrapped by this HttpComponentsHttpResponse.
HttpResponse - Interface in it.unimi.dsi.law.warc.util
Provides high level access to WARC records with record-type equal to response and content-type equal to HTTP (or HTTPS).
HttpResponseFilteredIterator - Class in it.unimi.dsi.law.warc.io
A class to iterate over WARC files getting only records corresponding to HttpResponse that satisfy a given filter.
HttpResponseFilteredIterator(FastBufferedInputStream, WarcRecord, WarcHttpResponse, Filter<HttpResponse>, ProgressLogger) - Constructor for class it.unimi.dsi.law.warc.io.HttpResponseFilteredIterator
Builds the filtered iterator.
HttpResponseFilteredIterator(FastBufferedInputStream, WarcRecord, WarcHttpResponse, Filter<HttpResponse>) - Constructor for class it.unimi.dsi.law.warc.io.HttpResponseFilteredIterator
Builds the filtered iterator.
HttpResponseHeaderMap() - Constructor for class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse.HttpResponseHeaderMap
 
HYPERBOLIC - Static variable in class it.unimi.dsi.law.stat.WeightedTau
A singleton instance of the symmetric hyperbolic additive τ.
HYPERBOLIC_WEIGHER - Static variable in class it.unimi.dsi.law.stat.WeightedTau
A hyperbolic weigher (the default one).

I

id - Variable in class it.unimi.dsi.law.vector.Vector
The description ID associated with this vector (immutable).
image - Variable in class it.unimi.dsi.law.warc.filters.parser.Token
The string image of the token.
ImmutableSparseVector - Class in it.unimi.dsi.law.vector
An immutable implementation of Vector optimized for sparse vectors.
inBuf - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
 
index - Variable in class it.unimi.dsi.law.vector.ImmutableSparseVector
An arrays containing the indexes of the vector values.
IndexWarc - Class in it.unimi.dsi.law.warc.tool
A tool to index a WARC file.
IndexWarc() - Constructor for class it.unimi.dsi.law.warc.tool.IndexWarc
 
init() - Method in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
 
init() - Method in class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
 
init() - Method in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
 
init() - Method in class it.unimi.dsi.law.rank.PageRank
Basic initialization: we log the damping factor, check that the preference vector is correctly sized and stochastic, fill SpectralRanking.rank with the preference vector and set the dangling-node distribution depending on the value of PageRank.stronglyPreferential.
init() - Method in class it.unimi.dsi.law.rank.PageRankGaussSeidel
 
init() - Method in class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
 
init() - Method in class it.unimi.dsi.law.rank.PageRankParallelPowerSeries
 
init() - Method in class it.unimi.dsi.law.rank.PageRankPowerSeries
 
init() - Method in class it.unimi.dsi.law.rank.PageRankPush
 
init() - Method in class it.unimi.dsi.law.rank.PowerSeries
 
init() - Method in class it.unimi.dsi.law.rank.SpectralRanking
Initializes the rank vector, zeroes SpectralRanking.iteration and logs basic data.
init(URI) - Method in class it.unimi.dsi.law.warc.parser.HTMLParser.SetLinkReceiver
 
init(URI) - Method in interface it.unimi.dsi.law.warc.parser.Parser.LinkReceiver
Initializes this receiver for a new page.
input_stream - Variable in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
 
inputStream - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
 
inspectable - Variable in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
The number of bytes read in the buffer, when connected.
InspectableBufferedInputStream - Class in it.unimi.dsi.law.warc.io
An input stream that wraps an underlying input stream to make it rewindable and partially inspectable, using a bounded-capacity memory buffer and an overflow file.
InspectableBufferedInputStream(int, File) - Constructor for class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
Creates a new ready stream.
InspectableBufferedInputStream(int) - Constructor for class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
Creates a new ready stream using default temporary directory for the overflow file.
InspectableBufferedInputStream() - Constructor for class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
Creates a new ready stream with default buffer size, and using default temporary directory for the overflow file.
InspectableBufferedInputStream.State - Enum in it.unimi.dsi.law.warc.io
The possible states of this stream, as explained above.
INSTANCE - Static variable in class it.unimi.dsi.law.stat.AveragePrecisionCorrelation
The singleton instance of this class.
INSTANCE - Static variable in class it.unimi.dsi.law.stat.KendallTau
The singleton instance of this class.
INSTANCE - Static variable in class it.unimi.dsi.law.warc.filters.IsHttpResponse
 
INSTANCE - Static variable in class it.unimi.dsi.law.warc.filters.IsProbablyBinary
 
Int2DoubleMapVector - Class in it.unimi.dsi.law.vector
A mutable implementation of Vector for sparse vectors.
intermediateRank - Variable in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
The rank vector obtained after the first half of a round.
IntHeapIndirectPriorityQueue() - Constructor for class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
 
INVALID_ID - Static variable in class it.unimi.dsi.law.vector.Vector
A value indicating that the norm is not computed for current values.
INVALID_LEXICAL_STATE - Static variable in error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
Tried to change to an invalid lexical state.
INVALID_NORM - Static variable in class it.unimi.dsi.law.vector.Vector
A value indicating that the norm is not computed for current values.
isDuplicate() - Method in interface it.unimi.dsi.law.warc.util.DigestBasedDuplicateDetection
Returns the duplicate status of this response.
isDuplicate() - Method in class it.unimi.dsi.law.warc.util.WarcHttpResponse
 
ISDUPLICATE_HEADER - Static variable in interface it.unimi.dsi.law.warc.util.HttpResponse
The WARC anvl-filed name to store the digest.
isEmpty() - Method in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
 
IsHttpResponse - Class in it.unimi.dsi.law.warc.filters
Accepts only records that are http/https responses.
isMutable() - Method in class it.unimi.dsi.law.vector.Vector
Returns the mutability status of this vector.
IsProbablyBinary - Class in it.unimi.dsi.law.warc.filters
Accepts only http responses whose content stream appears to be binary.
isSkippable(T) - Method in interface it.unimi.dsi.law.util.ConsistentHashFunction.SkipStrategy
Checks whether a bucket can be returned or should be skipped.
isStochastic(DoubleList) - Static method in class it.unimi.dsi.law.rank.SpectralRanking
Commodity method checking whether a vector is stochastic (nonnegative entries summing up to one within SpectralRanking.STOCHASTIC_TOLERANCE).
it.unimi.dsi.law - package it.unimi.dsi.law
Basic classes.
it.unimi.dsi.law.bubing.util - package it.unimi.dsi.law.bubing.util
 
it.unimi.dsi.law.graph - package it.unimi.dsi.law.graph
Graph-related classes.
it.unimi.dsi.law.io.tool - package it.unimi.dsi.law.io.tool
Tools manipulating and converting files.
it.unimi.dsi.law.rank - package it.unimi.dsi.law.rank
Computation of spectral rankings and associated utilities.
it.unimi.dsi.law.stat - package it.unimi.dsi.law.stat
Statistical tools (in particular, Kendall's τ) for large-size data.
it.unimi.dsi.law.util - package it.unimi.dsi.law.util
Utility classes.
it.unimi.dsi.law.vector - package it.unimi.dsi.law.vector
 
it.unimi.dsi.law.warc.filters - package it.unimi.dsi.law.warc.filters
A comprehensive filtering system.
it.unimi.dsi.law.warc.filters.parser - package it.unimi.dsi.law.warc.filters.parser
 
it.unimi.dsi.law.warc.io - package it.unimi.dsi.law.warc.io
Provides classes performing low and high level WARC I/O (for format details, please see the ISO draft).
it.unimi.dsi.law.warc.io.examples - package it.unimi.dsi.law.warc.io.examples
 
it.unimi.dsi.law.warc.parser - package it.unimi.dsi.law.warc.parser
Extensions of the BulletParser.
it.unimi.dsi.law.warc.tool - package it.unimi.dsi.law.warc.tool
Command-line tools that manipulate WARC files.
it.unimi.dsi.law.warc.util - package it.unimi.dsi.law.warc.util
 
it.unimi.dsi.law.webgraph - package it.unimi.dsi.law.webgraph
 
iteration - Variable in class it.unimi.dsi.law.rank.SpectralRanking
The current step (0 after initialization).
IterationNumberStoppingCriterion(int) - Constructor for class it.unimi.dsi.law.rank.SpectralRanking.IterationNumberStoppingCriterion
Creates an instance with a given number of iterations.
iterator() - Method in class it.unimi.dsi.law.warc.parser.HTMLParser.SetLinkReceiver
 

J

jj_nt - Variable in class it.unimi.dsi.law.warc.filters.parser.FilterParser
Next token.
jjFillToken() - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
 
jjstrLiteralImages - Static variable in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
Token literal values.

K

KahanSummation - Class in it.unimi.dsi.law.util
Kahan's summation algorithm encapsulated in an object.
KahanSummation() - Constructor for class it.unimi.dsi.law.util.KahanSummation
 
KatzParallelGaussSeidel - Class in it.unimi.dsi.law.rank
Computes Katz's index using a parallel implementation of the Gauß–Seidel method; this is the implementation of choice to be used when computing Katz's index.
KatzParallelGaussSeidel(ImmutableGraph, int, Logger) - Constructor for class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
Creates a new instance.
KatzParallelGaussSeidel(ImmutableGraph) - Constructor for class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
Creates a new instance.
KendallTau - Class in it.unimi.dsi.law.stat
Computes Kendall's τ between two score vectors.
kind - Variable in class it.unimi.dsi.law.warc.filters.parser.Token
An integer that describes the kind of this token.

L

L1NormStoppingCritertion() - Constructor for class it.unimi.dsi.law.rank.PageRankPush.L1NormStoppingCritertion
 
labelBasename(String) - Method in class it.unimi.dsi.law.graph.LayeredLabelPropagation
Sets the basename for label files.
labelPropagation(ImmutableGraph) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
 
lambda - Variable in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
The dominant eigenvalue.
largestIndegree(ImmutableGraph) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
 
largestOutdegree(ImmutableGraph) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
 
LayeredLabelPropagation - Class in it.unimi.dsi.law.graph
An implementation of the layered label propagation algorithm described by by Paolo Boldi, Sebastiano Vigna, Marco Rosa, Massimo Santini, and Sebastiano Vigna in “Layered label propagation: A multiresolution coordinate-free ordering for compressing social networks”, Proceedings of the 20th international conference on World Wide Web, pages 587−596, ACM, 2011.
LayeredLabelPropagation(ImmutableGraph, long) - Constructor for class it.unimi.dsi.law.graph.LayeredLabelPropagation
Creates a new instance.
LayeredLabelPropagation(ImmutableGraph, int[], long) - Constructor for class it.unimi.dsi.law.graph.LayeredLabelPropagation
Creates a new instance using a specific initial permutation.
LayeredLabelPropagation(ImmutableGraph, int[], long, boolean) - Constructor for class it.unimi.dsi.law.graph.LayeredLabelPropagation
Creates a new instance using a specific initial permutation.
LeftSingularVectorParallelPowerMethod - Class in it.unimi.dsi.law.rank
Computes the left singular vector of a graph using a parallel implementation of the power method.
LeftSingularVectorParallelPowerMethod(ImmutableGraph, ImmutableGraph, int, Logger) - Constructor for class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
Creates a new instance.
LeftSingularVectorParallelPowerMethod(ImmutableGraph, ImmutableGraph, Logger) - Constructor for class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
Creates a new instance.
LeftSingularVectorParallelPowerMethod(ImmutableGraph, ImmutableGraph, int) - Constructor for class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
Creates a new instance.
LeftSingularVectorParallelPowerMethod(ImmutableGraph, ImmutableGraph) - Constructor for class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
Creates a new instance.
length() - Method in class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
 
length() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
Returns the overall length of this input stream.
length() - Method in class it.unimi.dsi.law.warc.io.MeasurableSequenceInputStream
 
length() - Method in class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
 
LEXICAL_ERROR - Static variable in error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
Lexical error occurred.
LexicalErr(boolean, int, int, int, String, int) - Static method in error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
Returns a detailed message for the Error when it is thrown by the token manager to indicate a lexical error.
lexStateNames - Static variable in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
Lexer state names.
line - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
 
link(URI) - Method in class it.unimi.dsi.law.warc.parser.HTMLParser.SetLinkReceiver
 
link(URI) - Method in interface it.unimi.dsi.law.warc.parser.Parser.LinkReceiver
Handles a link.
list() - Method in class it.unimi.dsi.law.warc.util.RemappedStringMap
 
ListGZWarcComments - Class in it.unimi.dsi.law.warc.tool
A tool to list the GZip header comments contained in a compressed WARC file.
ListGZWarcComments() - Constructor for class it.unimi.dsi.law.warc.tool.ListGZWarcComments
 
loadAsDoubles(CharSequence, Class<?>, boolean) - Static method in class it.unimi.dsi.law.stat.CorrelationIndex
Loads a vector of doubles, either in binary or textual form.
location() - Method in class it.unimi.dsi.law.warc.parser.HTMLParser
Returns the BURL location header, if present; if it is not present, but the page contains a valid metalocation, the latter is returned.
location(URI) - Method in class it.unimi.dsi.law.warc.parser.HTMLParser.SetLinkReceiver
 
location(URI) - Method in interface it.unimi.dsi.law.warc.parser.Parser.LinkReceiver
Handles the location defined by headers.
log10(int) - Static method in class it.unimi.dsi.law.warc.util.Util
Returns ⌊ log10(x) ⌋.
log10(long) - Static method in class it.unimi.dsi.law.warc.util.Util
Returns ⌊ log10(x) ⌋.
LOGARITHMIC_WEIGHER - Static variable in class it.unimi.dsi.law.stat.WeightedTau
A logarithmic weigher.
logger - Variable in class it.unimi.dsi.law.rank.SpectralRanking
A logger defined by the implementing subclasses.
LOGGER - Static variable in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
 
LOOP_DETECTED - Static variable in error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
Detected (and bailed out of) an infinite loop in the token manager.

M

main(String[]) - Static method in class it.unimi.dsi.law.graph.BFS
 
main(String[]) - Static method in class it.unimi.dsi.law.graph.DFS
 
main(String[]) - Static method in class it.unimi.dsi.law.graph.LayeredLabelPropagation
 
main(String[]) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
 
main(String[]) - Static method in class it.unimi.dsi.law.io.tool.DataInput2Text
 
main(String[]) - Static method in class it.unimi.dsi.law.io.tool.NumberDistinctLines
 
main(String[]) - Static method in class it.unimi.dsi.law.io.tool.Text2DataOutput
 
main(String[]) - Static method in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
 
main(String[]) - Static method in class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
 
main(String[]) - Static method in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
 
main(String[]) - Static method in class it.unimi.dsi.law.rank.PageRankFromCoefficients
 
main(String[]) - Static method in class it.unimi.dsi.law.rank.PageRankGaussSeidel
 
main(String[]) - Static method in class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
 
main(String[]) - Static method in class it.unimi.dsi.law.rank.PageRankParallelPowerSeries
 
main(String[]) - Static method in class it.unimi.dsi.law.rank.PageRankPowerSeries
 
main(String[]) - Static method in class it.unimi.dsi.law.rank.PageRankPush
 
main(String[]) - Static method in class it.unimi.dsi.law.rank.PowerSeries
 
main(String[]) - Static method in class it.unimi.dsi.law.stat.AveragePrecisionCorrelation
 
main(String[]) - Static method in class it.unimi.dsi.law.stat.KendallTau
 
main(String[]) - Static method in class it.unimi.dsi.law.stat.WeightedTau
 
main(String[]) - Static method in enum it.unimi.dsi.law.util.Norm
 
main(String[]) - Static method in class it.unimi.dsi.law.warc.filters.DuplicateSegmentsLessThan
 
main(String[]) - Static method in class it.unimi.dsi.law.warc.io.examples.SequentialHttpResponseRead
 
main(String[]) - Static method in class it.unimi.dsi.law.warc.io.examples.SequentialHttpResponseWrite
 
main(String[]) - Static method in class it.unimi.dsi.law.warc.io.examples.SequentialWarcRecordRead
 
main(String[]) - Static method in class it.unimi.dsi.law.warc.io.examples.SequentialWarcRecordWrite
 
main(String[]) - Static method in class it.unimi.dsi.law.warc.tool.CompressWarc
 
main(String[]) - Static method in class it.unimi.dsi.law.warc.tool.CutWarc
 
main(String[]) - Static method in class it.unimi.dsi.law.warc.tool.ExtractDigestUrls
 
main(String[]) - Static method in class it.unimi.dsi.law.warc.tool.ExtractLinks
 
main(String[]) - Static method in class it.unimi.dsi.law.warc.tool.GrepWarc
 
main(String[]) - Static method in class it.unimi.dsi.law.warc.tool.GZWarcStats
 
main(String[]) - Static method in class it.unimi.dsi.law.warc.tool.IndexWarc
 
main(String[]) - Static method in class it.unimi.dsi.law.warc.tool.ListGZWarcComments
 
main(String[]) - Static method in class it.unimi.dsi.law.warc.util.RemappedStringMap
 
markovian - Variable in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
if true, the matrix will be stocasticized.
markovian - Variable in class it.unimi.dsi.law.rank.PowerSeries
If true, the matrix adjacency graph will be stochasticised, thus computing a pseudorank.
MAX_RATIO_STOPPING_CRITERION - Static variable in class it.unimi.dsi.law.rank.PowerSeries
A stopping criterion that stops when PowerSeries.maxRatio is smaller than the reciprocal of PowerSeries.alpha.
MAX_UPDATES - Static variable in class it.unimi.dsi.law.graph.LayeredLabelPropagation
The default maximum number of updates.
maxNextCharInd - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
 
maxRatio - Variable in class it.unimi.dsi.law.rank.PowerSeries
The maximum ratio between components.
MeasurableSequenceInputStream - Class in it.unimi.dsi.law.warc.io
MeasurableSequenceInputStream(MeasurableInputStream...) - Constructor for class it.unimi.dsi.law.warc.io.MeasurableSequenceInputStream
Constructs a sequence from an array of input streams.
memoryUsageOf(byte[]) - Static method in class it.unimi.dsi.law.bubing.util.BURL
Deprecated.
Returns the memory usage associated to a byte array.
MetadataHttpResponse - Class in it.unimi.dsi.law.warc.util
An abstract extention of AbstractHttpResponse which additionally provides support for getting and setting metadata (i.e., MetadataHttpResponse.uri(), MetadataHttpResponse.statusLine(), MetadataHttpResponse.status() and MetadataHttpResponse.headers()).
MetadataHttpResponse() - Constructor for class it.unimi.dsi.law.warc.util.MetadataHttpResponse
 
MetadataHttpResponse.HeaderMap - Class in it.unimi.dsi.law.warc.util
A special map used for headers: keys are case-insensitive, and multiple puts are converted into comma-separated values.
metaLocation(URI) - Method in class it.unimi.dsi.law.warc.parser.HTMLParser.SetLinkReceiver
 
metaLocation(URI) - Method in interface it.unimi.dsi.law.warc.parser.Parser.LinkReceiver
Handles the location defined by a META element.
metaRefresh(URI) - Method in class it.unimi.dsi.law.warc.parser.HTMLParser.SetLinkReceiver
 
metaRefresh(URI) - Method in interface it.unimi.dsi.law.warc.parser.Parser.LinkReceiver
Handles the refresh defined by a META element.
MIN_SCALE - Static variable in class it.unimi.dsi.law.rank.PowerSeries
Below this scale, we stop the iterative process.
minRatio - Variable in class it.unimi.dsi.law.rank.PowerSeries
The minimum ratio between components.
mtime - Variable in class it.unimi.dsi.law.warc.io.GZWarcRecord.GZHeader
The mtime gzip field.
MutableHttpResponse - Class in it.unimi.dsi.law.warc.util
A mutable extension of MetadataHttpResponse that provides support for setting the content stream.
MutableHttpResponse() - Constructor for class it.unimi.dsi.law.warc.util.MutableHttpResponse
 

N

n - Variable in class it.unimi.dsi.law.rank.SpectralRanking
The number of nodes of SpectralRanking.graph, cached.
name - Variable in class it.unimi.dsi.law.warc.io.GZWarcRecord.GZHeader
The (optional) name gzip field.
newToken(int, String) - Static method in class it.unimi.dsi.law.warc.filters.parser.Token
Returns a new Token object, by default.
newToken(int) - Static method in class it.unimi.dsi.law.warc.filters.parser.Token
 
next - Variable in class it.unimi.dsi.law.warc.filters.parser.Token
A reference to the next regular (non-special) token from the input stream.
next() - Method in class it.unimi.dsi.law.warc.io.HttpResponseFilteredIterator
 
next() - Method in class it.unimi.dsi.law.warc.io.WarcFilteredIterator
 
node2Seen - Variable in class it.unimi.dsi.law.rank.PageRankPush
A map from nodes to the seen-order.
nodeLabels - Variable in class it.unimi.dsi.law.webgraph.CompressedIntLabel
The node labels stream containing the node labels.
nonZero - Variable in class it.unimi.dsi.law.vector.ImmutableSparseVector
The number of non-zero entry in this vector.
norm - Variable in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
The norm.
norm - Variable in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
The norm.
Norm - Enum in it.unimi.dsi.law.util
An Enum providing different ℓ norms.
normalize(double[], double) - Method in enum it.unimi.dsi.law.util.Norm
Normalizes a vector to a given norm value.
normalize(double[], double) - Static method in class it.unimi.dsi.law.util.NormL1
Deprecated.
 
normalize(double[], double) - Static method in class it.unimi.dsi.law.util.NormL2
Deprecated.
 
normDelta() - Method in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
 
normDelta() - Method in class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
 
normDelta() - Method in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
 
normDelta() - Method in class it.unimi.dsi.law.rank.PageRankGaussSeidel
Return an upper bound to the ℓ1 norm of the error, obtained multiplying by α / (1 − α) the ℓ1 norm of the difference between the last two approximations (this idea arose in discussions with David Gleich).
normDelta() - Method in class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
Return the following values: if a suitable norm vector has been set, an upper bound on the error (the ℓ distance from the rank to be computed); otherwise, an upper bound to the ℓ1 norm of the error, obtained multiplying by α / (1 − α) the ℓ1 norm of the difference between the last two approximations (this idea arose in discussions with David Gleich).
normDelta() - Method in class it.unimi.dsi.law.rank.PageRankParallelPowerSeries
 
normDelta() - Method in class it.unimi.dsi.law.rank.PageRankPowerSeries
 
normDelta() - Method in class it.unimi.dsi.law.rank.PowerSeries
 
normDelta() - Method in class it.unimi.dsi.law.rank.SpectralRanking
Returns the norm of an estimation of the distance to the limit of the iterative process: depending on the implementation, this can be an actual bound or, for example, just the difference between the last two approximations.
NormL1 - Class in it.unimi.dsi.law.util
Deprecated.
NormL2 - Class in it.unimi.dsi.law.util
Deprecated.
NormStoppingCriterion(double) - Constructor for class it.unimi.dsi.law.rank.SpectralRanking.NormStoppingCriterion
Creates an instance with given threshold.
normVector(String, double) - Method in class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
Sets the norm vector.
normVector(double[], double) - Method in class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
Sets the norm vector.
normVector(String, double) - Method in class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
Sets the norm vector.
normVector(double[], double) - Method in class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
Sets the norm vector.
not(Filter<T>) - Static method in class it.unimi.dsi.law.warc.filters.Filters
Produces the negation of the given filter.
NOT - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
RegularExpression Id.
NULL_LINK_RECEIVER - Static variable in interface it.unimi.dsi.law.warc.parser.Parser
A no-op implementation of Parser.LinkReceiver.
NumberDistinctLines - Class in it.unimi.dsi.law.io.tool
The main method of this class reads a UTF-8 file containg a newline separated list of strings and writes a DataOutputStream containing a list of ints such that the i-th int is equal to the j-th int iff the (crc of the) i-th string is equal to the (crc of the) j-th string.

O

object2ObjectEntrySet() - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse.HttpResponseHeaderMap
 
OPENPAREN - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
RegularExpression Id.
or(SpectralRanking.StoppingCriterion, SpectralRanking.StoppingCriterion) - Static method in class it.unimi.dsi.law.rank.SpectralRanking
Composes two stopping criteria, producing a single stopping criterion (the computation stops iff either condition becomes true; lazy boolean evaluation is applied).
or(Filter<T>...) - Static method in class it.unimi.dsi.law.warc.filters.Filters
Produces the disjunction of the given filters.
or() - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
 
OR - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
RegularExpression Id.
order - Variable in class it.unimi.dsi.law.rank.PageRankPowerSeries
The order of the derivatives.
outdegree - Variable in class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
The outdegree of each node (initialized after the first computation).
OVERFLOW_FILE_RANDOM_PATH_ELEMENTS - Static variable in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
The number of path elements for the hierarchical overflow file (see Util.createHierarchicalTempFile(File, int, String, String)).
overflowFile - Variable in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
The overflow file used by this stream: it is created at construction time, and deleted on InspectableBufferedInputStream.dispose(), finalization, or exit.
overflowLength() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
Returns the current length of the overflow file.

P

pageRank(ImmutableGraph) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
 
PageRank - Class in it.unimi.dsi.law.rank
An abstract class defining methods and attributes supporting PageRank computations.
PageRank(ImmutableGraph, Logger) - Constructor for class it.unimi.dsi.law.rank.PageRank
Creates a new instance.
PageRankFromCoefficients - Class in it.unimi.dsi.law.rank
Computes PageRank using its power series.
PageRankFromCoefficients() - Constructor for class it.unimi.dsi.law.rank.PageRankFromCoefficients
 
PageRankGaussSeidel - Class in it.unimi.dsi.law.rank
Computes PageRank of a graph using the Gauß–Seidel method.
PageRankGaussSeidel(ImmutableGraph, Logger) - Constructor for class it.unimi.dsi.law.rank.PageRankGaussSeidel
Creates a new instance.
PageRankGaussSeidel(ImmutableGraph) - Constructor for class it.unimi.dsi.law.rank.PageRankGaussSeidel
Creates a new instance.
PageRankParallelGaussSeidel - Class in it.unimi.dsi.law.rank
Computes PageRank using a parallel (multicore) implementation of the Gauß–Seidel method.
PageRankParallelGaussSeidel(ImmutableGraph, int, Logger) - Constructor for class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
Creates a new instance.
PageRankParallelGaussSeidel(ImmutableGraph) - Constructor for class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
Creates a new instance.
PageRankParallelGaussSeidel(ImmutableGraph, Logger) - Constructor for class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
Creates a new instance.
PageRankParallelPowerSeries - Class in it.unimi.dsi.law.rank
Computes PageRank using a parallel (multicore) implementation of the power-series method, which runs the power method starting from the preference vector, thus evaluating the truncated PageRank power series (see PageRankPowerSeries).
PageRankParallelPowerSeries(ImmutableGraph, int, Logger) - Constructor for class it.unimi.dsi.law.rank.PageRankParallelPowerSeries
Creates a new instance.
PageRankParallelPowerSeries(ImmutableGraph) - Constructor for class it.unimi.dsi.law.rank.PageRankParallelPowerSeries
Creates a new instance.
PageRankPowerSeries - Class in it.unimi.dsi.law.rank
Computes PageRank (and possibly its derivatives in the damping factor) using its power series.
PageRankPowerSeries(ImmutableGraph, Logger) - Constructor for class it.unimi.dsi.law.rank.PageRankPowerSeries
Creates a new instance.
PageRankPowerSeries(ImmutableGraph) - Constructor for class it.unimi.dsi.law.rank.PageRankPowerSeries
Creates a new instance.
PageRankPush - Class in it.unimi.dsi.law.rank
Computes strongly preferential PageRank for a preference vector concentrated on a node using the push algorithm.
PageRankPush(ImmutableGraph, Logger, boolean) - Constructor for class it.unimi.dsi.law.rank.PageRankPush
Creates a new instance.
PageRankPush(ImmutableGraph, boolean) - Constructor for class it.unimi.dsi.law.rank.PageRankPush
Creates a new instance.
PageRankPush.EmptyQueueStoppingCritertion - Class in it.unimi.dsi.law.rank
 
PageRankPush.IntHeapIndirectPriorityQueue - Class in it.unimi.dsi.law.rank
 
PageRankPush.L1NormStoppingCritertion - Class in it.unimi.dsi.law.rank
 
parse(String) - Static method in class it.unimi.dsi.law.bubing.util.BURL
Deprecated.
Creates a new BUbiNG URL from a string specification if possible, or returns null otherwise.
parse(MutableString) - Static method in class it.unimi.dsi.law.bubing.util.BURL
Deprecated.
Creates a new BUbiNG URL from a mutable string specification if possible, or returns null otherwise.
parse(String) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
 
parse(Response, Parser.LinkReceiver) - Method in class it.unimi.dsi.law.warc.parser.BinaryParser
 
parse(Response, Parser.LinkReceiver) - Method in class it.unimi.dsi.law.warc.parser.HTMLParser
 
parse(Response, Parser.LinkReceiver) - Method in interface it.unimi.dsi.law.warc.parser.Parser
Parses a response.
parseCommaSeparatedProperty(String) - Static method in class it.unimi.dsi.law.warc.util.Util
The given string is parsed as a comma-separated list of items, and the items are returned in the form of an array, possibly after resolving an indirection.
ParseException - Exception in it.unimi.dsi.law.warc.filters.parser
This exception is thrown when parse errors are encountered.
ParseException(Token, int[][], String[]) - Constructor for exception it.unimi.dsi.law.warc.filters.parser.ParseException
This constructor is used by the method "generateParseException" in the generated parser.
ParseException() - Constructor for exception it.unimi.dsi.law.warc.filters.parser.ParseException
The following constructors are for use by you for whatever purpose you can think of.
ParseException(String) - Constructor for exception it.unimi.dsi.law.warc.filters.parser.ParseException
Constructor with message.
parseInputTypes(JSAPResult) - Static method in class it.unimi.dsi.law.stat.CorrelationIndex
Commodity method to extract from a JSAPResult instance the file type information provided by the user, or supply the default (doubles in binary form).
Parser - Interface in it.unimi.dsi.law.warc.parser
A generic parser for responses.
Parser.LinkReceiver - Interface in it.unimi.dsi.law.warc.parser
A class that can receive URLs discovered during parsing.
pathAndQuery(URI) - Static method in class it.unimi.dsi.law.bubing.util.BURL
Deprecated.
Returns the concatenated raw path and raw query of a BUbiNG URL.
pathAndQueryAsByteArray(URI) - Static method in class it.unimi.dsi.law.bubing.util.BURL
Deprecated.
Returns an ASCII byte-array representation of the raw path and raw query of a BUbiNG URL.
pathAndQueryAsByteArray(byte[]) - Static method in class it.unimi.dsi.law.bubing.util.BURL
Deprecated.
Extracts the path and query of an absolute BUbiNG URL in its byte-array representation.
PathEndsWithOneOf - Class in it.unimi.dsi.law.warc.filters
Accepts only URIs whose path ends (case-insensitively) with one of a given set of suffixes.
PathEndsWithOneOf(String[]) - Constructor for class it.unimi.dsi.law.warc.filters.PathEndsWithOneOf
Creates a filter that only accepts URLs whose path ends with one of a given set of suffixes.
pNorm - Variable in class it.unimi.dsi.law.rank.PageRankPush
The norm of the SpectralRanking.rank.
position() - Method in class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
 
position() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
 
position() - Method in class it.unimi.dsi.law.warc.io.MeasurableSequenceInputStream
 
PowerSeries - Class in it.unimi.dsi.law.rank
Computes a power series on a graph using a parallel implementation.
PowerSeries(ImmutableGraph, int, Logger) - Constructor for class it.unimi.dsi.law.rank.PowerSeries
Creates a new instance.
PowerSeries(ImmutableGraph, int) - Constructor for class it.unimi.dsi.law.rank.PowerSeries
Creates a new instance.
PowerSeries(ImmutableGraph, Logger) - Constructor for class it.unimi.dsi.law.rank.PowerSeries
Creates a new instance.
PowerSeries(ImmutableGraph) - Constructor for class it.unimi.dsi.law.rank.PowerSeries
Creates a new instance.
pr(ImmutableGraph) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
 
Precision - Class in it.unimi.dsi.law.util
A set of commodity methods to manipulate precision of doubles.
Precision() - Constructor for class it.unimi.dsi.law.util.Precision
 
preference - Variable in class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
The preference vector to be used (or null if the uniform preference vector should be used).
preference - Variable in class it.unimi.dsi.law.rank.PageRank
The preference vector to be used (or null if the uniform preference vector should be used).
preference - Variable in class it.unimi.dsi.law.rank.PowerSeries
The preference vector to be used (or null if the uniform preference vector should be used).
prevCharIsCR - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
 
prevCharIsLF - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
 
previousRank - Variable in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
The rank vector after the last iteration (only meaningful after at least one step).
previousRank - Variable in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
The rank vector at the end of the last round.
previousRank - Variable in class it.unimi.dsi.law.rank.PageRankParallelPowerSeries
The rank vector after the last iteration (only meaningful after at least one step).
previousRank - Variable in class it.unimi.dsi.law.rank.PageRankPowerSeries
The rank vector after the last iteration (only meaningful after at least one step).
previousRank - Variable in class it.unimi.dsi.law.rank.PowerSeries
The approximation obtained after the last iteration (only meaningful after at least one step).
progressLogger - Variable in class it.unimi.dsi.law.rank.PageRankPush
A progress logger.
pseudoRank - Variable in class it.unimi.dsi.law.rank.PageRankGaussSeidel
If true, an everywhere zero dangling-node distribution will be simulated, resulting in the computation of a pseudorank.
pseudoRank - Variable in class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
If true, an everywhere zero dangling-node distribution will be simulated, resulting in the computation of a pseudorank.
put(String, String) - Method in class it.unimi.dsi.law.warc.util.MetadataHttpResponse.HeaderMap
 

Q

QUADRATIC_WEIGHER - Static variable in class it.unimi.dsi.law.stat.WeightedTau
A quadratic weigher.
queueIsEmpty() - Method in class it.unimi.dsi.law.rank.PageRankPush
 

R

random(ImmutableGraph) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
 
random - Variable in class it.unimi.dsi.law.warc.util.AbstractHttpResponse
A high-quality pseudorandom generator to generate UUIDs.
rank(ImmutableGraph, double[]) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
 
rank - Variable in class it.unimi.dsi.law.rank.SpectralRanking
The current rank vector.
read() - Method in class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
 
read(byte[], int, int) - Method in class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
 
read(FastBufferedInputStream) - Method in class it.unimi.dsi.law.warc.io.GZWarcRecord
 
read(byte[], int, int) - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
 
read(byte[]) - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
 
read() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
 
read() - Method in class it.unimi.dsi.law.warc.io.MeasurableSequenceInputStream
 
read(byte[], int, int) - Method in class it.unimi.dsi.law.warc.io.MeasurableSequenceInputStream
 
read(FastBufferedInputStream) - Method in class it.unimi.dsi.law.warc.io.WarcRecord
A method to read a record from an InputStream.
readANVLHeaders(MeasurableInputStream, Map<String, String>, Charset) - Static method in class it.unimi.dsi.law.warc.util.Util
Parses headers from the given stream.
readBytes() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
The number of bytes read so far from the underlying stream.
readChar() - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Read a character.
readHeaderLine(InputStream, Charset) - Static method in class it.unimi.dsi.law.warc.util.Util
Return byte array from an (unchunked) input stream.
readStatusLine(MeasurableInputStream, Charset) - Static method in class it.unimi.dsi.law.warc.util.Util
 
recordId - Variable in class it.unimi.dsi.law.warc.io.WarcRecord.Header
The warc record id.
recordType - Variable in class it.unimi.dsi.law.warc.io.WarcRecord.Header
The warc record-type.
refArray - Variable in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
The reference array.
ReInit(InputStream) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
Reinitialise.
ReInit(InputStream, String) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
Reinitialise.
ReInit(Reader) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
Reinitialise.
ReInit(FilterParserTokenManager) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
Reinitialise.
ReInit(SimpleCharStream) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
Reinitialise parser.
ReInit(SimpleCharStream, int) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
Reinitialise parser.
ReInit(Reader, int, int, int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Reinitialise.
ReInit(Reader, int, int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Reinitialise.
ReInit(Reader) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Reinitialise.
ReInit(InputStream, String, int, int, int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Reinitialise.
ReInit(InputStream, int, int, int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Reinitialise.
ReInit(InputStream, String) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Reinitialise.
ReInit(InputStream) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Reinitialise.
ReInit(InputStream, String, int, int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Reinitialise.
ReInit(InputStream, int, int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Reinitialise.
RemappedStringMap - Class in it.unimi.dsi.law.warc.util
A StringMap that remaps values returned by another StringMap.
RemappedStringMap(StringMap<? extends CharSequence>, int[]) - Constructor for class it.unimi.dsi.law.warc.util.RemappedStringMap
Creates a new remapped minimal perfect hash.
remove(T) - Method in class it.unimi.dsi.law.util.ConsistentHashFunction
Removes a bucket.
remove() - Method in class it.unimi.dsi.law.warc.io.HttpResponseFilteredIterator
 
remove() - Method in class it.unimi.dsi.law.warc.io.WarcFilteredIterator
 
RemoveHubs - Class in it.unimi.dsi.law.graph
Removes nodes from a graph following a number of strategies.
RemoveHubs() - Constructor for class it.unimi.dsi.law.graph.RemoveHubs
 
replicae - Variable in class it.unimi.dsi.law.util.ConsistentHashFunction
Maps points in the unit interval to buckets.
REPLICAE_PER_BUCKET - Static variable in class it.unimi.dsi.law.util.ConsistentHashFunction
Each bucket is replicated this number of times.
reset() - Method in class it.unimi.dsi.law.util.KahanSummation
Resets the current value and correction to zero.
resetRead() - Method in class it.unimi.dsi.law.warc.io.GZWarcRecord
 
resetRead() - Method in class it.unimi.dsi.law.warc.io.WarcRecord
A method to allow the reuse of the present object for non consecutive reads.
residual - Variable in class it.unimi.dsi.law.rank.PageRankPush
The vector r (the rôole of p is covered by SpectralRanking.rank).
response(HttpResponse) - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse.HttpResponseHeaderMap
Sets the response whose headers will be wrapped by this map.
Response - Interface in it.unimi.dsi.law.warc.util
Provides high level access to WARC records with record-type equal to response.
rewind() - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
Rewinds this stream.
root - Variable in class it.unimi.dsi.law.rank.PageRankPush
The node where the preference vector is concentrated.
run(FastBufferedInputStream, OutputStream) - Static method in class it.unimi.dsi.law.warc.tool.CompressWarc
This method reads from a given input stream a sequence of uncompressed WARC records and writes to a given output stream a compressed version of them.
run(FastBufferedInputStream, RandomAccessFile, boolean, boolean, long[], int, OutputStream) - Static method in class it.unimi.dsi.law.warc.tool.CutWarc
 
run(FastBufferedInputStream, boolean, Filter<HttpResponse>, PrintWriter) - Static method in class it.unimi.dsi.law.warc.tool.ExtractDigestUrls
 
run(FastBufferedInputStream, boolean, Filter<HttpResponse>, PrintWriter, StringMap<? extends CharSequence>, StringMap<? extends CharSequence>) - Static method in class it.unimi.dsi.law.warc.tool.ExtractLinks
Extracts links from a WARC file.
run(FastBufferedInputStream, boolean, Filter<WarcRecord>, OutputStream) - Static method in class it.unimi.dsi.law.warc.tool.GrepWarc
This method acts as a sort of "grep" for WARC files.
run(FastBufferedInputStream, SummaryStats, SummaryStats, SummaryStats) - Static method in class it.unimi.dsi.law.warc.tool.GZWarcStats
 
run(FastBufferedInputStream, boolean, OutputStream) - Static method in class it.unimi.dsi.law.warc.tool.IndexWarc
This method reads from a given input stream a sequence of WARC records and writes to a given output stream the byte offset of the read records.
run(FastBufferedInputStream, PrintWriter) - Static method in class it.unimi.dsi.law.warc.tool.ListGZWarcComments
Writes on the given writer the GZip header comment filed.
run(String, String, StringMap<? extends CharSequence>, String, int) - Static method in class it.unimi.dsi.law.warc.util.RemappedStringMap
 

S

salsa - Variable in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
Compute the SALSA score (only for historical and testing reasons: please use the Salsa class instead).
scale - Variable in class it.unimi.dsi.law.rank.PowerSeries
The overall scaling that has been applied to the current approximation.
scale(double) - Method in class it.unimi.dsi.law.vector.DenseVector
 
scale(double) - Method in class it.unimi.dsi.law.vector.ImmutableSparseVector
 
scale(double) - Method in class it.unimi.dsi.law.vector.Vector
Scale values in this vector by a value alpha.
schemeAndAuthority(URI) - Static method in class it.unimi.dsi.law.bubing.util.BURL
Deprecated.
Returns the concatenated URI.getScheme() and raw authority of a BUbiNG URL.
schemeAndAuthority(byte[]) - Static method in class it.unimi.dsi.law.bubing.util.BURL
Deprecated.
Extracts the scheme and authority of an absolute BUbiNG URL in its byte-array representation.
SchemeEquals - Class in it.unimi.dsi.law.warc.filters
Accepts only URIs whose scheme equals a certain string (typically, http).
SchemeEquals(String) - Constructor for class it.unimi.dsi.law.warc.filters.SchemeEquals
Creates a filter that only accepts URIs with a given scheme.
seen2Node - Variable in class it.unimi.dsi.law.rank.PageRankPush
A map from seen-order to nodes.
SequentialHttpResponseRead - Class in it.unimi.dsi.law.warc.io.examples
 
SequentialHttpResponseRead() - Constructor for class it.unimi.dsi.law.warc.io.examples.SequentialHttpResponseRead
 
SequentialHttpResponseWrite - Class in it.unimi.dsi.law.warc.io.examples
 
SequentialHttpResponseWrite() - Constructor for class it.unimi.dsi.law.warc.io.examples.SequentialHttpResponseWrite
 
SequentialWarcRecordRead - Class in it.unimi.dsi.law.warc.io.examples
 
SequentialWarcRecordRead() - Constructor for class it.unimi.dsi.law.warc.io.examples.SequentialWarcRecordRead
 
SequentialWarcRecordWrite - Class in it.unimi.dsi.law.warc.io.examples
 
SequentialWarcRecordWrite() - Constructor for class it.unimi.dsi.law.warc.io.examples.SequentialWarcRecordWrite
 
set(int, double) - Method in class it.unimi.dsi.law.vector.DenseVector
 
set(int, double) - Method in class it.unimi.dsi.law.vector.ImmutableSparseVector
 
set(int, double) - Method in class it.unimi.dsi.law.vector.Int2DoubleMapVector
 
set(int, double) - Method in class it.unimi.dsi.law.vector.Vector
Sets the value val at index idx.
set(URI, HttpResponse) - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
Sets the response wrapped by this instance.
setDebugStream(PrintStream) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
Set debug output.
setInput(InputStream, long, CRC32) - Method in class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
Resets the bounded stream fields, for reusing it.
setInput(InputStream, long) - Method in class it.unimi.dsi.law.warc.io.BoundedCountingInputStream
Resets the bounded stream fields, for reusing it.
SetLinkReceiver() - Constructor for class it.unimi.dsi.law.warc.parser.HTMLParser.SetLinkReceiver
 
setTabSize(int) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
 
shift - Variable in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
A shift.
shift - Variable in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
A shift.
shouldStop(SpectralRanking) - Method in class it.unimi.dsi.law.rank.PageRankPush.EmptyQueueStoppingCritertion
 
shouldStop(SpectralRanking) - Method in class it.unimi.dsi.law.rank.PageRankPush.L1NormStoppingCritertion
 
shouldStop(SpectralRanking) - Method in class it.unimi.dsi.law.rank.SpectralRanking.IterationNumberStoppingCriterion
 
shouldStop(SpectralRanking) - Method in class it.unimi.dsi.law.rank.SpectralRanking.NormStoppingCriterion
 
shouldStop(SpectralRanking) - Method in interface it.unimi.dsi.law.rank.SpectralRanking.StoppingCriterion
Determines if the computation should be stopped.
similarity(Vector, Vector) - Method in class it.unimi.dsi.law.vector.CosineSimilarityStrategy
 
similarity(Vector, Vector) - Method in class it.unimi.dsi.law.vector.EuclideanSimilarityStrategy
 
similarity(Vector, Vector) - Method in interface it.unimi.dsi.law.vector.SimilarityStrategy
Returns the similarity value between two vectors.
SimilarityStrategy - Interface in it.unimi.dsi.law.vector
An interface specifying methods used to obtain pattern similarities.
SimpleCharStream - Class in it.unimi.dsi.law.warc.filters.parser
An implementation of interface CharStream, where the stream is assumed to contain only ASCII characters (without unicode processing).
SimpleCharStream(Reader, int, int, int) - Constructor for class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Constructor.
SimpleCharStream(Reader, int, int) - Constructor for class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Constructor.
SimpleCharStream(Reader) - Constructor for class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Constructor.
SimpleCharStream(InputStream, String, int, int, int) - Constructor for class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Constructor.
SimpleCharStream(InputStream, int, int, int) - Constructor for class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Constructor.
SimpleCharStream(InputStream, String, int, int) - Constructor for class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Constructor.
SimpleCharStream(InputStream, int, int) - Constructor for class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Constructor.
SimpleCharStream(InputStream, String) - Constructor for class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Constructor.
SimpleCharStream(InputStream) - Constructor for class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Constructor.
size() - Method in class it.unimi.dsi.law.stat.WeightedTau.AbstractWeigher
 
size - Variable in class it.unimi.dsi.law.vector.Vector
The vector size (immutable).
size() - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse.HttpResponseHeaderMap
 
size() - Method in class it.unimi.dsi.law.warc.util.RemappedStringMap
 
size64() - Method in class it.unimi.dsi.law.warc.util.RemappedStringMap
 
sizes - Variable in class it.unimi.dsi.law.util.ConsistentHashFunction
For each bucket, its size.
skip(FastBufferedInputStream) - Method in class it.unimi.dsi.law.warc.io.GZWarcRecord
 
skip(long) - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
 
skip(FastBufferedInputStream) - Method in class it.unimi.dsi.law.warc.io.WarcRecord
A method to skip a record from an InputStream.
skipStrategy - Variable in class it.unimi.dsi.law.util.ConsistentHashFunction
The optional strategy to skip buckets, or null.
specialToken - Variable in class it.unimi.dsi.law.warc.filters.parser.Token
This field is used to access special tokens that occur prior to this token, but after the immediately preceding regular (non-special) token.
SpectralRanking - Class in it.unimi.dsi.law.rank
A base abstract class defining methods and attributes supporting computations of graph spectral rankings such as the dominant eigenvector, PageRank or Katz's index.
SpectralRanking(ImmutableGraph, Logger) - Constructor for class it.unimi.dsi.law.rank.SpectralRanking
Creates a new instance.
SpectralRanking.IterationNumberStoppingCriterion - Class in it.unimi.dsi.law.rank
A stopping criterion that stops whenever the number of iterations exceeds a given bound.
SpectralRanking.NormStoppingCriterion - Class in it.unimi.dsi.law.rank
A stopping criterion that evaluates SpectralRanking.normDelta(), and stops if this value is smaller than a given threshold.
SpectralRanking.StoppingCriterion - Interface in it.unimi.dsi.law.rank
A a strategy that decides when a computation should be stopped.
standardFilters() - Static method in class it.unimi.dsi.law.warc.filters.Filters
Returns a list of the standard filter classes.
start() - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParser
Parser.
startDocument() - Method in class it.unimi.dsi.law.warc.parser.Digester
 
startElement(Element, Map<Attribute, MutableString>) - Method in class it.unimi.dsi.law.warc.parser.Digester
 
STATIC_LEXER_ERROR - Static variable in error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
An attempt was made to create a second instance of a static token manager.
staticFlag - Static variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
Whether parser is static.
status() - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
 
status() - Method in interface it.unimi.dsi.law.warc.util.HttpResponse
Returns the response status.
status() - Method in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
 
StatusCategory - Class in it.unimi.dsi.law.warc.filters
Accepts only fetched response whose status category (status/100) has a certain value.
StatusCategory(int) - Constructor for class it.unimi.dsi.law.warc.filters.StatusCategory
Creates a filter that only accepts responses of the given category.
statusLine() - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
 
statusLine() - Method in interface it.unimi.dsi.law.warc.util.HttpResponse
Returns the response status line.
statusLine - Variable in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
The status line of this response.
statusLine() - Method in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
 
statusLine(StatusLine) - Method in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
Sets the status line
step() - Method in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
 
step() - Method in class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
 
step() - Method in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
 
step() - Method in class it.unimi.dsi.law.rank.PageRankGaussSeidel
 
step() - Method in class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
 
step() - Method in class it.unimi.dsi.law.rank.PageRankParallelPowerSeries
 
step() - Method in class it.unimi.dsi.law.rank.PageRankPowerSeries
 
step() - Method in class it.unimi.dsi.law.rank.PageRankPush
 
step() - Method in class it.unimi.dsi.law.rank.PowerSeries
 
step() - Method in class it.unimi.dsi.law.rank.SpectralRanking
Performs one computation step.
stepUntil(SpectralRanking.StoppingCriterion) - Method in class it.unimi.dsi.law.rank.DominantEigenvectorParallelPowerMethod
 
stepUntil(SpectralRanking.StoppingCriterion) - Method in class it.unimi.dsi.law.rank.KatzParallelGaussSeidel
 
stepUntil(SpectralRanking.StoppingCriterion) - Method in class it.unimi.dsi.law.rank.LeftSingularVectorParallelPowerMethod
 
stepUntil(SpectralRanking.StoppingCriterion) - Method in class it.unimi.dsi.law.rank.PageRankGaussSeidel
 
stepUntil(SpectralRanking.StoppingCriterion) - Method in class it.unimi.dsi.law.rank.PageRankParallelGaussSeidel
 
stepUntil(SpectralRanking.StoppingCriterion) - Method in class it.unimi.dsi.law.rank.PageRankParallelPowerSeries
 
stepUntil(SpectralRanking.StoppingCriterion) - Method in class it.unimi.dsi.law.rank.PageRankPowerSeries
 
stepUntil(SpectralRanking.StoppingCriterion) - Method in class it.unimi.dsi.law.rank.PowerSeries
 
stepUntil(SpectralRanking.StoppingCriterion) - Method in class it.unimi.dsi.law.rank.SpectralRanking
Calls SpectralRanking.init() and steps until a given stopping criterion is met.
STOCHASTIC_TOLERANCE - Static variable in class it.unimi.dsi.law.rank.SpectralRanking
The admitted tolerance in the verification that a vector is a stochastic one.
store(ImmutableGraph, ImmutableGraph, double[], int[], String, String) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
 
stronglyPreferential - Variable in class it.unimi.dsi.law.rank.PageRank
Decides whether we use the strongly or weakly (the default) preferential algorithm.
subjectUri - Variable in class it.unimi.dsi.law.warc.io.WarcRecord.Header
The warc subject-uri.
subSequence(int, int) - Method in class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
 
subset - Variable in class it.unimi.dsi.law.rank.PageRankPowerSeries
If not null, the subset of nodes over which the derivatives should be computed.
SwitchTo(int) - Method in class it.unimi.dsi.law.warc.filters.parser.FilterParserTokenManager
Switch to specified lex state.
symPageRank(ImmutableGraph, ImmutableGraph) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
 

T

tabSize - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
 
Text2DataOutput - Class in it.unimi.dsi.law.io.tool
The main method of this class converts converts a text file containing numbers to binary DataOutput format.
threshold - Variable in class it.unimi.dsi.law.rank.PageRankPush
The threshold for stopping.
THRESHOLD - Static variable in class it.unimi.dsi.law.warc.filters.IsProbablyBinary
The number of zeroes that must appear to cause the page to be considered probably binary.
toBitStream(OutputBitStream, int) - Method in class it.unimi.dsi.law.webgraph.CompressedIntLabel
 
toByteArray(URI) - Static method in class it.unimi.dsi.law.bubing.util.BURL
Deprecated.
Returns an ASCII byte-array representation of a BUbiNG URL.
toHexString(byte[]) - Static method in class it.unimi.dsi.law.warc.util.Util
Returns a mutable string representing in hexadecimal a digest.
toImmutableSparseVector() - Method in class it.unimi.dsi.law.vector.Int2DoubleMapVector
 
token - Variable in class it.unimi.dsi.law.warc.filters.parser.FilterParser
Current token.
Token - Class in it.unimi.dsi.law.warc.filters.parser
Describes the input token stream.
Token() - Constructor for class it.unimi.dsi.law.warc.filters.parser.Token
No-argument constructor
Token(int) - Constructor for class it.unimi.dsi.law.warc.filters.parser.Token
Constructs a new token for the specified Image.
Token(int, String) - Constructor for class it.unimi.dsi.law.warc.filters.parser.Token
Constructs a new token for the specified Image and Kind.
token_source - Variable in class it.unimi.dsi.law.warc.filters.parser.FilterParser
Generated Token Manager.
tokenImage - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
Literal token values.
tokenImage - Variable in exception it.unimi.dsi.law.warc.filters.parser.ParseException
This is a reference to the "tokenImage" array of the generated parser within which the parse error occurred.
TokenMgrError - Error in it.unimi.dsi.law.warc.filters.parser
Token Manager Error.
TokenMgrError() - Constructor for error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
No arg constructor.
TokenMgrError(String, int) - Constructor for error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
Constructor with message and reason.
TokenMgrError(boolean, int, int, int, String, int, int) - Constructor for error it.unimi.dsi.law.warc.filters.parser.TokenMgrError
Full Constructor.
toSpec() - Method in class it.unimi.dsi.law.webgraph.CompressedIntLabel
 
toString(byte[]) - Static method in class it.unimi.dsi.law.bubing.util.BURL
Deprecated.
Returns an ASCII byte-array representation of a BUbiNG URL.
toString() - Method in class it.unimi.dsi.law.util.ConsistentHashFunction
 
toString(Object...) - Method in class it.unimi.dsi.law.warc.filters.AbstractFilter
A helper method that generates a string version of this filter (mainly useful for atomic, i.e., class-based, filters).
toString() - Method in class it.unimi.dsi.law.warc.filters.ContentTypeStartsWith
 
toString() - Method in class it.unimi.dsi.law.warc.filters.DigestEquals
 
toString() - Method in class it.unimi.dsi.law.warc.filters.DuplicateSegmentsLessThan
 
toString() - Method in class it.unimi.dsi.law.warc.filters.HostEndsWith
 
toString() - Method in class it.unimi.dsi.law.warc.filters.HostEquals
 
toString() - Method in class it.unimi.dsi.law.warc.filters.IsHttpResponse
 
toString() - Method in class it.unimi.dsi.law.warc.filters.IsProbablyBinary
 
toString() - Method in class it.unimi.dsi.law.warc.filters.parser.Token
Returns the image.
toString() - Method in class it.unimi.dsi.law.warc.filters.PathEndsWithOneOf
 
toString() - Method in class it.unimi.dsi.law.warc.filters.SchemeEquals
 
toString() - Method in class it.unimi.dsi.law.warc.filters.StatusCategory
 
toString() - Method in class it.unimi.dsi.law.warc.filters.URLEquals
 
toString() - Method in class it.unimi.dsi.law.warc.filters.URLMatchesRegex
 
toString() - Method in class it.unimi.dsi.law.warc.filters.URLShorterThan
 
toString() - Method in class it.unimi.dsi.law.warc.io.GZWarcRecord.GZHeader
 
toString() - Method in class it.unimi.dsi.law.warc.io.GZWarcRecord
 
toString() - Method in class it.unimi.dsi.law.warc.io.WarcRecord.Header
 
toString() - Method in class it.unimi.dsi.law.warc.io.WarcRecord
 
toString() - Method in class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
 
toWarcRecord(WarcRecord) - Method in class it.unimi.dsi.law.warc.util.AbstractHttpResponse
Populates a WARC record with contents from this response.
trackLineColumn - Variable in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
 
TRUE - Static variable in class it.unimi.dsi.law.warc.filters.Filters
The constantly true filter.
TRUE - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
RegularExpression Id.
truncate(double, int) - Static method in class it.unimi.dsi.law.util.Precision
Truncates the given double value to the given number of fractional binary digits.
truncate(double[], int) - Static method in class it.unimi.dsi.law.util.Precision
Applies Precision.truncate(double, int) to the given array.
truncate(long) - Method in class it.unimi.dsi.law.warc.io.InspectableBufferedInputStream
Truncates the overflow file to a given size.

U

uncompressedSkipLength - Variable in class it.unimi.dsi.law.warc.io.GZWarcRecord.GZHeader
The uncompressed-skip-length warc-required extra gzip field.
UpdateLineColumn(char) - Method in class it.unimi.dsi.law.warc.filters.parser.SimpleCharStream
 
upHeap(double[], int[], int[], int) - Static method in class it.unimi.dsi.law.rank.PageRankPush.IntHeapIndirectPriorityQueue
 
uri() - Method in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
 
uri - Variable in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
The URI that is currently contained in this response.
uri() - Method in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
 
uri(URI) - Method in class it.unimi.dsi.law.warc.util.MetadataHttpResponse
Sets the url.
uri() - Method in interface it.unimi.dsi.law.warc.util.Response
Returns the URI associated with this response.
url(ImmutableGraph, FastBufferedReader) - Static method in class it.unimi.dsi.law.graph.RemoveHubs
 
url(URI) - Method in class it.unimi.dsi.law.warc.parser.Digester
Sets the URI that will be used to tune the next digest.
url - Variable in class it.unimi.dsi.law.warc.util.HttpComponentsHttpResponse
URLEquals - Class in it.unimi.dsi.law.warc.filters
Accepts only a given URIs.
URLEquals(String) - Constructor for class it.unimi.dsi.law.warc.filters.URLEquals
Creates a filter that only accepts URIs equal to a given URI.
URLMatchesRegex - Class in it.unimi.dsi.law.warc.filters
Accepts only URIs that match a certain regular expression.
URLMatchesRegex(String) - Constructor for class it.unimi.dsi.law.warc.filters.URLMatchesRegex
Creates a filter that only accepts URLs matching a given regular expression.
URLShorterThan - Class in it.unimi.dsi.law.warc.filters
Accepts only URIs whose overall length is below a given threshold.
URLShorterThan(int) - Constructor for class it.unimi.dsi.law.warc.filters.URLShorterThan
Creates a filter that only accepts URLs shorter than the given threshold.
USE_POSITION_INSTEAD_OF_SKIP - Static variable in class it.unimi.dsi.law.warc.io.GZWarcRecord
Tells what method to use to skip bytes in the input stream.
USE_POSITION_INSTEAD_OF_SKIP - Static variable in class it.unimi.dsi.law.warc.io.WarcRecord
Tells what method to use to skip bytes in the input stream.
Util - Class in it.unimi.dsi.law
A static container of utility methods for all LAW software.
Util() - Constructor for class it.unimi.dsi.law.Util
 
Util - Class in it.unimi.dsi.law.warc.util
Static utility methods.
UUID_FIELD_NAME - Static variable in class it.unimi.dsi.law.warc.io.WarcRecord
Some constant strings in their byte equivalent.

V

value() - Method in class it.unimi.dsi.law.util.KahanSummation
Returns the sum computed so far.
value - Variable in class it.unimi.dsi.law.vector.ImmutableSparseVector
An arrays containing vector values.
valueOf(String) - Static method in enum it.unimi.dsi.law.io.tool.DataInput2Text.Type
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum it.unimi.dsi.law.util.Norm
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.ContentTypeStartsWith
 
valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.DigestEquals
 
valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.DuplicateSegmentsLessThan
 
valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.HostEndsWith
 
valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.HostEquals
 
valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.IsHttpResponse
 
valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.IsProbablyBinary
 
valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.PathEndsWithOneOf
 
valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.SchemeEquals
 
valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.StatusCategory
 
valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.URLEquals
 
valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.URLMatchesRegex
 
valueOf(String) - Static method in class it.unimi.dsi.law.warc.filters.URLShorterThan
 
valueOf(String) - Static method in enum it.unimi.dsi.law.warc.io.InspectableBufferedInputStream.State
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum it.unimi.dsi.law.warc.io.WarcRecord.ContentType
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum it.unimi.dsi.law.warc.io.WarcRecord.RecordType
Returns the enum constant of this type with the specified name.
values() - Static method in enum it.unimi.dsi.law.io.tool.DataInput2Text.Type
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum it.unimi.dsi.law.util.Norm
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum it.unimi.dsi.law.warc.io.InspectableBufferedInputStream.State
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum it.unimi.dsi.law.warc.io.WarcRecord.ContentType
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum it.unimi.dsi.law.warc.io.WarcRecord.RecordType
Returns an array containing the constants of this enum type, in the order they are declared.
Vector - Class in it.unimi.dsi.law.vector
A class representing a vector of double.
Vector(int, boolean, int) - Constructor for class it.unimi.dsi.law.vector.Vector
Build a vector of given size and set the mutability status of this vector.

W

WARC_ID - Static variable in class it.unimi.dsi.law.warc.io.WarcRecord
Some constant strings in their byte equivalent.
WarcFilteredIterator - Class in it.unimi.dsi.law.warc.io
A class to iterate over WARC files getting only records that satisfy a given filter.
WarcFilteredIterator(FastBufferedInputStream, WarcRecord, Filter<WarcRecord>, ProgressLogger) - Constructor for class it.unimi.dsi.law.warc.io.WarcFilteredIterator
Builds the filtered iterator.
WarcFilteredIterator(FastBufferedInputStream, WarcRecord, Filter<WarcRecord>) - Constructor for class it.unimi.dsi.law.warc.io.WarcFilteredIterator
Builds the filtered iterator.
WarcHttpResponse - Class in it.unimi.dsi.law.warc.util
An AbstractHttpResponse implementation that reads the response content from a WARC record (via the WarcHttpResponse.fromWarcRecord(WarcRecord) method.
WarcHttpResponse() - Constructor for class it.unimi.dsi.law.warc.util.WarcHttpResponse
 
WarcRecord - Class in it.unimi.dsi.law.warc.io
A class to read/write WARC/0.9 records (for format details, please see the WARC format specifications).
WarcRecord(byte[]) - Constructor for class it.unimi.dsi.law.warc.io.WarcRecord
Builds a warc record.
WarcRecord() - Constructor for class it.unimi.dsi.law.warc.io.WarcRecord
Builds a warc record.
WarcRecord.ContentType - Enum in it.unimi.dsi.law.warc.io
Content types.
WarcRecord.FormatException - Exception in it.unimi.dsi.law.warc.io
An exception to denote parsing errors during reads.
WarcRecord.Header - Class in it.unimi.dsi.law.warc.io
A class to contain fields contained in the warc header.
WarcRecord.RecordType - Enum in it.unimi.dsi.law.warc.io
Record types.
weigh() - Method in class it.unimi.dsi.law.util.ExchangeWeigher
Computes the weight of exchanges for the current data.
WeightedTau - Class in it.unimi.dsi.law.stat
Computes the weighted τ between two score vectors.
WeightedTau() - Constructor for class it.unimi.dsi.law.stat.WeightedTau
Create an additive hyperbolic τ.
WeightedTau(Int2DoubleFunction) - Constructor for class it.unimi.dsi.law.stat.WeightedTau
Create an additive weighted τ using the specified weigher.
WeightedTau(Int2DoubleFunction, boolean) - Constructor for class it.unimi.dsi.law.stat.WeightedTau
Create an additive or multiplicative weighted τ using the specified weigher and combination strategy.
WeightedTau.AbstractWeigher - Class in it.unimi.dsi.law.stat
 
WORD - Static variable in interface it.unimi.dsi.law.warc.filters.parser.FilterParserConstants
RegularExpression Id.
wrap(byte[], int, int) - Method in class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
Wraps a byte-array fragment into this byte-array character sequence.
wrap(byte[]) - Method in class it.unimi.dsi.law.warc.util.ByteArrayCharSequence
Wraps a byte array into this byte-array character sequence.
write(OutputStream) - Method in class it.unimi.dsi.law.warc.io.GZWarcRecord
 
write(OutputStream) - Method in class it.unimi.dsi.law.warc.io.WarcRecord
A method to write this record to an OutputStream.
writeANVLHeaders(OutputStream, Map<String, String>, Charset) - Static method in class it.unimi.dsi.law.warc.util.Util
Writes a (name, value) map as an ANVL segment in a given stream.

Z

zero() - Method in class it.unimi.dsi.law.vector.DenseVector
 
zero() - Method in class it.unimi.dsi.law.vector.ImmutableSparseVector
 
zero() - Method in class it.unimi.dsi.law.vector.Vector
Reset (to zero) this vector.
ZERO_WEIGHER - Static variable in class it.unimi.dsi.law.stat.WeightedTau
A constant zero weigher.
A B C D E F G H I J K L M N O P Q R S T U V W Z 
Skip navigation links