All Classes Interface Summary Class Summary Enum Summary Annotation Types Summary
Class |
Description |
AbstractArrayEncodedNgramLanguageModel<W> |
|
AbstractContextEncodedNgramLanguageModel<W> |
|
AbstractNgramLanguageModel<W> |
|
AbstractNgramMap<T> |
|
AbstractTMap<T extends java.lang.Comparable> |
Contains some limited shared functionality between Custom[type]Maps
|
AbstractTMap.Functionality<T extends java.lang.Comparable> |
|
AbstractTMap.MapType |
|
Annotations |
|
Annotations.Option |
|
Annotations.OutputParameter |
Just a fancy-pants comment.
|
Annotations.PrintMemoryCount |
Fields annotated with this annotation will have their memory usage added
to the memory usage map returned by countApproximateMemoryUsage.
|
Annotations.SkipMemoryCount |
|
ArpaLmReader<W> |
A parser for ARPA LM files.
|
ArpaLmReaderCallback<V> |
Callback that is called for each n-gram in the collection
|
ArrayEncodedCachingLmWrapper<W> |
|
ArrayEncodedDirectMappedLmCache |
A direct-mapped cache.
|
ArrayEncodedLmCache |
|
ArrayEncodedNgramLanguageModel<W> |
Top-level interface for an n-gram language model which accepts n-gram in an
array-of-integers encoding.
|
ArrayEncodedNgramLanguageModel.DefaultImplementations |
|
ArrayEncodedProbBackoffLm<W> |
Language model implementation which uses Kneser-Ney-style backoff
computation.
|
BitList |
|
BitStream |
Wraps a portion of a long[] array with iterator-like functionality over a
stream of bits.
|
BitUtils |
|
BoundedList<E> |
List which returns special boundary symbols when get() is called outside the
range of the list.
|
CompressedNgramMap<T> |
|
CompressibleProbBackoffValueContainer |
|
CompressibleValueContainer<V> |
|
CompressionUtils |
|
ComputeLogProbabilityOfTextStream |
Computes the log probability of a list of files.
|
ConfigOptions |
Stores some configuration options, with useful defaults.
|
ContextEncodedCachingLmWrapper<T> |
|
ContextEncodedDirectMappedLmCache |
|
ContextEncodedLmCache |
|
ContextEncodedNgramLanguageModel<W> |
Interface for language models which expose the internal context-encoding for
more efficient queries.
|
ContextEncodedNgramLanguageModel.DefaultImplementations |
|
ContextEncodedNgramLanguageModel.LmContextInfo |
Simple class for returning context offsets
|
ContextEncodedNgramMap<T> |
|
ContextEncodedProbBackoffLm<W> |
Language model implementation which uses Kneser-Ney style backoff
computation.
|
Counter<E> |
A map from objects to doubles.
|
CountValueContainer |
|
CustomWidthArray |
An array with a custom word "width" in bits.
|
FirstPassCallback<V extends LongRepresentable<V>> |
Reader callback which adds n-grams to an NgramMap
|
GoogleLmReader<W> |
Reads in n-gram count collections in the format that the Google n-grams Web1T
corpus comes in.
|
HashNgramMap<T> |
|
Indexer<E extends java.lang.Comparable<E>> |
Maintains a two-way map between a set of objects and contiguous integers from
0 to the number of objects.
|
IOUtils |
Some IO utility functions.
|
Iterators |
Utilities for dealing with Iterators
|
Iterators.IteratorIterator<T> |
Wraps a two-level iteration scenario in an iterator.
|
Iterators.Transform<S,T> |
Wraps a base iterator with a transformation function.
|
KneserNeyCountValueContainer |
Stored type and token counts necessary for estimating a Kneser-Ney language
model
|
KneserNeyCountValueContainer.KneserNeyCounts |
Warning: type counts are stored internally as 32-bit ints.
|
KneserNeyFileWritingLmReaderCallback<W> |
Class for producing a Kneser-Ney language model in ARPA format from raw text.
|
KneserNeyLmReaderCallback<W> |
Class for producing a Kneser-Ney language model in ARPA format from raw text.
|
LmReader<V,C extends LmReaderCallback<V>> |
|
LmReaderCallback<V> |
Callback that is called for each n-gram in the collection
|
LmReaders |
This class contains a number of static methods for reading/writing/estimating
n-gram language models.
|
Logger |
Basic logging singleton class.
|
Logger.CompoundLogger |
Convenience class for stringing together loggers.
|
Logger.LogInterface |
Logging interface.
|
Logger.NullLogger |
Default logging goes nowhere.
|
Logger.SystemLogger |
Logs to System.out and System.err
|
LongArray |
|
LongArray.StaticMethods |
|
LongHashSet |
Open address hash map with linear probing.
|
LongRef |
|
LongRepresentable<T> |
|
LongToIntHashMap |
Open address hash map with linear probing.
|
MakeKneserNeyArpaFromText |
Estimates a Kneser-Ney language model from raw text, and writes the language
model out in ARPA-format.
|
MakeLmBinaryFromArpa |
Given a language model in ARPA format, builds a binary representation of the
language model and writes it to disk.
|
MakeLmBinaryFromGoogle |
Given a directory in Google n-grams format, builds a binary representation of
a stupid-backoff language model language model and writes it to disk.
|
MakeNgramMapBinaryFromGoogle |
Like MakeLmBinaryFromGoogle , except it only writes the NgramMap
portion of the LM, meaning the binary does not contain the vocabulary.
|
MosesPhraseTable |
Experimental class for reading Moses phrase tables and storing them
efficiently in memory using a trie.
|
MosesPhraseTable.TargetSideTranslation |
|
MosesPhraseTableReader<W> |
|
MosesPhraseTableReaderCallback<W> |
Class for representing phrase tables efficiently in memory.
|
MurmurHash |
Taken/modified from
http://d3s.mff.cuni.cz/~holub/sw/javamurmurhash/MurmurHash.java
|
NgramIterableWrapper<W,V> |
Wraps an NgramMap as an Iterable, so it is easy to iterate over the n-grams
and associated values.
|
NgramLanguageModel<W> |
Base interface for an n-gram language model, which exposes only inefficient
convenience methods.
|
NgramLanguageModel.StaticMethods |
|
NgramMap<V> |
|
NgramMap.Entry<T> |
|
NgramMapAddingCallback<V> |
Reader callback which adds n-grams to an NgramMap
|
NgramMapWrapper<W,V> |
Wraps an NgramMap as a Java Map, with ngrams of all orders mixed together.
|
NgramOrderedLmReaderCallback<V> |
Callback that is called for each n-gram in the collection
|
NgramsForOrderIterableWrapper<W,V> |
Wraps an NgramMap as an Iterable, so it is easy to iterate over the n-grams
of a particular order.
|
NgramsForOrderMapWrapper<W,V> |
Wraps an NgramMap as a Java Map, but only ngrams of a particular order.
|
Pair<F,S> |
A generic-typed pair of objects.
|
PhraseTableCounts |
|
PhraseTableValueContainer |
Stored type and token counts necessary for estimating a Kneser-Ney language
model
|
PhraseTableValueContainer.FeaturePhraseTableValues |
|
PhraseTableValueContainer.PhraseTableValues |
|
PhraseTableValueContainer.TargetTranslationsValues |
|
ProbBackoffPair |
|
ProbBackoffValueContainer |
|
StringWordIndexer |
Implementation of a WordIndexer in which words are represented as strings.
|
StrUtils |
|
StupidBackoffLm<W> |
Language model implementation which uses stupid backoff (Brants et al., 2007)
computation.
|
TextReader<W> |
Class for reading raw text files.
|
TIntMap<T extends java.lang.Comparable> |
Provides a map from objects to non-negative integers.
|
UncompressedProbBackoffValueContainer |
|
UnrankedUncompressedProbBackoffValueContainer |
|
ValueContainer<V> |
Manages storage of arbitrary values in an NgramMap
|
VariableLengthBitCompressor |
|
WordIndexer<W> |
Enumerates words in the vocabulary of a language model.
|
WordIndexer.StaticMethods |
|