Package org.languagetool.languagemodel
Class BaseLanguageModel
- java.lang.Object
-
- org.languagetool.languagemodel.BaseLanguageModel
-
- All Implemented Interfaces:
java.lang.AutoCloseable
,LanguageModel
- Direct Known Subclasses:
LuceneLanguageModel
,LuceneSingleIndexLanguageModel
public abstract class BaseLanguageModel extends java.lang.Object implements LanguageModel
The algorithm of a language model, independent of the way data is stored (see sub classes for that).- Since:
- 3.2
-
-
Field Summary
Fields Modifier and Type Field Description private static boolean
DEBUG
private java.lang.Long
totalTokenCount
-
Fields inherited from interface org.languagetool.languagemodel.LanguageModel
GOOGLE_SENTENCE_END, GOOGLE_SENTENCE_START
-
-
Constructor Summary
Constructors Constructor Description BaseLanguageModel()
-
Method Summary
All Methods Instance Methods Abstract Methods Concrete Methods Modifier and Type Method Description private void
debug(java.lang.String message, java.lang.Object... vars)
abstract long
getCount(java.lang.String token1)
Get the occurrence count fortoken
.abstract long
getCount(java.util.List<java.lang.String> tokens)
Get the occurrence count for the given token sequence.Probability
getPseudoProbability(java.util.List<java.lang.String> context)
This is not always guaranteed to be a real probability (0.0 to 1.0).Probability
getPseudoProbabilityStupidBackoff(java.util.List<java.lang.String> context)
abstract long
getTotalTokenCount()
private long
tryGetCount(java.util.List<java.lang.String> context)
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Methods inherited from interface org.languagetool.languagemodel.LanguageModel
close
-
-
-
-
Field Detail
-
DEBUG
private static final boolean DEBUG
- See Also:
- Constant Field Values
-
totalTokenCount
private java.lang.Long totalTokenCount
-
-
Method Detail
-
tryGetCount
private long tryGetCount(java.util.List<java.lang.String> context)
-
getPseudoProbabilityStupidBackoff
public Probability getPseudoProbabilityStupidBackoff(java.util.List<java.lang.String> context)
-
getPseudoProbability
public Probability getPseudoProbability(java.util.List<java.lang.String> context)
Description copied from interface:LanguageModel
This is not always guaranteed to be a real probability (0.0 to 1.0). Throws exception if context is longer than the ngram index supports.- Specified by:
getPseudoProbability
in interfaceLanguageModel
-
getCount
public abstract long getCount(java.lang.String token1)
Get the occurrence count fortoken
.
-
getCount
public abstract long getCount(java.util.List<java.lang.String> tokens)
Get the occurrence count for the given token sequence.
-
getTotalTokenCount
public abstract long getTotalTokenCount()
-
debug
private void debug(java.lang.String message, java.lang.Object... vars)
-
-