Package edu.berkeley.nlp.lm
Class AbstractNgramLanguageModel<W>
- java.lang.Object
-
- edu.berkeley.nlp.lm.AbstractNgramLanguageModel<W>
-
- All Implemented Interfaces:
NgramLanguageModel<W>
,java.io.Serializable
- Direct Known Subclasses:
AbstractArrayEncodedNgramLanguageModel
,AbstractContextEncodedNgramLanguageModel
public abstract class AbstractNgramLanguageModel<W> extends java.lang.Object implements NgramLanguageModel<W>, java.io.Serializable
- See Also:
- Serialized Form
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from interface edu.berkeley.nlp.lm.NgramLanguageModel
NgramLanguageModel.StaticMethods
-
-
Field Summary
Fields Modifier and Type Field Description protected int
lmOrder
protected float
oovWordLogProb
Fixed constant returned when computing the log probability for an n-gram whose last word is not in the vocabulary.
-
Constructor Summary
Constructors Constructor Description AbstractNgramLanguageModel(int lmOrder, WordIndexer<W> wordIndexer, float oovWordLogProb)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description int
getLmOrder()
Maximum size of n-grams stored by the model.WordIndexer<W>
getWordIndexer()
Each LM must have a WordIndexer which assigns integer IDs to each word W in the language.void
setOovWordLogProb(float oovWordLogProb)
Sets the (log) probability for an OOV word.-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Methods inherited from interface edu.berkeley.nlp.lm.NgramLanguageModel
getLogProb, scoreSentence
-
-
-
-
Constructor Detail
-
AbstractNgramLanguageModel
public AbstractNgramLanguageModel(int lmOrder, WordIndexer<W> wordIndexer, float oovWordLogProb)
-
-
Method Detail
-
getLmOrder
public int getLmOrder()
Description copied from interface:NgramLanguageModel
Maximum size of n-grams stored by the model.- Specified by:
getLmOrder
in interfaceNgramLanguageModel<W>
- Returns:
-
getWordIndexer
public WordIndexer<W> getWordIndexer()
Description copied from interface:NgramLanguageModel
Each LM must have a WordIndexer which assigns integer IDs to each word W in the language.- Specified by:
getWordIndexer
in interfaceNgramLanguageModel<W>
- Returns:
-
setOovWordLogProb
public void setOovWordLogProb(float oovWordLogProb)
Description copied from interface:NgramLanguageModel
Sets the (log) probability for an OOV word. Note that this is in general different from the log prob of theunk
tag probability.- Specified by:
setOovWordLogProb
in interfaceNgramLanguageModel<W>
-
-