Class LuceneLanguageModel

    • Constructor Detail

      • LuceneLanguageModel

        public LuceneLanguageModel​(java.io.File topIndexDir)
        Parameters:
        topIndexDir - a directory which contains either: 1) sub directories called 1grams, 2grams, 3grams, which are Lucene indexes with ngram occurrences as created by org.languagetool.dev.FrequencyIndexCreator or 2) sub directories index-1, index-2 etc that contain the sub directories described under 1)
    • Method Detail

      • validateDirectory

        public static void validateDirectory​(java.io.File topIndexDir)
      • getSubDirectoriesOrNull

        @Nullable
        private static @Nullable java.io.File[] getSubDirectoriesOrNull​(java.io.File topIndexDir)
      • getCount

        public long getCount​(java.util.List<java.lang.String> tokens)
        Description copied from class: BaseLanguageModel
        Get the occurrence count for the given token sequence.
        Specified by:
        getCount in class BaseLanguageModel
      • close

        public void close()
      • toString

        public java.lang.String toString()
        Overrides:
        toString in class java.lang.Object