Class RuleCreator


  • public class RuleCreator
    extends java.lang.Object
    Takes the output of HomophoneOccurrenceDumper and automatically writes the error probabilities of variants (e.g. "0.977 public hair") when XML_MODE = false or writes XML rules when XML_MODE = true.
    Since:
    2.8
    • Field Detail

      • ngramToOccurrence

        private final java.util.Map<java.lang.String,​java.lang.Long> ngramToOccurrence
      • wordTokenizer

        private final org.languagetool.tokenizers.WordTokenizer wordTokenizer
      • minErrorProb

        private final float minErrorProb
      • ruleCount

        private int ruleCount
      • tokenFilteredRules

        private int tokenFilteredRules
      • probFilteredRules

        private int probFilteredRules
    • Constructor Detail

      • RuleCreator

        public RuleCreator​(float minErrorProb)
    • Method Detail

      • run

        private void run​(java.io.File homophoneOccurrences,
                         java.lang.String homophonePath)
                  throws java.io.IOException
        Throws:
        java.io.IOException
      • printRule

        private void printRule​(RuleCreator.OccurrenceInfo occurrenceInfo,
                               java.lang.String[] parts,
                               java.lang.String variant)
      • initMaps

        private void initMaps​(java.io.File homophoneOccurrenceFile)
                       throws java.io.FileNotFoundException
        Throws:
        java.io.FileNotFoundException
      • main

        public static void main​(java.lang.String[] args)
                         throws java.io.IOException
        Throws:
        java.io.IOException