Class SRXSentenceTokenizer

    • Field Detail

      • srxDocument

        private final net.loomchild.segment.srx.SrxDocument srxDocument
      • language

        private final Language language
      • parCode

        private java.lang.String parCode
    • Constructor Detail

      • SRXSentenceTokenizer

        public SRXSentenceTokenizer​(Language language)
        Build a sentence tokenizer based on the rules in the segment.srx file that comes with LanguageTool.
      • SRXSentenceTokenizer

        public SRXSentenceTokenizer​(Language language,
                                    java.lang.String srxInClassPath)
        Parameters:
        srxInClassPath - the path to an SRX file in the classpath
        Since:
        3.2
    • Method Detail

      • setSingleLineBreaksMarksParagraph

        public final void setSingleLineBreaksMarksParagraph​(boolean lineBreakParagraphs)
        Specified by:
        setSingleLineBreaksMarksParagraph in interface SentenceTokenizer
        Parameters:
        lineBreakParagraphs - if true, single lines breaks are assumed to end a paragraph; if false, only two ore more consecutive line breaks end a paragraph