Class FastTextIterator

  • All Implemented Interfaces:
    java.util.Iterator<java.lang.String>, TextIterator

    public class FastTextIterator
    extends AbstractTextIterator
    Represents fast text iterator that splits text according to SRX rules.
    • Constructor Summary

      Constructors 
      Constructor Description
      FastTextIterator​(SrxDocument document, java.lang.String languageCode, java.io.Reader reader)
      Creates streaming text iterator with no additional parameters.
      FastTextIterator​(SrxDocument document, java.lang.String languageCode, java.io.Reader reader, java.util.Map<java.lang.String,​java.lang.Object> parameterMap)
      Creates streaming text iterator that obtains language rules form given document using given language code.
      FastTextIterator​(SrxDocument document, java.lang.String languageCode, java.lang.CharSequence text)
      Creates text iterator with no additional parameters.
      FastTextIterator​(SrxDocument document, java.lang.String languageCode, java.lang.CharSequence text, java.util.Map<java.lang.String,​java.lang.Object> parameterMap)
      Creates text iterator that obtains language rules form given document using given language code.
    • Method Summary

      All Methods Instance Methods Concrete Methods 
      Modifier and Type Method Description
      boolean hasNext()
      java.lang.String next()
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
      • Methods inherited from interface java.util.Iterator

        forEachRemaining
    • Field Detail

      • text

        private java.lang.CharSequence text
      • segment

        private java.lang.String segment
      • startPosition

        private int startPosition
      • endPosition

        private int endPosition
    • Constructor Detail

      • FastTextIterator

        public FastTextIterator​(SrxDocument document,
                                java.lang.String languageCode,
                                java.lang.CharSequence text,
                                java.util.Map<java.lang.String,​java.lang.Object> parameterMap)
        Creates text iterator that obtains language rules form given document using given language code. To retrieve language rules calls SrxDocument.getLanguageRuleList(String). Supported parameters: SrxTextIterator.MAX_LOOKBEHIND_CONSTRUCT_LENGTH_PARAMETER.
        Parameters:
        document - document containing language rules
        languageCode - language code to select the rule
        text -
        parameterMap - additional segmentation parameters
      • FastTextIterator

        public FastTextIterator​(SrxDocument document,
                                java.lang.String languageCode,
                                java.lang.CharSequence text)
        Creates text iterator with no additional parameters.
        Parameters:
        document - document containing language rules
        languageCode - language code to select the rule
        text -
        See Also:
        FastTextIterator(SrxDocument, String, CharSequence, Map)
      • FastTextIterator

        public FastTextIterator​(SrxDocument document,
                                java.lang.String languageCode,
                                java.io.Reader reader,
                                java.util.Map<java.lang.String,​java.lang.Object> parameterMap)
        Creates streaming text iterator that obtains language rules form given document using given language code. To retrieve language rules calls SrxDocument.getLanguageRuleList(String). To handle streams uses ReaderCharSequence, so not all possible regular expressions are accepted. See ReaderCharSequence for details. Supported parameters: SrxTextIterator.BUFFER_LENGTH_PARAMETER, SrxTextIterator.MAX_LOOKBEHIND_CONSTRUCT_LENGTH_PARAMETER.
        Parameters:
        document - document containing language rules
        languageCode - language code to select the rules
        reader - reader from which text will be read
        parameterMap - additional segmentation parameters
      • FastTextIterator

        public FastTextIterator​(SrxDocument document,
                                java.lang.String languageCode,
                                java.io.Reader reader)
        Creates streaming text iterator with no additional parameters.
        Parameters:
        document - document containing language rules
        languageCode - language code to select the rules
        reader - reader from which text will be read
        See Also:
        FastTextIterator(SrxDocument, String, Reader, Map)
    • Method Detail

      • next

        public java.lang.String next()
        Returns:
        next segment in text, or null if end of text has been reached.
      • hasNext

        public boolean hasNext()
        Returns:
        true if there are more segments