Class EsperantoWordTokenizer

  • All Implemented Interfaces:
    org.languagetool.tokenizers.Tokenizer

    public class EsperantoWordTokenizer
    extends org.languagetool.tokenizers.WordTokenizer
    • Method Summary

      All Methods Instance Methods Concrete Methods 
      Modifier and Type Method Description
      java.util.List<java.lang.String> tokenize​(java.lang.String text)
      Tokenizes just like WordTokenizer with the exception that words such as "dank'" contain an apostrophe within it.
      • Methods inherited from class org.languagetool.tokenizers.WordTokenizer

        getProtocols, getTokenizingCharacters, isEMail, isUrl, joinEMails, joinEMailsAndUrls, joinUrls
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
    • Constructor Detail

      • EsperantoWordTokenizer

        public EsperantoWordTokenizer()
    • Method Detail

      • tokenize

        public java.util.List<java.lang.String> tokenize​(java.lang.String text)
        Tokenizes just like WordTokenizer with the exception that words such as "dank'" contain an apostrophe within it.
        Specified by:
        tokenize in interface org.languagetool.tokenizers.Tokenizer
        Overrides:
        tokenize in class org.languagetool.tokenizers.WordTokenizer
        Parameters:
        text - - Text to tokenize
        Returns:
        List of tokens. Note: a special string EO@APOS is used to replace apostrophe during tokenizing.