Package org.languagetool.tokenizers
Class SimpleSentenceTokenizer
- java.lang.Object
-
- org.languagetool.tokenizers.SRXSentenceTokenizer
-
- org.languagetool.tokenizers.SimpleSentenceTokenizer
-
- All Implemented Interfaces:
SentenceTokenizer
,Tokenizer
public class SimpleSentenceTokenizer extends SRXSentenceTokenizer
A very simple sentence tokenizer that splits on[.!?…]
followed by whitespace or an uppercase letter. You probably want to use an adaptedSRXSentenceTokenizer
instead.- Since:
- 2.6
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description (package private) static class
SimpleSentenceTokenizer.AnyLanguage
-
Constructor Summary
Constructors Constructor Description SimpleSentenceTokenizer()
-
Method Summary
-
Methods inherited from class org.languagetool.tokenizers.SRXSentenceTokenizer
setSingleLineBreaksMarksParagraph, singleLineBreaksMarksPara, tokenize
-
-