Search/Lucene/Analysis/Analyzer.php
LICENSE
This source file is subject to the new BSD license that is bundled with this package in the file LICENSE.txt. It is also available through the world-wide-web at this URL: http://framework.zend.com/license/new-bsd If you did not receive a copy of the license and are unable to obtain it through the world-wide-web, please send an email to license@zend.com so we can send you a copy immediately.
- Category
- Zend
- Copyright
- Copyright (c) 2005-2012 Zend Technologies USA Inc. (http://www.zend.com)
- License
- New BSD License
- Package
- Zend_Search_Lucene
- Subpackage
- Analysis
- Version
- $Id: Analyzer.php 24593 2012-01-05 20:35:02Z matthew $
\Zend_Search_Lucene_Analysis_Analyzer
It thus represents a policy for extracting index terms from text.
Note: Lucene Java implementation is oriented to streams. It provides effective work with a huge documents (more then 20Mb). But engine itself is not oriented such documents. Thus Zend_Search_Lucene analysis API works with data strings and sets (arrays).
- Children
- \Zend_Search_Lucene_Analysis_Analyzer_Common
- Category
- Zend
- Copyright
- Copyright (c) 2005-2012 Zend Technologies USA Inc. (http://www.zend.com)
- License
- New BSD License
Properties


\Zend_Search_Lucene_Analysis_Analyzer $_defaultImpl
Methods


getDefault() : \Zend_Search_Lucene_Analysis_Analyzer
Return the default Analyzer implementation used by indexing code.
Type | Description |
---|---|
\Zend_Search_Lucene_Analysis_Analyzer |


nextToken() : \Zend_Search_Lucene_Analysis_Token | null
Tokenization stream API Get next token Returns null at the end of stream
Tokens are returned in UTF-8 (internal Zend_Search_Lucene encoding)
Type | Description |
---|---|
\Zend_Search_Lucene_Analysis_Token | null |


setDefault( $analyzer) : void
Set the default Analyzer implementation used by indexing code.
Name | Type | Description |
---|---|---|
$analyzer |


setInput(string $data, $encoding = '') : void
Tokenization stream API Set input
Name | Type | Description |
---|---|---|
$data | string | |
$encoding |