AbstractCommon
extends AbstractAnalyzer
in package
AbstractYes
AbstractCommon implementation of the analyzerfunctionality.
There are several standard standard subclasses provided by Analysis subpackage.
Tags
Table of Contents
Properties
- $_encoding : string
- Input string encoding
- $_input : string
- Input string
- $_filters : array<string|int, mixed>
- The set of Token filters applied to the Token stream.
Methods
- addFilter() : void
- Add Token filter to the AnalyzerInterface
- normalize() : Token
- Apply filters to the token. Can return null when the token was removed.
- setInput() : void
- Tokenization stream API Set input
- tokenize() : array<string|int, mixed>
- Tokenize text to a terms Returns array of \ZendSearch\Lucene\Analysis\Token objects
Properties
$_encoding
Input string encoding
protected
string
$_encoding
= ''
$_input
Input string
protected
string
$_input
= null
$_filters
The set of Token filters applied to the Token stream.
private
array<string|int, mixed>
$_filters
= array()
Array of \ZendSearch\Lucene\Analysis\TokenFilter\TokenFilterInterface objects.
Methods
addFilter()
Add Token filter to the AnalyzerInterface
public
addFilter(TokenFilterInterface $filter) : void
Parameters
- $filter : TokenFilterInterface
normalize()
Apply filters to the token. Can return null when the token was removed.
public
normalize(Token $token) : Token
Parameters
- $token : Token
Return values
TokensetInput()
Tokenization stream API Set input
public
setInput(string $data[, mixed $encoding = '' ]) : void
Parameters
- $data : string
- $encoding : mixed = ''
tokenize()
Tokenize text to a terms Returns array of \ZendSearch\Lucene\Analysis\Token objects
public
tokenize(string $data[, mixed $encoding = '' ]) : array<string|int, mixed>
Tokens are returned in UTF-8 (internal Zend_Search_Lucene encoding)
Parameters
- $data : string
- $encoding : mixed = ''