Analysis
Table of Contents
Interfaces
- AnalyzerInterface
- An AnalyzerInterface is used to analyze text.
- TokenFilterInterface
- Token filter converts (normalizes) Token ore removes it from a token stream.
Classes
- AbstractAnalyzer
- General analyzer implementation.
- Analyzer
- AnalyzerInterface manager.
- AbstractCommon
- AbstractCommon implementation of the analyzerfunctionality.
- CaseInsensitive
- AbstractCommon implementation of the analyzerfunctionality.
- Text
- AbstractCommon implementation of the analyzerfunctionality.
- CaseInsensitive
- AbstractCommon implementation of the analyzerfunctionality.
- TextNum
- AbstractCommon implementation of the analyzerfunctionality.
- CaseInsensitive
- AbstractCommon implementation of the analyzerfunctionality.
- Utf8
- AbstractCommon implementation of the analyzerfunctionality.
- CaseInsensitive
- AbstractCommon implementation of the analyzerfunctionality.
- Utf8Num
- AbstractCommon implementation of the analyzerfunctionality.
- Token
- LowerCase
- Lower case Token filter.
- LowerCaseUtf8
- Lower case Token filter.
- ShortWords
- Token filter that removes short words. What is short word can be configured with constructor.
- StopWords
- Token filter that removes stop words. These words must be provided as array (set), example:
$stopwords = array('the' => 1, 'an' => '1');