Class EmailCjkSynonymAnalyzer
java.lang.Object
org.apache.lucene.analysis.Analyzer
org.apache.lucene.analysis.StopwordAnalyzerBase
com.apple.foundationdb.record.lucene.EmailCjkSynonymAnalyzer
- All Implemented Interfaces:
Closeable,AutoCloseable
public class EmailCjkSynonymAnalyzer
extends org.apache.lucene.analysis.StopwordAnalyzerBase
An analyzer that can handle emails, CJK, and synonyms. It essentially combines UAX29URLEmailAnalyzer, CJKUnigramFilter,
and SynonymGraphFilter.
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.lucene.analysis.Analyzer
org.apache.lucene.analysis.Analyzer.ReuseStrategy, org.apache.lucene.analysis.Analyzer.TokenStreamComponents -
Field Summary
FieldsFields inherited from class org.apache.lucene.analysis.StopwordAnalyzerBase
stopwordsFields inherited from class org.apache.lucene.analysis.Analyzer
GLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY -
Constructor Summary
ConstructorsConstructorDescriptionEmailCjkSynonymAnalyzer(org.apache.lucene.analysis.CharArraySet stopwords, int minTokenLength, int minAlphanumericTokenLength, int maxTokenLength, boolean withEmailTokenizer, boolean withSynonymGraphFilter, org.apache.lucene.analysis.synonym.SynonymMap synonymMap) -
Method Summary
Modifier and TypeMethodDescriptionprotected org.apache.lucene.analysis.Analyzer.TokenStreamComponentscreateComponents(String fieldName) Deprecated.intintprotected org.apache.lucene.analysis.synonym.SynonymMapprotected org.apache.lucene.analysis.TokenStreamprotected booleanprotected booleanMethods inherited from class org.apache.lucene.analysis.StopwordAnalyzerBase
getStopwordSet, loadStopwordSet, loadStopwordSet, loadStopwordSetMethods inherited from class org.apache.lucene.analysis.Analyzer
attributeFactory, close, getOffsetGap, getPositionIncrementGap, getReuseStrategy, getVersion, initReader, initReaderForNormalization, normalize, setVersion, tokenStream, tokenStream
-
Field Details
-
UNIQUE_IDENTIFIER
- See Also:
-
-
Constructor Details
-
EmailCjkSynonymAnalyzer
public EmailCjkSynonymAnalyzer(@Nonnull org.apache.lucene.analysis.CharArraySet stopwords, int minTokenLength, int minAlphanumericTokenLength, int maxTokenLength, boolean withEmailTokenizer, boolean withSynonymGraphFilter, @Nullable org.apache.lucene.analysis.synonym.SynonymMap synonymMap)
-
-
Method Details
-
createComponents
@Deprecated protected org.apache.lucene.analysis.Analyzer.TokenStreamComponents createComponents(String fieldName) Deprecated.- Specified by:
createComponentsin classorg.apache.lucene.analysis.Analyzer
-
normalize
protected org.apache.lucene.analysis.TokenStream normalize(String fieldName, org.apache.lucene.analysis.TokenStream in) - Overrides:
normalizein classorg.apache.lucene.analysis.Analyzer
-
getMinTokenLength
public int getMinTokenLength() -
getMaxTokenLength
public int getMaxTokenLength() -
withSynonymGraphFilter
protected boolean withSynonymGraphFilter() -
withEmailTokenizer
protected boolean withEmailTokenizer() -
getSynonymMap
@Nonnull protected org.apache.lucene.analysis.synonym.SynonymMap getSynonymMap()
-